TOP ACTIVATIONS
MAX = 3.349

said , " You are both good children . I love
" You 're both very lovely children . I
said , " You are both winners . You are both
" Wow , you are very good helpers ," the
" You are also very kind and careful ,"
S ue says , " Yes , I want to
on the tall rail near the pond . Then
z ig zag path on the grass . She thinks it
spark ly â just like Mam ma liked
Tom say , " You are welcome !" Then
" Wow , they are so pretty ," Anna says
and says , " You are the best friend I ever
and say , " You 're welcome , Mom my .
was a little creature on the rail . It was a
and says , " You 're welcome , my sweet ies
" Thank you , you are very nice ," Lily says
two friends , Tony and Lucy , were walking in a
happy here . We have each other and we will make
saw some soft parts on the handle . It felt nice
. " You are very kind and generous ,"

MIN ACTIVATIONS
MIN = -3.731

was all better . The charming soldier opened the window ,
The noisy brother and the quiet brother celebrated with a big
a strong and brave bear on it . Benny wanted to
to the bowl . The quiet brother used the shield to
the day playing with the yellow shield and having lots of
cool shirt with a dinosaur on it . Tim
river separates us !" The quiet bunny was sad . They
with pictures of cute animals on it . He loved to
ran away , and the blue bird saved its eggs .
. When the white all igator got out of
it ! The white all igator was scared so
with his favorite cartoon character on the cover . He asked
to go home . The polite boy was sad . He
the dragon again , but this time it was awake !
pick you up !" The quiet bunny was happy . The
big red nose and he wore funny clothes . He liked
calendar with pictures of animals on it . Every morning ,
and had fun . The anxious feeling was gone . The
borrow the cane . The poor man was kind and said
play with her dolls and wear pretty dresses . One day

INTERVAL 1.579 - 3.349
CONTAINS 0.565%

girl named Lucy . Lucy was nervous to go outside and
their map . They looked everywhere , but it was gone
was her daughter 's favourite toy - a big , red
you something to help ." When they arrived home
Suddenly , the jog ger stopped running and waved with a

INTERVAL -0.191 - 1.579
CONTAINS 62.591%

it in her bucket with the other shells she found .
The woman smiled at the girl and said , " How
notice that too !" They started to play ,
your cars , I promise ." He showed them
to do it . She was so proud of helping to

INTERVAL -1.961 - -0.191
CONTAINS 36.723%

quickly unt ied the ribbon and Benny felt much better .
quarrel and was grateful the little child showed him the right
and over until the sink is almost empty . They use
the table and grab some papers . They start to fold
scared , but he was brave and wanted to know what

INTERVAL -3.731 - -1.961
CONTAINS 0.120%

again tomorrow , but not too much ." B
very excited and quickly picked up the cord . She took
ball all day . The rude big dog saw them and
my . Tim my loved wearing his favorite red socks every
ocean and look for seas hell s . One day ,
Token said
Token ID531
Feature activation+6.68e-01

Pos logit contributions
 least+1.088
oh+0.814
 ourselves+0.809
 very+0.768
 cle+0.754
Neg logit contributions
les-1.475
ork-1.421
lights-1.402
pron-1.346
lf-1.240
Token,
Token ID11
Feature activation-4.69e-01

Pos logit contributions
 least+1.014
oh+0.746
 ourselves+0.721
 cle+0.707
 very+0.706
Neg logit contributions
les-1.235
ork-1.214
lights-1.184
pron-1.108
adoes-1.042
Token "
Token ID366
Feature activation+5.03e-01

Pos logit contributions
les+0.856
ork+0.849
lights+0.829
pron+0.770
adoes+0.735
Neg logit contributions
 least-0.690
 very-0.540
oh-0.532
 ourselves-0.508
 cle-0.497
TokenYou
Token ID1639
Feature activation+1.68e+00

Pos logit contributions
 least+0.527
oh+0.334
 very+0.328
 ourselves+0.321
 cle+0.308
Neg logit contributions
les-1.140
ork-1.121
lights-1.107
pron-1.068
adoes-1.004
Token are
Token ID389
Feature activation+2.56e+00

Pos logit contributions
 least+1.404
oh+0.966
 oh+0.882
 ourselves+0.870
 cle+0.802
Neg logit contributions
les-4.035
ork-3.830
lights-3.686
pron-3.672
adoes-3.454
Token both
Token ID1111
Feature activation+3.35e+00

Pos logit contributions
 least+2.896
oh+2.141
 oh+1.822
 cle+1.740
'm+1.729
Neg logit contributions
les-4.983
pron-4.968
ork-4.929
lights-4.818
lf-4.597
Token good
Token ID922
Feature activation+9.98e-01

Pos logit contributions
 least+5.362
oh+4.446
 Wait+3.840
Pro+3.807
 ourselves+3.765
Neg logit contributions
pron-5.202
les-5.036
lights-4.742
ork-4.736
adoes-4.467
Token children
Token ID1751
Feature activation+6.46e-01

Pos logit contributions
 least+1.550
oh+1.238
 ourselves+1.160
 very+1.155
 cle+1.148
Neg logit contributions
les-1.784
ork-1.672
lights-1.669
pron-1.544
adoes-1.483
Token.
Token ID13
Feature activation+3.82e-01

Pos logit contributions
 least+1.127
oh+0.895
 ourselves+0.864
 cle+0.858
 very+0.846
Neg logit contributions
les-1.000
ork-0.974
lights-0.953
pron-0.897
adoes-0.822
Token I
Token ID314
Feature activation+1.45e+00

Pos logit contributions
 least+0.423
oh+0.283
 very+0.281
 ourselves+0.271
 cle+0.263
Neg logit contributions
les-0.827
ork-0.823
lights-0.808
pron-0.772
adoes-0.737
Token love
Token ID1842
Feature activation+5.86e-01

Pos logit contributions
 least+1.524
 ourselves+1.119
oh+1.064
 oh+1.060
 cle+1.006
Neg logit contributions
les-3.153
ork-3.124
pron-2.917
lights-2.879
le-2.617
Token
Token ID198
Feature activation+6.54e-01

Pos logit contributions
les+0.344
ork+0.342
lights+0.333
pron+0.319
adoes+0.299
Neg logit contributions
 least-0.239
 very-0.174
oh-0.174
 ourselves-0.170
 cle-0.163
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
 least+1.069
oh+0.817
 cle+0.791
 ourselves+0.786
 very+0.782
Neg logit contributions
ork-1.110
les-1.105
lights-1.066
pron-1.000
lf-0.932
Token"
Token ID1
Feature activation-1.92e-02

Pos logit contributions
 least+2.043
 cle+1.606
 very+1.585
oh+1.581
 ourselves+1.567
Neg logit contributions
les-1.334
ork-1.334
lights-1.264
pron-1.191
lf-1.099
TokenYou
Token ID1639
Feature activation+9.26e-01

Pos logit contributions
les+0.042
ork+0.041
lights+0.041
pron+0.039
adoes+0.037
Neg logit contributions
 least-0.022
oh-0.015
 very-0.015
 ourselves-0.015
 cle-0.014
Token're
Token ID821
Feature activation+2.06e+00

Pos logit contributions
 least+0.759
oh+0.476
 ourselves+0.446
 oh+0.397
 very+0.393
Neg logit contributions
les-2.251
ork-2.199
lights-2.136
pron-2.096
adoes-1.989
Token both
Token ID1111
Feature activation+3.14e+00

Pos logit contributions
 least+2.002
oh+1.456
 am+1.186
 oh+1.183
 cle+1.177
Neg logit contributions
pron-4.328
ork-4.302
les-4.221
lights-4.184
lf-4.015
Token very
Token ID845
Feature activation+1.03e+00

Pos logit contributions
 least+4.628
oh+3.962
 ourselves+3.425
 am+3.226
 Wait+3.170
Neg logit contributions
pron-5.212
les-4.969
ork-4.870
lights-4.806
adoes-4.524
Token lovely
Token ID14081
Feature activation+6.26e-01

Pos logit contributions
 least+1.643
oh+1.201
 ourselves+1.181
 cle+1.147
 very+1.129
Neg logit contributions
les-1.772
ork-1.745
lights-1.692
pron-1.637
lf-1.496
Token children
Token ID1751
Feature activation+1.13e+00

Pos logit contributions
 least+0.960
oh+0.735
 ourselves+0.696
 very+0.695
 cle+0.690
Neg logit contributions
les-1.129
ork-1.092
lights-1.088
pron-1.005
adoes-0.962
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
 least+1.853
oh+1.437
 cle+1.390
 ourselves+1.367
 oh+1.340
Neg logit contributions
les-1.888
ork-1.796
lights-1.781
pron-1.671
lf-1.545
Token I
Token ID314
Feature activation+1.22e+00

Pos logit contributions
 least+0.248
oh+0.164
 very+0.161
 ourselves+0.156
 cle+0.150
Neg logit contributions
les-0.507
ork-0.505
lights-0.497
pron-0.475
adoes-0.453
Token said
Token ID531
Feature activation+4.68e-01

Pos logit contributions
 least+0.795
 ourselves+0.589
oh+0.589
 very+0.554
 cle+0.540
Neg logit contributions
les-1.192
ork-1.168
lights-1.140
pron-1.097
lf-1.021
Token,
Token ID11
Feature activation-5.34e-01

Pos logit contributions
 least+0.684
oh+0.503
 ourselves+0.486
 very+0.481
 cle+0.474
Neg logit contributions
les-0.876
ork-0.870
lights-0.847
pron-0.801
adoes-0.753
Token "
Token ID366
Feature activation+3.21e-02

Pos logit contributions
les+0.984
ork+0.968
lights+0.952
pron+0.877
adoes+0.844
Neg logit contributions
 least-0.778
 very-0.608
oh-0.594
 ourselves-0.570
 cle-0.559
TokenYou
Token ID1639
Feature activation+6.84e-01

Pos logit contributions
 least+0.034
oh+0.023
 very+0.022
 ourselves+0.022
 cle+0.020
Neg logit contributions
les-0.072
ork-0.072
lights-0.071
pron-0.068
adoes-0.064
Token are
Token ID389
Feature activation+2.39e+00

Pos logit contributions
 least+0.573
oh+0.349
 ourselves+0.325
 very+0.304
 cle+0.292
Neg logit contributions
les-1.665
ork-1.642
lights-1.599
pron-1.561
adoes-1.476
Token both
Token ID1111
Feature activation+3.11e+00

Pos logit contributions
 least+2.900
oh+2.258
'm+1.955
 oh+1.889
 am+1.888
Neg logit contributions
pron-4.494
les-4.391
ork-4.363
lights-4.315
lf-4.201
Token winners
Token ID14591
Feature activation+8.90e-01

Pos logit contributions
 least+4.948
oh+4.045
 ourselves+3.475
 Wait+3.412
Pro+3.299
Neg logit contributions
pron-5.119
les-4.811
ork-4.697
lights-4.628
adoes-4.340
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
 least+1.416
oh+1.091
 ourselves+1.051
 cle+1.048
 very+1.033
Neg logit contributions
les-1.529
ork-1.470
lights-1.466
pron-1.366
adoes-1.275
Token You
Token ID921
Feature activation+9.10e-01

Pos logit contributions
 least+0.332
oh+0.216
 very+0.213
 ourselves+0.206
 cle+0.198
Neg logit contributions
les-0.712
ork-0.710
lights-0.697
pron-0.667
adoes-0.637
Token are
Token ID389
Feature activation+1.39e+00

Pos logit contributions
 least+0.901
oh+0.621
 ourselves+0.592
 oh+0.560
 very+0.546
Neg logit contributions
les-2.060
ork-2.014
lights-1.941
pron-1.901
adoes-1.791
Token both
Token ID1111
Feature activation+2.64e+00

Pos logit contributions
 least+1.681
oh+1.296
 am+1.139
 ourselves+1.126
'm+1.125
Neg logit contributions
les-2.682
ork-2.656
pron-2.631
lights-2.612
lf-2.453
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
 least+0.076
oh+0.057
 very+0.056
 ourselves+0.056
 cle+0.054
Neg logit contributions
les-0.093
ork-0.093
lights-0.091
pron-0.086
adoes-0.081
Token"
Token ID1
Feature activation+3.02e-02

Pos logit contributions
les+0.272
ork+0.270
lights+0.266
pron+0.249
adoes+0.234
Neg logit contributions
 least-0.270
oh-0.213
 very-0.209
 ourselves-0.208
 am-0.199
TokenWow
Token ID22017
Feature activation+1.17e+00

Pos logit contributions
 least+0.039
oh+0.028
 very+0.027
 ourselves+0.027
 cle+0.026
Neg logit contributions
les-0.061
ork-0.061
lights-0.060
pron-0.057
adoes-0.054
Token,
Token ID11
Feature activation+6.50e-01

Pos logit contributions
 least+1.854
 cle+1.358
oh+1.349
 very+1.316
 ourselves+1.308
Neg logit contributions
les-2.090
ork-2.003
lights-1.953
pron-1.808
lf-1.723
Token you
Token ID345
Feature activation+1.19e+00

Pos logit contributions
 least+0.931
oh+0.706
 ourselves+0.676
 very+0.675
 cle+0.654
Neg logit contributions
ork-1.193
les-1.192
lights-1.159
pron-1.090
lf-1.031
Token are
Token ID389
Feature activation+3.00e+00

Pos logit contributions
 least+1.306
oh+0.906
 ourselves+0.906
 oh+0.851
 very+0.842
Neg logit contributions
les-2.596
ork-2.517
lights-2.396
pron-2.381
lf-2.226
Token very
Token ID845
Feature activation+1.37e+00

Pos logit contributions
 least+3.199
oh+2.202
'm+1.823
 oh+1.767
 am+1.763
Neg logit contributions
les-6.266
ork-6.207
lf-6.182
pron-6.053
lights-6.037
Token good
Token ID922
Feature activation+8.59e-01

Pos logit contributions
 least+2.163
oh+1.530
 ourselves+1.498
 oh+1.449
 cle+1.423
Neg logit contributions
les-2.407
ork-2.405
lights-2.291
pron-2.240
lf-2.131
Token helpers
Token ID49385
Feature activation+9.58e-01

Pos logit contributions
 least+1.418
oh+1.113
 ourselves+1.062
 very+1.061
 cle+1.054
Neg logit contributions
les-1.453
ork-1.395
lights-1.365
pron-1.267
adoes-1.214
Token,"
Token ID553
Feature activation+1.52e+00

Pos logit contributions
 least+1.453
oh+1.078
 cle+1.069
 ourselves+1.051
 very+1.048
Neg logit contributions
les-1.716
ork-1.669
lights-1.634
pron-1.532
lf-1.467
Token the
Token ID262
Feature activation-9.22e-02

Pos logit contributions
 least+2.479
oh+1.954
 ourselves+1.837
 cle+1.780
 am+1.777
Neg logit contributions
les-2.591
lights-2.537
ork-2.448
pron-2.256
lf-2.186
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
les+0.958
ork+0.938
lights+0.921
pron+0.885
adoes+0.827
Neg logit contributions
 least-0.659
 very-0.509
oh-0.492
 ourselves-0.475
 am-0.463
Token
Token ID198
Feature activation+5.76e-01

Pos logit contributions
 least+0.028
oh+0.021
 very+0.021
 ourselves+0.021
 cle+0.020
Neg logit contributions
les-0.035
ork-0.035
lights-0.034
pron-0.032
adoes-0.030
Token"
Token ID1
Feature activation+9.10e-01

Pos logit contributions
 least+1.018
oh+0.778
 very+0.773
 cle+0.768
 ourselves+0.754
Neg logit contributions
ork-0.912
les-0.904
lights-0.886
pron-0.822
lf-0.766
TokenYou
Token ID1639
Feature activation-8.39e-02

Pos logit contributions
 least+1.193
oh+0.851
 very+0.825
 ourselves+0.811
 cle+0.774
Neg logit contributions
les-1.794
ork-1.784
lights-1.765
pron-1.723
lf-1.585
Token are
Token ID389
Feature activation+2.22e+00

Pos logit contributions
les+0.200
ork+0.200
lights+0.197
pron+0.189
adoes+0.179
Neg logit contributions
 least-0.079
oh-0.047
 very-0.046
 ourselves-0.045
 cle-0.042
Token also
Token ID635
Feature activation+2.97e+00

Pos logit contributions
 least+2.582
oh+1.993
 ourselves+1.681
'm+1.661
 am+1.645
Neg logit contributions
ork-4.333
les-4.255
pron-4.217
lights-4.132
lf-4.119
Token very
Token ID845
Feature activation+1.00e+00

Pos logit contributions
 least+4.259
oh+3.424
 ourselves+3.089
 am+2.788
 Wait+2.749
Neg logit contributions
pron-5.290
ork-5.076
les-4.970
lf-4.749
lights-4.743
Token kind
Token ID1611
Feature activation+3.29e-01

Pos logit contributions
 least+1.478
oh+1.048
 ourselves+1.032
 very+0.975
 cle+0.974
Neg logit contributions
les-1.860
ork-1.855
lights-1.772
pron-1.731
lf-1.622
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
 least+0.558
oh+0.437
 very+0.424
 ourselves+0.422
 cle+0.414
Neg logit contributions
les-0.536
ork-0.528
lights-0.518
pron-0.480
adoes-0.453
Token careful
Token ID8161
Feature activation+2.19e-01

Pos logit contributions
 least+2.647
oh+1.997
 ourselves+1.878
 am+1.799
 very+1.774
Neg logit contributions
ork-2.643
les-2.622
pron-2.621
lights-2.566
lf-2.373
Token,"
Token ID553
Feature activation+1.23e+00

Pos logit contributions
 least+0.336
oh+0.254
 very+0.248
 ourselves+0.246
 cle+0.241
Neg logit contributions
les-0.389
ork-0.387
lights-0.379
pron-0.358
adoes-0.335
Token
Token ID198
Feature activation+2.98e-01

Pos logit contributions
les+0.407
ork+0.400
lights+0.395
pron+0.376
adoes+0.354
Neg logit contributions
 least-0.305
oh-0.233
 very-0.231
 ourselves-0.223
 am-0.219
TokenS
Token ID50
Feature activation-4.60e-01

Pos logit contributions
 least+0.595
oh+0.478
 ourselves+0.473
 very+0.471
 cle+0.462
Neg logit contributions
ork-0.395
les-0.394
lights-0.381
pron-0.352
adoes-0.321
Tokenue
Token ID518
Feature activation-3.30e-01

Pos logit contributions
les+0.450
ork+0.439
lights+0.437
pron+0.394
adoes+0.345
Neg logit contributions
 least-1.062
oh-0.911
 very-0.884
 ourselves-0.883
 am-0.870
Token says
Token ID1139
Feature activation+7.58e-01

Pos logit contributions
ork+0.751
lights+0.741
les+0.740
pron+0.711
adoes+0.671
Neg logit contributions
 least-0.339
oh-0.223
 very-0.217
 ourselves-0.203
 cle-0.197
Token,
Token ID11
Feature activation+2.98e-01

Pos logit contributions
 least+1.181
 ourselves+0.852
oh+0.850
 cle+0.811
 very+0.807
Neg logit contributions
ork-1.375
les-1.366
lights-1.319
pron-1.252
adoes-1.161
Token "
Token ID366
Feature activation+2.97e+00

Pos logit contributions
 least+0.497
oh+0.383
 ourselves+0.378
 very+0.372
 cle+0.363
Neg logit contributions
les-0.490
ork-0.487
lights-0.475
pron-0.453
adoes-0.415
TokenYes
Token ID5297
Feature activation+3.86e-01

Pos logit contributions
 least+4.685
 cle+3.089
 ourselves+3.066
 kidding+3.002
 very+2.832
Neg logit contributions
ork-5.418
les-5.412
pron-5.183
lights-4.961
pool-4.600
Token,
Token ID11
Feature activation-2.04e-01

Pos logit contributions
 least+0.550
oh+0.404
 very+0.394
 ourselves+0.394
 cle+0.386
Neg logit contributions
les-0.725
ork-0.722
lights-0.706
pron-0.668
adoes-0.626
Token I
Token ID314
Feature activation+1.75e-01

Pos logit contributions
les+0.386
ork+0.383
lights+0.375
pron+0.355
adoes+0.333
Neg logit contributions
 least-0.290
oh-0.217
 very-0.216
 ourselves-0.211
 cle-0.203
Token want
Token ID765
Feature activation+5.62e-01

Pos logit contributions
 least+0.218
oh+0.153
 ourselves+0.149
 very+0.148
 cle+0.141
Neg logit contributions
les-0.363
ork-0.363
lights-0.353
pron-0.339
adoes-0.317
Token to
Token ID284
Feature activation-6.85e-01

Pos logit contributions
 least+0.906
oh+0.708
 ourselves+0.676
 very+0.669
 cle+0.665
Neg logit contributions
les-0.947
ork-0.939
lights-0.914
pron-0.856
adoes-0.797
Token on
Token ID319
Feature activation+3.91e-01

Pos logit contributions
 least+0.078
oh+0.061
 very+0.060
 ourselves+0.059
 cle+0.058
Neg logit contributions
les-0.078
ork-0.078
lights-0.076
pron-0.072
adoes-0.066
Token the
Token ID262
Feature activation+1.70e+00

Pos logit contributions
 least+0.527
oh+0.390
 very+0.370
 ourselves+0.366
 cle+0.363
Neg logit contributions
ork-0.762
les-0.760
lights-0.744
pron-0.699
adoes-0.665
Token tall
Token ID7331
Feature activation+1.29e-01

Pos logit contributions
 least+2.958
oh+2.375
 ourselves+2.241
'm+2.213
 am+2.213
Neg logit contributions
les-2.731
ork-2.705
lights-2.507
pron-2.394
lf-2.305
Token rail
Token ID6787
Feature activation+1.90e-01

Pos logit contributions
 least+0.213
oh+0.166
 very+0.163
 ourselves+0.162
 cle+0.157
Neg logit contributions
les-0.214
ork-0.212
lights-0.207
pron-0.195
adoes-0.181
Token near
Token ID1474
Feature activation+8.46e-01

Pos logit contributions
 least+0.318
oh+0.250
 very+0.245
 ourselves+0.244
 cle+0.238
Neg logit contributions
les-0.307
ork-0.305
lights-0.298
pron-0.279
adoes-0.259
Token the
Token ID262
Feature activation+2.93e+00

Pos logit contributions
 least+1.245
oh+0.952
 cle+0.899
 very+0.882
 ourselves+0.874
Neg logit contributions
ork-1.544
les-1.522
lights-1.494
pron-1.381
adoes-1.342
Token pond
Token ID16723
Feature activation-5.12e-02

Pos logit contributions
 least+5.428
 oh+4.669
oh+4.567
 ourselves+4.566
 cle+4.553
Neg logit contributions
ork-4.276
les-4.002
lights-3.630
pron-3.265
lf-3.178
Token.
Token ID13
Feature activation-8.67e-01

Pos logit contributions
les+0.084
ork+0.084
lights+0.082
pron+0.077
adoes+0.072
Neg logit contributions
 least-0.084
oh-0.066
 very-0.065
 ourselves-0.064
 cle-0.062
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
les+1.658
ork+1.623
lights+1.578
pron+1.529
adoes+1.407
Neg logit contributions
 least-1.233
 very-0.954
oh-0.932
 ourselves-0.912
 so-0.886
Token
Token ID198
Feature activation-1.36e+00

Pos logit contributions
les+2.081
lights+2.035
ork+1.979
pron+1.910
adoes+1.872
Neg logit contributions
 least-1.232
 very-0.981
oh-0.963
 am-0.900
 ourselves-0.872
TokenThen
Token ID6423
Feature activation-2.63e-02

Pos logit contributions
les+2.352
lights+2.321
ork+2.196
pron+2.088
adoes+1.989
Neg logit contributions
 least-2.096
oh-1.804
 very-1.754
 ourselves-1.638
 oh-1.628
Token z
Token ID1976
Feature activation-7.71e-01

Pos logit contributions
 least+0.100
oh+0.077
 very+0.076
 ourselves+0.075
 cle+0.073
Neg logit contributions
les-0.106
ork-0.106
lights-0.103
pron-0.097
adoes-0.091
Tokenig
Token ID328
Feature activation-5.02e-01

Pos logit contributions
les+1.184
lights+1.161
ork+1.157
pron+1.072
adoes+1.011
Neg logit contributions
 least-1.369
oh-1.107
 very-1.058
 am-1.045
 ourselves-1.038
Tokenzag
Token ID50183
Feature activation-8.56e-02

Pos logit contributions
les+0.799
ork+0.794
lights+0.777
pron+0.738
adoes+0.682
Neg logit contributions
 least-0.840
oh-0.673
 very-0.670
 ourselves-0.642
 am-0.630
Token path
Token ID3108
Feature activation+1.24e-01

Pos logit contributions
les+0.143
ork+0.142
lights+0.138
pron+0.131
adoes+0.121
Neg logit contributions
 least-0.139
oh-0.108
 very-0.107
 ourselves-0.105
 cle-0.102
Token on
Token ID319
Feature activation+4.92e-02

Pos logit contributions
 least+0.193
oh+0.147
 very+0.144
 ourselves+0.144
 cle+0.139
Neg logit contributions
les-0.218
ork-0.216
lights-0.212
pron-0.200
adoes-0.186
Token the
Token ID262
Feature activation+2.89e+00

Pos logit contributions
 least+0.062
oh+0.044
 very+0.043
 ourselves+0.043
 cle+0.041
Neg logit contributions
les-0.101
ork-0.100
lights-0.098
pron-0.094
adoes-0.088
Token grass
Token ID8701
Feature activation+1.62e-01

Pos logit contributions
 least+6.019
oh+5.018
'm+4.952
 ourselves+4.941
 oh+4.922
Neg logit contributions
ork-3.744
les-3.742
lights-3.348
lf-3.062
pron-3.040
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
 least+0.273
oh+0.213
 very+0.210
 ourselves+0.209
 cle+0.204
Neg logit contributions
les-0.261
ork-0.260
lights-0.253
pron-0.237
adoes-0.220
Token She
Token ID1375
Feature activation+2.44e-02

Pos logit contributions
les+0.403
ork+0.397
lights+0.393
pron+0.376
adoes+0.352
Neg logit contributions
 least-0.232
oh-0.164
 very-0.163
 ourselves-0.157
 am-0.150
Token thinks
Token ID6834
Feature activation+1.72e-01

Pos logit contributions
 least+0.032
oh+0.023
 very+0.022
 ourselves+0.022
 cle+0.021
Neg logit contributions
les-0.049
ork-0.049
lights-0.048
pron-0.046
adoes-0.043
Token it
Token ID340
Feature activation+6.09e-01

Pos logit contributions
 least+0.350
oh+0.287
 very+0.281
 ourselves+0.281
 cle+0.274
Neg logit contributions
les-0.219
ork-0.218
lights-0.211
pron-0.194
adoes-0.175
Token spark
Token ID9009
Feature activation-2.73e-01

Pos logit contributions
 least+1.247
oh+0.969
 ourselves+0.939
 very+0.909
 am+0.905
Neg logit contributions
les-1.275
lights-1.270
ork-1.260
pron-1.181
lf-1.088
Tokenly
Token ID306
Feature activation+6.89e-02

Pos logit contributions
ork+0.260
les+0.250
lights+0.244
pron+0.228
adoes+0.193
Neg logit contributions
 least-0.634
oh-0.539
 very-0.535
 ourselves-0.529
 cle-0.519
Tokenâ
Token ID22940
Feature activation+6.04e-01

Pos logit contributions
 least+0.114
oh+0.089
 very+0.087
 ourselves+0.087
 cle+0.084
Neg logit contributions
les-0.113
ork-0.112
lights-0.110
pron-0.103
adoes-0.096
Token
Token ID26391
Feature activation-2.69e-01

Pos logit contributions
 least+0.982
 very+0.760
 ourselves+0.745
oh+0.739
 cle+0.731
Neg logit contributions
les-1.020
ork-1.012
lights-0.965
pron-0.930
adoes-0.853
Token
Token ID447
Feature activation+5.57e-01

Pos logit contributions
les+0.441
ork+0.432
lights+0.427
pron+0.403
adoes+0.373
Neg logit contributions
 least-0.446
oh-0.355
 very-0.348
 ourselves-0.344
 am-0.332
Token
Token ID251
Feature activation+2.88e+00

Pos logit contributions
 least+1.159
oh+0.940
 ourselves+0.937
 very+0.934
 cle+0.925
Neg logit contributions
ork-0.691
les-0.687
lights-0.651
pron-0.614
adoes-0.538
Tokenjust
Token ID3137
Feature activation+1.39e+00

Pos logit contributions
 least+5.535
 cle+4.523
 ourselves+4.320
oh+4.228
 kidding+4.159
Neg logit contributions
les-4.130
ork-4.055
lights-3.917
pron-3.514
anne-3.034
Token like
Token ID588
Feature activation+1.16e+00

Pos logit contributions
 least+2.393
oh+1.929
 ourselves+1.920
 am+1.883
 cle+1.849
Neg logit contributions
les-2.152
lights-2.026
ork-2.019
pron-1.932
lf-1.731
Token Mam
Token ID29926
Feature activation+1.38e+00

Pos logit contributions
 least+1.722
oh+1.312
 ourselves+1.286
 cle+1.206
 very+1.196
Neg logit contributions
ork-2.090
les-2.066
lights-2.033
pron-1.914
lf-1.785
Tokenma
Token ID2611
Feature activation+5.72e-01

Pos logit contributions
 least+2.296
 ourselves+1.881
 cle+1.778
 very+1.737
oh+1.715
Neg logit contributions
les-2.334
ork-2.151
lights-2.080
pron-2.059
lf-1.878
Token liked
Token ID8288
Feature activation+7.82e-01

Pos logit contributions
 least+0.848
 ourselves+0.640
oh+0.637
 very+0.606
 cle+0.600
Neg logit contributions
les-1.043
ork-1.023
lights-0.996
pron-0.950
adoes-0.876
Token Tom
Token ID4186
Feature activation+1.21e+00

Pos logit contributions
 least+1.012
oh+0.788
 very+0.765
 ourselves+0.764
 cle+0.745
Neg logit contributions
les-1.094
ork-1.072
lights-1.031
pron-0.980
adoes-0.927
Token say
Token ID910
Feature activation+8.68e-01

Pos logit contributions
 least+1.635
oh+1.248
 oh+1.223
 ourselves+1.200
 cle+1.160
Neg logit contributions
les-2.397
ork-2.202
lights-2.174
pron-2.038
adoes-1.927
Token,
Token ID11
Feature activation-4.57e-01

Pos logit contributions
 least+1.375
oh+1.004
 ourselves+0.961
 cle+0.947
 very+0.922
Neg logit contributions
les-1.558
ork-1.551
lights-1.508
pron-1.402
adoes-1.326
Token "
Token ID366
Feature activation+1.54e+00

Pos logit contributions
les+0.823
ork+0.820
lights+0.804
pron+0.745
adoes+0.709
Neg logit contributions
 least-0.684
 very-0.536
oh-0.524
 ourselves-0.503
 cle-0.496
TokenYou
Token ID1639
Feature activation+1.30e+00

Pos logit contributions
 least+1.928
 cle+1.213
 very+1.198
 ourselves+1.144
oh+1.132
Neg logit contributions
les-3.236
ork-3.181
lights-3.087
pron-3.061
lf-2.831
Token are
Token ID389
Feature activation+2.88e+00

Pos logit contributions
 least+1.073
oh+0.641
 ourselves+0.628
 oh+0.559
 cle+0.521
Neg logit contributions
les-3.170
ork-3.083
lights-2.961
pron-2.941
adoes-2.768
Token welcome
Token ID7062
Feature activation+3.52e-01

Pos logit contributions
 least+3.746
oh+2.742
'm+2.445
 oh+2.420
 am+2.376
Neg logit contributions
les-5.131
pron-5.087
lf-5.081
ork-5.053
lights-5.050
Token!"
Token ID2474
Feature activation+2.30e-01

Pos logit contributions
 least+0.515
oh+0.381
 very+0.372
 ourselves+0.371
 cle+0.363
Neg logit contributions
les-0.647
ork-0.644
lights-0.627
pron-0.594
adoes-0.558
Token
Token ID198
Feature activation+4.94e-01

Pos logit contributions
 least+0.366
oh+0.282
 very+0.274
 ourselves+0.272
 cle+0.267
Neg logit contributions
les-0.399
ork-0.396
lights-0.388
pron-0.364
adoes-0.339
Token
Token ID198
Feature activation+5.75e-01

Pos logit contributions
 least+0.790
oh+0.602
 cle+0.578
 ourselves+0.577
 very+0.577
Neg logit contributions
ork-0.854
les-0.853
lights-0.827
pron-0.773
lf-0.718
TokenThen
Token ID6423
Feature activation+8.06e-01

Pos logit contributions
 least+0.887
oh+0.655
 very+0.637
 cle+0.636
 ourselves+0.630
Neg logit contributions
les-1.036
ork-1.034
lights-1.003
pron-0.953
lf-0.885
Token
Token ID198
Feature activation+7.35e-02

Pos logit contributions
les+0.597
ork+0.586
lights+0.581
pron+0.555
adoes+0.525
Neg logit contributions
 least-0.432
 very-0.326
oh-0.325
 ourselves-0.317
 am-0.305
Token"
Token ID1
Feature activation+1.02e+00

Pos logit contributions
 least+0.128
oh+0.101
 very+0.099
 ourselves+0.099
 cle+0.096
Neg logit contributions
les-0.116
ork-0.115
lights-0.112
pron-0.105
adoes-0.097
TokenWow
Token ID22017
Feature activation+5.55e-01

Pos logit contributions
 least+1.644
oh+1.219
 very+1.208
 ourselves+1.198
 cle+1.178
Neg logit contributions
les-1.742
ork-1.729
lights-1.700
pron-1.612
adoes-1.480
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
 least+0.805
oh+0.591
 very+0.576
 ourselves+0.573
 cle+0.570
Neg logit contributions
les-1.036
ork-1.019
lights-0.991
pron-0.937
adoes-0.882
Token they
Token ID484
Feature activation+1.64e+00

Pos logit contributions
 least+0.229
oh+0.171
 very+0.166
 ourselves+0.165
 cle+0.159
Neg logit contributions
les-0.309
ork-0.308
lights-0.301
pron-0.286
adoes-0.268
Token are
Token ID389
Feature activation+2.87e+00

Pos logit contributions
 least+1.295
 oh+0.843
 ourselves+0.753
oh+0.751
 very+0.694
Neg logit contributions
les-4.049
ork-3.920
lights-3.657
pron-3.646
ilt-3.484
Token so
Token ID523
Feature activation+8.15e-01

Pos logit contributions
 least+4.413
oh+3.508
 oh+3.176
'm+3.131
 am+3.113
Neg logit contributions
lights-4.862
les-4.818
ork-4.754
lf-4.539
adoes-4.227
Token pretty
Token ID2495
Feature activation-2.59e-01

Pos logit contributions
 least+1.435
oh+1.081
 ourselves+1.054
 cle+1.041
 oh+1.041
Neg logit contributions
ork-1.294
les-1.292
lights-1.257
pron-1.155
lf-1.081
Token,"
Token ID553
Feature activation+7.64e-01

Pos logit contributions
ork+0.500
les+0.495
lights+0.485
pron+0.466
adoes+0.434
Neg logit contributions
 least-0.350
oh-0.260
 very-0.259
 ourselves-0.254
 cle-0.238
Token Anna
Token ID11735
Feature activation+6.61e-01

Pos logit contributions
 least+1.395
oh+1.116
 ourselves+1.065
 very+1.057
 cle+1.049
Neg logit contributions
les-1.147
lights-1.118
ork-1.116
pron-1.033
lf-0.952
Token says
Token ID1139
Feature activation+2.64e-01

Pos logit contributions
 least+0.660
oh+0.418
 ourselves+0.409
 very+0.392
 oh+0.376
Neg logit contributions
les-1.549
ork-1.503
lights-1.470
pron-1.399
adoes-1.340
Token and
Token ID290
Feature activation+3.08e-01

Pos logit contributions
les+0.010
ork+0.010
lights+0.010
pron+0.009
adoes+0.008
Neg logit contributions
 least-0.010
oh-0.007
 very-0.007
 ourselves-0.007
 cle-0.007
Token says
Token ID1139
Feature activation+7.17e-02

Pos logit contributions
 least+0.407
oh+0.303
 very+0.289
 ourselves+0.289
 cle+0.274
Neg logit contributions
les-0.610
ork-0.600
lights-0.591
pron-0.560
adoes-0.530
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
 least+0.100
oh+0.074
 very+0.072
 ourselves+0.072
 cle+0.069
Neg logit contributions
les-0.137
ork-0.137
lights-0.134
pron-0.127
adoes-0.119
Token "
Token ID366
Feature activation+7.33e-01

Pos logit contributions
ork+2.216
les+2.215
lights+2.173
adoes+1.949
pron+1.942
Neg logit contributions
 least-1.561
 very-1.268
oh-1.159
 so-1.120
 cle-1.091
TokenYou
Token ID1639
Feature activation+9.47e-01

Pos logit contributions
 least+0.668
oh+0.367
 very+0.365
 ourselves+0.353
 cle+0.349
Neg logit contributions
les-1.761
ork-1.730
lights-1.703
pron-1.679
lf-1.566
Token are
Token ID389
Feature activation+2.86e+00

Pos logit contributions
 least+0.765
oh+0.456
 ourselves+0.431
 very+0.385
 oh+0.380
Neg logit contributions
les-2.329
ork-2.270
lights-2.203
pron-2.184
adoes-2.035
Token the
Token ID262
Feature activation+1.62e+00

Pos logit contributions
 least+3.502
oh+2.648
 oh+2.427
'm+2.334
 Wait+2.329
Neg logit contributions
pron-5.327
les-5.201
lights-5.129
ork-5.112
lf-4.992
Token best
Token ID1266
Feature activation+4.02e-01

Pos logit contributions
 least+1.430
oh+1.006
 ourselves+0.908
 am+0.787
 oh+0.774
Neg logit contributions
les-3.705
ork-3.663
pron-3.658
lights-3.600
adoes-3.356
Token friend
Token ID1545
Feature activation+5.59e-01

Pos logit contributions
 least+0.641
oh+0.491
 ourselves+0.480
 very+0.475
 cle+0.466
Neg logit contributions
les-0.698
ork-0.694
lights-0.676
pron-0.645
adoes-0.595
Token I
Token ID314
Feature activation+1.23e+00

Pos logit contributions
 least+0.701
oh+0.486
 ourselves+0.479
 very+0.466
 cle+0.461
Neg logit contributions
les-1.140
ork-1.126
lights-1.101
pron-1.054
lf-0.990
Token ever
Token ID1683
Feature activation+1.18e+00

Pos logit contributions
 least+1.588
 ourselves+1.179
oh+1.128
 cle+1.111
 oh+1.086
Neg logit contributions
les-2.457
ork-2.418
pron-2.292
lights-2.222
lf-2.072
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
 least+0.169
oh+0.133
 very+0.130
 ourselves+0.129
 cle+0.126
Neg logit contributions
les-0.164
ork-0.163
lights-0.159
pron-0.149
adoes-0.138
Token say
Token ID910
Feature activation+5.56e-01

Pos logit contributions
 least+1.830
oh+1.524
 ourselves+1.400
 oh+1.394
 very+1.367
Neg logit contributions
les-1.725
ork-1.652
lights-1.645
pron-1.543
adoes-1.438
Token,
Token ID11
Feature activation-6.83e-01

Pos logit contributions
 least+0.906
oh+0.691
 ourselves+0.660
 very+0.651
 cle+0.648
Neg logit contributions
les-0.958
ork-0.954
lights-0.930
pron-0.878
adoes-0.811
Token "
Token ID366
Feature activation+1.37e+00

Pos logit contributions
les+1.283
ork+1.260
lights+1.235
pron+1.131
adoes+1.098
Neg logit contributions
 least-0.978
 very-0.781
oh-0.750
 ourselves-0.725
 cle-0.704
TokenYou
Token ID1639
Feature activation+1.17e+00

Pos logit contributions
 least+1.662
 cle+1.043
 very+1.022
oh+1.008
 ourselves+0.975
Neg logit contributions
les-2.870
ork-2.859
pron-2.810
lights-2.798
lf-2.546
Token're
Token ID821
Feature activation+2.85e+00

Pos logit contributions
 least+1.017
oh+0.638
 ourselves+0.614
 oh+0.548
 cle+0.535
Neg logit contributions
les-2.791
ork-2.718
pron-2.633
lights-2.618
adoes-2.422
Token welcome
Token ID7062
Feature activation+6.16e-01

Pos logit contributions
 least+3.117
oh+2.214
 oh+1.959
 cle+1.901
 am+1.842
Neg logit contributions
pron-5.720
les-5.449
ork-5.436
lights-5.355
lf-5.295
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
 least+0.928
oh+0.692
 ourselves+0.669
 cle+0.666
 very+0.664
Neg logit contributions
les-1.104
ork-1.102
lights-1.076
pron-1.016
adoes-0.952
Token Mom
Token ID11254
Feature activation+3.73e-01

Pos logit contributions
les+0.378
ork+0.377
lights+0.364
pron+0.346
adoes+0.324
Neg logit contributions
 least-0.297
 very-0.220
oh-0.220
 ourselves-0.217
 cle-0.207
Tokenmy
Token ID1820
Feature activation+3.00e-01

Pos logit contributions
 least+0.619
oh+0.481
 ourselves+0.471
 very+0.470
 cle+0.465
Neg logit contributions
les-0.610
ork-0.595
lights-0.591
pron-0.555
adoes-0.513
Token.
Token ID13
Feature activation+9.05e-01

Pos logit contributions
 least+0.475
oh+0.364
 very+0.355
 ourselves+0.355
 cle+0.348
Neg logit contributions
les-0.515
ork-0.508
lights-0.498
pron-0.471
adoes-0.438
Token was
Token ID373
Feature activation+8.28e-01

Pos logit contributions
 least+0.810
oh+0.576
 ourselves+0.570
 very+0.548
 cle+0.543
Neg logit contributions
ork-1.154
les-1.153
lights-1.121
pron-1.047
lf-1.006
Token a
Token ID257
Feature activation+1.76e-01

Pos logit contributions
 least+0.940
oh+0.665
 ourselves+0.621
 am+0.568
 cle+0.560
Neg logit contributions
ork-1.785
les-1.764
lights-1.757
pron-1.658
adoes-1.596
Token little
Token ID1310
Feature activation+9.75e-01

Pos logit contributions
 least+0.285
oh+0.221
 ourselves+0.216
 very+0.215
 cle+0.208
Neg logit contributions
ork-0.294
les-0.294
lights-0.285
pron-0.271
adoes-0.250
Token creature
Token ID7185
Feature activation+1.97e-01

Pos logit contributions
 least+1.853
oh+1.490
 ourselves+1.474
 cle+1.414
 am+1.401
Neg logit contributions
les-1.390
ork-1.364
lights-1.319
pron-1.219
adoes-1.149
Token on
Token ID319
Feature activation-1.11e+00

Pos logit contributions
 least+0.295
oh+0.222
 ourselves+0.218
 very+0.217
 cle+0.210
Neg logit contributions
les-0.354
ork-0.352
lights-0.343
pron-0.325
adoes-0.304
Token the
Token ID262
Feature activation+2.84e+00

Pos logit contributions
les+2.549
ork+2.480
lights+2.475
pron+2.414
ravings+2.175
Neg logit contributions
 least-1.185
 very-0.849
 ourselves-0.763
oh-0.733
 so-0.718
Token rail
Token ID6787
Feature activation-1.53e-01

Pos logit contributions
 least+5.062
 am+4.295
'm+4.193
 kidding+4.187
 ourselves+4.182
Neg logit contributions
ork-4.479
les-4.390
lights-3.758
lf-3.670
pron-3.590
Token.
Token ID13
Feature activation-4.25e-01

Pos logit contributions
les+0.262
ork+0.261
lights+0.255
pron+0.242
adoes+0.225
Neg logit contributions
 least-0.242
oh-0.187
 very-0.186
 ourselves-0.181
 cle-0.174
Token It
Token ID632
Feature activation+2.57e-01

Pos logit contributions
les+0.780
ork+0.757
lights+0.754
pron+0.718
adoes+0.664
Neg logit contributions
 least-0.624
 very-0.488
oh-0.487
 ourselves-0.461
 am-0.455
Token was
Token ID373
Feature activation+9.97e-01

Pos logit contributions
 least+0.354
oh+0.257
 ourselves+0.254
 very+0.249
 cle+0.241
Neg logit contributions
les-0.496
ork-0.492
lights-0.481
pron-0.456
adoes-0.430
Token a
Token ID257
Feature activation+5.42e-01

Pos logit contributions
 least+1.304
oh+0.968
 ourselves+0.922
 am+0.849
'm+0.818
Neg logit contributions
ork-1.990
les-1.952
lights-1.927
pron-1.839
adoes-1.752
Token and
Token ID290
Feature activation-1.02e-01

Pos logit contributions
 least+1.264
oh+1.032
 ourselves+0.984
 oh+0.965
 cle+0.965
Neg logit contributions
les-1.146
ork-1.079
lights-1.062
pron-0.993
adoes-0.911
Token says
Token ID1139
Feature activation+6.31e-01

Pos logit contributions
ork+0.224
les+0.224
lights+0.220
pron+0.211
adoes+0.199
Neg logit contributions
 least-0.114
oh-0.075
 very-0.075
 ourselves-0.073
 cle-0.070
Token,
Token ID11
Feature activation-2.08e-01

Pos logit contributions
 least+0.954
oh+0.696
 ourselves+0.670
 cle+0.662
 very+0.656
Neg logit contributions
les-1.159
ork-1.157
lights-1.129
pron-1.061
adoes-0.991
Token "
Token ID366
Feature activation+5.10e-01

Pos logit contributions
les+0.369
ork+0.364
lights+0.357
pron+0.333
adoes+0.315
Neg logit contributions
 least-0.319
oh-0.247
 very-0.247
 ourselves-0.238
 cle-0.231
TokenYou
Token ID1639
Feature activation+1.50e+00

Pos logit contributions
 least+0.538
oh+0.337
 very+0.332
 ourselves+0.325
 cle+0.314
Neg logit contributions
les-1.159
ork-1.138
lights-1.127
pron-1.091
adoes-1.019
Token're
Token ID821
Feature activation+2.84e+00

Pos logit contributions
 least+1.277
oh+0.843
 ourselves+0.785
 oh+0.737
 cle+0.714
Neg logit contributions
les-3.532
ork-3.425
pron-3.306
lights-3.301
adoes-3.061
Token welcome
Token ID7062
Feature activation+7.99e-01

Pos logit contributions
 least+3.073
oh+2.164
 cle+1.974
 oh+1.965
 ourselves+1.807
Neg logit contributions
pron-5.693
ork-5.548
les-5.391
lights-5.356
lf-5.239
Token,
Token ID11
Feature activation-5.99e-01

Pos logit contributions
 least+1.211
oh+0.897
 ourselves+0.868
 cle+0.866
 very+0.855
Neg logit contributions
les-1.433
ork-1.421
lights-1.390
pron-1.302
adoes-1.225
Token my
Token ID616
Feature activation+1.30e+00

Pos logit contributions
ork+1.216
les+1.188
lights+1.150
pron+1.105
adoes+1.038
Neg logit contributions
 least-0.812
 very-0.584
 ourselves-0.582
oh-0.571
 cle-0.546
Token sweet
Token ID6029
Feature activation-8.35e-01

Pos logit contributions
 least+2.052
oh+1.600
 ourselves+1.560
 cle+1.448
 oh+1.440
Neg logit contributions
les-2.272
ork-2.235
lights-2.176
pron-2.109
adoes-1.862
Tokenies
Token ID444
Feature activation+1.43e+00

Pos logit contributions
ork+1.425
lights+1.358
les+1.350
pron+1.301
adoes+1.207
Neg logit contributions
 least-1.352
 very-1.058
oh-1.042
 ourselves-1.031
 am-1.007
Token"
Token ID1
Feature activation+1.54e+00

Pos logit contributions
 least+1.541
oh+1.163
 cle+1.158
 very+1.155
 ourselves+1.145
Neg logit contributions
les-1.279
ork-1.268
lights-1.240
pron-1.133
adoes-1.065
TokenThank
Token ID10449
Feature activation+1.47e+00

Pos logit contributions
 least+2.133
 ourselves+1.440
oh+1.434
 very+1.429
 cle+1.365
Neg logit contributions
les-2.974
ork-2.908
pron-2.894
lights-2.863
lf-2.579
Token you
Token ID345
Feature activation+1.79e+00

Pos logit contributions
 least+1.580
 cle+0.937
 very+0.900
oh+0.815
'm+0.788
Neg logit contributions
les-3.357
ork-3.310
lights-3.201
pron-3.023
lf-2.957
Token,
Token ID11
Feature activation-4.48e-01

Pos logit contributions
 least+3.060
oh+2.322
 cle+2.298
 am+2.267
 ourselves+2.236
Neg logit contributions
les-2.918
ork-2.913
lights-2.895
pron-2.543
lf-2.461
Token you
Token ID345
Feature activation+1.44e+00

Pos logit contributions
ork+0.871
les+0.867
lights+0.843
pron+0.806
adoes+0.754
Neg logit contributions
 least-0.625
 very-0.460
oh-0.459
 ourselves-0.446
 cle-0.432
Token are
Token ID389
Feature activation+2.83e+00

Pos logit contributions
 least+1.397
oh+0.823
 ourselves+0.819
 cle+0.741
 oh+0.704
Neg logit contributions
les-3.389
ork-3.301
lights-3.149
pron-3.108
adoes-2.953
Token very
Token ID845
Feature activation+1.27e+00

Pos logit contributions
 least+3.753
oh+2.386
'm+2.225
 am+2.178
 Wait+2.147
Neg logit contributions
les-5.604
pron-5.528
ork-5.415
lf-5.368
lights-5.336
Token nice
Token ID3621
Feature activation+8.76e-01

Pos logit contributions
 least+2.093
 ourselves+1.460
oh+1.457
 cle+1.424
 oh+1.399
Neg logit contributions
les-2.161
ork-2.151
lights-2.072
pron-2.033
lf-1.866
Token,"
Token ID553
Feature activation+1.37e+00

Pos logit contributions
 least+1.452
oh+1.107
 cle+1.069
 ourselves+1.066
 very+1.053
Neg logit contributions
les-1.491
ork-1.442
lights-1.420
pron-1.304
adoes-1.233
Token Lily
Token ID20037
Feature activation+3.63e-01

Pos logit contributions
 least+2.186
oh+1.724
 ourselves+1.637
 very+1.575
 oh+1.566
Neg logit contributions
les-2.351
lights-2.333
ork-2.280
pron-2.055
lf-2.019
Token says
Token ID1139
Feature activation-1.94e-01

Pos logit contributions
 least+0.453
oh+0.325
 ourselves+0.312
 very+0.311
 cle+0.297
Neg logit contributions
les-0.749
ork-0.733
lights-0.719
pron-0.684
adoes-0.648
Token two
Token ID734
Feature activation+2.83e-01

Pos logit contributions
 least+0.430
oh+0.352
 ourselves+0.346
 very+0.346
 cle+0.339
Neg logit contributions
les-0.265
ork-0.263
lights-0.255
pron-0.236
adoes-0.213
Token friends
Token ID2460
Feature activation+1.40e+00

Pos logit contributions
 least+0.482
oh+0.380
 ourselves+0.371
 very+0.368
 cle+0.357
Neg logit contributions
les-0.456
ork-0.449
lights-0.439
pron-0.411
adoes-0.382
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
 least+2.209
oh+1.728
 ourselves+1.724
 cle+1.713
 very+1.630
Neg logit contributions
les-2.391
ork-2.295
lights-2.201
lf-2.063
pron-2.030
Token Tony
Token ID8832
Feature activation+6.66e-01

Pos logit contributions
 least+2.700
oh+2.105
 ourselves+2.059
 cle+2.011
 very+1.908
Neg logit contributions
ork-2.258
les-2.247
lights-2.175
pron-2.066
lf-1.954
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
 least+1.131
 ourselves+0.869
oh+0.869
 cle+0.850
 very+0.845
Neg logit contributions
les-1.097
ork-1.063
lights-1.012
pron-0.969
lf-0.905
Token Lucy
Token ID22162
Feature activation+2.82e+00

Pos logit contributions
 least+3.337
oh+2.749
 ourselves+2.634
 cle+2.563
 very+2.505
Neg logit contributions
les-2.014
ork-1.886
lights-1.860
adoes-1.715
pron-1.700
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 least+4.827
 ourselves+4.011
 cle+3.967
 oh+3.950
oh+3.926
Neg logit contributions
les-4.579
ork-3.946
lights-3.594
le-3.528
oom-3.469
Token were
Token ID547
Feature activation+1.31e+00

Pos logit contributions
 least+2.157
oh+1.678
 ourselves+1.677
 cle+1.608
 oh+1.521
Neg logit contributions
les-2.359
ork-2.267
lights-2.193
pron-2.123
lf-2.002
Token walking
Token ID6155
Feature activation+1.06e-01

Pos logit contributions
 least+1.535
oh+1.103
 ourselves+1.072
 am+1.028
 cle+1.008
Neg logit contributions
les-2.711
ork-2.651
lights-2.591
pron-2.535
lf-2.392
Token in
Token ID287
Feature activation+3.22e-01

Pos logit contributions
 least+0.154
oh+0.115
 very+0.113
 ourselves+0.112
 cle+0.108
Neg logit contributions
les-0.197
ork-0.196
lights-0.192
pron-0.182
adoes-0.171
Token a
Token ID257
Feature activation-4.01e-01

Pos logit contributions
 least+0.414
oh+0.302
 ourselves+0.285
 very+0.284
 cle+0.276
Neg logit contributions
les-0.643
ork-0.641
lights-0.630
pron-0.599
adoes-0.564
Token happy
Token ID3772
Feature activation-3.38e-01

Pos logit contributions
ork+1.614
les+1.597
lights+1.567
pron+1.475
lf+1.406
Neg logit contributions
 least-0.912
 very-0.680
 ourselves-0.617
oh-0.589
 so-0.588
Token here
Token ID994
Feature activation+5.66e-01

Pos logit contributions
les+0.623
ork+0.622
lights+0.601
pron+0.575
adoes+0.534
Neg logit contributions
 least-0.504
 very-0.378
oh-0.372
 ourselves-0.372
 am-0.352
Token.
Token ID13
Feature activation-4.30e-01

Pos logit contributions
 least+0.956
oh+0.752
 ourselves+0.722
 cle+0.720
 very+0.719
Neg logit contributions
ork-0.898
les-0.897
lights-0.871
pron-0.822
adoes-0.753
Token We
Token ID775
Feature activation-1.13e+00

Pos logit contributions
les+0.921
ork+0.911
lights+0.890
pron+0.852
lf+0.803
Neg logit contributions
 least-0.519
oh-0.360
 very-0.350
 ourselves-0.348
 cle-0.323
Token have
Token ID423
Feature activation+5.27e-01

Pos logit contributions
lights+2.521
ork+2.461
les+2.446
pron+2.356
lf+2.235
Neg logit contributions
 least-1.396
 very-0.966
 am-0.923
oh-0.894
'm-0.872
Token each
Token ID1123
Feature activation+2.82e+00

Pos logit contributions
 least+0.811
oh+0.636
 ourselves+0.602
 very+0.598
 cle+0.588
Neg logit contributions
ork-0.915
les-0.910
lights-0.882
pron-0.842
adoes-0.782
Token other
Token ID584
Feature activation-2.63e-01

Pos logit contributions
 least+4.571
oh+3.907
 ourselves+3.662
 cle+3.569
Pro+3.477
Neg logit contributions
les-4.902
ork-4.579
pron-4.089
lights-3.929
lf-3.861
Token and
Token ID290
Feature activation+6.18e-01

Pos logit contributions
ork+0.478
les+0.478
lights+0.465
pron+0.442
adoes+0.412
Neg logit contributions
 least-0.389
 very-0.295
oh-0.289
 ourselves-0.288
 cle-0.274
Token we
Token ID356
Feature activation-1.07e+00

Pos logit contributions
 least+0.809
oh+0.607
 very+0.565
 ourselves+0.560
 cle+0.553
Neg logit contributions
ork-1.196
les-1.194
lights-1.174
pron-1.121
adoes-1.059
Token will
Token ID481
Feature activation-3.06e-01

Pos logit contributions
lights+2.381
ork+2.350
les+2.327
pron+2.233
lf+2.118
Neg logit contributions
 least-1.303
 very-0.903
 am-0.860
oh-0.836
 are-0.809
Token make
Token ID787
Feature activation+3.33e-01

Pos logit contributions
les+0.626
ork+0.615
lights+0.609
pron+0.577
adoes+0.544
Neg logit contributions
 least-0.400
 very-0.283
oh-0.280
 ourselves-0.279
 am-0.265
Token saw
Token ID2497
Feature activation+9.42e-01

Pos logit contributions
 least+0.400
oh+0.308
 ourselves+0.303
 very+0.301
 cle+0.292
Neg logit contributions
les-0.435
ork-0.430
lights-0.420
pron-0.398
adoes-0.370
Token some
Token ID617
Feature activation+6.56e-01

Pos logit contributions
 least+1.300
oh+0.988
 ourselves+0.931
 am+0.900
 very+0.883
Neg logit contributions
les-1.812
ork-1.790
lights-1.751
pron-1.632
adoes-1.590
Token soft
Token ID2705
Feature activation-7.88e-01

Pos logit contributions
 least+1.124
oh+0.893
 ourselves+0.863
 very+0.827
 am+0.811
Neg logit contributions
les-1.074
ork-1.048
lights-1.027
pron-0.956
adoes-0.895
Token parts
Token ID3354
Feature activation+3.37e-01

Pos logit contributions
ork+1.355
les+1.313
lights+1.308
pron+1.271
lf+1.140
Neg logit contributions
 least-1.216
 very-0.990
oh-0.944
 ourselves-0.934
 am-0.912
Token on
Token ID319
Feature activation-3.15e-01

Pos logit contributions
 least+0.573
oh+0.452
 ourselves+0.439
 very+0.437
 cle+0.428
Neg logit contributions
les-0.540
ork-0.532
lights-0.523
pron-0.486
adoes-0.451
Token the
Token ID262
Feature activation+2.82e+00

Pos logit contributions
les+0.657
ork+0.648
lights+0.638
pron+0.616
adoes+0.572
Neg logit contributions
 least-0.385
 very-0.271
oh-0.267
 ourselves-0.265
 cle-0.247
Token handle
Token ID5412
Feature activation+8.03e-01

Pos logit contributions
 least+5.397
 ourselves+4.680
oh+4.507
 am+4.409
 kidding+4.311
Neg logit contributions
ork-4.154
les-3.870
lights-3.456
pron-3.447
lf-3.249
Token.
Token ID13
Feature activation+6.41e-02

Pos logit contributions
 least+1.420
oh+1.111
 ourselves+1.095
 cle+1.084
 very+1.073
Neg logit contributions
les-1.245
ork-1.235
lights-1.222
pron-1.103
adoes-1.032
Token It
Token ID632
Feature activation-1.40e-01

Pos logit contributions
 least+0.080
oh+0.057
 very+0.055
 ourselves+0.055
 cle+0.052
Neg logit contributions
les-0.132
ork-0.131
lights-0.129
pron-0.123
adoes-0.116
Token felt
Token ID2936
Feature activation+1.36e+00

Pos logit contributions
les+0.278
ork+0.277
lights+0.271
pron+0.259
adoes+0.243
Neg logit contributions
 least-0.185
oh-0.135
 very-0.134
 ourselves-0.128
 cle-0.126
Token nice
Token ID3621
Feature activation+6.86e-02

Pos logit contributions
 least+2.018
oh+1.627
 ourselves+1.494
 am+1.351
 oh+1.334
Neg logit contributions
les-2.463
ork-2.444
lights-2.396
pron-2.258
adoes-2.134
Token.
Token ID13
Feature activation-7.79e-01

Pos logit contributions
 least+0.252
oh+0.197
 very+0.192
 ourselves+0.192
 cle+0.186
Neg logit contributions
les-0.247
ork-0.245
lights-0.239
pron-0.225
adoes-0.209
Token
Token ID198
Feature activation+4.68e-01

Pos logit contributions
les+1.560
ork+1.527
lights+1.479
pron+1.466
lf+1.369
Neg logit contributions
 least-1.032
 very-0.786
 ourselves-0.754
oh-0.731
 am-0.713
Token
Token ID198
Feature activation+6.20e-01

Pos logit contributions
 least+0.739
oh+0.560
 ourselves+0.541
 very+0.541
 cle+0.540
Neg logit contributions
ork-0.816
les-0.813
lights-0.788
pron-0.741
lf-0.688
Token"
Token ID1
Feature activation+4.52e-01

Pos logit contributions
 least+1.132
oh+0.878
 very+0.866
 cle+0.864
 ourselves+0.855
Neg logit contributions
ork-0.936
les-0.933
lights-0.895
pron-0.852
lf-0.781
TokenYou
Token ID1639
Feature activation+5.41e-01

Pos logit contributions
 least+0.600
oh+0.433
 very+0.424
 ourselves+0.417
 cle+0.401
Neg logit contributions
les-0.895
ork-0.885
lights-0.870
pron-0.839
adoes-0.776
Token are
Token ID389
Feature activation+2.82e+00

Pos logit contributions
 least+0.437
oh+0.254
 ourselves+0.239
 very+0.227
 cle+0.210
Neg logit contributions
les-1.340
ork-1.317
lights-1.288
pron-1.249
adoes-1.190
Token very
Token ID845
Feature activation+1.17e+00

Pos logit contributions
 least+3.293
oh+2.485
 oh+2.134
'm+2.090
 am+2.019
Neg logit contributions
les-5.381
pron-5.328
ork-5.327
lights-5.262
lf-5.186
Token kind
Token ID1611
Feature activation+4.88e-01

Pos logit contributions
 least+1.877
oh+1.332
 ourselves+1.312
 oh+1.284
 cle+1.280
Neg logit contributions
les-2.047
ork-2.026
lights-1.943
pron-1.886
lf-1.759
Token and
Token ID290
Feature activation+1.38e+00

Pos logit contributions
 least+0.812
oh+0.630
 very+0.606
 ourselves+0.603
 cle+0.597
Neg logit contributions
les-0.817
ork-0.799
lights-0.789
pron-0.721
adoes-0.691
Token generous
Token ID14431
Feature activation+1.13e+00

Pos logit contributions
 least+2.270
oh+1.746
 ourselves+1.629
 very+1.574
 am+1.568
Neg logit contributions
les-2.199
ork-2.182
pron-2.150
lights-2.147
lf-1.926
Token,"
Token ID553
Feature activation+1.03e+00

Pos logit contributions
 least+1.817
oh+1.411
 cle+1.314
 ourselves+1.300
 very+1.294
Neg logit contributions
les-1.965
lights-1.885
ork-1.876
pron-1.686
adoes-1.656
Token was
Token ID373
Feature activation-1.42e+00

Pos logit contributions
ork+1.002
les+1.000
lights+0.991
pron+0.935
adoes+0.880
Neg logit contributions
 least-0.776
 very-0.592
oh-0.589
 ourselves-0.552
 am-0.552
Token all
Token ID477
Feature activation-9.09e-02

Pos logit contributions
les+3.190
ork+3.163
lights+3.088
pron+2.830
adoes+2.828
Neg logit contributions
 least-1.648
 very-1.334
 so-1.156
 cle-1.028
oh-1.024
Token better
Token ID1365
Feature activation-1.10e-01

Pos logit contributions
les+0.157
ork+0.157
lights+0.152
pron+0.144
adoes+0.135
Neg logit contributions
 least-0.144
 very-0.110
oh-0.110
 ourselves-0.107
 cle-0.104
Token.
Token ID13
Feature activation+9.00e-02

Pos logit contributions
les+0.193
ork+0.192
lights+0.188
pron+0.178
adoes+0.166
Neg logit contributions
 least-0.173
oh-0.132
 very-0.132
 ourselves-0.130
 cle-0.124
Token The
Token ID383
Feature activation-1.63e+00

Pos logit contributions
 least+0.136
oh+0.103
 very+0.101
 ourselves+0.100
 cle+0.097
Neg logit contributions
les-0.163
ork-0.162
lights-0.158
pron-0.150
adoes-0.140
Token charming
Token ID23332
Feature activation-3.73e+00

Pos logit contributions
ork+2.612
les+2.539
lights+2.507
pron+2.196
anne+2.139
Neg logit contributions
 least-3.088
 very-2.569
 cle-2.404
 ourselves-2.266
 am-2.228
Token soldier
Token ID10686
Feature activation-5.93e-01

Pos logit contributions
lights+8.226
ork+8.095
les+7.694
pron+7.422
ves+7.102
Neg logit contributions
 least-4.852
 very-4.622
 so-3.812
 cle-3.646
 are-3.599
Token opened
Token ID4721
Feature activation-1.66e+00

Pos logit contributions
ork+1.118
lights+1.114
les+1.107
pron+1.043
adoes+0.987
Neg logit contributions
 least-0.847
 very-0.630
oh-0.621
 cle-0.580
 am-0.580
Token the
Token ID262
Feature activation-1.96e-01

Pos logit contributions
ork+3.600
les+3.514
lights+3.462
pron+3.451
lf+3.184
Neg logit contributions
 least-2.035
 very-1.594
 so-1.483
 ourselves-1.449
oh-1.298
Token window
Token ID4324
Feature activation-2.32e-01

Pos logit contributions
les+0.314
ork+0.310
lights+0.305
pron+0.285
adoes+0.263
Neg logit contributions
 least-0.336
 very-0.263
oh-0.261
 ourselves-0.256
 cle-0.251
Token,
Token ID11
Feature activation-7.25e-01

Pos logit contributions
les+0.382
ork+0.380
lights+0.368
pron+0.351
adoes+0.326
Neg logit contributions
 least-0.384
oh-0.299
 very-0.299
 ourselves-0.290
 cle-0.282
TokenThe
Token ID464
Feature activation-4.28e-01

Pos logit contributions
 least+0.042
oh+0.032
 very+0.031
 ourselves+0.031
 cle+0.030
Neg logit contributions
les-0.050
ork-0.049
lights-0.048
pron-0.046
adoes-0.043
Token noisy
Token ID31210
Feature activation-2.14e+00

Pos logit contributions
ork+0.744
les+0.744
lights+0.718
pron+0.667
adoes+0.636
Neg logit contributions
 least-0.686
 very-0.534
oh-0.512
 cle-0.508
 ourselves-0.498
Token brother
Token ID3956
Feature activation+2.67e-01

Pos logit contributions
ork+4.178
les+3.853
lights+3.773
pron+3.461
ves+3.379
Neg logit contributions
 least-3.168
 very-2.612
 cle-2.292
 so-2.283
 oh-2.261
Token and
Token ID290
Feature activation-6.25e-01

Pos logit contributions
 least+0.405
oh+0.307
 ourselves+0.304
 very+0.301
 cle+0.292
Neg logit contributions
les-0.474
ork-0.468
lights-0.456
pron-0.434
adoes-0.402
Token the
Token ID262
Feature activation-4.52e-01

Pos logit contributions
ork+1.169
les+1.143
lights+1.117
pron+1.056
adoes+0.972
Neg logit contributions
 least-0.927
 very-0.716
 cle-0.673
oh-0.671
 ourselves-0.665
Token quiet
Token ID5897
Feature activation-3.70e+00

Pos logit contributions
ork+0.775
les+0.774
lights+0.745
pron+0.693
adoes+0.656
Neg logit contributions
 least-0.736
 very-0.577
 cle-0.551
oh-0.550
 ourselves-0.541
Token brother
Token ID3956
Feature activation+7.32e-01

Pos logit contributions
ork+7.774
les+6.813
lights+6.483
pron+6.469
ool+6.390
Neg logit contributions
 least-5.007
 very-4.694
 so-4.263
 are-3.790
 most-3.596
Token celebrated
Token ID13943
Feature activation+5.18e-02

Pos logit contributions
 least+1.186
 ourselves+0.926
oh+0.920
 cle+0.892
 very+0.889
Neg logit contributions
les-1.245
ork-1.193
lights-1.165
pron-1.109
lf-1.029
Token with
Token ID351
Feature activation-8.24e-02

Pos logit contributions
 least+0.081
oh+0.062
 very+0.061
 ourselves+0.061
 cle+0.059
Neg logit contributions
les-0.090
ork-0.090
lights-0.088
pron-0.083
adoes-0.077
Token a
Token ID257
Feature activation+9.93e-02

Pos logit contributions
les+0.172
ork+0.172
lights+0.168
pron+0.161
adoes+0.152
Neg logit contributions
 least-0.101
 very-0.070
oh-0.069
 ourselves-0.068
 cle-0.065
Token big
Token ID1263
Feature activation-4.62e-01

Pos logit contributions
 least+0.163
oh+0.127
 very+0.124
 ourselves+0.124
 cle+0.120
Neg logit contributions
les-0.165
ork-0.164
lights-0.160
pron-0.151
adoes-0.139
Token a
Token ID257
Feature activation-7.47e-01

Pos logit contributions
les+0.151
ork+0.150
lights+0.147
pron+0.141
adoes+0.133
Neg logit contributions
 least-0.077
 very-0.052
oh-0.052
 ourselves-0.050
 cle-0.048
Token strong
Token ID1913
Feature activation-1.97e+00

Pos logit contributions
les+1.198
lights+1.174
ork+1.155
pron+1.048
adoes+1.011
Neg logit contributions
 least-1.272
 very-1.058
oh-1.002
 ourselves-0.971
 cle-0.968
Token and
Token ID290
Feature activation+2.18e-01

Pos logit contributions
lights+3.692
ork+3.525
les+3.483
pron+3.294
adoes+2.990
Neg logit contributions
 least-2.964
 very-2.561
 so-2.319
 am-2.212
 say-2.211
Token brave
Token ID14802
Feature activation-2.79e+00

Pos logit contributions
 least+0.331
oh+0.252
 ourselves+0.244
 very+0.242
 cle+0.234
Neg logit contributions
les-0.389
ork-0.388
lights-0.379
pron-0.359
adoes-0.336
Token bear
Token ID6842
Feature activation-6.49e-01

Pos logit contributions
lights+5.158
ork+5.018
pron+4.886
les+4.615
lf+4.454
Neg logit contributions
 least-4.187
 very-3.833
 so-3.419
 cle-3.255
 say-3.180
Token on
Token ID319
Feature activation-3.70e+00

Pos logit contributions
lights+1.135
les+1.126
ork+1.114
pron+1.066
adoes+0.981
Neg logit contributions
 least-0.998
 very-0.792
oh-0.779
 ourselves-0.748
 am-0.737
Token it
Token ID340
Feature activation-4.31e-01

Pos logit contributions
les+7.894
lights+7.736
pron+7.066
ork+6.907
ilt+6.841
Neg logit contributions
 very-5.064
 least-4.493
 so-4.309
 am-3.644
 ourselves-3.465
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
les+0.735
ork+0.732
lights+0.721
pron+0.686
adoes+0.633
Neg logit contributions
 least-0.682
 very-0.538
oh-0.524
 ourselves-0.513
 cle-0.499
Token Benny
Token ID44275
Feature activation+5.53e-01

Pos logit contributions
lights+2.586
les+2.566
ork+2.553
pron+2.424
lf+2.274
Neg logit contributions
 least-1.672
 very-1.303
oh-1.232
 so-1.207
 am-1.140
Token wanted
Token ID2227
Feature activation-6.37e-01

Pos logit contributions
 least+0.824
 ourselves+0.615
oh+0.612
 very+0.584
 cle+0.579
Neg logit contributions
les-1.009
ork-0.999
lights-0.954
pron-0.919
lf-0.865
Token to
Token ID284
Feature activation-6.28e-02

Pos logit contributions
lights+1.241
les+1.240
ork+1.219
pron+1.158
adoes+1.087
Neg logit contributions
 least-0.895
 very-0.694
oh-0.642
 ourselves-0.622
 so-0.621
Token to
Token ID284
Feature activation+9.80e-01

Pos logit contributions
 least+0.279
oh+0.215
 very+0.210
 ourselves+0.209
 cle+0.203
Neg logit contributions
les-0.303
ork-0.301
lights-0.294
pron-0.277
adoes-0.258
Token the
Token ID262
Feature activation-7.13e-01

Pos logit contributions
 least+1.450
oh+1.146
 very+1.058
 ourselves+1.053
 cle+1.038
Neg logit contributions
ork-1.742
les-1.738
lights-1.699
pron-1.632
lf-1.521
Token bowl
Token ID9396
Feature activation+5.17e-01

Pos logit contributions
ork+1.136
les+1.123
lights+1.091
pron+1.010
adoes+0.938
Neg logit contributions
 least-1.251
 very-0.999
 ourselves-0.957
 cle-0.951
oh-0.945
Token.
Token ID13
Feature activation-4.32e-01

Pos logit contributions
 least+0.882
oh+0.689
 ourselves+0.675
 very+0.664
 cle+0.657
Neg logit contributions
les-0.825
ork-0.818
lights-0.804
pron-0.747
adoes-0.688
Token The
Token ID383
Feature activation-1.03e+00

Pos logit contributions
les+0.811
ork+0.807
lights+0.786
pron+0.749
adoes+0.700
Neg logit contributions
 least-0.620
 very-0.461
oh-0.459
 ourselves-0.444
 cle-0.435
Token quiet
Token ID5897
Feature activation-3.52e+00

Pos logit contributions
ork+1.739
les+1.697
lights+1.622
pron+1.495
adoes+1.429
Neg logit contributions
 least-1.754
 very-1.408
 cle-1.340
oh-1.274
 ourselves-1.273
Token brother
Token ID3956
Feature activation-1.55e-01

Pos logit contributions
ork+7.261
les+6.227
pron+6.108
lights+6.024
ilantro+5.796
Neg logit contributions
 least-4.747
 very-4.388
 so-4.176
 are-3.854
 most-3.596
Token used
Token ID973
Feature activation-1.06e+00

Pos logit contributions
ork+0.282
les+0.281
lights+0.277
pron+0.261
adoes+0.246
Neg logit contributions
 least-0.227
oh-0.172
 very-0.170
 ourselves-0.164
 cle-0.160
Token the
Token ID262
Feature activation-8.30e-01

Pos logit contributions
ork+2.140
les+2.130
lights+2.086
pron+1.976
adoes+1.891
Neg logit contributions
 least-1.373
 very-1.073
 cle-0.971
 so-0.967
oh-0.944
Token shield
Token ID7614
Feature activation+3.13e-01

Pos logit contributions
ork+1.255
les+1.251
lights+1.223
pron+1.094
adoes+1.031
Neg logit contributions
 least-1.501
 very-1.228
 cle-1.188
 ourselves-1.167
 am-1.148
Token to
Token ID284
Feature activation+2.40e-01

Pos logit contributions
 least+0.491
oh+0.378
 ourselves+0.370
 very+0.365
 cle+0.354
Neg logit contributions
les-0.539
ork-0.537
lights-0.522
pron-0.493
adoes-0.460
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
les+1.394
ork+1.380
lights+1.367
pron+1.308
adoes+1.220
Neg logit contributions
 least-0.734
 very-0.515
 ourselves-0.487
oh-0.466
 cle-0.451
Token day
Token ID1110
Feature activation-2.02e-01

Pos logit contributions
ork+2.345
lights+2.271
les+2.259
pron+2.063
adoes+2.014
Neg logit contributions
 least-1.780
 very-1.393
oh-1.312
 ourselves-1.280
 cle-1.269
Token playing
Token ID2712
Feature activation-1.26e+00

Pos logit contributions
ork+0.425
les+0.425
lights+0.418
pron+0.399
adoes+0.377
Neg logit contributions
 least-0.241
 very-0.167
oh-0.167
 ourselves-0.162
 cle-0.154
Token with
Token ID351
Feature activation-9.77e-01

Pos logit contributions
les+2.523
ork+2.516
lights+2.515
pron+2.393
adoes+2.162
Neg logit contributions
 least-1.573
 very-1.295
 ourselves-1.174
 so-1.128
 am-1.124
Token the
Token ID262
Feature activation-2.13e+00

Pos logit contributions
les+2.219
ork+2.194
lights+2.128
pron+2.121
lf+1.935
Neg logit contributions
 least-1.076
 very-0.771
 ourselves-0.698
 so-0.640
 cle-0.633
Token yellow
Token ID7872
Feature activation-3.46e+00

Pos logit contributions
ork+3.915
les+3.494
lights+3.458
pron+3.223
ves+3.073
Neg logit contributions
 least-3.571
 very-3.064
 cle-2.877
 so-2.709
 most-2.666
Token shield
Token ID7614
Feature activation-8.51e-02

Pos logit contributions
ork+6.053
lights+5.455
pron+5.304
anne+5.220
ves+4.830
Neg logit contributions
 least-5.923
 very-5.360
 are-5.104
 am-4.773
 so-4.699
Token and
Token ID290
Feature activation+1.47e-01

Pos logit contributions
les+0.155
ork+0.155
lights+0.151
pron+0.144
adoes+0.134
Neg logit contributions
 least-0.125
oh-0.093
 very-0.093
 ourselves-0.091
 cle-0.088
Token having
Token ID1719
Feature activation+6.84e-02

Pos logit contributions
 least+0.191
oh+0.138
 very+0.134
 ourselves+0.133
 cle+0.127
Neg logit contributions
les-0.294
ork-0.293
lights-0.287
pron-0.274
adoes-0.258
Token lots
Token ID6041
Feature activation-1.24e+00

Pos logit contributions
 least+0.094
oh+0.069
 very+0.068
 ourselves+0.067
 cle+0.064
Neg logit contributions
les-0.132
ork-0.131
lights-0.128
pron-0.122
adoes-0.114
Token of
Token ID286
Feature activation-5.85e-01

Pos logit contributions
ork+1.844
lights+1.769
les+1.755
pron+1.748
adoes+1.646
Neg logit contributions
 least-2.177
 very-1.880
 so-1.739
 am-1.722
oh-1.722
Token cool
Token ID3608
Feature activation-3.11e+00

Pos logit contributions
les+2.871
lights+2.845
ork+2.748
adoes+2.585
pron+2.499
Neg logit contributions
 least-1.985
 very-1.829
 am-1.581
oh-1.577
 so-1.530
Token shirt
Token ID10147
Feature activation-1.03e+00

Pos logit contributions
lights+6.120
les+5.772
ork+5.766
emaker+5.394
adoes+5.280
Neg logit contributions
 very-4.049
 least-3.903
 so-3.552
 am-3.220
 say-3.074
Token with
Token ID351
Feature activation-7.98e-01

Pos logit contributions
ork+1.887
les+1.874
lights+1.859
pron+1.716
adoes+1.684
Neg logit contributions
 least-1.519
 very-1.217
oh-1.135
 so-1.122
 ourselves-1.056
Token a
Token ID257
Feature activation-1.89e+00

Pos logit contributions
ork+1.652
les+1.623
lights+1.607
pron+1.553
lf+1.434
Neg logit contributions
 least-0.991
 very-0.763
 ourselves-0.694
oh-0.683
 so-0.679
Token dinosaur
Token ID28257
Feature activation-1.67e+00

Pos logit contributions
lights+3.492
les+3.480
ork+3.284
adoes+3.076
ors+2.898
Neg logit contributions
 least-2.658
 very-2.535
 so-2.141
 am-2.140
oh-2.102
Token on
Token ID319
Feature activation-3.41e+00

Pos logit contributions
lights+3.123
ork+2.992
les+2.991
pron+2.840
adoes+2.738
Neg logit contributions
 least-2.417
 very-2.033
oh-1.847
 so-1.789
 am-1.707
Token it
Token ID340
Feature activation-5.56e-01

Pos logit contributions
les+8.195
lights+7.841
pron+7.557
ork+7.500
ravings+7.072
Neg logit contributions
 very-3.577
 least-3.530
 so-3.206
 ourselves-2.487
 say-2.389
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
les+0.962
ork+0.960
lights+0.948
pron+0.888
adoes+0.837
Neg logit contributions
 least-0.872
 very-0.685
oh-0.667
 ourselves-0.644
 so-0.625
Token
Token ID198
Feature activation-7.84e-01

Pos logit contributions
les+2.474
ork+2.445
lights+2.398
pron+2.310
adoes+2.138
Neg logit contributions
 least-1.545
 very-1.220
oh-1.159
 so-1.132
 ourselves-1.072
Token
Token ID198
Feature activation-1.18e+00

Pos logit contributions
les+1.566
lights+1.541
ork+1.526
pron+1.460
adoes+1.420
Neg logit contributions
 least-0.989
 very-0.762
oh-0.759
 am-0.712
 ourselves-0.695
TokenTim
Token ID14967
Feature activation-5.78e-02

Pos logit contributions
les+2.121
lights+2.098
ork+2.062
pron+1.916
adoes+1.808
Neg logit contributions
 least-1.743
 very-1.474
oh-1.469
 oh-1.352
 so-1.338
Token river
Token ID7850
Feature activation-7.06e-02

Pos logit contributions
 least+0.785
oh+0.635
 ourselves+0.608
 very+0.604
 cle+0.589
Neg logit contributions
les-0.688
ork-0.684
lights-0.668
pron-0.620
adoes-0.583
Token separates
Token ID31555
Feature activation+4.60e-01

Pos logit contributions
les+0.154
ork+0.154
lights+0.151
pron+0.145
adoes+0.136
Neg logit contributions
 least-0.078
oh-0.053
 very-0.052
 ourselves-0.051
 cle-0.048
Token us
Token ID514
Feature activation+4.80e-01

Pos logit contributions
 least+0.640
oh+0.471
 very+0.461
 cle+0.453
 am+0.449
Neg logit contributions
les-0.880
ork-0.867
lights-0.853
pron-0.792
adoes-0.768
Token!"
Token ID2474
Feature activation+5.38e-01

Pos logit contributions
 least+0.731
oh+0.554
 very+0.534
 ourselves+0.531
 cle+0.529
Neg logit contributions
les-0.848
ork-0.843
lights-0.820
pron-0.765
adoes-0.731
Token The
Token ID383
Feature activation-1.04e+00

Pos logit contributions
 least+0.848
oh+0.644
 ourselves+0.625
 very+0.624
 cle+0.614
Neg logit contributions
ork-0.942
les-0.940
lights-0.906
pron-0.857
adoes-0.811
Token quiet
Token ID5897
Feature activation-3.39e+00

Pos logit contributions
les+1.864
ork+1.859
lights+1.799
pron+1.667
adoes+1.539
Neg logit contributions
 least-1.663
 very-1.362
 cle-1.272
 ourselves-1.245
oh-1.213
Token bunny
Token ID44915
Feature activation-4.00e-01

Pos logit contributions
ork+7.183
pron+7.020
lights+6.587
les+6.282
ravings+6.115
Neg logit contributions
 least-4.173
 very-4.163
 so-3.832
 am-3.611
 are-3.581
Token was
Token ID373
Feature activation+2.32e-01

Pos logit contributions
ork+0.798
les+0.797
lights+0.796
pron+0.752
adoes+0.709
Neg logit contributions
 least-0.515
 very-0.383
oh-0.381
 ourselves-0.348
 cle-0.348
Token sad
Token ID6507
Feature activation+5.37e-01

Pos logit contributions
 least+0.299
oh+0.217
 ourselves+0.208
 very+0.204
 cle+0.197
Neg logit contributions
les-0.465
ork-0.464
lights-0.455
pron-0.434
adoes-0.410
Token.
Token ID13
Feature activation+2.33e-01

Pos logit contributions
 least+0.913
oh+0.734
 ourselves+0.705
 cle+0.697
 very+0.689
Neg logit contributions
les-0.863
ork-0.849
lights-0.836
pron-0.764
adoes-0.718
Token They
Token ID1119
Feature activation+6.18e-01

Pos logit contributions
 least+0.401
oh+0.314
 very+0.308
 ourselves+0.307
 cle+0.298
Neg logit contributions
les-0.370
ork-0.370
lights-0.360
pron-0.337
adoes-0.314
Token with
Token ID351
Feature activation-3.08e-01

Pos logit contributions
les+0.084
ork+0.083
lights+0.081
pron+0.076
adoes+0.071
Neg logit contributions
 least-0.079
oh-0.061
 very-0.061
 ourselves-0.059
 cle-0.057
Token pictures
Token ID5986
Feature activation-6.61e-01

Pos logit contributions
les+0.580
ork+0.578
lights+0.570
pron+0.539
adoes+0.506
Neg logit contributions
 least-0.434
 very-0.331
oh-0.324
 ourselves-0.314
 cle-0.306
Token of
Token ID286
Feature activation-2.99e-01

Pos logit contributions
ork+1.088
les+1.080
lights+1.047
pron+1.001
adoes+0.941
Neg logit contributions
 least-1.074
 very-0.885
oh-0.842
 ourselves-0.818
 so-0.811
Token cute
Token ID13779
Feature activation-2.06e+00

Pos logit contributions
les+0.500
ork+0.500
lights+0.487
pron+0.461
adoes+0.425
Neg logit contributions
 least-0.487
 very-0.382
oh-0.373
 ourselves-0.373
 cle-0.359
Token animals
Token ID4695
Feature activation-2.97e-01

Pos logit contributions
ork+3.719
pron+3.703
lights+3.605
les+3.350
anne+3.099
Neg logit contributions
 least-2.864
 very-2.629
 so-2.372
 ourselves-2.362
oh-2.267
Token on
Token ID319
Feature activation-3.37e+00

Pos logit contributions
ork+0.506
les+0.504
lights+0.496
pron+0.468
adoes+0.436
Neg logit contributions
 least-0.474
 very-0.372
oh-0.365
 ourselves-0.358
 am-0.344
Token it
Token ID340
Feature activation-5.98e-01

Pos logit contributions
les+6.766
lights+6.565
pron+6.391
ork+6.386
ravings+6.269
Neg logit contributions
 least-4.321
 very-4.293
 so-3.569
 ourselves-3.409
 am-3.287
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
les+1.043
ork+1.041
lights+1.019
pron+0.970
adoes+0.904
Neg logit contributions
 least-0.933
 very-0.742
oh-0.706
 ourselves-0.694
 so-0.673
Token He
Token ID679
Feature activation+2.97e-01

Pos logit contributions
les+2.688
ork+2.626
lights+2.498
pron+2.401
adoes+2.223
Neg logit contributions
 least-2.512
 very-2.075
oh-1.937
 so-1.935
 am-1.898
Token loved
Token ID6151
Feature activation-6.73e-01

Pos logit contributions
 least+0.465
oh+0.358
 ourselves+0.355
 very+0.345
 cle+0.338
Neg logit contributions
les-0.514
ork-0.511
lights-0.495
pron-0.474
lf-0.437
Token to
Token ID284
Feature activation-6.75e-01

Pos logit contributions
les+1.276
ork+1.270
lights+1.243
pron+1.170
adoes+1.094
Neg logit contributions
 least-0.956
 very-0.754
oh-0.695
 ourselves-0.690
 cle-0.677
Token ran
Token ID4966
Feature activation-5.65e-01

Pos logit contributions
lights+2.683
ork+2.608
les+2.525
pron+2.469
adoes+2.308
Neg logit contributions
 least-1.794
 very-1.423
oh-1.343
 so-1.249
 cle-1.231
Token away
Token ID1497
Feature activation+3.24e-02

Pos logit contributions
les+1.026
lights+1.016
ork+1.013
pron+0.981
lf+0.885
Neg logit contributions
 least-0.849
 very-0.681
oh-0.638
 ourselves-0.621
 so-0.614
Token,
Token ID11
Feature activation-4.06e-01

Pos logit contributions
 least+0.050
oh+0.039
 very+0.038
 ourselves+0.038
 cle+0.036
Neg logit contributions
les-0.057
ork-0.057
lights-0.055
pron-0.052
adoes-0.049
Token and
Token ID290
Feature activation+2.87e-01

Pos logit contributions
les+0.808
ork+0.801
lights+0.787
pron+0.740
adoes+0.697
Neg logit contributions
 least-0.547
 very-0.414
oh-0.392
 ourselves-0.381
 cle-0.374
Token the
Token ID262
Feature activation-1.36e+00

Pos logit contributions
 least+0.399
oh+0.296
 ourselves+0.289
 very+0.286
 cle+0.276
Neg logit contributions
les-0.545
ork-0.541
lights-0.529
pron-0.502
adoes-0.473
Token blue
Token ID4171
Feature activation-3.36e+00

Pos logit contributions
les+2.493
ork+2.473
lights+2.419
pron+2.183
adoes+2.022
Neg logit contributions
 least-2.111
 very-1.784
 cle-1.651
 so-1.563
 oh-1.541
Token bird
Token ID6512
Feature activation-2.83e-01

Pos logit contributions
lights+6.833
les+6.539
ork+6.430
pron+6.387
lf+6.008
Neg logit contributions
 least-4.768
 very-4.400
 am-3.838
 are-3.783
 so-3.771
Token saved
Token ID7448
Feature activation-5.63e-01

Pos logit contributions
ork+0.531
les+0.529
lights+0.524
pron+0.497
adoes+0.465
Neg logit contributions
 least-0.400
 very-0.301
oh-0.299
 ourselves-0.284
 cle-0.278
Token its
Token ID663
Feature activation-3.01e-01

Pos logit contributions
les+1.106
lights+1.093
ork+1.091
pron+1.055
lf+0.958
Neg logit contributions
 least-0.774
 very-0.587
 ourselves-0.573
oh-0.566
 cle-0.529
Token eggs
Token ID9653
Feature activation-2.15e-01

Pos logit contributions
les+0.510
ork+0.504
lights+0.495
pron+0.466
adoes+0.431
Neg logit contributions
 least-0.492
 very-0.385
oh-0.375
 ourselves-0.373
 cle-0.368
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
les+0.368
ork+0.365
lights+0.357
pron+0.338
adoes+0.315
Neg logit contributions
 least-0.344
 very-0.265
oh-0.264
 ourselves-0.261
 cle-0.249
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
les+0.242
ork+0.240
lights+0.235
pron+0.223
adoes+0.206
Neg logit contributions
 least-0.231
oh-0.180
 very-0.179
 ourselves-0.175
 cle-0.169
Token
Token ID198
Feature activation+5.35e-01

Pos logit contributions
 least+0.167
oh+0.129
 very+0.127
 ourselves+0.126
 cle+0.122
Neg logit contributions
les-0.176
ork-0.175
lights-0.171
pron-0.161
adoes-0.150
Token
Token ID198
Feature activation-3.51e-01

Pos logit contributions
 least+0.847
oh+0.633
 ourselves+0.630
 very+0.623
 cle+0.616
Neg logit contributions
ork-0.932
les-0.917
lights-0.885
pron-0.845
lf-0.781
TokenWhen
Token ID2215
Feature activation+1.46e-01

Pos logit contributions
les+0.671
lights+0.658
ork+0.657
pron+0.614
adoes+0.583
Neg logit contributions
 least-0.496
oh-0.381
 very-0.369
 ourselves-0.360
 am-0.347
Token the
Token ID262
Feature activation-1.49e+00

Pos logit contributions
 least+0.218
oh+0.165
 very+0.161
 ourselves+0.160
 cle+0.155
Neg logit contributions
les-0.264
ork-0.263
lights-0.257
pron-0.244
adoes-0.228
Token white
Token ID2330
Feature activation-3.36e+00

Pos logit contributions
ork+2.670
les+2.481
lights+2.347
pron+2.318
adoes+2.104
Neg logit contributions
 least-2.554
 very-2.109
 cle-1.966
 so-1.925
 ourselves-1.912
Token all
Token ID477
Feature activation-2.42e-01

Pos logit contributions
ork+6.805
pron+6.219
lights+6.141
les+5.941
isdom+5.766
Neg logit contributions
 least-4.747
 very-4.464
 so-4.034
 first-3.606
'm-3.599
Tokenigator
Token ID23823
Feature activation+5.59e-02

Pos logit contributions
ork+0.382
les+0.376
lights+0.365
pron+0.350
adoes+0.319
Neg logit contributions
 least-0.424
oh-0.335
 very-0.333
 ourselves-0.325
 cle-0.317
Token got
Token ID1392
Feature activation-2.97e-01

Pos logit contributions
 least+0.087
oh+0.067
 very+0.066
 ourselves+0.065
 cle+0.063
Neg logit contributions
les-0.097
ork-0.097
lights-0.094
pron-0.089
adoes-0.083
Token out
Token ID503
Feature activation+2.85e-01

Pos logit contributions
les+0.594
ork+0.591
lights+0.577
pron+0.555
lf+0.517
Neg logit contributions
 least-0.396
 very-0.295
oh-0.285
 ourselves-0.277
 cle-0.268
Token of
Token ID286
Feature activation-5.36e-01

Pos logit contributions
 least+0.476
oh+0.375
 ourselves+0.365
 very+0.363
 cle+0.353
Neg logit contributions
les-0.467
ork-0.462
lights-0.455
pron-0.421
adoes-0.394
Token it
Token ID340
Feature activation+7.58e-01

Pos logit contributions
 least+0.551
oh+0.425
 ourselves+0.409
 very+0.405
 cle+0.398
Neg logit contributions
les-0.609
ork-0.605
lights-0.591
pron-0.555
adoes-0.522
Token!
Token ID0
Feature activation+3.05e-01

Pos logit contributions
 least+1.269
oh+1.006
 ourselves+0.964
 cle+0.943
 am+0.940
Neg logit contributions
les-1.231
ork-1.213
lights-1.190
pron-1.097
adoes-1.027
Token
Token ID198
Feature activation+4.41e-01

Pos logit contributions
 least+0.471
oh+0.358
 very+0.352
 ourselves+0.352
 cle+0.340
Neg logit contributions
ork-0.533
les-0.532
lights-0.517
pron-0.488
adoes-0.456
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
 least+0.690
oh+0.515
 ourselves+0.511
 very+0.507
 cle+0.498
Neg logit contributions
ork-0.773
les-0.766
lights-0.740
pron-0.704
lf-0.652
TokenThe
Token ID464
Feature activation-1.53e+00

Pos logit contributions
les+0.310
ork+0.307
lights+0.303
pron+0.286
adoes+0.270
Neg logit contributions
 least-0.221
oh-0.166
 very-0.162
 ourselves-0.158
 cle-0.152
Token white
Token ID2330
Feature activation-3.34e+00

Pos logit contributions
ork+2.934
les+2.722
lights+2.643
pron+2.547
anne+2.320
Neg logit contributions
 least-2.438
 very-2.027
 cle-1.847
 so-1.806
 ourselves-1.789
Token all
Token ID477
Feature activation-1.48e-01

Pos logit contributions
ork+6.523
lights+6.091
pron+5.929
les+5.720
isdom+5.478
Neg logit contributions
 least-4.859
 very-4.620
 so-4.138
 am-3.843
'm-3.838
Tokenigator
Token ID23823
Feature activation-2.54e-02

Pos logit contributions
ork+0.218
les+0.216
lights+0.209
pron+0.197
adoes+0.181
Neg logit contributions
 least-0.272
oh-0.219
 very-0.218
 ourselves-0.212
 cle-0.208
Token was
Token ID373
Feature activation+8.44e-02

Pos logit contributions
les+0.047
ork+0.047
lights+0.046
pron+0.044
adoes+0.041
Neg logit contributions
 least-0.037
oh-0.027
 very-0.027
 ourselves-0.026
 cle-0.026
Token scared
Token ID12008
Feature activation+1.59e-01

Pos logit contributions
 least+0.077
oh+0.046
 very+0.044
 ourselves+0.043
 cle+0.040
Neg logit contributions
les-0.202
ork-0.201
lights-0.198
pron-0.190
adoes-0.181
Token so
Token ID523
Feature activation+2.03e-01

Pos logit contributions
 least+0.263
oh+0.205
 very+0.200
 ourselves+0.200
 cle+0.194
Neg logit contributions
les-0.265
ork-0.263
lights-0.257
pron-0.241
adoes-0.225
Token with
Token ID351
Feature activation-3.52e-01

Pos logit contributions
les+0.841
ork+0.836
lights+0.826
pron+0.767
adoes+0.741
Neg logit contributions
 least-0.638
 very-0.495
oh-0.477
 ourselves-0.452
 am-0.443
Token his
Token ID465
Feature activation+2.90e-01

Pos logit contributions
ork+0.690
les+0.690
lights+0.677
pron+0.646
adoes+0.603
Neg logit contributions
 least-0.463
 very-0.348
oh-0.332
 ourselves-0.329
 cle-0.319
Token favorite
Token ID4004
Feature activation-2.30e+00

Pos logit contributions
 least+0.541
oh+0.437
 ourselves+0.426
 very+0.423
 cle+0.414
Neg logit contributions
ork-0.415
les-0.415
lights-0.400
pron-0.374
adoes-0.344
Token cartoon
Token ID16251
Feature activation-7.98e-01

Pos logit contributions
lights+3.885
ork+3.691
les+3.577
pron+3.469
anne+3.186
Neg logit contributions
 least-3.560
 very-3.417
 am-3.077
 so-3.055
 are-3.032
Token character
Token ID2095
Feature activation-1.68e-01

Pos logit contributions
les+1.337
lights+1.318
ork+1.314
pron+1.227
adoes+1.154
Neg logit contributions
 least-1.249
 very-1.037
oh-0.984
 am-0.946
 ourselves-0.943
Token on
Token ID319
Feature activation-3.30e+00

Pos logit contributions
les+0.292
ork+0.292
lights+0.286
pron+0.270
adoes+0.252
Neg logit contributions
 least-0.259
oh-0.198
 very-0.198
 ourselves-0.193
 cle-0.186
Token the
Token ID262
Feature activation+7.30e-01

Pos logit contributions
lights+7.475
les+7.264
ork+6.719
anne+6.718
pron+6.698
Neg logit contributions
 least-3.817
 very-3.815
 so-3.294
 ourselves-2.909
 am-2.692
Token cover
Token ID3002
Feature activation-1.09e+00

Pos logit contributions
 least+1.451
oh+1.199
 ourselves+1.160
 am+1.147
 very+1.136
Neg logit contributions
ork-0.963
les-0.960
lights-0.890
pron-0.869
lf-0.779
Token.
Token ID13
Feature activation-9.26e-01

Pos logit contributions
ork+1.936
les+1.922
lights+1.882
pron+1.818
adoes+1.699
Neg logit contributions
 least-1.596
 very-1.309
oh-1.233
 ourselves-1.209
 so-1.193
Token He
Token ID679
Feature activation-4.39e-02

Pos logit contributions
les+1.688
ork+1.602
lights+1.600
pron+1.507
lf+1.367
Neg logit contributions
 least-1.381
 very-1.129
oh-1.072
 so-1.038
 oh-1.029
Token asked
Token ID1965
Feature activation-3.81e-01

Pos logit contributions
les+0.076
ork+0.076
lights+0.075
pron+0.070
adoes+0.066
Neg logit contributions
 least-0.069
oh-0.053
 very-0.052
 ourselves-0.051
 cle-0.050
Token to
Token ID284
Feature activation-7.13e-01

Pos logit contributions
 least+1.382
oh+1.051
 ourselves+1.020
 cle+0.999
 am+0.967
Neg logit contributions
ork-1.631
les-1.610
lights-1.557
pron-1.501
lf-1.355
Token go
Token ID467
Feature activation+4.08e-01

Pos logit contributions
les+1.512
ork+1.499
lights+1.493
pron+1.413
adoes+1.331
Neg logit contributions
 least-0.905
 very-0.611
 ourselves-0.608
oh-0.591
 am-0.556
Token home
Token ID1363
Feature activation+9.69e-01

Pos logit contributions
 least+0.641
oh+0.502
 ourselves+0.482
 very+0.480
 cle+0.476
Neg logit contributions
les-0.708
ork-0.703
lights-0.683
pron-0.639
adoes-0.596
Token.
Token ID13
Feature activation-5.67e-01

Pos logit contributions
 least+1.741
oh+1.428
 cle+1.355
 ourselves+1.344
 oh+1.318
Neg logit contributions
ork-1.453
les-1.422
lights-1.409
pron-1.282
lf-1.193
Token The
Token ID383
Feature activation-1.19e+00

Pos logit contributions
les+0.971
ork+0.952
lights+0.934
pron+0.892
adoes+0.812
Neg logit contributions
 least-0.922
 very-0.712
oh-0.698
 ourselves-0.685
 cle-0.675
Token polite
Token ID23507
Feature activation-3.28e+00

Pos logit contributions
ork+2.267
les+2.232
lights+2.216
pron+2.047
adoes+1.939
Neg logit contributions
 least-1.783
 very-1.412
 cle-1.310
 ourselves-1.275
 so-1.247
Token boy
Token ID2933
Feature activation+3.27e-01

Pos logit contributions
ork+8.480
lights+8.421
pron+8.357
les+7.752
anne+7.683
Neg logit contributions
 least-2.723
 very-2.421
 are-2.172
 so-2.072
 say-1.974
Token was
Token ID373
Feature activation+3.77e-01

Pos logit contributions
 least+0.491
oh+0.374
 ourselves+0.370
 very+0.362
 cle+0.352
Neg logit contributions
les-0.585
ork-0.579
lights-0.562
pron-0.535
adoes-0.495
Token sad
Token ID6507
Feature activation+3.56e-01

Pos logit contributions
 least+0.323
oh+0.192
 ourselves+0.183
 very+0.165
 cle+0.161
Neg logit contributions
les-0.912
ork-0.910
lights-0.895
pron-0.865
adoes-0.816
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
 least+0.567
oh+0.445
 ourselves+0.430
 very+0.426
 cle+0.418
Neg logit contributions
les-0.605
ork-0.601
lights-0.590
pron-0.548
adoes-0.513
Token He
Token ID679
Feature activation+2.30e-01

Pos logit contributions
 least+0.354
oh+0.281
 very+0.275
 ourselves+0.275
 cle+0.266
Neg logit contributions
les-0.317
ork-0.316
lights-0.307
pron-0.287
adoes-0.267
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 least+0.351
oh+0.251
 ourselves+0.240
 very+0.237
 cle+0.227
Neg logit contributions
les-0.584
ork-0.579
lights-0.568
pron-0.540
adoes-0.513
Token dragon
Token ID10441
Feature activation-5.78e-01

Pos logit contributions
ork+1.951
lights+1.899
les+1.874
pron+1.689
adoes+1.591
Neg logit contributions
 least-2.201
 very-1.841
 cle-1.728
 ourselves-1.669
oh-1.667
Token again
Token ID757
Feature activation+6.91e-01

Pos logit contributions
ork+1.056
les+1.046
lights+1.036
pron+0.981
adoes+0.921
Neg logit contributions
 least-0.859
 very-0.666
oh-0.648
 ourselves-0.621
 cle-0.609
Token,
Token ID11
Feature activation-2.97e-01

Pos logit contributions
 least+1.219
oh+0.976
 ourselves+0.959
 cle+0.937
 very+0.936
Neg logit contributions
les-1.055
ork-1.042
lights-0.998
pron-0.944
lf-0.862
Token but
Token ID475
Feature activation-8.85e-02

Pos logit contributions
ork+0.564
les+0.561
lights+0.548
pron+0.519
adoes+0.485
Neg logit contributions
 least-0.424
 very-0.319
oh-0.310
 ourselves-0.303
 cle-0.296
Token this
Token ID428
Feature activation-3.28e+00

Pos logit contributions
les+0.174
ork+0.173
lights+0.170
pron+0.161
adoes+0.151
Neg logit contributions
 least-0.120
oh-0.087
 very-0.087
 ourselves-0.085
 cle-0.081
Token time
Token ID640
Feature activation-2.65e-01

Pos logit contributions
lights+5.421
ork+4.951
adoes+4.925
les+4.781
ors+4.780
Neg logit contributions
 least-6.008
 very-5.546
 am-5.101
 so-5.083
oh-4.858
Token it
Token ID340
Feature activation-6.83e-01

Pos logit contributions
les+0.525
ork+0.518
lights+0.512
pron+0.489
adoes+0.457
Neg logit contributions
 least-0.348
 very-0.259
oh-0.255
 ourselves-0.245
 cle-0.236
Token was
Token ID373
Feature activation-9.66e-01

Pos logit contributions
lights+1.413
les+1.413
ork+1.408
pron+1.343
adoes+1.272
Neg logit contributions
 least-0.852
oh-0.624
 very-0.616
 am-0.562
 cle-0.560
Token awake
Token ID21693
Feature activation-4.46e-01

Pos logit contributions
les+2.111
ork+2.062
lights+2.018
pron+1.874
adoes+1.819
Neg logit contributions
 least-1.158
 very-0.925
 so-0.790
 cle-0.780
oh-0.749
Token!
Token ID0
Feature activation+2.31e-01

Pos logit contributions
ork+0.779
les+0.778
lights+0.762
pron+0.725
adoes+0.675
Neg logit contributions
 least-0.685
 very-0.535
oh-0.519
 ourselves-0.508
 so-0.489
Token pick
Token ID2298
Feature activation+5.08e-01

Pos logit contributions
les+0.316
ork+0.315
lights+0.308
pron+0.293
adoes+0.272
Neg logit contributions
 least-0.241
oh-0.177
 very-0.177
 ourselves-0.174
 cle-0.167
Token you
Token ID345
Feature activation+2.26e-01

Pos logit contributions
 least+0.825
oh+0.643
 very+0.619
 ourselves+0.611
 cle+0.603
Neg logit contributions
les-0.867
ork-0.839
lights-0.815
pron-0.753
adoes-0.727
Token up
Token ID510
Feature activation+2.82e-01

Pos logit contributions
 least+0.344
oh+0.262
 very+0.255
 ourselves+0.252
 cle+0.246
Neg logit contributions
les-0.403
ork-0.400
lights-0.390
pron-0.368
adoes-0.347
Token!"
Token ID2474
Feature activation+5.51e-01

Pos logit contributions
 least+0.419
oh+0.318
 very+0.307
 ourselves+0.303
 cle+0.297
Neg logit contributions
les-0.510
ork-0.506
lights-0.496
pron-0.467
adoes-0.441
Token The
Token ID383
Feature activation-2.46e-01

Pos logit contributions
 least+0.764
oh+0.553
 ourselves+0.535
 very+0.527
 cle+0.512
Neg logit contributions
les-1.071
ork-1.058
lights-1.033
pron-0.972
adoes-0.931
Token quiet
Token ID5897
Feature activation-3.28e+00

Pos logit contributions
les+0.418
ork+0.417
lights+0.405
pron+0.380
adoes+0.353
Neg logit contributions
 least-0.402
 very-0.313
oh-0.309
 ourselves-0.304
 cle-0.300
Token bunny
Token ID44915
Feature activation-3.87e-01

Pos logit contributions
ork+6.871
pron+6.557
lights+6.212
les+5.903
ravings+5.800
Neg logit contributions
 least-4.136
 very-4.126
 so-3.831
 am-3.505
 are-3.442
Token was
Token ID373
Feature activation+3.94e-01

Pos logit contributions
ork+0.840
les+0.838
lights+0.837
pron+0.798
adoes+0.754
Neg logit contributions
 least-0.430
 very-0.301
oh-0.298
 ourselves-0.274
 cle-0.270
Token happy
Token ID3772
Feature activation-5.21e-01

Pos logit contributions
 least+0.472
oh+0.334
 ourselves+0.316
 very+0.300
 am+0.296
Neg logit contributions
les-0.824
ork-0.821
lights-0.807
pron-0.776
adoes-0.731
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
les+0.918
ork+0.917
lights+0.890
pron+0.856
adoes+0.788
Neg logit contributions
 least-0.823
 very-0.646
oh-0.620
 ourselves-0.615
 so-0.593
Token The
Token ID383
Feature activation-2.70e-01

Pos logit contributions
 least+0.388
oh+0.300
 very+0.294
 ourselves+0.293
 cle+0.283
Neg logit contributions
les-0.396
ork-0.394
lights-0.384
pron-0.361
adoes-0.338
Token big
Token ID1263
Feature activation+5.37e-01

Pos logit contributions
 least+0.205
oh+0.160
 very+0.156
 ourselves+0.156
 cle+0.151
Neg logit contributions
les-0.207
ork-0.206
lights-0.200
pron-0.190
adoes-0.175
Token red
Token ID2266
Feature activation-2.87e-01

Pos logit contributions
 least+1.006
oh+0.818
 ourselves+0.806
 very+0.776
 cle+0.771
Neg logit contributions
les-0.763
ork-0.756
lights-0.726
pron-0.699
lf-0.626
Token nose
Token ID9686
Feature activation-1.44e+00

Pos logit contributions
les+0.415
ork+0.410
lights+0.406
pron+0.371
adoes+0.345
Neg logit contributions
 least-0.527
 very-0.426
oh-0.423
 ourselves-0.412
 am-0.402
Token and
Token ID290
Feature activation+1.25e-01

Pos logit contributions
ork+2.741
les+2.722
lights+2.676
pron+2.508
adoes+2.444
Neg logit contributions
 least-1.948
 very-1.700
 so-1.516
oh-1.475
 am-1.367
Token he
Token ID339
Feature activation-7.96e-03

Pos logit contributions
 least+0.175
oh+0.129
 very+0.126
 ourselves+0.125
 cle+0.120
Neg logit contributions
les-0.237
ork-0.236
lights-0.231
pron-0.220
adoes-0.206
Token wore
Token ID12408
Feature activation-3.28e+00

Pos logit contributions
les+0.014
ork+0.014
lights+0.014
pron+0.013
adoes+0.012
Neg logit contributions
 least-0.012
oh-0.009
 very-0.009
 ourselves-0.009
 cle-0.008
Token funny
Token ID8258
Feature activation-1.59e+00

Pos logit contributions
lights+10.115
ork+9.599
les+9.561
adoes+9.119
ravings+9.083
Neg logit contributions
 very-1.772
 so-1.142
 least-0.880
 oh-0.451
 a-0.295
Token clothes
Token ID8242
Feature activation-1.37e+00

Pos logit contributions
ork+2.486
lights+2.467
les+2.412
pron+2.351
adoes+2.119
Neg logit contributions
 least-2.386
 very-2.284
 so-2.049
 am-2.010
oh-2.004
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
ork+2.620
les+2.549
lights+2.519
pron+2.407
adoes+2.313
Neg logit contributions
 least-1.971
 very-1.640
 so-1.485
 ourselves-1.413
oh-1.402
Token He
Token ID679
Feature activation-1.79e-01

Pos logit contributions
les+1.991
ork+1.953
lights+1.890
pron+1.829
adoes+1.696
Neg logit contributions
 least-1.929
 very-1.604
oh-1.515
 am-1.487
 so-1.478
Token liked
Token ID8288
Feature activation-4.71e-01

Pos logit contributions
ork+0.314
les+0.314
lights+0.308
pron+0.289
adoes+0.273
Neg logit contributions
 least-0.275
 very-0.211
oh-0.210
 ourselves-0.201
 cle-0.198
Token calendar
Token ID11845
Feature activation+2.00e-01

Pos logit contributions
 least+0.303
oh+0.246
 ourselves+0.241
 very+0.241
 cle+0.234
Neg logit contributions
les-0.219
ork-0.219
lights-0.211
pron-0.199
adoes-0.180
Token with
Token ID351
Feature activation-7.15e-02

Pos logit contributions
 least+0.325
oh+0.252
 ourselves+0.247
 very+0.245
 cle+0.239
Neg logit contributions
les-0.336
ork-0.334
lights-0.326
pron-0.309
adoes-0.285
Token pictures
Token ID5986
Feature activation-7.39e-01

Pos logit contributions
les+0.132
ork+0.131
lights+0.128
pron+0.122
adoes+0.114
Neg logit contributions
 least-0.104
oh-0.079
 very-0.078
 ourselves-0.076
 cle-0.074
Token of
Token ID286
Feature activation-5.10e-01

Pos logit contributions
ork+1.273
les+1.267
lights+1.230
pron+1.181
adoes+1.110
Neg logit contributions
 least-1.136
 very-0.932
oh-0.875
 ourselves-0.858
 so-0.849
Token animals
Token ID4695
Feature activation-3.16e-01

Pos logit contributions
les+0.877
ork+0.874
lights+0.858
pron+0.810
adoes+0.745
Neg logit contributions
 least-0.808
 very-0.642
 ourselves-0.622
oh-0.610
 cle-0.590
Token on
Token ID319
Feature activation-3.27e+00

Pos logit contributions
ork+0.545
les+0.541
lights+0.534
pron+0.503
adoes+0.469
Neg logit contributions
 least-0.500
 very-0.392
oh-0.384
 ourselves-0.377
 am-0.365
Token it
Token ID340
Feature activation-4.67e-01

Pos logit contributions
les+6.623
lights+6.412
pron+6.079
ravings+6.076
ork+6.021
Neg logit contributions
 very-4.157
 least-4.064
 so-3.414
 am-3.370
 ourselves-3.349
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
les+0.801
ork+0.796
lights+0.781
pron+0.740
adoes+0.691
Neg logit contributions
 least-0.738
 very-0.584
oh-0.562
 ourselves-0.554
 am-0.536
Token Every
Token ID3887
Feature activation-3.14e-01

Pos logit contributions
les+2.180
ork+2.120
lights+2.103
pron+1.958
adoes+1.861
Neg logit contributions
 least-1.939
 very-1.566
oh-1.501
 am-1.469
 ourselves-1.453
Token morning
Token ID3329
Feature activation-2.36e-01

Pos logit contributions
les+0.543
ork+0.535
lights+0.532
pron+0.492
adoes+0.472
Neg logit contributions
 least-0.496
 very-0.387
oh-0.383
 ourselves-0.363
 am-0.361
Token,
Token ID11
Feature activation-7.64e-01

Pos logit contributions
les+0.472
ork+0.467
lights+0.462
pron+0.437
adoes+0.416
Neg logit contributions
 least-0.305
oh-0.226
 very-0.224
 ourselves-0.213
 am-0.207
Token and
Token ID290
Feature activation+8.96e-01

Pos logit contributions
les+0.003
ork+0.003
lights+0.003
pron+0.003
adoes+0.002
Neg logit contributions
 least-0.003
oh-0.002
 very-0.002
 ourselves-0.002
 cle-0.002
Token had
Token ID550
Feature activation+1.45e+00

Pos logit contributions
 least+1.490
oh+1.169
 ourselves+1.140
 cle+1.107
 am+1.074
Neg logit contributions
les-1.471
ork-1.417
lights-1.408
pron-1.329
lf-1.251
Token fun
Token ID1257
Feature activation-1.25e+00

Pos logit contributions
 least+1.853
oh+1.369
 ourselves+1.262
 am+1.193
 oh+1.182
Neg logit contributions
les-2.922
ork-2.903
lights-2.794
pron-2.708
lf-2.524
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
ork+2.448
les+2.446
lights+2.344
pron+2.332
adoes+2.108
Neg logit contributions
 least-1.696
 very-1.395
 ourselves-1.273
 so-1.246
oh-1.201
Token The
Token ID383
Feature activation-1.14e+00

Pos logit contributions
les+0.025
ork+0.024
lights+0.024
pron+0.023
adoes+0.021
Neg logit contributions
 least-0.022
oh-0.017
 very-0.017
 ourselves-0.017
 cle-0.016
Token anxious
Token ID18116
Feature activation-3.27e+00

Pos logit contributions
les+2.241
ork+2.240
lights+2.171
pron+2.031
adoes+1.948
Neg logit contributions
 least-1.642
 very-1.280
 cle-1.168
oh-1.141
 am-1.118
Token feeling
Token ID4203
Feature activation-4.49e-01

Pos logit contributions
pron+6.746
ork+6.504
lights+6.418
les+6.079
anne+5.881
Neg logit contributions
 least-4.323
 very-3.831
 so-3.259
 am-3.184
'm-3.144
Token was
Token ID373
Feature activation-4.27e-01

Pos logit contributions
ork+0.794
les+0.790
lights+0.778
pron+0.745
adoes+0.691
Neg logit contributions
 least-0.690
 very-0.540
oh-0.529
 ourselves-0.505
 am-0.497
Token gone
Token ID3750
Feature activation-2.12e-03

Pos logit contributions
les+0.944
ork+0.938
lights+0.913
pron+0.866
adoes+0.828
Neg logit contributions
 least-0.496
 very-0.352
oh-0.327
 ourselves-0.321
 cle-0.309
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
les+0.004
ork+0.004
lights+0.003
pron+0.003
adoes+0.003
Neg logit contributions
 least-0.003
oh-0.003
 very-0.003
 ourselves-0.003
 cle-0.003
Token The
Token ID383
Feature activation-1.01e+00

Pos logit contributions
 least+0.351
oh+0.268
 very+0.262
 ourselves+0.261
 cle+0.254
Neg logit contributions
ork-0.393
les-0.393
lights-0.383
pron-0.361
adoes-0.338
Token borrow
Token ID8804
Feature activation-1.40e+00

Pos logit contributions
 least+0.042
oh+0.025
 very+0.025
 ourselves+0.024
 cle+0.022
Neg logit contributions
les-0.108
ork-0.107
lights-0.106
pron-0.101
adoes-0.097
Token the
Token ID262
Feature activation-7.38e-01

Pos logit contributions
lights+3.168
ork+3.103
les+3.083
pron+3.042
lf+2.826
Neg logit contributions
 least-1.546
 very-1.151
 ourselves-1.099
 oh-1.007
 so-1.003
Token cane
Token ID33009
Feature activation+3.29e-01

Pos logit contributions
lights+0.885
ork+0.872
les+0.871
pron+0.750
lf+0.682
Neg logit contributions
 least-1.562
 very-1.330
 cle-1.280
 ourselves-1.273
oh-1.264
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
 least+0.542
oh+0.428
 ourselves+0.413
 very+0.410
 cle+0.402
Neg logit contributions
les-0.539
ork-0.538
lights-0.521
pron-0.492
adoes-0.458
Token The
Token ID383
Feature activation-1.26e+00

Pos logit contributions
les+0.629
ork+0.616
lights+0.612
pron+0.575
adoes+0.538
Neg logit contributions
 least-0.486
oh-0.372
 very-0.370
 ourselves-0.354
 am-0.345
Token poor
Token ID3595
Feature activation-3.26e+00

Pos logit contributions
lights+2.196
ork+2.172
les+2.142
pron+1.839
adoes+1.824
Neg logit contributions
 least-2.065
 very-1.758
 cle-1.644
 oh-1.582
 am-1.533
Token man
Token ID582
Feature activation-2.83e-01

Pos logit contributions
lights+7.524
ork+7.392
les+6.692
anne+6.540
ilt+6.443
Neg logit contributions
 very-3.604
 least-3.390
 oh-3.107
 so-2.979
 say-2.876
Token was
Token ID373
Feature activation+3.31e-01

Pos logit contributions
ork+0.572
les+0.572
lights+0.568
pron+0.534
adoes+0.508
Neg logit contributions
 least-0.361
oh-0.261
 very-0.260
 cle-0.241
 ourselves-0.240
Token kind
Token ID1611
Feature activation-1.00e+00

Pos logit contributions
 least+0.408
oh+0.290
 ourselves+0.279
 very+0.266
 am+0.259
Neg logit contributions
les-0.679
ork-0.679
lights-0.663
pron-0.641
adoes-0.601
Token and
Token ID290
Feature activation+1.67e-01

Pos logit contributions
ork+1.839
lights+1.816
les+1.810
pron+1.756
adoes+1.599
Neg logit contributions
 least-1.490
 very-1.209
 so-1.133
oh-1.125
 cle-1.061
Token said
Token ID531
Feature activation+1.86e-01

Pos logit contributions
 least+0.260
oh+0.200
 ourselves+0.195
 very+0.194
 cle+0.186
Neg logit contributions
les-0.292
ork-0.290
lights-0.283
pron-0.269
adoes-0.250
Token play
Token ID711
Feature activation-1.22e+00

Pos logit contributions
les+0.028
ork+0.028
lights+0.027
pron+0.026
adoes+0.024
Neg logit contributions
 least-0.026
oh-0.020
 very-0.020
 ourselves-0.019
 cle-0.019
Token with
Token ID351
Feature activation-1.17e+00

Pos logit contributions
lights+2.296
les+2.241
ork+2.230
pron+2.041
anne+1.956
Neg logit contributions
 least-1.723
 very-1.430
oh-1.323
 oh-1.319
 ourselves-1.289
Token her
Token ID607
Feature activation-1.01e-01

Pos logit contributions
les+2.635
ork+2.635
lights+2.610
pron+2.471
anne+2.305
Neg logit contributions
 least-1.156
 very-0.891
 ourselves-0.785
oh-0.742
 oh-0.738
Token dolls
Token ID36062
Feature activation-6.18e-01

Pos logit contributions
les+0.161
ork+0.159
lights+0.155
pron+0.145
adoes+0.134
Neg logit contributions
 least-0.176
 very-0.138
oh-0.138
 ourselves-0.136
 cle-0.132
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
ork+1.108
les+1.102
lights+1.094
pron+1.014
adoes+0.969
Neg logit contributions
 least-0.966
 very-0.764
oh-0.712
 ourselves-0.693
 so-0.685
Token wear
Token ID5806
Feature activation-3.26e+00

Pos logit contributions
 least+0.176
oh+0.136
 very+0.133
 ourselves+0.132
 cle+0.128
Neg logit contributions
les-0.194
ork-0.192
lights-0.188
pron-0.178
adoes-0.166
Token pretty
Token ID2495
Feature activation-2.58e+00

Pos logit contributions
ork+7.351
lights+7.128
emaker+7.067
les+6.840
ravings+6.730
Neg logit contributions
 very-3.660
 so-3.089
 least-2.951
 are-2.455
 say-2.402
Token dresses
Token ID27309
Feature activation-1.77e+00

Pos logit contributions
ork+4.290
pron+3.994
lights+3.966
les+3.809
emaker+3.777
Neg logit contributions
 least-3.768
 very-3.590
 so-3.225
 ourselves-3.050
 am-3.039
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
ork+3.449
les+3.368
lights+3.330
pron+3.207
adoes+3.162
Neg logit contributions
 least-2.367
 very-2.055
 so-1.851
 ourselves-1.728
oh-1.691
Token One
Token ID1881
Feature activation-2.33e-02

Pos logit contributions
les+1.632
ork+1.601
lights+1.595
pron+1.447
adoes+1.371
Neg logit contributions
 least-2.247
 very-1.900
oh-1.823
 am-1.800
 ourselves-1.760
Token day
Token ID1110
Feature activation+2.62e-01

Pos logit contributions
les+0.039
ork+0.039
lights+0.038
pron+0.036
adoes+0.034
Neg logit contributions
 least-0.038
oh-0.029
 very-0.029
 ourselves-0.028
 cle-0.027
Token girl
Token ID2576
Feature activation+2.12e-02

Pos logit contributions
 least+0.366
oh+0.278
 ourselves+0.272
 very+0.269
 cle+0.261
Neg logit contributions
les-0.433
ork-0.429
lights-0.419
pron-0.399
adoes-0.371
Token named
Token ID3706
Feature activation+1.85e-01

Pos logit contributions
 least+0.034
oh+0.026
 very+0.026
 ourselves+0.025
 cle+0.024
Neg logit contributions
les-0.037
ork-0.036
lights-0.035
pron-0.033
adoes-0.031
Token Lucy
Token ID22162
Feature activation+7.92e-01

Pos logit contributions
 least+0.332
oh+0.262
 very+0.259
 ourselves+0.258
 cle+0.252
Neg logit contributions
ork-0.281
les-0.281
lights-0.272
pron-0.255
adoes-0.235
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
 least+1.454
oh+1.140
 ourselves+1.136
 cle+1.128
 oh+1.073
Neg logit contributions
les-1.181
ork-1.170
lights-1.102
pron-1.060
lf-0.968
Token Lucy
Token ID22162
Feature activation+1.04e+00

Pos logit contributions
les+0.539
ork+0.533
lights+0.517
pron+0.495
adoes+0.461
Neg logit contributions
 least-0.407
 very-0.312
oh-0.309
 ourselves-0.301
 am-0.292
Token was
Token ID373
Feature activation+1.85e+00

Pos logit contributions
 least+1.743
 ourselves+1.346
 cle+1.303
oh+1.294
 oh+1.263
Neg logit contributions
les-1.729
ork-1.721
lights-1.606
pron-1.577
lf-1.479
Token nervous
Token ID10927
Feature activation-2.41e-01

Pos logit contributions
 least+2.068
oh+1.541
 ourselves+1.540
 am+1.197
 cle+1.138
Neg logit contributions
ork-3.807
pron-3.720
les-3.690
lights-3.643
adoes-3.438
Token to
Token ID284
Feature activation-2.96e-01

Pos logit contributions
ork+0.432
les+0.432
lights+0.419
pron+0.403
adoes+0.372
Neg logit contributions
 least-0.368
 very-0.284
oh-0.278
 ourselves-0.272
 cle-0.261
Token go
Token ID467
Feature activation+8.74e-01

Pos logit contributions
les+0.608
ork+0.603
lights+0.600
pron+0.567
adoes+0.533
Neg logit contributions
 least-0.380
 very-0.271
oh-0.267
 ourselves-0.262
 am-0.253
Token outside
Token ID2354
Feature activation+1.30e+00

Pos logit contributions
 least+1.440
oh+1.160
 ourselves+1.107
 cle+1.083
 oh+1.059
Neg logit contributions
les-1.442
ork-1.420
lights-1.371
pron-1.284
adoes-1.188
Token and
Token ID290
Feature activation+3.26e-02

Pos logit contributions
 least+2.414
oh+2.009
 ourselves+1.959
 cle+1.927
 am+1.842
Neg logit contributions
les-1.822
ork-1.778
lights-1.764
pron-1.626
lf-1.516
Token their
Token ID511
Feature activation-1.96e-01

Pos logit contributions
ork+0.770
les+0.769
lights+0.749
pron+0.731
lf+0.672
Neg logit contributions
 least-0.577
 very-0.428
 ourselves-0.420
oh-0.415
 cle-0.395
Token map
Token ID3975
Feature activation-7.92e-01

Pos logit contributions
les+0.311
ork+0.306
lights+0.297
pron+0.282
adoes+0.258
Neg logit contributions
 least-0.341
 very-0.267
oh-0.264
 ourselves-0.263
 cle-0.256
Token.
Token ID13
Feature activation-5.26e-01

Pos logit contributions
les+1.374
ork+1.365
lights+1.339
pron+1.287
adoes+1.206
Neg logit contributions
 least-1.250
 very-0.970
oh-0.934
 ourselves-0.910
 so-0.887
Token They
Token ID1119
Feature activation+6.24e-01

Pos logit contributions
les+0.998
ork+0.964
lights+0.950
pron+0.920
lf+0.855
Neg logit contributions
 least-0.754
 very-0.573
oh-0.566
 ourselves-0.564
 am-0.544
Token looked
Token ID3114
Feature activation-3.48e-02

Pos logit contributions
 least+0.902
 ourselves+0.689
oh+0.687
 cle+0.645
 very+0.639
Neg logit contributions
les-1.141
ork-1.140
lights-1.102
pron-1.048
lf-0.982
Token everywhere
Token ID8347
Feature activation+1.81e+00

Pos logit contributions
les+0.063
ork+0.063
lights+0.062
pron+0.059
adoes+0.055
Neg logit contributions
 least-0.052
 very-0.039
oh-0.039
 ourselves-0.038
 cle-0.037
Token,
Token ID11
Feature activation+3.64e-01

Pos logit contributions
 least+3.366
oh+2.746
 cle+2.599
 ourselves+2.564
'm+2.479
Neg logit contributions
lights-2.757
ork-2.718
les-2.712
pron-2.383
lf-2.281
Token but
Token ID475
Feature activation+5.33e-01

Pos logit contributions
 least+0.512
oh+0.387
 ourselves+0.372
 very+0.366
 cle+0.357
Neg logit contributions
les-0.683
ork-0.680
lights-0.669
pron-0.633
adoes-0.594
Token it
Token ID340
Feature activation+8.24e-02

Pos logit contributions
 least+0.903
oh+0.716
 ourselves+0.688
 very+0.681
 cle+0.674
Neg logit contributions
ork-0.848
les-0.840
lights-0.834
pron-0.767
adoes-0.719
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
 least+0.109
oh+0.079
 very+0.077
 ourselves+0.076
 cle+0.073
Neg logit contributions
les-0.164
ork-0.163
lights-0.160
pron-0.152
adoes-0.143
Token gone
Token ID3750
Feature activation+2.11e-02

Pos logit contributions
les+1.999
ork+1.912
lights+1.823
pron+1.771
adoes+1.667
Neg logit contributions
 least-1.656
 very-1.332
 ourselves-1.182
 so-1.178
 cle-1.151
Token was
Token ID373
Feature activation-1.85e-02

Pos logit contributions
les+0.056
ork+0.056
lights+0.054
pron+0.052
adoes+0.049
Neg logit contributions
 least-0.036
oh-0.026
 very-0.026
 ourselves-0.025
 cle-0.024
Token her
Token ID607
Feature activation+1.07e+00

Pos logit contributions
les+0.040
ork+0.040
lights+0.039
pron+0.037
adoes+0.035
Neg logit contributions
 least-0.021
oh-0.014
 very-0.014
 ourselves-0.014
 cle-0.013
Token daughter
Token ID4957
Feature activation+2.42e-02

Pos logit contributions
 least+1.863
oh+1.520
 ourselves+1.491
 am+1.413
 very+1.408
Neg logit contributions
ork-1.629
les-1.591
lights-1.549
pron-1.510
adoes-1.340
Token's
Token ID338
Feature activation+1.04e+00

Pos logit contributions
 least+0.035
oh+0.026
 very+0.026
 ourselves+0.026
 cle+0.025
Neg logit contributions
les-0.045
ork-0.044
lights-0.043
pron-0.041
adoes-0.039
Token favourite
Token ID12507
Feature activation+3.31e-01

Pos logit contributions
 least+1.845
oh+1.487
 ourselves+1.464
 am+1.382
 cle+1.381
Neg logit contributions
ork-1.590
les-1.557
lights-1.504
pron-1.459
adoes-1.295
Token toy
Token ID13373
Feature activation+1.59e+00

Pos logit contributions
 least+0.588
oh+0.469
 ourselves+0.458
 very+0.453
 cle+0.445
Neg logit contributions
les-0.507
ork-0.502
lights-0.486
pron-0.460
adoes-0.421
Token -
Token ID532
Feature activation-3.11e-01

Pos logit contributions
 least+2.893
 ourselves+2.318
oh+2.303
 cle+2.252
 am+2.151
Neg logit contributions
les-2.412
ork-2.402
lights-2.294
pron-2.182
lf-2.002
Token a
Token ID257
Feature activation+4.42e-01

Pos logit contributions
les+0.659
ork+0.653
lights+0.643
pron+0.607
adoes+0.578
Neg logit contributions
 least-0.369
 very-0.260
oh-0.254
 ourselves-0.247
 cle-0.237
Token big
Token ID1263
Feature activation+1.54e-01

Pos logit contributions
 least+0.808
oh+0.644
 ourselves+0.638
 very+0.617
 cle+0.609
Neg logit contributions
ork-0.652
les-0.648
lights-0.625
pron-0.607
lf-0.537
Token,
Token ID11
Feature activation+1.31e-01

Pos logit contributions
 least+0.270
oh+0.214
 very+0.210
 ourselves+0.210
 cle+0.204
Neg logit contributions
les-0.237
ork-0.235
lights-0.229
pron-0.216
adoes-0.198
Token red
Token ID2266
Feature activation+7.04e-02

Pos logit contributions
 least+0.221
oh+0.174
 very+0.170
 ourselves+0.170
 cle+0.164
Neg logit contributions
les-0.209
ork-0.208
lights-0.202
pron-0.191
adoes-0.176
Token you
Token ID345
Feature activation+1.43e-01

Pos logit contributions
 least+0.909
oh+0.696
 very+0.670
 cle+0.657
 am+0.656
Neg logit contributions
les-1.087
ork-1.073
lights-1.035
pron-0.981
adoes-0.925
Token something
Token ID1223
Feature activation+9.42e-01

Pos logit contributions
 least+0.187
oh+0.136
 very+0.132
 ourselves+0.131
 cle+0.126
Neg logit contributions
les-0.284
ork-0.283
lights-0.277
pron-0.264
adoes-0.249
Token to
Token ID284
Feature activation-6.58e-01

Pos logit contributions
 least+1.451
oh+1.124
 ourselves+1.050
 cle+1.020
 am+1.013
Neg logit contributions
les-1.655
ork-1.636
lights-1.614
pron-1.525
adoes-1.398
Token help
Token ID1037
Feature activation+6.91e-01

Pos logit contributions
les+1.335
lights+1.314
ork+1.308
pron+1.223
adoes+1.167
Neg logit contributions
 least-0.872
 very-0.624
oh-0.618
 ourselves-0.615
 am-0.595
Token."
Token ID526
Feature activation+1.65e+00

Pos logit contributions
 least+0.887
oh+0.629
 cle+0.610
 very+0.603
 ourselves+0.578
Neg logit contributions
les-1.401
ork-1.383
lights-1.352
pron-1.287
adoes-1.227
Token
Token ID198
Feature activation+1.63e+00

Pos logit contributions
 least+2.933
oh+2.267
 ourselves+2.267
 cle+2.235
 very+2.205
Neg logit contributions
les-2.566
ork-2.534
lights-2.335
pron-2.208
adoes-2.046
Token
Token ID198
Feature activation+2.09e+00

Pos logit contributions
 least+2.983
 ourselves+2.260
 cle+2.252
oh+2.251
 oh+2.214
Neg logit contributions
ork-2.597
les-2.490
lights-2.299
pron-2.236
lf-2.062
TokenWhen
Token ID2215
Feature activation+4.73e-01

Pos logit contributions
 least+3.782
 very+2.883
 ourselves+2.802
 cle+2.762
 kidding+2.733
Neg logit contributions
ork-3.416
les-3.282
pron-2.999
lights-2.903
lf-2.733
Token they
Token ID484
Feature activation-2.70e-01

Pos logit contributions
 least+0.734
oh+0.567
 very+0.549
 ourselves+0.548
 cle+0.535
Neg logit contributions
ork-0.816
les-0.814
lights-0.797
pron-0.751
adoes-0.699
Token arrived
Token ID5284
Feature activation+3.75e-01

Pos logit contributions
les+0.456
ork+0.455
lights+0.449
pron+0.422
adoes+0.400
Neg logit contributions
 least-0.445
oh-0.343
 very-0.341
 ourselves-0.324
 am-0.322
Token home
Token ID1363
Feature activation+1.50e+00

Pos logit contributions
 least+0.558
oh+0.420
 ourselves+0.406
 very+0.406
 cle+0.396
Neg logit contributions
les-0.686
ork-0.684
lights-0.668
pron-0.626
adoes-0.589
TokenSuddenly
Token ID38582
Feature activation+5.66e-01

Pos logit contributions
les+0.648
ork+0.635
lights+0.635
pron+0.594
adoes+0.549
Neg logit contributions
 least-0.650
oh-0.535
 very-0.523
 ourselves-0.508
 am-0.494
Token,
Token ID11
Feature activation+3.75e-01

Pos logit contributions
 least+0.840
oh+0.611
 ourselves+0.605
 very+0.588
 cle+0.587
Neg logit contributions
ork-1.037
les-1.036
lights-1.009
pron-0.940
adoes-0.902
Token the
Token ID262
Feature activation-2.99e-01

Pos logit contributions
 least+0.520
oh+0.378
 ourselves+0.367
 very+0.362
 cle+0.355
Neg logit contributions
les-0.720
ork-0.718
lights-0.704
pron-0.665
adoes-0.630
Token jog
Token ID48342
Feature activation-7.68e-01

Pos logit contributions
ork+0.490
les+0.489
lights+0.482
pron+0.452
adoes+0.420
Neg logit contributions
 least-0.494
 very-0.389
oh-0.385
 ourselves-0.376
 cle-0.369
Tokenger
Token ID1362
Feature activation-8.36e-02

Pos logit contributions
lights+1.106
ork+1.097
les+1.096
pron+1.043
adoes+0.957
Neg logit contributions
 least-1.373
oh-1.157
 very-1.149
 am-1.090
 cle-1.061
Token stopped
Token ID5025
Feature activation+1.88e+00

Pos logit contributions
les+0.142
ork+0.142
lights+0.139
pron+0.131
adoes+0.122
Neg logit contributions
 least-0.133
oh-0.103
 very-0.102
 ourselves-0.099
 cle-0.097
Token running
Token ID2491
Feature activation+9.19e-01

Pos logit contributions
 least+3.664
oh+3.072
 ourselves+2.947
 cle+2.866
 oh+2.784
Neg logit contributions
les-2.624
ork-2.468
lights-2.353
pron-2.103
adoes-2.055
Token and
Token ID290
Feature activation+3.02e-01

Pos logit contributions
 least+1.663
oh+1.337
 ourselves+1.290
 cle+1.263
 am+1.244
Neg logit contributions
les-1.413
ork-1.384
lights-1.330
pron-1.176
adoes-1.168
Token waved
Token ID26834
Feature activation-6.09e-01

Pos logit contributions
 least+0.487
oh+0.375
 ourselves+0.369
 very+0.367
 cle+0.357
Neg logit contributions
les-0.511
ork-0.505
lights-0.492
pron-0.466
adoes-0.434
Token with
Token ID351
Feature activation+2.24e-01

Pos logit contributions
ork+1.092
les+1.080
lights+1.067
pron+1.035
lf+0.962
Neg logit contributions
 least-0.899
 very-0.725
oh-0.690
 ourselves-0.682
 cle-0.662
Token a
Token ID257
Feature activation+1.15e-01

Pos logit contributions
 least+0.317
oh+0.235
 very+0.227
 ourselves+0.227
 cle+0.218
Neg logit contributions
les-0.426
ork-0.423
lights-0.415
pron-0.391
adoes-0.370
Token it
Token ID340
Feature activation-4.31e-01

Pos logit contributions
ork+0.435
les+0.434
lights+0.423
pron+0.399
adoes+0.370
Neg logit contributions
 least-0.447
 very-0.351
oh-0.346
 ourselves-0.342
 cle-0.332
Token in
Token ID287
Feature activation+6.68e-01

Pos logit contributions
les+0.849
ork+0.848
lights+0.827
pron+0.788
adoes+0.742
Neg logit contributions
 least-0.570
 very-0.426
oh-0.411
 ourselves-0.402
 cle-0.388
Token her
Token ID607
Feature activation+6.33e-01

Pos logit contributions
 least+0.838
oh+0.618
 ourselves+0.570
 very+0.556
 cle+0.553
Neg logit contributions
ork-1.356
les-1.335
lights-1.315
pron-1.271
adoes-1.183
Token bucket
Token ID19236
Feature activation-2.10e-01

Pos logit contributions
 least+0.985
oh+0.773
 ourselves+0.748
 very+0.724
 am+0.707
Neg logit contributions
ork-1.104
les-1.091
lights-1.067
pron-1.030
adoes-0.933
Token with
Token ID351
Feature activation+2.65e-02

Pos logit contributions
les+0.351
ork+0.350
lights+0.339
pron+0.321
adoes+0.300
Neg logit contributions
 least-0.340
oh-0.264
 very-0.264
 ourselves-0.257
 cle-0.248
Token the
Token ID262
Feature activation+7.09e-01

Pos logit contributions
 least+0.034
oh+0.024
 very+0.024
 ourselves+0.023
 cle+0.022
Neg logit contributions
les-0.054
ork-0.053
lights-0.052
pron-0.050
adoes-0.047
Token other
Token ID584
Feature activation+8.90e-01

Pos logit contributions
 least+1.182
oh+0.943
 ourselves+0.910
 very+0.894
 am+0.874
Neg logit contributions
les-1.148
ork-1.135
lights-1.098
pron-1.053
adoes-0.948
Token shells
Token ID19679
Feature activation+1.72e-01

Pos logit contributions
 least+1.532
oh+1.221
 very+1.177
 ourselves+1.174
 cle+1.149
Neg logit contributions
les-1.429
ork-1.367
lights-1.353
pron-1.239
adoes-1.145
Token she
Token ID673
Feature activation+4.82e-01

Pos logit contributions
 least+0.280
oh+0.218
 very+0.214
 ourselves+0.213
 cle+0.206
Neg logit contributions
les-0.286
ork-0.284
lights-0.277
pron-0.261
adoes-0.242
Token found
Token ID1043
Feature activation+2.18e-01

Pos logit contributions
 least+0.779
 ourselves+0.604
oh+0.604
 very+0.587
 cle+0.570
Neg logit contributions
les-0.813
ork-0.802
lights-0.776
pron-0.745
lf-0.677
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
 least+0.350
oh+0.270
 very+0.264
 ourselves+0.263
 cle+0.256
Neg logit contributions
les-0.369
ork-0.366
lights-0.357
pron-0.337
adoes-0.313
TokenThe
Token ID464
Feature activation-6.60e-01

Pos logit contributions
les+0.893
lights+0.883
ork+0.876
pron+0.820
adoes+0.774
Neg logit contributions
 least-0.638
oh-0.503
 very-0.488
 ourselves-0.472
 am-0.455
Token woman
Token ID2415
Feature activation-4.68e-01

Pos logit contributions
les+1.054
ork+1.053
lights+1.044
pron+0.936
adoes+0.904
Neg logit contributions
 least-1.131
 very-0.924
oh-0.878
 cle-0.861
 ourselves-0.849
Token smiled
Token ID13541
Feature activation+1.07e-01

Pos logit contributions
ork+0.892
les+0.880
lights+0.880
pron+0.831
adoes+0.785
Neg logit contributions
 least-0.653
oh-0.494
 very-0.490
 ourselves-0.449
 cle-0.445
Token at
Token ID379
Feature activation+9.38e-02

Pos logit contributions
 least+0.184
oh+0.145
 very+0.142
 ourselves+0.141
 cle+0.137
Neg logit contributions
les-0.170
ork-0.169
lights-0.165
pron-0.155
adoes-0.143
Token the
Token ID262
Feature activation-1.06e-01

Pos logit contributions
 least+0.118
oh+0.084
 very+0.082
 ourselves+0.081
 cle+0.078
Neg logit contributions
les-0.192
ork-0.191
lights-0.188
pron-0.179
adoes-0.169
Token girl
Token ID2576
Feature activation-8.34e-02

Pos logit contributions
les+0.163
ork+0.163
lights+0.159
pron+0.148
adoes+0.137
Neg logit contributions
 least-0.186
 very-0.148
oh-0.147
 ourselves-0.144
 cle-0.141
Token and
Token ID290
Feature activation+2.15e-01

Pos logit contributions
les+0.137
ork+0.136
lights+0.133
pron+0.126
adoes+0.117
Neg logit contributions
 least-0.138
oh-0.108
 very-0.107
 ourselves-0.105
 cle-0.102
Token said
Token ID531
Feature activation+2.03e-01

Pos logit contributions
 least+0.275
oh+0.197
 ourselves+0.191
 very+0.190
 cle+0.181
Neg logit contributions
les-0.435
ork-0.431
lights-0.422
pron-0.404
adoes-0.380
Token,
Token ID11
Feature activation-5.84e-02

Pos logit contributions
 least+0.284
oh+0.208
 ourselves+0.202
 very+0.202
 cle+0.195
Neg logit contributions
les-0.388
ork-0.387
lights-0.378
pron-0.359
adoes-0.338
Token "
Token ID366
Feature activation-4.45e-01

Pos logit contributions
les+0.101
ork+0.100
lights+0.098
pron+0.092
adoes+0.086
Neg logit contributions
 least-0.091
oh-0.070
 very-0.070
 ourselves-0.068
 cle-0.066
TokenHow
Token ID2437
Feature activation+4.30e-01

Pos logit contributions
ork+0.943
les+0.929
lights+0.921
pron+0.865
adoes+0.827
Neg logit contributions
 least-0.546
oh-0.399
 very-0.372
 ourselves-0.368
 am-0.348
Token notice
Token ID4003
Feature activation-3.63e-01

Pos logit contributions
 least+0.790
 ourselves+0.579
oh+0.568
 very+0.543
 cle+0.531
Neg logit contributions
les-1.213
ork-1.211
lights-1.146
pron-1.133
lf-1.034
Token that
Token ID326
Feature activation-5.68e-01

Pos logit contributions
ork+0.671
les+0.666
lights+0.652
pron+0.625
adoes+0.573
Neg logit contributions
 least-0.536
 very-0.399
oh-0.398
 ourselves-0.396
 cle-0.371
Token too
Token ID1165
Feature activation-1.33e+00

Pos logit contributions
ork+0.993
les+0.986
lights+0.969
pron+0.912
adoes+0.839
Neg logit contributions
 least-0.897
 very-0.697
oh-0.682
 ourselves-0.678
 cle-0.649
Token!"
Token ID2474
Feature activation+9.58e-01

Pos logit contributions
ork+2.615
les+2.577
lights+2.544
pron+2.504
adoes+2.328
Neg logit contributions
 least-1.745
 very-1.429
oh-1.349
 ourselves-1.297
 so-1.247
Token
Token ID198
Feature activation+1.69e-01

Pos logit contributions
 least+1.591
oh+1.243
 ourselves+1.212
 cle+1.195
 very+1.160
Neg logit contributions
ork-1.586
les-1.581
lights-1.550
pron-1.451
adoes-1.341
Token
Token ID198
Feature activation+3.46e-01

Pos logit contributions
 least+0.255
oh+0.192
 very+0.188
 ourselves+0.187
 cle+0.182
Neg logit contributions
les-0.305
ork-0.305
lights-0.297
pron-0.281
adoes-0.262
TokenThey
Token ID2990
Feature activation-8.16e-02

Pos logit contributions
 least+0.581
oh+0.448
 very+0.441
 ourselves+0.440
 cle+0.430
Neg logit contributions
ork-0.570
les-0.568
lights-0.552
pron-0.524
lf-0.483
Token started
Token ID2067
Feature activation-3.92e-02

Pos logit contributions
les+0.152
ork+0.152
lights+0.149
pron+0.142
adoes+0.133
Neg logit contributions
 least-0.117
oh-0.087
 very-0.086
 ourselves-0.084
 cle-0.081
Token to
Token ID284
Feature activation+2.49e-01

Pos logit contributions
les+0.073
ork+0.073
lights+0.071
pron+0.067
adoes+0.063
Neg logit contributions
 least-0.057
oh-0.042
 very-0.042
 ourselves-0.041
 cle-0.040
Token play
Token ID711
Feature activation-1.78e-02

Pos logit contributions
 least+0.415
oh+0.326
 very+0.319
 ourselves+0.318
 cle+0.308
Neg logit contributions
les-0.404
ork-0.400
lights-0.389
pron-0.369
adoes-0.342
Token,
Token ID11
Feature activation+1.67e-01

Pos logit contributions
les+0.031
ork+0.031
lights+0.030
pron+0.028
adoes+0.026
Neg logit contributions
 least-0.028
oh-0.022
 very-0.021
 ourselves-0.021
 cle-0.020
Token your
Token ID534
Feature activation-3.75e-01

Pos logit contributions
les+0.953
ork+0.942
lights+0.928
pron+0.889
adoes+0.804
Neg logit contributions
 least-0.938
 very-0.745
 ourselves-0.735
oh-0.725
 so-0.684
Token cars
Token ID5006
Feature activation-4.64e-01

Pos logit contributions
les+0.606
ork+0.594
lights+0.585
pron+0.545
adoes+0.503
Neg logit contributions
 least-0.640
 very-0.505
oh-0.493
 ourselves-0.493
 cle-0.483
Token,
Token ID11
Feature activation-4.76e-01

Pos logit contributions
les+0.703
ork+0.696
lights+0.685
pron+0.639
adoes+0.589
Neg logit contributions
 least-0.842
 very-0.668
oh-0.666
 ourselves-0.655
 am-0.629
Token I
Token ID314
Feature activation-5.21e-01

Pos logit contributions
les+0.930
ork+0.919
lights+0.890
pron+0.851
lf+0.791
Neg logit contributions
 least-0.662
 very-0.491
 ourselves-0.478
oh-0.471
 cle-0.455
Token promise
Token ID6991
Feature activation-4.54e-02

Pos logit contributions
lights+1.172
les+1.169
ork+1.159
pron+1.103
adoes+1.060
Neg logit contributions
 least-0.568
oh-0.373
 very-0.369
 am-0.357
 ourselves-0.341
Token."
Token ID526
Feature activation+7.90e-01

Pos logit contributions
les+0.081
ork+0.081
lights+0.079
pron+0.075
adoes+0.070
Neg logit contributions
 least-0.069
oh-0.053
 very-0.052
 ourselves-0.051
 cle-0.049
Token
Token ID198
Feature activation+4.83e-01

Pos logit contributions
 least+1.284
oh+0.976
 ourselves+0.953
 cle+0.953
 very+0.951
Neg logit contributions
ork-1.336
les-1.311
lights-1.282
pron-1.217
adoes-1.131
Token
Token ID198
Feature activation+6.41e-01

Pos logit contributions
 least+0.763
oh+0.572
 very+0.561
 ourselves+0.560
 cle+0.554
Neg logit contributions
ork-0.845
les-0.836
lights-0.810
pron-0.771
lf-0.716
TokenHe
Token ID1544
Feature activation-4.70e-01

Pos logit contributions
 least+1.149
 very+0.880
oh+0.879
 ourselves+0.867
 cle+0.867
Neg logit contributions
ork-0.996
les-0.970
lights-0.941
pron-0.905
lf-0.837
Token showed
Token ID3751
Feature activation-1.22e+00

Pos logit contributions
ork+0.848
les+0.840
lights+0.839
pron+0.794
adoes+0.741
Neg logit contributions
 least-0.703
 very-0.530
oh-0.523
 cle-0.499
 ourselves-0.493
Token them
Token ID606
Feature activation-2.72e-01

Pos logit contributions
ork+2.373
pron+2.365
les+2.353
lights+2.338
lf+2.098
Neg logit contributions
 least-1.655
 very-1.239
 ourselves-1.195
oh-1.194
 cle-1.128
Token to
Token ID284
Feature activation-6.09e-01

Pos logit contributions
les+0.133
ork+0.132
lights+0.129
pron+0.123
adoes+0.115
Neg logit contributions
 least-0.106
oh-0.080
 very-0.079
 ourselves-0.078
 cle-0.075
Token do
Token ID466
Feature activation+8.58e-01

Pos logit contributions
les+1.362
lights+1.341
ork+1.338
pron+1.259
adoes+1.196
Neg logit contributions
 least-0.700
 very-0.466
oh-0.461
 ourselves-0.452
 am-0.429
Token it
Token ID340
Feature activation+1.01e+00

Pos logit contributions
 least+1.423
oh+1.149
 cle+1.053
 ourselves+1.052
 very+1.044
Neg logit contributions
les-1.399
ork-1.397
lights-1.336
pron-1.258
adoes-1.180
Token.
Token ID13
Feature activation+4.31e-01

Pos logit contributions
 least+2.107
oh+1.739
 ourselves+1.697
 cle+1.681
 very+1.651
Neg logit contributions
les-1.228
ork-1.212
lights-1.150
pron-1.037
adoes-0.948
Token She
Token ID1375
Feature activation-1.10e-01

Pos logit contributions
 least+0.611
oh+0.451
 ourselves+0.443
 very+0.440
 cle+0.427
Neg logit contributions
ork-0.812
les-0.809
lights-0.791
pron-0.751
adoes-0.703
Token was
Token ID373
Feature activation+1.50e-01

Pos logit contributions
les+0.194
ork+0.194
lights+0.190
pron+0.179
adoes+0.168
Neg logit contributions
 least-0.167
oh-0.128
 very-0.126
 ourselves-0.122
 cle-0.120
Token so
Token ID523
Feature activation+2.94e-01

Pos logit contributions
 least+0.158
oh+0.104
 ourselves+0.099
 very+0.099
 cle+0.093
Neg logit contributions
les-0.336
ork-0.334
lights-0.328
pron-0.316
adoes-0.299
Token proud
Token ID6613
Feature activation-1.74e-02

Pos logit contributions
 least+0.405
oh+0.294
 ourselves+0.288
 very+0.282
 cle+0.273
Neg logit contributions
les-0.570
ork-0.567
lights-0.554
pron-0.530
adoes-0.494
Token of
Token ID286
Feature activation+3.62e-01

Pos logit contributions
les+0.023
ork+0.023
lights+0.022
pron+0.020
adoes+0.018
Neg logit contributions
 least-0.035
oh-0.028
 very-0.028
 ourselves-0.028
 cle-0.027
Token helping
Token ID5742
Feature activation+6.85e-01

Pos logit contributions
 least+0.437
oh+0.312
 very+0.296
 ourselves+0.292
 cle+0.284
Neg logit contributions
les-0.756
ork-0.753
lights-0.739
pron-0.704
adoes-0.666
Token to
Token ID284
Feature activation-2.99e-01

Pos logit contributions
 least+1.035
oh+0.793
 ourselves+0.754
 cle+0.746
 very+0.744
Neg logit contributions
les-1.222
ork-1.219
lights-1.175
pron-1.124
adoes-1.049
Token quickly
Token ID2952
Feature activation-3.49e-01

Pos logit contributions
ork+0.760
les+0.757
lights+0.753
pron+0.712
adoes+0.664
Neg logit contributions
 least-0.614
oh-0.459
 very-0.455
 ourselves-0.430
 am-0.424
Token unt
Token ID1418
Feature activation-4.34e-01

Pos logit contributions
les+0.604
ork+0.602
lights+0.599
pron+0.564
adoes+0.523
Neg logit contributions
 least-0.546
oh-0.420
 very-0.417
 ourselves-0.400
 cle-0.394
Tokenied
Token ID798
Feature activation-8.28e-01

Pos logit contributions
ork+0.420
les+0.398
lights+0.396
pron+0.372
lf+0.312
Neg logit contributions
 least-1.014
oh-0.855
 ourselves-0.849
 very-0.844
 cle-0.817
Token the
Token ID262
Feature activation+5.72e-01

Pos logit contributions
ork+1.517
les+1.504
lights+1.487
pron+1.455
lf+1.325
Neg logit contributions
 least-1.216
 very-0.934
 ourselves-0.912
oh-0.888
 cle-0.858
Token ribbon
Token ID29092
Feature activation+1.16e-01

Pos logit contributions
 least+1.044
oh+0.851
 ourselves+0.826
 very+0.814
 am+0.793
Neg logit contributions
ork-0.832
les-0.831
lights-0.798
pron-0.761
adoes-0.694
Token and
Token ID290
Feature activation-5.27e-01

Pos logit contributions
 least+0.192
oh+0.150
 very+0.147
 ourselves+0.146
 cle+0.142
Neg logit contributions
les-0.190
ork-0.189
lights-0.184
pron-0.173
adoes-0.161
Token Benny
Token ID44275
Feature activation+7.38e-01

Pos logit contributions
ork+0.926
les+0.921
lights+0.903
pron+0.857
adoes+0.786
Neg logit contributions
 least-0.834
 very-0.631
oh-0.627
 ourselves-0.614
 cle-0.599
Token felt
Token ID2936
Feature activation+5.86e-01

Pos logit contributions
 least+1.146
 ourselves+0.888
oh+0.864
 very+0.837
 cle+0.827
Neg logit contributions
les-1.310
ork-1.286
lights-1.210
pron-1.177
lf-1.089
Token much
Token ID881
Feature activation-3.59e-02

Pos logit contributions
 least+0.800
oh+0.604
 ourselves+0.579
 very+0.540
 cle+0.538
Neg logit contributions
les-1.135
ork-1.123
lights-1.091
pron-1.042
adoes-0.975
Token better
Token ID1365
Feature activation+5.22e-01

Pos logit contributions
les+0.077
ork+0.076
lights+0.075
pron+0.072
adoes+0.068
Neg logit contributions
 least-0.042
oh-0.029
 very-0.028
 ourselves-0.028
 cle-0.027
Token.
Token ID13
Feature activation-3.56e-01

Pos logit contributions
 least+0.894
oh+0.707
 ourselves+0.689
 very+0.683
 cle+0.677
Neg logit contributions
les-0.827
ork-0.823
lights-0.786
pron-0.742
adoes-0.687
Token quarrel
Token ID41998
Feature activation+1.96e-01

Pos logit contributions
les+0.294
ork+0.293
lights+0.289
pron+0.276
adoes+0.259
Neg logit contributions
 least-0.150
oh-0.101
 very-0.100
 ourselves-0.099
 am-0.092
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
 least+0.337
oh+0.266
 very+0.260
 ourselves+0.259
 cle+0.253
Neg logit contributions
les-0.308
ork-0.305
lights-0.298
pron-0.278
adoes-0.259
Token was
Token ID373
Feature activation-5.32e-01

Pos logit contributions
les+0.204
ork+0.203
lights+0.198
pron+0.188
adoes+0.175
Neg logit contributions
 least-0.164
oh-0.123
 very-0.122
 ourselves-0.120
 cle-0.116
Token grateful
Token ID14066
Feature activation-2.37e-01

Pos logit contributions
ork+1.101
les+1.098
lights+1.076
pron+1.016
adoes+0.954
Neg logit contributions
 least-0.688
 very-0.514
oh-0.471
 ourselves-0.469
 cle-0.459
Token the
Token ID262
Feature activation-8.38e-01

Pos logit contributions
les+0.439
ork+0.436
lights+0.424
pron+0.407
adoes+0.378
Neg logit contributions
 least-0.347
 very-0.261
oh-0.257
 ourselves-0.255
 cle-0.244
Token little
Token ID1310
Feature activation-4.84e-01

Pos logit contributions
les+1.403
ork+1.398
lights+1.376
pron+1.262
adoes+1.197
Neg logit contributions
 least-1.389
 very-1.103
 cle-1.049
oh-1.032
 ourselves-1.028
Token child
Token ID1200
Feature activation+7.27e-02

Pos logit contributions
ork+0.831
les+0.825
lights+0.804
pron+0.758
adoes+0.706
Neg logit contributions
 least-0.773
 very-0.596
oh-0.586
 ourselves-0.573
 cle-0.570
Token showed
Token ID3751
Feature activation-1.18e-01

Pos logit contributions
 least+0.104
oh+0.077
 very+0.076
 ourselves+0.075
 cle+0.072
Neg logit contributions
les-0.136
ork-0.135
lights-0.132
pron-0.126
adoes-0.118
Token him
Token ID683
Feature activation-6.42e-01

Pos logit contributions
les+0.219
ork+0.219
lights+0.214
pron+0.205
adoes+0.190
Neg logit contributions
 least-0.169
oh-0.126
 very-0.125
 ourselves-0.123
 cle-0.118
Token the
Token ID262
Feature activation-2.17e-01

Pos logit contributions
les+1.135
ork+1.133
lights+1.108
pron+1.070
lf+0.982
Neg logit contributions
 least-0.986
 very-0.772
 ourselves-0.747
oh-0.720
 cle-0.714
Token right
Token ID826
Feature activation-1.73e+00

Pos logit contributions
les+0.346
ork+0.342
lights+0.336
pron+0.312
adoes+0.290
Neg logit contributions
 least-0.373
 very-0.292
oh-0.289
 ourselves-0.285
 cle-0.281
Token and
Token ID290
Feature activation+3.45e-01

Pos logit contributions
 least+0.788
oh+0.609
 very+0.587
 ourselves+0.586
 cle+0.576
Neg logit contributions
les-0.878
ork-0.876
lights-0.854
pron-0.804
adoes-0.744
Token over
Token ID625
Feature activation+1.08e+00

Pos logit contributions
 least+0.502
oh+0.381
 ourselves+0.367
 very+0.367
 cle+0.354
Neg logit contributions
les-0.634
ork-0.624
lights-0.616
pron-0.582
adoes-0.546
Token until
Token ID1566
Feature activation+6.48e-02

Pos logit contributions
 least+1.956
oh+1.579
 cle+1.526
 ourselves+1.513
 very+1.487
Neg logit contributions
ork-1.617
les-1.610
lights-1.580
pron-1.443
lf-1.293
Token the
Token ID262
Feature activation+4.56e-01

Pos logit contributions
 least+0.109
oh+0.086
 very+0.084
 ourselves+0.084
 cle+0.081
Neg logit contributions
les-0.105
ork-0.105
lights-0.102
pron-0.096
adoes-0.089
Token sink
Token ID14595
Feature activation-8.11e-02

Pos logit contributions
 least+0.803
oh+0.636
 ourselves+0.626
 very+0.617
 cle+0.603
Neg logit contributions
les-0.707
ork-0.702
lights-0.685
pron-0.650
adoes-0.583
Token is
Token ID318
Feature activation-2.22e-01

Pos logit contributions
les+0.162
ork+0.162
lights+0.158
pron+0.151
adoes+0.143
Neg logit contributions
 least-0.104
oh-0.075
 very-0.074
 ourselves-0.072
 cle-0.069
Token almost
Token ID2048
Feature activation-4.97e-01

Pos logit contributions
les+0.439
ork+0.436
lights+0.424
pron+0.401
adoes+0.383
Neg logit contributions
 least-0.299
 very-0.222
oh-0.217
 ourselves-0.211
 cle-0.204
Token empty
Token ID6565
Feature activation-4.45e-02

Pos logit contributions
les+0.995
ork+0.995
lights+0.965
pron+0.912
adoes+0.877
Neg logit contributions
 least-0.662
 very-0.488
oh-0.467
 ourselves-0.456
 cle-0.442
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
les+0.073
ork+0.073
lights+0.071
pron+0.067
adoes+0.062
Neg logit contributions
 least-0.073
oh-0.057
 very-0.057
 ourselves-0.056
 cle-0.054
Token They
Token ID1119
Feature activation+7.95e-02

Pos logit contributions
 least+0.346
oh+0.264
 very+0.259
 ourselves+0.257
 cle+0.250
Neg logit contributions
ork-0.391
les-0.390
lights-0.381
pron-0.359
adoes-0.335
Token use
Token ID779
Feature activation-7.31e-02

Pos logit contributions
 least+0.096
oh+0.067
 very+0.066
 ourselves+0.065
 cle+0.062
Neg logit contributions
les-0.166
ork-0.165
lights-0.162
pron-0.154
adoes-0.146
Token the
Token ID262
Feature activation+7.08e-01

Pos logit contributions
 least+0.848
oh+0.651
 ourselves+0.603
 very+0.597
 cle+0.597
Neg logit contributions
ork-1.196
les-1.186
lights-1.158
pron-1.106
adoes-1.038
Token table
Token ID3084
Feature activation-3.81e-01

Pos logit contributions
 least+1.213
oh+0.955
 ourselves+0.941
 very+0.905
 cle+0.905
Neg logit contributions
ork-1.146
les-1.132
lights-1.093
pron-1.070
lf-0.968
Token and
Token ID290
Feature activation+7.43e-02

Pos logit contributions
les+0.643
ork+0.634
lights+0.621
pron+0.590
adoes+0.552
Neg logit contributions
 least-0.613
 very-0.483
oh-0.476
 ourselves-0.463
 am-0.447
Token grab
Token ID5552
Feature activation-8.24e-01

Pos logit contributions
 least+0.108
oh+0.081
 very+0.079
 ourselves+0.079
 cle+0.076
Neg logit contributions
les-0.137
ork-0.137
lights-0.134
pron-0.127
adoes-0.119
Token some
Token ID617
Feature activation-1.54e-01

Pos logit contributions
ork+1.680
les+1.664
lights+1.650
pron+1.628
lf+1.518
Neg logit contributions
 least-1.011
 very-0.771
 ourselves-0.725
oh-0.718
 so-0.672
Token papers
Token ID9473
Feature activation-6.30e-01

Pos logit contributions
les+0.239
ork+0.239
lights+0.232
pron+0.219
adoes+0.203
Neg logit contributions
 least-0.265
 very-0.209
oh-0.209
 ourselves-0.205
 cle-0.200
Token.
Token ID13
Feature activation-5.25e-02

Pos logit contributions
les+1.101
ork+1.095
lights+1.078
pron+1.032
adoes+0.970
Neg logit contributions
 least-0.970
 very-0.761
oh-0.741
 ourselves-0.735
 so-0.694
Token They
Token ID1119
Feature activation-1.12e-01

Pos logit contributions
les+0.093
ork+0.092
lights+0.090
pron+0.086
adoes+0.079
Neg logit contributions
 least-0.081
oh-0.062
 very-0.061
 ourselves-0.061
 cle-0.058
Token start
Token ID923
Feature activation-6.89e-01

Pos logit contributions
les+0.218
ork+0.217
lights+0.213
pron+0.202
adoes+0.191
Neg logit contributions
 least-0.153
oh-0.111
 very-0.110
 ourselves-0.107
 cle-0.104
Token to
Token ID284
Feature activation-3.09e-01

Pos logit contributions
les+1.340
lights+1.303
ork+1.290
pron+1.217
adoes+1.161
Neg logit contributions
 least-0.944
 very-0.750
oh-0.701
 ourselves-0.677
 so-0.656
Token fold
Token ID5591
Feature activation-4.38e-01

Pos logit contributions
les+0.578
ork+0.573
lights+0.566
pron+0.527
adoes+0.499
Neg logit contributions
 least-0.449
oh-0.336
 very-0.335
 ourselves-0.329
 am-0.316
Token scared
Token ID12008
Feature activation+1.49e-01

Pos logit contributions
 least+0.235
oh+0.137
 ourselves+0.130
 very+0.123
 cle+0.115
Neg logit contributions
les-0.668
ork-0.666
lights-0.654
pron-0.630
adoes-0.599
Token,
Token ID11
Feature activation+9.69e-01

Pos logit contributions
 least+0.240
oh+0.186
 very+0.182
 ourselves+0.181
 cle+0.176
Neg logit contributions
les-0.252
ork-0.249
lights-0.244
pron-0.229
adoes-0.214
Token but
Token ID475
Feature activation+5.97e-01

Pos logit contributions
 least+1.347
oh+1.017
 ourselves+0.965
 am+0.933
 cle+0.911
Neg logit contributions
les-1.848
ork-1.823
lights-1.792
pron-1.682
adoes-1.612
Token he
Token ID339
Feature activation+1.56e-01

Pos logit contributions
 least+1.041
oh+0.825
 ourselves+0.812
 am+0.781
 very+0.781
Neg logit contributions
les-0.920
ork-0.910
lights-0.890
pron-0.834
lf-0.771
Token was
Token ID373
Feature activation-5.83e-01

Pos logit contributions
 least+0.220
oh+0.163
 ourselves+0.160
 very+0.159
 cle+0.152
Neg logit contributions
les-0.296
ork-0.295
lights-0.287
pron-0.274
adoes-0.256
Token brave
Token ID14802
Feature activation-7.51e-01

Pos logit contributions
les+1.323
ork+1.320
lights+1.304
pron+1.230
adoes+1.178
Neg logit contributions
 least-0.646
 very-0.468
oh-0.412
 cle-0.403
 so-0.398
Token and
Token ID290
Feature activation+7.28e-02

Pos logit contributions
ork+1.349
lights+1.300
les+1.298
pron+1.264
lf+1.156
Neg logit contributions
 least-1.161
 very-0.921
oh-0.867
 so-0.854
 cle-0.847
Token wanted
Token ID2227
Feature activation+1.95e-01

Pos logit contributions
 least+0.107
oh+0.081
 very+0.079
 ourselves+0.079
 cle+0.076
Neg logit contributions
les-0.133
ork-0.132
lights-0.129
pron-0.123
adoes-0.115
Token to
Token ID284
Feature activation-1.41e-01

Pos logit contributions
 least+0.304
oh+0.234
 very+0.227
 ourselves+0.227
 cle+0.219
Neg logit contributions
les-0.341
ork-0.341
lights-0.331
pron-0.313
adoes-0.292
Token know
Token ID760
Feature activation+6.11e-02

Pos logit contributions
les+0.284
ork+0.281
lights+0.279
pron+0.265
adoes+0.248
Neg logit contributions
 least-0.186
oh-0.133
 very-0.132
 ourselves-0.130
 am-0.125
Token what
Token ID644
Feature activation+5.01e-01

Pos logit contributions
 least+0.087
oh+0.065
 very+0.064
 ourselves+0.063
 cle+0.061
Neg logit contributions
les-0.114
ork-0.113
lights-0.111
pron-0.105
adoes-0.099
Token again
Token ID757
Feature activation-3.24e-01

Pos logit contributions
les+0.201
ork+0.200
lights+0.196
pron+0.185
adoes+0.171
Neg logit contributions
 least-0.197
oh-0.153
 very-0.153
 ourselves-0.150
 cle-0.144
Token tomorrow
Token ID9439
Feature activation-6.38e-02

Pos logit contributions
les+0.555
ork+0.554
lights+0.543
pron+0.510
adoes+0.477
Neg logit contributions
 least-0.516
 very-0.397
oh-0.397
 ourselves-0.391
 am-0.371
Token,
Token ID11
Feature activation-3.41e-01

Pos logit contributions
les+0.106
ork+0.105
lights+0.103
pron+0.097
adoes+0.090
Neg logit contributions
 least-0.104
oh-0.081
 very-0.080
 ourselves-0.079
 cle-0.076
Token but
Token ID475
Feature activation-8.81e-02

Pos logit contributions
les+0.664
ork+0.662
lights+0.646
pron+0.614
adoes+0.572
Neg logit contributions
 least-0.472
 very-0.344
oh-0.340
 ourselves-0.335
 cle-0.321
Token not
Token ID407
Feature activation-1.97e-01

Pos logit contributions
les+0.187
ork+0.186
lights+0.183
pron+0.174
adoes+0.165
Neg logit contributions
 least-0.105
oh-0.073
 very-0.071
 ourselves-0.070
 cle-0.067
Token too
Token ID1165
Feature activation-2.02e+00

Pos logit contributions
les+0.381
ork+0.379
lights+0.370
pron+0.351
adoes+0.329
Neg logit contributions
 least-0.275
 very-0.202
oh-0.201
 ourselves-0.197
 cle-0.188
Token much
Token ID881
Feature activation-7.16e-01

Pos logit contributions
les+4.147
ork+4.086
lights+4.070
pron+3.820
adoes+3.693
Neg logit contributions
 least-2.700
 very-2.315
 so-2.023
oh-1.975
 ourselves-1.921
Token."
Token ID526
Feature activation+8.89e-01

Pos logit contributions
ork+1.254
les+1.234
lights+1.210
pron+1.159
adoes+1.068
Neg logit contributions
 least-1.131
 very-0.875
 ourselves-0.853
oh-0.830
 cle-0.820
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
 least+1.464
oh+1.132
 ourselves+1.106
 very+1.079
 cle+1.078
Neg logit contributions
ork-1.504
les-1.496
lights-1.463
pron-1.324
adoes-1.272
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
 least+1.893
oh+1.485
 cle+1.444
 ourselves+1.441
 oh+1.414
Neg logit contributions
ork-1.594
les-1.586
lights-1.515
pron-1.386
lf-1.300
TokenB
Token ID33
Feature activation+6.57e-01

Pos logit contributions
 least+3.056
oh+2.341
 cle+2.316
 ourselves+2.306
 very+2.303
Neg logit contributions
ork-2.236
les-2.198
lights-2.135
pron-1.957
lf-1.863
Token very
Token ID845
Feature activation+1.26e+00

Pos logit contributions
 least+1.742
oh+1.338
 ourselves+1.326
 am+1.188
 cle+1.108
Neg logit contributions
ork-2.328
les-2.292
lights-2.244
pron-2.185
adoes-2.089
Token excited
Token ID6568
Feature activation-4.05e-01

Pos logit contributions
 least+2.048
oh+1.503
 ourselves+1.489
 cle+1.346
 am+1.317
Neg logit contributions
ork-2.188
les-2.145
lights-2.048
pron-2.001
adoes-1.847
Token and
Token ID290
Feature activation+3.87e-01

Pos logit contributions
ork+0.709
les+0.705
lights+0.692
pron+0.660
adoes+0.610
Neg logit contributions
 least-0.631
 very-0.493
oh-0.484
 ourselves-0.472
 so-0.455
Token quickly
Token ID2952
Feature activation-4.54e-01

Pos logit contributions
 least+0.626
oh+0.481
 ourselves+0.475
 very+0.461
 cle+0.453
Neg logit contributions
les-0.656
ork-0.649
lights-0.630
pron-0.603
adoes-0.557
Token picked
Token ID6497
Feature activation-3.71e-01

Pos logit contributions
les+0.822
ork+0.816
lights+0.815
pron+0.764
adoes+0.719
Neg logit contributions
 least-0.670
 very-0.520
oh-0.512
 ourselves-0.485
 cle-0.484
Token up
Token ID510
Feature activation-2.03e+00

Pos logit contributions
lights+0.531
ork+0.528
les+0.524
pron+0.490
lf+0.442
Neg logit contributions
 least-0.693
 very-0.565
oh-0.561
 ourselves-0.548
 cle-0.533
Token the
Token ID262
Feature activation+7.77e-02

Pos logit contributions
lights+5.028
ork+4.983
les+4.917
pron+4.887
lf+4.541
Neg logit contributions
 least-1.756
 very-1.428
 so-1.240
 ourselves-1.125
 oh-1.010
Token cord
Token ID15050
Feature activation+4.06e-01

Pos logit contributions
 least+0.143
oh+0.115
 very+0.114
 ourselves+0.113
 cle+0.110
Neg logit contributions
les-0.114
ork-0.113
lights-0.110
pron-0.103
adoes-0.094
Token.
Token ID13
Feature activation-6.15e-01

Pos logit contributions
 least+0.700
oh+0.552
 ourselves+0.540
 very+0.530
 cle+0.527
Neg logit contributions
les-0.638
ork-0.638
lights-0.621
pron-0.578
adoes-0.537
Token She
Token ID1375
Feature activation-5.01e-01

Pos logit contributions
les+1.249
lights+1.224
ork+1.224
pron+1.160
adoes+1.090
Neg logit contributions
 least-0.784
 very-0.579
oh-0.577
 ourselves-0.542
 am-0.528
Token took
Token ID1718
Feature activation-6.27e-01

Pos logit contributions
ork+0.888
les+0.888
lights+0.881
pron+0.817
adoes+0.776
Neg logit contributions
 least-0.763
 very-0.594
oh-0.586
 cle-0.550
 ourselves-0.540
Token ball
Token ID2613
Feature activation+2.37e-01

Pos logit contributions
lights+2.530
les+2.459
ork+2.385
pron+2.294
lf+2.148
Neg logit contributions
 least-2.604
 very-2.235
 am-2.105
oh-2.062
 ourselves-2.054
Token all
Token ID477
Feature activation-1.40e-01

Pos logit contributions
 least+0.352
oh+0.267
 ourselves+0.259
 very+0.257
 cle+0.250
Neg logit contributions
les-0.428
ork-0.425
lights-0.416
pron-0.391
adoes-0.368
Token day
Token ID1110
Feature activation-1.07e+00

Pos logit contributions
ork+0.240
les+0.239
lights+0.232
pron+0.221
adoes+0.205
Neg logit contributions
 least-0.225
oh-0.174
 very-0.174
 ourselves-0.170
 am-0.163
Token.
Token ID13
Feature activation-1.89e-01

Pos logit contributions
les+1.983
lights+1.939
ork+1.917
pron+1.841
adoes+1.769
Neg logit contributions
 least-1.532
 very-1.300
 ourselves-1.194
oh-1.177
 so-1.164
Token The
Token ID383
Feature activation-1.01e+00

Pos logit contributions
les+0.355
ork+0.352
lights+0.345
pron+0.331
adoes+0.308
Neg logit contributions
 least-0.271
oh-0.201
 very-0.200
 ourselves-0.197
 am-0.187
Token rude
Token ID22066
Feature activation-2.95e+00

Pos logit contributions
les+1.770
ork+1.729
lights+1.719
pron+1.551
adoes+1.505
Neg logit contributions
 least-1.615
 very-1.328
 cle-1.231
oh-1.195
 am-1.178
Token big
Token ID1263
Feature activation-1.16e+00

Pos logit contributions
lights+6.444
les+6.163
rily+5.854
pron+5.811
ork+5.756
Neg logit contributions
 very-3.442
 least-3.345
 am-2.880
 are-2.814
oh-2.681
Token dog
Token ID3290
Feature activation+5.01e-01

Pos logit contributions
les+2.018
ork+1.983
lights+1.981
pron+1.851
adoes+1.676
Neg logit contributions
 least-1.838
 very-1.541
oh-1.403
 am-1.388
 cle-1.378
Token saw
Token ID2497
Feature activation+3.39e-01

Pos logit contributions
 least+0.767
oh+0.585
 ourselves+0.585
 very+0.561
 cle+0.555
Neg logit contributions
les-0.886
ork-0.874
lights-0.839
pron-0.797
adoes-0.747
Token them
Token ID606
Feature activation+5.18e-01

Pos logit contributions
 least+0.479
oh+0.355
 ourselves+0.342
 very+0.342
 cle+0.331
Neg logit contributions
les-0.653
ork-0.644
lights-0.629
pron-0.594
adoes-0.564
Token and
Token ID290
Feature activation+3.77e-02

Pos logit contributions
 least+0.865
oh+0.696
 ourselves+0.666
 very+0.652
 cle+0.650
Neg logit contributions
les-0.834
ork-0.817
lights-0.796
pron-0.749
adoes-0.696
Tokenmy
Token ID1820
Feature activation-1.12e-01

Pos logit contributions
 least+0.197
oh+0.153
 very+0.150
 ourselves+0.149
 cle+0.145
Neg logit contributions
les-0.205
ork-0.203
lights-0.198
pron-0.187
adoes-0.174
Token.
Token ID13
Feature activation-4.20e-01

Pos logit contributions
les+0.186
ork+0.186
lights+0.182
pron+0.171
adoes+0.159
Neg logit contributions
 least-0.183
oh-0.142
 very-0.142
 ourselves-0.138
 cle-0.133
Token Tim
Token ID5045
Feature activation+4.19e-01

Pos logit contributions
les+0.773
ork+0.765
lights+0.737
pron+0.707
adoes+0.652
Neg logit contributions
 least-0.625
oh-0.479
 very-0.478
 ourselves-0.465
 am-0.447
Tokenmy
Token ID1820
Feature activation+5.87e-01

Pos logit contributions
 least+0.698
oh+0.545
 ourselves+0.541
 very+0.529
 cle+0.527
Neg logit contributions
les-0.688
ork-0.672
lights-0.654
pron-0.624
adoes-0.572
Token loved
Token ID6151
Feature activation-2.11e-01

Pos logit contributions
 least+0.912
 ourselves+0.688
oh+0.680
 cle+0.666
 very+0.643
Neg logit contributions
les-1.033
ork-1.029
lights-0.984
pron-0.956
lf-0.892
Token wearing
Token ID5762
Feature activation-2.18e+00

Pos logit contributions
les+0.384
ork+0.382
lights+0.375
pron+0.351
adoes+0.330
Neg logit contributions
 least-0.313
 very-0.238
oh-0.234
 ourselves-0.230
 cle-0.224
Token his
Token ID465
Feature activation-3.93e-01

Pos logit contributions
lights+4.566
ork+4.546
les+4.505
anne+4.135
pron+4.054
Neg logit contributions
 least-2.509
 very-2.374
 so-2.031
 cle-1.910
 oh-1.908
Token favorite
Token ID4004
Feature activation-9.07e-01

Pos logit contributions
les+0.626
ork+0.621
lights+0.609
pron+0.555
adoes+0.531
Neg logit contributions
 least-0.666
 very-0.539
oh-0.518
 ourselves-0.511
 cle-0.508
Token red
Token ID2266
Feature activation-7.46e-01

Pos logit contributions
ork+1.402
les+1.393
lights+1.392
pron+1.234
adoes+1.212
Neg logit contributions
 least-1.523
 very-1.297
oh-1.201
 cle-1.186
 am-1.180
Token socks
Token ID24359
Feature activation+4.20e-01

Pos logit contributions
les+1.154
lights+1.145
ork+1.138
pron+1.035
adoes+0.983
Neg logit contributions
 least-1.266
 very-1.037
oh-1.003
 ourselves-0.985
 am-0.969
Token every
Token ID790
Feature activation-5.68e-01

Pos logit contributions
 least+0.718
oh+0.571
 ourselves+0.558
 cle+0.544
 very+0.544
Neg logit contributions
les-0.666
ork-0.657
lights-0.644
pron-0.608
lf-0.553
Token ocean
Token ID9151
Feature activation-2.78e-01

Pos logit contributions
 least+2.400
oh+1.920
 ourselves+1.864
 am+1.843
 cle+1.836
Neg logit contributions
ork-1.917
les-1.890
lights-1.757
pron-1.754
lf-1.573
Token and
Token ID290
Feature activation-2.50e-01

Pos logit contributions
ork+0.459
les+0.457
lights+0.445
pron+0.421
adoes+0.393
Neg logit contributions
 least-0.458
 very-0.362
oh-0.357
 ourselves-0.350
 cle-0.337
Token look
Token ID804
Feature activation-2.83e-01

Pos logit contributions
les+0.480
ork+0.480
lights+0.469
pron+0.442
adoes+0.417
Neg logit contributions
 least-0.350
 very-0.260
oh-0.255
 ourselves-0.249
 am-0.242
Token for
Token ID329
Feature activation-8.48e-01

Pos logit contributions
ork+0.518
les+0.512
lights+0.504
pron+0.481
adoes+0.447
Neg logit contributions
 least-0.417
 very-0.322
oh-0.311
 ourselves-0.306
 cle-0.294
Token seas
Token ID21547
Feature activation-2.52e+00

Pos logit contributions
ork+1.517
les+1.476
lights+1.454
pron+1.414
adoes+1.291
Neg logit contributions
 least-1.299
 very-1.042
 ourselves-0.981
 cle-0.976
 so-0.951
Tokenhell
Token ID12758
Feature activation-1.99e+00

Pos logit contributions
lights+2.282
pron+2.208
adoes+2.154
ork+2.039
les+1.958
Neg logit contributions
 least-5.631
 very-5.526
oh-5.200
 ourselves-5.132
 so-5.097
Tokens
Token ID82
Feature activation+2.67e-02

Pos logit contributions
lights+3.820
pron+3.813
ork+3.718
les+3.624
lf+3.547
Neg logit contributions
 least-2.537
 very-2.224
oh-2.086
 ourselves-2.015
 so-1.859
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
 least+0.044
oh+0.034
 very+0.034
 ourselves+0.034
 cle+0.033
Neg logit contributions
les-0.044
ork-0.044
lights-0.043
pron-0.040
adoes-0.037
Token One
Token ID1881
Feature activation+2.63e-01

Pos logit contributions
les+1.579
lights+1.547
ork+1.537
pron+1.406
adoes+1.334
Neg logit contributions
 least-1.925
 very-1.582
oh-1.535
 am-1.507
 ourselves-1.482
Token day
Token ID1110
Feature activation+2.28e-01

Pos logit contributions
 least+0.439
oh+0.343
 ourselves+0.338
 very+0.334
 cle+0.326
Neg logit contributions
les-0.430
ork-0.426
lights-0.415
pron-0.393
adoes-0.359
Token,
Token ID11
Feature activation-2.07e-01

Pos logit contributions
 least+0.321
oh+0.235
 ourselves+0.230
 very+0.228
 cle+0.222
Neg logit contributions
les-0.436
ork-0.435
lights-0.422
pron-0.403
adoes-0.377