TOP ACTIVATIONS
MAX = 3.349

said , " You are both good children . I love
" You 're both very lovely children . I
said , " You are both winners . You are both
" Wow , you are very good helpers ," the
" You are also very kind and careful ,"
S ue says , " Yes , I want to
on the tall rail near the pond . Then
z ig zag path on the grass . She thinks it
spark ly â just like Mam ma liked
Tom say , " You are welcome !" Then
" Wow , they are so pretty ," Anna says
and says , " You are the best friend I ever
and say , " You 're welcome , Mom my .
was a little creature on the rail . It was a
and says , " You 're welcome , my sweet ies
" Thank you , you are very nice ," Lily says
two friends , Tony and Lucy , were walking in a
happy here . We have each other and we will make
saw some soft parts on the handle . It felt nice
. " You are very kind and generous ,"

MIN ACTIVATIONS
MIN = -3.731

was all better . The charming soldier opened the window ,
The noisy brother and the quiet brother celebrated with a big
a strong and brave bear on it . Benny wanted to
to the bowl . The quiet brother used the shield to
the day playing with the yellow shield and having lots of
cool shirt with a dinosaur on it . Tim
river separates us !" The quiet bunny was sad . They
with pictures of cute animals on it . He loved to
ran away , and the blue bird saved its eggs .
. When the white all igator got out of
it ! The white all igator was scared so
with his favorite cartoon character on the cover . He asked
to go home . The polite boy was sad . He
the dragon again , but this time it was awake !
pick you up !" The quiet bunny was happy . The
big red nose and he wore funny clothes . He liked
calendar with pictures of animals on it . Every morning ,
and had fun . The anxious feeling was gone . The
borrow the cane . The poor man was kind and said
play with her dolls and wear pretty dresses . One day

INTERVAL 1.579 - 3.349
CONTAINS 0.565%

" No , it 's mine !" Tom said . "
Oh no !" Lily and Ben say . They
a huge lion appeared with an angry roar . It was
S andy had an amazing time at the park
, Lily found a big toy fish that was red and

INTERVAL -0.191 - 1.579
CONTAINS 62.591%

a special den in the mountain . The copper
each other . He was so happy ! Next
scared and wants to run away , but he is too
. " They are expensive , and you should take care
boy named John . He was only three years old and

INTERVAL -1.961 - -0.191
CONTAINS 36.723%

. " I just wanted to play ." Mom
toys when she cleaned up . <|endoftext|> Once upon a time
running , he saw a big mill nearby . The mill
try to score . One day , they play
from running any further . He had gotten so scared he

INTERVAL -3.731 - -1.961
CONTAINS 0.120%

it up and started to play with it . Tim my
Look , I am a graceful dancer !" Lily said .
explore the ground and play with his friends . One day
The boy was small and wore a modest white shirt .
smiles . The gentle van drove away , to
Token said
Token ID531
Feature activation+6.68e-01

Pos logit contributions
 least+1.088
oh+0.814
 ourselves+0.809
 very+0.768
 cle+0.754
Neg logit contributions
les-1.475
ork-1.421
lights-1.402
pron-1.346
lf-1.240
Token,
Token ID11
Feature activation-4.69e-01

Pos logit contributions
 least+1.014
oh+0.746
 ourselves+0.721
 cle+0.707
 very+0.706
Neg logit contributions
les-1.235
ork-1.214
lights-1.184
pron-1.108
adoes-1.042
Token "
Token ID366
Feature activation+5.03e-01

Pos logit contributions
les+0.856
ork+0.849
lights+0.829
pron+0.770
adoes+0.735
Neg logit contributions
 least-0.690
 very-0.540
oh-0.532
 ourselves-0.508
 cle-0.497
TokenYou
Token ID1639
Feature activation+1.68e+00

Pos logit contributions
 least+0.527
oh+0.334
 very+0.328
 ourselves+0.321
 cle+0.308
Neg logit contributions
les-1.140
ork-1.121
lights-1.107
pron-1.068
adoes-1.004
Token are
Token ID389
Feature activation+2.56e+00

Pos logit contributions
 least+1.404
oh+0.966
 oh+0.882
 ourselves+0.870
 cle+0.802
Neg logit contributions
les-4.035
ork-3.830
lights-3.686
pron-3.672
adoes-3.454
Token both
Token ID1111
Feature activation+3.35e+00

Pos logit contributions
 least+2.896
oh+2.141
 oh+1.822
 cle+1.740
'm+1.729
Neg logit contributions
les-4.983
pron-4.968
ork-4.929
lights-4.818
lf-4.597
Token good
Token ID922
Feature activation+9.98e-01

Pos logit contributions
 least+5.362
oh+4.446
 Wait+3.840
Pro+3.807
 ourselves+3.765
Neg logit contributions
pron-5.202
les-5.036
lights-4.742
ork-4.736
adoes-4.467
Token children
Token ID1751
Feature activation+6.46e-01

Pos logit contributions
 least+1.550
oh+1.238
 ourselves+1.160
 very+1.155
 cle+1.148
Neg logit contributions
les-1.784
ork-1.672
lights-1.669
pron-1.544
adoes-1.483
Token.
Token ID13
Feature activation+3.82e-01

Pos logit contributions
 least+1.127
oh+0.895
 ourselves+0.864
 cle+0.858
 very+0.846
Neg logit contributions
les-1.000
ork-0.974
lights-0.953
pron-0.897
adoes-0.822
Token I
Token ID314
Feature activation+1.45e+00

Pos logit contributions
 least+0.423
oh+0.283
 very+0.281
 ourselves+0.271
 cle+0.263
Neg logit contributions
les-0.827
ork-0.823
lights-0.808
pron-0.772
adoes-0.737
Token love
Token ID1842
Feature activation+5.86e-01

Pos logit contributions
 least+1.524
 ourselves+1.119
oh+1.064
 oh+1.060
 cle+1.006
Neg logit contributions
les-3.153
ork-3.124
pron-2.917
lights-2.879
le-2.617
Token
Token ID198
Feature activation+6.54e-01

Pos logit contributions
les+0.344
ork+0.342
lights+0.333
pron+0.319
adoes+0.299
Neg logit contributions
 least-0.239
 very-0.174
oh-0.174
 ourselves-0.170
 cle-0.163
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
 least+1.069
oh+0.817
 cle+0.791
 ourselves+0.786
 very+0.782
Neg logit contributions
ork-1.110
les-1.105
lights-1.066
pron-1.000
lf-0.932
Token"
Token ID1
Feature activation-1.92e-02

Pos logit contributions
 least+2.043
 cle+1.606
 very+1.585
oh+1.581
 ourselves+1.567
Neg logit contributions
les-1.334
ork-1.334
lights-1.264
pron-1.191
lf-1.099
TokenYou
Token ID1639
Feature activation+9.26e-01

Pos logit contributions
les+0.042
ork+0.041
lights+0.041
pron+0.039
adoes+0.037
Neg logit contributions
 least-0.022
oh-0.015
 very-0.015
 ourselves-0.015
 cle-0.014
Token're
Token ID821
Feature activation+2.06e+00

Pos logit contributions
 least+0.759
oh+0.476
 ourselves+0.446
 oh+0.397
 very+0.393
Neg logit contributions
les-2.251
ork-2.199
lights-2.136
pron-2.096
adoes-1.989
Token both
Token ID1111
Feature activation+3.14e+00

Pos logit contributions
 least+2.002
oh+1.456
 am+1.186
 oh+1.183
 cle+1.177
Neg logit contributions
pron-4.328
ork-4.302
les-4.221
lights-4.184
lf-4.015
Token very
Token ID845
Feature activation+1.03e+00

Pos logit contributions
 least+4.628
oh+3.962
 ourselves+3.425
 am+3.226
 Wait+3.170
Neg logit contributions
pron-5.212
les-4.969
ork-4.870
lights-4.806
adoes-4.524
Token lovely
Token ID14081
Feature activation+6.26e-01

Pos logit contributions
 least+1.643
oh+1.201
 ourselves+1.181
 cle+1.147
 very+1.129
Neg logit contributions
les-1.772
ork-1.745
lights-1.692
pron-1.637
lf-1.496
Token children
Token ID1751
Feature activation+1.13e+00

Pos logit contributions
 least+0.960
oh+0.735
 ourselves+0.696
 very+0.695
 cle+0.690
Neg logit contributions
les-1.129
ork-1.092
lights-1.088
pron-1.005
adoes-0.962
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
 least+1.853
oh+1.437
 cle+1.390
 ourselves+1.367
 oh+1.340
Neg logit contributions
les-1.888
ork-1.796
lights-1.781
pron-1.671
lf-1.545
Token I
Token ID314
Feature activation+1.22e+00

Pos logit contributions
 least+0.248
oh+0.164
 very+0.161
 ourselves+0.156
 cle+0.150
Neg logit contributions
les-0.507
ork-0.505
lights-0.497
pron-0.475
adoes-0.453
Token said
Token ID531
Feature activation+4.68e-01

Pos logit contributions
 least+0.795
 ourselves+0.589
oh+0.589
 very+0.554
 cle+0.540
Neg logit contributions
les-1.192
ork-1.168
lights-1.140
pron-1.097
lf-1.021
Token,
Token ID11
Feature activation-5.34e-01

Pos logit contributions
 least+0.684
oh+0.503
 ourselves+0.486
 very+0.481
 cle+0.474
Neg logit contributions
les-0.876
ork-0.870
lights-0.847
pron-0.801
adoes-0.753
Token "
Token ID366
Feature activation+3.21e-02

Pos logit contributions
les+0.984
ork+0.968
lights+0.952
pron+0.877
adoes+0.844
Neg logit contributions
 least-0.778
 very-0.608
oh-0.594
 ourselves-0.570
 cle-0.559
TokenYou
Token ID1639
Feature activation+6.84e-01

Pos logit contributions
 least+0.034
oh+0.023
 very+0.022
 ourselves+0.022
 cle+0.020
Neg logit contributions
les-0.072
ork-0.072
lights-0.071
pron-0.068
adoes-0.064
Token are
Token ID389
Feature activation+2.39e+00

Pos logit contributions
 least+0.573
oh+0.349
 ourselves+0.325
 very+0.304
 cle+0.292
Neg logit contributions
les-1.665
ork-1.642
lights-1.599
pron-1.561
adoes-1.476
Token both
Token ID1111
Feature activation+3.11e+00

Pos logit contributions
 least+2.900
oh+2.258
'm+1.955
 oh+1.889
 am+1.888
Neg logit contributions
pron-4.494
les-4.391
ork-4.363
lights-4.315
lf-4.201
Token winners
Token ID14591
Feature activation+8.90e-01

Pos logit contributions
 least+4.948
oh+4.045
 ourselves+3.475
 Wait+3.412
Pro+3.299
Neg logit contributions
pron-5.119
les-4.811
ork-4.697
lights-4.628
adoes-4.340
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
 least+1.416
oh+1.091
 ourselves+1.051
 cle+1.048
 very+1.033
Neg logit contributions
les-1.529
ork-1.470
lights-1.466
pron-1.366
adoes-1.275
Token You
Token ID921
Feature activation+9.10e-01

Pos logit contributions
 least+0.332
oh+0.216
 very+0.213
 ourselves+0.206
 cle+0.198
Neg logit contributions
les-0.712
ork-0.710
lights-0.697
pron-0.667
adoes-0.637
Token are
Token ID389
Feature activation+1.39e+00

Pos logit contributions
 least+0.901
oh+0.621
 ourselves+0.592
 oh+0.560
 very+0.546
Neg logit contributions
les-2.060
ork-2.014
lights-1.941
pron-1.901
adoes-1.791
Token both
Token ID1111
Feature activation+2.64e+00

Pos logit contributions
 least+1.681
oh+1.296
 am+1.139
 ourselves+1.126
'm+1.125
Neg logit contributions
les-2.682
ork-2.656
pron-2.631
lights-2.612
lf-2.453
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
 least+0.076
oh+0.057
 very+0.056
 ourselves+0.056
 cle+0.054
Neg logit contributions
les-0.093
ork-0.093
lights-0.091
pron-0.086
adoes-0.081
Token"
Token ID1
Feature activation+3.02e-02

Pos logit contributions
les+0.272
ork+0.270
lights+0.266
pron+0.249
adoes+0.234
Neg logit contributions
 least-0.270
oh-0.213
 very-0.209
 ourselves-0.208
 am-0.199
TokenWow
Token ID22017
Feature activation+1.17e+00

Pos logit contributions
 least+0.039
oh+0.028
 very+0.027
 ourselves+0.027
 cle+0.026
Neg logit contributions
les-0.061
ork-0.061
lights-0.060
pron-0.057
adoes-0.054
Token,
Token ID11
Feature activation+6.50e-01

Pos logit contributions
 least+1.854
 cle+1.358
oh+1.349
 very+1.316
 ourselves+1.308
Neg logit contributions
les-2.090
ork-2.003
lights-1.953
pron-1.808
lf-1.723
Token you
Token ID345
Feature activation+1.19e+00

Pos logit contributions
 least+0.931
oh+0.706
 ourselves+0.676
 very+0.675
 cle+0.654
Neg logit contributions
ork-1.193
les-1.192
lights-1.159
pron-1.090
lf-1.031
Token are
Token ID389
Feature activation+3.00e+00

Pos logit contributions
 least+1.306
oh+0.906
 ourselves+0.906
 oh+0.851
 very+0.842
Neg logit contributions
les-2.596
ork-2.517
lights-2.396
pron-2.381
lf-2.226
Token very
Token ID845
Feature activation+1.37e+00

Pos logit contributions
 least+3.199
oh+2.202
'm+1.823
 oh+1.767
 am+1.763
Neg logit contributions
les-6.266
ork-6.207
lf-6.182
pron-6.053
lights-6.037
Token good
Token ID922
Feature activation+8.59e-01

Pos logit contributions
 least+2.163
oh+1.530
 ourselves+1.498
 oh+1.449
 cle+1.423
Neg logit contributions
les-2.407
ork-2.405
lights-2.291
pron-2.240
lf-2.131
Token helpers
Token ID49385
Feature activation+9.58e-01

Pos logit contributions
 least+1.418
oh+1.113
 ourselves+1.062
 very+1.061
 cle+1.054
Neg logit contributions
les-1.453
ork-1.395
lights-1.365
pron-1.267
adoes-1.214
Token,"
Token ID553
Feature activation+1.52e+00

Pos logit contributions
 least+1.453
oh+1.078
 cle+1.069
 ourselves+1.051
 very+1.048
Neg logit contributions
les-1.716
ork-1.669
lights-1.634
pron-1.532
lf-1.467
Token the
Token ID262
Feature activation-9.22e-02

Pos logit contributions
 least+2.479
oh+1.954
 ourselves+1.837
 cle+1.780
 am+1.777
Neg logit contributions
les-2.591
lights-2.537
ork-2.448
pron-2.256
lf-2.186
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
les+0.958
ork+0.938
lights+0.921
pron+0.885
adoes+0.827
Neg logit contributions
 least-0.659
 very-0.509
oh-0.492
 ourselves-0.475
 am-0.463
Token
Token ID198
Feature activation+5.76e-01

Pos logit contributions
 least+0.028
oh+0.021
 very+0.021
 ourselves+0.021
 cle+0.020
Neg logit contributions
les-0.035
ork-0.035
lights-0.034
pron-0.032
adoes-0.030
Token"
Token ID1
Feature activation+9.10e-01

Pos logit contributions
 least+1.018
oh+0.778
 very+0.773
 cle+0.768
 ourselves+0.754
Neg logit contributions
ork-0.912
les-0.904
lights-0.886
pron-0.822
lf-0.766
TokenYou
Token ID1639
Feature activation-8.39e-02

Pos logit contributions
 least+1.193
oh+0.851
 very+0.825
 ourselves+0.811
 cle+0.774
Neg logit contributions
les-1.794
ork-1.784
lights-1.765
pron-1.723
lf-1.585
Token are
Token ID389
Feature activation+2.22e+00

Pos logit contributions
les+0.200
ork+0.200
lights+0.197
pron+0.189
adoes+0.179
Neg logit contributions
 least-0.079
oh-0.047
 very-0.046
 ourselves-0.045
 cle-0.042
Token also
Token ID635
Feature activation+2.97e+00

Pos logit contributions
 least+2.582
oh+1.993
 ourselves+1.681
'm+1.661
 am+1.645
Neg logit contributions
ork-4.333
les-4.255
pron-4.217
lights-4.132
lf-4.119
Token very
Token ID845
Feature activation+1.00e+00

Pos logit contributions
 least+4.259
oh+3.424
 ourselves+3.089
 am+2.788
 Wait+2.749
Neg logit contributions
pron-5.290
ork-5.076
les-4.970
lf-4.749
lights-4.743
Token kind
Token ID1611
Feature activation+3.29e-01

Pos logit contributions
 least+1.478
oh+1.048
 ourselves+1.032
 very+0.975
 cle+0.974
Neg logit contributions
les-1.860
ork-1.855
lights-1.772
pron-1.731
lf-1.622
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
 least+0.558
oh+0.437
 very+0.424
 ourselves+0.422
 cle+0.414
Neg logit contributions
les-0.536
ork-0.528
lights-0.518
pron-0.480
adoes-0.453
Token careful
Token ID8161
Feature activation+2.19e-01

Pos logit contributions
 least+2.647
oh+1.997
 ourselves+1.878
 am+1.799
 very+1.774
Neg logit contributions
ork-2.643
les-2.622
pron-2.621
lights-2.566
lf-2.373
Token,"
Token ID553
Feature activation+1.23e+00

Pos logit contributions
 least+0.336
oh+0.254
 very+0.248
 ourselves+0.246
 cle+0.241
Neg logit contributions
les-0.389
ork-0.387
lights-0.379
pron-0.358
adoes-0.335
Token
Token ID198
Feature activation+2.98e-01

Pos logit contributions
les+0.407
ork+0.400
lights+0.395
pron+0.376
adoes+0.354
Neg logit contributions
 least-0.305
oh-0.233
 very-0.231
 ourselves-0.223
 am-0.219
TokenS
Token ID50
Feature activation-4.60e-01

Pos logit contributions
 least+0.595
oh+0.478
 ourselves+0.473
 very+0.471
 cle+0.462
Neg logit contributions
ork-0.395
les-0.394
lights-0.381
pron-0.352
adoes-0.321
Tokenue
Token ID518
Feature activation-3.30e-01

Pos logit contributions
les+0.450
ork+0.439
lights+0.437
pron+0.394
adoes+0.345
Neg logit contributions
 least-1.062
oh-0.911
 very-0.884
 ourselves-0.883
 am-0.870
Token says
Token ID1139
Feature activation+7.58e-01

Pos logit contributions
ork+0.751
lights+0.741
les+0.740
pron+0.711
adoes+0.671
Neg logit contributions
 least-0.339
oh-0.223
 very-0.217
 ourselves-0.203
 cle-0.197
Token,
Token ID11
Feature activation+2.98e-01

Pos logit contributions
 least+1.181
 ourselves+0.852
oh+0.850
 cle+0.811
 very+0.807
Neg logit contributions
ork-1.375
les-1.366
lights-1.319
pron-1.252
adoes-1.161
Token "
Token ID366
Feature activation+2.97e+00

Pos logit contributions
 least+0.497
oh+0.383
 ourselves+0.378
 very+0.372
 cle+0.363
Neg logit contributions
les-0.490
ork-0.487
lights-0.475
pron-0.453
adoes-0.415
TokenYes
Token ID5297
Feature activation+3.86e-01

Pos logit contributions
 least+4.685
 cle+3.089
 ourselves+3.066
 kidding+3.002
 very+2.832
Neg logit contributions
ork-5.418
les-5.412
pron-5.183
lights-4.961
pool-4.600
Token,
Token ID11
Feature activation-2.04e-01

Pos logit contributions
 least+0.550
oh+0.404
 very+0.394
 ourselves+0.394
 cle+0.386
Neg logit contributions
les-0.725
ork-0.722
lights-0.706
pron-0.668
adoes-0.626
Token I
Token ID314
Feature activation+1.75e-01

Pos logit contributions
les+0.386
ork+0.383
lights+0.375
pron+0.355
adoes+0.333
Neg logit contributions
 least-0.290
oh-0.217
 very-0.216
 ourselves-0.211
 cle-0.203
Token want
Token ID765
Feature activation+5.62e-01

Pos logit contributions
 least+0.218
oh+0.153
 ourselves+0.149
 very+0.148
 cle+0.141
Neg logit contributions
les-0.363
ork-0.363
lights-0.353
pron-0.339
adoes-0.317
Token to
Token ID284
Feature activation-6.85e-01

Pos logit contributions
 least+0.906
oh+0.708
 ourselves+0.676
 very+0.669
 cle+0.665
Neg logit contributions
les-0.947
ork-0.939
lights-0.914
pron-0.856
adoes-0.797
Token on
Token ID319
Feature activation+3.91e-01

Pos logit contributions
 least+0.078
oh+0.061
 very+0.060
 ourselves+0.059
 cle+0.058
Neg logit contributions
les-0.078
ork-0.078
lights-0.076
pron-0.072
adoes-0.066
Token the
Token ID262
Feature activation+1.70e+00

Pos logit contributions
 least+0.527
oh+0.390
 very+0.370
 ourselves+0.366
 cle+0.363
Neg logit contributions
ork-0.762
les-0.760
lights-0.744
pron-0.699
adoes-0.665
Token tall
Token ID7331
Feature activation+1.29e-01

Pos logit contributions
 least+2.958
oh+2.375
 ourselves+2.241
'm+2.213
 am+2.213
Neg logit contributions
les-2.731
ork-2.705
lights-2.507
pron-2.394
lf-2.305
Token rail
Token ID6787
Feature activation+1.90e-01

Pos logit contributions
 least+0.213
oh+0.166
 very+0.163
 ourselves+0.162
 cle+0.157
Neg logit contributions
les-0.214
ork-0.212
lights-0.207
pron-0.195
adoes-0.181
Token near
Token ID1474
Feature activation+8.46e-01

Pos logit contributions
 least+0.318
oh+0.250
 very+0.245
 ourselves+0.244
 cle+0.238
Neg logit contributions
les-0.307
ork-0.305
lights-0.298
pron-0.279
adoes-0.259
Token the
Token ID262
Feature activation+2.93e+00

Pos logit contributions
 least+1.245
oh+0.952
 cle+0.899
 very+0.882
 ourselves+0.874
Neg logit contributions
ork-1.544
les-1.522
lights-1.494
pron-1.381
adoes-1.342
Token pond
Token ID16723
Feature activation-5.12e-02

Pos logit contributions
 least+5.428
 oh+4.669
oh+4.567
 ourselves+4.566
 cle+4.553
Neg logit contributions
ork-4.276
les-4.002
lights-3.630
pron-3.265
lf-3.178
Token.
Token ID13
Feature activation-8.67e-01

Pos logit contributions
les+0.084
ork+0.084
lights+0.082
pron+0.077
adoes+0.072
Neg logit contributions
 least-0.084
oh-0.066
 very-0.065
 ourselves-0.064
 cle-0.062
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
les+1.658
ork+1.623
lights+1.578
pron+1.529
adoes+1.407
Neg logit contributions
 least-1.233
 very-0.954
oh-0.932
 ourselves-0.912
 so-0.886
Token
Token ID198
Feature activation-1.36e+00

Pos logit contributions
les+2.081
lights+2.035
ork+1.979
pron+1.910
adoes+1.872
Neg logit contributions
 least-1.232
 very-0.981
oh-0.963
 am-0.900
 ourselves-0.872
TokenThen
Token ID6423
Feature activation-2.63e-02

Pos logit contributions
les+2.352
lights+2.321
ork+2.196
pron+2.088
adoes+1.989
Neg logit contributions
 least-2.096
oh-1.804
 very-1.754
 ourselves-1.638
 oh-1.628
Token z
Token ID1976
Feature activation-7.71e-01

Pos logit contributions
 least+0.100
oh+0.077
 very+0.076
 ourselves+0.075
 cle+0.073
Neg logit contributions
les-0.106
ork-0.106
lights-0.103
pron-0.097
adoes-0.091
Tokenig
Token ID328
Feature activation-5.02e-01

Pos logit contributions
les+1.184
lights+1.161
ork+1.157
pron+1.072
adoes+1.011
Neg logit contributions
 least-1.369
oh-1.107
 very-1.058
 am-1.045
 ourselves-1.038
Tokenzag
Token ID50183
Feature activation-8.56e-02

Pos logit contributions
les+0.799
ork+0.794
lights+0.777
pron+0.738
adoes+0.682
Neg logit contributions
 least-0.840
oh-0.673
 very-0.670
 ourselves-0.642
 am-0.630
Token path
Token ID3108
Feature activation+1.24e-01

Pos logit contributions
les+0.143
ork+0.142
lights+0.138
pron+0.131
adoes+0.121
Neg logit contributions
 least-0.139
oh-0.108
 very-0.107
 ourselves-0.105
 cle-0.102
Token on
Token ID319
Feature activation+4.92e-02

Pos logit contributions
 least+0.193
oh+0.147
 very+0.144
 ourselves+0.144
 cle+0.139
Neg logit contributions
les-0.218
ork-0.216
lights-0.212
pron-0.200
adoes-0.186
Token the
Token ID262
Feature activation+2.89e+00

Pos logit contributions
 least+0.062
oh+0.044
 very+0.043
 ourselves+0.043
 cle+0.041
Neg logit contributions
les-0.101
ork-0.100
lights-0.098
pron-0.094
adoes-0.088
Token grass
Token ID8701
Feature activation+1.62e-01

Pos logit contributions
 least+6.019
oh+5.018
'm+4.952
 ourselves+4.941
 oh+4.922
Neg logit contributions
ork-3.744
les-3.742
lights-3.348
lf-3.062
pron-3.040
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
 least+0.273
oh+0.213
 very+0.210
 ourselves+0.209
 cle+0.204
Neg logit contributions
les-0.261
ork-0.260
lights-0.253
pron-0.237
adoes-0.220
Token She
Token ID1375
Feature activation+2.44e-02

Pos logit contributions
les+0.403
ork+0.397
lights+0.393
pron+0.376
adoes+0.352
Neg logit contributions
 least-0.232
oh-0.164
 very-0.163
 ourselves-0.157
 am-0.150
Token thinks
Token ID6834
Feature activation+1.72e-01

Pos logit contributions
 least+0.032
oh+0.023
 very+0.022
 ourselves+0.022
 cle+0.021
Neg logit contributions
les-0.049
ork-0.049
lights-0.048
pron-0.046
adoes-0.043
Token it
Token ID340
Feature activation+6.09e-01

Pos logit contributions
 least+0.350
oh+0.287
 very+0.281
 ourselves+0.281
 cle+0.274
Neg logit contributions
les-0.219
ork-0.218
lights-0.211
pron-0.194
adoes-0.175
Token spark
Token ID9009
Feature activation-2.73e-01

Pos logit contributions
 least+1.247
oh+0.969
 ourselves+0.939
 very+0.909
 am+0.905
Neg logit contributions
les-1.275
lights-1.270
ork-1.260
pron-1.181
lf-1.088
Tokenly
Token ID306
Feature activation+6.89e-02

Pos logit contributions
ork+0.260
les+0.250
lights+0.244
pron+0.228
adoes+0.193
Neg logit contributions
 least-0.634
oh-0.539
 very-0.535
 ourselves-0.529
 cle-0.519
Tokenâ
Token ID22940
Feature activation+6.04e-01

Pos logit contributions
 least+0.114
oh+0.089
 very+0.087
 ourselves+0.087
 cle+0.084
Neg logit contributions
les-0.113
ork-0.112
lights-0.110
pron-0.103
adoes-0.096
Token
Token ID26391
Feature activation-2.69e-01

Pos logit contributions
 least+0.982
 very+0.760
 ourselves+0.745
oh+0.739
 cle+0.731
Neg logit contributions
les-1.020
ork-1.012
lights-0.965
pron-0.930
adoes-0.853
Token
Token ID447
Feature activation+5.57e-01

Pos logit contributions
les+0.441
ork+0.432
lights+0.427
pron+0.403
adoes+0.373
Neg logit contributions
 least-0.446
oh-0.355
 very-0.348
 ourselves-0.344
 am-0.332
Token
Token ID251
Feature activation+2.88e+00

Pos logit contributions
 least+1.159
oh+0.940
 ourselves+0.937
 very+0.934
 cle+0.925
Neg logit contributions
ork-0.691
les-0.687
lights-0.651
pron-0.614
adoes-0.538
Tokenjust
Token ID3137
Feature activation+1.39e+00

Pos logit contributions
 least+5.535
 cle+4.523
 ourselves+4.320
oh+4.228
 kidding+4.159
Neg logit contributions
les-4.130
ork-4.055
lights-3.917
pron-3.514
anne-3.034
Token like
Token ID588
Feature activation+1.16e+00

Pos logit contributions
 least+2.393
oh+1.929
 ourselves+1.920
 am+1.883
 cle+1.849
Neg logit contributions
les-2.152
lights-2.026
ork-2.019
pron-1.932
lf-1.731
Token Mam
Token ID29926
Feature activation+1.38e+00

Pos logit contributions
 least+1.722
oh+1.312
 ourselves+1.286
 cle+1.206
 very+1.196
Neg logit contributions
ork-2.090
les-2.066
lights-2.033
pron-1.914
lf-1.785
Tokenma
Token ID2611
Feature activation+5.72e-01

Pos logit contributions
 least+2.296
 ourselves+1.881
 cle+1.778
 very+1.737
oh+1.715
Neg logit contributions
les-2.334
ork-2.151
lights-2.080
pron-2.059
lf-1.878
Token liked
Token ID8288
Feature activation+7.82e-01

Pos logit contributions
 least+0.848
 ourselves+0.640
oh+0.637
 very+0.606
 cle+0.600
Neg logit contributions
les-1.043
ork-1.023
lights-0.996
pron-0.950
adoes-0.876
Token Tom
Token ID4186
Feature activation+1.21e+00

Pos logit contributions
 least+1.012
oh+0.788
 very+0.765
 ourselves+0.764
 cle+0.745
Neg logit contributions
les-1.094
ork-1.072
lights-1.031
pron-0.980
adoes-0.927
Token say
Token ID910
Feature activation+8.68e-01

Pos logit contributions
 least+1.635
oh+1.248
 oh+1.223
 ourselves+1.200
 cle+1.160
Neg logit contributions
les-2.397
ork-2.202
lights-2.174
pron-2.038
adoes-1.927
Token,
Token ID11
Feature activation-4.57e-01

Pos logit contributions
 least+1.375
oh+1.004
 ourselves+0.961
 cle+0.947
 very+0.922
Neg logit contributions
les-1.558
ork-1.551
lights-1.508
pron-1.402
adoes-1.326
Token "
Token ID366
Feature activation+1.54e+00

Pos logit contributions
les+0.823
ork+0.820
lights+0.804
pron+0.745
adoes+0.709
Neg logit contributions
 least-0.684
 very-0.536
oh-0.524
 ourselves-0.503
 cle-0.496
TokenYou
Token ID1639
Feature activation+1.30e+00

Pos logit contributions
 least+1.928
 cle+1.213
 very+1.198
 ourselves+1.144
oh+1.132
Neg logit contributions
les-3.236
ork-3.181
lights-3.087
pron-3.061
lf-2.831
Token are
Token ID389
Feature activation+2.88e+00

Pos logit contributions
 least+1.073
oh+0.641
 ourselves+0.628
 oh+0.559
 cle+0.521
Neg logit contributions
les-3.170
ork-3.083
lights-2.961
pron-2.941
adoes-2.768
Token welcome
Token ID7062
Feature activation+3.52e-01

Pos logit contributions
 least+3.746
oh+2.742
'm+2.445
 oh+2.420
 am+2.376
Neg logit contributions
les-5.131
pron-5.087
lf-5.081
ork-5.053
lights-5.050
Token!"
Token ID2474
Feature activation+2.30e-01

Pos logit contributions
 least+0.515
oh+0.381
 very+0.372
 ourselves+0.371
 cle+0.363
Neg logit contributions
les-0.647
ork-0.644
lights-0.627
pron-0.594
adoes-0.558
Token
Token ID198
Feature activation+4.94e-01

Pos logit contributions
 least+0.366
oh+0.282
 very+0.274
 ourselves+0.272
 cle+0.267
Neg logit contributions
les-0.399
ork-0.396
lights-0.388
pron-0.364
adoes-0.339
Token
Token ID198
Feature activation+5.75e-01

Pos logit contributions
 least+0.790
oh+0.602
 cle+0.578
 ourselves+0.577
 very+0.577
Neg logit contributions
ork-0.854
les-0.853
lights-0.827
pron-0.773
lf-0.718
TokenThen
Token ID6423
Feature activation+8.06e-01

Pos logit contributions
 least+0.887
oh+0.655
 very+0.637
 cle+0.636
 ourselves+0.630
Neg logit contributions
les-1.036
ork-1.034
lights-1.003
pron-0.953
lf-0.885
Token
Token ID198
Feature activation+7.35e-02

Pos logit contributions
les+0.597
ork+0.586
lights+0.581
pron+0.555
adoes+0.525
Neg logit contributions
 least-0.432
 very-0.326
oh-0.325
 ourselves-0.317
 am-0.305
Token"
Token ID1
Feature activation+1.02e+00

Pos logit contributions
 least+0.128
oh+0.101
 very+0.099
 ourselves+0.099
 cle+0.096
Neg logit contributions
les-0.116
ork-0.115
lights-0.112
pron-0.105
adoes-0.097
TokenWow
Token ID22017
Feature activation+5.55e-01

Pos logit contributions
 least+1.644
oh+1.219
 very+1.208
 ourselves+1.198
 cle+1.178
Neg logit contributions
les-1.742
ork-1.729
lights-1.700
pron-1.612
adoes-1.480
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
 least+0.805
oh+0.591
 very+0.576
 ourselves+0.573
 cle+0.570
Neg logit contributions
les-1.036
ork-1.019
lights-0.991
pron-0.937
adoes-0.882
Token they
Token ID484
Feature activation+1.64e+00

Pos logit contributions
 least+0.229
oh+0.171
 very+0.166
 ourselves+0.165
 cle+0.159
Neg logit contributions
les-0.309
ork-0.308
lights-0.301
pron-0.286
adoes-0.268
Token are
Token ID389
Feature activation+2.87e+00

Pos logit contributions
 least+1.295
 oh+0.843
 ourselves+0.753
oh+0.751
 very+0.694
Neg logit contributions
les-4.049
ork-3.920
lights-3.657
pron-3.646
ilt-3.484
Token so
Token ID523
Feature activation+8.15e-01

Pos logit contributions
 least+4.413
oh+3.508
 oh+3.176
'm+3.131
 am+3.113
Neg logit contributions
lights-4.862
les-4.818
ork-4.754
lf-4.539
adoes-4.227
Token pretty
Token ID2495
Feature activation-2.59e-01

Pos logit contributions
 least+1.435
oh+1.081
 ourselves+1.054
 cle+1.041
 oh+1.041
Neg logit contributions
ork-1.294
les-1.292
lights-1.257
pron-1.155
lf-1.081
Token,"
Token ID553
Feature activation+7.64e-01

Pos logit contributions
ork+0.500
les+0.495
lights+0.485
pron+0.466
adoes+0.434
Neg logit contributions
 least-0.350
oh-0.260
 very-0.259
 ourselves-0.254
 cle-0.238
Token Anna
Token ID11735
Feature activation+6.61e-01

Pos logit contributions
 least+1.395
oh+1.116
 ourselves+1.065
 very+1.057
 cle+1.049
Neg logit contributions
les-1.147
lights-1.118
ork-1.116
pron-1.033
lf-0.952
Token says
Token ID1139
Feature activation+2.64e-01

Pos logit contributions
 least+0.660
oh+0.418
 ourselves+0.409
 very+0.392
 oh+0.376
Neg logit contributions
les-1.549
ork-1.503
lights-1.470
pron-1.399
adoes-1.340
Token and
Token ID290
Feature activation+3.08e-01

Pos logit contributions
les+0.010
ork+0.010
lights+0.010
pron+0.009
adoes+0.008
Neg logit contributions
 least-0.010
oh-0.007
 very-0.007
 ourselves-0.007
 cle-0.007
Token says
Token ID1139
Feature activation+7.17e-02

Pos logit contributions
 least+0.407
oh+0.303
 very+0.289
 ourselves+0.289
 cle+0.274
Neg logit contributions
les-0.610
ork-0.600
lights-0.591
pron-0.560
adoes-0.530
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
 least+0.100
oh+0.074
 very+0.072
 ourselves+0.072
 cle+0.069
Neg logit contributions
les-0.137
ork-0.137
lights-0.134
pron-0.127
adoes-0.119
Token "
Token ID366
Feature activation+7.33e-01

Pos logit contributions
ork+2.216
les+2.215
lights+2.173
adoes+1.949
pron+1.942
Neg logit contributions
 least-1.561
 very-1.268
oh-1.159
 so-1.120
 cle-1.091
TokenYou
Token ID1639
Feature activation+9.47e-01

Pos logit contributions
 least+0.668
oh+0.367
 very+0.365
 ourselves+0.353
 cle+0.349
Neg logit contributions
les-1.761
ork-1.730
lights-1.703
pron-1.679
lf-1.566
Token are
Token ID389
Feature activation+2.86e+00

Pos logit contributions
 least+0.765
oh+0.456
 ourselves+0.431
 very+0.385
 oh+0.380
Neg logit contributions
les-2.329
ork-2.270
lights-2.203
pron-2.184
adoes-2.035
Token the
Token ID262
Feature activation+1.62e+00

Pos logit contributions
 least+3.502
oh+2.648
 oh+2.427
'm+2.334
 Wait+2.329
Neg logit contributions
pron-5.327
les-5.201
lights-5.129
ork-5.112
lf-4.992
Token best
Token ID1266
Feature activation+4.02e-01

Pos logit contributions
 least+1.430
oh+1.006
 ourselves+0.908
 am+0.787
 oh+0.774
Neg logit contributions
les-3.705
ork-3.663
pron-3.658
lights-3.600
adoes-3.356
Token friend
Token ID1545
Feature activation+5.59e-01

Pos logit contributions
 least+0.641
oh+0.491
 ourselves+0.480
 very+0.475
 cle+0.466
Neg logit contributions
les-0.698
ork-0.694
lights-0.676
pron-0.645
adoes-0.595
Token I
Token ID314
Feature activation+1.23e+00

Pos logit contributions
 least+0.701
oh+0.486
 ourselves+0.479
 very+0.466
 cle+0.461
Neg logit contributions
les-1.140
ork-1.126
lights-1.101
pron-1.054
lf-0.990
Token ever
Token ID1683
Feature activation+1.18e+00

Pos logit contributions
 least+1.588
 ourselves+1.179
oh+1.128
 cle+1.111
 oh+1.086
Neg logit contributions
les-2.457
ork-2.418
pron-2.292
lights-2.222
lf-2.072
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
 least+0.169
oh+0.133
 very+0.130
 ourselves+0.129
 cle+0.126
Neg logit contributions
les-0.164
ork-0.163
lights-0.159
pron-0.149
adoes-0.138
Token say
Token ID910
Feature activation+5.56e-01

Pos logit contributions
 least+1.830
oh+1.524
 ourselves+1.400
 oh+1.394
 very+1.367
Neg logit contributions
les-1.725
ork-1.652
lights-1.645
pron-1.543
adoes-1.438
Token,
Token ID11
Feature activation-6.83e-01

Pos logit contributions
 least+0.906
oh+0.691
 ourselves+0.660
 very+0.651
 cle+0.648
Neg logit contributions
les-0.958
ork-0.954
lights-0.930
pron-0.878
adoes-0.811
Token "
Token ID366
Feature activation+1.37e+00

Pos logit contributions
les+1.283
ork+1.260
lights+1.235
pron+1.131
adoes+1.098
Neg logit contributions
 least-0.978
 very-0.781
oh-0.750
 ourselves-0.725
 cle-0.704
TokenYou
Token ID1639
Feature activation+1.17e+00

Pos logit contributions
 least+1.662
 cle+1.043
 very+1.022
oh+1.008
 ourselves+0.975
Neg logit contributions
les-2.870
ork-2.859
pron-2.810
lights-2.798
lf-2.546
Token're
Token ID821
Feature activation+2.85e+00

Pos logit contributions
 least+1.017
oh+0.638
 ourselves+0.614
 oh+0.548
 cle+0.535
Neg logit contributions
les-2.791
ork-2.718
pron-2.633
lights-2.618
adoes-2.422
Token welcome
Token ID7062
Feature activation+6.16e-01

Pos logit contributions
 least+3.117
oh+2.214
 oh+1.959
 cle+1.901
 am+1.842
Neg logit contributions
pron-5.720
les-5.449
ork-5.436
lights-5.355
lf-5.295
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
 least+0.928
oh+0.692
 ourselves+0.669
 cle+0.666
 very+0.664
Neg logit contributions
les-1.104
ork-1.102
lights-1.076
pron-1.016
adoes-0.952
Token Mom
Token ID11254
Feature activation+3.73e-01

Pos logit contributions
les+0.378
ork+0.377
lights+0.364
pron+0.346
adoes+0.324
Neg logit contributions
 least-0.297
 very-0.220
oh-0.220
 ourselves-0.217
 cle-0.207
Tokenmy
Token ID1820
Feature activation+3.00e-01

Pos logit contributions
 least+0.619
oh+0.481
 ourselves+0.471
 very+0.470
 cle+0.465
Neg logit contributions
les-0.610
ork-0.595
lights-0.591
pron-0.555
adoes-0.513
Token.
Token ID13
Feature activation+9.05e-01

Pos logit contributions
 least+0.475
oh+0.364
 very+0.355
 ourselves+0.355
 cle+0.348
Neg logit contributions
les-0.515
ork-0.508
lights-0.498
pron-0.471
adoes-0.438
Token was
Token ID373
Feature activation+8.28e-01

Pos logit contributions
 least+0.810
oh+0.576
 ourselves+0.570
 very+0.548
 cle+0.543
Neg logit contributions
ork-1.154
les-1.153
lights-1.121
pron-1.047
lf-1.006
Token a
Token ID257
Feature activation+1.76e-01

Pos logit contributions
 least+0.940
oh+0.665
 ourselves+0.621
 am+0.568
 cle+0.560
Neg logit contributions
ork-1.785
les-1.764
lights-1.757
pron-1.658
adoes-1.596
Token little
Token ID1310
Feature activation+9.75e-01

Pos logit contributions
 least+0.285
oh+0.221
 ourselves+0.216
 very+0.215
 cle+0.208
Neg logit contributions
ork-0.294
les-0.294
lights-0.285
pron-0.271
adoes-0.250
Token creature
Token ID7185
Feature activation+1.97e-01

Pos logit contributions
 least+1.853
oh+1.490
 ourselves+1.474
 cle+1.414
 am+1.401
Neg logit contributions
les-1.390
ork-1.364
lights-1.319
pron-1.219
adoes-1.149
Token on
Token ID319
Feature activation-1.11e+00

Pos logit contributions
 least+0.295
oh+0.222
 ourselves+0.218
 very+0.217
 cle+0.210
Neg logit contributions
les-0.354
ork-0.352
lights-0.343
pron-0.325
adoes-0.304
Token the
Token ID262
Feature activation+2.84e+00

Pos logit contributions
les+2.549
ork+2.480
lights+2.475
pron+2.414
ravings+2.175
Neg logit contributions
 least-1.185
 very-0.849
 ourselves-0.763
oh-0.733
 so-0.718
Token rail
Token ID6787
Feature activation-1.53e-01

Pos logit contributions
 least+5.062
 am+4.295
'm+4.193
 kidding+4.187
 ourselves+4.182
Neg logit contributions
ork-4.479
les-4.390
lights-3.758
lf-3.670
pron-3.590
Token.
Token ID13
Feature activation-4.25e-01

Pos logit contributions
les+0.262
ork+0.261
lights+0.255
pron+0.242
adoes+0.225
Neg logit contributions
 least-0.242
oh-0.187
 very-0.186
 ourselves-0.181
 cle-0.174
Token It
Token ID632
Feature activation+2.57e-01

Pos logit contributions
les+0.780
ork+0.757
lights+0.754
pron+0.718
adoes+0.664
Neg logit contributions
 least-0.624
 very-0.488
oh-0.487
 ourselves-0.461
 am-0.455
Token was
Token ID373
Feature activation+9.97e-01

Pos logit contributions
 least+0.354
oh+0.257
 ourselves+0.254
 very+0.249
 cle+0.241
Neg logit contributions
les-0.496
ork-0.492
lights-0.481
pron-0.456
adoes-0.430
Token a
Token ID257
Feature activation+5.42e-01

Pos logit contributions
 least+1.304
oh+0.968
 ourselves+0.922
 am+0.849
'm+0.818
Neg logit contributions
ork-1.990
les-1.952
lights-1.927
pron-1.839
adoes-1.752
Token and
Token ID290
Feature activation-1.02e-01

Pos logit contributions
 least+1.264
oh+1.032
 ourselves+0.984
 oh+0.965
 cle+0.965
Neg logit contributions
les-1.146
ork-1.079
lights-1.062
pron-0.993
adoes-0.911
Token says
Token ID1139
Feature activation+6.31e-01

Pos logit contributions
ork+0.224
les+0.224
lights+0.220
pron+0.211
adoes+0.199
Neg logit contributions
 least-0.114
oh-0.075
 very-0.075
 ourselves-0.073
 cle-0.070
Token,
Token ID11
Feature activation-2.08e-01

Pos logit contributions
 least+0.954
oh+0.696
 ourselves+0.670
 cle+0.662
 very+0.656
Neg logit contributions
les-1.159
ork-1.157
lights-1.129
pron-1.061
adoes-0.991
Token "
Token ID366
Feature activation+5.10e-01

Pos logit contributions
les+0.369
ork+0.364
lights+0.357
pron+0.333
adoes+0.315
Neg logit contributions
 least-0.319
oh-0.247
 very-0.247
 ourselves-0.238
 cle-0.231
TokenYou
Token ID1639
Feature activation+1.50e+00

Pos logit contributions
 least+0.538
oh+0.337
 very+0.332
 ourselves+0.325
 cle+0.314
Neg logit contributions
les-1.159
ork-1.138
lights-1.127
pron-1.091
adoes-1.019
Token're
Token ID821
Feature activation+2.84e+00

Pos logit contributions
 least+1.277
oh+0.843
 ourselves+0.785
 oh+0.737
 cle+0.714
Neg logit contributions
les-3.532
ork-3.425
pron-3.306
lights-3.301
adoes-3.061
Token welcome
Token ID7062
Feature activation+7.99e-01

Pos logit contributions
 least+3.073
oh+2.164
 cle+1.974
 oh+1.965
 ourselves+1.807
Neg logit contributions
pron-5.693
ork-5.548
les-5.391
lights-5.356
lf-5.239
Token,
Token ID11
Feature activation-5.99e-01

Pos logit contributions
 least+1.211
oh+0.897
 ourselves+0.868
 cle+0.866
 very+0.855
Neg logit contributions
les-1.433
ork-1.421
lights-1.390
pron-1.302
adoes-1.225
Token my
Token ID616
Feature activation+1.30e+00

Pos logit contributions
ork+1.216
les+1.188
lights+1.150
pron+1.105
adoes+1.038
Neg logit contributions
 least-0.812
 very-0.584
 ourselves-0.582
oh-0.571
 cle-0.546
Token sweet
Token ID6029
Feature activation-8.35e-01

Pos logit contributions
 least+2.052
oh+1.600
 ourselves+1.560
 cle+1.448
 oh+1.440
Neg logit contributions
les-2.272
ork-2.235
lights-2.176
pron-2.109
adoes-1.862
Tokenies
Token ID444
Feature activation+1.43e+00

Pos logit contributions
ork+1.425
lights+1.358
les+1.350
pron+1.301
adoes+1.207
Neg logit contributions
 least-1.352
 very-1.058
oh-1.042
 ourselves-1.031
 am-1.007
Token"
Token ID1
Feature activation+1.54e+00

Pos logit contributions
 least+1.541
oh+1.163
 cle+1.158
 very+1.155
 ourselves+1.145
Neg logit contributions
les-1.279
ork-1.268
lights-1.240
pron-1.133
adoes-1.065
TokenThank
Token ID10449
Feature activation+1.47e+00

Pos logit contributions
 least+2.133
 ourselves+1.440
oh+1.434
 very+1.429
 cle+1.365
Neg logit contributions
les-2.974
ork-2.908
pron-2.894
lights-2.863
lf-2.579
Token you
Token ID345
Feature activation+1.79e+00

Pos logit contributions
 least+1.580
 cle+0.937
 very+0.900
oh+0.815
'm+0.788
Neg logit contributions
les-3.357
ork-3.310
lights-3.201
pron-3.023
lf-2.957
Token,
Token ID11
Feature activation-4.48e-01

Pos logit contributions
 least+3.060
oh+2.322
 cle+2.298
 am+2.267
 ourselves+2.236
Neg logit contributions
les-2.918
ork-2.913
lights-2.895
pron-2.543
lf-2.461
Token you
Token ID345
Feature activation+1.44e+00

Pos logit contributions
ork+0.871
les+0.867
lights+0.843
pron+0.806
adoes+0.754
Neg logit contributions
 least-0.625
 very-0.460
oh-0.459
 ourselves-0.446
 cle-0.432
Token are
Token ID389
Feature activation+2.83e+00

Pos logit contributions
 least+1.397
oh+0.823
 ourselves+0.819
 cle+0.741
 oh+0.704
Neg logit contributions
les-3.389
ork-3.301
lights-3.149
pron-3.108
adoes-2.953
Token very
Token ID845
Feature activation+1.27e+00

Pos logit contributions
 least+3.753
oh+2.386
'm+2.225
 am+2.178
 Wait+2.147
Neg logit contributions
les-5.604
pron-5.528
ork-5.415
lf-5.368
lights-5.336
Token nice
Token ID3621
Feature activation+8.76e-01

Pos logit contributions
 least+2.093
 ourselves+1.460
oh+1.457
 cle+1.424
 oh+1.399
Neg logit contributions
les-2.161
ork-2.151
lights-2.072
pron-2.033
lf-1.866
Token,"
Token ID553
Feature activation+1.37e+00

Pos logit contributions
 least+1.452
oh+1.107
 cle+1.069
 ourselves+1.066
 very+1.053
Neg logit contributions
les-1.491
ork-1.442
lights-1.420
pron-1.304
adoes-1.233
Token Lily
Token ID20037
Feature activation+3.63e-01

Pos logit contributions
 least+2.186
oh+1.724
 ourselves+1.637
 very+1.575
 oh+1.566
Neg logit contributions
les-2.351
lights-2.333
ork-2.280
pron-2.055
lf-2.019
Token says
Token ID1139
Feature activation-1.94e-01

Pos logit contributions
 least+0.453
oh+0.325
 ourselves+0.312
 very+0.311
 cle+0.297
Neg logit contributions
les-0.749
ork-0.733
lights-0.719
pron-0.684
adoes-0.648
Token two
Token ID734
Feature activation+2.83e-01

Pos logit contributions
 least+0.430
oh+0.352
 ourselves+0.346
 very+0.346
 cle+0.339
Neg logit contributions
les-0.265
ork-0.263
lights-0.255
pron-0.236
adoes-0.213
Token friends
Token ID2460
Feature activation+1.40e+00

Pos logit contributions
 least+0.482
oh+0.380
 ourselves+0.371
 very+0.368
 cle+0.357
Neg logit contributions
les-0.456
ork-0.449
lights-0.439
pron-0.411
adoes-0.382
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
 least+2.209
oh+1.728
 ourselves+1.724
 cle+1.713
 very+1.630
Neg logit contributions
les-2.391
ork-2.295
lights-2.201
lf-2.063
pron-2.030
Token Tony
Token ID8832
Feature activation+6.66e-01

Pos logit contributions
 least+2.700
oh+2.105
 ourselves+2.059
 cle+2.011
 very+1.908
Neg logit contributions
ork-2.258
les-2.247
lights-2.175
pron-2.066
lf-1.954
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
 least+1.131
 ourselves+0.869
oh+0.869
 cle+0.850
 very+0.845
Neg logit contributions
les-1.097
ork-1.063
lights-1.012
pron-0.969
lf-0.905
Token Lucy
Token ID22162
Feature activation+2.82e+00

Pos logit contributions
 least+3.337
oh+2.749
 ourselves+2.634
 cle+2.563
 very+2.505
Neg logit contributions
les-2.014
ork-1.886
lights-1.860
adoes-1.715
pron-1.700
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 least+4.827
 ourselves+4.011
 cle+3.967
 oh+3.950
oh+3.926
Neg logit contributions
les-4.579
ork-3.946
lights-3.594
le-3.528
oom-3.469
Token were
Token ID547
Feature activation+1.31e+00

Pos logit contributions
 least+2.157
oh+1.678
 ourselves+1.677
 cle+1.608
 oh+1.521
Neg logit contributions
les-2.359
ork-2.267
lights-2.193
pron-2.123
lf-2.002
Token walking
Token ID6155
Feature activation+1.06e-01

Pos logit contributions
 least+1.535
oh+1.103
 ourselves+1.072
 am+1.028
 cle+1.008
Neg logit contributions
les-2.711
ork-2.651
lights-2.591
pron-2.535
lf-2.392
Token in
Token ID287
Feature activation+3.22e-01

Pos logit contributions
 least+0.154
oh+0.115
 very+0.113
 ourselves+0.112
 cle+0.108
Neg logit contributions
les-0.197
ork-0.196
lights-0.192
pron-0.182
adoes-0.171
Token a
Token ID257
Feature activation-4.01e-01

Pos logit contributions
 least+0.414
oh+0.302
 ourselves+0.285
 very+0.284
 cle+0.276
Neg logit contributions
les-0.643
ork-0.641
lights-0.630
pron-0.599
adoes-0.564
Token happy
Token ID3772
Feature activation-3.38e-01

Pos logit contributions
ork+1.614
les+1.597
lights+1.567
pron+1.475
lf+1.406
Neg logit contributions
 least-0.912
 very-0.680
 ourselves-0.617
oh-0.589
 so-0.588
Token here
Token ID994
Feature activation+5.66e-01

Pos logit contributions
les+0.623
ork+0.622
lights+0.601
pron+0.575
adoes+0.534
Neg logit contributions
 least-0.504
 very-0.378
oh-0.372
 ourselves-0.372
 am-0.352
Token.
Token ID13
Feature activation-4.30e-01

Pos logit contributions
 least+0.956
oh+0.752
 ourselves+0.722
 cle+0.720
 very+0.719
Neg logit contributions
ork-0.898
les-0.897
lights-0.871
pron-0.822
adoes-0.753
Token We
Token ID775
Feature activation-1.13e+00

Pos logit contributions
les+0.921
ork+0.911
lights+0.890
pron+0.852
lf+0.803
Neg logit contributions
 least-0.519
oh-0.360
 very-0.350
 ourselves-0.348
 cle-0.323
Token have
Token ID423
Feature activation+5.27e-01

Pos logit contributions
lights+2.521
ork+2.461
les+2.446
pron+2.356
lf+2.235
Neg logit contributions
 least-1.396
 very-0.966
 am-0.923
oh-0.894
'm-0.872
Token each
Token ID1123
Feature activation+2.82e+00

Pos logit contributions
 least+0.811
oh+0.636
 ourselves+0.602
 very+0.598
 cle+0.588
Neg logit contributions
ork-0.915
les-0.910
lights-0.882
pron-0.842
adoes-0.782
Token other
Token ID584
Feature activation-2.63e-01

Pos logit contributions
 least+4.571
oh+3.907
 ourselves+3.662
 cle+3.569
Pro+3.477
Neg logit contributions
les-4.902
ork-4.579
pron-4.089
lights-3.929
lf-3.861
Token and
Token ID290
Feature activation+6.18e-01

Pos logit contributions
ork+0.478
les+0.478
lights+0.465
pron+0.442
adoes+0.412
Neg logit contributions
 least-0.389
 very-0.295
oh-0.289
 ourselves-0.288
 cle-0.274
Token we
Token ID356
Feature activation-1.07e+00

Pos logit contributions
 least+0.809
oh+0.607
 very+0.565
 ourselves+0.560
 cle+0.553
Neg logit contributions
ork-1.196
les-1.194
lights-1.174
pron-1.121
adoes-1.059
Token will
Token ID481
Feature activation-3.06e-01

Pos logit contributions
lights+2.381
ork+2.350
les+2.327
pron+2.233
lf+2.118
Neg logit contributions
 least-1.303
 very-0.903
 am-0.860
oh-0.836
 are-0.809
Token make
Token ID787
Feature activation+3.33e-01

Pos logit contributions
les+0.626
ork+0.615
lights+0.609
pron+0.577
adoes+0.544
Neg logit contributions
 least-0.400
 very-0.283
oh-0.280
 ourselves-0.279
 am-0.265
Token saw
Token ID2497
Feature activation+9.42e-01

Pos logit contributions
 least+0.400
oh+0.308
 ourselves+0.303
 very+0.301
 cle+0.292
Neg logit contributions
les-0.435
ork-0.430
lights-0.420
pron-0.398
adoes-0.370
Token some
Token ID617
Feature activation+6.56e-01

Pos logit contributions
 least+1.300
oh+0.988
 ourselves+0.931
 am+0.900
 very+0.883
Neg logit contributions
les-1.812
ork-1.790
lights-1.751
pron-1.632
adoes-1.590
Token soft
Token ID2705
Feature activation-7.88e-01

Pos logit contributions
 least+1.124
oh+0.893
 ourselves+0.863
 very+0.827
 am+0.811
Neg logit contributions
les-1.074
ork-1.048
lights-1.027
pron-0.956
adoes-0.895
Token parts
Token ID3354
Feature activation+3.37e-01

Pos logit contributions
ork+1.355
les+1.313
lights+1.308
pron+1.271
lf+1.140
Neg logit contributions
 least-1.216
 very-0.990
oh-0.944
 ourselves-0.934
 am-0.912
Token on
Token ID319
Feature activation-3.15e-01

Pos logit contributions
 least+0.573
oh+0.452
 ourselves+0.439
 very+0.437
 cle+0.428
Neg logit contributions
les-0.540
ork-0.532
lights-0.523
pron-0.486
adoes-0.451
Token the
Token ID262
Feature activation+2.82e+00

Pos logit contributions
les+0.657
ork+0.648
lights+0.638
pron+0.616
adoes+0.572
Neg logit contributions
 least-0.385
 very-0.271
oh-0.267
 ourselves-0.265
 cle-0.247
Token handle
Token ID5412
Feature activation+8.03e-01

Pos logit contributions
 least+5.397
 ourselves+4.680
oh+4.507
 am+4.409
 kidding+4.311
Neg logit contributions
ork-4.154
les-3.870
lights-3.456
pron-3.447
lf-3.249
Token.
Token ID13
Feature activation+6.41e-02

Pos logit contributions
 least+1.420
oh+1.111
 ourselves+1.095
 cle+1.084
 very+1.073
Neg logit contributions
les-1.245
ork-1.235
lights-1.222
pron-1.103
adoes-1.032
Token It
Token ID632
Feature activation-1.40e-01

Pos logit contributions
 least+0.080
oh+0.057
 very+0.055
 ourselves+0.055
 cle+0.052
Neg logit contributions
les-0.132
ork-0.131
lights-0.129
pron-0.123
adoes-0.116
Token felt
Token ID2936
Feature activation+1.36e+00

Pos logit contributions
les+0.278
ork+0.277
lights+0.271
pron+0.259
adoes+0.243
Neg logit contributions
 least-0.185
oh-0.135
 very-0.134
 ourselves-0.128
 cle-0.126
Token nice
Token ID3621
Feature activation+6.86e-02

Pos logit contributions
 least+2.018
oh+1.627
 ourselves+1.494
 am+1.351
 oh+1.334
Neg logit contributions
les-2.463
ork-2.444
lights-2.396
pron-2.258
adoes-2.134
Token.
Token ID13
Feature activation-7.79e-01

Pos logit contributions
 least+0.252
oh+0.197
 very+0.192
 ourselves+0.192
 cle+0.186
Neg logit contributions
les-0.247
ork-0.245
lights-0.239
pron-0.225
adoes-0.209
Token
Token ID198
Feature activation+4.68e-01

Pos logit contributions
les+1.560
ork+1.527
lights+1.479
pron+1.466
lf+1.369
Neg logit contributions
 least-1.032
 very-0.786
 ourselves-0.754
oh-0.731
 am-0.713
Token
Token ID198
Feature activation+6.20e-01

Pos logit contributions
 least+0.739
oh+0.560
 ourselves+0.541
 very+0.541
 cle+0.540
Neg logit contributions
ork-0.816
les-0.813
lights-0.788
pron-0.741
lf-0.688
Token"
Token ID1
Feature activation+4.52e-01

Pos logit contributions
 least+1.132
oh+0.878
 very+0.866
 cle+0.864
 ourselves+0.855
Neg logit contributions
ork-0.936
les-0.933
lights-0.895
pron-0.852
lf-0.781
TokenYou
Token ID1639
Feature activation+5.41e-01

Pos logit contributions
 least+0.600
oh+0.433
 very+0.424
 ourselves+0.417
 cle+0.401
Neg logit contributions
les-0.895
ork-0.885
lights-0.870
pron-0.839
adoes-0.776
Token are
Token ID389
Feature activation+2.82e+00

Pos logit contributions
 least+0.437
oh+0.254
 ourselves+0.239
 very+0.227
 cle+0.210
Neg logit contributions
les-1.340
ork-1.317
lights-1.288
pron-1.249
adoes-1.190
Token very
Token ID845
Feature activation+1.17e+00

Pos logit contributions
 least+3.293
oh+2.485
 oh+2.134
'm+2.090
 am+2.019
Neg logit contributions
les-5.381
pron-5.328
ork-5.327
lights-5.262
lf-5.186
Token kind
Token ID1611
Feature activation+4.88e-01

Pos logit contributions
 least+1.877
oh+1.332
 ourselves+1.312
 oh+1.284
 cle+1.280
Neg logit contributions
les-2.047
ork-2.026
lights-1.943
pron-1.886
lf-1.759
Token and
Token ID290
Feature activation+1.38e+00

Pos logit contributions
 least+0.812
oh+0.630
 very+0.606
 ourselves+0.603
 cle+0.597
Neg logit contributions
les-0.817
ork-0.799
lights-0.789
pron-0.721
adoes-0.691
Token generous
Token ID14431
Feature activation+1.13e+00

Pos logit contributions
 least+2.270
oh+1.746
 ourselves+1.629
 very+1.574
 am+1.568
Neg logit contributions
les-2.199
ork-2.182
pron-2.150
lights-2.147
lf-1.926
Token,"
Token ID553
Feature activation+1.03e+00

Pos logit contributions
 least+1.817
oh+1.411
 cle+1.314
 ourselves+1.300
 very+1.294
Neg logit contributions
les-1.965
lights-1.885
ork-1.876
pron-1.686
adoes-1.656
Token was
Token ID373
Feature activation-1.42e+00

Pos logit contributions
ork+1.002
les+1.000
lights+0.991
pron+0.935
adoes+0.880
Neg logit contributions
 least-0.776
 very-0.592
oh-0.589
 ourselves-0.552
 am-0.552
Token all
Token ID477
Feature activation-9.09e-02

Pos logit contributions
les+3.190
ork+3.163
lights+3.088
pron+2.830
adoes+2.828
Neg logit contributions
 least-1.648
 very-1.334
 so-1.156
 cle-1.028
oh-1.024
Token better
Token ID1365
Feature activation-1.10e-01

Pos logit contributions
les+0.157
ork+0.157
lights+0.152
pron+0.144
adoes+0.135
Neg logit contributions
 least-0.144
 very-0.110
oh-0.110
 ourselves-0.107
 cle-0.104
Token.
Token ID13
Feature activation+9.00e-02

Pos logit contributions
les+0.193
ork+0.192
lights+0.188
pron+0.178
adoes+0.166
Neg logit contributions
 least-0.173
oh-0.132
 very-0.132
 ourselves-0.130
 cle-0.124
Token The
Token ID383
Feature activation-1.63e+00

Pos logit contributions
 least+0.136
oh+0.103
 very+0.101
 ourselves+0.100
 cle+0.097
Neg logit contributions
les-0.163
ork-0.162
lights-0.158
pron-0.150
adoes-0.140
Token charming
Token ID23332
Feature activation-3.73e+00

Pos logit contributions
ork+2.612
les+2.539
lights+2.507
pron+2.196
anne+2.139
Neg logit contributions
 least-3.088
 very-2.569
 cle-2.404
 ourselves-2.266
 am-2.228
Token soldier
Token ID10686
Feature activation-5.93e-01

Pos logit contributions
lights+8.226
ork+8.095
les+7.694
pron+7.422
ves+7.102
Neg logit contributions
 least-4.852
 very-4.622
 so-3.812
 cle-3.646
 are-3.599
Token opened
Token ID4721
Feature activation-1.66e+00

Pos logit contributions
ork+1.118
lights+1.114
les+1.107
pron+1.043
adoes+0.987
Neg logit contributions
 least-0.847
 very-0.630
oh-0.621
 cle-0.580
 am-0.580
Token the
Token ID262
Feature activation-1.96e-01

Pos logit contributions
ork+3.600
les+3.514
lights+3.462
pron+3.451
lf+3.184
Neg logit contributions
 least-2.035
 very-1.594
 so-1.483
 ourselves-1.449
oh-1.298
Token window
Token ID4324
Feature activation-2.32e-01

Pos logit contributions
les+0.314
ork+0.310
lights+0.305
pron+0.285
adoes+0.263
Neg logit contributions
 least-0.336
 very-0.263
oh-0.261
 ourselves-0.256
 cle-0.251
Token,
Token ID11
Feature activation-7.25e-01

Pos logit contributions
les+0.382
ork+0.380
lights+0.368
pron+0.351
adoes+0.326
Neg logit contributions
 least-0.384
oh-0.299
 very-0.299
 ourselves-0.290
 cle-0.282
TokenThe
Token ID464
Feature activation-4.28e-01

Pos logit contributions
 least+0.042
oh+0.032
 very+0.031
 ourselves+0.031
 cle+0.030
Neg logit contributions
les-0.050
ork-0.049
lights-0.048
pron-0.046
adoes-0.043
Token noisy
Token ID31210
Feature activation-2.14e+00

Pos logit contributions
ork+0.744
les+0.744
lights+0.718
pron+0.667
adoes+0.636
Neg logit contributions
 least-0.686
 very-0.534
oh-0.512
 cle-0.508
 ourselves-0.498
Token brother
Token ID3956
Feature activation+2.67e-01

Pos logit contributions
ork+4.178
les+3.853
lights+3.773
pron+3.461
ves+3.379
Neg logit contributions
 least-3.168
 very-2.612
 cle-2.292
 so-2.283
 oh-2.261
Token and
Token ID290
Feature activation-6.25e-01

Pos logit contributions
 least+0.405
oh+0.307
 ourselves+0.304
 very+0.301
 cle+0.292
Neg logit contributions
les-0.474
ork-0.468
lights-0.456
pron-0.434
adoes-0.402
Token the
Token ID262
Feature activation-4.52e-01

Pos logit contributions
ork+1.169
les+1.143
lights+1.117
pron+1.056
adoes+0.972
Neg logit contributions
 least-0.927
 very-0.716
 cle-0.673
oh-0.671
 ourselves-0.665
Token quiet
Token ID5897
Feature activation-3.70e+00

Pos logit contributions
ork+0.775
les+0.774
lights+0.745
pron+0.693
adoes+0.656
Neg logit contributions
 least-0.736
 very-0.577
 cle-0.551
oh-0.550
 ourselves-0.541
Token brother
Token ID3956
Feature activation+7.32e-01

Pos logit contributions
ork+7.774
les+6.813
lights+6.483
pron+6.469
ool+6.390
Neg logit contributions
 least-5.007
 very-4.694
 so-4.263
 are-3.790
 most-3.596
Token celebrated
Token ID13943
Feature activation+5.18e-02

Pos logit contributions
 least+1.186
 ourselves+0.926
oh+0.920
 cle+0.892
 very+0.889
Neg logit contributions
les-1.245
ork-1.193
lights-1.165
pron-1.109
lf-1.029
Token with
Token ID351
Feature activation-8.24e-02

Pos logit contributions
 least+0.081
oh+0.062
 very+0.061
 ourselves+0.061
 cle+0.059
Neg logit contributions
les-0.090
ork-0.090
lights-0.088
pron-0.083
adoes-0.077
Token a
Token ID257
Feature activation+9.93e-02

Pos logit contributions
les+0.172
ork+0.172
lights+0.168
pron+0.161
adoes+0.152
Neg logit contributions
 least-0.101
 very-0.070
oh-0.069
 ourselves-0.068
 cle-0.065
Token big
Token ID1263
Feature activation-4.62e-01

Pos logit contributions
 least+0.163
oh+0.127
 very+0.124
 ourselves+0.124
 cle+0.120
Neg logit contributions
les-0.165
ork-0.164
lights-0.160
pron-0.151
adoes-0.139
Token a
Token ID257
Feature activation-7.47e-01

Pos logit contributions
les+0.151
ork+0.150
lights+0.147
pron+0.141
adoes+0.133
Neg logit contributions
 least-0.077
 very-0.052
oh-0.052
 ourselves-0.050
 cle-0.048
Token strong
Token ID1913
Feature activation-1.97e+00

Pos logit contributions
les+1.198
lights+1.174
ork+1.155
pron+1.048
adoes+1.011
Neg logit contributions
 least-1.272
 very-1.058
oh-1.002
 ourselves-0.971
 cle-0.968
Token and
Token ID290
Feature activation+2.18e-01

Pos logit contributions
lights+3.692
ork+3.525
les+3.483
pron+3.294
adoes+2.990
Neg logit contributions
 least-2.964
 very-2.561
 so-2.319
 am-2.212
 say-2.211
Token brave
Token ID14802
Feature activation-2.79e+00

Pos logit contributions
 least+0.331
oh+0.252
 ourselves+0.244
 very+0.242
 cle+0.234
Neg logit contributions
les-0.389
ork-0.388
lights-0.379
pron-0.359
adoes-0.336
Token bear
Token ID6842
Feature activation-6.49e-01

Pos logit contributions
lights+5.158
ork+5.018
pron+4.886
les+4.615
lf+4.454
Neg logit contributions
 least-4.187
 very-3.833
 so-3.419
 cle-3.255
 say-3.180
Token on
Token ID319
Feature activation-3.70e+00

Pos logit contributions
lights+1.135
les+1.126
ork+1.114
pron+1.066
adoes+0.981
Neg logit contributions
 least-0.998
 very-0.792
oh-0.779
 ourselves-0.748
 am-0.737
Token it
Token ID340
Feature activation-4.31e-01

Pos logit contributions
les+7.894
lights+7.736
pron+7.066
ork+6.907
ilt+6.841
Neg logit contributions
 very-5.064
 least-4.493
 so-4.309
 am-3.644
 ourselves-3.465
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
les+0.735
ork+0.732
lights+0.721
pron+0.686
adoes+0.633
Neg logit contributions
 least-0.682
 very-0.538
oh-0.524
 ourselves-0.513
 cle-0.499
Token Benny
Token ID44275
Feature activation+5.53e-01

Pos logit contributions
lights+2.586
les+2.566
ork+2.553
pron+2.424
lf+2.274
Neg logit contributions
 least-1.672
 very-1.303
oh-1.232
 so-1.207
 am-1.140
Token wanted
Token ID2227
Feature activation-6.37e-01

Pos logit contributions
 least+0.824
 ourselves+0.615
oh+0.612
 very+0.584
 cle+0.579
Neg logit contributions
les-1.009
ork-0.999
lights-0.954
pron-0.919
lf-0.865
Token to
Token ID284
Feature activation-6.28e-02

Pos logit contributions
lights+1.241
les+1.240
ork+1.219
pron+1.158
adoes+1.087
Neg logit contributions
 least-0.895
 very-0.694
oh-0.642
 ourselves-0.622
 so-0.621
Token to
Token ID284
Feature activation+9.80e-01

Pos logit contributions
 least+0.279
oh+0.215
 very+0.210
 ourselves+0.209
 cle+0.203
Neg logit contributions
les-0.303
ork-0.301
lights-0.294
pron-0.277
adoes-0.258
Token the
Token ID262
Feature activation-7.13e-01

Pos logit contributions
 least+1.450
oh+1.146
 very+1.058
 ourselves+1.053
 cle+1.038
Neg logit contributions
ork-1.742
les-1.738
lights-1.699
pron-1.632
lf-1.521
Token bowl
Token ID9396
Feature activation+5.17e-01

Pos logit contributions
ork+1.136
les+1.123
lights+1.091
pron+1.010
adoes+0.938
Neg logit contributions
 least-1.251
 very-0.999
 ourselves-0.957
 cle-0.951
oh-0.945
Token.
Token ID13
Feature activation-4.32e-01

Pos logit contributions
 least+0.882
oh+0.689
 ourselves+0.675
 very+0.664
 cle+0.657
Neg logit contributions
les-0.825
ork-0.818
lights-0.804
pron-0.747
adoes-0.688
Token The
Token ID383
Feature activation-1.03e+00

Pos logit contributions
les+0.811
ork+0.807
lights+0.786
pron+0.749
adoes+0.700
Neg logit contributions
 least-0.620
 very-0.461
oh-0.459
 ourselves-0.444
 cle-0.435
Token quiet
Token ID5897
Feature activation-3.52e+00

Pos logit contributions
ork+1.739
les+1.697
lights+1.622
pron+1.495
adoes+1.429
Neg logit contributions
 least-1.754
 very-1.408
 cle-1.340
oh-1.274
 ourselves-1.273
Token brother
Token ID3956
Feature activation-1.55e-01

Pos logit contributions
ork+7.261
les+6.227
pron+6.108
lights+6.024
ilantro+5.796
Neg logit contributions
 least-4.747
 very-4.388
 so-4.176
 are-3.854
 most-3.596
Token used
Token ID973
Feature activation-1.06e+00

Pos logit contributions
ork+0.282
les+0.281
lights+0.277
pron+0.261
adoes+0.246
Neg logit contributions
 least-0.227
oh-0.172
 very-0.170
 ourselves-0.164
 cle-0.160
Token the
Token ID262
Feature activation-8.30e-01

Pos logit contributions
ork+2.140
les+2.130
lights+2.086
pron+1.976
adoes+1.891
Neg logit contributions
 least-1.373
 very-1.073
 cle-0.971
 so-0.967
oh-0.944
Token shield
Token ID7614
Feature activation+3.13e-01

Pos logit contributions
ork+1.255
les+1.251
lights+1.223
pron+1.094
adoes+1.031
Neg logit contributions
 least-1.501
 very-1.228
 cle-1.188
 ourselves-1.167
 am-1.148
Token to
Token ID284
Feature activation+2.40e-01

Pos logit contributions
 least+0.491
oh+0.378
 ourselves+0.370
 very+0.365
 cle+0.354
Neg logit contributions
les-0.539
ork-0.537
lights-0.522
pron-0.493
adoes-0.460
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
les+1.394
ork+1.380
lights+1.367
pron+1.308
adoes+1.220
Neg logit contributions
 least-0.734
 very-0.515
 ourselves-0.487
oh-0.466
 cle-0.451
Token day
Token ID1110
Feature activation-2.02e-01

Pos logit contributions
ork+2.345
lights+2.271
les+2.259
pron+2.063
adoes+2.014
Neg logit contributions
 least-1.780
 very-1.393
oh-1.312
 ourselves-1.280
 cle-1.269
Token playing
Token ID2712
Feature activation-1.26e+00

Pos logit contributions
ork+0.425
les+0.425
lights+0.418
pron+0.399
adoes+0.377
Neg logit contributions
 least-0.241
 very-0.167
oh-0.167
 ourselves-0.162
 cle-0.154
Token with
Token ID351
Feature activation-9.77e-01

Pos logit contributions
les+2.523
ork+2.516
lights+2.515
pron+2.393
adoes+2.162
Neg logit contributions
 least-1.573
 very-1.295
 ourselves-1.174
 so-1.128
 am-1.124
Token the
Token ID262
Feature activation-2.13e+00

Pos logit contributions
les+2.219
ork+2.194
lights+2.128
pron+2.121
lf+1.935
Neg logit contributions
 least-1.076
 very-0.771
 ourselves-0.698
 so-0.640
 cle-0.633
Token yellow
Token ID7872
Feature activation-3.46e+00

Pos logit contributions
ork+3.915
les+3.494
lights+3.458
pron+3.223
ves+3.073
Neg logit contributions
 least-3.571
 very-3.064
 cle-2.877
 so-2.709
 most-2.666
Token shield
Token ID7614
Feature activation-8.51e-02

Pos logit contributions
ork+6.053
lights+5.455
pron+5.304
anne+5.220
ves+4.830
Neg logit contributions
 least-5.923
 very-5.360
 are-5.104
 am-4.773
 so-4.699
Token and
Token ID290
Feature activation+1.47e-01

Pos logit contributions
les+0.155
ork+0.155
lights+0.151
pron+0.144
adoes+0.134
Neg logit contributions
 least-0.125
oh-0.093
 very-0.093
 ourselves-0.091
 cle-0.088
Token having
Token ID1719
Feature activation+6.84e-02

Pos logit contributions
 least+0.191
oh+0.138
 very+0.134
 ourselves+0.133
 cle+0.127
Neg logit contributions
les-0.294
ork-0.293
lights-0.287
pron-0.274
adoes-0.258
Token lots
Token ID6041
Feature activation-1.24e+00

Pos logit contributions
 least+0.094
oh+0.069
 very+0.068
 ourselves+0.067
 cle+0.064
Neg logit contributions
les-0.132
ork-0.131
lights-0.128
pron-0.122
adoes-0.114
Token of
Token ID286
Feature activation-5.85e-01

Pos logit contributions
ork+1.844
lights+1.769
les+1.755
pron+1.748
adoes+1.646
Neg logit contributions
 least-2.177
 very-1.880
 so-1.739
 am-1.722
oh-1.722
Token cool
Token ID3608
Feature activation-3.11e+00

Pos logit contributions
les+2.871
lights+2.845
ork+2.748
adoes+2.585
pron+2.499
Neg logit contributions
 least-1.985
 very-1.829
 am-1.581
oh-1.577
 so-1.530
Token shirt
Token ID10147
Feature activation-1.03e+00

Pos logit contributions
lights+6.120
les+5.772
ork+5.766
emaker+5.394
adoes+5.280
Neg logit contributions
 very-4.049
 least-3.903
 so-3.552
 am-3.220
 say-3.074
Token with
Token ID351
Feature activation-7.98e-01

Pos logit contributions
ork+1.887
les+1.874
lights+1.859
pron+1.716
adoes+1.684
Neg logit contributions
 least-1.519
 very-1.217
oh-1.135
 so-1.122
 ourselves-1.056
Token a
Token ID257
Feature activation-1.89e+00

Pos logit contributions
ork+1.652
les+1.623
lights+1.607
pron+1.553
lf+1.434
Neg logit contributions
 least-0.991
 very-0.763
 ourselves-0.694
oh-0.683
 so-0.679
Token dinosaur
Token ID28257
Feature activation-1.67e+00

Pos logit contributions
lights+3.492
les+3.480
ork+3.284
adoes+3.076
ors+2.898
Neg logit contributions
 least-2.658
 very-2.535
 so-2.141
 am-2.140
oh-2.102
Token on
Token ID319
Feature activation-3.41e+00

Pos logit contributions
lights+3.123
ork+2.992
les+2.991
pron+2.840
adoes+2.738
Neg logit contributions
 least-2.417
 very-2.033
oh-1.847
 so-1.789
 am-1.707
Token it
Token ID340
Feature activation-5.56e-01

Pos logit contributions
les+8.195
lights+7.841
pron+7.557
ork+7.500
ravings+7.072
Neg logit contributions
 very-3.577
 least-3.530
 so-3.206
 ourselves-2.487
 say-2.389
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
les+0.962
ork+0.960
lights+0.948
pron+0.888
adoes+0.837
Neg logit contributions
 least-0.872
 very-0.685
oh-0.667
 ourselves-0.644
 so-0.625
Token
Token ID198
Feature activation-7.84e-01

Pos logit contributions
les+2.474
ork+2.445
lights+2.398
pron+2.310
adoes+2.138
Neg logit contributions
 least-1.545
 very-1.220
oh-1.159
 so-1.132
 ourselves-1.072
Token
Token ID198
Feature activation-1.18e+00

Pos logit contributions
les+1.566
lights+1.541
ork+1.526
pron+1.460
adoes+1.420
Neg logit contributions
 least-0.989
 very-0.762
oh-0.759
 am-0.712
 ourselves-0.695
TokenTim
Token ID14967
Feature activation-5.78e-02

Pos logit contributions
les+2.121
lights+2.098
ork+2.062
pron+1.916
adoes+1.808
Neg logit contributions
 least-1.743
 very-1.474
oh-1.469
 oh-1.352
 so-1.338
Token river
Token ID7850
Feature activation-7.06e-02

Pos logit contributions
 least+0.785
oh+0.635
 ourselves+0.608
 very+0.604
 cle+0.589
Neg logit contributions
les-0.688
ork-0.684
lights-0.668
pron-0.620
adoes-0.583
Token separates
Token ID31555
Feature activation+4.60e-01

Pos logit contributions
les+0.154
ork+0.154
lights+0.151
pron+0.145
adoes+0.136
Neg logit contributions
 least-0.078
oh-0.053
 very-0.052
 ourselves-0.051
 cle-0.048
Token us
Token ID514
Feature activation+4.80e-01

Pos logit contributions
 least+0.640
oh+0.471
 very+0.461
 cle+0.453
 am+0.449
Neg logit contributions
les-0.880
ork-0.867
lights-0.853
pron-0.792
adoes-0.768
Token!"
Token ID2474
Feature activation+5.38e-01

Pos logit contributions
 least+0.731
oh+0.554
 very+0.534
 ourselves+0.531
 cle+0.529
Neg logit contributions
les-0.848
ork-0.843
lights-0.820
pron-0.765
adoes-0.731
Token The
Token ID383
Feature activation-1.04e+00

Pos logit contributions
 least+0.848
oh+0.644
 ourselves+0.625
 very+0.624
 cle+0.614
Neg logit contributions
ork-0.942
les-0.940
lights-0.906
pron-0.857
adoes-0.811
Token quiet
Token ID5897
Feature activation-3.39e+00

Pos logit contributions
les+1.864
ork+1.859
lights+1.799
pron+1.667
adoes+1.539
Neg logit contributions
 least-1.663
 very-1.362
 cle-1.272
 ourselves-1.245
oh-1.213
Token bunny
Token ID44915
Feature activation-4.00e-01

Pos logit contributions
ork+7.183
pron+7.020
lights+6.587
les+6.282
ravings+6.115
Neg logit contributions
 least-4.173
 very-4.163
 so-3.832
 am-3.611
 are-3.581
Token was
Token ID373
Feature activation+2.32e-01

Pos logit contributions
ork+0.798
les+0.797
lights+0.796
pron+0.752
adoes+0.709
Neg logit contributions
 least-0.515
 very-0.383
oh-0.381
 ourselves-0.348
 cle-0.348
Token sad
Token ID6507
Feature activation+5.37e-01

Pos logit contributions
 least+0.299
oh+0.217
 ourselves+0.208
 very+0.204
 cle+0.197
Neg logit contributions
les-0.465
ork-0.464
lights-0.455
pron-0.434
adoes-0.410
Token.
Token ID13
Feature activation+2.33e-01

Pos logit contributions
 least+0.913
oh+0.734
 ourselves+0.705
 cle+0.697
 very+0.689
Neg logit contributions
les-0.863
ork-0.849
lights-0.836
pron-0.764
adoes-0.718
Token They
Token ID1119
Feature activation+6.18e-01

Pos logit contributions
 least+0.401
oh+0.314
 very+0.308
 ourselves+0.307
 cle+0.298
Neg logit contributions
les-0.370
ork-0.370
lights-0.360
pron-0.337
adoes-0.314
Token with
Token ID351
Feature activation-3.08e-01

Pos logit contributions
les+0.084
ork+0.083
lights+0.081
pron+0.076
adoes+0.071
Neg logit contributions
 least-0.079
oh-0.061
 very-0.061
 ourselves-0.059
 cle-0.057
Token pictures
Token ID5986
Feature activation-6.61e-01

Pos logit contributions
les+0.580
ork+0.578
lights+0.570
pron+0.539
adoes+0.506
Neg logit contributions
 least-0.434
 very-0.331
oh-0.324
 ourselves-0.314
 cle-0.306
Token of
Token ID286
Feature activation-2.99e-01

Pos logit contributions
ork+1.088
les+1.080
lights+1.047
pron+1.001
adoes+0.941
Neg logit contributions
 least-1.074
 very-0.885
oh-0.842
 ourselves-0.818
 so-0.811
Token cute
Token ID13779
Feature activation-2.06e+00

Pos logit contributions
les+0.500
ork+0.500
lights+0.487
pron+0.461
adoes+0.425
Neg logit contributions
 least-0.487
 very-0.382
oh-0.373
 ourselves-0.373
 cle-0.359
Token animals
Token ID4695
Feature activation-2.97e-01

Pos logit contributions
ork+3.719
pron+3.703
lights+3.605
les+3.350
anne+3.099
Neg logit contributions
 least-2.864
 very-2.629
 so-2.372
 ourselves-2.362
oh-2.267
Token on
Token ID319
Feature activation-3.37e+00

Pos logit contributions
ork+0.506
les+0.504
lights+0.496
pron+0.468
adoes+0.436
Neg logit contributions
 least-0.474
 very-0.372
oh-0.365
 ourselves-0.358
 am-0.344
Token it
Token ID340
Feature activation-5.98e-01

Pos logit contributions
les+6.766
lights+6.565
pron+6.391
ork+6.386
ravings+6.269
Neg logit contributions
 least-4.321
 very-4.293
 so-3.569
 ourselves-3.409
 am-3.287
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
les+1.043
ork+1.041
lights+1.019
pron+0.970
adoes+0.904
Neg logit contributions
 least-0.933
 very-0.742
oh-0.706
 ourselves-0.694
 so-0.673
Token He
Token ID679
Feature activation+2.97e-01

Pos logit contributions
les+2.688
ork+2.626
lights+2.498
pron+2.401
adoes+2.223
Neg logit contributions
 least-2.512
 very-2.075
oh-1.937
 so-1.935
 am-1.898
Token loved
Token ID6151
Feature activation-6.73e-01

Pos logit contributions
 least+0.465
oh+0.358
 ourselves+0.355
 very+0.345
 cle+0.338
Neg logit contributions
les-0.514
ork-0.511
lights-0.495
pron-0.474
lf-0.437
Token to
Token ID284
Feature activation-6.75e-01

Pos logit contributions
les+1.276
ork+1.270
lights+1.243
pron+1.170
adoes+1.094
Neg logit contributions
 least-0.956
 very-0.754
oh-0.695
 ourselves-0.690
 cle-0.677
Token ran
Token ID4966
Feature activation-5.65e-01

Pos logit contributions
lights+2.683
ork+2.608
les+2.525
pron+2.469
adoes+2.308
Neg logit contributions
 least-1.794
 very-1.423
oh-1.343
 so-1.249
 cle-1.231
Token away
Token ID1497
Feature activation+3.24e-02

Pos logit contributions
les+1.026
lights+1.016
ork+1.013
pron+0.981
lf+0.885
Neg logit contributions
 least-0.849
 very-0.681
oh-0.638
 ourselves-0.621
 so-0.614
Token,
Token ID11
Feature activation-4.06e-01

Pos logit contributions
 least+0.050
oh+0.039
 very+0.038
 ourselves+0.038
 cle+0.036
Neg logit contributions
les-0.057
ork-0.057
lights-0.055
pron-0.052
adoes-0.049
Token and
Token ID290
Feature activation+2.87e-01

Pos logit contributions
les+0.808
ork+0.801
lights+0.787
pron+0.740
adoes+0.697
Neg logit contributions
 least-0.547
 very-0.414
oh-0.392
 ourselves-0.381
 cle-0.374
Token the
Token ID262
Feature activation-1.36e+00

Pos logit contributions
 least+0.399
oh+0.296
 ourselves+0.289
 very+0.286
 cle+0.276
Neg logit contributions
les-0.545
ork-0.541
lights-0.529
pron-0.502
adoes-0.473
Token blue
Token ID4171
Feature activation-3.36e+00

Pos logit contributions
les+2.493
ork+2.473
lights+2.419
pron+2.183
adoes+2.022
Neg logit contributions
 least-2.111
 very-1.784
 cle-1.651
 so-1.563
 oh-1.541
Token bird
Token ID6512
Feature activation-2.83e-01

Pos logit contributions
lights+6.833
les+6.539
ork+6.430
pron+6.387
lf+6.008
Neg logit contributions
 least-4.768
 very-4.400
 am-3.838
 are-3.783
 so-3.771
Token saved
Token ID7448
Feature activation-5.63e-01

Pos logit contributions
ork+0.531
les+0.529
lights+0.524
pron+0.497
adoes+0.465
Neg logit contributions
 least-0.400
 very-0.301
oh-0.299
 ourselves-0.284
 cle-0.278
Token its
Token ID663
Feature activation-3.01e-01

Pos logit contributions
les+1.106
lights+1.093
ork+1.091
pron+1.055
lf+0.958
Neg logit contributions
 least-0.774
 very-0.587
 ourselves-0.573
oh-0.566
 cle-0.529
Token eggs
Token ID9653
Feature activation-2.15e-01

Pos logit contributions
les+0.510
ork+0.504
lights+0.495
pron+0.466
adoes+0.431
Neg logit contributions
 least-0.492
 very-0.385
oh-0.375
 ourselves-0.373
 cle-0.368
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
les+0.368
ork+0.365
lights+0.357
pron+0.338
adoes+0.315
Neg logit contributions
 least-0.344
 very-0.265
oh-0.264
 ourselves-0.261
 cle-0.249
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
les+0.242
ork+0.240
lights+0.235
pron+0.223
adoes+0.206
Neg logit contributions
 least-0.231
oh-0.180
 very-0.179
 ourselves-0.175
 cle-0.169
Token
Token ID198
Feature activation+5.35e-01

Pos logit contributions
 least+0.167
oh+0.129
 very+0.127
 ourselves+0.126
 cle+0.122
Neg logit contributions
les-0.176
ork-0.175
lights-0.171
pron-0.161
adoes-0.150
Token
Token ID198
Feature activation-3.51e-01

Pos logit contributions
 least+0.847
oh+0.633
 ourselves+0.630
 very+0.623
 cle+0.616
Neg logit contributions
ork-0.932
les-0.917
lights-0.885
pron-0.845
lf-0.781
TokenWhen
Token ID2215
Feature activation+1.46e-01

Pos logit contributions
les+0.671
lights+0.658
ork+0.657
pron+0.614
adoes+0.583
Neg logit contributions
 least-0.496
oh-0.381
 very-0.369
 ourselves-0.360
 am-0.347
Token the
Token ID262
Feature activation-1.49e+00

Pos logit contributions
 least+0.218
oh+0.165
 very+0.161
 ourselves+0.160
 cle+0.155
Neg logit contributions
les-0.264
ork-0.263
lights-0.257
pron-0.244
adoes-0.228
Token white
Token ID2330
Feature activation-3.36e+00

Pos logit contributions
ork+2.670
les+2.481
lights+2.347
pron+2.318
adoes+2.104
Neg logit contributions
 least-2.554
 very-2.109
 cle-1.966
 so-1.925
 ourselves-1.912
Token all
Token ID477
Feature activation-2.42e-01

Pos logit contributions
ork+6.805
pron+6.219
lights+6.141
les+5.941
isdom+5.766
Neg logit contributions
 least-4.747
 very-4.464
 so-4.034
 first-3.606
'm-3.599
Tokenigator
Token ID23823
Feature activation+5.59e-02

Pos logit contributions
ork+0.382
les+0.376
lights+0.365
pron+0.350
adoes+0.319
Neg logit contributions
 least-0.424
oh-0.335
 very-0.333
 ourselves-0.325
 cle-0.317
Token got
Token ID1392
Feature activation-2.97e-01

Pos logit contributions
 least+0.087
oh+0.067
 very+0.066
 ourselves+0.065
 cle+0.063
Neg logit contributions
les-0.097
ork-0.097
lights-0.094
pron-0.089
adoes-0.083
Token out
Token ID503
Feature activation+2.85e-01

Pos logit contributions
les+0.594
ork+0.591
lights+0.577
pron+0.555
lf+0.517
Neg logit contributions
 least-0.396
 very-0.295
oh-0.285
 ourselves-0.277
 cle-0.268
Token of
Token ID286
Feature activation-5.36e-01

Pos logit contributions
 least+0.476
oh+0.375
 ourselves+0.365
 very+0.363
 cle+0.353
Neg logit contributions
les-0.467
ork-0.462
lights-0.455
pron-0.421
adoes-0.394
Token it
Token ID340
Feature activation+7.58e-01

Pos logit contributions
 least+0.551
oh+0.425
 ourselves+0.409
 very+0.405
 cle+0.398
Neg logit contributions
les-0.609
ork-0.605
lights-0.591
pron-0.555
adoes-0.522
Token!
Token ID0
Feature activation+3.05e-01

Pos logit contributions
 least+1.269
oh+1.006
 ourselves+0.964
 cle+0.943
 am+0.940
Neg logit contributions
les-1.231
ork-1.213
lights-1.190
pron-1.097
adoes-1.027
Token
Token ID198
Feature activation+4.41e-01

Pos logit contributions
 least+0.471
oh+0.358
 very+0.352
 ourselves+0.352
 cle+0.340
Neg logit contributions
ork-0.533
les-0.532
lights-0.517
pron-0.488
adoes-0.456
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
 least+0.690
oh+0.515
 ourselves+0.511
 very+0.507
 cle+0.498
Neg logit contributions
ork-0.773
les-0.766
lights-0.740
pron-0.704
lf-0.652
TokenThe
Token ID464
Feature activation-1.53e+00

Pos logit contributions
les+0.310
ork+0.307
lights+0.303
pron+0.286
adoes+0.270
Neg logit contributions
 least-0.221
oh-0.166
 very-0.162
 ourselves-0.158
 cle-0.152
Token white
Token ID2330
Feature activation-3.34e+00

Pos logit contributions
ork+2.934
les+2.722
lights+2.643
pron+2.547
anne+2.320
Neg logit contributions
 least-2.438
 very-2.027
 cle-1.847
 so-1.806
 ourselves-1.789
Token all
Token ID477
Feature activation-1.48e-01

Pos logit contributions
ork+6.523
lights+6.091
pron+5.929
les+5.720
isdom+5.478
Neg logit contributions
 least-4.859
 very-4.620
 so-4.138
 am-3.843
'm-3.838
Tokenigator
Token ID23823
Feature activation-2.54e-02

Pos logit contributions
ork+0.218
les+0.216
lights+0.209
pron+0.197
adoes+0.181
Neg logit contributions
 least-0.272
oh-0.219
 very-0.218
 ourselves-0.212
 cle-0.208
Token was
Token ID373
Feature activation+8.44e-02

Pos logit contributions
les+0.047
ork+0.047
lights+0.046
pron+0.044
adoes+0.041
Neg logit contributions
 least-0.037
oh-0.027
 very-0.027
 ourselves-0.026
 cle-0.026
Token scared
Token ID12008
Feature activation+1.59e-01

Pos logit contributions
 least+0.077
oh+0.046
 very+0.044
 ourselves+0.043
 cle+0.040
Neg logit contributions
les-0.202
ork-0.201
lights-0.198
pron-0.190
adoes-0.181
Token so
Token ID523
Feature activation+2.03e-01

Pos logit contributions
 least+0.263
oh+0.205
 very+0.200
 ourselves+0.200
 cle+0.194
Neg logit contributions
les-0.265
ork-0.263
lights-0.257
pron-0.241
adoes-0.225
Token with
Token ID351
Feature activation-3.52e-01

Pos logit contributions
les+0.841
ork+0.836
lights+0.826
pron+0.767
adoes+0.741
Neg logit contributions
 least-0.638
 very-0.495
oh-0.477
 ourselves-0.452
 am-0.443
Token his
Token ID465
Feature activation+2.90e-01

Pos logit contributions
ork+0.690
les+0.690
lights+0.677
pron+0.646
adoes+0.603
Neg logit contributions
 least-0.463
 very-0.348
oh-0.332
 ourselves-0.329
 cle-0.319
Token favorite
Token ID4004
Feature activation-2.30e+00

Pos logit contributions
 least+0.541
oh+0.437
 ourselves+0.426
 very+0.423
 cle+0.414
Neg logit contributions
ork-0.415
les-0.415
lights-0.400
pron-0.374
adoes-0.344
Token cartoon
Token ID16251
Feature activation-7.98e-01

Pos logit contributions
lights+3.885
ork+3.691
les+3.577
pron+3.469
anne+3.186
Neg logit contributions
 least-3.560
 very-3.417
 am-3.077
 so-3.055
 are-3.032
Token character
Token ID2095
Feature activation-1.68e-01

Pos logit contributions
les+1.337
lights+1.318
ork+1.314
pron+1.227
adoes+1.154
Neg logit contributions
 least-1.249
 very-1.037
oh-0.984
 am-0.946
 ourselves-0.943
Token on
Token ID319
Feature activation-3.30e+00

Pos logit contributions
les+0.292
ork+0.292
lights+0.286
pron+0.270
adoes+0.252
Neg logit contributions
 least-0.259
oh-0.198
 very-0.198
 ourselves-0.193
 cle-0.186
Token the
Token ID262
Feature activation+7.30e-01

Pos logit contributions
lights+7.475
les+7.264
ork+6.719
anne+6.718
pron+6.698
Neg logit contributions
 least-3.817
 very-3.815
 so-3.294
 ourselves-2.909
 am-2.692
Token cover
Token ID3002
Feature activation-1.09e+00

Pos logit contributions
 least+1.451
oh+1.199
 ourselves+1.160
 am+1.147
 very+1.136
Neg logit contributions
ork-0.963
les-0.960
lights-0.890
pron-0.869
lf-0.779
Token.
Token ID13
Feature activation-9.26e-01

Pos logit contributions
ork+1.936
les+1.922
lights+1.882
pron+1.818
adoes+1.699
Neg logit contributions
 least-1.596
 very-1.309
oh-1.233
 ourselves-1.209
 so-1.193
Token He
Token ID679
Feature activation-4.39e-02

Pos logit contributions
les+1.688
ork+1.602
lights+1.600
pron+1.507
lf+1.367
Neg logit contributions
 least-1.381
 very-1.129
oh-1.072
 so-1.038
 oh-1.029
Token asked
Token ID1965
Feature activation-3.81e-01

Pos logit contributions
les+0.076
ork+0.076
lights+0.075
pron+0.070
adoes+0.066
Neg logit contributions
 least-0.069
oh-0.053
 very-0.052
 ourselves-0.051
 cle-0.050
Token to
Token ID284
Feature activation-7.13e-01

Pos logit contributions
 least+1.382
oh+1.051
 ourselves+1.020
 cle+0.999
 am+0.967
Neg logit contributions
ork-1.631
les-1.610
lights-1.557
pron-1.501
lf-1.355
Token go
Token ID467
Feature activation+4.08e-01

Pos logit contributions
les+1.512
ork+1.499
lights+1.493
pron+1.413
adoes+1.331
Neg logit contributions
 least-0.905
 very-0.611
 ourselves-0.608
oh-0.591
 am-0.556
Token home
Token ID1363
Feature activation+9.69e-01

Pos logit contributions
 least+0.641
oh+0.502
 ourselves+0.482
 very+0.480
 cle+0.476
Neg logit contributions
les-0.708
ork-0.703
lights-0.683
pron-0.639
adoes-0.596
Token.
Token ID13
Feature activation-5.67e-01

Pos logit contributions
 least+1.741
oh+1.428
 cle+1.355
 ourselves+1.344
 oh+1.318
Neg logit contributions
ork-1.453
les-1.422
lights-1.409
pron-1.282
lf-1.193
Token The
Token ID383
Feature activation-1.19e+00

Pos logit contributions
les+0.971
ork+0.952
lights+0.934
pron+0.892
adoes+0.812
Neg logit contributions
 least-0.922
 very-0.712
oh-0.698
 ourselves-0.685
 cle-0.675
Token polite
Token ID23507
Feature activation-3.28e+00

Pos logit contributions
ork+2.267
les+2.232
lights+2.216
pron+2.047
adoes+1.939
Neg logit contributions
 least-1.783
 very-1.412
 cle-1.310
 ourselves-1.275
 so-1.247
Token boy
Token ID2933
Feature activation+3.27e-01

Pos logit contributions
ork+8.480
lights+8.421
pron+8.357
les+7.752
anne+7.683
Neg logit contributions
 least-2.723
 very-2.421
 are-2.172
 so-2.072
 say-1.974
Token was
Token ID373
Feature activation+3.77e-01

Pos logit contributions
 least+0.491
oh+0.374
 ourselves+0.370
 very+0.362
 cle+0.352
Neg logit contributions
les-0.585
ork-0.579
lights-0.562
pron-0.535
adoes-0.495
Token sad
Token ID6507
Feature activation+3.56e-01

Pos logit contributions
 least+0.323
oh+0.192
 ourselves+0.183
 very+0.165
 cle+0.161
Neg logit contributions
les-0.912
ork-0.910
lights-0.895
pron-0.865
adoes-0.816
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
 least+0.567
oh+0.445
 ourselves+0.430
 very+0.426
 cle+0.418
Neg logit contributions
les-0.605
ork-0.601
lights-0.590
pron-0.548
adoes-0.513
Token He
Token ID679
Feature activation+2.30e-01

Pos logit contributions
 least+0.354
oh+0.281
 very+0.275
 ourselves+0.275
 cle+0.266
Neg logit contributions
les-0.317
ork-0.316
lights-0.307
pron-0.287
adoes-0.267
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 least+0.351
oh+0.251
 ourselves+0.240
 very+0.237
 cle+0.227
Neg logit contributions
les-0.584
ork-0.579
lights-0.568
pron-0.540
adoes-0.513
Token dragon
Token ID10441
Feature activation-5.78e-01

Pos logit contributions
ork+1.951
lights+1.899
les+1.874
pron+1.689
adoes+1.591
Neg logit contributions
 least-2.201
 very-1.841
 cle-1.728
 ourselves-1.669
oh-1.667
Token again
Token ID757
Feature activation+6.91e-01

Pos logit contributions
ork+1.056
les+1.046
lights+1.036
pron+0.981
adoes+0.921
Neg logit contributions
 least-0.859
 very-0.666
oh-0.648
 ourselves-0.621
 cle-0.609
Token,
Token ID11
Feature activation-2.97e-01

Pos logit contributions
 least+1.219
oh+0.976
 ourselves+0.959
 cle+0.937
 very+0.936
Neg logit contributions
les-1.055
ork-1.042
lights-0.998
pron-0.944
lf-0.862
Token but
Token ID475
Feature activation-8.85e-02

Pos logit contributions
ork+0.564
les+0.561
lights+0.548
pron+0.519
adoes+0.485
Neg logit contributions
 least-0.424
 very-0.319
oh-0.310
 ourselves-0.303
 cle-0.296
Token this
Token ID428
Feature activation-3.28e+00

Pos logit contributions
les+0.174
ork+0.173
lights+0.170
pron+0.161
adoes+0.151
Neg logit contributions
 least-0.120
oh-0.087
 very-0.087
 ourselves-0.085
 cle-0.081
Token time
Token ID640
Feature activation-2.65e-01

Pos logit contributions
lights+5.421
ork+4.951
adoes+4.925
les+4.781
ors+4.780
Neg logit contributions
 least-6.008
 very-5.546
 am-5.101
 so-5.083
oh-4.858
Token it
Token ID340
Feature activation-6.83e-01

Pos logit contributions
les+0.525
ork+0.518
lights+0.512
pron+0.489
adoes+0.457
Neg logit contributions
 least-0.348
 very-0.259
oh-0.255
 ourselves-0.245
 cle-0.236
Token was
Token ID373
Feature activation-9.66e-01

Pos logit contributions
lights+1.413
les+1.413
ork+1.408
pron+1.343
adoes+1.272
Neg logit contributions
 least-0.852
oh-0.624
 very-0.616
 am-0.562
 cle-0.560
Token awake
Token ID21693
Feature activation-4.46e-01

Pos logit contributions
les+2.111
ork+2.062
lights+2.018
pron+1.874
adoes+1.819
Neg logit contributions
 least-1.158
 very-0.925
 so-0.790
 cle-0.780
oh-0.749
Token!
Token ID0
Feature activation+2.31e-01

Pos logit contributions
ork+0.779
les+0.778
lights+0.762
pron+0.725
adoes+0.675
Neg logit contributions
 least-0.685
 very-0.535
oh-0.519
 ourselves-0.508
 so-0.489
Token pick
Token ID2298
Feature activation+5.08e-01

Pos logit contributions
les+0.316
ork+0.315
lights+0.308
pron+0.293
adoes+0.272
Neg logit contributions
 least-0.241
oh-0.177
 very-0.177
 ourselves-0.174
 cle-0.167
Token you
Token ID345
Feature activation+2.26e-01

Pos logit contributions
 least+0.825
oh+0.643
 very+0.619
 ourselves+0.611
 cle+0.603
Neg logit contributions
les-0.867
ork-0.839
lights-0.815
pron-0.753
adoes-0.727
Token up
Token ID510
Feature activation+2.82e-01

Pos logit contributions
 least+0.344
oh+0.262
 very+0.255
 ourselves+0.252
 cle+0.246
Neg logit contributions
les-0.403
ork-0.400
lights-0.390
pron-0.368
adoes-0.347
Token!"
Token ID2474
Feature activation+5.51e-01

Pos logit contributions
 least+0.419
oh+0.318
 very+0.307
 ourselves+0.303
 cle+0.297
Neg logit contributions
les-0.510
ork-0.506
lights-0.496
pron-0.467
adoes-0.441
Token The
Token ID383
Feature activation-2.46e-01

Pos logit contributions
 least+0.764
oh+0.553
 ourselves+0.535
 very+0.527
 cle+0.512
Neg logit contributions
les-1.071
ork-1.058
lights-1.033
pron-0.972
adoes-0.931
Token quiet
Token ID5897
Feature activation-3.28e+00

Pos logit contributions
les+0.418
ork+0.417
lights+0.405
pron+0.380
adoes+0.353
Neg logit contributions
 least-0.402
 very-0.313
oh-0.309
 ourselves-0.304
 cle-0.300
Token bunny
Token ID44915
Feature activation-3.87e-01

Pos logit contributions
ork+6.871
pron+6.557
lights+6.212
les+5.903
ravings+5.800
Neg logit contributions
 least-4.136
 very-4.126
 so-3.831
 am-3.505
 are-3.442
Token was
Token ID373
Feature activation+3.94e-01

Pos logit contributions
ork+0.840
les+0.838
lights+0.837
pron+0.798
adoes+0.754
Neg logit contributions
 least-0.430
 very-0.301
oh-0.298
 ourselves-0.274
 cle-0.270
Token happy
Token ID3772
Feature activation-5.21e-01

Pos logit contributions
 least+0.472
oh+0.334
 ourselves+0.316
 very+0.300
 am+0.296
Neg logit contributions
les-0.824
ork-0.821
lights-0.807
pron-0.776
adoes-0.731
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
les+0.918
ork+0.917
lights+0.890
pron+0.856
adoes+0.788
Neg logit contributions
 least-0.823
 very-0.646
oh-0.620
 ourselves-0.615
 so-0.593
Token The
Token ID383
Feature activation-2.70e-01

Pos logit contributions
 least+0.388
oh+0.300
 very+0.294
 ourselves+0.293
 cle+0.283
Neg logit contributions
les-0.396
ork-0.394
lights-0.384
pron-0.361
adoes-0.338
Token big
Token ID1263
Feature activation+5.37e-01

Pos logit contributions
 least+0.205
oh+0.160
 very+0.156
 ourselves+0.156
 cle+0.151
Neg logit contributions
les-0.207
ork-0.206
lights-0.200
pron-0.190
adoes-0.175
Token red
Token ID2266
Feature activation-2.87e-01

Pos logit contributions
 least+1.006
oh+0.818
 ourselves+0.806
 very+0.776
 cle+0.771
Neg logit contributions
les-0.763
ork-0.756
lights-0.726
pron-0.699
lf-0.626
Token nose
Token ID9686
Feature activation-1.44e+00

Pos logit contributions
les+0.415
ork+0.410
lights+0.406
pron+0.371
adoes+0.345
Neg logit contributions
 least-0.527
 very-0.426
oh-0.423
 ourselves-0.412
 am-0.402
Token and
Token ID290
Feature activation+1.25e-01

Pos logit contributions
ork+2.741
les+2.722
lights+2.676
pron+2.508
adoes+2.444
Neg logit contributions
 least-1.948
 very-1.700
 so-1.516
oh-1.475
 am-1.367
Token he
Token ID339
Feature activation-7.96e-03

Pos logit contributions
 least+0.175
oh+0.129
 very+0.126
 ourselves+0.125
 cle+0.120
Neg logit contributions
les-0.237
ork-0.236
lights-0.231
pron-0.220
adoes-0.206
Token wore
Token ID12408
Feature activation-3.28e+00

Pos logit contributions
les+0.014
ork+0.014
lights+0.014
pron+0.013
adoes+0.012
Neg logit contributions
 least-0.012
oh-0.009
 very-0.009
 ourselves-0.009
 cle-0.008
Token funny
Token ID8258
Feature activation-1.59e+00

Pos logit contributions
lights+10.115
ork+9.599
les+9.561
adoes+9.119
ravings+9.083
Neg logit contributions
 very-1.772
 so-1.142
 least-0.880
 oh-0.451
 a-0.295
Token clothes
Token ID8242
Feature activation-1.37e+00

Pos logit contributions
ork+2.486
lights+2.467
les+2.412
pron+2.351
adoes+2.119
Neg logit contributions
 least-2.386
 very-2.284
 so-2.049
 am-2.010
oh-2.004
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
ork+2.620
les+2.549
lights+2.519
pron+2.407
adoes+2.313
Neg logit contributions
 least-1.971
 very-1.640
 so-1.485
 ourselves-1.413
oh-1.402
Token He
Token ID679
Feature activation-1.79e-01

Pos logit contributions
les+1.991
ork+1.953
lights+1.890
pron+1.829
adoes+1.696
Neg logit contributions
 least-1.929
 very-1.604
oh-1.515
 am-1.487
 so-1.478
Token liked
Token ID8288
Feature activation-4.71e-01

Pos logit contributions
ork+0.314
les+0.314
lights+0.308
pron+0.289
adoes+0.273
Neg logit contributions
 least-0.275
 very-0.211
oh-0.210
 ourselves-0.201
 cle-0.198
Token calendar
Token ID11845
Feature activation+2.00e-01

Pos logit contributions
 least+0.303
oh+0.246
 ourselves+0.241
 very+0.241
 cle+0.234
Neg logit contributions
les-0.219
ork-0.219
lights-0.211
pron-0.199
adoes-0.180
Token with
Token ID351
Feature activation-7.15e-02

Pos logit contributions
 least+0.325
oh+0.252
 ourselves+0.247
 very+0.245
 cle+0.239
Neg logit contributions
les-0.336
ork-0.334
lights-0.326
pron-0.309
adoes-0.285
Token pictures
Token ID5986
Feature activation-7.39e-01

Pos logit contributions
les+0.132
ork+0.131
lights+0.128
pron+0.122
adoes+0.114
Neg logit contributions
 least-0.104
oh-0.079
 very-0.078
 ourselves-0.076
 cle-0.074
Token of
Token ID286
Feature activation-5.10e-01

Pos logit contributions
ork+1.273
les+1.267
lights+1.230
pron+1.181
adoes+1.110
Neg logit contributions
 least-1.136
 very-0.932
oh-0.875
 ourselves-0.858
 so-0.849
Token animals
Token ID4695
Feature activation-3.16e-01

Pos logit contributions
les+0.877
ork+0.874
lights+0.858
pron+0.810
adoes+0.745
Neg logit contributions
 least-0.808
 very-0.642
 ourselves-0.622
oh-0.610
 cle-0.590
Token on
Token ID319
Feature activation-3.27e+00

Pos logit contributions
ork+0.545
les+0.541
lights+0.534
pron+0.503
adoes+0.469
Neg logit contributions
 least-0.500
 very-0.392
oh-0.384
 ourselves-0.377
 am-0.365
Token it
Token ID340
Feature activation-4.67e-01

Pos logit contributions
les+6.623
lights+6.412
pron+6.079
ravings+6.076
ork+6.021
Neg logit contributions
 very-4.157
 least-4.064
 so-3.414
 am-3.370
 ourselves-3.349
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
les+0.801
ork+0.796
lights+0.781
pron+0.740
adoes+0.691
Neg logit contributions
 least-0.738
 very-0.584
oh-0.562
 ourselves-0.554
 am-0.536
Token Every
Token ID3887
Feature activation-3.14e-01

Pos logit contributions
les+2.180
ork+2.120
lights+2.103
pron+1.958
adoes+1.861
Neg logit contributions
 least-1.939
 very-1.566
oh-1.501
 am-1.469
 ourselves-1.453
Token morning
Token ID3329
Feature activation-2.36e-01

Pos logit contributions
les+0.543
ork+0.535
lights+0.532
pron+0.492
adoes+0.472
Neg logit contributions
 least-0.496
 very-0.387
oh-0.383
 ourselves-0.363
 am-0.361
Token,
Token ID11
Feature activation-7.64e-01

Pos logit contributions
les+0.472
ork+0.467
lights+0.462
pron+0.437
adoes+0.416
Neg logit contributions
 least-0.305
oh-0.226
 very-0.224
 ourselves-0.213
 am-0.207
Token and
Token ID290
Feature activation+8.96e-01

Pos logit contributions
les+0.003
ork+0.003
lights+0.003
pron+0.003
adoes+0.002
Neg logit contributions
 least-0.003
oh-0.002
 very-0.002
 ourselves-0.002
 cle-0.002
Token had
Token ID550
Feature activation+1.45e+00

Pos logit contributions
 least+1.490
oh+1.169
 ourselves+1.140
 cle+1.107
 am+1.074
Neg logit contributions
les-1.471
ork-1.417
lights-1.408
pron-1.329
lf-1.251
Token fun
Token ID1257
Feature activation-1.25e+00

Pos logit contributions
 least+1.853
oh+1.369
 ourselves+1.262
 am+1.193
 oh+1.182
Neg logit contributions
les-2.922
ork-2.903
lights-2.794
pron-2.708
lf-2.524
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
ork+2.448
les+2.446
lights+2.344
pron+2.332
adoes+2.108
Neg logit contributions
 least-1.696
 very-1.395
 ourselves-1.273
 so-1.246
oh-1.201
Token The
Token ID383
Feature activation-1.14e+00

Pos logit contributions
les+0.025
ork+0.024
lights+0.024
pron+0.023
adoes+0.021
Neg logit contributions
 least-0.022
oh-0.017
 very-0.017
 ourselves-0.017
 cle-0.016
Token anxious
Token ID18116
Feature activation-3.27e+00

Pos logit contributions
les+2.241
ork+2.240
lights+2.171
pron+2.031
adoes+1.948
Neg logit contributions
 least-1.642
 very-1.280
 cle-1.168
oh-1.141
 am-1.118
Token feeling
Token ID4203
Feature activation-4.49e-01

Pos logit contributions
pron+6.746
ork+6.504
lights+6.418
les+6.079
anne+5.881
Neg logit contributions
 least-4.323
 very-3.831
 so-3.259
 am-3.184
'm-3.144
Token was
Token ID373
Feature activation-4.27e-01

Pos logit contributions
ork+0.794
les+0.790
lights+0.778
pron+0.745
adoes+0.691
Neg logit contributions
 least-0.690
 very-0.540
oh-0.529
 ourselves-0.505
 am-0.497
Token gone
Token ID3750
Feature activation-2.12e-03

Pos logit contributions
les+0.944
ork+0.938
lights+0.913
pron+0.866
adoes+0.828
Neg logit contributions
 least-0.496
 very-0.352
oh-0.327
 ourselves-0.321
 cle-0.309
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
les+0.004
ork+0.004
lights+0.003
pron+0.003
adoes+0.003
Neg logit contributions
 least-0.003
oh-0.003
 very-0.003
 ourselves-0.003
 cle-0.003
Token The
Token ID383
Feature activation-1.01e+00

Pos logit contributions
 least+0.351
oh+0.268
 very+0.262
 ourselves+0.261
 cle+0.254
Neg logit contributions
ork-0.393
les-0.393
lights-0.383
pron-0.361
adoes-0.338
Token borrow
Token ID8804
Feature activation-1.40e+00

Pos logit contributions
 least+0.042
oh+0.025
 very+0.025
 ourselves+0.024
 cle+0.022
Neg logit contributions
les-0.108
ork-0.107
lights-0.106
pron-0.101
adoes-0.097
Token the
Token ID262
Feature activation-7.38e-01

Pos logit contributions
lights+3.168
ork+3.103
les+3.083
pron+3.042
lf+2.826
Neg logit contributions
 least-1.546
 very-1.151
 ourselves-1.099
 oh-1.007
 so-1.003
Token cane
Token ID33009
Feature activation+3.29e-01

Pos logit contributions
lights+0.885
ork+0.872
les+0.871
pron+0.750
lf+0.682
Neg logit contributions
 least-1.562
 very-1.330
 cle-1.280
 ourselves-1.273
oh-1.264
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
 least+0.542
oh+0.428
 ourselves+0.413
 very+0.410
 cle+0.402
Neg logit contributions
les-0.539
ork-0.538
lights-0.521
pron-0.492
adoes-0.458
Token The
Token ID383
Feature activation-1.26e+00

Pos logit contributions
les+0.629
ork+0.616
lights+0.612
pron+0.575
adoes+0.538
Neg logit contributions
 least-0.486
oh-0.372
 very-0.370
 ourselves-0.354
 am-0.345
Token poor
Token ID3595
Feature activation-3.26e+00

Pos logit contributions
lights+2.196
ork+2.172
les+2.142
pron+1.839
adoes+1.824
Neg logit contributions
 least-2.065
 very-1.758
 cle-1.644
 oh-1.582
 am-1.533
Token man
Token ID582
Feature activation-2.83e-01

Pos logit contributions
lights+7.524
ork+7.392
les+6.692
anne+6.540
ilt+6.443
Neg logit contributions
 very-3.604
 least-3.390
 oh-3.107
 so-2.979
 say-2.876
Token was
Token ID373
Feature activation+3.31e-01

Pos logit contributions
ork+0.572
les+0.572
lights+0.568
pron+0.534
adoes+0.508
Neg logit contributions
 least-0.361
oh-0.261
 very-0.260
 cle-0.241
 ourselves-0.240
Token kind
Token ID1611
Feature activation-1.00e+00

Pos logit contributions
 least+0.408
oh+0.290
 ourselves+0.279
 very+0.266
 am+0.259
Neg logit contributions
les-0.679
ork-0.679
lights-0.663
pron-0.641
adoes-0.601
Token and
Token ID290
Feature activation+1.67e-01

Pos logit contributions
ork+1.839
lights+1.816
les+1.810
pron+1.756
adoes+1.599
Neg logit contributions
 least-1.490
 very-1.209
 so-1.133
oh-1.125
 cle-1.061
Token said
Token ID531
Feature activation+1.86e-01

Pos logit contributions
 least+0.260
oh+0.200
 ourselves+0.195
 very+0.194
 cle+0.186
Neg logit contributions
les-0.292
ork-0.290
lights-0.283
pron-0.269
adoes-0.250
Token play
Token ID711
Feature activation-1.22e+00

Pos logit contributions
les+0.028
ork+0.028
lights+0.027
pron+0.026
adoes+0.024
Neg logit contributions
 least-0.026
oh-0.020
 very-0.020
 ourselves-0.019
 cle-0.019
Token with
Token ID351
Feature activation-1.17e+00

Pos logit contributions
lights+2.296
les+2.241
ork+2.230
pron+2.041
anne+1.956
Neg logit contributions
 least-1.723
 very-1.430
oh-1.323
 oh-1.319
 ourselves-1.289
Token her
Token ID607
Feature activation-1.01e-01

Pos logit contributions
les+2.635
ork+2.635
lights+2.610
pron+2.471
anne+2.305
Neg logit contributions
 least-1.156
 very-0.891
 ourselves-0.785
oh-0.742
 oh-0.738
Token dolls
Token ID36062
Feature activation-6.18e-01

Pos logit contributions
les+0.161
ork+0.159
lights+0.155
pron+0.145
adoes+0.134
Neg logit contributions
 least-0.176
 very-0.138
oh-0.138
 ourselves-0.136
 cle-0.132
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
ork+1.108
les+1.102
lights+1.094
pron+1.014
adoes+0.969
Neg logit contributions
 least-0.966
 very-0.764
oh-0.712
 ourselves-0.693
 so-0.685
Token wear
Token ID5806
Feature activation-3.26e+00

Pos logit contributions
 least+0.176
oh+0.136
 very+0.133
 ourselves+0.132
 cle+0.128
Neg logit contributions
les-0.194
ork-0.192
lights-0.188
pron-0.178
adoes-0.166
Token pretty
Token ID2495
Feature activation-2.58e+00

Pos logit contributions
ork+7.351
lights+7.128
emaker+7.067
les+6.840
ravings+6.730
Neg logit contributions
 very-3.660
 so-3.089
 least-2.951
 are-2.455
 say-2.402
Token dresses
Token ID27309
Feature activation-1.77e+00

Pos logit contributions
ork+4.290
pron+3.994
lights+3.966
les+3.809
emaker+3.777
Neg logit contributions
 least-3.768
 very-3.590
 so-3.225
 ourselves-3.050
 am-3.039
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
ork+3.449
les+3.368
lights+3.330
pron+3.207
adoes+3.162
Neg logit contributions
 least-2.367
 very-2.055
 so-1.851
 ourselves-1.728
oh-1.691
Token One
Token ID1881
Feature activation-2.33e-02

Pos logit contributions
les+1.632
ork+1.601
lights+1.595
pron+1.447
adoes+1.371
Neg logit contributions
 least-2.247
 very-1.900
oh-1.823
 am-1.800
 ourselves-1.760
Token day
Token ID1110
Feature activation+2.62e-01

Pos logit contributions
les+0.039
ork+0.039
lights+0.038
pron+0.036
adoes+0.034
Neg logit contributions
 least-0.038
oh-0.029
 very-0.029
 ourselves-0.028
 cle-0.027
Token"
Token ID1
Feature activation+1.37e+00

Pos logit contributions
 least+0.437
oh+0.342
 very+0.339
 ourselves+0.334
 cle+0.329
Neg logit contributions
ork-0.386
les-0.385
lights-0.372
pron-0.352
adoes-0.325
TokenNo
Token ID2949
Feature activation+9.03e-01

Pos logit contributions
 least+2.027
 very+1.405
 ourselves+1.396
oh+1.393
 cle+1.342
Neg logit contributions
ork-2.553
les-2.521
pron-2.462
lights-2.440
lf-2.197
Token,
Token ID11
Feature activation-1.90e-03

Pos logit contributions
 least+1.395
oh+1.041
 ourselves+1.013
 cle+1.000
 very+0.999
Neg logit contributions
ork-1.623
les-1.623
lights-1.551
pron-1.475
lf-1.398
Token it
Token ID340
Feature activation+3.51e-01

Pos logit contributions
les+0.003
ork+0.003
lights+0.003
pron+0.003
adoes+0.003
Neg logit contributions
 least-0.003
oh-0.002
 very-0.002
 ourselves-0.002
 cle-0.002
Token's
Token ID338
Feature activation-4.02e-01

Pos logit contributions
 least+0.466
oh+0.329
 ourselves+0.328
 very+0.324
 cle+0.312
Neg logit contributions
les-0.696
ork-0.692
lights-0.669
pron-0.640
adoes-0.606
Token mine
Token ID6164
Feature activation+1.81e+00

Pos logit contributions
les+0.728
ork+0.710
lights+0.706
pron+0.661
adoes+0.612
Neg logit contributions
 least-0.621
 very-0.489
oh-0.476
 ourselves-0.470
 cle-0.447
Token!"
Token ID2474
Feature activation+8.05e-01

Pos logit contributions
 least+3.082
 cle+2.365
oh+2.227
'm+2.145
 Wait+2.133
Neg logit contributions
ork-2.979
les-2.954
lights-2.930
adoes-2.559
pron-2.547
Token Tom
Token ID4186
Feature activation+1.34e+00

Pos logit contributions
 least+1.327
oh+1.033
 cle+0.986
 ourselves+0.984
 very+0.974
Neg logit contributions
ork-1.351
les-1.328
lights-1.302
pron-1.214
adoes-1.148
Token said
Token ID531
Feature activation+7.86e-03

Pos logit contributions
 least+1.851
 ourselves+1.345
oh+1.240
 am+1.226
 cle+1.221
Neg logit contributions
les-2.688
ork-2.590
lights-2.475
pron-2.370
lf-2.310
Token.
Token ID13
Feature activation-5.07e-01

Pos logit contributions
 least+0.013
oh+0.010
 very+0.010
 ourselves+0.010
 cle+0.010
Neg logit contributions
les-0.013
ork-0.013
lights-0.013
pron-0.012
adoes-0.011
Token "
Token ID366
Feature activation+8.52e-01

Pos logit contributions
les+0.936
ork+0.910
lights+0.894
pron+0.857
adoes+0.795
Neg logit contributions
 least-0.737
 very-0.586
oh-0.569
 ourselves-0.546
 am-0.539
TokenOh
Token ID5812
Feature activation+1.05e+00

Pos logit contributions
 least+0.146
oh+0.103
 very+0.101
 ourselves+0.100
 cle+0.096
Neg logit contributions
les-0.236
ork-0.235
lights-0.230
pron-0.220
adoes-0.207
Token no
Token ID645
Feature activation+1.13e+00

Pos logit contributions
 least+1.314
 ourselves+0.911
oh+0.910
 cle+0.880
 very+0.851
Neg logit contributions
ork-2.154
les-2.095
lights-2.082
pron-1.995
adoes-1.897
Token!"
Token ID2474
Feature activation-1.54e-01

Pos logit contributions
 least+1.771
 cle+1.313
 ourselves+1.295
oh+1.287
 very+1.261
Neg logit contributions
les-2.021
ork-1.993
lights-1.899
pron-1.808
lf-1.727
Token Lily
Token ID20037
Feature activation+1.62e+00

Pos logit contributions
les+0.246
ork+0.245
lights+0.239
pron+0.226
adoes+0.209
Neg logit contributions
 least-0.261
 very-0.205
oh-0.204
 ourselves-0.201
 cle-0.194
Token and
Token ID290
Feature activation+4.20e-01

Pos logit contributions
 least+2.475
 ourselves+1.845
oh+1.812
 cle+1.790
 oh+1.728
Neg logit contributions
les-3.026
ork-2.823
lights-2.725
adoes-2.533
pron-2.462
Token Ben
Token ID3932
Feature activation+2.10e+00

Pos logit contributions
 least+0.624
oh+0.475
 very+0.464
 ourselves+0.459
 cle+0.447
Neg logit contributions
les-0.763
ork-0.750
lights-0.731
pron-0.698
adoes-0.654
Token say
Token ID910
Feature activation-1.57e-01

Pos logit contributions
 least+2.717
oh+1.870
 ourselves+1.794
 cle+1.776
 oh+1.752
Neg logit contributions
les-4.426
ork-4.169
lights-3.967
adoes-3.740
le-3.703
Token.
Token ID13
Feature activation+7.46e-01

Pos logit contributions
les+0.264
ork+0.261
lights+0.256
pron+0.243
adoes+0.224
Neg logit contributions
 least-0.254
 very-0.199
oh-0.199
 ourselves-0.195
 cle-0.187
Token
Token ID198
Feature activation+4.33e-01

Pos logit contributions
 least+1.255
oh+0.941
 ourselves+0.920
 cle+0.912
 very+0.893
Neg logit contributions
ork-1.266
les-1.236
lights-1.208
pron-1.126
adoes-1.067
Token
Token ID198
Feature activation+1.77e-01

Pos logit contributions
 least+0.683
oh+0.514
 very+0.500
 cle+0.497
 ourselves+0.496
Neg logit contributions
ork-0.759
les-0.755
lights-0.733
pron-0.688
adoes-0.641
TokenThey
Token ID2990
Feature activation+6.12e-01

Pos logit contributions
 least+0.288
oh+0.222
 very+0.219
 ourselves+0.216
 cle+0.212
Neg logit contributions
les-0.300
ork-0.298
lights-0.290
pron-0.274
adoes-0.255
Token a
Token ID257
Feature activation+3.80e-03

Pos logit contributions
les+0.550
ork+0.546
lights+0.536
pron+0.509
adoes+0.476
Neg logit contributions
 least-0.389
 very-0.288
oh-0.281
 ourselves-0.275
 cle-0.267
Token huge
Token ID3236
Feature activation-6.08e-01

Pos logit contributions
 least+0.006
oh+0.005
 very+0.005
 ourselves+0.005
 cle+0.005
Neg logit contributions
les-0.006
ork-0.006
lights-0.006
pron-0.006
adoes-0.005
Token lion
Token ID18744
Feature activation-4.43e-01

Pos logit contributions
les+0.938
ork+0.935
lights+0.928
pron+0.845
adoes+0.789
Neg logit contributions
 least-1.062
 very-0.864
oh-0.844
 cle-0.812
 am-0.805
Token appeared
Token ID4120
Feature activation+5.26e-01

Pos logit contributions
ork+0.748
lights+0.744
les+0.741
pron+0.704
adoes+0.648
Neg logit contributions
 least-0.704
oh-0.560
 very-0.550
 ourselves-0.520
 am-0.517
Token with
Token ID351
Feature activation+1.90e-01

Pos logit contributions
 least+0.879
oh+0.685
 ourselves+0.667
 very+0.655
 cle+0.651
Neg logit contributions
les-0.868
ork-0.857
lights-0.830
pron-0.770
adoes-0.733
Token an
Token ID281
Feature activation+1.58e+00

Pos logit contributions
 least+0.223
oh+0.154
 ourselves+0.149
 very+0.148
 cle+0.140
Neg logit contributions
les-0.402
ork-0.400
lights-0.392
pron-0.375
adoes-0.355
Token angry
Token ID7954
Feature activation-6.30e-01

Pos logit contributions
 least+2.928
oh+2.420
 ourselves+2.407
 am+2.275
'm+2.218
Neg logit contributions
les-2.374
ork-2.286
lights-2.130
pron-2.031
lf-1.992
Token roar
Token ID34532
Feature activation-7.41e-01

Pos logit contributions
lights+1.004
les+1.001
ork+0.989
pron+0.927
adoes+0.841
Neg logit contributions
 least-1.060
 very-0.861
oh-0.848
 cle-0.806
 ourselves-0.804
Token.
Token ID13
Feature activation-4.80e-01

Pos logit contributions
ork+1.326
les+1.308
lights+1.303
pron+1.258
adoes+1.154
Neg logit contributions
 least-1.087
 very-0.865
oh-0.849
 ourselves-0.797
 am-0.782
Token It
Token ID632
Feature activation-4.20e-01

Pos logit contributions
les+0.913
ork+0.900
lights+0.899
pron+0.851
adoes+0.791
Neg logit contributions
 least-0.665
 very-0.506
oh-0.503
 ourselves-0.472
 cle-0.466
Token was
Token ID373
Feature activation+2.89e-01

Pos logit contributions
ork+0.791
les+0.784
lights+0.779
pron+0.740
adoes+0.687
Neg logit contributions
 least-0.594
 very-0.456
oh-0.452
 cle-0.420
 ourselves-0.416
Token
Token ID198
Feature activation+9.29e-01

Pos logit contributions
 least+0.714
oh+0.537
 very+0.530
 ourselves+0.523
 cle+0.516
Neg logit contributions
ork-0.813
les-0.806
lights-0.781
pron-0.740
adoes-0.692
Token
Token ID198
Feature activation+8.82e-01

Pos logit contributions
 least+1.605
oh+1.219
 ourselves+1.212
 cle+1.206
 very+1.183
Neg logit contributions
ork-1.512
les-1.484
lights-1.397
pron-1.333
lf-1.222
TokenS
Token ID50
Feature activation+2.84e-01

Pos logit contributions
 least+1.554
 ourselves+1.198
oh+1.187
 very+1.178
 cle+1.175
Neg logit contributions
ork-1.400
les-1.362
lights-1.282
pron-1.237
lf-1.114
Tokenandy
Token ID10757
Feature activation+5.64e-01

Pos logit contributions
 least+0.562
 very+0.456
oh+0.455
 ourselves+0.452
 cle+0.441
Neg logit contributions
les-0.379
ork-0.375
lights-0.358
pron-0.333
adoes-0.299
Token had
Token ID550
Feature activation+7.85e-01

Pos logit contributions
 least+0.822
 ourselves+0.620
oh+0.610
 very+0.592
 cle+0.581
Neg logit contributions
les-1.045
ork-1.025
lights-0.994
pron-0.948
lf-0.886
Token an
Token ID281
Feature activation+1.73e+00

Pos logit contributions
 least+0.932
oh+0.660
 ourselves+0.650
 very+0.594
 cle+0.591
Neg logit contributions
ork-1.639
les-1.638
lights-1.564
pron-1.531
adoes-1.419
Token amazing
Token ID4998
Feature activation-3.66e-01

Pos logit contributions
 least+2.809
oh+2.503
 ourselves+2.394
 am+2.236
'm+2.179
Neg logit contributions
les-2.722
ork-2.629
lights-2.558
pron-2.467
lf-2.240
Token time
Token ID640
Feature activation+1.75e-01

Pos logit contributions
les+0.495
ork+0.485
lights+0.478
pron+0.434
adoes+0.405
Neg logit contributions
 least-0.716
 very-0.587
oh-0.580
 ourselves-0.563
 am-0.561
Token at
Token ID379
Feature activation+1.32e+00

Pos logit contributions
 least+0.274
oh+0.211
 very+0.206
 ourselves+0.206
 cle+0.199
Neg logit contributions
les-0.302
ork-0.300
lights-0.293
pron-0.277
adoes-0.257
Token the
Token ID262
Feature activation+5.81e-01

Pos logit contributions
 least+1.825
oh+1.523
 ourselves+1.413
 cle+1.374
 oh+1.368
Neg logit contributions
ork-2.356
les-2.352
lights-2.295
pron-2.173
lf-2.096
Token park
Token ID3952
Feature activation+3.15e-01

Pos logit contributions
 least+0.955
oh+0.753
 ourselves+0.750
 very+0.721
 cle+0.714
Neg logit contributions
ork-0.947
les-0.943
lights-0.906
pron-0.863
lf-0.790
Token,
Token ID11
Feature activation-5.41e-01

Pos logit contributions
 least+0.107
oh+0.078
 very+0.077
 ourselves+0.076
 cle+0.073
Neg logit contributions
les-0.151
ork-0.151
lights-0.148
pron-0.140
adoes-0.132
Token Lily
Token ID20037
Feature activation-1.87e-01

Pos logit contributions
les+0.915
ork+0.906
lights+0.891
pron+0.822
adoes+0.778
Neg logit contributions
 least-0.891
 very-0.705
oh-0.684
 ourselves-0.673
 oh-0.663
Token found
Token ID1043
Feature activation+9.50e-01

Pos logit contributions
ork+0.332
les+0.331
lights+0.327
pron+0.309
adoes+0.289
Neg logit contributions
 least-0.283
oh-0.216
 very-0.213
 ourselves-0.205
 am-0.199
Token a
Token ID257
Feature activation+4.66e-01

Pos logit contributions
 least+1.109
oh+0.797
 ourselves+0.753
 am+0.712
 cle+0.708
Neg logit contributions
les-2.038
ork-2.012
lights-1.956
pron-1.866
adoes-1.780
Token big
Token ID1263
Feature activation+4.40e-01

Pos logit contributions
 least+0.789
oh+0.620
 ourselves+0.611
 very+0.590
 cle+0.580
Neg logit contributions
ork-0.750
les-0.742
lights-0.718
pron-0.697
lf-0.624
Token toy
Token ID13373
Feature activation+1.99e+00

Pos logit contributions
 least+0.781
oh+0.624
 ourselves+0.614
 very+0.603
 cle+0.592
Neg logit contributions
les-0.669
ork-0.663
lights-0.642
pron-0.611
lf-0.555
Token fish
Token ID5916
Feature activation+2.19e-01

Pos logit contributions
 least+3.666
oh+3.049
 ourselves+3.038
 cle+2.981
'm+2.907
Neg logit contributions
les-2.928
ork-2.896
lights-2.702
pron-2.493
lf-2.458
Token that
Token ID326
Feature activation+9.37e-02

Pos logit contributions
 least+0.330
oh+0.249
 ourselves+0.244
 very+0.244
 cle+0.236
Neg logit contributions
les-0.394
ork-0.392
lights-0.381
pron-0.362
adoes-0.337
Token was
Token ID373
Feature activation-3.13e-02

Pos logit contributions
 least+0.139
oh+0.105
 very+0.103
 ourselves+0.102
 cle+0.098
Neg logit contributions
les-0.170
ork-0.169
lights-0.165
pron-0.157
adoes-0.147
Token red
Token ID2266
Feature activation+4.86e-01

Pos logit contributions
les+0.062
ork+0.061
lights+0.060
pron+0.057
adoes+0.054
Neg logit contributions
 least-0.042
oh-0.030
 very-0.030
 ourselves-0.029
 cle-0.028
Token and
Token ID290
Feature activation+2.46e-01

Pos logit contributions
 least+0.847
oh+0.674
 ourselves+0.658
 very+0.650
 cle+0.644
Neg logit contributions
les-0.766
ork-0.748
lights-0.731
pron-0.678
adoes-0.634
Token a
Token ID257
Feature activation+5.23e-01

Pos logit contributions
 least+0.118
oh+0.090
 very+0.088
 ourselves+0.087
 cle+0.084
Neg logit contributions
les-0.139
ork-0.138
lights-0.135
pron-0.128
adoes-0.119
Token special
Token ID2041
Feature activation+5.01e-01

Pos logit contributions
 least+0.912
oh+0.729
 ourselves+0.712
 very+0.694
 cle+0.682
Neg logit contributions
ork-0.809
les-0.805
lights-0.771
pron-0.737
adoes-0.668
Token den
Token ID2853
Feature activation+2.42e-01

Pos logit contributions
 least+0.937
oh+0.765
 ourselves+0.740
 very+0.731
 cle+0.717
Neg logit contributions
ork-0.709
les-0.709
lights-0.682
pron-0.645
adoes-0.583
Token in
Token ID287
Feature activation+3.17e-01

Pos logit contributions
 least+0.382
oh+0.296
 very+0.287
 ourselves+0.287
 cle+0.278
Neg logit contributions
les-0.413
ork-0.411
lights-0.402
pron-0.379
adoes-0.352
Token the
Token ID262
Feature activation+9.06e-01

Pos logit contributions
 least+0.422
oh+0.313
 very+0.299
 ourselves+0.296
 cle+0.287
Neg logit contributions
les-0.619
ork-0.617
lights-0.607
pron-0.572
adoes-0.542
Token mountain
Token ID8598
Feature activation+3.50e-01

Pos logit contributions
 least+1.640
oh+1.353
 ourselves+1.343
 am+1.274
 very+1.270
Neg logit contributions
les-1.328
ork-1.308
lights-1.257
pron-1.192
lf-1.064
Token.
Token ID13
Feature activation+1.72e-01

Pos logit contributions
 least+0.578
oh+0.453
 very+0.441
 ourselves+0.441
 cle+0.430
Neg logit contributions
les-0.570
ork-0.566
lights-0.554
pron-0.519
adoes-0.480
Token
Token ID198
Feature activation+2.64e-01

Pos logit contributions
 least+0.270
oh+0.207
 very+0.204
 ourselves+0.202
 cle+0.195
Neg logit contributions
les-0.297
ork-0.295
lights-0.288
pron-0.272
adoes-0.254
Token
Token ID198
Feature activation+4.71e-01

Pos logit contributions
 least+0.404
oh+0.304
 very+0.299
 ourselves+0.299
 cle+0.289
Neg logit contributions
ork-0.472
les-0.471
lights-0.456
pron-0.434
adoes-0.401
TokenThe
Token ID464
Feature activation+3.89e-01

Pos logit contributions
 least+0.700
oh+0.513
 very+0.510
 ourselves+0.509
 cle+0.489
Neg logit contributions
ork-0.865
les-0.862
lights-0.829
pron-0.802
lf-0.738
Token copper
Token ID15317
Feature activation-9.62e-01

Pos logit contributions
 least+0.664
oh+0.525
 ourselves+0.516
 very+0.508
 am+0.490
Neg logit contributions
les-0.629
ork-0.617
lights-0.596
pron-0.568
adoes-0.521
Token each
Token ID1123
Feature activation+5.99e-01

Pos logit contributions
les+0.074
ork+0.074
lights+0.072
pron+0.068
adoes+0.063
Neg logit contributions
 least-0.071
oh-0.055
 very-0.054
 ourselves-0.054
 cle-0.052
Token other
Token ID584
Feature activation+9.34e-04

Pos logit contributions
 least+1.044
oh+0.826
 ourselves+0.809
 very+0.788
 cle+0.785
Neg logit contributions
les-0.953
ork-0.927
lights-0.900
pron-0.864
lf-0.778
Token.
Token ID13
Feature activation-8.86e-01

Pos logit contributions
 least+0.001
oh+0.001
 very+0.001
 ourselves+0.001
 cle+0.001
Neg logit contributions
les-0.002
ork-0.002
lights-0.002
pron-0.001
adoes-0.001
Token He
Token ID679
Feature activation-2.70e-01

Pos logit contributions
les+1.607
ork+1.588
lights+1.510
pron+1.449
adoes+1.339
Neg logit contributions
 least-1.345
 very-1.062
oh-1.026
 ourselves-1.015
 so-0.985
Token was
Token ID373
Feature activation+8.16e-01

Pos logit contributions
ork+0.498
les+0.497
lights+0.489
pron+0.458
adoes+0.436
Neg logit contributions
 least-0.391
oh-0.295
 very-0.295
 ourselves-0.279
 cle-0.274
Token so
Token ID523
Feature activation+6.79e-01

Pos logit contributions
 least+0.673
oh+0.395
 ourselves+0.386
 am+0.333
 cle+0.301
Neg logit contributions
les-1.985
ork-1.957
lights-1.945
pron-1.917
lf-1.797
Token happy
Token ID3772
Feature activation+1.02e-01

Pos logit contributions
 least+0.976
oh+0.710
 ourselves+0.699
 cle+0.655
 am+0.648
Neg logit contributions
les-1.282
ork-1.262
lights-1.246
pron-1.205
lf-1.105
Token!
Token ID0
Feature activation-5.22e-01

Pos logit contributions
 least+0.165
oh+0.127
 very+0.125
 ourselves+0.124
 cle+0.120
Neg logit contributions
les-0.174
ork-0.173
lights-0.169
pron-0.160
adoes-0.148
Token
Token ID198
Feature activation-9.33e-01

Pos logit contributions
les+0.974
ork+0.961
lights+0.931
pron+0.891
adoes+0.831
Neg logit contributions
 least-0.763
 very-0.579
oh-0.575
 ourselves-0.560
 am-0.540
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
les+1.910
ork+1.861
lights+1.860
pron+1.767
adoes+1.719
Neg logit contributions
 least-1.154
 very-0.899
oh-0.873
 am-0.822
 ourselves-0.816
TokenNext
Token ID10019
Feature activation+1.04e+00

Pos logit contributions
les+1.802
ork+1.780
lights+1.749
pron+1.598
adoes+1.532
Neg logit contributions
 least-1.652
 very-1.371
oh-1.360
 oh-1.275
 ourselves-1.272
Token scared
Token ID12008
Feature activation-3.68e-02

Pos logit contributions
 least+0.449
oh+0.319
 ourselves+0.305
 very+0.292
 cle+0.286
Neg logit contributions
les-0.763
ork-0.760
lights-0.749
pron-0.716
adoes-0.676
Token and
Token ID290
Feature activation+4.49e-01

Pos logit contributions
les+0.059
ork+0.059
lights+0.057
pron+0.054
adoes+0.050
Neg logit contributions
 least-0.063
oh-0.049
 very-0.049
 ourselves-0.048
 cle-0.047
Token wants
Token ID3382
Feature activation-3.39e-01

Pos logit contributions
 least+0.595
oh+0.436
 ourselves+0.418
 very+0.407
 cle+0.397
Neg logit contributions
les-0.891
ork-0.875
lights-0.864
pron-0.815
adoes-0.777
Token to
Token ID284
Feature activation-3.61e-01

Pos logit contributions
les+0.632
ork+0.621
lights+0.612
pron+0.586
adoes+0.542
Neg logit contributions
 least-0.494
 very-0.376
oh-0.366
 ourselves-0.362
 cle-0.348
Token run
Token ID1057
Feature activation+4.88e-02

Pos logit contributions
les+0.700
ork+0.689
lights+0.687
pron+0.650
adoes+0.607
Neg logit contributions
 least-0.504
oh-0.369
 very-0.368
 ourselves-0.361
 am-0.351
Token away
Token ID1497
Feature activation+3.42e-01

Pos logit contributions
 least+0.078
oh+0.060
 very+0.060
 ourselves+0.059
 cle+0.057
Neg logit contributions
les-0.083
ork-0.083
lights-0.081
pron-0.077
adoes-0.071
Token,
Token ID11
Feature activation+8.11e-01

Pos logit contributions
 least+0.572
oh+0.448
 ourselves+0.432
 very+0.431
 cle+0.424
Neg logit contributions
ork-0.558
les-0.558
lights-0.545
pron-0.505
adoes-0.476
Token but
Token ID475
Feature activation+7.30e-01

Pos logit contributions
 least+1.182
oh+0.917
 ourselves+0.863
 cle+0.824
 am+0.823
Neg logit contributions
ork-1.460
les-1.453
lights-1.441
pron-1.353
adoes-1.286
Token he
Token ID339
Feature activation-2.23e-01

Pos logit contributions
 least+1.321
oh+1.056
 ourselves+1.026
 cle+0.999
 very+0.990
Neg logit contributions
ork-1.072
les-1.068
lights-1.060
pron-0.985
lf-0.925
Token is
Token ID318
Feature activation-1.01e+00

Pos logit contributions
ork+0.480
les+0.479
lights+0.473
pron+0.453
adoes+0.427
Neg logit contributions
 least-0.258
oh-0.175
 very-0.173
 ourselves-0.165
 am-0.160
Token too
Token ID1165
Feature activation-1.26e+00

Pos logit contributions
ork+2.289
les+2.271
lights+2.209
pron+2.091
adoes+1.991
Neg logit contributions
 least-1.164
 very-0.908
 so-0.764
 cle-0.746
oh-0.724
Token.
Token ID13
Feature activation-3.25e-01

Pos logit contributions
les+0.445
ork+0.441
lights+0.431
pron+0.410
adoes+0.379
Neg logit contributions
 least-0.415
 very-0.329
oh-0.323
 ourselves-0.318
 cle-0.306
Token "
Token ID366
Feature activation-3.23e-02

Pos logit contributions
les+0.602
ork+0.589
lights+0.580
pron+0.551
adoes+0.512
Neg logit contributions
 least-0.472
 very-0.366
oh-0.358
 ourselves-0.347
 am-0.339
TokenThey
Token ID2990
Feature activation+4.32e-01

Pos logit contributions
les+0.073
ork+0.073
lights+0.071
pron+0.068
adoes+0.065
Neg logit contributions
 least-0.034
oh-0.022
 very-0.022
 ourselves-0.021
 cle-0.020
Token are
Token ID389
Feature activation-5.18e-02

Pos logit contributions
 least+0.342
oh+0.190
 very+0.176
 ourselves+0.176
 cle+0.164
Neg logit contributions
les-1.075
ork-1.066
lights-1.040
pron-1.007
adoes-0.959
Token expensive
Token ID5789
Feature activation+7.47e-01

Pos logit contributions
les+0.103
ork+0.102
lights+0.100
pron+0.095
adoes+0.089
Neg logit contributions
 least-0.069
 very-0.050
oh-0.050
 ourselves-0.049
 cle-0.047
Token,
Token ID11
Feature activation+1.29e-01

Pos logit contributions
 least+1.293
oh+1.053
 ourselves+0.991
 cle+0.990
 very+0.978
Neg logit contributions
les-1.176
lights-1.142
ork-1.139
pron-1.034
adoes-0.990
Token and
Token ID290
Feature activation+8.32e-01

Pos logit contributions
 least+0.187
oh+0.141
 very+0.138
 ourselves+0.137
 cle+0.132
Neg logit contributions
les-0.236
ork-0.235
lights-0.230
pron-0.218
adoes-0.204
Token you
Token ID345
Feature activation-3.98e-01

Pos logit contributions
 least+1.205
oh+0.917
 ourselves+0.851
 cle+0.833
 very+0.830
Neg logit contributions
ork-1.523
les-1.514
lights-1.510
pron-1.415
adoes-1.343
Token should
Token ID815
Feature activation-1.28e+00

Pos logit contributions
les+0.791
ork+0.784
lights+0.781
pron+0.735
adoes+0.693
Neg logit contributions
 least-0.537
 very-0.390
oh-0.386
 ourselves-0.370
 am-0.366
Token take
Token ID1011
Feature activation-2.88e-01

Pos logit contributions
les+2.719
lights+2.664
ork+2.644
pron+2.444
adoes+2.340
Neg logit contributions
 least-1.690
 ourselves-1.250
 very-1.226
 am-1.165
oh-1.163
Token care
Token ID1337
Feature activation+5.55e-01

Pos logit contributions
les+0.450
ork+0.448
lights+0.440
pron+0.418
lf+0.379
Neg logit contributions
 least-0.504
 very-0.399
oh-0.395
 ourselves-0.392
 cle-0.375
Token boy
Token ID2933
Feature activation+2.32e-01

Pos logit contributions
 least+0.346
oh+0.261
 ourselves+0.256
 very+0.254
 cle+0.246
Neg logit contributions
les-0.414
ork-0.410
lights-0.401
pron-0.382
adoes-0.355
Token named
Token ID3706
Feature activation-4.75e-01

Pos logit contributions
 least+0.377
oh+0.292
 ourselves+0.287
 very+0.285
 cle+0.279
Neg logit contributions
les-0.389
ork-0.387
lights-0.376
pron-0.356
adoes-0.329
Token John
Token ID1757
Feature activation+3.59e-01

Pos logit contributions
les+0.883
ork+0.878
lights+0.861
pron+0.811
adoes+0.766
Neg logit contributions
 least-0.714
oh-0.546
 very-0.544
 ourselves-0.523
 am-0.491
Token.
Token ID13
Feature activation-5.95e-01

Pos logit contributions
 least+0.616
oh+0.484
 ourselves+0.478
 very+0.468
 cle+0.466
Neg logit contributions
les-0.574
ork-0.563
lights-0.546
pron-0.518
adoes-0.474
Token He
Token ID679
Feature activation+9.20e-01

Pos logit contributions
les+1.031
ork+1.026
lights+0.972
pron+0.936
adoes+0.858
Neg logit contributions
 least-0.943
 very-0.748
oh-0.744
 ourselves-0.726
 am-0.699
Token was
Token ID373
Feature activation+1.25e+00

Pos logit contributions
 least+1.527
 ourselves+1.197
oh+1.143
 cle+1.109
 oh+1.054
Neg logit contributions
ork-1.564
les-1.547
pron-1.459
lights-1.445
lf-1.364
Token only
Token ID691
Feature activation+1.07e+00

Pos logit contributions
 least+1.260
oh+0.868
 ourselves+0.860
 am+0.724
 cle+0.660
Neg logit contributions
ork-2.745
les-2.742
pron-2.687
lights-2.680
lf-2.494
Token three
Token ID1115
Feature activation-1.79e-01

Pos logit contributions
 least+1.416
oh+1.057
 ourselves+1.027
 am+0.931
 very+0.878
Neg logit contributions
les-2.121
ork-2.077
lights-2.013
pron-1.979
adoes-1.858
Token years
Token ID812
Feature activation-1.01e+00

Pos logit contributions
ork+0.313
les+0.312
lights+0.304
pron+0.291
adoes+0.269
Neg logit contributions
 least-0.273
 very-0.211
oh-0.209
 ourselves-0.204
 cle-0.199
Token old
Token ID1468
Feature activation+1.53e+00

Pos logit contributions
les+1.844
ork+1.837
lights+1.769
pron+1.723
adoes+1.633
Neg logit contributions
 least-1.442
 very-1.177
oh-1.066
 ourselves-1.047
 so-1.034
Token and
Token ID290
Feature activation+6.17e-01

Pos logit contributions
 least+2.921
oh+2.455
 ourselves+2.331
 cle+2.263
 am+2.214
Neg logit contributions
les-2.232
ork-2.072
lights-2.057
adoes-1.824
pron-1.795
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
les+0.667
ork+0.663
lights+0.646
pron+0.617
adoes+0.573
Neg logit contributions
 least-0.603
 very-0.481
oh-0.469
 ourselves-0.461
 cle-0.445
Token "
Token ID366
Feature activation-2.08e-01

Pos logit contributions
les+0.240
ork+0.238
lights+0.231
pron+0.220
adoes+0.204
Neg logit contributions
 least-0.226
 very-0.177
oh-0.176
 ourselves-0.171
 cle-0.167
TokenI
Token ID40
Feature activation-5.28e-01

Pos logit contributions
les+0.436
ork+0.433
lights+0.424
pron+0.402
adoes+0.383
Neg logit contributions
 least-0.254
oh-0.180
 very-0.177
 ourselves-0.173
 am-0.165
Token just
Token ID655
Feature activation+5.10e-01

Pos logit contributions
les+1.128
lights+1.123
ork+1.109
pron+1.056
adoes+1.009
Neg logit contributions
 least-0.632
oh-0.446
 very-0.446
 am-0.431
'm-0.414
Token wanted
Token ID2227
Feature activation+4.10e-01

Pos logit contributions
 least+0.828
oh+0.648
 ourselves+0.636
 very+0.618
 cle+0.614
Neg logit contributions
les-0.854
ork-0.832
lights-0.810
pron-0.769
adoes-0.720
Token to
Token ID284
Feature activation-2.11e-01

Pos logit contributions
 least+0.638
oh+0.495
 ourselves+0.474
 very+0.472
 cle+0.463
Neg logit contributions
les-0.715
ork-0.712
lights-0.691
pron-0.653
adoes-0.611
Token play
Token ID711
Feature activation+1.01e-01

Pos logit contributions
les+0.415
ork+0.409
lights+0.406
pron+0.383
adoes+0.360
Neg logit contributions
 least-0.289
oh-0.210
 very-0.208
 ourselves-0.206
 am-0.198
Token."
Token ID526
Feature activation-2.28e-01

Pos logit contributions
 least+0.175
oh+0.139
 very+0.136
 ourselves+0.135
 cle+0.132
Neg logit contributions
les-0.158
ork-0.157
lights-0.152
pron-0.143
adoes-0.132
Token
Token ID198
Feature activation+1.69e-01

Pos logit contributions
les+0.426
ork+0.420
lights+0.411
pron+0.393
adoes+0.368
Neg logit contributions
 least-0.327
 very-0.249
oh-0.247
 ourselves-0.241
 am-0.232
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
 least+0.255
oh+0.193
 very+0.188
 ourselves+0.187
 cle+0.182
Neg logit contributions
les-0.304
ork-0.304
lights-0.296
pron-0.280
adoes-0.261
TokenMom
Token ID29252
Feature activation-5.16e-01

Pos logit contributions
les+0.281
ork+0.276
lights+0.271
pron+0.255
adoes+0.240
Neg logit contributions
 least-0.263
oh-0.206
 very-0.204
 ourselves-0.201
 am-0.193
Token toys
Token ID14958
Feature activation+1.66e-01

Pos logit contributions
 least+0.292
oh+0.234
 very+0.229
 ourselves+0.228
 cle+0.221
Neg logit contributions
les-0.248
ork-0.246
lights-0.240
pron-0.225
adoes-0.207
Token when
Token ID618
Feature activation-2.50e-01

Pos logit contributions
 least+0.266
oh+0.207
 very+0.202
 ourselves+0.201
 cle+0.196
Neg logit contributions
les-0.280
ork-0.277
lights-0.271
pron-0.256
adoes-0.238
Token she
Token ID673
Feature activation+3.58e-01

Pos logit contributions
les+0.481
ork+0.476
lights+0.465
pron+0.442
adoes+0.415
Neg logit contributions
 least-0.346
 very-0.255
oh-0.254
 ourselves-0.250
 cle-0.237
Token cleaned
Token ID20750
Feature activation-4.31e-01

Pos logit contributions
 least+0.546
oh+0.417
 ourselves+0.415
 very+0.409
 cle+0.396
Neg logit contributions
les-0.634
ork-0.626
lights-0.608
pron-0.576
adoes-0.534
Token up
Token ID510
Feature activation-4.87e-02

Pos logit contributions
ork+0.749
les+0.745
lights+0.728
pron+0.698
adoes+0.639
Neg logit contributions
 least-0.666
 very-0.517
oh-0.509
 ourselves-0.506
 cle-0.474
Token.
Token ID13
Feature activation-3.93e-01

Pos logit contributions
les+0.092
ork+0.091
lights+0.089
pron+0.085
adoes+0.079
Neg logit contributions
 least-0.069
oh-0.051
 very-0.051
 ourselves-0.050
 cle-0.048
Token<|endoftext|>
Token ID50256
Feature activation-4.71e-01

Pos logit contributions
les+0.752
ork+0.740
lights+0.730
pron+0.691
adoes+0.649
Neg logit contributions
 least-0.550
oh-0.411
 very-0.404
 ourselves-0.394
 am-0.377
TokenOnce
Token ID7454
Feature activation-1.45e+00

Pos logit contributions
les+0.936
lights+0.928
ork+0.925
pron+0.881
adoes+0.835
Neg logit contributions
 least-0.608
oh-0.451
 very-0.445
 am-0.424
 ourselves-0.422
Token upon
Token ID2402
Feature activation-1.01e-01

Pos logit contributions
les+2.463
ork+2.403
lights+2.387
pron+2.278
adoes+2.257
Neg logit contributions
 least-2.351
 very-1.949
oh-1.845
 am-1.762
 so-1.719
Token a
Token ID257
Feature activation-7.86e-01

Pos logit contributions
les+0.238
ork+0.236
lights+0.233
pron+0.222
adoes+0.212
Neg logit contributions
 least-0.096
 very-0.060
oh-0.059
 ourselves-0.057
 cle-0.053
Token time
Token ID640
Feature activation+5.11e-01

Pos logit contributions
les+0.674
lights+0.661
ork+0.641
pron+0.529
adoes+0.525
Neg logit contributions
 least-1.925
 very-1.679
oh-1.663
 am-1.590
 cle-1.589
Token running
Token ID2491
Feature activation+6.62e-01

Pos logit contributions
les+1.072
ork+1.065
lights+1.064
pron+0.999
adoes+0.943
Neg logit contributions
 least-0.690
 very-0.513
oh-0.480
 ourselves-0.469
 cle-0.459
Token,
Token ID11
Feature activation+3.73e-01

Pos logit contributions
 least+1.056
oh+0.816
 ourselves+0.800
 cle+0.769
 very+0.752
Neg logit contributions
les-1.144
ork-1.128
lights-1.095
pron-0.999
adoes-0.965
Token he
Token ID339
Feature activation-4.06e-01

Pos logit contributions
 least+0.642
oh+0.511
 ourselves+0.496
 very+0.490
 cle+0.482
Neg logit contributions
ork-0.582
les-0.581
lights-0.568
pron-0.529
adoes-0.495
Token saw
Token ID2497
Feature activation+2.38e-01

Pos logit contributions
ork+0.722
lights+0.720
les+0.720
pron+0.672
adoes+0.636
Neg logit contributions
 least-0.623
 very-0.478
oh-0.477
 ourselves-0.443
 cle-0.440
Token a
Token ID257
Feature activation-2.96e-02

Pos logit contributions
 least+0.289
oh+0.204
 ourselves+0.195
 very+0.194
 cle+0.185
Neg logit contributions
les-0.500
ork-0.497
lights-0.488
pron-0.464
adoes-0.441
Token big
Token ID1263
Feature activation-2.31e-01

Pos logit contributions
les+0.047
ork+0.047
lights+0.046
pron+0.043
adoes+0.040
Neg logit contributions
 least-0.050
oh-0.039
 very-0.039
 ourselves-0.038
 cle-0.037
Token mill
Token ID3939
Feature activation-7.27e-01

Pos logit contributions
les+0.375
ork+0.375
lights+0.367
pron+0.343
adoes+0.318
Neg logit contributions
 least-0.387
 very-0.303
oh-0.303
 ourselves-0.293
 cle-0.287
Token nearby
Token ID6716
Feature activation-2.19e-02

Pos logit contributions
ork+1.365
les+1.359
lights+1.333
pron+1.272
adoes+1.198
Neg logit contributions
 least-1.019
 very-0.796
oh-0.782
 ourselves-0.735
 so-0.709
Token.
Token ID13
Feature activation-1.28e-01

Pos logit contributions
les+0.036
ork+0.036
lights+0.035
pron+0.033
adoes+0.031
Neg logit contributions
 least-0.036
oh-0.028
 very-0.028
 ourselves-0.028
 cle-0.027
Token The
Token ID383
Feature activation+3.26e-01

Pos logit contributions
les+0.211
ork+0.207
lights+0.204
pron+0.192
adoes+0.177
Neg logit contributions
 least-0.214
oh-0.168
 very-0.167
 ourselves-0.163
 cle-0.159
Token mill
Token ID3939
Feature activation-1.25e+00

Pos logit contributions
 least+0.467
oh+0.351
 ourselves+0.341
 very+0.337
 cle+0.325
Neg logit contributions
les-0.608
ork-0.607
lights-0.590
pron-0.562
adoes-0.526
Token try
Token ID1949
Feature activation-5.08e-01

Pos logit contributions
ork+0.166
les+0.166
lights+0.162
pron+0.153
adoes+0.142
Neg logit contributions
 least-0.153
oh-0.117
 very-0.117
 ourselves-0.115
 cle-0.111
Token to
Token ID284
Feature activation+1.89e-01

Pos logit contributions
les+0.859
lights+0.855
ork+0.855
pron+0.806
adoes+0.744
Neg logit contributions
 least-0.802
 very-0.647
oh-0.622
 ourselves-0.619
 am-0.592
Token score
Token ID4776
Feature activation-4.25e-01

Pos logit contributions
 least+0.347
oh+0.279
 very+0.274
 ourselves+0.273
 cle+0.266
Neg logit contributions
les-0.274
ork-0.271
lights-0.262
pron-0.247
adoes-0.225
Token.
Token ID13
Feature activation-1.62e-01

Pos logit contributions
ork+0.782
les+0.779
lights+0.767
pron+0.744
lf+0.681
Neg logit contributions
 least-0.609
 very-0.473
 ourselves-0.457
oh-0.454
 cle-0.437
Token
Token ID198
Feature activation-7.77e-01

Pos logit contributions
les+0.295
ork+0.292
lights+0.286
pron+0.272
adoes+0.253
Neg logit contributions
 least-0.243
oh-0.185
 very-0.182
 ourselves-0.180
 am-0.172
Token
Token ID198
Feature activation-9.22e-01

Pos logit contributions
les+1.567
lights+1.527
ork+1.498
pron+1.436
adoes+1.409
Neg logit contributions
 least-0.988
oh-0.772
 very-0.763
 am-0.719
 ourselves-0.711
TokenOne
Token ID3198
Feature activation+1.69e-01

Pos logit contributions
les+1.348
lights+1.317
ork+1.292
pron+1.171
adoes+1.143
Neg logit contributions
 least-1.689
oh-1.459
 very-1.422
 ourselves-1.376
 am-1.358
Token day
Token ID1110
Feature activation-5.13e-02

Pos logit contributions
 least+0.283
oh+0.221
 ourselves+0.216
 very+0.216
 cle+0.209
Neg logit contributions
les-0.278
ork-0.276
lights-0.269
pron-0.254
adoes-0.234
Token,
Token ID11
Feature activation-3.92e-01

Pos logit contributions
les+0.101
ork+0.100
lights+0.098
pron+0.094
adoes+0.088
Neg logit contributions
 least-0.068
oh-0.050
 very-0.049
 ourselves-0.049
 cle-0.046
Token they
Token ID484
Feature activation+5.99e-01

Pos logit contributions
les+0.749
ork+0.741
lights+0.722
pron+0.689
adoes+0.642
Neg logit contributions
 least-0.552
 very-0.419
oh-0.411
 ourselves-0.406
 oh-0.388
Token play
Token ID711
Feature activation+3.01e-01

Pos logit contributions
 least+0.783
oh+0.585
 ourselves+0.570
 very+0.553
 cle+0.545
Neg logit contributions
les-1.182
ork-1.153
lights-1.115
pron-1.068
adoes-1.009
Token from
Token ID422
Feature activation+6.60e-01

Pos logit contributions
 least+1.020
oh+0.790
 ourselves+0.742
 very+0.726
 am+0.721
Neg logit contributions
les-1.214
ork-1.211
lights-1.178
pron-1.113
adoes-1.044
Token running
Token ID2491
Feature activation+3.12e-01

Pos logit contributions
 least+0.967
oh+0.763
 ourselves+0.722
 very+0.706
 am+0.698
Neg logit contributions
les-1.179
ork-1.166
lights-1.141
pron-1.077
adoes-1.012
Token any
Token ID597
Feature activation+3.57e-01

Pos logit contributions
 least+0.519
oh+0.407
 ourselves+0.394
 very+0.392
 cle+0.384
Neg logit contributions
les-0.511
ork-0.507
lights-0.494
pron-0.459
adoes-0.432
Token further
Token ID2252
Feature activation+6.54e-01

Pos logit contributions
 least+0.516
oh+0.393
 ourselves+0.381
 very+0.377
 am+0.362
Neg logit contributions
les-0.665
ork-0.653
lights-0.642
pron-0.603
adoes-0.566
Token.
Token ID13
Feature activation-4.12e-02

Pos logit contributions
 least+1.096
oh+0.869
 ourselves+0.829
 very+0.819
 cle+0.818
Neg logit contributions
les-1.076
ork-1.047
lights-1.036
pron-0.933
adoes-0.897
Token He
Token ID679
Feature activation-2.06e-01

Pos logit contributions
les+0.067
ork+0.066
lights+0.065
pron+0.061
adoes+0.056
Neg logit contributions
 least-0.070
oh-0.054
 very-0.054
 ourselves-0.053
 cle-0.052
Token had
Token ID550
Feature activation+7.11e-02

Pos logit contributions
ork+0.379
les+0.379
lights+0.373
pron+0.355
adoes+0.332
Neg logit contributions
 least-0.300
oh-0.225
 very-0.223
 ourselves-0.214
 cle-0.211
Token gotten
Token ID7891
Feature activation+3.98e-01

Pos logit contributions
 least+0.101
oh+0.075
 very+0.074
 ourselves+0.073
 cle+0.070
Neg logit contributions
les-0.133
ork-0.133
lights-0.130
pron-0.123
adoes-0.115
Token so
Token ID523
Feature activation+1.64e-01

Pos logit contributions
 least+0.539
oh+0.401
 ourselves+0.386
 very+0.375
 am+0.366
Neg logit contributions
les-0.770
ork-0.763
lights-0.745
pron-0.707
adoes-0.668
Token scared
Token ID12008
Feature activation+6.22e-02

Pos logit contributions
 least+0.251
oh+0.192
 very+0.186
 ourselves+0.186
 cle+0.179
Neg logit contributions
les-0.292
ork-0.291
lights-0.284
pron-0.268
adoes-0.251
Token he
Token ID339
Feature activation+3.88e-02

Pos logit contributions
 least+0.099
oh+0.077
 very+0.076
 ourselves+0.075
 cle+0.073
Neg logit contributions
les-0.106
ork-0.105
lights-0.102
pron-0.097
adoes-0.090
Token it
Token ID340
Feature activation+1.16e-01

Pos logit contributions
lights+1.305
ork+1.261
les+1.212
pron+1.207
lf+1.075
Neg logit contributions
 least-1.628
 very-1.373
oh-1.334
 ourselves-1.289
 so-1.271
Token up
Token ID510
Feature activation+4.08e-01

Pos logit contributions
 least+0.191
oh+0.148
 very+0.146
 ourselves+0.145
 cle+0.141
Neg logit contributions
les-0.192
ork-0.190
lights-0.185
pron-0.175
adoes-0.162
Token and
Token ID290
Feature activation-3.52e-01

Pos logit contributions
 least+0.725
oh+0.573
 ourselves+0.563
 very+0.551
 cle+0.549
Neg logit contributions
ork-0.624
les-0.621
lights-0.600
pron-0.564
adoes-0.521
Token started
Token ID2067
Feature activation-6.92e-01

Pos logit contributions
ork+0.642
les+0.633
lights+0.626
pron+0.592
adoes+0.547
Neg logit contributions
 least-0.529
 very-0.402
oh-0.397
 ourselves-0.385
 cle-0.375
Token to
Token ID284
Feature activation-3.42e-01

Pos logit contributions
lights+1.352
les+1.344
ork+1.324
pron+1.248
adoes+1.171
Neg logit contributions
 least-0.936
 very-0.734
oh-0.692
 cle-0.669
 ourselves-0.662
Token play
Token ID711
Feature activation-2.05e+00

Pos logit contributions
ork+0.563
les+0.562
lights+0.555
pron+0.510
adoes+0.473
Neg logit contributions
 least-0.580
 very-0.452
oh-0.449
 ourselves-0.436
 cle-0.433
Token with
Token ID351
Feature activation-2.17e+00

Pos logit contributions
lights+4.114
les+3.968
ork+3.848
pron+3.839
anne+3.675
Neg logit contributions
 least-2.738
 very-2.552
 so-2.277
 oh-2.234
 am-2.188
Token it
Token ID340
Feature activation-1.86e-01

Pos logit contributions
lights+3.689
pron+3.657
ork+3.557
les+3.448
anne+3.289
Neg logit contributions
 least-3.562
 very-3.199
 ourselves-2.914
 so-2.893
 oh-2.825
Token.
Token ID13
Feature activation-9.74e-01

Pos logit contributions
les+0.313
ork+0.311
lights+0.305
pron+0.289
adoes+0.267
Neg logit contributions
 least-0.302
 very-0.237
oh-0.234
 ourselves-0.228
 cle-0.223
Token Tim
Token ID5045
Feature activation+6.67e-01

Pos logit contributions
les+1.949
lights+1.895
ork+1.879
pron+1.791
adoes+1.678
Neg logit contributions
 least-1.284
 very-1.020
oh-0.992
 so-0.926
 am-0.920
Tokenmy
Token ID1820
Feature activation-8.17e-02

Pos logit contributions
 least+1.141
 ourselves+0.914
oh+0.895
 cle+0.883
 very+0.881
Neg logit contributions
les-1.070
ork-1.018
lights-0.990
pron-0.946
lf-0.877
TokenLook
Token ID8567
Feature activation+4.97e-02

Pos logit contributions
 least+0.359
oh+0.260
 very+0.256
 ourselves+0.254
 cle+0.244
Neg logit contributions
les-0.516
ork-0.513
lights-0.502
pron-0.479
adoes-0.449
Token,
Token ID11
Feature activation-4.01e-02

Pos logit contributions
 least+0.069
oh+0.051
 very+0.050
 ourselves+0.049
 cle+0.047
Neg logit contributions
les-0.095
ork-0.095
lights-0.093
pron-0.088
adoes-0.083
Token I
Token ID314
Feature activation+4.54e-01

Pos logit contributions
les+0.070
ork+0.069
lights+0.068
pron+0.064
adoes+0.060
Neg logit contributions
 least-0.063
oh-0.048
 very-0.048
 ourselves-0.047
 cle-0.045
Token am
Token ID716
Feature activation+3.86e-01

Pos logit contributions
 least+0.711
 ourselves+0.548
oh+0.545
 very+0.537
 cle+0.520
Neg logit contributions
les-0.780
ork-0.778
lights-0.742
pron-0.720
lf-0.658
Token a
Token ID257
Feature activation+5.71e-01

Pos logit contributions
 least+0.481
oh+0.345
 ourselves+0.329
 very+0.321
 am+0.313
Neg logit contributions
les-0.785
ork-0.783
lights-0.768
pron-0.739
adoes-0.692
Token graceful
Token ID44363
Feature activation-2.29e+00

Pos logit contributions
 least+1.174
oh+0.973
 ourselves+0.964
 very+0.933
 cle+0.923
Neg logit contributions
ork-0.700
les-0.689
lights-0.658
pron-0.637
lf-0.556
Token dancer
Token ID38619
Feature activation+2.71e-01

Pos logit contributions
lights+3.774
ork+3.707
les+3.589
pron+3.228
adoes+3.169
Neg logit contributions
 least-3.948
 very-3.263
oh-3.012
 oh-2.967
 am-2.958
Token!"
Token ID2474
Feature activation+1.68e-01

Pos logit contributions
 least+0.355
oh+0.255
 very+0.248
 ourselves+0.248
 cle+0.239
Neg logit contributions
les-0.537
ork-0.532
lights-0.522
pron-0.497
adoes-0.467
Token Lily
Token ID20037
Feature activation+4.12e-01

Pos logit contributions
 least+0.295
oh+0.234
 very+0.229
 ourselves+0.228
 cle+0.222
Neg logit contributions
les-0.262
ork-0.260
lights-0.253
pron-0.237
adoes-0.219
Token said
Token ID531
Feature activation+3.15e-02

Pos logit contributions
 least+0.485
oh+0.328
 ourselves+0.324
 very+0.313
 cle+0.304
Neg logit contributions
les-0.884
ork-0.873
lights-0.856
pron-0.820
lf-0.775
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
 least+0.052
oh+0.041
 very+0.040
 ourselves+0.040
 cle+0.039
Neg logit contributions
les-0.052
ork-0.052
lights-0.050
pron-0.048
adoes-0.044
Token explore
Token ID7301
Feature activation-5.49e-02

Pos logit contributions
les+0.055
ork+0.055
lights+0.054
pron+0.051
adoes+0.047
Neg logit contributions
 least-0.046
oh-0.035
 very-0.034
 ourselves-0.034
 cle-0.033
Token the
Token ID262
Feature activation+1.89e-01

Pos logit contributions
les+0.097
ork+0.097
lights+0.094
pron+0.090
adoes+0.083
Neg logit contributions
 least-0.084
oh-0.064
 very-0.064
 ourselves-0.063
 cle-0.061
Token ground
Token ID2323
Feature activation-4.28e-01

Pos logit contributions
 least+0.326
oh+0.257
 very+0.251
 ourselves+0.251
 cle+0.244
Neg logit contributions
les-0.300
ork-0.297
lights-0.288
pron-0.271
adoes-0.252
Token and
Token ID290
Feature activation-2.58e-01

Pos logit contributions
ork+0.766
les+0.761
lights+0.742
pron+0.715
adoes+0.664
Neg logit contributions
 least-0.652
 very-0.516
oh-0.492
 ourselves-0.482
 cle-0.466
Token play
Token ID711
Feature activation-6.78e-01

Pos logit contributions
les+0.500
ork+0.500
lights+0.489
pron+0.466
adoes+0.435
Neg logit contributions
 least-0.359
 very-0.265
oh-0.259
 ourselves-0.254
 am-0.247
Token with
Token ID351
Feature activation-2.11e+00

Pos logit contributions
ork+1.192
les+1.187
lights+1.187
pron+1.132
adoes+1.030
Neg logit contributions
 least-1.012
 very-0.834
 ourselves-0.778
oh-0.776
 am-0.762
Token his
Token ID465
Feature activation+5.16e-01

Pos logit contributions
pron+4.619
ork+4.515
lights+4.492
les+4.323
anne+4.064
Neg logit contributions
 least-2.460
 very-2.275
 so-2.029
 ourselves-1.916
 oh-1.864
Token friends
Token ID2460
Feature activation-4.40e-01

Pos logit contributions
 least+0.958
oh+0.778
 ourselves+0.738
 very+0.732
 cle+0.715
Neg logit contributions
ork-0.755
les-0.750
lights-0.725
pron-0.675
adoes-0.637
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
ork+0.771
les+0.765
lights+0.755
pron+0.714
adoes+0.660
Neg logit contributions
 least-0.693
 very-0.548
oh-0.527
 ourselves-0.520
 am-0.501
Token One
Token ID1881
Feature activation+1.69e-01

Pos logit contributions
les+2.066
ork+2.027
lights+2.024
pron+1.958
lf+1.831
Neg logit contributions
 least-1.298
 very-1.003
 am-0.949
oh-0.937
 ourselves-0.907
Token day
Token ID1110
Feature activation+2.51e-01

Pos logit contributions
 least+0.281
oh+0.219
 ourselves+0.215
 very+0.214
 cle+0.208
Neg logit contributions
les-0.280
ork-0.279
lights-0.271
pron-0.256
adoes-0.236
TokenThe
Token ID464
Feature activation+4.56e-01

Pos logit contributions
 least+0.683
oh+0.510
 ourselves+0.509
 very+0.507
 cle+0.494
Neg logit contributions
ork-0.728
les-0.718
lights-0.694
pron-0.665
lf-0.618
Token boy
Token ID2933
Feature activation+6.23e-01

Pos logit contributions
 least+0.796
oh+0.639
 ourselves+0.622
 very+0.617
 cle+0.600
Neg logit contributions
les-0.697
ork-0.691
lights-0.671
pron-0.636
adoes-0.581
Token was
Token ID373
Feature activation-9.14e-03

Pos logit contributions
 least+0.887
 ourselves+0.664
oh+0.649
 cle+0.622
 very+0.621
Neg logit contributions
les-1.173
ork-1.152
lights-1.120
pron-1.069
lf-1.001
Token small
Token ID1402
Feature activation+7.61e-01

Pos logit contributions
les+0.021
ork+0.021
lights+0.021
pron+0.020
adoes+0.019
Neg logit contributions
 least-0.009
oh-0.006
 very-0.005
 ourselves-0.005
 cle-0.005
Token and
Token ID290
Feature activation-1.10e-01

Pos logit contributions
 least+1.321
oh+1.068
 ourselves+1.046
 am+1.000
 very+0.992
Neg logit contributions
les-1.186
ork-1.145
lights-1.140
pron-1.046
adoes-0.987
Token wore
Token ID12408
Feature activation-2.52e+00

Pos logit contributions
les+0.191
ork+0.190
lights+0.185
pron+0.174
adoes+0.163
Neg logit contributions
 least-0.173
oh-0.133
 very-0.133
 ourselves-0.129
 cle-0.126
Token a
Token ID257
Feature activation-2.32e+00

Pos logit contributions
ork+6.203
les+6.055
lights+6.006
pron+5.602
ravings+5.467
Neg logit contributions
 very-2.170
 least-2.057
 so-1.822
 cle-1.462
 oh-1.413
Token modest
Token ID12949
Feature activation-1.41e+00

Pos logit contributions
lights+4.326
les+4.310
ork+4.194
adoes+4.011
ors+3.813
Neg logit contributions
 very-3.216
 least-3.149
 so-2.632
 am-2.582
 cle-2.554
Token white
Token ID2330
Feature activation-1.71e+00

Pos logit contributions
les+2.163
lights+2.157
ork+2.116
adoes+1.856
pron+1.843
Neg logit contributions
 least-2.454
 very-2.136
 so-1.925
 am-1.921
 cle-1.918
Token shirt
Token ID10147
Feature activation-1.13e+00

Pos logit contributions
lights+2.768
les+2.737
ork+2.677
adoes+2.379
pron+2.297
Neg logit contributions
 least-2.768
 very-2.427
 am-2.186
oh-2.145
 oh-2.139
Token.
Token ID13
Feature activation-8.23e-01

Pos logit contributions
ork+2.042
les+2.030
lights+1.983
pron+1.856
adoes+1.815
Neg logit contributions
 least-1.664
 very-1.370
 so-1.243
oh-1.235
 ourselves-1.198
Token smiles
Token ID21845
Feature activation-1.02e-01

Pos logit contributions
ork+1.274
les+1.254
lights+1.229
pron+1.179
adoes+1.082
Neg logit contributions
 least-1.393
 very-1.131
 ourselves-1.078
oh-1.069
 cle-1.064
Token.
Token ID13
Feature activation-4.89e-01

Pos logit contributions
ork+0.177
les+0.176
lights+0.172
pron+0.163
adoes+0.152
Neg logit contributions
 least-0.160
oh-0.123
 very-0.123
 ourselves-0.120
 cle-0.116
Token
Token ID198
Feature activation+3.37e-01

Pos logit contributions
les+0.911
ork+0.910
lights+0.892
pron+0.857
lf+0.806
Neg logit contributions
 least-0.703
 very-0.525
oh-0.513
 ourselves-0.513
 am-0.485
Token
Token ID198
Feature activation-4.91e-01

Pos logit contributions
 least+0.536
oh+0.411
 ourselves+0.398
 very+0.398
 cle+0.391
Neg logit contributions
les-0.579
ork-0.578
lights-0.559
pron-0.529
adoes-0.489
TokenThe
Token ID464
Feature activation-1.45e+00

Pos logit contributions
ork+0.920
les+0.920
lights+0.917
pron+0.854
adoes+0.812
Neg logit contributions
 least-0.695
 very-0.520
oh-0.519
 ourselves-0.513
 am-0.485
Token gentle
Token ID10296
Feature activation-2.38e+00

Pos logit contributions
ork+2.528
lights+2.472
les+2.466
pron+2.272
adoes+2.203
Neg logit contributions
 least-2.395
 very-1.965
 cle-1.843
oh-1.725
 so-1.706
Token van
Token ID5719
Feature activation-7.39e-01

Pos logit contributions
ork+4.205
lights+4.175
pron+4.095
les+4.037
lf+3.575
Neg logit contributions
 least-3.573
 very-3.001
 say-2.847
 am-2.756
 cle-2.728
Token drove
Token ID10357
Feature activation-5.92e-02

Pos logit contributions
ork+1.380
les+1.373
lights+1.347
pron+1.292
adoes+1.215
Neg logit contributions
 least-1.058
oh-0.800
 very-0.800
 cle-0.743
 ourselves-0.733
Token away
Token ID1497
Feature activation+9.06e-01

Pos logit contributions
les+0.100
ork+0.099
lights+0.097
pron+0.092
adoes+0.085
Neg logit contributions
 least-0.095
oh-0.074
 very-0.074
 ourselves-0.072
 cle-0.070
Token,
Token ID11
Feature activation-4.94e-01

Pos logit contributions
 least+1.433
oh+1.120
 ourselves+1.067
 cle+1.040
 oh+1.027
Neg logit contributions
ork-1.543
les-1.540
lights-1.515
pron-1.375
adoes-1.314
Token to
Token ID284
Feature activation+6.17e-01

Pos logit contributions
ork+0.908
les+0.907
lights+0.883
pron+0.833
adoes+0.785
Neg logit contributions
 least-0.736
 very-0.561
oh-0.547
 ourselves-0.532
 cle-0.520