TOP ACTIVATIONS
MAX = 3.684

and they were able to un pack it . Inside they
filled the hole in Tim my 's yard . Tim my
opened the box and found Lily 's favorite stuffed lion .
takes a cloth and wipes Ben 's shirt . But she
No , thank you . Cig arettes are stupid . They
opens her eyes and sees Ben with something in his mouth
not want to be a tatt let ale . She decided
big rock and put his father 's jacket on it .
opened the box and to her surprise , it was full
They all agreed and helped her open the chest . Inside
used the glove to clean her room . She wiped the
Mom smiles and wipes Mia 's face . She throws
opens her eyes and sees Tom . She rub s her
asked her dad to help her remove the lid of the
and placed something into his son 's hand , they looked
her mom my to help her order a new chair .
and gently wiped the little girl 's skin .
toy box , and to her surprise , the magnet was
her mom my to help her . " Mom
sink and mom my cleaned Lily 's shirt with soap and

MIN ACTIVATIONS
MIN = -4.040

bush . Every day , lots of different animals would gather
One day , a tall gir affe named Gig i went
a time there was a gir affe who loved to get
, there was a baby gir affe named Pr une .
girl named Mary . Mary dreamed of finding a diamond .
a time there was a gir affe named Sam and a
clock went " tick - t ock , tick - t
, there was a little gir affe named Gig i .
was a big , tall gir affe named Jerry . Jerry
was a monkey and a gir affe . The gir affe
, there was a tall gir affe named Jerry . Jerry
a bridge . Everyday , lots of people would cross the
One day , a tall gir affe was walking through the
adventure story , but after an hour they were both tired
Every morning , lots of boats arrived and left
just went " tick - t ock , tick - t
lively little bird named Ch ir py . Ch ir py
she asked her mom my lots of questions about marriage .
toys . Sally said yes , and they made a trade
time , there was a gir affe named Jerry . Jerry

INTERVAL 1.753 - 3.684
CONTAINS 1.017%

Anna says . " Let 's make bubbles with it !"
day . She was outside in the big backyard in the
Tom said , " I promise . And I will wash
it was useful to help her mom my .
Once upon a time , there was a little girl named

INTERVAL -0.178 - 1.753
CONTAINS 58.800%

, and just passing it makes you feel good ." The
Lily . She loved to play outside with her friends .
ran around the park in excitement . The
she found an old book in her grandma 's attic .
, I 'll make one for you . But you have

INTERVAL -2.109 - -0.178
CONTAINS 39.801%

forest ran away , but the oak tree stayed strong and
day . The sun was shining , and the birds were
, and Fl uffy played all day . They ran and
the restaurant . But when they got there , they saw
if he wanted to play a game . Rob agreed and

INTERVAL -4.040 - -2.109
CONTAINS 0.383%

. The neighbor was a kind lady . She saw the
day , a big bird heard her singing and said ,
too dangerous . Lily felt sad again , but she knew
ster was not being nice , but it did not say
having so much fun . They had a great day together
Token and
Token ID290
Feature activation+4.63e-01

Pos logit contributions
bes+2.678
vered+2.476
es+2.445
e+2.443
antly+2.410
Neg logit contributions
aff-1.993
ock-1.904
weet-1.830
iki-1.722
oto-1.667
Token they
Token ID484
Feature activation+1.19e-01

Pos logit contributions
bes+0.808
vered+0.750
antly+0.748
es+0.732
e+0.727
Neg logit contributions
aff-0.804
ock-0.801
weet-0.792
iki-0.736
oto-0.710
Token were
Token ID547
Feature activation+1.20e+00

Pos logit contributions
bes+0.193
antly+0.184
vered+0.180
es+0.176
e+0.172
Neg logit contributions
aff-0.221
ock-0.219
weet-0.216
iki-0.202
oto-0.196
Token able
Token ID1498
Feature activation+5.05e-01

Pos logit contributions
bes+2.392
antly+2.291
vered+2.273
es+2.199
 kit+2.166
Neg logit contributions
aff-1.756
weet-1.742
ock-1.734
iki-1.582
ules-1.529
Token to
Token ID284
Feature activation+1.41e+00

Pos logit contributions
bes+1.045
antly+1.002
vered+0.982
es+0.971
e+0.961
Neg logit contributions
aff-0.711
weet-0.703
ock-0.698
iki-0.640
aa-0.613
Token un
Token ID555
Feature activation+3.68e+00

Pos logit contributions
bes+2.247
es+2.150
antly+2.112
vered+2.074
 kit+1.965
Neg logit contributions
ock-2.686
aff-2.612
weet-2.557
iki-2.431
oto-2.381
Tokenpack
Token ID8002
Feature activation+9.81e-01

Pos logit contributions
bes+3.974
 Something+3.600
 elbows+3.422
vered+3.328
es+3.277
Neg logit contributions
aff-9.578
ock-8.869
aa-8.826
add-8.629
oot-8.493
Token it
Token ID340
Feature activation+1.19e+00

Pos logit contributions
bes+1.782
antly+1.669
vered+1.649
es+1.616
nob+1.613
Neg logit contributions
aff-1.614
ock-1.601
weet-1.526
iki-1.423
oto-1.394
Token.
Token ID13
Feature activation+3.61e-01

Pos logit contributions
bes+2.073
vered+1.946
antly+1.938
e+1.901
es+1.858
Neg logit contributions
ock-2.051
aff-2.017
weet-1.949
iki-1.830
oto-1.779
Token Inside
Token ID14384
Feature activation+5.72e-02

Pos logit contributions
bes+0.574
antly+0.540
vered+0.539
es+0.526
e+0.519
Neg logit contributions
aff-0.684
ock-0.667
weet-0.667
iki-0.623
ules-0.604
Token they
Token ID484
Feature activation+8.70e-01

Pos logit contributions
bes+0.086
antly+0.082
vered+0.080
es+0.079
e+0.077
Neg logit contributions
aff-0.112
ock-0.111
weet-0.110
iki-0.103
oto-0.100
Token filled
Token ID5901
Feature activation+1.55e+00

Pos logit contributions
bes+1.198
antly+1.151
vered+1.131
e+1.102
es+1.094
Neg logit contributions
ock-1.197
aff-1.193
weet-1.181
iki-1.091
oto-1.058
Token the
Token ID262
Feature activation+1.35e+00

Pos logit contributions
bes+2.780
vered+2.679
antly+2.677
 kit+2.558
es+2.556
Neg logit contributions
aff-2.655
ock-2.570
weet-2.534
oto-2.363
iki-2.275
Token hole
Token ID7604
Feature activation+1.16e+00

Pos logit contributions
bes+1.510
antly+1.449
es+1.407
vered+1.402
e+1.336
Neg logit contributions
aff-3.162
ock-3.126
weet-3.083
iki-2.901
arl-2.847
Token in
Token ID287
Feature activation+1.53e+00

Pos logit contributions
bes+1.371
antly+1.284
vered+1.249
 kit+1.179
es+1.162
Neg logit contributions
aff-2.631
ock-2.602
weet-2.562
iki-2.406
oto-2.354
Token Tim
Token ID5045
Feature activation+2.34e+00

Pos logit contributions
bes+2.979
antly+2.879
es+2.832
vered+2.819
e+2.677
Neg logit contributions
ock-2.342
weet-2.309
aff-2.260
iki-2.002
oto-1.959
Tokenmy
Token ID1820
Feature activation+3.68e+00

Pos logit contributions
bes+4.325
antly+4.043
vered+4.032
 Something+3.832
es+3.766
Neg logit contributions
weet-4.232
ock-4.199
aff-3.965
iki-3.700
aa-3.646
Token's
Token ID338
Feature activation+2.01e+00

Pos logit contributions
antly+3.914
bes+3.758
vered+3.419
 Something+3.174
 Hide+3.121
Neg logit contributions
weet-9.187
ock-9.015
aff-8.799
aa-8.214
 tour-8.086
Token yard
Token ID12699
Feature activation+1.27e+00

Pos logit contributions
es+2.926
bes+2.913
vered+2.815
antly+2.809
e+2.682
Neg logit contributions
aff-3.837
weet-3.787
ock-3.766
iki-3.755
 tour-3.614
Token.
Token ID13
Feature activation-5.89e-01

Pos logit contributions
bes+2.075
antly+2.036
vered+2.001
es+1.901
e+1.880
Neg logit contributions
aff-2.301
ock-2.235
weet-2.215
iki-2.013
oto-1.986
Token Tim
Token ID5045
Feature activation-4.43e-01

Pos logit contributions
ock+1.201
aff+1.186
weet+1.177
iki+1.106
oto+1.076
Neg logit contributions
bes-0.854
antly-0.821
es-0.791
vered-0.778
e-0.750
Tokenmy
Token ID1820
Feature activation+7.03e-01

Pos logit contributions
aff+0.792
ock+0.779
weet+0.772
iki+0.716
ules+0.697
Neg logit contributions
bes-0.743
antly-0.723
vered-0.705
es-0.704
e-0.689
Token opened
Token ID4721
Feature activation+1.48e+00

Pos logit contributions
bes+1.649
antly+1.558
vered+1.503
e+1.457
es+1.449
Neg logit contributions
aff-1.675
ock-1.661
weet-1.609
iki-1.542
ules-1.490
Token the
Token ID262
Feature activation+2.33e+00

Pos logit contributions
bes+3.155
vered+2.940
antly+2.929
es+2.863
e+2.858
Neg logit contributions
aff-2.070
ock-1.969
weet-1.920
iki-1.831
aa-1.730
Token box
Token ID3091
Feature activation+8.57e-01

Pos logit contributions
bes+3.401
vered+3.203
e+3.179
es+3.050
antly+2.967
Neg logit contributions
aff-4.954
ock-4.879
arl-4.646
weet-4.613
 tour-4.499
Token and
Token ID290
Feature activation+1.59e+00

Pos logit contributions
bes+1.866
vered+1.776
antly+1.771
e+1.727
es+1.697
Neg logit contributions
aff-1.113
ock-1.099
weet-1.059
iki-0.979
oto-0.921
Token found
Token ID1043
Feature activation+1.63e+00

Pos logit contributions
bes+2.880
vered+2.682
antly+2.625
e+2.571
 wrist+2.505
Neg logit contributions
aff-2.660
ock-2.650
weet-2.599
iki-2.458
aa-2.389
Token Lily
Token ID20037
Feature activation+3.65e+00

Pos logit contributions
bes+2.937
vered+2.663
 wrist+2.643
antly+2.603
e+2.549
Neg logit contributions
aff-2.878
ock-2.785
weet-2.696
iki-2.561
oto-2.467
Token's
Token ID338
Feature activation+1.96e+00

Pos logit contributions
antly+7.779
bes+7.410
vered+7.340
ASH+7.187
 Something+7.011
Neg logit contributions
aff-5.059
weet-5.024
ock-4.971
iki-4.731
 tour-4.341
Token favorite
Token ID4004
Feature activation+2.53e+00

Pos logit contributions
bes+2.952
antly+2.932
vered+2.889
es+2.865
e+2.718
Neg logit contributions
aff-3.843
ock-3.720
weet-3.647
iki-3.584
oto-3.522
Token stuffed
Token ID22259
Feature activation+1.04e+00

Pos logit contributions
bes+4.287
vered+4.109
antly+4.093
es+3.979
 elbows+3.809
Neg logit contributions
aff-4.714
ock-4.666
weet-4.375
oto-4.130
 tour-4.128
Token lion
Token ID18744
Feature activation+2.58e+00

Pos logit contributions
bes+1.776
antly+1.707
vered+1.679
es+1.610
e+1.591
Neg logit contributions
aff-1.878
ock-1.845
weet-1.755
iki-1.669
oto-1.662
Token.
Token ID13
Feature activation+4.57e-01

Pos logit contributions
antly+4.967
vered+4.843
bes+4.785
ASH+4.601
 Something+4.506
Neg logit contributions
aff-4.172
ock-4.005
weet-3.983
iki-3.692
oto-3.558
Token takes
Token ID2753
Feature activation+5.79e-01

Pos logit contributions
bes+1.102
antly+1.067
vered+1.050
es+1.020
e+1.018
Neg logit contributions
aff-0.915
ock-0.903
weet-0.879
iki-0.820
oto-0.796
Token a
Token ID257
Feature activation-1.06e-01

Pos logit contributions
bes+1.069
antly+1.004
vered+0.998
es+0.975
 wrist+0.952
Neg logit contributions
ock-0.960
aff-0.958
weet-0.937
iki-0.853
aa-0.848
Token cloth
Token ID16270
Feature activation-5.85e-01

Pos logit contributions
aff+0.229
ock+0.227
weet+0.225
iki+0.212
oto+0.207
Neg logit contributions
bes-0.138
antly-0.132
vered-0.128
es-0.126
e-0.122
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
aff+0.974
weet+0.954
ock+0.949
iki+0.879
ules+0.849
Neg logit contributions
bes-1.051
antly-1.025
es-1.004
vered-0.995
 kit-0.971
Token wipes
Token ID46142
Feature activation+7.36e-01

Pos logit contributions
bes+1.678
antly+1.593
vered+1.593
e+1.542
es+1.521
Neg logit contributions
ock-2.032
weet-2.004
aff-1.974
iki-1.844
aa-1.829
Token Ben
Token ID3932
Feature activation+3.57e+00

Pos logit contributions
bes+1.404
vered+1.317
antly+1.309
es+1.285
nob+1.268
Neg logit contributions
ock-1.163
aff-1.152
weet-1.122
iki-1.037
aa-1.016
Token's
Token ID338
Feature activation+1.47e+00

Pos logit contributions
bes+4.440
antly+4.403
ograph+4.107
vered+4.031
 Hide+3.998
Neg logit contributions
ock-7.881
weet-7.720
aa-7.459
aff-7.240
 wind-7.003
Token shirt
Token ID10147
Feature activation+9.36e-01

Pos logit contributions
vered+2.375
es+2.355
antly+2.343
bes+2.342
nob+2.238
Neg logit contributions
ock-2.591
aff-2.578
weet-2.558
aa-2.413
oto-2.412
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
bes+1.509
vered+1.451
antly+1.448
e+1.381
 searches+1.374
Neg logit contributions
ock-1.724
aff-1.690
weet-1.631
iki-1.506
oto-1.504
Token But
Token ID887
Feature activation+1.00e+00

Pos logit contributions
aff+0.547
ock+0.546
weet+0.535
iki+0.496
oto+0.479
Neg logit contributions
bes-0.617
antly-0.596
vered-0.581
es-0.579
e-0.561
Token she
Token ID673
Feature activation+3.03e-01

Pos logit contributions
bes+2.065
antly+1.969
vered+1.945
nob+1.919
es+1.873
Neg logit contributions
ock-1.432
weet-1.398
aff-1.382
iki-1.282
oto-1.235
TokenNo
Token ID2949
Feature activation-8.59e-01

Pos logit contributions
aff+0.177
ock+0.176
weet+0.174
iki+0.164
oto+0.160
Neg logit contributions
bes-0.105
antly-0.100
vered-0.097
es-0.096
e-0.093
Token,
Token ID11
Feature activation+8.96e-02

Pos logit contributions
aff+1.666
ock+1.632
weet+1.614
iki+1.508
ules+1.470
Neg logit contributions
bes-1.323
antly-1.280
es-1.248
vered-1.243
 kit-1.207
Token thank
Token ID5875
Feature activation+5.52e-01

Pos logit contributions
bes+0.151
antly+0.146
vered+0.142
es+0.140
e+0.137
Neg logit contributions
aff-0.160
ock-0.159
weet-0.157
iki-0.146
oto-0.142
Token you
Token ID345
Feature activation-7.24e-01

Pos logit contributions
bes+0.966
antly+0.924
vered+0.904
es+0.882
nob+0.879
Neg logit contributions
aff-0.960
ock-0.947
weet-0.942
iki-0.871
aa-0.857
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
aff+1.478
ock+1.466
weet+1.444
iki+1.371
oto+1.326
Neg logit contributions
bes-1.022
antly-0.988
es-0.970
vered-0.946
e-0.928
Token Cig
Token ID32616
Feature activation+3.56e+00

Pos logit contributions
bes+1.905
antly+1.796
vered+1.786
es+1.729
nob+1.709
Neg logit contributions
weet-2.346
aff-2.335
ock-2.265
aa-2.162
iki-2.142
Tokenarettes
Token ID13890
Feature activation-1.44e+00

Pos logit contributions
vered+5.290
bes+4.761
antly+4.667
 searches+4.506
 wrist+4.469
Neg logit contributions
aa-7.316
iki-7.055
aff-7.039
oto-6.849
ules-6.846
Token are
Token ID389
Feature activation-7.36e-01

Pos logit contributions
aff+3.138
weet+3.052
ock+2.989
iki+2.935
oto+2.825
Neg logit contributions
bes-1.841
es-1.814
antly-1.782
e-1.727
 kit-1.681
Token stupid
Token ID8531
Feature activation-1.03e-01

Pos logit contributions
aff+1.215
ock+1.203
weet+1.178
iki+1.105
ules+1.055
Neg logit contributions
bes-1.353
antly-1.296
es-1.271
vered-1.251
e-1.248
Token.
Token ID13
Feature activation+8.09e-01

Pos logit contributions
aff+0.197
ock+0.195
weet+0.193
iki+0.181
oto+0.176
Neg logit contributions
bes-0.159
antly-0.152
vered-0.149
es-0.147
e-0.143
Token They
Token ID1119
Feature activation-4.28e-01

Pos logit contributions
bes+1.450
antly+1.381
vered+1.377
es+1.334
nob+1.327
Neg logit contributions
aff-1.359
weet-1.333
ock-1.324
aa-1.219
iki-1.215
Token opens
Token ID9808
Feature activation+8.64e-01

Pos logit contributions
bes+2.547
antly+2.436
vered+2.381
e+2.294
es+2.221
Neg logit contributions
aff-3.183
ock-3.096
weet-3.028
iki-2.975
aa-2.806
Token her
Token ID607
Feature activation+1.87e+00

Pos logit contributions
bes+1.714
antly+1.629
vered+1.616
es+1.589
house+1.561
Neg logit contributions
aff-1.290
ock-1.282
weet-1.253
iki-1.159
oto-1.139
Token eyes
Token ID2951
Feature activation+3.20e-01

Pos logit contributions
bes+3.345
vered+3.238
antly+3.113
es+3.070
e+3.054
Neg logit contributions
aff-3.146
weet-2.970
iki-2.962
ock-2.878
arl-2.754
Token and
Token ID290
Feature activation+1.93e+00

Pos logit contributions
bes+0.655
antly+0.623
vered+0.619
es+0.606
e+0.601
Neg logit contributions
aff-0.459
ock-0.456
weet-0.445
iki-0.403
oto-0.387
Token sees
Token ID7224
Feature activation-2.38e-01

Pos logit contributions
bes+3.085
antly+2.871
vered+2.864
e+2.756
house+2.700
Neg logit contributions
weet-3.570
ock-3.539
aff-3.524
iki-3.381
aa-3.186
Token Ben
Token ID3932
Feature activation+3.50e+00

Pos logit contributions
aff+0.467
ock+0.463
weet+0.458
iki+0.429
aa+0.417
Neg logit contributions
bes-0.357
antly-0.342
vered-0.334
es-0.332
e-0.321
Token with
Token ID351
Feature activation+6.05e-01

Pos logit contributions
bes+6.718
antly+6.673
vered+6.303
ASH+6.295
 kit+5.944
Neg logit contributions
weet-5.838
aff-5.680
ock-5.675
iki-5.320
oto-4.924
Token something
Token ID1223
Feature activation+1.56e+00

Pos logit contributions
bes+1.004
antly+0.965
vered+0.933
es+0.923
nob+0.898
Neg logit contributions
ock-1.093
aff-1.085
weet-1.085
iki-1.003
oto-0.982
Token in
Token ID287
Feature activation+1.37e+00

Pos logit contributions
bes+3.135
antly+2.979
vered+2.956
e+2.853
nob+2.853
Neg logit contributions
aff-2.376
ock-2.357
weet-2.275
iki-2.145
oto-2.115
Token his
Token ID465
Feature activation+1.91e+00

Pos logit contributions
bes+2.671
antly+2.609
vered+2.560
es+2.514
house+2.478
Neg logit contributions
ock-2.071
weet-2.046
aff-2.008
iki-1.878
oto-1.854
Token mouth
Token ID5422
Feature activation+6.68e-01

Pos logit contributions
bes+2.311
vered+2.210
antly+2.200
es+2.184
nob+1.954
Neg logit contributions
weet-4.207
aff-4.134
ock-4.122
iki-4.116
aa-3.939
Token not
Token ID407
Feature activation-1.05e+00

Pos logit contributions
antly+2.408
bes+2.379
vered+2.370
es+2.277
 kit+2.240
Neg logit contributions
ock-1.817
aff-1.790
weet-1.770
oto-1.636
iki-1.606
Token want
Token ID765
Feature activation-7.51e-01

Pos logit contributions
aff+1.852
weet+1.761
ock+1.755
iki+1.689
ules+1.614
Neg logit contributions
bes-1.785
e-1.759
es-1.713
antly-1.696
nob-1.693
Token to
Token ID284
Feature activation+1.44e-01

Pos logit contributions
aff+1.284
ock+1.266
weet+1.250
iki+1.178
ules+1.137
Neg logit contributions
bes-1.296
antly-1.273
vered-1.229
es-1.228
e-1.215
Token be
Token ID307
Feature activation-4.73e-03

Pos logit contributions
bes+0.236
antly+0.227
vered+0.222
es+0.220
nob+0.210
Neg logit contributions
aff-0.264
ock-0.263
weet-0.260
iki-0.240
oto-0.235
Token a
Token ID257
Feature activation-2.51e-01

Pos logit contributions
aff+0.008
ock+0.008
weet+0.008
iki+0.007
oto+0.007
Neg logit contributions
bes-0.009
antly-0.008
vered-0.008
es-0.008
e-0.008
Token tatt
Token ID14399
Feature activation+3.48e+00

Pos logit contributions
aff+0.395
ock+0.393
weet+0.387
iki+0.356
oto+0.343
Neg logit contributions
bes-0.471
antly-0.457
vered-0.447
es-0.442
e-0.434
Tokenlet
Token ID1616
Feature activation-4.29e-01

Pos logit contributions
bes+6.596
vered+5.950
 searches+5.884
antly+5.583
 Something+5.570
Neg logit contributions
aff-6.254
iki-6.073
aa-5.931
ock-5.923
weet-5.914
Tokenale
Token ID1000
Feature activation-8.87e-01

Pos logit contributions
aff+0.853
ock+0.840
weet+0.837
iki+0.785
oto+0.761
Neg logit contributions
bes-0.627
antly-0.607
es-0.590
vered-0.587
e-0.574
Token.
Token ID13
Feature activation+7.80e-01

Pos logit contributions
aff+1.826
ock+1.823
weet+1.815
iki+1.704
aa+1.652
Neg logit contributions
bes-1.206
es-1.175
antly-1.168
vered-1.129
e-1.105
Token She
Token ID1375
Feature activation-2.00e-01

Pos logit contributions
bes+1.629
vered+1.563
antly+1.563
es+1.525
e+1.514
Neg logit contributions
aff-1.087
weet-1.073
ock-1.043
iki-0.975
aa-0.932
Token decided
Token ID3066
Feature activation-6.52e-01

Pos logit contributions
aff+0.350
ock+0.348
weet+0.344
iki+0.317
oto+0.308
Neg logit contributions
bes-0.342
antly-0.330
vered-0.322
es-0.319
e-0.312
Token big
Token ID1263
Feature activation+5.45e-01

Pos logit contributions
bes+1.568
es+1.504
antly+1.492
vered+1.487
e+1.450
Neg logit contributions
ock-1.531
weet-1.512
aff-1.510
iki-1.409
aa-1.385
Token rock
Token ID3881
Feature activation+3.44e-01

Pos logit contributions
bes+0.868
antly+0.832
es+0.813
vered+0.807
e+0.776
Neg logit contributions
ock-1.023
aff-1.011
weet-1.005
iki-0.937
aa-0.917
Token and
Token ID290
Feature activation+4.79e-01

Pos logit contributions
bes+0.592
antly+0.570
vered+0.556
es+0.546
e+0.533
Neg logit contributions
ock-0.598
aff-0.595
weet-0.585
iki-0.543
aa-0.527
Token put
Token ID1234
Feature activation+9.39e-01

Pos logit contributions
bes+0.802
antly+0.762
vered+0.751
es+0.744
e+0.729
Neg logit contributions
ock-0.851
weet-0.843
aff-0.843
iki-0.780
aa-0.761
Token his
Token ID465
Feature activation+7.09e-01

Pos logit contributions
bes+1.491
vered+1.367
antly+1.367
es+1.353
house+1.316
Neg logit contributions
ock-1.805
aff-1.767
weet-1.735
iki-1.618
oto-1.602
Token father
Token ID2988
Feature activation+3.48e+00

Pos logit contributions
bes+1.027
antly+0.982
es+0.981
vered+0.951
house+0.928
Neg logit contributions
ock-1.411
aff-1.385
weet-1.375
iki-1.321
oto-1.294
Token's
Token ID338
Feature activation+1.71e+00

Pos logit contributions
bes+6.379
 Something+6.001
antly+5.993
 searches+5.773
ASH+5.612
Neg logit contributions
ock-6.407
weet-6.042
aff-5.576
aa-5.534
iki-5.410
Token jacket
Token ID15224
Feature activation+1.08e+00

Pos logit contributions
bes+2.555
es+2.532
antly+2.425
house+2.340
e+2.325
Neg logit contributions
ock-3.363
weet-3.196
aff-3.190
aa-3.152
iki-3.122
Token on
Token ID319
Feature activation+8.44e-01

Pos logit contributions
bes+1.593
antly+1.500
vered+1.496
 searches+1.435
 Something+1.428
Neg logit contributions
ock-2.115
aff-2.073
weet-2.021
aa-1.904
iki-1.897
Token it
Token ID340
Feature activation+9.20e-01

Pos logit contributions
bes+1.527
antly+1.442
vered+1.438
es+1.433
 wrist+1.363
Neg logit contributions
ock-1.404
aff-1.390
weet-1.358
iki-1.233
oto-1.225
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
bes+1.482
antly+1.419
vered+1.399
es+1.352
e+1.332
Neg logit contributions
ock-1.695
aff-1.674
weet-1.652
iki-1.504
aa-1.479
Token opened
Token ID4721
Feature activation+1.16e+00

Pos logit contributions
bes+1.530
antly+1.426
vered+1.401
es+1.361
 kit+1.358
Neg logit contributions
aff-1.846
ock-1.824
weet-1.811
iki-1.721
aa-1.651
Token the
Token ID262
Feature activation+2.21e+00

Pos logit contributions
bes+2.328
antly+2.164
vered+2.147
es+2.135
e+2.100
Neg logit contributions
aff-1.699
ock-1.683
weet-1.657
iki-1.566
oto-1.484
Token box
Token ID3091
Feature activation+2.68e-01

Pos logit contributions
bes+3.070
vered+2.726
es+2.671
e+2.636
antly+2.588
Neg logit contributions
ock-4.914
aff-4.824
weet-4.707
iki-4.538
andel-4.402
Token and
Token ID290
Feature activation+1.55e+00

Pos logit contributions
bes+0.557
antly+0.535
vered+0.529
es+0.519
e+0.514
Neg logit contributions
aff-0.370
ock-0.366
weet-0.360
iki-0.330
oto-0.314
Token to
Token ID284
Feature activation+2.55e-01

Pos logit contributions
bes+2.624
vered+2.367
antly+2.332
e+2.279
 wrist+2.231
Neg logit contributions
ock-2.792
weet-2.782
aff-2.782
iki-2.652
aa-2.540
Token her
Token ID607
Feature activation+3.48e+00

Pos logit contributions
bes+0.456
antly+0.437
vered+0.429
es+0.427
e+0.410
Neg logit contributions
aff-0.428
ock-0.428
weet-0.426
iki-0.393
oto-0.381
Token surprise
Token ID5975
Feature activation+1.24e+00

Pos logit contributions
bes+7.698
es+7.083
vered+6.977
nob+6.775
antly+6.713
Neg logit contributions
iki-5.075
aff-4.609
weet-4.572
 tour-4.560
 storm-4.349
Token,
Token ID11
Feature activation+7.42e-01

Pos logit contributions
bes+2.436
vered+2.280
antly+2.268
e+2.194
nob+2.175
Neg logit contributions
aff-1.921
ock-1.878
weet-1.812
iki-1.778
oto-1.665
Token it
Token ID340
Feature activation+1.91e+00

Pos logit contributions
bes+1.474
antly+1.432
vered+1.406
es+1.372
e+1.365
Neg logit contributions
aff-1.071
weet-1.070
ock-1.060
iki-0.972
oto-0.927
Token was
Token ID373
Feature activation+2.69e-01

Pos logit contributions
bes+2.892
vered+2.687
 wrist+2.684
e+2.647
antly+2.506
Neg logit contributions
aff-3.711
ock-3.654
iki-3.524
weet-3.520
oto-3.314
Token full
Token ID1336
Feature activation-3.90e-01

Pos logit contributions
bes+0.426
antly+0.404
vered+0.401
es+0.394
e+0.382
Neg logit contributions
aff-0.509
ock-0.506
weet-0.502
iki-0.468
oto-0.457
Token They
Token ID1119
Feature activation-1.37e+00

Pos logit contributions
aff+0.142
ock+0.141
weet+0.139
iki+0.129
oto+0.125
Neg logit contributions
bes-0.154
antly-0.148
vered-0.145
es-0.144
e-0.141
Token all
Token ID477
Feature activation-1.24e+00

Pos logit contributions
aff+2.580
ock+2.568
weet+2.540
arl+2.288
andel+2.285
Neg logit contributions
es-2.174
bes-2.168
antly-2.148
vered-2.113
 kit-2.026
Token agreed
Token ID4987
Feature activation-1.81e-01

Pos logit contributions
aff+2.383
ock+2.320
weet+2.317
arl+2.109
iki+2.101
Neg logit contributions
bes-1.913
vered-1.897
antly-1.871
es-1.825
e-1.806
Token and
Token ID290
Feature activation-1.81e-01

Pos logit contributions
aff+0.283
ock+0.280
weet+0.277
iki+0.254
oto+0.245
Neg logit contributions
bes-0.345
antly-0.333
vered-0.326
es-0.324
e-0.317
Token helped
Token ID4193
Feature activation+8.15e-01

Pos logit contributions
aff+0.293
ock+0.291
weet+0.286
iki+0.263
oto+0.255
Neg logit contributions
bes-0.337
antly-0.326
vered-0.319
es-0.317
e-0.309
Token her
Token ID607
Feature activation+3.46e+00

Pos logit contributions
bes+1.639
antly+1.581
vered+1.554
es+1.517
nob+1.479
Neg logit contributions
aff-1.219
ock-1.181
weet-1.160
aa-1.061
iki-1.056
Token open
Token ID1280
Feature activation+2.31e+00

Pos logit contributions
bes+7.371
vered+6.695
ASH+6.582
antly+6.498
ograph+6.379
Neg logit contributions
iki-5.320
aff-5.107
 tour-5.037
ock-5.014
aa-4.882
Token the
Token ID262
Feature activation+1.82e+00

Pos logit contributions
bes+4.952
antly+4.687
vered+4.540
e+4.403
 Something+4.320
Neg logit contributions
aff-3.263
ock-3.142
weet-3.076
iki-2.913
andel-2.733
Token chest
Token ID7721
Feature activation+1.06e+00

Pos logit contributions
bes+1.749
vered+1.572
antly+1.554
e+1.501
es+1.457
Neg logit contributions
aff-4.717
ock-4.690
weet-4.585
oto-4.337
iki-4.311
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
bes+1.782
vered+1.678
antly+1.666
e+1.615
es+1.573
Neg logit contributions
aff-1.906
ock-1.879
weet-1.846
iki-1.752
oto-1.672
Token Inside
Token ID14384
Feature activation-4.21e-01

Pos logit contributions
bes+0.386
antly+0.372
vered+0.368
es+0.362
e+0.358
Neg logit contributions
aff-0.322
ock-0.314
weet-0.311
iki-0.285
aa-0.277
Token used
Token ID973
Feature activation+2.97e-01

Pos logit contributions
bes+1.240
antly+1.179
vered+1.145
es+1.128
nob+1.106
Neg logit contributions
aff-1.234
ock-1.229
weet-1.216
iki-1.145
aa-1.120
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
bes+0.536
antly+0.515
vered+0.503
es+0.500
e+0.487
Neg logit contributions
ock-0.492
aff-0.490
weet-0.486
iki-0.446
aa-0.433
Token glove
Token ID29144
Feature activation+5.37e-01

Pos logit contributions
bes+1.780
es+1.724
vered+1.688
antly+1.671
e+1.627
Neg logit contributions
ock-2.287
weet-2.207
aff-2.131
oto-2.094
iki-2.077
Token to
Token ID284
Feature activation-2.79e-01

Pos logit contributions
bes+1.084
antly+1.041
vered+1.026
es+1.004
e+1.000
Neg logit contributions
ock-0.757
aff-0.757
weet-0.742
iki-0.682
aa-0.661
Token clean
Token ID3424
Feature activation+1.68e+00

Pos logit contributions
aff+0.597
ock+0.588
weet+0.583
iki+0.550
oto+0.536
Neg logit contributions
bes-0.372
antly-0.352
vered-0.342
es-0.336
e-0.332
Token her
Token ID607
Feature activation+3.45e+00

Pos logit contributions
bes+3.024
antly+2.943
vered+2.943
house+2.820
es+2.812
Neg logit contributions
ock-2.786
weet-2.633
aff-2.629
oto-2.396
iki-2.373
Token room
Token ID2119
Feature activation+2.89e-01

Pos logit contributions
vered+5.607
antly+5.563
es+5.461
nob+5.302
bes+5.059
Neg logit contributions
aff-6.189
oto-6.159
ock-6.083
weet-6.027
aa-5.873
Token.
Token ID13
Feature activation-2.99e-01

Pos logit contributions
bes+0.493
antly+0.477
vered+0.467
es+0.459
e+0.452
Neg logit contributions
aff-0.499
ock-0.499
weet-0.488
iki-0.452
oto-0.440
Token She
Token ID1375
Feature activation+3.12e-01

Pos logit contributions
aff+0.478
ock+0.476
weet+0.466
iki+0.433
oto+0.420
Neg logit contributions
bes-0.558
antly-0.540
vered-0.528
es-0.523
e-0.510
Token wiped
Token ID21122
Feature activation+4.74e-01

Pos logit contributions
bes+0.476
antly+0.457
vered+0.445
es+0.437
e+0.427
Neg logit contributions
ock-0.597
aff-0.596
weet-0.587
iki-0.552
aa-0.538
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
bes+0.860
antly+0.807
vered+0.799
es+0.789
nob+0.771
Neg logit contributions
ock-0.794
aff-0.784
weet-0.768
iki-0.704
aa-0.691
Token
Token ID198
Feature activation-9.07e-02

Pos logit contributions
aff+2.187
ock+2.169
weet+2.131
iki+2.029
aa+1.975
Neg logit contributions
bes-1.326
antly-1.247
es-1.216
e-1.179
vered-1.163
TokenMom
Token ID29252
Feature activation+8.09e-02

Pos logit contributions
aff+0.188
ock+0.186
weet+0.184
iki+0.173
oto+0.169
Neg logit contributions
bes-0.128
antly-0.123
vered-0.119
es-0.118
e-0.114
Token smiles
Token ID21845
Feature activation+1.42e-02

Pos logit contributions
bes+0.121
antly+0.116
vered+0.113
es+0.111
e+0.108
Neg logit contributions
aff-0.160
ock-0.159
weet-0.157
iki-0.147
oto-0.143
Token and
Token ID290
Feature activation+3.34e-01

Pos logit contributions
bes+0.028
antly+0.027
vered+0.027
es+0.026
e+0.026
Neg logit contributions
aff-0.021
ock-0.021
weet-0.021
iki-0.019
oto-0.018
Token wipes
Token ID46142
Feature activation+3.58e-01

Pos logit contributions
bes+0.560
antly+0.537
vered+0.528
es+0.515
e+0.512
Neg logit contributions
aff-0.600
ock-0.597
weet-0.590
iki-0.549
oto-0.535
Token Mia
Token ID32189
Feature activation+3.44e+00

Pos logit contributions
bes+0.672
antly+0.637
vered+0.629
es+0.617
nob+0.607
Neg logit contributions
aff-0.577
ock-0.571
weet-0.559
iki-0.517
aa-0.500
Token's
Token ID338
Feature activation+1.83e+00

Pos logit contributions
bes+3.687
antly+3.353
 Hide+3.116
vered+3.089
ASH+3.074
Neg logit contributions
ock-8.501
weet-8.414
aa-8.053
aff-7.861
iki-7.480
Token face
Token ID1986
Feature activation+6.60e-01

Pos logit contributions
es+2.755
bes+2.693
vered+2.683
antly+2.640
house+2.531
Neg logit contributions
ock-3.541
weet-3.469
aff-3.408
arl-3.220
oto-3.213
Token.
Token ID13
Feature activation-7.62e-01

Pos logit contributions
bes+1.059
vered+0.990
antly+0.990
es+0.949
e+0.937
Neg logit contributions
ock-1.223
aff-1.221
weet-1.196
iki-1.102
aa-1.090
Token She
Token ID1375
Feature activation-2.43e-01

Pos logit contributions
ock+1.272
aff+1.264
weet+1.222
iki+1.131
oto+1.108
Neg logit contributions
bes-1.379
antly-1.343
es-1.306
vered-1.294
 Something-1.271
Token throws
Token ID12542
Feature activation+4.03e-01

Pos logit contributions
aff+0.449
ock+0.445
weet+0.440
iki+0.409
aa+0.397
Neg logit contributions
bes-0.393
antly-0.378
vered-0.368
es-0.366
e-0.355
Token opens
Token ID9808
Feature activation+1.35e+00

Pos logit contributions
bes+2.683
vered+2.507
antly+2.473
e+2.400
 kit+2.364
Neg logit contributions
aff-3.366
ock-3.345
weet-3.342
iki-3.225
aa-3.064
Token her
Token ID607
Feature activation+2.09e+00

Pos logit contributions
bes+2.787
vered+2.654
antly+2.641
es+2.575
house+2.550
Neg logit contributions
aff-1.888
ock-1.873
weet-1.831
iki-1.702
oto-1.686
Token eyes
Token ID2951
Feature activation+4.55e-01

Pos logit contributions
vered+3.917
bes+3.893
es+3.733
antly+3.719
e+3.616
Neg logit contributions
aff-3.222
weet-3.187
ock-3.117
iki-3.112
aa-2.999
Token and
Token ID290
Feature activation+2.01e+00

Pos logit contributions
bes+0.955
vered+0.907
antly+0.906
es+0.882
e+0.878
Neg logit contributions
aff-0.628
ock-0.627
weet-0.610
iki-0.549
aa-0.529
Token sees
Token ID7224
Feature activation+1.79e-01

Pos logit contributions
bes+3.282
vered+3.153
antly+3.066
e+2.974
house+2.971
Neg logit contributions
weet-3.622
ock-3.549
aff-3.499
iki-3.381
aa-3.279
Token Tom
Token ID4186
Feature activation+3.44e+00

Pos logit contributions
bes+0.314
antly+0.303
vered+0.296
es+0.290
e+0.285
Neg logit contributions
aff-0.307
ock-0.305
weet-0.301
iki-0.278
oto-0.271
Token.
Token ID13
Feature activation-1.48e-01

Pos logit contributions
bes+6.541
antly+6.342
vered+6.171
ASH+6.139
 Something+5.812
Neg logit contributions
weet-5.953
ock-5.874
aff-5.553
iki-5.346
aa-5.042
Token She
Token ID1375
Feature activation+1.40e+00

Pos logit contributions
aff+0.252
ock+0.251
weet+0.247
iki+0.229
oto+0.221
Neg logit contributions
bes-0.261
antly-0.251
vered-0.245
es-0.243
e-0.237
Token rub
Token ID6437
Feature activation+7.56e-01

Pos logit contributions
bes+2.089
antly+1.992
vered+1.981
e+1.897
 kit+1.896
Neg logit contributions
aff-2.739
weet-2.687
ock-2.660
iki-2.573
oto-2.458
Tokens
Token ID82
Feature activation+2.30e+00

Pos logit contributions
bes+1.229
antly+1.134
vered+1.130
nob+1.109
 wrist+1.094
Neg logit contributions
ock-1.408
aff-1.392
weet-1.354
aa-1.280
iki-1.273
Token her
Token ID607
Feature activation+2.23e+00

Pos logit contributions
bes+4.490
house+4.217
vered+4.160
antly+4.026
ograph+4.021
Neg logit contributions
ock-3.726
weet-3.444
aa-3.177
aff-3.166
oto-3.163
Token asked
Token ID1965
Feature activation-1.59e-01

Pos logit contributions
bes+1.798
antly+1.651
vered+1.609
 wrist+1.606
es+1.558
Neg logit contributions
aff-2.224
ock-2.212
weet-2.175
iki-2.091
aa-2.041
Token her
Token ID607
Feature activation+5.11e-01

Pos logit contributions
aff+0.271
ock+0.268
weet+0.265
iki+0.246
oto+0.237
Neg logit contributions
bes-0.284
antly-0.274
vered-0.268
es-0.267
e-0.260
Token dad
Token ID9955
Feature activation+2.29e-01

Pos logit contributions
bes+0.868
antly+0.823
vered+0.815
e+0.785
es+0.779
Neg logit contributions
aff-0.931
ock-0.917
weet-0.910
iki-0.864
oto-0.841
Token to
Token ID284
Feature activation+1.71e+00

Pos logit contributions
bes+0.440
antly+0.420
vered+0.413
es+0.403
e+0.399
Neg logit contributions
aff-0.359
ock-0.355
weet-0.350
iki-0.322
oto-0.312
Token help
Token ID1037
Feature activation+1.67e+00

Pos logit contributions
bes+3.251
antly+3.042
es+3.029
vered+3.011
house+2.730
Neg logit contributions
ock-2.932
weet-2.866
aff-2.736
iki-2.601
oto-2.597
Token her
Token ID607
Feature activation+3.43e+00

Pos logit contributions
bes+3.401
antly+3.354
vered+3.323
es+3.142
 wrist+3.070
Neg logit contributions
ock-2.470
aff-2.369
weet-2.333
aa-2.114
oto-2.049
Token remove
Token ID4781
Feature activation+7.48e-01

Pos logit contributions
bes+7.686
vered+7.099
antly+6.857
ASH+6.849
es+6.740
Neg logit contributions
ock-4.836
 tour-4.314
oto-4.266
weet-4.263
iki-4.240
Token the
Token ID262
Feature activation+2.17e+00

Pos logit contributions
bes+1.416
antly+1.361
vered+1.327
es+1.301
e+1.286
Neg logit contributions
ock-1.183
aff-1.176
weet-1.136
iki-1.040
andel-1.016
Token lid
Token ID19789
Feature activation-1.23e-01

Pos logit contributions
bes+2.971
antly+2.871
vered+2.866
e+2.811
es+2.718
Neg logit contributions
ock-4.662
aff-4.642
weet-4.459
 tour-4.214
 storm-4.156
Token of
Token ID286
Feature activation+3.46e-01

Pos logit contributions
aff+0.237
ock+0.235
weet+0.233
iki+0.218
oto+0.212
Neg logit contributions
bes-0.189
antly-0.181
vered-0.176
es-0.175
e-0.170
Token the
Token ID262
Feature activation+2.16e+00

Pos logit contributions
bes+0.626
antly+0.610
vered+0.594
es+0.584
e+0.572
Neg logit contributions
aff-0.571
ock-0.569
weet-0.563
iki-0.514
oto-0.498
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
aff+0.355
ock+0.351
weet+0.346
iki+0.317
oto+0.306
Neg logit contributions
bes-0.453
antly-0.438
vered-0.429
es-0.427
e-0.417
Token placed
Token ID4624
Feature activation+4.75e-01

Pos logit contributions
bes+2.408
antly+2.280
vered+2.275
e+2.190
nob+2.187
Neg logit contributions
aff-1.704
weet-1.662
ock-1.654
iki-1.527
aa-1.502
Token something
Token ID1223
Feature activation+1.40e+00

Pos logit contributions
bes+0.808
antly+0.764
vered+0.754
es+0.739
nob+0.723
Neg logit contributions
aff-0.839
ock-0.837
weet-0.823
iki-0.764
oto-0.750
Token into
Token ID656
Feature activation+9.95e-01

Pos logit contributions
bes+2.519
vered+2.333
antly+2.315
e+2.275
es+2.272
Neg logit contributions
aff-2.376
ock-2.351
weet-2.237
iki-2.127
oto-2.111
Token his
Token ID465
Feature activation+1.90e+00

Pos logit contributions
bes+1.799
antly+1.728
vered+1.704
es+1.656
nob+1.630
Neg logit contributions
weet-1.650
ock-1.642
aff-1.637
iki-1.464
oto-1.462
Token son
Token ID3367
Feature activation+3.43e+00

Pos logit contributions
es+2.558
bes+2.535
antly+2.487
vered+2.453
nob+2.358
Neg logit contributions
aff-3.935
ock-3.865
weet-3.849
aa-3.738
iki-3.714
Token's
Token ID338
Feature activation+2.12e+00

Pos logit contributions
bes+5.070
antly+4.838
vered+4.805
ASH+4.466
 searches+4.260
Neg logit contributions
aff-7.313
weet-7.308
ock-7.099
aa-6.804
iki-6.712
Token hand
Token ID1021
Feature activation+1.31e+00

Pos logit contributions
es+2.884
antly+2.819
bes+2.811
vered+2.667
nob+2.602
Neg logit contributions
aff-4.392
ock-4.314
weet-4.259
aa-4.152
andel-4.080
Token,
Token ID11
Feature activation+6.11e-01

Pos logit contributions
bes+2.284
vered+2.198
antly+2.160
 searches+2.080
es+2.055
Neg logit contributions
aff-2.261
ock-2.247
weet-2.212
iki-2.018
aa-2.000
Token they
Token ID484
Feature activation-7.93e-02

Pos logit contributions
bes+1.110
antly+1.071
vered+1.063
es+1.038
e+1.010
Neg logit contributions
aff-0.999
weet-0.992
ock-0.985
iki-0.903
oto-0.883
Token looked
Token ID3114
Feature activation-1.12e-01

Pos logit contributions
aff+0.137
ock+0.136
weet+0.134
iki+0.124
oto+0.120
Neg logit contributions
bes-0.138
antly-0.132
vered-0.130
es-0.128
e-0.125
Token her
Token ID607
Feature activation+5.51e-01

Pos logit contributions
aff+0.559
ock+0.550
weet+0.544
iki+0.508
oto+0.488
Neg logit contributions
bes-0.565
antly-0.548
vered-0.536
es-0.535
e-0.520
Token mom
Token ID1995
Feature activation-1.71e-01

Pos logit contributions
bes+0.945
antly+0.895
vered+0.886
e+0.857
es+0.849
Neg logit contributions
aff-0.998
ock-0.987
weet-0.977
iki-0.923
oto-0.904
Tokenmy
Token ID1820
Feature activation+2.98e-01

Pos logit contributions
aff+0.271
ock+0.267
weet+0.264
iki+0.243
oto+0.235
Neg logit contributions
bes-0.322
antly-0.312
vered-0.305
es-0.304
e-0.296
Token to
Token ID284
Feature activation+1.97e+00

Pos logit contributions
bes+0.619
antly+0.594
vered+0.585
es+0.572
e+0.569
Neg logit contributions
aff-0.418
ock-0.417
weet-0.409
iki-0.368
oto-0.356
Token help
Token ID1037
Feature activation+1.60e+00

Pos logit contributions
bes+3.626
antly+3.460
es+3.449
vered+3.436
ASH+3.015
Neg logit contributions
ock-3.533
weet-3.366
aff-3.186
oto-2.991
arl-2.966
Token her
Token ID607
Feature activation+3.42e+00

Pos logit contributions
bes+3.168
antly+3.120
vered+3.105
es+2.957
 Something+2.850
Neg logit contributions
ock-2.431
aff-2.320
weet-2.300
aa-2.101
andel-2.010
Token order
Token ID1502
Feature activation+6.49e-01

Pos logit contributions
bes+7.290
vered+6.929
antly+6.847
es+6.717
ASH+6.576
Neg logit contributions
ock-5.227
weet-4.546
aff-4.290
oto-4.186
 tour-4.142
Token a
Token ID257
Feature activation+1.28e+00

Pos logit contributions
bes+1.102
antly+1.051
vered+1.048
es+1.016
e+1.004
Neg logit contributions
ock-1.152
aff-1.138
weet-1.115
iki-1.021
oto-1.009
Token new
Token ID649
Feature activation+1.40e+00

Pos logit contributions
bes+2.100
es+1.947
vered+1.938
antly+1.930
e+1.869
Neg logit contributions
ock-2.470
weet-2.418
aff-2.375
iki-2.227
aa-2.211
Token chair
Token ID5118
Feature activation-1.92e-01

Pos logit contributions
bes+1.951
antly+1.856
vered+1.821
es+1.808
e+1.725
Neg logit contributions
ock-3.005
weet-2.906
aff-2.819
iki-2.714
oto-2.692
Token.
Token ID13
Feature activation+8.31e-01

Pos logit contributions
aff+0.365
ock+0.362
weet+0.358
iki+0.335
oto+0.326
Neg logit contributions
bes-0.298
antly-0.285
vered-0.278
es-0.276
e-0.269
Token and
Token ID290
Feature activation+1.51e+00

Pos logit contributions
aff+0.386
ock+0.381
weet+0.377
iki+0.346
oto+0.334
Neg logit contributions
bes-0.489
antly-0.472
vered-0.463
es-0.460
e-0.449
Token gently
Token ID15165
Feature activation+1.58e+00

Pos logit contributions
bes+2.699
vered+2.524
antly+2.521
e+2.454
nob+2.428
Neg logit contributions
ock-2.478
weet-2.461
aff-2.402
iki-2.270
aa-2.269
Token wiped
Token ID21122
Feature activation+1.35e+00

Pos logit contributions
bes+2.483
antly+2.374
vered+2.330
es+2.276
 wrist+2.247
Neg logit contributions
ock-2.885
aff-2.836
weet-2.801
aa-2.736
iki-2.668
Token the
Token ID262
Feature activation+3.39e-01

Pos logit contributions
bes+2.439
vered+2.198
 wrist+2.179
es+2.178
house+2.174
Neg logit contributions
ock-2.360
aff-2.246
weet-2.200
iki-2.050
aa-2.047
Token little
Token ID1310
Feature activation+1.63e+00

Pos logit contributions
bes+0.539
antly+0.519
vered+0.508
es+0.507
e+0.493
Neg logit contributions
ock-0.633
aff-0.631
weet-0.622
iki-0.581
aa-0.567
Token girl
Token ID2576
Feature activation+3.41e+00

Pos logit contributions
bes+3.445
es+3.273
antly+3.257
vered+3.205
e+3.205
Neg logit contributions
ock-2.288
aff-2.207
weet-2.174
iki-2.031
aa-1.983
Token's
Token ID338
Feature activation+1.63e+00

Pos logit contributions
bes+4.428
ASH+4.047
antly+3.958
 Hide+3.938
es+3.786
Neg logit contributions
ock-7.620
weet-7.601
aa-7.141
aff-7.106
iki-6.970
Token skin
Token ID4168
Feature activation+1.08e+00

Pos logit contributions
bes+2.275
es+2.261
house+2.176
vered+2.144
antly+2.133
Neg logit contributions
ock-3.323
weet-3.222
aff-3.185
aa-3.113
iki-3.109
Token.
Token ID13
Feature activation+4.55e-01

Pos logit contributions
bes+1.772
antly+1.670
vered+1.647
 searches+1.600
es+1.595
Neg logit contributions
ock-1.992
aff-1.912
weet-1.900
aa-1.745
iki-1.744
Token
Token ID198
Feature activation-4.09e-01

Pos logit contributions
bes+0.743
antly+0.701
vered+0.697
es+0.682
e+0.670
Neg logit contributions
aff-0.841
weet-0.824
ock-0.820
iki-0.766
oto-0.744
Token
Token ID198
Feature activation+2.85e-02

Pos logit contributions
aff+0.865
ock+0.861
weet+0.850
iki+0.801
aa+0.780
Neg logit contributions
bes-0.547
antly-0.525
es-0.502
vered-0.501
e-0.487
Token toy
Token ID13373
Feature activation+9.96e-01

Pos logit contributions
bes+1.600
vered+1.523
antly+1.516
es+1.506
e+1.405
Neg logit contributions
aff-2.171
ock-2.151
weet-2.133
iki-1.927
oto-1.919
Token box
Token ID3091
Feature activation-5.66e-01

Pos logit contributions
bes+1.150
vered+1.088
antly+1.067
e+0.974
es+0.958
Neg logit contributions
aff-2.317
ock-2.309
weet-2.241
iki-2.102
oto-2.061
Token,
Token ID11
Feature activation+5.50e-02

Pos logit contributions
aff+1.005
ock+0.983
weet+0.972
iki+0.910
aa+0.888
Neg logit contributions
bes-0.962
antly-0.930
es-0.908
vered-0.898
house-0.885
Token and
Token ID290
Feature activation+7.49e-01

Pos logit contributions
bes+0.096
antly+0.093
vered+0.091
es+0.090
e+0.087
Neg logit contributions
aff-0.094
ock-0.094
weet-0.093
iki-0.086
oto-0.083
Token to
Token ID284
Feature activation-3.16e-01

Pos logit contributions
bes+1.452
antly+1.405
vered+1.385
e+1.349
es+1.322
Neg logit contributions
aff-1.123
weet-1.123
ock-1.120
iki-0.999
aa-0.983
Token her
Token ID607
Feature activation+3.40e+00

Pos logit contributions
aff+0.559
ock+0.550
weet+0.541
iki+0.507
aa+0.492
Neg logit contributions
bes-0.541
antly-0.520
vered-0.510
es-0.502
e-0.497
Token surprise
Token ID5975
Feature activation+1.07e+00

Pos logit contributions
bes+7.728
vered+7.069
antly+6.910
es+6.889
 elbows+6.695
Neg logit contributions
aff-4.612
iki-4.601
 tour-4.249
weet-4.247
ock-4.136
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
bes+2.005
vered+1.911
antly+1.898
e+1.811
es+1.810
Neg logit contributions
ock-1.728
aff-1.723
weet-1.652
iki-1.584
oto-1.535
Token the
Token ID262
Feature activation+2.02e+00

Pos logit contributions
bes+2.012
antly+1.971
vered+1.953
e+1.896
es+1.885
Neg logit contributions
weet-1.419
aff-1.401
ock-1.391
iki-1.264
oto-1.226
Token magnet
Token ID19972
Feature activation+1.58e+00

Pos logit contributions
bes+3.377
es+3.343
antly+3.307
vered+3.258
e+3.255
Neg logit contributions
ock-3.709
aff-3.702
weet-3.665
 hurricane-3.419
 wind-3.361
Token was
Token ID373
Feature activation+9.05e-01

Pos logit contributions
bes+2.604
antly+2.422
vered+2.406
e+2.326
 wrist+2.324
Neg logit contributions
aff-2.868
ock-2.798
weet-2.736
iki-2.652
aa-2.548
Token her
Token ID607
Feature activation+3.60e-01

Pos logit contributions
aff+1.318
ock+1.291
weet+1.280
iki+1.209
ules+1.162
Neg logit contributions
bes-1.190
antly-1.161
es-1.152
vered-1.132
e-1.110
Token mom
Token ID1995
Feature activation-4.95e-01

Pos logit contributions
bes+0.594
antly+0.567
vered+0.560
es+0.540
e+0.536
Neg logit contributions
aff-0.668
ock-0.657
weet-0.652
iki-0.614
oto-0.601
Tokenmy
Token ID1820
Feature activation-2.54e-01

Pos logit contributions
aff+0.830
ock+0.817
weet+0.807
iki+0.752
aa+0.726
Neg logit contributions
bes-0.866
antly-0.852
es-0.835
vered-0.822
e-0.804
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
aff+0.398
ock+0.394
weet+0.389
iki+0.359
oto+0.346
Neg logit contributions
bes-0.478
antly-0.465
vered-0.454
es-0.454
e-0.441
Token help
Token ID1037
Feature activation+1.26e+00

Pos logit contributions
bes+2.097
antly+1.998
vered+1.996
es+1.976
nob+1.787
Neg logit contributions
ock-2.191
weet-2.160
aff-2.143
oto-1.954
iki-1.943
Token her
Token ID607
Feature activation+3.39e+00

Pos logit contributions
bes+2.490
antly+2.455
vered+2.448
es+2.334
 wrist+2.238
Neg logit contributions
ock-1.872
aff-1.839
weet-1.815
aa-1.640
oto-1.590
Token.
Token ID13
Feature activation+9.14e-01

Pos logit contributions
bes+7.320
vered+7.141
antly+6.805
 elbows+6.625
ASH+6.584
Neg logit contributions
ock-4.742
aff-4.545
 tour-4.344
weet-4.306
oto-4.260
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
bes+1.564
vered+1.523
antly+1.470
es+1.467
e+1.440
Neg logit contributions
aff-1.634
weet-1.568
ock-1.560
iki-1.456
aa-1.442
Token
Token ID198
Feature activation+1.36e+00

Pos logit contributions
aff+0.237
ock+0.236
weet+0.233
iki+0.219
oto+0.213
Neg logit contributions
bes-0.160
antly-0.153
vered-0.148
es-0.147
e-0.142
Token"
Token ID1
Feature activation+8.05e-01

Pos logit contributions
bes+2.252
vered+2.213
antly+2.154
es+2.132
 searches+2.076
Neg logit contributions
ock-2.495
aff-2.474
weet-2.472
aa-2.260
iki-2.232
TokenMom
Token ID29252
Feature activation+3.54e-01

Pos logit contributions
bes+1.052
vered+0.994
es+0.980
antly+0.978
e+0.935
Neg logit contributions
ock-1.770
aff-1.750
weet-1.739
iki-1.633
oto-1.614
Token sink
Token ID14595
Feature activation+7.63e-01

Pos logit contributions
bes+2.401
vered+2.322
es+2.301
antly+2.291
e+2.197
Neg logit contributions
weet-2.234
ock-2.228
aff-2.225
iki-2.020
oto-2.002
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
bes+1.591
antly+1.542
vered+1.533
es+1.477
e+1.472
Neg logit contributions
ock-1.042
aff-1.028
weet-1.003
iki-0.904
aa-0.888
Token mom
Token ID1995
Feature activation+1.57e+00

Pos logit contributions
bes+1.843
vered+1.768
antly+1.755
es+1.728
e+1.647
Neg logit contributions
ock-1.777
weet-1.758
aff-1.725
iki-1.609
aa-1.609
Tokenmy
Token ID1820
Feature activation+2.09e+00

Pos logit contributions
bes+3.417
vered+3.246
 wrist+3.154
antly+3.129
e+3.021
Neg logit contributions
ock-2.357
weet-2.181
aff-2.100
aa-1.967
ules-1.913
Token cleaned
Token ID20750
Feature activation+1.77e+00

Pos logit contributions
bes+3.976
vered+3.790
antly+3.735
 wrist+3.726
e+3.523
Neg logit contributions
ock-3.389
aff-3.184
weet-3.164
aa-3.135
ules-2.981
Token Lily
Token ID20037
Feature activation+3.39e+00

Pos logit contributions
bes+3.322
vered+3.201
antly+3.094
 wrist+3.070
es+3.031
Neg logit contributions
ock-2.830
aff-2.790
weet-2.634
aa-2.514
iki-2.391
Token's
Token ID338
Feature activation+2.57e+00

Pos logit contributions
antly+4.796
vered+4.659
 Something+4.448
bes+4.447
 Hide+4.330
Neg logit contributions
ock-6.969
weet-6.650
aff-6.353
aa-6.244
ashing-6.102
Token shirt
Token ID10147
Feature activation+3.81e-01

Pos logit contributions
antly+4.293
es+4.269
vered+4.099
e+3.856
bes+3.816
Neg logit contributions
aff-4.583
weet-4.425
ock-4.374
aa-4.322
oto-4.305
Token with
Token ID351
Feature activation+2.76e-01

Pos logit contributions
bes+0.579
antly+0.557
vered+0.554
es+0.532
e+0.527
Neg logit contributions
ock-0.734
aff-0.730
weet-0.712
iki-0.662
oto-0.653
Token soap
Token ID19533
Feature activation-2.66e-01

Pos logit contributions
bes+0.446
antly+0.428
vered+0.418
es+0.416
e+0.403
Neg logit contributions
ock-0.505
aff-0.504
weet-0.497
iki-0.460
aa-0.450
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
aff+0.533
ock+0.528
weet+0.526
iki+0.493
oto+0.479
Neg logit contributions
bes-0.388
antly-0.371
vered-0.359
es-0.359
e-0.345
Token bush
Token ID24484
Feature activation-1.25e+00

Pos logit contributions
ock+3.659
aff+3.614
weet+3.588
ules+3.339
iki+3.326
Neg logit contributions
bes-2.886
house-2.848
antly-2.716
 wrist-2.710
e-2.695
Token.
Token ID13
Feature activation-2.80e-01

Pos logit contributions
aff+2.765
ock+2.725
weet+2.702
iki+2.551
oto+2.506
Neg logit contributions
bes-1.587
house-1.516
antly-1.494
es-1.492
e-1.466
Token Every
Token ID3887
Feature activation-2.18e+00

Pos logit contributions
aff+0.528
ock+0.525
weet+0.514
iki+0.485
oto+0.471
Neg logit contributions
bes-0.443
antly-0.424
vered-0.414
es-0.410
e-0.403
Token day
Token ID1110
Feature activation+4.07e-01

Pos logit contributions
aff+3.384
ock+2.865
weet+2.855
ules+2.625
andel+2.500
Neg logit contributions
antly-4.625
e-4.528
house-4.438
bes-4.432
nob-4.344
Token,
Token ID11
Feature activation+3.08e-02

Pos logit contributions
bes+0.727
antly+0.684
vered+0.677
es+0.661
nob+0.647
Neg logit contributions
aff-0.693
ock-0.691
weet-0.685
iki-0.627
aa-0.612
Token lots
Token ID6041
Feature activation-4.04e+00

Pos logit contributions
bes+0.049
antly+0.047
vered+0.046
es+0.045
e+0.044
Neg logit contributions
aff-0.058
ock-0.057
weet-0.057
iki-0.053
oto-0.051
Token of
Token ID286
Feature activation-1.03e+00

Pos logit contributions
aff+7.431
arl+7.286
iki+6.900
oto+6.671
weet+6.608
Neg logit contributions
e-7.602
es-7.534
's-7.467
house-7.045
vered-6.686
Token different
Token ID1180
Feature activation-6.61e-01

Pos logit contributions
aff+1.817
ock+1.762
weet+1.720
iki+1.615
oto+1.576
Neg logit contributions
bes-1.770
vered-1.688
es-1.671
antly-1.665
e-1.664
Token animals
Token ID4695
Feature activation-1.24e+00

Pos logit contributions
aff+1.210
ock+1.209
weet+1.190
iki+1.104
oto+1.085
Neg logit contributions
bes-1.075
antly-1.020
vered-1.018
es-1.015
e-0.971
Token would
Token ID561
Feature activation-1.44e+00

Pos logit contributions
aff+2.517
ock+2.473
weet+2.448
iki+2.294
oto+2.280
Neg logit contributions
es-1.765
bes-1.761
e-1.706
vered-1.699
antly-1.691
Token gather
Token ID6431
Feature activation-1.69e-01

Pos logit contributions
aff+3.008
ock+2.953
weet+2.851
iki+2.747
aa+2.699
Neg logit contributions
bes-2.007
e-1.888
vered-1.812
nob-1.807
antly-1.802
TokenOne
Token ID3198
Feature activation-1.30e+00

Pos logit contributions
bes+1.683
vered+1.604
antly+1.567
 wrist+1.520
es+1.484
Neg logit contributions
aff-1.877
ock-1.856
weet-1.804
iki-1.685
oto-1.650
Token day
Token ID1110
Feature activation+2.73e-01

Pos logit contributions
aff+1.696
ock+1.523
weet+1.476
iki+1.334
ules+1.314
Neg logit contributions
bes-2.874
e-2.864
antly-2.847
house-2.818
vered-2.788
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
bes+0.480
antly+0.460
vered+0.451
es+0.442
nob+0.434
Neg logit contributions
aff-0.464
ock-0.459
weet-0.453
iki-0.422
aa-0.411
Token a
Token ID257
Feature activation-9.40e-01

Pos logit contributions
bes+1.619
antly+1.512
vered+1.468
es+1.467
 kit+1.428
Neg logit contributions
aff-1.897
weet-1.876
ock-1.871
iki-1.784
aa-1.728
Token tall
Token ID7331
Feature activation-2.41e+00

Pos logit contributions
ock+1.482
aff+1.480
weet+1.449
iki+1.318
ules+1.305
Neg logit contributions
bes-1.738
antly-1.696
e-1.632
house-1.630
vered-1.629
Token gir
Token ID37370
Feature activation-4.00e+00

Pos logit contributions
ock+4.929
weet+4.888
aff+4.794
ules+4.563
iki+4.438
Neg logit contributions
house-3.506
bes-3.393
 wrist-3.304
 wound-3.278
antly-3.256
Tokenaffe
Token ID21223
Feature activation+2.59e-01

Pos logit contributions
weet+5.364
ock+5.035
ules+4.889
add+4.870
iki+4.756
Neg logit contributions
es-8.902
e-8.885
oc-8.833
house-8.673
's-8.346
Token named
Token ID3706
Feature activation-1.36e+00

Pos logit contributions
bes+0.528
antly+0.508
vered+0.497
es+0.489
 wrist+0.480
Neg logit contributions
aff-0.372
ock-0.369
weet-0.364
iki-0.331
aa-0.319
Token Gig
Token ID19525
Feature activation-1.13e+00

Pos logit contributions
ock+2.213
aff+2.135
weet+2.088
iki+1.980
ules+1.938
Neg logit contributions
bes-2.450
es-2.387
antly-2.384
e-2.364
nob-2.347
Tokeni
Token ID72
Feature activation+1.93e-01

Pos logit contributions
ock+1.618
weet+1.582
aff+1.575
andel+1.375
ules+1.372
Neg logit contributions
bes-2.273
antly-2.229
es-2.216
vered-2.179
e-2.161
Token went
Token ID1816
Feature activation-2.28e+00

Pos logit contributions
bes+0.284
antly+0.270
vered+0.263
es+0.256
nob+0.250
Neg logit contributions
aff-0.385
ock-0.381
weet-0.377
iki-0.354
aa-0.345
Token a
Token ID257
Feature activation-5.22e-01

Pos logit contributions
bes+0.625
antly+0.602
vered+0.587
es+0.571
 kit+0.550
Neg logit contributions
ock-0.871
weet-0.864
aff-0.860
iki-0.795
oto-0.788
Token time
Token ID640
Feature activation+3.05e-01

Pos logit contributions
aff+0.722
ock+0.707
weet+0.696
iki+0.629
ules+0.609
Neg logit contributions
bes-1.091
antly-1.053
vered-1.034
e-1.027
es-1.021
Token there
Token ID612
Feature activation+1.37e+00

Pos logit contributions
bes+0.541
antly+0.520
vered+0.508
es+0.496
nob+0.487
Neg logit contributions
aff-0.516
ock-0.511
weet-0.506
iki-0.473
oto-0.454
Token was
Token ID373
Feature activation-2.96e-01

Pos logit contributions
bes+1.768
antly+1.547
vered+1.519
 wrist+1.494
 kit+1.469
Neg logit contributions
ock-3.065
aff-3.024
weet-2.979
iki-2.923
oto-2.810
Token a
Token ID257
Feature activation-8.76e-01

Pos logit contributions
aff+0.644
ock+0.636
weet+0.627
iki+0.594
aa+0.578
Neg logit contributions
bes-0.388
antly-0.368
vered-0.357
es-0.352
e-0.348
Token gir
Token ID37370
Feature activation-4.00e+00

Pos logit contributions
ock+1.333
aff+1.325
weet+1.296
iki+1.172
ules+1.153
Neg logit contributions
bes-1.681
antly-1.624
e-1.577
house-1.575
vered-1.570
Tokenaffe
Token ID21223
Feature activation-2.54e-01

Pos logit contributions
weet+4.726
arl+4.569
ock+4.444
add+4.362
iki+4.246
Neg logit contributions
e-9.514
es-9.370
house-9.050
oc-8.977
's-8.752
Token who
Token ID508
Feature activation+3.73e-01

Pos logit contributions
aff+0.412
ock+0.406
weet+0.400
iki+0.371
oto+0.359
Neg logit contributions
bes-0.468
antly-0.451
es-0.441
vered-0.441
e-0.433
Token loved
Token ID6151
Feature activation-1.68e+00

Pos logit contributions
bes+0.640
antly+0.617
vered+0.600
es+0.592
 wrist+0.578
Neg logit contributions
aff-0.650
ock-0.644
weet-0.634
iki-0.590
aa-0.575
Token to
Token ID284
Feature activation-7.83e-01

Pos logit contributions
ock+3.054
aff+3.001
weet+2.826
iki+2.808
arl+2.688
Neg logit contributions
bes-2.779
es-2.602
vered-2.554
e-2.553
 kit-2.550
Token get
Token ID651
Feature activation-3.59e-01

Pos logit contributions
aff+1.532
ock+1.517
weet+1.479
iki+1.408
aa+1.368
Neg logit contributions
bes-1.194
antly-1.105
e-1.102
vered-1.090
es-1.082
Token,
Token ID11
Feature activation+6.11e-01

Pos logit contributions
bes+0.095
antly+0.092
vered+0.090
es+0.088
e+0.086
Neg logit contributions
aff-0.097
ock-0.097
weet-0.095
iki-0.089
oto-0.086
Token there
Token ID612
Feature activation+1.38e+00

Pos logit contributions
bes+0.813
antly+0.755
vered+0.743
es+0.717
nob+0.700
Neg logit contributions
ock-1.299
aff-1.295
weet-1.287
iki-1.221
aa-1.183
Token was
Token ID373
Feature activation-2.52e-01

Pos logit contributions
bes+1.769
antly+1.560
vered+1.550
 wrist+1.499
nob+1.469
Neg logit contributions
ock-3.077
aff-3.006
weet-2.990
iki-2.910
oto-2.816
Token a
Token ID257
Feature activation-7.81e-01

Pos logit contributions
aff+0.546
ock+0.541
weet+0.533
iki+0.505
aa+0.492
Neg logit contributions
bes-0.330
antly-0.313
vered-0.303
es-0.300
e-0.294
Token baby
Token ID5156
Feature activation-2.26e-01

Pos logit contributions
ock+1.192
aff+1.188
weet+1.161
iki+1.055
ules+1.032
Neg logit contributions
bes-1.494
antly-1.441
es-1.399
e-1.398
vered-1.397
Token gir
Token ID37370
Feature activation-3.95e+00

Pos logit contributions
aff+0.374
ock+0.370
weet+0.365
iki+0.337
oto+0.327
Neg logit contributions
bes-0.410
antly-0.396
vered-0.386
es-0.386
e-0.378
Tokenaffe
Token ID21223
Feature activation-1.39e-01

Pos logit contributions
weet+4.578
 hurricane+4.377
ock+4.350
arl+4.258
add+4.139
Neg logit contributions
es-9.692
e-9.580
house-8.852
bes-8.694
oc-8.652
Token named
Token ID3706
Feature activation-1.40e+00

Pos logit contributions
aff+0.217
ock+0.215
weet+0.211
iki+0.195
oto+0.188
Neg logit contributions
bes-0.263
antly-0.254
vered-0.249
es-0.247
e-0.243
Token Pr
Token ID1736
Feature activation-2.29e+00

Pos logit contributions
ock+2.428
aff+2.341
weet+2.297
iki+2.146
ules+2.101
Neg logit contributions
bes-2.431
antly-2.318
es-2.287
e-2.273
nob-2.235
Tokenune
Token ID1726
Feature activation-1.23e+00

Pos logit contributions
weet+3.831
ock+3.781
aff+3.647
 hurricane+3.489
ules+3.338
Neg logit contributions
antly-4.076
es-3.952
bes-3.905
e-3.835
 kit-3.766
Token.
Token ID13
Feature activation-9.64e-01

Pos logit contributions
aff+2.783
weet+2.782
ock+2.777
andel+2.570
arl+2.570
Neg logit contributions
bes-1.472
es-1.445
antly-1.413
vered-1.369
e-1.364
Token girl
Token ID2576
Feature activation+2.15e-01

Pos logit contributions
weet+6.205
ock+6.176
aff+5.934
ules+5.836
iki+5.772
Neg logit contributions
house-4.367
 kit-4.215
 wrist-4.207
's-4.132
e-4.095
Token named
Token ID3706
Feature activation-1.09e+00

Pos logit contributions
bes+0.458
antly+0.445
vered+0.438
es+0.430
nob+0.420
Neg logit contributions
aff-0.289
ock-0.286
weet-0.283
iki-0.256
aa-0.246
Token Mary
Token ID5335
Feature activation-4.01e-02

Pos logit contributions
ock+1.771
aff+1.731
weet+1.668
ules+1.562
iki+1.552
Neg logit contributions
bes-2.059
antly-1.958
house-1.935
es-1.933
nob-1.922
Token.
Token ID13
Feature activation-8.20e-01

Pos logit contributions
aff+0.080
ock+0.080
weet+0.079
iki+0.074
oto+0.072
Neg logit contributions
bes-0.059
antly-0.056
vered-0.054
es-0.054
e-0.052
Token Mary
Token ID5335
Feature activation+8.46e-01

Pos logit contributions
aff+1.347
ock+1.344
weet+1.299
iki+1.181
aa+1.168
Neg logit contributions
bes-1.492
antly-1.457
es-1.433
vered-1.403
nob-1.389
Token dreamed
Token ID27947
Feature activation-3.93e+00

Pos logit contributions
bes+1.483
antly+1.457
vered+1.431
es+1.373
 kit+1.341
Neg logit contributions
aff-1.407
ock-1.393
weet-1.374
iki-1.292
oto-1.253
Token of
Token ID286
Feature activation-9.10e-01

Pos logit contributions
weet+7.192
aff+7.185
iki+6.941
ock+6.352
aph+6.347
Neg logit contributions
es-7.314
antly-7.142
bes-7.017
 kit-6.952
's-6.639
Token finding
Token ID4917
Feature activation+2.94e-01

Pos logit contributions
aff+1.879
ock+1.850
weet+1.773
iki+1.729
aa+1.688
Neg logit contributions
bes-1.303
es-1.185
e-1.176
antly-1.174
vered-1.169
Token a
Token ID257
Feature activation+4.55e-02

Pos logit contributions
bes+0.483
antly+0.463
vered+0.452
es+0.442
e+0.432
Neg logit contributions
aff-0.545
ock-0.542
weet-0.536
iki-0.494
oto-0.485
Token diamond
Token ID15291
Feature activation-6.44e-01

Pos logit contributions
bes+0.088
antly+0.085
vered+0.084
es+0.083
e+0.081
Neg logit contributions
aff-0.070
ock-0.069
weet-0.068
iki-0.062
oto-0.060
Token.
Token ID13
Feature activation-2.33e-01

Pos logit contributions
aff+1.357
ock+1.344
weet+1.338
iki+1.249
ules+1.231
Neg logit contributions
bes-0.881
antly-0.835
es-0.819
vered-0.799
house-0.794
Token a
Token ID257
Feature activation-4.12e-01

Pos logit contributions
bes+1.081
antly+1.046
vered+1.026
es+0.983
 kit+0.962
Neg logit contributions
ock-1.405
weet-1.399
aff-1.368
oto-1.268
iki-1.266
Token time
Token ID640
Feature activation+2.31e-01

Pos logit contributions
aff+0.560
ock+0.549
weet+0.541
iki+0.488
ules+0.472
Neg logit contributions
bes-0.868
antly-0.839
vered-0.823
es-0.814
e-0.813
Token there
Token ID612
Feature activation+1.33e+00

Pos logit contributions
bes+0.406
antly+0.390
vered+0.380
es+0.373
e+0.366
Neg logit contributions
aff-0.395
ock-0.391
weet-0.387
iki-0.362
oto-0.348
Token was
Token ID373
Feature activation+9.78e-03

Pos logit contributions
bes+1.715
antly+1.506
vered+1.472
 wrist+1.438
 kit+1.421
Neg logit contributions
ock-2.970
aff-2.933
weet-2.895
iki-2.832
oto-2.731
Token a
Token ID257
Feature activation-7.17e-01

Pos logit contributions
bes+0.013
antly+0.013
vered+0.012
es+0.012
e+0.012
Neg logit contributions
aff-0.021
ock-0.020
weet-0.020
iki-0.019
oto-0.019
Token gir
Token ID37370
Feature activation-3.88e+00

Pos logit contributions
ock+1.076
aff+1.073
weet+1.046
iki+0.951
ules+0.930
Neg logit contributions
bes-1.393
antly-1.347
vered-1.304
es-1.302
e-1.301
Tokenaffe
Token ID21223
Feature activation-2.16e-01

Pos logit contributions
weet+4.465
arl+4.310
ock+4.294
add+4.204
iki+4.057
Neg logit contributions
e-9.235
es-9.214
house-8.825
oc-8.758
's-8.552
Token named
Token ID3706
Feature activation-1.54e+00

Pos logit contributions
aff+0.348
ock+0.344
weet+0.339
iki+0.313
oto+0.303
Neg logit contributions
bes-0.401
antly-0.387
vered-0.378
es-0.378
e-0.371
Token Sam
Token ID3409
Feature activation-8.46e-01

Pos logit contributions
ock+2.565
aff+2.446
weet+2.408
iki+2.257
aa+2.205
Neg logit contributions
bes-2.810
antly-2.673
e-2.653
es-2.648
house-2.631
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
aff+1.840
ock+1.786
weet+1.781
iki+1.688
andel+1.666
Neg logit contributions
bes-1.099
antly-1.037
es-1.028
e-1.022
vered-1.009
Token a
Token ID257
Feature activation-1.41e+00

Pos logit contributions
aff+2.459
ock+2.380
weet+2.284
andel+2.176
iki+2.174
Neg logit contributions
bes-1.735
es-1.635
e-1.632
antly-1.627
vered-1.623
Token clock
Token ID8801
Feature activation-3.08e-02

Pos logit contributions
bes+1.061
antly+1.013
vered+1.010
es+0.979
e+0.944
Neg logit contributions
ock-1.301
weet-1.282
aff-1.264
iki-1.154
oto-1.149
Token went
Token ID1816
Feature activation-7.70e-01

Pos logit contributions
aff+0.044
ock+0.043
weet+0.043
iki+0.039
oto+0.037
Neg logit contributions
bes-0.063
antly-0.061
vered-0.060
es-0.059
e-0.058
Token "
Token ID366
Feature activation+1.37e-01

Pos logit contributions
aff+1.575
ock+1.536
weet+1.528
iki+1.469
oto+1.411
Neg logit contributions
bes-1.118
antly-1.073
es-1.035
nob-1.028
 kit-1.027
Tokentick
Token ID42298
Feature activation-1.20e+00

Pos logit contributions
bes+0.201
antly+0.191
vered+0.186
es+0.184
e+0.177
Neg logit contributions
aff-0.272
ock-0.271
weet-0.268
iki-0.250
aa-0.244
Token-
Token ID12
Feature activation-1.40e+00

Pos logit contributions
aff+2.495
weet+2.447
ock+2.429
iki+2.418
oto+2.323
Neg logit contributions
bes-1.686
es-1.615
e-1.578
antly-1.556
vered-1.547
Tokent
Token ID83
Feature activation-3.84e+00

Pos logit contributions
aff+2.202
ock+2.154
weet+2.145
iki+2.088
oto+1.985
Neg logit contributions
bes-2.669
e-2.581
es-2.576
antly-2.565
vered-2.530
Tokenock
Token ID735
Feature activation-5.97e-01

Pos logit contributions
iki+5.730
aff+5.654
oto+5.064
ules+5.054
add+4.703
Neg logit contributions
e-8.820
es-8.513
nob-8.448
bes-8.342
house-8.267
Token,
Token ID11
Feature activation-1.59e+00

Pos logit contributions
aff+1.193
ock+1.172
weet+1.164
iki+1.104
oto+1.079
Neg logit contributions
bes-0.884
antly-0.840
es-0.830
vered-0.825
e-0.809
Token tick
Token ID4378
Feature activation-6.93e-01

Pos logit contributions
aff+3.168
ock+3.003
weet+2.935
iki+2.907
oto+2.844
Neg logit contributions
bes-2.449
e-2.321
antly-2.310
vered-2.296
es-2.272
Token-
Token ID12
Feature activation-1.24e+00

Pos logit contributions
aff+1.413
weet+1.392
ock+1.384
iki+1.324
oto+1.273
Neg logit contributions
bes-1.012
antly-0.963
es-0.955
vered-0.944
e-0.921
Tokent
Token ID83
Feature activation-3.27e+00

Pos logit contributions
aff+2.072
weet+1.993
ock+1.989
iki+1.944
oto+1.840
Neg logit contributions
bes-2.259
antly-2.188
e-2.180
es-2.174
vered-2.143
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
bes+0.852
antly+0.823
vered+0.799
es+0.777
e+0.769
Neg logit contributions
aff-0.778
ock-0.769
weet-0.762
iki-0.714
oto-0.684
Token there
Token ID612
Feature activation+2.02e+00

Pos logit contributions
bes+2.004
antly+1.833
vered+1.778
 wrist+1.766
es+1.736
Neg logit contributions
ock-2.576
weet-2.552
aff-2.528
iki-2.433
aa-2.325
Token was
Token ID373
Feature activation+7.62e-03

Pos logit contributions
bes+2.952
antly+2.576
 wrist+2.525
vered+2.514
 kit+2.505
Neg logit contributions
ock-4.142
aff-4.115
weet-3.990
iki-3.980
oto-3.829
Token a
Token ID257
Feature activation-7.18e-01

Pos logit contributions
bes+0.010
antly+0.010
vered+0.010
es+0.009
e+0.009
Neg logit contributions
aff-0.016
ock-0.016
weet-0.016
iki-0.015
oto-0.014
Token little
Token ID1310
Feature activation-1.15e+00

Pos logit contributions
ock+1.087
aff+1.085
weet+1.059
iki+0.969
ules+0.942
Neg logit contributions
bes-1.383
antly-1.336
house-1.292
es-1.292
vered-1.291
Token gir
Token ID37370
Feature activation-3.84e+00

Pos logit contributions
aff+2.148
ock+2.107
weet+2.091
iki+1.944
ules+1.926
Neg logit contributions
bes-1.759
antly-1.706
house-1.681
 kit-1.670
es-1.666
Tokenaffe
Token ID21223
Feature activation-2.06e-01

Pos logit contributions
weet+4.947
ock+4.608
arl+4.514
 hurricane+4.369
add+4.339
Neg logit contributions
es-8.838
e-8.586
oc-8.430
house-8.304
bes-8.067
Token named
Token ID3706
Feature activation-1.25e+00

Pos logit contributions
aff+0.309
ock+0.305
weet+0.300
iki+0.275
oto+0.266
Neg logit contributions
bes-0.404
antly-0.391
vered-0.383
es-0.382
e-0.375
Token Gig
Token ID19525
Feature activation-1.45e+00

Pos logit contributions
ock+2.040
aff+1.977
weet+1.928
iki+1.803
aa+1.762
Neg logit contributions
bes-2.311
antly-2.198
es-2.180
nob-2.162
e-2.158
Tokeni
Token ID72
Feature activation-5.78e-01

Pos logit contributions
ock+2.086
aff+2.047
weet+2.047
andel+1.841
arl+1.812
Neg logit contributions
bes-2.892
es-2.835
antly-2.832
e-2.743
vered-2.721
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
aff+1.204
ock+1.201
weet+1.189
iki+1.109
oto+1.088
Neg logit contributions
bes-0.791
antly-0.756
es-0.750
vered-0.732
e-0.727
Token was
Token ID373
Feature activation-1.85e-01

Pos logit contributions
bes+2.600
antly+2.250
vered+2.241
 wrist+2.224
 kit+2.201
Neg logit contributions
ock-3.897
aff-3.820
weet-3.774
iki-3.705
oto-3.574
Token a
Token ID257
Feature activation-7.03e-01

Pos logit contributions
aff+0.396
ock+0.393
weet+0.388
iki+0.366
oto+0.357
Neg logit contributions
bes-0.244
antly-0.232
vered-0.225
es-0.222
e-0.217
Token big
Token ID1263
Feature activation-1.75e+00

Pos logit contributions
ock+1.063
aff+1.062
weet+1.036
iki+0.945
ules+0.922
Neg logit contributions
bes-1.355
antly-1.308
vered-1.269
es-1.266
e-1.264
Token,
Token ID11
Feature activation-6.69e-01

Pos logit contributions
aff+3.549
ock+3.487
weet+3.423
iki+3.234
ules+3.234
Neg logit contributions
house-2.471
bes-2.455
e-2.406
 kit-2.362
 wrist-2.329
Token tall
Token ID7331
Feature activation-2.02e+00

Pos logit contributions
aff+1.139
ock+1.120
weet+1.092
iki+1.018
oto+0.983
Neg logit contributions
bes-1.193
antly-1.143
vered-1.106
es-1.105
house-1.101
Token gir
Token ID37370
Feature activation-3.81e+00

Pos logit contributions
ock+3.981
weet+3.931
aff+3.890
iki+3.655
ules+3.635
Neg logit contributions
bes-3.022
house-2.943
 wrist-2.857
antly-2.791
e-2.783
Tokenaffe
Token ID21223
Feature activation-6.86e-02

Pos logit contributions
weet+4.456
ock+4.343
arl+4.205
add+4.074
 hurricane+4.052
Neg logit contributions
es-9.302
e-8.972
house-8.514
oc-8.469
bes-8.435
Token named
Token ID3706
Feature activation-1.46e+00

Pos logit contributions
aff+0.115
ock+0.114
weet+0.112
iki+0.104
oto+0.101
Neg logit contributions
bes-0.123
antly-0.118
vered-0.116
es-0.115
e-0.112
Token Jerry
Token ID13075
Feature activation-1.75e-01

Pos logit contributions
ock+2.560
aff+2.408
weet+2.401
iki+2.246
aa+2.208
Neg logit contributions
bes-2.547
antly-2.407
es-2.396
nob-2.364
e-2.362
Token.
Token ID13
Feature activation-7.84e-01

Pos logit contributions
aff+0.356
ock+0.353
weet+0.350
iki+0.328
oto+0.319
Neg logit contributions
bes-0.251
antly-0.240
vered-0.234
es-0.232
e-0.226
Token Jerry
Token ID13075
Feature activation+3.50e-01

Pos logit contributions
ock+1.270
aff+1.249
weet+1.232
iki+1.151
aa+1.123
Neg logit contributions
bes-1.452
antly-1.396
es-1.368
vered-1.367
house-1.348
Token was
Token ID373
Feature activation+2.85e-03

Pos logit contributions
bes+2.440
antly+2.141
 kit+2.087
vered+2.083
 wrist+2.078
Neg logit contributions
ock-3.753
aff-3.699
weet-3.641
iki-3.589
oto-3.446
Token a
Token ID257
Feature activation-7.91e-01

Pos logit contributions
bes+0.004
antly+0.004
vered+0.004
es+0.004
e+0.003
Neg logit contributions
aff-0.006
ock-0.006
weet-0.006
iki-0.006
oto-0.005
Token monkey
Token ID21657
Feature activation-2.35e-01

Pos logit contributions
ock+1.230
aff+1.227
weet+1.197
iki+1.093
ules+1.070
Neg logit contributions
bes-1.491
antly-1.438
house-1.396
e-1.392
es-1.390
Token and
Token ID290
Feature activation-6.71e-01

Pos logit contributions
aff+0.352
ock+0.348
weet+0.342
iki+0.314
oto+0.303
Neg logit contributions
bes-0.463
antly-0.446
vered-0.438
es-0.438
e-0.430
Token a
Token ID257
Feature activation-1.08e+00

Pos logit contributions
aff+1.370
ock+1.340
weet+1.307
iki+1.234
ules+1.208
Neg logit contributions
bes-0.977
antly-0.928
vered-0.908
es-0.908
e-0.900
Token gir
Token ID37370
Feature activation-3.79e+00

Pos logit contributions
ock+1.844
aff+1.817
weet+1.767
ules+1.613
iki+1.613
Neg logit contributions
bes-1.886
antly-1.819
house-1.779
e-1.775
es-1.761
Tokenaffe
Token ID21223
Feature activation-3.21e-01

Pos logit contributions
weet+4.808
ock+4.390
ules+4.131
oto+4.079
andel+4.061
Neg logit contributions
es-8.825
e-8.750
oc-8.497
house-8.421
's-7.983
Token.
Token ID13
Feature activation-5.92e-01

Pos logit contributions
aff+0.523
ock+0.517
weet+0.510
iki+0.471
oto+0.458
Neg logit contributions
bes-0.588
antly-0.569
es-0.557
vered-0.557
e-0.547
Token The
Token ID383
Feature activation-1.82e+00

Pos logit contributions
ock+1.096
aff+1.082
weet+1.067
iki+1.005
aa+0.977
Neg logit contributions
bes-0.961
antly-0.921
vered-0.903
es-0.898
e-0.886
Token gir
Token ID37370
Feature activation-3.60e+00

Pos logit contributions
ock+3.396
aff+3.280
weet+3.243
ules+3.023
andel+2.977
Neg logit contributions
bes-2.905
house-2.833
e-2.807
antly-2.780
es-2.742
Tokenaffe
Token ID21223
Feature activation-3.26e-01

Pos logit contributions
weet+5.687
ock+5.328
arl+5.242
andel+5.150
 hurricane+5.138
Neg logit contributions
es-7.819
e-7.256
oc-7.185
house-7.158
bes-7.011
Token,
Token ID11
Feature activation+9.55e-01

Pos logit contributions
bes+0.221
antly+0.213
vered+0.208
es+0.204
e+0.200
Neg logit contributions
aff-0.222
ock-0.220
weet-0.217
iki-0.203
oto-0.196
Token there
Token ID612
Feature activation+1.62e+00

Pos logit contributions
bes+1.378
antly+1.252
vered+1.229
 wrist+1.190
es+1.184
Neg logit contributions
ock-1.941
weet-1.928
aff-1.925
iki-1.836
aa-1.763
Token was
Token ID373
Feature activation+4.35e-02

Pos logit contributions
bes+2.202
antly+1.943
vered+1.898
 wrist+1.857
 kit+1.852
Neg logit contributions
ock-3.503
aff-3.466
weet-3.412
iki-3.335
oto-3.230
Token a
Token ID257
Feature activation-8.23e-01

Pos logit contributions
bes+0.059
antly+0.057
vered+0.055
es+0.054
e+0.052
Neg logit contributions
aff-0.091
ock-0.091
weet-0.090
iki-0.084
oto-0.082
Token tall
Token ID7331
Feature activation-2.01e+00

Pos logit contributions
ock+1.273
aff+1.269
weet+1.235
iki+1.132
ules+1.107
Neg logit contributions
bes-1.563
antly-1.499
house-1.462
e-1.459
es-1.457
Token gir
Token ID37370
Feature activation-3.76e+00

Pos logit contributions
ock+3.990
weet+3.912
aff+3.901
iki+3.654
ules+3.641
Neg logit contributions
bes-3.026
house-2.970
 wrist-2.853
e-2.823
 wound-2.787
Tokenaffe
Token ID21223
Feature activation-2.21e-01

Pos logit contributions
weet+4.292
ock+4.171
add+4.125
arl+3.973
iki+3.916
Neg logit contributions
es-8.948
e-8.794
house-8.530
oc-8.467
's-8.287
Token named
Token ID3706
Feature activation-1.42e+00

Pos logit contributions
aff+0.345
ock+0.341
weet+0.336
iki+0.310
oto+0.299
Neg logit contributions
bes-0.420
antly-0.405
vered-0.397
es-0.396
e-0.389
Token Jerry
Token ID13075
Feature activation-3.29e-01

Pos logit contributions
ock+2.468
aff+2.348
weet+2.303
iki+2.176
aa+2.130
Neg logit contributions
bes-2.502
es-2.370
antly-2.364
e-2.344
nob-2.334
Token.
Token ID13
Feature activation-7.42e-01

Pos logit contributions
aff+0.678
ock+0.672
weet+0.665
iki+0.624
ules+0.609
Neg logit contributions
bes-0.459
antly-0.438
es-0.427
vered-0.426
e-0.417
Token Jerry
Token ID13075
Feature activation+5.61e-01

Pos logit contributions
ock+1.188
aff+1.176
weet+1.155
iki+1.081
aa+1.049
Neg logit contributions
bes-1.379
antly-1.325
vered-1.302
es-1.296
house-1.273
Token a
Token ID257
Feature activation-8.34e-01

Pos logit contributions
bes+0.049
antly+0.047
vered+0.046
es+0.045
e+0.043
Neg logit contributions
aff-0.076
ock-0.075
weet-0.075
iki-0.070
oto-0.068
Token bridge
Token ID7696
Feature activation-1.29e+00

Pos logit contributions
ock+1.287
aff+1.286
weet+1.249
iki+1.139
ules+1.117
Neg logit contributions
bes-1.588
antly-1.531
house-1.489
e-1.482
vered-1.477
Token.
Token ID13
Feature activation-1.88e-01

Pos logit contributions
aff+2.888
ock+2.838
weet+2.818
iki+2.694
oto+2.662
Neg logit contributions
bes-1.614
antly-1.513
house-1.509
e-1.494
es-1.487
Token Everyday
Token ID48154
Feature activation-3.15e-01

Pos logit contributions
aff+0.356
ock+0.354
weet+0.348
iki+0.327
oto+0.318
Neg logit contributions
bes-0.293
antly-0.281
vered-0.274
es-0.271
e-0.266
Token,
Token ID11
Feature activation+2.26e-01

Pos logit contributions
aff+0.523
ock+0.514
weet+0.507
iki+0.468
oto+0.454
Neg logit contributions
bes-0.572
antly-0.558
vered-0.543
es-0.540
e-0.531
Token lots
Token ID6041
Feature activation-3.76e+00

Pos logit contributions
bes+0.360
antly+0.344
vered+0.336
es+0.330
e+0.321
Neg logit contributions
aff-0.420
ock-0.418
weet-0.416
iki-0.386
oto-0.375
Token of
Token ID286
Feature activation-1.46e+00

Pos logit contributions
aff+6.503
arl+6.396
iki+6.104
weet+6.069
oto+6.044
Neg logit contributions
es-7.442
e-7.248
's-7.143
house-6.908
vered-6.578
Token people
Token ID661
Feature activation-1.22e+00

Pos logit contributions
aff+2.857
ock+2.688
weet+2.680
iki+2.562
oto+2.548
Neg logit contributions
bes-2.259
vered-2.220
e-2.179
es-2.171
antly-2.153
Token would
Token ID561
Feature activation-1.62e+00

Pos logit contributions
aff+2.423
ock+2.361
weet+2.361
iki+2.234
oto+2.215
Neg logit contributions
bes-1.772
es-1.760
e-1.735
vered-1.723
antly-1.716
Token cross
Token ID3272
Feature activation-3.33e-01

Pos logit contributions
aff+3.368
ock+3.215
weet+3.116
iki+3.055
oto+3.003
Neg logit contributions
bes-2.299
e-2.202
house-2.149
antly-2.135
vered-2.116
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
aff+0.629
ock+0.621
weet+0.614
iki+0.577
oto+0.561
Neg logit contributions
bes-0.526
antly-0.505
vered-0.494
es-0.490
e-0.479
TokenOne
Token ID3198
Feature activation-1.11e+00

Pos logit contributions
bes+1.608
vered+1.504
antly+1.467
 wrist+1.442
es+1.403
Neg logit contributions
aff-1.910
ock-1.899
weet-1.853
iki-1.721
ules-1.679
Token day
Token ID1110
Feature activation+3.15e-01

Pos logit contributions
aff+1.400
ock+1.246
weet+1.203
iki+1.087
ules+1.045
Neg logit contributions
bes-2.494
e-2.483
antly-2.483
house-2.427
vered-2.424
Token,
Token ID11
Feature activation+6.06e-01

Pos logit contributions
bes+0.554
antly+0.529
vered+0.521
es+0.510
nob+0.501
Neg logit contributions
aff-0.535
ock-0.529
weet-0.523
iki-0.489
aa-0.475
Token a
Token ID257
Feature activation-1.02e+00

Pos logit contributions
bes+0.917
antly+0.862
vered+0.844
es+0.836
 kit+0.808
Neg logit contributions
aff-1.179
ock-1.171
weet-1.165
iki-1.097
oto-1.065
Token tall
Token ID7331
Feature activation-2.40e+00

Pos logit contributions
aff+1.639
ock+1.630
weet+1.594
iki+1.447
ules+1.441
Neg logit contributions
bes-1.865
antly-1.826
house-1.759
e-1.755
vered-1.744
Token gir
Token ID37370
Feature activation-3.73e+00

Pos logit contributions
ock+4.892
weet+4.869
aff+4.787
ules+4.528
iki+4.406
Neg logit contributions
house-3.501
bes-3.374
 wrist-3.311
antly-3.269
 wound-3.263
Tokenaffe
Token ID21223
Feature activation+9.12e-02

Pos logit contributions
weet+4.829
ock+4.622
arl+4.530
ules+4.369
add+4.321
Neg logit contributions
e-8.256
es-8.176
oc-8.097
house-8.040
's-7.716
Token was
Token ID373
Feature activation-4.61e-01

Pos logit contributions
bes+0.186
antly+0.180
vered+0.176
es+0.174
e+0.170
Neg logit contributions
aff-0.130
ock-0.129
weet-0.127
iki-0.116
aa-0.111
Token walking
Token ID6155
Feature activation-7.66e-01

Pos logit contributions
aff+0.924
ock+0.914
weet+0.896
iki+0.848
oto+0.828
Neg logit contributions
bes-0.682
antly-0.651
vered-0.632
es-0.630
e-0.628
Token through
Token ID832
Feature activation+1.31e+00

Pos logit contributions
aff+1.645
weet+1.616
ock+1.615
iki+1.535
ules+1.497
Neg logit contributions
bes-1.009
antly-0.993
vered-0.966
e-0.950
es-0.945
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
bes+2.568
antly+2.521
vered+2.408
es+2.401
 kit+2.361
Neg logit contributions
weet-2.065
ock-1.999
aff-1.930
iki-1.727
oto-1.661
Token adventure
Token ID8855
Feature activation-5.02e-01

Pos logit contributions
aff+1.565
ock+1.532
weet+1.531
iki+1.443
ules+1.409
Neg logit contributions
bes-1.322
antly-1.231
es-1.193
vered-1.192
e-1.191
Token story
Token ID1621
Feature activation-6.77e-01

Pos logit contributions
aff+1.071
ock+1.059
weet+1.055
iki+0.996
oto+0.973
Neg logit contributions
bes-0.668
antly-0.636
es-0.620
vered-0.617
e-0.601
Token,
Token ID11
Feature activation-8.09e-01

Pos logit contributions
aff+1.362
ock+1.358
weet+1.351
iki+1.273
aa+1.234
Neg logit contributions
bes-0.997
antly-0.937
es-0.926
vered-0.900
e-0.880
Token but
Token ID475
Feature activation-4.37e-01

Pos logit contributions
aff+1.369
ock+1.359
weet+1.312
iki+1.238
oto+1.206
Neg logit contributions
bes-1.445
antly-1.372
es-1.365
vered-1.354
e-1.351
Token after
Token ID706
Feature activation-1.60e+00

Pos logit contributions
aff+0.737
ock+0.733
weet+0.718
iki+0.660
oto+0.643
Neg logit contributions
bes-0.781
antly-0.757
vered-0.742
es-0.735
e-0.722
Token an
Token ID281
Feature activation-3.71e+00

Pos logit contributions
aff+3.703
ock+3.678
weet+3.556
iki+3.429
arl+3.386
Neg logit contributions
bes-2.005
es-1.864
antly-1.817
e-1.780
vered-1.745
Token hour
Token ID1711
Feature activation+4.50e-01

Pos logit contributions
aff+6.336
aa+6.076
ules+5.930
weet+5.764
iki+5.754
Neg logit contributions
house-7.024
nob-7.007
e-6.840
bes-6.797
vered-6.540
Token they
Token ID484
Feature activation-1.79e+00

Pos logit contributions
bes+0.820
antly+0.778
vered+0.762
es+0.745
nob+0.739
Neg logit contributions
aff-0.749
ock-0.742
weet-0.727
iki-0.677
ules-0.655
Token were
Token ID547
Feature activation-6.46e-01

Pos logit contributions
aff+3.678
ock+3.618
weet+3.614
andel+3.338
iki+3.331
Neg logit contributions
e-2.544
es-2.532
antly-2.524
bes-2.521
vered-2.486
Token both
Token ID1111
Feature activation+3.26e-01

Pos logit contributions
aff+1.176
ock+1.157
weet+1.132
iki+1.070
oto+1.041
Neg logit contributions
bes-1.076
antly-1.031
es-1.012
vered-1.006
e-0.999
Token tired
Token ID10032
Feature activation-9.94e-01

Pos logit contributions
bes+0.628
antly+0.607
vered+0.591
es+0.586
nob+0.573
Neg logit contributions
aff-0.500
ock-0.498
weet-0.494
iki-0.451
aa-0.436
Token
Token ID198
Feature activation-5.27e-01

Pos logit contributions
ock+1.544
aff+1.542
weet+1.512
iki+1.435
oto+1.415
Neg logit contributions
bes-0.920
antly-0.888
vered-0.843
es-0.842
e-0.823
Token
Token ID198
Feature activation-9.72e-01

Pos logit contributions
aff+1.130
ock+1.119
weet+1.110
iki+1.044
aa+1.018
Neg logit contributions
bes-0.692
antly-0.665
vered-0.637
es-0.636
e-0.621
TokenEvery
Token ID6109
Feature activation-1.82e+00

Pos logit contributions
aff+1.743
ock+1.686
weet+1.674
iki+1.583
aa+1.544
Neg logit contributions
bes-1.656
antly-1.598
vered-1.565
e-1.539
house-1.519
Token morning
Token ID3329
Feature activation-7.12e-02

Pos logit contributions
aff+2.725
weet+2.457
ock+2.455
arl+2.187
iki+2.178
Neg logit contributions
antly-3.880
e-3.757
bes-3.680
vered-3.654
house-3.617
Token,
Token ID11
Feature activation-8.37e-01

Pos logit contributions
aff+0.127
ock+0.126
weet+0.124
iki+0.116
oto+0.112
Neg logit contributions
bes-0.120
antly-0.115
vered-0.113
es-0.112
e-0.109
Token lots
Token ID6041
Feature activation-3.70e+00

Pos logit contributions
aff+1.589
ock+1.572
weet+1.547
iki+1.446
oto+1.422
Neg logit contributions
bes-1.324
antly-1.289
vered-1.258
es-1.256
e-1.230
Token of
Token ID286
Feature activation-1.53e+00

Pos logit contributions
aff+6.033
arl+5.965
weet+5.799
iki+5.672
oto+5.550
Neg logit contributions
es-7.322
e-7.173
's-6.926
vered-6.636
antly-6.629
Token boats
Token ID16659
Feature activation-1.78e+00

Pos logit contributions
aff+2.946
weet+2.766
ock+2.766
iki+2.671
oto+2.554
Neg logit contributions
bes-2.374
antly-2.318
vered-2.305
e-2.295
es-2.279
Token arrived
Token ID5284
Feature activation-5.21e-01

Pos logit contributions
aff+3.388
weet+3.265
ock+3.217
iki+3.075
andel+3.041
Neg logit contributions
es-2.780
antly-2.767
bes-2.753
e-2.705
vered-2.681
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
aff+0.918
ock+0.912
weet+0.899
iki+0.834
oto+0.808
Neg logit contributions
bes-0.890
antly-0.871
es-0.844
vered-0.842
e-0.814
Token left
Token ID1364
Feature activation-7.54e-01

Pos logit contributions
aff+1.968
ock+1.911
weet+1.859
iki+1.774
oto+1.728
Neg logit contributions
bes-1.721
antly-1.701
es-1.661
vered-1.646
e-1.612
Token just
Token ID655
Feature activation-7.93e-01

Pos logit contributions
bes+0.279
antly+0.265
vered+0.259
es+0.255
e+0.249
Neg logit contributions
aff-0.342
ock-0.341
weet-0.337
iki-0.316
oto-0.306
Token went
Token ID1816
Feature activation-1.09e+00

Pos logit contributions
aff+1.559
ock+1.534
weet+1.500
iki+1.412
ules+1.399
Neg logit contributions
bes-1.201
antly-1.178
vered-1.150
es-1.139
e-1.113
Token "
Token ID366
Feature activation-3.82e-01

Pos logit contributions
aff+2.299
ock+2.255
weet+2.232
iki+2.144
ules+2.085
Neg logit contributions
bes-1.514
antly-1.475
 kit-1.413
nob-1.407
house-1.400
Tokentick
Token ID42298
Feature activation-1.18e+00

Pos logit contributions
aff+0.779
ock+0.767
weet+0.758
iki+0.719
oto+0.702
Neg logit contributions
bes-0.541
antly-0.522
vered-0.509
es-0.500
e-0.493
Token-
Token ID12
Feature activation-1.43e+00

Pos logit contributions
aff+2.427
weet+2.373
ock+2.356
iki+2.350
oto+2.257
Neg logit contributions
bes-1.663
es-1.597
antly-1.553
e-1.545
vered-1.537
Tokent
Token ID83
Feature activation-3.70e+00

Pos logit contributions
aff+2.069
ock+2.018
weet+2.012
iki+1.945
oto+1.831
Neg logit contributions
bes-2.898
antly-2.821
e-2.818
es-2.807
vered-2.780
Tokenock
Token ID735
Feature activation-3.75e-01

Pos logit contributions
iki+6.291
aff+6.235
ules+5.645
oto+5.543
add+5.458
Neg logit contributions
e-7.454
nob-7.295
es-7.278
bes-7.130
house-7.049
Token,
Token ID11
Feature activation-8.41e-01

Pos logit contributions
aff+0.739
ock+0.729
weet+0.721
iki+0.680
oto+0.663
Neg logit contributions
bes-0.561
antly-0.536
vered-0.523
es-0.522
e-0.508
Token tick
Token ID4378
Feature activation-4.58e-01

Pos logit contributions
aff+1.500
ock+1.465
weet+1.429
iki+1.373
oto+1.332
Neg logit contributions
bes-1.431
antly-1.364
vered-1.340
es-1.328
e-1.315
Token-
Token ID12
Feature activation-1.11e+00

Pos logit contributions
aff+0.882
ock+0.866
weet+0.865
iki+0.815
oto+0.786
Neg logit contributions
bes-0.717
antly-0.688
vered-0.672
es-0.671
e-0.652
Tokent
Token ID83
Feature activation-2.82e+00

Pos logit contributions
aff+1.747
ock+1.685
weet+1.679
iki+1.621
oto+1.534
Neg logit contributions
bes-2.105
antly-2.057
e-2.022
es-2.020
vered-2.015
Token lively
Token ID29696
Feature activation-1.92e+00

Pos logit contributions
ock+1.296
aff+1.295
weet+1.260
iki+1.154
ules+1.135
Neg logit contributions
bes-1.507
antly-1.446
house-1.411
e-1.404
es-1.398
Token little
Token ID1310
Feature activation-1.62e+00

Pos logit contributions
ock+3.568
aff+3.524
weet+3.456
ules+3.302
iki+3.242
Neg logit contributions
house-3.025
bes-3.015
 wrist-2.869
e-2.859
 Hide-2.812
Token bird
Token ID6512
Feature activation-1.03e+00

Pos logit contributions
aff+3.204
ock+3.133
weet+3.110
ules+2.916
iki+2.881
Neg logit contributions
bes-2.318
house-2.302
 kit-2.248
antly-2.222
e-2.208
Token named
Token ID3706
Feature activation-1.03e+00

Pos logit contributions
aff+1.804
ock+1.744
weet+1.724
iki+1.610
ules+1.598
Neg logit contributions
bes-1.772
e-1.713
antly-1.697
es-1.690
house-1.677
Token Ch
Token ID609
Feature activation-2.19e+00

Pos logit contributions
ock+1.698
aff+1.679
weet+1.580
ules+1.502
iki+1.493
Neg logit contributions
bes-1.896
antly-1.818
e-1.797
house-1.780
es-1.771
Tokenir
Token ID343
Feature activation-3.68e+00

Pos logit contributions
ock+1.920
aff+1.917
weet+1.896
oto+1.583
ules+1.544
Neg logit contributions
bes-5.792
e-5.669
antly-5.519
es-5.471
vered-5.449
Tokenpy
Token ID9078
Feature activation-2.04e+00

Pos logit contributions
 hurricane+6.673
arl+6.480
weet+6.472
aff+6.329
 tour+6.306
Neg logit contributions
es-6.587
e-6.137
bes-6.094
antly-6.075
nob-5.894
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
aff+4.828
weet+4.741
ock+4.702
ules+4.494
andel+4.471
Neg logit contributions
e-2.292
es-2.262
bes-2.205
antly-2.174
house-2.125
Token Ch
Token ID609
Feature activation-2.29e+00

Pos logit contributions
aff+1.983
ock+1.930
weet+1.899
iki+1.765
oto+1.744
Neg logit contributions
bes-1.896
antly-1.862
es-1.845
vered-1.780
e-1.766
Tokenir
Token ID343
Feature activation-3.48e+00

Pos logit contributions
ock+2.078
weet+2.057
aff+1.962
oto+1.694
ules+1.694
Neg logit contributions
bes-6.224
e-5.924
vered-5.849
es-5.839
house-5.801
Tokenpy
Token ID9078
Feature activation-1.26e+00

Pos logit contributions
weet+4.678
 hurricane+4.625
ock+4.595
aff+4.498
arl+4.454
Neg logit contributions
es-7.702
bes-7.482
antly-7.305
e-7.176
nob-7.172
Token she
Token ID673
Feature activation+2.11e-01

Pos logit contributions
aff+1.653
ock+1.606
weet+1.537
iki+1.424
andel+1.424
Neg logit contributions
bes-1.854
es-1.776
antly-1.772
e-1.735
vered-1.733
Token asked
Token ID1965
Feature activation-5.20e-01

Pos logit contributions
bes+0.388
antly+0.375
vered+0.368
es+0.362
nob+0.351
Neg logit contributions
aff-0.341
ock-0.339
weet-0.335
iki-0.310
oto-0.299
Token her
Token ID607
Feature activation-4.06e-01

Pos logit contributions
aff+0.907
ock+0.901
weet+0.881
iki+0.826
aa+0.795
Neg logit contributions
bes-0.894
antly-0.872
es-0.865
vered-0.852
e-0.830
Token mom
Token ID1995
Feature activation-8.66e-01

Pos logit contributions
ock+0.795
aff+0.794
weet+0.781
iki+0.724
aa+0.704
Neg logit contributions
bes-0.614
antly-0.592
es-0.577
vered-0.571
e-0.552
Tokenmy
Token ID1820
Feature activation-4.93e-01

Pos logit contributions
aff+1.542
ock+1.509
weet+1.495
iki+1.408
aa+1.372
Neg logit contributions
bes-1.402
antly-1.387
es-1.377
house-1.322
 kit-1.320
Token lots
Token ID6041
Feature activation-3.65e+00

Pos logit contributions
aff+0.759
ock+0.751
weet+0.739
iki+0.686
aa+0.659
Neg logit contributions
bes-0.934
antly-0.908
es-0.897
vered-0.884
e-0.866
Token of
Token ID286
Feature activation-1.91e+00

Pos logit contributions
arl+5.672
aff+5.654
iki+5.552
weet+5.515
aph+5.478
Neg logit contributions
es-7.244
's-6.876
e-6.636
bes-6.472
house-6.438
Token questions
Token ID2683
Feature activation-5.54e-01

Pos logit contributions
aff+3.087
ock+3.044
weet+2.959
iki+2.824
arl+2.675
Neg logit contributions
bes-3.327
e-3.290
 kit-3.240
vered-3.230
es-3.227
Token about
Token ID546
Feature activation+2.23e-02

Pos logit contributions
aff+1.115
ock+1.104
weet+1.095
iki+1.034
aa+1.008
Neg logit contributions
bes-0.795
antly-0.759
es-0.750
vered-0.733
e-0.720
Token marriage
Token ID4845
Feature activation-3.50e-01

Pos logit contributions
bes+0.038
antly+0.037
vered+0.036
es+0.035
e+0.034
Neg logit contributions
aff-0.039
ock-0.039
weet-0.038
iki-0.036
oto-0.035
Token.
Token ID13
Feature activation+1.96e-01

Pos logit contributions
aff+0.709
ock+0.707
weet+0.701
iki+0.657
aa+0.638
Neg logit contributions
bes-0.498
antly-0.475
es-0.463
vered-0.458
e-0.445
Token toys
Token ID14958
Feature activation-4.96e-01

Pos logit contributions
bes+0.756
antly+0.728
vered+0.708
es+0.683
nob+0.677
Neg logit contributions
aff-0.895
ock-0.883
weet-0.861
iki-0.805
oto-0.791
Token.
Token ID13
Feature activation-8.89e-01

Pos logit contributions
aff+0.966
weet+0.960
ock+0.956
iki+0.896
aa+0.869
Neg logit contributions
bes-0.741
antly-0.714
es-0.699
vered-0.693
e-0.675
Token Sally
Token ID25737
Feature activation-2.87e-01

Pos logit contributions
ock+1.522
weet+1.471
aff+1.459
iki+1.327
oto+1.290
Neg logit contributions
bes-1.594
antly-1.557
es-1.487
vered-1.477
house-1.467
Token said
Token ID531
Feature activation-8.86e-01

Pos logit contributions
aff+0.536
ock+0.534
weet+0.529
iki+0.490
aa+0.476
Neg logit contributions
bes-0.459
antly-0.440
vered-0.429
es-0.427
e-0.414
Token yes
Token ID3763
Feature activation-1.19e+00

Pos logit contributions
aff+1.652
ock+1.639
weet+1.613
iki+1.515
aa+1.455
Neg logit contributions
bes-1.421
antly-1.391
es-1.380
vered-1.369
e-1.323
Token,
Token ID11
Feature activation-3.64e+00

Pos logit contributions
aff+2.149
weet+2.144
ock+2.127
iki+2.014
andel+1.959
Neg logit contributions
bes-1.924
antly-1.881
es-1.867
's-1.843
e-1.816
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
aff+7.910
ock+7.564
iki+7.520
arl+7.451
andel+7.338
Neg logit contributions
's-5.630
es-5.232
bes-5.181
 wrist-5.082
 kit-5.064
Token they
Token ID484
Feature activation-6.23e-01

Pos logit contributions
ock+1.432
aff+1.413
weet+1.378
iki+1.243
arl+1.202
Neg logit contributions
bes-2.134
antly-2.107
es-2.082
vered-2.056
house-2.025
Token made
Token ID925
Feature activation+1.14e+00

Pos logit contributions
ock+1.034
aff+1.033
weet+1.027
iki+0.951
oto+0.915
Neg logit contributions
bes-1.116
antly-1.086
es-1.065
vered-1.062
e-1.047
Token a
Token ID257
Feature activation+6.91e-01

Pos logit contributions
bes+2.072
antly+1.980
vered+1.939
nob+1.889
es+1.886
Neg logit contributions
aff-1.947
weet-1.880
ock-1.876
oto-1.691
iki-1.687
Token trade
Token ID3292
Feature activation+8.90e-01

Pos logit contributions
bes+1.418
antly+1.363
vered+1.360
es+1.347
nob+1.309
Neg logit contributions
aff-1.009
weet-0.967
ock-0.966
iki-0.871
oto-0.846
Token time
Token ID640
Feature activation+4.55e-01

Pos logit contributions
aff+0.827
ock+0.806
weet+0.796
iki+0.720
ules+0.699
Neg logit contributions
bes-1.224
antly-1.176
vered-1.153
e-1.146
es-1.139
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
bes+0.824
antly+0.795
vered+0.773
es+0.752
nob+0.744
Neg logit contributions
aff-0.753
ock-0.746
weet-0.739
iki-0.690
oto-0.662
Token there
Token ID612
Feature activation+1.95e+00

Pos logit contributions
bes+1.865
antly+1.695
vered+1.652
 wrist+1.615
es+1.610
Neg logit contributions
ock-2.434
weet-2.410
aff-2.382
iki-2.290
aa-2.197
Token was
Token ID373
Feature activation+3.56e-02

Pos logit contributions
bes+2.827
antly+2.477
vered+2.407
 kit+2.384
 wrist+2.372
Neg logit contributions
ock-4.071
aff-4.011
weet-3.939
iki-3.868
oto-3.757
Token a
Token ID257
Feature activation-8.16e-01

Pos logit contributions
bes+0.049
antly+0.046
vered+0.045
es+0.044
e+0.043
Neg logit contributions
aff-0.075
ock-0.074
weet-0.074
iki-0.069
oto-0.068
Token gir
Token ID37370
Feature activation-3.64e+00

Pos logit contributions
ock+1.271
aff+1.265
weet+1.236
iki+1.132
ules+1.105
Neg logit contributions
bes-1.536
antly-1.479
house-1.438
es-1.436
e-1.431
Tokenaffe
Token ID21223
Feature activation-1.64e-01

Pos logit contributions
weet+4.413
ock+4.137
add+4.054
arl+4.017
iki+3.994
Neg logit contributions
es-8.412
e-8.281
house-8.087
oc-8.049
bes-7.809
Token named
Token ID3706
Feature activation-1.31e+00

Pos logit contributions
aff+0.250
ock+0.247
weet+0.243
iki+0.224
oto+0.216
Neg logit contributions
bes-0.317
antly-0.306
vered-0.300
es-0.299
e-0.293
Token Jerry
Token ID13075
Feature activation-2.36e-01

Pos logit contributions
ock+2.256
aff+2.164
weet+2.120
iki+1.999
aa+1.953
Neg logit contributions
bes-2.302
es-2.182
antly-2.177
e-2.162
house-2.135
Token.
Token ID13
Feature activation-6.50e-01

Pos logit contributions
aff+0.483
ock+0.478
weet+0.474
iki+0.445
oto+0.433
Neg logit contributions
bes-0.335
antly-0.319
vered-0.311
es-0.310
e-0.302
Token Jerry
Token ID13075
Feature activation+5.66e-01

Pos logit contributions
ock+1.034
aff+1.027
weet+1.007
iki+0.940
aa+0.912
Neg logit contributions
bes-1.211
antly-1.166
vered-1.145
es-1.139
house-1.116
Token Anna
Token ID11735
Feature activation+9.86e-01

Pos logit contributions
aff+0.639
ock+0.638
weet+0.626
iki+0.577
oto+0.555
Neg logit contributions
bes-0.730
antly-0.702
es-0.683
vered-0.682
e-0.667
Token says
Token ID1139
Feature activation+1.11e+00

Pos logit contributions
bes+1.907
antly+1.829
vered+1.810
nob+1.735
house+1.714
Neg logit contributions
aff-1.561
ock-1.519
weet-1.508
iki-1.393
aa-1.364
Token.
Token ID13
Feature activation-2.93e-01

Pos logit contributions
bes+1.914
antly+1.806
vered+1.796
nob+1.718
 searches+1.693
Neg logit contributions
aff-1.962
ock-1.929
weet-1.899
aa-1.776
iki-1.760
Token "
Token ID366
Feature activation+7.30e-01

Pos logit contributions
ock+0.520
aff+0.519
weet+0.510
iki+0.472
oto+0.460
Neg logit contributions
bes-0.499
antly-0.482
es-0.468
vered-0.467
e-0.452
TokenLet
Token ID5756
Feature activation+4.39e-01

Pos logit contributions
bes+0.987
antly+0.943
vered+0.925
es+0.899
nob+0.884
Neg logit contributions
aff-1.533
ock-1.533
weet-1.527
iki-1.396
aa-1.389
Token's
Token ID338
Feature activation+1.86e+00

Pos logit contributions
bes+0.311
antly+0.269
vered+0.266
es+0.251
nob+0.226
Neg logit contributions
ock-1.231
aff-1.224
weet-1.214
iki-1.157
aa-1.143
Token make
Token ID787
Feature activation+1.59e+00

Pos logit contributions
bes+3.460
antly+3.338
vered+3.326
es+3.299
nob+3.055
Neg logit contributions
ock-3.159
aff-3.086
weet-3.049
oto-2.742
aa-2.726
Token bubbles
Token ID25037
Feature activation+1.80e-01

Pos logit contributions
bes+2.638
vered+2.571
antly+2.559
nob+2.509
house+2.423
Neg logit contributions
ock-2.954
aff-2.917
weet-2.912
oto-2.680
aa-2.592
Token with
Token ID351
Feature activation+2.09e-02

Pos logit contributions
bes+0.303
antly+0.293
vered+0.287
es+0.279
e+0.273
Neg logit contributions
aff-0.320
ock-0.319
weet-0.314
iki-0.289
oto-0.283
Token it
Token ID340
Feature activation-2.80e-01

Pos logit contributions
bes+0.032
antly+0.031
vered+0.030
es+0.030
e+0.029
Neg logit contributions
aff-0.040
ock-0.040
weet-0.040
iki-0.037
oto-0.036
Token!"
Token ID2474
Feature activation-5.57e-01

Pos logit contributions
aff+0.522
ock+0.516
weet+0.510
iki+0.479
oto+0.464
Neg logit contributions
bes-0.448
antly-0.431
es-0.419
vered-0.419
e-0.406
Token day
Token ID1110
Feature activation-2.43e-01

Pos logit contributions
aff+0.945
ock+0.904
weet+0.893
iki+0.786
ules+0.770
Neg logit contributions
bes-1.812
antly-1.738
vered-1.711
house-1.698
e-1.695
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
aff+0.478
ock+0.473
weet+0.467
iki+0.439
oto+0.428
Neg logit contributions
bes-0.365
antly-0.349
vered-0.339
es-0.337
e-0.330
Token She
Token ID1375
Feature activation-7.74e-01

Pos logit contributions
ock+1.862
aff+1.859
weet+1.773
iki+1.648
aa+1.640
Neg logit contributions
bes-1.835
antly-1.801
 Something-1.741
e-1.738
vered-1.730
Token was
Token ID373
Feature activation-8.27e-01

Pos logit contributions
aff+1.484
ock+1.451
weet+1.435
iki+1.350
oto+1.321
Neg logit contributions
bes-1.227
antly-1.173
vered-1.154
es-1.137
e-1.133
Token outside
Token ID2354
Feature activation+3.56e-01

Pos logit contributions
aff+1.621
ock+1.581
weet+1.526
iki+1.452
oto+1.420
Neg logit contributions
bes-1.282
antly-1.230
e-1.212
vered-1.190
es-1.182
Token in
Token ID287
Feature activation+2.01e+00

Pos logit contributions
bes+0.588
antly+0.562
vered+0.545
es+0.541
nob+0.525
Neg logit contributions
ock-0.638
aff-0.636
weet-0.633
iki-0.585
aa-0.570
Token the
Token ID262
Feature activation+6.49e-01

Pos logit contributions
bes+4.519
antly+4.353
es+4.333
vered+4.220
 kit+3.979
Neg logit contributions
weet-2.734
ock-2.662
iki-2.252
aff-2.178
ules-2.131
Token big
Token ID1263
Feature activation+5.20e-01

Pos logit contributions
bes+1.177
es+1.115
vered+1.112
antly+1.105
e+1.046
Neg logit contributions
ock-1.103
weet-1.097
aff-1.071
iki-0.994
aa-0.961
Token backyard
Token ID24296
Feature activation+9.29e-01

Pos logit contributions
bes+0.775
antly+0.735
vered+0.716
es+0.710
e+0.675
Neg logit contributions
ock-1.037
weet-1.030
aff-1.027
iki-0.955
aa-0.930
Token in
Token ID287
Feature activation+2.25e+00

Pos logit contributions
bes+1.673
antly+1.607
vered+1.581
es+1.538
nob+1.483
Neg logit contributions
ock-1.556
weet-1.549
aff-1.534
iki-1.412
ules-1.364
Token the
Token ID262
Feature activation+7.98e-01

Pos logit contributions
bes+5.132
antly+4.882
es+4.865
vered+4.771
 kit+4.635
Neg logit contributions
weet-3.061
ock-2.991
iki-2.525
aff-2.357
ules-2.336
TokenTom
Token ID13787
Feature activation-3.36e-01

Pos logit contributions
bes+0.938
antly+0.916
vered+0.906
es+0.876
nob+0.865
Neg logit contributions
aff-0.933
ock-0.931
weet-0.920
iki-0.841
oto-0.829
Token said
Token ID531
Feature activation+4.57e-01

Pos logit contributions
aff+0.550
ock+0.546
weet+0.541
iki+0.496
ules+0.479
Neg logit contributions
bes-0.615
antly-0.596
es-0.582
vered-0.580
e-0.566
Token,
Token ID11
Feature activation-6.31e-01

Pos logit contributions
bes+0.829
antly+0.784
vered+0.771
es+0.751
nob+0.751
Neg logit contributions
aff-0.767
ock-0.756
weet-0.738
iki-0.686
aa-0.677
Token "
Token ID366
Feature activation+4.30e-01

Pos logit contributions
aff+1.206
ock+1.204
weet+1.184
iki+1.112
oto+1.075
Neg logit contributions
bes-0.994
antly-0.953
es-0.930
vered-0.911
 kit-0.889
TokenI
Token ID40
Feature activation-4.29e-01

Pos logit contributions
bes+0.714
antly+0.678
es+0.664
vered+0.664
nob+0.646
Neg logit contributions
ock-0.789
aff-0.785
weet-0.772
iki-0.718
aa-0.702
Token promise
Token ID6991
Feature activation+2.10e+00

Pos logit contributions
aff+0.898
ock+0.892
weet+0.886
iki+0.824
ules+0.804
Neg logit contributions
bes-0.592
antly-0.575
vered-0.556
es-0.554
e-0.538
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
bes+4.398
vered+4.242
antly+4.143
es+4.023
e+3.971
Neg logit contributions
ock-3.046
aff-2.990
weet-2.937
aa-2.752
oto-2.732
Token And
Token ID843
Feature activation+1.26e+00

Pos logit contributions
bes+2.655
antly+2.484
vered+2.480
es+2.451
nob+2.393
Neg logit contributions
weet-2.419
aff-2.407
ock-2.363
iki-2.198
aa-2.149
Token I
Token ID314
Feature activation+2.19e-01

Pos logit contributions
bes+2.444
antly+2.318
vered+2.262
nob+2.248
es+2.235
Neg logit contributions
weet-1.963
ock-1.937
aff-1.922
iki-1.799
oto-1.744
Token will
Token ID481
Feature activation+1.48e-01

Pos logit contributions
bes+0.308
antly+0.290
vered+0.283
es+0.278
e+0.269
Neg logit contributions
aff-0.458
ock-0.454
weet-0.448
iki-0.424
aa-0.414
Token wash
Token ID13502
Feature activation+5.08e-01

Pos logit contributions
bes+0.233
antly+0.223
vered+0.219
es+0.216
e+0.208
Neg logit contributions
aff-0.280
ock-0.278
weet-0.275
iki-0.256
oto-0.250
Token it
Token ID340
Feature activation+8.77e-01

Pos logit contributions
aff+0.028
ock+0.028
weet+0.027
iki+0.025
oto+0.024
Neg logit contributions
bes-0.031
antly-0.030
vered-0.029
es-0.029
e-0.028
Token was
Token ID373
Feature activation-3.16e-01

Pos logit contributions
bes+1.277
vered+1.189
antly+1.181
 wrist+1.146
e+1.125
Neg logit contributions
ock-1.756
aff-1.721
weet-1.702
iki-1.599
aa-1.582
Token useful
Token ID4465
Feature activation+6.38e-01

Pos logit contributions
aff+0.548
ock+0.538
weet+0.531
iki+0.496
ules+0.477
Neg logit contributions
bes-0.553
antly-0.533
vered-0.517
es-0.515
e-0.504
Token to
Token ID284
Feature activation+2.21e-01

Pos logit contributions
bes+1.267
antly+1.232
vered+1.201
e+1.163
es+1.161
Neg logit contributions
ock-0.954
aff-0.941
weet-0.922
iki-0.829
oto-0.819
Token help
Token ID1037
Feature activation+9.71e-01

Pos logit contributions
bes+0.331
antly+0.317
vered+0.311
es+0.305
e+0.292
Neg logit contributions
ock-0.438
aff-0.437
weet-0.432
iki-0.401
oto-0.394
Token her
Token ID607
Feature activation+2.34e+00

Pos logit contributions
bes+1.686
antly+1.613
vered+1.609
es+1.525
 searches+1.478
Neg logit contributions
ock-1.753
weet-1.718
aff-1.713
aa-1.520
iki-1.498
Token mom
Token ID1995
Feature activation+1.48e+00

Pos logit contributions
bes+4.767
vered+4.758
antly+4.595
nob+4.457
ASH+4.380
Neg logit contributions
ock-3.355
aff-3.245
aa-3.172
weet-3.152
oto-3.095
Tokenmy
Token ID1820
Feature activation+1.89e+00

Pos logit contributions
bes+3.102
vered+2.997
antly+2.931
es+2.790
 searches+2.732
Neg logit contributions
ock-2.280
weet-2.148
aff-2.107
aa-1.847
oto-1.838
Token.
Token ID13
Feature activation+5.69e-01

Pos logit contributions
bes+3.402
antly+3.243
vered+3.216
ASH+3.037
es+3.024
Neg logit contributions
ock-3.287
weet-3.128
aff-3.114
aa-2.810
ules-2.729
Token
Token ID198
Feature activation-5.74e-01

Pos logit contributions
bes+0.870
vered+0.838
antly+0.827
es+0.805
e+0.801
Neg logit contributions
aff-1.113
ock-1.089
weet-1.084
iki-1.018
aa-0.993
Token
Token ID198
Feature activation+3.82e-02

Pos logit contributions
aff+1.237
ock+1.224
weet+1.218
iki+1.146
aa+1.111
Neg logit contributions
bes-0.743
antly-0.711
es-0.686
vered-0.671
e-0.660
TokenOnce
Token ID7454
Feature activation-1.61e+00

Pos logit contributions
bes+1.819
vered+1.764
antly+1.712
 wrist+1.651
es+1.621
Neg logit contributions
aff-1.972
ock-1.963
weet-1.912
iki-1.757
oto-1.751
Token upon
Token ID2402
Feature activation+8.34e-01

Pos logit contributions
aff+2.791
weet+2.732
ock+2.702
iki+2.397
andel+2.392
Neg logit contributions
antly-2.783
es-2.772
bes-2.745
vered-2.625
's-2.620
Token a
Token ID257
Feature activation-6.72e-01

Pos logit contributions
bes+1.266
antly+1.238
vered+1.220
es+1.162
 kit+1.139
Neg logit contributions
ock-1.622
weet-1.613
aff-1.568
oto-1.459
iki-1.442
Token time
Token ID640
Feature activation+4.57e-01

Pos logit contributions
aff+0.958
ock+0.931
weet+0.921
iki+0.833
ules+0.813
Neg logit contributions
bes-1.388
antly-1.332
vered-1.302
e-1.302
es-1.287
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
bes+0.826
antly+0.798
vered+0.777
es+0.754
nob+0.746
Neg logit contributions
aff-0.754
ock-0.747
weet-0.739
iki-0.691
oto-0.664
Token there
Token ID612
Feature activation+1.97e+00

Pos logit contributions
bes+1.546
antly+1.415
vered+1.389
 wrist+1.358
es+1.357
Neg logit contributions
ock-2.112
weet-2.081
aff-2.057
iki-1.957
aa-1.905
Token was
Token ID373
Feature activation-2.71e-01

Pos logit contributions
bes+2.877
antly+2.499
 wrist+2.471
vered+2.462
 kit+2.443
Neg logit contributions
ock-4.066
aff-3.996
weet-3.910
iki-3.839
oto-3.735
Token a
Token ID257
Feature activation-7.32e-01

Pos logit contributions
aff+0.589
ock+0.583
weet+0.575
iki+0.546
aa+0.532
Neg logit contributions
bes-0.352
antly-0.334
vered-0.322
es-0.320
e-0.314
Token little
Token ID1310
Feature activation-1.11e+00

Pos logit contributions
ock+1.134
aff+1.134
weet+1.104
iki+1.018
ules+0.990
Neg logit contributions
bes-1.381
antly-1.336
house-1.293
es-1.292
vered-1.290
Token girl
Token ID2576
Feature activation+4.64e-01

Pos logit contributions
aff+2.127
ock+2.089
weet+2.073
iki+1.943
ules+1.921
Neg logit contributions
bes-1.680
antly-1.627
house-1.610
 kit-1.594
es-1.592
Token named
Token ID3706
Feature activation-8.02e-01

Pos logit contributions
bes+1.013
antly+0.988
vered+0.973
es+0.948
nob+0.927
Neg logit contributions
aff-0.592
ock-0.589
weet-0.588
iki-0.525
aa-0.511
Token,
Token ID11
Feature activation-7.67e-01

Pos logit contributions
aff+0.999
ock+0.988
weet+0.975
iki+0.922
oto+0.893
Neg logit contributions
bes-0.838
antly-0.803
es-0.786
vered-0.780
e-0.758
Token and
Token ID290
Feature activation+2.45e-01

Pos logit contributions
aff+1.274
ock+1.261
weet+1.221
iki+1.143
aa+1.114
Neg logit contributions
bes-1.384
antly-1.329
es-1.300
vered-1.297
e-1.271
Token just
Token ID655
Feature activation-1.70e-02

Pos logit contributions
bes+0.415
antly+0.397
vered+0.388
es+0.383
e+0.375
Neg logit contributions
aff-0.433
ock-0.430
weet-0.428
iki-0.394
oto-0.382
Token passing
Token ID6427
Feature activation-2.14e-01

Pos logit contributions
aff+0.026
ock+0.026
weet+0.025
iki+0.023
oto+0.022
Neg logit contributions
bes-0.033
antly-0.032
vered-0.031
es-0.031
e-0.030
Token it
Token ID340
Feature activation-3.12e-01

Pos logit contributions
aff+0.398
ock+0.395
weet+0.390
iki+0.365
oto+0.355
Neg logit contributions
bes-0.341
antly-0.330
vered-0.321
es-0.318
e-0.311
Token makes
Token ID1838
Feature activation-1.56e-01

Pos logit contributions
aff+0.556
ock+0.551
weet+0.543
iki+0.507
oto+0.494
Neg logit contributions
bes-0.523
antly-0.505
vered-0.493
es-0.490
e-0.476
Token you
Token ID345
Feature activation+2.48e-01

Pos logit contributions
aff+0.274
ock+0.271
weet+0.267
iki+0.250
oto+0.242
Neg logit contributions
bes-0.266
antly-0.257
vered-0.251
es-0.249
e-0.243
Token feel
Token ID1254
Feature activation+3.80e-02

Pos logit contributions
bes+0.449
antly+0.433
vered+0.422
es+0.417
nob+0.405
Neg logit contributions
aff-0.405
ock-0.405
weet-0.402
iki-0.367
aa-0.356
Token good
Token ID922
Feature activation-3.53e-01

Pos logit contributions
bes+0.070
antly+0.067
vered+0.066
es+0.065
e+0.064
Neg logit contributions
aff-0.062
ock-0.061
weet-0.060
iki-0.055
oto-0.054
Token."
Token ID526
Feature activation-3.77e-01

Pos logit contributions
aff+0.660
ock+0.653
weet+0.648
iki+0.609
oto+0.588
Neg logit contributions
bes-0.562
antly-0.543
vered-0.528
es-0.528
e-0.509
Token The
Token ID383
Feature activation-4.07e-01

Pos logit contributions
ock+0.800
aff+0.800
weet+0.786
iki+0.742
oto+0.726
Neg logit contributions
bes-0.507
antly-0.483
es-0.463
vered-0.462
e-0.447
Token Lily
Token ID20037
Feature activation+6.47e-01

Pos logit contributions
aff+0.428
ock+0.425
weet+0.414
iki+0.379
oto+0.368
Neg logit contributions
bes-0.603
antly-0.583
vered-0.570
es-0.569
e-0.559
Token.
Token ID13
Feature activation-3.64e-01

Pos logit contributions
bes+1.042
antly+1.003
vered+0.981
es+0.948
 kit+0.909
Neg logit contributions
ock-1.222
weet-1.209
aff-1.201
iki-1.110
aa-1.088
Token She
Token ID1375
Feature activation+2.59e-01

Pos logit contributions
aff+0.563
ock+0.559
weet+0.547
iki+0.508
oto+0.492
Neg logit contributions
bes-0.701
antly-0.675
vered-0.660
es-0.657
e-0.645
Token loved
Token ID6151
Feature activation-1.19e+00

Pos logit contributions
bes+0.477
antly+0.463
vered+0.457
es+0.449
e+0.433
Neg logit contributions
aff-0.415
ock-0.414
weet-0.409
iki-0.376
aa-0.366
Token to
Token ID284
Feature activation-5.64e-01

Pos logit contributions
aff+2.126
ock+2.096
weet+1.984
iki+1.950
ules+1.828
Neg logit contributions
bes-2.001
es-1.865
antly-1.846
e-1.842
 kit-1.837
Token play
Token ID711
Feature activation+1.29e+00

Pos logit contributions
aff+1.180
ock+1.159
weet+1.136
iki+1.084
ules+1.048
Neg logit contributions
bes-0.795
antly-0.738
e-0.723
vered-0.719
es-0.714
Token outside
Token ID2354
Feature activation+2.69e-01

Pos logit contributions
antly+1.860
vered+1.812
bes+1.792
es+1.627
 Something+1.601
Neg logit contributions
ock-2.677
aff-2.631
weet-2.552
oto-2.370
aa-2.341
Token with
Token ID351
Feature activation-1.26e-01

Pos logit contributions
bes+0.393
antly+0.376
vered+0.367
es+0.357
nob+0.345
Neg logit contributions
ock-0.537
aff-0.536
weet-0.529
iki-0.494
aa-0.481
Token her
Token ID607
Feature activation+3.83e-01

Pos logit contributions
aff+0.212
ock+0.209
weet+0.207
iki+0.191
oto+0.185
Neg logit contributions
bes-0.227
antly-0.219
vered-0.214
es-0.212
e-0.208
Token friends
Token ID2460
Feature activation+3.49e-01

Pos logit contributions
bes+0.610
antly+0.592
vered+0.581
es+0.563
nob+0.558
Neg logit contributions
aff-0.716
ock-0.709
weet-0.703
iki-0.668
aa-0.649
Token.
Token ID13
Feature activation-7.97e-01

Pos logit contributions
bes+0.563
antly+0.538
vered+0.526
es+0.512
nob+0.499
Neg logit contributions
ock-0.651
aff-0.651
weet-0.638
iki-0.595
aa-0.579
Token ran
Token ID4966
Feature activation+2.41e-01

Pos logit contributions
bes+2.007
antly+1.907
vered+1.871
e+1.823
es+1.819
Neg logit contributions
aff-1.839
weet-1.823
ock-1.817
iki-1.682
aa-1.653
Token around
Token ID1088
Feature activation+1.15e+00

Pos logit contributions
bes+0.351
antly+0.333
vered+0.326
es+0.322
e+0.310
Neg logit contributions
aff-0.489
ock-0.484
weet-0.476
iki-0.447
aa-0.436
Token the
Token ID262
Feature activation+1.86e+00

Pos logit contributions
bes+2.051
antly+1.980
vered+1.896
es+1.863
nob+1.830
Neg logit contributions
aff-1.934
ock-1.896
weet-1.896
iki-1.710
aa-1.664
Token park
Token ID3952
Feature activation+5.88e-01

Pos logit contributions
bes+3.488
es+3.418
antly+3.340
vered+3.291
nob+3.107
Neg logit contributions
ock-3.158
weet-3.153
aff-3.108
iki-2.670
oto-2.605
Token in
Token ID287
Feature activation+1.59e+00

Pos logit contributions
bes+0.933
antly+0.904
vered+0.880
es+0.853
e+0.837
Neg logit contributions
aff-1.102
ock-1.076
weet-1.068
iki-0.998
aa-0.965
Token excitement
Token ID14067
Feature activation+6.22e-01

Pos logit contributions
bes+3.239
antly+2.998
es+2.937
vered+2.907
 kit+2.841
Neg logit contributions
weet-2.482
ock-2.460
aff-2.370
iki-2.181
oto-2.039
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
bes+0.994
antly+0.941
vered+0.921
es+0.894
e+0.888
Neg logit contributions
aff-1.174
ock-1.158
weet-1.135
iki-1.058
ules-1.029
Token
Token ID220
Feature activation+1.01e+00

Pos logit contributions
bes+0.267
antly+0.253
vered+0.248
es+0.244
e+0.238
Neg logit contributions
aff-0.389
ock-0.383
weet-0.381
iki-0.358
aa-0.348
Token
Token ID198
Feature activation+1.67e-01

Pos logit contributions
bes+1.605
vered+1.536
antly+1.522
e+1.443
 wrist+1.439
Neg logit contributions
aff-1.901
weet-1.849
ock-1.846
iki-1.713
ules-1.684
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
bes+0.242
antly+0.230
vered+0.226
es+0.221
e+0.215
Neg logit contributions
aff-0.336
ock-0.333
weet-0.330
iki-0.309
oto-0.301
TokenThe
Token ID464
Feature activation+7.45e-01

Pos logit contributions
bes+0.345
antly+0.331
vered+0.323
es+0.319
e+0.310
Neg logit contributions
aff-0.436
ock-0.434
weet-0.430
iki-0.399
oto-0.390
Token she
Token ID673
Feature activation+9.68e-01

Pos logit contributions
aff+0.102
ock+0.101
weet+0.099
iki+0.090
oto+0.086
Neg logit contributions
bes-0.157
antly-0.152
vered-0.149
es-0.148
e-0.145
Token found
Token ID1043
Feature activation-5.02e-03

Pos logit contributions
bes+1.836
antly+1.778
vered+1.756
es+1.696
 wrist+1.675
Neg logit contributions
weet-1.498
ock-1.498
aff-1.469
aa-1.359
iki-1.338
Token an
Token ID281
Feature activation-1.41e+00

Pos logit contributions
aff+0.011
ock+0.011
weet+0.010
iki+0.010
oto+0.010
Neg logit contributions
bes-0.007
antly-0.007
vered-0.006
es-0.006
e-0.006
Token old
Token ID1468
Feature activation+1.95e-01

Pos logit contributions
aff+2.464
ock+2.363
weet+2.339
iki+2.191
aa+2.148
Neg logit contributions
bes-2.432
antly-2.322
vered-2.320
nob-2.319
 wrist-2.312
Token book
Token ID1492
Feature activation-2.00e-01

Pos logit contributions
bes+0.283
antly+0.272
vered+0.265
es+0.262
e+0.254
Neg logit contributions
aff-0.389
ock-0.387
weet-0.383
iki-0.359
oto-0.351
Token in
Token ID287
Feature activation+3.85e-01

Pos logit contributions
aff+0.421
ock+0.417
weet+0.413
iki+0.389
oto+0.379
Neg logit contributions
bes-0.274
antly-0.261
vered-0.252
es-0.251
e-0.243
Token her
Token ID607
Feature activation+1.59e+00

Pos logit contributions
bes+0.719
antly+0.703
vered+0.688
es+0.680
e+0.657
Neg logit contributions
ock-0.616
weet-0.613
aff-0.612
iki-0.552
oto-0.540
Token grandma
Token ID49890
Feature activation+2.45e+00

Pos logit contributions
vered+1.881
bes+1.872
antly+1.867
es+1.758
nob+1.632
Neg logit contributions
weet-3.595
aff-3.540
ock-3.507
iki-3.405
oto-3.291
Token's
Token ID338
Feature activation+1.44e+00

Pos logit contributions
bes+1.998
vered+1.997
antly+1.971
 Something+1.558
es+1.441
Neg logit contributions
weet-7.039
aff-6.830
ock-6.779
aa-6.364
iki-6.300
Token attic
Token ID46113
Feature activation-7.46e-02

Pos logit contributions
antly+1.608
vered+1.559
bes+1.545
es+1.473
e+1.351
Neg logit contributions
aff-3.380
weet-3.343
ock-3.264
iki-3.141
aa-3.075
Token.
Token ID13
Feature activation+8.94e-01

Pos logit contributions
aff+0.150
ock+0.149
weet+0.147
iki+0.138
oto+0.135
Neg logit contributions
bes-0.108
antly-0.103
vered-0.100
es-0.099
e-0.096
Token,
Token ID11
Feature activation-2.98e-01

Pos logit contributions
bes+0.270
antly+0.259
vered+0.255
es+0.250
e+0.245
Neg logit contributions
aff-0.276
ock-0.275
weet-0.270
iki-0.251
oto-0.244
Token I
Token ID314
Feature activation+4.23e-01

Pos logit contributions
aff+0.507
ock+0.499
weet+0.492
iki+0.455
aa+0.442
Neg logit contributions
bes-0.525
antly-0.508
vered-0.497
es-0.494
e-0.484
Token'll
Token ID1183
Feature activation+6.12e-01

Pos logit contributions
bes+0.686
antly+0.644
vered+0.632
es+0.624
e+0.605
Neg logit contributions
aff-0.798
ock-0.791
weet-0.777
iki-0.733
aa-0.714
Token make
Token ID787
Feature activation+1.14e+00

Pos logit contributions
bes+1.004
antly+0.963
vered+0.944
es+0.941
nob+0.891
Neg logit contributions
aff-1.131
ock-1.124
weet-1.106
iki-1.029
aa-1.011
Token one
Token ID530
Feature activation+6.26e-01

Pos logit contributions
bes+1.878
antly+1.795
vered+1.779
nob+1.706
house+1.658
Neg logit contributions
aff-2.137
ock-2.132
weet-2.128
oto-1.905
iki-1.889
Token for
Token ID329
Feature activation+4.47e-01

Pos logit contributions
bes+0.983
antly+0.927
vered+0.923
es+0.886
nob+0.872
Neg logit contributions
ock-1.206
aff-1.206
weet-1.187
iki-1.099
oto-1.079
Token you
Token ID345
Feature activation+8.34e-02

Pos logit contributions
bes+0.758
antly+0.742
vered+0.723
es+0.701
nob+0.692
Neg logit contributions
aff-0.789
ock-0.788
weet-0.787
iki-0.712
oto-0.701
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
bes+0.131
antly+0.126
vered+0.123
es+0.121
e+0.117
Neg logit contributions
aff-0.158
ock-0.157
weet-0.155
iki-0.144
oto-0.141
Token But
Token ID887
Feature activation+7.18e-01

Pos logit contributions
bes+2.598
vered+2.455
es+2.416
antly+2.403
nob+2.306
Neg logit contributions
weet-2.348
aff-2.333
ock-2.296
iki-2.118
aa-2.044
Token you
Token ID345
Feature activation-6.37e-01

Pos logit contributions
bes+1.339
antly+1.258
vered+1.244
es+1.230
nob+1.200
Neg logit contributions
ock-1.174
weet-1.157
aff-1.156
iki-1.082
oto-1.035
Token have
Token ID423
Feature activation-8.10e-01

Pos logit contributions
aff+1.360
ock+1.349
weet+1.337
iki+1.262
oto+1.238
Neg logit contributions
bes-0.841
antly-0.806
vered-0.787
es-0.786
e-0.783
Token forest
Token ID8222
Feature activation-2.58e-01

Pos logit contributions
aff+1.529
ock+1.514
weet+1.478
oto+1.381
iki+1.377
Neg logit contributions
bes-1.365
antly-1.343
vered-1.314
house-1.291
e-1.276
Token ran
Token ID4966
Feature activation-7.45e-01

Pos logit contributions
aff+0.467
ock+0.464
weet+0.457
iki+0.426
oto+0.416
Neg logit contributions
bes-0.423
antly-0.409
vered-0.401
es-0.398
e-0.387
Token away
Token ID1497
Feature activation-4.14e-01

Pos logit contributions
aff+1.438
ock+1.430
weet+1.425
iki+1.338
ules+1.304
Neg logit contributions
bes-1.133
antly-1.099
vered-1.074
es-1.066
e-1.038
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
aff+0.776
ock+0.774
weet+0.763
iki+0.711
oto+0.695
Neg logit contributions
bes-0.662
antly-0.636
vered-0.624
es-0.620
e-0.599
Token but
Token ID475
Feature activation-1.23e-01

Pos logit contributions
aff+2.138
ock+2.114
weet+2.024
iki+1.940
oto+1.903
Neg logit contributions
bes-1.857
antly-1.735
vered-1.709
es-1.707
house-1.675
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
aff+0.207
ock+0.205
weet+0.202
iki+0.187
oto+0.181
Neg logit contributions
bes-0.219
antly-0.212
vered-0.207
es-0.205
e-0.201
Token oak
Token ID26210
Feature activation-3.63e-01

Pos logit contributions
ock+2.507
aff+2.454
weet+2.386
ules+2.224
aa+2.214
Neg logit contributions
bes-2.648
antly-2.579
house-2.571
vered-2.506
nob-2.490
Token tree
Token ID5509
Feature activation+2.60e-01

Pos logit contributions
aff+0.682
ock+0.676
weet+0.671
iki+0.625
oto+0.612
Neg logit contributions
bes-0.574
antly-0.551
vered-0.539
es-0.537
e-0.520
Token stayed
Token ID9658
Feature activation+7.71e-01

Pos logit contributions
bes+0.307
antly+0.288
vered+0.277
es+0.272
e+0.264
Neg logit contributions
aff-0.588
ock-0.585
weet-0.577
iki-0.550
aa-0.535
Token strong
Token ID1913
Feature activation-9.14e-03

Pos logit contributions
bes+1.275
antly+1.220
vered+1.209
es+1.196
e+1.149
Neg logit contributions
ock-1.383
aff-1.354
weet-1.345
iki-1.248
oto-1.222
Token and
Token ID290
Feature activation+3.58e-03

Pos logit contributions
aff+0.016
ock+0.016
weet+0.016
iki+0.014
oto+0.014
Neg logit contributions
bes-0.016
antly-0.015
vered-0.015
es-0.015
e-0.014
Token day
Token ID1110
Feature activation-2.91e-02

Pos logit contributions
aff+1.509
weet+1.459
ock+1.430
iki+1.278
ules+1.260
Neg logit contributions
bes-2.405
antly-2.261
house-2.243
e-2.230
vered-2.217
Token.
Token ID13
Feature activation+6.45e-02

Pos logit contributions
aff+0.053
ock+0.053
weet+0.052
iki+0.048
oto+0.047
Neg logit contributions
bes-0.048
antly-0.046
vered-0.045
es-0.044
e-0.043
Token The
Token ID383
Feature activation-1.15e+00

Pos logit contributions
bes+0.093
antly+0.089
vered+0.087
es+0.085
e+0.083
Neg logit contributions
aff-0.131
ock-0.130
weet-0.128
iki-0.120
oto-0.118
Token sun
Token ID4252
Feature activation+7.04e-01

Pos logit contributions
aff+1.837
ock+1.721
weet+1.710
iki+1.587
ules+1.559
Neg logit contributions
bes-2.229
antly-2.128
 wrist-2.055
e-2.050
house-2.038
Token was
Token ID373
Feature activation-1.83e+00

Pos logit contributions
bes+0.800
vered+0.740
antly+0.721
es+0.690
 wrist+0.673
Neg logit contributions
ock-1.663
aff-1.639
weet-1.626
iki-1.563
aa-1.534
Token shining
Token ID22751
Feature activation-1.32e+00

Pos logit contributions
aff+3.901
ock+3.611
weet+3.525
iki+3.447
ules+3.327
Neg logit contributions
antly-2.565
bes-2.549
e-2.459
house-2.442
 kit-2.351
Token,
Token ID11
Feature activation-1.73e+00

Pos logit contributions
aff+2.448
weet+2.404
ock+2.343
iki+2.246
ules+2.163
Neg logit contributions
antly-2.168
bes-2.055
es-2.053
house-1.984
vered-1.972
Token and
Token ID290
Feature activation-4.45e-01

Pos logit contributions
aff+3.275
ock+2.950
weet+2.914
iki+2.891
andel+2.834
Neg logit contributions
bes-3.018
antly-2.799
es-2.788
's-2.766
e-2.739
Token the
Token ID262
Feature activation-9.75e-01

Pos logit contributions
aff+0.814
ock+0.792
weet+0.784
iki+0.739
aa+0.713
Neg logit contributions
bes-0.736
antly-0.706
es-0.688
vered-0.684
e-0.667
Token birds
Token ID10087
Feature activation-1.77e+00

Pos logit contributions
aff+1.549
ock+1.487
weet+1.456
iki+1.373
ules+1.334
Neg logit contributions
bes-1.852
antly-1.769
vered-1.710
 wrist-1.709
e-1.708
Token were
Token ID547
Feature activation-7.46e-01

Pos logit contributions
aff+3.426
weet+3.298
ock+3.268
oto+3.125
andel+3.112
Neg logit contributions
e-2.796
es-2.786
bes-2.706
antly-2.663
vered-2.639
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
aff+0.127
ock+0.126
weet+0.124
iki+0.116
oto+0.113
Neg logit contributions
bes-0.106
antly-0.102
vered-0.100
es-0.098
e-0.096
Token and
Token ID290
Feature activation-2.68e-01

Pos logit contributions
aff+1.994
ock+1.956
weet+1.903
iki+1.805
andel+1.761
Neg logit contributions
bes-1.746
es-1.705
antly-1.699
vered-1.664
e-1.644
Token Fl
Token ID1610
Feature activation+1.48e-01

Pos logit contributions
aff+0.444
ock+0.442
weet+0.434
iki+0.400
oto+0.388
Neg logit contributions
bes-0.484
antly-0.468
vered-0.459
es-0.457
e-0.446
Tokenuffy
Token ID15352
Feature activation-1.96e-01

Pos logit contributions
bes+0.247
antly+0.238
vered+0.233
es+0.228
nob+0.224
Neg logit contributions
aff-0.267
ock-0.262
weet-0.259
iki-0.244
aa-0.236
Token played
Token ID2826
Feature activation+8.61e-01

Pos logit contributions
aff+0.341
ock+0.339
weet+0.334
iki+0.310
oto+0.302
Neg logit contributions
bes-0.335
antly-0.323
vered-0.316
es-0.314
e-0.307
Token all
Token ID477
Feature activation-1.28e+00

Pos logit contributions
bes+1.075
antly+1.064
vered+1.010
nob+0.952
es+0.938
Neg logit contributions
aff-1.880
ock-1.864
weet-1.818
iki-1.703
oto-1.694
Token day
Token ID1110
Feature activation+1.84e-01

Pos logit contributions
aff+1.745
weet+1.639
ock+1.612
iki+1.553
ules+1.545
Neg logit contributions
bes-2.696
e-2.655
antly-2.637
es-2.578
house-2.577
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
bes+0.279
antly+0.271
vered+0.264
es+0.257
e+0.251
Neg logit contributions
aff-0.357
ock-0.353
weet-0.349
iki-0.325
aa-0.318
Token They
Token ID1119
Feature activation-1.58e+00

Pos logit contributions
ock+2.105
weet+2.004
aff+1.999
iki+1.877
aa+1.846
Neg logit contributions
bes-2.025
antly-1.928
house-1.913
es-1.883
vered-1.848
Token ran
Token ID4966
Feature activation-1.07e+00

Pos logit contributions
ock+3.206
aff+3.187
weet+3.143
andel+2.965
arl+2.958
Neg logit contributions
bes-2.365
es-2.302
e-2.292
vered-2.205
house-2.175
Token and
Token ID290
Feature activation-1.60e-01

Pos logit contributions
weet+1.977
ock+1.974
aff+1.948
iki+1.844
ules+1.805
Neg logit contributions
bes-1.742
es-1.658
antly-1.652
vered-1.625
house-1.607
Token the
Token ID262
Feature activation+6.53e-01

Pos logit contributions
bes+0.618
antly+0.597
vered+0.587
es+0.580
e+0.555
Neg logit contributions
ock-0.568
aff-0.568
weet-0.566
iki-0.514
oto-0.501
Token restaurant
Token ID7072
Feature activation+1.84e-01

Pos logit contributions
bes+1.034
vered+0.991
antly+0.987
es+0.968
e+0.932
Neg logit contributions
aff-1.251
ock-1.229
weet-1.220
iki-1.126
oto-1.117
Token.
Token ID13
Feature activation-4.31e-01

Pos logit contributions
bes+0.284
antly+0.272
vered+0.267
es+0.260
e+0.254
Neg logit contributions
aff-0.352
ock-0.350
weet-0.345
iki-0.322
oto-0.315
Token But
Token ID887
Feature activation+5.64e-01

Pos logit contributions
ock+0.709
aff+0.701
weet+0.693
iki+0.645
oto+0.625
Neg logit contributions
bes-0.796
antly-0.764
vered-0.740
es-0.738
house-0.722
Token when
Token ID618
Feature activation+3.16e-01

Pos logit contributions
bes+1.103
antly+1.057
vered+1.047
es+1.031
nob+1.015
Neg logit contributions
aff-0.836
ock-0.831
weet-0.820
iki-0.759
aa-0.732
Token they
Token ID484
Feature activation-3.73e-01

Pos logit contributions
bes+0.719
antly+0.695
vered+0.691
es+0.680
e+0.671
Neg logit contributions
aff-0.373
ock-0.368
weet-0.363
iki-0.325
aa-0.310
Token got
Token ID1392
Feature activation+1.27e+00

Pos logit contributions
aff+0.677
ock+0.670
weet+0.665
iki+0.614
oto+0.600
Neg logit contributions
bes-0.613
antly-0.597
vered-0.584
es-0.578
e-0.569
Token there
Token ID612
Feature activation+2.95e-01

Pos logit contributions
bes+2.358
vered+2.181
antly+2.165
es+2.099
 wrist+2.081
Neg logit contributions
ock-2.121
aff-2.073
weet-2.040
aa-1.990
iki-1.971
Token,
Token ID11
Feature activation+5.21e-01

Pos logit contributions
bes+0.524
antly+0.497
vered+0.487
es+0.480
e+0.473
Neg logit contributions
aff-0.502
ock-0.496
weet-0.486
iki-0.457
aa-0.441
Token they
Token ID484
Feature activation-6.82e-01

Pos logit contributions
bes+1.173
antly+1.138
vered+1.121
es+1.102
e+1.096
Neg logit contributions
aff-0.618
ock-0.614
weet-0.600
iki-0.539
oto-0.514
Token saw
Token ID2497
Feature activation-3.83e-01

Pos logit contributions
aff+1.402
ock+1.388
weet+1.378
iki+1.282
oto+1.253
Neg logit contributions
bes-0.967
antly-0.936
es-0.921
vered-0.909
e-0.893
Token if
Token ID611
Feature activation+3.47e-01

Pos logit contributions
bes+1.147
antly+1.087
vered+1.076
e+1.038
es+1.033
Neg logit contributions
aff-0.894
ock-0.878
weet-0.864
iki-0.810
oto-0.788
Token he
Token ID339
Feature activation+1.01e-01

Pos logit contributions
bes+0.744
antly+0.716
vered+0.704
es+0.694
e+0.686
Neg logit contributions
aff-0.459
ock-0.455
weet-0.448
iki-0.411
oto-0.397
Token wanted
Token ID2227
Feature activation+2.47e-01

Pos logit contributions
bes+0.168
antly+0.161
vered+0.157
es+0.155
e+0.151
Neg logit contributions
aff-0.182
ock-0.181
weet-0.178
iki-0.167
oto-0.162
Token to
Token ID284
Feature activation+3.21e-01

Pos logit contributions
bes+0.488
antly+0.469
vered+0.462
es+0.455
e+0.444
Neg logit contributions
aff-0.373
ock-0.371
weet-0.366
iki-0.333
oto-0.324
Token play
Token ID711
Feature activation+1.26e+00

Pos logit contributions
bes+0.506
antly+0.484
vered+0.475
es+0.469
 kit+0.446
Neg logit contributions
aff-0.609
ock-0.605
weet-0.600
iki-0.558
oto-0.545
Token a
Token ID257
Feature activation-5.73e-01

Pos logit contributions
bes+1.718
antly+1.712
vered+1.646
es+1.540
nob+1.480
Neg logit contributions
ock-2.680
aff-2.561
weet-2.538
oto-2.341
iki-2.320
Token game
Token ID983
Feature activation-2.03e-02

Pos logit contributions
aff+1.065
ock+1.047
weet+1.038
iki+0.980
ules+0.954
Neg logit contributions
bes-0.914
antly-0.869
vered-0.846
es-0.836
house-0.829
Token.
Token ID13
Feature activation+6.85e-01

Pos logit contributions
aff+0.039
ock+0.039
weet+0.039
iki+0.036
oto+0.035
Neg logit contributions
bes-0.031
antly-0.029
vered-0.029
es-0.028
e-0.027
Token Rob
Token ID3851
Feature activation+1.13e-01

Pos logit contributions
bes+1.157
antly+1.115
vered+1.102
es+1.069
e+1.060
Neg logit contributions
aff-1.241
weet-1.189
ock-1.187
iki-1.122
oto-1.086
Token agreed
Token ID4987
Feature activation-8.10e-01

Pos logit contributions
bes+0.204
antly+0.195
vered+0.191
es+0.189
e+0.184
Neg logit contributions
aff-0.189
ock-0.186
weet-0.183
iki-0.171
oto-0.166
Token and
Token ID290
Feature activation-2.76e-01

Pos logit contributions
aff+1.343
weet+1.315
ock+1.313
iki+1.208
aa+1.156
Neg logit contributions
bes-1.492
antly-1.439
es-1.412
vered-1.397
e-1.364
Token.
Token ID13
Feature activation-7.01e-01

Pos logit contributions
bes+1.255
antly+1.224
vered+1.196
es+1.151
e+1.115
Neg logit contributions
aff-1.441
ock-1.433
weet-1.428
iki-1.316
aa-1.279
Token The
Token ID383
Feature activation-1.27e+00

Pos logit contributions
ock+1.373
aff+1.354
weet+1.327
iki+1.263
oto+1.232
Neg logit contributions
bes-1.069
antly-1.032
vered-1.000
es-0.998
house-0.975
Token neighbor
Token ID4780
Feature activation+4.19e-01

Pos logit contributions
aff+2.489
ock+2.451
weet+2.342
ules+2.201
iki+2.152
Neg logit contributions
bes-1.921
antly-1.886
house-1.861
 wrist-1.823
 wound-1.805
Token was
Token ID373
Feature activation+4.45e-01

Pos logit contributions
bes+0.744
antly+0.709
vered+0.697
es+0.679
e+0.667
Neg logit contributions
aff-0.711
ock-0.704
weet-0.701
iki-0.654
aa-0.629
Token a
Token ID257
Feature activation-2.97e-01

Pos logit contributions
bes+0.862
antly+0.840
vered+0.824
es+0.808
nob+0.783
Neg logit contributions
aff-0.676
weet-0.673
ock-0.671
iki-0.612
oto-0.595
Token kind
Token ID1611
Feature activation-2.60e+00

Pos logit contributions
aff+0.494
ock+0.491
weet+0.481
iki+0.444
oto+0.431
Neg logit contributions
bes-0.535
antly-0.517
vered-0.505
es-0.499
e-0.492
Token lady
Token ID10846
Feature activation+1.39e+00

Pos logit contributions
ock+5.074
weet+5.059
aff+5.013
ules+4.743
iki+4.615
Neg logit contributions
house-3.745
 kit-3.594
 wound-3.563
 wrist-3.516
's-3.512
Token.
Token ID13
Feature activation-8.54e-01

Pos logit contributions
bes+2.549
antly+2.416
vered+2.337
ASH+2.248
 Something+2.232
Neg logit contributions
weet-2.410
ock-2.391
aff-2.385
iki-2.219
aa-2.134
Token She
Token ID1375
Feature activation-1.05e+00

Pos logit contributions
ock+1.635
aff+1.595
weet+1.576
iki+1.490
oto+1.466
Neg logit contributions
bes-1.340
antly-1.296
es-1.267
vered-1.254
house-1.243
Token saw
Token ID2497
Feature activation-1.75e+00

Pos logit contributions
aff+2.044
ock+2.035
weet+1.997
andel+1.836
iki+1.828
Neg logit contributions
bes-1.617
antly-1.565
es-1.551
vered-1.497
e-1.478
Token the
Token ID262
Feature activation-1.02e+00

Pos logit contributions
aff+3.553
ock+3.544
weet+3.356
iki+3.289
aa+3.228
Neg logit contributions
es-2.529
bes-2.408
vered-2.351
antly-2.338
e-2.287
Token day
Token ID1110
Feature activation+2.69e-01

Pos logit contributions
aff+2.420
ock+2.036
weet+1.967
ules+1.923
arl+1.883
Neg logit contributions
e-3.799
antly-3.692
house-3.689
bes-3.677
's-3.623
Token,
Token ID11
Feature activation-5.54e-01

Pos logit contributions
bes+0.479
antly+0.456
vered+0.446
es+0.438
nob+0.431
Neg logit contributions
aff-0.458
ock-0.453
weet-0.448
iki-0.416
aa-0.406
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
aff+0.883
ock+0.874
weet+0.851
iki+0.782
ules+0.761
Neg logit contributions
bes-1.055
antly-1.020
vered-1.000
es-0.995
e-0.982
Token big
Token ID1263
Feature activation-1.81e+00

Pos logit contributions
aff+1.612
ock+1.590
weet+1.537
iki+1.422
ules+1.403
Neg logit contributions
bes-1.825
antly-1.816
house-1.733
 kit-1.729
vered-1.727
Token bird
Token ID6512
Feature activation-5.85e-01

Pos logit contributions
aff+3.274
ock+3.056
weet+2.945
ules+2.882
iki+2.844
Neg logit contributions
house-3.018
bes-2.989
e-2.974
 kit-2.964
antly-2.942
Token heard
Token ID2982
Feature activation-2.48e+00

Pos logit contributions
aff+1.061
ock+1.034
weet+1.018
iki+0.956
oto+0.945
Neg logit contributions
bes-0.967
antly-0.942
es-0.921
vered-0.920
e-0.910
Token her
Token ID607
Feature activation-8.67e-01

Pos logit contributions
aff+5.194
ock+5.058
iki+5.016
arl+4.933
weet+4.786
Neg logit contributions
es-3.459
e-3.305
bes-3.229
's-3.213
antly-3.198
Token singing
Token ID13777
Feature activation-1.12e+00

Pos logit contributions
aff+1.680
ock+1.673
weet+1.631
iki+1.511
ules+1.497
Neg logit contributions
bes-1.330
antly-1.293
es-1.262
e-1.238
vered-1.227
Token and
Token ID290
Feature activation-5.17e-01

Pos logit contributions
aff+2.359
ock+2.232
weet+2.220
iki+2.178
ules+2.104
Neg logit contributions
bes-1.591
antly-1.568
es-1.537
nob-1.501
e-1.495
Token said
Token ID531
Feature activation+1.43e-01

Pos logit contributions
aff+0.937
ock+0.931
weet+0.904
iki+0.848
oto+0.830
Neg logit contributions
bes-0.854
antly-0.828
es-0.809
vered-0.805
e-0.777
Token,
Token ID11
Feature activation-7.69e-01

Pos logit contributions
bes+0.249
antly+0.238
vered+0.233
es+0.229
e+0.226
Neg logit contributions
aff-0.248
ock-0.244
weet-0.242
iki-0.225
oto-0.218
Token too
Token ID1165
Feature activation-2.32e+00

Pos logit contributions
aff+2.698
ock+2.553
weet+2.478
iki+2.356
ules+2.346
Neg logit contributions
bes-2.440
antly-2.321
house-2.308
nob-2.257
 kit-2.254
Token dangerous
Token ID4923
Feature activation-1.84e-01

Pos logit contributions
aff+4.388
ock+4.116
weet+4.021
iki+3.853
ules+3.780
Neg logit contributions
bes-3.902
house-3.806
antly-3.696
nob-3.667
es-3.560
Token.
Token ID13
Feature activation-4.48e-01

Pos logit contributions
aff+0.362
ock+0.358
weet+0.355
iki+0.333
oto+0.324
Neg logit contributions
bes-0.276
antly-0.264
vered-0.257
es-0.256
e-0.248
Token Lily
Token ID20037
Feature activation+6.94e-03

Pos logit contributions
ock+0.706
aff+0.700
weet+0.690
iki+0.640
oto+0.621
Neg logit contributions
bes-0.859
antly-0.827
vered-0.804
es-0.802
house-0.786
Token felt
Token ID2936
Feature activation-5.68e-01

Pos logit contributions
bes+0.010
antly+0.010
vered+0.010
es+0.009
e+0.009
Neg logit contributions
aff-0.014
ock-0.014
weet-0.014
iki-0.013
oto-0.012
Token sad
Token ID6507
Feature activation-2.14e+00

Pos logit contributions
aff+0.826
ock+0.801
weet+0.792
iki+0.731
ules+0.698
Neg logit contributions
bes-1.164
antly-1.128
vered-1.118
es-1.100
e-1.092
Token again
Token ID757
Feature activation-1.13e+00

Pos logit contributions
aff+3.998
weet+3.893
ock+3.683
iki+3.599
ules+3.499
Neg logit contributions
es-3.625
antly-3.593
bes-3.525
's-3.473
 searches-3.436
Token,
Token ID11
Feature activation-2.18e+00

Pos logit contributions
aff+1.982
ock+1.940
weet+1.872
iki+1.775
oto+1.724
Neg logit contributions
bes-1.996
antly-1.899
es-1.898
vered-1.868
e-1.853
Token but
Token ID475
Feature activation-4.62e-02

Pos logit contributions
aff+4.105
ock+3.883
weet+3.684
iki+3.614
ules+3.530
Neg logit contributions
bes-3.888
's-3.637
 wrist-3.601
 wound-3.588
antly-3.557
Token she
Token ID673
Feature activation+1.97e-01

Pos logit contributions
aff+0.070
ock+0.070
weet+0.069
iki+0.063
oto+0.061
Neg logit contributions
bes-0.091
antly-0.088
vered-0.086
es-0.085
e-0.083
Token knew
Token ID2993
Feature activation-3.01e-01

Pos logit contributions
bes+0.388
antly+0.372
vered+0.366
es+0.362
e+0.354
Neg logit contributions
aff-0.295
ock-0.294
weet-0.289
iki-0.268
oto-0.259
Tokenster
Token ID1706
Feature activation-2.65e-01

Pos logit contributions
bes+1.810
vered+1.668
nob+1.614
antly+1.603
e+1.528
Neg logit contributions
aff-3.865
ock-3.655
iki-3.628
oto-3.557
weet-3.546
Token was
Token ID373
Feature activation-2.24e-01

Pos logit contributions
aff+0.539
ock+0.534
weet+0.529
iki+0.495
oto+0.483
Neg logit contributions
bes-0.380
antly-0.364
vered-0.354
es-0.352
e-0.341
Token not
Token ID407
Feature activation-1.34e+00

Pos logit contributions
aff+0.394
ock+0.389
weet+0.383
iki+0.358
oto+0.346
Neg logit contributions
bes-0.385
antly-0.370
vered-0.362
es-0.359
e-0.352
Token being
Token ID852
Feature activation-1.16e+00

Pos logit contributions
aff+2.539
ock+2.430
weet+2.363
iki+2.353
ules+2.240
Neg logit contributions
bes-2.093
es-2.004
e-1.989
 kit-1.984
antly-1.976
Token nice
Token ID3621
Feature activation-6.23e-01

Pos logit contributions
aff+2.006
ock+1.951
weet+1.907
iki+1.848
oto+1.773
Neg logit contributions
bes-1.995
es-1.967
e-1.945
vered-1.940
antly-1.922
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
aff+1.055
ock+1.044
weet+1.035
iki+0.963
oto+0.937
Neg logit contributions
bes-1.080
antly-1.055
es-1.047
vered-1.041
e-1.002
Token but
Token ID475
Feature activation+6.75e-02

Pos logit contributions
aff+4.334
ock+3.991
iki+3.878
weet+3.838
arl+3.753
Neg logit contributions
bes-3.791
's-3.746
es-3.722
vered-3.683
e-3.676
Token it
Token ID340
Feature activation+3.58e-01

Pos logit contributions
bes+0.108
antly+0.103
vered+0.101
es+0.099
e+0.097
Neg logit contributions
aff-0.128
ock-0.127
weet-0.126
iki-0.117
oto-0.114
Token did
Token ID750
Feature activation+1.38e-02

Pos logit contributions
bes+0.610
antly+0.580
vered+0.566
es+0.558
nob+0.545
Neg logit contributions
ock-0.628
aff-0.627
weet-0.616
iki-0.578
oto-0.557
Token not
Token ID407
Feature activation-1.60e+00

Pos logit contributions
bes+0.023
antly+0.023
vered+0.022
es+0.022
e+0.021
Neg logit contributions
aff-0.025
ock-0.024
weet-0.024
iki-0.022
oto-0.022
Token say
Token ID910
Feature activation-5.08e-01

Pos logit contributions
aff+3.362
ock+3.227
weet+3.223
iki+3.161
ules+3.154
Neg logit contributions
bes-2.082
e-2.053
antly-2.005
vered-2.003
house-1.979
Token having
Token ID1719
Feature activation+4.27e-01

Pos logit contributions
bes+0.675
antly+0.648
vered+0.632
es+0.614
nob+0.604
Neg logit contributions
aff-0.680
ock-0.673
weet-0.667
iki-0.618
oto-0.598
Token so
Token ID523
Feature activation-7.43e-01

Pos logit contributions
bes+0.893
antly+0.881
vered+0.856
es+0.841
nob+0.831
Neg logit contributions
aff-0.584
ock-0.583
weet-0.574
iki-0.512
oto-0.502
Token much
Token ID881
Feature activation+2.17e-02

Pos logit contributions
aff+1.039
ock+1.015
weet+0.992
iki+0.913
oto+0.876
Neg logit contributions
bes-1.548
vered-1.489
es-1.488
antly-1.487
e-1.465
Token fun
Token ID1257
Feature activation+1.76e+00

Pos logit contributions
bes+0.046
antly+0.045
vered+0.044
es+0.043
e+0.043
Neg logit contributions
aff-0.029
ock-0.029
weet-0.029
iki-0.026
oto-0.025
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
antly+3.133
bes+2.993
vered+2.874
 Something+2.825
nob+2.803
Neg logit contributions
ock-3.070
aff-2.997
weet-2.939
iki-2.655
ules-2.643
Token They
Token ID1119
Feature activation-2.35e+00

Pos logit contributions
ock+2.626
aff+2.528
weet+2.501
iki+2.343
andel+2.338
Neg logit contributions
bes-1.711
antly-1.618
es-1.617
house-1.549
e-1.526
Token had
Token ID550
Feature activation-3.08e-01

Pos logit contributions
aff+4.903
ock+4.850
weet+4.776
arl+4.661
andel+4.615
Neg logit contributions
e-3.596
es-3.409
bes-3.331
's-3.329
antly-3.253
Token a
Token ID257
Feature activation-9.18e-01

Pos logit contributions
aff+0.559
ock+0.554
weet+0.543
iki+0.509
aa+0.493
Neg logit contributions
bes-0.514
antly-0.494
vered-0.483
es-0.480
e-0.472
Token great
Token ID1049
Feature activation-7.64e-01

Pos logit contributions
aff+1.573
ock+1.562
weet+1.518
ules+1.405
iki+1.394
Neg logit contributions
bes-1.622
antly-1.562
house-1.526
vered-1.514
e-1.512
Token day
Token ID1110
Feature activation+3.69e-01

Pos logit contributions
aff+1.009
ock+0.984
weet+0.974
iki+0.873
ules+0.859
Neg logit contributions
bes-1.662
antly-1.597
vered-1.565
e-1.562
house-1.560
Token together
Token ID1978
Feature activation-2.44e-01

Pos logit contributions
bes+0.624
antly+0.604
vered+0.587
es+0.569
nob+0.567
Neg logit contributions
aff-0.654
ock-0.648
weet-0.644
iki-0.596
aa-0.583