TOP ACTIVATIONS
MAX = 3.684

and they were able to un pack it . Inside they
filled the hole in Tim my 's yard . Tim my
opened the box and found Lily 's favorite stuffed lion .
takes a cloth and wipes Ben 's shirt . But she
No , thank you . Cig arettes are stupid . They
opens her eyes and sees Ben with something in his mouth
not want to be a tatt let ale . She decided
big rock and put his father 's jacket on it .
opened the box and to her surprise , it was full
They all agreed and helped her open the chest . Inside
used the glove to clean her room . She wiped the
Mom smiles and wipes Mia 's face . She throws
opens her eyes and sees Tom . She rub s her
asked her dad to help her remove the lid of the
and placed something into his son 's hand , they looked
her mom my to help her order a new chair .
and gently wiped the little girl 's skin .
toy box , and to her surprise , the magnet was
her mom my to help her . " Mom
sink and mom my cleaned Lily 's shirt with soap and

MIN ACTIVATIONS
MIN = -4.040

bush . Every day , lots of different animals would gather
One day , a tall gir affe named Gig i went
a time there was a gir affe who loved to get
, there was a baby gir affe named Pr une .
girl named Mary . Mary dreamed of finding a diamond .
a time there was a gir affe named Sam and a
clock went " tick - t ock , tick - t
, there was a little gir affe named Gig i .
was a big , tall gir affe named Jerry . Jerry
was a monkey and a gir affe . The gir affe
, there was a tall gir affe named Jerry . Jerry
a bridge . Everyday , lots of people would cross the
One day , a tall gir affe was walking through the
adventure story , but after an hour they were both tired
Every morning , lots of boats arrived and left
just went " tick - t ock , tick - t
lively little bird named Ch ir py . Ch ir py
she asked her mom my lots of questions about marriage .
toys . Sally said yes , and they made a trade
time , there was a gir affe named Jerry . Jerry

INTERVAL 1.753 - 3.684
CONTAINS 1.017%

deliver wonderful things all around the sky . The moon smiled
juicy . They see Mr . Brown walking to his house
<|endoftext|> Once upon a time there was a little girl .
happy and he promised to always remember where he put his
charge and she had to be obedient . Later

INTERVAL -0.178 - 1.753
CONTAINS 58.800%

The mom looked up and shrugged her shoulders . " Not
. Sara thought it was a graceful cat and wanted to
The dentist fixed her tooth and Lily felt much better .
would not play in the bathroom . But it
ay on . â € � � That

INTERVAL -2.109 - -0.178
CONTAINS 39.801%

on before we buy it ." They examined the
" Let 's climb the tree !" Sam says .
. He put on his clothes , and his mom put
all of these le mons ?" Lily looked at the le
touch . Tim my liked it and said , " Mom

INTERVAL -4.040 - -2.109
CONTAINS 0.383%

fluffy bunny , a w add ling duck , and a
best of friends , and they loved to dance together in
there was a young , spoiled girl . She was living
His mom my said yes , but told him to be
The moral of the story is that when you share with
Token and
Token ID290
Feature activation+4.63e-01

Pos logit contributions
bes+2.678
vered+2.476
es+2.445
e+2.443
antly+2.410
Neg logit contributions
aff-1.993
ock-1.904
weet-1.830
iki-1.722
oto-1.667
Token they
Token ID484
Feature activation+1.19e-01

Pos logit contributions
bes+0.808
vered+0.750
antly+0.748
es+0.732
e+0.727
Neg logit contributions
aff-0.804
ock-0.801
weet-0.792
iki-0.736
oto-0.710
Token were
Token ID547
Feature activation+1.20e+00

Pos logit contributions
bes+0.193
antly+0.184
vered+0.180
es+0.176
e+0.172
Neg logit contributions
aff-0.221
ock-0.219
weet-0.216
iki-0.202
oto-0.196
Token able
Token ID1498
Feature activation+5.05e-01

Pos logit contributions
bes+2.392
antly+2.291
vered+2.273
es+2.199
 kit+2.166
Neg logit contributions
aff-1.756
weet-1.742
ock-1.734
iki-1.582
ules-1.529
Token to
Token ID284
Feature activation+1.41e+00

Pos logit contributions
bes+1.045
antly+1.002
vered+0.982
es+0.971
e+0.961
Neg logit contributions
aff-0.711
weet-0.703
ock-0.698
iki-0.640
aa-0.613
Token un
Token ID555
Feature activation+3.68e+00

Pos logit contributions
bes+2.247
es+2.150
antly+2.112
vered+2.074
 kit+1.965
Neg logit contributions
ock-2.686
aff-2.612
weet-2.557
iki-2.431
oto-2.381
Tokenpack
Token ID8002
Feature activation+9.81e-01

Pos logit contributions
bes+3.974
 Something+3.600
 elbows+3.422
vered+3.328
es+3.277
Neg logit contributions
aff-9.578
ock-8.869
aa-8.826
add-8.629
oot-8.493
Token it
Token ID340
Feature activation+1.19e+00

Pos logit contributions
bes+1.782
antly+1.669
vered+1.649
es+1.616
nob+1.613
Neg logit contributions
aff-1.614
ock-1.601
weet-1.526
iki-1.423
oto-1.394
Token.
Token ID13
Feature activation+3.61e-01

Pos logit contributions
bes+2.073
vered+1.946
antly+1.938
e+1.901
es+1.858
Neg logit contributions
ock-2.051
aff-2.017
weet-1.949
iki-1.830
oto-1.779
Token Inside
Token ID14384
Feature activation+5.72e-02

Pos logit contributions
bes+0.574
antly+0.540
vered+0.539
es+0.526
e+0.519
Neg logit contributions
aff-0.684
ock-0.667
weet-0.667
iki-0.623
ules-0.604
Token they
Token ID484
Feature activation+8.70e-01

Pos logit contributions
bes+0.086
antly+0.082
vered+0.080
es+0.079
e+0.077
Neg logit contributions
aff-0.112
ock-0.111
weet-0.110
iki-0.103
oto-0.100
Token filled
Token ID5901
Feature activation+1.55e+00

Pos logit contributions
bes+1.198
antly+1.151
vered+1.131
e+1.102
es+1.094
Neg logit contributions
ock-1.197
aff-1.193
weet-1.181
iki-1.091
oto-1.058
Token the
Token ID262
Feature activation+1.35e+00

Pos logit contributions
bes+2.780
vered+2.679
antly+2.677
 kit+2.558
es+2.556
Neg logit contributions
aff-2.655
ock-2.570
weet-2.534
oto-2.363
iki-2.275
Token hole
Token ID7604
Feature activation+1.16e+00

Pos logit contributions
bes+1.510
antly+1.449
es+1.407
vered+1.402
e+1.336
Neg logit contributions
aff-3.162
ock-3.126
weet-3.083
iki-2.901
arl-2.847
Token in
Token ID287
Feature activation+1.53e+00

Pos logit contributions
bes+1.371
antly+1.284
vered+1.249
 kit+1.179
es+1.162
Neg logit contributions
aff-2.631
ock-2.602
weet-2.562
iki-2.406
oto-2.354
Token Tim
Token ID5045
Feature activation+2.34e+00

Pos logit contributions
bes+2.979
antly+2.879
es+2.832
vered+2.819
e+2.677
Neg logit contributions
ock-2.342
weet-2.309
aff-2.260
iki-2.002
oto-1.959
Tokenmy
Token ID1820
Feature activation+3.68e+00

Pos logit contributions
bes+4.325
antly+4.043
vered+4.032
 Something+3.832
es+3.766
Neg logit contributions
weet-4.232
ock-4.199
aff-3.965
iki-3.700
aa-3.646
Token's
Token ID338
Feature activation+2.01e+00

Pos logit contributions
antly+3.914
bes+3.758
vered+3.419
 Something+3.174
 Hide+3.121
Neg logit contributions
weet-9.187
ock-9.015
aff-8.799
aa-8.214
 tour-8.086
Token yard
Token ID12699
Feature activation+1.27e+00

Pos logit contributions
es+2.926
bes+2.913
vered+2.815
antly+2.809
e+2.682
Neg logit contributions
aff-3.837
weet-3.787
ock-3.766
iki-3.755
 tour-3.614
Token.
Token ID13
Feature activation-5.89e-01

Pos logit contributions
bes+2.075
antly+2.036
vered+2.001
es+1.901
e+1.880
Neg logit contributions
aff-2.301
ock-2.235
weet-2.215
iki-2.013
oto-1.986
Token Tim
Token ID5045
Feature activation-4.43e-01

Pos logit contributions
ock+1.201
aff+1.186
weet+1.177
iki+1.106
oto+1.076
Neg logit contributions
bes-0.854
antly-0.821
es-0.791
vered-0.778
e-0.750
Tokenmy
Token ID1820
Feature activation+7.03e-01

Pos logit contributions
aff+0.792
ock+0.779
weet+0.772
iki+0.716
ules+0.697
Neg logit contributions
bes-0.743
antly-0.723
vered-0.705
es-0.704
e-0.689
Token opened
Token ID4721
Feature activation+1.48e+00

Pos logit contributions
bes+1.649
antly+1.558
vered+1.503
e+1.457
es+1.449
Neg logit contributions
aff-1.675
ock-1.661
weet-1.609
iki-1.542
ules-1.490
Token the
Token ID262
Feature activation+2.33e+00

Pos logit contributions
bes+3.155
vered+2.940
antly+2.929
es+2.863
e+2.858
Neg logit contributions
aff-2.070
ock-1.969
weet-1.920
iki-1.831
aa-1.730
Token box
Token ID3091
Feature activation+8.57e-01

Pos logit contributions
bes+3.401
vered+3.203
e+3.179
es+3.050
antly+2.967
Neg logit contributions
aff-4.954
ock-4.879
arl-4.646
weet-4.613
 tour-4.499
Token and
Token ID290
Feature activation+1.59e+00

Pos logit contributions
bes+1.866
vered+1.776
antly+1.771
e+1.727
es+1.697
Neg logit contributions
aff-1.113
ock-1.099
weet-1.059
iki-0.979
oto-0.921
Token found
Token ID1043
Feature activation+1.63e+00

Pos logit contributions
bes+2.880
vered+2.682
antly+2.625
e+2.571
 wrist+2.505
Neg logit contributions
aff-2.660
ock-2.650
weet-2.599
iki-2.458
aa-2.389
Token Lily
Token ID20037
Feature activation+3.65e+00

Pos logit contributions
bes+2.937
vered+2.663
 wrist+2.643
antly+2.603
e+2.549
Neg logit contributions
aff-2.878
ock-2.785
weet-2.696
iki-2.561
oto-2.467
Token's
Token ID338
Feature activation+1.96e+00

Pos logit contributions
antly+7.779
bes+7.410
vered+7.340
ASH+7.187
 Something+7.011
Neg logit contributions
aff-5.059
weet-5.024
ock-4.971
iki-4.731
 tour-4.341
Token favorite
Token ID4004
Feature activation+2.53e+00

Pos logit contributions
bes+2.952
antly+2.932
vered+2.889
es+2.865
e+2.718
Neg logit contributions
aff-3.843
ock-3.720
weet-3.647
iki-3.584
oto-3.522
Token stuffed
Token ID22259
Feature activation+1.04e+00

Pos logit contributions
bes+4.287
vered+4.109
antly+4.093
es+3.979
 elbows+3.809
Neg logit contributions
aff-4.714
ock-4.666
weet-4.375
oto-4.130
 tour-4.128
Token lion
Token ID18744
Feature activation+2.58e+00

Pos logit contributions
bes+1.776
antly+1.707
vered+1.679
es+1.610
e+1.591
Neg logit contributions
aff-1.878
ock-1.845
weet-1.755
iki-1.669
oto-1.662
Token.
Token ID13
Feature activation+4.57e-01

Pos logit contributions
antly+4.967
vered+4.843
bes+4.785
ASH+4.601
 Something+4.506
Neg logit contributions
aff-4.172
ock-4.005
weet-3.983
iki-3.692
oto-3.558
Token takes
Token ID2753
Feature activation+5.79e-01

Pos logit contributions
bes+1.102
antly+1.067
vered+1.050
es+1.020
e+1.018
Neg logit contributions
aff-0.915
ock-0.903
weet-0.879
iki-0.820
oto-0.796
Token a
Token ID257
Feature activation-1.06e-01

Pos logit contributions
bes+1.069
antly+1.004
vered+0.998
es+0.975
 wrist+0.952
Neg logit contributions
ock-0.960
aff-0.958
weet-0.937
iki-0.853
aa-0.848
Token cloth
Token ID16270
Feature activation-5.85e-01

Pos logit contributions
aff+0.229
ock+0.227
weet+0.225
iki+0.212
oto+0.207
Neg logit contributions
bes-0.138
antly-0.132
vered-0.128
es-0.126
e-0.122
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
aff+0.974
weet+0.954
ock+0.949
iki+0.879
ules+0.849
Neg logit contributions
bes-1.051
antly-1.025
es-1.004
vered-0.995
 kit-0.971
Token wipes
Token ID46142
Feature activation+7.36e-01

Pos logit contributions
bes+1.678
antly+1.593
vered+1.593
e+1.542
es+1.521
Neg logit contributions
ock-2.032
weet-2.004
aff-1.974
iki-1.844
aa-1.829
Token Ben
Token ID3932
Feature activation+3.57e+00

Pos logit contributions
bes+1.404
vered+1.317
antly+1.309
es+1.285
nob+1.268
Neg logit contributions
ock-1.163
aff-1.152
weet-1.122
iki-1.037
aa-1.016
Token's
Token ID338
Feature activation+1.47e+00

Pos logit contributions
bes+4.440
antly+4.403
ograph+4.107
vered+4.031
 Hide+3.998
Neg logit contributions
ock-7.881
weet-7.720
aa-7.459
aff-7.240
 wind-7.003
Token shirt
Token ID10147
Feature activation+9.36e-01

Pos logit contributions
vered+2.375
es+2.355
antly+2.343
bes+2.342
nob+2.238
Neg logit contributions
ock-2.591
aff-2.578
weet-2.558
aa-2.413
oto-2.412
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
bes+1.509
vered+1.451
antly+1.448
e+1.381
 searches+1.374
Neg logit contributions
ock-1.724
aff-1.690
weet-1.631
iki-1.506
oto-1.504
Token But
Token ID887
Feature activation+1.00e+00

Pos logit contributions
aff+0.547
ock+0.546
weet+0.535
iki+0.496
oto+0.479
Neg logit contributions
bes-0.617
antly-0.596
vered-0.581
es-0.579
e-0.561
Token she
Token ID673
Feature activation+3.03e-01

Pos logit contributions
bes+2.065
antly+1.969
vered+1.945
nob+1.919
es+1.873
Neg logit contributions
ock-1.432
weet-1.398
aff-1.382
iki-1.282
oto-1.235
TokenNo
Token ID2949
Feature activation-8.59e-01

Pos logit contributions
aff+0.177
ock+0.176
weet+0.174
iki+0.164
oto+0.160
Neg logit contributions
bes-0.105
antly-0.100
vered-0.097
es-0.096
e-0.093
Token,
Token ID11
Feature activation+8.96e-02

Pos logit contributions
aff+1.666
ock+1.632
weet+1.614
iki+1.508
ules+1.470
Neg logit contributions
bes-1.323
antly-1.280
es-1.248
vered-1.243
 kit-1.207
Token thank
Token ID5875
Feature activation+5.52e-01

Pos logit contributions
bes+0.151
antly+0.146
vered+0.142
es+0.140
e+0.137
Neg logit contributions
aff-0.160
ock-0.159
weet-0.157
iki-0.146
oto-0.142
Token you
Token ID345
Feature activation-7.24e-01

Pos logit contributions
bes+0.966
antly+0.924
vered+0.904
es+0.882
nob+0.879
Neg logit contributions
aff-0.960
ock-0.947
weet-0.942
iki-0.871
aa-0.857
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
aff+1.478
ock+1.466
weet+1.444
iki+1.371
oto+1.326
Neg logit contributions
bes-1.022
antly-0.988
es-0.970
vered-0.946
e-0.928
Token Cig
Token ID32616
Feature activation+3.56e+00

Pos logit contributions
bes+1.905
antly+1.796
vered+1.786
es+1.729
nob+1.709
Neg logit contributions
weet-2.346
aff-2.335
ock-2.265
aa-2.162
iki-2.142
Tokenarettes
Token ID13890
Feature activation-1.44e+00

Pos logit contributions
vered+5.290
bes+4.761
antly+4.667
 searches+4.506
 wrist+4.469
Neg logit contributions
aa-7.316
iki-7.055
aff-7.039
oto-6.849
ules-6.846
Token are
Token ID389
Feature activation-7.36e-01

Pos logit contributions
aff+3.138
weet+3.052
ock+2.989
iki+2.935
oto+2.825
Neg logit contributions
bes-1.841
es-1.814
antly-1.782
e-1.727
 kit-1.681
Token stupid
Token ID8531
Feature activation-1.03e-01

Pos logit contributions
aff+1.215
ock+1.203
weet+1.178
iki+1.105
ules+1.055
Neg logit contributions
bes-1.353
antly-1.296
es-1.271
vered-1.251
e-1.248
Token.
Token ID13
Feature activation+8.09e-01

Pos logit contributions
aff+0.197
ock+0.195
weet+0.193
iki+0.181
oto+0.176
Neg logit contributions
bes-0.159
antly-0.152
vered-0.149
es-0.147
e-0.143
Token They
Token ID1119
Feature activation-4.28e-01

Pos logit contributions
bes+1.450
antly+1.381
vered+1.377
es+1.334
nob+1.327
Neg logit contributions
aff-1.359
weet-1.333
ock-1.324
aa-1.219
iki-1.215
Token opens
Token ID9808
Feature activation+8.64e-01

Pos logit contributions
bes+2.547
antly+2.436
vered+2.381
e+2.294
es+2.221
Neg logit contributions
aff-3.183
ock-3.096
weet-3.028
iki-2.975
aa-2.806
Token her
Token ID607
Feature activation+1.87e+00

Pos logit contributions
bes+1.714
antly+1.629
vered+1.616
es+1.589
house+1.561
Neg logit contributions
aff-1.290
ock-1.282
weet-1.253
iki-1.159
oto-1.139
Token eyes
Token ID2951
Feature activation+3.20e-01

Pos logit contributions
bes+3.345
vered+3.238
antly+3.113
es+3.070
e+3.054
Neg logit contributions
aff-3.146
weet-2.970
iki-2.962
ock-2.878
arl-2.754
Token and
Token ID290
Feature activation+1.93e+00

Pos logit contributions
bes+0.655
antly+0.623
vered+0.619
es+0.606
e+0.601
Neg logit contributions
aff-0.459
ock-0.456
weet-0.445
iki-0.403
oto-0.387
Token sees
Token ID7224
Feature activation-2.38e-01

Pos logit contributions
bes+3.085
antly+2.871
vered+2.864
e+2.756
house+2.700
Neg logit contributions
weet-3.570
ock-3.539
aff-3.524
iki-3.381
aa-3.186
Token Ben
Token ID3932
Feature activation+3.50e+00

Pos logit contributions
aff+0.467
ock+0.463
weet+0.458
iki+0.429
aa+0.417
Neg logit contributions
bes-0.357
antly-0.342
vered-0.334
es-0.332
e-0.321
Token with
Token ID351
Feature activation+6.05e-01

Pos logit contributions
bes+6.718
antly+6.673
vered+6.303
ASH+6.295
 kit+5.944
Neg logit contributions
weet-5.838
aff-5.680
ock-5.675
iki-5.320
oto-4.924
Token something
Token ID1223
Feature activation+1.56e+00

Pos logit contributions
bes+1.004
antly+0.965
vered+0.933
es+0.923
nob+0.898
Neg logit contributions
ock-1.093
aff-1.085
weet-1.085
iki-1.003
oto-0.982
Token in
Token ID287
Feature activation+1.37e+00

Pos logit contributions
bes+3.135
antly+2.979
vered+2.956
e+2.853
nob+2.853
Neg logit contributions
aff-2.376
ock-2.357
weet-2.275
iki-2.145
oto-2.115
Token his
Token ID465
Feature activation+1.91e+00

Pos logit contributions
bes+2.671
antly+2.609
vered+2.560
es+2.514
house+2.478
Neg logit contributions
ock-2.071
weet-2.046
aff-2.008
iki-1.878
oto-1.854
Token mouth
Token ID5422
Feature activation+6.68e-01

Pos logit contributions
bes+2.311
vered+2.210
antly+2.200
es+2.184
nob+1.954
Neg logit contributions
weet-4.207
aff-4.134
ock-4.122
iki-4.116
aa-3.939
Token not
Token ID407
Feature activation-1.05e+00

Pos logit contributions
antly+2.408
bes+2.379
vered+2.370
es+2.277
 kit+2.240
Neg logit contributions
ock-1.817
aff-1.790
weet-1.770
oto-1.636
iki-1.606
Token want
Token ID765
Feature activation-7.51e-01

Pos logit contributions
aff+1.852
weet+1.761
ock+1.755
iki+1.689
ules+1.614
Neg logit contributions
bes-1.785
e-1.759
es-1.713
antly-1.696
nob-1.693
Token to
Token ID284
Feature activation+1.44e-01

Pos logit contributions
aff+1.284
ock+1.266
weet+1.250
iki+1.178
ules+1.137
Neg logit contributions
bes-1.296
antly-1.273
vered-1.229
es-1.228
e-1.215
Token be
Token ID307
Feature activation-4.73e-03

Pos logit contributions
bes+0.236
antly+0.227
vered+0.222
es+0.220
nob+0.210
Neg logit contributions
aff-0.264
ock-0.263
weet-0.260
iki-0.240
oto-0.235
Token a
Token ID257
Feature activation-2.51e-01

Pos logit contributions
aff+0.008
ock+0.008
weet+0.008
iki+0.007
oto+0.007
Neg logit contributions
bes-0.009
antly-0.008
vered-0.008
es-0.008
e-0.008
Token tatt
Token ID14399
Feature activation+3.48e+00

Pos logit contributions
aff+0.395
ock+0.393
weet+0.387
iki+0.356
oto+0.343
Neg logit contributions
bes-0.471
antly-0.457
vered-0.447
es-0.442
e-0.434
Tokenlet
Token ID1616
Feature activation-4.29e-01

Pos logit contributions
bes+6.596
vered+5.950
 searches+5.884
antly+5.583
 Something+5.570
Neg logit contributions
aff-6.254
iki-6.073
aa-5.931
ock-5.923
weet-5.914
Tokenale
Token ID1000
Feature activation-8.87e-01

Pos logit contributions
aff+0.853
ock+0.840
weet+0.837
iki+0.785
oto+0.761
Neg logit contributions
bes-0.627
antly-0.607
es-0.590
vered-0.587
e-0.574
Token.
Token ID13
Feature activation+7.80e-01

Pos logit contributions
aff+1.826
ock+1.823
weet+1.815
iki+1.704
aa+1.652
Neg logit contributions
bes-1.206
es-1.175
antly-1.168
vered-1.129
e-1.105
Token She
Token ID1375
Feature activation-2.00e-01

Pos logit contributions
bes+1.629
vered+1.563
antly+1.563
es+1.525
e+1.514
Neg logit contributions
aff-1.087
weet-1.073
ock-1.043
iki-0.975
aa-0.932
Token decided
Token ID3066
Feature activation-6.52e-01

Pos logit contributions
aff+0.350
ock+0.348
weet+0.344
iki+0.317
oto+0.308
Neg logit contributions
bes-0.342
antly-0.330
vered-0.322
es-0.319
e-0.312
Token big
Token ID1263
Feature activation+5.45e-01

Pos logit contributions
bes+1.568
es+1.504
antly+1.492
vered+1.487
e+1.450
Neg logit contributions
ock-1.531
weet-1.512
aff-1.510
iki-1.409
aa-1.385
Token rock
Token ID3881
Feature activation+3.44e-01

Pos logit contributions
bes+0.868
antly+0.832
es+0.813
vered+0.807
e+0.776
Neg logit contributions
ock-1.023
aff-1.011
weet-1.005
iki-0.937
aa-0.917
Token and
Token ID290
Feature activation+4.79e-01

Pos logit contributions
bes+0.592
antly+0.570
vered+0.556
es+0.546
e+0.533
Neg logit contributions
ock-0.598
aff-0.595
weet-0.585
iki-0.543
aa-0.527
Token put
Token ID1234
Feature activation+9.39e-01

Pos logit contributions
bes+0.802
antly+0.762
vered+0.751
es+0.744
e+0.729
Neg logit contributions
ock-0.851
weet-0.843
aff-0.843
iki-0.780
aa-0.761
Token his
Token ID465
Feature activation+7.09e-01

Pos logit contributions
bes+1.491
vered+1.367
antly+1.367
es+1.353
house+1.316
Neg logit contributions
ock-1.805
aff-1.767
weet-1.735
iki-1.618
oto-1.602
Token father
Token ID2988
Feature activation+3.48e+00

Pos logit contributions
bes+1.027
antly+0.982
es+0.981
vered+0.951
house+0.928
Neg logit contributions
ock-1.411
aff-1.385
weet-1.375
iki-1.321
oto-1.294
Token's
Token ID338
Feature activation+1.71e+00

Pos logit contributions
bes+6.379
 Something+6.001
antly+5.993
 searches+5.773
ASH+5.612
Neg logit contributions
ock-6.407
weet-6.042
aff-5.576
aa-5.534
iki-5.410
Token jacket
Token ID15224
Feature activation+1.08e+00

Pos logit contributions
bes+2.555
es+2.532
antly+2.425
house+2.340
e+2.325
Neg logit contributions
ock-3.363
weet-3.196
aff-3.190
aa-3.152
iki-3.122
Token on
Token ID319
Feature activation+8.44e-01

Pos logit contributions
bes+1.593
antly+1.500
vered+1.496
 searches+1.435
 Something+1.428
Neg logit contributions
ock-2.115
aff-2.073
weet-2.021
aa-1.904
iki-1.897
Token it
Token ID340
Feature activation+9.20e-01

Pos logit contributions
bes+1.527
antly+1.442
vered+1.438
es+1.433
 wrist+1.363
Neg logit contributions
ock-1.404
aff-1.390
weet-1.358
iki-1.233
oto-1.225
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
bes+1.482
antly+1.419
vered+1.399
es+1.352
e+1.332
Neg logit contributions
ock-1.695
aff-1.674
weet-1.652
iki-1.504
aa-1.479
Token opened
Token ID4721
Feature activation+1.16e+00

Pos logit contributions
bes+1.530
antly+1.426
vered+1.401
es+1.361
 kit+1.358
Neg logit contributions
aff-1.846
ock-1.824
weet-1.811
iki-1.721
aa-1.651
Token the
Token ID262
Feature activation+2.21e+00

Pos logit contributions
bes+2.328
antly+2.164
vered+2.147
es+2.135
e+2.100
Neg logit contributions
aff-1.699
ock-1.683
weet-1.657
iki-1.566
oto-1.484
Token box
Token ID3091
Feature activation+2.68e-01

Pos logit contributions
bes+3.070
vered+2.726
es+2.671
e+2.636
antly+2.588
Neg logit contributions
ock-4.914
aff-4.824
weet-4.707
iki-4.538
andel-4.402
Token and
Token ID290
Feature activation+1.55e+00

Pos logit contributions
bes+0.557
antly+0.535
vered+0.529
es+0.519
e+0.514
Neg logit contributions
aff-0.370
ock-0.366
weet-0.360
iki-0.330
oto-0.314
Token to
Token ID284
Feature activation+2.55e-01

Pos logit contributions
bes+2.624
vered+2.367
antly+2.332
e+2.279
 wrist+2.231
Neg logit contributions
ock-2.792
weet-2.782
aff-2.782
iki-2.652
aa-2.540
Token her
Token ID607
Feature activation+3.48e+00

Pos logit contributions
bes+0.456
antly+0.437
vered+0.429
es+0.427
e+0.410
Neg logit contributions
aff-0.428
ock-0.428
weet-0.426
iki-0.393
oto-0.381
Token surprise
Token ID5975
Feature activation+1.24e+00

Pos logit contributions
bes+7.698
es+7.083
vered+6.977
nob+6.775
antly+6.713
Neg logit contributions
iki-5.075
aff-4.609
weet-4.572
 tour-4.560
 storm-4.349
Token,
Token ID11
Feature activation+7.42e-01

Pos logit contributions
bes+2.436
vered+2.280
antly+2.268
e+2.194
nob+2.175
Neg logit contributions
aff-1.921
ock-1.878
weet-1.812
iki-1.778
oto-1.665
Token it
Token ID340
Feature activation+1.91e+00

Pos logit contributions
bes+1.474
antly+1.432
vered+1.406
es+1.372
e+1.365
Neg logit contributions
aff-1.071
weet-1.070
ock-1.060
iki-0.972
oto-0.927
Token was
Token ID373
Feature activation+2.69e-01

Pos logit contributions
bes+2.892
vered+2.687
 wrist+2.684
e+2.647
antly+2.506
Neg logit contributions
aff-3.711
ock-3.654
iki-3.524
weet-3.520
oto-3.314
Token full
Token ID1336
Feature activation-3.90e-01

Pos logit contributions
bes+0.426
antly+0.404
vered+0.401
es+0.394
e+0.382
Neg logit contributions
aff-0.509
ock-0.506
weet-0.502
iki-0.468
oto-0.457
Token They
Token ID1119
Feature activation-1.37e+00

Pos logit contributions
aff+0.142
ock+0.141
weet+0.139
iki+0.129
oto+0.125
Neg logit contributions
bes-0.154
antly-0.148
vered-0.145
es-0.144
e-0.141
Token all
Token ID477
Feature activation-1.24e+00

Pos logit contributions
aff+2.580
ock+2.568
weet+2.540
arl+2.288
andel+2.285
Neg logit contributions
es-2.174
bes-2.168
antly-2.148
vered-2.113
 kit-2.026
Token agreed
Token ID4987
Feature activation-1.81e-01

Pos logit contributions
aff+2.383
ock+2.320
weet+2.317
arl+2.109
iki+2.101
Neg logit contributions
bes-1.913
vered-1.897
antly-1.871
es-1.825
e-1.806
Token and
Token ID290
Feature activation-1.81e-01

Pos logit contributions
aff+0.283
ock+0.280
weet+0.277
iki+0.254
oto+0.245
Neg logit contributions
bes-0.345
antly-0.333
vered-0.326
es-0.324
e-0.317
Token helped
Token ID4193
Feature activation+8.15e-01

Pos logit contributions
aff+0.293
ock+0.291
weet+0.286
iki+0.263
oto+0.255
Neg logit contributions
bes-0.337
antly-0.326
vered-0.319
es-0.317
e-0.309
Token her
Token ID607
Feature activation+3.46e+00

Pos logit contributions
bes+1.639
antly+1.581
vered+1.554
es+1.517
nob+1.479
Neg logit contributions
aff-1.219
ock-1.181
weet-1.160
aa-1.061
iki-1.056
Token open
Token ID1280
Feature activation+2.31e+00

Pos logit contributions
bes+7.371
vered+6.695
ASH+6.582
antly+6.498
ograph+6.379
Neg logit contributions
iki-5.320
aff-5.107
 tour-5.037
ock-5.014
aa-4.882
Token the
Token ID262
Feature activation+1.82e+00

Pos logit contributions
bes+4.952
antly+4.687
vered+4.540
e+4.403
 Something+4.320
Neg logit contributions
aff-3.263
ock-3.142
weet-3.076
iki-2.913
andel-2.733
Token chest
Token ID7721
Feature activation+1.06e+00

Pos logit contributions
bes+1.749
vered+1.572
antly+1.554
e+1.501
es+1.457
Neg logit contributions
aff-4.717
ock-4.690
weet-4.585
oto-4.337
iki-4.311
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
bes+1.782
vered+1.678
antly+1.666
e+1.615
es+1.573
Neg logit contributions
aff-1.906
ock-1.879
weet-1.846
iki-1.752
oto-1.672
Token Inside
Token ID14384
Feature activation-4.21e-01

Pos logit contributions
bes+0.386
antly+0.372
vered+0.368
es+0.362
e+0.358
Neg logit contributions
aff-0.322
ock-0.314
weet-0.311
iki-0.285
aa-0.277
Token used
Token ID973
Feature activation+2.97e-01

Pos logit contributions
bes+1.240
antly+1.179
vered+1.145
es+1.128
nob+1.106
Neg logit contributions
aff-1.234
ock-1.229
weet-1.216
iki-1.145
aa-1.120
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
bes+0.536
antly+0.515
vered+0.503
es+0.500
e+0.487
Neg logit contributions
ock-0.492
aff-0.490
weet-0.486
iki-0.446
aa-0.433
Token glove
Token ID29144
Feature activation+5.37e-01

Pos logit contributions
bes+1.780
es+1.724
vered+1.688
antly+1.671
e+1.627
Neg logit contributions
ock-2.287
weet-2.207
aff-2.131
oto-2.094
iki-2.077
Token to
Token ID284
Feature activation-2.79e-01

Pos logit contributions
bes+1.084
antly+1.041
vered+1.026
es+1.004
e+1.000
Neg logit contributions
ock-0.757
aff-0.757
weet-0.742
iki-0.682
aa-0.661
Token clean
Token ID3424
Feature activation+1.68e+00

Pos logit contributions
aff+0.597
ock+0.588
weet+0.583
iki+0.550
oto+0.536
Neg logit contributions
bes-0.372
antly-0.352
vered-0.342
es-0.336
e-0.332
Token her
Token ID607
Feature activation+3.45e+00

Pos logit contributions
bes+3.024
antly+2.943
vered+2.943
house+2.820
es+2.812
Neg logit contributions
ock-2.786
weet-2.633
aff-2.629
oto-2.396
iki-2.373
Token room
Token ID2119
Feature activation+2.89e-01

Pos logit contributions
vered+5.607
antly+5.563
es+5.461
nob+5.302
bes+5.059
Neg logit contributions
aff-6.189
oto-6.159
ock-6.083
weet-6.027
aa-5.873
Token.
Token ID13
Feature activation-2.99e-01

Pos logit contributions
bes+0.493
antly+0.477
vered+0.467
es+0.459
e+0.452
Neg logit contributions
aff-0.499
ock-0.499
weet-0.488
iki-0.452
oto-0.440
Token She
Token ID1375
Feature activation+3.12e-01

Pos logit contributions
aff+0.478
ock+0.476
weet+0.466
iki+0.433
oto+0.420
Neg logit contributions
bes-0.558
antly-0.540
vered-0.528
es-0.523
e-0.510
Token wiped
Token ID21122
Feature activation+4.74e-01

Pos logit contributions
bes+0.476
antly+0.457
vered+0.445
es+0.437
e+0.427
Neg logit contributions
ock-0.597
aff-0.596
weet-0.587
iki-0.552
aa-0.538
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
bes+0.860
antly+0.807
vered+0.799
es+0.789
nob+0.771
Neg logit contributions
ock-0.794
aff-0.784
weet-0.768
iki-0.704
aa-0.691
Token
Token ID198
Feature activation-9.07e-02

Pos logit contributions
aff+2.187
ock+2.169
weet+2.131
iki+2.029
aa+1.975
Neg logit contributions
bes-1.326
antly-1.247
es-1.216
e-1.179
vered-1.163
TokenMom
Token ID29252
Feature activation+8.09e-02

Pos logit contributions
aff+0.188
ock+0.186
weet+0.184
iki+0.173
oto+0.169
Neg logit contributions
bes-0.128
antly-0.123
vered-0.119
es-0.118
e-0.114
Token smiles
Token ID21845
Feature activation+1.42e-02

Pos logit contributions
bes+0.121
antly+0.116
vered+0.113
es+0.111
e+0.108
Neg logit contributions
aff-0.160
ock-0.159
weet-0.157
iki-0.147
oto-0.143
Token and
Token ID290
Feature activation+3.34e-01

Pos logit contributions
bes+0.028
antly+0.027
vered+0.027
es+0.026
e+0.026
Neg logit contributions
aff-0.021
ock-0.021
weet-0.021
iki-0.019
oto-0.018
Token wipes
Token ID46142
Feature activation+3.58e-01

Pos logit contributions
bes+0.560
antly+0.537
vered+0.528
es+0.515
e+0.512
Neg logit contributions
aff-0.600
ock-0.597
weet-0.590
iki-0.549
oto-0.535
Token Mia
Token ID32189
Feature activation+3.44e+00

Pos logit contributions
bes+0.672
antly+0.637
vered+0.629
es+0.617
nob+0.607
Neg logit contributions
aff-0.577
ock-0.571
weet-0.559
iki-0.517
aa-0.500
Token's
Token ID338
Feature activation+1.83e+00

Pos logit contributions
bes+3.687
antly+3.353
 Hide+3.116
vered+3.089
ASH+3.074
Neg logit contributions
ock-8.501
weet-8.414
aa-8.053
aff-7.861
iki-7.480
Token face
Token ID1986
Feature activation+6.60e-01

Pos logit contributions
es+2.755
bes+2.693
vered+2.683
antly+2.640
house+2.531
Neg logit contributions
ock-3.541
weet-3.469
aff-3.408
arl-3.220
oto-3.213
Token.
Token ID13
Feature activation-7.62e-01

Pos logit contributions
bes+1.059
vered+0.990
antly+0.990
es+0.949
e+0.937
Neg logit contributions
ock-1.223
aff-1.221
weet-1.196
iki-1.102
aa-1.090
Token She
Token ID1375
Feature activation-2.43e-01

Pos logit contributions
ock+1.272
aff+1.264
weet+1.222
iki+1.131
oto+1.108
Neg logit contributions
bes-1.379
antly-1.343
es-1.306
vered-1.294
 Something-1.271
Token throws
Token ID12542
Feature activation+4.03e-01

Pos logit contributions
aff+0.449
ock+0.445
weet+0.440
iki+0.409
aa+0.397
Neg logit contributions
bes-0.393
antly-0.378
vered-0.368
es-0.366
e-0.355
Token opens
Token ID9808
Feature activation+1.35e+00

Pos logit contributions
bes+2.683
vered+2.507
antly+2.473
e+2.400
 kit+2.364
Neg logit contributions
aff-3.366
ock-3.345
weet-3.342
iki-3.225
aa-3.064
Token her
Token ID607
Feature activation+2.09e+00

Pos logit contributions
bes+2.787
vered+2.654
antly+2.641
es+2.575
house+2.550
Neg logit contributions
aff-1.888
ock-1.873
weet-1.831
iki-1.702
oto-1.686
Token eyes
Token ID2951
Feature activation+4.55e-01

Pos logit contributions
vered+3.917
bes+3.893
es+3.733
antly+3.719
e+3.616
Neg logit contributions
aff-3.222
weet-3.187
ock-3.117
iki-3.112
aa-2.999
Token and
Token ID290
Feature activation+2.01e+00

Pos logit contributions
bes+0.955
vered+0.907
antly+0.906
es+0.882
e+0.878
Neg logit contributions
aff-0.628
ock-0.627
weet-0.610
iki-0.549
aa-0.529
Token sees
Token ID7224
Feature activation+1.79e-01

Pos logit contributions
bes+3.282
vered+3.153
antly+3.066
e+2.974
house+2.971
Neg logit contributions
weet-3.622
ock-3.549
aff-3.499
iki-3.381
aa-3.279
Token Tom
Token ID4186
Feature activation+3.44e+00

Pos logit contributions
bes+0.314
antly+0.303
vered+0.296
es+0.290
e+0.285
Neg logit contributions
aff-0.307
ock-0.305
weet-0.301
iki-0.278
oto-0.271
Token.
Token ID13
Feature activation-1.48e-01

Pos logit contributions
bes+6.541
antly+6.342
vered+6.171
ASH+6.139
 Something+5.812
Neg logit contributions
weet-5.953
ock-5.874
aff-5.553
iki-5.346
aa-5.042
Token She
Token ID1375
Feature activation+1.40e+00

Pos logit contributions
aff+0.252
ock+0.251
weet+0.247
iki+0.229
oto+0.221
Neg logit contributions
bes-0.261
antly-0.251
vered-0.245
es-0.243
e-0.237
Token rub
Token ID6437
Feature activation+7.56e-01

Pos logit contributions
bes+2.089
antly+1.992
vered+1.981
e+1.897
 kit+1.896
Neg logit contributions
aff-2.739
weet-2.687
ock-2.660
iki-2.573
oto-2.458
Tokens
Token ID82
Feature activation+2.30e+00

Pos logit contributions
bes+1.229
antly+1.134
vered+1.130
nob+1.109
 wrist+1.094
Neg logit contributions
ock-1.408
aff-1.392
weet-1.354
aa-1.280
iki-1.273
Token her
Token ID607
Feature activation+2.23e+00

Pos logit contributions
bes+4.490
house+4.217
vered+4.160
antly+4.026
ograph+4.021
Neg logit contributions
ock-3.726
weet-3.444
aa-3.177
aff-3.166
oto-3.163
Token asked
Token ID1965
Feature activation-1.59e-01

Pos logit contributions
bes+1.798
antly+1.651
vered+1.609
 wrist+1.606
es+1.558
Neg logit contributions
aff-2.224
ock-2.212
weet-2.175
iki-2.091
aa-2.041
Token her
Token ID607
Feature activation+5.11e-01

Pos logit contributions
aff+0.271
ock+0.268
weet+0.265
iki+0.246
oto+0.237
Neg logit contributions
bes-0.284
antly-0.274
vered-0.268
es-0.267
e-0.260
Token dad
Token ID9955
Feature activation+2.29e-01

Pos logit contributions
bes+0.868
antly+0.823
vered+0.815
e+0.785
es+0.779
Neg logit contributions
aff-0.931
ock-0.917
weet-0.910
iki-0.864
oto-0.841
Token to
Token ID284
Feature activation+1.71e+00

Pos logit contributions
bes+0.440
antly+0.420
vered+0.413
es+0.403
e+0.399
Neg logit contributions
aff-0.359
ock-0.355
weet-0.350
iki-0.322
oto-0.312
Token help
Token ID1037
Feature activation+1.67e+00

Pos logit contributions
bes+3.251
antly+3.042
es+3.029
vered+3.011
house+2.730
Neg logit contributions
ock-2.932
weet-2.866
aff-2.736
iki-2.601
oto-2.597
Token her
Token ID607
Feature activation+3.43e+00

Pos logit contributions
bes+3.401
antly+3.354
vered+3.323
es+3.142
 wrist+3.070
Neg logit contributions
ock-2.470
aff-2.369
weet-2.333
aa-2.114
oto-2.049
Token remove
Token ID4781
Feature activation+7.48e-01

Pos logit contributions
bes+7.686
vered+7.099
antly+6.857
ASH+6.849
es+6.740
Neg logit contributions
ock-4.836
 tour-4.314
oto-4.266
weet-4.263
iki-4.240
Token the
Token ID262
Feature activation+2.17e+00

Pos logit contributions
bes+1.416
antly+1.361
vered+1.327
es+1.301
e+1.286
Neg logit contributions
ock-1.183
aff-1.176
weet-1.136
iki-1.040
andel-1.016
Token lid
Token ID19789
Feature activation-1.23e-01

Pos logit contributions
bes+2.971
antly+2.871
vered+2.866
e+2.811
es+2.718
Neg logit contributions
ock-4.662
aff-4.642
weet-4.459
 tour-4.214
 storm-4.156
Token of
Token ID286
Feature activation+3.46e-01

Pos logit contributions
aff+0.237
ock+0.235
weet+0.233
iki+0.218
oto+0.212
Neg logit contributions
bes-0.189
antly-0.181
vered-0.176
es-0.175
e-0.170
Token the
Token ID262
Feature activation+2.16e+00

Pos logit contributions
bes+0.626
antly+0.610
vered+0.594
es+0.584
e+0.572
Neg logit contributions
aff-0.571
ock-0.569
weet-0.563
iki-0.514
oto-0.498
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
aff+0.355
ock+0.351
weet+0.346
iki+0.317
oto+0.306
Neg logit contributions
bes-0.453
antly-0.438
vered-0.429
es-0.427
e-0.417
Token placed
Token ID4624
Feature activation+4.75e-01

Pos logit contributions
bes+2.408
antly+2.280
vered+2.275
e+2.190
nob+2.187
Neg logit contributions
aff-1.704
weet-1.662
ock-1.654
iki-1.527
aa-1.502
Token something
Token ID1223
Feature activation+1.40e+00

Pos logit contributions
bes+0.808
antly+0.764
vered+0.754
es+0.739
nob+0.723
Neg logit contributions
aff-0.839
ock-0.837
weet-0.823
iki-0.764
oto-0.750
Token into
Token ID656
Feature activation+9.95e-01

Pos logit contributions
bes+2.519
vered+2.333
antly+2.315
e+2.275
es+2.272
Neg logit contributions
aff-2.376
ock-2.351
weet-2.237
iki-2.127
oto-2.111
Token his
Token ID465
Feature activation+1.90e+00

Pos logit contributions
bes+1.799
antly+1.728
vered+1.704
es+1.656
nob+1.630
Neg logit contributions
weet-1.650
ock-1.642
aff-1.637
iki-1.464
oto-1.462
Token son
Token ID3367
Feature activation+3.43e+00

Pos logit contributions
es+2.558
bes+2.535
antly+2.487
vered+2.453
nob+2.358
Neg logit contributions
aff-3.935
ock-3.865
weet-3.849
aa-3.738
iki-3.714
Token's
Token ID338
Feature activation+2.12e+00

Pos logit contributions
bes+5.070
antly+4.838
vered+4.805
ASH+4.466
 searches+4.260
Neg logit contributions
aff-7.313
weet-7.308
ock-7.099
aa-6.804
iki-6.712
Token hand
Token ID1021
Feature activation+1.31e+00

Pos logit contributions
es+2.884
antly+2.819
bes+2.811
vered+2.667
nob+2.602
Neg logit contributions
aff-4.392
ock-4.314
weet-4.259
aa-4.152
andel-4.080
Token,
Token ID11
Feature activation+6.11e-01

Pos logit contributions
bes+2.284
vered+2.198
antly+2.160
 searches+2.080
es+2.055
Neg logit contributions
aff-2.261
ock-2.247
weet-2.212
iki-2.018
aa-2.000
Token they
Token ID484
Feature activation-7.93e-02

Pos logit contributions
bes+1.110
antly+1.071
vered+1.063
es+1.038
e+1.010
Neg logit contributions
aff-0.999
weet-0.992
ock-0.985
iki-0.903
oto-0.883
Token looked
Token ID3114
Feature activation-1.12e-01

Pos logit contributions
aff+0.137
ock+0.136
weet+0.134
iki+0.124
oto+0.120
Neg logit contributions
bes-0.138
antly-0.132
vered-0.130
es-0.128
e-0.125
Token her
Token ID607
Feature activation+5.51e-01

Pos logit contributions
aff+0.559
ock+0.550
weet+0.544
iki+0.508
oto+0.488
Neg logit contributions
bes-0.565
antly-0.548
vered-0.536
es-0.535
e-0.520
Token mom
Token ID1995
Feature activation-1.71e-01

Pos logit contributions
bes+0.945
antly+0.895
vered+0.886
e+0.857
es+0.849
Neg logit contributions
aff-0.998
ock-0.987
weet-0.977
iki-0.923
oto-0.904
Tokenmy
Token ID1820
Feature activation+2.98e-01

Pos logit contributions
aff+0.271
ock+0.267
weet+0.264
iki+0.243
oto+0.235
Neg logit contributions
bes-0.322
antly-0.312
vered-0.305
es-0.304
e-0.296
Token to
Token ID284
Feature activation+1.97e+00

Pos logit contributions
bes+0.619
antly+0.594
vered+0.585
es+0.572
e+0.569
Neg logit contributions
aff-0.418
ock-0.417
weet-0.409
iki-0.368
oto-0.356
Token help
Token ID1037
Feature activation+1.60e+00

Pos logit contributions
bes+3.626
antly+3.460
es+3.449
vered+3.436
ASH+3.015
Neg logit contributions
ock-3.533
weet-3.366
aff-3.186
oto-2.991
arl-2.966
Token her
Token ID607
Feature activation+3.42e+00

Pos logit contributions
bes+3.168
antly+3.120
vered+3.105
es+2.957
 Something+2.850
Neg logit contributions
ock-2.431
aff-2.320
weet-2.300
aa-2.101
andel-2.010
Token order
Token ID1502
Feature activation+6.49e-01

Pos logit contributions
bes+7.290
vered+6.929
antly+6.847
es+6.717
ASH+6.576
Neg logit contributions
ock-5.227
weet-4.546
aff-4.290
oto-4.186
 tour-4.142
Token a
Token ID257
Feature activation+1.28e+00

Pos logit contributions
bes+1.102
antly+1.051
vered+1.048
es+1.016
e+1.004
Neg logit contributions
ock-1.152
aff-1.138
weet-1.115
iki-1.021
oto-1.009
Token new
Token ID649
Feature activation+1.40e+00

Pos logit contributions
bes+2.100
es+1.947
vered+1.938
antly+1.930
e+1.869
Neg logit contributions
ock-2.470
weet-2.418
aff-2.375
iki-2.227
aa-2.211
Token chair
Token ID5118
Feature activation-1.92e-01

Pos logit contributions
bes+1.951
antly+1.856
vered+1.821
es+1.808
e+1.725
Neg logit contributions
ock-3.005
weet-2.906
aff-2.819
iki-2.714
oto-2.692
Token.
Token ID13
Feature activation+8.31e-01

Pos logit contributions
aff+0.365
ock+0.362
weet+0.358
iki+0.335
oto+0.326
Neg logit contributions
bes-0.298
antly-0.285
vered-0.278
es-0.276
e-0.269
Token and
Token ID290
Feature activation+1.51e+00

Pos logit contributions
aff+0.386
ock+0.381
weet+0.377
iki+0.346
oto+0.334
Neg logit contributions
bes-0.489
antly-0.472
vered-0.463
es-0.460
e-0.449
Token gently
Token ID15165
Feature activation+1.58e+00

Pos logit contributions
bes+2.699
vered+2.524
antly+2.521
e+2.454
nob+2.428
Neg logit contributions
ock-2.478
weet-2.461
aff-2.402
iki-2.270
aa-2.269
Token wiped
Token ID21122
Feature activation+1.35e+00

Pos logit contributions
bes+2.483
antly+2.374
vered+2.330
es+2.276
 wrist+2.247
Neg logit contributions
ock-2.885
aff-2.836
weet-2.801
aa-2.736
iki-2.668
Token the
Token ID262
Feature activation+3.39e-01

Pos logit contributions
bes+2.439
vered+2.198
 wrist+2.179
es+2.178
house+2.174
Neg logit contributions
ock-2.360
aff-2.246
weet-2.200
iki-2.050
aa-2.047
Token little
Token ID1310
Feature activation+1.63e+00

Pos logit contributions
bes+0.539
antly+0.519
vered+0.508
es+0.507
e+0.493
Neg logit contributions
ock-0.633
aff-0.631
weet-0.622
iki-0.581
aa-0.567
Token girl
Token ID2576
Feature activation+3.41e+00

Pos logit contributions
bes+3.445
es+3.273
antly+3.257
vered+3.205
e+3.205
Neg logit contributions
ock-2.288
aff-2.207
weet-2.174
iki-2.031
aa-1.983
Token's
Token ID338
Feature activation+1.63e+00

Pos logit contributions
bes+4.428
ASH+4.047
antly+3.958
 Hide+3.938
es+3.786
Neg logit contributions
ock-7.620
weet-7.601
aa-7.141
aff-7.106
iki-6.970
Token skin
Token ID4168
Feature activation+1.08e+00

Pos logit contributions
bes+2.275
es+2.261
house+2.176
vered+2.144
antly+2.133
Neg logit contributions
ock-3.323
weet-3.222
aff-3.185
aa-3.113
iki-3.109
Token.
Token ID13
Feature activation+4.55e-01

Pos logit contributions
bes+1.772
antly+1.670
vered+1.647
 searches+1.600
es+1.595
Neg logit contributions
ock-1.992
aff-1.912
weet-1.900
aa-1.745
iki-1.744
Token
Token ID198
Feature activation-4.09e-01

Pos logit contributions
bes+0.743
antly+0.701
vered+0.697
es+0.682
e+0.670
Neg logit contributions
aff-0.841
weet-0.824
ock-0.820
iki-0.766
oto-0.744
Token
Token ID198
Feature activation+2.85e-02

Pos logit contributions
aff+0.865
ock+0.861
weet+0.850
iki+0.801
aa+0.780
Neg logit contributions
bes-0.547
antly-0.525
es-0.502
vered-0.501
e-0.487
Token toy
Token ID13373
Feature activation+9.96e-01

Pos logit contributions
bes+1.600
vered+1.523
antly+1.516
es+1.506
e+1.405
Neg logit contributions
aff-2.171
ock-2.151
weet-2.133
iki-1.927
oto-1.919
Token box
Token ID3091
Feature activation-5.66e-01

Pos logit contributions
bes+1.150
vered+1.088
antly+1.067
e+0.974
es+0.958
Neg logit contributions
aff-2.317
ock-2.309
weet-2.241
iki-2.102
oto-2.061
Token,
Token ID11
Feature activation+5.50e-02

Pos logit contributions
aff+1.005
ock+0.983
weet+0.972
iki+0.910
aa+0.888
Neg logit contributions
bes-0.962
antly-0.930
es-0.908
vered-0.898
house-0.885
Token and
Token ID290
Feature activation+7.49e-01

Pos logit contributions
bes+0.096
antly+0.093
vered+0.091
es+0.090
e+0.087
Neg logit contributions
aff-0.094
ock-0.094
weet-0.093
iki-0.086
oto-0.083
Token to
Token ID284
Feature activation-3.16e-01

Pos logit contributions
bes+1.452
antly+1.405
vered+1.385
e+1.349
es+1.322
Neg logit contributions
aff-1.123
weet-1.123
ock-1.120
iki-0.999
aa-0.983
Token her
Token ID607
Feature activation+3.40e+00

Pos logit contributions
aff+0.559
ock+0.550
weet+0.541
iki+0.507
aa+0.492
Neg logit contributions
bes-0.541
antly-0.520
vered-0.510
es-0.502
e-0.497
Token surprise
Token ID5975
Feature activation+1.07e+00

Pos logit contributions
bes+7.728
vered+7.069
antly+6.910
es+6.889
 elbows+6.695
Neg logit contributions
aff-4.612
iki-4.601
 tour-4.249
weet-4.247
ock-4.136
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
bes+2.005
vered+1.911
antly+1.898
e+1.811
es+1.810
Neg logit contributions
ock-1.728
aff-1.723
weet-1.652
iki-1.584
oto-1.535
Token the
Token ID262
Feature activation+2.02e+00

Pos logit contributions
bes+2.012
antly+1.971
vered+1.953
e+1.896
es+1.885
Neg logit contributions
weet-1.419
aff-1.401
ock-1.391
iki-1.264
oto-1.226
Token magnet
Token ID19972
Feature activation+1.58e+00

Pos logit contributions
bes+3.377
es+3.343
antly+3.307
vered+3.258
e+3.255
Neg logit contributions
ock-3.709
aff-3.702
weet-3.665
 hurricane-3.419
 wind-3.361
Token was
Token ID373
Feature activation+9.05e-01

Pos logit contributions
bes+2.604
antly+2.422
vered+2.406
e+2.326
 wrist+2.324
Neg logit contributions
aff-2.868
ock-2.798
weet-2.736
iki-2.652
aa-2.548
Token her
Token ID607
Feature activation+3.60e-01

Pos logit contributions
aff+1.318
ock+1.291
weet+1.280
iki+1.209
ules+1.162
Neg logit contributions
bes-1.190
antly-1.161
es-1.152
vered-1.132
e-1.110
Token mom
Token ID1995
Feature activation-4.95e-01

Pos logit contributions
bes+0.594
antly+0.567
vered+0.560
es+0.540
e+0.536
Neg logit contributions
aff-0.668
ock-0.657
weet-0.652
iki-0.614
oto-0.601
Tokenmy
Token ID1820
Feature activation-2.54e-01

Pos logit contributions
aff+0.830
ock+0.817
weet+0.807
iki+0.752
aa+0.726
Neg logit contributions
bes-0.866
antly-0.852
es-0.835
vered-0.822
e-0.804
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
aff+0.398
ock+0.394
weet+0.389
iki+0.359
oto+0.346
Neg logit contributions
bes-0.478
antly-0.465
vered-0.454
es-0.454
e-0.441
Token help
Token ID1037
Feature activation+1.26e+00

Pos logit contributions
bes+2.097
antly+1.998
vered+1.996
es+1.976
nob+1.787
Neg logit contributions
ock-2.191
weet-2.160
aff-2.143
oto-1.954
iki-1.943
Token her
Token ID607
Feature activation+3.39e+00

Pos logit contributions
bes+2.490
antly+2.455
vered+2.448
es+2.334
 wrist+2.238
Neg logit contributions
ock-1.872
aff-1.839
weet-1.815
aa-1.640
oto-1.590
Token.
Token ID13
Feature activation+9.14e-01

Pos logit contributions
bes+7.320
vered+7.141
antly+6.805
 elbows+6.625
ASH+6.584
Neg logit contributions
ock-4.742
aff-4.545
 tour-4.344
weet-4.306
oto-4.260
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
bes+1.564
vered+1.523
antly+1.470
es+1.467
e+1.440
Neg logit contributions
aff-1.634
weet-1.568
ock-1.560
iki-1.456
aa-1.442
Token
Token ID198
Feature activation+1.36e+00

Pos logit contributions
aff+0.237
ock+0.236
weet+0.233
iki+0.219
oto+0.213
Neg logit contributions
bes-0.160
antly-0.153
vered-0.148
es-0.147
e-0.142
Token"
Token ID1
Feature activation+8.05e-01

Pos logit contributions
bes+2.252
vered+2.213
antly+2.154
es+2.132
 searches+2.076
Neg logit contributions
ock-2.495
aff-2.474
weet-2.472
aa-2.260
iki-2.232
TokenMom
Token ID29252
Feature activation+3.54e-01

Pos logit contributions
bes+1.052
vered+0.994
es+0.980
antly+0.978
e+0.935
Neg logit contributions
ock-1.770
aff-1.750
weet-1.739
iki-1.633
oto-1.614
Token sink
Token ID14595
Feature activation+7.63e-01

Pos logit contributions
bes+2.401
vered+2.322
es+2.301
antly+2.291
e+2.197
Neg logit contributions
weet-2.234
ock-2.228
aff-2.225
iki-2.020
oto-2.002
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
bes+1.591
antly+1.542
vered+1.533
es+1.477
e+1.472
Neg logit contributions
ock-1.042
aff-1.028
weet-1.003
iki-0.904
aa-0.888
Token mom
Token ID1995
Feature activation+1.57e+00

Pos logit contributions
bes+1.843
vered+1.768
antly+1.755
es+1.728
e+1.647
Neg logit contributions
ock-1.777
weet-1.758
aff-1.725
iki-1.609
aa-1.609
Tokenmy
Token ID1820
Feature activation+2.09e+00

Pos logit contributions
bes+3.417
vered+3.246
 wrist+3.154
antly+3.129
e+3.021
Neg logit contributions
ock-2.357
weet-2.181
aff-2.100
aa-1.967
ules-1.913
Token cleaned
Token ID20750
Feature activation+1.77e+00

Pos logit contributions
bes+3.976
vered+3.790
antly+3.735
 wrist+3.726
e+3.523
Neg logit contributions
ock-3.389
aff-3.184
weet-3.164
aa-3.135
ules-2.981
Token Lily
Token ID20037
Feature activation+3.39e+00

Pos logit contributions
bes+3.322
vered+3.201
antly+3.094
 wrist+3.070
es+3.031
Neg logit contributions
ock-2.830
aff-2.790
weet-2.634
aa-2.514
iki-2.391
Token's
Token ID338
Feature activation+2.57e+00

Pos logit contributions
antly+4.796
vered+4.659
 Something+4.448
bes+4.447
 Hide+4.330
Neg logit contributions
ock-6.969
weet-6.650
aff-6.353
aa-6.244
ashing-6.102
Token shirt
Token ID10147
Feature activation+3.81e-01

Pos logit contributions
antly+4.293
es+4.269
vered+4.099
e+3.856
bes+3.816
Neg logit contributions
aff-4.583
weet-4.425
ock-4.374
aa-4.322
oto-4.305
Token with
Token ID351
Feature activation+2.76e-01

Pos logit contributions
bes+0.579
antly+0.557
vered+0.554
es+0.532
e+0.527
Neg logit contributions
ock-0.734
aff-0.730
weet-0.712
iki-0.662
oto-0.653
Token soap
Token ID19533
Feature activation-2.66e-01

Pos logit contributions
bes+0.446
antly+0.428
vered+0.418
es+0.416
e+0.403
Neg logit contributions
ock-0.505
aff-0.504
weet-0.497
iki-0.460
aa-0.450
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
aff+0.533
ock+0.528
weet+0.526
iki+0.493
oto+0.479
Neg logit contributions
bes-0.388
antly-0.371
vered-0.359
es-0.359
e-0.345
Token bush
Token ID24484
Feature activation-1.25e+00

Pos logit contributions
ock+3.659
aff+3.614
weet+3.588
ules+3.339
iki+3.326
Neg logit contributions
bes-2.886
house-2.848
antly-2.716
 wrist-2.710
e-2.695
Token.
Token ID13
Feature activation-2.80e-01

Pos logit contributions
aff+2.765
ock+2.725
weet+2.702
iki+2.551
oto+2.506
Neg logit contributions
bes-1.587
house-1.516
antly-1.494
es-1.492
e-1.466
Token Every
Token ID3887
Feature activation-2.18e+00

Pos logit contributions
aff+0.528
ock+0.525
weet+0.514
iki+0.485
oto+0.471
Neg logit contributions
bes-0.443
antly-0.424
vered-0.414
es-0.410
e-0.403
Token day
Token ID1110
Feature activation+4.07e-01

Pos logit contributions
aff+3.384
ock+2.865
weet+2.855
ules+2.625
andel+2.500
Neg logit contributions
antly-4.625
e-4.528
house-4.438
bes-4.432
nob-4.344
Token,
Token ID11
Feature activation+3.08e-02

Pos logit contributions
bes+0.727
antly+0.684
vered+0.677
es+0.661
nob+0.647
Neg logit contributions
aff-0.693
ock-0.691
weet-0.685
iki-0.627
aa-0.612
Token lots
Token ID6041
Feature activation-4.04e+00

Pos logit contributions
bes+0.049
antly+0.047
vered+0.046
es+0.045
e+0.044
Neg logit contributions
aff-0.058
ock-0.057
weet-0.057
iki-0.053
oto-0.051
Token of
Token ID286
Feature activation-1.03e+00

Pos logit contributions
aff+7.431
arl+7.286
iki+6.900
oto+6.671
weet+6.608
Neg logit contributions
e-7.602
es-7.534
's-7.467
house-7.045
vered-6.686
Token different
Token ID1180
Feature activation-6.61e-01

Pos logit contributions
aff+1.817
ock+1.762
weet+1.720
iki+1.615
oto+1.576
Neg logit contributions
bes-1.770
vered-1.688
es-1.671
antly-1.665
e-1.664
Token animals
Token ID4695
Feature activation-1.24e+00

Pos logit contributions
aff+1.210
ock+1.209
weet+1.190
iki+1.104
oto+1.085
Neg logit contributions
bes-1.075
antly-1.020
vered-1.018
es-1.015
e-0.971
Token would
Token ID561
Feature activation-1.44e+00

Pos logit contributions
aff+2.517
ock+2.473
weet+2.448
iki+2.294
oto+2.280
Neg logit contributions
es-1.765
bes-1.761
e-1.706
vered-1.699
antly-1.691
Token gather
Token ID6431
Feature activation-1.69e-01

Pos logit contributions
aff+3.008
ock+2.953
weet+2.851
iki+2.747
aa+2.699
Neg logit contributions
bes-2.007
e-1.888
vered-1.812
nob-1.807
antly-1.802
TokenOne
Token ID3198
Feature activation-1.30e+00

Pos logit contributions
bes+1.683
vered+1.604
antly+1.567
 wrist+1.520
es+1.484
Neg logit contributions
aff-1.877
ock-1.856
weet-1.804
iki-1.685
oto-1.650
Token day
Token ID1110
Feature activation+2.73e-01

Pos logit contributions
aff+1.696
ock+1.523
weet+1.476
iki+1.334
ules+1.314
Neg logit contributions
bes-2.874
e-2.864
antly-2.847
house-2.818
vered-2.788
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
bes+0.480
antly+0.460
vered+0.451
es+0.442
nob+0.434
Neg logit contributions
aff-0.464
ock-0.459
weet-0.453
iki-0.422
aa-0.411
Token a
Token ID257
Feature activation-9.40e-01

Pos logit contributions
bes+1.619
antly+1.512
vered+1.468
es+1.467
 kit+1.428
Neg logit contributions
aff-1.897
weet-1.876
ock-1.871
iki-1.784
aa-1.728
Token tall
Token ID7331
Feature activation-2.41e+00

Pos logit contributions
ock+1.482
aff+1.480
weet+1.449
iki+1.318
ules+1.305
Neg logit contributions
bes-1.738
antly-1.696
e-1.632
house-1.630
vered-1.629
Token gir
Token ID37370
Feature activation-4.00e+00

Pos logit contributions
ock+4.929
weet+4.888
aff+4.794
ules+4.563
iki+4.438
Neg logit contributions
house-3.506
bes-3.393
 wrist-3.304
 wound-3.278
antly-3.256
Tokenaffe
Token ID21223
Feature activation+2.59e-01

Pos logit contributions
weet+5.364
ock+5.035
ules+4.889
add+4.870
iki+4.756
Neg logit contributions
es-8.902
e-8.885
oc-8.833
house-8.673
's-8.346
Token named
Token ID3706
Feature activation-1.36e+00

Pos logit contributions
bes+0.528
antly+0.508
vered+0.497
es+0.489
 wrist+0.480
Neg logit contributions
aff-0.372
ock-0.369
weet-0.364
iki-0.331
aa-0.319
Token Gig
Token ID19525
Feature activation-1.13e+00

Pos logit contributions
ock+2.213
aff+2.135
weet+2.088
iki+1.980
ules+1.938
Neg logit contributions
bes-2.450
es-2.387
antly-2.384
e-2.364
nob-2.347
Tokeni
Token ID72
Feature activation+1.93e-01

Pos logit contributions
ock+1.618
weet+1.582
aff+1.575
andel+1.375
ules+1.372
Neg logit contributions
bes-2.273
antly-2.229
es-2.216
vered-2.179
e-2.161
Token went
Token ID1816
Feature activation-2.28e+00

Pos logit contributions
bes+0.284
antly+0.270
vered+0.263
es+0.256
nob+0.250
Neg logit contributions
aff-0.385
ock-0.381
weet-0.377
iki-0.354
aa-0.345
Token a
Token ID257
Feature activation-5.22e-01

Pos logit contributions
bes+0.625
antly+0.602
vered+0.587
es+0.571
 kit+0.550
Neg logit contributions
ock-0.871
weet-0.864
aff-0.860
iki-0.795
oto-0.788
Token time
Token ID640
Feature activation+3.05e-01

Pos logit contributions
aff+0.722
ock+0.707
weet+0.696
iki+0.629
ules+0.609
Neg logit contributions
bes-1.091
antly-1.053
vered-1.034
e-1.027
es-1.021
Token there
Token ID612
Feature activation+1.37e+00

Pos logit contributions
bes+0.541
antly+0.520
vered+0.508
es+0.496
nob+0.487
Neg logit contributions
aff-0.516
ock-0.511
weet-0.506
iki-0.473
oto-0.454
Token was
Token ID373
Feature activation-2.96e-01

Pos logit contributions
bes+1.768
antly+1.547
vered+1.519
 wrist+1.494
 kit+1.469
Neg logit contributions
ock-3.065
aff-3.024
weet-2.979
iki-2.923
oto-2.810
Token a
Token ID257
Feature activation-8.76e-01

Pos logit contributions
aff+0.644
ock+0.636
weet+0.627
iki+0.594
aa+0.578
Neg logit contributions
bes-0.388
antly-0.368
vered-0.357
es-0.352
e-0.348
Token gir
Token ID37370
Feature activation-4.00e+00

Pos logit contributions
ock+1.333
aff+1.325
weet+1.296
iki+1.172
ules+1.153
Neg logit contributions
bes-1.681
antly-1.624
e-1.577
house-1.575
vered-1.570
Tokenaffe
Token ID21223
Feature activation-2.54e-01

Pos logit contributions
weet+4.726
arl+4.569
ock+4.444
add+4.362
iki+4.246
Neg logit contributions
e-9.514
es-9.370
house-9.050
oc-8.977
's-8.752
Token who
Token ID508
Feature activation+3.73e-01

Pos logit contributions
aff+0.412
ock+0.406
weet+0.400
iki+0.371
oto+0.359
Neg logit contributions
bes-0.468
antly-0.451
es-0.441
vered-0.441
e-0.433
Token loved
Token ID6151
Feature activation-1.68e+00

Pos logit contributions
bes+0.640
antly+0.617
vered+0.600
es+0.592
 wrist+0.578
Neg logit contributions
aff-0.650
ock-0.644
weet-0.634
iki-0.590
aa-0.575
Token to
Token ID284
Feature activation-7.83e-01

Pos logit contributions
ock+3.054
aff+3.001
weet+2.826
iki+2.808
arl+2.688
Neg logit contributions
bes-2.779
es-2.602
vered-2.554
e-2.553
 kit-2.550
Token get
Token ID651
Feature activation-3.59e-01

Pos logit contributions
aff+1.532
ock+1.517
weet+1.479
iki+1.408
aa+1.368
Neg logit contributions
bes-1.194
antly-1.105
e-1.102
vered-1.090
es-1.082
Token,
Token ID11
Feature activation+6.11e-01

Pos logit contributions
bes+0.095
antly+0.092
vered+0.090
es+0.088
e+0.086
Neg logit contributions
aff-0.097
ock-0.097
weet-0.095
iki-0.089
oto-0.086
Token there
Token ID612
Feature activation+1.38e+00

Pos logit contributions
bes+0.813
antly+0.755
vered+0.743
es+0.717
nob+0.700
Neg logit contributions
ock-1.299
aff-1.295
weet-1.287
iki-1.221
aa-1.183
Token was
Token ID373
Feature activation-2.52e-01

Pos logit contributions
bes+1.769
antly+1.560
vered+1.550
 wrist+1.499
nob+1.469
Neg logit contributions
ock-3.077
aff-3.006
weet-2.990
iki-2.910
oto-2.816
Token a
Token ID257
Feature activation-7.81e-01

Pos logit contributions
aff+0.546
ock+0.541
weet+0.533
iki+0.505
aa+0.492
Neg logit contributions
bes-0.330
antly-0.313
vered-0.303
es-0.300
e-0.294
Token baby
Token ID5156
Feature activation-2.26e-01

Pos logit contributions
ock+1.192
aff+1.188
weet+1.161
iki+1.055
ules+1.032
Neg logit contributions
bes-1.494
antly-1.441
es-1.399
e-1.398
vered-1.397
Token gir
Token ID37370
Feature activation-3.95e+00

Pos logit contributions
aff+0.374
ock+0.370
weet+0.365
iki+0.337
oto+0.327
Neg logit contributions
bes-0.410
antly-0.396
vered-0.386
es-0.386
e-0.378
Tokenaffe
Token ID21223
Feature activation-1.39e-01

Pos logit contributions
weet+4.578
 hurricane+4.377
ock+4.350
arl+4.258
add+4.139
Neg logit contributions
es-9.692
e-9.580
house-8.852
bes-8.694
oc-8.652
Token named
Token ID3706
Feature activation-1.40e+00

Pos logit contributions
aff+0.217
ock+0.215
weet+0.211
iki+0.195
oto+0.188
Neg logit contributions
bes-0.263
antly-0.254
vered-0.249
es-0.247
e-0.243
Token Pr
Token ID1736
Feature activation-2.29e+00

Pos logit contributions
ock+2.428
aff+2.341
weet+2.297
iki+2.146
ules+2.101
Neg logit contributions
bes-2.431
antly-2.318
es-2.287
e-2.273
nob-2.235
Tokenune
Token ID1726
Feature activation-1.23e+00

Pos logit contributions
weet+3.831
ock+3.781
aff+3.647
 hurricane+3.489
ules+3.338
Neg logit contributions
antly-4.076
es-3.952
bes-3.905
e-3.835
 kit-3.766
Token.
Token ID13
Feature activation-9.64e-01

Pos logit contributions
aff+2.783
weet+2.782
ock+2.777
andel+2.570
arl+2.570
Neg logit contributions
bes-1.472
es-1.445
antly-1.413
vered-1.369
e-1.364
Token girl
Token ID2576
Feature activation+2.15e-01

Pos logit contributions
weet+6.205
ock+6.176
aff+5.934
ules+5.836
iki+5.772
Neg logit contributions
house-4.367
 kit-4.215
 wrist-4.207
's-4.132
e-4.095
Token named
Token ID3706
Feature activation-1.09e+00

Pos logit contributions
bes+0.458
antly+0.445
vered+0.438
es+0.430
nob+0.420
Neg logit contributions
aff-0.289
ock-0.286
weet-0.283
iki-0.256
aa-0.246
Token Mary
Token ID5335
Feature activation-4.01e-02

Pos logit contributions
ock+1.771
aff+1.731
weet+1.668
ules+1.562
iki+1.552
Neg logit contributions
bes-2.059
antly-1.958
house-1.935
es-1.933
nob-1.922
Token.
Token ID13
Feature activation-8.20e-01

Pos logit contributions
aff+0.080
ock+0.080
weet+0.079
iki+0.074
oto+0.072
Neg logit contributions
bes-0.059
antly-0.056
vered-0.054
es-0.054
e-0.052
Token Mary
Token ID5335
Feature activation+8.46e-01

Pos logit contributions
aff+1.347
ock+1.344
weet+1.299
iki+1.181
aa+1.168
Neg logit contributions
bes-1.492
antly-1.457
es-1.433
vered-1.403
nob-1.389
Token dreamed
Token ID27947
Feature activation-3.93e+00

Pos logit contributions
bes+1.483
antly+1.457
vered+1.431
es+1.373
 kit+1.341
Neg logit contributions
aff-1.407
ock-1.393
weet-1.374
iki-1.292
oto-1.253
Token of
Token ID286
Feature activation-9.10e-01

Pos logit contributions
weet+7.192
aff+7.185
iki+6.941
ock+6.352
aph+6.347
Neg logit contributions
es-7.314
antly-7.142
bes-7.017
 kit-6.952
's-6.639
Token finding
Token ID4917
Feature activation+2.94e-01

Pos logit contributions
aff+1.879
ock+1.850
weet+1.773
iki+1.729
aa+1.688
Neg logit contributions
bes-1.303
es-1.185
e-1.176
antly-1.174
vered-1.169
Token a
Token ID257
Feature activation+4.55e-02

Pos logit contributions
bes+0.483
antly+0.463
vered+0.452
es+0.442
e+0.432
Neg logit contributions
aff-0.545
ock-0.542
weet-0.536
iki-0.494
oto-0.485
Token diamond
Token ID15291
Feature activation-6.44e-01

Pos logit contributions
bes+0.088
antly+0.085
vered+0.084
es+0.083
e+0.081
Neg logit contributions
aff-0.070
ock-0.069
weet-0.068
iki-0.062
oto-0.060
Token.
Token ID13
Feature activation-2.33e-01

Pos logit contributions
aff+1.357
ock+1.344
weet+1.338
iki+1.249
ules+1.231
Neg logit contributions
bes-0.881
antly-0.835
es-0.819
vered-0.799
house-0.794
Token a
Token ID257
Feature activation-4.12e-01

Pos logit contributions
bes+1.081
antly+1.046
vered+1.026
es+0.983
 kit+0.962
Neg logit contributions
ock-1.405
weet-1.399
aff-1.368
oto-1.268
iki-1.266
Token time
Token ID640
Feature activation+2.31e-01

Pos logit contributions
aff+0.560
ock+0.549
weet+0.541
iki+0.488
ules+0.472
Neg logit contributions
bes-0.868
antly-0.839
vered-0.823
es-0.814
e-0.813
Token there
Token ID612
Feature activation+1.33e+00

Pos logit contributions
bes+0.406
antly+0.390
vered+0.380
es+0.373
e+0.366
Neg logit contributions
aff-0.395
ock-0.391
weet-0.387
iki-0.362
oto-0.348
Token was
Token ID373
Feature activation+9.78e-03

Pos logit contributions
bes+1.715
antly+1.506
vered+1.472
 wrist+1.438
 kit+1.421
Neg logit contributions
ock-2.970
aff-2.933
weet-2.895
iki-2.832
oto-2.731
Token a
Token ID257
Feature activation-7.17e-01

Pos logit contributions
bes+0.013
antly+0.013
vered+0.012
es+0.012
e+0.012
Neg logit contributions
aff-0.021
ock-0.020
weet-0.020
iki-0.019
oto-0.019
Token gir
Token ID37370
Feature activation-3.88e+00

Pos logit contributions
ock+1.076
aff+1.073
weet+1.046
iki+0.951
ules+0.930
Neg logit contributions
bes-1.393
antly-1.347
vered-1.304
es-1.302
e-1.301
Tokenaffe
Token ID21223
Feature activation-2.16e-01

Pos logit contributions
weet+4.465
arl+4.310
ock+4.294
add+4.204
iki+4.057
Neg logit contributions
e-9.235
es-9.214
house-8.825
oc-8.758
's-8.552
Token named
Token ID3706
Feature activation-1.54e+00

Pos logit contributions
aff+0.348
ock+0.344
weet+0.339
iki+0.313
oto+0.303
Neg logit contributions
bes-0.401
antly-0.387
vered-0.378
es-0.378
e-0.371
Token Sam
Token ID3409
Feature activation-8.46e-01

Pos logit contributions
ock+2.565
aff+2.446
weet+2.408
iki+2.257
aa+2.205
Neg logit contributions
bes-2.810
antly-2.673
e-2.653
es-2.648
house-2.631
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
aff+1.840
ock+1.786
weet+1.781
iki+1.688
andel+1.666
Neg logit contributions
bes-1.099
antly-1.037
es-1.028
e-1.022
vered-1.009
Token a
Token ID257
Feature activation-1.41e+00

Pos logit contributions
aff+2.459
ock+2.380
weet+2.284
andel+2.176
iki+2.174
Neg logit contributions
bes-1.735
es-1.635
e-1.632
antly-1.627
vered-1.623
Token clock
Token ID8801
Feature activation-3.08e-02

Pos logit contributions
bes+1.061
antly+1.013
vered+1.010
es+0.979
e+0.944
Neg logit contributions
ock-1.301
weet-1.282
aff-1.264
iki-1.154
oto-1.149
Token went
Token ID1816
Feature activation-7.70e-01

Pos logit contributions
aff+0.044
ock+0.043
weet+0.043
iki+0.039
oto+0.037
Neg logit contributions
bes-0.063
antly-0.061
vered-0.060
es-0.059
e-0.058
Token "
Token ID366
Feature activation+1.37e-01

Pos logit contributions
aff+1.575
ock+1.536
weet+1.528
iki+1.469
oto+1.411
Neg logit contributions
bes-1.118
antly-1.073
es-1.035
nob-1.028
 kit-1.027
Tokentick
Token ID42298
Feature activation-1.20e+00

Pos logit contributions
bes+0.201
antly+0.191
vered+0.186
es+0.184
e+0.177
Neg logit contributions
aff-0.272
ock-0.271
weet-0.268
iki-0.250
aa-0.244
Token-
Token ID12
Feature activation-1.40e+00

Pos logit contributions
aff+2.495
weet+2.447
ock+2.429
iki+2.418
oto+2.323
Neg logit contributions
bes-1.686
es-1.615
e-1.578
antly-1.556
vered-1.547
Tokent
Token ID83
Feature activation-3.84e+00

Pos logit contributions
aff+2.202
ock+2.154
weet+2.145
iki+2.088
oto+1.985
Neg logit contributions
bes-2.669
e-2.581
es-2.576
antly-2.565
vered-2.530
Tokenock
Token ID735
Feature activation-5.97e-01

Pos logit contributions
iki+5.730
aff+5.654
oto+5.064
ules+5.054
add+4.703
Neg logit contributions
e-8.820
es-8.513
nob-8.448
bes-8.342
house-8.267
Token,
Token ID11
Feature activation-1.59e+00

Pos logit contributions
aff+1.193
ock+1.172
weet+1.164
iki+1.104
oto+1.079
Neg logit contributions
bes-0.884
antly-0.840
es-0.830
vered-0.825
e-0.809
Token tick
Token ID4378
Feature activation-6.93e-01

Pos logit contributions
aff+3.168
ock+3.003
weet+2.935
iki+2.907
oto+2.844
Neg logit contributions
bes-2.449
e-2.321
antly-2.310
vered-2.296
es-2.272
Token-
Token ID12
Feature activation-1.24e+00

Pos logit contributions
aff+1.413
weet+1.392
ock+1.384
iki+1.324
oto+1.273
Neg logit contributions
bes-1.012
antly-0.963
es-0.955
vered-0.944
e-0.921
Tokent
Token ID83
Feature activation-3.27e+00

Pos logit contributions
aff+2.072
weet+1.993
ock+1.989
iki+1.944
oto+1.840
Neg logit contributions
bes-2.259
antly-2.188
e-2.180
es-2.174
vered-2.143
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
bes+0.852
antly+0.823
vered+0.799
es+0.777
e+0.769
Neg logit contributions
aff-0.778
ock-0.769
weet-0.762
iki-0.714
oto-0.684
Token there
Token ID612
Feature activation+2.02e+00

Pos logit contributions
bes+2.004
antly+1.833
vered+1.778
 wrist+1.766
es+1.736
Neg logit contributions
ock-2.576
weet-2.552
aff-2.528
iki-2.433
aa-2.325
Token was
Token ID373
Feature activation+7.62e-03

Pos logit contributions
bes+2.952
antly+2.576
 wrist+2.525
vered+2.514
 kit+2.505
Neg logit contributions
ock-4.142
aff-4.115
weet-3.990
iki-3.980
oto-3.829
Token a
Token ID257
Feature activation-7.18e-01

Pos logit contributions
bes+0.010
antly+0.010
vered+0.010
es+0.009
e+0.009
Neg logit contributions
aff-0.016
ock-0.016
weet-0.016
iki-0.015
oto-0.014
Token little
Token ID1310
Feature activation-1.15e+00

Pos logit contributions
ock+1.087
aff+1.085
weet+1.059
iki+0.969
ules+0.942
Neg logit contributions
bes-1.383
antly-1.336
house-1.292
es-1.292
vered-1.291
Token gir
Token ID37370
Feature activation-3.84e+00

Pos logit contributions
aff+2.148
ock+2.107
weet+2.091
iki+1.944
ules+1.926
Neg logit contributions
bes-1.759
antly-1.706
house-1.681
 kit-1.670
es-1.666
Tokenaffe
Token ID21223
Feature activation-2.06e-01

Pos logit contributions
weet+4.947
ock+4.608
arl+4.514
 hurricane+4.369
add+4.339
Neg logit contributions
es-8.838
e-8.586
oc-8.430
house-8.304
bes-8.067
Token named
Token ID3706
Feature activation-1.25e+00

Pos logit contributions
aff+0.309
ock+0.305
weet+0.300
iki+0.275
oto+0.266
Neg logit contributions
bes-0.404
antly-0.391
vered-0.383
es-0.382
e-0.375
Token Gig
Token ID19525
Feature activation-1.45e+00

Pos logit contributions
ock+2.040
aff+1.977
weet+1.928
iki+1.803
aa+1.762
Neg logit contributions
bes-2.311
antly-2.198
es-2.180
nob-2.162
e-2.158
Tokeni
Token ID72
Feature activation-5.78e-01

Pos logit contributions
ock+2.086
aff+2.047
weet+2.047
andel+1.841
arl+1.812
Neg logit contributions
bes-2.892
es-2.835
antly-2.832
e-2.743
vered-2.721
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
aff+1.204
ock+1.201
weet+1.189
iki+1.109
oto+1.088
Neg logit contributions
bes-0.791
antly-0.756
es-0.750
vered-0.732
e-0.727
Token was
Token ID373
Feature activation-1.85e-01

Pos logit contributions
bes+2.600
antly+2.250
vered+2.241
 wrist+2.224
 kit+2.201
Neg logit contributions
ock-3.897
aff-3.820
weet-3.774
iki-3.705
oto-3.574
Token a
Token ID257
Feature activation-7.03e-01

Pos logit contributions
aff+0.396
ock+0.393
weet+0.388
iki+0.366
oto+0.357
Neg logit contributions
bes-0.244
antly-0.232
vered-0.225
es-0.222
e-0.217
Token big
Token ID1263
Feature activation-1.75e+00

Pos logit contributions
ock+1.063
aff+1.062
weet+1.036
iki+0.945
ules+0.922
Neg logit contributions
bes-1.355
antly-1.308
vered-1.269
es-1.266
e-1.264
Token,
Token ID11
Feature activation-6.69e-01

Pos logit contributions
aff+3.549
ock+3.487
weet+3.423
iki+3.234
ules+3.234
Neg logit contributions
house-2.471
bes-2.455
e-2.406
 kit-2.362
 wrist-2.329
Token tall
Token ID7331
Feature activation-2.02e+00

Pos logit contributions
aff+1.139
ock+1.120
weet+1.092
iki+1.018
oto+0.983
Neg logit contributions
bes-1.193
antly-1.143
vered-1.106
es-1.105
house-1.101
Token gir
Token ID37370
Feature activation-3.81e+00

Pos logit contributions
ock+3.981
weet+3.931
aff+3.890
iki+3.655
ules+3.635
Neg logit contributions
bes-3.022
house-2.943
 wrist-2.857
antly-2.791
e-2.783
Tokenaffe
Token ID21223
Feature activation-6.86e-02

Pos logit contributions
weet+4.456
ock+4.343
arl+4.205
add+4.074
 hurricane+4.052
Neg logit contributions
es-9.302
e-8.972
house-8.514
oc-8.469
bes-8.435
Token named
Token ID3706
Feature activation-1.46e+00

Pos logit contributions
aff+0.115
ock+0.114
weet+0.112
iki+0.104
oto+0.101
Neg logit contributions
bes-0.123
antly-0.118
vered-0.116
es-0.115
e-0.112
Token Jerry
Token ID13075
Feature activation-1.75e-01

Pos logit contributions
ock+2.560
aff+2.408
weet+2.401
iki+2.246
aa+2.208
Neg logit contributions
bes-2.547
antly-2.407
es-2.396
nob-2.364
e-2.362
Token.
Token ID13
Feature activation-7.84e-01

Pos logit contributions
aff+0.356
ock+0.353
weet+0.350
iki+0.328
oto+0.319
Neg logit contributions
bes-0.251
antly-0.240
vered-0.234
es-0.232
e-0.226
Token Jerry
Token ID13075
Feature activation+3.50e-01

Pos logit contributions
ock+1.270
aff+1.249
weet+1.232
iki+1.151
aa+1.123
Neg logit contributions
bes-1.452
antly-1.396
es-1.368
vered-1.367
house-1.348
Token was
Token ID373
Feature activation+2.85e-03

Pos logit contributions
bes+2.440
antly+2.141
 kit+2.087
vered+2.083
 wrist+2.078
Neg logit contributions
ock-3.753
aff-3.699
weet-3.641
iki-3.589
oto-3.446
Token a
Token ID257
Feature activation-7.91e-01

Pos logit contributions
bes+0.004
antly+0.004
vered+0.004
es+0.004
e+0.003
Neg logit contributions
aff-0.006
ock-0.006
weet-0.006
iki-0.006
oto-0.005
Token monkey
Token ID21657
Feature activation-2.35e-01

Pos logit contributions
ock+1.230
aff+1.227
weet+1.197
iki+1.093
ules+1.070
Neg logit contributions
bes-1.491
antly-1.438
house-1.396
e-1.392
es-1.390
Token and
Token ID290
Feature activation-6.71e-01

Pos logit contributions
aff+0.352
ock+0.348
weet+0.342
iki+0.314
oto+0.303
Neg logit contributions
bes-0.463
antly-0.446
vered-0.438
es-0.438
e-0.430
Token a
Token ID257
Feature activation-1.08e+00

Pos logit contributions
aff+1.370
ock+1.340
weet+1.307
iki+1.234
ules+1.208
Neg logit contributions
bes-0.977
antly-0.928
vered-0.908
es-0.908
e-0.900
Token gir
Token ID37370
Feature activation-3.79e+00

Pos logit contributions
ock+1.844
aff+1.817
weet+1.767
ules+1.613
iki+1.613
Neg logit contributions
bes-1.886
antly-1.819
house-1.779
e-1.775
es-1.761
Tokenaffe
Token ID21223
Feature activation-3.21e-01

Pos logit contributions
weet+4.808
ock+4.390
ules+4.131
oto+4.079
andel+4.061
Neg logit contributions
es-8.825
e-8.750
oc-8.497
house-8.421
's-7.983
Token.
Token ID13
Feature activation-5.92e-01

Pos logit contributions
aff+0.523
ock+0.517
weet+0.510
iki+0.471
oto+0.458
Neg logit contributions
bes-0.588
antly-0.569
es-0.557
vered-0.557
e-0.547
Token The
Token ID383
Feature activation-1.82e+00

Pos logit contributions
ock+1.096
aff+1.082
weet+1.067
iki+1.005
aa+0.977
Neg logit contributions
bes-0.961
antly-0.921
vered-0.903
es-0.898
e-0.886
Token gir
Token ID37370
Feature activation-3.60e+00

Pos logit contributions
ock+3.396
aff+3.280
weet+3.243
ules+3.023
andel+2.977
Neg logit contributions
bes-2.905
house-2.833
e-2.807
antly-2.780
es-2.742
Tokenaffe
Token ID21223
Feature activation-3.26e-01

Pos logit contributions
weet+5.687
ock+5.328
arl+5.242
andel+5.150
 hurricane+5.138
Neg logit contributions
es-7.819
e-7.256
oc-7.185
house-7.158
bes-7.011
Token,
Token ID11
Feature activation+9.55e-01

Pos logit contributions
bes+0.221
antly+0.213
vered+0.208
es+0.204
e+0.200
Neg logit contributions
aff-0.222
ock-0.220
weet-0.217
iki-0.203
oto-0.196
Token there
Token ID612
Feature activation+1.62e+00

Pos logit contributions
bes+1.378
antly+1.252
vered+1.229
 wrist+1.190
es+1.184
Neg logit contributions
ock-1.941
weet-1.928
aff-1.925
iki-1.836
aa-1.763
Token was
Token ID373
Feature activation+4.35e-02

Pos logit contributions
bes+2.202
antly+1.943
vered+1.898
 wrist+1.857
 kit+1.852
Neg logit contributions
ock-3.503
aff-3.466
weet-3.412
iki-3.335
oto-3.230
Token a
Token ID257
Feature activation-8.23e-01

Pos logit contributions
bes+0.059
antly+0.057
vered+0.055
es+0.054
e+0.052
Neg logit contributions
aff-0.091
ock-0.091
weet-0.090
iki-0.084
oto-0.082
Token tall
Token ID7331
Feature activation-2.01e+00

Pos logit contributions
ock+1.273
aff+1.269
weet+1.235
iki+1.132
ules+1.107
Neg logit contributions
bes-1.563
antly-1.499
house-1.462
e-1.459
es-1.457
Token gir
Token ID37370
Feature activation-3.76e+00

Pos logit contributions
ock+3.990
weet+3.912
aff+3.901
iki+3.654
ules+3.641
Neg logit contributions
bes-3.026
house-2.970
 wrist-2.853
e-2.823
 wound-2.787
Tokenaffe
Token ID21223
Feature activation-2.21e-01

Pos logit contributions
weet+4.292
ock+4.171
add+4.125
arl+3.973
iki+3.916
Neg logit contributions
es-8.948
e-8.794
house-8.530
oc-8.467
's-8.287
Token named
Token ID3706
Feature activation-1.42e+00

Pos logit contributions
aff+0.345
ock+0.341
weet+0.336
iki+0.310
oto+0.299
Neg logit contributions
bes-0.420
antly-0.405
vered-0.397
es-0.396
e-0.389
Token Jerry
Token ID13075
Feature activation-3.29e-01

Pos logit contributions
ock+2.468
aff+2.348
weet+2.303
iki+2.176
aa+2.130
Neg logit contributions
bes-2.502
es-2.370
antly-2.364
e-2.344
nob-2.334
Token.
Token ID13
Feature activation-7.42e-01

Pos logit contributions
aff+0.678
ock+0.672
weet+0.665
iki+0.624
ules+0.609
Neg logit contributions
bes-0.459
antly-0.438
es-0.427
vered-0.426
e-0.417
Token Jerry
Token ID13075
Feature activation+5.61e-01

Pos logit contributions
ock+1.188
aff+1.176
weet+1.155
iki+1.081
aa+1.049
Neg logit contributions
bes-1.379
antly-1.325
vered-1.302
es-1.296
house-1.273
Token a
Token ID257
Feature activation-8.34e-01

Pos logit contributions
bes+0.049
antly+0.047
vered+0.046
es+0.045
e+0.043
Neg logit contributions
aff-0.076
ock-0.075
weet-0.075
iki-0.070
oto-0.068
Token bridge
Token ID7696
Feature activation-1.29e+00

Pos logit contributions
ock+1.287
aff+1.286
weet+1.249
iki+1.139
ules+1.117
Neg logit contributions
bes-1.588
antly-1.531
house-1.489
e-1.482
vered-1.477
Token.
Token ID13
Feature activation-1.88e-01

Pos logit contributions
aff+2.888
ock+2.838
weet+2.818
iki+2.694
oto+2.662
Neg logit contributions
bes-1.614
antly-1.513
house-1.509
e-1.494
es-1.487
Token Everyday
Token ID48154
Feature activation-3.15e-01

Pos logit contributions
aff+0.356
ock+0.354
weet+0.348
iki+0.327
oto+0.318
Neg logit contributions
bes-0.293
antly-0.281
vered-0.274
es-0.271
e-0.266
Token,
Token ID11
Feature activation+2.26e-01

Pos logit contributions
aff+0.523
ock+0.514
weet+0.507
iki+0.468
oto+0.454
Neg logit contributions
bes-0.572
antly-0.558
vered-0.543
es-0.540
e-0.531
Token lots
Token ID6041
Feature activation-3.76e+00

Pos logit contributions
bes+0.360
antly+0.344
vered+0.336
es+0.330
e+0.321
Neg logit contributions
aff-0.420
ock-0.418
weet-0.416
iki-0.386
oto-0.375
Token of
Token ID286
Feature activation-1.46e+00

Pos logit contributions
aff+6.503
arl+6.396
iki+6.104
weet+6.069
oto+6.044
Neg logit contributions
es-7.442
e-7.248
's-7.143
house-6.908
vered-6.578
Token people
Token ID661
Feature activation-1.22e+00

Pos logit contributions
aff+2.857
ock+2.688
weet+2.680
iki+2.562
oto+2.548
Neg logit contributions
bes-2.259
vered-2.220
e-2.179
es-2.171
antly-2.153
Token would
Token ID561
Feature activation-1.62e+00

Pos logit contributions
aff+2.423
ock+2.361
weet+2.361
iki+2.234
oto+2.215
Neg logit contributions
bes-1.772
es-1.760
e-1.735
vered-1.723
antly-1.716
Token cross
Token ID3272
Feature activation-3.33e-01

Pos logit contributions
aff+3.368
ock+3.215
weet+3.116
iki+3.055
oto+3.003
Neg logit contributions
bes-2.299
e-2.202
house-2.149
antly-2.135
vered-2.116
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
aff+0.629
ock+0.621
weet+0.614
iki+0.577
oto+0.561
Neg logit contributions
bes-0.526
antly-0.505
vered-0.494
es-0.490
e-0.479
TokenOne
Token ID3198
Feature activation-1.11e+00

Pos logit contributions
bes+1.608
vered+1.504
antly+1.467
 wrist+1.442
es+1.403
Neg logit contributions
aff-1.910
ock-1.899
weet-1.853
iki-1.721
ules-1.679
Token day
Token ID1110
Feature activation+3.15e-01

Pos logit contributions
aff+1.400
ock+1.246
weet+1.203
iki+1.087
ules+1.045
Neg logit contributions
bes-2.494
e-2.483
antly-2.483
house-2.427
vered-2.424
Token,
Token ID11
Feature activation+6.06e-01

Pos logit contributions
bes+0.554
antly+0.529
vered+0.521
es+0.510
nob+0.501
Neg logit contributions
aff-0.535
ock-0.529
weet-0.523
iki-0.489
aa-0.475
Token a
Token ID257
Feature activation-1.02e+00

Pos logit contributions
bes+0.917
antly+0.862
vered+0.844
es+0.836
 kit+0.808
Neg logit contributions
aff-1.179
ock-1.171
weet-1.165
iki-1.097
oto-1.065
Token tall
Token ID7331
Feature activation-2.40e+00

Pos logit contributions
aff+1.639
ock+1.630
weet+1.594
iki+1.447
ules+1.441
Neg logit contributions
bes-1.865
antly-1.826
house-1.759
e-1.755
vered-1.744
Token gir
Token ID37370
Feature activation-3.73e+00

Pos logit contributions
ock+4.892
weet+4.869
aff+4.787
ules+4.528
iki+4.406
Neg logit contributions
house-3.501
bes-3.374
 wrist-3.311
antly-3.269
 wound-3.263
Tokenaffe
Token ID21223
Feature activation+9.12e-02

Pos logit contributions
weet+4.829
ock+4.622
arl+4.530
ules+4.369
add+4.321
Neg logit contributions
e-8.256
es-8.176
oc-8.097
house-8.040
's-7.716
Token was
Token ID373
Feature activation-4.61e-01

Pos logit contributions
bes+0.186
antly+0.180
vered+0.176
es+0.174
e+0.170
Neg logit contributions
aff-0.130
ock-0.129
weet-0.127
iki-0.116
aa-0.111
Token walking
Token ID6155
Feature activation-7.66e-01

Pos logit contributions
aff+0.924
ock+0.914
weet+0.896
iki+0.848
oto+0.828
Neg logit contributions
bes-0.682
antly-0.651
vered-0.632
es-0.630
e-0.628
Token through
Token ID832
Feature activation+1.31e+00

Pos logit contributions
aff+1.645
weet+1.616
ock+1.615
iki+1.535
ules+1.497
Neg logit contributions
bes-1.009
antly-0.993
vered-0.966
e-0.950
es-0.945
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
bes+2.568
antly+2.521
vered+2.408
es+2.401
 kit+2.361
Neg logit contributions
weet-2.065
ock-1.999
aff-1.930
iki-1.727
oto-1.661
Token adventure
Token ID8855
Feature activation-5.02e-01

Pos logit contributions
aff+1.565
ock+1.532
weet+1.531
iki+1.443
ules+1.409
Neg logit contributions
bes-1.322
antly-1.231
es-1.193
vered-1.192
e-1.191
Token story
Token ID1621
Feature activation-6.77e-01

Pos logit contributions
aff+1.071
ock+1.059
weet+1.055
iki+0.996
oto+0.973
Neg logit contributions
bes-0.668
antly-0.636
es-0.620
vered-0.617
e-0.601
Token,
Token ID11
Feature activation-8.09e-01

Pos logit contributions
aff+1.362
ock+1.358
weet+1.351
iki+1.273
aa+1.234
Neg logit contributions
bes-0.997
antly-0.937
es-0.926
vered-0.900
e-0.880
Token but
Token ID475
Feature activation-4.37e-01

Pos logit contributions
aff+1.369
ock+1.359
weet+1.312
iki+1.238
oto+1.206
Neg logit contributions
bes-1.445
antly-1.372
es-1.365
vered-1.354
e-1.351
Token after
Token ID706
Feature activation-1.60e+00

Pos logit contributions
aff+0.737
ock+0.733
weet+0.718
iki+0.660
oto+0.643
Neg logit contributions
bes-0.781
antly-0.757
vered-0.742
es-0.735
e-0.722
Token an
Token ID281
Feature activation-3.71e+00

Pos logit contributions
aff+3.703
ock+3.678
weet+3.556
iki+3.429
arl+3.386
Neg logit contributions
bes-2.005
es-1.864
antly-1.817
e-1.780
vered-1.745
Token hour
Token ID1711
Feature activation+4.50e-01

Pos logit contributions
aff+6.336
aa+6.076
ules+5.930
weet+5.764
iki+5.754
Neg logit contributions
house-7.024
nob-7.007
e-6.840
bes-6.797
vered-6.540
Token they
Token ID484
Feature activation-1.79e+00

Pos logit contributions
bes+0.820
antly+0.778
vered+0.762
es+0.745
nob+0.739
Neg logit contributions
aff-0.749
ock-0.742
weet-0.727
iki-0.677
ules-0.655
Token were
Token ID547
Feature activation-6.46e-01

Pos logit contributions
aff+3.678
ock+3.618
weet+3.614
andel+3.338
iki+3.331
Neg logit contributions
e-2.544
es-2.532
antly-2.524
bes-2.521
vered-2.486
Token both
Token ID1111
Feature activation+3.26e-01

Pos logit contributions
aff+1.176
ock+1.157
weet+1.132
iki+1.070
oto+1.041
Neg logit contributions
bes-1.076
antly-1.031
es-1.012
vered-1.006
e-0.999
Token tired
Token ID10032
Feature activation-9.94e-01

Pos logit contributions
bes+0.628
antly+0.607
vered+0.591
es+0.586
nob+0.573
Neg logit contributions
aff-0.500
ock-0.498
weet-0.494
iki-0.451
aa-0.436
Token
Token ID198
Feature activation-5.27e-01

Pos logit contributions
ock+1.544
aff+1.542
weet+1.512
iki+1.435
oto+1.415
Neg logit contributions
bes-0.920
antly-0.888
vered-0.843
es-0.842
e-0.823
Token
Token ID198
Feature activation-9.72e-01

Pos logit contributions
aff+1.130
ock+1.119
weet+1.110
iki+1.044
aa+1.018
Neg logit contributions
bes-0.692
antly-0.665
vered-0.637
es-0.636
e-0.621
TokenEvery
Token ID6109
Feature activation-1.82e+00

Pos logit contributions
aff+1.743
ock+1.686
weet+1.674
iki+1.583
aa+1.544
Neg logit contributions
bes-1.656
antly-1.598
vered-1.565
e-1.539
house-1.519
Token morning
Token ID3329
Feature activation-7.12e-02

Pos logit contributions
aff+2.725
weet+2.457
ock+2.455
arl+2.187
iki+2.178
Neg logit contributions
antly-3.880
e-3.757
bes-3.680
vered-3.654
house-3.617
Token,
Token ID11
Feature activation-8.37e-01

Pos logit contributions
aff+0.127
ock+0.126
weet+0.124
iki+0.116
oto+0.112
Neg logit contributions
bes-0.120
antly-0.115
vered-0.113
es-0.112
e-0.109
Token lots
Token ID6041
Feature activation-3.70e+00

Pos logit contributions
aff+1.589
ock+1.572
weet+1.547
iki+1.446
oto+1.422
Neg logit contributions
bes-1.324
antly-1.289
vered-1.258
es-1.256
e-1.230
Token of
Token ID286
Feature activation-1.53e+00

Pos logit contributions
aff+6.033
arl+5.965
weet+5.799
iki+5.672
oto+5.550
Neg logit contributions
es-7.322
e-7.173
's-6.926
vered-6.636
antly-6.629
Token boats
Token ID16659
Feature activation-1.78e+00

Pos logit contributions
aff+2.946
weet+2.766
ock+2.766
iki+2.671
oto+2.554
Neg logit contributions
bes-2.374
antly-2.318
vered-2.305
e-2.295
es-2.279
Token arrived
Token ID5284
Feature activation-5.21e-01

Pos logit contributions
aff+3.388
weet+3.265
ock+3.217
iki+3.075
andel+3.041
Neg logit contributions
es-2.780
antly-2.767
bes-2.753
e-2.705
vered-2.681
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
aff+0.918
ock+0.912
weet+0.899
iki+0.834
oto+0.808
Neg logit contributions
bes-0.890
antly-0.871
es-0.844
vered-0.842
e-0.814
Token left
Token ID1364
Feature activation-7.54e-01

Pos logit contributions
aff+1.968
ock+1.911
weet+1.859
iki+1.774
oto+1.728
Neg logit contributions
bes-1.721
antly-1.701
es-1.661
vered-1.646
e-1.612
Token just
Token ID655
Feature activation-7.93e-01

Pos logit contributions
bes+0.279
antly+0.265
vered+0.259
es+0.255
e+0.249
Neg logit contributions
aff-0.342
ock-0.341
weet-0.337
iki-0.316
oto-0.306
Token went
Token ID1816
Feature activation-1.09e+00

Pos logit contributions
aff+1.559
ock+1.534
weet+1.500
iki+1.412
ules+1.399
Neg logit contributions
bes-1.201
antly-1.178
vered-1.150
es-1.139
e-1.113
Token "
Token ID366
Feature activation-3.82e-01

Pos logit contributions
aff+2.299
ock+2.255
weet+2.232
iki+2.144
ules+2.085
Neg logit contributions
bes-1.514
antly-1.475
 kit-1.413
nob-1.407
house-1.400
Tokentick
Token ID42298
Feature activation-1.18e+00

Pos logit contributions
aff+0.779
ock+0.767
weet+0.758
iki+0.719
oto+0.702
Neg logit contributions
bes-0.541
antly-0.522
vered-0.509
es-0.500
e-0.493
Token-
Token ID12
Feature activation-1.43e+00

Pos logit contributions
aff+2.427
weet+2.373
ock+2.356
iki+2.350
oto+2.257
Neg logit contributions
bes-1.663
es-1.597
antly-1.553
e-1.545
vered-1.537
Tokent
Token ID83
Feature activation-3.70e+00

Pos logit contributions
aff+2.069
ock+2.018
weet+2.012
iki+1.945
oto+1.831
Neg logit contributions
bes-2.898
antly-2.821
e-2.818
es-2.807
vered-2.780
Tokenock
Token ID735
Feature activation-3.75e-01

Pos logit contributions
iki+6.291
aff+6.235
ules+5.645
oto+5.543
add+5.458
Neg logit contributions
e-7.454
nob-7.295
es-7.278
bes-7.130
house-7.049
Token,
Token ID11
Feature activation-8.41e-01

Pos logit contributions
aff+0.739
ock+0.729
weet+0.721
iki+0.680
oto+0.663
Neg logit contributions
bes-0.561
antly-0.536
vered-0.523
es-0.522
e-0.508
Token tick
Token ID4378
Feature activation-4.58e-01

Pos logit contributions
aff+1.500
ock+1.465
weet+1.429
iki+1.373
oto+1.332
Neg logit contributions
bes-1.431
antly-1.364
vered-1.340
es-1.328
e-1.315
Token-
Token ID12
Feature activation-1.11e+00

Pos logit contributions
aff+0.882
ock+0.866
weet+0.865
iki+0.815
oto+0.786
Neg logit contributions
bes-0.717
antly-0.688
vered-0.672
es-0.671
e-0.652
Tokent
Token ID83
Feature activation-2.82e+00

Pos logit contributions
aff+1.747
ock+1.685
weet+1.679
iki+1.621
oto+1.534
Neg logit contributions
bes-2.105
antly-2.057
e-2.022
es-2.020
vered-2.015
Token lively
Token ID29696
Feature activation-1.92e+00

Pos logit contributions
ock+1.296
aff+1.295
weet+1.260
iki+1.154
ules+1.135
Neg logit contributions
bes-1.507
antly-1.446
house-1.411
e-1.404
es-1.398
Token little
Token ID1310
Feature activation-1.62e+00

Pos logit contributions
ock+3.568
aff+3.524
weet+3.456
ules+3.302
iki+3.242
Neg logit contributions
house-3.025
bes-3.015
 wrist-2.869
e-2.859
 Hide-2.812
Token bird
Token ID6512
Feature activation-1.03e+00

Pos logit contributions
aff+3.204
ock+3.133
weet+3.110
ules+2.916
iki+2.881
Neg logit contributions
bes-2.318
house-2.302
 kit-2.248
antly-2.222
e-2.208
Token named
Token ID3706
Feature activation-1.03e+00

Pos logit contributions
aff+1.804
ock+1.744
weet+1.724
iki+1.610
ules+1.598
Neg logit contributions
bes-1.772
e-1.713
antly-1.697
es-1.690
house-1.677
Token Ch
Token ID609
Feature activation-2.19e+00

Pos logit contributions
ock+1.698
aff+1.679
weet+1.580
ules+1.502
iki+1.493
Neg logit contributions
bes-1.896
antly-1.818
e-1.797
house-1.780
es-1.771
Tokenir
Token ID343
Feature activation-3.68e+00

Pos logit contributions
ock+1.920
aff+1.917
weet+1.896
oto+1.583
ules+1.544
Neg logit contributions
bes-5.792
e-5.669
antly-5.519
es-5.471
vered-5.449
Tokenpy
Token ID9078
Feature activation-2.04e+00

Pos logit contributions
 hurricane+6.673
arl+6.480
weet+6.472
aff+6.329
 tour+6.306
Neg logit contributions
es-6.587
e-6.137
bes-6.094
antly-6.075
nob-5.894
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
aff+4.828
weet+4.741
ock+4.702
ules+4.494
andel+4.471
Neg logit contributions
e-2.292
es-2.262
bes-2.205
antly-2.174
house-2.125
Token Ch
Token ID609
Feature activation-2.29e+00

Pos logit contributions
aff+1.983
ock+1.930
weet+1.899
iki+1.765
oto+1.744
Neg logit contributions
bes-1.896
antly-1.862
es-1.845
vered-1.780
e-1.766
Tokenir
Token ID343
Feature activation-3.48e+00

Pos logit contributions
ock+2.078
weet+2.057
aff+1.962
oto+1.694
ules+1.694
Neg logit contributions
bes-6.224
e-5.924
vered-5.849
es-5.839
house-5.801
Tokenpy
Token ID9078
Feature activation-1.26e+00

Pos logit contributions
weet+4.678
 hurricane+4.625
ock+4.595
aff+4.498
arl+4.454
Neg logit contributions
es-7.702
bes-7.482
antly-7.305
e-7.176
nob-7.172
Token she
Token ID673
Feature activation+2.11e-01

Pos logit contributions
aff+1.653
ock+1.606
weet+1.537
iki+1.424
andel+1.424
Neg logit contributions
bes-1.854
es-1.776
antly-1.772
e-1.735
vered-1.733
Token asked
Token ID1965
Feature activation-5.20e-01

Pos logit contributions
bes+0.388
antly+0.375
vered+0.368
es+0.362
nob+0.351
Neg logit contributions
aff-0.341
ock-0.339
weet-0.335
iki-0.310
oto-0.299
Token her
Token ID607
Feature activation-4.06e-01

Pos logit contributions
aff+0.907
ock+0.901
weet+0.881
iki+0.826
aa+0.795
Neg logit contributions
bes-0.894
antly-0.872
es-0.865
vered-0.852
e-0.830
Token mom
Token ID1995
Feature activation-8.66e-01

Pos logit contributions
ock+0.795
aff+0.794
weet+0.781
iki+0.724
aa+0.704
Neg logit contributions
bes-0.614
antly-0.592
es-0.577
vered-0.571
e-0.552
Tokenmy
Token ID1820
Feature activation-4.93e-01

Pos logit contributions
aff+1.542
ock+1.509
weet+1.495
iki+1.408
aa+1.372
Neg logit contributions
bes-1.402
antly-1.387
es-1.377
house-1.322
 kit-1.320
Token lots
Token ID6041
Feature activation-3.65e+00

Pos logit contributions
aff+0.759
ock+0.751
weet+0.739
iki+0.686
aa+0.659
Neg logit contributions
bes-0.934
antly-0.908
es-0.897
vered-0.884
e-0.866
Token of
Token ID286
Feature activation-1.91e+00

Pos logit contributions
arl+5.672
aff+5.654
iki+5.552
weet+5.515
aph+5.478
Neg logit contributions
es-7.244
's-6.876
e-6.636
bes-6.472
house-6.438
Token questions
Token ID2683
Feature activation-5.54e-01

Pos logit contributions
aff+3.087
ock+3.044
weet+2.959
iki+2.824
arl+2.675
Neg logit contributions
bes-3.327
e-3.290
 kit-3.240
vered-3.230
es-3.227
Token about
Token ID546
Feature activation+2.23e-02

Pos logit contributions
aff+1.115
ock+1.104
weet+1.095
iki+1.034
aa+1.008
Neg logit contributions
bes-0.795
antly-0.759
es-0.750
vered-0.733
e-0.720
Token marriage
Token ID4845
Feature activation-3.50e-01

Pos logit contributions
bes+0.038
antly+0.037
vered+0.036
es+0.035
e+0.034
Neg logit contributions
aff-0.039
ock-0.039
weet-0.038
iki-0.036
oto-0.035
Token.
Token ID13
Feature activation+1.96e-01

Pos logit contributions
aff+0.709
ock+0.707
weet+0.701
iki+0.657
aa+0.638
Neg logit contributions
bes-0.498
antly-0.475
es-0.463
vered-0.458
e-0.445
Token toys
Token ID14958
Feature activation-4.96e-01

Pos logit contributions
bes+0.756
antly+0.728
vered+0.708
es+0.683
nob+0.677
Neg logit contributions
aff-0.895
ock-0.883
weet-0.861
iki-0.805
oto-0.791
Token.
Token ID13
Feature activation-8.89e-01

Pos logit contributions
aff+0.966
weet+0.960
ock+0.956
iki+0.896
aa+0.869
Neg logit contributions
bes-0.741
antly-0.714
es-0.699
vered-0.693
e-0.675
Token Sally
Token ID25737
Feature activation-2.87e-01

Pos logit contributions
ock+1.522
weet+1.471
aff+1.459
iki+1.327
oto+1.290
Neg logit contributions
bes-1.594
antly-1.557
es-1.487
vered-1.477
house-1.467
Token said
Token ID531
Feature activation-8.86e-01

Pos logit contributions
aff+0.536
ock+0.534
weet+0.529
iki+0.490
aa+0.476
Neg logit contributions
bes-0.459
antly-0.440
vered-0.429
es-0.427
e-0.414
Token yes
Token ID3763
Feature activation-1.19e+00

Pos logit contributions
aff+1.652
ock+1.639
weet+1.613
iki+1.515
aa+1.455
Neg logit contributions
bes-1.421
antly-1.391
es-1.380
vered-1.369
e-1.323
Token,
Token ID11
Feature activation-3.64e+00

Pos logit contributions
aff+2.149
weet+2.144
ock+2.127
iki+2.014
andel+1.959
Neg logit contributions
bes-1.924
antly-1.881
es-1.867
's-1.843
e-1.816
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
aff+7.910
ock+7.564
iki+7.520
arl+7.451
andel+7.338
Neg logit contributions
's-5.630
es-5.232
bes-5.181
 wrist-5.082
 kit-5.064
Token they
Token ID484
Feature activation-6.23e-01

Pos logit contributions
ock+1.432
aff+1.413
weet+1.378
iki+1.243
arl+1.202
Neg logit contributions
bes-2.134
antly-2.107
es-2.082
vered-2.056
house-2.025
Token made
Token ID925
Feature activation+1.14e+00

Pos logit contributions
ock+1.034
aff+1.033
weet+1.027
iki+0.951
oto+0.915
Neg logit contributions
bes-1.116
antly-1.086
es-1.065
vered-1.062
e-1.047
Token a
Token ID257
Feature activation+6.91e-01

Pos logit contributions
bes+2.072
antly+1.980
vered+1.939
nob+1.889
es+1.886
Neg logit contributions
aff-1.947
weet-1.880
ock-1.876
oto-1.691
iki-1.687
Token trade
Token ID3292
Feature activation+8.90e-01

Pos logit contributions
bes+1.418
antly+1.363
vered+1.360
es+1.347
nob+1.309
Neg logit contributions
aff-1.009
weet-0.967
ock-0.966
iki-0.871
oto-0.846
Token time
Token ID640
Feature activation+4.55e-01

Pos logit contributions
aff+0.827
ock+0.806
weet+0.796
iki+0.720
ules+0.699
Neg logit contributions
bes-1.224
antly-1.176
vered-1.153
e-1.146
es-1.139
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
bes+0.824
antly+0.795
vered+0.773
es+0.752
nob+0.744
Neg logit contributions
aff-0.753
ock-0.746
weet-0.739
iki-0.690
oto-0.662
Token there
Token ID612
Feature activation+1.95e+00

Pos logit contributions
bes+1.865
antly+1.695
vered+1.652
 wrist+1.615
es+1.610
Neg logit contributions
ock-2.434
weet-2.410
aff-2.382
iki-2.290
aa-2.197
Token was
Token ID373
Feature activation+3.56e-02

Pos logit contributions
bes+2.827
antly+2.477
vered+2.407
 kit+2.384
 wrist+2.372
Neg logit contributions
ock-4.071
aff-4.011
weet-3.939
iki-3.868
oto-3.757
Token a
Token ID257
Feature activation-8.16e-01

Pos logit contributions
bes+0.049
antly+0.046
vered+0.045
es+0.044
e+0.043
Neg logit contributions
aff-0.075
ock-0.074
weet-0.074
iki-0.069
oto-0.068
Token gir
Token ID37370
Feature activation-3.64e+00

Pos logit contributions
ock+1.271
aff+1.265
weet+1.236
iki+1.132
ules+1.105
Neg logit contributions
bes-1.536
antly-1.479
house-1.438
es-1.436
e-1.431
Tokenaffe
Token ID21223
Feature activation-1.64e-01

Pos logit contributions
weet+4.413
ock+4.137
add+4.054
arl+4.017
iki+3.994
Neg logit contributions
es-8.412
e-8.281
house-8.087
oc-8.049
bes-7.809
Token named
Token ID3706
Feature activation-1.31e+00

Pos logit contributions
aff+0.250
ock+0.247
weet+0.243
iki+0.224
oto+0.216
Neg logit contributions
bes-0.317
antly-0.306
vered-0.300
es-0.299
e-0.293
Token Jerry
Token ID13075
Feature activation-2.36e-01

Pos logit contributions
ock+2.256
aff+2.164
weet+2.120
iki+1.999
aa+1.953
Neg logit contributions
bes-2.302
es-2.182
antly-2.177
e-2.162
house-2.135
Token.
Token ID13
Feature activation-6.50e-01

Pos logit contributions
aff+0.483
ock+0.478
weet+0.474
iki+0.445
oto+0.433
Neg logit contributions
bes-0.335
antly-0.319
vered-0.311
es-0.310
e-0.302
Token Jerry
Token ID13075
Feature activation+5.66e-01

Pos logit contributions
ock+1.034
aff+1.027
weet+1.007
iki+0.940
aa+0.912
Neg logit contributions
bes-1.211
antly-1.166
vered-1.145
es-1.139
house-1.116
Token deliver
Token ID5203
Feature activation+1.59e+00

Pos logit contributions
bes+2.838
antly+2.719
es+2.697
vered+2.653
 kit+2.453
Neg logit contributions
weet-3.307
ock-3.245
aff-3.125
iki-2.903
 tour-2.855
Token wonderful
Token ID7932
Feature activation+1.85e+00

Pos logit contributions
bes+2.977
vered+2.694
antly+2.689
es+2.603
nob+2.593
Neg logit contributions
ock-2.701
aff-2.681
weet-2.593
oto-2.326
iki-2.323
Token things
Token ID1243
Feature activation+6.34e-01

Pos logit contributions
bes+2.924
antly+2.709
vered+2.605
e+2.605
es+2.583
Neg logit contributions
aff-3.692
ock-3.661
weet-3.536
iki-3.212
ules-3.195
Token all
Token ID477
Feature activation-4.41e-02

Pos logit contributions
bes+1.139
antly+1.087
vered+1.069
e+1.035
es+1.034
Neg logit contributions
aff-1.067
ock-1.066
weet-1.031
iki-0.956
oto-0.934
Token around
Token ID1088
Feature activation+1.05e+00

Pos logit contributions
aff+0.080
ock+0.080
weet+0.079
iki+0.073
oto+0.071
Neg logit contributions
bes-0.073
antly-0.070
vered-0.068
es-0.067
e-0.066
Token the
Token ID262
Feature activation+1.89e+00

Pos logit contributions
bes+1.846
antly+1.745
vered+1.723
es+1.657
 wrist+1.654
Neg logit contributions
ock-1.818
aff-1.805
weet-1.764
iki-1.603
ules-1.589
Token sky
Token ID6766
Feature activation+1.13e+00

Pos logit contributions
bes+3.673
vered+3.483
es+3.477
antly+3.330
 wrist+3.265
Neg logit contributions
ock-3.247
aff-3.138
weet-3.084
 storm-2.683
oto-2.666
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
bes+1.869
vered+1.767
antly+1.760
es+1.666
 wrist+1.664
Neg logit contributions
aff-2.057
ock-2.048
weet-2.015
iki-1.850
ules-1.802
Token The
Token ID383
Feature activation+1.15e+00

Pos logit contributions
bes+0.276
antly+0.264
vered+0.260
es+0.256
e+0.250
Neg logit contributions
aff-0.294
ock-0.290
weet-0.287
iki-0.265
ules-0.258
Token moon
Token ID8824
Feature activation+1.31e+00

Pos logit contributions
bes+2.048
es+1.941
vered+1.928
antly+1.893
e+1.855
Neg logit contributions
ock-2.009
aff-1.984
weet-1.938
iki-1.785
oto-1.754
Token smiled
Token ID13541
Feature activation+3.66e-01

Pos logit contributions
bes+2.061
vered+1.873
antly+1.847
nob+1.814
 wrist+1.806
Neg logit contributions
ock-2.513
aff-2.509
weet-2.447
iki-2.338
aa-2.236
Token juicy
Token ID35075
Feature activation+2.56e-02

Pos logit contributions
bes+0.321
antly+0.311
vered+0.304
es+0.302
e+0.295
Neg logit contributions
aff-0.260
ock-0.257
weet-0.255
iki-0.234
aa-0.227
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
bes+0.040
antly+0.039
vered+0.038
es+0.037
e+0.037
Neg logit contributions
aff-0.048
ock-0.048
weet-0.047
iki-0.044
oto-0.043
Token They
Token ID1119
Feature activation-8.63e-01

Pos logit contributions
bes+2.198
vered+2.134
antly+2.109
nob+2.072
es+2.057
Neg logit contributions
aff-1.526
weet-1.452
ock-1.436
iki-1.312
aa-1.290
Token see
Token ID766
Feature activation-3.48e-01

Pos logit contributions
ock+1.582
aff+1.567
weet+1.553
iki+1.434
ules+1.421
Neg logit contributions
bes-1.406
antly-1.332
vered-1.323
e-1.311
 searches-1.298
Token Mr
Token ID1770
Feature activation+1.09e+00

Pos logit contributions
aff+0.684
ock+0.679
weet+0.671
iki+0.632
aa+0.612
Neg logit contributions
bes-0.522
antly-0.500
vered-0.486
es-0.486
e-0.469
Token.
Token ID13
Feature activation+2.21e+00

Pos logit contributions
bes+1.828
vered+1.731
antly+1.708
 wrist+1.619
e+1.614
Neg logit contributions
aff-2.070
ock-2.012
weet-1.938
iki-1.850
oto-1.807
Token Brown
Token ID4373
Feature activation+9.83e-01

Pos logit contributions
bes+3.600
vered+3.587
antly+3.382
e+3.283
es+3.210
Neg logit contributions
aff-4.201
weet-4.021
ock-3.949
iki-3.824
oto-3.696
Token walking
Token ID6155
Feature activation+1.32e-01

Pos logit contributions
bes+1.651
antly+1.596
vered+1.539
nob+1.498
 wrist+1.472
Neg logit contributions
aff-1.805
ock-1.767
weet-1.717
iki-1.623
aa-1.567
Token to
Token ID284
Feature activation-1.83e-02

Pos logit contributions
bes+0.204
antly+0.194
vered+0.189
es+0.186
e+0.181
Neg logit contributions
aff-0.257
ock-0.254
weet-0.251
iki-0.235
aa-0.229
Token his
Token ID465
Feature activation+1.19e+00

Pos logit contributions
aff+0.032
ock+0.032
weet+0.032
iki+0.029
oto+0.028
Neg logit contributions
bes-0.031
antly-0.030
vered-0.029
es-0.029
e-0.028
Token house
Token ID2156
Feature activation+2.33e-01

Pos logit contributions
bes+1.784
antly+1.772
vered+1.707
es+1.700
e+1.617
Neg logit contributions
aff-2.374
weet-2.317
ock-2.285
iki-2.198
oto-2.113
Token<|endoftext|>
Token ID50256
Feature activation+1.42e+00

Pos logit contributions
aff+1.225
ock+1.220
weet+1.195
iki+1.129
oto+1.107
Neg logit contributions
bes-0.744
antly-0.721
es-0.688
vered-0.682
e-0.669
TokenOnce
Token ID7454
Feature activation-1.70e+00

Pos logit contributions
bes+2.414
vered+2.261
 wrist+2.200
antly+2.182
es+2.084
Neg logit contributions
ock-2.613
aff-2.599
weet-2.526
iki-2.330
ules-2.288
Token upon
Token ID2402
Feature activation+7.35e-01

Pos logit contributions
aff+2.790
ock+2.719
weet+2.710
oto+2.371
andel+2.360
Neg logit contributions
antly-3.104
es-3.093
bes-3.030
vered-2.980
's-2.954
Token a
Token ID257
Feature activation-6.59e-01

Pos logit contributions
bes+1.109
antly+1.077
vered+1.054
es+1.021
 kit+0.993
Neg logit contributions
ock-1.438
weet-1.431
aff-1.393
oto-1.294
iki-1.292
Token time
Token ID640
Feature activation+9.60e-02

Pos logit contributions
aff+0.941
ock+0.907
weet+0.895
iki+0.811
ules+0.791
Neg logit contributions
bes-1.359
antly-1.311
vered-1.287
e-1.285
es-1.267
Token there
Token ID612
Feature activation+1.76e+00

Pos logit contributions
bes+0.165
antly+0.159
vered+0.155
es+0.153
e+0.150
Neg logit contributions
aff-0.167
ock-0.166
weet-0.164
iki-0.152
oto-0.147
Token was
Token ID373
Feature activation-3.09e-01

Pos logit contributions
bes+2.482
antly+2.135
vered+2.123
 wrist+2.109
 kit+2.099
Neg logit contributions
ock-3.775
aff-3.671
weet-3.654
iki-3.602
oto-3.463
Token a
Token ID257
Feature activation-7.61e-01

Pos logit contributions
aff+0.674
ock+0.665
weet+0.655
iki+0.621
aa+0.605
Neg logit contributions
bes-0.402
antly-0.381
vered-0.369
es-0.364
e-0.360
Token little
Token ID1310
Feature activation-1.19e+00

Pos logit contributions
aff+1.166
ock+1.162
weet+1.131
iki+1.029
ules+1.009
Neg logit contributions
bes-1.452
antly-1.405
vered-1.361
house-1.360
es-1.353
Token girl
Token ID2576
Feature activation+6.02e-01

Pos logit contributions
aff+2.246
ock+2.200
weet+2.184
iki+2.020
ules+2.015
Neg logit contributions
bes-1.818
antly-1.765
house-1.754
vered-1.726
 kit-1.726
Token.
Token ID13
Feature activation-7.82e-01

Pos logit contributions
bes+1.337
antly+1.301
vered+1.281
es+1.251
nob+1.223
Neg logit contributions
aff-0.749
weet-0.748
ock-0.748
iki-0.670
aa-0.652
Token happy
Token ID3772
Feature activation+6.43e-01

Pos logit contributions
bes+0.825
antly+0.799
vered+0.781
es+0.774
nob+0.762
Neg logit contributions
aff-0.582
ock-0.570
weet-0.569
iki-0.524
aa-0.508
Token and
Token ID290
Feature activation+5.05e-01

Pos logit contributions
bes+1.250
antly+1.215
vered+1.192
es+1.141
e+1.138
Neg logit contributions
aff-0.988
ock-0.985
weet-0.967
iki-0.888
oto-0.870
Token he
Token ID339
Feature activation-3.36e-01

Pos logit contributions
bes+0.944
antly+0.917
vered+0.907
es+0.877
nob+0.864
Neg logit contributions
aff-0.796
weet-0.786
ock-0.780
iki-0.720
oto-0.708
Token promised
Token ID8072
Feature activation+1.24e-01

Pos logit contributions
aff+0.616
ock+0.613
weet+0.605
iki+0.561
ules+0.544
Neg logit contributions
bes-0.554
antly-0.531
vered-0.518
es-0.518
e-0.504
Token to
Token ID284
Feature activation-5.66e-01

Pos logit contributions
bes+0.244
antly+0.235
vered+0.231
es+0.229
e+0.224
Neg logit contributions
aff-0.189
ock-0.186
weet-0.184
iki-0.169
oto-0.164
Token always
Token ID1464
Feature activation+2.00e+00

Pos logit contributions
aff+1.149
ock+1.132
weet+1.114
iki+1.050
ules+1.031
Neg logit contributions
bes-0.815
antly-0.778
e-0.755
vered-0.751
es-0.741
Token remember
Token ID3505
Feature activation-3.17e-01

Pos logit contributions
bes+3.743
antly+3.725
es+3.630
vered+3.605
ASH+3.299
Neg logit contributions
ock-3.385
weet-3.287
aff-3.160
iki-2.990
oto-2.977
Token where
Token ID810
Feature activation+8.10e-01

Pos logit contributions
aff+0.596
ock+0.586
weet+0.580
iki+0.542
aa+0.528
Neg logit contributions
bes-0.509
antly-0.490
vered-0.476
es-0.474
e-0.462
Token he
Token ID339
Feature activation+7.99e-01

Pos logit contributions
bes+1.592
antly+1.499
vered+1.473
es+1.449
e+1.425
Neg logit contributions
weet-1.246
ock-1.243
aff-1.224
iki-1.125
oto-1.093
Token put
Token ID1234
Feature activation+8.00e-01

Pos logit contributions
bes+1.313
antly+1.228
vered+1.176
es+1.164
 wrist+1.156
Neg logit contributions
aff-1.478
ock-1.472
weet-1.430
iki-1.357
aa-1.300
Token his
Token ID465
Feature activation+6.72e-01

Pos logit contributions
bes+1.299
antly+1.228
vered+1.204
es+1.188
nob+1.148
Neg logit contributions
ock-1.499
aff-1.481
weet-1.443
iki-1.333
oto-1.321
Token charge
Token ID3877
Feature activation-1.37e+00

Pos logit contributions
aff+0.722
ock+0.712
weet+0.702
iki+0.661
ules+0.643
Neg logit contributions
bes-0.548
antly-0.522
vered-0.512
es-0.503
e-0.497
Token and
Token ID290
Feature activation-3.21e-01

Pos logit contributions
aff+2.318
weet+2.256
ock+2.196
iki+2.174
oto+2.078
Neg logit contributions
bes-2.437
es-2.414
vered-2.368
antly-2.367
e-2.312
Token she
Token ID673
Feature activation+3.53e-02

Pos logit contributions
aff+0.549
ock+0.543
weet+0.533
iki+0.495
oto+0.481
Neg logit contributions
bes-0.568
antly-0.548
vered-0.537
es-0.533
e-0.520
Token had
Token ID550
Feature activation-9.86e-02

Pos logit contributions
bes+0.063
antly+0.060
vered+0.059
es+0.058
e+0.057
Neg logit contributions
aff-0.060
ock-0.059
weet-0.058
iki-0.054
oto-0.052
Token to
Token ID284
Feature activation+5.57e-01

Pos logit contributions
aff+0.172
ock+0.171
weet+0.168
iki+0.156
oto+0.152
Neg logit contributions
bes-0.171
antly-0.164
vered-0.161
es-0.159
e-0.156
Token be
Token ID307
Feature activation+1.90e+00

Pos logit contributions
bes+0.901
antly+0.875
es+0.859
vered+0.853
nob+0.799
Neg logit contributions
ock-1.022
aff-1.016
weet-1.005
iki-0.937
oto-0.917
Token obedient
Token ID41652
Feature activation+6.89e-01

Pos logit contributions
bes+3.464
antly+3.370
vered+3.245
es+3.224
nob+3.108
Neg logit contributions
ock-3.113
weet-3.064
aff-2.905
iki-2.737
aa-2.735
Token.
Token ID13
Feature activation-3.66e-01

Pos logit contributions
bes+1.135
antly+1.084
vered+1.051
es+1.022
e+1.014
Neg logit contributions
aff-1.262
ock-1.259
weet-1.226
iki-1.137
oto-1.119
Token
Token ID198
Feature activation-7.46e-01

Pos logit contributions
ock+0.763
aff+0.762
weet+0.751
iki+0.706
oto+0.690
Neg logit contributions
bes-0.507
antly-0.485
vered-0.465
es-0.463
e-0.444
Token
Token ID198
Feature activation-2.69e-01

Pos logit contributions
aff+1.632
ock+1.616
weet+1.606
iki+1.519
aa+1.473
Neg logit contributions
bes-0.957
antly-0.909
es-0.874
vered-0.855
e-0.839
TokenLater
Token ID18602
Feature activation-1.68e+00

Pos logit contributions
aff+0.489
ock+0.483
weet+0.478
iki+0.447
oto+0.432
Neg logit contributions
bes-0.451
antly-0.432
vered-0.421
es-0.416
e-0.407
TokenThe
Token ID464
Feature activation-2.49e-01

Pos logit contributions
bes+1.668
vered+1.543
antly+1.530
es+1.517
 searches+1.485
Neg logit contributions
aff-2.041
ock-2.033
weet-2.012
iki-1.852
ules-1.848
Token mom
Token ID1995
Feature activation-9.02e-01

Pos logit contributions
aff+0.426
ock+0.422
weet+0.416
iki+0.385
oto+0.371
Neg logit contributions
bes-0.440
antly-0.427
vered-0.415
es-0.409
e-0.401
Token looked
Token ID3114
Feature activation-1.02e+00

Pos logit contributions
aff+1.577
ock+1.551
weet+1.546
iki+1.412
andel+1.390
Neg logit contributions
bes-1.523
antly-1.502
es-1.479
vered-1.435
e-1.423
Token up
Token ID510
Feature activation-1.23e+00

Pos logit contributions
aff+2.206
weet+2.193
ock+2.185
iki+2.056
oto+1.996
Neg logit contributions
antly-1.323
bes-1.314
vered-1.261
es-1.247
 kit-1.199
Token and
Token ID290
Feature activation+7.43e-01

Pos logit contributions
ock+2.234
weet+2.231
aff+2.203
iki+2.010
ules+1.954
Neg logit contributions
antly-2.060
bes-1.997
vered-1.952
es-1.916
e-1.872
Token shrugged
Token ID28271
Feature activation+4.96e-01

Pos logit contributions
bes+1.436
vered+1.340
antly+1.332
e+1.306
es+1.296
Neg logit contributions
aff-1.163
ock-1.153
weet-1.143
iki-1.070
ules-1.036
Token her
Token ID607
Feature activation+7.95e-01

Pos logit contributions
bes+0.837
antly+0.789
vered+0.776
es+0.764
e+0.750
Neg logit contributions
aff-0.878
ock-0.878
weet-0.860
iki-0.803
oto-0.787
Token shoulders
Token ID12450
Feature activation-1.01e-01

Pos logit contributions
bes+0.917
vered+0.835
e+0.822
antly+0.821
es+0.815
Neg logit contributions
aff-1.842
ock-1.810
weet-1.779
iki-1.727
oto-1.694
Token.
Token ID13
Feature activation+2.09e-01

Pos logit contributions
aff+0.185
ock+0.183
weet+0.181
iki+0.169
oto+0.164
Neg logit contributions
bes-0.164
antly-0.158
vered-0.154
es-0.152
e-0.148
Token "
Token ID366
Feature activation-6.83e-02

Pos logit contributions
bes+0.361
antly+0.343
vered+0.340
es+0.334
e+0.330
Neg logit contributions
aff-0.365
ock-0.358
weet-0.356
iki-0.331
ules-0.320
TokenNot
Token ID3673
Feature activation-4.78e-01

Pos logit contributions
aff+0.147
ock+0.146
weet+0.145
iki+0.136
oto+0.133
Neg logit contributions
bes-0.090
antly-0.085
vered-0.083
es-0.082
e-0.079
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
bes+0.584
antly+0.559
vered+0.547
es+0.530
e+0.522
Neg logit contributions
aff-0.733
ock-0.724
weet-0.717
iki-0.668
aa-0.646
Token Sara
Token ID24799
Feature activation+8.64e-01

Pos logit contributions
aff+0.316
ock+0.315
weet+0.311
iki+0.289
oto+0.281
Neg logit contributions
bes-0.290
antly-0.278
vered-0.272
es-0.269
e-0.262
Token thought
Token ID1807
Feature activation-1.11e+00

Pos logit contributions
bes+1.632
antly+1.589
vered+1.547
es+1.489
nob+1.488
Neg logit contributions
aff-1.361
ock-1.305
weet-1.300
iki-1.225
aa-1.171
Token it
Token ID340
Feature activation+1.77e+00

Pos logit contributions
aff+2.459
ock+2.423
weet+2.381
iki+2.252
oto+2.209
Neg logit contributions
bes-1.449
es-1.370
antly-1.355
vered-1.347
e-1.319
Token was
Token ID373
Feature activation+1.76e-01

Pos logit contributions
bes+2.815
vered+2.601
antly+2.560
 wrist+2.526
e+2.473
Neg logit contributions
ock-3.308
aff-3.246
weet-3.149
iki-3.037
oto-2.908
Token a
Token ID257
Feature activation+3.78e-01

Pos logit contributions
bes+0.298
antly+0.288
vered+0.283
es+0.278
e+0.270
Neg logit contributions
aff-0.311
ock-0.308
weet-0.306
iki-0.283
oto-0.275
Token graceful
Token ID44363
Feature activation-3.39e-01

Pos logit contributions
bes+0.686
antly+0.660
vered+0.651
es+0.648
e+0.623
Neg logit contributions
aff-0.622
ock-0.614
weet-0.611
iki-0.561
oto-0.544
Token cat
Token ID3797
Feature activation+5.69e-01

Pos logit contributions
aff+0.628
ock+0.625
weet+0.616
iki+0.575
ules+0.563
Neg logit contributions
bes-0.544
antly-0.522
vered-0.510
es-0.504
e-0.494
Token and
Token ID290
Feature activation-2.93e-01

Pos logit contributions
bes+0.969
antly+0.942
vered+0.920
es+0.874
nob+0.867
Neg logit contributions
aff-1.005
ock-0.995
weet-0.984
iki-0.914
oto-0.881
Token wanted
Token ID2227
Feature activation+1.91e-01

Pos logit contributions
aff+0.497
ock+0.495
weet+0.484
iki+0.448
oto+0.435
Neg logit contributions
bes-0.519
antly-0.500
vered-0.489
es-0.487
e-0.474
Token to
Token ID284
Feature activation+4.26e-01

Pos logit contributions
bes+0.380
antly+0.365
vered+0.358
es+0.354
e+0.347
Neg logit contributions
aff-0.284
ock-0.280
weet-0.278
iki-0.253
oto-0.244
TokenThe
Token ID464
Feature activation+4.96e-01

Pos logit contributions
bes+0.138
antly+0.133
vered+0.131
es+0.129
e+0.125
Neg logit contributions
aff-0.142
ock-0.141
weet-0.140
iki-0.129
oto-0.125
Token dentist
Token ID38408
Feature activation+2.25e+00

Pos logit contributions
bes+0.872
antly+0.835
vered+0.831
es+0.814
e+0.791
Neg logit contributions
ock-0.860
aff-0.846
weet-0.841
iki-0.767
oto-0.761
Token fixed
Token ID5969
Feature activation+3.13e-01

Pos logit contributions
bes+4.293
vered+3.975
 elbows+3.847
 wrist+3.816
antly+3.815
Neg logit contributions
aff-3.776
ock-3.585
weet-3.526
iki-3.309
aa-3.137
Token her
Token ID607
Feature activation+2.01e+00

Pos logit contributions
bes+0.566
antly+0.546
vered+0.535
es+0.523
e+0.514
Neg logit contributions
aff-0.522
ock-0.517
weet-0.509
iki-0.467
oto-0.453
Token tooth
Token ID16162
Feature activation+1.13e+00

Pos logit contributions
bes+2.890
antly+2.786
vered+2.755
es+2.704
 Hide+2.549
Neg logit contributions
aff-4.003
weet-3.926
ock-3.915
iki-3.709
oto-3.692
Token and
Token ID290
Feature activation+4.40e-01

Pos logit contributions
bes+2.139
antly+2.072
vered+2.067
 Hide+1.977
 searches+1.966
Neg logit contributions
ock-1.730
aff-1.720
weet-1.650
iki-1.512
oto-1.466
Token Lily
Token ID20037
Feature activation+1.51e+00

Pos logit contributions
bes+0.800
antly+0.768
vered+0.760
es+0.741
e+0.734
Neg logit contributions
ock-0.720
aff-0.719
weet-0.712
iki-0.645
oto-0.630
Token felt
Token ID2936
Feature activation+4.29e-01

Pos logit contributions
bes+2.301
antly+2.172
vered+2.117
 wrist+2.049
 Something+2.037
Neg logit contributions
aff-2.879
ock-2.839
weet-2.801
iki-2.601
aa-2.566
Token much
Token ID881
Feature activation+1.03e+00

Pos logit contributions
bes+0.876
antly+0.842
vered+0.827
es+0.821
nob+0.796
Neg logit contributions
ock-0.614
aff-0.610
weet-0.603
iki-0.534
oto-0.527
Token better
Token ID1365
Feature activation+1.15e+00

Pos logit contributions
bes+1.854
vered+1.715
es+1.707
antly+1.706
e+1.677
Neg logit contributions
ock-1.792
aff-1.745
weet-1.710
iki-1.535
oto-1.531
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
bes+2.036
antly+1.959
vered+1.902
nob+1.877
es+1.810
Neg logit contributions
ock-1.926
aff-1.920
weet-1.872
iki-1.755
aa-1.720
Token would
Token ID561
Feature activation-2.91e-01

Pos logit contributions
aff+0.790
ock+0.776
weet+0.775
iki+0.707
ules+0.684
Neg logit contributions
bes-0.891
antly-0.862
vered-0.849
es-0.846
e-0.825
Token not
Token ID407
Feature activation-1.00e-01

Pos logit contributions
aff+0.571
ock+0.565
weet+0.558
iki+0.523
ules+0.510
Neg logit contributions
bes-0.442
antly-0.421
vered-0.409
es-0.405
e-0.399
Token play
Token ID711
Feature activation+2.11e+00

Pos logit contributions
aff+0.193
ock+0.191
weet+0.189
iki+0.177
oto+0.172
Neg logit contributions
bes-0.154
antly-0.148
vered-0.144
es-0.142
e-0.139
Token in
Token ID287
Feature activation+8.72e-01

Pos logit contributions
antly+3.013
bes+2.943
vered+2.830
nob+2.698
es+2.594
Neg logit contributions
ock-4.404
aff-4.167
weet-4.028
aa-3.763
oto-3.729
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
bes+1.680
antly+1.655
vered+1.600
es+1.580
nob+1.543
Neg logit contributions
ock-1.332
weet-1.299
aff-1.298
iki-1.153
aa-1.125
Token bathroom
Token ID12436
Feature activation-1.00e-01

Pos logit contributions
bes+1.493
vered+1.459
es+1.435
antly+1.426
e+1.360
Neg logit contributions
ock-2.677
aff-2.608
weet-2.600
aa-2.436
iki-2.417
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
aff+0.197
ock+0.195
weet+0.193
iki+0.181
oto+0.176
Neg logit contributions
bes-0.150
antly-0.144
vered-0.140
es-0.138
e-0.135
Token
Token ID198
Feature activation-7.67e-01

Pos logit contributions
aff+0.530
ock+0.528
weet+0.517
iki+0.482
oto+0.467
Neg logit contributions
bes-0.531
antly-0.513
vered-0.496
es-0.494
e-0.482
Token
Token ID198
Feature activation+6.81e-02

Pos logit contributions
aff+1.659
ock+1.645
weet+1.626
iki+1.538
ules+1.495
Neg logit contributions
bes-1.009
antly-0.974
es-0.924
vered-0.899
e-0.898
TokenBut
Token ID1537
Feature activation-1.23e-01

Pos logit contributions
bes+0.075
antly+0.070
vered+0.069
es+0.067
e+0.064
Neg logit contributions
aff-0.163
ock-0.162
weet-0.160
iki-0.152
oto-0.149
Token it
Token ID340
Feature activation+6.11e-01

Pos logit contributions
aff+0.219
ock+0.217
weet+0.214
iki+0.199
oto+0.193
Neg logit contributions
bes-0.208
antly-0.201
vered-0.195
es-0.194
e-0.189
Tokenay
Token ID323
Feature activation+1.82e-02

Pos logit contributions
aff+1.307
ock+1.299
weet+1.295
iki+1.216
aa+1.176
Neg logit contributions
bes-1.671
antly-1.646
es-1.603
e-1.562
nob-1.549
Tokenon
Token ID261
Feature activation+2.25e-01

Pos logit contributions
bes+0.018
antly+0.016
vered+0.016
es+0.015
e+0.015
Neg logit contributions
aff-0.045
ock-0.045
weet-0.045
iki-0.043
oto-0.042
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
bes+0.376
antly+0.359
vered+0.354
es+0.346
e+0.339
Neg logit contributions
aff-0.404
ock-0.401
weet-0.395
iki-0.367
oto-0.356
Token
Token ID220
Feature activation+8.49e-01

Pos logit contributions
aff+0.032
ock+0.032
weet+0.032
iki+0.030
oto+0.029
Neg logit contributions
bes-0.022
antly-0.021
vered-0.020
es-0.020
e-0.019
Token
Token ID198
Feature activation+2.39e-01

Pos logit contributions
bes+1.326
vered+1.268
antly+1.256
e+1.205
es+1.198
Neg logit contributions
aff-1.632
ock-1.597
weet-1.575
iki-1.464
ules-1.457
Token
Token ID198
Feature activation+4.53e-01

Pos logit contributions
bes+0.349
antly+0.334
vered+0.329
es+0.320
e+0.311
Neg logit contributions
aff-0.479
ock-0.475
weet-0.470
iki-0.438
oto-0.431
Tokenâ
Token ID22940
Feature activation-5.09e-01

Pos logit contributions
bes+0.733
antly+0.700
vered+0.692
es+0.678
nob+0.657
Neg logit contributions
ock-0.844
aff-0.841
weet-0.827
iki-0.763
oto-0.756
Token€
Token ID26391
Feature activation+1.00e+00

Pos logit contributions
aff+0.939
ock+0.919
weet+0.917
iki+0.861
oto+0.831
Neg logit contributions
bes-0.831
antly-0.799
vered-0.774
e-0.761
es-0.758
Token�
Token ID129
Feature activation-3.55e-01

Pos logit contributions
antly+1.515
bes+1.513
vered+1.499
nob+1.402
es+1.399
Neg logit contributions
aff-1.894
weet-1.862
ock-1.856
iki-1.735
aa-1.731
Token�
Token ID241
Feature activation+4.33e-01

Pos logit contributions
aff+0.792
ock+0.789
weet+0.781
iki+0.740
aa+0.718
Neg logit contributions
bes-0.440
antly-0.410
vered-0.397
es-0.388
e-0.384
TokenThat
Token ID2504
Feature activation+8.29e-01

Pos logit contributions
bes+0.631
antly+0.600
es+0.588
vered+0.583
nob+0.559
Neg logit contributions
ock-0.866
aff-0.860
weet-0.854
iki-0.798
oto-0.784
Tokenon
Token ID261
Feature activation-1.59e-02

Pos logit contributions
bes+0.316
antly+0.295
vered+0.289
es+0.270
e+0.260
Neg logit contributions
aff-0.772
ock-0.769
weet-0.759
iki-0.720
oto-0.707
Token before
Token ID878
Feature activation+4.17e-02

Pos logit contributions
aff+0.029
ock+0.029
weet+0.028
iki+0.026
oto+0.025
Neg logit contributions
bes-0.026
antly-0.025
vered-0.025
es-0.024
e-0.024
Token we
Token ID356
Feature activation-3.69e-01

Pos logit contributions
bes+0.068
antly+0.065
vered+0.064
es+0.063
e+0.061
Neg logit contributions
aff-0.077
ock-0.076
weet-0.075
iki-0.070
oto-0.068
Token buy
Token ID2822
Feature activation+1.00e-01

Pos logit contributions
aff+0.729
ock+0.722
weet+0.714
iki+0.671
oto+0.655
Neg logit contributions
bes-0.546
antly-0.527
vered-0.513
es-0.507
e-0.501
Token it
Token ID340
Feature activation-3.99e-01

Pos logit contributions
bes+0.146
antly+0.138
vered+0.135
es+0.133
e+0.129
Neg logit contributions
aff-0.203
ock-0.202
weet-0.199
iki-0.186
oto-0.182
Token."
Token ID526
Feature activation-2.68e-01

Pos logit contributions
aff+0.738
ock+0.731
weet+0.726
iki+0.682
oto+0.656
Neg logit contributions
bes-0.647
antly-0.622
es-0.606
vered-0.600
e-0.584
Token
Token ID198
Feature activation+3.54e-02

Pos logit contributions
aff+0.560
ock+0.559
weet+0.552
iki+0.520
oto+0.507
Neg logit contributions
bes-0.371
antly-0.355
vered-0.340
es-0.340
e-0.327
Token
Token ID198
Feature activation+6.60e-01

Pos logit contributions
bes+0.051
antly+0.048
vered+0.047
es+0.047
e+0.045
Neg logit contributions
aff-0.072
ock-0.072
weet-0.071
iki-0.066
oto-0.065
TokenThey
Token ID2990
Feature activation-4.16e-01

Pos logit contributions
bes+1.217
antly+1.179
vered+1.169
es+1.153
e+1.128
Neg logit contributions
ock-1.072
aff-1.059
weet-1.053
iki-0.949
oto-0.926
Token examined
Token ID11068
Feature activation+1.07e-01

Pos logit contributions
aff+0.789
ock+0.785
weet+0.777
iki+0.722
oto+0.704
Neg logit contributions
bes-0.651
antly-0.631
vered-0.617
es-0.611
e-0.599
Token the
Token ID262
Feature activation-8.64e-01

Pos logit contributions
bes+0.192
antly+0.184
vered+0.180
es+0.178
e+0.174
Neg logit contributions
aff-0.178
ock-0.176
weet-0.174
iki-0.161
oto-0.155
Token
Token ID198
Feature activation+6.07e-02

Pos logit contributions
bes+0.098
antly+0.094
vered+0.091
es+0.090
e+0.087
Neg logit contributions
aff-0.138
ock-0.137
weet-0.136
iki-0.127
oto-0.124
Token"
Token ID1
Feature activation+3.41e-01

Pos logit contributions
bes+0.083
antly+0.079
vered+0.077
es+0.076
e+0.073
Neg logit contributions
aff-0.127
ock-0.127
weet-0.125
iki-0.118
aa-0.115
TokenLet
Token ID5756
Feature activation+5.56e-01

Pos logit contributions
bes+0.446
antly+0.419
vered+0.409
es+0.406
e+0.387
Neg logit contributions
aff-0.743
ock-0.738
weet-0.734
iki-0.691
oto-0.671
Token's
Token ID338
Feature activation+5.95e-01

Pos logit contributions
bes+0.254
antly+0.198
vered+0.192
es+0.174
 wrist+0.151
Neg logit contributions
ock-1.692
aff-1.688
weet-1.677
iki-1.609
aa-1.593
Token climb
Token ID12080
Feature activation-7.47e-01

Pos logit contributions
bes+1.059
antly+1.017
vered+0.996
es+0.994
 kit+0.943
Neg logit contributions
aff-1.011
ock-1.007
weet-1.004
iki-0.926
oto-0.893
Token the
Token ID262
Feature activation-1.41e+00

Pos logit contributions
aff+1.591
ock+1.576
weet+1.565
iki+1.475
oto+1.446
Neg logit contributions
bes-0.993
antly-0.965
es-0.929
vered-0.929
e-0.899
Token tree
Token ID5509
Feature activation-4.33e-01

Pos logit contributions
aff+2.866
ock+2.838
weet+2.794
ules+2.646
oto+2.617
Neg logit contributions
bes-1.986
antly-1.946
nob-1.897
house-1.878
vered-1.865
Token!"
Token ID2474
Feature activation-1.34e+00

Pos logit contributions
aff+0.833
ock+0.827
weet+0.816
iki+0.767
oto+0.751
Neg logit contributions
bes-0.660
antly-0.635
es-0.618
vered-0.617
house-0.599
Token Sam
Token ID3409
Feature activation-7.50e-02

Pos logit contributions
aff+2.213
ock+2.196
weet+2.147
iki+1.941
oto+1.922
Neg logit contributions
bes-2.462
antly-2.362
vered-2.333
house-2.332
es-2.322
Token says
Token ID1139
Feature activation+1.17e+00

Pos logit contributions
aff+0.116
ock+0.114
weet+0.113
iki+0.104
oto+0.100
Neg logit contributions
bes-0.145
antly-0.140
vered-0.137
es-0.136
e-0.133
Token.
Token ID13
Feature activation-4.22e-01

Pos logit contributions
bes+1.995
antly+1.890
vered+1.862
 searches+1.781
es+1.760
Neg logit contributions
aff-2.079
ock-2.021
weet-1.998
aa-1.884
iki-1.868
Token.
Token ID13
Feature activation+7.40e-01

Pos logit contributions
aff+0.185
ock+0.183
weet+0.181
iki+0.169
oto+0.164
Neg logit contributions
bes-0.160
antly-0.154
vered-0.150
es-0.148
e-0.144
Token He
Token ID679
Feature activation-2.34e-01

Pos logit contributions
bes+1.334
vered+1.256
antly+1.249
es+1.245
e+1.225
Neg logit contributions
aff-1.240
ock-1.223
weet-1.222
iki-1.120
ules-1.105
Token put
Token ID1234
Feature activation+3.31e-01

Pos logit contributions
aff+0.420
ock+0.415
weet+0.411
iki+0.381
oto+0.370
Neg logit contributions
bes-0.393
antly-0.380
vered-0.371
es-0.366
e-0.358
Token on
Token ID319
Feature activation+1.14e+00

Pos logit contributions
bes+0.482
antly+0.454
vered+0.447
es+0.440
e+0.424
Neg logit contributions
ock-0.668
aff-0.656
weet-0.649
iki-0.608
oto-0.598
Token his
Token ID465
Feature activation-3.17e-01

Pos logit contributions
bes+1.965
es+1.878
antly+1.876
vered+1.843
e+1.765
Neg logit contributions
ock-1.987
weet-1.931
aff-1.894
oto-1.744
iki-1.736
Token clothes
Token ID8242
Feature activation-5.53e-01

Pos logit contributions
aff+0.630
ock+0.621
weet+0.616
iki+0.575
oto+0.559
Neg logit contributions
bes-0.477
antly-0.452
vered-0.441
es-0.433
e-0.426
Token,
Token ID11
Feature activation-3.88e-01

Pos logit contributions
aff+0.888
weet+0.864
ock+0.861
iki+0.797
oto+0.767
Neg logit contributions
bes-1.035
antly-1.004
es-0.979
vered-0.977
e-0.952
Token and
Token ID290
Feature activation+6.41e-01

Pos logit contributions
aff+0.724
ock+0.708
weet+0.698
iki+0.655
oto+0.636
Neg logit contributions
bes-0.633
antly-0.605
vered-0.590
es-0.586
e-0.572
Token his
Token ID465
Feature activation+1.94e-01

Pos logit contributions
bes+1.144
antly+1.083
vered+1.070
es+1.048
e+1.032
Neg logit contributions
ock-1.077
weet-1.060
aff-1.051
iki-0.970
oto-0.947
Token mom
Token ID1995
Feature activation+9.90e-01

Pos logit contributions
bes+0.288
antly+0.277
vered+0.270
es+0.269
e+0.260
Neg logit contributions
aff-0.381
ock-0.379
weet-0.375
iki-0.353
oto-0.343
Token put
Token ID1234
Feature activation+5.91e-01

Pos logit contributions
bes+1.764
antly+1.634
vered+1.622
 wrist+1.554
es+1.551
Neg logit contributions
ock-1.712
aff-1.674
weet-1.673
iki-1.542
ules-1.523
Token all
Token ID477
Feature activation-2.42e+00

Pos logit contributions
bes+0.063
antly+0.060
vered+0.059
es+0.058
e+0.057
Neg logit contributions
aff-0.066
ock-0.065
weet-0.064
iki-0.059
oto-0.058
Token of
Token ID286
Feature activation-3.18e-01

Pos logit contributions
aff+4.895
ock+4.574
weet+4.566
iki+4.494
oto+4.423
Neg logit contributions
e-3.522
's-3.485
bes-3.417
es-3.402
house-3.375
Token these
Token ID777
Feature activation-1.27e+00

Pos logit contributions
aff+0.590
ock+0.583
weet+0.576
iki+0.540
oto+0.524
Neg logit contributions
bes-0.513
antly-0.490
vered-0.478
es-0.476
e-0.468
Token le
Token ID443
Feature activation-1.12e+00

Pos logit contributions
ock+2.245
weet+2.243
aff+2.196
iki+2.059
oto+2.003
Neg logit contributions
bes-2.161
es-2.055
antly-2.021
vered-2.002
e-1.966
Tokenmons
Token ID11567
Feature activation-1.48e+00

Pos logit contributions
ock+2.168
weet+2.164
aff+2.076
iki+1.949
oto+1.925
Neg logit contributions
bes-1.758
es-1.650
antly-1.616
vered-1.597
 kit-1.580
Token?"
Token ID1701
Feature activation-3.79e-01

Pos logit contributions
aff+2.587
weet+2.562
ock+2.552
iki+2.398
andel+2.318
Neg logit contributions
bes-2.537
es-2.490
antly-2.405
 kit-2.373
e-2.355
Token Lily
Token ID20037
Feature activation+4.70e-01

Pos logit contributions
ock+0.569
aff+0.568
weet+0.556
iki+0.514
oto+0.498
Neg logit contributions
bes-0.756
antly-0.727
es-0.708
vered-0.707
e-0.692
Token looked
Token ID3114
Feature activation-9.72e-01

Pos logit contributions
bes+0.827
antly+0.794
vered+0.780
es+0.766
nob+0.748
Neg logit contributions
aff-0.800
ock-0.783
weet-0.776
iki-0.729
oto-0.707
Token at
Token ID379
Feature activation-2.27e+00

Pos logit contributions
ock+1.855
weet+1.838
aff+1.818
iki+1.704
ules+1.660
Neg logit contributions
bes-1.506
antly-1.502
vered-1.460
es-1.433
house-1.402
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
aff+4.849
weet+4.749
ock+4.685
ules+4.634
aa+4.473
Neg logit contributions
bes-3.198
antly-3.074
vered-3.038
e-2.958
es-2.951
Token le
Token ID443
Feature activation-9.23e-01

Pos logit contributions
ock+1.772
weet+1.731
aff+1.675
iki+1.538
ules+1.526
Neg logit contributions
bes-2.366
antly-2.244
vered-2.205
es-2.191
 kit-2.177
Token touch
Token ID3638
Feature activation+1.09e-01

Pos logit contributions
aff+1.862
weet+1.793
ock+1.790
iki+1.725
ules+1.678
Neg logit contributions
bes-1.246
antly-1.171
e-1.169
vered-1.149
es-1.129
Token.
Token ID13
Feature activation-6.41e-01

Pos logit contributions
bes+0.166
antly+0.157
vered+0.154
es+0.151
e+0.147
Neg logit contributions
aff-0.212
ock-0.211
weet-0.208
iki-0.193
oto-0.188
Token Tim
Token ID5045
Feature activation-8.12e-02

Pos logit contributions
ock+0.994
aff+0.982
weet+0.961
iki+0.898
oto+0.864
Neg logit contributions
bes-1.246
antly-1.209
vered-1.172
house-1.171
es-1.169
Tokenmy
Token ID1820
Feature activation+3.81e-01

Pos logit contributions
aff+0.140
ock+0.139
weet+0.137
iki+0.127
oto+0.123
Neg logit contributions
bes-0.142
antly-0.137
vered-0.134
es-0.132
e-0.129
Token liked
Token ID8288
Feature activation-4.71e-01

Pos logit contributions
bes+0.558
antly+0.533
vered+0.518
es+0.507
nob+0.494
Neg logit contributions
aff-0.757
ock-0.752
weet-0.745
iki-0.698
aa-0.682
Token it
Token ID340
Feature activation-4.03e-01

Pos logit contributions
aff+0.906
ock+0.894
weet+0.877
iki+0.836
oto+0.807
Neg logit contributions
bes-0.735
antly-0.696
vered-0.685
es-0.681
e-0.665
Token and
Token ID290
Feature activation-3.10e-01

Pos logit contributions
aff+0.595
ock+0.577
weet+0.569
iki+0.529
oto+0.507
Neg logit contributions
bes-0.810
antly-0.790
es-0.773
vered-0.773
e-0.757
Token said
Token ID531
Feature activation+3.96e-01

Pos logit contributions
aff+0.543
ock+0.537
weet+0.528
iki+0.493
oto+0.476
Neg logit contributions
bes-0.535
antly-0.515
vered-0.504
es-0.501
e-0.485
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
bes+0.725
antly+0.685
vered+0.672
es+0.659
e+0.655
Neg logit contributions
aff-0.657
ock-0.650
weet-0.641
iki-0.586
oto-0.575
Token "
Token ID366
Feature activation+5.27e-01

Pos logit contributions
ock+2.512
aff+2.479
weet+2.414
iki+2.337
oto+2.229
Neg logit contributions
bes-1.752
antly-1.661
es-1.618
 wrist-1.562
vered-1.550
TokenMom
Token ID29252
Feature activation+3.75e-01

Pos logit contributions
bes+0.760
es+0.705
vered+0.704
antly+0.704
 searches+0.664
Neg logit contributions
ock-1.084
aff-1.078
weet-1.070
iki-0.989
oto-0.971
Token fluffy
Token ID39145
Feature activation-4.78e-01

Pos logit contributions
ock+1.016
aff+1.005
weet+0.984
iki+0.906
ules+0.881
Neg logit contributions
bes-1.250
antly-1.213
es-1.171
vered-1.168
house-1.163
Token bunny
Token ID44915
Feature activation+9.52e-01

Pos logit contributions
ock+0.848
aff+0.843
weet+0.833
iki+0.773
ules+0.757
Neg logit contributions
bes-0.804
antly-0.766
vered-0.743
es-0.742
e-0.733
Token,
Token ID11
Feature activation-4.61e-01

Pos logit contributions
bes+1.740
antly+1.691
vered+1.659
nob+1.595
ASH+1.561
Neg logit contributions
aff-1.563
ock-1.552
weet-1.548
iki-1.443
aa-1.367
Token a
Token ID257
Feature activation+5.44e-05

Pos logit contributions
aff+0.861
ock+0.855
weet+0.833
iki+0.785
aa+0.763
Neg logit contributions
bes-0.746
antly-0.709
es-0.693
vered-0.690
e-0.676
Token w
Token ID266
Feature activation-1.88e+00

Pos logit contributions
bes+0.000
antly+0.000
vered+0.000
es+0.000
e+0.000
Neg logit contributions
aff-0.000
ock-0.000
weet-0.000
iki-0.000
oto-0.000
Tokenadd
Token ID2860
Feature activation-2.31e+00

Pos logit contributions
weet+2.528
ock+2.392
aff+2.355
iki+2.077
ules+2.026
Neg logit contributions
bes-4.136
e-3.930
antly-3.825
es-3.822
vered-3.819
Tokenling
Token ID1359
Feature activation-1.01e+00

Pos logit contributions
weet+3.971
aff+3.855
ock+3.770
ules+3.588
arl+3.535
Neg logit contributions
bes-4.068
es-3.878
e-3.856
antly-3.817
house-3.750
Token duck
Token ID22045
Feature activation-2.45e-01

Pos logit contributions
aff+1.871
ock+1.858
weet+1.826
ules+1.710
iki+1.707
Neg logit contributions
bes-1.606
antly-1.546
e-1.510
vered-1.502
house-1.497
Token,
Token ID11
Feature activation-6.60e-01

Pos logit contributions
aff+0.420
ock+0.417
weet+0.411
iki+0.380
oto+0.370
Neg logit contributions
bes-0.429
antly-0.412
vered-0.402
es-0.402
e-0.393
Token and
Token ID290
Feature activation-2.86e-01

Pos logit contributions
aff+1.045
ock+1.035
weet+0.997
iki+0.929
aa+0.902
Neg logit contributions
bes-1.263
antly-1.209
es-1.193
vered-1.177
e-1.160
Token a
Token ID257
Feature activation+3.91e-02

Pos logit contributions
aff+0.490
ock+0.487
weet+0.477
iki+0.445
oto+0.430
Neg logit contributions
bes-0.501
antly-0.484
vered-0.473
es-0.472
e-0.458
Token best
Token ID1266
Feature activation-1.30e+00

Pos logit contributions
aff+2.587
weet+2.491
ock+2.479
ules+2.310
iki+2.310
Neg logit contributions
bes-1.580
antly-1.548
e-1.501
vered-1.454
 kit-1.450
Token of
Token ID286
Feature activation-8.49e-01

Pos logit contributions
aff+1.905
weet+1.829
ock+1.771
iki+1.615
oto+1.612
Neg logit contributions
bes-2.644
antly-2.604
es-2.578
vered-2.558
e-2.548
Token friends
Token ID2460
Feature activation-9.26e-01

Pos logit contributions
aff+1.768
ock+1.740
weet+1.727
iki+1.622
ules+1.595
Neg logit contributions
bes-1.197
vered-1.136
antly-1.136
es-1.121
e-1.115
Token,
Token ID11
Feature activation-1.89e+00

Pos logit contributions
aff+1.662
weet+1.637
ock+1.635
iki+1.514
oto+1.497
Neg logit contributions
bes-1.533
es-1.521
antly-1.503
vered-1.482
e-1.467
Token and
Token ID290
Feature activation-9.04e-01

Pos logit contributions
aff+3.356
ock+3.232
weet+3.073
arl+3.043
andel+3.002
Neg logit contributions
bes-3.320
es-3.258
e-3.222
house-3.138
antly-3.088
Token they
Token ID484
Feature activation-2.30e+00

Pos logit contributions
aff+1.369
ock+1.363
weet+1.316
iki+1.197
oto+1.168
Neg logit contributions
bes-1.771
es-1.734
antly-1.732
vered-1.703
e-1.681
Token loved
Token ID6151
Feature activation-2.08e-01

Pos logit contributions
aff+4.362
ock+4.213
weet+4.188
andel+4.048
arl+3.960
Neg logit contributions
es-3.884
e-3.884
bes-3.767
antly-3.695
vered-3.650
Token to
Token ID284
Feature activation-9.95e-01

Pos logit contributions
aff+0.342
ock+0.339
weet+0.334
iki+0.309
oto+0.299
Neg logit contributions
bes-0.379
antly-0.365
vered-0.358
es-0.356
e-0.348
Token dance
Token ID9280
Feature activation-8.28e-01

Pos logit contributions
aff+2.055
ock+2.006
weet+1.980
iki+1.874
ules+1.839
Neg logit contributions
bes-1.445
antly-1.361
e-1.350
vered-1.328
es-1.317
Token together
Token ID1978
Feature activation-4.60e-01

Pos logit contributions
aff+1.848
weet+1.834
ock+1.821
iki+1.744
oto+1.693
Neg logit contributions
bes-1.037
antly-0.981
es-0.972
vered-0.944
e-0.917
Token in
Token ID287
Feature activation+1.85e-01

Pos logit contributions
aff+0.859
ock+0.850
weet+0.845
iki+0.792
oto+0.769
Neg logit contributions
bes-0.735
antly-0.708
es-0.692
vered-0.691
e-0.676
Token there
Token ID612
Feature activation+1.72e+00

Pos logit contributions
bes+1.456
antly+1.321
vered+1.299
 wrist+1.268
es+1.261
Neg logit contributions
ock-2.035
weet-2.014
aff-1.998
iki-1.920
aa-1.840
Token was
Token ID373
Feature activation-6.44e-02

Pos logit contributions
bes+2.394
antly+2.101
vered+2.066
 wrist+2.046
 kit+2.044
Neg logit contributions
ock-3.679
aff-3.612
weet-3.580
iki-3.494
oto-3.379
Token a
Token ID257
Feature activation-7.85e-01

Pos logit contributions
aff+0.137
ock+0.136
weet+0.134
iki+0.126
oto+0.123
Neg logit contributions
bes-0.086
antly-0.082
vered-0.080
es-0.079
e-0.076
Token young
Token ID1862
Feature activation-7.89e-01

Pos logit contributions
ock+1.219
aff+1.215
weet+1.185
iki+1.085
ules+1.059
Neg logit contributions
bes-1.481
antly-1.427
house-1.387
es-1.382
e-1.379
Token,
Token ID11
Feature activation-8.25e-01

Pos logit contributions
ock+1.417
aff+1.417
weet+1.399
iki+1.304
ules+1.267
Neg logit contributions
bes-1.282
antly-1.217
es-1.202
vered-1.192
house-1.182
Token spoiled
Token ID34781
Feature activation-2.19e+00

Pos logit contributions
aff+1.471
ock+1.439
weet+1.394
iki+1.312
ules+1.272
Neg logit contributions
bes-1.428
antly-1.362
house-1.323
es-1.317
e-1.317
Token girl
Token ID2576
Feature activation+1.74e-01

Pos logit contributions
ock+4.401
weet+4.391
aff+4.330
iki+4.143
ules+4.038
Neg logit contributions
bes-3.077
house-3.016
 kit-2.976
 wrist-2.914
antly-2.853
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
bes+0.362
antly+0.353
vered+0.346
es+0.340
nob+0.332
Neg logit contributions
aff-0.240
ock-0.238
weet-0.235
iki-0.213
aa-0.205
Token She
Token ID1375
Feature activation-4.44e-01

Pos logit contributions
aff+0.549
ock+0.548
weet+0.535
iki+0.496
oto+0.481
Neg logit contributions
bes-0.672
antly-0.647
vered-0.634
es-0.630
e-0.620
Token was
Token ID373
Feature activation-8.50e-01

Pos logit contributions
aff+0.803
ock+0.795
weet+0.787
iki+0.734
ules+0.711
Neg logit contributions
bes-0.752
antly-0.716
es-0.698
vered-0.698
e-0.690
Token living
Token ID2877
Feature activation+3.82e-01

Pos logit contributions
aff+1.510
ock+1.494
weet+1.432
iki+1.354
oto+1.321
Neg logit contributions
bes-1.470
e-1.399
antly-1.390
house-1.376
es-1.376
Token His
Token ID2399
Feature activation+8.35e-01

Pos logit contributions
bes+1.251
vered+1.199
antly+1.184
es+1.164
e+1.137
Neg logit contributions
aff-1.314
ock-1.282
weet-1.273
iki-1.183
aa-1.168
Token mom
Token ID1995
Feature activation-5.94e-02

Pos logit contributions
bes+1.446
antly+1.404
vered+1.399
es+1.375
e+1.345
Neg logit contributions
aff-1.449
ock-1.429
weet-1.409
iki-1.358
oto-1.338
Tokenmy
Token ID1820
Feature activation-1.51e-01

Pos logit contributions
aff+0.102
ock+0.101
weet+0.099
iki+0.092
oto+0.089
Neg logit contributions
bes-0.104
antly-0.101
vered-0.098
es-0.097
e-0.095
Token said
Token ID531
Feature activation-3.89e-01

Pos logit contributions
aff+0.236
ock+0.234
weet+0.231
iki+0.211
oto+0.204
Neg logit contributions
bes-0.286
antly-0.277
vered-0.271
es-0.269
e-0.263
Token yes
Token ID3763
Feature activation-1.57e+00

Pos logit contributions
aff+0.675
ock+0.665
weet+0.657
iki+0.610
oto+0.587
Neg logit contributions
bes-0.679
antly-0.656
vered-0.642
es-0.641
e-0.623
Token,
Token ID11
Feature activation-2.72e+00

Pos logit contributions
weet+2.966
aff+2.963
ock+2.912
iki+2.778
andel+2.727
Neg logit contributions
bes-2.396
's-2.381
antly-2.353
house-2.293
es-2.292
Token but
Token ID475
Feature activation-2.03e-01

Pos logit contributions
aff+5.306
ock+4.931
iki+4.826
weet+4.724
arl+4.686
Neg logit contributions
bes-4.494
's-4.415
es-4.340
 kit-4.274
house-4.229
Token told
Token ID1297
Feature activation-1.24e+00

Pos logit contributions
aff+0.294
ock+0.291
weet+0.286
iki+0.260
oto+0.251
Neg logit contributions
bes-0.411
antly-0.399
vered-0.391
es-0.388
e-0.380
Token him
Token ID683
Feature activation-3.24e-01

Pos logit contributions
aff+2.659
ock+2.561
weet+2.559
iki+2.450
aa+2.390
Neg logit contributions
bes-1.617
antly-1.570
e-1.529
vered-1.519
nob-1.486
Token to
Token ID284
Feature activation-4.19e-01

Pos logit contributions
aff+0.541
ock+0.535
weet+0.528
iki+0.488
oto+0.471
Neg logit contributions
bes-0.582
antly-0.561
vered-0.550
es-0.549
e-0.532
Token be
Token ID307
Feature activation+1.28e+00

Pos logit contributions
aff+0.841
ock+0.823
weet+0.819
iki+0.769
ules+0.749
Neg logit contributions
bes-0.613
antly-0.583
vered-0.569
e-0.556
es-0.555
TokenThe
Token ID464
Feature activation-5.20e-01

Pos logit contributions
weet+1.989
aff+1.988
ock+1.987
iki+1.865
aa+1.830
Neg logit contributions
bes-1.516
antly-1.490
es-1.449
vered-1.419
e-1.398
Token moral
Token ID6573
Feature activation-1.24e+00

Pos logit contributions
aff+0.814
ock+0.804
weet+0.800
iki+0.737
aa+0.713
Neg logit contributions
bes-1.001
antly-0.963
vered-0.942
es-0.936
e-0.924
Token of
Token ID286
Feature activation-4.39e-01

Pos logit contributions
aff+1.433
weet+1.373
ock+1.345
iki+1.246
ules+1.172
Neg logit contributions
antly-2.838
bes-2.805
es-2.771
vered-2.763
e-2.739
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
aff+0.796
ock+0.778
weet+0.770
iki+0.720
aa+0.701
Neg logit contributions
bes-0.729
antly-0.705
es-0.685
vered-0.683
e-0.677
Token story
Token ID1621
Feature activation-5.59e-01

Pos logit contributions
aff+2.976
ock+2.928
weet+2.920
iki+2.759
aa+2.700
Neg logit contributions
bes-2.204
antly-2.056
es-2.048
e-2.016
 searches-1.964
Token is
Token ID318
Feature activation-2.16e+00

Pos logit contributions
aff+1.465
ock+1.454
weet+1.451
iki+1.379
ules+1.358
Neg logit contributions
bes-0.468
antly-0.450
es-0.426
vered-0.420
e-0.387
Token that
Token ID326
Feature activation+6.21e-01

Pos logit contributions
ock+3.692
aff+3.683
weet+3.568
andel+3.154
iki+3.148
Neg logit contributions
antly-3.933
es-3.839
bes-3.723
vered-3.721
e-3.710
Token when
Token ID618
Feature activation+3.43e-01

Pos logit contributions
bes+1.168
antly+1.118
vered+1.083
nob+1.063
es+1.063
Neg logit contributions
weet-0.999
ock-0.996
aff-0.982
iki-0.900
oto-0.872
Token you
Token ID345
Feature activation-7.26e-01

Pos logit contributions
bes+0.596
antly+0.571
vered+0.555
es+0.547
nob+0.542
Neg logit contributions
aff-0.592
ock-0.588
weet-0.584
iki-0.540
oto-0.525
Token share
Token ID2648
Feature activation+6.27e-01

Pos logit contributions
aff+1.428
ock+1.416
weet+1.403
iki+1.315
oto+1.284
Neg logit contributions
bes-1.074
antly-1.032
e-1.017
vered-1.013
es-1.003
Token with
Token ID351
Feature activation-3.77e-01

Pos logit contributions
bes+1.078
antly+1.049
vered+1.014
nob+0.985
es+0.984
Neg logit contributions
aff-1.085
ock-1.075
weet-1.061
iki-0.970
oto-0.949