TOP ACTIVATIONS
MAX = 17.132

's condition wasn 't immediately available , and it was unclear
= c v 2 . Ga ussian Bl ur ( gray
t crowd the pan ; only cook about 3
. The Syrian ambassador was summoned ; he denied Syrian involvement
possession of the Energy Committee ga vel . Now
, Gl endale police were summoned to the club by manager
officials in Manila were not available for comment Sunday .
, broadcast , rewritten or redistributed .) <|endoftext|> New castle United
index . php ? topic = 28 18 48 . 0
Glen Gab ert was not returned . Hudson County Executive Tom
spent a week using the tactics learned in this interview to
and non - fiction . Cos player and figure model .
, not only as a representative of the Democratic Party but
Washington status quo , the ga vel may have been more
. ( Jones was not available to be interviewed for this
said the injuries were not considered to be life - threatening
vegetation by molecules out - g assed from soils by the
. Shortly thereafter , the congressman would be chosen as Romney
after debate officials turned away Green Party presidential nominee Jill Stein
equ ated with an " unt ouch able " status that

MIN ACTIVATIONS
MIN = -14.950

yellow fever or something like that . That
need the eggs . Sch ind el
legions of fans who enjoy nothing more than slaughter ing zombies
figures from the Office for National Statistics said industrial output rose
ES ) US Treasury Secretary Jacob Lew speaks on the state
start off one way and end another . I find it
this new Riverside park resembles nothing so much as a cult
this was not rape or anything like that , this was
Iron Maiden , Harry Potter , need
kind of CV written by Jacob Re es - M ogg
B OM B or something else . B OM
believe in divine intervention or anything . But that was a
year - old brother , Jacob , is also living on
Mother Jones ' Foundation for National Progress , National Public Radio
edit ] References [ edit ] <|endoftext|> Z ombie
several lessons from former President Harry Truman . Ign
orative bank notes [ edit ] $ 25
, although she said the pur view of the bank is
and health care legislation to Harry Reid and Nancy Pelosi ,
his 2006 speech marking the founding of his regime , Gaddafi

INTERVAL 9.112 - 17.132
CONTAINS 0.122%

ev ict _ from _ os _ cache (); } }
marriage . It is discrimination writ into federal law . It
loss of employment , and ostr ac ism from family and
struct uring kernel rect K ernel = c v 2 .
animal , please do seek professional help . Continue Reading Below

INTERVAL 1.091 - 9.112
CONTAINS 33.042%

certain groups in certain countries . While she does
as with many parts of a campaign , it is obvious
itself , doing it well on anything other than 8 -
that had sidelined him since early July . However , there
option to " c :\ tmp ". Only a single back

INTERVAL -6.929 - 1.091
CONTAINS 65.772%

might have a flying model that is your pride
- of - war , the most complex machine his age
free , hur ray ! Your beloved lic hess team has
test chamber monitors ( 56 ) Cele brate your
prem ised on a vast m iscal culation about the origins

INTERVAL -14.950 - -6.929
CONTAINS 1.063%

disrupt the availability of child sexual abuse content in Australia ,
. [ color = red ][ b ] U f ou
war with Germany [ edit ] From 2 February
- Jimmy Tide 08 - Harry @ TMZ - V os
from healthcare ", its website adds . It says
Token's
Token ID338
Feature activation-1.39e+00

Pos logit contributions
eln+1.341
imens+1.297
earable+1.281
enne+1.227
imates+1.206
Neg logit contributions
theless-1.351
 rhy-1.243
ymes-1.185
ingly-1.073
olla-1.045
Token condition
Token ID4006
Feature activation+1.89e+00

Pos logit contributions
theless+1.033
 rhy+0.972
ymes+0.919
ingly+0.826
olla+0.809
Neg logit contributions
eln-1.072
imens-1.035
earable-1.026
enne-0.984
imates-0.962
Token wasn
Token ID2492
Feature activation+2.57e+00

Pos logit contributions
eln+1.429
imens+1.385
earable+1.367
enne+1.309
imates+1.285
Neg logit contributions
theless-1.442
 rhy-1.330
ymes-1.266
ingly-1.146
olla-1.118
Token't
Token ID470
Feature activation+1.23e+00

Pos logit contributions
eln+1.878
imens+1.819
earable+1.800
enne+1.721
imates+1.687
Neg logit contributions
theless-2.033
 rhy-1.868
ymes-1.773
ingly-1.600
olla-1.569
Token immediately
Token ID3393
Feature activation+1.15e+01

Pos logit contributions
eln+1.016
imens+0.984
earable+0.972
enne+0.938
imates+0.918
Neg logit contributions
theless-0.855
 rhy-0.792
ymes-0.738
ingly-0.667
olla-0.641
Token available
Token ID1695
Feature activation+1.71e+01

Pos logit contributions
eln+6.680
imens+6.250
earable+6.117
ainted+6.091
handled+6.086
Neg logit contributions
theless-9.638
ymes-8.853
 rhy-8.850
olla-8.107
itudinal-8.018
Token,
Token ID11
Feature activation+4.53e+00

Pos logit contributions
eln+8.464
imens+8.126
imates+7.962
handled+7.791
ainted+7.778
Neg logit contributions
theless-14.361
ymes-12.910
 rhy-12.830
itudinal-11.445
olla-11.268
Token and
Token ID290
Feature activation+5.41e+00

Pos logit contributions
eln+3.571
imens+3.449
earable+3.390
enne+3.322
imates+3.248
Neg logit contributions
theless-3.246
 rhy-2.919
ymes-2.814
ingly-2.511
olla-2.428
Token it
Token ID340
Feature activation+3.90e+00

Pos logit contributions
eln+4.094
imens+3.941
earable+3.841
enne+3.794
imates+3.666
Neg logit contributions
theless-4.034
 rhy-3.689
ymes-3.525
ingly-3.184
olla-3.103
Token was
Token ID373
Feature activation+3.70e+00

Pos logit contributions
eln+2.807
imens+2.716
earable+2.646
enne+2.546
imates+2.522
Neg logit contributions
theless-3.089
 rhy-2.824
ymes-2.711
ingly-2.457
olla-2.420
Token unclear
Token ID10061
Feature activation+8.13e+00

Pos logit contributions
eln+3.046
imens+2.953
earable+2.910
enne+2.813
imates+2.766
Neg logit contributions
theless-2.554
 rhy-2.314
ymes-2.203
ingly-1.965
olla-1.925
Token =
Token ID796
Feature activation+1.77e+00

Pos logit contributions
eln+4.616
imens+4.424
earable+4.387
enne+4.257
imates+4.108
Neg logit contributions
theless-5.410
 rhy-4.967
ymes-4.747
ingly-4.231
olla-4.219
Token c
Token ID269
Feature activation+3.22e+00

Pos logit contributions
eln+1.185
imens+1.142
earable+1.128
enne+1.075
imates+1.049
Neg logit contributions
theless-1.499
 rhy-1.398
ymes-1.334
ingly-1.225
olla-1.197
Tokenv
Token ID85
Feature activation+1.03e+00

Pos logit contributions
eln+2.092
imens+2.016
earable+1.983
enne+1.889
imates+1.848
Neg logit contributions
theless-2.818
 rhy-2.624
ymes-2.508
ingly-2.300
olla-2.250
Token2
Token ID17
Feature activation+4.43e+00

Pos logit contributions
eln+0.728
imens+0.704
earable+0.694
enne+0.662
imates+0.649
Neg logit contributions
theless-0.835
 rhy-0.779
ymes-0.746
ingly-0.683
olla-0.667
Token.
Token ID13
Feature activation+2.00e+00

Pos logit contributions
eln+3.248
imens+3.139
earable+3.133
enne+2.998
imates+2.937
Neg logit contributions
theless-3.538
 rhy-3.193
ymes-2.999
ingly-2.687
olla-2.645
TokenGa
Token ID35389
Feature activation+1.46e+01

Pos logit contributions
eln+1.274
imens+1.226
earable+1.210
enne+1.148
imates+1.121
Neg logit contributions
theless-1.767
 rhy-1.650
ymes-1.580
ingly-1.457
olla-1.424
Tokenussian
Token ID31562
Feature activation+7.02e+00

Pos logit contributions
eln+6.267
earable+5.880
imens+5.792
enne+5.602
Issue+5.148
Neg logit contributions
theless-15.338
 rhy-13.884
ymes-13.401
ingly-12.112
olla-11.950
TokenBl
Token ID3629
Feature activation+1.52e-01

Pos logit contributions
eln+4.719
earable+4.488
imens+4.465
enne+4.378
imates+4.141
Neg logit contributions
theless-5.877
 rhy-5.372
ymes-5.149
ingly-4.660
olla-4.619
Tokenur
Token ID333
Feature activation+3.25e-01

Pos logit contributions
eln+0.111
imens+0.108
earable+0.106
enne+0.100
imates+0.100
Neg logit contributions
theless-0.118
 rhy-0.112
ymes-0.106
ingly-0.097
olla-0.095
Token(
Token ID7
Feature activation+5.75e+00

Pos logit contributions
eln+0.244
imens+0.236
earable+0.233
enne+0.223
imates+0.219
Neg logit contributions
theless-0.250
 rhy-0.232
ymes-0.220
ingly-0.200
olla-0.195
Tokengray
Token ID44605
Feature activation+8.71e+00

Pos logit contributions
eln+3.811
imens+3.645
earable+3.571
enne+3.411
ainted+3.318
Neg logit contributions
theless-4.892
 rhy-4.552
ymes-4.351
ingly-3.954
olla-3.877
Token
Token ID447
Feature activation+1.29e-02

Pos logit contributions
eln+4.075
imens+3.932
earable+3.895
enne+3.757
imates+3.649
Neg logit contributions
theless-4.158
 rhy-3.798
ymes-3.633
ingly-3.247
olla-3.190
Token
Token ID247
Feature activation-2.60e-01

Pos logit contributions
eln+0.010
imens+0.010
earable+0.010
enne+0.009
imates+0.009
Neg logit contributions
theless-0.009
 rhy-0.009
ymes-0.008
ingly-0.008
olla-0.007
Tokent
Token ID83
Feature activation-2.18e+00

Pos logit contributions
theless+0.205
 rhy+0.191
ymes+0.182
ingly+0.167
olla+0.162
Neg logit contributions
eln-0.188
imens-0.182
earable-0.180
enne-0.171
imates-0.168
Token crowd
Token ID4315
Feature activation+4.36e-01

Pos logit contributions
theless+2.039
 rhy+1.926
ymes+1.822
ingly+1.713
olla+1.667
Neg logit contributions
eln-1.250
imens-1.232
earable-1.219
enne-1.133
imates-1.096
Token the
Token ID262
Feature activation+2.99e+00

Pos logit contributions
eln+0.353
imens+0.344
earable+0.339
enne+0.324
imates+0.319
Neg logit contributions
theless-0.309
 rhy-0.285
ymes-0.269
ingly-0.245
olla-0.236
Token pan
Token ID3425
Feature activation+1.46e+01

Pos logit contributions
eln+2.476
imens+2.386
earable+2.377
enne+2.296
imates+2.250
Neg logit contributions
theless-2.065
 rhy-1.838
ymes-1.778
ingly-1.557
olla-1.532
Token;
Token ID26
Feature activation-1.58e+00

Pos logit contributions
eln+6.187
imens+5.910
enne+5.855
earable+5.624
handled+5.497
Neg logit contributions
theless-14.295
 rhy-13.096
ymes-12.515
olla-11.371
tymology-11.138
Token only
Token ID691
Feature activation+2.32e-01

Pos logit contributions
theless+0.981
 rhy+0.903
ymes+0.843
ingly+0.753
olla+0.720
Neg logit contributions
eln-1.415
imens-1.383
earable-1.365
enne-1.312
imates-1.291
Token cook
Token ID4255
Feature activation+2.11e+00

Pos logit contributions
eln+0.178
imens+0.173
earable+0.171
enne+0.164
imates+0.160
Neg logit contributions
theless-0.174
 rhy-0.160
ymes-0.152
ingly-0.138
olla-0.134
Token about
Token ID546
Feature activation+4.71e+00

Pos logit contributions
eln+1.727
imens+1.673
earable+1.650
enne+1.597
imates+1.576
Neg logit contributions
theless-1.479
 rhy-1.333
ymes-1.267
ingly-1.119
olla-1.097
Token 3
Token ID513
Feature activation+2.74e+00

Pos logit contributions
eln+2.669
imens+2.566
earable+2.542
enne+2.389
imates+2.354
Neg logit contributions
theless-4.437
 rhy-4.105
ymes-3.964
ingly-3.618
olla-3.604
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
theless+0.456
 rhy+0.423
ymes+0.400
ingly+0.360
olla+0.350
Neg logit contributions
eln-0.512
imens-0.499
earable-0.496
enne-0.472
imates-0.464
Token The
Token ID383
Feature activation-3.11e-01

Pos logit contributions
theless+0.109
 rhy+0.100
ymes+0.095
ingly+0.086
olla+0.084
Neg logit contributions
eln-0.113
imens-0.110
earable-0.108
enne-0.104
imates-0.102
Token Syrian
Token ID6318
Feature activation-5.67e-01

Pos logit contributions
theless+0.230
 rhy+0.213
ymes+0.201
ingly+0.182
olla+0.177
Neg logit contributions
eln-0.243
imens-0.235
earable-0.233
enne-0.223
imates-0.219
Token ambassador
Token ID14791
Feature activation+7.45e+00

Pos logit contributions
theless+0.439
 rhy+0.407
ymes+0.387
ingly+0.352
olla+0.342
Neg logit contributions
eln-0.423
imens-0.410
earable-0.405
enne-0.387
imates-0.379
Token was
Token ID373
Feature activation+5.66e-01

Pos logit contributions
eln+4.875
imens+4.607
earable+4.510
enne+4.479
handled+4.364
Neg logit contributions
theless-6.203
 rhy-5.548
ymes-5.467
olla-4.914
ingly-4.861
Token summoned
Token ID20056
Feature activation+1.45e+01

Pos logit contributions
eln+0.386
imens+0.373
earable+0.368
enne+0.350
imates+0.343
Neg logit contributions
theless-0.474
 rhy-0.441
ymes-0.422
ingly-0.386
olla-0.377
Token;
Token ID26
Feature activation+2.46e+00

Pos logit contributions
eln+8.242
imens+7.288
ainted+7.278
earable+7.150
enne+7.063
Neg logit contributions
theless-12.153
 rhy-10.928
ymes-10.904
olla-9.766
itudinal-9.639
Token he
Token ID339
Feature activation+2.96e+00

Pos logit contributions
eln+1.914
imens+1.849
earable+1.823
enne+1.757
imates+1.728
Neg logit contributions
theless-1.795
 rhy-1.657
ymes-1.577
ingly-1.416
olla-1.386
Token denied
Token ID6699
Feature activation+5.34e+00

Pos logit contributions
eln+2.212
imens+2.130
earable+2.107
enne+2.024
imates+1.982
Neg logit contributions
theless-2.266
 rhy-2.092
ymes-2.004
ingly-1.803
olla-1.769
Token Syrian
Token ID6318
Feature activation+2.23e-01

Pos logit contributions
eln+3.769
imens+3.572
earable+3.534
enne+3.402
imates+3.374
Neg logit contributions
theless-4.234
 rhy-3.878
ymes-3.767
olla-3.361
itudinal-3.354
Token involvement
Token ID9750
Feature activation+3.10e-01

Pos logit contributions
eln+0.158
imens+0.153
earable+0.151
enne+0.144
imates+0.141
Neg logit contributions
theless-0.180
 rhy-0.167
ymes-0.159
ingly-0.145
olla-0.142
Token possession
Token ID7797
Feature activation-2.54e+00

Pos logit contributions
eln+1.892
imens+1.822
earable+1.806
enne+1.727
imates+1.694
Neg logit contributions
theless-2.015
 rhy-1.857
ymes-1.782
ingly-1.611
olla-1.576
Token of
Token ID286
Feature activation-7.96e-01

Pos logit contributions
theless+1.643
 rhy+1.516
ymes+1.416
ingly+1.283
olla+1.229
Neg logit contributions
eln-2.195
imens-2.142
earable-2.122
enne-2.021
imates-1.984
Token the
Token ID262
Feature activation+3.13e-01

Pos logit contributions
theless+0.590
 rhy+0.550
ymes+0.516
ingly+0.471
olla+0.456
Neg logit contributions
eln-0.618
imens-0.602
earable-0.593
enne-0.567
imates-0.555
Token Energy
Token ID6682
Feature activation+2.28e+00

Pos logit contributions
eln+0.237
imens+0.230
earable+0.227
enne+0.217
imates+0.213
Neg logit contributions
theless-0.239
 rhy-0.222
ymes-0.209
ingly-0.191
olla-0.186
Token Committee
Token ID4606
Feature activation+3.02e+00

Pos logit contributions
eln+1.691
imens+1.628
earable+1.607
enne+1.556
imates+1.508
Neg logit contributions
theless-1.770
 rhy-1.621
ymes-1.554
ingly-1.385
olla-1.368
Token ga
Token ID31986
Feature activation+1.40e+01

Pos logit contributions
eln+2.322
imens+2.239
earable+2.218
enne+2.141
imates+2.087
Neg logit contributions
theless-2.264
 rhy-2.075
ymes-1.982
ingly-1.776
olla-1.736
Tokenvel
Token ID626
Feature activation-4.13e-01

Pos logit contributions
eln+6.636
earable+5.877
imens+5.772
enne+5.654
ainted+5.221
Neg logit contributions
theless-14.171
 rhy-12.762
ymes-12.685
itudinal-11.104
ingly-10.955
Token.
Token ID13
Feature activation+2.59e+00

Pos logit contributions
theless+0.291
 rhy+0.268
ymes+0.252
ingly+0.232
olla+0.222
Neg logit contributions
eln-0.334
imens-0.327
earable-0.323
enne-0.307
imates-0.302
Token
Token ID198
Feature activation+1.65e+00

Pos logit contributions
eln+2.198
imens+2.131
earable+2.113
enne+2.037
imates+2.000
Neg logit contributions
theless-1.741
 rhy-1.592
ymes-1.502
ingly-1.336
olla-1.292
Token
Token ID198
Feature activation+4.45e-01

Pos logit contributions
eln+1.472
imens+1.433
earable+1.421
enne+1.372
imates+1.349
Neg logit contributions
theless-1.042
 rhy-0.939
ymes-0.879
ingly-0.770
olla-0.747
TokenNow
Token ID3844
Feature activation+1.65e-01

Pos logit contributions
eln+0.301
imens+0.293
earable+0.287
enne+0.272
imates+0.268
Neg logit contributions
theless-0.372
 rhy-0.346
ymes-0.333
ingly-0.308
olla-0.301
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
theless+1.950
 rhy+1.799
ymes+1.689
ingly+1.522
olla+1.480
Neg logit contributions
eln-2.506
imens-2.464
earable-2.432
enne-2.319
imates-2.305
Token Gl
Token ID2671
Feature activation-4.23e-02

Pos logit contributions
theless+1.132
 rhy+1.045
ymes+0.987
ingly+0.895
olla+0.873
Neg logit contributions
eln-1.214
imens-1.181
earable-1.167
enne-1.110
imates-1.097
Tokenendale
Token ID41147
Feature activation-1.06e+00

Pos logit contributions
theless+0.036
 rhy+0.034
ymes+0.033
ingly+0.030
olla+0.029
Neg logit contributions
eln-0.028
imens-0.027
earable-0.027
enne-0.025
imates-0.025
Token police
Token ID1644
Feature activation-1.22e+00

Pos logit contributions
theless+0.702
 rhy+0.645
ymes+0.609
ingly+0.542
olla+0.525
Neg logit contributions
eln-0.900
imens-0.882
earable-0.870
enne-0.835
imates-0.822
Token were
Token ID547
Feature activation+1.28e+00

Pos logit contributions
theless+0.897
 rhy+0.835
ymes+0.791
ingly+0.714
olla+0.688
Neg logit contributions
eln-0.962
imens-0.928
earable-0.914
enne-0.876
imates-0.863
Token summoned
Token ID20056
Feature activation+1.39e+01

Pos logit contributions
eln+0.907
imens+0.875
earable+0.864
enne+0.825
imates+0.808
Neg logit contributions
theless-1.042
 rhy-0.966
ymes-0.922
ingly-0.842
olla-0.823
Token to
Token ID284
Feature activation+6.57e+00

Pos logit contributions
eln+7.818
imens+7.045
enne+6.970
earable+6.913
ainted+6.561
Neg logit contributions
theless-12.251
 rhy-11.073
ymes-10.773
olla-9.739
itudinal-9.524
Token the
Token ID262
Feature activation+4.80e+00

Pos logit contributions
eln+4.627
imens+4.411
earable+4.356
enne+4.222
imates+4.094
Neg logit contributions
theless-5.298
 rhy-4.816
ymes-4.672
ingly-4.209
olla-4.087
Token club
Token ID3430
Feature activation-3.31e-01

Pos logit contributions
eln+3.647
imens+3.478
earable+3.445
enne+3.353
imates+3.239
Neg logit contributions
theless-3.600
 rhy-3.277
ymes-3.158
ingly-2.842
olla-2.775
Token by
Token ID416
Feature activation+2.92e+00

Pos logit contributions
theless+0.245
 rhy+0.230
ymes+0.216
ingly+0.198
olla+0.191
Neg logit contributions
eln-0.254
imens-0.251
earable-0.246
enne-0.234
imates-0.231
Token manager
Token ID4706
Feature activation+3.78e+00

Pos logit contributions
eln+2.187
imens+2.097
earable+2.072
enne+2.006
imates+1.945
Neg logit contributions
theless-2.257
 rhy-2.073
ymes-1.980
ingly-1.790
olla-1.753
Token officials
Token ID2828
Feature activation+2.61e+00

Pos logit contributions
theless+0.938
 rhy+0.869
ymes+0.805
ingly+0.720
olla+0.690
Neg logit contributions
eln-1.335
imens-1.297
earable-1.283
enne-1.235
imates-1.214
Token in
Token ID287
Feature activation+3.38e-01

Pos logit contributions
eln+2.217
imens+2.159
earable+2.134
enne+2.067
imates+2.030
Neg logit contributions
theless-1.732
 rhy-1.565
ymes-1.500
ingly-1.325
olla-1.293
Token Manila
Token ID31721
Feature activation+3.16e+00

Pos logit contributions
eln+0.252
imens+0.244
earable+0.241
enne+0.231
imates+0.226
Neg logit contributions
theless-0.262
 rhy-0.242
ymes-0.230
ingly-0.209
olla-0.204
Token were
Token ID547
Feature activation+1.02e+00

Pos logit contributions
eln+2.705
imens+2.623
earable+2.599
enne+2.512
imates+2.465
Neg logit contributions
theless-2.078
 rhy-1.873
ymes-1.795
ingly-1.582
olla-1.538
Token not
Token ID407
Feature activation+1.30e+00

Pos logit contributions
eln+0.758
imens+0.734
earable+0.726
enne+0.695
imates+0.680
Neg logit contributions
theless-0.787
 rhy-0.729
ymes-0.693
ingly-0.630
olla-0.613
Token available
Token ID1695
Feature activation+1.39e+01

Pos logit contributions
eln+0.919
imens+0.887
earable+0.876
enne+0.836
imates+0.819
Neg logit contributions
theless-1.061
 rhy-0.984
ymes-0.940
ingly-0.858
olla-0.838
Token for
Token ID329
Feature activation+9.38e+00

Pos logit contributions
eln+6.538
imens+6.230
enne+6.005
ainted+5.985
imates+5.984
Neg logit contributions
theless-12.737
ymes-11.687
 rhy-11.654
itudinal-10.284
olla-10.274
Token comment
Token ID2912
Feature activation+8.38e+00

Pos logit contributions
eln+5.730
imens+5.356
earable+5.022
enne+4.919
imates+4.885
Neg logit contributions
theless-8.188
 rhy-7.527
ymes-7.429
ingly-6.670
olla-6.598
Token Sunday
Token ID3502
Feature activation+5.66e+00

Pos logit contributions
eln+5.738
imens+5.495
earable+5.329
enne+5.252
imates+5.207
Neg logit contributions
theless-6.536
 rhy-5.945
ymes-5.881
ingly-5.126
olla-5.064
Token.
Token ID13
Feature activation+1.80e+00

Pos logit contributions
eln+4.111
imens+3.896
earable+3.838
enne+3.722
imates+3.690
Neg logit contributions
theless-4.399
 rhy-4.008
ymes-3.940
ingly-3.434
olla-3.421
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
eln+1.548
imens+1.507
earable+1.489
enne+1.437
imates+1.412
Neg logit contributions
theless-1.176
 rhy-1.073
ymes-1.009
ingly-0.898
olla-0.863
Token,
Token ID11
Feature activation+4.36e+00

Pos logit contributions
eln+4.119
imens+3.980
earable+3.976
enne+3.822
imates+3.727
Neg logit contributions
theless-4.066
 rhy-3.683
ymes-3.476
ingly-3.144
olla-3.060
Token broadcast
Token ID7025
Feature activation+3.04e+00

Pos logit contributions
eln+3.005
imens+2.957
earable+2.936
enne+2.816
ainted+2.745
Neg logit contributions
theless-3.493
 rhy-3.208
ymes-3.109
ingly-2.793
olla-2.755
Token,
Token ID11
Feature activation+4.85e+00

Pos logit contributions
eln+2.445
imens+2.367
earable+2.348
enne+2.252
imates+2.210
Neg logit contributions
theless-2.177
 rhy-2.002
ymes-1.886
ingly-1.693
olla-1.650
Token rewritten
Token ID30101
Feature activation+7.10e+00

Pos logit contributions
eln+3.827
imens+3.744
earable+3.736
enne+3.569
ainted+3.524
Neg logit contributions
theless-3.432
 rhy-3.061
ymes-2.977
ingly-2.652
olla-2.572
Token or
Token ID393
Feature activation+1.22e-01

Pos logit contributions
eln+3.945
imens+3.813
earable+3.740
enne+3.594
imates+3.455
Neg logit contributions
theless-6.749
 rhy-6.228
ymes-5.936
ingly-5.469
olla-5.455
Token redistributed
Token ID38913
Feature activation+1.36e+01

Pos logit contributions
eln+0.103
imens+0.100
earable+0.098
enne+0.095
imates+0.094
Neg logit contributions
theless-0.083
 rhy-0.077
ymes-0.072
ingly-0.065
olla-0.063
Token.)
Token ID2014
Feature activation-2.59e+00

Pos logit contributions
eln+7.109
enne+6.808
imens+6.796
earable+6.706
imates+6.665
Neg logit contributions
theless-12.325
 rhy-11.125
ymes-10.520
itudinal-9.864
ingly-9.825
Token<|endoftext|>
Token ID50256
Feature activation+6.38e+00

Pos logit contributions
theless+1.605
 rhy+1.461
ymes+1.369
ingly+1.213
olla+1.177
Neg logit contributions
eln-2.320
imens-2.257
earable-2.219
enne-2.144
imates-2.114
TokenNew
Token ID3791
Feature activation-2.37e-01

Pos logit contributions
eln+4.109
earable+3.929
imens+3.914
enne+3.721
imates+3.607
Neg logit contributions
theless-5.540
 rhy-5.110
ymes-4.828
ingly-4.394
olla-4.304
Tokencastle
Token ID18676
Feature activation-4.24e-01

Pos logit contributions
theless+0.177
 rhy+0.164
ymes+0.156
ingly+0.142
olla+0.138
Neg logit contributions
eln-0.182
imens-0.176
earable-0.174
enne-0.166
imates-0.164
Token United
Token ID1578
Feature activation-2.04e+00

Pos logit contributions
theless+0.328
 rhy+0.303
ymes+0.289
ingly+0.262
olla+0.256
Neg logit contributions
eln-0.316
imens-0.306
earable-0.302
enne-0.289
imates-0.283
Tokenindex
Token ID9630
Feature activation+9.19e-01

Pos logit contributions
eln+3.427
imens+3.304
earable+3.235
enne+3.146
ainted+3.064
Neg logit contributions
theless-3.528
 rhy-3.266
ymes-3.095
ingly-2.777
olla-2.720
Token.
Token ID13
Feature activation+2.35e-01

Pos logit contributions
eln+0.716
imens+0.695
earable+0.685
enne+0.658
imates+0.647
Neg logit contributions
theless-0.687
 rhy-0.628
ymes-0.593
ingly-0.535
olla-0.521
Tokenphp
Token ID10121
Feature activation+3.28e+00

Pos logit contributions
eln+0.144
imens+0.139
earable+0.137
enne+0.129
imates+0.126
Neg logit contributions
theless-0.213
 rhy-0.200
ymes-0.191
ingly-0.177
olla-0.173
Token?
Token ID30
Feature activation+5.88e+00

Pos logit contributions
eln+2.431
imens+2.351
earable+2.329
enne+2.244
imates+2.175
Neg logit contributions
theless-2.558
 rhy-2.360
ymes-2.242
ingly-2.034
olla-1.989
Tokentopic
Token ID26652
Feature activation+3.82e+00

Pos logit contributions
eln+3.534
earable+3.330
imens+3.326
handled+3.132
enne+3.127
Neg logit contributions
theless-5.375
 rhy-5.017
ymes-4.797
ingly-4.439
olla-4.309
Token=
Token ID28
Feature activation+1.34e+01

Pos logit contributions
eln+2.419
imens+2.363
earable+2.322
enne+2.218
imates+2.172
Neg logit contributions
theless-3.390
 rhy-3.094
ymes-2.942
ingly-2.702
olla-2.618
Token28
Token ID2078
Feature activation+3.00e+00

Pos logit contributions
eln+8.161
imens+7.843
earable+7.638
enne+7.465
NVIDIA+7.115
Neg logit contributions
theless-10.793
 rhy-10.136
ymes-9.363
ingly-8.384
itudinal-8.277
Token18
Token ID1507
Feature activation+1.18e+00

Pos logit contributions
eln+2.128
imens+2.057
earable+2.030
enne+1.940
imates+1.901
Neg logit contributions
theless-2.410
 rhy-2.241
ymes-2.129
ingly-1.940
olla-1.882
Token48
Token ID2780
Feature activation+4.57e+00

Pos logit contributions
eln+0.863
imens+0.834
earable+0.824
enne+0.788
imates+0.772
Neg logit contributions
theless-0.927
 rhy-0.859
ymes-0.818
ingly-0.744
olla-0.726
Token.
Token ID13
Feature activation+3.67e+00

Pos logit contributions
eln+3.453
imens+3.328
earable+3.296
enne+3.169
imates+3.116
Neg logit contributions
theless-3.477
 rhy-3.178
ymes-3.011
ingly-2.726
olla-2.639
Token0
Token ID15
Feature activation+1.82e+00

Pos logit contributions
eln+2.234
imens+2.138
earable+2.098
enne+1.978
imates+1.941
Neg logit contributions
theless-3.359
 rhy-3.117
ymes-2.989
ingly-2.744
olla-2.685
Token Glen
Token ID20716
Feature activation+2.58e+00

Pos logit contributions
theless+1.546
 rhy+1.455
ymes+1.394
ingly+1.278
olla+1.237
Neg logit contributions
eln-1.348
imens-1.307
earable-1.286
enne-1.221
imates-1.196
Token Gab
Token ID12300
Feature activation-3.06e+00

Pos logit contributions
eln+1.901
imens+1.832
earable+1.810
enne+1.745
imates+1.698
Neg logit contributions
theless-2.029
 rhy-1.872
ymes-1.776
ingly-1.614
olla-1.570
Tokenert
Token ID861
Feature activation-1.17e+00

Pos logit contributions
theless+2.545
 rhy+2.376
ymes+2.305
olla+2.100
ingly+2.095
Neg logit contributions
eln-2.016
earable-1.975
imens-1.961
enne-1.836
imates-1.808
Token was
Token ID373
Feature activation+1.16e+00

Pos logit contributions
theless+0.847
 rhy+0.790
ymes+0.745
ingly+0.673
olla+0.655
Neg logit contributions
eln-0.920
imens-0.893
earable-0.890
enne-0.845
imates-0.830
Token not
Token ID407
Feature activation+1.70e+00

Pos logit contributions
eln+0.896
imens+0.866
earable+0.856
enne+0.822
imates+0.804
Neg logit contributions
theless-0.876
 rhy-0.813
ymes-0.770
ingly-0.703
olla-0.677
Token returned
Token ID4504
Feature activation+1.34e+01

Pos logit contributions
eln+1.387
imens+1.347
earable+1.330
enne+1.278
imates+1.254
Neg logit contributions
theless-1.201
 rhy-1.102
ymes-1.040
ingly-0.940
olla-0.908
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
eln+7.576
imens+6.984
handled+6.731
enne+6.724
earable+6.627
Neg logit contributions
theless-11.457
 rhy-10.610
ymes-10.278
itudinal-9.216
ingly-9.207
Token Hudson
Token ID19995
Feature activation+3.10e-01

Pos logit contributions
eln+1.238
imens+1.204
earable+1.190
enne+1.147
imates+1.128
Neg logit contributions
theless-0.958
 rhy-0.875
ymes-0.824
ingly-0.733
olla-0.709
Token County
Token ID3418
Feature activation-1.70e+00

Pos logit contributions
eln+0.214
imens+0.206
earable+0.204
enne+0.194
imates+0.190
Neg logit contributions
theless-0.257
 rhy-0.239
ymes-0.228
ingly-0.209
olla-0.204
Token Executive
Token ID10390
Feature activation-2.05e+00

Pos logit contributions
theless+1.209
 rhy+1.121
ymes+1.057
ingly+0.951
olla+0.921
Neg logit contributions
eln-1.369
imens-1.333
earable-1.310
enne-1.260
imates-1.228
Token Tom
Token ID4186
Feature activation+1.39e+00

Pos logit contributions
theless+1.624
 rhy+1.551
ymes+1.479
ingly+1.365
olla+1.316
Neg logit contributions
eln-1.440
imens-1.408
earable-1.382
enne-1.308
imates-1.284
Token spent
Token ID3377
Feature activation+4.74e+00

Pos logit contributions
eln+4.143
imens+4.073
earable+3.981
enne+3.907
ainted+3.780
Neg logit contributions
theless-4.804
 rhy-4.467
ymes-4.350
ingly-3.888
olla-3.824
Token a
Token ID257
Feature activation+3.28e+00

Pos logit contributions
eln+3.380
imens+3.213
earable+3.198
enne+3.078
imates+2.989
Neg logit contributions
theless-3.888
 rhy-3.469
ymes-3.348
ingly-2.997
olla-2.964
Token week
Token ID1285
Feature activation+6.92e+00

Pos logit contributions
eln+1.773
imens+1.679
earable+1.654
enne+1.573
imates+1.506
Neg logit contributions
theless-3.216
 rhy-3.001
ymes-2.908
ingly-2.679
olla-2.639
Token using
Token ID1262
Feature activation+2.83e+00

Pos logit contributions
eln+3.920
earable+3.759
imens+3.729
enne+3.615
imates+3.468
Neg logit contributions
theless-6.411
 rhy-5.913
ymes-5.743
olla-5.175
ingly-5.150
Token the
Token ID262
Feature activation+4.48e+00

Pos logit contributions
eln+2.165
imens+2.097
earable+2.077
enne+2.001
imates+1.958
Neg logit contributions
theless-2.133
 rhy-1.948
ymes-1.857
ingly-1.662
olla-1.629
Token tactics
Token ID10815
Feature activation+1.33e+01

Pos logit contributions
eln+2.992
imens+2.888
earable+2.769
enne+2.713
imates+2.638
Neg logit contributions
theless-3.775
 rhy-3.512
ymes-3.369
ingly-3.073
olla-3.040
Token learned
Token ID4499
Feature activation+1.00e-01

Pos logit contributions
eln+4.651
imens+4.544
earable+4.511
enne+4.402
ainted+4.307
Neg logit contributions
theless-14.155
 rhy-12.517
ymes-12.489
itudinal-11.682
olla-11.674
Token in
Token ID287
Feature activation+3.55e+00

Pos logit contributions
eln+0.084
imens+0.082
earable+0.081
enne+0.078
imates+0.076
Neg logit contributions
theless-0.068
 rhy-0.063
ymes-0.059
ingly-0.053
olla-0.051
Token this
Token ID428
Feature activation+4.03e+00

Pos logit contributions
eln+2.623
imens+2.539
earable+2.509
enne+2.423
imates+2.353
Neg logit contributions
theless-2.782
 rhy-2.507
ymes-2.439
ingly-2.177
olla-2.138
Token interview
Token ID2720
Feature activation+4.76e+00

Pos logit contributions
eln+1.779
imens+1.684
earable+1.662
enne+1.537
imates+1.471
Neg logit contributions
theless-4.382
 rhy-4.044
ymes-3.968
ingly-3.655
olla-3.624
Token to
Token ID284
Feature activation+6.26e+00

Pos logit contributions
eln+2.821
imens+2.684
earable+2.666
enne+2.543
imates+2.468
Neg logit contributions
theless-4.419
 rhy-4.065
ymes-3.924
ingly-3.555
olla-3.523
Token and
Token ID290
Feature activation+1.45e+00

Pos logit contributions
eln+2.243
imens+2.176
earable+2.156
enne+2.068
imates+2.025
Neg logit contributions
theless-2.183
 rhy-1.965
ymes-1.885
ingly-1.671
olla-1.657
Token non
Token ID1729
Feature activation-1.54e+00

Pos logit contributions
eln+1.225
imens+1.190
earable+1.176
enne+1.133
imates+1.112
Neg logit contributions
theless-0.978
 rhy-0.900
ymes-0.844
ingly-0.757
olla-0.732
Token-
Token ID12
Feature activation+4.88e+00

Pos logit contributions
theless+0.934
 rhy+0.879
ymes+0.826
ingly+0.753
olla+0.708
Neg logit contributions
eln-1.360
imens-1.328
earable-1.318
enne-1.258
imates-1.250
Tokenfiction
Token ID24046
Feature activation+3.70e+00

Pos logit contributions
eln+4.555
imens+4.443
earable+4.369
enne+4.231
imates+4.162
Neg logit contributions
theless-2.859
 rhy-2.560
ymes-2.397
ingly-2.028
olla-1.963
Token.
Token ID13
Feature activation+2.77e+00

Pos logit contributions
eln+2.779
imens+2.691
earable+2.679
enne+2.564
imates+2.513
Neg logit contributions
theless-2.862
 rhy-2.604
ymes-2.459
ingly-2.204
olla-2.160
Token Cos
Token ID10437
Feature activation+1.31e+01

Pos logit contributions
eln+2.457
imens+2.395
earable+2.375
enne+2.291
imates+2.262
Neg logit contributions
theless-1.737
 rhy-1.565
ymes-1.467
ingly-1.302
olla-1.238
Tokenplayer
Token ID7829
Feature activation+6.55e+00

Pos logit contributions
eln+4.245
imens+4.023
earable+3.599
enne+3.391
imates+3.065
Neg logit contributions
theless-15.075
 rhy-14.164
ymes-13.123
ingly-12.435
olla-12.323
Token and
Token ID290
Feature activation+4.58e+00

Pos logit contributions
eln+4.566
imens+4.420
earable+4.413
enne+4.227
imates+4.112
Neg logit contributions
theless-5.387
 rhy-4.758
ymes-4.536
olla-4.038
ingly-4.007
Token figure
Token ID3785
Feature activation+1.20e+01

Pos logit contributions
eln+2.857
imens+2.764
earable+2.739
enne+2.556
imates+2.547
Neg logit contributions
theless-4.102
 rhy-3.626
ymes-3.604
ingly-3.238
olla-3.224
Token model
Token ID2746
Feature activation+6.29e+00

Pos logit contributions
imens+2.989
eln+2.891
earable+2.732
imates+2.403
enne+2.302
Neg logit contributions
theless-14.805
ymes-12.946
 rhy-12.871
olla-12.141
itudinal-12.031
Token.
Token ID13
Feature activation+2.99e+00

Pos logit contributions
eln+4.349
imens+4.219
earable+4.209
enne+4.047
imates+3.945
Neg logit contributions
theless-5.257
 rhy-4.706
ymes-4.412
ingly-3.933
olla-3.887
Token,
Token ID11
Feature activation-3.00e-01

Pos logit contributions
eln+2.603
imens+2.518
earable+2.479
enne+2.390
imates+2.335
Neg logit contributions
theless-2.695
 rhy-2.481
ymes-2.369
ingly-2.137
olla-2.094
Token not
Token ID407
Feature activation-7.87e+00

Pos logit contributions
theless+0.222
 rhy+0.206
ymes+0.194
ingly+0.177
olla+0.170
Neg logit contributions
eln-0.234
imens-0.227
earable-0.226
enne-0.215
imates-0.211
Token only
Token ID691
Feature activation-6.79e-01

Pos logit contributions
theless+5.229
 rhy+4.823
ymes+4.223
ingly+4.142
olla+3.799
Neg logit contributions
eln-6.299
imens-6.229
earable-6.167
enne-5.903
ktop-5.792
Token as
Token ID355
Feature activation+2.13e+00

Pos logit contributions
theless+0.512
 rhy+0.475
ymes+0.442
ingly+0.408
olla+0.392
Neg logit contributions
eln-0.523
imens-0.509
earable-0.504
enne-0.478
imates-0.471
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
eln+1.632
imens+1.580
earable+1.561
enne+1.497
imates+1.468
Neg logit contributions
theless-1.602
 rhy-1.477
ymes-1.405
ingly-1.270
olla-1.238
Token representative
Token ID8852
Feature activation+1.30e+01

Pos logit contributions
eln+0.678
imens+0.655
earable+0.650
enne+0.612
imates+0.604
Neg logit contributions
theless-0.849
 rhy-0.794
ymes-0.752
ingly-0.695
olla-0.676
Token of
Token ID286
Feature activation+2.93e+00

Pos logit contributions
eln+8.322
imens+8.172
enne+7.881
earable+7.790
imates+7.447
Neg logit contributions
theless-10.277
ymes-9.040
 rhy-9.013
ingly-7.797
olla-7.776
Token the
Token ID262
Feature activation+1.70e+00

Pos logit contributions
eln+2.177
imens+2.095
earable+2.070
enne+1.988
imates+1.945
Neg logit contributions
theless-2.268
 rhy-2.083
ymes-1.998
ingly-1.800
olla-1.767
Token Democratic
Token ID4390
Feature activation+1.79e+00

Pos logit contributions
eln+1.166
imens+1.126
earable+1.111
enne+1.059
imates+1.036
Neg logit contributions
theless-1.416
 rhy-1.318
ymes-1.258
ingly-1.153
olla-1.126
Token Party
Token ID3615
Feature activation+9.01e-02

Pos logit contributions
eln+1.129
imens+1.086
earable+1.073
enne+1.019
imates+0.993
Neg logit contributions
theless-1.595
 rhy-1.487
ymes-1.427
ingly-1.311
olla-1.285
Token but
Token ID475
Feature activation+3.07e-01

Pos logit contributions
eln+0.076
imens+0.074
earable+0.074
enne+0.070
imates+0.070
Neg logit contributions
theless-0.060
 rhy-0.055
ymes-0.052
ingly-0.047
olla-0.045
Token Washington
Token ID2669
Feature activation+6.67e-01

Pos logit contributions
theless+0.291
 rhy+0.273
ymes+0.254
ingly+0.236
olla+0.228
Neg logit contributions
eln-0.269
imens-0.261
earable-0.259
enne-0.245
imates-0.242
Token status
Token ID3722
Feature activation+1.06e+00

Pos logit contributions
eln+0.464
imens+0.449
earable+0.443
enne+0.422
imates+0.414
Neg logit contributions
theless-0.549
 rhy-0.512
ymes-0.487
ingly-0.446
olla-0.435
Token quo
Token ID18658
Feature activation-8.47e-01

Pos logit contributions
eln+0.987
imens+0.962
earable+0.952
enne+0.921
imates+0.907
Neg logit contributions
theless-0.619
 rhy-0.554
ymes-0.519
ingly-0.453
olla-0.437
Token,
Token ID11
Feature activation-9.10e-01

Pos logit contributions
theless+0.570
 rhy+0.531
ymes+0.493
ingly+0.447
olla+0.430
Neg logit contributions
eln-0.712
imens-0.691
earable-0.685
enne-0.655
imates-0.645
Token the
Token ID262
Feature activation-8.45e-01

Pos logit contributions
theless+0.656
 rhy+0.607
ymes+0.569
ingly+0.521
olla+0.504
Neg logit contributions
eln-0.725
imens-0.707
earable-0.698
enne-0.665
imates-0.654
Token ga
Token ID31986
Feature activation+1.28e+01

Pos logit contributions
theless+0.665
 rhy+0.620
ymes+0.579
ingly+0.536
olla+0.519
Neg logit contributions
eln-0.617
imens-0.603
earable-0.593
enne-0.561
imates-0.555
Tokenvel
Token ID626
Feature activation+1.14e+00

Pos logit contributions
eln+6.178
imens+5.502
earable+5.315
enne+5.136
ainted+4.821
Neg logit contributions
theless-13.201
 rhy-11.996
ymes-11.521
ingly-10.355
itudinal-10.101
Token may
Token ID743
Feature activation-1.70e-01

Pos logit contributions
eln+0.877
imens+0.854
earable+0.843
enne+0.806
imates+0.790
Neg logit contributions
theless-0.853
 rhy-0.790
ymes-0.747
ingly-0.682
olla-0.659
Token have
Token ID423
Feature activation-1.20e+00

Pos logit contributions
theless+0.131
 rhy+0.123
ymes+0.115
ingly+0.106
olla+0.102
Neg logit contributions
eln-0.128
imens-0.124
earable-0.121
enne-0.117
imates-0.114
Token been
Token ID587
Feature activation-6.86e-01

Pos logit contributions
theless+0.897
 rhy+0.838
ymes+0.783
ingly+0.713
olla+0.690
Neg logit contributions
eln-0.922
imens-0.904
earable-0.885
enne-0.857
imates-0.831
Token more
Token ID517
Feature activation+8.46e-01

Pos logit contributions
theless+0.510
 rhy+0.473
ymes+0.447
ingly+0.408
olla+0.397
Neg logit contributions
eln-0.531
imens-0.518
earable-0.507
enne-0.490
imates-0.478
Token.
Token ID13
Feature activation+2.49e+00

Pos logit contributions
theless+0.385
 rhy+0.358
ymes+0.337
ingly+0.305
olla+0.295
Neg logit contributions
eln-0.422
imens-0.410
earable-0.406
enne-0.387
imates-0.381
Token (
Token ID357
Feature activation+5.04e+00

Pos logit contributions
eln+1.944
imens+1.891
earable+1.876
enne+1.809
imates+1.762
Neg logit contributions
theless-1.816
 rhy-1.673
ymes-1.582
ingly-1.423
olla-1.379
TokenJones
Token ID25784
Feature activation+1.77e+00

Pos logit contributions
eln+3.461
earable+3.326
imens+3.321
enne+3.178
imates+3.081
Neg logit contributions
theless-4.130
 rhy-3.826
ymes-3.675
ingly-3.307
olla-3.238
Token was
Token ID373
Feature activation-1.68e+00

Pos logit contributions
eln+1.399
imens+1.354
earable+1.340
enne+1.289
imates+1.262
Neg logit contributions
theless-1.284
 rhy-1.177
ymes-1.118
ingly-1.007
olla-0.980
Token not
Token ID407
Feature activation-9.09e-01

Pos logit contributions
theless+1.240
 rhy+1.164
ymes+1.098
ingly+0.991
olla+0.963
Neg logit contributions
eln-1.296
imens-1.260
earable-1.243
enne-1.188
imates-1.169
Token available
Token ID1695
Feature activation+1.27e+01

Pos logit contributions
theless+0.686
 rhy+0.641
ymes+0.606
ingly+0.550
olla+0.533
Neg logit contributions
eln-0.693
imens-0.672
earable-0.662
enne-0.632
imates-0.623
Token to
Token ID284
Feature activation+9.30e+00

Pos logit contributions
eln+6.780
imens+6.450
earable+6.258
enne+6.245
ainted+6.087
Neg logit contributions
theless-11.253
 rhy-10.152
ymes-10.029
olla-9.082
itudinal-9.005
Token be
Token ID307
Feature activation+2.04e+00

Pos logit contributions
eln+5.731
imens+5.428
earable+5.222
enne+5.137
imates+5.048
Neg logit contributions
theless-7.972
 rhy-7.287
ymes-7.240
itudinal-6.386
ingly-6.353
Token interviewed
Token ID12299
Feature activation+8.25e+00

Pos logit contributions
eln+1.372
imens+1.322
earable+1.297
enne+1.249
imates+1.215
Neg logit contributions
theless-1.732
 rhy-1.604
ymes-1.540
ingly-1.412
olla-1.376
Token for
Token ID329
Feature activation+6.69e+00

Pos logit contributions
eln+5.463
imens+5.170
earable+5.127
enne+4.995
handled+4.825
Neg logit contributions
theless-6.813
 rhy-6.133
ymes-5.969
ingly-5.335
olla-5.331
Token this
Token ID428
Feature activation+1.40e+00

Pos logit contributions
eln+4.710
imens+4.474
earable+4.433
enne+4.245
imates+4.133
Neg logit contributions
theless-5.294
 rhy-4.866
ymes-4.743
ingly-4.268
olla-4.188
Token said
Token ID531
Feature activation+7.58e-01

Pos logit contributions
eln+3.125
imens+3.051
earable+3.012
enne+2.924
imates+2.888
Neg logit contributions
theless-1.783
 rhy-1.573
ymes-1.484
ingly-1.273
olla-1.227
Token the
Token ID262
Feature activation-1.39e-01

Pos logit contributions
eln+0.603
imens+0.585
earable+0.578
enne+0.555
imates+0.544
Neg logit contributions
theless-0.549
 rhy-0.506
ymes-0.479
ingly-0.432
olla-0.420
Token injuries
Token ID6821
Feature activation+4.17e+00

Pos logit contributions
theless+0.102
 rhy+0.095
ymes+0.089
ingly+0.081
olla+0.078
Neg logit contributions
eln-0.109
imens-0.105
earable-0.104
enne-0.099
imates-0.098
Token were
Token ID547
Feature activation+1.74e+00

Pos logit contributions
eln+3.035
imens+2.950
earable+2.913
enne+2.806
imates+2.770
Neg logit contributions
theless-3.256
 rhy-2.921
ymes-2.872
ingly-2.543
olla-2.527
Token not
Token ID407
Feature activation+1.10e+00

Pos logit contributions
eln+1.308
imens+1.269
earable+1.254
enne+1.198
imates+1.180
Neg logit contributions
theless-1.336
 rhy-1.228
ymes-1.172
ingly-1.060
olla-1.038
Token considered
Token ID3177
Feature activation+1.27e+01

Pos logit contributions
eln+0.651
imens+0.625
earable+0.616
enne+0.582
imates+0.567
Neg logit contributions
theless-1.021
 rhy-0.957
ymes-0.919
ingly-0.850
olla-0.834
Token to
Token ID284
Feature activation+6.72e+00

Pos logit contributions
eln+4.993
imens+4.593
ainted+4.472
earable+4.420
handled+4.365
Neg logit contributions
theless-12.813
ymes-11.740
 rhy-11.598
olla-11.055
itudinal-10.743
Token be
Token ID307
Feature activation+3.55e+00

Pos logit contributions
eln+3.806
imens+3.695
earable+3.552
enne+3.403
ainted+3.381
Neg logit contributions
theless-6.259
ymes-5.571
 rhy-5.555
olla-5.121
itudinal-4.977
Token life
Token ID1204
Feature activation+5.00e+00

Pos logit contributions
eln+1.993
imens+1.904
earable+1.875
enne+1.757
imates+1.732
Neg logit contributions
theless-3.362
 rhy-3.123
ymes-3.036
ingly-2.788
olla-2.772
Token-
Token ID12
Feature activation+4.50e+00

Pos logit contributions
eln+3.720
imens+3.606
earable+3.586
imates+3.408
handled+3.388
Neg logit contributions
theless-3.738
 rhy-3.403
ymes-3.310
ingly-2.919
olla-2.918
Tokenthreatening
Token ID26159
Feature activation+6.87e+00

Pos logit contributions
eln+2.663
imens+2.556
earable+2.519
enne+2.378
handled+2.356
Neg logit contributions
theless-4.128
 rhy-3.854
ymes-3.738
ingly-3.365
olla-3.363
Token vegetation
Token ID28459
Feature activation+8.25e-01

Pos logit contributions
theless+0.906
 rhy+0.859
ymes+0.815
ingly+0.754
olla+0.739
Neg logit contributions
eln-0.620
imens-0.598
earable-0.589
enne-0.557
imates-0.543
Token by
Token ID416
Feature activation-1.64e+00

Pos logit contributions
eln+0.638
imens+0.617
earable+0.610
enne+0.585
imates+0.572
Neg logit contributions
theless-0.617
 rhy-0.572
ymes-0.541
ingly-0.491
olla-0.478
Token molecules
Token ID17745
Feature activation-1.54e+00

Pos logit contributions
theless+1.311
 rhy+1.235
ymes+1.159
ingly+1.066
olla+1.034
Neg logit contributions
eln-1.184
imens-1.142
earable-1.128
enne-1.077
imates-1.049
Token out
Token ID503
Feature activation+3.97e+00

Pos logit contributions
theless+1.117
 rhy+1.044
ymes+0.982
ingly+0.885
olla+0.853
Neg logit contributions
eln-1.233
imens-1.195
earable-1.182
enne-1.128
imates-1.111
Token-
Token ID12
Feature activation+6.49e+00

Pos logit contributions
eln+3.038
imens+2.955
earable+2.912
enne+2.822
imates+2.775
Neg logit contributions
theless-2.961
 rhy-2.690
ymes-2.558
ingly-2.292
olla-2.247
Tokeng
Token ID70
Feature activation+1.26e+01

Pos logit contributions
eln+4.223
imens+4.102
earable+4.036
enne+3.894
imates+3.837
Neg logit contributions
theless-5.423
 rhy-5.019
ymes-4.774
ingly-4.354
olla-4.275
Tokenassed
Token ID21390
Feature activation+3.39e+00

Pos logit contributions
eln+7.067
imens+7.016
earable+6.929
assed+6.753
imates+6.731
Neg logit contributions
theless-11.230
 rhy-10.423
ymes-9.658
ingly-8.866
olla-8.682
Token from
Token ID422
Feature activation+1.14e+00

Pos logit contributions
eln+2.425
imens+2.340
earable+2.302
enne+2.211
imates+2.172
Neg logit contributions
theless-2.724
 rhy-2.510
ymes-2.399
ingly-2.179
olla-2.139
Token soils
Token ID38750
Feature activation+1.65e-01

Pos logit contributions
eln+0.842
imens+0.815
earable+0.805
enne+0.770
imates+0.754
Neg logit contributions
theless-0.888
 rhy-0.822
ymes-0.782
ingly-0.712
olla-0.694
Token by
Token ID416
Feature activation-1.49e+00

Pos logit contributions
eln+0.130
imens+0.126
earable+0.125
enne+0.119
imates+0.117
Neg logit contributions
theless-0.122
 rhy-0.113
ymes-0.106
ingly-0.097
olla-0.093
Token the
Token ID262
Feature activation-1.65e+00

Pos logit contributions
theless+1.197
 rhy+1.130
ymes+1.061
ingly+0.975
olla+0.941
Neg logit contributions
eln-1.069
imens-1.031
earable-1.021
enne-0.975
imates-0.947
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
eln+2.743
imens+2.654
earable+2.624
enne+2.516
imates+2.484
Neg logit contributions
theless-2.697
 rhy-2.465
ymes-2.363
ingly-2.116
olla-2.084
Token Shortly
Token ID27265
Feature activation+4.76e-01

Pos logit contributions
theless+0.212
 rhy+0.194
ymes+0.184
ingly+0.166
olla+0.161
Neg logit contributions
eln-0.252
imens-0.245
earable-0.242
enne-0.232
imates-0.227
Token thereafter
Token ID19547
Feature activation-1.46e-01

Pos logit contributions
eln+0.416
imens+0.405
earable+0.401
enne+0.387
imates+0.380
Neg logit contributions
theless-0.306
 rhy-0.278
ymes-0.262
ingly-0.232
olla-0.225
Token,
Token ID11
Feature activation+5.13e-01

Pos logit contributions
theless+0.106
 rhy+0.097
ymes+0.092
ingly+0.082
olla+0.080
Neg logit contributions
eln-0.117
imens-0.113
earable-0.112
enne-0.108
imates-0.106
Token the
Token ID262
Feature activation+9.34e-01

Pos logit contributions
eln+0.371
imens+0.359
earable+0.354
enne+0.339
imates+0.332
Neg logit contributions
theless-0.408
 rhy-0.378
ymes-0.360
ingly-0.328
olla-0.320
Token congressman
Token ID29545
Feature activation+1.26e+01

Pos logit contributions
eln+0.702
imens+0.680
earable+0.672
enne+0.643
imates+0.630
Neg logit contributions
theless-0.718
 rhy-0.664
ymes-0.631
ingly-0.573
olla-0.558
Token would
Token ID561
Feature activation+1.26e+00

Pos logit contributions
eln+8.174
earable+8.019
ainted+7.835
imens+7.751
handled+7.750
Neg logit contributions
theless-9.101
ymes-8.135
 rhy-7.917
itudinal-7.087
olla-6.968
Token be
Token ID307
Feature activation+2.08e+00

Pos logit contributions
eln+0.977
imens+0.946
earable+0.935
enne+0.897
imates+0.880
Neg logit contributions
theless-0.941
 rhy-0.867
ymes-0.823
ingly-0.745
olla-0.725
Token chosen
Token ID7147
Feature activation+4.87e+00

Pos logit contributions
eln+1.549
imens+1.495
earable+1.478
enne+1.417
imates+1.391
Neg logit contributions
theless-1.606
 rhy-1.461
ymes-1.409
ingly-1.270
olla-1.240
Token as
Token ID355
Feature activation+1.64e+00

Pos logit contributions
eln+3.210
imens+3.073
earable+3.036
enne+2.915
imates+2.845
Neg logit contributions
theless-4.152
 rhy-3.826
ymes-3.718
ingly-3.318
olla-3.308
Token Romney
Token ID8431
Feature activation+1.10e+00

Pos logit contributions
eln+1.291
imens+1.253
earable+1.238
enne+1.187
imates+1.165
Neg logit contributions
theless-1.199
 rhy-1.106
ymes-1.047
ingly-0.947
olla-0.921
Token after
Token ID706
Feature activation+6.51e-01

Pos logit contributions
eln+0.169
imens+0.164
earable+0.162
enne+0.155
imates+0.152
Neg logit contributions
theless-0.163
 rhy-0.151
ymes-0.143
ingly-0.130
olla-0.126
Token debate
Token ID4384
Feature activation+4.58e+00

Pos logit contributions
eln+0.496
imens+0.481
earable+0.475
enne+0.455
imates+0.446
Neg logit contributions
theless-0.494
 rhy-0.456
ymes-0.433
ingly-0.393
olla-0.383
Token officials
Token ID2828
Feature activation+2.61e+00

Pos logit contributions
eln+3.021
imens+2.917
earable+2.853
enne+2.763
imates+2.650
Neg logit contributions
theless-3.884
 rhy-3.564
ymes-3.433
ingly-3.121
itudinal-3.062
Token turned
Token ID2900
Feature activation+4.88e+00

Pos logit contributions
eln+2.077
imens+2.022
earable+1.998
enne+1.926
imates+1.894
Neg logit contributions
theless-1.850
 rhy-1.680
ymes-1.617
ingly-1.438
olla-1.409
Token away
Token ID1497
Feature activation+5.27e+00

Pos logit contributions
eln+3.660
imens+3.533
earable+3.477
enne+3.418
imates+3.315
Neg logit contributions
theless-3.686
 rhy-3.316
ymes-3.204
ingly-2.835
olla-2.818
Token Green
Token ID3469
Feature activation+1.26e+01

Pos logit contributions
eln+3.671
imens+3.495
earable+3.481
enne+3.328
imates+3.283
Neg logit contributions
theless-4.251
 rhy-3.876
ymes-3.729
ingly-3.336
itudinal-3.312
Token Party
Token ID3615
Feature activation+2.62e+00

Pos logit contributions
eln+7.248
earable+6.635
imens+6.518
enne+6.418
ainted+6.244
Neg logit contributions
theless-11.066
 rhy-10.184
ymes-10.024
ingly-8.857
itudinal-8.673
Token presidential
Token ID4787
Feature activation-2.26e+00

Pos logit contributions
eln+1.773
imens+1.698
earable+1.683
enne+1.607
imates+1.566
Neg logit contributions
theless-2.208
 rhy-2.046
ymes-1.964
ingly-1.788
olla-1.750
Token nominee
Token ID10429
Feature activation+6.74e+00

Pos logit contributions
theless+2.409
 rhy+2.299
ymes+2.202
ingly+2.088
olla+2.042
Neg logit contributions
eln-1.009
imens-0.960
earable-0.927
enne-0.853
imates-0.831
Token Jill
Token ID22772
Feature activation+4.31e+00

Pos logit contributions
eln+3.577
imens+3.428
earable+3.277
enne+3.208
ainted+3.096
Neg logit contributions
theless-6.392
 rhy-5.972
ymes-5.718
olla-5.268
ingly-5.219
Token Stein
Token ID15215
Feature activation+9.16e+00

Pos logit contributions
eln+3.219
imens+3.114
earable+3.048
enne+3.032
imates+2.884
Neg logit contributions
theless-3.333
 rhy-3.064
ymes-2.878
olla-2.570
ingly-2.527
Token equ
Token ID1602
Feature activation+3.50e+00

Pos logit contributions
eln+1.927
imens+1.870
earable+1.846
enne+1.760
imates+1.728
Neg logit contributions
theless-2.162
 rhy-1.996
ymes-1.911
ingly-1.731
olla-1.699
Tokenated
Token ID515
Feature activation+4.66e+00

Pos logit contributions
eln+1.450
imens+1.404
earable+1.334
enne+1.267
imates+1.230
Neg logit contributions
theless-3.868
 rhy-3.645
ymes-3.505
ingly-3.247
olla-3.229
Token with
Token ID351
Feature activation+1.60e+00

Pos logit contributions
eln+3.244
imens+3.102
earable+3.082
enne+2.983
ainted+2.905
Neg logit contributions
theless-3.778
 rhy-3.455
ymes-3.308
olla-2.975
ingly-2.930
Token an
Token ID281
Feature activation+7.40e-01

Pos logit contributions
eln+1.135
imens+1.095
earable+1.082
enne+1.033
imates+1.012
Neg logit contributions
theless-1.298
 rhy-1.204
ymes-1.150
ingly-1.049
olla-1.025
Token "
Token ID366
Feature activation+3.36e+00

Pos logit contributions
eln+0.488
imens+0.471
earable+0.464
enne+0.441
imates+0.431
Neg logit contributions
theless-0.635
 rhy-0.593
ymes-0.567
ingly-0.521
olla-0.509
Tokenunt
Token ID2797
Feature activation+1.26e+01

Pos logit contributions
eln+2.388
imens+2.296
earable+2.274
enne+2.174
imates+2.123
Neg logit contributions
theless-2.744
 rhy-2.541
ymes-2.408
ingly-2.186
olla-2.141
Tokenouch
Token ID7673
Feature activation+2.74e+00

Pos logit contributions
imens+7.426
earable+7.368
ainted+7.257
eln+7.186
imates+6.909
Neg logit contributions
theless-11.333
 rhy-9.942
ymes-9.362
ingly-8.729
olla-8.459
Tokenable
Token ID540
Feature activation+1.71e+00

Pos logit contributions
eln+1.666
imens+1.597
earable+1.578
enne+1.499
imates+1.455
Neg logit contributions
theless-2.499
 rhy-2.341
ymes-2.244
ingly-2.059
olla-2.030
Token"
Token ID1
Feature activation+2.11e+00

Pos logit contributions
eln+1.319
imens+1.278
earable+1.264
enne+1.212
imates+1.188
Neg logit contributions
theless-1.277
 rhy-1.177
ymes-1.118
ingly-1.011
olla-0.985
Token status
Token ID3722
Feature activation+8.94e-01

Pos logit contributions
eln+1.439
imens+1.388
earable+1.371
enne+1.306
imates+1.276
Neg logit contributions
theless-1.769
 rhy-1.643
ymes-1.572
ingly-1.439
olla-1.407
Token that
Token ID326
Feature activation+1.29e+00

Pos logit contributions
eln+0.713
imens+0.690
earable+0.682
enne+0.655
imates+0.642
Neg logit contributions
theless-0.646
 rhy-0.597
ymes-0.564
ingly-0.511
olla-0.496
Token yellow
Token ID7872
Feature activation+6.61e+00

Pos logit contributions
eln+1.606
imens+1.555
earable+1.539
enne+1.483
imates+1.456
Neg logit contributions
theless-1.337
 rhy-1.217
ymes-1.158
ingly-1.034
olla-1.005
Token fever
Token ID17372
Feature activation-1.59e+00

Pos logit contributions
eln+4.971
imens+4.721
earable+4.692
enne+4.595
imates+4.492
Neg logit contributions
theless-5.010
 rhy-4.432
ymes-4.300
ingly-3.873
olla-3.810
Token
Token ID447
Feature activation-6.20e-02

Pos logit contributions
theless+1.103
 rhy+1.036
ymes+0.973
ingly+0.881
olla+0.839
Neg logit contributions
eln-1.304
imens-1.273
earable-1.267
enne-1.201
imates-1.183
Token
Token ID251
Feature activation-1.09e+00

Pos logit contributions
theless+0.043
 rhy+0.039
ymes+0.037
ingly+0.033
olla+0.032
Neg logit contributions
eln-0.051
imens-0.050
earable-0.049
enne-0.047
imates-0.046
Token or
Token ID393
Feature activation-1.33e+00

Pos logit contributions
theless+0.777
 rhy+0.721
ymes+0.680
ingly+0.613
olla+0.593
Neg logit contributions
eln-0.872
imens-0.852
earable-0.839
enne-0.801
imates-0.791
Token something
Token ID1223
Feature activation-1.49e+01

Pos logit contributions
theless+0.962
 rhy+0.902
ymes+0.844
ingly+0.765
olla+0.739
Neg logit contributions
eln-1.042
imens-1.022
earable-1.005
enne-0.961
imates-0.947
Token like
Token ID588
Feature activation-3.94e+00

Pos logit contributions
 rhy+8.330
theless+7.952
ymes+7.542
ingly+6.759
 else+6.705
Neg logit contributions
imens-12.152
eln-11.941
earable-11.815
enne-11.079
imates-11.000
Token that
Token ID326
Feature activation-4.48e+00

Pos logit contributions
theless+2.512
 rhy+2.379
ymes+2.205
ingly+2.017
olla+1.891
Neg logit contributions
eln-3.353
imens-3.340
earable-3.279
imates-3.130
enne-3.096
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
theless+2.891
 rhy+2.756
ymes+2.571
ingly+2.292
olla+2.199
Neg logit contributions
eln-3.787
imens-3.703
earable-3.661
enne-3.478
imates-3.440
Token That
Token ID1320
Feature activation+2.61e-01

Pos logit contributions
theless+1.933
 rhy+1.805
ymes+1.704
ingly+1.526
olla+1.484
Neg logit contributions
eln-2.373
imens-2.313
earable-2.262
enne-2.165
imates-2.137
Token
Token ID447
Feature activation-1.92e+00

Pos logit contributions
eln+0.211
imens+0.204
earable+0.202
enne+0.194
imates+0.190
Neg logit contributions
theless-0.185
 rhy-0.171
ymes-0.162
ingly-0.145
olla-0.141
Token need
Token ID761
Feature activation-1.11e+00

Pos logit contributions
theless+0.203
 rhy+0.189
ymes+0.179
ingly+0.163
olla+0.158
Neg logit contributions
eln-0.197
imens-0.190
earable-0.188
enne-0.179
imates-0.176
Token the
Token ID262
Feature activation-8.60e-01

Pos logit contributions
theless+0.881
 rhy+0.835
ymes+0.787
ingly+0.723
olla+0.695
Neg logit contributions
eln-0.803
imens-0.779
earable-0.766
enne-0.729
imates-0.707
Token eggs
Token ID9653
Feature activation+2.72e+00

Pos logit contributions
theless+0.674
 rhy+0.636
ymes+0.597
ingly+0.542
olla+0.530
Neg logit contributions
eln-0.632
imens-0.614
earable-0.603
enne-0.577
imates-0.562
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
eln+2.079
imens+2.005
earable+1.984
enne+1.921
imates+1.882
Neg logit contributions
theless-2.045
 rhy-1.854
ymes-1.770
ingly-1.605
olla-1.570
Token
Token ID784
Feature activation+4.70e+00

Pos logit contributions
theless+0.968
 rhy+0.898
ymes+0.847
ingly+0.764
olla+0.740
Neg logit contributions
eln-1.117
imens-1.086
earable-1.067
enne-1.022
imates-1.005
Token Sch
Token ID3059
Feature activation-1.49e+01

Pos logit contributions
eln+3.158
imens+3.048
earable+3.014
enne+2.874
imates+2.813
Neg logit contributions
theless-3.939
 rhy-3.638
ymes-3.488
ingly-3.194
olla-3.126
Tokenind
Token ID521
Feature activation-4.37e+00

Pos logit contributions
ymes+8.331
theless+8.318
 rhy+8.296
ingly+7.637
olla+7.451
Neg logit contributions
eln-10.878
imens-10.608
earable-10.388
ktop-10.320
handled-10.076
Tokenel
Token ID417
Feature activation-3.28e+00

Pos logit contributions
theless+3.729
 rhy+3.663
ymes+3.590
ingly+3.396
olla+3.308
Neg logit contributions
imens-2.501
earable-2.405
eln-2.353
 millenn-2.346
imates-2.298
Token
Token ID198
Feature activation-2.11e+00

Pos logit contributions
theless+1.958
 rhy+1.784
ymes+1.659
ingly+1.466
olla+1.412
Neg logit contributions
eln-3.004
imens-2.934
earable-2.892
enne-2.787
imates-2.747
Token
Token ID198
Feature activation-2.35e+00

Pos logit contributions
theless+1.102
 rhy+1.020
ymes+0.953
ingly+0.851
olla+0.805
Neg logit contributions
eln-2.065
imens-2.016
earable-1.982
enne-1.912
imates-1.886
Token
Token ID447
Feature activation-6.46e-01

Pos logit contributions
theless+1.575
 rhy+1.441
ymes+1.375
ingly+1.240
olla+1.209
Neg logit contributions
eln-1.952
imens-1.899
earable-1.868
enne-1.795
imates-1.778
Token legions
Token ID44395
Feature activation-5.80e-01

Pos logit contributions
theless+2.192
 rhy+2.042
ymes+1.899
ingly+1.775
olla+1.680
Neg logit contributions
eln-2.231
imens-2.190
earable-2.145
enne-2.062
ainted-2.001
Token of
Token ID286
Feature activation+4.08e-01

Pos logit contributions
theless+0.396
 rhy+0.363
ymes+0.342
ingly+0.306
olla+0.297
Neg logit contributions
eln-0.485
imens-0.472
earable-0.467
enne-0.449
imates-0.441
Token fans
Token ID3296
Feature activation-9.50e-01

Pos logit contributions
eln+0.335
imens+0.326
earable+0.322
enne+0.310
imates+0.304
Neg logit contributions
theless-0.285
 rhy-0.262
ymes-0.247
ingly-0.222
olla-0.215
Token who
Token ID508
Feature activation-6.46e-01

Pos logit contributions
theless+0.676
 rhy+0.627
ymes+0.587
ingly+0.539
olla+0.515
Neg logit contributions
eln-0.768
imens-0.744
earable-0.735
enne-0.705
imates-0.689
Token enjoy
Token ID2883
Feature activation-1.47e+00

Pos logit contributions
theless+0.462
 rhy+0.428
ymes+0.399
ingly+0.364
olla+0.351
Neg logit contributions
eln-0.522
imens-0.507
earable-0.500
enne-0.479
imates-0.471
Token nothing
Token ID2147
Feature activation-1.48e+01

Pos logit contributions
theless+1.101
 rhy+1.020
ymes+0.957
ingly+0.877
olla+0.848
Neg logit contributions
eln-1.139
imens-1.112
earable-1.094
enne-1.047
imates-1.023
Token more
Token ID517
Feature activation-2.85e+00

Pos logit contributions
 rhy+7.455
theless+7.395
ymes+6.446
ingly+6.139
 else+5.863
Neg logit contributions
imens-13.265
eln-12.891
enne-12.477
earable-12.420
Issue-11.778
Token than
Token ID621
Feature activation-1.41e+00

Pos logit contributions
theless+1.769
 rhy+1.614
ymes+1.508
ingly+1.346
olla+1.288
Neg logit contributions
eln-2.558
imens-2.495
earable-2.454
enne-2.376
imates-2.337
Token slaughter
Token ID17968
Feature activation-4.10e+00

Pos logit contributions
theless+1.078
 rhy+1.009
ymes+0.939
ingly+0.868
olla+0.840
Neg logit contributions
eln-1.059
imens-1.030
earable-1.011
enne-0.972
imates-0.945
Tokening
Token ID278
Feature activation-3.68e+00

Pos logit contributions
theless+1.496
 rhy+1.296
ymes+1.124
ingly+1.006
olla+0.791
Neg logit contributions
eln-4.665
earable-4.588
imens-4.560
enne-4.387
ainted-4.340
Token zombies
Token ID19005
Feature activation+1.90e+00

Pos logit contributions
theless+2.884
 rhy+2.734
ymes+2.556
ingly+2.403
olla+2.279
Neg logit contributions
eln-2.682
imens-2.588
earable-2.533
enne-2.431
imates-2.362
Token figures
Token ID5538
Feature activation-7.46e-01

Pos logit contributions
eln+2.509
imens+2.406
earable+2.395
enne+2.280
imates+2.276
Neg logit contributions
theless-2.394
 rhy-2.194
ymes-2.103
ingly-1.872
olla-1.852
Token from
Token ID422
Feature activation+5.65e-01

Pos logit contributions
theless+0.525
 rhy+0.483
ymes+0.452
ingly+0.410
olla+0.396
Neg logit contributions
eln-0.612
imens-0.593
earable-0.586
enne-0.564
imates-0.550
Token the
Token ID262
Feature activation+7.22e-01

Pos logit contributions
eln+0.442
imens+0.429
earable+0.424
enne+0.407
imates+0.399
Neg logit contributions
theless-0.416
 rhy-0.383
ymes-0.364
ingly-0.329
olla-0.320
Token Office
Token ID4452
Feature activation+2.08e+00

Pos logit contributions
eln+0.609
imens+0.592
earable+0.586
enne+0.564
imates+0.554
Neg logit contributions
theless-0.487
 rhy-0.445
ymes-0.420
ingly-0.375
olla-0.364
Token for
Token ID329
Feature activation-4.65e+00

Pos logit contributions
eln+1.298
imens+1.244
earable+1.232
enne+1.173
imates+1.141
Neg logit contributions
theless-1.862
 rhy-1.728
ymes-1.663
ingly-1.527
olla-1.498
Token National
Token ID2351
Feature activation-1.46e+01

Pos logit contributions
theless+2.755
 rhy+2.605
ymes+2.450
ingly+2.243
itudinal+2.024
Neg logit contributions
eln-4.266
earable-4.145
imens-4.056
enne-3.893
ainted-3.765
Token Statistics
Token ID14370
Feature activation-1.99e+00

Pos logit contributions
theless+5.345
 rhy+5.091
ymes+4.765
 Savings+4.511
ingly+4.127
Neg logit contributions
earable-15.310
eln-15.292
imens-14.633
ainted-14.207
ktop-14.178
Token said
Token ID531
Feature activation+6.65e-02

Pos logit contributions
theless+1.345
 rhy+1.230
ymes+1.137
ingly+1.039
olla+0.998
Neg logit contributions
eln-1.693
imens-1.653
earable-1.621
enne-1.565
imates-1.525
Token industrial
Token ID7593
Feature activation-4.40e-01

Pos logit contributions
eln+0.052
imens+0.050
earable+0.049
enne+0.047
imates+0.046
Neg logit contributions
theless-0.049
 rhy-0.046
ymes-0.043
ingly-0.039
olla-0.038
Token output
Token ID5072
Feature activation+3.68e+00

Pos logit contributions
theless+0.350
 rhy+0.326
ymes+0.309
ingly+0.283
olla+0.275
Neg logit contributions
eln-0.319
imens-0.309
earable-0.305
enne-0.291
imates-0.285
Token rose
Token ID8278
Feature activation+3.29e+00

Pos logit contributions
eln+2.673
imens+2.602
earable+2.573
enne+2.460
imates+2.437
Neg logit contributions
theless-2.856
 rhy-2.624
ymes-2.531
ingly-2.245
olla-2.225
TokenES
Token ID1546
Feature activation+7.09e-01

Pos logit contributions
eln+3.171
imens+3.068
earable+3.011
enne+2.887
imates+2.827
Neg logit contributions
theless-3.737
 rhy-3.449
ymes-3.271
ingly-2.968
olla-2.928
Token)
Token ID8
Feature activation+4.87e-01

Pos logit contributions
eln+0.527
imens+0.510
earable+0.504
enne+0.480
imates+0.471
Neg logit contributions
theless-0.551
 rhy-0.511
ymes-0.486
ingly-0.441
olla-0.430
TokenUS
Token ID2937
Feature activation-1.06e+00

Pos logit contributions
eln+0.378
imens+0.366
earable+0.362
enne+0.347
imates+0.340
Neg logit contributions
theless-0.362
 rhy-0.334
ymes-0.317
ingly-0.287
olla-0.279
Token Treasury
Token ID13419
Feature activation-1.01e+00

Pos logit contributions
theless+0.833
 rhy+0.770
ymes+0.738
ingly+0.676
olla+0.646
Neg logit contributions
eln-0.776
imens-0.756
earable-0.746
enne-0.713
imates-0.692
Token Secretary
Token ID4986
Feature activation+2.21e+00

Pos logit contributions
theless+0.739
 rhy+0.686
ymes+0.645
ingly+0.586
olla+0.568
Neg logit contributions
eln-0.798
imens-0.776
earable-0.766
enne-0.735
imates-0.719
Token Jacob
Token ID12806
Feature activation-1.44e+01

Pos logit contributions
eln+1.595
imens+1.537
earable+1.518
enne+1.448
imates+1.427
Neg logit contributions
theless-1.770
 rhy-1.627
ymes-1.560
ingly-1.408
olla-1.379
Token Lew
Token ID8260
Feature activation-3.58e+00

Pos logit contributions
theless+7.383
ymes+6.758
 rhy+6.691
olla+5.848
ingly+5.525
Neg logit contributions
earable-12.584
eln-12.322
imens-12.237
ainted-11.914
enne-11.820
Token speaks
Token ID9209
Feature activation+5.03e+00

Pos logit contributions
theless+2.518
 rhy+2.348
ymes+2.198
ingly+2.006
olla+1.955
Neg logit contributions
eln-2.819
imens-2.810
earable-2.770
enne-2.640
imates-2.582
Token on
Token ID319
Feature activation+8.48e+00

Pos logit contributions
eln+3.516
imens+3.361
earable+3.311
enne+3.186
ainted+3.137
Neg logit contributions
theless-4.072
 rhy-3.713
ymes-3.610
ingly-3.223
olla-3.195
Token the
Token ID262
Feature activation+3.78e+00

Pos logit contributions
eln+5.461
imens+5.105
earable+4.990
enne+4.773
ainted+4.730
Neg logit contributions
theless-7.132
 rhy-6.543
ymes-6.464
olla-5.800
ingly-5.770
Token state
Token ID1181
Feature activation-1.47e+00

Pos logit contributions
eln+2.628
imens+2.536
earable+2.493
enne+2.386
imates+2.336
Neg logit contributions
theless-3.083
 rhy-2.848
ymes-2.754
ingly-2.498
olla-2.446
Token start
Token ID923
Feature activation-8.47e+00

Pos logit contributions
theless+2.439
 rhy+2.341
ymes+2.123
ingly+1.922
olla+1.831
Neg logit contributions
eln-2.924
imens-2.863
earable-2.813
enne-2.708
ainted-2.652
Token off
Token ID572
Feature activation-6.31e+00

Pos logit contributions
 rhy+4.978
theless+4.762
ymes+4.488
ingly+3.874
olla+3.650
Neg logit contributions
eln-7.326
imens-7.253
earable-7.164
enne-6.847
imates-6.601
Token one
Token ID530
Feature activation+1.52e-01

Pos logit contributions
 rhy+4.304
theless+4.287
ymes+3.930
ingly+3.531
olla+3.253
Neg logit contributions
eln-4.976
imens-4.974
earable-4.773
enne-4.628
imates-4.462
Token way
Token ID835
Feature activation-5.52e-01

Pos logit contributions
eln+0.110
imens+0.107
earable+0.105
enne+0.100
imates+0.098
Neg logit contributions
theless-0.120
 rhy-0.113
ymes-0.107
ingly-0.098
olla-0.094
Token and
Token ID290
Feature activation-8.04e+00

Pos logit contributions
theless+0.412
 rhy+0.384
ymes+0.363
ingly+0.330
olla+0.320
Neg logit contributions
eln-0.424
imens-0.414
earable-0.407
enne-0.390
imates-0.381
Token end
Token ID886
Feature activation-1.43e+01

Pos logit contributions
theless+4.946
 rhy+4.924
ymes+4.298
ingly+3.882
olla+3.644
Neg logit contributions
eln-6.887
imens-6.683
earable-6.529
enne-6.383
ainted-6.282
Token another
Token ID1194
Feature activation-8.37e-01

Pos logit contributions
 rhy+7.744
theless+7.447
ymes+6.933
ingly+6.291
olla+5.690
Neg logit contributions
imens-11.934
eln-11.842
earable-11.606
enne-11.115
imates-10.937
Token.
Token ID13
Feature activation-8.64e-01

Pos logit contributions
theless+0.609
 rhy+0.570
ymes+0.535
ingly+0.487
olla+0.469
Neg logit contributions
eln-0.661
imens-0.642
earable-0.634
enne-0.607
imates-0.594
Token I
Token ID314
Feature activation-2.69e+00

Pos logit contributions
theless+0.643
 rhy+0.598
ymes+0.568
ingly+0.517
olla+0.504
Neg logit contributions
eln-0.669
imens-0.647
earable-0.638
enne-0.609
imates-0.596
Token find
Token ID1064
Feature activation-4.48e+00

Pos logit contributions
theless+1.794
 rhy+1.690
ymes+1.546
ingly+1.421
olla+1.355
Neg logit contributions
eln-2.295
imens-2.224
earable-2.197
enne-2.094
imates-2.060
Token it
Token ID340
Feature activation-1.80e+00

Pos logit contributions
theless+3.491
 rhy+3.358
ymes+3.093
ingly+2.912
olla+2.725
Neg logit contributions
eln-3.237
imens-3.153
earable-3.123
enne-2.947
ainted-2.853
Token this
Token ID428
Feature activation-2.81e+00

Pos logit contributions
theless+1.883
 rhy+1.760
ymes+1.645
ingly+1.511
olla+1.445
Neg logit contributions
eln-1.942
imens-1.872
earable-1.858
enne-1.761
imates-1.738
Token new
Token ID649
Feature activation-3.97e+00

Pos logit contributions
theless+2.076
 rhy+1.951
ymes+1.817
ingly+1.668
olla+1.579
Neg logit contributions
eln-2.188
imens-2.109
earable-2.093
enne-2.003
imates-1.957
Token Riverside
Token ID35597
Feature activation-2.24e+00

Pos logit contributions
theless+2.845
 rhy+2.670
ymes+2.467
ingly+2.276
olla+2.161
Neg logit contributions
eln-3.140
imens-3.035
earable-2.991
enne-2.894
imates-2.830
Token park
Token ID3952
Feature activation-1.61e+00

Pos logit contributions
theless+1.614
 rhy+1.508
ymes+1.402
ingly+1.277
olla+1.245
Neg logit contributions
eln-1.775
imens-1.727
earable-1.704
enne-1.632
imates-1.601
Token resembles
Token ID22960
Feature activation-3.80e+00

Pos logit contributions
theless+1.149
 rhy+1.071
ymes+1.000
ingly+0.909
olla+0.876
Neg logit contributions
eln-1.298
imens-1.258
earable-1.244
enne-1.197
imates-1.166
Token nothing
Token ID2147
Feature activation-1.43e+01

Pos logit contributions
theless+2.707
 rhy+2.537
ymes+2.358
ingly+2.181
olla+2.049
Neg logit contributions
eln-3.040
imens-2.932
earable-2.900
enne-2.797
imates-2.713
Token so
Token ID523
Feature activation-1.47e+00

Pos logit contributions
 rhy+8.617
theless+8.537
ymes+7.569
ingly+7.233
 else+6.814
Neg logit contributions
eln-11.762
imens-11.330
earable-10.993
enne-10.608
ktop-10.193
Token much
Token ID881
Feature activation-3.69e+00

Pos logit contributions
theless+1.207
 rhy+1.134
ymes+1.077
ingly+0.992
olla+0.957
Neg logit contributions
eln-1.019
imens-0.987
earable-0.972
enne-0.929
imates-0.904
Token as
Token ID355
Feature activation-4.20e-01

Pos logit contributions
theless+2.892
 rhy+2.711
ymes+2.558
ingly+2.346
olla+2.263
Neg logit contributions
eln-2.695
imens-2.616
earable-2.566
enne-2.467
imates-2.412
Token a
Token ID257
Feature activation+5.52e-01

Pos logit contributions
theless+0.313
 rhy+0.290
ymes+0.273
ingly+0.248
olla+0.240
Neg logit contributions
eln-0.327
imens-0.317
earable-0.312
enne-0.300
imates-0.294
Token cult
Token ID2285
Feature activation-3.58e-01

Pos logit contributions
eln+0.408
imens+0.394
earable+0.389
enne+0.373
imates+0.365
Neg logit contributions
theless-0.431
 rhy-0.400
ymes-0.380
ingly-0.346
olla-0.337
Tokenthis
Token ID5661
Feature activation+6.65e-01

Pos logit contributions
theless+2.679
 rhy+2.490
ymes+2.406
ingly+2.214
olla+2.146
Neg logit contributions
eln-2.297
imens-2.229
earable-2.206
enne-2.090
ktop-2.049
Token was
Token ID373
Feature activation+7.79e-01

Pos logit contributions
eln+0.515
imens+0.499
earable+0.494
enne+0.473
imates+0.464
Neg logit contributions
theless-0.495
 rhy-0.458
ymes-0.434
ingly-0.393
olla-0.382
Token not
Token ID407
Feature activation-8.85e-01

Pos logit contributions
eln+0.610
imens+0.591
earable+0.584
enne+0.560
imates+0.550
Neg logit contributions
theless-0.575
 rhy-0.529
ymes-0.502
ingly-0.453
olla-0.441
Token rape
Token ID7262
Feature activation+8.91e-01

Pos logit contributions
theless+0.647
 rhy+0.603
ymes+0.567
ingly+0.514
olla+0.496
Neg logit contributions
eln-0.697
imens-0.679
earable-0.671
enne-0.643
imates-0.628
Token or
Token ID393
Feature activation+8.93e-01

Pos logit contributions
eln+0.716
imens+0.694
earable+0.687
enne+0.659
imates+0.647
Neg logit contributions
theless-0.638
 rhy-0.587
ymes-0.555
ingly-0.500
olla-0.486
Token anything
Token ID1997
Feature activation-1.42e+01

Pos logit contributions
eln+0.731
imens+0.708
earable+0.699
enne+0.673
imates+0.662
Neg logit contributions
theless-0.627
 rhy-0.572
ymes-0.543
ingly-0.485
olla-0.474
Token like
Token ID588
Feature activation-1.79e+00

Pos logit contributions
theless+8.034
 rhy+7.975
ymes+7.212
ingly+6.397
 else+6.378
Neg logit contributions
imens-11.939
eln-11.589
earable-11.578
ktop-10.839
imates-10.830
Token that
Token ID326
Feature activation+1.29e+00

Pos logit contributions
theless+1.258
 rhy+1.169
ymes+1.102
ingly+0.999
olla+0.963
Neg logit contributions
eln-1.443
imens-1.408
earable-1.392
enne-1.328
imates-1.310
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
eln+1.221
imens+1.190
earable+1.178
enne+1.140
imates+1.122
Neg logit contributions
theless-0.741
 rhy-0.659
ymes-0.618
ingly-0.537
olla-0.519
Token this
Token ID428
Feature activation-5.13e-01

Pos logit contributions
eln+1.347
imens+1.304
earable+1.291
enne+1.243
imates+1.216
Neg logit contributions
theless-1.238
 rhy-1.130
ymes-1.077
ingly-0.967
olla-0.944
Token was
Token ID373
Feature activation-1.29e+00

Pos logit contributions
theless+0.400
 rhy+0.374
ymes+0.354
ingly+0.323
olla+0.314
Neg logit contributions
eln-0.379
imens-0.366
earable-0.363
enne-0.345
imates-0.340
Token
Token ID447
Feature activation-1.95e+00

Pos logit contributions
eln+3.755
imens+3.634
earable+3.628
enne+3.461
imates+3.380
Neg logit contributions
theless-3.749
 rhy-3.400
ymes-3.257
ingly-2.937
olla-2.869
Token
Token ID251
Feature activation+1.57e+00

Pos logit contributions
theless+0.844
 rhy+0.752
ymes+0.684
ingly+0.562
olla+0.532
Neg logit contributions
eln-2.102
imens-2.058
earable-2.031
enne-1.978
imates-1.952
Token Iron
Token ID7931
Feature activation+3.83e-01

Pos logit contributions
eln+1.220
imens+1.182
earable+1.170
enne+1.121
imates+1.100
Neg logit contributions
theless-1.172
 rhy-1.078
ymes-1.025
ingly-0.926
olla-0.902
Token Maiden
Token ID30591
Feature activation+4.64e+00

Pos logit contributions
eln+0.269
imens+0.261
earable+0.256
enne+0.244
imates+0.238
Neg logit contributions
theless-0.309
 rhy-0.292
ymes-0.278
ingly-0.257
olla-0.248
Token,
Token ID11
Feature activation+5.09e+00

Pos logit contributions
eln+3.516
imens+3.442
earable+3.420
enne+3.258
imates+3.188
Neg logit contributions
theless-3.487
 rhy-3.120
ymes-2.980
ingly-2.695
olla-2.634
Token Harry
Token ID5850
Feature activation-1.42e+01

Pos logit contributions
eln+3.774
earable+3.650
imens+3.645
enne+3.483
imates+3.399
Neg logit contributions
theless-3.884
 rhy-3.529
ymes-3.362
ingly-3.031
olla-2.963
Token Potter
Token ID14179
Feature activation-4.21e+00

Pos logit contributions
theless+8.745
 rhy+8.177
ymes+7.592
ingly+6.837
haus+6.644
Neg logit contributions
eln-11.834
imens-11.431
earable-11.402
ktop-10.914
imates-10.638
Token,
Token ID11
Feature activation+3.91e+00

Pos logit contributions
theless+2.703
 rhy+2.590
ymes+2.399
ingly+2.166
olla+2.047
Neg logit contributions
eln-3.587
imens-3.490
earable-3.398
enne-3.291
ktop-3.232
Token
Token ID564
Feature activation-1.08e+00

Pos logit contributions
eln+2.918
imens+2.819
earable+2.811
enne+2.687
imates+2.635
Neg logit contributions
theless-2.995
 rhy-2.739
ymes-2.610
ingly-2.358
olla-2.304
Token
Token ID250
Feature activation+2.69e+00

Pos logit contributions
theless+0.710
 rhy+0.663
ymes+0.629
ingly+0.564
olla+0.547
Neg logit contributions
eln-0.915
imens-0.889
earable-0.873
enne-0.843
imates-0.828
Tokenneed
Token ID31227
Feature activation-1.87e+00

Pos logit contributions
eln+1.808
imens+1.739
earable+1.723
enne+1.640
imates+1.601
Neg logit contributions
theless-2.276
 rhy-2.114
ymes-2.016
ingly-1.844
olla-1.805
Token kind
Token ID1611
Feature activation-5.52e+00

Pos logit contributions
theless+3.815
 rhy+3.720
ymes+3.365
ingly+3.107
olla+2.986
Neg logit contributions
eln-3.835
imens-3.792
earable-3.737
enne-3.555
imates-3.506
Token of
Token ID286
Feature activation-4.35e+00

Pos logit contributions
theless+3.475
 rhy+3.278
ymes+3.005
ingly+2.731
olla+2.595
Neg logit contributions
eln-4.776
imens-4.701
earable-4.663
enne-4.423
imates-4.403
Token CV
Token ID26196
Feature activation-6.97e-02

Pos logit contributions
theless+3.262
 rhy+3.162
ymes+2.874
ingly+2.656
olla+2.560
Neg logit contributions
eln-3.236
imens-3.219
earable-3.146
enne-3.002
imates-2.973
Token written
Token ID3194
Feature activation-5.48e+00

Pos logit contributions
theless+0.052
 rhy+0.048
ymes+0.046
ingly+0.041
olla+0.040
Neg logit contributions
eln-0.054
imens-0.052
earable-0.052
enne-0.049
imates-0.049
Token by
Token ID416
Feature activation-2.45e+00

Pos logit contributions
theless+3.772
 rhy+3.738
ymes+3.376
ingly+3.128
olla+2.952
Neg logit contributions
eln-4.316
imens-4.261
earable-4.174
enne-4.028
imates-3.944
Token Jacob
Token ID12806
Feature activation-1.41e+01

Pos logit contributions
theless+1.876
 rhy+1.763
ymes+1.653
ingly+1.513
olla+1.462
Neg logit contributions
eln-1.827
imens-1.785
earable-1.763
enne-1.692
imates-1.655
Token Re
Token ID797
Feature activation-5.75e+00

Pos logit contributions
theless+10.008
 rhy+9.552
ymes+9.135
ingly+8.214
olla+7.946
Neg logit contributions
earable-9.591
imens-9.543
eln-9.538
enne-8.932
imates-8.709
Tokenes
Token ID274
Feature activation-1.55e+00

Pos logit contributions
theless+4.562
 rhy+4.391
ymes+4.335
ingly+3.925
olla+3.837
Neg logit contributions
eln-3.706
earable-3.704
imens-3.649
enne-3.331
imates-3.327
Token-
Token ID12
Feature activation-1.88e+00

Pos logit contributions
theless+1.097
 rhy+1.017
ymes+0.961
ingly+0.872
olla+0.842
Neg logit contributions
eln-1.238
imens-1.216
earable-1.198
enne-1.141
imates-1.130
TokenM
Token ID44
Feature activation-3.18e+00

Pos logit contributions
theless+1.381
 rhy+1.286
ymes+1.232
ingly+1.104
olla+1.076
Neg logit contributions
eln-1.449
imens-1.418
earable-1.401
enne-1.334
imates-1.314
Tokenogg
Token ID10332
Feature activation-3.18e+00

Pos logit contributions
theless+1.701
 rhy+1.550
ymes+1.515
ingly+1.333
olla+1.285
Neg logit contributions
eln-3.004
imens-2.966
earable-2.926
imates-2.797
enne-2.792
Token
Token ID1906
Feature activation+2.48e+00

Pos logit contributions
theless+3.301
 rhy+3.097
ymes+2.885
ingly+2.553
olla+2.435
Neg logit contributions
eln-4.931
imens-4.805
earable-4.728
imates-4.569
enne-4.562
TokenB
Token ID33
Feature activation+2.59e-01

Pos logit contributions
eln+1.767
imens+1.709
earable+1.693
enne+1.615
imates+1.577
Neg logit contributions
theless-1.987
 rhy-1.837
ymes-1.751
ingly-1.594
olla-1.556
TokenOM
Token ID2662
Feature activation+4.74e-01

Pos logit contributions
eln+0.183
imens+0.176
earable+0.174
enne+0.166
imates+0.163
Neg logit contributions
theless-0.210
 rhy-0.195
ymes-0.186
ingly-0.170
olla-0.166
TokenB
Token ID33
Feature activation-1.54e+00

Pos logit contributions
eln+0.389
imens+0.377
earable+0.374
enne+0.358
imates+0.351
Neg logit contributions
theless-0.333
 rhy-0.303
ymes-0.288
ingly-0.258
olla-0.250
Token or
Token ID393
Feature activation-2.60e+00

Pos logit contributions
theless+1.087
 rhy+1.019
ymes+0.952
ingly+0.860
olla+0.836
Neg logit contributions
eln-1.242
imens-1.204
earable-1.190
enne-1.138
imates-1.123
Token something
Token ID1223
Feature activation-1.40e+01

Pos logit contributions
theless+1.916
 rhy+1.801
ymes+1.676
ingly+1.518
olla+1.469
Neg logit contributions
eln-2.016
imens-1.977
earable-1.943
enne-1.869
imates-1.831
Token else
Token ID2073
Feature activation-2.79e+00

Pos logit contributions
 rhy+8.378
theless+8.344
ymes+7.541
 else+6.898
ingly+6.662
Neg logit contributions
imens-10.895
eln-10.693
earable-10.540
enne-9.882
imates-9.796
Token.
Token ID13
Feature activation-4.74e-01

Pos logit contributions
theless+1.944
 rhy+1.824
ymes+1.704
ingly+1.525
olla+1.474
Neg logit contributions
eln-2.275
imens-2.229
earable-2.188
enne-2.090
imates-2.061
Token
Token ID784
Feature activation+2.54e+00

Pos logit contributions
theless+0.356
 rhy+0.332
ymes+0.313
ingly+0.283
olla+0.276
Neg logit contributions
eln-0.363
imens-0.353
earable-0.348
enne-0.334
imates-0.327
TokenB
Token ID33
Feature activation+4.27e-01

Pos logit contributions
eln+2.144
imens+2.077
earable+2.059
enne+1.983
imates+1.945
Neg logit contributions
theless-1.715
 rhy-1.559
ymes-1.476
ingly-1.318
olla-1.279
TokenOM
Token ID2662
Feature activation+1.33e-01

Pos logit contributions
eln+0.350
imens+0.340
earable+0.336
enne+0.323
imates+0.317
Neg logit contributions
theless-0.298
 rhy-0.274
ymes-0.259
ingly-0.232
olla-0.226
Token believe
Token ID1975
Feature activation-1.51e+00

Pos logit contributions
 rhy+3.832
theless+3.796
ymes+3.296
ingly+3.133
olla+2.907
Neg logit contributions
eln-4.931
imens-4.770
earable-4.740
enne-4.453
imates-4.423
Token in
Token ID287
Feature activation+3.35e-01

Pos logit contributions
theless+1.052
 rhy+0.991
ymes+0.922
ingly+0.844
olla+0.799
Neg logit contributions
eln-1.225
imens-1.200
earable-1.182
enne-1.126
imates-1.107
Token divine
Token ID11871
Feature activation+2.35e+00

Pos logit contributions
eln+0.245
imens+0.238
earable+0.235
enne+0.224
imates+0.220
Neg logit contributions
theless-0.264
 rhy-0.248
ymes-0.233
ingly-0.213
olla-0.207
Token intervention
Token ID9572
Feature activation+1.68e+00

Pos logit contributions
eln+1.704
imens+1.637
earable+1.608
enne+1.544
imates+1.519
Neg logit contributions
theless-1.864
 rhy-1.712
ymes-1.646
ingly-1.491
olla-1.470
Token or
Token ID393
Feature activation+1.73e+00

Pos logit contributions
eln+1.330
imens+1.290
earable+1.274
enne+1.225
imates+1.203
Neg logit contributions
theless-1.228
 rhy-1.125
ymes-1.069
ingly-0.963
olla-0.939
Token anything
Token ID1997
Feature activation-1.39e+01

Pos logit contributions
eln+1.337
imens+1.294
earable+1.280
enne+1.227
imates+1.204
Neg logit contributions
theless-1.286
 rhy-1.184
ymes-1.125
ingly-1.018
olla-0.991
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
 rhy+7.595
theless+7.308
ymes+6.900
ingly+5.968
 else+5.897
Neg logit contributions
imens-11.708
eln-11.622
earable-11.307
enne-10.786
ainted-10.660
Token But
Token ID887
Feature activation-7.32e-01

Pos logit contributions
theless+0.228
 rhy+0.211
ymes+0.199
ingly+0.179
olla+0.174
Neg logit contributions
eln-0.276
imens-0.268
earable-0.264
enne-0.253
imates-0.249
Token that
Token ID326
Feature activation+3.43e-01

Pos logit contributions
theless+0.527
 rhy+0.492
ymes+0.457
ingly+0.416
olla+0.400
Neg logit contributions
eln-0.588
imens-0.573
earable-0.565
enne-0.540
imates-0.532
Token was
Token ID373
Feature activation+1.95e+00

Pos logit contributions
eln+0.264
imens+0.257
earable+0.253
enne+0.242
imates+0.238
Neg logit contributions
theless-0.258
 rhy-0.241
ymes-0.227
ingly-0.206
olla-0.199
Token a
Token ID257
Feature activation+3.19e+00

Pos logit contributions
eln+1.502
imens+1.456
earable+1.437
enne+1.379
imates+1.352
Neg logit contributions
theless-1.466
 rhy-1.356
ymes-1.285
ingly-1.164
olla-1.132
Tokenyear
Token ID1941
Feature activation-6.50e+00

Pos logit contributions
eln+0.847
imens+0.819
earable+0.807
enne+0.772
imates+0.754
Neg logit contributions
theless-0.967
 rhy-0.896
ymes-0.853
ingly-0.779
olla-0.761
Token-
Token ID12
Feature activation-4.03e+00

Pos logit contributions
theless+4.016
 rhy+3.800
ymes+3.488
ingly+3.144
olla+3.009
Neg logit contributions
eln-5.688
imens-5.503
earable-5.449
enne-5.285
imates-5.199
Tokenold
Token ID727
Feature activation-8.50e+00

Pos logit contributions
theless+2.777
 rhy+2.609
ymes+2.532
ingly+2.321
olla+2.247
Neg logit contributions
eln-3.165
earable-3.059
imens-3.052
imates-2.895
enne-2.882
Token brother
Token ID3956
Feature activation-6.12e+00

Pos logit contributions
theless+4.802
 rhy+4.690
ymes+4.255
ingly+3.692
olla+3.422
Neg logit contributions
eln-7.693
imens-7.470
earable-7.469
enne-7.226
ktop-7.086
Token,
Token ID11
Feature activation-3.59e+00

Pos logit contributions
theless+4.218
 rhy+4.054
ymes+3.788
ingly+3.429
olla+3.285
Neg logit contributions
eln-4.854
imens-4.761
earable-4.744
enne-4.477
imates-4.365
Token Jacob
Token ID12806
Feature activation-1.39e+01

Pos logit contributions
theless+2.869
 rhy+2.722
ymes+2.566
ingly+2.320
olla+2.257
Neg logit contributions
eln-2.561
imens-2.463
earable-2.460
enne-2.335
imates-2.284
Token,
Token ID11
Feature activation-2.61e+00

Pos logit contributions
theless+8.710
 rhy+8.118
ymes+8.055
ingly+7.058
olla+6.813
Neg logit contributions
earable-10.434
eln-10.280
imens-10.037
enne-9.766
ktop-9.649
Token is
Token ID318
Feature activation-2.07e+00

Pos logit contributions
theless+1.816
 rhy+1.718
ymes+1.592
ingly+1.435
olla+1.389
Neg logit contributions
eln-2.125
imens-2.069
earable-2.047
enne-1.956
imates-1.908
Token also
Token ID635
Feature activation-3.37e-01

Pos logit contributions
theless+1.523
 rhy+1.439
ymes+1.348
ingly+1.218
olla+1.181
Neg logit contributions
eln-1.599
imens-1.555
earable-1.540
enne-1.471
imates-1.436
Token living
Token ID2877
Feature activation+1.25e-01

Pos logit contributions
theless+0.256
 rhy+0.238
ymes+0.225
ingly+0.204
olla+0.199
Neg logit contributions
eln-0.256
imens-0.249
earable-0.246
enne-0.235
imates-0.230
Token on
Token ID319
Feature activation+1.18e+00

Pos logit contributions
eln+0.102
imens+0.099
earable+0.098
enne+0.094
imates+0.092
Neg logit contributions
theless-0.088
 rhy-0.081
ymes-0.076
ingly-0.068
olla-0.066
Token Mother
Token ID10584
Feature activation+4.08e+00

Pos logit contributions
eln+1.219
imens+1.176
earable+1.167
enne+1.116
imates+1.098
Neg logit contributions
theless-1.245
 rhy-1.143
ymes-1.091
ingly-0.986
olla-0.964
Token Jones
Token ID5437
Feature activation-3.90e-02

Pos logit contributions
eln+2.649
imens+2.516
earable+2.492
enne+2.405
imates+2.353
Neg logit contributions
theless-3.537
 rhy-3.306
ymes-3.129
ingly-2.827
olla-2.780
Token'
Token ID6
Feature activation-2.87e+00

Pos logit contributions
theless+0.029
 rhy+0.027
ymes+0.025
ingly+0.023
olla+0.022
Neg logit contributions
eln-0.030
imens-0.029
earable-0.029
enne-0.028
imates-0.027
Token Foundation
Token ID5693
Feature activation-5.01e+00

Pos logit contributions
theless+2.127
 rhy+1.988
ymes+1.862
ingly+1.712
olla+1.663
Neg logit contributions
eln-2.199
imens-2.145
earable-2.110
enne-2.030
imates-1.975
Token for
Token ID329
Feature activation-2.57e+00

Pos logit contributions
theless+3.673
 rhy+3.483
ymes+3.226
ingly+3.047
olla+2.982
Neg logit contributions
eln-3.714
imens-3.679
earable-3.594
enne-3.398
imates-3.344
Token National
Token ID2351
Feature activation-1.38e+01

Pos logit contributions
theless+1.921
 rhy+1.799
ymes+1.698
ingly+1.575
olla+1.505
Neg logit contributions
eln-1.942
imens-1.890
earable-1.867
enne-1.805
imates-1.737
Token Progress
Token ID18387
Feature activation-2.87e+00

Pos logit contributions
 rhy+7.971
theless+7.789
ymes+7.452
ingly+7.329
itudinal+6.770
Neg logit contributions
eln-10.840
imens-10.457
earable-10.324
ktop-10.150
enne-10.040
Token,
Token ID11
Feature activation+3.50e+00

Pos logit contributions
theless+1.938
 rhy+1.801
ymes+1.697
ingly+1.539
olla+1.475
Neg logit contributions
eln-2.388
imens-2.325
earable-2.294
enne-2.206
imates-2.157
Token National
Token ID2351
Feature activation-7.48e+00

Pos logit contributions
eln+2.560
imens+2.460
earable+2.448
enne+2.338
imates+2.313
Neg logit contributions
theless-2.729
 rhy-2.502
ymes-2.398
ingly-2.153
olla-2.112
Token Public
Token ID5094
Feature activation-9.73e+00

Pos logit contributions
theless+5.045
 rhy+4.706
ymes+4.469
ingly+4.211
itudinal+4.020
Neg logit contributions
eln-6.010
imens-5.795
earable-5.720
enne-5.455
ktop-5.437
Token Radio
Token ID8829
Feature activation-1.80e+00

Pos logit contributions
theless+6.067
 rhy+5.600
ymes+5.210
ingly+5.162
itudinal+5.002
Neg logit contributions
eln-7.840
imens-7.775
earable-7.694
ktop-7.171
enne-7.116
Token edit
Token ID4370
Feature activation-3.14e+00

Pos logit contributions
theless+2.773
 rhy+2.718
ymes+2.602
ingly+2.313
olla+2.019
Neg logit contributions
 millenn-10.389
eln-10.227
imens-10.180
earable-9.810
ÍÍ-9.696
Token ]
Token ID2361
Feature activation-9.69e+00

Pos logit contributions
theless+1.339
 rhy+1.203
ymes+1.111
ingly+0.981
olla+0.903
Neg logit contributions
eln-3.327
imens-3.265
earable-3.206
enne-3.121
imates-3.082
Token
Token ID198
Feature activation-3.17e+00

Pos logit contributions
theless+5.173
 rhy+4.789
ymes+4.484
ingly+3.999
olla+3.763
Neg logit contributions
eln-8.914
imens-8.711
earable-8.553
enne-8.282
imates-8.211
Token
Token ID198
Feature activation-4.24e+00

Pos logit contributions
theless+1.623
 rhy+1.500
ymes+1.412
ingly+1.260
olla+1.187
Neg logit contributions
eln-3.133
imens-3.049
earable-2.986
enne-2.891
imates-2.857
TokenReferences
Token ID19927
Feature activation-2.71e+00

Pos logit contributions
theless+2.634
 rhy+2.441
ymes+2.381
ingly+2.147
olla+2.073
Neg logit contributions
eln-3.657
imens-3.531
earable-3.476
imates-3.341
enne-3.341
Token [
Token ID685
Feature activation-1.37e+01

Pos logit contributions
theless+1.844
 rhy+1.695
ymes+1.597
ingly+1.441
olla+1.390
Neg logit contributions
eln-2.266
imens-2.192
earable-2.170
enne-2.076
imates-2.049
Token edit
Token ID4370
Feature activation-4.91e+00

Pos logit contributions
 rhy+2.939
ymes+2.915
theless+2.756
ingly+2.633
olla+2.218
Neg logit contributions
 millenn-15.997
imens-15.040
eln-15.040
 pione-14.857
ÍÍ-14.717
Token ]
Token ID2361
Feature activation-1.03e+01

Pos logit contributions
theless+1.870
 rhy+1.677
ymes+1.562
ingly+1.391
olla+1.248
Neg logit contributions
eln-5.318
imens-5.222
earable-5.110
 millenn-5.040
enne-4.991
Token<|endoftext|>
Token ID50256
Feature activation+6.56e+00

Pos logit contributions
theless+5.311
 rhy+4.814
ymes+4.554
ingly+4.090
olla+3.863
Neg logit contributions
eln-9.642
imens-9.378
earable-9.187
enne-8.870
handled-8.824
TokenZ
Token ID57
Feature activation-5.44e+00

Pos logit contributions
eln+4.234
earable+4.041
imens+4.026
enne+3.830
imates+3.712
Neg logit contributions
theless-5.694
 rhy-5.247
ymes-4.957
ingly-4.504
olla-4.414
Tokenombie
Token ID9081
Feature activation+4.52e+00

Pos logit contributions
theless+4.165
 rhy+3.938
ymes+3.840
ingly+3.478
olla+3.467
Neg logit contributions
eln-3.778
imens-3.769
earable-3.667
imates-3.470
enne-3.451
Token several
Token ID1811
Feature activation-1.96e+00

Pos logit contributions
 rhy+5.297
theless+5.297
ymes+4.746
ingly+4.556
 Hindi+4.145
Neg logit contributions
imens-6.997
eln-6.947
earable-6.834
enne-6.746
ktop-6.351
Token lessons
Token ID11658
Feature activation-7.68e+00

Pos logit contributions
theless+1.637
 rhy+1.554
ymes+1.460
ingly+1.351
olla+1.304
Neg logit contributions
eln-1.331
imens-1.302
earable-1.280
enne-1.234
imates-1.194
Token from
Token ID422
Feature activation-2.41e+00

Pos logit contributions
theless+4.381
 rhy+4.197
ymes+3.692
ingly+3.511
olla+3.130
Neg logit contributions
eln-6.877
imens-6.821
enne-6.631
earable-6.629
imates-6.260
Token former
Token ID1966
Feature activation-1.14e+00

Pos logit contributions
theless+1.770
 rhy+1.662
ymes+1.549
ingly+1.414
olla+1.377
Neg logit contributions
eln-1.863
imens-1.833
earable-1.807
enne-1.738
imates-1.695
Token President
Token ID1992
Feature activation-2.39e-01

Pos logit contributions
theless+0.855
 rhy+0.797
ymes+0.745
ingly+0.685
olla+0.663
Neg logit contributions
eln-0.872
imens-0.849
earable-0.837
enne-0.802
imates-0.786
Token Harry
Token ID5850
Feature activation-1.37e+01

Pos logit contributions
theless+0.199
 rhy+0.187
ymes+0.179
ingly+0.164
olla+0.160
Neg logit contributions
eln-0.163
imens-0.158
earable-0.156
enne-0.149
imates-0.145
Token Truman
Token ID33027
Feature activation-4.13e+00

Pos logit contributions
theless+5.807
 rhy+4.684
ingly+4.369
ymes+4.198
haus+3.747
Neg logit contributions
eln-13.958
earable-13.504
imens-12.985
ktop-12.822
imates-12.732
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
theless+2.838
 rhy+2.651
ymes+2.442
ingly+2.262
olla+2.113
Neg logit contributions
eln-3.377
imens-3.281
earable-3.248
enne-3.155
imates-3.034
Token
Token ID198
Feature activation+2.71e+00

Pos logit contributions
eln+0.252
imens+0.245
earable+0.242
enne+0.234
imates+0.230
Neg logit contributions
theless-0.173
 rhy-0.157
ymes-0.147
ingly-0.130
olla-0.125
Token
Token ID198
Feature activation+6.17e-01

Pos logit contributions
eln+2.340
imens+2.276
earable+2.267
enne+2.186
imates+2.147
Neg logit contributions
theless-1.796
 rhy-1.605
ymes-1.505
ingly-1.313
olla-1.280
TokenIgn
Token ID32916
Feature activation-5.29e+00

Pos logit contributions
eln+0.456
imens+0.441
earable+0.436
enne+0.417
imates+0.409
Neg logit contributions
theless-0.482
 rhy-0.446
ymes-0.424
ingly-0.385
olla-0.376
Tokenorative
Token ID36478
Feature activation-2.03e+00

Pos logit contributions
theless+1.486
 rhy+1.352
ymes+1.266
ingly+1.139
olla+1.103
Neg logit contributions
eln-1.955
imens-1.895
earable-1.882
enne-1.802
imates-1.769
Token bank
Token ID3331
Feature activation-9.22e+00

Pos logit contributions
theless+1.554
 rhy+1.458
ymes+1.356
ingly+1.243
olla+1.212
Neg logit contributions
eln-1.527
imens-1.474
earable-1.466
enne-1.397
imates-1.362
Tokennotes
Token ID17815
Feature activation-5.08e+00

Pos logit contributions
theless+5.102
 rhy+4.929
ymes+4.503
ingly+4.055
olla+3.947
Neg logit contributions
eln-8.099
imens-8.014
earable-7.962
enne-7.502
imates-7.456
Token [
Token ID685
Feature activation-1.16e+01

Pos logit contributions
theless+3.401
 rhy+3.195
ymes+2.851
ingly+2.633
olla+2.542
Neg logit contributions
eln-4.276
imens-4.111
earable-4.091
enne-3.941
imates-3.812
Token edit
Token ID4370
Feature activation-5.99e+00

Pos logit contributions
 rhy+2.962
theless+2.847
ymes+2.699
ingly+2.387
olla+2.156
Neg logit contributions
 millenn-13.266
eln-12.604
imens-12.458
ÍÍ-12.335
earable-12.234
Token ]
Token ID2361
Feature activation-1.36e+01

Pos logit contributions
theless+2.086
 rhy+1.882
ymes+1.751
ingly+1.567
olla+1.390
Neg logit contributions
eln-6.554
imens-6.442
 millenn-6.375
earable-6.317
enne-6.147
Token
Token ID198
Feature activation-6.48e+00

Pos logit contributions
theless+3.999
 rhy+3.772
ymes+3.571
ingly+3.206
olla+3.189
Neg logit contributions
 millenn-14.689
eln-14.182
imens-13.948
earable-13.533
enne-13.119
Token
Token ID198
Feature activation-5.88e+00

Pos logit contributions
theless+2.626
 rhy+2.529
ymes+2.356
ingly+2.151
olla+1.968
Neg logit contributions
eln-6.788
 millenn-6.641
imens-6.607
earable-6.440
enne-6.264
Token$
Token ID3
Feature activation-1.01e+00

Pos logit contributions
theless+4.577
 rhy+4.285
ymes+4.140
ingly+3.848
olla+3.790
Neg logit contributions
eln-4.041
imens-3.952
earable-3.864
enne-3.671
imates-3.654
Token25
Token ID1495
Feature activation-5.72e+00

Pos logit contributions
theless+0.759
 rhy+0.703
ymes+0.669
ingly+0.610
olla+0.600
Neg logit contributions
eln-0.778
imens-0.754
earable-0.743
enne-0.708
imates-0.697
Token
Token ID784
Feature activation-1.71e+00

Pos logit contributions
theless+4.005
 rhy+3.809
ymes+3.506
ingly+3.186
olla+3.170
Neg logit contributions
eln-4.421
imens-4.393
earable-4.363
enne-4.104
imates-4.065
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
eln+0.001
imens+0.001
earable+0.000
enne+0.000
imates+0.000
Neg logit contributions
theless-0.000
 rhy-0.000
ymes-0.000
ingly-0.000
olla-0.000
Token although
Token ID3584
Feature activation-2.24e+00

Pos logit contributions
theless+0.961
 rhy+0.898
ymes+0.844
ingly+0.772
olla+0.745
Neg logit contributions
eln-0.991
imens-0.962
earable-0.952
enne-0.918
imates-0.888
Token she
Token ID673
Feature activation-5.53e-01

Pos logit contributions
theless+1.613
 rhy+1.499
ymes+1.398
ingly+1.281
olla+1.239
Neg logit contributions
eln-1.780
imens-1.751
earable-1.732
enne-1.659
imates-1.612
Token said
Token ID531
Feature activation+2.56e-02

Pos logit contributions
theless+0.396
 rhy+0.364
ymes+0.339
ingly+0.309
olla+0.297
Neg logit contributions
eln-0.449
imens-0.439
earable-0.434
enne-0.414
imates-0.405
Token the
Token ID262
Feature activation-3.54e-01

Pos logit contributions
eln+0.020
imens+0.020
earable+0.019
enne+0.019
imates+0.018
Neg logit contributions
theless-0.019
 rhy-0.017
ymes-0.016
ingly-0.015
olla-0.014
Token pur
Token ID1308
Feature activation-1.35e+01

Pos logit contributions
theless+0.273
 rhy+0.254
ymes+0.240
ingly+0.219
olla+0.212
Neg logit contributions
eln-0.265
imens-0.257
earable-0.254
enne-0.245
imates-0.237
Tokenview
Token ID1177
Feature activation-1.95e+00

Pos logit contributions
theless+8.556
 rhy+7.477
ymes+7.298
olla+6.748
ingly+6.554
Neg logit contributions
earable-10.275
eln-9.863
enne-9.334
 millenn-9.277
imens-9.170
Token of
Token ID286
Feature activation-2.97e-01

Pos logit contributions
theless+1.306
 rhy+1.217
ymes+1.128
ingly+1.041
olla+0.988
Neg logit contributions
eln-1.632
imens-1.592
earable-1.573
enne-1.524
imates-1.475
Token the
Token ID262
Feature activation-6.20e-01

Pos logit contributions
theless+0.222
 rhy+0.206
ymes+0.194
ingly+0.177
olla+0.172
Neg logit contributions
eln-0.229
imens-0.223
earable-0.220
enne-0.212
imates-0.206
Token bank
Token ID3331
Feature activation-3.74e+00

Pos logit contributions
theless+0.474
 rhy+0.442
ymes+0.418
ingly+0.380
olla+0.370
Neg logit contributions
eln-0.465
imens-0.451
earable-0.447
enne-0.429
imates-0.417
Token is
Token ID318
Feature activation-1.24e+00

Pos logit contributions
theless+2.590
 rhy+2.427
ymes+2.248
ingly+2.121
olla+1.998
Neg logit contributions
eln-2.970
imens-2.928
earable-2.871
enne-2.774
imates-2.695
Token and
Token ID290
Feature activation+2.01e+00

Pos logit contributions
eln+0.515
imens+0.499
earable+0.493
enne+0.472
imates+0.459
Neg logit contributions
theless-0.571
 rhy-0.533
ymes-0.505
ingly-0.465
olla-0.449
Token health
Token ID1535
Feature activation-2.66e+00

Pos logit contributions
eln+1.510
imens+1.451
earable+1.445
enne+1.377
imates+1.357
Neg logit contributions
theless-1.538
 rhy-1.417
ymes-1.359
ingly-1.220
olla-1.200
Token care
Token ID1337
Feature activation+1.12e+00

Pos logit contributions
theless+1.647
 rhy+1.513
ymes+1.408
ingly+1.258
olla+1.200
Neg logit contributions
eln-2.376
imens-2.314
earable-2.292
enne-2.207
imates-2.173
Token legislation
Token ID5520
Feature activation+3.07e+00

Pos logit contributions
eln+0.804
imens+0.777
earable+0.768
enne+0.733
imates+0.718
Neg logit contributions
theless-0.896
 rhy-0.832
ymes-0.792
ingly-0.724
olla-0.705
Token to
Token ID284
Feature activation-2.06e+00

Pos logit contributions
eln+2.176
imens+2.105
earable+2.079
enne+1.984
imates+1.953
Neg logit contributions
theless-2.482
 rhy-2.297
ymes-2.199
ingly-1.989
olla-1.961
Token Harry
Token ID5850
Feature activation-1.34e+01

Pos logit contributions
theless+1.569
 rhy+1.465
ymes+1.375
ingly+1.263
olla+1.218
Neg logit contributions
eln-1.539
imens-1.507
earable-1.477
enne-1.426
imates-1.393
Token Reid
Token ID16095
Feature activation-2.22e+00

Pos logit contributions
theless+6.447
 rhy+5.309
ymes+4.740
ingly+4.670
haus+4.186
Neg logit contributions
eln-13.192
earable-12.703
imens-12.434
imates-12.365
ktop-11.819
Token and
Token ID290
Feature activation+3.10e+00

Pos logit contributions
theless+1.528
 rhy+1.415
ymes+1.319
ingly+1.211
olla+1.147
Neg logit contributions
eln-1.811
imens-1.769
earable-1.760
enne-1.685
imates-1.651
Token Nancy
Token ID18496
Feature activation-2.71e-01

Pos logit contributions
eln+2.230
imens+2.137
earable+2.126
enne+2.025
imates+1.994
Neg logit contributions
theless-2.461
 rhy-2.265
ymes-2.174
ingly-1.953
olla-1.920
Token Pelosi
Token ID29611
Feature activation+4.80e+00

Pos logit contributions
theless+0.246
 rhy+0.230
ymes+0.220
ingly+0.203
olla+0.199
Neg logit contributions
eln-0.168
imens-0.160
earable-0.158
enne-0.149
imates-0.146
Token,
Token ID11
Feature activation+1.98e+00

Pos logit contributions
eln+3.623
imens+3.460
earable+3.424
enne+3.284
imates+3.242
Neg logit contributions
theless-3.643
 rhy-3.318
ymes-3.196
ingly-2.867
olla-2.825
Token his
Token ID465
Feature activation+1.06e+00

Pos logit contributions
theless+1.455
 rhy+1.367
ymes+1.281
ingly+1.185
olla+1.148
Neg logit contributions
eln-1.325
imens-1.289
earable-1.284
enne-1.218
imates-1.183
Token 2006
Token ID4793
Feature activation-1.53e-01

Pos logit contributions
eln+0.733
imens+0.708
earable+0.699
enne+0.666
imates+0.652
Neg logit contributions
theless-0.877
 rhy-0.816
ymes-0.779
ingly-0.713
olla-0.696
Token speech
Token ID4046
Feature activation+4.55e+00

Pos logit contributions
theless+0.115
 rhy+0.107
ymes+0.101
ingly+0.093
olla+0.090
Neg logit contributions
eln-0.116
imens-0.113
earable-0.112
enne-0.107
imates-0.105
Token marking
Token ID18730
Feature activation-4.21e+00

Pos logit contributions
eln+3.271
imens+3.120
earable+3.085
enne+2.964
imates+2.893
Neg logit contributions
theless-3.641
 rhy-3.299
ymes-3.209
ingly-2.873
olla-2.829
Token the
Token ID262
Feature activation-2.01e+00

Pos logit contributions
theless+2.869
 rhy+2.705
ymes+2.508
ingly+2.304
olla+2.217
Neg logit contributions
eln-3.409
imens-3.320
earable-3.289
enne-3.175
imates-3.105
Token founding
Token ID16636
Feature activation-1.33e+01

Pos logit contributions
theless+1.422
 rhy+1.331
ymes+1.241
ingly+1.126
olla+1.095
Neg logit contributions
eln-1.621
imens-1.570
earable-1.558
enne-1.498
imates-1.465
Token of
Token ID286
Feature activation-5.16e+00

Pos logit contributions
theless+7.088
 rhy+7.077
ymes+6.019
itudinal+5.490
ingly+5.474
Neg logit contributions
earable-11.151
imens-11.146
eln-11.146
handled-10.566
enne-10.525
Token his
Token ID465
Feature activation-2.43e+00

Pos logit contributions
theless+3.548
 rhy+3.398
ymes+3.155
ingly+2.841
olla+2.761
Neg logit contributions
eln-4.130
imens-4.032
earable-3.967
enne-3.837
imates-3.730
Token regime
Token ID7142
Feature activation-5.26e+00

Pos logit contributions
theless+1.780
 rhy+1.658
ymes+1.560
ingly+1.414
olla+1.375
Neg logit contributions
eln-1.899
imens-1.842
earable-1.822
enne-1.756
imates-1.713
Token,
Token ID11
Feature activation-1.39e+00

Pos logit contributions
theless+3.268
 rhy+3.142
ymes+2.861
ingly+2.561
olla+2.487
Neg logit contributions
eln-4.468
imens-4.412
earable-4.313
enne-4.153
imates-4.121
Token Gaddafi
Token ID40320
Feature activation+2.98e+00

Pos logit contributions
theless+1.051
 rhy+0.974
ymes+0.925
ingly+0.837
olla+0.814
Neg logit contributions
eln-1.060
imens-1.026
earable-1.017
enne-0.974
imates-0.954
Tokenev
Token ID1990
Feature activation-1.05e+00

Pos logit contributions
theless+0.387
 rhy+0.356
ymes+0.339
ingly+0.307
olla+0.299
Neg logit contributions
eln-0.416
imens-0.405
earable-0.401
enne-0.384
imates-0.378
Tokenict
Token ID713
Feature activation+1.14e-01

Pos logit contributions
theless+0.895
 rhy+0.829
ymes+0.792
ingly+0.733
olla+0.720
Neg logit contributions
eln-0.697
imens-0.676
earable-0.671
enne-0.639
imates-0.627
Token_
Token ID62
Feature activation+5.14e-01

Pos logit contributions
eln+0.081
imens+0.079
earable+0.078
enne+0.075
imates+0.073
Neg logit contributions
theless-0.091
 rhy-0.085
ymes-0.081
ingly-0.074
olla-0.072
Tokenfrom
Token ID6738
Feature activation+4.48e+00

Pos logit contributions
eln+0.355
imens+0.345
earable+0.342
enne+0.325
imates+0.319
Neg logit contributions
theless-0.423
 rhy-0.392
ymes-0.375
ingly-0.345
olla-0.336
Token_
Token ID62
Feature activation+2.52e+00

Pos logit contributions
eln+3.122
imens+2.978
earable+2.936
enne+2.819
imates+2.720
Neg logit contributions
theless-3.711
 rhy-3.430
ymes-3.273
ingly-2.969
olla-2.883
Tokenos
Token ID418
Feature activation+9.38e+00

Pos logit contributions
eln+1.714
imens+1.652
earable+1.628
enne+1.551
imates+1.514
Neg logit contributions
theless-2.116
 rhy-1.970
ymes-1.880
ingly-1.721
olla-1.679
Token_
Token ID62
Feature activation+8.41e-01

Pos logit contributions
eln+6.617
imens+6.238
earable+6.056
enne+5.823
ktop+5.724
Neg logit contributions
theless-7.576
 rhy-7.027
ymes-6.469
ingly-6.025
olla-5.761
Tokencache
Token ID23870
Feature activation+4.50e+00

Pos logit contributions
eln+0.579
imens+0.561
earable+0.556
enne+0.528
imates+0.518
Neg logit contributions
theless-0.696
 rhy-0.644
ymes-0.618
ingly-0.567
olla-0.553
Token();
Token ID9783
Feature activation+1.32e-01

Pos logit contributions
eln+3.124
imens+2.995
earable+2.943
enne+2.820
imates+2.743
Neg logit contributions
theless-3.731
 rhy-3.438
ymes-3.298
ingly-3.008
olla-2.927
Token }
Token ID1782
Feature activation-3.92e+00

Pos logit contributions
eln+0.104
imens+0.101
earable+0.100
enne+0.096
imates+0.094
Neg logit contributions
theless-0.096
 rhy-0.089
ymes-0.083
ingly-0.075
olla-0.074
Token }
Token ID1782
Feature activation-5.22e+00

Pos logit contributions
theless+2.745
 rhy+2.587
ymes+2.320
ingly+2.112
olla+2.044
Neg logit contributions
eln-3.147
earable-3.060
imens-3.058
enne-2.931
imates-2.845
Token marriage
Token ID4845
Feature activation-5.02e+00

Pos logit contributions
eln+1.072
imens+1.039
earable+1.030
enne+0.988
imates+0.976
Neg logit contributions
theless-0.923
 rhy-0.842
ymes-0.800
ingly-0.714
olla-0.702
Token.
Token ID13
Feature activation-2.34e+00

Pos logit contributions
theless+3.317
 rhy+3.083
ymes+2.906
ingly+2.647
olla+2.468
Neg logit contributions
eln-4.213
earable-4.070
imens-4.040
enne-3.912
ainted-3.796
Token It
Token ID632
Feature activation-2.44e+00

Pos logit contributions
theless+1.683
 rhy+1.549
ymes+1.472
ingly+1.345
olla+1.305
Neg logit contributions
eln-1.866
imens-1.819
earable-1.771
enne-1.700
ainted-1.664
Token is
Token ID318
Feature activation-5.94e-01

Pos logit contributions
theless+1.722
 rhy+1.589
ymes+1.490
ingly+1.341
olla+1.282
Neg logit contributions
eln-1.991
imens-1.952
earable-1.925
enne-1.854
ainted-1.799
Token discrimination
Token ID8839
Feature activation-1.68e+00

Pos logit contributions
theless+0.437
 rhy+0.404
ymes+0.382
ingly+0.346
olla+0.333
Neg logit contributions
eln-0.467
imens-0.454
earable-0.447
enne-0.431
imates-0.421
Token writ
Token ID1991
Feature activation+9.43e+00

Pos logit contributions
theless+1.171
 rhy+1.082
ymes+1.013
ingly+0.920
olla+0.879
Neg logit contributions
eln-1.392
imens-1.350
earable-1.334
enne-1.281
imates-1.257
Token into
Token ID656
Feature activation+1.45e+00

Pos logit contributions
eln+5.495
imens+4.905
enne+4.825
earable+4.786
ainted+4.468
Neg logit contributions
theless-8.737
 rhy-8.048
ymes-7.918
olla-7.162
itudinal-6.928
Token federal
Token ID2717
Feature activation-2.98e+00

Pos logit contributions
eln+1.105
imens+1.070
earable+1.063
enne+1.015
imates+1.000
Neg logit contributions
theless-1.087
 rhy-0.996
ymes-0.956
ingly-0.858
olla-0.844
Token law
Token ID1099
Feature activation-2.65e+00

Pos logit contributions
theless+1.771
 rhy+1.620
ymes+1.501
ingly+1.336
olla+1.253
Neg logit contributions
eln-2.752
imens-2.689
earable-2.659
enne-2.552
imates-2.511
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
theless+1.816
 rhy+1.686
ymes+1.574
ingly+1.423
olla+1.358
Neg logit contributions
eln-2.192
imens-2.137
earable-2.113
enne-2.044
imates-1.982
Token It
Token ID632
Feature activation+3.96e-01

Pos logit contributions
theless+0.751
 rhy+0.692
ymes+0.657
ingly+0.598
olla+0.580
Neg logit contributions
eln-0.799
imens-0.777
earable-0.762
enne-0.732
imates-0.715
Token loss
Token ID2994
Feature activation+1.87e+00

Pos logit contributions
eln+2.491
imens+2.390
earable+2.381
enne+2.272
imates+2.235
Neg logit contributions
theless-2.657
 rhy-2.438
ymes-2.349
ingly-2.100
olla-2.075
Token of
Token ID286
Feature activation+1.68e+00

Pos logit contributions
eln+1.491
imens+1.438
earable+1.429
enne+1.367
imates+1.354
Neg logit contributions
theless-1.346
 rhy-1.232
ymes-1.173
ingly-1.036
olla-1.021
Token employment
Token ID7184
Feature activation+1.72e+00

Pos logit contributions
eln+1.182
imens+1.140
earable+1.128
enne+1.077
imates+1.059
Neg logit contributions
theless-1.372
 rhy-1.267
ymes-1.217
ingly-1.107
olla-1.083
Token,
Token ID11
Feature activation+3.97e+00

Pos logit contributions
eln+1.365
imens+1.326
earable+1.311
enne+1.260
imates+1.240
Neg logit contributions
theless-1.247
 rhy-1.141
ymes-1.086
ingly-0.971
olla-0.948
Token and
Token ID290
Feature activation+2.04e+00

Pos logit contributions
eln+2.892
imens+2.782
earable+2.775
enne+2.642
imates+2.601
Neg logit contributions
theless-3.102
 rhy-2.845
ymes-2.752
ingly-2.451
olla-2.427
Token ostr
Token ID42387
Feature activation+1.09e+01

Pos logit contributions
eln+1.490
imens+1.439
earable+1.424
enne+1.359
imates+1.335
Neg logit contributions
theless-1.607
 rhy-1.485
ymes-1.421
ingly-1.286
olla-1.261
Tokenac
Token ID330
Feature activation+2.51e+00

Pos logit contributions
imens+5.885
earable+5.565
eln+5.522
imates+5.358
ainted+5.325
Neg logit contributions
theless-10.329
 rhy-9.654
ymes-9.154
ingly-8.538
olla-8.211
Tokenism
Token ID1042
Feature activation+1.97e+00

Pos logit contributions
eln+2.066
imens+2.002
earable+1.963
enne+1.903
imates+1.875
Neg logit contributions
theless-1.754
 rhy-1.600
ymes-1.513
ingly-1.345
olla-1.315
Token from
Token ID422
Feature activation+2.02e+00

Pos logit contributions
eln+1.916
imens+1.870
earable+1.852
enne+1.792
imates+1.767
Neg logit contributions
theless-1.075
 rhy-0.960
ymes-0.892
ingly-0.767
olla-0.737
Token family
Token ID1641
Feature activation+3.45e+00

Pos logit contributions
eln+1.419
imens+1.370
earable+1.357
enne+1.293
imates+1.272
Neg logit contributions
theless-1.639
 rhy-1.514
ymes-1.452
ingly-1.324
olla-1.293
Token and
Token ID290
Feature activation+3.31e+00

Pos logit contributions
eln+2.945
imens+2.860
earable+2.817
enne+2.737
imates+2.701
Neg logit contributions
theless-2.286
 rhy-2.065
ymes-1.964
ingly-1.721
olla-1.683
Token struct
Token ID2878
Feature activation-1.18e+00

Pos logit contributions
eln+0.244
imens+0.231
earable+0.227
enne+0.207
imates+0.202
Neg logit contributions
theless-0.642
 rhy-0.612
ymes-0.588
ingly-0.554
olla-0.544
Tokenuring
Token ID870
Feature activation-2.51e+00

Pos logit contributions
theless+0.898
 rhy+0.853
ymes+0.804
ingly+0.735
olla+0.714
Neg logit contributions
eln-0.869
imens-0.837
earable-0.836
enne-0.803
imates-0.780
Token kernel
Token ID9720
Feature activation+5.36e+00

Pos logit contributions
theless+2.390
 rhy+2.309
ymes+2.176
ingly+2.050
olla+1.983
Neg logit contributions
eln-1.379
imens-1.323
earable-1.321
enne-1.228
imates-1.199
Token rect
Token ID13621
Feature activation+2.54e+00

Pos logit contributions
eln+3.351
imens+3.217
earable+3.143
enne+2.995
imates+2.942
Neg logit contributions
theless-4.785
 rhy-4.328
ymes-4.235
ingly-3.870
olla-3.793
TokenK
Token ID42
Feature activation+5.70e+00

Pos logit contributions
eln+1.767
imens+1.707
earable+1.673
enne+1.607
imates+1.571
Neg logit contributions
theless-2.100
 rhy-1.942
ymes-1.852
ingly-1.697
olla-1.655
Tokenernel
Token ID7948
Feature activation+9.23e+00

Pos logit contributions
eln+3.999
imens+3.836
earable+3.709
enne+3.635
imates+3.511
Neg logit contributions
theless-4.730
 rhy-4.277
ymes-4.059
ingly-3.690
olla-3.596
Token =
Token ID796
Feature activation+2.30e+00

Pos logit contributions
eln+5.619
imens+5.317
earable+5.171
enne+5.043
imates+4.852
Neg logit contributions
theless-8.341
 rhy-7.409
ymes-7.178
ingly-6.589
olla-6.381
Token c
Token ID269
Feature activation+4.29e+00

Pos logit contributions
eln+1.528
imens+1.473
earable+1.452
enne+1.381
imates+1.351
Neg logit contributions
theless-1.966
 rhy-1.830
ymes-1.752
ingly-1.608
olla-1.572
Tokenv
Token ID85
Feature activation+4.27e-01

Pos logit contributions
eln+2.756
imens+2.665
earable+2.606
enne+2.482
imates+2.433
Neg logit contributions
theless-3.767
 rhy-3.509
ymes-3.360
ingly-3.084
olla-3.019
Token2
Token ID17
Feature activation+3.01e+00

Pos logit contributions
eln+0.302
imens+0.293
earable+0.289
enne+0.276
imates+0.270
Neg logit contributions
theless-0.342
 rhy-0.320
ymes-0.306
ingly-0.279
olla-0.273
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
eln+2.310
imens+2.231
earable+2.212
enne+2.117
imates+2.079
Neg logit contributions
theless-2.266
 rhy-2.082
ymes-1.978
ingly-1.785
olla-1.740
Token animal
Token ID5044
Feature activation+1.10e+00

Pos logit contributions
eln+0.521
imens+0.503
earable+0.497
enne+0.474
imates+0.465
Neg logit contributions
theless-0.587
 rhy-0.547
ymes-0.521
ingly-0.476
olla-0.463
Token,
Token ID11
Feature activation+2.38e+00

Pos logit contributions
eln+0.894
imens+0.867
earable+0.858
enne+0.824
imates+0.809
Neg logit contributions
theless-0.783
 rhy-0.720
ymes-0.681
ingly-0.613
olla-0.595
Token please
Token ID3387
Feature activation+5.43e+00

Pos logit contributions
eln+1.810
imens+1.757
earable+1.736
enne+1.661
imates+1.634
Neg logit contributions
theless-1.797
 rhy-1.642
ymes-1.569
ingly-1.416
olla-1.387
Token do
Token ID466
Feature activation+1.15e+00

Pos logit contributions
eln+3.896
imens+3.829
earable+3.725
enne+3.596
imates+3.555
Neg logit contributions
theless-4.146
 rhy-3.696
ymes-3.636
ingly-3.229
olla-3.216
Token seek
Token ID5380
Feature activation+2.88e+00

Pos logit contributions
eln+0.872
imens+0.844
earable+0.833
enne+0.798
imates+0.784
Neg logit contributions
theless-0.879
 rhy-0.809
ymes-0.771
ingly-0.697
olla-0.681
Token professional
Token ID4708
Feature activation+1.03e+01

Pos logit contributions
eln+1.867
imens+1.796
earable+1.770
enne+1.683
imates+1.645
Neg logit contributions
theless-2.517
 rhy-2.344
ymes-2.245
ingly-2.064
olla-2.019
Token help
Token ID1037
Feature activation+6.43e+00

Pos logit contributions
eln+4.800
imens+4.589
earable+4.372
enne+4.166
handled+4.166
Neg logit contributions
theless-10.289
 rhy-9.263
ymes-9.138
ingly-8.487
olla-8.394
Token.
Token ID13
Feature activation+2.99e+00

Pos logit contributions
eln+4.634
imens+4.541
earable+4.405
enne+4.246
imates+4.206
Neg logit contributions
theless-4.988
 rhy-4.557
ymes-4.321
ingly-3.925
olla-3.854
Token Continue
Token ID10054
Feature activation-3.09e-01

Pos logit contributions
eln+2.198
imens+2.137
earable+2.111
enne+2.019
imates+1.988
Neg logit contributions
theless-2.329
 rhy-2.143
ymes-2.040
ingly-1.843
olla-1.799
Token Reading
Token ID11725
Feature activation-5.39e+00

Pos logit contributions
theless+0.208
 rhy+0.192
ymes+0.180
ingly+0.162
olla+0.156
Neg logit contributions
eln-0.262
imens-0.254
earable-0.252
enne-0.242
imates-0.238
Token Below
Token ID10383
Feature activation-4.75e+00

Pos logit contributions
theless+3.803
 rhy+3.566
ymes+3.361
ingly+3.107
olla+2.987
Neg logit contributions
eln-4.210
imens-4.095
earable-4.056
enne-3.847
ainted-3.790
Token certain
Token ID1728
Feature activation-3.79e+00

Pos logit contributions
theless+1.227
 rhy+1.155
ymes+1.077
ingly+0.976
olla+0.952
Neg logit contributions
eln-1.251
imens-1.234
earable-1.214
enne-1.162
imates-1.143
Token groups
Token ID2628
Feature activation+7.05e-03

Pos logit contributions
theless+2.917
 rhy+2.750
ymes+2.606
ingly+2.375
olla+2.294
Neg logit contributions
eln-2.741
imens-2.679
earable-2.635
enne-2.530
imates-2.491
Token in
Token ID287
Feature activation-1.63e+00

Pos logit contributions
eln+0.006
imens+0.005
earable+0.005
enne+0.005
imates+0.005
Neg logit contributions
theless-0.005
 rhy-0.005
ymes-0.005
ingly-0.004
olla-0.004
Token certain
Token ID1728
Feature activation-3.01e+00

Pos logit contributions
theless+1.244
 rhy+1.170
ymes+1.093
ingly+1.000
olla+0.976
Neg logit contributions
eln-1.214
imens-1.191
earable-1.174
enne-1.122
imates-1.099
Token countries
Token ID2678
Feature activation-1.84e+00

Pos logit contributions
theless+2.522
 rhy+2.392
ymes+2.261
ingly+2.088
olla+2.012
Neg logit contributions
eln-1.995
imens-1.944
earable-1.921
enne-1.811
imates-1.784
Token.
Token ID13
Feature activation+3.55e+00

Pos logit contributions
theless+1.250
 rhy+1.168
ymes+1.098
ingly+0.995
olla+0.947
Neg logit contributions
eln-1.518
imens-1.479
earable-1.469
enne-1.401
imates-1.372
Token
Token ID447
Feature activation+4.06e+00

Pos logit contributions
eln+2.580
imens+2.484
earable+2.470
enne+2.374
imates+2.313
Neg logit contributions
theless-2.799
 rhy-2.586
ymes-2.467
ingly-2.231
olla-2.153
Token
Token ID251
Feature activation+3.15e+00

Pos logit contributions
eln+4.253
earable+4.129
imens+4.114
enne+4.010
imates+3.936
Neg logit contributions
theless-1.908
 rhy-1.607
ymes-1.468
ingly-1.216
olla-1.137
Token While
Token ID2893
Feature activation+2.02e+00

Pos logit contributions
eln+2.265
imens+2.177
earable+2.164
enne+2.080
imates+2.026
Neg logit contributions
theless-2.511
 rhy-2.330
ymes-2.217
ingly-2.012
olla-1.950
Token she
Token ID673
Feature activation+1.91e+00

Pos logit contributions
eln+1.510
imens+1.457
earable+1.439
enne+1.382
imates+1.354
Neg logit contributions
theless-1.563
 rhy-1.440
ymes-1.375
ingly-1.245
olla-1.212
Token does
Token ID857
Feature activation+1.29e+00

Pos logit contributions
eln+1.544
imens+1.497
earable+1.480
enne+1.425
imates+1.400
Neg logit contributions
theless-1.362
 rhy-1.250
ymes-1.185
ingly-1.065
olla-1.036
Token as
Token ID355
Feature activation-9.25e-01

Pos logit contributions
eln+0.263
imens+0.255
earable+0.252
enne+0.242
imates+0.237
Neg logit contributions
theless-0.248
 rhy-0.229
ymes-0.217
ingly-0.196
olla-0.191
Token with
Token ID351
Feature activation-2.84e+00

Pos logit contributions
theless+0.686
 rhy+0.638
ymes+0.601
ingly+0.548
olla+0.532
Neg logit contributions
eln-0.717
imens-0.699
earable-0.691
enne-0.658
imates-0.645
Token many
Token ID867
Feature activation-4.43e+00

Pos logit contributions
theless+2.024
 rhy+1.919
ymes+1.770
ingly+1.628
olla+1.571
Neg logit contributions
eln-2.237
imens-2.196
earable-2.173
enne-2.056
imates-2.016
Token parts
Token ID3354
Feature activation-4.53e-01

Pos logit contributions
theless+3.301
 rhy+3.104
ymes+2.862
ingly+2.623
olla+2.543
Neg logit contributions
eln-3.368
imens-3.285
earable-3.242
enne-3.116
imates-3.034
Token of
Token ID286
Feature activation+2.20e+00

Pos logit contributions
theless+0.317
 rhy+0.288
ymes+0.274
ingly+0.244
olla+0.238
Neg logit contributions
eln-0.374
imens-0.362
earable-0.359
enne-0.345
imates-0.339
Token a
Token ID257
Feature activation+2.48e+00

Pos logit contributions
eln+1.686
imens+1.630
earable+1.612
enne+1.544
imates+1.513
Neg logit contributions
theless-1.655
 rhy-1.515
ymes-1.449
ingly-1.310
olla-1.275
Token campaign
Token ID1923
Feature activation-2.62e+00

Pos logit contributions
eln+2.024
imens+1.961
earable+1.941
enne+1.866
imates+1.828
Neg logit contributions
theless-1.745
 rhy-1.587
ymes-1.513
ingly-1.355
olla-1.318
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
theless+1.736
 rhy+1.650
ymes+1.523
ingly+1.382
olla+1.321
Neg logit contributions
eln-2.196
imens-2.147
earable-2.114
imates-2.029
enne-2.024
Token it
Token ID340
Feature activation+2.69e+00

Pos logit contributions
theless+1.116
 rhy+1.052
ymes+0.980
ingly+0.889
olla+0.861
Neg logit contributions
eln-1.233
imens-1.201
earable-1.189
enne-1.136
imates-1.115
Token is
Token ID318
Feature activation+3.70e+00

Pos logit contributions
eln+2.075
imens+2.010
earable+1.984
enne+1.905
imates+1.875
Neg logit contributions
theless-2.000
 rhy-1.836
ymes-1.747
ingly-1.579
olla-1.542
Token obvious
Token ID3489
Feature activation+1.74e+00

Pos logit contributions
eln+2.679
imens+2.579
earable+2.556
enne+2.440
imates+2.394
Neg logit contributions
theless-2.930
 rhy-2.682
ymes-2.578
ingly-2.336
olla-2.299
Token itself
Token ID2346
Feature activation-3.29e+00

Pos logit contributions
theless+1.893
 rhy+1.766
ymes+1.670
ingly+1.506
olla+1.441
Neg logit contributions
eln-2.405
imens-2.327
earable-2.299
enne-2.184
imates-2.179
Token,
Token ID11
Feature activation+1.21e+00

Pos logit contributions
theless+2.128
 rhy+1.987
ymes+1.876
ingly+1.666
olla+1.594
Neg logit contributions
eln-2.798
imens-2.738
earable-2.704
enne-2.577
imates-2.571
Token doing
Token ID1804
Feature activation-2.88e+00

Pos logit contributions
eln+0.932
imens+0.904
earable+0.893
enne+0.854
imates+0.840
Neg logit contributions
theless-0.909
 rhy-0.841
ymes-0.796
ingly-0.721
olla-0.702
Token it
Token ID340
Feature activation-1.86e+00

Pos logit contributions
theless+2.107
 rhy+1.972
ymes+1.855
ingly+1.686
olla+1.635
Neg logit contributions
eln-2.213
imens-2.188
earable-2.138
enne-2.045
imates-2.030
Token well
Token ID880
Feature activation+2.56e+00

Pos logit contributions
theless+1.499
 rhy+1.387
ymes+1.311
ingly+1.210
olla+1.178
Neg logit contributions
eln-1.338
imens-1.304
earable-1.284
enne-1.203
imates-1.200
Token on
Token ID319
Feature activation+3.46e+00

Pos logit contributions
eln+1.994
imens+1.933
earable+1.910
enne+1.832
imates+1.798
Neg logit contributions
theless-1.893
 rhy-1.745
ymes-1.656
ingly-1.496
olla-1.456
Token anything
Token ID1997
Feature activation-5.17e+00

Pos logit contributions
eln+2.466
imens+2.353
earable+2.330
enne+2.238
imates+2.177
Neg logit contributions
theless-2.772
 rhy-2.556
ymes-2.470
ingly-2.232
olla-2.182
Token other
Token ID584
Feature activation+4.18e+00

Pos logit contributions
theless+3.447
 rhy+3.186
ymes+2.956
ingly+2.626
olla+2.533
Neg logit contributions
eln-4.353
imens-4.301
earable-4.188
imates-3.997
enne-3.991
Token than
Token ID621
Feature activation+2.56e+00

Pos logit contributions
eln+3.504
imens+3.411
earable+3.378
enne+3.247
imates+3.180
Neg logit contributions
theless-2.833
 rhy-2.575
ymes-2.435
ingly-2.151
olla-2.097
Token 8
Token ID807
Feature activation+3.06e+00

Pos logit contributions
eln+1.854
imens+1.786
earable+1.769
enne+1.695
imates+1.651
Neg logit contributions
theless-2.026
 rhy-1.870
ymes-1.796
ingly-1.628
olla-1.591
Token-
Token ID12
Feature activation+1.23e+00

Pos logit contributions
eln+2.428
imens+2.340
earable+2.316
enne+2.229
imates+2.190
Neg logit contributions
theless-2.198
 rhy-2.017
ymes-1.900
ingly-1.730
olla-1.660
Token that
Token ID326
Feature activation+3.62e+00

Pos logit contributions
eln+3.962
imens+3.860
earable+3.811
enne+3.684
imates+3.625
Neg logit contributions
theless-4.128
 rhy-3.663
ymes-3.544
ingly-3.173
olla-3.117
Token had
Token ID550
Feature activation+2.36e+00

Pos logit contributions
eln+2.742
imens+2.659
earable+2.624
enne+2.532
imates+2.483
Neg logit contributions
theless-2.721
 rhy-2.478
ymes-2.363
ingly-2.139
olla-2.106
Token sidelined
Token ID39632
Feature activation+2.38e+00

Pos logit contributions
eln+1.754
imens+1.698
earable+1.676
enne+1.605
imates+1.573
Neg logit contributions
theless-1.826
 rhy-1.687
ymes-1.606
ingly-1.458
olla-1.423
Token him
Token ID683
Feature activation+1.56e+00

Pos logit contributions
eln+1.894
imens+1.844
earable+1.820
enne+1.752
imates+1.713
Neg logit contributions
theless-1.700
 rhy-1.553
ymes-1.479
ingly-1.321
olla-1.294
Token since
Token ID1201
Feature activation+2.87e+00

Pos logit contributions
eln+1.166
imens+1.129
earable+1.115
enne+1.066
imates+1.045
Neg logit contributions
theless-1.210
 rhy-1.121
ymes-1.064
ingly-0.969
olla-0.943
Token early
Token ID1903
Feature activation+1.72e+00

Pos logit contributions
eln+2.089
imens+2.017
earable+2.000
enne+1.905
imates+1.872
Neg logit contributions
theless-2.267
 rhy-2.086
ymes-2.003
ingly-1.812
olla-1.776
Token July
Token ID2901
Feature activation+3.89e+00

Pos logit contributions
eln+1.157
imens+1.116
earable+1.102
enne+1.050
imates+1.026
Neg logit contributions
theless-1.458
 rhy-1.355
ymes-1.300
ingly-1.188
olla-1.164
Token.
Token ID13
Feature activation+2.97e+00

Pos logit contributions
eln+2.974
imens+2.876
earable+2.837
enne+2.740
imates+2.686
Neg logit contributions
theless-2.912
 rhy-2.672
ymes-2.563
ingly-2.280
olla-2.236
Token However
Token ID2102
Feature activation+9.11e-01

Pos logit contributions
eln+2.422
imens+2.358
earable+2.331
enne+2.246
imates+2.204
Neg logit contributions
theless-2.061
 rhy-1.895
ymes-1.788
ingly-1.601
olla-1.545
Token,
Token ID11
Feature activation+4.56e-01

Pos logit contributions
eln+0.725
imens+0.703
earable+0.694
enne+0.667
imates+0.653
Neg logit contributions
theless-0.656
 rhy-0.602
ymes-0.570
ingly-0.515
olla-0.500
Token there
Token ID612
Feature activation+2.75e+00

Pos logit contributions
eln+0.345
imens+0.334
earable+0.330
enne+0.316
imates+0.310
Neg logit contributions
theless-0.349
 rhy-0.322
ymes-0.306
ingly-0.278
olla-0.271
Token option
Token ID3038
Feature activation+3.09e+00

Pos logit contributions
eln+0.140
imens+0.135
earable+0.133
enne+0.126
imates+0.124
Neg logit contributions
theless-0.168
 rhy-0.157
ymes-0.149
ingly-0.137
olla-0.133
Token to
Token ID284
Feature activation-1.27e+00

Pos logit contributions
eln+2.189
imens+2.110
earable+2.082
enne+1.998
imates+1.960
Neg logit contributions
theless-2.491
 rhy-2.293
ymes-2.206
ingly-1.990
olla-1.962
Token "
Token ID366
Feature activation+3.60e+00

Pos logit contributions
theless+0.947
 rhy+0.886
ymes+0.824
ingly+0.754
olla+0.731
Neg logit contributions
eln-0.980
imens-0.954
earable-0.940
enne-0.902
imates-0.889
Tokenc
Token ID66
Feature activation+6.73e+00

Pos logit contributions
eln+2.486
imens+2.395
earable+2.363
enne+2.257
imates+2.196
Neg logit contributions
theless-2.976
 rhy-2.767
ymes-2.647
ingly-2.400
olla-2.351
Token:\
Token ID7479
Feature activation+1.10e+00

Pos logit contributions
eln+5.089
imens+4.955
earable+4.874
enne+4.704
imates+4.553
Neg logit contributions
theless-4.987
 rhy-4.607
ymes-4.417
ingly-3.919
olla-3.814
Tokentmp
Token ID22065
Feature activation+7.28e+00

Pos logit contributions
eln+0.867
imens+0.841
earable+0.831
enne+0.798
imates+0.783
Neg logit contributions
theless-0.798
 rhy-0.735
ymes-0.697
ingly-0.628
olla-0.611
Token".
Token ID1911
Feature activation-5.13e+00

Pos logit contributions
eln+5.303
imens+5.150
earable+5.019
enne+4.834
imates+4.724
Neg logit contributions
theless-5.533
 rhy-5.112
ymes-4.908
ingly-4.349
olla-4.283
Token Only
Token ID5514
Feature activation-1.11e+00

Pos logit contributions
theless+3.666
 rhy+3.467
ymes+3.209
ingly+2.918
olla+2.822
Neg logit contributions
eln-3.936
imens-3.864
earable-3.788
enne-3.601
imates-3.562
Token a
Token ID257
Feature activation+4.24e-01

Pos logit contributions
theless+0.837
 rhy+0.784
ymes+0.734
ingly+0.669
olla+0.646
Neg logit contributions
eln-0.846
imens-0.820
earable-0.815
enne-0.774
imates-0.759
Token single
Token ID2060
Feature activation+3.03e-01

Pos logit contributions
eln+0.320
imens+0.311
earable+0.308
enne+0.293
imates+0.289
Neg logit contributions
theless-0.323
 rhy-0.302
ymes-0.284
ingly-0.259
olla-0.252
Token back
Token ID736
Feature activation-2.64e+00

Pos logit contributions
eln+0.216
imens+0.210
earable+0.207
enne+0.197
imates+0.194
Neg logit contributions
theless-0.244
 rhy-0.228
ymes-0.216
ingly-0.198
olla-0.192
Token might
Token ID1244
Feature activation+1.01e+00

Pos logit contributions
theless+0.678
 rhy+0.632
ymes+0.593
ingly+0.543
olla+0.521
Neg logit contributions
eln-0.718
imens-0.698
earable-0.688
enne-0.660
imates-0.647
Token have
Token ID423
Feature activation-2.11e+00

Pos logit contributions
eln+0.790
imens+0.766
earable+0.757
enne+0.726
imates+0.713
Neg logit contributions
theless-0.738
 rhy-0.680
ymes-0.645
ingly-0.582
olla-0.566
Token a
Token ID257
Feature activation+5.79e-01

Pos logit contributions
theless+1.636
 rhy+1.534
ymes+1.430
ingly+1.315
olla+1.279
Neg logit contributions
eln-1.577
imens-1.547
earable-1.514
enne-1.449
imates-1.415
Token flying
Token ID7348
Feature activation-1.23e+00

Pos logit contributions
eln+0.433
imens+0.419
earable+0.414
enne+0.396
imates+0.388
Neg logit contributions
theless-0.447
 rhy-0.414
ymes-0.393
ingly-0.357
olla-0.348
Token model
Token ID2746
Feature activation+1.45e+00

Pos logit contributions
theless+0.953
 rhy+0.890
ymes+0.835
ingly+0.765
olla+0.744
Neg logit contributions
eln-0.916
imens-0.887
earable-0.875
enne-0.837
imates-0.825
Token that
Token ID326
Feature activation-1.82e+00

Pos logit contributions
eln+1.164
imens+1.128
earable+1.117
enne+1.073
imates+1.052
Neg logit contributions
theless-1.037
 rhy-0.949
ymes-0.904
ingly-0.809
olla-0.789
Token is
Token ID318
Feature activation-8.65e-01

Pos logit contributions
theless+1.246
 rhy+1.151
ymes+1.073
ingly+0.968
olla+0.923
Neg logit contributions
eln-1.523
imens-1.488
earable-1.475
enne-1.416
imates-1.375
Token
Token ID198
Feature activation-3.98e-01

Pos logit contributions
theless+0.600
 rhy+0.553
ymes+0.518
ingly+0.468
olla+0.449
Neg logit contributions
eln-0.712
imens-0.697
earable-0.687
enne-0.662
imates-0.648
Token
Token ID198
Feature activation+1.67e-01

Pos logit contributions
theless+0.234
 rhy+0.213
ymes+0.199
ingly+0.176
olla+0.168
Neg logit contributions
eln-0.369
imens-0.360
earable-0.356
enne-0.343
imates-0.338
Tokenyour
Token ID14108
Feature activation+3.82e-01

Pos logit contributions
eln+0.116
imens+0.113
earable+0.111
enne+0.106
imates+0.104
Neg logit contributions
theless-0.137
 rhy-0.126
ymes-0.121
ingly-0.111
olla-0.108
Token pride
Token ID11293
Feature activation-2.29e+00

Pos logit contributions
eln+0.277
imens+0.268
earable+0.265
enne+0.253
imates+0.248
Neg logit contributions
theless-0.304
 rhy-0.282
ymes-0.268
ingly-0.245
olla-0.239
Token-
Token ID12
Feature activation+4.35e+00

Pos logit contributions
eln+0.558
imens+0.541
earable+0.535
enne+0.513
imates+0.503
Neg logit contributions
theless-0.526
 rhy-0.484
ymes-0.459
ingly-0.415
olla-0.404
Tokenof
Token ID1659
Feature activation+2.31e+00

Pos logit contributions
eln+3.056
imens+2.961
earable+2.905
enne+2.805
handled+2.760
Neg logit contributions
theless-3.520
 rhy-3.273
ymes-3.137
ingly-2.813
olla-2.775
Token-
Token ID12
Feature activation+2.05e+00

Pos logit contributions
eln+1.877
imens+1.812
earable+1.795
enne+1.731
imates+1.697
Neg logit contributions
theless-1.643
 rhy-1.487
ymes-1.423
ingly-1.261
olla-1.242
Tokenwar
Token ID5767
Feature activation+3.24e+00

Pos logit contributions
eln+1.250
imens+1.197
earable+1.173
enne+1.121
imates+1.086
Neg logit contributions
theless-1.878
 rhy-1.753
ymes-1.683
ingly-1.555
olla-1.522
Token,
Token ID11
Feature activation-4.81e-01

Pos logit contributions
eln+2.501
imens+2.414
earable+2.387
enne+2.306
imates+2.247
Neg logit contributions
theless-2.432
 rhy-2.199
ymes-2.113
ingly-1.884
olla-1.860
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
theless+0.336
 rhy+0.310
ymes+0.292
ingly+0.263
olla+0.253
Neg logit contributions
eln-0.397
imens-0.385
earable-0.380
enne-0.365
imates-0.358
Token most
Token ID749
Feature activation-2.37e+00

Pos logit contributions
theless+1.017
 rhy+0.954
ymes+0.895
ingly+0.815
olla+0.789
Neg logit contributions
eln-0.993
imens-0.962
earable-0.952
enne-0.908
imates-0.891
Token complex
Token ID3716
Feature activation-3.96e+00

Pos logit contributions
theless+1.894
 rhy+1.796
ymes+1.687
ingly+1.572
olla+1.474
Neg logit contributions
eln-1.687
imens-1.646
earable-1.598
enne-1.544
imates-1.507
Token machine
Token ID4572
Feature activation+6.45e-01

Pos logit contributions
theless+2.936
 rhy+2.792
ymes+2.569
ingly+2.403
olla+2.247
Neg logit contributions
eln-3.038
imens-2.952
earable-2.881
enne-2.762
imates-2.714
Token his
Token ID465
Feature activation-5.67e-01

Pos logit contributions
eln+0.532
imens+0.517
earable+0.511
enne+0.491
imates+0.483
Neg logit contributions
theless-0.449
 rhy-0.411
ymes-0.389
ingly-0.348
olla-0.338
Token age
Token ID2479
Feature activation-4.60e-01

Pos logit contributions
theless+0.447
 rhy+0.416
ymes+0.395
ingly+0.360
olla+0.350
Neg logit contributions
eln-0.414
imens-0.401
earable-0.396
enne-0.378
imates-0.370
Token free
Token ID1479
Feature activation-1.42e+00

Pos logit contributions
theless+0.871
 rhy+0.814
ymes+0.766
ingly+0.701
olla+0.683
Neg logit contributions
eln-0.846
imens-0.823
earable-0.813
enne-0.774
imates-0.762
Token,
Token ID11
Feature activation-3.46e+00

Pos logit contributions
theless+1.010
 rhy+0.931
ymes+0.880
ingly+0.798
olla+0.769
Neg logit contributions
eln-1.141
imens-1.111
earable-1.100
enne-1.045
imates-1.035
Token hur
Token ID6990
Feature activation-5.44e+00

Pos logit contributions
theless+2.444
 rhy+2.290
ymes+2.119
ingly+1.931
olla+1.874
Neg logit contributions
eln-2.788
imens-2.705
earable-2.646
enne-2.538
imates-2.521
Tokenray
Token ID2433
Feature activation-5.61e+00

Pos logit contributions
theless+3.443
 rhy+3.192
ymes+2.941
ingly+2.660
olla+2.574
Neg logit contributions
eln-4.648
earable-4.591
imens-4.546
enne-4.322
imates-4.315
Token!
Token ID0
Feature activation-1.64e+00

Pos logit contributions
theless+3.749
 rhy+3.564
ymes+3.294
ingly+3.031
olla+2.893
Neg logit contributions
eln-4.522
imens-4.453
earable-4.409
imates-4.162
enne-4.119
Token Your
Token ID3406
Feature activation-4.31e-01

Pos logit contributions
theless+1.078
 rhy+0.991
ymes+0.923
ingly+0.823
olla+0.801
Neg logit contributions
eln-1.416
imens-1.379
earable-1.357
enne-1.307
imates-1.289
Token beloved
Token ID14142
Feature activation+1.40e+00

Pos logit contributions
theless+0.347
 rhy+0.325
ymes+0.308
ingly+0.282
olla+0.276
Neg logit contributions
eln-0.306
imens-0.297
earable-0.292
enne-0.279
imates-0.273
Token lic
Token ID3476
Feature activation+1.12e+00

Pos logit contributions
eln+0.994
imens+0.961
earable+0.948
enne+0.905
imates+0.886
Neg logit contributions
theless-1.133
 rhy-1.052
ymes-1.003
ingly-0.916
olla-0.894
Tokenhess
Token ID33979
Feature activation+3.96e+00

Pos logit contributions
eln+0.734
imens+0.707
earable+0.697
enne+0.662
imates+0.648
Neg logit contributions
theless-0.963
 rhy-0.899
ymes-0.862
ingly-0.792
olla-0.775
Token team
Token ID1074
Feature activation-1.01e+00

Pos logit contributions
eln+2.966
imens+2.843
earable+2.814
enne+2.708
imates+2.652
Neg logit contributions
theless-3.051
 rhy-2.802
ymes-2.657
ingly-2.407
olla-2.349
Token has
Token ID468
Feature activation-5.79e-01

Pos logit contributions
theless+0.718
 rhy+0.663
ymes+0.628
ingly+0.570
olla+0.551
Neg logit contributions
eln-0.805
imens-0.784
earable-0.779
enne-0.739
imates-0.728
Token test
Token ID1332
Feature activation-2.23e+00

Pos logit contributions
theless+0.813
 rhy+0.754
ymes+0.715
ingly+0.651
olla+0.631
Neg logit contributions
eln-0.781
imens-0.759
earable-0.748
enne-0.717
imates-0.701
Token chamber
Token ID11847
Feature activation+4.95e-01

Pos logit contributions
theless+1.809
 rhy+1.698
ymes+1.615
ingly+1.482
olla+1.422
Neg logit contributions
eln-1.564
earable-1.508
imens-1.500
enne-1.427
imates-1.379
Token monitors
Token ID19374
Feature activation+5.16e-01

Pos logit contributions
eln+0.371
imens+0.359
earable+0.355
enne+0.340
imates+0.333
Neg logit contributions
theless-0.381
 rhy-0.353
ymes-0.335
ingly-0.305
olla-0.296
Token (
Token ID357
Feature activation+2.42e+00

Pos logit contributions
eln+0.404
imens+0.391
earable+0.386
enne+0.370
imates+0.362
Neg logit contributions
theless-0.382
 rhy-0.354
ymes-0.334
ingly-0.304
olla-0.292
Token56
Token ID3980
Feature activation+1.71e+00

Pos logit contributions
eln+1.764
imens+1.706
earable+1.686
enne+1.612
imates+1.579
Neg logit contributions
theless-1.907
 rhy-1.767
ymes-1.682
ingly-1.531
olla-1.493
Token)
Token ID8
Feature activation-2.79e+00

Pos logit contributions
eln+1.261
imens+1.222
earable+1.203
enne+1.151
imates+1.133
Neg logit contributions
theless-1.330
 rhy-1.222
ymes-1.166
ingly-1.056
olla-1.033
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
theless+1.634
 rhy+1.484
ymes+1.378
ingly+1.208
olla+1.169
Neg logit contributions
eln-2.591
imens-2.528
earable-2.495
enne-2.416
imates-2.376
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
eln+0.203
imens+0.198
earable+0.196
enne+0.189
imates+0.186
Neg logit contributions
theless-0.132
 rhy-0.120
ymes-0.112
ingly-0.099
olla-0.095
TokenCele
Token ID42741
Feature activation+1.75e+00

Pos logit contributions
eln+0.693
imens+0.668
earable+0.658
enne+0.626
imates+0.612
Neg logit contributions
theless-0.919
 rhy-0.858
ymes-0.821
ingly-0.756
olla-0.739
Tokenbrate
Token ID40804
Feature activation+3.52e+00

Pos logit contributions
eln+1.168
imens+1.123
earable+1.111
enne+1.064
imates+1.034
Neg logit contributions
theless-1.481
 rhy-1.385
ymes-1.329
ingly-1.206
olla-1.188
Token your
Token ID534
Feature activation+4.39e+00

Pos logit contributions
eln+2.590
imens+2.494
earable+2.482
enne+2.378
imates+2.324
Neg logit contributions
theless-2.713
 rhy-2.498
ymes-2.411
ingly-2.164
olla-2.132
Token prem
Token ID4199
Feature activation+3.25e+00

Pos logit contributions
theless+0.743
 rhy+0.692
ymes+0.648
ingly+0.590
olla+0.565
Neg logit contributions
eln-0.856
imens-0.833
earable-0.820
enne-0.790
imates-0.771
Tokenised
Token ID1417
Feature activation-2.93e-01

Pos logit contributions
eln+2.674
imens+2.598
earable+2.568
enne+2.470
imates+2.426
Neg logit contributions
theless-2.269
 rhy-2.080
ymes-1.967
ingly-1.763
olla-1.711
Token on
Token ID319
Feature activation-1.97e+00

Pos logit contributions
theless+0.199
 rhy+0.182
ymes+0.172
ingly+0.154
olla+0.149
Neg logit contributions
eln-0.246
imens-0.239
earable-0.236
enne-0.227
imates-0.223
Token a
Token ID257
Feature activation-4.34e+00

Pos logit contributions
theless+1.470
 rhy+1.381
ymes+1.284
ingly+1.170
olla+1.126
Neg logit contributions
eln-1.518
imens-1.482
earable-1.459
enne-1.398
imates-1.363
Token vast
Token ID5909
Feature activation-3.06e+00

Pos logit contributions
theless+3.276
 rhy+3.136
ymes+2.862
ingly+2.654
olla+2.539
Neg logit contributions
eln-3.218
imens-3.165
earable-3.114
enne-2.931
imates-2.908
Token m
Token ID285
Feature activation-2.71e+00

Pos logit contributions
theless+2.312
 rhy+2.177
ymes+2.027
ingly+1.882
olla+1.795
Neg logit contributions
eln-2.303
imens-2.259
earable-2.210
enne-2.097
imates-2.063
Tokeniscal
Token ID7860
Feature activation+5.68e-01

Pos logit contributions
theless+2.043
 rhy+1.915
ymes+1.820
ingly+1.682
olla+1.642
Neg logit contributions
eln-2.041
imens-1.954
earable-1.941
enne-1.840
imates-1.814
Tokenculation
Token ID14902
Feature activation-3.50e+00

Pos logit contributions
eln+0.420
imens+0.407
earable+0.403
enne+0.385
imates+0.376
Neg logit contributions
theless-0.443
 rhy-0.410
ymes-0.389
ingly-0.355
olla-0.347
Token about
Token ID546
Feature activation+2.77e+00

Pos logit contributions
theless+2.259
 rhy+2.123
ymes+1.947
ingly+1.791
olla+1.690
Neg logit contributions
eln-3.005
imens-2.945
earable-2.897
enne-2.778
imates-2.715
Token the
Token ID262
Feature activation+2.28e+00

Pos logit contributions
eln+2.049
imens+1.978
earable+1.959
enne+1.878
imates+1.847
Neg logit contributions
theless-2.141
 rhy-1.964
ymes-1.893
ingly-1.704
olla-1.676
Token origins
Token ID15587
Feature activation-3.33e+00

Pos logit contributions
eln+1.632
imens+1.579
earable+1.557
enne+1.492
imates+1.462
Neg logit contributions
theless-1.822
 rhy-1.679
ymes-1.612
ingly-1.465
olla-1.436
Token disrupt
Token ID10114
Feature activation-5.89e+00

Pos logit contributions
eln+0.684
imens+0.661
earable+0.653
enne+0.621
imates+0.607
Neg logit contributions
theless-0.818
 rhy-0.768
ymes-0.727
ingly-0.669
olla-0.649
Token the
Token ID262
Feature activation-1.90e+00

Pos logit contributions
theless+4.168
 rhy+3.939
ymes+3.637
ingly+3.442
olla+3.211
Neg logit contributions
eln-4.562
imens-4.522
earable-4.383
enne-4.264
ktop-4.152
Token availability
Token ID11500
Feature activation+3.37e-01

Pos logit contributions
theless+1.500
 rhy+1.407
ymes+1.317
ingly+1.212
olla+1.173
Neg logit contributions
eln-1.384
imens-1.349
earable-1.324
enne-1.275
imates-1.237
Token of
Token ID286
Feature activation+9.45e-01

Pos logit contributions
eln+0.278
imens+0.270
earable+0.267
enne+0.257
imates+0.253
Neg logit contributions
theless-0.235
 rhy-0.214
ymes-0.203
ingly-0.181
olla-0.176
Token child
Token ID1200
Feature activation-6.77e+00

Pos logit contributions
eln+0.695
imens+0.672
earable+0.664
enne+0.635
imates+0.623
Neg logit contributions
theless-0.740
 rhy-0.685
ymes-0.653
ingly-0.593
olla-0.579
Token sexual
Token ID3206
Feature activation-7.65e+00

Pos logit contributions
theless+4.907
 rhy+4.616
ymes+4.215
ingly+4.001
olla+3.688
Neg logit contributions
eln-5.282
earable-5.084
imens-5.078
enne-4.842
ktop-4.728
Token abuse
Token ID5076
Feature activation+7.74e-01

Pos logit contributions
theless+4.807
 rhy+4.563
ymes+4.063
ingly+3.771
olla+3.464
Neg logit contributions
eln-6.636
earable-6.494
imens-6.460
enne-6.096
imates-6.008
Token content
Token ID2695
Feature activation+1.23e+00

Pos logit contributions
eln+0.546
imens+0.528
earable+0.521
enne+0.498
imates+0.488
Neg logit contributions
theless-0.630
 rhy-0.582
ymes-0.557
ingly-0.508
olla-0.496
Token in
Token ID287
Feature activation-9.77e-01

Pos logit contributions
eln+0.982
imens+0.952
earable+0.941
enne+0.903
imates+0.886
Neg logit contributions
theless-0.893
 rhy-0.822
ymes-0.778
ingly-0.702
olla-0.682
Token Australia
Token ID4505
Feature activation-1.73e+00

Pos logit contributions
theless+0.735
 rhy+0.684
ymes+0.641
ingly+0.587
olla+0.569
Neg logit contributions
eln-0.749
imens-0.728
earable-0.718
enne-0.690
imates-0.675
Token,
Token ID11
Feature activation+8.70e-01

Pos logit contributions
theless+1.198
 rhy+1.109
ymes+1.040
ingly+0.947
olla+0.908
Neg logit contributions
eln-1.427
imens-1.382
earable-1.370
enne-1.313
imates-1.288
Token.
Token ID13
Feature activation-2.18e-01

Pos logit contributions
eln+4.166
imens+4.018
earable+3.990
enne+3.840
imates+3.797
Neg logit contributions
theless-4.478
 rhy-4.077
ymes-3.874
ingly-3.472
olla-3.393
Token [
Token ID685
Feature activation-3.93e+00

Pos logit contributions
theless+0.156
 rhy+0.144
ymes+0.136
ingly+0.123
olla+0.119
Neg logit contributions
eln-0.175
imens-0.170
earable-0.168
enne-0.161
imates-0.158
Tokencolor
Token ID8043
Feature activation+4.82e+00

Pos logit contributions
theless+2.905
 rhy+2.696
ymes+2.551
ingly+2.332
olla+2.287
Neg logit contributions
eln-2.968
imens-2.887
earable-2.863
enne-2.720
imates-2.698
Token=
Token ID28
Feature activation+2.35e+00

Pos logit contributions
eln+3.035
imens+2.959
earable+2.924
enne+2.800
imates+2.738
Neg logit contributions
theless-4.239
 rhy-3.918
ymes-3.752
ingly-3.386
olla-3.344
Tokenred
Token ID445
Feature activation+3.43e+00

Pos logit contributions
eln+1.915
imens+1.852
earable+1.827
enne+1.770
imates+1.724
Neg logit contributions
theless-1.657
 rhy-1.519
ymes-1.445
ingly-1.282
olla-1.247
Token][
Token ID7131
Feature activation-1.13e+01

Pos logit contributions
eln+3.332
imens+3.274
earable+3.242
enne+3.135
imates+3.077
Neg logit contributions
theless-1.821
 rhy-1.597
ymes-1.532
ingly-1.289
olla-1.241
Tokenb
Token ID65
Feature activation-3.64e-01

Pos logit contributions
theless+6.385
 rhy+5.911
ymes+5.672
ingly+5.113
olla+5.052
Neg logit contributions
eln-9.210
imens-9.154
earable-9.110
imates-8.679
enne-8.622
Token]
Token ID60
Feature activation+3.82e-01

Pos logit contributions
theless+0.164
 rhy+0.144
ymes+0.132
ingly+0.111
olla+0.104
Neg logit contributions
eln-0.387
imens-0.378
earable-0.375
enne-0.363
imates-0.359
TokenU
Token ID52
Feature activation-4.27e+00

Pos logit contributions
eln+0.262
imens+0.254
earable+0.250
enne+0.238
imates+0.233
Neg logit contributions
theless-0.317
 rhy-0.294
ymes-0.282
ingly-0.260
olla-0.254
Tokenf
Token ID69
Feature activation+9.15e-01

Pos logit contributions
theless+3.193
 rhy+3.017
ymes+2.868
ingly+2.631
olla+2.534
Neg logit contributions
eln-3.192
imens-3.106
earable-3.056
enne-2.897
imates-2.848
Tokenou
Token ID280
Feature activation-3.14e-01

Pos logit contributions
eln+0.638
imens+0.618
earable+0.611
enne+0.581
imates+0.568
Neg logit contributions
theless-0.750
 rhy-0.697
ymes-0.667
ingly-0.608
olla-0.598
Token war
Token ID1175
Feature activation-2.44e+00

Pos logit contributions
eln+0.510
imens+0.492
earable+0.486
enne+0.466
imates+0.457
Neg logit contributions
theless-0.507
 rhy-0.467
ymes-0.446
ingly-0.404
olla-0.394
Token with
Token ID351
Feature activation-2.87e+00

Pos logit contributions
theless+1.645
 rhy+1.514
ymes+1.410
ingly+1.274
olla+1.236
Neg logit contributions
eln-2.045
imens-1.997
earable-1.985
enne-1.888
imates-1.869
Token Germany
Token ID4486
Feature activation+1.06e+00

Pos logit contributions
theless+2.322
 rhy+2.162
ymes+2.045
ingly+1.889
olla+1.827
Neg logit contributions
eln-2.024
imens-1.972
earable-1.953
enne-1.855
imates-1.832
Token [
Token ID685
Feature activation-4.21e+00

Pos logit contributions
eln+0.894
imens+0.869
earable+0.859
enne+0.828
imates+0.813
Neg logit contributions
theless-0.712
 rhy-0.650
ymes-0.614
ingly-0.548
olla-0.531
Token edit
Token ID4370
Feature activation-4.80e+00

Pos logit contributions
theless+1.792
 rhy+1.591
ymes+1.462
ingly+1.249
olla+1.203
Neg logit contributions
eln-4.458
imens-4.376
earable-4.329
imates-4.200
enne-4.184
Token ]
Token ID2361
Feature activation-8.67e+00

Pos logit contributions
theless+1.993
 rhy+1.793
ymes+1.642
ingly+1.442
olla+1.323
Neg logit contributions
eln-5.070
imens-4.972
earable-4.900
enne-4.779
imates-4.702
Token
Token ID198
Feature activation-7.27e-01

Pos logit contributions
theless+3.385
 rhy+3.126
ymes+2.929
ingly+2.684
olla+2.487
Neg logit contributions
eln-9.083
 millenn-8.969
imens-8.908
earable-8.677
enne-8.345
Token
Token ID198
Feature activation-7.58e-01

Pos logit contributions
theless+0.419
 rhy+0.381
ymes+0.357
ingly+0.316
olla+0.303
Neg logit contributions
eln-0.682
imens-0.665
earable-0.656
enne-0.633
imates-0.624
TokenFrom
Token ID4863
Feature activation+4.76e+00

Pos logit contributions
theless+0.615
 rhy+0.571
ymes+0.547
ingly+0.504
olla+0.492
Neg logit contributions
eln-0.532
imens-0.516
earable-0.510
enne-0.484
imates-0.476
Token 2
Token ID362
Feature activation+6.75e+00

Pos logit contributions
eln+3.335
imens+3.235
earable+3.143
enne+3.030
imates+2.973
Neg logit contributions
theless-3.799
 rhy-3.514
ymes-3.394
ingly-3.057
olla-2.983
Token February
Token ID3945
Feature activation+3.07e+00

Pos logit contributions
eln+3.857
imens+3.631
earable+3.546
enne+3.459
ainted+3.358
Neg logit contributions
theless-6.247
 rhy-5.843
ymes-5.594
ingly-5.125
olla-5.069
Token-
Token ID12
Feature activation+6.64e+00

Pos logit contributions
theless+1.103
 rhy+1.024
ymes+0.969
ingly+0.872
olla+0.843
Neg logit contributions
eln-1.332
imens-1.285
earable-1.270
enne-1.219
imates-1.201
TokenJimmy
Token ID40335
Feature activation-4.61e+00

Pos logit contributions
eln+4.351
earable+4.174
imens+4.164
enne+3.960
imates+3.857
Neg logit contributions
theless-5.701
 rhy-5.217
ymes-4.917
ingly-4.475
olla-4.370
Token Tide
Token ID31105
Feature activation+1.29e+00

Pos logit contributions
theless+3.395
 rhy+3.193
ymes+3.008
ingly+2.774
olla+2.747
Neg logit contributions
eln-3.517
imens-3.406
earable-3.379
enne-3.198
ktop-3.098
Token 08
Token ID8487
Feature activation+8.88e-01

Pos logit contributions
eln+1.000
imens+0.968
earable+0.958
enne+0.919
imates+0.900
Neg logit contributions
theless-0.967
 rhy-0.889
ymes-0.844
ingly-0.763
olla-0.742
Token-
Token ID12
Feature activation+3.48e+00

Pos logit contributions
eln+0.731
imens+0.709
earable+0.702
enne+0.674
imates+0.662
Neg logit contributions
theless-0.619
 rhy-0.566
ymes-0.535
ingly-0.480
olla-0.465
TokenHarry
Token ID18308
Feature activation-1.26e+01

Pos logit contributions
eln+2.390
imens+2.302
earable+2.286
enne+2.172
imates+2.124
Neg logit contributions
theless-2.906
 rhy-2.686
ymes-2.552
ingly-2.332
olla-2.278
Token @
Token ID2488
Feature activation-1.27e+00

Pos logit contributions
theless+8.984
 rhy+8.457
ymes+8.065
ingly+7.454
olla+6.970
Neg logit contributions
eln-9.026
earable-8.729
imens-8.653
ktop-8.077
imates-7.959
Token TMZ
Token ID46800
Feature activation+2.19e+00

Pos logit contributions
theless+1.000
 rhy+0.943
ymes+0.897
ingly+0.820
olla+0.802
Neg logit contributions
eln-0.921
imens-0.894
earable-0.880
enne-0.838
imates-0.822
Token-
Token ID12
Feature activation+5.79e+00

Pos logit contributions
eln+1.702
imens+1.652
earable+1.632
enne+1.567
imates+1.536
Neg logit contributions
theless-1.638
 rhy-1.493
ymes-1.411
ingly-1.274
olla-1.240
TokenV
Token ID53
Feature activation-1.50e+00

Pos logit contributions
eln+3.877
imens+3.716
earable+3.711
enne+3.518
imates+3.439
Neg logit contributions
theless-4.903
 rhy-4.494
ymes-4.257
ingly-3.885
olla-3.788
Tokenos
Token ID418
Feature activation+3.50e+00

Pos logit contributions
theless+1.213
 rhy+1.137
ymes+1.098
ingly+1.004
olla+0.984
Neg logit contributions
eln-1.034
imens-0.998
earable-0.987
enne-0.928
imates-0.918
Token from
Token ID422
Feature activation+7.64e-01

Pos logit contributions
eln+2.367
imens+2.297
earable+2.250
enne+2.169
imates+2.139
Neg logit contributions
theless-2.550
 rhy-2.328
ymes-2.240
ingly-2.015
olla-1.979
Token healthcare
Token ID11409
Feature activation+1.03e+00

Pos logit contributions
eln+0.578
imens+0.560
earable+0.552
enne+0.529
imates+0.517
Neg logit contributions
theless-0.584
 rhy-0.541
ymes-0.513
ingly-0.467
olla-0.454
Token",
Token ID1600
Feature activation+7.67e-01

Pos logit contributions
eln+0.818
imens+0.792
earable+0.784
enne+0.752
imates+0.737
Neg logit contributions
theless-0.744
 rhy-0.685
ymes-0.648
ingly-0.586
olla-0.568
Token its
Token ID663
Feature activation-3.48e+00

Pos logit contributions
eln+0.617
imens+0.599
earable+0.592
enne+0.569
imates+0.558
Neg logit contributions
theless-0.548
 rhy-0.504
ymes-0.477
ingly-0.430
olla-0.417
Token website
Token ID3052
Feature activation-7.73e+00

Pos logit contributions
theless+2.644
 rhy+2.530
ymes+2.352
ingly+2.177
olla+2.111
Neg logit contributions
eln-2.569
imens-2.502
earable-2.471
enne-2.354
imates-2.285
Token adds
Token ID6673
Feature activation-7.98e+00

Pos logit contributions
theless+4.363
 rhy+4.232
ymes+3.722
ingly+3.428
olla+3.209
Neg logit contributions
eln-7.043
earable-6.893
imens-6.787
enne-6.547
ktop-6.359
Token.
Token ID13
Feature activation-3.14e+00

Pos logit contributions
theless+4.887
 rhy+4.645
ymes+4.281
ingly+3.955
olla+3.698
Neg logit contributions
eln-6.796
earable-6.622
imens-6.608
enne-6.293
imates-6.136
Token
Token ID198
Feature activation+1.64e+00

Pos logit contributions
theless+1.916
 rhy+1.734
ymes+1.620
ingly+1.445
olla+1.396
Neg logit contributions
eln-2.823
imens-2.744
earable-2.727
enne-2.629
imates-2.576
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
eln+1.440
imens+1.401
earable+1.393
enne+1.343
imates+1.320
Neg logit contributions
theless-1.052
 rhy-0.943
ymes-0.883
ingly-0.773
olla-0.754
TokenIt
Token ID1026
Feature activation-2.43e+00

Pos logit contributions
theless+1.169
 rhy+1.090
ymes+1.048
ingly+0.967
olla+0.943
Neg logit contributions
eln-1.000
imens-0.968
earable-0.955
enne-0.911
imates-0.890
Token says
Token ID1139
Feature activation-4.61e+00

Pos logit contributions
theless+1.708
 rhy+1.576
ymes+1.485
ingly+1.347
olla+1.288
Neg logit contributions
eln-1.973
imens-1.916
earable-1.908
enne-1.813
imates-1.764