TOP ACTIVATIONS
MAX = 2.372

. Sa arak k ala , 28 , said
personal integrity ," Sa arak k ala is quoted as saying
performance , while the sax ophon ist liked this performance best
improvis ation , while the pian ist and expert use terms
, 2012 ) A 24 - year - old
as saying . Sa arak k ala , a critic of
turtles ." How much debris does it take to
claimed . Am ad ou Di allo was also the name
More : Acc used students find sympathetic ear with
s a problem before someone needs the data . When you
skillet ( don t crowd the pan ; only
avail . Ranger 6 crashed into the lunar surface at 9
distributions in multiple versions , each with their own versions of
9 th S PA . [ 59 ] Over 20 %
cl ipping the plastic cover surrounding the radio . Start from
( Na of umi )
live liest . The sax ophon ist considered Performance 3 to
ipping the plastic cover surrounding the radio . Start from the
discover his $ 2 , 200 vintage bicycle had been stolen
successful performance , while the sax ophon ist liked this performance

MIN ACTIVATIONS
MIN = -2.500

Author of The Far rah W ether s Mysteries , comic
to engage in an open and straightforward communication with staff ,
? Jay C z ark owski : I would
will open new store at Five Corn ers in Bur ien
? Jay C z ark owski : I think
? Jay C z ark owski : You know
? Jay C z ark owski : And you
Big gest G ainer 2008 01 08 O & A -
bit 256 - bit 192 - bit 256 - bit 256
- Ret arded Cook 200 80 202 O & A -
bit 192 - bit 256 - bit 256 - bit R
on the road , 38 - 19 . T
s state ( such as through a or ) must
edges that you have to be aware of . In this
I am not afraid to speak frankly to a president of
administration battles to bring the deficit down to 6 . 3
bit 128 - bit 256 - bit 192 - bit 256
on a song - by - song basis , and em
efforts to pull the Fukushima plant back from the brink .
world . ( Wa ub ges hig Rice / CBC )

INTERVAL 1.154 - 2.372
CONTAINS 0.156%

introduced to a very active November Project group in Edmonton that
distributions in multiple versions , each with their own versions of
the ability to confront one 's accuser is important not merely
that apps are isolated from each other and from the host
Award for Dist inguished Human itarian Service winner was Ar lene

INTERVAL -0.064 - 1.154
CONTAINS 57.900%

a former competition commissioner , has already proposed that EU countries
to re - im agine the condo in Toronto , and
, as well as my own experiences living in and traveling
ar ( His ar ) by blocking of a
We talk a little about her support for Planned Parenthood ,

INTERVAL -1.282 - -0.064
CONTAINS 41.888%

anyone make you feel like you don 't matter or like
Sturgeon , ' light a fire under Westminster 's back side
to keep your road trip in the right direction stay at
" w arp factor " is too large , the output
bright shade of orange and the waters are so visc ous

INTERVAL -2.500 - -1.282
CONTAINS 0.056%

values . It is a good idea to link family values
was at The Head line for the launch seemed to love
ms G 2 G AU Opt ronics AH VA ( IPS
Los Angeles 1980 84 . Copyright John Brian King
s state ( such as through a or ) must
Token.
Token ID13
Feature activation+2.06e-01

Pos logit contributions
allo+0.347
 accuser+0.339
ctica+0.331
gio+0.324
omorphic+0.323
Neg logit contributions
chery-0.366
hens-0.335
 Nether-0.332
word-0.307
iron-0.300
Token
Token ID198
Feature activation+1.82e-01

Pos logit contributions
allo+2.155
 accuser+2.103
ctica+2.052
gio+2.013
968+1.983
Neg logit contributions
chery-2.619
hens-2.397
 Nether-2.356
word-2.214
iron-2.186
Token
Token ID198
Feature activation-5.31e-02

Pos logit contributions
allo+1.851
 accuser+1.776
ctica+1.737
gio+1.724
968+1.712
Neg logit contributions
chery-2.356
hens-2.183
 Nether-2.158
word-1.993
iron-1.971
TokenSa
Token ID33890
Feature activation+1.61e-01

Pos logit contributions
chery+0.684
hens+0.631
 Nether+0.622
word+0.584
iron+0.571
Neg logit contributions
allo-0.554
 accuser-0.541
ctica-0.529
omorphic-0.515
gio-0.515
Tokenarak
Token ID30447
Feature activation+5.89e-01

Pos logit contributions
allo+1.349
 accuser+1.275
ctica+1.252
omorphic+1.218
gio+1.191
Neg logit contributions
chery-2.404
 Nether-2.249
hens-2.241
word-2.100
iron-2.039
Tokenk
Token ID74
Feature activation+2.37e+00

Pos logit contributions
allo+4.665
lake+4.427
gio+4.285
ctica+4.176
omorphic+4.140
Neg logit contributions
chery-8.341
 Nether-8.224
hens-7.935
 BT-7.246
 Worm-7.230
Tokenala
Token ID6081
Feature activation-1.82e-01

Pos logit contributions
ala+16.081
allo+13.404
alan+10.968
algia+10.421
lake+9.788
Neg logit contributions
 Nether-27.779
-27.097
hens-26.611
 comedians-25.718
chery-25.612
Token,
Token ID11
Feature activation+8.09e-02

Pos logit contributions
chery+2.180
hens+2.000
 Nether+1.984
word+1.834
iron+1.803
Neg logit contributions
allo-2.060
ctica-1.969
 accuser-1.967
gio-1.925
omorphic-1.910
Token 28
Token ID2579
Feature activation-5.51e-02

Pos logit contributions
allo+0.810
 accuser+0.794
ctica+0.768
omorphic+0.748
gio+0.748
Neg logit contributions
chery-1.075
hens-0.992
 Nether-0.980
word-0.917
iron-0.899
Token,
Token ID11
Feature activation-3.48e-01

Pos logit contributions
chery+0.649
hens+0.594
 Nether+0.589
word+0.544
iron+0.533
Neg logit contributions
allo-0.636
 accuser-0.618
ctica-0.606
gio-0.596
omorphic-0.593
Token said
Token ID531
Feature activation-3.96e-01

Pos logit contributions
chery+4.085
hens+3.765
 Nether+3.754
word+3.453
iron+3.414
Neg logit contributions
allo-3.902
ctica-3.719
 accuser-3.676
gio-3.665
omorphic-3.622
Token personal
Token ID2614
Feature activation-1.16e+00

Pos logit contributions
chery+10.354
 Nether+9.644
hens+8.868
word+8.232
iron+8.157
Neg logit contributions
allo-11.210
968-10.349
gio-10.307
lake-10.230
bably-10.195
Token integrity
Token ID11540
Feature activation-4.97e-01

Pos logit contributions
chery+13.152
 Nether+12.248
hens+11.410
iron+10.847
 BT+10.755
Neg logit contributions
allo-10.877
gio-10.563
968-10.172
bably-10.102
ctica-9.940
Token,"
Token ID553
Feature activation+4.98e-01

Pos logit contributions
chery+5.309
hens+4.728
 Nether+4.601
word+4.303
iron+4.170
Neg logit contributions
allo-6.139
 accuser-5.808
ctica-5.781
gio-5.762
bably-5.700
Token Sa
Token ID10318
Feature activation+1.13e-01

Pos logit contributions
 accuser+5.004
allo+4.936
ctica+4.673
gio+4.585
omorphic+4.582
Neg logit contributions
chery-6.283
hens-5.729
 Nether-5.650
word-5.307
iron-5.280
Tokenarak
Token ID30447
Feature activation+5.79e-01

Pos logit contributions
allo+0.952
 accuser+0.906
ctica+0.888
omorphic+0.861
gio+0.852
Neg logit contributions
chery-1.673
hens-1.561
 Nether-1.560
word-1.460
iron-1.423
Tokenk
Token ID74
Feature activation+2.27e+00

Pos logit contributions
allo+4.599
lake+4.293
gio+4.218
ctica+4.124
dri+4.029
Neg logit contributions
chery-8.193
 Nether-8.098
hens-7.802
 BT-7.166
 Worm-7.164
Tokenala
Token ID6081
Feature activation-1.04e-01

Pos logit contributions
ala+15.550
allo+13.215
alan+10.726
algia+10.209
lake+9.486
Neg logit contributions
 Nether-27.052
-26.876
hens-25.615
 comedians-25.087
chery-24.881
Token is
Token ID318
Feature activation-3.71e-01

Pos logit contributions
chery+1.314
hens+1.210
 Nether+1.199
word+1.115
iron+1.095
Neg logit contributions
allo-1.111
 accuser-1.069
ctica-1.057
gio-1.033
omorphic-1.031
Token quoted
Token ID10947
Feature activation+2.50e-01

Pos logit contributions
chery+3.682
 Nether+3.364
hens+3.348
word+3.158
iron+3.011
Neg logit contributions
allo-4.770
 accuser-4.542
gio-4.522
ctica-4.487
lake-4.426
Token as
Token ID355
Feature activation-1.64e-02

Pos logit contributions
allo+2.748
 accuser+2.743
ctica+2.658
gio+2.601
omorphic+2.567
Neg logit contributions
chery-2.979
hens-2.755
 Nether-2.702
word-2.476
iron-2.441
Token saying
Token ID2282
Feature activation+2.12e-01

Pos logit contributions
chery+0.196
hens+0.180
 Nether+0.178
word+0.165
iron+0.161
Neg logit contributions
allo-0.186
 accuser-0.181
ctica-0.178
gio-0.174
omorphic-0.174
Token performance
Token ID2854
Feature activation+4.29e-01

Pos logit contributions
allo+7.069
 accuser+6.981
ctica+6.562
gio+6.541
 Entered+6.438
Neg logit contributions
chery-7.341
hens-6.869
 Nether-6.737
iron-6.214
word-6.127
Token,
Token ID11
Feature activation+4.02e-01

Pos logit contributions
allo+4.461
 accuser+4.336
ctica+4.219
gio+4.153
omorphic+4.086
Neg logit contributions
chery-5.324
hens-4.912
 Nether-4.874
word-4.488
iron-4.457
Token while
Token ID981
Feature activation+6.46e-01

Pos logit contributions
allo+4.358
 accuser+4.294
ctica+4.125
omorphic+4.063
968+4.021
Neg logit contributions
chery-4.789
hens-4.418
 Nether-4.346
word-4.041
iron-3.930
Token the
Token ID262
Feature activation+7.65e-01

Pos logit contributions
allo+6.542
 accuser+6.440
ctica+6.087
gio+6.008
omorphic+5.967
Neg logit contributions
chery-7.689
hens-7.195
 Nether-7.084
word-6.585
iron-6.540
Token sax
Token ID46909
Feature activation+1.69e+00

Pos logit contributions
 accuser+7.707
allo+7.469
gio+6.945
ctica+6.928
omorphic+6.887
Neg logit contributions
chery-9.229
 Nether-8.431
hens-8.417
word-7.711
iron-7.657
Tokenophon
Token ID48982
Feature activation+2.00e+00

Pos logit contributions
allo+12.846
ologist+12.811
 accuser+11.937
onte+11.582
omorphic+11.462
Neg logit contributions
chery-18.277
 Nether-16.998
hens-16.783
 Onion-15.132
iron-15.096
Tokenist
Token ID396
Feature activation+1.03e+00

Pos logit contributions
allo+11.308
ologist+11.085
ist+9.767
onte+9.404
ala+9.318
Neg logit contributions
chery-24.077
hens-23.134
 Nether-22.869
 Onion-21.551
 Ukrainians-21.226
Token liked
Token ID8288
Feature activation+3.04e-01

Pos logit contributions
 accuser+9.852
allo+9.357
968+8.872
gio+8.818
omorphic+8.797
Neg logit contributions
chery-11.450
hens-10.787
 Nether-10.424
word-9.679
argon-9.646
Token this
Token ID428
Feature activation+1.51e-01

Pos logit contributions
allo+3.298
 accuser+3.158
ctica+3.108
gio+3.060
omorphic+3.017
Neg logit contributions
chery-3.718
hens-3.443
 Nether-3.404
word-3.172
iron-3.110
Token performance
Token ID2854
Feature activation+2.88e-01

Pos logit contributions
allo+1.531
 accuser+1.495
ctica+1.445
gio+1.421
omorphic+1.406
Neg logit contributions
chery-1.977
hens-1.841
 Nether-1.819
word-1.692
iron-1.666
Token best
Token ID1266
Feature activation+1.24e-01

Pos logit contributions
allo+3.167
 accuser+3.086
ctica+3.005
omorphic+2.954
gio+2.934
Neg logit contributions
chery-3.471
hens-3.199
 Nether-3.189
word-2.916
iron-2.851
Token improvis
Token ID47833
Feature activation+6.57e-01

Pos logit contributions
allo+1.860
 accuser+1.823
ctica+1.778
gio+1.745
omorphic+1.742
Neg logit contributions
chery-1.418
hens-1.285
 Nether-1.275
word-1.157
iron-1.123
Tokenation
Token ID341
Feature activation+3.58e-01

Pos logit contributions
allo+5.319
omorphic+4.637
ctica+4.608
 accuser+4.606
lake+4.511
Neg logit contributions
chery-9.297
hens-8.889
 Nether-8.794
word-8.215
iron-8.028
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
allo+3.904
 accuser+3.818
ctica+3.672
omorphic+3.618
gio+3.617
Neg logit contributions
chery-4.298
hens-3.983
 Nether-3.904
word-3.612
iron-3.573
Token while
Token ID981
Feature activation+9.79e-01

Pos logit contributions
chery+2.163
hens+1.967
 Nether+1.950
word+1.797
iron+1.751
Neg logit contributions
allo-2.338
 accuser-2.271
ctica-2.250
gio-2.206
omorphic-2.193
Token the
Token ID262
Feature activation+7.69e-01

Pos logit contributions
 accuser+9.267
allo+9.043
968+8.488
ctica+8.350
onte+8.314
Neg logit contributions
chery-10.952
hens-10.402
 Nether-10.236
iron-9.503
word-9.410
Token pian
Token ID43923
Feature activation+1.93e+00

Pos logit contributions
 accuser+6.816
allo+6.574
gio+6.139
ctica+6.088
omorphic+5.986
Neg logit contributions
chery-10.223
 Nether-9.375
hens-9.338
word-8.611
iron-8.503
Tokenist
Token ID396
Feature activation+6.74e-01

Pos logit contributions
ologist+11.733
allo+11.656
lake+9.896
olic+9.493
ala+9.479
Neg logit contributions
chery-23.099
 Nether-22.825
hens-21.984
 Ukrainians-20.950
ISION-20.859
Token and
Token ID290
Feature activation+7.97e-01

Pos logit contributions
 accuser+6.132
allo+5.923
gio+5.531
968+5.494
ctica+5.470
Neg logit contributions
chery-8.680
hens-8.141
 Nether-7.979
word-7.425
iron-7.309
Token expert
Token ID5887
Feature activation+1.08e+00

Pos logit contributions
 accuser+7.637
allo+7.485
ctica+7.061
gio+6.999
omorphic+6.941
Neg logit contributions
chery-9.689
 Nether-9.135
hens-9.031
word-8.217
iron-8.183
Token use
Token ID779
Feature activation+1.18e-01

Pos logit contributions
 accuser+10.825
allo+10.398
gio+10.125
ologist+9.859
968+9.838
Neg logit contributions
chery-11.583
hens-10.642
 Nether-10.622
iron-9.690
word-9.630
Token terms
Token ID2846
Feature activation+6.40e-01

Pos logit contributions
allo+1.338
 accuser+1.306
ctica+1.275
gio+1.250
omorphic+1.247
Neg logit contributions
chery-1.404
hens-1.288
 Nether-1.277
word-1.176
iron-1.154
Token,
Token ID11
Feature activation-7.85e-02

Pos logit contributions
allo+0.659
 accuser+0.644
ctica+0.629
gio+0.616
omorphic+0.612
Neg logit contributions
chery-0.721
hens-0.662
 Nether-0.656
word-0.606
iron-0.593
Token 2012
Token ID2321
Feature activation-2.82e-01

Pos logit contributions
chery+1.074
hens+0.998
 Nether+0.990
word+0.928
iron+0.911
Neg logit contributions
allo-0.751
 accuser-0.731
ctica-0.715
omorphic-0.699
gio-0.695
Token)
Token ID8
Feature activation-1.14e-01

Pos logit contributions
chery+3.284
hens+3.040
 Nether+3.024
word+2.809
iron+2.720
Neg logit contributions
allo-3.197
 accuser-3.160
ctica-3.109
omorphic-3.079
gio-3.003
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
chery+1.342
hens+1.231
 Nether+1.218
word+1.132
iron+1.097
Neg logit contributions
allo-1.315
 accuser-1.294
ctica-1.261
omorphic-1.242
gio-1.228
Token
Token ID198
Feature activation+2.75e-01

Pos logit contributions
allo+0.130
 accuser+0.127
ctica+0.124
gio+0.121
omorphic+0.121
Neg logit contributions
chery-0.145
hens-0.134
 Nether-0.132
word-0.123
iron-0.120
TokenA
Token ID32
Feature activation+1.93e+00

Pos logit contributions
allo+2.089
 accuser+1.945
968+1.887
ctica+1.883
lake+1.851
Neg logit contributions
chery-4.267
hens-4.015
 Nether-3.977
word-3.677
iron-3.639
Token 24
Token ID1987
Feature activation+1.47e-02

Pos logit contributions
 accuser+13.581
allo+12.781
lake+12.530
 Salv+12.462
 masse+12.447
Neg logit contributions
chery-19.775
hens-18.125
argon-17.588
iron-16.540
Arsenal-16.106
Token-
Token ID12
Feature activation+1.35e-01

Pos logit contributions
allo+0.165
 accuser+0.161
ctica+0.157
gio+0.154
omorphic+0.153
Neg logit contributions
chery-0.179
hens-0.164
 Nether-0.162
word-0.151
iron-0.147
Tokenyear
Token ID1941
Feature activation+2.54e-01

Pos logit contributions
allo+1.268
 accuser+1.231
ctica+1.188
gio+1.163
lake+1.152
Neg logit contributions
chery-1.866
hens-1.734
 Nether-1.730
word-1.592
iron-1.566
Token-
Token ID12
Feature activation+3.21e-01

Pos logit contributions
allo+2.599
 accuser+2.577
ctica+2.458
omorphic+2.376
gio+2.376
Neg logit contributions
chery-3.265
hens-3.016
 Nether-2.943
word-2.754
iron-2.717
Tokenold
Token ID727
Feature activation+9.81e-01

Pos logit contributions
allo+3.411
 accuser+3.247
lake+3.173
ctica+3.167
968+3.127
Neg logit contributions
chery-3.962
 Nether-3.704
hens-3.698
word-3.294
iron-3.211
Token as
Token ID355
Feature activation-1.64e-02

Pos logit contributions
allo+2.748
 accuser+2.743
ctica+2.658
gio+2.601
omorphic+2.567
Neg logit contributions
chery-2.979
hens-2.755
 Nether-2.702
word-2.476
iron-2.441
Token saying
Token ID2282
Feature activation+2.12e-01

Pos logit contributions
chery+0.196
hens+0.180
 Nether+0.178
word+0.165
iron+0.161
Neg logit contributions
allo-0.186
 accuser-0.181
ctica-0.178
gio-0.174
omorphic-0.174
Token.
Token ID13
Feature activation+9.33e-02

Pos logit contributions
allo+2.111
 accuser+2.088
ctica+2.014
gio+1.959
omorphic+1.947
Neg logit contributions
chery-2.781
hens-2.574
 Nether-2.551
word-2.358
iron-2.319
Token Sa
Token ID10318
Feature activation+5.91e-02

Pos logit contributions
allo+1.033
 accuser+1.010
ctica+0.987
gio+0.965
omorphic+0.961
Neg logit contributions
chery-1.137
hens-1.040
 Nether-1.030
word-0.957
iron-0.937
Tokenarak
Token ID30447
Feature activation+5.98e-01

Pos logit contributions
allo+0.510
 accuser+0.491
ctica+0.478
omorphic+0.464
gio+0.463
Neg logit contributions
chery-0.866
hens-0.807
 Nether-0.804
word-0.754
iron-0.738
Tokenk
Token ID74
Feature activation+1.92e+00

Pos logit contributions
allo+4.696
lake+4.406
gio+4.287
ctica+4.204
omorphic+4.136
Neg logit contributions
chery-8.443
 Nether-8.375
hens-8.047
 Worm-7.406
 BT-7.383
Tokenala
Token ID6081
Feature activation-5.30e-01

Pos logit contributions
ala+13.654
allo+12.303
alan+10.111
algia+9.807
lake+9.325
Neg logit contributions
 Nether-24.098
hens-22.701
-22.666
chery-22.248
 comedians-21.991
Token,
Token ID11
Feature activation+4.43e-02

Pos logit contributions
chery+5.860
hens+5.313
 Nether+5.298
word+4.864
iron+4.828
Neg logit contributions
allo-6.207
ctica-5.926
gio-5.781
omorphic-5.748
968-5.731
Token a
Token ID257
Feature activation-4.24e-01

Pos logit contributions
allo+0.425
 accuser+0.414
ctica+0.402
gio+0.392
omorphic+0.391
Neg logit contributions
chery-0.607
hens-0.562
 Nether-0.557
word-0.521
iron-0.511
Token critic
Token ID4014
Feature activation+2.18e-01

Pos logit contributions
chery+4.658
 Nether+4.473
hens+4.356
word+3.965
iron+3.835
Neg logit contributions
allo-4.990
ctica-4.756
gio-4.582
 accuser-4.568
968-4.547
Token of
Token ID286
Feature activation-2.42e-01

Pos logit contributions
allo+2.249
 accuser+2.214
ctica+2.142
omorphic+2.095
gio+2.088
Neg logit contributions
chery-2.789
hens-2.577
 Nether-2.560
word-2.380
iron-2.316
Token turtles
Token ID36288
Feature activation-7.72e-02

Pos logit contributions
allo+1.294
 accuser+1.260
ctica+1.240
omorphic+1.208
gio+1.199
Neg logit contributions
chery-1.348
hens-1.231
 Nether-1.215
word-1.126
iron-1.099
Token."
Token ID526
Feature activation+2.03e-01

Pos logit contributions
chery+0.920
hens+0.838
 Nether+0.831
word+0.769
iron+0.752
Neg logit contributions
allo-0.881
 accuser-0.861
ctica-0.841
gio-0.828
omorphic-0.823
Token
Token ID198
Feature activation-8.69e-02

Pos logit contributions
allo+2.154
 accuser+2.062
ctica+2.050
omorphic+1.980
gio+1.978
Neg logit contributions
chery-2.561
hens-2.355
 Nether-2.327
word-2.171
iron-2.129
Token
Token ID198
Feature activation+4.64e-02

Pos logit contributions
chery+1.030
hens+0.942
 Nether+0.934
word+0.869
iron+0.844
Neg logit contributions
allo-0.993
 accuser-0.978
ctica-0.955
omorphic-0.935
gio-0.929
TokenHow
Token ID2437
Feature activation+9.41e-01

Pos logit contributions
allo+0.462
 accuser+0.447
ctica+0.437
gio+0.427
omorphic+0.425
Neg logit contributions
chery-0.619
hens-0.573
 Nether-0.568
word-0.530
iron-0.519
Token much
Token ID881
Feature activation+1.91e+00

Pos logit contributions
allo+9.003
 accuser+8.783
ctica+8.673
omorphic+8.619
lake+8.537
Neg logit contributions
chery-10.768
hens-9.863
 Nether-9.264
word-9.262
iron-8.738
Token debris
Token ID16468
Feature activation+1.20e+00

Pos logit contributions
 accuser+15.297
 Salv+14.588
 debris+14.568
 harms+14.500
 misconduct+14.383
Neg logit contributions
hens-17.155
chery-16.693
word-15.190
////////////////-14.243
iron-14.042
Token does
Token ID857
Feature activation+1.02e+00

Pos logit contributions
 accuser+10.043
allo+9.876
 harms+9.788
lake+9.468
omorphic+9.441
Neg logit contributions
chery-13.678
hens-13.042
word-12.225
iron-11.901
 Nether-11.859
Token it
Token ID340
Feature activation+4.12e-01

Pos logit contributions
 accuser+9.381
allo+9.301
omorphic+9.142
ctica+9.071
lake+8.955
Neg logit contributions
chery-11.495
hens-10.975
word-10.209
 Nether-9.836
iron-9.649
Token take
Token ID1011
Feature activation+2.26e-01

Pos logit contributions
allo+4.424
 accuser+4.320
ctica+4.167
omorphic+4.112
lake+4.057
Neg logit contributions
chery-4.940
hens-4.608
 Nether-4.449
word-4.241
iron-4.173
Token to
Token ID284
Feature activation+4.61e-01

Pos logit contributions
allo+2.408
 accuser+2.349
ctica+2.319
omorphic+2.264
lake+2.215
Neg logit contributions
chery-2.793
hens-2.569
 Nether-2.556
word-2.377
iron-2.307
Token claimed
Token ID4752
Feature activation+4.74e-01

Pos logit contributions
 accuser+2.659
allo+2.634
ctica+2.481
gio+2.438
968+2.395
Neg logit contributions
chery-4.367
hens-4.085
 Nether-4.026
word-3.764
iron-3.736
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
 accuser+4.609
allo+4.540
ctica+4.339
gio+4.183
omorphic+4.147
Neg logit contributions
chery-6.138
hens-5.695
 Nether-5.599
word-5.170
argon-5.127
Token Am
Token ID1703
Feature activation+5.59e-01

Pos logit contributions
allo+10.821
 accuser+10.613
 Entered+10.055
968+9.981
gio+9.939
Neg logit contributions
chery-14.510
hens-13.007
 Nether-12.577
iron-11.994
argon-11.685
Tokenad
Token ID324
Feature activation+4.53e-01

Pos logit contributions
allo+6.304
 accuser+5.819
omorphic+5.783
gio+5.729
dri+5.643
Neg logit contributions
chery-6.440
 Nether-6.120
hens-5.905
word-5.252
 Worm-5.224
Tokenou
Token ID280
Feature activation+8.74e-01

Pos logit contributions
allo+4.016
omorphic+3.636
ctica+3.580
gio+3.567
 accuser+3.554
Neg logit contributions
chery-6.400
 Nether-6.070
hens-5.920
word-5.442
iron-5.412
Token Di
Token ID6031
Feature activation+1.88e+00

Pos logit contributions
 accuser+8.078
allo+7.569
gio+7.139
ctica+7.076
 Entered+7.055
Neg logit contributions
chery-10.910
hens-10.222
 Nether-9.906
argon-9.382
word-9.251
Tokenallo
Token ID49457
Feature activation+3.23e-01

Pos logit contributions
allo+15.547
ala+10.019
onte+9.696
gio+9.577
alan+8.939
Neg logit contributions
 Nether-24.036
chery-23.172
hens-21.765
 BT-20.841
 Corn-20.753
Token was
Token ID373
Feature activation+1.19e-02

Pos logit contributions
 accuser+3.374
allo+3.334
ctica+3.160
gio+3.101
omorphic+3.081
Neg logit contributions
chery-4.084
hens-3.739
 Nether-3.687
word-3.442
iron-3.356
Token also
Token ID635
Feature activation+3.15e-02

Pos logit contributions
allo+0.143
 accuser+0.140
ctica+0.137
gio+0.134
omorphic+0.134
Neg logit contributions
chery-0.135
hens-0.123
 Nether-0.121
word-0.112
iron-0.109
Token the
Token ID262
Feature activation+1.87e-01

Pos logit contributions
allo+0.366
 accuser+0.358
ctica+0.350
gio+0.343
omorphic+0.342
Neg logit contributions
chery-0.367
hens-0.335
 Nether-0.331
word-0.306
iron-0.299
Token name
Token ID1438
Feature activation+1.62e-01

Pos logit contributions
 accuser+2.057
allo+2.049
ctica+1.930
omorphic+1.907
gio+1.905
Neg logit contributions
chery-2.289
hens-2.096
 Nether-2.046
word-1.915
iron-1.892
Token
Token ID198
Feature activation+3.35e-01

Pos logit contributions
 accuser+2.569
allo+2.515
ctica+2.412
gio+2.354
lake+2.324
Neg logit contributions
chery-2.930
hens-2.712
 Nether-2.661
word-2.458
iron-2.425
Token
Token ID198
Feature activation+7.43e-01

Pos logit contributions
allo+3.153
 accuser+3.010
gio+2.941
968+2.929
lake+2.897
Neg logit contributions
chery-4.493
hens-4.195
 Nether-4.133
word-3.800
iron-3.797
TokenMore
Token ID5167
Feature activation+4.56e-02

Pos logit contributions
allo+6.863
 accuser+6.693
968+6.486
lake+6.342
dri+6.292
Neg logit contributions
chery-9.471
 Nether-9.002
hens-8.788
word-7.764
 Worm-7.726
Token:
Token ID25
Feature activation+1.16e+00

Pos logit contributions
allo+0.496
 accuser+0.485
ctica+0.471
gio+0.461
omorphic+0.460
Neg logit contributions
chery-0.568
hens-0.521
 Nether-0.516
word-0.479
iron-0.469
Token Acc
Token ID6366
Feature activation+3.78e-01

Pos logit contributions
 accuser+12.665
allo+11.511
 Entered+11.456
lake+11.173
 Salv+11.127
Neg logit contributions
chery-11.766
hens-10.536
 Nether-9.690
iron-9.487
word-9.200
Tokenused
Token ID1484
Feature activation+1.77e+00

Pos logit contributions
 accuser+2.698
allo+2.649
ctica+2.395
gio+2.389
lake+2.381
Neg logit contributions
chery-6.052
 Nether-5.636
hens-5.613
word-5.233
iron-5.136
Token students
Token ID2444
Feature activation+9.37e-01

Pos logit contributions
 accuser+18.739
 sailor+14.985
 misconduct+14.840
 Entered+14.388
 Salv+14.371
Neg logit contributions
chery-16.350
hens-14.882
 Nether-14.122
argon-13.456
iron-13.214
Token find
Token ID1064
Feature activation+3.53e-01

Pos logit contributions
 accuser+10.421
allo+9.070
ctica+8.748
 Entered+8.637
968+8.628
Neg logit contributions
chery-10.386
hens-9.660
 Nether-9.199
iron-8.403
////////////////-8.403
Token sympathetic
Token ID22335
Feature activation+6.55e-01

Pos logit contributions
 accuser+4.030
allo+3.865
ctica+3.667
lake+3.599
gio+3.591
Neg logit contributions
chery-4.159
hens-3.901
 Nether-3.760
word-3.499
iron-3.430
Token ear
Token ID1027
Feature activation+5.39e-01

Pos logit contributions
 accuser+7.369
allo+6.516
lake+6.276
968+6.189
 Entered+6.149
Neg logit contributions
chery-7.858
hens-7.356
 Nether-7.047
iron-6.609
word-6.419
Token with
Token ID351
Feature activation+9.67e-01

Pos logit contributions
 accuser+5.509
allo+5.386
lake+5.066
ctica+5.031
gio+5.016
Neg logit contributions
chery-6.606
hens-6.389
 Nether-6.340
word-5.529
iron-5.524
Tokens
Token ID82
Feature activation+9.25e-01

Pos logit contributions
chery+1.518
hens+1.412
 Nether+1.386
word+1.301
iron+1.277
Neg logit contributions
allo-1.112
 accuser-1.085
ctica-1.053
omorphic-1.027
gio-1.023
Token a
Token ID257
Feature activation+5.71e-01

Pos logit contributions
allo+9.613
ctica+9.029
omorphic+9.028
 accuser+9.020
gio+8.698
Neg logit contributions
chery-10.277
hens-9.406
 Nether-9.251
iron-8.455
word-8.369
Token problem
Token ID1917
Feature activation+4.49e-01

Pos logit contributions
allo+5.651
 accuser+5.497
omorphic+5.248
ctica+5.225
968+5.133
Neg logit contributions
chery-7.350
hens-6.716
 Nether-6.547
iron-6.129
word-6.103
Token before
Token ID878
Feature activation+1.28e+00

Pos logit contributions
allo+4.730
 accuser+4.631
ctica+4.476
omorphic+4.420
968+4.376
Neg logit contributions
chery-5.468
hens-5.072
 Nether-5.030
word-4.620
iron-4.589
Token someone
Token ID2130
Feature activation+1.33e+00

Pos logit contributions
 accuser+11.468
allo+11.309
968+10.649
gio+10.452
 Salv+10.423
Neg logit contributions
chery-13.969
 Nether-12.551
hens-12.349
iron-11.682
argon-11.436
Token needs
Token ID2476
Feature activation+1.77e+00

Pos logit contributions
allo+11.960
 accuser+11.680
 restores+11.312
968+11.150
 harms+11.129
Neg logit contributions
chery-14.101
 Nether-12.696
hens-12.520
word-12.046
argon-11.697
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
allo+14.851
 accuser+13.634
dri+13.487
ctica+13.378
 restores+13.279
Neg logit contributions
chery-17.292
 Nether-15.329
hens-15.188
////////////////-13.849
word-13.846
Token data
Token ID1366
Feature activation+7.19e-01

Pos logit contributions
allo+11.905
 accuser+11.723
 restores+10.949
omorphic+10.916
dri+10.876
Neg logit contributions
chery-13.476
hens-12.255
 Nether-11.584
iron-10.900
argon-10.803
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
allo+7.097
 accuser+6.781
ctica+6.709
omorphic+6.560
968+6.430
Neg logit contributions
chery-8.711
 Nether-8.192
hens-8.120
word-7.314
iron-7.259
Token When
Token ID1649
Feature activation+6.71e-01

Pos logit contributions
allo+1.671
 accuser+1.628
ctica+1.588
gio+1.567
omorphic+1.543
Neg logit contributions
chery-1.768
hens-1.618
 Nether-1.587
word-1.476
iron-1.449
Token you
Token ID345
Feature activation+3.50e-01

Pos logit contributions
allo+6.999
 accuser+6.823
ctica+6.549
omorphic+6.516
968+6.389
Neg logit contributions
chery-7.968
hens-7.315
 Nether-7.184
word-6.638
iron-6.635
Token skillet
Token ID41306
Feature activation+6.51e-02

Pos logit contributions
 accuser+4.803
allo+4.593
lake+4.372
 stereo+3.952
968+3.925
Neg logit contributions
chery-13.465
hens-13.331
 Nether-12.674
word-12.318
 Worm-12.232
Token (
Token ID357
Feature activation-6.02e-01

Pos logit contributions
allo+0.742
 accuser+0.725
ctica+0.705
gio+0.693
omorphic+0.690
Neg logit contributions
chery-0.773
hens-0.709
 Nether-0.700
word-0.649
iron-0.633
Tokendon
Token ID9099
Feature activation+5.17e-01

Pos logit contributions
chery+7.658
hens+7.109
word+7.025
 Nether+6.730
iron+6.473
Neg logit contributions
 accuser-5.974
ctica-5.907
allo-5.780
omorphic-5.596
 Entered-5.477
Token
Token ID447
Feature activation-3.94e-01

Pos logit contributions
allo+4.652
ctica+4.255
 accuser+4.252
omorphic+4.246
gio+4.221
Neg logit contributions
chery-7.046
 Nether-6.671
hens-6.565
word-6.103
iron-5.967
Token
Token ID247
Feature activation-1.08e-02

Pos logit contributions
chery+5.064
hens+4.721
 Nether+4.625
word+4.446
iron+4.300
Neg logit contributions
 accuser-3.775
ctica-3.749
allo-3.706
omorphic-3.610
lake-3.548
Tokent
Token ID83
Feature activation+1.74e+00

Pos logit contributions
chery+0.116
hens+0.105
 Nether+0.104
word+0.096
iron+0.093
Neg logit contributions
allo-0.136
 accuser-0.134
ctica-0.131
gio-0.128
omorphic-0.128
Token crowd
Token ID4315
Feature activation-2.73e-01

Pos logit contributions
 accuser+7.695
968+7.402
allo+6.588
 lap+6.382
 stereo+6.340
Neg logit contributions
iron-23.519
hens-23.116
chery-22.431
argon-21.822
arks-21.602
Token the
Token ID262
Feature activation+5.94e-01

Pos logit contributions
chery+2.895
 Nether+2.601
hens+2.590
word+2.363
iron+2.241
Neg logit contributions
allo-3.469
 accuser-3.364
ctica-3.335
omorphic-3.272
gio-3.244
Token pan
Token ID3425
Feature activation-9.02e-01

Pos logit contributions
 accuser+3.432
allo+3.281
lake+2.898
ctica+2.870
omorphic+2.847
Neg logit contributions
chery-9.688
hens-9.327
 Nether-8.981
word-8.732
iron-8.673
Token;
Token ID26
Feature activation-8.64e-02

Pos logit contributions
chery+6.986
 Nether+6.076
hens+5.801
word+5.739
iron+5.295
Neg logit contributions
 millenn-12.532
��-12.491
allo-12.454
 accuser-12.447
ctica-12.358
Token only
Token ID691
Feature activation+2.09e-02

Pos logit contributions
chery+0.824
hens+0.730
 Nether+0.720
word+0.658
iron+0.632
Neg logit contributions
allo-1.194
 accuser-1.167
ctica-1.152
omorphic-1.128
gio-1.127
Token avail
Token ID29107
Feature activation+6.51e-01

Pos logit contributions
 accuser+5.216
allo+5.209
ctica+5.059
gio+4.945
968+4.844
Neg logit contributions
chery-6.665
hens-6.349
 Nether-6.142
word-5.683
iron-5.515
Token.
Token ID13
Feature activation+8.35e-01

Pos logit contributions
allo+5.796
 accuser+5.720
ctica+5.585
omorphic+5.509
968+5.296
Neg logit contributions
chery-8.302
hens-7.960
 Nether-7.715
word-7.184
iron-7.078
Token Ranger
Token ID21888
Feature activation+4.39e-01

Pos logit contributions
 Entered+8.042
allo+7.881
 accuser+7.825
968+7.649
ctica+7.649
Neg logit contributions
chery-9.769
hens-9.139
 Nether-8.505
iron-8.194
word-8.155
Token 6
Token ID718
Feature activation+5.52e-02

Pos logit contributions
allo+4.691
 accuser+4.644
ctica+4.533
968+4.422
gio+4.412
Neg logit contributions
chery-5.278
hens-4.807
 Nether-4.789
word-4.418
iron-4.354
Token crashed
Token ID14997
Feature activation+3.07e-01

Pos logit contributions
allo+0.598
 accuser+0.585
ctica+0.570
gio+0.558
omorphic+0.557
Neg logit contributions
chery-0.687
hens-0.632
 Nether-0.625
word-0.581
iron-0.568
Token into
Token ID656
Feature activation+1.74e+00

Pos logit contributions
allo+3.270
 accuser+3.263
ctica+3.153
omorphic+3.050
968+3.046
Neg logit contributions
chery-3.746
hens-3.474
 Nether-3.381
word-3.131
iron-3.059
Token the
Token ID262
Feature activation+1.47e+00

Pos logit contributions
ctica+15.255
 accuser+14.664
allo+14.385
 Vall+14.323
 Salv+14.259
Neg logit contributions
chery-16.137
hens-13.987
 Nether-13.318
basketball-12.968
word-12.964
Token lunar
Token ID25572
Feature activation+6.68e-01

Pos logit contributions
 accuser+14.338
allo+13.171
ctica+13.081
 Palmer+13.063
lake+13.011
Neg logit contributions
chery-14.060
hens-12.750
word-11.584
argon-11.514
 Nether-11.124
Token surface
Token ID4417
Feature activation+4.98e-01

Pos logit contributions
 accuser+6.462
allo+6.288
ctica+6.126
lake+6.017
omorphic+5.939
Neg logit contributions
chery-8.518
hens-7.789
 Nether-7.344
word-7.187
argon-6.825
Token at
Token ID379
Feature activation+4.27e-01

Pos logit contributions
 accuser+5.129
allo+5.034
ctica+4.854
lake+4.812
968+4.746
Neg logit contributions
chery-6.002
hens-5.647
 Nether-5.440
word-5.123
argon-4.932
Token 9
Token ID860
Feature activation+1.97e-02

Pos logit contributions
allo+4.675
 accuser+4.664
ctica+4.543
omorphic+4.414
968+4.409
Neg logit contributions
chery-4.966
hens-4.415
 Nether-4.403
word-4.129
iron-4.067
Token distributions
Token ID24570
Feature activation+6.72e-01

Pos logit contributions
allo+5.216
 accuser+5.153
omorphic+5.090
ctica+4.983
gio+4.890
Neg logit contributions
chery-5.956
hens-5.372
 Nether-5.286
word-4.928
iron-4.908
Token in
Token ID287
Feature activation+9.68e-01

Pos logit contributions
allo+6.559
 accuser+6.492
968+6.204
omorphic+6.151
gio+6.036
Neg logit contributions
chery-8.183
hens-7.611
 Nether-7.418
iron-6.966
argon-6.831
Token multiple
Token ID3294
Feature activation+8.90e-01

Pos logit contributions
omorphic+8.424
allo+8.365
 accuser+8.275
ctica+7.955
gio+7.900
Neg logit contributions
chery-11.972
hens-10.859
 Nether-10.621
iron-10.236
word-9.931
Token versions
Token ID6300
Feature activation+6.69e-01

Pos logit contributions
 accuser+7.846
omorphic+7.725
allo+7.676
ctica+7.397
lake+7.386
Neg logit contributions
chery-11.189
hens-10.026
 Nether-9.735
word-9.207
iron-9.195
Token,
Token ID11
Feature activation+6.97e-01

Pos logit contributions
allo+6.234
 accuser+6.150
omorphic+5.859
ctica+5.839
gio+5.811
Neg logit contributions
chery-8.367
hens-7.735
 Nether-7.597
iron-7.185
word-7.150
Token each
Token ID1123
Feature activation+1.74e+00

Pos logit contributions
allo+7.051
 accuser+6.955
omorphic+6.734
ctica+6.542
968+6.510
Neg logit contributions
chery-8.275
hens-7.650
 Nether-7.346
word-6.938
iron-6.920
Token with
Token ID351
Feature activation+1.07e+00

Pos logit contributions
 accuser+14.636
omorphic+14.066
allo+13.488
 Entered+13.255
968+13.192
Neg logit contributions
chery-15.616
hens-15.451
 Nether-14.521
iron-13.852
argon-13.595
Token their
Token ID511
Feature activation+1.03e+00

Pos logit contributions
 accuser+9.708
allo+9.570
omorphic+9.290
ctica+9.023
968+8.948
Neg logit contributions
chery-11.903
hens-11.182
 Nether-10.651
iron-10.583
word-10.252
Token own
Token ID898
Feature activation+8.05e-01

Pos logit contributions
 accuser+7.893
allo+7.730
omorphic+7.622
968+7.176
dri+7.111
Neg logit contributions
chery-13.607
hens-12.904
 Nether-12.145
iron-11.979
word-11.817
Token versions
Token ID6300
Feature activation+6.83e-01

Pos logit contributions
 accuser+8.350
allo+8.335
omorphic+8.118
ctica+7.907
lake+7.776
Neg logit contributions
chery-9.095
hens-8.247
 Nether-7.791
iron-7.657
word-7.486
Token of
Token ID286
Feature activation+9.30e-01

Pos logit contributions
allo+6.225
 accuser+5.946
ctica+5.929
gio+5.756
omorphic+5.743
Neg logit contributions
chery-8.644
 Nether-8.193
hens-8.167
iron-7.595
word-7.429
Token 9
Token ID860
Feature activation-9.75e-02

Pos logit contributions
allo+3.260
 accuser+3.195
gio+3.064
ctica+3.050
968+2.997
Neg logit contributions
chery-3.501
hens-3.158
 Nether-3.112
word-2.928
iron-2.873
Tokenth
Token ID400
Feature activation+3.42e-01

Pos logit contributions
chery+1.022
hens+0.929
 Nether+0.916
word+0.839
iron+0.819
Neg logit contributions
allo-1.245
 accuser-1.216
ctica-1.199
gio-1.173
omorphic-1.171
Token S
Token ID311
Feature activation-5.36e-01

Pos logit contributions
allo+3.780
 accuser+3.745
ctica+3.542
gio+3.528
lake+3.505
Neg logit contributions
chery-4.082
hens-3.692
 Nether-3.641
word-3.387
iron-3.335
TokenPA
Token ID4537
Feature activation+3.96e-01

Pos logit contributions
chery+7.565
hens+7.250
 Nether+6.799
word+6.551
iron+6.496
Neg logit contributions
 accuser-4.527
allo-4.432
ctica-4.349
omorphic-4.285
 Entered-4.162
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
allo+4.450
 accuser+4.329
gio+4.182
ctica+4.140
omorphic+4.110
Neg logit contributions
chery-4.627
hens-4.182
 Nether-4.137
word-3.868
iron-3.775
Token [
Token ID685
Feature activation+1.74e+00

Pos logit contributions
allo+2.949
 accuser+2.825
ctica+2.777
gio+2.737
968+2.733
Neg logit contributions
chery-4.048
hens-3.716
 Nether-3.631
word-3.443
iron-3.402
Token59
Token ID3270
Feature activation+7.01e-01

Pos logit contributions
allo+11.379
gio+10.463
lake+10.192
ctica+10.014
968+9.774
Neg logit contributions
chery-19.121
 Nether-18.617
hens-17.854
 BT-16.924
 Worm-16.839
Token]
Token ID60
Feature activation+4.71e-01

Pos logit contributions
allo+3.098
 accuser+2.806
968+2.732
lake+2.651
gio+2.624
Neg logit contributions
chery-12.463
hens-11.457
 Nether-11.387
iron-10.909
word-10.854
Token Over
Token ID3827
Feature activation+9.62e-01

Pos logit contributions
allo+4.613
 accuser+4.393
968+4.326
gio+4.308
ctica+4.306
Neg logit contributions
chery-6.088
hens-5.569
 Nether-5.358
word-5.143
iron-5.075
Token 20
Token ID1160
Feature activation+2.51e-01

Pos logit contributions
allo+8.712
ctica+8.357
lake+8.350
 accuser+8.242
dri+8.177
Neg logit contributions
chery-11.556
 Nether-10.471
hens-10.221
iron-9.404
argon-9.357
Token%
Token ID4
Feature activation+3.18e-01

Pos logit contributions
allo+2.661
 accuser+2.629
ctica+2.536
gio+2.490
968+2.483
Neg logit contributions
chery-3.135
hens-2.868
 Nether-2.832
word-2.634
iron-2.570
Tokencl
Token ID565
Feature activation+2.84e-01

Pos logit contributions
allo+13.416
dri+12.969
lake+12.779
latable+12.486
onte+12.462
Neg logit contributions
chery-7.934
 Nether-7.178
hens-6.933
 Ukrainians-5.711
argon-5.612
Tokenipping
Token ID4501
Feature activation+9.82e-01

Pos logit contributions
allo+2.878
 accuser+2.725
ctica+2.701
gio+2.699
omorphic+2.679
Neg logit contributions
chery-3.639
 Nether-3.395
hens-3.380
word-3.083
iron-3.036
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
allo+8.702
 accuser+8.666
dri+8.216
968+8.142
gio+7.885
Neg logit contributions
chery-11.882
hens-10.910
 Nether-10.537
word-10.285
argon-9.685
Token plastic
Token ID7309
Feature activation+1.24e+00

Pos logit contributions
 accuser+13.047
 stereo+12.440
allo+11.900
dri+11.715
968+11.653
Neg logit contributions
chery-14.108
hens-12.655
 Nether-12.311
word-12.106
argon-11.535
Token cover
Token ID3002
Feature activation+3.19e-02

Pos logit contributions
 accuser+11.054
allo+10.757
 stereo+10.167
968+10.108
gio+9.890
Neg logit contributions
chery-13.607
hens-12.610
 Nether-12.360
word-11.768
argon-11.322
Token surrounding
Token ID7346
Feature activation+1.73e+00

Pos logit contributions
allo+0.367
 accuser+0.358
ctica+0.350
gio+0.343
omorphic+0.342
Neg logit contributions
chery-0.377
hens-0.345
 Nether-0.341
word-0.315
iron-0.308
Token the
Token ID262
Feature activation+1.69e+00

Pos logit contributions
 accuser+12.704
 stereo+12.163
allo+12.101
dri+11.957
968+11.840
Neg logit contributions
chery-18.086
hens-16.575
 Nether-16.171
word-15.257
argon-14.714
Token radio
Token ID5243
Feature activation+7.54e-01

Pos logit contributions
 accuser+15.433
 stereo+15.220
968+13.873
dri+13.861
 infl+13.691
Neg logit contributions
chery-16.333
hens-14.535
 Nether-14.301
word-13.618
argon-13.339
Token.
Token ID13
Feature activation+8.08e-02

Pos logit contributions
 accuser+7.645
allo+7.583
968+7.424
lake+7.333
dri+7.287
Neg logit contributions
chery-8.685
hens-7.968
 Nether-7.893
word-7.141
argon-7.008
Token Start
Token ID7253
Feature activation+1.92e-01

Pos logit contributions
allo+0.884
 accuser+0.858
ctica+0.839
gio+0.824
omorphic+0.816
Neg logit contributions
chery-0.999
hens-0.917
 Nether-0.904
word-0.841
iron-0.822
Token from
Token ID422
Feature activation+2.91e-01

Pos logit contributions
allo+2.001
 accuser+1.953
ctica+1.883
omorphic+1.858
gio+1.856
Neg logit contributions
chery-2.447
hens-2.264
 Nether-2.235
word-2.066
iron-2.023
Token
Token ID1399
Feature activation+6.06e-01

Pos logit contributions
allo+0.573
 accuser+0.560
ctica+0.547
omorphic+0.535
gio+0.535
Neg logit contributions
chery-0.626
hens-0.575
 Nether-0.568
word-0.526
iron-0.516
Token
Token ID447
Feature activation-3.95e-01

Pos logit contributions
allo+6.238
 accuser+6.146
ctica+6.023
omorphic+5.985
dri+5.788
Neg logit contributions
chery-7.197
hens-6.519
 Nether-6.507
iron-5.888
word-5.779
Token
Token ID251
Feature activation+1.00e-01

Pos logit contributions
chery+5.579
hens+5.258
 Nether+5.140
word+4.943
iron+4.866
Neg logit contributions
allo-3.361
 accuser-3.346
ctica-3.239
omorphic-3.127
968-3.088
Token (
Token ID357
Feature activation+4.90e-01

Pos logit contributions
allo+1.138
 accuser+1.112
ctica+1.088
gio+1.063
omorphic+1.060
Neg logit contributions
chery-1.191
hens-1.084
 Nether-1.078
word-0.996
iron-0.975
TokenNa
Token ID26705
Feature activation+4.06e-01

Pos logit contributions
allo+3.650
 accuser+3.368
968+3.271
ctica+3.271
gio+3.223
Neg logit contributions
chery-7.471
 Nether-7.049
hens-6.988
word-6.490
 Worm-6.324
Tokenof
Token ID1659
Feature activation+1.73e+00

Pos logit contributions
allo+4.124
gio+3.702
968+3.700
lake+3.664
omorphic+3.660
Neg logit contributions
chery-5.356
 Nether-5.021
hens-4.851
word-4.397
 Chern-4.317
Tokenumi
Token ID12994
Feature activation+3.87e-01

Pos logit contributions
umi+10.950
allo+10.653
algia+9.159
gio+9.142
umar+8.905
Neg logit contributions
 Nether-21.565
chery-21.035
hens-20.766
 comedians-19.919
 Corn-19.454
Token)
Token ID8
Feature activation+1.93e-01

Pos logit contributions
allo+3.440
 accuser+3.312
968+3.241
gio+3.229
ctica+3.184
Neg logit contributions
chery-5.397
hens-4.949
 Nether-4.924
word-4.580
iron-4.510
Token
Token ID198
Feature activation+4.90e-02

Pos logit contributions
allo+1.983
 accuser+1.909
ctica+1.869
gio+1.855
968+1.834
Neg logit contributions
chery-2.480
hens-2.281
 Nether-2.270
word-2.093
iron-2.063
Token
Token ID198
Feature activation+4.84e-01

Pos logit contributions
allo+0.531
 accuser+0.516
ctica+0.504
gio+0.495
omorphic+0.491
Neg logit contributions
chery-0.610
hens-0.562
 Nether-0.556
word-0.515
iron-0.505
Token
Token ID447
Feature activation+6.16e-01

Pos logit contributions
allo+4.669
 accuser+4.369
ctica+4.324
968+4.324
dri+4.266
Neg logit contributions
chery-6.256
 Nether-5.900
hens-5.764
word-5.255
iron-5.178
Token live
Token ID2107
Feature activation+6.23e-01

Pos logit contributions
allo+4.030
 accuser+3.941
ctica+3.804
omorphic+3.766
gio+3.710
Neg logit contributions
chery-4.444
hens-4.121
 Nether-4.085
word-3.784
iron-3.714
Tokenliest
Token ID11318
Feature activation+6.10e-01

Pos logit contributions
allo+6.900
ctica+6.473
omorphic+6.409
 accuser+6.376
gio+6.354
Neg logit contributions
chery-6.985
 Nether-6.571
hens-6.466
word-5.870
argon-5.702
Token.
Token ID13
Feature activation+5.53e-01

Pos logit contributions
allo+5.824
 accuser+5.646
ctica+5.468
omorphic+5.412
gio+5.304
Neg logit contributions
chery-7.716
 Nether-7.244
hens-7.228
word-6.632
iron-6.580
Token The
Token ID383
Feature activation+4.34e-01

Pos logit contributions
allo+5.377
 Entered+5.083
 accuser+5.077
gio+5.074
ctica+5.042
Neg logit contributions
chery-7.136
hens-6.506
 Nether-6.257
word-5.985
iron-5.956
Token sax
Token ID46909
Feature activation+1.57e+00

Pos logit contributions
 accuser+4.690
allo+4.684
ctica+4.416
gio+4.398
omorphic+4.353
Neg logit contributions
chery-5.251
hens-4.751
 Nether-4.726
word-4.336
iron-4.306
Tokenophon
Token ID48982
Feature activation+1.72e+00

Pos logit contributions
allo+12.105
ologist+11.933
 accuser+11.150
omorphic+10.885
onte+10.731
Neg logit contributions
chery-17.398
 Nether-16.556
hens-15.992
 Onion-14.621
 IDF-14.439
Tokenist
Token ID396
Feature activation+1.13e+00

Pos logit contributions
allo+10.798
ologist+10.585
 accuser+9.452
onte+9.065
ala+9.038
Neg logit contributions
chery-21.139
 Nether-20.056
hens-20.053
 Onion-18.389
 Corn-17.877
Token considered
Token ID3177
Feature activation+2.09e-01

Pos logit contributions
 accuser+10.291
allo+9.787
omorphic+9.305
ologist+9.253
gio+9.180
Neg logit contributions
chery-12.672
hens-11.863
 Nether-11.491
argon-10.654
word-10.500
Token Performance
Token ID15193
Feature activation+8.50e-01

Pos logit contributions
allo+2.374
 accuser+2.304
ctica+2.257
gio+2.216
omorphic+2.194
Neg logit contributions
chery-2.473
hens-2.280
 Nether-2.246
word-2.082
iron-2.048
Token 3
Token ID513
Feature activation+5.55e-01

Pos logit contributions
allo+7.925
 accuser+7.639
gio+7.552
ctica+7.546
 Entered+7.323
Neg logit contributions
chery-10.269
hens-9.360
 Nether-9.241
word-8.665
iron-8.597
Token to
Token ID284
Feature activation+2.07e-01

Pos logit contributions
allo+5.885
 accuser+5.632
ctica+5.598
omorphic+5.510
gio+5.419
Neg logit contributions
chery-6.557
 Nether-6.123
hens-6.074
word-5.599
iron-5.493
Tokenipping
Token ID4501
Feature activation+9.82e-01

Pos logit contributions
allo+2.878
 accuser+2.725
ctica+2.701
gio+2.699
omorphic+2.679
Neg logit contributions
chery-3.639
 Nether-3.395
hens-3.380
word-3.083
iron-3.036
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
allo+8.702
 accuser+8.666
dri+8.216
968+8.142
gio+7.885
Neg logit contributions
chery-11.882
hens-10.910
 Nether-10.537
word-10.285
argon-9.685
Token plastic
Token ID7309
Feature activation+1.24e+00

Pos logit contributions
 accuser+13.047
 stereo+12.440
allo+11.900
dri+11.715
968+11.653
Neg logit contributions
chery-14.108
hens-12.655
 Nether-12.311
word-12.106
argon-11.535
Token cover
Token ID3002
Feature activation+3.19e-02

Pos logit contributions
 accuser+11.054
allo+10.757
 stereo+10.167
968+10.108
gio+9.890
Neg logit contributions
chery-13.607
hens-12.610
 Nether-12.360
word-11.768
argon-11.322
Token surrounding
Token ID7346
Feature activation+1.73e+00

Pos logit contributions
allo+0.367
 accuser+0.358
ctica+0.350
gio+0.343
omorphic+0.342
Neg logit contributions
chery-0.377
hens-0.345
 Nether-0.341
word-0.315
iron-0.308
Token the
Token ID262
Feature activation+1.69e+00

Pos logit contributions
 accuser+12.704
 stereo+12.163
allo+12.101
dri+11.957
968+11.840
Neg logit contributions
chery-18.086
hens-16.575
 Nether-16.171
word-15.257
argon-14.714
Token radio
Token ID5243
Feature activation+7.54e-01

Pos logit contributions
 accuser+15.433
 stereo+15.220
968+13.873
dri+13.861
 infl+13.691
Neg logit contributions
chery-16.333
hens-14.535
 Nether-14.301
word-13.618
argon-13.339
Token.
Token ID13
Feature activation+8.08e-02

Pos logit contributions
 accuser+7.645
allo+7.583
968+7.424
lake+7.333
dri+7.287
Neg logit contributions
chery-8.685
hens-7.968
 Nether-7.893
word-7.141
argon-7.008
Token Start
Token ID7253
Feature activation+1.92e-01

Pos logit contributions
allo+0.884
 accuser+0.858
ctica+0.839
gio+0.824
omorphic+0.816
Neg logit contributions
chery-0.999
hens-0.917
 Nether-0.904
word-0.841
iron-0.822
Token from
Token ID422
Feature activation+2.91e-01

Pos logit contributions
allo+2.001
 accuser+1.953
ctica+1.883
omorphic+1.858
gio+1.856
Neg logit contributions
chery-2.447
hens-2.264
 Nether-2.235
word-2.066
iron-2.023
Token the
Token ID262
Feature activation+5.87e-01

Pos logit contributions
allo+3.145
 accuser+3.098
ctica+2.992
omorphic+2.934
gio+2.923
Neg logit contributions
chery-3.524
hens-3.263
 Nether-3.206
word-2.979
iron-2.908
Token discover
Token ID7073
Feature activation+3.82e-01

Pos logit contributions
chery+2.410
 Nether+2.170
hens+2.150
word+1.956
iron+1.885
Neg logit contributions
allo-3.124
 accuser-2.983
ctica-2.978
gio-2.941
omorphic-2.921
Token his
Token ID465
Feature activation+1.14e+00

Pos logit contributions
 accuser+4.363
allo+4.313
ctica+4.065
omorphic+4.014
gio+3.983
Neg logit contributions
chery-4.357
hens-4.063
 Nether-3.935
word-3.672
iron-3.628
Token $
Token ID720
Feature activation+4.30e-01

Pos logit contributions
 accuser+10.886
 stereo+9.610
 Salv+9.333
 wrecked+9.304
allo+9.299
Neg logit contributions
chery-13.602
hens-12.697
argon-11.576
 Nether-11.357
word-11.295
Token2
Token ID17
Feature activation+1.60e+00

Pos logit contributions
allo+5.315
968+5.269
 accuser+5.093
lake+5.024
gio+5.023
Neg logit contributions
chery-4.426
hens-4.075
 Nether-3.957
word-3.485
iron-3.455
Token,
Token ID11
Feature activation+3.67e-01

Pos logit contributions
 accuser+11.938
allo+11.508
968+11.089
 stereo+11.076
lake+10.843
Neg logit contributions
chery-17.666
hens-16.702
 Nether-15.658
argon-15.318
word-14.703
Token200
Token ID2167
Feature activation+1.69e+00

Pos logit contributions
allo+4.257
968+4.149
 accuser+4.123
ctica+4.016
gio+3.969
Neg logit contributions
chery-4.126
hens-3.840
 Nether-3.728
word-3.367
iron-3.358
Token vintage
Token ID20276
Feature activation+7.50e-01

Pos logit contributions
 accuser+15.494
 stereo+14.694
 Suzuki+14.644
 Salv+14.369
 Triumph+14.005
Neg logit contributions
chery-16.434
hens-15.113
argon-13.952
 Nether-13.449
word-12.847
Token bicycle
Token ID17026
Feature activation+2.44e-01

Pos logit contributions
 accuser+7.944
allo+7.712
lake+7.280
 stereo+7.276
968+7.238
Neg logit contributions
chery-8.854
hens-7.911
 Nether-7.489
word-7.185
argon-7.156
Token had
Token ID550
Feature activation+1.06e-01

Pos logit contributions
allo+2.363
 accuser+2.362
ctica+2.227
gio+2.179
omorphic+2.170
Neg logit contributions
chery-3.235
hens-3.044
 Nether-2.962
word-2.763
iron-2.725
Token been
Token ID587
Feature activation+4.17e-01

Pos logit contributions
allo+0.958
 accuser+0.932
ctica+0.899
gio+0.880
omorphic+0.876
Neg logit contributions
chery-1.502
hens-1.401
 Nether-1.382
word-1.300
iron-1.279
Token stolen
Token ID9909
Feature activation+4.40e-01

Pos logit contributions
allo+3.218
 accuser+3.205
ctica+3.002
omorphic+2.925
gio+2.906
Neg logit contributions
chery-6.188
hens-5.868
 Nether-5.745
iron-5.356
word-5.328
Token successful
Token ID4388
Feature activation+6.49e-01

Pos logit contributions
allo+3.270
 accuser+3.215
ctica+3.077
omorphic+3.064
gio+3.032
Neg logit contributions
chery-3.755
hens-3.414
 Nether-3.394
word-3.144
iron-3.103
Token performance
Token ID2854
Feature activation+4.29e-01

Pos logit contributions
allo+7.069
 accuser+6.981
ctica+6.562
gio+6.541
 Entered+6.438
Neg logit contributions
chery-7.341
hens-6.869
 Nether-6.737
iron-6.214
word-6.127
Token,
Token ID11
Feature activation+4.02e-01

Pos logit contributions
allo+4.461
 accuser+4.336
ctica+4.219
gio+4.153
omorphic+4.086
Neg logit contributions
chery-5.324
hens-4.912
 Nether-4.874
word-4.488
iron-4.457
Token while
Token ID981
Feature activation+6.46e-01

Pos logit contributions
allo+4.358
 accuser+4.294
ctica+4.125
omorphic+4.063
968+4.021
Neg logit contributions
chery-4.789
hens-4.418
 Nether-4.346
word-4.041
iron-3.930
Token the
Token ID262
Feature activation+7.65e-01

Pos logit contributions
allo+6.542
 accuser+6.440
ctica+6.087
gio+6.008
omorphic+5.967
Neg logit contributions
chery-7.689
hens-7.195
 Nether-7.084
word-6.585
iron-6.540
Token sax
Token ID46909
Feature activation+1.69e+00

Pos logit contributions
 accuser+7.707
allo+7.469
gio+6.945
ctica+6.928
omorphic+6.887
Neg logit contributions
chery-9.229
 Nether-8.431
hens-8.417
word-7.711
iron-7.657
Tokenophon
Token ID48982
Feature activation+2.00e+00

Pos logit contributions
allo+12.846
ologist+12.811
 accuser+11.937
onte+11.582
omorphic+11.462
Neg logit contributions
chery-18.277
 Nether-16.998
hens-16.783
 Onion-15.132
iron-15.096
Tokenist
Token ID396
Feature activation+1.03e+00

Pos logit contributions
allo+11.308
ologist+11.085
ist+9.767
onte+9.404
ala+9.318
Neg logit contributions
chery-24.077
hens-23.134
 Nether-22.869
 Onion-21.551
 Ukrainians-21.226
Token liked
Token ID8288
Feature activation+3.04e-01

Pos logit contributions
 accuser+9.852
allo+9.357
968+8.872
gio+8.818
omorphic+8.797
Neg logit contributions
chery-11.450
hens-10.787
 Nether-10.424
word-9.679
argon-9.646
Token this
Token ID428
Feature activation+1.51e-01

Pos logit contributions
allo+3.298
 accuser+3.158
ctica+3.108
gio+3.060
omorphic+3.017
Neg logit contributions
chery-3.718
hens-3.443
 Nether-3.404
word-3.172
iron-3.110
Token performance
Token ID2854
Feature activation+2.88e-01

Pos logit contributions
allo+1.531
 accuser+1.495
ctica+1.445
gio+1.421
omorphic+1.406
Neg logit contributions
chery-1.977
hens-1.841
 Nether-1.819
word-1.692
iron-1.666
Token Author
Token ID6434
Feature activation-2.44e-01

Pos logit contributions
 Nether+10.355
chery+10.205
hens+9.382
word+9.149
 BT+8.856
Neg logit contributions
allo-9.734
ctica-9.601
 accuser-9.224
omorphic-9.188
gio-9.157
Token of
Token ID286
Feature activation-7.38e-01

Pos logit contributions
chery+2.769
hens+2.518
 Nether+2.497
word+2.325
iron+2.247
Neg logit contributions
allo-2.888
 accuser-2.825
ctica-2.806
omorphic-2.730
gio-2.694
Token The
Token ID383
Feature activation-4.01e-01

Pos logit contributions
chery+7.385
 Nether+7.096
hens+6.677
word+6.274
iron+6.072
Neg logit contributions
 accuser-8.328
allo-8.313
ctica-8.196
gio-8.093
968-8.056
Token Far
Token ID6755
Feature activation-7.91e-01

Pos logit contributions
chery+4.243
 Nether+4.031
hens+3.915
word+3.580
iron+3.505
Neg logit contributions
allo-4.856
 accuser-4.808
ctica-4.738
omorphic-4.694
bably-4.583
Tokenrah
Token ID11392
Feature activation-1.07e+00

Pos logit contributions
chery+7.335
hens+6.738
 Nether+6.474
word+6.400
iron+6.340
Neg logit contributions
 accuser-10.128
ctica-9.622
-9.506
allo-9.342
omorphic-9.325
Token W
Token ID370
Feature activation-2.50e+00

Pos logit contributions
 Nether+6.007
chery+5.778
 Worm+5.350
word+4.968
hens+4.807
Neg logit contributions
ctica-16.181
allo-16.162
 accuser-16.021
bably-15.848
��-15.759
Tokenether
Token ID6750
Feature activation-7.08e-02

Pos logit contributions
ether+20.180
 Nether+16.921
chery+16.801
hens+15.489
hen+15.479
Neg logit contributions
-24.493
 seiz-23.289
 millenn-22.623
 accuser-22.417
 Entered-22.296
Tokens
Token ID82
Feature activation-6.19e-01

Pos logit contributions
chery+0.974
hens+0.906
 Nether+0.891
word+0.841
iron+0.823
Neg logit contributions
allo-0.675
 accuser-0.663
ctica-0.644
omorphic-0.626
gio-0.622
Token Mysteries
Token ID28298
Feature activation-9.81e-01

Pos logit contributions
chery+5.499
 Nether+5.295
hens+5.231
word+4.602
 Worm+4.407
Neg logit contributions
 accuser-8.116
allo-7.962
ctica-7.846
bably-7.762
omorphic-7.750
Token,
Token ID11
Feature activation-1.27e+00

Pos logit contributions
chery+7.401
 Nether+6.532
hens+6.277
word+6.135
 BT+5.818
Neg logit contributions
��-13.563
bably-13.195
allo-13.169
 accuser-13.056
 millenn-12.996
Token comic
Token ID9048
Feature activation-7.40e-01

Pos logit contributions
chery+3.933
 comic+3.550
 Nether+3.426
 comics+2.953
 BT+2.450
Neg logit contributions
-21.083
olics-21.071
ctica-21.069
allo-20.874
bably-20.756
Token to
Token ID284
Feature activation-6.82e-01

Pos logit contributions
chery+3.802
hens+3.457
 Nether+3.383
word+3.150
iron+3.056
Neg logit contributions
allo-4.405
 accuser-4.306
ctica-4.197
gio-4.139
968-4.102
Token engage
Token ID8209
Feature activation+1.83e-02

Pos logit contributions
chery+6.560
 Nether+5.833
hens+5.776
word+5.529
iron+5.093
Neg logit contributions
allo-8.470
 accuser-8.340
ctica-8.194
dri-8.006
gio-8.000
Token in
Token ID287
Feature activation-9.02e-01

Pos logit contributions
allo+0.195
 accuser+0.190
ctica+0.185
gio+0.181
omorphic+0.181
Neg logit contributions
chery-0.230
hens-0.212
 Nether-0.210
word-0.195
iron-0.191
Token an
Token ID281
Feature activation-1.11e+00

Pos logit contributions
chery+7.292
hens+6.313
 Nether+6.017
word+5.550
iron+5.194
Neg logit contributions
allo-12.279
 accuser-11.666
ctica-11.607
lake-11.596
gio-11.583
Token open
Token ID1280
Feature activation-1.29e+00

Pos logit contributions
chery+8.608
hens+7.653
 Nether+7.200
word+6.775
iron+6.522
Neg logit contributions
ctica-14.019
-13.832
allo-13.747
gio-13.655
bably-13.438
Token and
Token ID290
Feature activation-2.38e+00

Pos logit contributions
chery+7.676
hens+6.437
 Nether+6.253
word+5.989
 calculation+5.609
Neg logit contributions
allo-18.480
ctica-18.227
-17.933
gio-17.867
bably-17.344
Token straightforward
Token ID15836
Feature activation-1.45e+00

Pos logit contributions
chery+11.225
 simple+10.366
hens+9.650
 reasoned+9.327
 forth+8.849
Neg logit contributions
lake-27.582
allo-26.388
-26.377
ctica-26.264
ット-25.526
Token communication
Token ID6946
Feature activation-4.56e-01

Pos logit contributions
chery+8.807
 Nether+7.197
 calculation+6.969
hens+6.753
word+6.654
Neg logit contributions
ctica-19.471
allo-19.391
-19.257
bably-18.816
gio-18.808
Token with
Token ID351
Feature activation+1.33e-01

Pos logit contributions
chery+5.046
hens+4.514
 Nether+4.467
word+4.149
iron+3.980
Neg logit contributions
allo-5.417
 accuser-5.233
gio-5.216
ctica-5.208
omorphic-5.049
Token staff
Token ID3085
Feature activation+5.58e-02

Pos logit contributions
allo+1.375
 accuser+1.351
ctica+1.298
omorphic+1.272
gio+1.267
Neg logit contributions
chery-1.725
hens-1.584
 Nether-1.569
word-1.461
iron-1.448
Token,
Token ID11
Feature activation+8.35e-02

Pos logit contributions
allo+0.622
 accuser+0.608
ctica+0.593
gio+0.580
omorphic+0.580
Neg logit contributions
chery-0.676
hens-0.620
 Nether-0.614
word-0.568
iron-0.556
Token?
Token ID30
Feature activation+1.84e-01

Pos logit contributions
allo+2.232
 accuser+2.186
ctica+2.115
gio+2.078
omorphic+2.071
Neg logit contributions
chery-2.635
hens-2.435
 Nether-2.415
word-2.252
iron-2.201
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
allo+1.930
 accuser+1.874
ctica+1.832
gio+1.806
omorphic+1.784
Neg logit contributions
chery-2.328
hens-2.143
 Nether-2.108
word-1.982
iron-1.941
Token
Token ID198
Feature activation-1.76e-02

Pos logit contributions
chery+1.736
hens+1.581
 Nether+1.564
word+1.456
iron+1.412
Neg logit contributions
allo-1.759
 accuser-1.739
ctica-1.701
omorphic-1.672
gio-1.642
TokenJay
Token ID30568
Feature activation-5.08e-01

Pos logit contributions
chery+0.178
hens+0.160
 Nether+0.157
word+0.144
iron+0.140
Neg logit contributions
allo-0.232
 accuser-0.227
ctica-0.223
gio-0.219
omorphic-0.219
Token C
Token ID327
Feature activation-2.69e-01

Pos logit contributions
chery+3.791
 Nether+3.443
hens+3.309
 Corn+2.825
word+2.807
Neg logit contributions
allo-7.862
omorphic-7.602
bably-7.578
ctica-7.554
 accuser-7.545
Tokenz
Token ID89
Feature activation-2.06e+00

Pos logit contributions
chery+2.332
hens+2.118
 Nether+1.905
word+1.871
iron+1.804
Neg logit contributions
 accuser-3.935
allo-3.888
ctica-3.804
omorphic-3.720
gio-3.705
Tokenark
Token ID668
Feature activation-3.92e-01

Pos logit contributions
arks+12.948
chery+12.114
iron+11.386
ark+11.096
word+10.835
Neg logit contributions
 millenn-27.806
 proport-25.711
��-25.325
 therap-25.027
 destro-24.816
Tokenowski
Token ID12079
Feature activation+7.69e-02

Pos logit contributions
chery+3.854
hens+3.464
 Nether+3.267
word+3.224
iron+2.991
Neg logit contributions
 accuser-5.195
allo-5.090
ctica-4.961
omorphic-4.854
gio-4.848
Token:
Token ID25
Feature activation+4.09e-01

Pos logit contributions
allo+0.820
 accuser+0.799
ctica+0.778
gio+0.766
omorphic+0.756
Neg logit contributions
chery-0.967
hens-0.890
 Nether-0.882
word-0.818
iron-0.803
Token I
Token ID314
Feature activation-3.92e-01

Pos logit contributions
allo+4.173
 accuser+4.026
gio+3.908
968+3.893
 Entered+3.864
Neg logit contributions
chery-5.121
hens-4.704
 Nether-4.534
word-4.360
iron-4.294
Token would
Token ID561
Feature activation-2.86e-01

Pos logit contributions
chery+4.056
hens+3.613
 Nether+3.473
word+3.279
iron+3.149
Neg logit contributions
allo-4.968
 accuser-4.853
ctica-4.728
omorphic-4.676
gio-4.674
Token will
Token ID481
Feature activation-6.20e-01

Pos logit contributions
chery+10.833
 Nether+9.704
hens+9.662
word+9.121
 Corn+8.759
Neg logit contributions
allo-12.009
ctica-11.759
gio-11.326
 accuser-11.233
bably-11.093
Token open
Token ID1280
Feature activation-1.00e+00

Pos logit contributions
chery+6.401
hens+5.807
 Nether+5.769
word+5.271
iron+5.131
Neg logit contributions
 accuser-7.362
allo-7.311
ctica-7.274
gio-7.103
omorphic-7.020
Token new
Token ID649
Feature activation-4.93e-01

Pos logit contributions
chery+8.835
 Nether+7.993
hens+7.607
word+7.142
arks+6.836
Neg logit contributions
allo-12.317
 accuser-12.214
ctica-11.824
gio-11.776
968-11.670
Token store
Token ID3650
Feature activation-1.02e+00

Pos logit contributions
chery+5.779
 Nether+5.293
hens+5.185
word+4.857
 BT+4.632
Neg logit contributions
allo-5.387
 accuser-5.276
ctica-5.224
gio-5.148
omorphic-5.018
Token at
Token ID379
Feature activation-9.70e-01

Pos logit contributions
chery+8.680
 Nether+7.247
hens+7.231
 BT+6.486
iron+6.461
Neg logit contributions
allo-13.201
ctica-12.768
gio-12.669
 accuser-12.595
omorphic-12.126
Token Five
Token ID10579
Feature activation-2.00e+00

Pos logit contributions
chery+9.692
 Nether+9.106
hens+8.996
word+8.075
 Corn+7.934
Neg logit contributions
allo-10.708
 accuser-10.664
968-10.295
ctica-10.283
dri-10.126
Token Corn
Token ID11424
Feature activation-5.21e-01

Pos logit contributions
 Corn+13.900
 Nether+10.930
hens+9.798
chery+9.309
 Worm+8.941
Neg logit contributions
 satell-25.945
��-25.850
 destro-25.722
bably-25.078
 proport-25.049
Tokeners
Token ID364
Feature activation-1.28e+00

Pos logit contributions
chery+5.134
hens+4.711
 Nether+4.434
iron+4.077
word+4.067
Neg logit contributions
 accuser-6.843
allo-6.729
ctica-6.681
omorphic-6.464
gio-6.402
Token in
Token ID287
Feature activation-1.53e+00

Pos logit contributions
chery+10.463
 Nether+9.364
hens+9.285
 Corn+8.413
word+8.223
Neg logit contributions
allo-15.471
ctica-15.118
 accuser-14.725
omorphic-14.322
gio-14.273
Token Bur
Token ID5481
Feature activation-8.99e-01

Pos logit contributions
chery+9.786
 Nether+9.545
 Corn+8.268
 Banner+8.081
hens+7.976
Neg logit contributions
��-19.957
allo-19.852
 millenn-19.509
 accuser-19.269
ctica-19.246
Tokenien
Token ID2013
Feature activation-5.86e-01

Pos logit contributions
chery+5.159
hens+4.467
 Nether+4.351
iron+4.081
word+3.938
Neg logit contributions
-14.367
 accuser-14.336
ctica-13.795
ーテ-13.624
allo-13.611
Token?
Token ID30
Feature activation-7.71e-03

Pos logit contributions
chery+5.131
hens+4.651
 Nether+4.616
word+4.267
iron+4.265
Neg logit contributions
allo-5.060
 accuser-4.956
ctica-4.839
gio-4.767
omorphic-4.720
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
chery+0.094
hens+0.086
 Nether+0.085
word+0.079
iron+0.077
Neg logit contributions
allo-0.086
 accuser-0.084
ctica-0.082
gio-0.080
omorphic-0.080
Token
Token ID198
Feature activation-9.16e-02

Pos logit contributions
chery+3.211
hens+2.904
 Nether+2.873
word+2.682
iron+2.581
Neg logit contributions
allo-3.615
 accuser-3.614
ctica-3.534
omorphic-3.495
bably-3.415
TokenJay
Token ID30568
Feature activation-3.24e-01

Pos logit contributions
chery+0.906
hens+0.812
 Nether+0.794
word+0.732
iron+0.711
Neg logit contributions
allo-1.230
 accuser-1.209
ctica-1.186
omorphic-1.163
gio-1.162
Token C
Token ID327
Feature activation-2.55e-01

Pos logit contributions
chery+2.718
 Nether+2.430
hens+2.368
word+2.070
 Corn+1.971
Neg logit contributions
allo-4.810
 accuser-4.676
ctica-4.647
omorphic-4.635
bably-4.569
Tokenz
Token ID89
Feature activation-1.93e+00

Pos logit contributions
chery+2.224
hens+1.999
 Nether+1.836
word+1.781
iron+1.722
Neg logit contributions
 accuser-3.703
allo-3.657
ctica-3.576
omorphic-3.500
gio-3.478
Tokenark
Token ID668
Feature activation-3.36e-01

Pos logit contributions
arks+12.111
chery+11.732
iron+10.638
word+10.504
ark+10.369
Neg logit contributions
 millenn-26.614
 proport-24.405
��-23.855
 therap-23.481
 destro-23.478
Tokenowski
Token ID12079
Feature activation+1.99e-01

Pos logit contributions
chery+3.406
hens+3.052
 Nether+2.898
word+2.846
iron+2.658
Neg logit contributions
 accuser-4.369
allo-4.315
ctica-4.187
omorphic-4.100
gio-4.093
Token:
Token ID25
Feature activation+1.54e-01

Pos logit contributions
allo+1.999
 accuser+1.948
ctica+1.878
gio+1.874
968+1.855
Neg logit contributions
chery-2.576
hens-2.378
 Nether-2.365
word-2.189
iron-2.161
Token I
Token ID314
Feature activation-3.31e-02

Pos logit contributions
allo+1.691
 accuser+1.636
ctica+1.588
gio+1.577
omorphic+1.559
Neg logit contributions
chery-1.895
hens-1.745
 Nether-1.701
word-1.608
iron-1.570
Token think
Token ID892
Feature activation-3.11e-01

Pos logit contributions
chery+0.364
hens+0.331
 Nether+0.326
word+0.300
iron+0.292
Neg logit contributions
allo-0.408
 accuser-0.400
ctica-0.391
gio-0.384
omorphic-0.383
Token?
Token ID30
Feature activation+2.40e-01

Pos logit contributions
allo+0.229
 accuser+0.224
ctica+0.219
gio+0.214
omorphic+0.214
Neg logit contributions
chery-0.243
hens-0.223
 Nether-0.221
word-0.204
iron-0.200
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
allo+2.504
 accuser+2.406
ctica+2.351
gio+2.347
omorphic+2.303
Neg logit contributions
chery-3.042
hens-2.811
 Nether-2.733
word-2.588
iron-2.522
Token
Token ID198
Feature activation-7.32e-03

Pos logit contributions
chery+1.307
hens+1.193
 Nether+1.179
word+1.097
iron+1.066
Neg logit contributions
allo-1.284
 accuser-1.265
ctica-1.237
omorphic-1.213
gio-1.198
TokenJay
Token ID30568
Feature activation-3.36e-01

Pos logit contributions
chery+0.074
hens+0.067
 Nether+0.066
word+0.060
iron+0.059
Neg logit contributions
allo-0.096
 accuser-0.094
ctica-0.093
gio-0.091
omorphic-0.091
Token C
Token ID327
Feature activation-2.32e-01

Pos logit contributions
chery+2.784
 Nether+2.507
hens+2.434
word+2.128
 Corn+2.032
Neg logit contributions
allo-5.010
 accuser-4.852
ctica-4.837
omorphic-4.821
bably-4.764
Tokenz
Token ID89
Feature activation-1.92e+00

Pos logit contributions
chery+2.046
hens+1.845
 Nether+1.701
word+1.642
iron+1.585
Neg logit contributions
 accuser-3.348
allo-3.316
ctica-3.241
omorphic-3.171
gio-3.154
Tokenark
Token ID668
Feature activation-3.27e-01

Pos logit contributions
arks+11.999
chery+11.516
iron+10.486
word+10.315
ark+10.231
Neg logit contributions
 millenn-26.716
 proport-24.583
��-24.055
 therap-23.866
 destro-23.768
Tokenowski
Token ID12079
Feature activation+3.06e-01

Pos logit contributions
chery+3.317
hens+2.976
 Nether+2.832
word+2.772
iron+2.587
Neg logit contributions
 accuser-4.252
allo-4.196
ctica-4.070
omorphic-3.986
gio-3.983
Token:
Token ID25
Feature activation+5.34e-01

Pos logit contributions
allo+2.929
 accuser+2.852
gio+2.751
968+2.726
ctica+2.723
Neg logit contributions
chery-4.056
hens-3.744
 Nether-3.734
word-3.452
iron-3.420
Token You
Token ID921
Feature activation-1.31e-01

Pos logit contributions
allo+5.407
 accuser+5.061
gio+5.054
 Entered+4.983
lake+4.893
Neg logit contributions
chery-6.703
hens-6.189
 Nether-5.816
word-5.703
iron-5.539
Token know
Token ID760
Feature activation+3.47e-01

Pos logit contributions
chery+1.349
hens+1.209
 Nether+1.189
word+1.097
iron+1.058
Neg logit contributions
allo-1.700
 accuser-1.672
ctica-1.633
gio-1.608
omorphic-1.599
Token?
Token ID30
Feature activation+4.28e-01

Pos logit contributions
allo+1.473
 accuser+1.437
ctica+1.387
gio+1.369
omorphic+1.357
Neg logit contributions
chery-1.739
hens-1.613
 Nether-1.597
word-1.482
iron-1.455
Token
Token ID198
Feature activation-2.67e-01

Pos logit contributions
allo+4.152
 accuser+3.956
gio+3.891
ctica+3.858
 Entered+3.826
Neg logit contributions
chery-5.564
hens-5.192
 Nether-4.994
word-4.793
iron-4.677
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
chery+2.934
hens+2.654
 Nether+2.623
word+2.450
iron+2.356
Neg logit contributions
allo-3.246
 accuser-3.239
ctica-3.170
omorphic-3.126
bably-3.056
TokenJay
Token ID30568
Feature activation-3.35e-01

Pos logit contributions
chery+1.077
hens+0.966
 Nether+0.942
word+0.871
iron+0.846
Neg logit contributions
allo-1.475
 accuser-1.450
ctica-1.423
omorphic-1.395
gio-1.395
Token C
Token ID327
Feature activation-2.52e-01

Pos logit contributions
chery+2.795
 Nether+2.506
hens+2.431
word+2.126
 Corn+2.032
Neg logit contributions
allo-4.992
 accuser-4.841
ctica-4.825
omorphic-4.801
bably-4.745
Tokenz
Token ID89
Feature activation-1.92e+00

Pos logit contributions
chery+2.204
hens+1.984
 Nether+1.818
word+1.765
iron+1.706
Neg logit contributions
 accuser-3.651
allo-3.607
ctica-3.529
omorphic-3.450
gio-3.429
Tokenark
Token ID668
Feature activation-3.78e-01

Pos logit contributions
arks+12.098
chery+11.789
iron+10.646
word+10.517
ark+10.354
Neg logit contributions
 millenn-26.443
 proport-24.246
��-23.736
 destro-23.434
 therap-23.421
Tokenowski
Token ID12079
Feature activation+2.44e-01

Pos logit contributions
chery+3.778
hens+3.372
 Nether+3.188
word+3.155
iron+2.934
Neg logit contributions
 accuser-4.962
allo-4.877
ctica-4.740
omorphic-4.637
gio-4.631
Token:
Token ID25
Feature activation+3.50e-01

Pos logit contributions
allo+2.410
 accuser+2.350
gio+2.263
ctica+2.253
968+2.241
Neg logit contributions
chery-3.200
hens-2.955
 Nether-2.940
word-2.723
iron-2.691
Token And
Token ID843
Feature activation-7.63e-02

Pos logit contributions
allo+3.690
 accuser+3.560
gio+3.454
968+3.403
ctica+3.400
Neg logit contributions
chery-4.348
hens-4.011
 Nether-3.842
word-3.716
iron-3.618
Token you
Token ID345
Feature activation-7.48e-02

Pos logit contributions
chery+0.889
hens+0.809
 Nether+0.803
word+0.739
iron+0.722
Neg logit contributions
allo-0.891
 accuser-0.869
ctica-0.859
gio-0.835
omorphic-0.834
TokenBig
Token ID12804
Feature activation+2.19e-01

Pos logit contributions
chery+1.246
hens+1.150
 Nether+1.137
word+1.071
iron+1.047
Neg logit contributions
allo-0.972
 accuser-0.948
ctica-0.930
omorphic-0.908
gio-0.900
Tokengest
Token ID3495
Feature activation+7.62e-02

Pos logit contributions
allo+2.659
 accuser+2.581
gio+2.520
ctica+2.505
omorphic+2.449
Neg logit contributions
chery-2.428
hens-2.196
 Nether-2.176
word-1.988
iron-1.950
Token G
Token ID402
Feature activation+1.79e-02

Pos logit contributions
allo+0.917
 accuser+0.900
ctica+0.876
gio+0.862
omorphic+0.858
Neg logit contributions
chery-0.857
hens-0.780
 Nether-0.769
word-0.709
iron-0.694
Tokenainer
Token ID10613
Feature activation-6.37e-01

Pos logit contributions
allo+0.138
 accuser+0.133
ctica+0.129
gio+0.125
omorphic+0.124
Neg logit contributions
chery-0.279
hens-0.261
 Nether-0.259
word-0.244
iron-0.240
Token 2008
Token ID3648
Feature activation-2.47e-01

Pos logit contributions
chery+6.904
 Nether+6.245
hens+6.206
word+5.758
iron+5.455
Neg logit contributions
allo-7.455
ctica-7.095
 accuser-7.035
omorphic-7.005
bably-6.969
Token01
Token ID486
Feature activation-1.86e+00

Pos logit contributions
chery+2.264
hens+2.052
 Nether+2.004
word+1.833
iron+1.736
Neg logit contributions
allo-3.449
 accuser-3.434
ctica-3.351
omorphic-3.335
bably-3.243
Token08
Token ID2919
Feature activation-4.84e-01

Pos logit contributions
 Nether+8.216
hens+8.077
chery+7.792
08+7.119
07+6.569
Neg logit contributions
omorphic-23.997
ctica-23.625
allo-23.238
bably-23.125
gio-22.679
Token O
Token ID440
Feature activation-4.01e-01

Pos logit contributions
chery+6.029
 Nether+5.635
hens+5.562
word+5.247
 BT+5.045
Neg logit contributions
allo-4.837
 accuser-4.771
ctica-4.731
omorphic-4.696
gio-4.552
Token&
Token ID5
Feature activation+3.15e-02

Pos logit contributions
chery+2.679
hens+2.197
 Nether+2.088
word+2.012
iron+1.850
Neg logit contributions
 accuser-6.504
allo-6.501
omorphic-6.368
ctica-6.354
bably-6.279
TokenA
Token ID32
Feature activation-5.55e-01

Pos logit contributions
allo+0.220
 accuser+0.211
ctica+0.203
gio+0.196
omorphic+0.194
Neg logit contributions
chery-0.513
hens-0.482
 Nether-0.478
word-0.452
iron-0.446
Token-
Token ID12
Feature activation-1.35e-01

Pos logit contributions
chery+5.376
 Nether+4.932
hens+4.902
word+4.521
iron+4.282
Neg logit contributions
 accuser-7.129
bably-7.027
allo-7.023
omorphic-6.977
ctica-6.904
Tokenbit
Token ID2545
Feature activation-8.97e-01

Pos logit contributions
chery+8.922
word+8.765
bit+8.725
hens+8.023
iron+7.112
Neg logit contributions
 accuser-22.886
 millenn-22.203
ctica-22.104
-22.034
��-21.777
Token 256
Token ID17759
Feature activation-9.33e-01

Pos logit contributions
chery+8.027
 Nether+7.868
hens+7.446
word+7.280
 BT+7.002
Neg logit contributions
 accuser-10.939
allo-10.736
ctica-10.705
-10.556
omorphic-10.407
Token-
Token ID12
Feature activation-1.71e+00

Pos logit contributions
chery+7.415
 Nether+7.194
hens+6.747
word+6.583
 BT+6.511
Neg logit contributions
allo-12.308
ctica-12.300
 accuser-12.222
-11.967
bably-11.772
Tokenbit
Token ID2545
Feature activation-9.73e-01

Pos logit contributions
word+9.378
chery+9.323
bit+9.162
hens+8.496
hen+7.525
Neg logit contributions
 millenn-23.119
 accuser-22.960
-22.842
 proport-22.535
 seiz-22.531
Token 192
Token ID17817
Feature activation-8.12e-01

Pos logit contributions
chery+8.624
 Nether+8.510
hens+7.991
word+7.786
 BT+7.584
Neg logit contributions
 accuser-11.801
allo-11.590
ctica-11.517
-11.321
omorphic-11.165
Token-
Token ID12
Feature activation-1.84e+00

Pos logit contributions
chery+6.809
 Nether+6.632
hens+6.229
word+6.086
 BT+5.935
Neg logit contributions
allo-10.855
ctica-10.785
 accuser-10.656
-10.387
bably-10.243
Tokenbit
Token ID2545
Feature activation-1.13e+00

Pos logit contributions
bit+9.536
word+9.407
chery+8.853
hens+8.082
hen+7.502
Neg logit contributions
 millenn-25.662
 proport-24.979
 seiz-24.794
-24.710
 accuser-24.505
Token 256
Token ID17759
Feature activation-8.20e-01

Pos logit contributions
 Nether+9.578
chery+9.364
hens+8.564
 BT+8.563
word+8.232
Neg logit contributions
 accuser-13.860
allo-13.602
ctica-13.410
-13.164
omorphic-12.993
Token-
Token ID12
Feature activation-1.78e+00

Pos logit contributions
chery+6.782
 Nether+6.587
hens+6.108
word+5.959
 BT+5.957
Neg logit contributions
allo-11.002
ctica-10.952
 accuser-10.811
-10.658
omorphic-10.585
Tokenbit
Token ID2545
Feature activation-8.58e-01

Pos logit contributions
word+10.277
bit+9.992
chery+9.841
hens+8.668
hen+8.226
Neg logit contributions
 millenn-24.719
 proport-23.889
��-23.837
 seiz-23.709
-23.375
Token 256
Token ID17759
Feature activation-7.80e-01

Pos logit contributions
chery+7.913
 Nether+7.800
hens+7.161
 BT+6.890
word+6.881
Neg logit contributions
 accuser-10.724
allo-10.616
ctica-10.400
-10.145
omorphic-10.142
Token-
Token ID12
Feature activation-1.95e-01

Pos logit contributions
chery+9.210
 Nether+8.283
hens+8.202
word+7.923
 BT+7.314
Neg logit contributions
allo-11.721
 accuser-11.391
omorphic-11.381
bably-11.250
gio-11.130
TokenRet
Token ID9781
Feature activation-2.37e-01

Pos logit contributions
chery+2.581
hens+2.384
 Nether+2.355
word+2.237
iron+2.186
Neg logit contributions
allo-1.953
 accuser-1.912
ctica-1.873
omorphic-1.830
gio-1.800
Tokenarded
Token ID10676
Feature activation-4.44e-01

Pos logit contributions
chery+2.660
hens+2.417
 Nether+2.368
word+2.197
iron+2.173
Neg logit contributions
allo-2.835
 accuser-2.802
ctica-2.736
omorphic-2.669
968-2.663
Token Cook
Token ID8261
Feature activation-6.57e-01

Pos logit contributions
chery+5.166
 Nether+4.793
hens+4.751
word+4.378
iron+4.234
Neg logit contributions
allo-4.914
ctica-4.745
 accuser-4.733
omorphic-4.657
gio-4.623
Token 200
Token ID939
Feature activation-5.23e-01

Pos logit contributions
chery+7.552
hens+7.096
 Nether+6.980
word+6.547
iron+6.397
Neg logit contributions
allo-7.040
ctica-6.764
omorphic-6.709
 accuser-6.705
gio-6.606
Token80
Token ID1795
Feature activation-1.81e+00

Pos logit contributions
chery+4.881
 Nether+4.432
hens+4.396
word+4.131
iron+3.865
Neg logit contributions
allo-6.896
 accuser-6.864
ctica-6.697
omorphic-6.574
gio-6.561
Token202
Token ID19004
Feature activation-4.40e-01

Pos logit contributions
chery+12.208
 Nether+11.155
hens+10.118
iron+9.440
word+9.101
Neg logit contributions
 accuser-20.280
allo-19.393
gio-19.079
ctica-18.920
bably-18.810
Token O
Token ID440
Feature activation-4.37e-01

Pos logit contributions
chery+5.315
 Nether+5.128
hens+4.903
word+4.659
 BT+4.551
Neg logit contributions
ctica-4.579
allo-4.561
 accuser-4.543
omorphic-4.505
bably-4.396
Token&
Token ID5
Feature activation+3.25e-01

Pos logit contributions
chery+2.855
hens+2.342
 Nether+2.211
word+2.175
iron+1.981
Neg logit contributions
 accuser-7.188
allo-7.114
ctica-6.998
omorphic-6.977
bably-6.892
TokenA
Token ID32
Feature activation-4.80e-01

Pos logit contributions
allo+1.840
 accuser+1.681
968+1.625
gio+1.588
ctica+1.579
Neg logit contributions
chery-5.638
hens-5.371
 Nether-5.304
word-4.943
iron-4.932
Token-
Token ID12
Feature activation-1.79e-01

Pos logit contributions
chery+4.789
 Nether+4.354
hens+4.343
word+4.038
iron+3.830
Neg logit contributions
 accuser-6.131
allo-6.064
omorphic-5.961
bably-5.958
ctica-5.946
Tokenbit
Token ID2545
Feature activation-9.73e-01

Pos logit contributions
word+9.378
chery+9.323
bit+9.162
hens+8.496
hen+7.525
Neg logit contributions
 millenn-23.119
 accuser-22.960
-22.842
 proport-22.535
 seiz-22.531
Token 192
Token ID17817
Feature activation-8.12e-01

Pos logit contributions
chery+8.624
 Nether+8.510
hens+7.991
word+7.786
 BT+7.584
Neg logit contributions
 accuser-11.801
allo-11.590
ctica-11.517
-11.321
omorphic-11.165
Token-
Token ID12
Feature activation-1.84e+00

Pos logit contributions
chery+6.809
 Nether+6.632
hens+6.229
word+6.086
 BT+5.935
Neg logit contributions
allo-10.855
ctica-10.785
 accuser-10.656
-10.387
bably-10.243
Tokenbit
Token ID2545
Feature activation-1.13e+00

Pos logit contributions
bit+9.536
word+9.407
chery+8.853
hens+8.082
hen+7.502
Neg logit contributions
 millenn-25.662
 proport-24.979
 seiz-24.794
-24.710
 accuser-24.505
Token 256
Token ID17759
Feature activation-8.20e-01

Pos logit contributions
 Nether+9.578
chery+9.364
hens+8.564
 BT+8.563
word+8.232
Neg logit contributions
 accuser-13.860
allo-13.602
ctica-13.410
-13.164
omorphic-12.993
Token-
Token ID12
Feature activation-1.78e+00

Pos logit contributions
chery+6.782
 Nether+6.587
hens+6.108
word+5.959
 BT+5.957
Neg logit contributions
allo-11.002
ctica-10.952
 accuser-10.811
-10.658
omorphic-10.585
Tokenbit
Token ID2545
Feature activation-8.58e-01

Pos logit contributions
word+10.277
bit+9.992
chery+9.841
hens+8.668
hen+8.226
Neg logit contributions
 millenn-24.719
 proport-23.889
��-23.837
 seiz-23.709
-23.375
Token 256
Token ID17759
Feature activation-7.80e-01

Pos logit contributions
chery+7.913
 Nether+7.800
hens+7.161
 BT+6.890
word+6.881
Neg logit contributions
 accuser-10.724
allo-10.616
ctica-10.400
-10.145
omorphic-10.142
Token-
Token ID12
Feature activation-1.67e+00

Pos logit contributions
chery+6.603
 Nether+6.394
hens+5.978
 BT+5.805
word+5.803
Neg logit contributions
allo-10.486
ctica-10.377
 accuser-10.248
omorphic-10.072
-10.046
Tokenbit
Token ID2545
Feature activation-7.33e-01

Pos logit contributions
word+9.958
chery+9.577
bit+9.420
hens+8.358
hen+7.861
Neg logit contributions
 millenn-23.655
��-22.980
 proport-22.867
 seiz-22.632
 therap-22.396
Token R
Token ID371
Feature activation-7.03e-02

Pos logit contributions
chery+7.022
 Nether+6.793
hens+6.377
word+6.053
 BT+5.986
Neg logit contributions
 accuser-9.182
allo-9.151
ctica-8.969
omorphic-8.759
-8.681
Token on
Token ID319
Feature activation-3.41e-01

Pos logit contributions
allo+0.870
 accuser+0.849
ctica+0.825
gio+0.806
omorphic+0.802
Neg logit contributions
chery-1.061
hens-0.976
 Nether-0.969
word-0.901
iron-0.880
Token the
Token ID262
Feature activation-6.15e-01

Pos logit contributions
chery+3.681
hens+3.329
 Nether+3.320
word+3.005
iron+2.968
Neg logit contributions
allo-4.170
ctica-4.035
 accuser-4.024
gio-3.937
omorphic-3.927
Token road
Token ID2975
Feature activation-1.59e-01

Pos logit contributions
chery+6.934
 Nether+6.314
hens+6.233
iron+5.755
 Corn+5.695
Neg logit contributions
allo-6.839
ctica-6.780
omorphic-6.609
968-6.573
gio-6.490
Token,
Token ID11
Feature activation-5.46e-02

Pos logit contributions
chery+1.861
hens+1.696
 Nether+1.667
word+1.557
iron+1.532
Neg logit contributions
allo-1.844
 accuser-1.796
ctica-1.764
gio-1.737
omorphic-1.728
Token 38
Token ID4353
Feature activation-7.00e-01

Pos logit contributions
chery+0.672
hens+0.617
 Nether+0.612
word+0.567
iron+0.557
Neg logit contributions
allo-0.601
 accuser-0.585
ctica-0.574
gio-0.561
omorphic-0.561
Token-
Token ID12
Feature activation-1.75e+00

Pos logit contributions
chery+6.285
hens+5.889
 Nether+5.715
word+5.419
iron+5.296
Neg logit contributions
allo-8.977
 accuser-8.775
ctica-8.728
gio-8.684
omorphic-8.404
Token19
Token ID1129
Feature activation-2.87e-01

Pos logit contributions
chery+9.865
word+9.558
iron+9.536
hens+8.957
hen+8.373
Neg logit contributions
��-24.316
 millenn-23.601
 satell-22.593
 proport-22.435
ctica-22.290
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
chery+3.192
hens+2.920
 Nether+2.877
word+2.669
iron+2.630
Neg logit contributions
allo-3.398
 accuser-3.322
ctica-3.286
omorphic-3.251
gio-3.197
Token
Token ID198
Feature activation+1.30e-02

Pos logit contributions
allo+3.265
 accuser+3.175
ctica+3.119
gio+3.064
968+3.034
Neg logit contributions
chery-3.713
hens-3.375
 Nether-3.301
word-3.104
iron-3.020
Token
Token ID198
Feature activation+5.46e-02

Pos logit contributions
allo+0.143
 accuser+0.140
ctica+0.136
gio+0.134
omorphic+0.133
Neg logit contributions
chery-0.160
hens-0.147
 Nether-0.146
word-0.135
iron-0.132
TokenT
Token ID51
Feature activation-4.87e-01

Pos logit contributions
allo+0.565
 accuser+0.549
ctica+0.536
gio+0.525
omorphic+0.521
Neg logit contributions
chery-0.707
hens-0.653
 Nether-0.647
word-0.602
iron-0.589
Token
Token ID247
Feature activation+6.47e-01

Pos logit contributions
chery+0.847
hens+0.790
 Nether+0.782
word+0.739
iron+0.724
Neg logit contributions
allo-0.494
 accuser-0.484
ctica-0.470
omorphic-0.456
gio-0.455
Tokens
Token ID82
Feature activation-2.45e-01

Pos logit contributions
allo+4.606
 accuser+4.095
dri+4.093
968+4.056
lake+3.980
Neg logit contributions
chery-9.657
 Nether-9.297
hens-8.998
iron-8.438
word-8.370
Token state
Token ID1181
Feature activation+3.43e-01

Pos logit contributions
chery+3.431
hens+3.158
 Nether+3.134
word+2.958
iron+2.847
Neg logit contributions
allo-2.257
 accuser-2.183
ctica-2.134
gio-2.103
omorphic-2.061
Token (
Token ID357
Feature activation-1.33e+00

Pos logit contributions
allo+3.515
 accuser+3.402
ctica+3.285
968+3.284
gio+3.245
Neg logit contributions
chery-4.306
hens-4.004
 Nether-3.974
iron-3.671
word-3.651
Tokensuch
Token ID10508
Feature activation-9.78e-01

Pos logit contributions
chery+7.744
hens+6.825
iron+6.804
word+6.700
hen+6.601
Neg logit contributions
 millenn-20.271
 accuser-19.918
ctica-19.238
 proport-19.094
olics-19.019
Token as
Token ID355
Feature activation-1.75e+00

Pos logit contributions
chery+7.024
word+5.721
hens+5.590
 Nether+5.478
iron+4.895
Neg logit contributions
 millenn-14.451
allo-14.110
��-13.975
 accuser-13.715
itialized-13.679
Token through
Token ID832
Feature activation-7.52e-01

Pos logit contributions
chery+8.623
 Nether+6.670
word+5.948
hens+5.772
 Corn+4.833
Neg logit contributions
allo-23.583
ctica-23.435
bably-23.374
algia-23.288
-23.012
Token a
Token ID257
Feature activation-1.37e+00

Pos logit contributions
chery+5.988
 Nether+5.388
hens+5.210
word+4.778
 BT+4.465
Neg logit contributions
allo-10.701
 accuser-10.393
ctica-10.351
omorphic-10.171
bably-9.983
Token or
Token ID393
Feature activation-5.78e-01

Pos logit contributions
chery+11.497
 Nether+10.722
hens+9.803
word+9.451
 BT+8.925
Neg logit contributions
allo-15.456
ctica-15.136
 accuser-14.969
omorphic-14.485
bably-14.304
Token )
Token ID1267
Feature activation+3.82e-01

Pos logit contributions
chery+6.084
 Nether+5.410
hens+5.302
word+4.893
 Worm+4.571
Neg logit contributions
allo-7.095
 accuser-7.020
ctica-7.001
omorphic-6.674
gio-6.647
Token must
Token ID1276
Feature activation+2.78e-01

Pos logit contributions
allo+4.301
 accuser+4.146
ctica+4.056
lake+4.019
968+3.992
Neg logit contributions
chery-4.374
hens-4.049
 Nether-4.028
word-3.727
argon-3.638
Token edges
Token ID13015
Feature activation-2.26e-01

Pos logit contributions
hens+4.733
chery+4.692
 Nether+4.209
word+3.695
iron+3.584
Neg logit contributions
allo-9.566
 accuser-9.347
ctica-9.140
gio-9.083
lake-9.066
Token that
Token ID326
Feature activation+4.61e-03

Pos logit contributions
chery+2.402
hens+2.156
 Nether+2.119
word+1.988
iron+1.890
Neg logit contributions
allo-2.881
 accuser-2.804
ctica-2.784
gio-2.695
omorphic-2.687
Token you
Token ID345
Feature activation-9.26e-01

Pos logit contributions
allo+0.058
 accuser+0.057
ctica+0.056
gio+0.055
omorphic+0.054
Neg logit contributions
chery-0.050
hens-0.045
 Nether-0.044
word-0.041
iron-0.040
Token have
Token ID423
Feature activation-8.61e-02

Pos logit contributions
chery+7.082
hens+6.310
 Nether+5.941
iron+5.520
word+5.462
Neg logit contributions
 millenn-12.789
allo-12.776
 accuser-12.521
ctica-12.518
-12.431
Token to
Token ID284
Feature activation-1.12e+00

Pos logit contributions
chery+0.996
hens+0.907
 Nether+0.894
word+0.831
iron+0.810
Neg logit contributions
allo-1.013
 accuser-0.992
ctica-0.972
omorphic-0.948
gio-0.948
Token be
Token ID307
Feature activation-1.74e+00

Pos logit contributions
chery+6.939
 Nether+5.256
word+5.081
hens+5.072
iron+4.676
Neg logit contributions
 millenn-16.498
��-16.399
allo-16.309
ctica-16.047
-16.009
Token aware
Token ID3910
Feature activation-3.39e-01

Pos logit contributions
chery+7.348
 Nether+6.296
hens+5.251
iron+4.862
hen+4.052
Neg logit contributions
allo-25.186
gio-24.933
olics-24.582
-24.502
dri-24.401
Token of
Token ID286
Feature activation-2.74e-01

Pos logit contributions
chery+3.723
hens+3.335
 Nether+3.301
word+3.099
iron+2.936
Neg logit contributions
allo-4.135
 accuser-4.085
ctica-4.025
omorphic-3.925
bably-3.889
Token.
Token ID13
Feature activation-4.34e-01

Pos logit contributions
chery+3.009
hens+2.740
 Nether+2.695
word+2.495
iron+2.448
Neg logit contributions
allo-3.333
 accuser-3.279
ctica-3.221
omorphic-3.126
gio-3.119
Token In
Token ID554
Feature activation-9.62e-01

Pos logit contributions
chery+4.744
 Nether+4.500
hens+4.378
word+4.004
iron+3.988
Neg logit contributions
allo-5.012
 accuser-4.991
ctica-4.864
omorphic-4.737
gio-4.700
Token this
Token ID428
Feature activation-9.51e-02

Pos logit contributions
chery+6.255
 Nether+5.639
hens+5.569
word+4.976
iron+4.402
Neg logit contributions
allo-14.172
ctica-14.169
 accuser-14.038
-14.032
bably-13.584
TokenI
Token ID40
Feature activation+5.58e-02

Pos logit contributions
allo+1.520
 accuser+1.460
ctica+1.423
gio+1.401
968+1.384
Neg logit contributions
chery-2.148
hens-1.990
 Nether-1.983
word-1.836
iron-1.799
Token am
Token ID716
Feature activation-4.02e-02

Pos logit contributions
allo+0.683
 accuser+0.670
ctica+0.655
gio+0.643
omorphic+0.641
Neg logit contributions
chery-0.614
hens-0.559
 Nether-0.553
word-0.507
iron-0.496
Token not
Token ID407
Feature activation+3.44e-01

Pos logit contributions
chery+0.465
hens+0.426
 Nether+0.421
word+0.390
iron+0.380
Neg logit contributions
allo-0.472
 accuser-0.459
ctica-0.451
gio-0.442
omorphic-0.439
Token afraid
Token ID7787
Feature activation-1.42e-01

Pos logit contributions
 accuser+3.629
allo+3.572
ctica+3.439
omorphic+3.384
gio+3.353
Neg logit contributions
chery-4.284
hens-3.927
 Nether-3.875
iron-3.543
word-3.524
Token to
Token ID284
Feature activation-5.20e-01

Pos logit contributions
chery+1.660
hens+1.522
 Nether+1.494
word+1.397
iron+1.355
Neg logit contributions
allo-1.656
 accuser-1.605
ctica-1.580
gio-1.552
omorphic-1.541
Token speak
Token ID2740
Feature activation-1.74e+00

Pos logit contributions
chery+5.532
 Nether+5.114
hens+5.016
word+4.795
iron+4.488
Neg logit contributions
allo-6.175
ctica-5.965
bably-5.898
 accuser-5.851
gio-5.835
Token frankly
Token ID17813
Feature activation-9.37e-01

Pos logit contributions
chery+12.101
 Nether+11.235
word+11.165
 up+11.020
hens+10.225
Neg logit contributions
itialized-19.434
-19.041
gio-18.911
bably-18.730
allo-18.500
Token to
Token ID284
Feature activation-4.32e-01

Pos logit contributions
chery+7.819
hens+6.807
 Nether+6.579
word+6.410
iron+5.961
Neg logit contributions
allo-12.223
gio-12.029
bably-11.812
itialized-11.781
ctica-11.725
Token a
Token ID257
Feature activation-3.63e-01

Pos logit contributions
chery+4.494
hens+4.214
 Nether+4.197
word+3.749
 BT+3.617
Neg logit contributions
allo-5.263
ctica-5.068
gio-4.995
 accuser-4.995
omorphic-4.936
Token president
Token ID1893
Feature activation-1.01e-01

Pos logit contributions
chery+4.317
 Nether+4.043
hens+3.968
word+3.697
iron+3.531
Neg logit contributions
allo-3.969
ctica-3.807
gio-3.710
 accuser-3.694
omorphic-3.667
Token of
Token ID286
Feature activation-2.79e-02

Pos logit contributions
chery+1.214
hens+1.114
 Nether+1.102
word+1.027
iron+0.998
Neg logit contributions
allo-1.128
 accuser-1.093
ctica-1.076
gio-1.058
omorphic-1.051
Token administration
Token ID3662
Feature activation-2.99e-01

Pos logit contributions
allo+3.471
 accuser+3.405
ctica+3.252
gio+3.170
dri+3.146
Neg logit contributions
chery-3.867
hens-3.494
 Nether-3.436
word-3.220
iron-3.169
Token battles
Token ID10181
Feature activation-2.98e-01

Pos logit contributions
chery+3.820
hens+3.565
 Nether+3.477
word+3.274
iron+3.190
Neg logit contributions
allo-3.061
 accuser-2.971
ctica-2.929
omorphic-2.874
gio-2.840
Token to
Token ID284
Feature activation-8.49e-02

Pos logit contributions
chery+3.373
hens+3.086
 Nether+3.063
word+2.800
iron+2.701
Neg logit contributions
allo-3.481
 accuser-3.389
ctica-3.378
gio-3.332
omorphic-3.331
Token bring
Token ID2222
Feature activation-7.33e-01

Pos logit contributions
chery+1.069
hens+0.984
 Nether+0.976
word+0.909
iron+0.888
Neg logit contributions
allo-0.904
 accuser-0.884
ctica-0.866
gio-0.848
omorphic-0.844
Token the
Token ID262
Feature activation-1.13e-01

Pos logit contributions
chery+7.450
hens+7.115
 Nether+6.906
word+5.971
 BT+5.878
Neg logit contributions
allo-8.494
ctica-8.346
omorphic-8.297
gio-8.151
 accuser-8.038
Token deficit
Token ID11807
Feature activation-1.73e+00

Pos logit contributions
chery+1.361
hens+1.270
 Nether+1.247
word+1.152
iron+1.119
Neg logit contributions
allo-1.258
 accuser-1.212
ctica-1.197
omorphic-1.176
gio-1.172
Token down
Token ID866
Feature activation-1.79e-01

Pos logit contributions
chery+12.953
hens+11.460
 Nether+10.948
 up+10.503
 BT+10.342
Neg logit contributions
ctica-18.046
allo-17.860
itialized-17.632
omorphic-17.496
dri-17.483
Token to
Token ID284
Feature activation-8.99e-01

Pos logit contributions
chery+2.050
hens+1.896
 Nether+1.847
word+1.725
iron+1.691
Neg logit contributions
allo-2.081
 accuser-2.050
ctica-2.004
omorphic-1.953
gio-1.953
Token 6
Token ID718
Feature activation-2.88e-01

Pos logit contributions
chery+9.152
hens+8.812
 Nether+8.399
word+7.460
 BT+7.340
Neg logit contributions
ctica-9.671
omorphic-9.657
allo-9.524
 accuser-9.507
-9.227
Token.
Token ID13
Feature activation-2.15e-01

Pos logit contributions
chery+3.135
hens+2.917
 Nether+2.830
word+2.613
iron+2.579
Neg logit contributions
allo-3.467
 accuser-3.445
ctica-3.375
omorphic-3.308
gio-3.268
Token3
Token ID18
Feature activation-3.90e-01

Pos logit contributions
chery+1.877
hens+1.674
 Nether+1.658
word+1.493
iron+1.435
Neg logit contributions
allo-3.083
 accuser-3.066
ctica-3.004
omorphic-2.968
gio-2.934
Tokenbit
Token ID2545
Feature activation-7.57e-01

Pos logit contributions
chery+7.934
word+7.264
bit+6.513
hens+6.300
 Nether+6.202
Neg logit contributions
 accuser-20.670
ctica-19.406
allo-18.999
bably-18.960
olics-18.773
Token 128
Token ID13108
Feature activation-1.22e+00

Pos logit contributions
chery+6.106
 Nether+5.862
hens+5.573
word+5.370
 BT+5.250
Neg logit contributions
 accuser-10.291
allo-10.259
ctica-10.067
-9.897
omorphic-9.844
Token-
Token ID12
Feature activation-1.68e+00

Pos logit contributions
chery+9.300
 Nether+8.980
hens+8.605
word+8.567
 BT+8.526
Neg logit contributions
 accuser-15.069
allo-15.063
ctica-14.470
gio-13.916
olics-13.913
Tokenbit
Token ID2545
Feature activation-8.97e-01

Pos logit contributions
chery+8.922
word+8.765
bit+8.725
hens+8.023
iron+7.112
Neg logit contributions
 accuser-22.886
 millenn-22.203
ctica-22.104
-22.034
��-21.777
Token 256
Token ID17759
Feature activation-9.33e-01

Pos logit contributions
chery+8.027
 Nether+7.868
hens+7.446
word+7.280
 BT+7.002
Neg logit contributions
 accuser-10.939
allo-10.736
ctica-10.705
-10.556
omorphic-10.407
Token-
Token ID12
Feature activation-1.71e+00

Pos logit contributions
chery+7.415
 Nether+7.194
hens+6.747
word+6.583
 BT+6.511
Neg logit contributions
allo-12.308
ctica-12.300
 accuser-12.222
-11.967
bably-11.772
Tokenbit
Token ID2545
Feature activation-9.73e-01

Pos logit contributions
word+9.378
chery+9.323
bit+9.162
hens+8.496
hen+7.525
Neg logit contributions
 millenn-23.119
 accuser-22.960
-22.842
 proport-22.535
 seiz-22.531
Token 192
Token ID17817
Feature activation-8.12e-01

Pos logit contributions
chery+8.624
 Nether+8.510
hens+7.991
word+7.786
 BT+7.584
Neg logit contributions
 accuser-11.801
allo-11.590
ctica-11.517
-11.321
omorphic-11.165
Token-
Token ID12
Feature activation-1.84e+00

Pos logit contributions
chery+6.809
 Nether+6.632
hens+6.229
word+6.086
 BT+5.935
Neg logit contributions
allo-10.855
ctica-10.785
 accuser-10.656
-10.387
bably-10.243
Tokenbit
Token ID2545
Feature activation-1.13e+00

Pos logit contributions
bit+9.536
word+9.407
chery+8.853
hens+8.082
hen+7.502
Neg logit contributions
 millenn-25.662
 proport-24.979
 seiz-24.794
-24.710
 accuser-24.505
Token 256
Token ID17759
Feature activation-8.20e-01

Pos logit contributions
 Nether+9.578
chery+9.364
hens+8.564
 BT+8.563
word+8.232
Neg logit contributions
 accuser-13.860
allo-13.602
ctica-13.410
-13.164
omorphic-12.993
Token on
Token ID319
Feature activation+2.18e-01

Pos logit contributions
allo+0.906
 accuser+0.882
ctica+0.861
omorphic+0.843
gio+0.842
Neg logit contributions
chery-0.946
hens-0.868
 Nether-0.858
word-0.794
iron-0.777
Token a
Token ID257
Feature activation-9.06e-03

Pos logit contributions
allo+2.414
 accuser+2.348
ctica+2.281
omorphic+2.238
gio+2.230
Neg logit contributions
chery-2.650
hens-2.430
 Nether-2.385
word-2.226
iron-2.187
Token song
Token ID3496
Feature activation+4.73e-01

Pos logit contributions
chery+0.113
hens+0.104
 Nether+0.103
word+0.095
iron+0.093
Neg logit contributions
allo-0.098
 accuser-0.096
ctica-0.094
gio-0.092
omorphic-0.091
Token-
Token ID12
Feature activation+9.42e-02

Pos logit contributions
allo+5.031
 accuser+4.900
ctica+4.825
omorphic+4.719
gio+4.658
Neg logit contributions
chery-5.555
hens-5.273
 Nether-5.235
word-4.756
iron-4.686
Tokenby
Token ID1525
Feature activation-1.08e+00

Pos logit contributions
allo+1.215
 accuser+1.183
ctica+1.161
gio+1.143
omorphic+1.139
Neg logit contributions
chery-0.974
hens-0.884
 Nether-0.879
word-0.792
iron-0.772
Token-
Token ID12
Feature activation-1.71e+00

Pos logit contributions
chery+8.478
word+7.221
hens+7.066
 Nether+6.821
 BT+6.169
Neg logit contributions
allo-14.226
��-14.119
 accuser-13.877
ctica-13.866
-13.795
Tokensong
Token ID34050
Feature activation-2.29e-01

Pos logit contributions
word+14.967
chery+13.923
iron+12.124
words+12.091
hens+11.836
Neg logit contributions
 millenn-20.853
 seiz-20.165
��-20.162
 Entered-19.825
 proport-19.777
Token basis
Token ID4308
Feature activation+1.50e-01

Pos logit contributions
chery+2.155
hens+1.917
 Nether+1.901
word+1.729
iron+1.662
Neg logit contributions
allo-3.143
 accuser-3.074
ctica-3.061
gio-3.003
omorphic-2.963
Token,
Token ID11
Feature activation+1.33e-01

Pos logit contributions
allo+1.672
 accuser+1.625
ctica+1.593
omorphic+1.561
gio+1.550
Neg logit contributions
chery-1.810
hens-1.662
 Nether-1.651
word-1.525
iron-1.498
Token and
Token ID290
Feature activation+8.23e-02

Pos logit contributions
allo+1.486
 accuser+1.443
ctica+1.410
omorphic+1.380
gio+1.376
Neg logit contributions
chery-1.610
hens-1.477
 Nether-1.462
word-1.354
iron-1.325
Token em
Token ID795
Feature activation+4.38e-01

Pos logit contributions
allo+0.913
 accuser+0.888
ctica+0.867
gio+0.849
omorphic+0.848
Neg logit contributions
chery-1.002
hens-0.921
 Nether-0.911
word-0.846
iron-0.828
Token efforts
Token ID4040
Feature activation-2.11e-01

Pos logit contributions
chery+2.758
hens+2.523
 Nether+2.515
word+2.273
iron+2.234
Neg logit contributions
allo-2.760
 accuser-2.668
ctica-2.653
gio-2.623
omorphic-2.603
Token to
Token ID284
Feature activation-7.66e-01

Pos logit contributions
chery+2.419
hens+2.190
 Nether+2.149
word+2.004
iron+1.961
Neg logit contributions
allo-2.493
 accuser-2.438
ctica-2.384
gio-2.368
omorphic-2.333
Token pull
Token ID2834
Feature activation-5.09e-01

Pos logit contributions
chery+8.794
 Nether+7.958
hens+7.872
word+7.284
iron+7.208
Neg logit contributions
allo-7.736
 accuser-7.658
omorphic-7.636
gio-7.556
ctica-7.531
Token the
Token ID262
Feature activation-2.18e-01

Pos logit contributions
chery+5.255
hens+4.777
 Nether+4.688
word+4.220
iron+4.162
Neg logit contributions
 accuser-6.171
allo-6.116
ctica-6.090
gio-5.979
968-5.923
Token Fukushima
Token ID34170
Feature activation-5.42e-01

Pos logit contributions
chery+2.625
hens+2.412
 Nether+2.405
word+2.194
iron+2.147
Neg logit contributions
allo-2.416
 accuser-2.349
ctica-2.344
gio-2.291
omorphic-2.290
Token plant
Token ID4618
Feature activation-1.70e+00

Pos logit contributions
chery+5.810
hens+5.428
 Nether+5.286
word+4.749
iron+4.731
Neg logit contributions
allo-6.306
ctica-6.143
 accuser-6.124
omorphic-6.032
gio-6.029
Token back
Token ID736
Feature activation-8.44e-01

Pos logit contributions
chery+14.858
hens+13.199
 Nether+12.655
 BT+12.610
iron+12.090
Neg logit contributions
ctica-15.464
-15.266
omorphic-15.246
bably-15.242
968-15.183
Token from
Token ID422
Feature activation-3.76e-01

Pos logit contributions
chery+8.100
hens+7.573
 Nether+7.027
word+6.791
iron+6.656
Neg logit contributions
allo-10.274
 accuser-10.160
gio-9.772
968-9.634
ctica-9.569
Token the
Token ID262
Feature activation-6.27e-01

Pos logit contributions
chery+4.344
 Nether+3.931
hens+3.897
word+3.549
 Chern+3.464
Neg logit contributions
allo-4.244
 accuser-4.214
gio-4.113
968-4.041
omorphic-4.032
Token brink
Token ID29585
Feature activation-3.78e-01

Pos logit contributions
chery+5.899
 Nether+5.443
hens+5.290
 Chern+4.693
 BT+4.594
Neg logit contributions
allo-8.065
gio-7.866
omorphic-7.778
ctica-7.775
 accuser-7.726
Token.
Token ID13
Feature activation-9.71e-02

Pos logit contributions
chery+4.265
hens+3.894
 Nether+3.714
word+3.549
iron+3.437
Neg logit contributions
allo-4.511
 accuser-4.408
gio-4.273
ctica-4.266
968-4.230
Token world
Token ID995
Feature activation+6.77e-02

Pos logit contributions
allo+3.872
 accuser+3.864
ctica+3.650
lake+3.522
gio+3.376
Neg logit contributions
chery-9.888
hens-9.150
 Nether-8.746
iron-8.593
argon-8.587
Token.
Token ID13
Feature activation-2.85e-01

Pos logit contributions
allo+0.753
 accuser+0.735
ctica+0.718
gio+0.703
omorphic+0.702
Neg logit contributions
chery-0.822
hens-0.754
 Nether-0.747
word-0.692
iron-0.676
Token (
Token ID357
Feature activation+4.03e-01

Pos logit contributions
chery+3.098
hens+2.835
 Nether+2.826
word+2.560
iron+2.533
Neg logit contributions
allo-3.452
 accuser-3.399
omorphic-3.293
ctica-3.292
gio-3.223
TokenWa
Token ID33484
Feature activation-9.14e-01

Pos logit contributions
allo+4.241
gio+4.002
 accuser+4.002
ctica+3.991
968+3.924
Neg logit contributions
chery-4.956
 Nether-4.632
hens-4.572
word-4.176
iron-4.066
Tokenub
Token ID549
Feature activation+5.99e-04

Pos logit contributions
chery+8.193
 Nether+7.177
word+7.174
hens+7.135
iron+6.713
Neg logit contributions
 accuser-11.747
ctica-11.103
-10.978
 Entered-10.798
lake-10.703
Tokenges
Token ID3212
Feature activation-1.69e+00

Pos logit contributions
allo+0.006
 accuser+0.005
ctica+0.005
gio+0.005
omorphic+0.005
Neg logit contributions
chery-0.008
hens-0.008
 Nether-0.008
word-0.007
iron-0.007
Tokenhig
Token ID25196
Feature activation+2.51e-02

Pos logit contributions
chery+9.236
hens+8.062
word+7.173
icken+6.354
hen+6.080
Neg logit contributions
 accuser-23.627
 millenn-23.182
ーテ-22.996
-22.832
 destro-22.148
Token Rice
Token ID13823
Feature activation-9.25e-01

Pos logit contributions
allo+0.199
 accuser+0.192
ctica+0.186
gio+0.181
omorphic+0.180
Neg logit contributions
chery-0.386
hens-0.361
 Nether-0.357
word-0.338
iron-0.332
Token/
Token ID14
Feature activation-5.07e-01

Pos logit contributions
chery+6.261
 Nether+5.580
hens+5.356
word+4.924
 BT+4.552
Neg logit contributions
��-13.304
ctica-13.266
bably-13.159
allo-13.146
omorphic-13.143
TokenCBC
Token ID29208
Feature activation-5.76e-01

Pos logit contributions
chery+2.781
hens+2.219
 Nether+1.985
word+1.726
iron+1.608
Neg logit contributions
 accuser-8.828
allo-8.725
omorphic-8.614
ctica-8.565
bably-8.466
Token)
Token ID8
Feature activation-1.70e-01

Pos logit contributions
chery+6.101
hens+5.604
 Nether+5.594
word+5.216
iron+5.131
Neg logit contributions
allo-6.675
 accuser-6.494
omorphic-6.409
ctica-6.388
gio-6.334
Token introduced
Token ID5495
Feature activation-2.24e-01

Pos logit contributions
chery+6.621
hens+6.010
 Nether+5.996
word+5.685
iron+5.555
Neg logit contributions
allo-6.643
ctica-6.275
 accuser-6.236
gio-6.205
omorphic-6.100
Token to
Token ID284
Feature activation-2.83e-02

Pos logit contributions
chery+2.529
hens+2.308
 Nether+2.279
word+2.121
iron+2.079
Neg logit contributions
allo-2.670
 accuser-2.609
ctica-2.544
omorphic-2.503
gio-2.500
Token a
Token ID257
Feature activation+4.79e-01

Pos logit contributions
chery+0.333
hens+0.304
 Nether+0.302
word+0.279
iron+0.272
Neg logit contributions
allo-0.326
 accuser-0.318
ctica-0.311
gio-0.305
omorphic-0.304
Token very
Token ID845
Feature activation-2.62e-02

Pos logit contributions
allo+5.072
 accuser+5.027
omorphic+4.761
ctica+4.734
gio+4.708
Neg logit contributions
chery-5.743
hens-5.332
 Nether-5.047
word-4.759
iron-4.725
Token active
Token ID4075
Feature activation+6.19e-01

Pos logit contributions
chery+0.323
hens+0.296
 Nether+0.294
word+0.273
iron+0.267
Neg logit contributions
allo-0.289
 accuser-0.281
ctica-0.276
gio-0.270
omorphic-0.269
Token November
Token ID3389
Feature activation+1.45e+00

Pos logit contributions
 accuser+6.487
allo+6.383
omorphic+5.962
ctica+5.961
lake+5.935
Neg logit contributions
chery-7.353
hens-6.810
 Nether-6.297
word-6.027
argon-5.925
Token Project
Token ID4935
Feature activation+7.33e-01

Pos logit contributions
 accuser+11.146
allo+10.243
omorphic+10.094
 participant+10.042
 Palmer+9.996
Neg logit contributions
chery-17.030
hens-15.177
argon-14.188
 Nether-14.186
word-13.562
Token group
Token ID1448
Feature activation-1.86e-01

Pos logit contributions
 accuser+7.216
allo+7.150
omorphic+6.520
ctica+6.451
 Entered+6.433
Neg logit contributions
chery-8.922
hens-8.254
 Nether-7.800
word-7.415
argon-7.275
Token in
Token ID287
Feature activation-1.44e-01

Pos logit contributions
chery+2.153
 Nether+1.949
hens+1.945
word+1.809
iron+1.766
Neg logit contributions
allo-2.165
 accuser-2.105
ctica-2.095
gio-2.049
omorphic-2.045
Token Edmonton
Token ID21165
Feature activation+1.31e-01

Pos logit contributions
chery+1.665
hens+1.538
 Nether+1.531
word+1.401
iron+1.365
Neg logit contributions
allo-1.680
 accuser-1.629
ctica-1.609
omorphic-1.574
gio-1.571
Token that
Token ID326
Feature activation+1.49e-02

Pos logit contributions
allo+1.472
 accuser+1.434
ctica+1.400
gio+1.368
omorphic+1.366
Neg logit contributions
chery-1.570
hens-1.443
 Nether-1.422
word-1.316
iron-1.284
Token distributions
Token ID24570
Feature activation+6.72e-01

Pos logit contributions
allo+5.216
 accuser+5.153
omorphic+5.090
ctica+4.983
gio+4.890
Neg logit contributions
chery-5.956
hens-5.372
 Nether-5.286
word-4.928
iron-4.908
Token in
Token ID287
Feature activation+9.68e-01

Pos logit contributions
allo+6.559
 accuser+6.492
968+6.204
omorphic+6.151
gio+6.036
Neg logit contributions
chery-8.183
hens-7.611
 Nether-7.418
iron-6.966
argon-6.831
Token multiple
Token ID3294
Feature activation+8.90e-01

Pos logit contributions
omorphic+8.424
allo+8.365
 accuser+8.275
ctica+7.955
gio+7.900
Neg logit contributions
chery-11.972
hens-10.859
 Nether-10.621
iron-10.236
word-9.931
Token versions
Token ID6300
Feature activation+6.69e-01

Pos logit contributions
 accuser+7.846
omorphic+7.725
allo+7.676
ctica+7.397
lake+7.386
Neg logit contributions
chery-11.189
hens-10.026
 Nether-9.735
word-9.207
iron-9.195
Token,
Token ID11
Feature activation+6.97e-01

Pos logit contributions
allo+6.234
 accuser+6.150
omorphic+5.859
ctica+5.839
gio+5.811
Neg logit contributions
chery-8.367
hens-7.735
 Nether-7.597
iron-7.185
word-7.150
Token each
Token ID1123
Feature activation+1.74e+00

Pos logit contributions
allo+7.051
 accuser+6.955
omorphic+6.734
ctica+6.542
968+6.510
Neg logit contributions
chery-8.275
hens-7.650
 Nether-7.346
word-6.938
iron-6.920
Token with
Token ID351
Feature activation+1.07e+00

Pos logit contributions
 accuser+14.636
omorphic+14.066
allo+13.488
 Entered+13.255
968+13.192
Neg logit contributions
chery-15.616
hens-15.451
 Nether-14.521
iron-13.852
argon-13.595
Token their
Token ID511
Feature activation+1.03e+00

Pos logit contributions
 accuser+9.708
allo+9.570
omorphic+9.290
ctica+9.023
968+8.948
Neg logit contributions
chery-11.903
hens-11.182
 Nether-10.651
iron-10.583
word-10.252
Token own
Token ID898
Feature activation+8.05e-01

Pos logit contributions
 accuser+7.893
allo+7.730
omorphic+7.622
968+7.176
dri+7.111
Neg logit contributions
chery-13.607
hens-12.904
 Nether-12.145
iron-11.979
word-11.817
Token versions
Token ID6300
Feature activation+6.83e-01

Pos logit contributions
 accuser+8.350
allo+8.335
omorphic+8.118
ctica+7.907
lake+7.776
Neg logit contributions
chery-9.095
hens-8.247
 Nether-7.791
iron-7.657
word-7.486
Token of
Token ID286
Feature activation+9.30e-01

Pos logit contributions
allo+6.225
 accuser+5.946
ctica+5.929
gio+5.756
omorphic+5.743
Neg logit contributions
chery-8.644
 Nether-8.193
hens-8.167
iron-7.595
word-7.429
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
 accuser+5.513
allo+5.325
lake+4.930
ctica+4.924
omorphic+4.877
Neg logit contributions
chery-5.950
hens-5.572
 Nether-5.393
iron-5.000
word-4.996
Token ability
Token ID2694
Feature activation-2.32e-02

Pos logit contributions
 accuser+12.976
allo+11.155
 misconduct+10.933
 harms+10.617
 Entered+10.609
Neg logit contributions
chery-12.224
hens-10.819
 Nether-10.652
iron-9.959
word-9.727
Token to
Token ID284
Feature activation+1.91e-01

Pos logit contributions
chery+0.280
hens+0.257
 Nether+0.253
word+0.235
iron+0.230
Neg logit contributions
allo-0.262
 accuser-0.256
ctica-0.250
gio-0.245
omorphic-0.244
Token confront
Token ID7239
Feature activation+6.39e-01

Pos logit contributions
allo+2.143
 accuser+2.141
ctica+2.043
gio+2.000
omorphic+1.993
Neg logit contributions
chery-2.293
hens-2.101
 Nether-2.067
word-1.909
iron-1.876
Token one
Token ID530
Feature activation+1.08e+00

Pos logit contributions
 accuser+6.870
allo+6.317
omorphic+5.938
ctica+5.926
lake+5.745
Neg logit contributions
chery-7.846
hens-7.191
 Nether-6.997
iron-6.649
word-6.526
Token's
Token ID338
Feature activation+1.18e+00

Pos logit contributions
 accuser+10.064
allo+7.930
 participant+7.378
 misconduct+7.343
 harms+7.314
Neg logit contributions
chery-13.849
hens-12.955
 Nether-12.680
iron-11.921
word-11.860
Token accuser
Token ID43738
Feature activation+2.55e-01

Pos logit contributions
 accuser+10.870
 misconduct+7.524
allo+7.256
 harms+6.919
lake+6.788
Neg logit contributions
chery-16.527
hens-15.019
 Nether-14.498
iron-14.174
////////////////-14.054
Token is
Token ID318
Feature activation+5.61e-01

Pos logit contributions
 accuser+2.756
allo+2.711
ctica+2.596
omorphic+2.545
gio+2.512
Neg logit contributions
chery-3.128
hens-2.889
 Nether-2.864
word-2.652
iron-2.603
Token important
Token ID1593
Feature activation+9.27e-02

Pos logit contributions
 accuser+5.779
allo+5.726
omorphic+5.484
ctica+5.435
968+5.316
Neg logit contributions
chery-6.731
hens-6.317
 Nether-6.217
word-5.721
iron-5.575
Token not
Token ID407
Feature activation-2.54e-01

Pos logit contributions
allo+1.050
 accuser+1.027
ctica+0.996
gio+0.976
omorphic+0.975
Neg logit contributions
chery-1.110
hens-1.020
 Nether-1.008
word-0.931
iron-0.913
Token merely
Token ID6974
Feature activation-6.80e-02

Pos logit contributions
chery+2.310
hens+2.038
 Nether+1.963
word+1.786
iron+1.721
Neg logit contributions
allo-3.604
 accuser-3.465
ctica-3.454
gio-3.440
omorphic-3.393
Token that
Token ID326
Feature activation+5.48e-01

Pos logit contributions
allo+6.284
 accuser+6.160
omorphic+5.980
lake+5.837
gio+5.809
Neg logit contributions
chery-6.549
hens-6.014
 Nether-5.945
word-5.497
iron-5.449
Token apps
Token ID6725
Feature activation+2.43e-01

Pos logit contributions
allo+6.140
 accuser+6.119
omorphic+5.868
gio+5.717
lake+5.714
Neg logit contributions
chery-6.078
hens-5.605
 Nether-5.463
word-5.115
iron-5.054
Token are
Token ID389
Feature activation-9.78e-02

Pos logit contributions
allo+2.553
 accuser+2.511
ctica+2.423
omorphic+2.404
gio+2.371
Neg logit contributions
chery-3.041
hens-2.818
 Nether-2.766
word-2.574
iron-2.544
Token isolated
Token ID11557
Feature activation+3.62e-01

Pos logit contributions
chery+1.169
hens+1.070
 Nether+1.059
word+0.984
iron+0.958
Neg logit contributions
allo-1.110
 accuser-1.078
ctica-1.062
gio-1.036
omorphic-1.022
Token from
Token ID422
Feature activation+1.13e+00

Pos logit contributions
allo+3.629
 accuser+3.541
omorphic+3.478
ctica+3.430
gio+3.359
Neg logit contributions
chery-4.678
hens-4.263
 Nether-4.234
word-3.918
iron-3.866
Token each
Token ID1123
Feature activation+1.30e+00

Pos logit contributions
 accuser+9.834
allo+9.571
omorphic+9.447
ctica+9.259
olics+8.935
Neg logit contributions
chery-13.345
hens-12.352
 Nether-11.487
iron-11.477
word-11.212
Token other
Token ID584
Feature activation+3.84e-01

Pos logit contributions
 accuser+8.542
allo+8.130
omorphic+8.034
ctica+8.020
lake+7.413
Neg logit contributions
chery-17.084
hens-15.962
 Nether-15.155
word-14.746
iron-14.563
Token and
Token ID290
Feature activation+9.44e-02

Pos logit contributions
allo+4.003
 accuser+3.949
omorphic+3.836
ctica+3.799
gio+3.715
Neg logit contributions
chery-4.754
hens-4.331
 Nether-4.318
word-3.992
iron-3.896
Token from
Token ID422
Feature activation+1.12e+00

Pos logit contributions
allo+1.047
 accuser+1.025
ctica+0.997
omorphic+0.982
gio+0.979
Neg logit contributions
chery-1.148
hens-1.056
 Nether-1.041
word-0.967
iron-0.950
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
 accuser+10.404
allo+10.157
omorphic+10.138
ctica+9.800
olics+9.581
Neg logit contributions
chery-12.530
hens-11.389
 Nether-10.856
iron-10.648
word-10.512
Token host
Token ID2583
Feature activation+1.00e+00

Pos logit contributions
 accuser+11.172
omorphic+10.342
allo+10.312
ctica+9.908
 Entered+9.867
Neg logit contributions
chery-12.936
hens-11.718
iron-11.055
word-10.952
 Nether-10.859
Token Award
Token ID11289
Feature activation+8.70e-01

Pos logit contributions
allo+3.678
 accuser+3.638
ctica+3.440
gio+3.356
 Entered+3.308
Neg logit contributions
chery-7.590
hens-7.151
 Nether-7.021
word-6.598
iron-6.565
Token for
Token ID329
Feature activation+3.27e-01

Pos logit contributions
 accuser+7.519
allo+7.487
 Entered+7.000
ologist+6.963
gio+6.927
Neg logit contributions
chery-10.925
hens-10.229
 Nether-10.177
word-9.166
ether-9.150
Token Dist
Token ID4307
Feature activation-4.90e-01

Pos logit contributions
allo+3.106
 accuser+3.044
ctica+2.947
omorphic+2.882
gio+2.880
Neg logit contributions
chery-4.386
hens-4.100
 Nether-3.982
iron-3.741
word-3.728
Tokeninguished
Token ID46709
Feature activation-3.72e-01

Pos logit contributions
chery+7.469
hens+7.013
 Nether+6.722
word+6.669
iron+6.609
Neg logit contributions
 accuser-3.789
allo-3.678
ctica-3.584
gio-3.408
-3.403
Token Human
Token ID5524
Feature activation+7.45e-01

Pos logit contributions
chery+3.456
 Nether+3.194
hens+3.044
word+2.843
iron+2.615
Neg logit contributions
allo-5.110
 accuser-4.997
ctica-4.950
gio-4.755
omorphic-4.753
Tokenitarian
Token ID8353
Feature activation+1.20e+00

Pos logit contributions
allo+0.812
omorphic+0.503
itarian+0.245
ologist+0.244
gio+0.222
Neg logit contributions
chery-15.263
hens-14.979
 Nether-14.761
word-14.024
argon-13.896
Token Service
Token ID4809
Feature activation+7.66e-01

Pos logit contributions
 Entered+5.540
 Salv+5.245
allo+4.669
gio+4.645
 accuser+4.465
Neg logit contributions
chery-18.035
hens-17.650
argon-16.992
 Nether-16.766
iron-16.454
Token winner
Token ID8464
Feature activation+2.06e-01

Pos logit contributions
 accuser+6.888
allo+6.678
 Entered+6.480
ctica+6.471
omorphic+6.437
Neg logit contributions
chery-9.604
hens-8.891
 Nether-8.858
word-8.125
iron-7.964
Token was
Token ID373
Feature activation+1.86e-01

Pos logit contributions
allo+2.296
 accuser+2.268
ctica+2.199
gio+2.153
omorphic+2.152
Neg logit contributions
chery-2.449
hens-2.250
 Nether-2.205
word-2.047
iron-2.004
Token Ar
Token ID943
Feature activation+4.82e-01

Pos logit contributions
allo+2.240
 accuser+2.209
ctica+2.150
gio+2.112
omorphic+2.111
Neg logit contributions
chery-2.068
hens-1.888
 Nether-1.841
word-1.709
iron-1.675
Tokenlene
Token ID29466
Feature activation-1.04e-01

Pos logit contributions
allo+5.382
ctica+5.182
gio+5.072
 accuser+5.010
omorphic+4.953
Neg logit contributions
chery-5.581
 Nether-5.161
hens-5.099
word-4.657
iron-4.516
Token a
Token ID257
Feature activation+2.14e-01

Pos logit contributions
allo+3.784
 accuser+3.726
ctica+3.574
gio+3.495
omorphic+3.486
Neg logit contributions
chery-4.123
hens-3.763
 Nether-3.644
word-3.396
iron-3.346
Token former
Token ID1966
Feature activation+5.98e-02

Pos logit contributions
allo+2.638
 accuser+2.602
ctica+2.512
gio+2.491
omorphic+2.469
Neg logit contributions
chery-2.357
hens-2.114
 Nether-2.043
word-1.910
iron-1.876
Token competition
Token ID5449
Feature activation+2.81e-01

Pos logit contributions
allo+0.687
 accuser+0.672
ctica+0.655
gio+0.643
omorphic+0.640
Neg logit contributions
chery-0.706
hens-0.644
 Nether-0.634
word-0.590
iron-0.576
Token commissioner
Token ID16248
Feature activation+4.60e-01

Pos logit contributions
allo+2.988
 accuser+2.984
ctica+2.855
gio+2.776
omorphic+2.750
Neg logit contributions
chery-3.501
hens-3.235
 Nether-3.161
word-2.915
iron-2.892
Token,
Token ID11
Feature activation+2.10e-01

Pos logit contributions
allo+4.664
 accuser+4.625
ctica+4.422
gio+4.306
omorphic+4.287
Neg logit contributions
chery-5.748
hens-5.330
 Nether-5.239
word-4.834
iron-4.725
Token has
Token ID468
Feature activation+3.54e-01

Pos logit contributions
allo+2.196
 accuser+2.174
ctica+2.087
gio+2.042
omorphic+2.033
Neg logit contributions
chery-2.664
hens-2.456
 Nether-2.405
word-2.236
iron-2.193
Token already
Token ID1541
Feature activation+4.76e-01

Pos logit contributions
allo+3.697
 accuser+3.650
ctica+3.508
omorphic+3.453
gio+3.415
Neg logit contributions
chery-4.392
hens-4.061
 Nether-3.980
word-3.659
iron-3.629
Token proposed
Token ID5150
Feature activation-8.19e-02

Pos logit contributions
allo+4.897
 accuser+4.850
ctica+4.628
omorphic+4.552
gio+4.497
Neg logit contributions
chery-5.850
hens-5.410
 Nether-5.302
iron-4.827
word-4.816
Token that
Token ID326
Feature activation+1.45e-01

Pos logit contributions
chery+0.927
hens+0.851
 Nether+0.841
word+0.776
iron+0.754
Neg logit contributions
allo-0.976
 accuser-0.954
ctica-0.936
gio-0.919
omorphic-0.915
Token EU
Token ID4576
Feature activation+2.26e-01

Pos logit contributions
allo+1.634
 accuser+1.599
ctica+1.558
gio+1.520
omorphic+1.520
Neg logit contributions
chery-1.726
hens-1.577
 Nether-1.556
word-1.447
iron-1.420
Token countries
Token ID2678
Feature activation+1.97e-01

Pos logit contributions
allo+2.547
 accuser+2.487
ctica+2.430
omorphic+2.365
gio+2.356
Neg logit contributions
chery-2.700
hens-2.422
 Nether-2.419
word-2.247
iron-2.197
Token to
Token ID284
Feature activation-1.32e-01

Pos logit contributions
chery+5.077
hens+4.529
 Nether+4.452
word+4.110
iron+4.005
Neg logit contributions
allo-6.165
 accuser-6.009
gio-5.970
ctica-5.905
968-5.799
Token re
Token ID302
Feature activation-2.79e-01

Pos logit contributions
chery+1.592
 Nether+1.450
hens+1.446
word+1.335
iron+1.312
Neg logit contributions
allo-1.484
 accuser-1.443
ctica-1.420
gio-1.392
omorphic-1.380
Token-
Token ID12
Feature activation-4.33e-01

Pos logit contributions
chery+3.149
hens+2.962
 Nether+2.873
word+2.770
iron+2.678
Neg logit contributions
 accuser-3.237
allo-3.202
ctica-3.128
gio-3.037
omorphic-2.995
Tokenim
Token ID320
Feature activation+3.53e-01

Pos logit contributions
chery+4.656
hens+4.410
 Nether+4.212
word+4.155
iron+4.110
Neg logit contributions
 accuser-5.186
allo-5.051
ctica-5.018
gio-4.784
 Entered-4.772
Tokenagine
Token ID12756
Feature activation+2.55e-01

Pos logit contributions
allo+3.861
omorphic+3.631
 accuser+3.625
ctica+3.580
gio+3.571
Neg logit contributions
chery-4.309
hens-3.926
 Nether-3.914
word-3.541
iron-3.454
Token the
Token ID262
Feature activation-3.80e-02

Pos logit contributions
allo+2.779
 accuser+2.737
ctica+2.651
omorphic+2.624
gio+2.609
Neg logit contributions
chery-3.095
hens-2.865
 Nether-2.766
word-2.604
iron-2.544
Token condo
Token ID32119
Feature activation-2.30e-01

Pos logit contributions
chery+0.464
hens+0.426
 Nether+0.423
word+0.392
iron+0.383
Neg logit contributions
allo-0.422
 accuser-0.410
ctica-0.402
gio-0.393
omorphic-0.392
Token in
Token ID287
Feature activation+9.97e-02

Pos logit contributions
chery+2.796
 Nether+2.561
hens+2.555
word+2.377
iron+2.332
Neg logit contributions
allo-2.533
 accuser-2.423
ctica-2.407
gio-2.337
omorphic-2.327
Token Toronto
Token ID6586
Feature activation+5.46e-02

Pos logit contributions
allo+1.172
 accuser+1.151
ctica+1.121
gio+1.101
omorphic+1.101
Neg logit contributions
chery-1.146
hens-1.046
 Nether-1.029
word-0.950
iron-0.930
Token,
Token ID11
Feature activation-6.31e-02

Pos logit contributions
allo+0.618
 accuser+0.603
ctica+0.590
gio+0.577
omorphic+0.576
Neg logit contributions
chery-0.654
hens-0.599
 Nether-0.592
word-0.548
iron-0.536
Token and
Token ID290
Feature activation-1.74e-01

Pos logit contributions
chery+0.728
hens+0.666
 Nether+0.661
word+0.608
iron+0.596
Neg logit contributions
allo-0.743
 accuser-0.723
ctica-0.709
gio-0.697
omorphic-0.694
Token,
Token ID11
Feature activation-4.03e-03

Pos logit contributions
allo+3.249
 accuser+3.177
ctica+3.103
omorphic+3.023
gio+2.989
Neg logit contributions
chery-3.792
hens-3.535
 Nether-3.510
word-3.229
iron-3.176
Token as
Token ID355
Feature activation-2.90e-02

Pos logit contributions
chery+0.050
hens+0.046
 Nether+0.046
word+0.042
iron+0.041
Neg logit contributions
allo-0.044
 accuser-0.043
ctica-0.042
gio-0.041
omorphic-0.041
Token well
Token ID880
Feature activation-4.19e-02

Pos logit contributions
chery+0.369
hens+0.340
 Nether+0.336
word+0.313
iron+0.306
Neg logit contributions
allo-0.307
 accuser-0.299
ctica-0.292
gio-0.286
omorphic-0.285
Token as
Token ID355
Feature activation-2.28e-01

Pos logit contributions
chery+0.461
hens+0.418
 Nether+0.414
word+0.381
iron+0.370
Neg logit contributions
allo-0.517
 accuser-0.505
ctica-0.495
gio-0.486
omorphic-0.486
Token my
Token ID616
Feature activation+1.68e-01

Pos logit contributions
chery+2.589
 Nether+2.359
hens+2.333
word+2.164
iron+2.074
Neg logit contributions
allo-2.687
 accuser-2.616
ctica-2.591
gio-2.555
omorphic-2.551
Token own
Token ID898
Feature activation+2.15e-01

Pos logit contributions
allo+1.843
 accuser+1.808
ctica+1.745
gio+1.713
omorphic+1.699
Neg logit contributions
chery-2.064
hens-1.902
 Nether-1.862
word-1.734
iron-1.713
Token experiences
Token ID6461
Feature activation+6.42e-01

Pos logit contributions
allo+2.303
 accuser+2.267
ctica+2.172
gio+2.123
omorphic+2.107
Neg logit contributions
chery-2.684
hens-2.501
 Nether-2.422
word-2.258
iron-2.246
Token living
Token ID2877
Feature activation+4.24e-01

Pos logit contributions
allo+6.081
 accuser+5.896
ctica+5.830
968+5.538
omorphic+5.457
Neg logit contributions
chery-7.987
hens-7.774
 Nether-7.490
iron-6.844
argon-6.808
Token in
Token ID287
Feature activation+2.61e-01

Pos logit contributions
allo+4.178
 accuser+4.098
ctica+4.029
968+3.855
omorphic+3.779
Neg logit contributions
chery-5.436
hens-5.118
 Nether-4.958
word-4.685
iron-4.583
Token and
Token ID290
Feature activation-1.80e-02

Pos logit contributions
allo+3.035
 accuser+3.003
ctica+2.946
gio+2.839
omorphic+2.823
Neg logit contributions
chery-2.978
hens-2.675
 Nether-2.638
word-2.476
iron-2.439
Token traveling
Token ID11300
Feature activation+7.01e-01

Pos logit contributions
chery+0.211
hens+0.192
 Nether+0.190
word+0.176
iron+0.172
Neg logit contributions
allo-0.208
 accuser-0.203
ctica-0.198
gio-0.195
omorphic-0.194
Tokenar
Token ID283
Feature activation+2.18e-01

Pos logit contributions
chery+0.556
hens+0.509
 Nether+0.506
word+0.470
iron+0.459
Neg logit contributions
allo-0.495
 accuser-0.484
ctica-0.472
gio-0.462
omorphic-0.461
Token (
Token ID357
Feature activation-3.68e-01

Pos logit contributions
allo+2.480
 accuser+2.409
ctica+2.356
gio+2.326
omorphic+2.304
Neg logit contributions
chery-2.547
hens-2.351
 Nether-2.316
word-2.142
iron-2.100
TokenHis
Token ID6653
Feature activation+7.01e-02

Pos logit contributions
chery+4.703
hens+4.350
 Nether+4.266
word+4.078
iron+4.019
Neg logit contributions
 accuser-3.735
allo-3.733
ctica-3.639
omorphic-3.518
gio-3.460
Tokenar
Token ID283
Feature activation+1.38e-01

Pos logit contributions
allo+0.774
 accuser+0.756
ctica+0.736
gio+0.720
omorphic+0.719
Neg logit contributions
chery-0.858
hens-0.789
 Nether-0.778
word-0.721
iron-0.708
Token)
Token ID8
Feature activation+7.78e-02

Pos logit contributions
allo+1.530
 accuser+1.492
ctica+1.465
gio+1.433
omorphic+1.427
Neg logit contributions
chery-1.677
hens-1.551
 Nether-1.530
word-1.416
iron-1.384
Token
Token ID447
Feature activation+1.27e-01

Pos logit contributions
allo+0.878
 accuser+0.858
ctica+0.836
gio+0.823
omorphic+0.818
Neg logit contributions
chery-0.932
hens-0.855
 Nether-0.844
word-0.783
iron-0.766
Token
Token ID251
Feature activation+2.34e-01

Pos logit contributions
allo+1.033
 accuser+0.995
ctica+0.973
gio+0.938
omorphic+0.934
Neg logit contributions
chery-1.924
hens-1.801
 Nether-1.787
word-1.682
iron-1.653
Token by
Token ID416
Feature activation-3.66e-01

Pos logit contributions
allo+2.646
 accuser+2.582
ctica+2.513
gio+2.476
omorphic+2.462
Neg logit contributions
chery-2.753
hens-2.510
 Nether-2.493
word-2.302
iron-2.259
Token blocking
Token ID12013
Feature activation-2.86e-01

Pos logit contributions
chery+4.236
 Nether+3.895
hens+3.866
word+3.615
iron+3.443
Neg logit contributions
allo-4.200
ctica-4.019
 accuser-4.014
gio-3.908
omorphic-3.851
Token of
Token ID286
Feature activation-4.61e-02

Pos logit contributions
chery+3.295
 Nether+3.038
hens+2.994
word+2.739
iron+2.656
Neg logit contributions
allo-3.334
 accuser-3.215
ctica-3.209
gio-3.160
omorphic-3.089
Token a
Token ID257
Feature activation-1.74e-02

Pos logit contributions
chery+0.540
hens+0.494
 Nether+0.491
word+0.452
iron+0.439
Neg logit contributions
allo-0.534
 accuser-0.520
ctica-0.511
gio-0.500
omorphic-0.497
Token We
Token ID775
Feature activation+4.86e-02

Pos logit contributions
allo+0.908
 accuser+0.884
ctica+0.866
gio+0.850
omorphic+0.842
Neg logit contributions
chery-0.971
hens-0.891
 Nether-0.876
word-0.814
iron-0.799
Token talk
Token ID1561
Feature activation+6.09e-03

Pos logit contributions
allo+0.551
 accuser+0.538
ctica+0.525
gio+0.515
omorphic+0.514
Neg logit contributions
chery-0.580
hens-0.532
 Nether-0.526
word-0.487
iron-0.476
Token a
Token ID257
Feature activation-4.55e-01

Pos logit contributions
allo+0.073
 accuser+0.072
ctica+0.070
gio+0.069
omorphic+0.069
Neg logit contributions
chery-0.069
hens-0.063
 Nether-0.062
word-0.057
iron-0.056
Token little
Token ID1310
Feature activation-5.61e-01

Pos logit contributions
chery+4.485
 Nether+3.960
hens+3.923
word+3.744
iron+3.524
Neg logit contributions
allo-5.825
 accuser-5.723
ctica-5.684
omorphic-5.594
gio-5.588
Token about
Token ID546
Feature activation+1.66e-01

Pos logit contributions
chery+5.538
 Nether+4.869
hens+4.756
word+4.528
iron+4.317
Neg logit contributions
allo-7.193
ctica-6.987
gio-6.924
 accuser-6.866
omorphic-6.820
Token her
Token ID607
Feature activation+5.47e-01

Pos logit contributions
allo+2.096
 accuser+2.052
ctica+2.003
gio+1.967
omorphic+1.965
Neg logit contributions
chery-1.771
hens-1.617
 Nether-1.572
word-1.445
iron-1.420
Token support
Token ID1104
Feature activation-1.36e-01

Pos logit contributions
allo+6.070
 accuser+6.049
ctica+5.665
gio+5.595
968+5.560
Neg logit contributions
chery-6.171
hens-5.816
 Nether-5.461
word-5.116
iron-5.084
Token for
Token ID329
Feature activation-1.17e-01

Pos logit contributions
chery+1.633
hens+1.491
 Nether+1.483
word+1.390
iron+1.340
Neg logit contributions
allo-1.517
 accuser-1.481
ctica-1.446
omorphic-1.432
gio-1.425
Token Planned
Token ID18451
Feature activation+1.59e-02

Pos logit contributions
chery+1.363
hens+1.241
 Nether+1.238
word+1.140
iron+1.107
Neg logit contributions
allo-1.365
 accuser-1.329
ctica-1.304
gio-1.281
omorphic-1.276
Token Parenthood
Token ID18461
Feature activation-4.28e-04

Pos logit contributions
allo+0.152
 accuser+0.148
ctica+0.144
gio+0.140
omorphic+0.140
Neg logit contributions
chery-0.219
hens-0.203
 Nether-0.201
word-0.189
iron-0.185
Token,
Token ID11
Feature activation-3.14e-01

Pos logit contributions
chery+0.005
hens+0.005
 Nether+0.005
word+0.004
iron+0.004
Neg logit contributions
allo-0.005
 accuser-0.005
ctica-0.005
gio-0.005
omorphic-0.005
Token anyone
Token ID2687
Feature activation+4.58e-01

Pos logit contributions
chery+4.635
 Nether+4.200
hens+4.175
word+3.856
iron+3.668
Neg logit contributions
allo-5.030
ctica-4.833
 accuser-4.767
gio-4.688
omorphic-4.638
Token make
Token ID787
Feature activation-5.69e-01

Pos logit contributions
 accuser+4.725
allo+4.563
ctica+4.378
omorphic+4.341
968+4.310
Neg logit contributions
chery-5.589
hens-5.333
 Nether-5.180
word-4.717
iron-4.696
Token you
Token ID345
Feature activation-1.09e+00

Pos logit contributions
chery+5.129
 Nether+4.613
hens+4.315
word+4.242
iron+4.107
Neg logit contributions
allo-7.759
ctica-7.577
gio-7.455
omorphic-7.274
 accuser-7.250
Token feel
Token ID1254
Feature activation-3.11e-01

Pos logit contributions
chery+8.827
 Nether+8.028
hens+7.742
word+7.557
 joke+7.240
Neg logit contributions
allo-13.955
ctica-13.446
bably-13.155
-13.122
gio-13.086
Token like
Token ID588
Feature activation+7.88e-02

Pos logit contributions
chery+3.823
hens+3.508
 Nether+3.459
word+3.251
iron+3.178
Neg logit contributions
allo-3.396
ctica-3.224
 accuser-3.177
omorphic-3.126
gio-3.105
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
allo+0.939
 accuser+0.922
ctica+0.896
gio+0.880
omorphic+0.877
Neg logit contributions
chery-0.894
hens-0.816
 Nether-0.806
word-0.743
iron-0.726
Token don
Token ID836
Feature activation-2.64e-01

Pos logit contributions
chery+3.112
hens+2.786
 Nether+2.767
word+2.597
iron+2.502
Neg logit contributions
allo-3.281
 accuser-3.112
ctica-3.075
gio-3.023
omorphic-3.018
Token't
Token ID470
Feature activation+1.18e-01

Pos logit contributions
chery+2.277
hens+1.977
 Nether+1.928
word+1.758
iron+1.695
Neg logit contributions
 accuser-3.879
allo-3.837
ctica-3.756
omorphic-3.700
bably-3.648
Token matter
Token ID2300
Feature activation+3.44e-01

Pos logit contributions
allo+1.179
 accuser+1.152
ctica+1.120
gio+1.097
omorphic+1.092
Neg logit contributions
chery-1.556
hens-1.443
 Nether-1.425
word-1.330
iron-1.306
Token or
Token ID393
Feature activation-6.39e-02

Pos logit contributions
allo+3.492
 accuser+3.431
ctica+3.313
omorphic+3.243
gio+3.207
Neg logit contributions
chery-4.385
hens-4.036
 Nether-4.034
word-3.710
iron-3.612
Token like
Token ID588
Feature activation-2.03e-02

Pos logit contributions
chery+0.741
hens+0.675
 Nether+0.669
word+0.618
iron+0.601
Neg logit contributions
allo-0.752
 accuser-0.730
ctica-0.717
gio-0.702
omorphic-0.701
Token Sturgeon
Token ID43623
Feature activation+6.08e-01

Pos logit contributions
 accuser+6.489
allo+6.394
ctica+6.180
omorphic+5.946
lake+5.916
Neg logit contributions
chery-7.841
hens-7.086
 Nether-6.811
iron-6.550
word-6.526
Token,
Token ID11
Feature activation+2.81e-01

Pos logit contributions
 accuser+5.750
allo+5.597
ctica+5.525
lake+5.215
gio+5.090
Neg logit contributions
chery-7.858
hens-7.244
 Nether-7.089
word-6.655
iron-6.491
Token '
Token ID705
Feature activation+2.80e-01

Pos logit contributions
allo+3.033
 accuser+2.989
ctica+2.884
omorphic+2.808
gio+2.801
Neg logit contributions
chery-3.435
hens-3.134
 Nether-3.079
word-2.863
iron-2.819
Tokenlight
Token ID2971
Feature activation-4.30e-01

Pos logit contributions
allo+3.309
 accuser+3.168
ctica+3.122
gio+3.111
lake+3.086
Neg logit contributions
chery-3.159
 Nether-2.885
hens-2.829
word-2.559
iron-2.485
Token a
Token ID257
Feature activation-4.08e-01

Pos logit contributions
chery+3.678
hens+3.469
 Nether+3.360
word+3.060
iron+2.981
Neg logit contributions
allo-6.072
 accuser-5.914
gio-5.813
ctica-5.800
omorphic-5.776
Token fire
Token ID2046
Feature activation-4.19e-01

Pos logit contributions
chery+4.394
 Nether+3.995
hens+3.972
word+3.671
iron+3.543
Neg logit contributions
allo-4.956
 accuser-4.799
ctica-4.745
gio-4.699
omorphic-4.693
Token under
Token ID739
Feature activation+7.58e-02

Pos logit contributions
chery+4.581
hens+4.223
 Nether+4.142
word+3.962
iron+3.847
Neg logit contributions
allo-4.910
 accuser-4.876
ctica-4.779
gio-4.745
omorphic-4.699
Token Westminster
Token ID20353
Feature activation+2.84e-01

Pos logit contributions
allo+0.861
 accuser+0.842
ctica+0.824
gio+0.804
omorphic+0.803
Neg logit contributions
chery-0.904
hens-0.825
 Nether-0.815
word-0.755
iron-0.740
Token's
Token ID338
Feature activation+1.14e-01

Pos logit contributions
allo+3.070
 accuser+3.010
ctica+2.922
lake+2.856
gio+2.843
Neg logit contributions
chery-3.471
hens-3.189
 Nether-3.120
word-2.936
iron-2.857
Token back
Token ID736
Feature activation-2.06e-01

Pos logit contributions
allo+1.305
 accuser+1.287
ctica+1.251
gio+1.218
omorphic+1.218
Neg logit contributions
chery-1.354
hens-1.236
 Nether-1.215
word-1.129
iron-1.105
Tokenside
Token ID1589
Feature activation-3.03e-01

Pos logit contributions
chery+2.341
hens+2.156
 Nether+2.104
word+1.990
iron+1.943
Neg logit contributions
allo-2.432
 accuser-2.374
ctica-2.324
gio-2.287
omorphic-2.260
Token to
Token ID284
Feature activation+4.40e-01

Pos logit contributions
allo+0.830
 accuser+0.813
ctica+0.793
gio+0.778
omorphic+0.775
Neg logit contributions
chery-0.799
hens-0.730
 Nether-0.719
word-0.666
iron-0.651
Token keep
Token ID1394
Feature activation+3.63e-01

Pos logit contributions
allo+4.859
 accuser+4.720
ctica+4.616
gio+4.539
omorphic+4.537
Neg logit contributions
chery-5.136
hens-4.718
 Nether-4.612
word-4.262
iron-4.250
Token your
Token ID534
Feature activation+8.21e-01

Pos logit contributions
allo+4.236
 accuser+4.186
ctica+4.039
gio+3.979
omorphic+3.930
Neg logit contributions
chery-4.078
hens-3.774
 Nether-3.605
word-3.410
iron-3.343
Token road
Token ID2975
Feature activation+4.89e-01

Pos logit contributions
 accuser+8.510
allo+8.324
968+7.804
ctica+7.782
 Entered+7.719
Neg logit contributions
chery-9.443
hens-8.769
 Nether-8.063
word-8.018
iron-7.950
Token trip
Token ID5296
Feature activation-3.11e-01

Pos logit contributions
allo+4.325
 accuser+4.183
ctica+4.099
dri+3.892
omorphic+3.881
Neg logit contributions
chery-6.845
hens-6.294
 Nether-6.288
word-5.757
iron-5.669
Token in
Token ID287
Feature activation-8.98e-02

Pos logit contributions
chery+3.562
hens+3.234
 Nether+3.223
word+2.987
iron+2.876
Neg logit contributions
allo-3.581
 accuser-3.496
ctica-3.424
gio-3.386
omorphic-3.368
Token the
Token ID262
Feature activation-1.46e-03

Pos logit contributions
chery+1.001
hens+0.908
 Nether+0.902
word+0.833
iron+0.808
Neg logit contributions
allo-1.090
 accuser-1.068
ctica-1.044
gio-1.025
omorphic-1.020
Token right
Token ID826
Feature activation+4.41e-02

Pos logit contributions
chery+0.018
hens+0.017
 Nether+0.016
word+0.015
iron+0.015
Neg logit contributions
allo-0.016
 accuser-0.016
ctica-0.015
gio-0.015
omorphic-0.015
Token direction
Token ID4571
Feature activation+8.84e-02

Pos logit contributions
allo+0.508
 accuser+0.496
ctica+0.485
gio+0.475
omorphic+0.474
Neg logit contributions
chery-0.519
hens-0.475
 Nether-0.469
word-0.435
iron-0.425
Token stay
Token ID2652
Feature activation+5.00e-01

Pos logit contributions
allo+0.990
 accuser+0.965
ctica+0.945
gio+0.923
omorphic+0.920
Neg logit contributions
chery-1.067
hens-0.980
 Nether-0.968
word-0.896
iron-0.880
Token at
Token ID379
Feature activation-2.65e-02

Pos logit contributions
allo+5.439
 accuser+5.361
ctica+5.229
omorphic+5.110
gio+5.076
Neg logit contributions
chery-5.681
hens-5.388
 Nether-5.264
word-4.847
iron-4.759
Token "
Token ID366
Feature activation-3.22e-01

Pos logit contributions
chery+3.181
 Nether+2.908
hens+2.868
word+2.710
iron+2.554
Neg logit contributions
allo-3.493
ctica-3.362
 accuser-3.342
omorphic-3.274
gio-3.265
Tokenw
Token ID86
Feature activation-3.33e-01

Pos logit contributions
chery+3.948
hens+3.582
 Nether+3.534
word+3.461
iron+3.303
Neg logit contributions
allo-3.449
 accuser-3.423
ctica-3.355
omorphic-3.261
gio-3.202
Tokenarp
Token ID5117
Feature activation+2.81e-02

Pos logit contributions
chery+2.770
hens+2.484
 Nether+2.349
word+2.207
iron+2.112
Neg logit contributions
 accuser-4.869
allo-4.834
ctica-4.726
gio-4.662
omorphic-4.653
Token factor
Token ID5766
Feature activation-6.09e-01

Pos logit contributions
allo+0.333
 accuser+0.326
ctica+0.319
gio+0.313
omorphic+0.312
Neg logit contributions
chery-0.321
hens-0.292
 Nether-0.289
word-0.266
iron-0.260
Token"
Token ID1
Feature activation-3.55e-01

Pos logit contributions
chery+5.843
hens+5.301
 Nether+5.025
word+4.897
iron+4.499
Neg logit contributions
allo-7.822
 accuser-7.519
ctica-7.455
omorphic-7.358
lake-7.315
Token is
Token ID318
Feature activation-1.89e-01

Pos logit contributions
chery+3.923
hens+3.520
 Nether+3.483
word+3.302
iron+3.111
Neg logit contributions
allo-4.321
ctica-4.171
 accuser-4.114
omorphic-4.034
gio-4.007
Token too
Token ID1165
Feature activation+3.72e-01

Pos logit contributions
chery+2.055
hens+1.872
 Nether+1.854
word+1.715
iron+1.653
Neg logit contributions
allo-2.337
 accuser-2.250
ctica-2.229
gio-2.175
lake-2.154
Token large
Token ID1588
Feature activation+1.85e-01

Pos logit contributions
allo+3.677
 accuser+3.647
ctica+3.462
omorphic+3.448
968+3.396
Neg logit contributions
chery-4.818
hens-4.413
 Nether-4.350
word-4.045
iron-3.995
Token,
Token ID11
Feature activation-2.56e-01

Pos logit contributions
allo+2.034
 accuser+1.992
ctica+1.949
omorphic+1.906
gio+1.902
Neg logit contributions
chery-2.240
hens-2.067
 Nether-2.035
word-1.869
iron-1.848
Token the
Token ID262
Feature activation-3.81e-01

Pos logit contributions
chery+2.806
hens+2.545
 Nether+2.527
word+2.364
iron+2.239
Neg logit contributions
allo-3.156
 accuser-3.051
ctica-3.020
omorphic-2.957
gio-2.957
Token output
Token ID5072
Feature activation-6.00e-01

Pos logit contributions
chery+3.800
 Nether+3.438
hens+3.410
word+3.193
iron+2.951
Neg logit contributions
allo-4.971
ctica-4.799
 accuser-4.738
gio-4.672
omorphic-4.650
Token bright
Token ID6016
Feature activation+3.73e-01

Pos logit contributions
allo+1.413
 accuser+1.381
ctica+1.344
gio+1.317
omorphic+1.316
Neg logit contributions
chery-1.641
hens-1.518
 Nether-1.493
word-1.401
iron-1.370
Token shade
Token ID17979
Feature activation+2.69e-01

Pos logit contributions
allo+3.805
 accuser+3.758
ctica+3.584
lake+3.533
gio+3.491
Neg logit contributions
chery-4.757
hens-4.361
 Nether-4.245
word-4.059
iron-3.928
Token of
Token ID286
Feature activation+3.40e-01

Pos logit contributions
allo+2.767
 accuser+2.680
ctica+2.605
lake+2.540
omorphic+2.530
Neg logit contributions
chery-3.456
hens-3.165
 Nether-3.147
word-2.943
iron-2.866
Token orange
Token ID10912
Feature activation+7.82e-02

Pos logit contributions
allo+3.445
 accuser+3.403
ctica+3.272
omorphic+3.188
gio+3.178
Neg logit contributions
chery-4.382
hens-3.984
 Nether-3.889
word-3.769
iron-3.647
Token and
Token ID290
Feature activation-1.28e-01

Pos logit contributions
allo+0.907
 accuser+0.886
ctica+0.866
gio+0.848
omorphic+0.848
Neg logit contributions
chery-0.913
hens-0.836
 Nether-0.825
word-0.765
iron-0.745
Token the
Token ID262
Feature activation-3.17e-01

Pos logit contributions
chery+1.648
hens+1.518
 Nether+1.510
word+1.391
iron+1.362
Neg logit contributions
allo-1.330
 accuser-1.290
ctica-1.271
gio-1.240
omorphic-1.236
Token waters
Token ID10150
Feature activation-5.07e-01

Pos logit contributions
chery+3.946
 Nether+3.648
hens+3.637
word+3.299
iron+3.238
Neg logit contributions
allo-3.388
ctica-3.242
 accuser-3.217
gio-3.175
omorphic-3.137
Token are
Token ID389
Feature activation+2.16e-01

Pos logit contributions
chery+6.027
hens+5.420
 Nether+5.294
word+5.050
iron+4.842
Neg logit contributions
allo-5.451
 accuser-5.426
ctica-5.287
gio-5.215
omorphic-5.148
Token so
Token ID523
Feature activation+3.50e-01

Pos logit contributions
allo+2.284
 accuser+2.240
ctica+2.161
omorphic+2.131
968+2.106
Neg logit contributions
chery-2.703
hens-2.504
 Nether-2.458
word-2.312
iron-2.257
Token visc
Token ID31116
Feature activation+1.80e-01

Pos logit contributions
allo+3.513
 accuser+3.410
ctica+3.304
omorphic+3.254
lake+3.202
Neg logit contributions
chery-4.512
hens-4.175
 Nether-4.123
word-3.924
iron-3.797
Tokenous
Token ID516
Feature activation+6.03e-02

Pos logit contributions
allo+2.254
 accuser+2.164
omorphic+2.137
ctica+2.131
gio+2.108
Neg logit contributions
chery-1.956
hens-1.765
 Nether-1.756
word-1.610
iron-1.546
Token values
Token ID3815
Feature activation-2.16e-01

Pos logit contributions
allo+2.104
 accuser+2.058
ctica+1.998
omorphic+1.964
gio+1.955
Neg logit contributions
chery-2.458
hens-2.287
 Nether-2.213
word-2.077
iron-2.043
Token.
Token ID13
Feature activation-1.44e-01

Pos logit contributions
chery+2.432
hens+2.198
 Nether+2.167
word+2.010
iron+1.958
Neg logit contributions
allo-2.598
 accuser-2.543
ctica-2.466
gio-2.439
omorphic-2.412
Token It
Token ID632
Feature activation-4.24e-01

Pos logit contributions
chery+1.767
 Nether+1.629
hens+1.616
word+1.495
iron+1.461
Neg logit contributions
allo-1.579
 accuser-1.554
ctica-1.512
omorphic-1.488
gio-1.469
Token is
Token ID318
Feature activation-6.52e-01

Pos logit contributions
chery+4.551
hens+4.121
 Nether+3.948
word+3.675
iron+3.618
Neg logit contributions
allo-5.155
 accuser-5.149
ctica-4.919
gio-4.886
968-4.844
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
chery+6.801
 Nether+5.979
hens+5.735
iron+5.420
word+5.381
Neg logit contributions
allo-7.826
 accuser-7.632
gio-7.563
ctica-7.533
lake-7.415
Token good
Token ID922
Feature activation-1.37e+00

Pos logit contributions
chery+10.762
 Nether+10.062
hens+9.251
word+9.045
 tru+9.044
Neg logit contributions
allo-10.962
ctica-10.815
gio-10.484
 accuser-10.446
lake-10.371
Token idea
Token ID2126
Feature activation-8.90e-01

Pos logit contributions
chery+11.582
 Nether+10.222
word+9.749
hens+9.401
 tru+9.377
Neg logit contributions
allo-15.225
gio-14.949
ctica-14.902
olics-14.598
dri-14.327
Token to
Token ID284
Feature activation-6.19e-01

Pos logit contributions
chery+8.088
hens+6.880
 Nether+6.661
word+6.254
iron+6.056
Neg logit contributions
 accuser-11.009
allo-10.993
gio-10.794
ctica-10.663
lake-10.553
Token link
Token ID2792
Feature activation-4.13e-01

Pos logit contributions
chery+6.522
 Nether+5.976
hens+5.621
word+5.366
iron+5.180
Neg logit contributions
allo-7.364
ctica-7.135
 accuser-7.094
dri-6.910
gio-6.909
Token family
Token ID1641
Feature activation-3.52e-01

Pos logit contributions
chery+4.676
 Nether+4.347
hens+4.167
word+3.911
iron+3.820
Neg logit contributions
allo-4.693
ctica-4.617
 accuser-4.587
gio-4.456
lake-4.404
Token values
Token ID3815
Feature activation-2.24e-01

Pos logit contributions
chery+3.955
 Nether+3.614
hens+3.541
word+3.300
iron+3.225
Neg logit contributions
allo-4.104
 accuser-4.057
ctica-3.980
gio-3.866
lake-3.814
Token was
Token ID373
Feature activation-8.70e-01

Pos logit contributions
chery+2.293
hens+2.100
 Nether+2.066
word+1.937
iron+1.894
Neg logit contributions
allo-1.980
 accuser-1.938
ctica-1.872
gio-1.840
omorphic-1.837
Token at
Token ID379
Feature activation-1.14e+00

Pos logit contributions
chery+9.221
hens+8.552
 Nether+8.424
word+7.737
 BT+7.465
Neg logit contributions
allo-9.486
 accuser-9.427
ctica-9.191
gio-9.154
-9.049
Token The
Token ID383
Feature activation-6.37e-01

Pos logit contributions
chery+10.623
hens+10.600
 Nether+10.502
 Worm+9.391
word+9.148
Neg logit contributions
allo-12.461
 accuser-12.308
ctica-11.787
bably-11.604
omorphic-11.489
Token Head
Token ID7123
Feature activation-3.16e-01

Pos logit contributions
chery+7.112
 Nether+6.708
hens+6.572
word+5.958
 Corn+5.786
Neg logit contributions
allo-7.150
 accuser-7.117
ctica-6.937
omorphic-6.695
968-6.648
Tokenline
Token ID1370
Feature activation-9.65e-01

Pos logit contributions
chery+3.975
hens+3.601
 Nether+3.520
word+3.399
iron+3.236
Neg logit contributions
allo-3.409
 accuser-3.331
ctica-3.273
omorphic-3.173
gio-3.159
Token for
Token ID329
Feature activation-1.52e+00

Pos logit contributions
chery+10.874
 Nether+9.720
hens+9.695
word+9.264
 Corn+8.907
Neg logit contributions
allo-9.733
ctica-9.714
 accuser-9.708
omorphic-9.312
bably-9.210
Token the
Token ID262
Feature activation-8.38e-01

Pos logit contributions
 Nether+14.307
chery+14.303
hens+14.017
 BT+13.375
word+13.333
Neg logit contributions
allo-13.654
 accuser-13.599
ctica-13.399
omorphic-13.159
gio-12.809
Token launch
Token ID4219
Feature activation-1.07e+00

Pos logit contributions
chery+9.676
 Nether+9.327
hens+9.202
word+8.493
 Corn+8.376
Neg logit contributions
allo-8.342
ctica-8.217
 accuser-8.036
omorphic-7.930
-7.791
Token seemed
Token ID3947
Feature activation-1.29e-01

Pos logit contributions
chery+11.030
hens+9.989
 Nether+9.777
word+9.413
 BT+9.144
Neg logit contributions
allo-11.354
 accuser-11.102
ctica-10.894
omorphic-10.824
bably-10.683
Token to
Token ID284
Feature activation-1.54e-01

Pos logit contributions
chery+1.531
hens+1.408
 Nether+1.384
word+1.288
iron+1.264
Neg logit contributions
allo-1.469
 accuser-1.434
ctica-1.396
gio-1.374
omorphic-1.359
Token love
Token ID1842
Feature activation-1.19e+00

Pos logit contributions
chery+1.862
hens+1.690
 Nether+1.686
word+1.562
iron+1.526
Neg logit contributions
allo-1.729
 accuser-1.694
ctica-1.655
gio-1.622
omorphic-1.607
Tokenms
Token ID907
Feature activation-5.06e-01

Pos logit contributions
chery+6.724
hens+6.351
 Nether+5.879
word+5.512
arks+5.029
Neg logit contributions
allo-13.521
 accuser-13.287
ctica-12.961
omorphic-12.734
lake-12.666
Token G
Token ID402
Feature activation-6.86e-01

Pos logit contributions
chery+5.039
 Nether+4.667
hens+4.609
word+4.175
 BT+4.107
Neg logit contributions
allo-6.273
 accuser-6.142
ctica-6.114
omorphic-5.971
gio-5.856
Token2
Token ID17
Feature activation+6.01e-02

Pos logit contributions
chery+5.444
hens+4.819
 Nether+4.644
word+4.424
iron+4.387
Neg logit contributions
 accuser-9.647
allo-9.580
ctica-9.390
bably-9.120
omorphic-9.076
TokenG
Token ID38
Feature activation-8.54e-01

Pos logit contributions
allo+0.471
 accuser+0.453
ctica+0.437
gio+0.429
omorphic+0.422
Neg logit contributions
chery-0.929
hens-0.869
 Nether-0.861
word-0.812
iron-0.799
Token AU
Token ID27548
Feature activation-7.60e-01

Pos logit contributions
chery+12.247
 Nether+12.054
hens+11.408
 BT+11.241
word+10.867
Neg logit contributions
allo-5.785
 accuser-5.511
gio-5.275
ctica-5.263
bably-5.174
Token Opt
Token ID13123
Feature activation-1.62e+00

Pos logit contributions
chery+4.913
 Nether+4.727
 BT+4.142
hens+4.133
word+3.751
Neg logit contributions
allo-11.609
ctica-11.397
 accuser-11.311
bably-11.304
omorphic-11.117
Tokenronics
Token ID20844
Feature activation-5.98e-01

Pos logit contributions
hens+10.868
chery+10.802
iron+9.227
hen+8.920
arks+8.508
Neg logit contributions
��-21.039
 accuser-20.665
 millenn-20.648
 proport-20.418
 nomine-20.399
Token AH
Token ID28159
Feature activation+2.44e-01

Pos logit contributions
chery+6.693
 Nether+6.382
hens+6.170
 BT+5.751
word+5.613
Neg logit contributions
allo-6.510
 accuser-6.371
ctica-6.240
gio-6.148
omorphic-6.105
TokenVA
Token ID11731
Feature activation-3.58e-01

Pos logit contributions
allo+2.434
 accuser+2.349
968+2.286
ctica+2.284
gio+2.271
Neg logit contributions
chery-3.194
hens-2.953
 Nether-2.943
word-2.704
iron-2.668
Token (
Token ID357
Feature activation-4.79e-01

Pos logit contributions
chery+3.715
 Nether+3.413
hens+3.402
word+3.038
iron+3.017
Neg logit contributions
allo-4.487
 accuser-4.358
ctica-4.333
gio-4.217
omorphic-4.213
TokenIPS
Token ID47643
Feature activation-2.91e-01

Pos logit contributions
chery+5.138
hens+4.702
 Nether+4.530
iron+4.203
word+4.177
Neg logit contributions
 accuser-5.804
allo-5.733
ctica-5.597
gio-5.468
 Entered-5.395
Token Los
Token ID5401
Feature activation-2.19e-01

Pos logit contributions
allo+4.638
 accuser+4.504
lake+4.390
gio+4.258
968+4.203
Neg logit contributions
chery-7.160
hens-6.486
 Nether-6.448
iron-6.071
word-6.007
Token Angeles
Token ID5652
Feature activation-3.45e-02

Pos logit contributions
chery+2.728
hens+2.539
 Nether+2.479
word+2.333
iron+2.275
Neg logit contributions
allo-2.307
 accuser-2.274
ctica-2.223
omorphic-2.187
gio-2.166
Token 1980
Token ID7169
Feature activation+2.03e-01

Pos logit contributions
chery+0.429
hens+0.396
 Nether+0.391
word+0.363
iron+0.356
Neg logit contributions
allo-0.374
 accuser-0.365
ctica-0.356
gio-0.350
omorphic-0.348
Token
Token ID1906
Feature activation+4.58e-01

Pos logit contributions
allo+2.108
 accuser+2.038
ctica+2.002
gio+1.978
968+1.961
Neg logit contributions
chery-2.573
hens-2.363
 Nether-2.341
word-2.174
iron-2.151
Token84
Token ID5705
Feature activation-9.30e-02

Pos logit contributions
allo+4.464
968+4.305
 accuser+4.273
ctica+4.150
gio+4.089
Neg logit contributions
chery-5.980
hens-5.622
 Nether-5.610
word-4.995
iron-4.915
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
chery+1.089
hens+0.997
 Nether+0.985
word+0.912
iron+0.889
Neg logit contributions
allo-1.078
 accuser-1.058
ctica-1.032
omorphic-1.013
gio-1.006
Token Copyright
Token ID15069
Feature activation-4.14e-01

Pos logit contributions
 Nether+7.622
chery+7.179
 Corn+5.693
word+5.551
 Gun+5.503
Neg logit contributions
 millenn-21.060
��-20.908
bably-20.721
 satell-20.620
 accuser-20.324
Token John
Token ID1757
Feature activation-5.51e-01

Pos logit contributions
chery+3.732
 Nether+3.534
hens+3.401
word+3.117
iron+2.881
Neg logit contributions
allo-5.631
 accuser-5.560
ctica-5.540
omorphic-5.489
bably-5.440
Token Brian
Token ID8403
Feature activation+1.03e-01

Pos logit contributions
chery+2.939
 Nether+2.569
hens+2.415
word+2.205
 BT+2.068
Neg logit contributions
allo-9.356
ctica-9.283
omorphic-9.210
 accuser-9.173
bably-8.943
Token King
Token ID2677
Feature activation-2.49e-01

Pos logit contributions
allo+1.109
 accuser+1.078
ctica+1.051
gio+1.036
omorphic+1.024
Neg logit contributions
chery-1.281
hens-1.181
 Nether-1.157
word-1.080
iron-1.062
Token
Token ID198
Feature activation-2.49e-01

Pos logit contributions
chery+2.859
hens+2.619
 Nether+2.589
word+2.415
iron+2.336
Neg logit contributions
allo-2.872
 accuser-2.828
ctica-2.732
omorphic-2.720
gio-2.703
Token
Token ID247
Feature activation+6.47e-01

Pos logit contributions
chery+0.847
hens+0.790
 Nether+0.782
word+0.739
iron+0.724
Neg logit contributions
allo-0.494
 accuser-0.484
ctica-0.470
omorphic-0.456
gio-0.455
Tokens
Token ID82
Feature activation-2.45e-01

Pos logit contributions
allo+4.606
 accuser+4.095
dri+4.093
968+4.056
lake+3.980
Neg logit contributions
chery-9.657
 Nether-9.297
hens-8.998
iron-8.438
word-8.370
Token state
Token ID1181
Feature activation+3.43e-01

Pos logit contributions
chery+3.431
hens+3.158
 Nether+3.134
word+2.958
iron+2.847
Neg logit contributions
allo-2.257
 accuser-2.183
ctica-2.134
gio-2.103
omorphic-2.061
Token (
Token ID357
Feature activation-1.33e+00

Pos logit contributions
allo+3.515
 accuser+3.402
ctica+3.285
968+3.284
gio+3.245
Neg logit contributions
chery-4.306
hens-4.004
 Nether-3.974
iron-3.671
word-3.651
Tokensuch
Token ID10508
Feature activation-9.78e-01

Pos logit contributions
chery+7.744
hens+6.825
iron+6.804
word+6.700
hen+6.601
Neg logit contributions
 millenn-20.271
 accuser-19.918
ctica-19.238
 proport-19.094
olics-19.019
Token as
Token ID355
Feature activation-1.75e+00

Pos logit contributions
chery+7.024
word+5.721
hens+5.590
 Nether+5.478
iron+4.895
Neg logit contributions
 millenn-14.451
allo-14.110
��-13.975
 accuser-13.715
itialized-13.679
Token through
Token ID832
Feature activation-7.52e-01

Pos logit contributions
chery+8.623
 Nether+6.670
word+5.948
hens+5.772
 Corn+4.833
Neg logit contributions
allo-23.583
ctica-23.435
bably-23.374
algia-23.288
-23.012
Token a
Token ID257
Feature activation-1.37e+00

Pos logit contributions
chery+5.988
 Nether+5.388
hens+5.210
word+4.778
 BT+4.465
Neg logit contributions
allo-10.701
 accuser-10.393
ctica-10.351
omorphic-10.171
bably-9.983
Token or
Token ID393
Feature activation-5.78e-01

Pos logit contributions
chery+11.497
 Nether+10.722
hens+9.803
word+9.451
 BT+8.925
Neg logit contributions
allo-15.456
ctica-15.136
 accuser-14.969
omorphic-14.485
bably-14.304
Token )
Token ID1267
Feature activation+3.82e-01

Pos logit contributions
chery+6.084
 Nether+5.410
hens+5.302
word+4.893
 Worm+4.571
Neg logit contributions
allo-7.095
 accuser-7.020
ctica-7.001
omorphic-6.674
gio-6.647
Token must
Token ID1276
Feature activation+2.78e-01

Pos logit contributions
allo+4.301
 accuser+4.146
ctica+4.056
lake+4.019
968+3.992
Neg logit contributions
chery-4.374
hens-4.049
 Nether-4.028
word-3.727
argon-3.638