TOP ACTIVATIONS
MAX = 3.963

the crazy Finnish crowd at ASUS R OG just over five
Strip Club - Chuck L idel - Nick Dip alo -
fuel . Exercise also improves insulin sensitivity . Cal
of the above by doing ABS OL UT ELY NOT H
, and this morning I clot he myself with Christ ,
s just totally uncomp romising . It
massive airflow for fast heat diss ipation while remaining whisper silent
helped stop Firefox users from defect ing to other browsers ,
. Decre ases insulin level . Increases
the tender merc ies of incomp ent fake Republican and Democratic
mainstream UK media and the diss ident Real News Network ,
joined seconds later by a deaf ening th ud from the
P ipes to enhance heat diss ipation . MSI has also
the same panel as the Asus . Both performed very similarly
public force to silence that diss ension ," he said .
philosophy of improvement that values defect prevention over defect detection .
comparison against the very popular Asus R OG Swift PG 278
players on why they had diss ented . For
their jobs , other face incomp rehens ion in their families
for CPUs and memory ( Intel VT - x , AMD

MIN ACTIVATIONS
MIN = -3.790

It should be noted that Bour dain isn t
Bryce Harper , Den ard Span , Jays on Wer th
from the Harvard - Smith sonian Center for Ast roph ysics
. ( 2002 ). A longitudinal study of bullying , dominance
ED MON TON Omar Kh adr has agreed to remain
. ( 2010 ). A longitudinal investigation of young adolescents
ively simple U sonian houses , a
some bad choices , Michael Vick has devoted time to speaking
from the Harvard - Smith sonian Center for Ast roph ysics
t say anything negative about Bour dain in public p
B SP - r uled Uttar Pradesh banned it and And
to fly back to Che chn ya on Tuesday night with
The truth is that Bour dain has always had a
recognition as his highly paid linem ates David De Cast ro
the Moon , fighting Darth Vader and wearing the face of
to sleep in a U sonian home , just 45 minutes
will never be celebrated by Bour dain in an episode of
Falling water , aka The Edgar J . Kau f mann
losing ground to rivals like Dom ino 's Pizza .
C rouch reports from Van Nu ys for the NBC 4

INTERVAL 2.025 - 3.963
CONTAINS 0.265%

cases of severe treatment of tourists have so far mostly involved
4 5 6 7 8 9 10 11 12 13 14
. It s hard to believe that of Arkansas
seven years old ," Florida Eagle Forum lobbyist Randy Osborne said
a single picture . Ranger 6 was definitely a very successful

INTERVAL 0.087 - 2.025
CONTAINS 40.757%

Lab arc a 's latest book appears to show him doing
Today there is a general consensus that equipment
of their daughter s distress . M
New York Times portrayed him as a man who has endured
cup qu inoa and 2 cups of water into a sauce

INTERVAL -1.852 - 0.087
CONTAINS 58.506%

input in compiling a list of possible Supreme Court nominees .
link provides a copy of the Excel work book I created
profession , by femin izing it in unacceptable ways .
the 2008 film Kung Fu Panda , is an anthrop omorph
couple : Del . Todd C . Gilbert ( R -

INTERVAL -3.790 - -1.852
CONTAINS 0.471%

politicians . Comed ian Kathy Griffin , who lost her CNN
READ : Cohen testimony alleges Trump knew Stone talked with WikiLeaks
4 . BET TY & VER ON ICA ( Ar chie
earn and make them feel rewarded at the end of the
division Hel fen oh ne Gren zen e . V .
Token the
Token ID262
Feature activation+2.41e-01

Pos logit contributions
vier+0.157
ashtra+0.155
amera+0.155
ersen+0.152
itably+0.150
Neg logit contributions
Reviewed-0.162
spot-0.152
worms-0.151
lication-0.149
lessness-0.146
Token crazy
Token ID7165
Feature activation+8.62e-01

Pos logit contributions
Reviewed+0.371
spot+0.348
worms+0.345
lication+0.340
lessness+0.334
Neg logit contributions
vier-0.385
ashtra-0.381
amera-0.380
ersen-0.373
itably-0.368
Token Finnish
Token ID26838
Feature activation+4.18e-01

Pos logit contributions
Reviewed+1.404
spot+1.321
worms+1.308
lication+1.289
lessness+1.271
Neg logit contributions
vier-1.302
ashtra-1.295
amera-1.282
ersen-1.257
itably-1.241
Token crowd
Token ID4315
Feature activation-3.28e-02

Pos logit contributions
Reviewed+0.695
spot+0.653
worms+0.647
lication+0.640
lessness+0.630
Neg logit contributions
vier-0.617
ashtra-0.612
amera-0.609
ersen-0.598
itably-0.586
Token at
Token ID379
Feature activation+5.47e-01

Pos logit contributions
vier+0.055
ashtra+0.054
amera+0.054
ersen+0.053
orah+0.052
Neg logit contributions
Reviewed-0.048
spot-0.044
worms-0.044
lication-0.044
lessness-0.042
Token ASUS
Token ID38100
Feature activation+3.96e+00

Pos logit contributions
Reviewed+0.849
spot+0.797
worms+0.790
lication+0.777
lessness+0.765
Neg logit contributions
vier-0.867
ashtra-0.860
amera-0.855
ersen-0.839
itably-0.828
Token R
Token ID371
Feature activation+6.82e-01

Pos logit contributions
Reviewed+5.694
spot+5.268
worms+5.241
lication+5.151
lessness+5.076
Neg logit contributions
vier-6.542
ashtra-6.459
itably-6.305
amera-6.274
ersen-6.202
TokenOG
Token ID7730
Feature activation-1.26e-01

Pos logit contributions
Reviewed+1.073
spot+1.006
worms+0.991
lication+0.977
lessness+0.964
Neg logit contributions
ashtra-1.065
vier-1.060
amera-1.044
ersen-1.031
itably-1.017
Token just
Token ID655
Feature activation+1.85e-02

Pos logit contributions
vier+0.196
amera+0.195
ashtra+0.194
ersen+0.191
itably+0.188
Neg logit contributions
Reviewed-0.198
worms-0.187
spot-0.186
lication-0.183
lessness-0.180
Token over
Token ID625
Feature activation+7.84e-01

Pos logit contributions
Reviewed+0.029
spot+0.027
worms+0.027
lication+0.027
lessness+0.026
Neg logit contributions
vier-0.029
ashtra-0.029
amera-0.029
ersen-0.028
itably-0.028
Token five
Token ID1936
Feature activation+5.77e-01

Pos logit contributions
Reviewed+1.243
spot+1.167
worms+1.154
lication+1.138
lessness+1.120
Neg logit contributions
vier-1.220
ashtra-1.212
amera-1.202
ersen-1.179
itably-1.164
Token Strip
Token ID18508
Feature activation-9.08e-01

Pos logit contributions
vier+1.949
ashtra+1.924
amera+1.915
ersen+1.864
orah+1.853
Neg logit contributions
Reviewed-1.899
worms-1.757
spot-1.751
lication-1.726
lessness-1.706
Token Club
Token ID6289
Feature activation+1.96e-01

Pos logit contributions
vier+1.427
ashtra+1.394
amera+1.389
ersen+1.352
orah+1.351
Neg logit contributions
Reviewed-1.431
spot-1.319
worms-1.318
lication-1.298
lessness-1.277
Token-
Token ID12
Feature activation+1.59e-01

Pos logit contributions
Reviewed+0.267
spot+0.248
worms+0.245
lication+0.241
lessness+0.237
Neg logit contributions
vier-0.348
ashtra-0.346
amera-0.343
ersen-0.337
itably-0.334
TokenChuck
Token ID44324
Feature activation-7.00e-01

Pos logit contributions
Reviewed+0.233
spot+0.218
worms+0.218
lication+0.215
lessness+0.211
Neg logit contributions
vier-0.265
ashtra-0.261
amera-0.259
ersen-0.253
orah-0.251
Token L
Token ID406
Feature activation+1.05e+00

Pos logit contributions
vier+1.186
amera+1.174
ashtra+1.174
ersen+1.154
orah+1.137
Neg logit contributions
Reviewed-1.009
spot-0.935
worms-0.923
lication-0.910
lessness-0.892
Tokenidel
Token ID5943
Feature activation+3.90e+00

Pos logit contributions
Reviewed+1.858
spot+1.758
worms+1.741
lication+1.717
lessness+1.696
Neg logit contributions
vier-1.446
ashtra-1.437
amera-1.421
ersen-1.389
itably-1.371
Token-
Token ID12
Feature activation+1.39e-01

Pos logit contributions
Reviewed+5.864
spot+5.498
worms+5.497
lication+5.334
lessness+5.299
Neg logit contributions
ashtra-6.202
vier-6.059
amera-6.022
itably-5.909
ersen-5.878
TokenNick
Token ID23609
Feature activation+6.90e-02

Pos logit contributions
Reviewed+0.205
spot+0.191
worms+0.191
lication+0.188
lessness+0.185
Neg logit contributions
vier-0.231
ashtra-0.228
amera-0.227
ersen-0.221
orah-0.220
Token Dip
Token ID43945
Feature activation+3.41e-01

Pos logit contributions
Reviewed+0.106
spot+0.099
worms+0.098
lication+0.096
lessness+0.095
Neg logit contributions
vier-0.111
ashtra-0.110
amera-0.109
ersen-0.107
itably-0.106
Tokenalo
Token ID7335
Feature activation-2.22e-01

Pos logit contributions
Reviewed+0.473
spot+0.437
worms+0.430
lication+0.423
lessness+0.417
Neg logit contributions
vier-0.600
amera-0.594
ashtra-0.594
ersen-0.586
orah-0.579
Token-
Token ID12
Feature activation+1.10e-01

Pos logit contributions
vier+0.357
amera+0.356
ashtra+0.354
ersen+0.348
orah+0.346
Neg logit contributions
Reviewed-0.337
spot-0.315
worms-0.311
lication-0.307
lessness-0.302
Token fuel
Token ID5252
Feature activation+6.47e-01

Pos logit contributions
vier+0.090
ashtra+0.089
amera+0.088
ersen+0.087
itably+0.086
Neg logit contributions
Reviewed-0.100
spot-0.094
worms-0.094
lication-0.092
lessness-0.090
Token.
Token ID13
Feature activation-3.14e-01

Pos logit contributions
Reviewed+0.933
spot+0.870
worms+0.861
lication+0.845
lessness+0.828
Neg logit contributions
vier-1.098
ashtra-1.093
amera-1.082
ersen-1.065
itably-1.055
Token Exercise
Token ID32900
Feature activation-3.11e-01

Pos logit contributions
vier+0.457
ashtra+0.450
amera+0.442
itably+0.440
ersen+0.439
Neg logit contributions
Reviewed-0.524
spot-0.495
worms-0.493
lication-0.486
lessness-0.475
Token also
Token ID635
Feature activation-1.42e-01

Pos logit contributions
vier+0.515
ashtra+0.507
amera+0.500
ersen+0.498
itably+0.497
Neg logit contributions
Reviewed-0.465
spot-0.429
worms-0.420
lication-0.418
lessness-0.401
Token improves
Token ID19575
Feature activation-2.18e-01

Pos logit contributions
vier+0.258
ashtra+0.255
amera+0.252
ersen+0.251
itably+0.250
Neg logit contributions
Reviewed-0.189
spot-0.171
lication-0.169
worms-0.168
lessness-0.161
Token insulin
Token ID14588
Feature activation+3.79e+00

Pos logit contributions
vier+0.338
ashtra+0.334
amera+0.330
ersen+0.326
itably+0.324
Neg logit contributions
Reviewed-0.346
spot-0.323
worms-0.321
lication-0.317
lessness-0.310
Token sensitivity
Token ID14233
Feature activation+4.85e-01

Pos logit contributions
Reviewed+3.919
spot+3.623
worms+3.557
lessness+3.454
lication+3.407
Neg logit contributions
ashtra-7.721
vier-7.702
amera-7.659
ersen-7.609
itably-7.462
Token.
Token ID13
Feature activation-5.56e-03

Pos logit contributions
Reviewed+0.708
spot+0.661
worms+0.654
lication+0.643
lessness+0.629
Neg logit contributions
vier-0.814
ashtra-0.810
amera-0.802
ersen-0.788
itably-0.782
Token
Token ID198
Feature activation-4.05e-02

Pos logit contributions
vier+0.008
ashtra+0.008
amera+0.008
ersen+0.008
itably+0.008
Neg logit contributions
Reviewed-0.009
spot-0.009
worms-0.009
lication-0.008
lessness-0.008
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
vier+0.062
ashtra+0.061
amera+0.060
ersen+0.060
itably+0.059
Neg logit contributions
Reviewed-0.065
spot-0.061
worms-0.061
lication-0.060
lessness-0.058
TokenCal
Token ID9771
Feature activation+9.37e-01

Pos logit contributions
Reviewed+0.164
spot+0.155
worms+0.154
lication+0.152
lessness+0.149
Neg logit contributions
vier-0.189
ashtra-0.187
amera-0.185
itably-0.182
ersen-0.182
Token of
Token ID286
Feature activation-1.81e-01

Pos logit contributions
vier+0.688
ashtra+0.683
amera+0.677
ersen+0.667
itably+0.658
Neg logit contributions
Reviewed-0.768
spot-0.722
worms-0.714
lication-0.707
lessness-0.691
Token the
Token ID262
Feature activation+1.07e-01

Pos logit contributions
vier+0.211
ashtra+0.209
amera+0.207
ersen+0.202
itably+0.199
Neg logit contributions
Reviewed-0.357
spot-0.339
worms-0.336
lication-0.333
lessness-0.328
Token above
Token ID2029
Feature activation+4.41e-01

Pos logit contributions
Reviewed+0.153
spot+0.142
worms+0.141
lication+0.139
lessness+0.136
Neg logit contributions
vier-0.184
ashtra-0.182
amera-0.181
ersen-0.179
itably-0.177
Token by
Token ID416
Feature activation+1.51e-01

Pos logit contributions
Reviewed+0.694
spot+0.651
worms+0.644
lication+0.636
lessness+0.624
Neg logit contributions
vier-0.690
ashtra-0.685
amera-0.679
ersen-0.667
itably-0.660
Token doing
Token ID1804
Feature activation+4.86e-01

Pos logit contributions
Reviewed+0.233
spot+0.216
worms+0.215
lication+0.212
lessness+0.207
Neg logit contributions
vier-0.243
ashtra-0.240
amera-0.239
ersen-0.235
itably-0.233
Token ABS
Token ID29950
Feature activation+3.78e+00

Pos logit contributions
Reviewed+0.757
spot+0.703
worms+0.699
lication+0.689
lessness+0.673
Neg logit contributions
vier-0.776
ashtra-0.767
amera-0.762
ersen-0.749
itably-0.742
TokenOL
Token ID3535
Feature activation+9.32e-01

Pos logit contributions
Reviewed+4.529
spot+4.127
worms+4.045
lessness+4.014
lication+4.014
Neg logit contributions
vier-7.211
ashtra-7.175
amera-7.041
ersen-6.978
orah-6.906
TokenUT
Token ID3843
Feature activation+8.62e-01

Pos logit contributions
Reviewed+1.413
spot+1.325
worms+1.311
lication+1.290
lessness+1.270
Neg logit contributions
vier-1.512
ashtra-1.505
amera-1.491
ersen-1.462
itably-1.447
TokenELY
Token ID30943
Feature activation+5.82e-01

Pos logit contributions
Reviewed+1.386
spot+1.303
worms+1.290
lication+1.271
lessness+1.252
Neg logit contributions
vier-1.318
ashtra-1.311
amera-1.299
ersen-1.273
itably-1.259
Token NOT
Token ID5626
Feature activation-2.84e-01

Pos logit contributions
Reviewed+0.887
spot+0.827
worms+0.821
lication+0.810
lessness+0.790
Neg logit contributions
vier-0.944
ashtra-0.938
amera-0.930
ersen-0.910
itably-0.904
TokenH
Token ID39
Feature activation-2.71e-01

Pos logit contributions
vier+0.456
ashtra+0.449
amera+0.445
ersen+0.438
itably+0.435
Neg logit contributions
Reviewed-0.439
spot-0.408
worms-0.406
lication-0.400
lessness-0.389
Token,
Token ID11
Feature activation+1.31e-01

Pos logit contributions
Reviewed+0.455
spot+0.425
worms+0.420
lication+0.415
lessness+0.406
Neg logit contributions
vier-0.484
ashtra-0.481
amera-0.477
ersen-0.468
itably-0.462
Token and
Token ID290
Feature activation+9.58e-02

Pos logit contributions
Reviewed+0.217
spot+0.205
worms+0.202
lication+0.200
lessness+0.197
Neg logit contributions
vier-0.194
ashtra-0.193
amera-0.191
ersen-0.187
itably-0.185
Token this
Token ID428
Feature activation-1.49e-01

Pos logit contributions
Reviewed+0.154
spot+0.144
worms+0.143
lication+0.141
lessness+0.138
Neg logit contributions
vier-0.147
ashtra-0.146
amera-0.145
ersen-0.143
itably-0.140
Token morning
Token ID3329
Feature activation+8.99e-01

Pos logit contributions
vier+0.243
ashtra+0.242
amera+0.241
ersen+0.236
itably+0.234
Neg logit contributions
Reviewed-0.225
spot-0.208
worms-0.208
lication-0.202
lessness-0.199
Token I
Token ID314
Feature activation-2.15e-01

Pos logit contributions
Reviewed+1.398
spot+1.312
worms+1.298
lication+1.277
lessness+1.259
Neg logit contributions
vier-1.422
ashtra-1.415
amera-1.401
ersen-1.375
itably-1.359
Token clot
Token ID5148
Feature activation+3.76e+00

Pos logit contributions
vier+0.334
ashtra+0.333
amera+0.330
ersen+0.325
orah+0.320
Neg logit contributions
Reviewed-0.338
spot-0.316
worms-0.315
lication-0.308
lessness-0.302
Tokenhe
Token ID258
Feature activation+7.14e-01

Pos logit contributions
Reviewed+3.448
spot+3.064
lessness+3.041
worms+3.030
lication+3.011
Neg logit contributions
vier-8.230
ashtra-8.110
ersen-7.853
amera-7.812
orah-7.713
Token myself
Token ID3589
Feature activation+7.55e-01

Pos logit contributions
Reviewed+1.099
spot+1.030
worms+1.019
lication+1.003
lessness+0.987
Neg logit contributions
vier-1.141
ashtra-1.136
amera-1.126
ersen-1.104
itably-1.092
Token with
Token ID351
Feature activation+8.52e-02

Pos logit contributions
Reviewed+1.131
spot+1.059
worms+1.046
lication+1.031
lessness+1.018
Neg logit contributions
vier-1.236
ashtra-1.227
amera-1.217
ersen-1.198
itably-1.182
Token Christ
Token ID1951
Feature activation+2.77e-01

Pos logit contributions
Reviewed+0.138
spot+0.130
worms+0.128
lication+0.126
lessness+0.124
Neg logit contributions
vier-0.130
ashtra-0.129
amera-0.128
ersen-0.125
itably-0.124
Token,
Token ID11
Feature activation+1.79e-01

Pos logit contributions
Reviewed+0.411
spot+0.382
worms+0.378
lication+0.371
lessness+0.363
Neg logit contributions
vier-0.458
ashtra-0.458
amera-0.454
ersen-0.445
orah-0.441
Token
Token ID447
Feature activation-2.49e-01

Pos logit contributions
vier+0.446
ashtra+0.443
amera+0.442
ersen+0.432
orah+0.429
Neg logit contributions
Reviewed-0.379
spot-0.350
worms-0.347
lication-0.344
lessness-0.335
Token
Token ID247
Feature activation-3.95e-01

Pos logit contributions
vier+0.405
ashtra+0.402
amera+0.400
ersen+0.394
orah+0.390
Neg logit contributions
Reviewed-0.373
worms-0.348
spot-0.348
lication-0.342
lessness-0.337
Tokens
Token ID82
Feature activation+2.68e-01

Pos logit contributions
vier+0.650
ashtra+0.645
amera+0.641
ersen+0.634
itably+0.626
Neg logit contributions
Reviewed-0.582
spot-0.543
worms-0.537
lication-0.534
lessness-0.521
Token just
Token ID655
Feature activation+1.53e-02

Pos logit contributions
Reviewed+0.415
spot+0.388
worms+0.384
lication+0.380
lessness+0.373
Neg logit contributions
vier-0.423
ashtra-0.423
amera-0.419
ersen-0.410
itably-0.407
Token totally
Token ID6635
Feature activation+3.16e-01

Pos logit contributions
Reviewed+0.024
spot+0.022
worms+0.022
lication+0.022
lessness+0.022
Neg logit contributions
ashtra-0.024
vier-0.024
amera-0.024
ersen-0.023
itably-0.023
Token uncomp
Token ID34318
Feature activation+3.73e+00

Pos logit contributions
Reviewed+0.468
spot+0.435
worms+0.431
lication+0.428
lessness+0.418
Neg logit contributions
vier-0.520
ashtra-0.519
amera-0.513
ersen-0.505
itably-0.500
Tokenromising
Token ID47112
Feature activation+4.08e-01

Pos logit contributions
Reviewed+4.949
lication+4.587
spot+4.537
worms+4.521
lessness+4.420
Neg logit contributions
vier-6.654
ashtra-6.635
amera-6.395
ersen-6.268
orah-6.194
Token.
Token ID13
Feature activation-1.26e-01

Pos logit contributions
Reviewed+0.582
spot+0.543
worms+0.538
lication+0.529
lessness+0.518
Neg logit contributions
vier-0.696
ashtra-0.693
amera-0.687
ersen-0.674
itably-0.668
Token It
Token ID632
Feature activation-3.56e-01

Pos logit contributions
vier+0.191
ashtra+0.189
amera+0.188
ersen+0.184
itably+0.182
Neg logit contributions
Reviewed-0.203
spot-0.191
worms-0.191
lication-0.188
lessness-0.184
Token
Token ID447
Feature activation-1.25e-02

Pos logit contributions
vier+0.604
ashtra+0.601
amera+0.598
ersen+0.585
itably+0.579
Neg logit contributions
Reviewed-0.511
spot-0.473
worms-0.470
lication-0.464
lessness-0.454
Token
Token ID247
Feature activation-3.75e-01

Pos logit contributions
vier+0.020
ashtra+0.020
amera+0.019
ersen+0.019
itably+0.019
Neg logit contributions
Reviewed-0.019
spot-0.018
worms-0.018
lication-0.018
lessness-0.018
Token massive
Token ID4858
Feature activation+1.38e-01

Pos logit contributions
Reviewed+0.284
spot+0.265
worms+0.263
lication+0.260
lessness+0.254
Neg logit contributions
vier-0.278
ashtra-0.277
amera-0.273
ersen-0.268
itably-0.268
Token airflow
Token ID45771
Feature activation+6.67e-01

Pos logit contributions
Reviewed+0.218
spot+0.202
worms+0.200
lication+0.199
lessness+0.193
Neg logit contributions
ashtra-0.216
vier-0.215
amera-0.212
ersen-0.210
itably-0.208
Token for
Token ID329
Feature activation+2.01e-01

Pos logit contributions
Reviewed+1.017
spot+0.945
worms+0.934
lication+0.927
lessness+0.901
Neg logit contributions
vier-1.080
ashtra-1.073
amera-1.062
ersen-1.049
itably-1.036
Token fast
Token ID3049
Feature activation+9.79e-01

Pos logit contributions
Reviewed+0.311
spot+0.292
worms+0.289
lication+0.285
lessness+0.279
Neg logit contributions
vier-0.321
ashtra-0.319
amera-0.316
ersen-0.311
itably-0.307
Token heat
Token ID4894
Feature activation+1.34e+00

Pos logit contributions
Reviewed+1.447
spot+1.347
worms+1.335
lication+1.316
lessness+1.294
Neg logit contributions
vier-1.622
ashtra-1.619
amera-1.602
ersen-1.581
itably-1.559
Token diss
Token ID6249
Feature activation+3.73e+00

Pos logit contributions
Reviewed+1.678
spot+1.533
worms+1.519
lication+1.499
lessness+1.454
Neg logit contributions
vier-2.516
ashtra-2.511
amera-2.482
ersen-2.458
itably-2.431
Tokenipation
Token ID25857
Feature activation+1.18e+00

Pos logit contributions
Reviewed+2.964
spot+2.811
lication+2.756
worms+2.689
lessness+2.654
Neg logit contributions
vier-8.376
ashtra-8.353
amera-8.295
ersen-8.075
orah-8.065
Token while
Token ID981
Feature activation+4.45e-01

Pos logit contributions
Reviewed+1.700
spot+1.586
worms+1.568
lication+1.542
lessness+1.516
Neg logit contributions
vier-2.015
ashtra-2.006
amera-1.988
ersen-1.955
itably-1.933
Token remaining
Token ID5637
Feature activation+9.01e-01

Pos logit contributions
Reviewed+0.711
spot+0.665
worms+0.660
lication+0.651
lessness+0.637
Neg logit contributions
vier-0.685
ashtra-0.682
amera-0.677
ersen-0.665
itably-0.657
Token whisper
Token ID31992
Feature activation-8.94e-01

Pos logit contributions
Reviewed+1.456
spot+1.361
worms+1.354
lication+1.335
lessness+1.307
Neg logit contributions
vier-1.375
ashtra-1.372
amera-1.356
ersen-1.330
itably-1.319
Token silent
Token ID10574
Feature activation+5.71e-01

Pos logit contributions
vier+1.174
ashtra+1.153
amera+1.141
itably+1.117
ersen+1.115
Neg logit contributions
Reviewed-1.639
lication-1.535
spot-1.533
worms-1.528
lessness-1.467
Token helped
Token ID4193
Feature activation+3.47e-02

Pos logit contributions
vier+0.527
ashtra+0.521
amera+0.519
ersen+0.511
orah+0.504
Neg logit contributions
Reviewed-0.577
spot-0.540
worms-0.534
lication-0.532
lessness-0.517
Token stop
Token ID2245
Feature activation+7.47e-01

Pos logit contributions
Reviewed+0.053
spot+0.049
worms+0.049
lication+0.049
lessness+0.047
Neg logit contributions
vier-0.056
ashtra-0.055
amera-0.055
ersen-0.054
itably-0.054
Token Firefox
Token ID16802
Feature activation+9.88e-01

Pos logit contributions
Reviewed+1.201
spot+1.117
worms+1.111
lication+1.096
lessness+1.074
Neg logit contributions
vier-1.149
ashtra-1.139
amera-1.129
ersen-1.110
itably-1.095
Token users
Token ID2985
Feature activation-2.21e-01

Pos logit contributions
Reviewed+1.436
spot+1.345
worms+1.331
lication+1.302
lessness+1.288
Neg logit contributions
vier-1.664
ashtra-1.659
amera-1.642
ersen-1.611
itably-1.595
Token from
Token ID422
Feature activation+1.06e+00

Pos logit contributions
vier+0.358
ashtra+0.354
amera+0.350
ersen+0.345
orah+0.342
Neg logit contributions
Reviewed-0.338
spot-0.316
worms-0.312
lication-0.309
lessness-0.301
Token defect
Token ID11855
Feature activation+3.70e+00

Pos logit contributions
Reviewed+1.664
spot+1.566
worms+1.548
lication+1.522
lessness+1.505
Neg logit contributions
vier-1.658
ashtra-1.652
amera-1.636
ersen-1.603
itably-1.584
Tokening
Token ID278
Feature activation+7.87e-01

Pos logit contributions
Reviewed+2.890
spot+2.571
worms+2.462
lessness+2.403
lication+2.316
Neg logit contributions
ashtra-8.598
vier-8.439
amera-8.401
ersen-8.338
itably-8.207
Token to
Token ID284
Feature activation+2.07e-01

Pos logit contributions
Reviewed+1.223
spot+1.148
worms+1.136
lication+1.121
lessness+1.098
Neg logit contributions
vier-1.251
ashtra-1.238
amera-1.231
ersen-1.206
itably-1.194
Token other
Token ID584
Feature activation+3.88e-01

Pos logit contributions
Reviewed+0.329
spot+0.304
worms+0.302
lication+0.300
lessness+0.291
Neg logit contributions
vier-0.325
ashtra-0.318
amera-0.317
ersen-0.312
orah-0.310
Token browsers
Token ID22616
Feature activation-3.58e-02

Pos logit contributions
Reviewed+0.665
spot+0.624
worms+0.619
lication+0.611
lessness+0.600
Neg logit contributions
vier-0.557
ashtra-0.544
amera-0.542
ersen-0.534
itably-0.527
Token,
Token ID11
Feature activation+1.77e-02

Pos logit contributions
vier+0.060
ashtra+0.059
amera+0.058
ersen+0.057
itably+0.057
Neg logit contributions
Reviewed-0.053
spot-0.049
worms-0.049
lication-0.048
lessness-0.047
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
Reviewed+3.398
spot+3.184
worms+3.148
lessness+3.075
lication+3.035
Neg logit contributions
vier-4.457
ashtra-4.441
amera-4.416
ersen-4.375
orah-4.271
Token
Token ID198
Feature activation+3.03e-01

Pos logit contributions
Reviewed+0.478
spot+0.450
worms+0.446
lication+0.442
lessness+0.432
Neg logit contributions
vier-0.446
ashtra-0.439
amera-0.435
ersen-0.430
itably-0.429
Token
Token ID198
Feature activation+3.34e-01

Pos logit contributions
Reviewed+0.484
spot+0.455
worms+0.452
lication+0.446
lessness+0.436
Neg logit contributions
vier-0.467
ashtra-0.461
amera-0.456
ersen-0.450
itably-0.445
TokenDecre
Token ID43198
Feature activation+1.27e+00

Pos logit contributions
Reviewed+0.466
spot+0.437
worms+0.432
lication+0.429
lessness+0.417
Neg logit contributions
vier-0.575
ashtra-0.567
amera-0.563
itably-0.555
ersen-0.554
Tokenases
Token ID1386
Feature activation+1.38e-01

Pos logit contributions
Reviewed+2.565
spot+2.447
worms+2.439
lication+2.409
lessness+2.381
Neg logit contributions
ashtra-1.400
vier-1.396
amera-1.367
ersen-1.334
itably-1.312
Token insulin
Token ID14588
Feature activation+3.60e+00

Pos logit contributions
Reviewed+0.211
spot+0.197
worms+0.195
lication+0.193
lessness+0.188
Neg logit contributions
vier-0.220
ashtra-0.219
amera-0.216
ersen-0.213
itably-0.212
Token level
Token ID1241
Feature activation-9.10e-02

Pos logit contributions
Reviewed+3.644
worms+3.373
spot+3.362
lessness+3.242
lication+3.166
Neg logit contributions
ashtra-7.421
vier-7.417
amera-7.353
ersen-7.285
itably-7.156
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
vier+0.152
ashtra+0.151
amera+0.149
itably+0.147
ersen+0.147
Neg logit contributions
Reviewed-0.133
spot-0.124
worms-0.121
lication-0.121
lessness-0.116
Token
Token ID198
Feature activation+4.07e-01

Pos logit contributions
Reviewed+0.587
spot+0.550
worms+0.545
lication+0.534
lessness+0.529
Neg logit contributions
vier-0.631
ashtra-0.630
amera-0.625
ersen-0.610
orah-0.600
Token
Token ID198
Feature activation+4.26e-01

Pos logit contributions
Reviewed+0.644
spot+0.605
worms+0.600
lication+0.592
lessness+0.580
Neg logit contributions
vier-0.633
ashtra-0.627
amera-0.621
ersen-0.611
itably-0.604
TokenIncreases
Token ID28544
Feature activation-1.08e-01

Pos logit contributions
Reviewed+0.564
spot+0.527
worms+0.522
lication+0.515
lessness+0.502
Neg logit contributions
vier-0.765
ashtra-0.755
amera-0.750
ersen-0.738
itably-0.738
Token the
Token ID262
Feature activation+1.07e-01

Pos logit contributions
Reviewed+0.267
spot+0.250
worms+0.248
lication+0.245
lessness+0.240
Neg logit contributions
vier-0.266
ashtra-0.265
amera-0.261
ersen-0.257
itably-0.255
Token tender
Token ID15403
Feature activation+8.71e-01

Pos logit contributions
Reviewed+0.166
spot+0.155
worms+0.153
lication+0.152
lessness+0.149
Neg logit contributions
vier-0.170
ashtra-0.170
amera-0.168
ersen-0.165
itably-0.164
Token merc
Token ID11991
Feature activation-5.26e-01

Pos logit contributions
Reviewed+1.283
spot+1.197
worms+1.183
lication+1.166
lessness+1.147
Neg logit contributions
vier-1.449
ashtra-1.443
amera-1.429
ersen-1.404
itably-1.389
Tokenies
Token ID444
Feature activation-4.80e-01

Pos logit contributions
vier+1.055
amera+1.047
ashtra+1.047
ersen+1.036
itably+1.020
Neg logit contributions
Reviewed-0.589
spot-0.534
worms-0.529
lication-0.519
lessness-0.503
Token of
Token ID286
Feature activation-7.92e-02

Pos logit contributions
vier+0.770
ashtra+0.766
amera+0.760
ersen+0.743
itably+0.735
Neg logit contributions
Reviewed-0.737
spot-0.692
worms-0.685
lication-0.674
lessness-0.667
Token incomp
Token ID13352
Feature activation+3.55e+00

Pos logit contributions
vier+0.117
ashtra+0.116
amera+0.115
ersen+0.112
itably+0.111
Neg logit contributions
Reviewed-0.132
spot-0.124
worms-0.123
lication-0.121
lessness-0.119
Tokenent
Token ID298
Feature activation-9.64e-01

Pos logit contributions
Reviewed+3.787
lication+3.672
spot+3.671
lessness+3.528
worms+3.495
Neg logit contributions
ashtra-7.083
vier-7.016
amera-6.870
ersen-6.681
orah-6.664
Token fake
Token ID8390
Feature activation-1.25e+00

Pos logit contributions
vier+1.489
ashtra+1.472
amera+1.467
ersen+1.438
itably+1.424
Neg logit contributions
Reviewed-1.534
spot-1.442
worms-1.428
lication-1.406
lessness-1.378
Token Republican
Token ID3415
Feature activation-1.06e-01

Pos logit contributions
vier+1.991
amera+1.957
ashtra+1.938
ersen+1.908
itably+1.899
Neg logit contributions
Reviewed-1.922
spot-1.786
lication-1.774
worms-1.742
lessness-1.711
Token and
Token ID290
Feature activation+3.38e-01

Pos logit contributions
vier+0.172
amera+0.169
ashtra+0.169
ersen+0.165
itably+0.163
Neg logit contributions
Reviewed-0.159
spot-0.149
lication-0.147
worms-0.146
lessness-0.142
Token Democratic
Token ID4390
Feature activation+1.95e-01

Pos logit contributions
Reviewed+0.517
spot+0.483
worms+0.479
lication+0.473
lessness+0.462
Neg logit contributions
vier-0.547
ashtra-0.542
amera-0.538
ersen-0.530
itably-0.523
Token mainstream
Token ID8661
Feature activation-7.72e-01

Pos logit contributions
vier+0.029
ashtra+0.029
amera+0.029
ersen+0.028
itably+0.028
Neg logit contributions
Reviewed-0.031
spot-0.029
worms-0.029
lication-0.028
lessness-0.028
Token UK
Token ID3482
Feature activation-5.34e-01

Pos logit contributions
vier+1.238
ashtra+1.213
amera+1.197
ersen+1.191
itably+1.172
Neg logit contributions
Reviewed-1.184
spot-1.106
lication-1.098
worms-1.095
lessness-1.048
Token media
Token ID2056
Feature activation-9.54e-01

Pos logit contributions
vier+0.896
ashtra+0.882
amera+0.871
ersen+0.864
itably+0.851
Neg logit contributions
Reviewed-0.779
spot-0.727
worms-0.723
lication-0.719
lessness-0.693
Token and
Token ID290
Feature activation+3.61e-01

Pos logit contributions
vier+1.550
ashtra+1.525
amera+1.506
ersen+1.485
itably+1.473
Neg logit contributions
Reviewed-1.439
spot-1.336
worms-1.331
lication-1.326
lessness-1.284
Token the
Token ID262
Feature activation+2.73e-01

Pos logit contributions
Reviewed+0.622
spot+0.587
worms+0.581
lication+0.574
lessness+0.566
Neg logit contributions
vier-0.511
ashtra-0.508
amera-0.503
ersen-0.492
itably-0.486
Token diss
Token ID6249
Feature activation+3.53e+00

Pos logit contributions
Reviewed+0.436
spot+0.409
worms+0.405
lication+0.401
lessness+0.392
Neg logit contributions
vier-0.424
ashtra-0.419
amera-0.416
ersen-0.409
itably-0.405
Tokenident
Token ID738
Feature activation+5.42e-01

Pos logit contributions
Reviewed+4.254
spot+3.967
worms+3.810
lication+3.794
lessness+3.748
Neg logit contributions
ashtra-6.657
vier-6.601
amera-6.579
ersen-6.448
itably-6.434
Token Real
Token ID6416
Feature activation+4.10e-01

Pos logit contributions
Reviewed+0.845
spot+0.794
worms+0.786
lication+0.780
lessness+0.757
Neg logit contributions
vier-0.861
ashtra-0.854
amera-0.845
ersen-0.830
itably-0.823
Token News
Token ID3000
Feature activation+5.27e-01

Pos logit contributions
Reviewed+0.660
spot+0.621
worms+0.614
lication+0.605
lessness+0.597
Neg logit contributions
vier-0.626
ashtra-0.623
amera-0.617
ersen-0.605
itably-0.598
Token Network
Token ID7311
Feature activation-1.67e-01

Pos logit contributions
Reviewed+0.812
spot+0.759
worms+0.751
lication+0.744
lessness+0.728
Neg logit contributions
vier-0.846
ashtra-0.837
amera-0.830
ersen-0.814
itably-0.804
Token,
Token ID11
Feature activation+2.10e-01

Pos logit contributions
vier+0.277
ashtra+0.273
amera+0.271
ersen+0.266
itably+0.263
Neg logit contributions
Reviewed-0.248
spot-0.230
worms-0.229
lication-0.226
lessness-0.221
Token joined
Token ID5399
Feature activation+7.88e-01

Pos logit contributions
Reviewed+0.512
spot+0.480
worms+0.475
lication+0.467
lessness+0.461
Neg logit contributions
vier-0.519
ashtra-0.518
amera-0.513
ersen-0.504
itably-0.497
Token seconds
Token ID4201
Feature activation-4.11e-02

Pos logit contributions
Reviewed+1.348
spot+1.271
worms+1.257
lication+1.241
lessness+1.229
Neg logit contributions
vier-1.126
ashtra-1.118
amera-1.106
ersen-1.082
itably-1.067
Token later
Token ID1568
Feature activation+4.16e-01

Pos logit contributions
vier+0.063
ashtra+0.063
amera+0.062
ersen+0.061
itably+0.060
Neg logit contributions
Reviewed-0.066
spot-0.062
worms-0.061
lication-0.060
lessness-0.059
Token by
Token ID416
Feature activation+3.26e-01

Pos logit contributions
Reviewed+0.737
spot+0.697
worms+0.690
lication+0.681
lessness+0.672
Neg logit contributions
vier-0.569
ashtra-0.566
amera-0.560
ersen-0.548
itably-0.540
Token a
Token ID257
Feature activation+3.98e-01

Pos logit contributions
Reviewed+0.530
spot+0.497
worms+0.492
lication+0.484
lessness+0.477
Neg logit contributions
vier-0.495
ashtra-0.493
amera-0.488
ersen-0.478
itably-0.472
Token deaf
Token ID21815
Feature activation+3.50e+00

Pos logit contributions
Reviewed+0.585
spot+0.542
worms+0.539
lication+0.528
lessness+0.518
Neg logit contributions
vier-0.668
ashtra-0.665
amera-0.659
ersen-0.650
itably-0.641
Tokenening
Token ID3101
Feature activation+3.35e-01

Pos logit contributions
Reviewed+1.062
spot+0.884
worms+0.754
lication+0.700
lessness+0.667
Neg logit contributions
vier-9.781
ashtra-9.755
amera-9.702
ersen-9.540
itably-9.472
Token th
Token ID294
Feature activation-7.72e-01

Pos logit contributions
Reviewed+0.486
spot+0.450
worms+0.446
lication+0.438
lessness+0.428
Neg logit contributions
vier-0.566
ashtra-0.566
amera-0.558
ersen-0.550
itably-0.545
Tokenud
Token ID463
Feature activation-1.53e+00

Pos logit contributions
ashtra+0.852
amera+0.845
vier+0.843
ersen+0.816
orah+0.813
Neg logit contributions
Reviewed-1.578
spot-1.469
worms-1.465
lication-1.457
lessness-1.429
Token from
Token ID422
Feature activation+1.10e+00

Pos logit contributions
vier+2.421
ashtra+2.407
amera+2.393
ersen+2.376
orah+2.342
Neg logit contributions
Reviewed-2.359
spot-2.161
worms-2.131
lication-2.123
lessness-2.038
Token the
Token ID262
Feature activation+2.52e-01

Pos logit contributions
Reviewed+1.885
spot+1.780
worms+1.764
lication+1.740
lessness+1.721
Neg logit contributions
vier-1.569
ashtra-1.560
amera-1.543
ersen-1.508
itably-1.488
TokenP
Token ID47
Feature activation+9.81e-01

Pos logit contributions
Reviewed+1.638
spot+1.534
worms+1.513
lication+1.489
lessness+1.465
Neg logit contributions
vier-1.854
ashtra-1.845
amera-1.827
ersen-1.796
itably-1.774
Tokenipes
Token ID18636
Feature activation+6.47e-01

Pos logit contributions
Reviewed+1.780
spot+1.685
worms+1.671
lication+1.647
lessness+1.628
Neg logit contributions
vier-1.300
ashtra-1.292
amera-1.279
ersen-1.249
itably-1.232
Token to
Token ID284
Feature activation+2.51e-01

Pos logit contributions
Reviewed+0.953
spot+0.891
worms+0.881
lication+0.867
lessness+0.854
Neg logit contributions
vier-1.078
ashtra-1.073
amera-1.062
ersen-1.043
itably-1.032
Token enhance
Token ID9494
Feature activation-9.35e-01

Pos logit contributions
Reviewed+0.401
spot+0.377
worms+0.373
lication+0.368
lessness+0.362
Neg logit contributions
vier-0.386
ashtra-0.384
amera-0.380
ersen-0.373
itably-0.369
Token heat
Token ID4894
Feature activation+1.19e+00

Pos logit contributions
ashtra+1.430
vier+1.425
amera+1.410
ersen+1.394
itably+1.380
Neg logit contributions
Reviewed-1.488
worms-1.390
spot-1.387
lication-1.372
lessness-1.334
Token diss
Token ID6249
Feature activation+3.47e+00

Pos logit contributions
Reviewed+1.330
spot+1.208
worms+1.195
lication+1.171
lessness+1.135
Neg logit contributions
vier-2.419
ashtra-2.413
amera-2.391
ersen-2.362
itably-2.342
Tokenipation
Token ID25857
Feature activation+1.12e+00

Pos logit contributions
Reviewed+2.935
spot+2.766
lication+2.703
lessness+2.631
worms+2.602
Neg logit contributions
vier-7.692
ashtra-7.647
amera-7.614
ersen-7.460
orah-7.376
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
Reviewed+1.594
spot+1.490
worms+1.471
lication+1.445
lessness+1.427
Neg logit contributions
vier-1.933
ashtra-1.923
amera-1.906
ersen-1.872
itably-1.850
Token MSI
Token ID30781
Feature activation+1.62e+00

Pos logit contributions
vier+0.166
ashtra+0.165
amera+0.163
ersen+0.160
itably+0.158
Neg logit contributions
Reviewed-0.190
spot-0.179
worms-0.179
lication-0.176
lessness-0.173
Token has
Token ID468
Feature activation+1.00e-01

Pos logit contributions
Reviewed+2.436
spot+2.289
worms+2.246
lication+2.206
lessness+2.203
Neg logit contributions
vier-2.643
ashtra-2.618
amera-2.579
ersen-2.520
itably-2.496
Token also
Token ID635
Feature activation+6.80e-02

Pos logit contributions
Reviewed+0.157
spot+0.146
worms+0.146
lication+0.144
lessness+0.140
Neg logit contributions
vier-0.157
ashtra-0.157
amera-0.155
ersen-0.154
itably-0.151
Token the
Token ID262
Feature activation-1.48e-02

Pos logit contributions
vier+0.848
ashtra+0.842
amera+0.833
ersen+0.814
itably+0.809
Neg logit contributions
Reviewed-0.950
spot-0.891
worms-0.886
lication-0.871
lessness-0.858
Token same
Token ID976
Feature activation+4.24e-04

Pos logit contributions
vier+0.025
ashtra+0.025
amera+0.024
ersen+0.024
itably+0.024
Neg logit contributions
Reviewed-0.022
spot-0.020
worms-0.020
lication-0.020
lessness-0.020
Token panel
Token ID6103
Feature activation+8.90e-01

Pos logit contributions
Reviewed+0.001
spot+0.001
worms+0.001
lication+0.001
lessness+0.001
Neg logit contributions
vier-0.001
ashtra-0.001
amera-0.001
ersen-0.001
itably-0.001
Token as
Token ID355
Feature activation+7.20e-01

Pos logit contributions
Reviewed+1.317
spot+1.235
worms+1.219
lication+1.198
lessness+1.182
Neg logit contributions
vier-1.474
ashtra-1.468
amera-1.452
ersen-1.427
itably-1.411
Token the
Token ID262
Feature activation+8.19e-02

Pos logit contributions
Reviewed+1.268
spot+1.202
worms+1.188
lication+1.172
lessness+1.160
Neg logit contributions
vier-0.988
ashtra-0.983
amera-0.969
ersen-0.950
orah-0.934
Token Asus
Token ID46301
Feature activation+3.47e+00

Pos logit contributions
Reviewed+0.123
spot+0.114
worms+0.114
lication+0.112
lessness+0.110
Neg logit contributions
vier-0.134
ashtra-0.133
amera-0.133
ersen-0.130
itably-0.129
Token.
Token ID13
Feature activation+8.30e-02

Pos logit contributions
Reviewed+5.055
spot+4.721
worms+4.650
lication+4.567
lessness+4.541
Neg logit contributions
vier-5.793
ashtra-5.735
amera-5.604
ersen-5.550
orah-5.482
Token Both
Token ID5747
Feature activation-8.23e-01

Pos logit contributions
Reviewed+0.140
spot+0.132
worms+0.132
lication+0.130
lessness+0.128
Neg logit contributions
vier-0.120
ashtra-0.119
amera-0.118
ersen-0.115
itably-0.113
Token performed
Token ID6157
Feature activation+8.27e-01

Pos logit contributions
ashtra+1.201
vier+1.198
amera+1.197
ersen+1.172
itably+1.152
Neg logit contributions
Reviewed-1.366
spot-1.279
worms-1.271
lication-1.261
lessness-1.231
Token very
Token ID845
Feature activation-6.14e-01

Pos logit contributions
Reviewed+1.298
spot+1.219
worms+1.207
lication+1.188
lessness+1.171
Neg logit contributions
vier-1.296
ashtra-1.290
amera-1.277
ersen-1.253
itably-1.238
Token similarly
Token ID12470
Feature activation+6.04e-01

Pos logit contributions
vier+0.995
ashtra+0.982
amera+0.976
ersen+0.962
itably+0.956
Neg logit contributions
Reviewed-0.933
worms-0.871
lication-0.859
spot-0.858
lessness-0.833
Token public
Token ID1171
Feature activation+1.88e-01

Pos logit contributions
vier+1.644
amera+1.619
ashtra+1.618
ersen+1.584
itably+1.575
Neg logit contributions
Reviewed-1.921
worms-1.785
spot-1.779
lication-1.759
lessness-1.683
Token force
Token ID2700
Feature activation-3.05e-01

Pos logit contributions
Reviewed+0.285
worms+0.262
spot+0.260
lication+0.255
Spot+0.246
Neg logit contributions
vier-0.302
amera-0.300
ashtra-0.299
ersen-0.295
orah-0.291
Token to
Token ID284
Feature activation-1.06e-01

Pos logit contributions
vier+0.466
amera+0.459
ashtra+0.458
ersen+0.448
orah+0.443
Neg logit contributions
Reviewed-0.496
spot-0.458
worms-0.453
lication-0.451
Spot-0.432
Token silence
Token ID9550
Feature activation+2.46e-01

Pos logit contributions
vier+0.163
ashtra+0.163
amera+0.160
ersen+0.159
itably+0.157
Neg logit contributions
Reviewed-0.169
spot-0.156
worms-0.156
lication-0.155
lessness-0.149
Token that
Token ID326
Feature activation+1.66e-01

Pos logit contributions
Reviewed+0.398
spot+0.372
worms+0.369
lication+0.364
lessness+0.354
Neg logit contributions
vier-0.375
ashtra-0.371
amera-0.369
ersen-0.365
itably-0.360
Token diss
Token ID6249
Feature activation+3.47e+00

Pos logit contributions
Reviewed+0.246
worms+0.229
spot+0.226
lication+0.223
lessness+0.212
Neg logit contributions
vier-0.273
ashtra-0.270
amera-0.269
ersen-0.266
orah-0.262
Tokenension
Token ID3004
Feature activation+8.87e-01

Pos logit contributions
Reviewed+4.219
spot+3.944
lication+3.854
worms+3.787
lessness+3.784
Neg logit contributions
ashtra-6.559
vier-6.532
amera-6.451
ersen-6.323
itably-6.320
Token,"
Token ID553
Feature activation-4.74e-01

Pos logit contributions
Reviewed+1.309
spot+1.216
worms+1.203
lication+1.187
lessness+1.158
Neg logit contributions
vier-1.479
ashtra-1.470
amera-1.460
ersen-1.434
itably-1.417
Token he
Token ID339
Feature activation+2.43e-01

Pos logit contributions
vier+0.733
ashtra+0.729
amera+0.722
ersen+0.709
itably+0.701
Neg logit contributions
Reviewed-0.753
spot-0.707
worms-0.700
lication-0.688
lessness-0.678
Token said
Token ID531
Feature activation-3.70e-01

Pos logit contributions
Reviewed+0.326
spot+0.307
worms+0.302
lication+0.296
lessness+0.293
Neg logit contributions
ashtra-0.435
vier-0.435
amera-0.430
ersen-0.421
itably-0.416
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
vier+0.664
ashtra+0.663
amera+0.654
ersen+0.642
itably+0.637
Neg logit contributions
Reviewed-0.497
spot-0.466
worms-0.459
lication-0.450
lessness-0.445
Token philosophy
Token ID8876
Feature activation+3.86e-01

Pos logit contributions
vier+0.702
ashtra+0.693
amera+0.684
ersen+0.673
itably+0.666
Neg logit contributions
Reviewed-0.761
spot-0.715
worms-0.712
lication-0.699
lessness-0.684
Token of
Token ID286
Feature activation+8.84e-02

Pos logit contributions
Reviewed+0.593
spot+0.554
worms+0.551
lication+0.540
lessness+0.530
Neg logit contributions
vier-0.620
ashtra-0.614
amera-0.608
ersen-0.598
itably-0.592
Token improvement
Token ID9025
Feature activation-2.94e-02

Pos logit contributions
Reviewed+0.141
spot+0.132
worms+0.131
lication+0.129
lessness+0.127
Neg logit contributions
vier-0.137
ashtra-0.136
amera-0.134
ersen-0.132
itably-0.130
Token that
Token ID326
Feature activation+1.81e-01

Pos logit contributions
vier+0.047
ashtra+0.047
amera+0.046
ersen+0.046
itably+0.045
Neg logit contributions
Reviewed-0.045
spot-0.042
worms-0.042
lication-0.041
lessness-0.040
Token values
Token ID3815
Feature activation-7.15e-01

Pos logit contributions
Reviewed+0.259
spot+0.239
worms+0.237
lication+0.234
lessness+0.228
Neg logit contributions
vier-0.312
ashtra-0.309
amera-0.307
ersen-0.302
itably-0.299
Token defect
Token ID11855
Feature activation+3.46e+00

Pos logit contributions
vier+1.086
ashtra+1.072
amera+1.064
ersen+1.044
itably+1.040
Neg logit contributions
Reviewed-1.154
spot-1.084
worms-1.076
lication-1.064
lessness-1.028
Token prevention
Token ID14196
Feature activation+1.60e-01

Pos logit contributions
Reviewed+4.997
spot+4.683
lessness+4.634
worms+4.557
lication+4.470
Neg logit contributions
ashtra-5.753
vier-5.729
amera-5.677
ersen-5.643
itably-5.496
Token over
Token ID625
Feature activation+9.85e-01

Pos logit contributions
Reviewed+0.237
spot+0.221
worms+0.219
lication+0.215
lessness+0.211
Neg logit contributions
vier-0.266
ashtra-0.265
amera-0.263
ersen-0.258
itably-0.255
Token defect
Token ID11855
Feature activation+2.80e+00

Pos logit contributions
Reviewed+1.548
spot+1.454
worms+1.438
lication+1.416
lessness+1.398
Neg logit contributions
vier-1.542
ashtra-1.536
amera-1.521
ersen-1.491
itably-1.473
Token detection
Token ID13326
Feature activation+2.32e-01

Pos logit contributions
Reviewed+4.307
spot+4.044
worms+3.980
lessness+3.964
lication+3.929
Neg logit contributions
ashtra-4.405
vier-4.380
amera-4.337
ersen-4.288
itably-4.205
Token.
Token ID13
Feature activation+7.65e-02

Pos logit contributions
Reviewed+0.329
spot+0.307
worms+0.303
lication+0.297
lessness+0.292
Neg logit contributions
vier-0.398
ashtra-0.396
amera-0.393
ersen-0.386
itably-0.382
Token comparison
Token ID7208
Feature activation-2.13e-01

Pos logit contributions
Reviewed+0.067
spot+0.063
worms+0.062
lication+0.061
lessness+0.060
Neg logit contributions
vier-0.071
ashtra-0.071
amera-0.071
ersen-0.069
itably-0.069
Token against
Token ID1028
Feature activation+9.45e-01

Pos logit contributions
vier+0.340
amera+0.337
ashtra+0.337
ersen+0.329
itably+0.326
Neg logit contributions
Reviewed-0.327
spot-0.305
worms-0.304
lication-0.299
lessness-0.293
Token the
Token ID262
Feature activation-2.56e-02

Pos logit contributions
Reviewed+1.623
spot+1.534
worms+1.518
lication+1.496
lessness+1.477
Neg logit contributions
vier-1.341
ashtra-1.334
amera-1.316
ersen-1.291
orah-1.271
Token very
Token ID845
Feature activation-7.29e-01

Pos logit contributions
vier+0.043
amera+0.043
ashtra+0.043
ersen+0.042
itably+0.042
Neg logit contributions
Reviewed-0.037
spot-0.035
worms-0.034
lication-0.034
lessness-0.033
Token popular
Token ID2968
Feature activation+7.25e-01

Pos logit contributions
vier+1.258
ashtra+1.245
amera+1.242
ersen+1.221
itably+1.217
Neg logit contributions
Reviewed-1.022
worms-0.947
lication-0.943
spot-0.943
lessness-0.921
Token Asus
Token ID46301
Feature activation+3.46e+00

Pos logit contributions
Reviewed+1.065
spot+0.995
worms+0.984
lication+0.968
lessness+0.953
Neg logit contributions
vier-1.211
ashtra-1.206
amera-1.195
ersen-1.174
itably-1.161
Token R
Token ID371
Feature activation+8.00e-01

Pos logit contributions
Reviewed+5.111
spot+4.791
worms+4.742
lication+4.616
lessness+4.607
Neg logit contributions
vier-5.682
ashtra-5.599
amera-5.465
ersen-5.409
orah-5.361
TokenOG
Token ID7730
Feature activation+2.72e-01

Pos logit contributions
Reviewed+1.273
spot+1.198
worms+1.184
lication+1.166
lessness+1.150
Neg logit contributions
vier-1.234
ashtra-1.229
amera-1.216
ersen-1.193
itably-1.178
Token Swift
Token ID15608
Feature activation+1.93e+00

Pos logit contributions
Reviewed+0.415
worms+0.393
spot+0.392
lication+0.385
lessness+0.377
Neg logit contributions
vier-0.432
amera-0.429
ashtra-0.427
ersen-0.422
itably-0.415
Token PG
Token ID23842
Feature activation+1.21e+00

Pos logit contributions
Reviewed+3.105
spot+2.942
worms+2.887
lication+2.849
lessness+2.807
Neg logit contributions
vier-2.945
ashtra-2.938
amera-2.897
ersen-2.855
itably-2.813
Token278
Token ID25870
Feature activation+8.01e-02

Pos logit contributions
Reviewed+1.591
spot+1.475
worms+1.463
lication+1.436
lessness+1.405
Neg logit contributions
vier-2.199
ashtra-2.179
amera-2.172
ersen-2.128
itably-2.100
Token players
Token ID1938
Feature activation+1.47e-01

Pos logit contributions
Reviewed+0.304
spot+0.284
worms+0.283
lication+0.278
lessness+0.274
Neg logit contributions
vier-0.290
ashtra-0.287
amera-0.284
ersen-0.280
itably-0.276
Token on
Token ID319
Feature activation+8.20e-02

Pos logit contributions
Reviewed+0.221
spot+0.206
worms+0.205
lication+0.201
lessness+0.198
Neg logit contributions
vier-0.239
ashtra-0.238
amera-0.236
ersen-0.232
orah-0.229
Token why
Token ID1521
Feature activation+7.62e-01

Pos logit contributions
Reviewed+0.133
spot+0.125
worms+0.124
lication+0.122
lessness+0.121
Neg logit contributions
vier-0.124
ashtra-0.123
amera-0.122
ersen-0.120
itably-0.118
Token they
Token ID484
Feature activation+5.67e-01

Pos logit contributions
Reviewed+1.183
spot+1.108
worms+1.099
lication+1.081
lessness+1.063
Neg logit contributions
vier-1.209
ashtra-1.201
amera-1.191
ersen-1.169
itably-1.154
Token had
Token ID550
Feature activation+5.21e-01

Pos logit contributions
Reviewed+0.805
worms+0.748
spot+0.746
lication+0.734
lessness+0.714
Neg logit contributions
vier-0.969
ashtra-0.966
amera-0.958
ersen-0.945
orah-0.935
Token diss
Token ID6249
Feature activation+3.45e+00

Pos logit contributions
Reviewed+0.796
worms+0.742
spot+0.742
lication+0.731
lessness+0.711
Neg logit contributions
vier-0.837
ashtra-0.832
amera-0.822
ersen-0.812
itably-0.805
Tokenented
Token ID4714
Feature activation-3.86e-01

Pos logit contributions
Reviewed+4.611
spot+4.285
worms+4.163
lication+4.151
lessness+4.068
Neg logit contributions
ashtra-6.127
vier-6.112
amera-6.048
ersen-5.887
itably-5.868
Token.
Token ID13
Feature activation+5.87e-03

Pos logit contributions
vier+0.624
ashtra+0.617
amera+0.617
ersen+0.606
orah+0.597
Neg logit contributions
Reviewed-0.584
worms-0.546
spot-0.541
lication-0.533
lessness-0.519
Token
Token ID198
Feature activation-6.97e-02

Pos logit contributions
Reviewed+0.009
spot+0.009
worms+0.009
lication+0.009
lessness+0.008
Neg logit contributions
vier-0.009
ashtra-0.009
amera-0.009
ersen-0.009
itably-0.009
Token
Token ID198
Feature activation+3.47e-02

Pos logit contributions
vier+0.107
ashtra+0.105
amera+0.104
ersen+0.103
itably+0.102
Neg logit contributions
Reviewed-0.112
spot-0.105
worms-0.104
lication-0.103
lessness-0.100
TokenFor
Token ID1890
Feature activation+3.73e-01

Pos logit contributions
Reviewed+0.052
spot+0.049
worms+0.049
lication+0.048
lessness+0.047
Neg logit contributions
vier-0.056
ashtra-0.056
amera-0.055
ersen-0.054
itably-0.054
Token their
Token ID511
Feature activation+3.26e-01

Pos logit contributions
Reviewed+3.724
spot+3.512
worms+3.461
lication+3.413
lessness+3.380
Neg logit contributions
ashtra-3.600
vier-3.587
amera-3.557
ersen-3.477
itably-3.443
Token jobs
Token ID3946
Feature activation+7.20e-01

Pos logit contributions
Reviewed+0.474
worms+0.440
spot+0.437
lication+0.428
lessness+0.419
Neg logit contributions
vier-0.553
ashtra-0.545
amera-0.540
ersen-0.535
orah-0.530
Token,
Token ID11
Feature activation+1.70e-01

Pos logit contributions
Reviewed+1.048
spot+0.977
worms+0.967
lication+0.950
lessness+0.933
Neg logit contributions
vier-1.215
ashtra-1.208
amera-1.198
ersen-1.177
itably-1.162
Token other
Token ID584
Feature activation+5.38e-01

Pos logit contributions
Reviewed+0.260
spot+0.242
worms+0.241
lication+0.237
lessness+0.231
Neg logit contributions
vier-0.273
ashtra-0.271
amera-0.269
ersen-0.267
orah-0.262
Token face
Token ID1986
Feature activation+4.42e-01

Pos logit contributions
Reviewed+0.836
spot+0.781
worms+0.769
lication+0.760
lessness+0.741
Neg logit contributions
vier-0.855
ashtra-0.846
amera-0.842
ersen-0.830
orah-0.817
Token incomp
Token ID13352
Feature activation+3.41e+00

Pos logit contributions
Reviewed+0.678
spot+0.632
worms+0.628
lication+0.616
lessness+0.601
Neg logit contributions
vier-0.711
ashtra-0.704
amera-0.699
ersen-0.687
itably-0.679
Tokenrehens
Token ID7345
Feature activation+1.01e+00

Pos logit contributions
Reviewed+3.947
lication+3.829
spot+3.806
lessness+3.706
worms+3.643
Neg logit contributions
vier-6.512
ashtra-6.452
amera-6.265
ersen-6.128
orah-6.088
Tokenion
Token ID295
Feature activation+3.78e-01

Pos logit contributions
Reviewed+1.427
spot+1.335
worms+1.319
lication+1.302
lessness+1.288
Neg logit contributions
ashtra-1.732
vier-1.730
amera-1.713
ersen-1.687
orah-1.668
Token in
Token ID287
Feature activation-1.10e-01

Pos logit contributions
Reviewed+0.565
spot+0.525
worms+0.519
lication+0.508
lessness+0.497
Neg logit contributions
vier-0.627
ashtra-0.621
amera-0.617
ersen-0.608
itably-0.598
Token their
Token ID511
Feature activation+4.47e-01

Pos logit contributions
vier+0.159
ashtra+0.157
amera+0.156
ersen+0.153
itably+0.151
Neg logit contributions
Reviewed-0.185
spot-0.173
worms-0.173
lication-0.169
lessness-0.167
Token families
Token ID4172
Feature activation-1.05e+00

Pos logit contributions
Reviewed+0.698
spot+0.654
worms+0.649
lication+0.634
lessness+0.623
Neg logit contributions
vier-0.705
ashtra-0.700
amera-0.692
ersen-0.683
itably-0.675
Token for
Token ID329
Feature activation+8.40e-01

Pos logit contributions
Reviewed+0.724
spot+0.674
worms+0.666
lication+0.654
lessness+0.644
Neg logit contributions
vier-0.925
ashtra-0.921
amera-0.913
ersen-0.897
itably-0.887
Token CPUs
Token ID32340
Feature activation+2.23e+00

Pos logit contributions
Reviewed+1.317
spot+1.238
worms+1.225
lication+1.207
lessness+1.189
Neg logit contributions
vier-1.315
ashtra-1.311
amera-1.297
ersen-1.272
itably-1.257
Token and
Token ID290
Feature activation+5.28e-01

Pos logit contributions
Reviewed+2.957
spot+2.786
worms+2.739
lication+2.684
lessness+2.669
Neg logit contributions
ashtra-3.986
vier-3.960
amera-3.926
ersen-3.856
itably-3.819
Token memory
Token ID4088
Feature activation+1.12e+00

Pos logit contributions
Reviewed+0.815
spot+0.763
worms+0.753
lication+0.738
lessness+0.729
Neg logit contributions
vier-0.846
ashtra-0.838
amera-0.833
ersen-0.821
orah-0.809
Token (
Token ID357
Feature activation-2.95e-01

Pos logit contributions
Reviewed+1.584
spot+1.479
worms+1.462
lication+1.435
lessness+1.418
Neg logit contributions
vier-1.919
ashtra-1.913
amera-1.895
ersen-1.860
itably-1.840
TokenIntel
Token ID24123
Feature activation+3.38e+00

Pos logit contributions
vier+0.475
ashtra+0.469
amera+0.466
ersen+0.458
orah+0.454
Neg logit contributions
Reviewed-0.448
spot-0.420
worms-0.418
lication-0.411
lessness-0.404
Token VT
Token ID32751
Feature activation+1.42e-01

Pos logit contributions
Reviewed+4.939
spot+4.595
worms+4.569
lication+4.527
lessness+4.437
Neg logit contributions
ashtra-5.596
vier-5.501
amera-5.404
ersen-5.325
orah-5.222
Token-
Token ID12
Feature activation+2.51e-01

Pos logit contributions
Reviewed+0.187
spot+0.174
worms+0.172
lication+0.168
lessness+0.165
Neg logit contributions
vier-0.259
ashtra-0.258
amera-0.257
ersen-0.252
itably-0.249
Tokenx
Token ID87
Feature activation+4.67e-01

Pos logit contributions
Reviewed+0.406
spot+0.382
worms+0.377
lication+0.373
lessness+0.369
Neg logit contributions
ashtra-0.381
vier-0.380
amera-0.375
ersen-0.369
itably-0.364
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
Reviewed+0.733
spot+0.689
worms+0.682
lication+0.671
lessness+0.660
Neg logit contributions
vier-0.731
ashtra-0.726
amera-0.719
ersen-0.707
itably-0.698
Token AMD
Token ID10324
Feature activation+2.01e+00

Pos logit contributions
vier+0.181
ashtra+0.179
amera+0.178
ersen+0.176
itably+0.173
Neg logit contributions
Reviewed-0.171
spot-0.161
worms-0.160
lication-0.156
lessness-0.154
Token It
Token ID632
Feature activation-2.09e-01

Pos logit contributions
vier+0.070
ashtra+0.069
amera+0.069
ersen+0.068
itably+0.067
Neg logit contributions
Reviewed-0.072
spot-0.068
worms-0.068
lication-0.066
lessness-0.065
Token should
Token ID815
Feature activation-3.48e-01

Pos logit contributions
vier+0.357
ashtra+0.355
amera+0.353
ersen+0.347
itably+0.343
Neg logit contributions
Reviewed-0.298
spot-0.277
worms-0.274
lication-0.271
lessness-0.264
Token be
Token ID307
Feature activation-1.42e-01

Pos logit contributions
vier+0.530
ashtra+0.529
amera+0.523
ersen+0.514
itably+0.506
Neg logit contributions
Reviewed-0.561
spot-0.525
worms-0.521
lication-0.515
lessness-0.504
Token noted
Token ID4367
Feature activation-2.20e-01

Pos logit contributions
vier+0.220
ashtra+0.218
amera+0.216
ersen+0.212
itably+0.211
Neg logit contributions
Reviewed-0.226
spot-0.212
worms-0.211
lication-0.207
lessness-0.203
Token that
Token ID326
Feature activation-4.12e-02

Pos logit contributions
vier+0.338
ashtra+0.336
amera+0.332
ersen+0.328
itably+0.322
Neg logit contributions
Reviewed-0.351
spot-0.330
worms-0.327
lication-0.321
lessness-0.316
Token Bour
Token ID20576
Feature activation-3.79e+00

Pos logit contributions
vier+0.062
ashtra+0.061
amera+0.061
ersen+0.060
itably+0.059
Neg logit contributions
Reviewed-0.067
spot-0.063
worms-0.063
lication-0.062
lessness-0.061
Tokendain
Token ID27162
Feature activation-8.01e-01

Pos logit contributions
vier+6.193
ashtra+6.072
ersen+6.017
amera+5.978
orah+5.872
Neg logit contributions
Reviewed-5.551
worms-5.123
lication-5.040
spot-4.984
lessness-4.861
Token isn
Token ID2125
Feature activation+7.91e-01

Pos logit contributions
vier+1.363
ashtra+1.349
amera+1.340
ersen+1.328
itably+1.304
Neg logit contributions
Reviewed-1.149
spot-1.056
worms-1.048
lication-1.042
lessness-1.009
Token
Token ID447
Feature activation+1.97e-01

Pos logit contributions
Reviewed+1.293
spot+1.217
worms+1.206
lication+1.188
lessness+1.171
Neg logit contributions
vier-1.188
ashtra-1.179
amera-1.168
ersen-1.146
itably-1.132
Token
Token ID247
Feature activation-3.90e-02

Pos logit contributions
Reviewed+0.330
spot+0.312
worms+0.309
lication+0.304
lessness+0.300
Neg logit contributions
vier-0.286
ashtra-0.285
amera-0.282
ersen-0.277
itably-0.272
Tokent
Token ID83
Feature activation+6.77e-01

Pos logit contributions
vier+0.067
ashtra+0.067
amera+0.066
ersen+0.065
itably+0.064
Neg logit contributions
Reviewed-0.056
spot-0.051
worms-0.051
lication-0.050
lessness-0.049
Token Bryce
Token ID39560
Feature activation-1.00e+00

Pos logit contributions
vier+0.153
ashtra+0.152
amera+0.151
ersen+0.149
itably+0.147
Neg logit contributions
Reviewed-0.146
spot-0.136
worms-0.135
lication-0.134
lessness-0.129
Token Harper
Token ID12686
Feature activation-1.47e+00

Pos logit contributions
vier+1.519
ashtra+1.505
amera+1.492
ersen+1.466
itably+1.445
Neg logit contributions
Reviewed-1.632
spot-1.523
worms-1.515
lication-1.490
lessness-1.467
Token,
Token ID11
Feature activation-3.05e-02

Pos logit contributions
vier+2.414
ashtra+2.376
amera+2.358
ersen+2.344
itably+2.277
Neg logit contributions
Reviewed-2.191
spot-2.020
worms-2.014
lication-2.006
lessness-1.916
Token Den
Token ID5601
Feature activation-2.75e-02

Pos logit contributions
vier+0.049
ashtra+0.049
amera+0.049
ersen+0.048
itably+0.047
Neg logit contributions
Reviewed-0.046
spot-0.043
worms-0.043
lication-0.042
lessness-0.041
Tokenard
Token ID446
Feature activation-4.53e-01

Pos logit contributions
vier+0.050
ashtra+0.049
amera+0.049
ersen+0.048
itably+0.048
Neg logit contributions
Reviewed-0.037
spot-0.034
worms-0.034
lication-0.033
lessness-0.032
Token Span
Token ID49101
Feature activation-3.73e+00

Pos logit contributions
vier+0.388
amera+0.376
ashtra+0.374
ersen+0.367
orah+0.356
Neg logit contributions
Reviewed-1.041
spot-0.987
worms-0.984
lication-0.974
lessness-0.959
Token,
Token ID11
Feature activation+1.24e-02

Pos logit contributions
vier+5.813
amera+5.644
ersen+5.521
ashtra+5.446
ager+5.400
Neg logit contributions
Reviewed-5.735
worms-5.333
spot-5.287
lication-5.183
Spot-4.998
Token Jays
Token ID23288
Feature activation-1.49e+00

Pos logit contributions
Reviewed+0.019
spot+0.018
worms+0.017
lication+0.017
lessness+0.017
Neg logit contributions
vier-0.020
ashtra-0.020
amera-0.020
ersen-0.019
itably-0.019
Tokenon
Token ID261
Feature activation+2.57e-01

Pos logit contributions
vier+2.317
amera+2.310
ashtra+2.284
ersen+2.272
orah+2.266
Neg logit contributions
Reviewed-2.303
worms-2.148
lication-2.115
spot-2.114
lessness-2.049
Token Wer
Token ID27648
Feature activation-1.99e+00

Pos logit contributions
Reviewed+0.375
spot+0.352
worms+0.348
lication+0.342
lessness+0.336
Neg logit contributions
ashtra-0.432
vier-0.430
amera-0.425
ersen-0.417
itably-0.413
Tokenth
Token ID400
Feature activation-1.13e+00

Pos logit contributions
vier+2.744
ashtra+2.722
amera+2.699
ersen+2.648
itably+2.594
Neg logit contributions
Reviewed-3.498
spot-3.291
worms-3.269
lication-3.222
lessness-3.177
Token from
Token ID422
Feature activation+1.18e+00

Pos logit contributions
Reviewed+0.619
spot+0.577
worms+0.571
lication+0.563
lessness+0.552
Neg logit contributions
vier-0.704
ashtra-0.698
amera-0.693
ersen-0.682
itably-0.673
Token the
Token ID262
Feature activation+8.84e-02

Pos logit contributions
Reviewed+1.970
spot+1.865
worms+1.854
lication+1.817
lessness+1.804
Neg logit contributions
ashtra-1.705
vier-1.703
amera-1.678
ersen-1.643
orah-1.627
Token Harvard
Token ID11131
Feature activation-1.13e+00

Pos logit contributions
Reviewed+0.145
spot+0.136
worms+0.135
lication+0.134
lessness+0.131
Neg logit contributions
vier-0.133
ashtra-0.131
amera-0.131
ersen-0.128
itably-0.126
Token-
Token ID12
Feature activation+7.26e-03

Pos logit contributions
vier+1.851
ashtra+1.833
amera+1.814
ersen+1.806
orah+1.783
Neg logit contributions
Reviewed-1.689
spot-1.570
lication-1.562
worms-1.543
lessness-1.499
TokenSmith
Token ID17919
Feature activation+9.45e-01

Pos logit contributions
Reviewed+0.008
spot+0.008
worms+0.008
lication+0.008
lessness+0.007
Neg logit contributions
vier-0.014
ashtra-0.014
amera-0.014
ersen-0.014
itably-0.014
Tokensonian
Token ID35202
Feature activation-3.66e+00

Pos logit contributions
Reviewed+1.155
spot+1.065
worms+1.051
lication+1.046
lessness+1.012
Neg logit contributions
vier-1.808
ashtra-1.797
amera-1.767
ersen-1.755
orah-1.743
Token Center
Token ID3337
Feature activation+3.53e-01

Pos logit contributions
vier+3.970
ersen+3.826
amera+3.709
ashtra+3.702
ager+3.686
Neg logit contributions
Reviewed-7.268
worms-6.988
lication-6.952
spot-6.908
lessness-6.750
Token for
Token ID329
Feature activation+3.07e-01

Pos logit contributions
Reviewed+0.550
spot+0.525
worms+0.513
lication+0.504
lessness+0.498
Neg logit contributions
ashtra-0.552
vier-0.552
amera-0.547
ersen-0.537
itably-0.530
Token Ast
Token ID8304
Feature activation-8.92e-01

Pos logit contributions
Reviewed+0.548
spot+0.520
worms+0.514
lication+0.507
lessness+0.501
Neg logit contributions
vier-0.415
ashtra-0.414
amera-0.408
ersen-0.399
itably-0.394
Tokenroph
Token ID10051
Feature activation-1.30e+00

Pos logit contributions
vier+1.505
ashtra+1.496
amera+1.486
ersen+1.459
itably+1.443
Neg logit contributions
Reviewed-1.293
spot-1.205
worms-1.193
lication-1.173
lessness-1.155
Tokenysics
Token ID23154
Feature activation+6.02e-01

Pos logit contributions
amera+2.094
vier+2.078
ersen+2.066
ashtra+2.063
orah+2.046
Neg logit contributions
Reviewed-2.029
spot-1.823
worms-1.815
lication-1.761
lessness-1.760
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
vier+0.823
ashtra+0.823
amera+0.812
ersen+0.794
itably+0.789
Neg logit contributions
Reviewed-0.718
spot-0.674
worms-0.664
lication-0.656
lessness-0.647
Token (
Token ID357
Feature activation-5.65e-01

Pos logit contributions
Reviewed+1.546
spot+1.440
worms+1.417
lication+1.394
lessness+1.375
Neg logit contributions
vier-2.070
ashtra-2.068
amera-2.047
ersen-2.006
itably-1.991
Token 2002
Token ID6244
Feature activation+1.89e-01

Pos logit contributions
vier+0.753
ashtra+0.723
amera+0.721
ersen+0.719
orah+0.709
Neg logit contributions
Reviewed-1.024
spot-0.972
worms-0.966
lication-0.959
lessness-0.927
Token ).
Token ID6739
Feature activation+4.43e-01

Pos logit contributions
Reviewed+0.289
spot+0.272
worms+0.268
lication+0.264
lessness+0.260
Neg logit contributions
vier-0.302
ashtra-0.302
amera-0.298
ersen-0.292
itably-0.289
Token A
Token ID317
Feature activation-3.71e-02

Pos logit contributions
Reviewed+0.733
spot+0.691
worms+0.684
lication+0.674
lessness+0.665
Neg logit contributions
vier-0.657
ashtra-0.654
amera-0.647
ersen-0.634
itably-0.626
Token longitudinal
Token ID36211
Feature activation-3.62e+00

Pos logit contributions
vier+0.057
ashtra+0.057
amera+0.056
ersen+0.056
itably+0.054
Neg logit contributions
Reviewed-0.059
spot-0.055
worms-0.055
lication-0.054
lessness-0.053
Token study
Token ID2050
Feature activation-1.50e+00

Pos logit contributions
ashtra+5.320
ersen+5.260
amera+5.214
vier+5.207
ager+5.170
Neg logit contributions
Reviewed-5.583
worms-5.326
spot-5.269
lication-5.148
lessness-5.043
Token of
Token ID286
Feature activation-4.04e-01

Pos logit contributions
vier+2.292
ashtra+2.282
amera+2.263
ersen+2.224
itably+2.185
Neg logit contributions
Reviewed-2.408
spot-2.256
worms-2.241
lication-2.194
lessness-2.160
Token bullying
Token ID20714
Feature activation-1.46e-02

Pos logit contributions
vier+0.580
ashtra+0.578
amera+0.570
ersen+0.563
itably+0.555
Neg logit contributions
Reviewed-0.690
spot-0.645
worms-0.639
lication-0.637
lessness-0.619
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
ashtra+0.021
vier+0.021
amera+0.021
ersen+0.021
itably+0.020
Neg logit contributions
Reviewed-0.025
spot-0.023
worms-0.023
lication-0.023
lessness-0.022
Token dominance
Token ID18648
Feature activation-1.60e+00

Pos logit contributions
vier+0.189
ashtra+0.189
amera+0.186
ersen+0.185
itably+0.181
Neg logit contributions
Reviewed-0.203
spot-0.189
worms-0.188
lication-0.187
lessness-0.181
TokenED
Token ID1961
Feature activation+1.67e-01

Pos logit contributions
vier+0.300
ashtra+0.297
amera+0.294
ersen+0.288
itably+0.286
Neg logit contributions
Reviewed-0.272
spot-0.256
worms-0.255
lication-0.252
lessness-0.248
TokenMON
Token ID27857
Feature activation-2.50e-01

Pos logit contributions
Reviewed+0.255
spot+0.238
worms+0.237
lication+0.233
lessness+0.229
Neg logit contributions
vier-0.270
ashtra-0.268
amera-0.266
ersen-0.261
itably-0.258
TokenTON
Token ID11357
Feature activation+2.95e-01

Pos logit contributions
ashtra+0.463
vier+0.461
amera+0.456
ersen+0.453
itably+0.450
Neg logit contributions
Reviewed-0.320
worms-0.293
spot-0.289
lication-0.288
lessness-0.277
Token
Token ID784
Feature activation+2.81e-02

Pos logit contributions
Reviewed+0.407
spot+0.378
worms+0.375
lication+0.368
lessness+0.360
Neg logit contributions
vier-0.517
ashtra-0.516
amera-0.509
ersen-0.502
itably-0.496
Token Omar
Token ID24980
Feature activation-1.30e+00

Pos logit contributions
Reviewed+0.044
spot+0.042
worms+0.041
lication+0.041
lessness+0.040
Neg logit contributions
vier-0.044
ashtra-0.044
amera-0.043
ersen-0.042
itably-0.042
Token Kh
Token ID5311
Feature activation-3.56e+00

Pos logit contributions
vier+2.099
ashtra+2.093
amera+2.082
ersen+2.030
itably+2.004
Neg logit contributions
Reviewed-1.978
worms-1.837
spot-1.818
lication-1.775
lessness-1.758
Tokenadr
Token ID41909
Feature activation-2.12e+00

Pos logit contributions
amera+3.202
vier+3.100
aida+3.047
orah+2.985
ashtra+2.969
Neg logit contributions
Reviewed-8.209
spot-7.502
lication-7.290
worms-7.286
lessness-7.264
Token has
Token ID468
Feature activation-5.66e-02

Pos logit contributions
ashtra+3.479
vier+3.448
ersen+3.426
amera+3.383
orah+3.327
Neg logit contributions
Reviewed-3.130
spot-2.886
lication-2.850
worms-2.822
lessness-2.706
Token agreed
Token ID4987
Feature activation-3.80e-01

Pos logit contributions
vier+0.093
ashtra+0.092
amera+0.091
ersen+0.091
itably+0.089
Neg logit contributions
Reviewed-0.085
spot-0.079
worms-0.078
lication-0.077
lessness-0.075
Token to
Token ID284
Feature activation-2.57e-01

Pos logit contributions
vier+0.553
ashtra+0.550
amera+0.545
ersen+0.534
itably+0.525
Neg logit contributions
Reviewed-0.638
spot-0.600
worms-0.596
lication-0.584
lessness-0.576
Token remain
Token ID3520
Feature activation+1.54e-01

Pos logit contributions
vier+0.408
ashtra+0.406
amera+0.401
ersen+0.395
itably+0.389
Neg logit contributions
Reviewed-0.400
spot-0.373
worms-0.370
lication-0.363
lessness-0.355
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
vier+0.358
ashtra+0.358
amera+0.353
ersen+0.346
itably+0.343
Neg logit contributions
Reviewed-0.323
spot-0.304
worms-0.299
lication-0.296
lessness-0.292
Token (
Token ID357
Feature activation-5.41e-01

Pos logit contributions
Reviewed+1.441
spot+1.342
worms+1.317
lication+1.295
lessness+1.281
Neg logit contributions
ashtra-1.996
vier-1.995
amera-1.977
ersen-1.931
itably-1.922
Token 2010
Token ID3050
Feature activation+1.13e-01

Pos logit contributions
vier+0.743
ashtra+0.713
amera+0.710
ersen+0.708
orah+0.699
Neg logit contributions
Reviewed-0.957
spot-0.908
worms-0.904
lication-0.895
lessness-0.867
Token ).
Token ID6739
Feature activation+4.93e-01

Pos logit contributions
Reviewed+0.179
spot+0.167
worms+0.166
lication+0.163
lessness+0.161
Neg logit contributions
vier-0.176
ashtra-0.174
amera-0.173
ersen-0.170
itably-0.167
Token A
Token ID317
Feature activation-6.90e-02

Pos logit contributions
Reviewed+0.819
spot+0.772
worms+0.765
lication+0.753
lessness+0.744
Neg logit contributions
vier-0.727
ashtra-0.723
amera-0.716
ersen-0.701
itably-0.693
Token longitudinal
Token ID36211
Feature activation-3.56e+00

Pos logit contributions
vier+0.109
ashtra+0.108
amera+0.107
ersen+0.105
orah+0.103
Neg logit contributions
Reviewed-0.108
spot-0.101
worms-0.100
lication-0.099
lessness-0.097
Token investigation
Token ID3645
Feature activation-1.57e+00

Pos logit contributions
ashtra+5.185
ersen+5.145
vier+5.113
amera+5.075
ager+5.054
Neg logit contributions
Reviewed-5.498
worms-5.185
spot-5.145
lication-5.105
lessness-4.905
Token of
Token ID286
Feature activation-6.78e-01

Pos logit contributions
vier+2.338
ashtra+2.321
amera+2.306
ersen+2.276
orah+2.225
Neg logit contributions
Reviewed-2.550
spot-2.383
worms-2.370
lication-2.326
lessness-2.274
Token young
Token ID1862
Feature activation-1.31e+00

Pos logit contributions
vier+0.976
ashtra+0.972
amera+0.955
ersen+0.949
itably+0.931
Neg logit contributions
Reviewed-1.157
spot-1.077
worms-1.069
lication-1.068
lessness-1.035
Token adolescents
Token ID25447
Feature activation-1.13e+00

Pos logit contributions
vier+1.843
ashtra+1.836
ersen+1.808
amera+1.796
agers+1.779
Neg logit contributions
Reviewed-2.262
lication-2.103
spot-2.093
worms-2.065
lessness-2.016
Token
Token ID447
Feature activation-6.36e-01

Pos logit contributions
ashtra+1.552
vier+1.547
amera+1.514
ersen+1.506
orah+1.481
Neg logit contributions
Reviewed-1.991
spot-1.851
worms-1.840
lication-1.835
lessness-1.779
Tokenively
Token ID2280
Feature activation+7.28e-01

Pos logit contributions
Reviewed+1.219
spot+1.156
worms+1.145
lication+1.133
lessness+1.117
Neg logit contributions
vier-0.808
ashtra-0.802
amera-0.793
ersen-0.774
itably-0.764
Token simple
Token ID2829
Feature activation-2.14e-01

Pos logit contributions
Reviewed+1.072
spot+0.999
worms+0.990
lication+0.975
lessness+0.957
Neg logit contributions
vier-1.213
ashtra-1.205
amera-1.197
ersen-1.175
itably-1.164
Token
Token ID564
Feature activation-1.10e-01

Pos logit contributions
vier+0.336
ashtra+0.334
amera+0.332
ersen+0.327
itably+0.323
Neg logit contributions
Reviewed-0.337
spot-0.314
worms-0.311
lication-0.307
lessness-0.300
Token
Token ID250
Feature activation-7.32e-01

Pos logit contributions
vier+0.173
ashtra+0.172
amera+0.171
ersen+0.168
orah+0.165
Neg logit contributions
Reviewed-0.171
spot-0.160
worms-0.159
lication-0.157
lessness-0.154
TokenU
Token ID52
Feature activation-1.06e+00

Pos logit contributions
vier+1.217
ashtra+1.202
amera+1.196
ersen+1.184
orah+1.164
Neg logit contributions
Reviewed-1.084
worms-0.999
spot-0.996
lication-0.982
lessness-0.960
Tokensonian
Token ID35202
Feature activation-3.55e+00

Pos logit contributions
vier+1.749
ashtra+1.725
amera+1.712
ersen+1.706
orah+1.669
Neg logit contributions
Reviewed-1.586
worms-1.464
spot-1.460
lication-1.446
lessness-1.410
Token
Token ID447
Feature activation-5.67e-01

Pos logit contributions
vier+5.415
amera+5.364
ersen+5.332
ashtra+5.270
itably+5.149
Neg logit contributions
Reviewed-5.442
spot-5.058
worms-4.964
lication-4.955
lessness-4.784
Token
Token ID251
Feature activation+2.27e-01

Pos logit contributions
vier+0.942
ashtra+0.938
amera+0.929
ersen+0.912
itably+0.902
Neg logit contributions
Reviewed-0.838
spot-0.784
worms-0.775
lication-0.762
lessness-0.751
Token houses
Token ID7777
Feature activation-1.48e+00

Pos logit contributions
Reviewed+0.343
spot+0.319
worms+0.317
lication+0.312
lessness+0.306
Neg logit contributions
vier-0.371
ashtra-0.368
amera-0.366
ersen-0.360
itably-0.355
Token,
Token ID11
Feature activation+7.19e-02

Pos logit contributions
vier+2.362
ashtra+2.340
amera+2.330
ersen+2.280
orah+2.272
Neg logit contributions
Reviewed-2.262
worms-2.078
spot-2.073
lication-2.053
lessness-1.992
Token a
Token ID257
Feature activation+7.96e-02

Pos logit contributions
Reviewed+0.111
spot+0.104
worms+0.103
lication+0.101
lessness+0.099
Neg logit contributions
vier-0.114
ashtra-0.114
amera-0.113
ersen-0.111
itably-0.110
Token some
Token ID617
Feature activation-3.60e-01

Pos logit contributions
vier+1.002
amera+0.993
ashtra+0.989
ersen+0.969
itably+0.951
Neg logit contributions
Reviewed-1.262
spot-1.184
worms-1.174
lication-1.159
lessness-1.146
Token bad
Token ID2089
Feature activation-2.96e-01

Pos logit contributions
vier+0.562
amera+0.559
ashtra+0.559
ersen+0.555
itably+0.541
Neg logit contributions
Reviewed-0.565
spot-0.524
worms-0.521
lication-0.511
lessness-0.503
Token choices
Token ID7747
Feature activation-7.76e-01

Pos logit contributions
vier+0.449
ashtra+0.445
amera+0.443
ersen+0.436
itably+0.426
Neg logit contributions
Reviewed-0.479
spot-0.444
worms-0.444
lication-0.440
lessness-0.431
Token,
Token ID11
Feature activation-3.05e-01

Pos logit contributions
vier+1.222
ashtra+1.204
amera+1.196
ersen+1.179
itably+1.159
Neg logit contributions
Reviewed-1.221
spot-1.130
worms-1.129
lication-1.109
lessness-1.076
Token Michael
Token ID3899
Feature activation-1.12e+00

Pos logit contributions
vier+0.483
ashtra+0.479
amera+0.475
ersen+0.469
itably+0.461
Neg logit contributions
Reviewed-0.475
spot-0.443
worms-0.439
lication-0.434
lessness-0.425
Token Vick
Token ID33347
Feature activation-3.54e+00

Pos logit contributions
vier+1.840
ashtra+1.819
amera+1.796
ersen+1.780
itably+1.761
Neg logit contributions
Reviewed-1.667
spot-1.541
worms-1.538
lication-1.509
lessness-1.462
Token has
Token ID468
Feature activation+3.61e-02

Pos logit contributions
vier+5.451
ersen+5.409
ashtra+5.329
amera+5.311
itably+5.201
Neg logit contributions
Reviewed-5.373
worms-4.896
spot-4.886
lication-4.877
herent-4.616
Token devoted
Token ID13378
Feature activation+1.81e-01

Pos logit contributions
Reviewed+0.054
spot+0.050
worms+0.050
lication+0.049
lessness+0.048
Neg logit contributions
vier-0.059
ashtra-0.059
amera-0.058
ersen-0.058
itably-0.057
Token time
Token ID640
Feature activation+4.39e-01

Pos logit contributions
Reviewed+0.303
spot+0.285
worms+0.283
lication+0.279
lessness+0.273
Neg logit contributions
vier-0.266
ashtra-0.264
amera-0.262
ersen-0.257
itably-0.254
Token to
Token ID284
Feature activation-2.72e-01

Pos logit contributions
Reviewed+0.669
spot+0.627
worms+0.620
lication+0.610
lessness+0.602
Neg logit contributions
vier-0.709
ashtra-0.706
amera-0.699
ersen-0.686
itably-0.678
Token speaking
Token ID5486
Feature activation+5.62e-01

Pos logit contributions
vier+0.408
ashtra+0.404
amera+0.403
ersen+0.395
itably+0.390
Neg logit contributions
Reviewed-0.445
spot-0.417
worms-0.412
lication-0.408
lessness-0.399
Token from
Token ID422
Feature activation+1.27e+00

Pos logit contributions
vier+0.633
ashtra+0.626
amera+0.623
ersen+0.617
itably+0.608
Neg logit contributions
Reviewed-0.602
spot-0.554
worms-0.552
lication-0.545
lessness-0.534
Token the
Token ID262
Feature activation+4.98e-02

Pos logit contributions
Reviewed+2.107
spot+1.997
worms+1.978
lication+1.943
lessness+1.931
Neg logit contributions
vier-1.855
ashtra-1.849
amera-1.819
ersen-1.775
itably-1.765
Token Harvard
Token ID11131
Feature activation-8.63e-01

Pos logit contributions
Reviewed+0.080
spot+0.075
worms+0.075
lication+0.074
lessness+0.073
Neg logit contributions
vier-0.076
ashtra-0.075
amera-0.075
ersen-0.073
itably-0.072
Token-
Token ID12
Feature activation+2.04e-01

Pos logit contributions
vier+1.451
ashtra+1.433
amera+1.424
ersen+1.414
itably+1.395
Neg logit contributions
Reviewed-1.246
spot-1.154
lication-1.145
worms-1.143
lessness-1.107
TokenSmith
Token ID17919
Feature activation+1.29e+00

Pos logit contributions
Reviewed+0.234
spot+0.215
worms+0.213
lication+0.209
lessness+0.204
Neg logit contributions
vier-0.407
ashtra-0.404
amera-0.400
ersen-0.395
itably-0.391
Tokensonian
Token ID35202
Feature activation-3.51e+00

Pos logit contributions
Reviewed+1.472
spot+1.350
worms+1.329
lication+1.300
lessness+1.274
Neg logit contributions
vier-2.567
ashtra-2.557
amera-2.538
ersen-2.500
itably-2.477
Token Center
Token ID3337
Feature activation+6.91e-01

Pos logit contributions
vier+3.879
ersen+3.739
amera+3.684
ashtra+3.636
ager+3.560
Neg logit contributions
Reviewed-6.852
worms-6.665
lication-6.585
spot-6.475
lessness-6.375
Token for
Token ID329
Feature activation+3.93e-01

Pos logit contributions
Reviewed+1.067
spot+1.020
worms+0.987
lication+0.978
lessness+0.965
Neg logit contributions
vier-1.088
ashtra-1.087
amera-1.081
ersen-1.054
itably-1.049
Token Ast
Token ID8304
Feature activation-6.18e-01

Pos logit contributions
Reviewed+0.709
spot+0.671
worms+0.666
lication+0.657
lessness+0.649
Neg logit contributions
vier-0.525
ashtra-0.521
amera-0.516
ersen-0.505
itably-0.497
Tokenroph
Token ID10051
Feature activation-9.85e-01

Pos logit contributions
vier+1.007
ashtra+0.992
amera+0.992
ersen+0.976
itably+0.965
Neg logit contributions
Reviewed-0.939
spot-0.868
worms-0.864
lication-0.856
lessness-0.833
Tokenysics
Token ID23154
Feature activation+6.84e-01

Pos logit contributions
vier+1.619
amera+1.611
ashtra+1.610
ersen+1.582
orah+1.558
Neg logit contributions
Reviewed-1.485
spot-1.352
worms-1.349
lication-1.320
lessness-1.295
Tokent
Token ID83
Feature activation+5.42e-01

Pos logit contributions
vier+0.158
ashtra+0.157
amera+0.155
ersen+0.154
itably+0.151
Neg logit contributions
Reviewed-0.119
spot-0.110
worms-0.109
lication-0.107
lessness-0.105
Token say
Token ID910
Feature activation-4.97e-01

Pos logit contributions
Reviewed+0.875
spot+0.822
worms+0.814
lication+0.803
lessness+0.790
Neg logit contributions
vier-0.824
ashtra-0.820
amera-0.812
ersen-0.798
itably-0.788
Token anything
Token ID1997
Feature activation+6.78e-03

Pos logit contributions
vier+0.779
ashtra+0.770
amera+0.766
ersen+0.752
itably+0.743
Neg logit contributions
Reviewed-0.783
spot-0.735
worms-0.727
lication-0.715
lessness-0.702
Token negative
Token ID4633
Feature activation+1.26e+00

Pos logit contributions
Reviewed+0.011
spot+0.010
worms+0.010
lication+0.010
lessness+0.009
Neg logit contributions
vier-0.011
ashtra-0.011
amera-0.011
ersen-0.010
orah-0.010
Token about
Token ID546
Feature activation+7.02e-01

Pos logit contributions
Reviewed+1.935
spot+1.824
worms+1.795
lication+1.773
lessness+1.757
Neg logit contributions
vier-1.999
ashtra-1.992
amera-1.963
ersen-1.930
itably-1.912
Token Bour
Token ID20576
Feature activation-3.48e+00

Pos logit contributions
Reviewed+1.133
spot+1.066
worms+1.055
lication+1.040
lessness+1.026
Neg logit contributions
vier-1.069
ashtra-1.063
amera-1.052
ersen-1.031
itably-1.019
Tokendain
Token ID27162
Feature activation-5.65e-01

Pos logit contributions
vier+5.972
ashtra+5.897
amera+5.854
ersen+5.724
orah+5.692
Neg logit contributions
Reviewed-4.831
worms-4.439
lication-4.374
spot-4.371
lessness-4.236
Token in
Token ID287
Feature activation-3.79e-02

Pos logit contributions
vier+0.892
ashtra+0.879
amera+0.877
ersen+0.867
itably+0.845
Neg logit contributions
Reviewed-0.885
spot-0.823
worms-0.814
lication-0.803
lessness-0.788
Token public
Token ID1171
Feature activation+6.96e-01

Pos logit contributions
vier+0.056
ashtra+0.056
amera+0.055
ersen+0.054
itably+0.054
Neg logit contributions
Reviewed-0.063
spot-0.059
worms-0.059
lication-0.058
lessness-0.057
Token
Token ID960
Feature activation+5.95e-02

Pos logit contributions
Reviewed+1.023
spot+0.952
worms+0.945
lication+0.922
lessness+0.910
Neg logit contributions
vier-1.160
ashtra-1.157
amera-1.150
ersen-1.130
itably-1.117
Tokenp
Token ID79
Feature activation+5.97e-02

Pos logit contributions
Reviewed+0.091
spot+0.085
worms+0.085
lication+0.083
lessness+0.082
Neg logit contributions
vier-0.096
ashtra-0.095
amera-0.094
ersen-0.092
orah-0.091
Token B
Token ID347
Feature activation+2.87e-01

Pos logit contributions
vier+1.044
ashtra+1.033
amera+1.028
ersen+1.020
orah+1.000
Neg logit contributions
Reviewed-1.061
spot-0.992
worms-0.980
lication-0.967
lessness-0.944
TokenSP
Token ID4303
Feature activation+8.38e-01

Pos logit contributions
Reviewed+0.465
spot+0.436
worms+0.433
lication+0.426
lessness+0.419
Neg logit contributions
vier-0.438
ashtra-0.435
amera-0.433
ersen-0.424
itably-0.418
Token-
Token ID12
Feature activation+1.73e-01

Pos logit contributions
Reviewed+1.217
spot+1.135
worms+1.125
lication+1.104
lessness+1.086
Neg logit contributions
vier-1.413
ashtra-1.408
amera-1.394
ersen-1.373
itably-1.355
Tokenr
Token ID81
Feature activation-1.82e-01

Pos logit contributions
Reviewed+0.253
spot+0.236
worms+0.235
lication+0.231
lessness+0.227
Neg logit contributions
vier-0.289
ashtra-0.288
amera-0.285
ersen-0.281
itably-0.276
Tokenuled
Token ID6309
Feature activation+9.72e-01

Pos logit contributions
vier+0.266
ashtra+0.266
amera+0.260
ersen+0.253
itably+0.253
Neg logit contributions
Reviewed-0.307
spot-0.290
worms-0.286
lication-0.282
lessness-0.279
Token Uttar
Token ID44744
Feature activation-3.48e+00

Pos logit contributions
Reviewed+1.587
spot+1.497
worms+1.481
lication+1.459
lessness+1.441
Neg logit contributions
vier-1.463
ashtra-1.453
amera-1.440
ersen-1.409
itably-1.392
Token Pradesh
Token ID25945
Feature activation-6.12e-01

Pos logit contributions
ashtra+0.942
vier+0.725
amera+0.651
orah+0.563
ersen+0.517
Neg logit contributions
Reviewed-10.043
worms-9.652
spot-9.566
lication-9.562
lessness-9.417
Token banned
Token ID9301
Feature activation+2.49e-01

Pos logit contributions
ashtra+1.052
vier+1.041
amera+1.028
ersen+1.020
itably+1.005
Neg logit contributions
Reviewed-0.869
worms-0.806
spot-0.805
lication-0.787
lessness-0.760
Token it
Token ID340
Feature activation-2.59e-01

Pos logit contributions
Reviewed+0.405
spot+0.381
worms+0.377
lication+0.372
lessness+0.366
Neg logit contributions
vier-0.375
ashtra-0.373
amera-0.369
ersen-0.363
itably-0.358
Token and
Token ID290
Feature activation+1.73e-02

Pos logit contributions
vier+0.416
ashtra+0.413
amera+0.410
ersen+0.401
itably+0.396
Neg logit contributions
Reviewed-0.400
spot-0.372
worms-0.369
lication-0.364
lessness-0.355
Token And
Token ID843
Feature activation-1.83e+00

Pos logit contributions
Reviewed+0.027
spot+0.026
worms+0.025
lication+0.025
lessness+0.024
Neg logit contributions
ashtra-0.027
vier-0.027
amera-0.027
ersen-0.026
itably-0.026
Token to
Token ID284
Feature activation-2.03e-01

Pos logit contributions
Reviewed+0.963
spot+0.909
worms+0.898
lication+0.886
lessness+0.874
Neg logit contributions
vier-0.933
ashtra-0.929
amera-0.918
ersen-0.899
itably-0.892
Token fly
Token ID6129
Feature activation+2.62e-01

Pos logit contributions
vier+0.311
ashtra+0.311
amera+0.306
ersen+0.304
orah+0.301
Neg logit contributions
Reviewed-0.327
spot-0.303
worms-0.302
lication-0.298
lessness-0.289
Token back
Token ID736
Feature activation-4.98e-01

Pos logit contributions
Reviewed+0.427
spot+0.399
worms+0.395
lication+0.391
lessness+0.381
Neg logit contributions
vier-0.396
ashtra-0.394
amera-0.391
ersen-0.386
itably-0.379
Token to
Token ID284
Feature activation+1.83e-02

Pos logit contributions
vier+0.720
ashtra+0.715
amera+0.707
ersen+0.692
itably+0.684
Neg logit contributions
Reviewed-0.845
spot-0.795
worms-0.790
lication-0.782
lessness-0.760
Token Che
Token ID2580
Feature activation-8.14e-01

Pos logit contributions
Reviewed+0.033
spot+0.031
worms+0.031
lication+0.031
lessness+0.030
Neg logit contributions
vier-0.025
ashtra-0.024
amera-0.024
ersen-0.024
itably-0.023
Tokenchn
Token ID1349
Feature activation-3.46e+00

Pos logit contributions
vier+0.746
amera+0.743
ashtra+0.725
ersen+0.704
orah+0.694
Neg logit contributions
Reviewed-1.818
worms-1.716
spot-1.715
lication-1.698
lessness-1.672
Tokenya
Token ID3972
Feature activation-1.18e+00

Pos logit contributions
amera+4.309
orah+4.283
ersen+4.220
vier+4.174
aida+4.066
Neg logit contributions
Reviewed-6.498
worms-5.848
spot-5.841
lication-5.817
lessness-5.738
Token on
Token ID319
Feature activation+5.43e-02

Pos logit contributions
vier+1.806
ashtra+1.799
amera+1.782
ersen+1.743
itably+1.714
Neg logit contributions
Reviewed-1.887
spot-1.757
worms-1.753
lication-1.717
lessness-1.668
Token Tuesday
Token ID3431
Feature activation+1.96e-01

Pos logit contributions
Reviewed+0.085
spot+0.080
worms+0.079
lication+0.078
lessness+0.077
Neg logit contributions
vier-0.085
ashtra-0.084
amera-0.084
ersen-0.082
itably-0.081
Token night
Token ID1755
Feature activation-1.57e-01

Pos logit contributions
Reviewed+0.286
spot+0.267
worms+0.264
lication+0.259
lessness+0.255
Neg logit contributions
vier-0.329
ashtra-0.327
amera-0.325
ersen-0.319
itably-0.316
Token with
Token ID351
Feature activation-9.99e-02

Pos logit contributions
vier+0.256
ashtra+0.255
amera+0.253
ersen+0.248
itably+0.245
Neg logit contributions
Reviewed-0.238
spot-0.221
worms-0.220
lication-0.215
lessness-0.212
Token
Token ID198
Feature activation+1.47e-01

Pos logit contributions
vier+0.033
ashtra+0.033
amera+0.032
ersen+0.032
itably+0.031
Neg logit contributions
Reviewed-0.035
spot-0.033
worms-0.033
lication-0.032
lessness-0.031
TokenThe
Token ID464
Feature activation+6.52e-01

Pos logit contributions
Reviewed+0.221
worms+0.209
spot+0.208
lication+0.205
lessness+0.202
Neg logit contributions
vier-0.237
ashtra-0.232
amera-0.232
ersen-0.227
itably-0.226
Token truth
Token ID3872
Feature activation+1.07e+00

Pos logit contributions
Reviewed+0.997
spot+0.932
worms+0.922
lication+0.909
lessness+0.894
Neg logit contributions
vier-1.052
ashtra-1.045
amera-1.036
ersen-1.020
itably-1.008
Token is
Token ID318
Feature activation+1.61e-01

Pos logit contributions
Reviewed+1.443
spot+1.352
worms+1.332
lication+1.307
lessness+1.295
Neg logit contributions
vier-1.891
ashtra-1.883
amera-1.866
ersen-1.831
itably-1.813
Token that
Token ID326
Feature activation+3.63e-01

Pos logit contributions
Reviewed+0.246
spot+0.231
worms+0.228
lication+0.225
lessness+0.221
Neg logit contributions
vier-0.258
ashtra-0.256
amera-0.254
ersen-0.249
itably-0.247
Token Bour
Token ID20576
Feature activation-3.42e+00

Pos logit contributions
Reviewed+0.598
spot+0.563
worms+0.558
lication+0.549
lessness+0.542
Neg logit contributions
vier-0.542
ashtra-0.538
amera-0.534
ersen-0.524
itably-0.516
Tokendain
Token ID27162
Feature activation-3.57e-01

Pos logit contributions
vier+5.676
ashtra+5.552
ersen+5.542
amera+5.525
orah+5.396
Neg logit contributions
Reviewed-4.936
worms-4.522
lication-4.466
spot-4.454
lessness-4.300
Token has
Token ID468
Feature activation+3.76e-01

Pos logit contributions
vier+0.621
ashtra+0.615
amera+0.611
ersen+0.605
itably+0.595
Neg logit contributions
Reviewed-0.500
spot-0.461
worms-0.455
lication-0.453
lessness-0.440
Token always
Token ID1464
Feature activation+1.80e-01

Pos logit contributions
Reviewed+0.569
spot+0.532
worms+0.527
lication+0.520
lessness+0.510
Neg logit contributions
vier-0.612
ashtra-0.607
amera-0.602
ersen-0.593
itably-0.585
Token had
Token ID550
Feature activation+4.85e-02

Pos logit contributions
Reviewed+0.268
spot+0.249
worms+0.246
lication+0.243
lessness+0.238
Neg logit contributions
vier-0.296
ashtra-0.293
amera-0.291
ersen-0.287
itably-0.284
Token a
Token ID257
Feature activation+1.55e-01

Pos logit contributions
Reviewed+0.078
spot+0.073
worms+0.073
lication+0.072
lessness+0.070
Neg logit contributions
vier-0.074
ashtra-0.073
amera-0.073
ersen-0.071
itably-0.070
Token recognition
Token ID9465
Feature activation-7.32e-01

Pos logit contributions
vier+1.678
amera+1.663
ersen+1.649
ashtra+1.641
itably+1.629
Neg logit contributions
Reviewed-1.688
worms-1.593
spot-1.562
lication-1.540
lessness-1.500
Token as
Token ID355
Feature activation+2.49e-01

Pos logit contributions
vier+1.112
ashtra+1.096
amera+1.094
ersen+1.078
itably+1.054
Neg logit contributions
Reviewed-1.183
worms-1.113
spot-1.108
lication-1.088
lessness-1.061
Token his
Token ID465
Feature activation-4.69e-01

Pos logit contributions
Reviewed+0.392
spot+0.368
worms+0.364
lication+0.359
lessness+0.353
Neg logit contributions
vier-0.390
ashtra-0.386
amera-0.384
ersen-0.377
itably-0.372
Token highly
Token ID4047
Feature activation+3.80e-01

Pos logit contributions
vier+0.741
ashtra+0.730
amera+0.729
ersen+0.724
itably+0.713
Neg logit contributions
Reviewed-0.731
spot-0.680
worms-0.673
lication-0.665
lessness-0.651
Token paid
Token ID3432
Feature activation-1.37e+00

Pos logit contributions
Reviewed+0.568
spot+0.529
worms+0.527
lication+0.519
lessness+0.509
Neg logit contributions
vier-0.624
ashtra-0.618
amera-0.612
ersen-0.605
itably-0.598
Token linem
Token ID22774
Feature activation-3.42e+00

Pos logit contributions
vier+2.166
amera+2.126
ashtra+2.121
ersen+2.102
itably+2.067
Neg logit contributions
Reviewed-2.107
spot-1.944
lication-1.919
worms-1.916
lessness-1.854
Tokenates
Token ID689
Feature activation-1.11e+00

Pos logit contributions
amera+4.925
ersen+4.897
ashtra+4.861
vier+4.858
ager+4.805
Neg logit contributions
Reviewed-5.685
lication-5.187
worms-5.102
spot-5.079
lessness-5.067
Token David
Token ID3271
Feature activation-3.20e-01

Pos logit contributions
vier+1.772
ashtra+1.758
amera+1.749
ersen+1.729
itably+1.699
Neg logit contributions
Reviewed-1.687
worms-1.575
spot-1.568
lication-1.533
lessness-1.502
Token De
Token ID1024
Feature activation+6.52e-02

Pos logit contributions
vier+0.525
ashtra+0.516
amera+0.509
ersen+0.506
orah+0.501
Neg logit contributions
Reviewed-0.487
worms-0.449
spot-0.448
lication-0.437
lessness-0.428
TokenCast
Token ID19248
Feature activation+7.93e-01

Pos logit contributions
Reviewed+0.100
spot+0.094
worms+0.093
lication+0.091
lessness+0.090
Neg logit contributions
vier-0.105
ashtra-0.104
amera-0.103
ersen-0.101
itably-0.100
Tokenro
Token ID305
Feature activation+6.43e-01

Pos logit contributions
Reviewed+0.944
spot+0.859
worms+0.847
lication+0.845
lessness+0.841
Neg logit contributions
ashtra-1.551
vier-1.522
amera-1.509
ersen-1.473
itably-1.471
Token the
Token ID262
Feature activation+2.00e-01

Pos logit contributions
Reviewed+0.123
spot+0.116
worms+0.115
lication+0.113
lessness+0.112
Neg logit contributions
vier-0.102
ashtra-0.102
amera-0.101
ersen-0.098
itably-0.097
Token Moon
Token ID6869
Feature activation+3.19e-01

Pos logit contributions
Reviewed+0.309
spot+0.289
worms+0.287
lication+0.282
lessness+0.278
Neg logit contributions
vier-0.319
ashtra-0.317
amera-0.314
ersen-0.309
itably-0.305
Token,
Token ID11
Feature activation-2.73e-02

Pos logit contributions
Reviewed+0.465
spot+0.430
worms+0.427
lication+0.422
lessness+0.414
Neg logit contributions
vier-0.537
ashtra-0.532
amera-0.529
ersen-0.522
orah-0.514
Token fighting
Token ID4330
Feature activation+5.49e-02

Pos logit contributions
vier+0.045
ashtra+0.044
amera+0.044
ersen+0.044
itably+0.043
Neg logit contributions
Reviewed-0.041
spot-0.038
worms-0.038
lication-0.037
lessness-0.037
Token Darth
Token ID31152
Feature activation-2.50e+00

Pos logit contributions
Reviewed+0.086
spot+0.081
worms+0.080
lication+0.079
lessness+0.077
Neg logit contributions
vier-0.086
ashtra-0.085
amera-0.085
ersen-0.083
itably-0.082
Token Vader
Token ID27403
Feature activation-3.41e+00

Pos logit contributions
vier+2.442
ashtra+2.435
amera+2.414
orah+2.319
ersen+2.298
Neg logit contributions
Reviewed-5.280
lication-5.005
spot-4.986
worms-4.938
lessness-4.842
Token and
Token ID290
Feature activation+4.21e-02

Pos logit contributions
vier+5.178
ashtra+5.097
amera+5.070
orah+5.011
ersen+4.988
Neg logit contributions
Reviewed-5.319
lication-4.870
spot-4.861
worms-4.793
lessness-4.599
Token wearing
Token ID5762
Feature activation+7.02e-01

Pos logit contributions
Reviewed+0.066
spot+0.062
worms+0.061
lication+0.060
lessness+0.059
Neg logit contributions
vier-0.066
ashtra-0.066
amera-0.065
ersen-0.064
itably-0.063
Token the
Token ID262
Feature activation-6.99e-02

Pos logit contributions
Reviewed+1.113
spot+1.048
worms+1.037
lication+1.020
lessness+1.007
Neg logit contributions
vier-1.086
ashtra-1.081
amera-1.071
ersen-1.050
itably-1.037
Token face
Token ID1986
Feature activation-3.26e-01

Pos logit contributions
vier+0.108
ashtra+0.108
amera+0.106
ersen+0.105
itably+0.103
Neg logit contributions
Reviewed-0.112
spot-0.104
worms-0.103
lication-0.102
lessness-0.101
Token of
Token ID286
Feature activation-2.20e-01

Pos logit contributions
vier+0.542
ashtra+0.536
amera+0.533
ersen+0.523
orah+0.519
Neg logit contributions
Reviewed-0.483
spot-0.446
worms-0.443
lication-0.436
lessness-0.425
Token to
Token ID284
Feature activation-1.70e-01

Pos logit contributions
vier+1.133
ashtra+1.125
amera+1.116
ersen+1.093
itably+1.080
Neg logit contributions
Reviewed-1.253
spot-1.178
worms-1.164
lication-1.151
lessness-1.132
Token sleep
Token ID3993
Feature activation-3.56e-01

Pos logit contributions
vier+0.256
ashtra+0.255
amera+0.252
ersen+0.249
itably+0.245
Neg logit contributions
Reviewed-0.276
spot-0.258
worms-0.257
lication-0.254
lessness-0.249
Token in
Token ID287
Feature activation-2.26e-01

Pos logit contributions
vier+0.582
ashtra+0.581
amera+0.572
ersen+0.563
orah+0.557
Neg logit contributions
Reviewed-0.534
spot-0.492
worms-0.490
lication-0.486
lessness-0.474
Token a
Token ID257
Feature activation+2.53e-02

Pos logit contributions
vier+0.344
ashtra+0.342
amera+0.339
ersen+0.333
itably+0.330
Neg logit contributions
Reviewed-0.365
spot-0.342
worms-0.338
lication-0.334
lessness-0.328
Token U
Token ID471
Feature activation-1.40e+00

Pos logit contributions
Reviewed+0.038
spot+0.035
worms+0.035
lication+0.034
lessness+0.034
Neg logit contributions
vier-0.042
ashtra-0.041
amera-0.041
ersen-0.041
itably-0.040
Tokensonian
Token ID35202
Feature activation-3.36e+00

Pos logit contributions
vier+2.463
ashtra+2.415
amera+2.412
ersen+2.378
itably+2.350
Neg logit contributions
Reviewed-1.960
worms-1.787
spot-1.780
lication-1.773
lessness-1.715
Token home
Token ID1363
Feature activation-1.32e+00

Pos logit contributions
vier+5.238
ersen+5.207
amera+5.180
ashtra+5.098
itably+5.046
Neg logit contributions
Reviewed-5.028
worms-4.706
spot-4.649
lication-4.579
lessness-4.461
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
vier+2.099
ashtra+2.090
amera+2.064
ersen+2.058
itably+2.027
Neg logit contributions
Reviewed-2.004
spot-1.840
worms-1.836
lication-1.826
lessness-1.758
Token just
Token ID655
Feature activation-3.47e-01

Pos logit contributions
vier+0.336
ashtra+0.334
amera+0.332
ersen+0.327
itably+0.320
Neg logit contributions
Reviewed-0.343
spot-0.320
worms-0.317
lication-0.314
lessness-0.308
Token 45
Token ID4153
Feature activation-6.53e-01

Pos logit contributions
vier+0.531
ashtra+0.529
amera+0.525
ersen+0.517
orah+0.510
Neg logit contributions
Reviewed-0.558
spot-0.519
worms-0.517
lication-0.512
lessness-0.499
Token minutes
Token ID2431
Feature activation+8.28e-01

Pos logit contributions
vier+1.136
amera+1.127
ashtra+1.126
ersen+1.103
itably+1.093
Neg logit contributions
Reviewed-0.907
worms-0.844
spot-0.842
lication-0.824
lessness-0.804
Token will
Token ID481
Feature activation+5.43e-01

Pos logit contributions
Reviewed+0.823
spot+0.768
worms+0.763
lication+0.751
lessness+0.734
Neg logit contributions
vier-0.865
ashtra-0.860
amera-0.852
ersen-0.839
itably-0.826
Token never
Token ID1239
Feature activation+3.39e-01

Pos logit contributions
Reviewed+0.868
spot+0.813
worms+0.806
lication+0.797
lessness+0.780
Neg logit contributions
vier-0.838
ashtra-0.834
amera-0.824
ersen-0.812
itably-0.801
Token be
Token ID307
Feature activation-3.40e-01

Pos logit contributions
Reviewed+0.549
spot+0.518
worms+0.512
lication+0.504
lessness+0.498
Neg logit contributions
vier-0.513
ashtra-0.511
amera-0.505
ersen-0.495
itably-0.489
Token celebrated
Token ID13943
Feature activation-5.97e-01

Pos logit contributions
vier+0.532
amera+0.523
ashtra+0.522
ersen+0.514
itably+0.509
Neg logit contributions
Reviewed-0.534
spot-0.499
worms-0.498
lication-0.493
lessness-0.480
Token by
Token ID416
Feature activation-7.93e-02

Pos logit contributions
vier+0.943
ashtra+0.931
amera+0.931
ersen+0.914
itably+0.903
Neg logit contributions
Reviewed-0.930
worms-0.867
spot-0.866
lication-0.849
lessness-0.823
Token Bour
Token ID20576
Feature activation-3.36e+00

Pos logit contributions
vier+0.114
ashtra+0.112
amera+0.112
ersen+0.110
itably+0.109
Neg logit contributions
Reviewed-0.135
spot-0.127
worms-0.126
lication-0.124
lessness-0.123
Tokendain
Token ID27162
Feature activation-4.53e-01

Pos logit contributions
vier+5.552
ashtra+5.443
ersen+5.425
amera+5.380
orah+5.286
Neg logit contributions
Reviewed-4.841
worms-4.484
lication-4.417
spot-4.396
lessness-4.267
Token in
Token ID287
Feature activation-2.29e-01

Pos logit contributions
vier+0.752
ashtra+0.743
amera+0.741
ersen+0.733
itably+0.717
Neg logit contributions
Reviewed-0.672
spot-0.626
worms-0.615
lication-0.611
lessness-0.598
Token an
Token ID281
Feature activation-1.87e-01

Pos logit contributions
vier+0.326
ashtra+0.320
amera+0.320
ersen+0.314
itably+0.312
Neg logit contributions
Reviewed-0.392
spot-0.371
worms-0.368
lication-0.360
lessness-0.358
Token episode
Token ID4471
Feature activation+6.91e-01

Pos logit contributions
vier+0.280
ashtra+0.277
amera+0.276
ersen+0.271
itably+0.267
Neg logit contributions
Reviewed-0.306
spot-0.291
worms-0.288
lication-0.282
lessness-0.278
Token of
Token ID286
Feature activation+5.95e-02

Pos logit contributions
Reviewed+1.060
spot+0.993
worms+0.984
lication+0.968
lessness+0.953
Neg logit contributions
vier-1.106
ashtra-1.099
amera-1.091
ersen-1.071
itably-1.059
Token Falling
Token ID42914
Feature activation+1.57e-02

Pos logit contributions
vier+0.813
ashtra+0.805
amera+0.804
ersen+0.794
itably+0.781
Neg logit contributions
Reviewed-0.783
spot-0.723
worms-0.715
lication-0.710
lessness-0.694
Tokenwater
Token ID7050
Feature activation-2.57e-01

Pos logit contributions
Reviewed+0.025
spot+0.023
worms+0.023
lication+0.023
lessness+0.022
Neg logit contributions
vier-0.025
ashtra-0.024
amera-0.024
ersen-0.024
orah-0.024
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
vier+0.402
ashtra+0.400
amera+0.397
ersen+0.394
orah+0.385
Neg logit contributions
Reviewed-0.404
spot-0.375
worms-0.373
lication-0.369
lessness-0.363
Token aka
Token ID22430
Feature activation-7.84e-01

Pos logit contributions
vier+0.309
ashtra+0.309
amera+0.306
ersen+0.302
itably+0.296
Neg logit contributions
Reviewed-0.324
spot-0.301
worms-0.299
lication-0.296
lessness-0.292
Token The
Token ID383
Feature activation-6.30e-01

Pos logit contributions
vier+1.197
ashtra+1.186
amera+1.179
ersen+1.160
itably+1.141
Neg logit contributions
Reviewed-1.266
spot-1.180
worms-1.168
lication-1.154
lessness-1.134
Token Edgar
Token ID29166
Feature activation-3.33e+00

Pos logit contributions
vier+1.002
ashtra+0.988
amera+0.985
ersen+0.970
orah+0.956
Neg logit contributions
Reviewed-0.981
spot-0.906
worms-0.900
lication-0.892
lessness-0.875
Token J
Token ID449
Feature activation-4.17e-01

Pos logit contributions
vier+5.042
amera+5.039
ersen+5.039
ashtra+4.989
orah+4.892
Neg logit contributions
Reviewed-5.249
lication-4.819
spot-4.816
worms-4.788
lessness-4.592
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
vier+0.704
ashtra+0.696
amera+0.695
ersen+0.688
orah+0.677
Neg logit contributions
Reviewed-0.610
spot-0.559
worms-0.556
lication-0.548
lessness-0.537
Token Kau
Token ID28148
Feature activation-1.25e+00

Pos logit contributions
Reviewed+1.502
spot+1.403
worms+1.387
lication+1.364
lessness+1.343
Neg logit contributions
vier-1.734
ashtra-1.726
amera-1.710
ersen-1.681
itably-1.662
Tokenf
Token ID69
Feature activation-5.81e-01

Pos logit contributions
vier+2.498
amera+2.441
ersen+2.438
ashtra+2.428
orah+2.387
Neg logit contributions
Reviewed-1.456
spot-1.303
worms-1.289
lication-1.263
lessness-1.239
Tokenmann
Token ID9038
Feature activation+2.89e-01

Pos logit contributions
vier+1.429
amera+1.408
ersen+1.405
ashtra+1.400
itably+1.378
Neg logit contributions
Reviewed-0.414
worms-0.340
spot-0.339
lication-0.324
lessness-0.307
Token losing
Token ID6078
Feature activation+2.37e+00

Pos logit contributions
Reviewed+0.063
worms+0.059
lication+0.059
spot+0.059
lessness+0.057
Neg logit contributions
vier-0.045
ashtra-0.045
amera-0.045
ersen-0.044
itably-0.043
Token ground
Token ID2323
Feature activation+2.49e-01

Pos logit contributions
Reviewed+3.547
spot+3.342
worms+3.290
lication+3.237
lessness+3.204
Neg logit contributions
vier-3.858
ashtra-3.855
amera-3.809
ersen-3.736
itably-3.701
Token to
Token ID284
Feature activation-8.26e-02

Pos logit contributions
Reviewed+0.403
spot+0.374
worms+0.374
lication+0.368
lessness+0.359
Neg logit contributions
vier-0.384
ashtra-0.377
amera-0.375
ersen-0.367
itably-0.363
Token rivals
Token ID14987
Feature activation+3.30e-01

Pos logit contributions
vier+0.136
ashtra+0.135
amera+0.134
ersen+0.132
itably+0.130
Neg logit contributions
Reviewed-0.124
worms-0.114
spot-0.114
lication-0.112
lessness-0.110
Token like
Token ID588
Feature activation-3.20e-01

Pos logit contributions
Reviewed+0.541
spot+0.509
worms+0.505
lication+0.497
lessness+0.490
Neg logit contributions
vier-0.494
ashtra-0.491
amera-0.486
ersen-0.477
itably-0.470
Token Dom
Token ID9666
Feature activation-3.29e+00

Pos logit contributions
vier+0.523
ashtra+0.521
amera+0.515
ersen+0.508
itably+0.500
Neg logit contributions
Reviewed-0.483
spot-0.448
worms-0.447
lication-0.440
lessness-0.432
Tokenino
Token ID2879
Feature activation+1.75e-02

Pos logit contributions
vier+3.981
ersen+3.882
ashtra+3.858
orah+3.826
amera+3.792
Neg logit contributions
Reviewed-6.299
worms-5.917
lication-5.786
spot-5.774
lessness-5.599
Token's
Token ID338
Feature activation-6.86e-01

Pos logit contributions
Reviewed+0.025
spot+0.024
worms+0.023
lication+0.023
lessness+0.023
Neg logit contributions
vier-0.029
ashtra-0.029
amera-0.029
ersen-0.029
orah-0.028
Token Pizza
Token ID20952
Feature activation-4.40e-01

Pos logit contributions
vier+1.088
ashtra+1.082
amera+1.066
ersen+1.053
orah+1.037
Neg logit contributions
Reviewed-1.064
spot-0.993
worms-0.991
lication-0.971
lessness-0.958
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
vier+0.732
ashtra+0.729
amera+0.719
ersen+0.711
orah+0.699
Neg logit contributions
Reviewed-0.649
spot-0.605
worms-0.600
lication-0.589
lessness-0.580
Token
Token ID198
Feature activation-2.73e-01

Pos logit contributions
vier+0.359
ashtra+0.355
amera+0.351
ersen+0.346
itably+0.340
Neg logit contributions
Reviewed-0.367
spot-0.343
worms-0.342
lication-0.335
lessness-0.331
Token C
Token ID327
Feature activation+8.31e-03

Pos logit contributions
Reviewed+0.849
spot+0.795
worms+0.786
lication+0.774
lessness+0.762
Neg logit contributions
vier-0.914
ashtra-0.909
amera-0.900
ersen-0.884
itably-0.874
Tokenrouch
Token ID48626
Feature activation+1.51e-01

Pos logit contributions
Reviewed+0.013
spot+0.013
worms+0.012
lication+0.012
lessness+0.012
Neg logit contributions
vier-0.013
ashtra-0.013
amera-0.012
ersen-0.012
itably-0.012
Token reports
Token ID3136
Feature activation-1.80e-02

Pos logit contributions
Reviewed+0.226
spot+0.211
worms+0.209
lication+0.207
lessness+0.202
Neg logit contributions
vier-0.250
ashtra-0.247
amera-0.245
ersen-0.242
itably-0.238
Token from
Token ID422
Feature activation+1.34e+00

Pos logit contributions
vier+0.030
ashtra+0.030
amera+0.030
ersen+0.029
itably+0.029
Neg logit contributions
Reviewed-0.026
spot-0.025
worms-0.024
lication-0.024
lessness-0.024
Token Van
Token ID6656
Feature activation+1.30e-01

Pos logit contributions
Reviewed+1.987
spot+1.871
worms+1.846
lication+1.811
lessness+1.794
Neg logit contributions
vier-2.207
ashtra-2.207
amera-2.184
ersen-2.136
itably-2.116
Token Nu
Token ID21733
Feature activation-3.28e+00

Pos logit contributions
Reviewed+0.236
spot+0.223
worms+0.221
lication+0.218
lessness+0.216
Neg logit contributions
vier-0.173
ashtra-0.172
amera-0.170
ersen-0.166
itably-0.164
Tokenys
Token ID893
Feature activation-1.12e+00

Pos logit contributions
vier+5.597
amera+5.470
ashtra+5.457
orah+5.424
ersen+5.381
Neg logit contributions
Reviewed-4.736
lication-4.236
worms-4.190
spot-4.131
lessness-3.985
Token for
Token ID329
Feature activation+4.14e-01

Pos logit contributions
vier+1.868
ashtra+1.860
amera+1.838
ersen+1.805
orah+1.792
Neg logit contributions
Reviewed-1.617
worms-1.512
spot-1.507
lication-1.472
lessness-1.444
Token the
Token ID262
Feature activation+2.54e-02

Pos logit contributions
Reviewed+0.705
spot+0.666
worms+0.659
lication+0.650
lessness+0.642
Neg logit contributions
vier-0.591
ashtra-0.590
amera-0.581
ersen-0.570
itably-0.563
Token NBC
Token ID9716
Feature activation-7.41e-01

Pos logit contributions
Reviewed+0.044
spot+0.042
worms+0.042
lication+0.041
lessness+0.040
Neg logit contributions
vier-0.035
amera-0.035
ashtra-0.035
ersen-0.034
orah-0.033
Token4
Token ID19
Feature activation+1.52e-01

Pos logit contributions
amera+1.161
vier+1.157
ashtra+1.156
orah+1.119
itably+1.114
Neg logit contributions
Reviewed-1.159
worms-1.079
spot-1.071
lication-1.065
lessness-1.033
Token cases
Token ID2663
Feature activation+4.50e-01

Pos logit contributions
vier+0.267
ashtra+0.264
amera+0.262
ersen+0.260
itably+0.257
Neg logit contributions
Reviewed-0.270
spot-0.252
worms-0.251
lication-0.248
lessness-0.239
Token of
Token ID286
Feature activation+1.77e-01

Pos logit contributions
Reviewed+0.718
spot+0.676
worms+0.668
lication+0.658
lessness+0.649
Neg logit contributions
vier-0.695
ashtra-0.691
amera-0.685
ersen-0.671
itably-0.663
Token severe
Token ID6049
Feature activation-2.37e-02

Pos logit contributions
Reviewed+0.290
spot+0.271
worms+0.269
lication+0.266
lessness+0.260
Neg logit contributions
vier-0.266
ashtra-0.264
amera-0.262
ersen-0.258
itably-0.254
Token treatment
Token ID3513
Feature activation+4.10e-01

Pos logit contributions
ashtra+0.038
vier+0.037
amera+0.037
ersen+0.037
itably+0.036
Neg logit contributions
Reviewed-0.037
spot-0.034
worms-0.034
lication-0.033
lessness-0.032
Token of
Token ID286
Feature activation+5.20e-02

Pos logit contributions
Reviewed+0.651
spot+0.610
worms+0.604
lication+0.596
lessness+0.584
Neg logit contributions
vier-0.637
ashtra-0.633
amera-0.626
ersen-0.617
itably-0.608
Token tourists
Token ID15930
Feature activation+2.20e+00

Pos logit contributions
Reviewed+0.084
spot+0.079
worms+0.078
lication+0.077
lessness+0.075
Neg logit contributions
vier-0.079
ashtra-0.078
amera-0.078
ersen-0.077
itably-0.076
Token have
Token ID423
Feature activation-1.75e-01

Pos logit contributions
Reviewed+3.371
spot+3.209
worms+3.175
lessness+3.101
lication+3.075
Neg logit contributions
ashtra-3.447
vier-3.441
amera-3.427
ersen-3.332
itably-3.306
Token so
Token ID523
Feature activation+3.10e-01

Pos logit contributions
vier+0.274
ashtra+0.272
amera+0.271
ersen+0.266
itably+0.263
Neg logit contributions
Reviewed-0.276
spot-0.257
worms-0.255
lication-0.253
lessness-0.246
Token far
Token ID1290
Feature activation+2.40e-01

Pos logit contributions
Reviewed+0.441
spot+0.411
worms+0.407
lication+0.401
lessness+0.393
Neg logit contributions
vier-0.531
ashtra-0.528
amera-0.524
ersen-0.515
itably-0.509
Token mostly
Token ID4632
Feature activation+4.83e-01

Pos logit contributions
Reviewed+0.373
spot+0.348
worms+0.345
lication+0.341
lessness+0.333
Neg logit contributions
vier-0.382
ashtra-0.379
amera-0.376
ersen-0.369
itably-0.365
Token involved
Token ID2950
Feature activation-4.24e-01

Pos logit contributions
Reviewed+0.782
spot+0.732
worms+0.726
lication+0.716
lessness+0.704
Neg logit contributions
vier-0.736
ashtra-0.733
amera-0.726
ersen-0.712
itably-0.702
Token 4
Token ID604
Feature activation+1.37e+00

Pos logit contributions
Reviewed+1.671
spot+1.569
worms+1.531
lication+1.509
lessness+1.478
Neg logit contributions
vier-2.154
ashtra-2.143
amera-2.107
ersen-2.089
itably-2.071
Token 5
Token ID642
Feature activation+9.60e-01

Pos logit contributions
Reviewed+1.755
spot+1.657
worms+1.605
lication+1.581
lessness+1.536
Neg logit contributions
ashtra-2.509
vier-2.502
amera-2.466
ersen-2.427
orah-2.425
Token 6
Token ID718
Feature activation+2.28e+00

Pos logit contributions
Reviewed+1.197
spot+1.122
worms+1.089
lication+1.083
lessness+1.052
Neg logit contributions
vier-1.798
ashtra-1.796
amera-1.765
ersen-1.750
orah-1.735
Token 7
Token ID767
Feature activation+1.36e+00

Pos logit contributions
Reviewed+2.691
spot+2.612
worms+2.471
lication+2.434
Spot+2.361
Neg logit contributions
ashtra-4.360
vier-4.321
amera-4.252
orah-4.207
itably-4.157
Token 8
Token ID807
Feature activation+1.37e+00

Pos logit contributions
Reviewed+1.777
spot+1.684
worms+1.622
lication+1.612
lessness+1.581
Neg logit contributions
ashtra-2.479
vier-2.472
amera-2.419
ersen-2.388
orah-2.378
Token 9
Token ID860
Feature activation+2.14e+00

Pos logit contributions
Reviewed+1.738
spot+1.671
worms+1.613
lication+1.583
lessness+1.550
Neg logit contributions
ashtra-2.508
vier-2.495
amera-2.463
ersen-2.412
itably-2.404
Token 10
Token ID838
Feature activation+8.76e-01

Pos logit contributions
Reviewed+2.736
spot+2.614
worms+2.515
lication+2.494
lessness+2.424
Neg logit contributions
vier-3.906
ashtra-3.900
amera-3.807
ersen-3.782
orah-3.768
Token 11
Token ID1367
Feature activation+4.12e-01

Pos logit contributions
Reviewed+1.297
spot+1.236
worms+1.202
lication+1.196
lessness+1.163
Neg logit contributions
vier-1.440
ashtra-1.431
amera-1.409
ersen-1.381
itably-1.373
Token 12
Token ID1105
Feature activation+4.21e-01

Pos logit contributions
Reviewed+0.551
spot+0.513
worms+0.504
lication+0.496
lessness+0.487
Neg logit contributions
vier-0.740
ashtra-0.737
amera-0.730
ersen-0.718
itably-0.712
Token 13
Token ID1511
Feature activation+1.54e-01

Pos logit contributions
Reviewed+0.564
spot+0.534
worms+0.520
lication+0.512
lessness+0.498
Neg logit contributions
vier-0.751
ashtra-0.749
amera-0.742
ersen-0.725
itably-0.722
Token 14
Token ID1478
Feature activation+7.40e-01

Pos logit contributions
Reviewed+0.221
spot+0.209
worms+0.204
lication+0.202
lessness+0.198
Neg logit contributions
vier-0.261
ashtra-0.260
amera-0.257
ersen-0.253
orah-0.251
Token.
Token ID13
Feature activation-5.11e-02

Pos logit contributions
vier+0.906
ashtra+0.899
amera+0.894
ersen+0.876
itably+0.868
Neg logit contributions
Reviewed-0.913
spot-0.855
worms-0.852
lication-0.837
lessness-0.822
Token It
Token ID632
Feature activation-9.00e-01

Pos logit contributions
vier+0.082
ashtra+0.081
amera+0.081
ersen+0.079
itably+0.078
Neg logit contributions
Reviewed-0.078
spot-0.073
worms-0.073
lication-0.072
lessness-0.070
Token
Token ID447
Feature activation-3.08e-01

Pos logit contributions
vier+1.526
ashtra+1.515
amera+1.504
ersen+1.482
orah+1.466
Neg logit contributions
Reviewed-1.296
spot-1.193
worms-1.179
lication-1.173
lessness-1.133
Token
Token ID247
Feature activation-2.79e-01

Pos logit contributions
vier+0.463
amera+0.457
ashtra+0.455
ersen+0.447
orah+0.441
Neg logit contributions
Reviewed-0.502
spot-0.471
worms-0.469
lication-0.461
lessness-0.457
Tokens
Token ID82
Feature activation+5.13e-01

Pos logit contributions
vier+0.475
ashtra+0.469
amera+0.467
ersen+0.459
itably+0.453
Neg logit contributions
Reviewed-0.400
spot-0.372
worms-0.370
lication-0.364
lessness-0.357
Token hard
Token ID1327
Feature activation+3.03e+00

Pos logit contributions
Reviewed+0.810
spot+0.760
worms+0.752
lication+0.742
lessness+0.730
Neg logit contributions
vier-0.801
ashtra-0.798
amera-0.789
ersen-0.774
itably-0.765
Token to
Token ID284
Feature activation+1.95e-01

Pos logit contributions
Reviewed+4.458
spot+4.280
worms+4.143
lessness+4.116
lication+4.088
Neg logit contributions
vier-4.858
ashtra-4.847
amera-4.795
itably-4.643
orah-4.634
Token believe
Token ID1975
Feature activation+4.74e-02

Pos logit contributions
Reviewed+0.316
spot+0.296
worms+0.295
lication+0.291
lessness+0.285
Neg logit contributions
vier-0.295
ashtra-0.293
amera-0.290
ersen-0.285
itably-0.282
Token that
Token ID326
Feature activation+1.58e-01

Pos logit contributions
Reviewed+0.076
spot+0.071
worms+0.071
lication+0.070
lessness+0.068
Neg logit contributions
vier-0.073
ashtra-0.072
amera-0.072
ersen-0.071
itably-0.069
Token of
Token ID286
Feature activation-8.56e-03

Pos logit contributions
Reviewed+0.253
spot+0.237
worms+0.235
lication+0.232
lessness+0.227
Neg logit contributions
vier-0.245
ashtra-0.243
amera-0.242
ersen-0.237
itably-0.233
Token Arkansas
Token ID14538
Feature activation-1.16e-01

Pos logit contributions
vier+0.012
ashtra+0.011
amera+0.011
ersen+0.011
itably+0.011
Neg logit contributions
Reviewed-0.015
spot-0.015
worms-0.014
lication-0.014
lessness-0.014
Token seven
Token ID3598
Feature activation+4.89e-01

Pos logit contributions
Reviewed+1.276
spot+1.198
worms+1.185
lication+1.163
lessness+1.150
Neg logit contributions
vier-1.338
ashtra-1.332
amera-1.319
ersen-1.293
itably-1.277
Token years
Token ID812
Feature activation-3.03e-01

Pos logit contributions
Reviewed+0.703
spot+0.654
worms+0.645
lication+0.639
lessness+0.624
Neg logit contributions
vier-0.832
ashtra-0.828
amera-0.820
ersen-0.807
itably-0.799
Token old
Token ID1468
Feature activation+5.44e-01

Pos logit contributions
vier+0.481
ashtra+0.480
amera+0.474
ersen+0.464
itably+0.459
Neg logit contributions
Reviewed-0.469
spot-0.442
worms-0.438
lication-0.429
lessness-0.425
Token,"
Token ID553
Feature activation-5.60e-01

Pos logit contributions
Reviewed+0.787
spot+0.733
worms+0.724
lication+0.714
lessness+0.700
Neg logit contributions
vier-0.922
ashtra-0.918
amera-0.909
ersen-0.895
itably-0.884
Token Florida
Token ID4744
Feature activation+2.72e-01

Pos logit contributions
vier+0.880
ashtra+0.877
amera+0.870
ersen+0.860
itably+0.845
Neg logit contributions
Reviewed-0.873
spot-0.816
worms-0.808
lication-0.797
lessness-0.782
Token Eagle
Token ID18456
Feature activation+2.11e+00

Pos logit contributions
Reviewed+0.441
spot+0.413
worms+0.411
lication+0.404
lessness+0.396
Neg logit contributions
vier-0.415
ashtra-0.413
amera-0.408
ersen-0.402
itably-0.396
Token Forum
Token ID14867
Feature activation+5.08e-01

Pos logit contributions
Reviewed+3.300
spot+3.119
worms+3.076
lication+3.021
lessness+2.981
Neg logit contributions
ashtra-3.300
vier-3.296
amera-3.265
ersen-3.203
itably-3.164
Token lobbyist
Token ID35196
Feature activation+8.40e-01

Pos logit contributions
Reviewed+0.749
spot+0.699
worms+0.691
lication+0.682
lessness+0.671
Neg logit contributions
vier-0.845
ashtra-0.842
amera-0.833
ersen-0.820
itably-0.811
Token Randy
Token ID21248
Feature activation-1.74e-01

Pos logit contributions
Reviewed+1.256
spot+1.178
worms+1.162
lication+1.143
lessness+1.130
Neg logit contributions
vier-1.376
ashtra-1.372
amera-1.357
ersen-1.332
itably-1.316
Token Osborne
Token ID29667
Feature activation+9.36e-01

Pos logit contributions
vier+0.285
ashtra+0.282
amera+0.279
ersen+0.276
itably+0.272
Neg logit contributions
Reviewed-0.263
spot-0.246
worms-0.243
lication-0.240
lessness-0.235
Token said
Token ID531
Feature activation-5.45e-01

Pos logit contributions
Reviewed+1.228
spot+1.143
worms+1.129
lication+1.105
lessness+1.090
Neg logit contributions
vier-1.698
ashtra-1.696
amera-1.680
ersen-1.649
itably-1.635
Token a
Token ID257
Feature activation-8.64e-02

Pos logit contributions
Reviewed+0.483
spot+0.452
worms+0.448
lication+0.442
lessness+0.434
Neg logit contributions
vier-0.471
ashtra-0.467
amera-0.466
ersen-0.455
itably-0.450
Token single
Token ID2060
Feature activation-4.61e-01

Pos logit contributions
vier+0.152
ashtra+0.151
amera+0.151
ersen+0.149
itably+0.147
Neg logit contributions
Reviewed-0.119
worms-0.109
spot-0.109
lication-0.107
lessness-0.105
Token picture
Token ID4286
Feature activation-1.59e+00

Pos logit contributions
vier+0.679
ashtra+0.674
amera+0.672
ersen+0.661
itably+0.650
Neg logit contributions
Reviewed-0.767
worms-0.717
spot-0.714
lication-0.704
lessness-0.690
Token.
Token ID13
Feature activation-1.68e-01

Pos logit contributions
amera+2.497
vier+2.444
ashtra+2.428
ersen+2.396
ager+2.362
Neg logit contributions
Reviewed-2.476
worms-2.301
spot-2.256
lication-2.250
lessness-2.179
Token Ranger
Token ID21888
Feature activation-4.05e-01

Pos logit contributions
vier+0.244
ashtra+0.242
amera+0.242
ersen+0.237
orah+0.233
Neg logit contributions
Reviewed-0.280
worms-0.264
spot-0.264
lication-0.259
lessness-0.254
Token 6
Token ID718
Feature activation+2.05e+00

Pos logit contributions
vier+0.726
ashtra+0.725
amera+0.720
ersen+0.706
itably+0.699
Neg logit contributions
Reviewed-0.544
spot-0.504
worms-0.501
lication-0.488
lessness-0.478
Token was
Token ID373
Feature activation-1.04e-01

Pos logit contributions
Reviewed+2.799
spot+2.611
worms+2.560
lication+2.513
lessness+2.493
Neg logit contributions
vier-3.627
ashtra-3.595
amera-3.546
ersen-3.470
itably-3.459
Token definitely
Token ID4753
Feature activation-2.81e-01

Pos logit contributions
vier+0.162
ashtra+0.162
amera+0.162
ersen+0.158
itably+0.156
Neg logit contributions
Reviewed-0.162
spot-0.151
worms-0.151
lication-0.149
lessness-0.145
Token a
Token ID257
Feature activation+9.19e-02

Pos logit contributions
ashtra+0.421
vier+0.420
amera+0.420
ersen+0.411
itably+0.405
Neg logit contributions
Reviewed-0.458
worms-0.425
spot-0.423
lication-0.420
lessness-0.409
Token very
Token ID845
Feature activation-7.02e-01

Pos logit contributions
Reviewed+0.150
spot+0.140
worms+0.139
lication+0.137
lessness+0.135
Neg logit contributions
vier-0.139
ashtra-0.138
amera-0.137
ersen-0.135
itably-0.133
Token successful
Token ID4388
Feature activation-2.28e-01

Pos logit contributions
vier+0.809
ashtra+0.801
amera+0.793
ersen+0.785
itably+0.765
Neg logit contributions
Reviewed-1.393
spot-1.313
worms-1.309
lication-1.302
lessness-1.275
Token Lab
Token ID3498
Feature activation+7.36e-01

Pos logit contributions
vier+0.074
ashtra+0.073
amera+0.073
ersen+0.071
itably+0.070
Neg logit contributions
Reviewed-0.080
spot-0.075
worms-0.074
lication-0.074
lessness-0.072
Tokenarc
Token ID5605
Feature activation+6.18e-01

Pos logit contributions
Reviewed+1.146
spot+1.079
worms+1.065
lication+1.051
lessness+1.035
Neg logit contributions
vier-1.160
ashtra-1.156
amera-1.144
ersen-1.121
itably-1.109
Tokena
Token ID64
Feature activation+1.96e-01

Pos logit contributions
Reviewed+0.830
spot+0.769
worms+0.760
lication+0.745
lessness+0.732
Neg logit contributions
vier-1.113
ashtra-1.106
amera-1.099
ersen-1.082
itably-1.069
Token's
Token ID338
Feature activation-4.20e-01

Pos logit contributions
Reviewed+0.278
spot+0.260
worms+0.257
lication+0.253
lessness+0.249
Neg logit contributions
vier-0.337
ashtra-0.336
amera-0.333
ersen-0.327
itably-0.324
Token latest
Token ID3452
Feature activation+1.24e+00

Pos logit contributions
vier+0.709
ashtra+0.702
amera+0.700
ersen+0.683
itably+0.679
Neg logit contributions
Reviewed-0.615
spot-0.567
worms-0.565
lication-0.555
lessness-0.548
Token book
Token ID1492
Feature activation+6.62e-01

Pos logit contributions
Reviewed+1.587
spot+1.472
worms+1.450
lication+1.421
lessness+1.398
Neg logit contributions
vier-2.313
ashtra-2.306
amera-2.284
ersen-2.250
itably-2.226
Token appears
Token ID3568
Feature activation+1.27e+00

Pos logit contributions
Reviewed+0.977
spot+0.912
worms+0.902
lication+0.889
lessness+0.874
Neg logit contributions
vier-1.100
ashtra-1.093
amera-1.083
ersen-1.064
itably-1.052
Token to
Token ID284
Feature activation-1.35e-01

Pos logit contributions
Reviewed+1.914
spot+1.810
worms+1.784
lessness+1.752
lication+1.748
Neg logit contributions
vier-2.044
ashtra-2.041
amera-2.013
ersen-1.984
itably-1.955
Token show
Token ID905
Feature activation+9.53e-01

Pos logit contributions
vier+0.196
ashtra+0.195
amera+0.193
ersen+0.189
itably+0.186
Neg logit contributions
Reviewed-0.227
spot-0.214
worms-0.212
lication-0.209
lessness-0.207
Token him
Token ID683
Feature activation-2.31e-01

Pos logit contributions
Reviewed+1.493
spot+1.402
worms+1.390
lication+1.366
lessness+1.349
Neg logit contributions
vier-1.491
ashtra-1.488
amera-1.471
ersen-1.441
itably-1.427
Token doing
Token ID1804
Feature activation+6.78e-01

Pos logit contributions
vier+0.374
ashtra+0.372
amera+0.367
ersen+0.363
orah+0.359
Neg logit contributions
Reviewed-0.353
spot-0.327
worms-0.324
lication-0.320
lessness-0.312
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
vier+0.500
ashtra+0.491
amera+0.486
ersen+0.480
itably+0.475
Neg logit contributions
Reviewed-0.536
spot-0.503
worms-0.502
lication-0.496
lessness-0.482
Token
Token ID447
Feature activation-1.93e-01

Pos logit contributions
vier+0.169
ashtra+0.165
amera+0.165
ersen+0.162
itably+0.161
Neg logit contributions
Reviewed-0.154
spot-0.146
worms-0.145
lication-0.143
lessness-0.140
Token
Token ID250
Feature activation-1.17e-01

Pos logit contributions
vier+0.304
ashtra+0.302
amera+0.300
ersen+0.295
orah+0.290
Neg logit contributions
Reviewed-0.299
spot-0.280
worms-0.278
lication-0.273
lessness-0.271
TokenToday
Token ID8888
Feature activation+2.77e-01

Pos logit contributions
vier+0.183
ashtra+0.181
amera+0.180
ersen+0.176
itably+0.174
Neg logit contributions
Reviewed-0.185
spot-0.174
worms-0.174
lication-0.171
lessness-0.168
Token there
Token ID612
Feature activation-4.33e-01

Pos logit contributions
Reviewed+0.404
spot+0.378
worms+0.373
lication+0.368
lessness+0.362
Neg logit contributions
vier-0.464
ashtra-0.463
amera-0.458
ersen-0.449
itably-0.445
Token is
Token ID318
Feature activation+1.94e-01

Pos logit contributions
vier+0.699
ashtra+0.695
amera+0.691
ersen+0.676
itably+0.669
Neg logit contributions
Reviewed-0.659
spot-0.616
worms-0.612
lication-0.600
lessness-0.590
Token a
Token ID257
Feature activation+1.00e-01

Pos logit contributions
Reviewed+0.311
spot+0.292
worms+0.290
lication+0.285
lessness+0.280
Neg logit contributions
vier-0.298
ashtra-0.296
amera-0.293
ersen-0.287
itably-0.285
Token general
Token ID2276
Feature activation-1.12e+00

Pos logit contributions
Reviewed+0.150
worms+0.139
spot+0.139
lication+0.135
lessness+0.133
Neg logit contributions
vier-0.167
ashtra-0.165
amera-0.164
ersen-0.161
itably-0.159
Token consensus
Token ID11529
Feature activation+8.95e-01

Pos logit contributions
vier+1.671
ashtra+1.661
amera+1.634
ersen+1.623
orah+1.595
Neg logit contributions
Reviewed-1.850
worms-1.738
spot-1.717
lication-1.673
lessness-1.614
Token that
Token ID326
Feature activation+5.32e-02

Pos logit contributions
Reviewed+1.357
spot+1.275
worms+1.255
lication+1.240
lessness+1.228
Neg logit contributions
vier-1.446
ashtra-1.440
amera-1.426
ersen-1.397
itably-1.383
Token equipment
Token ID5112
Feature activation-1.94e-01

Pos logit contributions
Reviewed+0.087
spot+0.081
worms+0.081
lication+0.079
lessness+0.078
Neg logit contributions
vier-0.080
ashtra-0.079
amera-0.079
ersen-0.078
itably-0.077
Token of
Token ID286
Feature activation-1.68e-01

Pos logit contributions
Reviewed+1.515
spot+1.443
worms+1.416
lessness+1.414
lication+1.393
Neg logit contributions
vier-1.750
ashtra-1.744
amera-1.737
ersen-1.698
itably-1.670
Token their
Token ID511
Feature activation+4.25e-01

Pos logit contributions
vier+0.217
ashtra+0.216
amera+0.213
ersen+0.209
itably+0.206
Neg logit contributions
Reviewed-0.312
spot-0.295
worms-0.293
lication-0.289
lessness-0.284
Token daughter
Token ID4957
Feature activation+1.07e-01

Pos logit contributions
Reviewed+0.700
spot+0.657
worms+0.653
lication+0.639
lessness+0.628
Neg logit contributions
vier-0.636
ashtra-0.635
amera-0.627
ersen-0.619
itably-0.609
Token
Token ID447
Feature activation-1.06e-01

Pos logit contributions
Reviewed+0.154
spot+0.144
worms+0.142
lication+0.140
lessness+0.137
Neg logit contributions
vier-0.181
ashtra-0.180
amera-0.179
ersen-0.175
itably-0.173
Token
Token ID247
Feature activation+3.75e-02

Pos logit contributions
vier+0.169
ashtra+0.168
amera+0.167
ersen+0.164
itably+0.162
Neg logit contributions
Reviewed-0.163
spot-0.153
worms-0.152
lication-0.148
lessness-0.147
Tokens
Token ID82
Feature activation+4.57e-01

Pos logit contributions
Reviewed+0.049
spot+0.045
worms+0.045
lication+0.044
lessness+0.043
Neg logit contributions
vier-0.069
ashtra-0.069
amera-0.068
ersen-0.067
itably-0.066
Token distress
Token ID17087
Feature activation+5.96e-01

Pos logit contributions
Reviewed+0.709
spot+0.662
worms+0.657
lication+0.644
lessness+0.630
Neg logit contributions
vier-0.725
ashtra-0.723
amera-0.714
ersen-0.706
itably-0.695
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
Reviewed+0.836
spot+0.778
worms+0.770
lication+0.756
lessness+0.743
Neg logit contributions
vier-1.034
ashtra-1.029
amera-1.020
ersen-1.003
itably-0.992
Token
Token ID564
Feature activation+6.22e-01

Pos logit contributions
Reviewed+0.232
spot+0.218
worms+0.215
lication+0.212
lessness+0.209
Neg logit contributions
vier-0.238
ashtra-0.237
amera-0.235
ersen-0.230
itably-0.228
Token
Token ID250
Feature activation-1.17e-01

Pos logit contributions
Reviewed+0.799
spot+0.739
worms+0.724
lication+0.710
lessness+0.699
Neg logit contributions
vier-1.160
ashtra-1.153
amera-1.147
ersen-1.122
itably-1.112
TokenM
Token ID44
Feature activation+2.85e-01

Pos logit contributions
vier+0.188
ashtra+0.187
amera+0.185
ersen+0.182
itably+0.181
Neg logit contributions
Reviewed-0.177
spot-0.168
worms-0.167
lication-0.163
lessness-0.161
Token New
Token ID968
Feature activation+4.92e-01

Pos logit contributions
vier+0.176
ashtra+0.171
amera+0.171
ersen+0.168
itably+0.165
Neg logit contributions
Reviewed-0.183
worms-0.172
spot-0.172
lication-0.169
lessness-0.166
Token York
Token ID1971
Feature activation-1.11e+00

Pos logit contributions
Reviewed+0.763
spot+0.714
worms+0.707
lication+0.697
lessness+0.687
Neg logit contributions
vier-0.783
ashtra-0.776
amera-0.772
ersen-0.756
itably-0.747
Token Times
Token ID3782
Feature activation+3.69e-01

Pos logit contributions
vier+1.246
amera+1.204
ashtra+1.184
orah+1.181
ersen+1.170
Neg logit contributions
Reviewed-2.216
spot-2.105
worms-2.084
lication-2.075
lessness-2.052
Token portrayed
Token ID19152
Feature activation-5.27e-01

Pos logit contributions
Reviewed+0.537
spot+0.501
worms+0.495
lication+0.489
lessness+0.480
Neg logit contributions
vier-0.622
ashtra-0.617
amera-0.612
ersen-0.601
itably-0.596
Token him
Token ID683
Feature activation+4.67e-02

Pos logit contributions
vier+0.815
ashtra+0.808
amera+0.803
ersen+0.792
itably+0.779
Neg logit contributions
Reviewed-0.838
spot-0.785
worms-0.775
lication-0.763
lessness-0.748
Token as
Token ID355
Feature activation+1.10e+00

Pos logit contributions
Reviewed+0.072
spot+0.067
worms+0.067
lication+0.065
lessness+0.065
Neg logit contributions
vier-0.075
ashtra-0.074
amera-0.074
ersen-0.072
itably-0.072
Token a
Token ID257
Feature activation+2.12e-01

Pos logit contributions
Reviewed+1.704
spot+1.608
worms+1.589
lication+1.557
lessness+1.551
Neg logit contributions
ashtra-1.742
vier-1.742
amera-1.715
ersen-1.686
itably-1.665
Token man
Token ID582
Feature activation-7.43e-01

Pos logit contributions
Reviewed+0.332
spot+0.309
worms+0.306
lication+0.304
lessness+0.295
Neg logit contributions
vier-0.338
ashtra-0.333
amera-0.331
ersen-0.326
itably-0.323
Token who
Token ID508
Feature activation+6.76e-01

Pos logit contributions
vier+1.204
ashtra+1.183
amera+1.179
ersen+1.157
orah+1.150
Neg logit contributions
Reviewed-1.134
spot-1.049
worms-1.041
lication-1.033
lessness-0.994
Token has
Token ID468
Feature activation+6.71e-01

Pos logit contributions
Reviewed+0.940
spot+0.873
worms+0.861
lication+0.849
lessness+0.830
Neg logit contributions
vier-1.185
ashtra-1.177
amera-1.168
ersen-1.150
itably-1.137
Token endured
Token ID24067
Feature activation+5.79e-01

Pos logit contributions
Reviewed+1.018
spot+0.953
worms+0.942
lication+0.929
lessness+0.911
Neg logit contributions
vier-1.090
ashtra-1.082
amera-1.073
ersen-1.055
itably-1.042
Token cup
Token ID6508
Feature activation-1.70e-02

Pos logit contributions
vier+1.046
ashtra+1.036
amera+1.027
ersen+1.013
itably+1.001
Neg logit contributions
Reviewed-0.929
spot-0.858
worms-0.858
lication-0.845
lessness-0.827
Token qu
Token ID627
Feature activation-7.60e-01

Pos logit contributions
vier+0.026
ashtra+0.026
amera+0.026
ersen+0.026
itably+0.025
Neg logit contributions
Reviewed-0.027
spot-0.025
worms-0.025
lication-0.024
lessness-0.024
Tokeninoa
Token ID40564
Feature activation+4.50e-01

Pos logit contributions
vier+1.312
ashtra+1.303
amera+1.289
ersen+1.278
itably+1.261
Neg logit contributions
Reviewed-1.076
spot-0.993
worms-0.983
lication-0.967
lessness-0.953
Token and
Token ID290
Feature activation+6.89e-02

Pos logit contributions
Reviewed+0.672
spot+0.630
lication+0.612
worms+0.612
lessness+0.597
Neg logit contributions
vier-0.743
ashtra-0.739
amera-0.732
ersen-0.716
itably-0.714
Token 2
Token ID362
Feature activation+7.40e-01

Pos logit contributions
Reviewed+0.101
spot+0.094
worms+0.093
lication+0.092
lessness+0.090
Neg logit contributions
vier-0.115
ashtra-0.114
amera-0.113
ersen-0.112
itably-0.110
Token cups
Token ID14180
Feature activation+3.07e-01

Pos logit contributions
Reviewed+0.965
spot+0.894
worms+0.881
lication+0.866
lessness+0.851
Neg logit contributions
vier-1.359
ashtra-1.352
amera-1.341
ersen-1.320
itably-1.307
Token of
Token ID286
Feature activation+4.54e-01

Pos logit contributions
Reviewed+0.476
spot+0.445
worms+0.439
lication+0.434
lessness+0.426
Neg logit contributions
vier-0.490
ashtra-0.485
amera-0.481
ersen-0.475
itably-0.468
Token water
Token ID1660
Feature activation+4.19e-01

Pos logit contributions
Reviewed+0.728
spot+0.681
worms+0.671
lication+0.665
lessness+0.656
Neg logit contributions
vier-0.700
ashtra-0.695
amera-0.689
ersen-0.676
itably-0.668
Token into
Token ID656
Feature activation+4.57e-01

Pos logit contributions
Reviewed+0.623
spot+0.582
worms+0.571
lication+0.565
lessness+0.552
Neg logit contributions
vier-0.695
ashtra-0.694
amera-0.684
ersen-0.672
itably-0.666
Token a
Token ID257
Feature activation+3.61e-01

Pos logit contributions
Reviewed+0.733
spot+0.687
worms+0.679
lication+0.669
lessness+0.662
Neg logit contributions
vier-0.701
ashtra-0.696
amera-0.691
ersen-0.677
itably-0.672
Token sauce
Token ID10746
Feature activation+6.73e-01

Pos logit contributions
Reviewed+0.552
worms+0.510
spot+0.510
lication+0.500
lessness+0.491
Neg logit contributions
vier-0.582
amera-0.578
ashtra-0.577
ersen-0.567
itably-0.561
Token input
Token ID5128
Feature activation-3.07e-01

Pos logit contributions
Reviewed+0.208
spot+0.195
worms+0.194
lication+0.191
lessness+0.187
Neg logit contributions
vier-0.184
ashtra-0.183
amera-0.181
ersen-0.178
itably-0.176
Token in
Token ID287
Feature activation-1.02e-01

Pos logit contributions
vier+0.495
ashtra+0.492
amera+0.488
ersen+0.479
itably+0.476
Neg logit contributions
Reviewed-0.469
spot-0.434
worms-0.432
lication-0.426
lessness-0.415
Token compiling
Token ID33393
Feature activation+4.38e-01

Pos logit contributions
vier+0.150
ashtra+0.148
amera+0.147
ersen+0.144
itably+0.143
Neg logit contributions
Reviewed-0.171
spot-0.160
worms-0.160
lication-0.157
lessness-0.154
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
Reviewed+0.714
spot+0.671
worms+0.666
lication+0.656
lessness+0.646
Neg logit contributions
vier-0.662
ashtra-0.658
amera-0.652
ersen-0.639
itably-0.632
Token list
Token ID1351
Feature activation-3.12e-01

Pos logit contributions
Reviewed+0.185
spot+0.172
worms+0.172
lication+0.170
lessness+0.167
Neg logit contributions
vier-0.163
ashtra-0.161
amera-0.160
ersen-0.157
itably-0.156
Token of
Token ID286
Feature activation-3.32e-02

Pos logit contributions
vier+0.485
ashtra+0.482
amera+0.478
ersen+0.470
itably+0.464
Neg logit contributions
Reviewed-0.492
spot-0.461
worms-0.457
lication-0.450
lessness-0.441
Token possible
Token ID1744
Feature activation+4.74e-02

Pos logit contributions
vier+0.051
ashtra+0.051
amera+0.050
ersen+0.049
itably+0.049
Neg logit contributions
Reviewed-0.053
spot-0.049
worms-0.049
lication-0.049
lessness-0.047
Token Supreme
Token ID5617
Feature activation+7.02e-02

Pos logit contributions
Reviewed+0.076
spot+0.071
worms+0.071
lication+0.070
lessness+0.068
Neg logit contributions
vier-0.073
ashtra-0.073
amera-0.072
ersen-0.071
itably-0.070
Token Court
Token ID3078
Feature activation-4.71e-01

Pos logit contributions
Reviewed+0.105
spot+0.098
worms+0.096
lication+0.096
lessness+0.093
Neg logit contributions
vier-0.115
ashtra-0.115
amera-0.114
ersen-0.112
itably-0.111
Token nominees
Token ID26456
Feature activation-5.88e-02

Pos logit contributions
ashtra+0.721
vier+0.714
amera+0.708
itably+0.708
ersen+0.699
Neg logit contributions
Reviewed-0.743
lication-0.691
worms-0.685
spot-0.682
lessness-0.656
Token.
Token ID13
Feature activation+2.56e-01

Pos logit contributions
vier+0.097
ashtra+0.096
amera+0.095
ersen+0.094
itably+0.093
Neg logit contributions
Reviewed-0.087
spot-0.081
worms-0.081
lication-0.080
lessness-0.078
Token link
Token ID2792
Feature activation+2.43e-01

Pos logit contributions
vier+0.153
amera+0.151
ashtra+0.151
ersen+0.148
itably+0.145
Neg logit contributions
Reviewed-0.166
worms-0.155
spot-0.154
lication-0.152
lessness-0.148
Token provides
Token ID3769
Feature activation+2.61e-02

Pos logit contributions
Reviewed+0.341
spot+0.315
worms+0.313
lication+0.308
lessness+0.301
Neg logit contributions
vier-0.423
ashtra-0.420
amera-0.418
ersen-0.409
itably-0.404
Token a
Token ID257
Feature activation+1.57e-02

Pos logit contributions
Reviewed+0.043
spot+0.041
worms+0.041
lication+0.040
lessness+0.039
Neg logit contributions
vier-0.039
ashtra-0.038
amera-0.038
ersen-0.037
itably-0.037
Token copy
Token ID4866
Feature activation+9.63e-01

Pos logit contributions
Reviewed+0.024
worms+0.022
spot+0.022
lication+0.022
lessness+0.021
Neg logit contributions
vier-0.026
ashtra-0.025
amera-0.025
ersen-0.025
itably-0.024
Token of
Token ID286
Feature activation-3.21e-01

Pos logit contributions
Reviewed+1.439
spot+1.358
worms+1.331
lication+1.317
lessness+1.308
Neg logit contributions
vier-1.569
ashtra-1.568
amera-1.544
ersen-1.516
itably-1.512
Token the
Token ID262
Feature activation-6.85e-02

Pos logit contributions
vier+0.436
ashtra+0.432
amera+0.428
ersen+0.419
itably+0.412
Neg logit contributions
Reviewed-0.573
spot-0.542
worms-0.539
lication-0.529
lessness-0.522
Token Excel
Token ID24134
Feature activation-1.10e+00

Pos logit contributions
vier+0.104
ashtra+0.103
amera+0.102
ersen+0.100
itably+0.098
Neg logit contributions
Reviewed-0.112
worms-0.105
spot-0.105
lication-0.103
lessness-0.101
Token work
Token ID670
Feature activation+5.12e-02

Pos logit contributions
vier+1.851
ashtra+1.824
amera+1.816
ersen+1.765
itably+1.752
Neg logit contributions
Reviewed-1.606
worms-1.490
spot-1.473
lication-1.440
lessness-1.430
Tokenbook
Token ID2070
Feature activation+1.43e+00

Pos logit contributions
Reviewed+0.048
spot+0.043
worms+0.042
lication+0.041
lessness+0.040
Neg logit contributions
vier-0.113
ashtra-0.112
amera-0.111
ersen-0.110
orah-0.109
Token I
Token ID314
Feature activation-2.31e-01

Pos logit contributions
Reviewed+2.139
spot+2.008
worms+1.978
lication+1.950
lessness+1.922
Neg logit contributions
vier-2.352
ashtra-2.343
amera-2.318
ersen-2.279
itably-2.256
Token created
Token ID2727
Feature activation-6.55e-02

Pos logit contributions
vier+0.348
ashtra+0.346
amera+0.345
ersen+0.340
orah+0.332
Neg logit contributions
Reviewed-0.376
worms-0.353
spot-0.351
lication-0.348
lessness-0.339
Token profession
Token ID7573
Feature activation+2.55e-01

Pos logit contributions
Reviewed+0.399
spot+0.376
worms+0.372
lication+0.367
lessness+0.361
Neg logit contributions
vier-0.387
ashtra-0.385
amera-0.379
ersen-0.373
itably-0.370
Token,
Token ID11
Feature activation+2.26e-01

Pos logit contributions
Reviewed+0.373
spot+0.348
worms+0.344
lication+0.340
lessness+0.331
Neg logit contributions
vier-0.429
ashtra-0.426
amera-0.422
ersen-0.415
itably-0.412
Token by
Token ID416
Feature activation+2.06e-01

Pos logit contributions
Reviewed+0.347
spot+0.324
worms+0.321
lication+0.317
lessness+0.310
Neg logit contributions
vier-0.364
ashtra-0.362
amera-0.358
ersen-0.353
itably-0.348
Token femin
Token ID7290
Feature activation-2.27e+00

Pos logit contributions
Reviewed+0.318
spot+0.296
worms+0.292
lication+0.290
lessness+0.282
Neg logit contributions
vier-0.331
ashtra-0.328
amera-0.324
ersen-0.320
itably-0.316
Tokenizing
Token ID2890
Feature activation-1.95e-01

Pos logit contributions
vier+4.208
amera+4.182
ashtra+4.107
itably+4.049
ersen+4.044
Neg logit contributions
Reviewed-2.892
spot-2.640
worms-2.629
lication-2.562
lessness-2.495
Token it
Token ID340
Feature activation-2.11e-02

Pos logit contributions
vier+0.298
ashtra+0.294
amera+0.292
ersen+0.286
itably+0.286
Neg logit contributions
Reviewed-0.314
spot-0.296
worms-0.293
lication-0.289
lessness-0.282
Token in
Token ID287
Feature activation-3.04e-02

Pos logit contributions
vier+0.035
ashtra+0.035
amera+0.035
ersen+0.034
itably+0.034
Neg logit contributions
Reviewed-0.031
spot-0.029
worms-0.029
lication-0.028
lessness-0.028
Token unacceptable
Token ID18010
Feature activation+2.61e-01

Pos logit contributions
vier+0.046
ashtra+0.046
amera+0.045
ersen+0.044
orah+0.044
Neg logit contributions
Reviewed-0.049
spot-0.046
worms-0.046
lication-0.045
lessness-0.044
Token ways
Token ID2842
Feature activation-1.11e+00

Pos logit contributions
Reviewed+0.467
spot+0.440
worms+0.437
lication+0.432
lessness+0.424
Neg logit contributions
vier-0.353
ashtra-0.350
amera-0.345
ersen-0.340
itably-0.336
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
vier+1.835
ashtra+1.819
amera+1.810
ersen+1.782
itably+1.763
Neg logit contributions
Reviewed-1.639
spot-1.516
worms-1.512
lication-1.484
lessness-1.442
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
Reviewed+0.238
spot+0.224
worms+0.223
lication+0.220
lessness+0.215
Neg logit contributions
vier-0.211
ashtra-0.207
amera-0.206
ersen-0.202
orah-0.200
Token the
Token ID262
Feature activation+1.25e-01

Pos logit contributions
vier+0.142
ashtra+0.141
amera+0.140
ersen+0.137
itably+0.135
Neg logit contributions
Reviewed-0.152
spot-0.143
worms-0.141
lication-0.139
lessness-0.137
Token 2008
Token ID3648
Feature activation+5.33e-01

Pos logit contributions
Reviewed+0.192
spot+0.179
worms+0.177
lication+0.175
lessness+0.172
Neg logit contributions
vier-0.202
ashtra-0.201
amera-0.200
ersen-0.196
itably-0.193
Token film
Token ID2646
Feature activation-3.56e-02

Pos logit contributions
Reviewed+0.752
spot+0.701
worms+0.692
lication+0.682
lessness+0.670
Neg logit contributions
vier-0.919
ashtra-0.914
amera-0.908
ersen-0.892
itably-0.881
Token Kung
Token ID44753
Feature activation-1.16e+00

Pos logit contributions
vier+0.056
ashtra+0.055
amera+0.055
ersen+0.054
itably+0.053
Neg logit contributions
Reviewed-0.056
spot-0.053
worms-0.052
lication-0.051
lessness-0.050
Token Fu
Token ID13333
Feature activation-2.17e+00

Pos logit contributions
vier+1.600
amera+1.598
ashtra+1.573
ersen+1.557
orah+1.532
Neg logit contributions
Reviewed-2.020
spot-1.880
lication-1.874
worms-1.862
lessness-1.808
Token Panda
Token ID41112
Feature activation-5.81e-01

Pos logit contributions
amera+2.821
vier+2.748
ashtra+2.708
ersen+2.688
orah+2.620
Neg logit contributions
Reviewed-3.933
lication-3.722
spot-3.713
worms-3.673
lessness-3.592
Token,
Token ID11
Feature activation+1.41e-01

Pos logit contributions
vier+0.982
amera+0.975
ashtra+0.968
ersen+0.947
orah+0.940
Neg logit contributions
Reviewed-0.838
spot-0.779
worms-0.774
lication-0.759
lessness-0.744
Token is
Token ID318
Feature activation-2.11e-01

Pos logit contributions
Reviewed+0.203
spot+0.187
worms+0.185
lication+0.183
lessness+0.179
Neg logit contributions
vier-0.241
ashtra-0.240
amera-0.239
ersen-0.234
itably-0.231
Token an
Token ID281
Feature activation-4.15e-01

Pos logit contributions
vier+0.324
ashtra+0.322
amera+0.321
ersen+0.314
itably+0.311
Neg logit contributions
Reviewed-0.338
spot-0.317
worms-0.315
lication-0.310
lessness-0.305
Token anthrop
Token ID17911
Feature activation-3.93e-01

Pos logit contributions
vier+0.593
ashtra+0.588
amera+0.587
ersen+0.573
itably+0.564
Neg logit contributions
Reviewed-0.708
spot-0.670
worms-0.665
lication-0.654
lessness-0.643
Tokenomorph
Token ID25831
Feature activation-1.49e+00

Pos logit contributions
vier+0.502
ashtra+0.496
amera+0.493
ersen+0.480
itably+0.474
Neg logit contributions
Reviewed-0.734
spot-0.695
worms-0.689
lication-0.681
lessness-0.673
Token couple
Token ID3155
Feature activation-3.70e-01

Pos logit contributions
Reviewed+0.989
spot+0.921
worms+0.902
lication+0.885
lessness+0.870
Neg logit contributions
vier-1.543
ashtra-1.541
amera-1.526
ersen-1.501
itably-1.486
Token:
Token ID25
Feature activation+4.51e-01

Pos logit contributions
vier+0.608
ashtra+0.601
amera+0.599
ersen+0.591
orah+0.582
Neg logit contributions
Reviewed-0.556
worms-0.513
spot-0.513
lication-0.506
lessness-0.491
Token Del
Token ID4216
Feature activation+1.84e-02

Pos logit contributions
Reviewed+0.698
spot+0.654
worms+0.648
lication+0.638
lessness+0.628
Neg logit contributions
vier-0.718
ashtra-0.714
amera-0.708
ersen-0.694
itably-0.686
Token.
Token ID13
Feature activation+7.54e-01

Pos logit contributions
Reviewed+0.029
spot+0.027
worms+0.027
lication+0.026
lessness+0.026
Neg logit contributions
vier-0.029
ashtra-0.029
amera-0.028
ersen-0.028
orah-0.028
Token Todd
Token ID14377
Feature activation-9.47e-01

Pos logit contributions
Reviewed+1.143
spot+1.071
worms+1.061
lication+1.042
lessness+1.026
Neg logit contributions
vier-1.223
ashtra-1.215
amera-1.205
ersen-1.183
itably-1.170
Token C
Token ID327
Feature activation-3.86e-01

Pos logit contributions
vier+1.551
ashtra+1.522
amera+1.514
ersen+1.508
itably+1.483
Neg logit contributions
Reviewed-1.432
spot-1.319
worms-1.318
lication-1.292
lessness-1.257
Token.
Token ID13
Feature activation+1.34e+00

Pos logit contributions
vier+0.600
ashtra+0.595
amera+0.594
ersen+0.585
itably+0.574
Neg logit contributions
Reviewed-0.618
spot-0.571
worms-0.568
lication-0.562
lessness-0.550
Token Gilbert
Token ID24023
Feature activation+1.68e+00

Pos logit contributions
Reviewed+2.024
spot+1.901
worms+1.877
lication+1.846
lessness+1.825
Neg logit contributions
vier-2.169
ashtra-2.164
amera-2.145
ersen-2.096
itably-2.076
Token (
Token ID357
Feature activation-5.24e-01

Pos logit contributions
Reviewed+2.310
spot+2.159
worms+2.130
lication+2.096
lessness+2.070
Neg logit contributions
vier-2.936
ashtra-2.930
amera-2.906
ersen-2.845
itably-2.820
TokenR
Token ID49
Feature activation+7.32e-01

Pos logit contributions
vier+0.749
ashtra+0.747
amera+0.743
ersen+0.725
orah+0.714
Neg logit contributions
Reviewed-0.883
spot-0.833
worms-0.831
lication-0.823
lessness-0.806
Token-
Token ID12
Feature activation-3.52e-01

Pos logit contributions
Reviewed+0.971
spot+0.899
worms+0.887
lication+0.873
lessness+0.857
Neg logit contributions
vier-1.324
ashtra-1.320
amera-1.307
ersen-1.284
itably-1.272
Token politicians
Token ID7602
Feature activation+2.43e-01

Pos logit contributions
Reviewed+1.107
spot+1.039
worms+1.027
lication+1.012
lessness+0.994
Neg logit contributions
vier-1.094
ashtra-1.089
amera-1.079
ersen-1.060
itably-1.047
Token.
Token ID13
Feature activation-7.36e-02

Pos logit contributions
Reviewed+0.351
spot+0.327
worms+0.324
lication+0.319
lessness+0.312
Neg logit contributions
vier-0.413
ashtra-0.411
amera-0.407
ersen-0.400
itably-0.396
Token Comed
Token ID37024
Feature activation-3.86e-01

Pos logit contributions
vier+0.113
ashtra+0.111
amera+0.111
ersen+0.108
orah+0.107
Neg logit contributions
Reviewed-0.118
worms-0.111
spot-0.111
lication-0.108
lessness-0.107
Tokenian
Token ID666
Feature activation-7.56e-01

Pos logit contributions
vier+0.537
ashtra+0.529
amera+0.526
ersen+0.522
orah+0.510
Neg logit contributions
Reviewed-0.681
spot-0.633
worms-0.632
lication-0.619
lessness-0.612
Token Kathy
Token ID33726
Feature activation-1.37e+00

Pos logit contributions
vier+1.186
ashtra+1.185
amera+1.162
ersen+1.157
itably+1.150
Neg logit contributions
Reviewed-1.177
worms-1.093
spot-1.087
lication-1.067
lessness-1.043
Token Griffin
Token ID16525
Feature activation-2.27e+00

Pos logit contributions
vier+1.956
ashtra+1.934
amera+1.912
ersen+1.900
itably+1.874
Neg logit contributions
Reviewed-2.319
spot-2.176
worms-2.169
lication-2.140
lessness-2.098
Token,
Token ID11
Feature activation+2.79e-01

Pos logit contributions
vier+3.665
ashtra+3.568
amera+3.545
ersen+3.538
itably+3.476
Neg logit contributions
Reviewed-3.390
worms-3.194
spot-3.132
lication-3.081
lessness-2.978
Token who
Token ID508
Feature activation+4.57e-01

Pos logit contributions
Reviewed+0.427
spot+0.398
worms+0.397
lication+0.389
lessness+0.383
Neg logit contributions
vier-0.448
ashtra-0.446
amera-0.443
ersen-0.436
itably-0.428
Token lost
Token ID2626
Feature activation+1.80e+00

Pos logit contributions
Reviewed+0.647
spot+0.599
worms+0.595
lication+0.584
lessness+0.571
Neg logit contributions
vier-0.790
ashtra-0.786
amera-0.778
ersen-0.767
itably-0.758
Token her
Token ID607
Feature activation-2.60e-01

Pos logit contributions
Reviewed+2.897
spot+2.739
worms+2.695
lication+2.662
lessness+2.633
Neg logit contributions
ashtra-2.726
vier-2.723
amera-2.677
ersen-2.633
itably-2.608
Token CNN
Token ID8100
Feature activation-4.57e-01

Pos logit contributions
vier+0.385
amera+0.377
ashtra+0.375
ersen+0.372
orah+0.368
Neg logit contributions
Reviewed-0.436
worms-0.409
spot-0.398
lication-0.391
lessness-0.388
TokenREAD
Token ID15675
Feature activation+8.48e-01

Pos logit contributions
vier+2.789
amera+2.752
ashtra+2.721
orah+2.700
itably+2.697
Neg logit contributions
Reviewed-2.622
spot-2.471
worms-2.462
lessness-2.435
lication-2.422
Token:
Token ID25
Feature activation+3.51e-01

Pos logit contributions
Reviewed+1.300
spot+1.215
worms+1.200
lication+1.190
lessness+1.165
Neg logit contributions
vier-1.362
ashtra-1.358
amera-1.343
ersen-1.318
itably-1.310
Token Cohen
Token ID17154
Feature activation-1.08e+00

Pos logit contributions
Reviewed+0.566
spot+0.530
worms+0.528
lication+0.519
lessness+0.510
Neg logit contributions
vier-0.532
ashtra-0.531
amera-0.526
ersen-0.513
itably-0.513
Token testimony
Token ID9709
Feature activation+1.53e+00

Pos logit contributions
vier+1.705
ashtra+1.698
amera+1.675
orah+1.656
ersen+1.653
Neg logit contributions
Reviewed-1.651
spot-1.551
worms-1.532
lication-1.519
lessness-1.460
Token alleges
Token ID24763
Feature activation-4.14e-01

Pos logit contributions
Reviewed+1.998
spot+1.922
worms+1.885
lessness+1.825
lication+1.812
Neg logit contributions
vier-2.769
ashtra-2.732
amera-2.674
ersen-2.638
itably-2.616
Token Trump
Token ID1301
Feature activation-1.96e+00

Pos logit contributions
vier+0.665
ashtra+0.658
amera+0.652
ersen+0.641
itably+0.634
Neg logit contributions
Reviewed-0.634
spot-0.599
worms-0.590
lication-0.582
lessness-0.575
Token knew
Token ID2993
Feature activation-7.64e-01

Pos logit contributions
ashtra+3.091
vier+3.085
amera+3.048
itably+2.985
ersen+2.979
Neg logit contributions
Reviewed-3.018
spot-2.819
worms-2.755
lication-2.736
lessness-2.650
Token Stone
Token ID8026
Feature activation+3.35e-02

Pos logit contributions
ashtra+1.120
amera+1.118
vier+1.114
ersen+1.091
itably+1.070
Neg logit contributions
Reviewed-1.273
spot-1.183
worms-1.174
lication-1.167
lessness-1.130
Token talked
Token ID6619
Feature activation+1.93e-01

Pos logit contributions
Reviewed+0.046
spot+0.044
worms+0.043
lication+0.042
lessness+0.042
Neg logit contributions
vier-0.059
amera-0.058
ashtra-0.058
itably-0.056
ersen-0.056
Token with
Token ID351
Feature activation-1.95e-01

Pos logit contributions
Reviewed+0.307
spot+0.289
worms+0.285
lication+0.281
lessness+0.277
Neg logit contributions
vier-0.299
ashtra-0.297
amera-0.294
ersen-0.288
itably-0.285
Token WikiLeaks
Token ID19766
Feature activation-6.27e-01

Pos logit contributions
vier+0.316
ashtra+0.313
amera+0.309
ersen+0.304
itably+0.302
Neg logit contributions
Reviewed-0.294
spot-0.277
worms-0.273
lication-0.269
lessness-0.265
Token4
Token ID19
Feature activation+6.20e-01

Pos logit contributions
Reviewed+0.084
worms+0.079
spot+0.079
lication+0.079
lessness+0.077
Neg logit contributions
vier-0.099
ashtra-0.097
amera-0.097
orah-0.094
ersen-0.094
Token.
Token ID13
Feature activation+8.57e-01

Pos logit contributions
Reviewed+0.869
spot+0.810
worms+0.801
lication+0.787
lessness+0.774
Neg logit contributions
vier-1.077
ashtra-1.072
amera-1.062
ersen-1.044
itably-1.033
Token BET
Token ID38651
Feature activation-7.41e-01

Pos logit contributions
Reviewed+1.342
spot+1.261
worms+1.246
lication+1.227
lessness+1.210
Neg logit contributions
vier-1.348
ashtra-1.342
amera-1.328
ersen-1.303
itably-1.288
TokenTY
Token ID9936
Feature activation-7.21e-01

Pos logit contributions
vier+1.194
amera+1.188
ashtra+1.182
ersen+1.152
orah+1.139
Neg logit contributions
Reviewed-1.120
worms-1.043
spot-1.042
lication-1.027
lessness-1.004
Token &
Token ID1222
Feature activation+1.66e-01

Pos logit contributions
vier+1.152
ashtra+1.131
amera+1.118
ersen+1.103
itably+1.092
Neg logit contributions
Reviewed-1.109
spot-1.036
worms-1.032
lication-1.029
lessness-0.990
Token VER
Token ID33310
Feature activation-3.02e+00

Pos logit contributions
Reviewed+0.256
spot+0.240
worms+0.238
lication+0.234
lessness+0.230
Neg logit contributions
vier-0.266
ashtra-0.264
amera-0.262
ersen-0.257
itably-0.254
TokenON
Token ID1340
Feature activation-4.34e-01

Pos logit contributions
vier+4.916
ashtra+4.788
amera+4.760
itably+4.758
orah+4.627
Neg logit contributions
Reviewed-4.348
worms-4.198
spot-4.107
lication-4.063
herent-3.936
TokenICA
Token ID25241
Feature activation+3.54e-01

Pos logit contributions
vier+0.532
ashtra+0.524
amera+0.521
ersen+0.508
orah+0.499
Neg logit contributions
Reviewed-0.829
spot-0.784
worms-0.783
lication-0.773
lessness-0.763
Token (
Token ID357
Feature activation-6.50e-01

Pos logit contributions
Reviewed+0.533
spot+0.499
worms+0.494
lication+0.488
lessness+0.478
Neg logit contributions
vier-0.577
ashtra-0.573
amera-0.568
ersen-0.558
itably-0.551
TokenAr
Token ID3163
Feature activation-4.66e-01

Pos logit contributions
vier+1.059
ashtra+1.042
amera+1.037
ersen+1.014
orah+1.009
Neg logit contributions
Reviewed-0.963
spot-0.908
worms-0.904
lication-0.902
lessness-0.882
Tokenchie
Token ID3043
Feature activation-1.41e+00

Pos logit contributions
vier+0.695
ashtra+0.680
amera+0.677
ersen+0.664
itably+0.657
Neg logit contributions
Reviewed-0.776
spot-0.722
worms-0.720
lication-0.711
lessness-0.700
Token earn
Token ID5160
Feature activation+1.63e+00

Pos logit contributions
Reviewed+1.521
spot+1.424
worms+1.409
lication+1.387
lessness+1.365
Neg logit contributions
vier-1.609
ashtra-1.601
amera-1.586
ersen-1.558
itably-1.539
Token and
Token ID290
Feature activation+1.38e-01

Pos logit contributions
Reviewed+2.308
spot+2.162
worms+2.132
lication+2.091
lessness+2.077
Neg logit contributions
vier-2.790
ashtra-2.778
amera-2.748
ersen-2.700
itably-2.670
Token make
Token ID787
Feature activation-2.59e-02

Pos logit contributions
Reviewed+0.220
spot+0.205
worms+0.203
lication+0.200
lessness+0.195
Neg logit contributions
vier-0.215
ashtra-0.214
amera-0.211
ersen-0.209
itably-0.206
Token them
Token ID606
Feature activation+3.26e-01

Pos logit contributions
vier+0.042
ashtra+0.041
amera+0.041
ersen+0.040
itably+0.040
Neg logit contributions
Reviewed-0.040
spot-0.037
worms-0.037
lication-0.036
lessness-0.035
Token feel
Token ID1254
Feature activation+1.05e+00

Pos logit contributions
Reviewed+0.511
spot+0.479
worms+0.474
lication+0.467
lessness+0.460
Neg logit contributions
vier-0.513
ashtra-0.510
amera-0.505
ersen-0.496
itably-0.490
Token rewarded
Token ID20945
Feature activation-1.92e+00

Pos logit contributions
Reviewed+1.719
spot+1.619
worms+1.602
lication+1.579
lessness+1.557
Neg logit contributions
vier-1.578
ashtra-1.570
amera-1.554
ersen-1.523
itably-1.504
Token at
Token ID379
Feature activation+1.79e-01

Pos logit contributions
vier+3.034
ashtra+2.991
amera+2.975
ersen+2.938
itably+2.909
Neg logit contributions
Reviewed-2.957
worms-2.738
spot-2.729
lication-2.700
lessness-2.572
Token the
Token ID262
Feature activation+5.18e-03

Pos logit contributions
Reviewed+0.284
spot+0.266
worms+0.264
lication+0.259
lessness+0.255
Neg logit contributions
vier-0.279
ashtra-0.278
amera-0.275
ersen-0.270
itably-0.267
Token end
Token ID886
Feature activation-6.44e-01

Pos logit contributions
Reviewed+0.008
spot+0.007
worms+0.007
lication+0.007
lessness+0.007
Neg logit contributions
vier-0.009
ashtra-0.008
amera-0.008
ersen-0.008
itably-0.008
Token of
Token ID286
Feature activation-2.55e-01

Pos logit contributions
vier+1.057
ashtra+1.052
amera+1.047
ersen+1.019
itably+1.006
Neg logit contributions
Reviewed-0.966
spot-0.908
worms-0.895
lication-0.882
lessness-0.878
Token the
Token ID262
Feature activation+1.29e-01

Pos logit contributions
vier+0.358
ashtra+0.354
amera+0.351
ersen+0.345
itably+0.342
Neg logit contributions
Reviewed-0.444
spot-0.418
worms-0.414
lication-0.408
lessness-0.401
Token division
Token ID7297
Feature activation+4.49e-01

Pos logit contributions
Reviewed+0.814
spot+0.763
worms+0.757
lication+0.744
lessness+0.729
Neg logit contributions
vier-0.835
ashtra-0.831
amera-0.823
ersen-0.809
orah-0.797
Token Hel
Token ID5053
Feature activation+6.72e-01

Pos logit contributions
Reviewed+0.696
spot+0.653
worms+0.646
lication+0.636
lessness+0.625
Neg logit contributions
vier-0.713
ashtra-0.709
amera-0.702
ersen-0.690
itably-0.681
Tokenfen
Token ID41037
Feature activation-8.27e-01

Pos logit contributions
Reviewed+1.037
spot+0.981
worms+0.968
lication+0.949
lessness+0.940
Neg logit contributions
vier-1.060
ashtra-1.057
amera-1.046
ersen-1.030
orah-1.014
Token oh
Token ID11752
Feature activation-9.88e-01

Pos logit contributions
vier+1.357
ashtra+1.345
amera+1.335
ersen+1.313
orah+1.284
Neg logit contributions
Reviewed-1.240
spot-1.157
worms-1.154
lication-1.131
lessness-1.106
Tokenne
Token ID710
Feature activation-1.88e-01

Pos logit contributions
vier+1.859
amera+1.848
ashtra+1.821
ersen+1.808
ager+1.776
Neg logit contributions
Reviewed-1.253
worms-1.142
spot-1.132
lication-1.130
lessness-1.091
Token Gren
Token ID19674
Feature activation-1.86e+00

Pos logit contributions
vier+0.310
ashtra+0.306
amera+0.305
ersen+0.300
itably+0.296
Neg logit contributions
Reviewed-0.282
spot-0.265
worms-0.262
lication-0.258
lessness-0.253
Tokenzen
Token ID4801
Feature activation-2.06e-01

Pos logit contributions
vier+3.008
amera+2.972
ashtra+2.947
ersen+2.907
orah+2.850
Neg logit contributions
Reviewed-2.853
worms-2.649
spot-2.640
lication-2.590
lessness-2.543
Token e
Token ID304
Feature activation-6.39e-01

Pos logit contributions
vier+0.344
ashtra+0.339
amera+0.338
ersen+0.332
itably+0.326
Neg logit contributions
Reviewed-0.303
spot-0.284
worms-0.281
lication-0.277
lessness-0.271
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
vier+1.032
ashtra+1.020
amera+1.015
ersen+1.006
itably+0.990
Neg logit contributions
Reviewed-0.967
spot-0.910
worms-0.904
lication-0.886
lessness-0.867
TokenV
Token ID53
Feature activation-1.01e+00

Pos logit contributions
Reviewed+0.407
spot+0.379
worms+0.375
lication+0.369
lessness+0.363
Neg logit contributions
vier-0.494
ashtra-0.492
amera-0.488
ersen-0.479
itably-0.474
Token.
Token ID13
Feature activation+5.38e-01

Pos logit contributions
vier+1.701
ashtra+1.683
amera+1.677
ersen+1.659
orah+1.630
Neg logit contributions
Reviewed-1.478
spot-1.375
worms-1.363
lication-1.331
lessness-1.310