TOP ACTIVATIONS
MAX = 1.645

<|endoftext|> Haw aii Congress woman Tuls i G abb ard is
Brain ] Pixel pipe Advertisement
<|endoftext|> The Geek Z odiac THE G EEK
texture . Sun flower Seed Fl our For
make the connection between Russian hackers and the Democratic National Committee
, in conjunction with Encyclopedia Britann ica Films and an O
a transition of ownership , Civ ica , a major player
getting its students admitted into Ivy League universities . Photo :
Armenian nation s survival . That
said . G ob ler s team
, St é ph ane Dion , had already announced his
who has ever taken a CPR class , you know that
, an open - world survival title that is already in
metal ; its hard all oys destroyed dies and machinery .
ological profile of N - cyclop rop yl methyl - 7
priorities : allegiance to v oodoo tr umps all .
of God ," reads the ob it published in the Richmond
this challenge will reference the relevant passage from the United States
that the connection between Russian hackers and the DNC leak isn
awful moment when you administer CPR for the first time ,

MIN ACTIVATIONS
MIN = -1.737

the ends of their tr unks . Ly uba 's were
ateurs cher cher ont un comprom is ent re t out
Sentinel / T NS via Getty Images ) Indiana Gov .
64 - bit Hy rule Fi end , O car ina
is determined by how the carc ass is scored for things
glove comp artments of tr unks of cars that enter national
) on the Mongol ian Plate au . The Il k
Bang Theory CBS Amy Poe hler Parks and Recreation NBC
Unlike the produce section at grocery stores , sushi restaurants don
from their carry - on baggage when going through airline security
Hide Caption 7 of 15 Photos : Carly Fiorina 's political
right to personal integrity ," Sa arak k ala is quoted
our daily newsletter Upgrade your inbox and get our Daily Dispatch
Hide Caption 4 of 15 Photos : Carly Fiorina 's political
Ves a - Matt i Sa arak k ala , a
Vel ie knows the Progressive Conservative government is talking openly
the troops " to enc irc le and destroy [ the
largely in anger at Patricia Neal . He would threaten to
seems to be dis j ointed with the way the toilet
is quoted as saying . Sa arak k ala , a

INTERVAL 0.799 - 1.645
CONTAINS 0.596%

said the campaign has been looking to media reports to make
different places , and the verb is here , or the
" ex empl ary " Defense secretary , calling him critical
TC button and the other cigarette lighter And here is the
mag i probably not relevant . Move on

INTERVAL -0.046 - 0.799
CONTAINS 58.669%

to celebrate the birthday of his brother , Kim bal .
But with a change to its policies at the end of
of things : going on a holiday , finding a partner
more out there than most . - July 6 , 2010
2018 . Walker did not waste his latest chance

INTERVAL -0.892 - -0.046
CONTAINS 40.456%

yet so few have it , due to the custom apps
as it tips over , N ett les explains
no further then the river that runs through Shang ba to
agency 's chief executive Stella Av ram opoulos said the face
ort ot ft w > loving this meme [ 10 :

INTERVAL -1.737 - -0.892
CONTAINS 0.278%

Here 's a breakdown who the the targets remaining
and I just felt speech less . " Oh
and about 53 % were office workers or in the military
when a TC RI truck drove by , and a strange
less mirror ing the Rab id Pupp ies slate . Essentially
Token<|endoftext|>
Token ID50256
Feature activation+5.13e-03

Pos logit contributions
 capped+0.040
mil+0.038
 primates+0.038
 kidnapping+0.037
 diving+0.036
Neg logit contributions
ngth-0.051
Avg-0.047
aund-0.047
esson-0.047
enegger-0.046
TokenHaw
Token ID33055
Feature activation-8.79e-02

Pos logit contributions
 capped+0.009
mil+0.009
 primates+0.009
respond+0.008
 kidnapping+0.008
Neg logit contributions
ngth-0.010
aund-0.009
enegger-0.009
esson-0.009
Avg-0.009
Tokenaii
Token ID42648
Feature activation+2.22e-01

Pos logit contributions
ngth+0.172
Avg+0.158
aund+0.157
esson+0.155
enegger+0.155
Neg logit contributions
 capped-0.158
mil-0.151
 primates-0.149
 kidnapping-0.145
 diving-0.145
Token Congress
Token ID3162
Feature activation-7.78e-02

Pos logit contributions
 capped+0.382
mil+0.360
 primates+0.359
 kidnapping+0.351
 diving+0.349
Neg logit contributions
ngth-0.456
Avg-0.417
aund-0.417
esson-0.415
enegger-0.414
Tokenwoman
Token ID8580
Feature activation-5.86e-01

Pos logit contributions
ngth+0.152
Avg+0.139
aund+0.139
esson+0.138
enegger+0.137
Neg logit contributions
 capped-0.140
mil-0.133
 primates-0.132
 kidnapping-0.129
 diving-0.128
Token Tuls
Token ID33219
Feature activation+1.64e+00

Pos logit contributions
ngth+1.255
Avg+1.161
aund+1.157
esson+1.154
enegger+1.143
Neg logit contributions
 capped-0.940
mil-0.889
 primates-0.879
 kidnapping-0.855
 diving-0.848
Tokeni
Token ID72
Feature activation+1.75e-01

Pos logit contributions
 capped+2.248
mil+2.162
 primates+2.038
rub+2.015
 diving+2.001
Neg logit contributions
ngth-3.951
enegger-3.627
aund-3.603
Avg-3.584
esson-3.581
Token G
Token ID402
Feature activation-1.43e-01

Pos logit contributions
 capped+0.257
mil+0.241
 primates+0.238
 kidnapping+0.232
 diving+0.232
Neg logit contributions
ngth-0.400
Avg-0.370
aund-0.370
enegger-0.369
esson-0.366
Tokenabb
Token ID6485
Feature activation+2.13e-01

Pos logit contributions
ngth+0.187
Avg+0.162
enegger+0.159
esson+0.148
aund+0.147
Neg logit contributions
 capped-0.350
mil-0.348
rub-0.335
 kidnapping-0.332
 primates-0.332
Tokenard
Token ID446
Feature activation-1.13e-01

Pos logit contributions
 capped+0.470
mil+0.452
 primates+0.448
 kidnapping+0.439
 diving+0.437
Neg logit contributions
ngth-0.329
Avg-0.295
aund-0.294
esson-0.293
enegger-0.289
Token is
Token ID318
Feature activation+3.87e-02

Pos logit contributions
ngth+0.229
Avg+0.211
aund+0.210
esson+0.210
enegger+0.208
Neg logit contributions
 capped-0.194
mil-0.184
 primates-0.182
 kidnapping-0.177
 diving-0.176
TokenBrain
Token ID44687
Feature activation-2.52e-01

Pos logit contributions
 capped+0.231
mil+0.219
 primates+0.214
 kidnapping+0.208
 diving+0.206
Neg logit contributions
ngth-0.358
aund-0.329
esson-0.328
Avg-0.328
enegger-0.326
Token]
Token ID60
Feature activation+2.40e-01

Pos logit contributions
ngth+0.488
Avg+0.447
aund+0.445
esson+0.443
enegger+0.439
Neg logit contributions
 capped-0.458
mil-0.436
 primates-0.431
 kidnapping-0.421
 diving-0.419
Token
Token ID198
Feature activation+5.10e-02

Pos logit contributions
 capped+0.396
mil+0.374
 primates+0.369
 kidnapping+0.359
 diving+0.357
Neg logit contributions
ngth-0.508
Avg-0.468
aund-0.467
esson-0.464
enegger-0.461
Token
Token ID198
Feature activation-3.71e-03

Pos logit contributions
 capped+0.082
mil+0.078
 primates+0.077
 kidnapping+0.075
 disadvant+0.075
Neg logit contributions
ngth-0.109
Avg-0.101
aund-0.100
esson-0.100
enegger-0.099
TokenPixel
Token ID40809
Feature activation+3.51e-01

Pos logit contributions
ngth+0.007
aund+0.007
esson+0.007
enegger+0.007
Avg+0.007
Neg logit contributions
 capped-0.007
mil-0.006
 primates-0.006
 kidnapping-0.006
respond-0.006
Tokenpipe
Token ID34360
Feature activation+1.60e+00

Pos logit contributions
 capped+0.597
mil+0.570
 primates+0.558
 diving+0.544
 kidnapping+0.544
Neg logit contributions
ngth-0.718
Avg-0.661
aund-0.660
enegger-0.659
esson-0.657
Token
Token ID198
Feature activation+8.54e-02

Pos logit contributions
 capped+2.401
mil+2.271
 primates+2.184
 diving+2.168
 kidnapping+2.123
Neg logit contributions
ngth-3.545
Avg-3.302
aund-3.283
enegger-3.267
esson-3.255
Token
Token ID198
Feature activation+1.60e-02

Pos logit contributions
 capped+0.137
mil+0.130
 primates+0.128
 kidnapping+0.125
 diving+0.124
Neg logit contributions
ngth-0.183
Avg-0.169
aund-0.169
esson-0.168
enegger-0.167
TokenAdvertisement
Token ID4723
Feature activation-8.81e-01

Pos logit contributions
 capped+0.028
mil+0.027
 primates+0.026
 kidnapping+0.025
respond+0.025
Neg logit contributions
ngth-0.032
aund-0.029
Avg-0.029
esson-0.029
enegger-0.029
Token
Token ID198
Feature activation+9.83e-02

Pos logit contributions
ngth+1.704
Avg+1.641
esson+1.615
aund+1.611
enegger+1.577
Neg logit contributions
 capped-1.514
 disadvant-1.459
 primates-1.444
mil-1.419
 kidnapping-1.405
Token
Token ID198
Feature activation+4.15e-02

Pos logit contributions
 capped+0.157
mil+0.148
 primates+0.146
 kidnapping+0.142
 diving+0.141
Neg logit contributions
ngth-0.212
Avg-0.196
aund-0.195
esson-0.194
enegger-0.193
Token
Token ID251
Feature activation-1.92e-01

Pos logit contributions
ngth+0.029
Avg+0.027
aund+0.027
esson+0.027
enegger+0.026
Neg logit contributions
 capped-0.026
mil-0.025
 primates-0.025
 kidnapping-0.024
 diving-0.024
Token<|endoftext|>
Token ID50256
Feature activation+7.22e-02

Pos logit contributions
ngth+0.417
Avg+0.383
aund+0.382
esson+0.380
enegger+0.378
Neg logit contributions
 capped-0.305
mil-0.289
 primates-0.284
 kidnapping-0.276
respond-0.275
TokenThe
Token ID464
Feature activation-4.83e-02

Pos logit contributions
 capped+0.128
mil+0.124
 primates+0.120
respond+0.118
 kidnapping+0.118
Neg logit contributions
ngth-0.144
aund-0.130
enegger-0.130
esson-0.130
Avg-0.129
Token Geek
Token ID33639
Feature activation+1.97e-01

Pos logit contributions
ngth+0.100
Avg+0.092
aund+0.092
esson+0.091
enegger+0.091
Neg logit contributions
 capped-0.082
mil-0.077
 primates-0.077
 kidnapping-0.075
 diving-0.075
Token Z
Token ID1168
Feature activation-5.26e-01

Pos logit contributions
 capped+0.332
mil+0.315
 primates+0.311
 kidnapping+0.303
 diving+0.301
Neg logit contributions
ngth-0.411
Avg-0.376
aund-0.376
esson-0.373
enegger-0.371
Tokenodiac
Token ID40096
Feature activation+1.55e+00

Pos logit contributions
ngth+1.094
Avg+1.010
aund+1.008
esson+1.004
enegger+0.994
Neg logit contributions
 capped-0.876
mil-0.831
 primates-0.822
 kidnapping-0.801
 diving-0.795
Token
Token ID198
Feature activation+9.83e-02

Pos logit contributions
 capped+2.658
 primates+2.484
mil+2.477
 diving+2.419
 kidnapping+2.414
Neg logit contributions
ngth-3.212
Avg-2.940
aund-2.915
enegger-2.884
esson-2.854
Token
Token ID198
Feature activation+5.45e-02

Pos logit contributions
 capped+0.150
mil+0.143
 primates+0.138
 kidnapping+0.134
respond+0.134
Neg logit contributions
ngth-0.223
aund-0.203
enegger-0.202
esson-0.201
Avg-0.201
TokenTHE
Token ID10970
Feature activation-2.19e-01

Pos logit contributions
 capped+0.096
mil+0.093
 primates+0.091
 kidnapping+0.088
respond+0.088
Neg logit contributions
ngth-0.109
aund-0.099
enegger-0.098
esson-0.098
Avg-0.098
Token G
Token ID402
Feature activation-2.34e-01

Pos logit contributions
ngth+0.458
Avg+0.422
aund+0.421
esson+0.419
enegger+0.415
Neg logit contributions
 capped-0.364
mil-0.344
 primates-0.341
 kidnapping-0.332
 diving-0.330
TokenEEK
Token ID33823
Feature activation+6.52e-01

Pos logit contributions
ngth+0.446
Avg+0.402
aund+0.399
esson+0.398
enegger+0.397
Neg logit contributions
 capped-0.437
mil-0.417
 primates-0.413
 kidnapping-0.402
 diving-0.402
Token texture
Token ID11743
Feature activation-3.55e-01

Pos logit contributions
 capped+0.003
mil+0.003
 primates+0.003
 kidnapping+0.003
 diving+0.003
Neg logit contributions
ngth-0.004
Avg-0.004
aund-0.004
esson-0.004
enegger-0.004
Token.
Token ID13
Feature activation+3.87e-02

Pos logit contributions
ngth+0.694
Avg+0.635
aund+0.633
esson+0.631
enegger+0.626
Neg logit contributions
 capped-0.644
mil-0.615
 primates-0.606
 diving-0.591
 kidnapping-0.589
Token
Token ID198
Feature activation+3.32e-02

Pos logit contributions
 capped+0.060
mil+0.057
 primates+0.056
 kidnapping+0.054
 diving+0.054
Neg logit contributions
ngth-0.086
aund-0.079
Avg-0.079
esson-0.078
enegger-0.078
Token
Token ID198
Feature activation-1.08e-02

Pos logit contributions
 capped+0.053
mil+0.050
 primates+0.050
 kidnapping+0.049
 diving+0.048
Neg logit contributions
ngth-0.071
Avg-0.066
aund-0.066
esson-0.065
enegger-0.065
TokenSun
Token ID16012
Feature activation+3.51e-02

Pos logit contributions
ngth+0.022
enegger+0.020
aund+0.020
esson+0.020
Avg+0.020
Neg logit contributions
 capped-0.019
mil-0.018
 primates-0.017
respond-0.017
 kidnapping-0.017
Tokenflower
Token ID25547
Feature activation+1.53e+00

Pos logit contributions
 capped+0.049
mil+0.046
 primates+0.046
 kidnapping+0.044
 diving+0.044
Neg logit contributions
ngth-0.082
Avg-0.077
aund-0.076
esson-0.076
enegger-0.076
Token Seed
Token ID23262
Feature activation+2.21e-01

Pos logit contributions
 capped+2.352
mil+2.267
 primates+2.128
 kidnapping+2.092
 Levin+2.081
Neg logit contributions
ngth-3.397
enegger-3.141
Avg-3.117
esson-3.114
aund-3.093
Token Fl
Token ID1610
Feature activation+7.28e-01

Pos logit contributions
 capped+0.339
mil+0.322
 primates+0.314
 kidnapping+0.305
 diving+0.303
Neg logit contributions
ngth-0.491
Avg-0.455
aund-0.453
esson-0.451
enegger-0.450
Tokenour
Token ID454
Feature activation+2.97e-01

Pos logit contributions
 capped+1.563
mil+1.533
 primates+1.480
rub+1.472
respond+1.465
Neg logit contributions
ngth-1.176
enegger-1.051
Avg-1.022
aund-0.999
esson-0.992
Token
Token ID960
Feature activation+1.87e-01

Pos logit contributions
 capped+0.475
mil+0.452
 primates+0.440
 kidnapping+0.429
 diving+0.427
Neg logit contributions
ngth-0.638
Avg-0.590
aund-0.588
esson-0.587
enegger-0.584
TokenFor
Token ID1890
Feature activation+3.23e-01

Pos logit contributions
 capped+0.328
mil+0.317
 primates+0.308
 kidnapping+0.301
respond+0.301
Neg logit contributions
ngth-0.375
aund-0.340
enegger-0.340
esson-0.339
Avg-0.337
Token make
Token ID787
Feature activation+2.50e-02

Pos logit contributions
 capped+0.144
mil+0.136
 primates+0.135
 kidnapping+0.131
 diving+0.130
Neg logit contributions
ngth-0.186
Avg-0.172
aund-0.171
esson-0.170
enegger-0.169
Token the
Token ID262
Feature activation+8.99e-02

Pos logit contributions
 capped+0.040
mil+0.038
 primates+0.038
 kidnapping+0.037
 diving+0.036
Neg logit contributions
ngth-0.054
Avg-0.050
aund-0.050
esson-0.049
enegger-0.049
Token connection
Token ID4637
Feature activation+8.42e-02

Pos logit contributions
 capped+0.146
mil+0.138
 primates+0.137
 kidnapping+0.133
 diving+0.132
Neg logit contributions
ngth-0.190
Avg-0.176
aund-0.176
esson-0.175
enegger-0.173
Token between
Token ID1022
Feature activation+8.36e-01

Pos logit contributions
 capped+0.144
mil+0.136
 primates+0.135
 kidnapping+0.132
 diving+0.131
Neg logit contributions
ngth-0.173
Avg-0.159
aund-0.158
esson-0.157
enegger-0.157
Token Russian
Token ID3394
Feature activation+2.71e-01

Pos logit contributions
 capped+1.240
mil+1.160
 primates+1.156
 kidnapping+1.131
 diving+1.125
Neg logit contributions
ngth-1.919
Avg-1.771
aund-1.758
enegger-1.749
esson-1.745
Token hackers
Token ID17110
Feature activation+1.50e+00

Pos logit contributions
 capped+0.430
mil+0.405
 primates+0.400
 kidnapping+0.392
 diving+0.390
Neg logit contributions
ngth-0.591
Avg-0.547
aund-0.543
enegger-0.542
esson-0.541
Token and
Token ID290
Feature activation-2.73e-02

Pos logit contributions
 capped+2.596
mil+2.442
 kidnapping+2.410
 diving+2.387
 primates+2.379
Neg logit contributions
ngth-3.028
Avg-2.834
enegger-2.793
aund-2.780
esson-2.738
Token the
Token ID262
Feature activation-6.83e-02

Pos logit contributions
ngth+0.059
Avg+0.054
aund+0.054
esson+0.053
enegger+0.053
Neg logit contributions
 capped-0.044
mil-0.042
 primates-0.041
 kidnapping-0.040
 diving-0.040
Token Democratic
Token ID4390
Feature activation-4.25e-01

Pos logit contributions
ngth+0.142
Avg+0.130
aund+0.130
esson+0.129
enegger+0.129
Neg logit contributions
 capped-0.115
mil-0.109
 primates-0.108
 kidnapping-0.106
 diving-0.105
Token National
Token ID2351
Feature activation-4.52e-01

Pos logit contributions
ngth+0.683
Avg+0.613
aund+0.610
esson+0.606
enegger+0.602
Neg logit contributions
 capped-0.914
mil-0.877
 primates-0.870
 kidnapping-0.853
 diving-0.849
Token Committee
Token ID4606
Feature activation-1.03e+00

Pos logit contributions
ngth+0.731
Avg+0.671
aund+0.667
esson+0.666
enegger+0.653
Neg logit contributions
 capped-0.945
mil-0.908
 primates-0.899
 disadvant-0.880
 kidnapping-0.878
Token,
Token ID11
Feature activation+1.45e-01

Pos logit contributions
 capped+0.357
mil+0.339
 primates+0.333
 kidnapping+0.326
 diving+0.324
Neg logit contributions
ngth-0.424
Avg-0.388
aund-0.387
esson-0.385
enegger-0.383
Token in
Token ID287
Feature activation-1.09e-02

Pos logit contributions
 capped+0.233
mil+0.218
 primates+0.216
 kidnapping+0.211
 diving+0.210
Neg logit contributions
ngth-0.316
aund-0.292
Avg-0.291
esson-0.289
enegger-0.287
Token conjunction
Token ID17856
Feature activation+3.13e-01

Pos logit contributions
ngth+0.024
Avg+0.023
aund+0.023
esson+0.022
enegger+0.022
Neg logit contributions
 capped-0.017
mil-0.016
 primates-0.016
 kidnapping-0.015
 diving-0.015
Token with
Token ID351
Feature activation+5.83e-02

Pos logit contributions
 capped+0.499
mil+0.472
 primates+0.467
 kidnapping+0.454
 diving+0.451
Neg logit contributions
ngth-0.673
Avg-0.623
aund-0.621
esson-0.618
enegger-0.613
Token Encyclopedia
Token ID36732
Feature activation+2.63e-01

Pos logit contributions
 capped+0.101
mil+0.095
 primates+0.095
 kidnapping+0.093
 diving+0.093
Neg logit contributions
ngth-0.119
Avg-0.109
aund-0.109
esson-0.108
enegger-0.107
Token Britann
Token ID46693
Feature activation+1.49e+00

Pos logit contributions
 capped+0.441
mil+0.418
 primates+0.414
 diving+0.405
 kidnapping+0.402
Neg logit contributions
ngth-0.541
aund-0.500
esson-0.497
Avg-0.497
enegger-0.495
Tokenica
Token ID3970
Feature activation-1.91e-01

Pos logit contributions
 capped+2.437
mil+2.358
 primates+2.301
 kidnapping+2.241
 diving+2.232
Neg logit contributions
ngth-3.182
enegger-2.852
Avg-2.830
esson-2.821
aund-2.788
Token Films
Token ID30198
Feature activation+1.30e-02

Pos logit contributions
ngth+0.367
Avg+0.335
aund+0.335
esson+0.335
enegger+0.332
Neg logit contributions
 capped-0.348
mil-0.331
 primates-0.329
 diving-0.321
 kidnapping-0.320
Token and
Token ID290
Feature activation+3.11e-02

Pos logit contributions
 capped+0.024
mil+0.023
 primates+0.022
 kidnapping+0.022
 diving+0.022
Neg logit contributions
ngth-0.025
Avg-0.023
aund-0.023
esson-0.023
enegger-0.023
Token an
Token ID281
Feature activation-1.72e-01

Pos logit contributions
 capped+0.053
mil+0.050
 primates+0.050
 kidnapping+0.048
 diving+0.048
Neg logit contributions
ngth-0.064
Avg-0.059
aund-0.059
esson-0.059
enegger-0.058
Token O
Token ID440
Feature activation+8.99e-03

Pos logit contributions
ngth+0.361
aund+0.329
Avg+0.328
esson+0.327
enegger+0.326
Neg logit contributions
 capped-0.287
 primates-0.270
mil-0.268
 kidnapping-0.262
 diving-0.261
Token a
Token ID257
Feature activation+4.39e-02

Pos logit contributions
 capped+0.509
mil+0.481
 primates+0.478
 kidnapping+0.465
 diving+0.464
Neg logit contributions
ngth-0.602
Avg-0.552
aund-0.550
enegger-0.546
esson-0.545
Token transition
Token ID6801
Feature activation+3.81e-01

Pos logit contributions
 capped+0.066
mil+0.061
 primates+0.061
 kidnapping+0.059
 diving+0.059
Neg logit contributions
ngth-0.100
Avg-0.093
aund-0.092
esson-0.092
enegger-0.091
Token of
Token ID286
Feature activation-3.59e-02

Pos logit contributions
 capped+0.632
mil+0.592
 primates+0.587
 kidnapping+0.573
 diving+0.568
Neg logit contributions
ngth-0.812
Avg-0.747
aund-0.741
esson-0.739
enegger-0.737
Token ownership
Token ID9238
Feature activation-4.97e-01

Pos logit contributions
ngth+0.075
Avg+0.069
aund+0.069
esson+0.069
enegger+0.068
Neg logit contributions
 capped-0.060
mil-0.057
 primates-0.056
 kidnapping-0.055
 diving-0.055
Token,
Token ID11
Feature activation-2.68e-02

Pos logit contributions
ngth+0.992
Avg+0.912
aund+0.909
esson+0.905
enegger+0.897
Neg logit contributions
 capped-0.873
mil-0.829
 primates-0.821
 kidnapping-0.801
 diving-0.796
Token Civ
Token ID47909
Feature activation+1.47e+00

Pos logit contributions
ngth+0.056
Avg+0.052
aund+0.051
esson+0.051
enegger+0.051
Neg logit contributions
 capped-0.045
mil-0.042
 primates-0.042
 kidnapping-0.041
 diving-0.041
Tokenica
Token ID3970
Feature activation+4.40e-01

Pos logit contributions
 capped+2.212
mil+2.136
 primates+2.027
ilon+1.993
 diving+1.986
Neg logit contributions
ngth-3.241
enegger-3.027
aund-2.984
Avg-2.971
esson-2.923
Token,
Token ID11
Feature activation+1.31e-01

Pos logit contributions
 capped+0.767
mil+0.717
 primates+0.712
 diving+0.691
 kidnapping+0.689
Neg logit contributions
ngth-0.899
Avg-0.825
aund-0.820
enegger-0.813
esson-0.810
Token a
Token ID257
Feature activation+1.46e-01

Pos logit contributions
 capped+0.210
mil+0.197
 primates+0.194
 kidnapping+0.189
 diving+0.188
Neg logit contributions
ngth-0.287
Avg-0.265
aund-0.264
enegger-0.262
esson-0.261
Token major
Token ID1688
Feature activation+8.91e-01

Pos logit contributions
 capped+0.233
mil+0.218
 primates+0.216
 kidnapping+0.211
 diving+0.211
Neg logit contributions
ngth-0.320
Avg-0.296
aund-0.294
esson-0.292
enegger-0.291
Token player
Token ID2137
Feature activation+1.47e-01

Pos logit contributions
 capped+1.462
mil+1.372
 primates+1.358
 kidnapping+1.341
 diving+1.340
Neg logit contributions
ngth-1.924
Avg-1.754
aund-1.739
esson-1.736
enegger-1.724
Token getting
Token ID1972
Feature activation-1.09e-01

Pos logit contributions
 capped+0.012
mil+0.012
 primates+0.012
 kidnapping+0.011
 diving+0.011
Neg logit contributions
ngth-0.018
Avg-0.016
aund-0.016
esson-0.016
enegger-0.016
Token its
Token ID663
Feature activation+1.34e-01

Pos logit contributions
ngth+0.233
Avg+0.215
aund+0.215
esson+0.214
enegger+0.212
Neg logit contributions
 capped-0.174
mil-0.165
 primates-0.163
 kidnapping-0.158
 diving-0.157
Token students
Token ID2444
Feature activation-3.52e-01

Pos logit contributions
 capped+0.221
 primates+0.207
mil+0.207
 kidnapping+0.201
 diving+0.200
Neg logit contributions
ngth-0.285
Avg-0.262
aund-0.261
esson-0.261
enegger-0.257
Token admitted
Token ID6848
Feature activation-1.45e-01

Pos logit contributions
ngth+0.770
Avg+0.716
aund+0.715
esson+0.710
enegger+0.708
Neg logit contributions
 capped-0.539
mil-0.513
 primates-0.507
 kidnapping-0.490
 disadvant-0.490
Token into
Token ID656
Feature activation-1.30e-01

Pos logit contributions
ngth+0.309
Avg+0.285
aund+0.284
esson+0.283
enegger+0.281
Neg logit contributions
 capped-0.237
mil-0.224
 primates-0.222
 kidnapping-0.216
 diving-0.214
Token Ivy
Token ID27386
Feature activation+1.46e+00

Pos logit contributions
ngth+0.266
Avg+0.245
aund+0.244
esson+0.243
enegger+0.240
Neg logit contributions
 capped-0.224
mil-0.212
 primates-0.210
 kidnapping-0.205
 diving-0.204
Token League
Token ID4041
Feature activation+5.20e-02

Pos logit contributions
 capped+0.716
mil+0.566
 primates+0.541
 diving+0.500
 kidnapping+0.466
Neg logit contributions
ngth-4.831
aund-4.547
enegger-4.535
esson-4.520
Avg-4.490
Token universities
Token ID11155
Feature activation-1.66e-03

Pos logit contributions
 capped+0.077
mil+0.073
 primates+0.072
 kidnapping+0.070
 diving+0.069
Neg logit contributions
ngth-0.118
Avg-0.109
aund-0.109
esson-0.109
enegger-0.108
Token.
Token ID13
Feature activation+3.78e-02

Pos logit contributions
ngth+0.003
Avg+0.003
aund+0.003
esson+0.003
enegger+0.003
Neg logit contributions
 capped-0.003
mil-0.003
 primates-0.003
 kidnapping-0.003
 diving-0.003
Token Photo
Token ID5555
Feature activation-3.41e-01

Pos logit contributions
 capped+0.060
mil+0.057
 primates+0.056
 kidnapping+0.054
 diving+0.054
Neg logit contributions
ngth-0.082
aund-0.075
Avg-0.075
esson-0.075
enegger-0.075
Token:
Token ID25
Feature activation+2.31e-01

Pos logit contributions
ngth+0.680
Avg+0.621
aund+0.620
esson+0.617
enegger+0.613
Neg logit contributions
 capped-0.604
mil-0.574
 primates-0.565
 kidnapping-0.553
 diving-0.549
Token Armenian
Token ID29388
Feature activation-3.41e-01

Pos logit contributions
 capped+0.035
mil+0.033
 primates+0.033
 kidnapping+0.033
 diving+0.032
Neg logit contributions
ngth-0.042
Avg-0.039
aund-0.039
esson-0.038
enegger-0.038
Token nation
Token ID3277
Feature activation+1.43e-01

Pos logit contributions
ngth+0.723
Avg+0.668
aund+0.665
esson+0.662
enegger+0.659
Neg logit contributions
 capped-0.556
mil-0.527
 primates-0.522
 kidnapping-0.508
 diving-0.504
Token
Token ID447
Feature activation+3.18e-01

Pos logit contributions
 capped+0.257
mil+0.244
 primates+0.241
 kidnapping+0.237
 diving+0.235
Neg logit contributions
ngth-0.282
Avg-0.259
aund-0.257
enegger-0.257
esson-0.254
Token
Token ID247
Feature activation+2.24e-01

Pos logit contributions
 capped+0.414
mil+0.387
 primates+0.381
 kidnapping+0.368
 diving+0.364
Neg logit contributions
ngth-0.780
Avg-0.727
aund-0.725
esson-0.723
enegger-0.720
Tokens
Token ID82
Feature activation+1.33e-01

Pos logit contributions
 capped+0.375
mil+0.358
 primates+0.351
 kidnapping+0.344
respond+0.341
Neg logit contributions
ngth-0.465
Avg-0.429
aund-0.428
enegger-0.426
esson-0.425
Token survival
Token ID9441
Feature activation+1.46e+00

Pos logit contributions
 capped+0.221
mil+0.209
 primates+0.208
 kidnapping+0.203
 diving+0.201
Neg logit contributions
ngth-0.281
Avg-0.259
aund-0.258
esson-0.256
enegger-0.255
Token.
Token ID13
Feature activation+1.74e-01

Pos logit contributions
 capped+2.471
mil+2.354
 primates+2.307
 kidnapping+2.251
 diving+2.244
Neg logit contributions
ngth-2.962
enegger-2.730
Avg-2.724
aund-2.688
esson-2.688
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
 capped+0.272
mil+0.260
 primates+0.252
 kidnapping+0.247
respond+0.245
Neg logit contributions
ngth-0.382
aund-0.351
esson-0.351
enegger-0.349
Avg-0.349
Token
Token ID198
Feature activation+1.62e-01

Pos logit contributions
 capped+0.352
mil+0.334
 primates+0.328
 kidnapping+0.319
respond+0.317
Neg logit contributions
ngth-0.488
aund-0.447
Avg-0.447
esson-0.445
enegger-0.443
TokenThat
Token ID2504
Feature activation+4.72e-01

Pos logit contributions
 capped+0.271
mil+0.262
 primates+0.253
respond+0.249
 kidnapping+0.249
Neg logit contributions
ngth-0.337
aund-0.306
enegger-0.306
esson-0.306
Avg-0.303
Token
Token ID447
Feature activation+2.43e-01

Pos logit contributions
 capped+0.855
mil+0.814
 primates+0.803
 kidnapping+0.795
 diving+0.780
Neg logit contributions
ngth-0.925
aund-0.842
enegger-0.842
Avg-0.838
esson-0.830
Token said
Token ID531
Feature activation-1.47e-01

Pos logit contributions
ngth+1.159
Avg+1.079
aund+1.075
esson+1.074
enegger+1.063
Neg logit contributions
 capped-0.931
mil-0.888
 primates-0.878
 kidnapping-0.860
 disadvant-0.852
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
ngth+0.272
Avg+0.248
aund+0.247
esson+0.246
enegger+0.243
Neg logit contributions
 capped-0.281
mil-0.268
 primates-0.266
 kidnapping-0.259
 diving-0.258
Token
Token ID198
Feature activation-4.52e-03

Pos logit contributions
ngth+0.293
Avg+0.271
aund+0.270
esson+0.269
enegger+0.266
Neg logit contributions
 capped-0.220
mil-0.208
 primates-0.206
 kidnapping-0.201
 diving-0.199
Token
Token ID198
Feature activation-2.63e-02

Pos logit contributions
ngth+0.010
Avg+0.009
aund+0.009
esson+0.009
enegger+0.009
Neg logit contributions
 capped-0.007
mil-0.007
 primates-0.007
 kidnapping-0.007
 disadvant-0.007
TokenG
Token ID38
Feature activation-3.28e-01

Pos logit contributions
ngth+0.056
aund+0.051
esson+0.051
Avg+0.051
enegger+0.051
Neg logit contributions
 capped-0.043
mil-0.041
 primates-0.040
 kidnapping-0.039
respond-0.039
Tokenob
Token ID672
Feature activation+1.45e+00

Pos logit contributions
ngth+0.815
Avg+0.757
aund+0.754
esson+0.749
enegger+0.746
Neg logit contributions
 capped-0.419
mil-0.393
 primates-0.386
respond-0.373
 kidnapping-0.367
Tokenler
Token ID1754
Feature activation-3.67e-01

Pos logit contributions
 capped+1.503
mil+1.406
 primates+1.318
respond+1.308
rub+1.279
Neg logit contributions
ngth-3.999
Avg-3.683
aund-3.652
enegger-3.643
acebook-3.596
Token
Token ID447
Feature activation-9.49e-02

Pos logit contributions
ngth+0.726
Avg+0.669
aund+0.667
esson+0.665
enegger+0.658
Neg logit contributions
 capped-0.644
mil-0.615
 primates-0.608
 kidnapping-0.594
 disadvant-0.590
Token
Token ID247
Feature activation-6.83e-02

Pos logit contributions
ngth+0.233
Avg+0.218
aund+0.217
esson+0.217
enegger+0.215
Neg logit contributions
 capped-0.123
mil-0.114
 primates-0.113
 kidnapping-0.109
 diving-0.108
Tokens
Token ID82
Feature activation-1.34e-01

Pos logit contributions
ngth+0.143
Avg+0.132
aund+0.132
esson+0.131
enegger+0.130
Neg logit contributions
 capped-0.113
mil-0.107
 primates-0.106
 kidnapping-0.103
 diving-0.102
Token team
Token ID1074
Feature activation+1.81e-02

Pos logit contributions
ngth+0.295
Avg+0.274
aund+0.273
esson+0.272
enegger+0.269
Neg logit contributions
 capped-0.206
mil-0.194
 primates-0.192
 kidnapping-0.187
 diving-0.186
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
ngth+0.211
Avg+0.195
aund+0.194
esson+0.193
enegger+0.192
Neg logit contributions
 capped-0.160
mil-0.151
 primates-0.149
 kidnapping-0.145
 diving-0.144
Token St
Token ID520
Feature activation+1.21e-01

Pos logit contributions
ngth+0.029
Avg+0.027
aund+0.027
esson+0.026
enegger+0.026
Neg logit contributions
 capped-0.025
mil-0.024
 primates-0.024
 kidnapping-0.023
 diving-0.023
Tokené
Token ID2634
Feature activation+1.22e-01

Pos logit contributions
 capped+0.298
mil+0.293
 primates+0.285
respond+0.281
 hon+0.279
Neg logit contributions
ngth-0.158
Avg-0.135
enegger-0.134
esson-0.130
aund-0.127
Tokenph
Token ID746
Feature activation+7.54e-01

Pos logit contributions
 capped+0.201
mil+0.195
 primates+0.188
 kidnapping+0.182
respond+0.182
Neg logit contributions
ngth-0.257
Avg-0.240
esson-0.235
enegger-0.235
aund-0.234
Tokenane
Token ID1531
Feature activation+7.71e-01

Pos logit contributions
 capped+0.857
mil+0.812
 primates+0.763
respond+0.752
 kidnapping+0.742
Neg logit contributions
ngth-2.001
enegger-1.843
Avg-1.842
esson-1.839
aund-1.824
Token Dion
Token ID29628
Feature activation+1.45e+00

Pos logit contributions
 capped+1.101
mil+1.050
 primates+1.019
 Levin+0.993
 diving+0.986
Neg logit contributions
ngth-1.860
enegger-1.684
esson-1.680
Avg-1.671
aund-1.667
Token,
Token ID11
Feature activation+1.21e-01

Pos logit contributions
 capped+2.495
mil+2.376
 primates+2.285
 kidnapping+2.230
 diving+2.226
Neg logit contributions
ngth-2.976
Avg-2.703
esson-2.664
aund-2.663
enegger-2.615
Token had
Token ID550
Feature activation-1.34e-01

Pos logit contributions
 capped+0.205
mil+0.194
 primates+0.192
 kidnapping+0.187
 diving+0.186
Neg logit contributions
ngth-0.251
Avg-0.231
aund-0.230
esson-0.229
enegger-0.227
Token already
Token ID1541
Feature activation-2.98e-02

Pos logit contributions
ngth+0.294
Avg+0.273
aund+0.272
esson+0.271
enegger+0.269
Neg logit contributions
 capped-0.208
mil-0.197
 primates-0.195
 kidnapping-0.189
 diving-0.188
Token announced
Token ID3414
Feature activation-2.10e-01

Pos logit contributions
ngth+0.064
Avg+0.060
aund+0.059
esson+0.059
enegger+0.059
Neg logit contributions
 capped-0.047
mil-0.045
 primates-0.044
 kidnapping-0.043
 diving-0.043
Token his
Token ID465
Feature activation+1.27e-01

Pos logit contributions
ngth+0.462
Avg+0.427
aund+0.424
esson+0.423
enegger+0.420
Neg logit contributions
 capped-0.331
mil-0.311
 primates-0.308
 kidnapping-0.300
 diving-0.298
Token who
Token ID508
Feature activation+3.81e-01

Pos logit contributions
 capped+0.186
mil+0.176
 primates+0.174
 kidnapping+0.169
 diving+0.168
Neg logit contributions
ngth-0.232
Avg-0.214
aund-0.213
esson-0.213
enegger-0.211
Token has
Token ID468
Feature activation-8.84e-03

Pos logit contributions
 capped+0.672
mil+0.637
 primates+0.631
 kidnapping+0.614
 diving+0.612
Neg logit contributions
ngth-0.761
Avg-0.699
aund-0.696
esson-0.691
enegger-0.687
Token ever
Token ID1683
Feature activation-2.58e-01

Pos logit contributions
ngth+0.019
Avg+0.018
aund+0.018
esson+0.018
enegger+0.018
Neg logit contributions
 capped-0.014
mil-0.013
 primates-0.013
 kidnapping-0.013
 diving-0.012
Token taken
Token ID2077
Feature activation+9.80e-02

Pos logit contributions
ngth+0.574
Avg+0.533
aund+0.532
esson+0.530
enegger+0.526
Neg logit contributions
 capped-0.389
mil-0.368
 primates-0.363
 kidnapping-0.352
 disadvant-0.350
Token a
Token ID257
Feature activation+1.35e-01

Pos logit contributions
 capped+0.159
mil+0.150
 primates+0.149
 kidnapping+0.145
 diving+0.145
Neg logit contributions
ngth-0.210
Avg-0.193
aund-0.192
esson-0.192
enegger-0.191
Token CPR
Token ID42920
Feature activation+1.44e+00

Pos logit contributions
 capped+0.204
mil+0.192
 primates+0.190
 kidnapping+0.185
 diving+0.184
Neg logit contributions
ngth-0.303
Avg-0.280
aund-0.279
esson-0.279
enegger-0.277
Token class
Token ID1398
Feature activation-2.64e-01

Pos logit contributions
 capped+2.201
mil+2.076
 diving+2.032
 primates+2.030
 kidnapping+2.009
Neg logit contributions
ngth-3.230
Avg-2.955
aund-2.898
enegger-2.880
esson-2.877
Token,
Token ID11
Feature activation-2.77e-02

Pos logit contributions
ngth+0.528
Avg+0.485
aund+0.482
esson+0.479
enegger+0.475
Neg logit contributions
 capped-0.467
mil-0.442
 primates-0.438
 kidnapping-0.428
 diving-0.427
Token you
Token ID345
Feature activation-3.63e-02

Pos logit contributions
ngth+0.062
Avg+0.057
aund+0.057
esson+0.057
enegger+0.057
Neg logit contributions
 capped-0.043
mil-0.040
 primates-0.039
 diving-0.039
 kidnapping-0.038
Token know
Token ID760
Feature activation+1.22e-01

Pos logit contributions
ngth+0.068
aund+0.063
Avg+0.063
esson+0.062
enegger+0.062
Neg logit contributions
 capped-0.068
mil-0.065
 primates-0.064
 diving-0.063
 kidnapping-0.062
Token that
Token ID326
Feature activation+8.51e-02

Pos logit contributions
 capped+0.207
mil+0.195
 primates+0.194
 kidnapping+0.190
 diving+0.189
Neg logit contributions
ngth-0.253
Avg-0.232
aund-0.231
esson-0.230
enegger-0.230
Token,
Token ID11
Feature activation-1.87e-03

Pos logit contributions
ngth+0.006
Avg+0.005
aund+0.005
esson+0.005
enegger+0.005
Neg logit contributions
 capped-0.005
mil-0.005
 primates-0.005
 kidnapping-0.005
 diving-0.005
Token an
Token ID281
Feature activation-1.79e-01

Pos logit contributions
ngth+0.004
Avg+0.004
aund+0.004
esson+0.004
enegger+0.004
Neg logit contributions
 capped-0.003
mil-0.003
 primates-0.003
 kidnapping-0.003
 diving-0.003
Token open
Token ID1280
Feature activation+1.99e-01

Pos logit contributions
ngth+0.386
aund+0.353
Avg+0.353
esson+0.350
enegger+0.349
Neg logit contributions
 capped-0.292
 primates-0.274
mil-0.273
 kidnapping-0.264
 diving-0.264
Token-
Token ID12
Feature activation+6.50e-02

Pos logit contributions
 capped+0.289
mil+0.269
 primates+0.267
 kidnapping+0.259
 diving+0.259
Neg logit contributions
ngth-0.465
Avg-0.430
aund-0.429
esson-0.428
enegger-0.425
Tokenworld
Token ID6894
Feature activation+5.28e-01

Pos logit contributions
 capped+0.116
mil+0.111
 primates+0.108
respond+0.106
 kidnapping+0.106
Neg logit contributions
ngth-0.129
aund-0.118
esson-0.117
Avg-0.117
enegger-0.117
Token survival
Token ID9441
Feature activation+1.44e+00

Pos logit contributions
 capped+0.675
mil+0.625
 primates+0.623
 kidnapping+0.602
 diving+0.600
Neg logit contributions
ngth-1.316
Avg-1.225
aund-1.222
esson-1.216
enegger-1.209
Token title
Token ID3670
Feature activation+4.29e-01

Pos logit contributions
 capped+1.385
 primates+1.245
mil+1.225
 diving+1.207
 kidnapping+1.190
Neg logit contributions
ngth-4.062
Avg-3.792
aund-3.789
enegger-3.769
esson-3.759
Token that
Token ID326
Feature activation+3.52e-02

Pos logit contributions
 capped+0.713
mil+0.672
 primates+0.666
 kidnapping+0.648
 diving+0.647
Neg logit contributions
ngth-0.904
Avg-0.834
aund-0.831
esson-0.826
enegger-0.822
Token is
Token ID318
Feature activation+6.49e-02

Pos logit contributions
 capped+0.062
mil+0.058
 primates+0.058
 kidnapping+0.056
 diving+0.056
Neg logit contributions
ngth-0.071
Avg-0.065
aund-0.065
esson-0.064
enegger-0.064
Token already
Token ID1541
Feature activation-1.65e-01

Pos logit contributions
 capped+0.104
mil+0.098
 primates+0.097
 kidnapping+0.094
 diving+0.094
Neg logit contributions
ngth-0.140
Avg-0.129
aund-0.129
esson-0.128
enegger-0.127
Token in
Token ID287
Feature activation+2.68e-02

Pos logit contributions
ngth+0.345
Avg+0.319
aund+0.318
esson+0.316
enegger+0.314
Neg logit contributions
 capped-0.274
mil-0.260
 primates-0.257
 kidnapping-0.250
 diving-0.249
Token metal
Token ID6147
Feature activation+2.86e-02

Pos logit contributions
ngth+0.459
Avg+0.424
aund+0.421
enegger+0.418
esson+0.417
Neg logit contributions
 capped-0.352
mil-0.331
 primates-0.328
 kidnapping-0.320
 diving-0.318
Token;
Token ID26
Feature activation-7.26e-01

Pos logit contributions
 capped+0.051
mil+0.048
 primates+0.047
 kidnapping+0.046
 diving+0.046
Neg logit contributions
ngth-0.057
Avg-0.053
aund-0.052
enegger-0.052
esson-0.052
Token its
Token ID663
Feature activation-1.10e-01

Pos logit contributions
ngth+1.534
Avg+1.418
aund+1.413
esson+1.407
enegger+1.395
Neg logit contributions
 capped-1.188
mil-1.125
 primates-1.113
 kidnapping-1.083
 diving-1.076
Token hard
Token ID1327
Feature activation+1.06e-01

Pos logit contributions
ngth+0.229
Avg+0.211
aund+0.210
esson+0.209
enegger+0.207
Neg logit contributions
 capped-0.187
mil-0.175
 primates-0.175
 kidnapping-0.170
 diving-0.170
Token all
Token ID477
Feature activation-2.07e-02

Pos logit contributions
 capped+0.187
mil+0.175
 primates+0.172
 kidnapping+0.170
 diving+0.170
Neg logit contributions
ngth-0.216
Avg-0.199
aund-0.198
esson-0.197
enegger-0.194
Tokenoys
Token ID19417
Feature activation+1.43e+00

Pos logit contributions
ngth+0.052
Avg+0.048
aund+0.048
esson+0.048
enegger+0.048
Neg logit contributions
 capped-0.026
mil-0.024
 primates-0.024
 diving-0.023
 kidnapping-0.023
Token destroyed
Token ID6572
Feature activation-1.49e-01

Pos logit contributions
 capped+2.667
 primates+2.454
mil+2.447
 diving+2.391
 disadvant+2.389
Neg logit contributions
ngth-2.759
Avg-2.548
aund-2.529
enegger-2.507
esson-2.481
Token dies
Token ID10564
Feature activation+6.02e-01

Pos logit contributions
ngth+0.318
Avg+0.295
aund+0.293
esson+0.292
enegger+0.290
Neg logit contributions
 capped-0.243
mil-0.228
 primates-0.228
 kidnapping-0.220
 diving-0.220
Token and
Token ID290
Feature activation+3.49e-02

Pos logit contributions
 capped+1.027
mil+0.960
 primates+0.931
 diving+0.921
 kidnapping+0.916
Neg logit contributions
ngth-1.241
Avg-1.176
enegger-1.140
aund-1.139
esson-1.127
Token machinery
Token ID20230
Feature activation-4.03e-01

Pos logit contributions
 capped+0.061
mil+0.057
 primates+0.057
 kidnapping+0.055
 diving+0.055
Neg logit contributions
ngth-0.072
Avg-0.066
aund-0.066
esson-0.065
enegger-0.065
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
ngth+0.793
Avg+0.727
aund+0.726
enegger+0.720
esson+0.719
Neg logit contributions
 capped-0.729
mil-0.687
 primates-0.681
 diving-0.666
 kidnapping-0.665
Tokenological
Token ID2770
Feature activation-1.84e-02

Pos logit contributions
 capped+0.378
mil+0.365
 primates+0.359
respond+0.352
 kidnapping+0.351
Neg logit contributions
ngth-0.389
Avg-0.361
aund-0.355
enegger-0.354
esson-0.351
Token profile
Token ID7034
Feature activation-2.05e-01

Pos logit contributions
ngth+0.038
Avg+0.035
aund+0.035
esson+0.035
enegger+0.034
Neg logit contributions
 capped-0.031
mil-0.029
 primates-0.029
 kidnapping-0.029
 diving-0.028
Token of
Token ID286
Feature activation+1.44e-01

Pos logit contributions
ngth+0.433
Avg+0.398
aund+0.397
esson+0.397
enegger+0.391
Neg logit contributions
 capped-0.336
mil-0.319
 primates-0.315
 kidnapping-0.307
 diving-0.305
Token N
Token ID399
Feature activation-1.78e-01

Pos logit contributions
 capped+0.227
 primates+0.213
mil+0.212
 kidnapping+0.205
 diving+0.203
Neg logit contributions
ngth-0.315
Avg-0.292
aund-0.292
esson-0.289
enegger-0.287
Token-
Token ID12
Feature activation+1.01e-01

Pos logit contributions
ngth+0.392
Avg+0.363
aund+0.361
enegger+0.358
esson+0.357
Neg logit contributions
 capped-0.275
mil-0.262
 primates-0.256
 kidnapping-0.248
respond-0.248
Tokencyclop
Token ID22873
Feature activation+1.43e+00

Pos logit contributions
 capped+0.160
mil+0.153
 primates+0.149
respond+0.145
 kidnapping+0.144
Neg logit contributions
ngth-0.218
Avg-0.201
aund-0.201
esson-0.200
enegger-0.200
Tokenrop
Token ID1773
Feature activation-3.66e-01

Pos logit contributions
 capped+1.815
mil+1.728
 primates+1.652
respond+1.612
 kidnapping+1.579
Neg logit contributions
ngth-3.529
Avg-3.264
aund-3.259
enegger-3.255
esson-3.247
Tokenyl
Token ID2645
Feature activation+6.61e-02

Pos logit contributions
ngth+0.608
aund+0.550
Avg+0.550
esson+0.547
enegger+0.537
Neg logit contributions
 capped-0.762
mil-0.728
 primates-0.725
 kidnapping-0.712
 diving-0.709
Tokenmethyl
Token ID43654
Feature activation+4.19e-01

Pos logit contributions
 capped+0.102
mil+0.096
 primates+0.095
 kidnapping+0.092
 diving+0.092
Neg logit contributions
ngth-0.146
Avg-0.136
aund-0.135
esson-0.135
enegger-0.134
Token-
Token ID12
Feature activation+3.50e-01

Pos logit contributions
 capped+0.646
mil+0.618
 primates+0.604
respond+0.582
 kidnapping+0.581
Neg logit contributions
ngth-0.930
Avg-0.859
esson-0.855
enegger-0.854
aund-0.848
Token7
Token ID22
Feature activation-1.15e-01

Pos logit contributions
 capped+0.610
mil+0.586
 primates+0.575
respond+0.559
 kidnapping+0.558
Neg logit contributions
ngth-0.704
Avg-0.643
esson-0.642
aund-0.640
enegger-0.637
Token priorities
Token ID15369
Feature activation+2.38e-01

Pos logit contributions
ngth+0.078
Avg+0.072
aund+0.071
esson+0.071
enegger+0.071
Neg logit contributions
 capped-0.057
mil-0.054
 primates-0.053
 kidnapping-0.052
 diving-0.052
Token:
Token ID25
Feature activation+1.38e-01

Pos logit contributions
 capped+0.414
mil+0.390
 primates+0.388
 kidnapping+0.378
 diving+0.377
Neg logit contributions
ngth-0.486
Avg-0.446
aund-0.442
esson-0.439
enegger-0.439
Token allegiance
Token ID25696
Feature activation+3.69e-01

Pos logit contributions
 capped+0.222
mil+0.209
 primates+0.207
 kidnapping+0.202
 diving+0.201
Neg logit contributions
ngth-0.297
Avg-0.274
aund-0.273
esson-0.272
enegger-0.270
Token to
Token ID284
Feature activation+5.55e-02

Pos logit contributions
 capped+0.562
mil+0.523
 primates+0.521
 kidnapping+0.504
 diving+0.500
Neg logit contributions
ngth-0.832
Avg-0.766
esson-0.766
aund-0.765
enegger-0.762
Token v
Token ID410
Feature activation+2.43e-01

Pos logit contributions
 capped+0.089
mil+0.084
 primates+0.083
 kidnapping+0.081
 diving+0.081
Neg logit contributions
ngth-0.120
Avg-0.110
aund-0.110
esson-0.109
enegger-0.108
Tokenoodoo
Token ID36038
Feature activation+1.43e+00

Pos logit contributions
 capped+0.454
mil+0.435
 primates+0.428
 kidnapping+0.416
respond+0.416
Neg logit contributions
ngth-0.462
Avg-0.417
aund-0.409
enegger-0.407
esson-0.406
Token tr
Token ID491
Feature activation+2.97e-01

Pos logit contributions
 capped+2.404
 primates+2.285
mil+2.227
 kidnapping+2.194
 diving+2.177
Neg logit contributions
ngth-3.035
Avg-2.760
enegger-2.735
aund-2.731
esson-2.702
Tokenumps
Token ID8142
Feature activation-1.08e+00

Pos logit contributions
 capped+0.491
mil+0.482
 primates+0.465
respond+0.451
 kidnapping+0.447
Neg logit contributions
ngth-0.620
enegger-0.569
Avg-0.568
esson-0.557
aund-0.556
Token all
Token ID477
Feature activation-2.50e-02

Pos logit contributions
ngth+2.313
Avg+2.143
aund+2.138
esson+2.137
enegger+2.109
Neg logit contributions
 capped-1.701
mil-1.616
 primates-1.585
respond-1.542
 kidnapping-1.540
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
ngth+0.054
Avg+0.050
aund+0.050
esson+0.049
enegger+0.049
Neg logit contributions
 capped-0.040
 primates-0.038
mil-0.038
 kidnapping-0.037
 diving-0.037
Token
Token ID198
Feature activation+1.37e-02

Pos logit contributions
ngth+0.028
Avg+0.025
aund+0.025
esson+0.025
enegger+0.025
Neg logit contributions
 capped-0.020
mil-0.019
 primates-0.019
 kidnapping-0.018
 diving-0.018
Token of
Token ID286
Feature activation+7.94e-02

Pos logit contributions
ngth+0.729
aund+0.672
Avg+0.672
esson+0.671
enegger+0.658
Neg logit contributions
 capped-0.578
mil-0.547
 primates-0.543
 kidnapping-0.524
 diving-0.524
Token God
Token ID1793
Feature activation+1.11e-01

Pos logit contributions
 capped+0.130
 primates+0.122
mil+0.122
 kidnapping+0.119
 diving+0.118
Neg logit contributions
ngth-0.170
Avg-0.156
aund-0.155
esson-0.155
enegger-0.154
Token,"
Token ID553
Feature activation-1.90e-01

Pos logit contributions
 capped+0.188
mil+0.176
 primates+0.175
 kidnapping+0.171
 diving+0.170
Neg logit contributions
ngth-0.234
Avg-0.215
enegger-0.213
aund-0.213
esson-0.211
Token reads
Token ID9743
Feature activation-4.39e-01

Pos logit contributions
ngth+0.417
Avg+0.383
aund+0.383
esson+0.381
enegger+0.378
Neg logit contributions
 capped-0.300
mil-0.283
 primates-0.280
 kidnapping-0.272
 diving-0.270
Token the
Token ID262
Feature activation+1.12e-01

Pos logit contributions
ngth+0.953
Avg+0.878
aund+0.876
esson+0.871
enegger+0.862
Neg logit contributions
 capped-0.702
mil-0.663
 primates-0.655
 kidnapping-0.637
 diving-0.634
Token ob
Token ID909
Feature activation+1.42e+00

Pos logit contributions
 capped+0.154
mil+0.143
 primates+0.142
 kidnapping+0.138
 diving+0.137
Neg logit contributions
ngth-0.269
Avg-0.249
aund-0.249
esson-0.248
enegger-0.245
Tokenit
Token ID270
Feature activation+3.75e-02

Pos logit contributions
 capped+1.902
mil+1.814
 primates+1.744
respond+1.716
rub+1.684
Neg logit contributions
ngth-3.398
Avg-3.163
enegger-3.126
aund-3.119
esson-3.106
Token published
Token ID3199
Feature activation+4.04e-01

Pos logit contributions
 capped+0.082
mil+0.078
 primates+0.077
 kidnapping+0.076
 diving+0.076
Neg logit contributions
ngth-0.060
Avg-0.053
aund-0.053
enegger-0.052
esson-0.052
Token in
Token ID287
Feature activation+6.15e-02

Pos logit contributions
 capped+0.719
mil+0.678
 primates+0.672
 kidnapping+0.658
 diving+0.655
Neg logit contributions
ngth-0.809
Avg-0.739
aund-0.738
esson-0.735
enegger-0.727
Token the
Token ID262
Feature activation+9.51e-02

Pos logit contributions
 capped+0.100
mil+0.094
 primates+0.093
 kidnapping+0.091
 diving+0.090
Neg logit contributions
ngth-0.132
Avg-0.121
aund-0.121
esson-0.120
enegger-0.119
Token Richmond
Token ID16940
Feature activation+4.05e-01

Pos logit contributions
 capped+0.173
mil+0.164
 primates+0.163
 kidnapping+0.159
 diving+0.159
Neg logit contributions
ngth-0.186
Avg-0.169
aund-0.169
esson-0.169
enegger-0.167
Token this
Token ID428
Feature activation+4.28e-02

Pos logit contributions
 capped+0.029
mil+0.027
 primates+0.027
 kidnapping+0.026
 diving+0.026
Neg logit contributions
ngth-0.041
Avg-0.038
aund-0.038
esson-0.038
enegger-0.038
Token challenge
Token ID4427
Feature activation+5.58e-01

Pos logit contributions
 capped+0.065
mil+0.061
 primates+0.061
 kidnapping+0.059
 diving+0.059
Neg logit contributions
ngth-0.096
Avg-0.089
aund-0.089
esson-0.089
enegger-0.088
Token will
Token ID481
Feature activation+2.43e-02

Pos logit contributions
 capped+1.011
mil+0.952
 primates+0.947
 kidnapping+0.927
 diving+0.922
Neg logit contributions
ngth-1.098
Avg-1.007
aund-1.003
esson-0.998
enegger-0.983
Token reference
Token ID4941
Feature activation+2.50e-01

Pos logit contributions
 capped+0.044
mil+0.042
 primates+0.042
 kidnapping+0.041
 diving+0.041
Neg logit contributions
ngth-0.047
Avg-0.043
aund-0.043
esson-0.043
enegger-0.042
Token the
Token ID262
Feature activation+2.90e-03

Pos logit contributions
 capped+0.398
mil+0.372
 primates+0.371
 kidnapping+0.363
 diving+0.357
Neg logit contributions
ngth-0.551
Avg-0.506
esson-0.504
aund-0.504
enegger-0.500
Token relevant
Token ID5981
Feature activation+1.41e+00

Pos logit contributions
 capped+0.005
mil+0.005
 primates+0.005
 kidnapping+0.004
 diving+0.004
Neg logit contributions
ngth-0.006
Avg-0.006
aund-0.006
esson-0.006
enegger-0.005
Token passage
Token ID10066
Feature activation+1.59e-01

Pos logit contributions
 capped+2.360
 primates+2.212
mil+2.201
 kidnapping+2.184
 diving+2.127
Neg logit contributions
ngth-3.053
Avg-2.760
esson-2.749
aund-2.744
enegger-2.710
Token from
Token ID422
Feature activation+4.26e-02

Pos logit contributions
 capped+0.274
mil+0.257
 primates+0.256
 kidnapping+0.249
 diving+0.248
Neg logit contributions
ngth-0.327
Avg-0.301
aund-0.299
esson-0.298
enegger-0.296
Token the
Token ID262
Feature activation+3.01e-02

Pos logit contributions
 capped+0.070
mil+0.066
 primates+0.065
 kidnapping+0.063
 diving+0.063
Neg logit contributions
ngth-0.091
Avg-0.084
aund-0.084
esson-0.083
enegger-0.083
Token United
Token ID1578
Feature activation-2.29e-01

Pos logit contributions
 capped+0.053
mil+0.051
 primates+0.050
 kidnapping+0.049
 diving+0.049
Neg logit contributions
ngth-0.059
Avg-0.055
aund-0.054
esson-0.054
enegger-0.054
Token States
Token ID1829
Feature activation+5.40e-01

Pos logit contributions
ngth+0.607
Avg+0.573
aund+0.572
esson+0.570
enegger+0.564
Neg logit contributions
 capped-0.247
mil-0.227
 primates-0.223
 kidnapping-0.215
 diving-0.213
Token that
Token ID326
Feature activation-9.97e-02

Pos logit contributions
ngth+0.210
Avg+0.194
aund+0.193
esson+0.192
enegger+0.190
Neg logit contributions
 capped-0.190
mil-0.181
 primates-0.179
 kidnapping-0.175
 diving-0.173
Token the
Token ID262
Feature activation-9.71e-05

Pos logit contributions
ngth+0.227
Avg+0.211
aund+0.210
esson+0.209
enegger+0.208
Neg logit contributions
 capped-0.147
mil-0.138
 primates-0.137
 kidnapping-0.133
 diving-0.132
Token connection
Token ID4637
Feature activation+1.67e-01

Pos logit contributions
ngth+0.000
Avg+0.000
aund+0.000
esson+0.000
enegger+0.000
Neg logit contributions
 capped-0.000
mil-0.000
 primates-0.000
 kidnapping-0.000
 diving-0.000
Token between
Token ID1022
Feature activation+8.80e-01

Pos logit contributions
 capped+0.294
mil+0.279
 primates+0.276
 kidnapping+0.270
 diving+0.268
Neg logit contributions
ngth-0.333
Avg-0.305
aund-0.304
esson-0.303
enegger-0.301
Token Russian
Token ID3394
Feature activation+3.22e-01

Pos logit contributions
 capped+1.269
mil+1.186
 primates+1.183
 kidnapping+1.154
 diving+1.146
Neg logit contributions
ngth-2.047
Avg-1.898
aund-1.888
esson-1.875
enegger-1.872
Token hackers
Token ID17110
Feature activation+1.41e+00

Pos logit contributions
 capped+0.438
mil+0.409
 primates+0.404
 kidnapping+0.393
 diving+0.390
Neg logit contributions
ngth-0.773
Avg-0.722
aund-0.718
esson-0.715
enegger-0.713
Token and
Token ID290
Feature activation-5.25e-02

Pos logit contributions
 capped+2.465
mil+2.320
 primates+2.282
 kidnapping+2.278
 diving+2.257
Neg logit contributions
ngth-2.784
Avg-2.611
aund-2.568
enegger-2.549
esson-2.543
Token the
Token ID262
Feature activation-9.99e-02

Pos logit contributions
ngth+0.112
Avg+0.103
aund+0.103
esson+0.103
enegger+0.102
Neg logit contributions
 capped-0.085
mil-0.080
 primates-0.080
 kidnapping-0.077
 diving-0.077
Token DNC
Token ID17824
Feature activation-1.59e-01

Pos logit contributions
ngth+0.206
Avg+0.190
aund+0.190
esson+0.189
enegger+0.187
Neg logit contributions
 capped-0.168
mil-0.160
 primates-0.158
 kidnapping-0.154
 diving-0.153
Token leak
Token ID13044
Feature activation+3.30e-02

Pos logit contributions
ngth+0.368
Avg+0.342
aund+0.341
esson+0.339
enegger+0.338
Neg logit contributions
 capped-0.232
mil-0.217
 primates-0.214
 kidnapping-0.209
 diving-0.207
Token isn
Token ID2125
Feature activation-1.02e-01

Pos logit contributions
 capped+0.059
mil+0.055
 primates+0.055
 kidnapping+0.054
 diving+0.053
Neg logit contributions
ngth-0.066
Avg-0.061
aund-0.060
esson-0.060
enegger-0.060
Token awful
Token ID12659
Feature activation-2.76e-01

Pos logit contributions
ngth+0.045
Avg+0.042
aund+0.041
esson+0.041
enegger+0.041
Neg logit contributions
 capped-0.036
 primates-0.034
mil-0.034
 kidnapping-0.034
 diving-0.033
Token moment
Token ID2589
Feature activation+1.81e-01

Pos logit contributions
ngth+0.561
Avg+0.517
aund+0.515
esson+0.510
enegger+0.509
Neg logit contributions
 capped-0.477
 primates-0.452
mil-0.452
 kidnapping-0.444
 diving-0.441
Token when
Token ID618
Feature activation-1.13e-01

Pos logit contributions
 capped+0.287
mil+0.269
 primates+0.268
 kidnapping+0.263
 diving+0.261
Neg logit contributions
ngth-0.395
Avg-0.366
aund-0.363
esson-0.361
enegger-0.360
Token you
Token ID345
Feature activation+3.17e-02

Pos logit contributions
ngth+0.265
Avg+0.246
aund+0.245
esson+0.244
enegger+0.243
Neg logit contributions
 capped-0.160
mil-0.150
 primates-0.148
 kidnapping-0.144
 diving-0.143
Token administer
Token ID21766
Feature activation+1.01e+00

Pos logit contributions
 capped+0.059
mil+0.055
 primates+0.055
 kidnapping+0.054
 diving+0.054
Neg logit contributions
ngth-0.061
aund-0.056
Avg-0.056
esson-0.055
enegger-0.055
Token CPR
Token ID42920
Feature activation+1.40e+00

Pos logit contributions
 capped+1.416
 primates+1.312
mil+1.302
 kidnapping+1.267
 diving+1.266
Neg logit contributions
ngth-2.444
Avg-2.251
aund-2.240
enegger-2.235
esson-2.219
Token for
Token ID329
Feature activation+1.09e-01

Pos logit contributions
 capped+2.264
mil+2.132
 primates+2.087
 kidnapping+2.072
 diving+2.071
Neg logit contributions
ngth-3.015
Avg-2.758
esson-2.726
aund-2.710
enegger-2.697
Token the
Token ID262
Feature activation-3.06e-03

Pos logit contributions
 capped+0.171
mil+0.161
 primates+0.160
 kidnapping+0.156
 diving+0.155
Neg logit contributions
ngth-0.239
Avg-0.219
aund-0.219
esson-0.218
enegger-0.216
Token first
Token ID717
Feature activation+5.04e-02

Pos logit contributions
ngth+0.007
Avg+0.007
aund+0.007
esson+0.006
enegger+0.006
Neg logit contributions
 capped-0.004
mil-0.004
 primates-0.004
 kidnapping-0.004
 diving-0.004
Token time
Token ID640
Feature activation+1.16e-01

Pos logit contributions
 capped+0.064
mil+0.059
 primates+0.058
 kidnapping+0.056
 diving+0.056
Neg logit contributions
ngth-0.126
Avg-0.118
aund-0.117
esson-0.117
enegger-0.116
Token,
Token ID11
Feature activation-2.26e-03

Pos logit contributions
 capped+0.206
mil+0.195
 primates+0.192
 kidnapping+0.188
 diving+0.188
Neg logit contributions
ngth-0.230
Avg-0.211
aund-0.210
esson-0.209
enegger-0.206
Token the
Token ID262
Feature activation-1.73e-01

Pos logit contributions
ngth+0.603
Avg+0.556
aund+0.554
esson+0.553
enegger+0.548
Neg logit contributions
 capped-0.476
mil-0.451
 primates-0.446
 kidnapping-0.434
 diving-0.432
Token ends
Token ID5645
Feature activation-5.58e-01

Pos logit contributions
ngth+0.393
Avg+0.365
aund+0.364
esson+0.363
enegger+0.360
Neg logit contributions
 capped-0.255
mil-0.240
 primates-0.237
 kidnapping-0.230
 diving-0.228
Token of
Token ID286
Feature activation-8.76e-02

Pos logit contributions
ngth+1.153
Avg+1.056
aund+1.055
esson+1.054
enegger+1.036
Neg logit contributions
 capped-0.924
mil-0.883
 primates-0.875
 kidnapping-0.852
 disadvant-0.846
Token their
Token ID511
Feature activation-2.11e-01

Pos logit contributions
ngth+0.171
Avg+0.157
aund+0.156
esson+0.156
enegger+0.154
Neg logit contributions
 capped-0.160
mil-0.151
 primates-0.150
 kidnapping-0.146
 diving-0.146
Token tr
Token ID491
Feature activation+1.17e-01

Pos logit contributions
ngth+0.460
Avg+0.425
aund+0.424
esson+0.423
enegger+0.419
Neg logit contributions
 capped-0.335
mil-0.315
 primates-0.313
 kidnapping-0.304
 diving-0.303
Tokenunks
Token ID14125
Feature activation-1.74e+00

Pos logit contributions
 capped+0.216
mil+0.208
 primates+0.204
respond+0.199
 kidnapping+0.198
Neg logit contributions
ngth-0.222
Avg-0.202
aund-0.200
esson-0.200
enegger-0.199
Token.
Token ID13
Feature activation-3.52e-02

Pos logit contributions
ngth+3.165
esson+2.869
Avg+2.867
aund+2.866
enegger+2.800
Neg logit contributions
 capped-3.197
mil-3.089
 primates-3.055
 kidnapping-3.000
respond-2.986
Token Ly
Token ID9334
Feature activation-2.11e-01

Pos logit contributions
ngth+0.076
Avg+0.070
aund+0.070
esson+0.070
enegger+0.069
Neg logit contributions
 capped-0.057
mil-0.054
 primates-0.053
 kidnapping-0.051
 diving-0.051
Tokenuba
Token ID22013
Feature activation-1.97e-01

Pos logit contributions
ngth+0.357
Avg+0.323
aund+0.322
esson+0.320
enegger+0.317
Neg logit contributions
 capped-0.433
mil-0.414
 primates-0.411
 kidnapping-0.402
 diving-0.400
Token's
Token ID338
Feature activation+3.03e-02

Pos logit contributions
ngth+0.362
Avg+0.330
aund+0.329
esson+0.327
enegger+0.324
Neg logit contributions
 capped-0.375
mil-0.358
 primates-0.355
 kidnapping-0.346
 diving-0.344
Token were
Token ID547
Feature activation-2.07e-01

Pos logit contributions
 capped+0.047
mil+0.044
 primates+0.044
 kidnapping+0.042
 diving+0.042
Neg logit contributions
ngth-0.068
Avg-0.063
aund-0.063
esson-0.062
enegger-0.062
Tokenateurs
Token ID45211
Feature activation-1.37e-01

Pos logit contributions
ngth+0.426
Avg+0.391
aund+0.384
esson+0.384
enegger+0.382
Neg logit contributions
 capped-0.401
mil-0.386
 primates-0.382
 kidnapping-0.370
respond-0.369
Token cher
Token ID21382
Feature activation-8.39e-02

Pos logit contributions
ngth+0.290
Avg+0.267
aund+0.265
esson+0.264
enegger+0.262
Neg logit contributions
 capped-0.225
mil-0.213
 primates-0.212
 kidnapping-0.206
 diving-0.204
Tokencher
Token ID2044
Feature activation-1.95e-01

Pos logit contributions
ngth+0.186
Avg+0.173
aund+0.172
esson+0.171
enegger+0.170
Neg logit contributions
 capped-0.128
mil-0.122
 primates-0.120
 kidnapping-0.116
respond-0.115
Tokenont
Token ID756
Feature activation-3.50e-01

Pos logit contributions
ngth+0.404
Avg+0.371
aund+0.368
esson+0.367
enegger+0.364
Neg logit contributions
 capped-0.332
mil-0.316
 primates-0.313
 kidnapping-0.303
 diving-0.301
Token un
Token ID555
Feature activation-3.98e-01

Pos logit contributions
ngth+0.731
Avg+0.672
aund+0.666
esson+0.664
enegger+0.661
Neg logit contributions
 capped-0.586
mil-0.555
 primates-0.550
 kidnapping-0.536
 diving-0.532
Token comprom
Token ID8382
Feature activation-1.65e+00

Pos logit contributions
ngth+0.900
Avg+0.835
aund+0.830
esson+0.827
enegger+0.823
Neg logit contributions
 capped-0.597
mil-0.563
 primates-0.556
 kidnapping-0.539
 diving-0.535
Tokenis
Token ID271
Feature activation-2.98e-01

Pos logit contributions
ngth+3.359
aund+3.120
Avg+3.104
esson+3.100
enegger+3.052
Neg logit contributions
 capped-2.822
mil-2.664
 primates-2.656
 kidnapping-2.589
 diving-2.571
Token ent
Token ID920
Feature activation-3.59e-01

Pos logit contributions
ngth+0.555
Avg+0.507
aund+0.501
esson+0.499
enegger+0.494
Neg logit contributions
 capped-0.563
mil-0.539
 primates-0.534
 kidnapping-0.520
respond-0.517
Tokenre
Token ID260
Feature activation-7.77e-02

Pos logit contributions
ngth+0.891
Avg+0.833
aund+0.827
esson+0.824
enegger+0.820
Neg logit contributions
 capped-0.458
mil-0.428
 primates-0.421
 kidnapping-0.406
respond-0.403
Token t
Token ID256
Feature activation-1.52e-01

Pos logit contributions
ngth+0.168
Avg+0.156
aund+0.155
esson+0.154
enegger+0.153
Neg logit contributions
 capped-0.124
mil-0.117
 primates-0.116
 kidnapping-0.112
 diving-0.112
Tokenout
Token ID448
Feature activation+2.54e-01

Pos logit contributions
ngth+0.322
Avg+0.296
aund+0.290
enegger+0.290
esson+0.290
Neg logit contributions
 capped-0.249
mil-0.240
 primates-0.233
respond-0.228
 kidnapping-0.227
Token Sentinel
Token ID26716
Feature activation-6.04e-01

Pos logit contributions
 capped+0.841
mil+0.769
 primates+0.762
 kidnapping+0.752
 diving+0.730
Neg logit contributions
ngth-1.837
enegger-1.716
esson-1.705
Avg-1.683
aund-1.681
Token/
Token ID14
Feature activation-3.05e-01

Pos logit contributions
ngth+1.145
Avg+1.050
aund+1.046
esson+1.041
enegger+1.032
Neg logit contributions
 capped-1.115
mil-1.061
 primates-1.053
 kidnapping-1.026
 disadvant-1.022
TokenT
Token ID51
Feature activation+1.46e-01

Pos logit contributions
ngth+0.562
aund+0.511
Avg+0.510
esson+0.507
enegger+0.503
Neg logit contributions
 capped-0.583
mil-0.560
 primates-0.550
 kidnapping-0.539
 diving-0.535
TokenNS
Token ID8035
Feature activation+1.72e-01

Pos logit contributions
 capped+0.266
mil+0.257
 primates+0.251
 kidnapping+0.247
respond+0.245
Neg logit contributions
ngth-0.280
Avg-0.256
enegger-0.254
aund-0.253
esson-0.252
Token via
Token ID2884
Feature activation-1.03e-01

Pos logit contributions
 capped+0.315
mil+0.303
 primates+0.297
 kidnapping+0.291
 diving+0.290
Neg logit contributions
ngth-0.330
Avg-0.303
aund-0.301
esson-0.299
enegger-0.295
Token Getty
Token ID7136
Feature activation-1.62e+00

Pos logit contributions
ngth+0.202
Avg+0.185
aund+0.184
esson+0.183
enegger+0.182
Neg logit contributions
 capped-0.185
mil-0.176
 primates-0.174
 kidnapping-0.170
 diving-0.169
Token Images
Token ID5382
Feature activation-1.16e+00

Pos logit contributions
ngth+2.395
Avg+2.179
esson+2.150
aund+2.138
enegger+2.112
Neg logit contributions
 capped-3.597
 primates-3.440
mil-3.415
 disadvant-3.381
 kidnapping-3.339
Token)
Token ID8
Feature activation-1.50e-01

Pos logit contributions
ngth+2.015
esson+1.838
Avg+1.837
aund+1.832
enegger+1.825
Neg logit contributions
 capped-2.308
mil-2.193
 primates-2.190
 disadvant-2.141
 kidnapping-2.126
Token Indiana
Token ID10278
Feature activation-2.13e-01

Pos logit contributions
ngth+0.314
Avg+0.287
aund+0.287
esson+0.285
enegger+0.282
Neg logit contributions
 capped-0.251
mil-0.238
 primates-0.234
 kidnapping-0.229
 diving-0.227
Token Gov
Token ID10964
Feature activation+2.82e-01

Pos logit contributions
ngth+0.454
Avg+0.418
aund+0.417
esson+0.414
enegger+0.412
Neg logit contributions
 capped-0.348
mil-0.329
 primates-0.325
 kidnapping-0.318
 diving-0.315
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
 capped+0.513
mil+0.492
 primates+0.479
 kidnapping+0.472
 diving+0.468
Neg logit contributions
ngth-0.551
Avg-0.500
aund-0.498
esson-0.490
enegger-0.490
Token 64
Token ID5598
Feature activation+1.06e-01

Pos logit contributions
 capped+0.102
mil+0.096
 primates+0.095
 kidnapping+0.092
respond+0.091
Neg logit contributions
ngth-0.164
Avg-0.153
aund-0.152
esson-0.152
enegger-0.150
Token-
Token ID12
Feature activation-4.85e-02

Pos logit contributions
 capped+0.179
mil+0.170
 primates+0.167
 kidnapping+0.162
 diving+0.162
Neg logit contributions
ngth-0.220
Avg-0.202
aund-0.201
enegger-0.200
esson-0.200
Tokenbit
Token ID2545
Feature activation-2.05e-01

Pos logit contributions
ngth+0.125
Avg+0.117
aund+0.117
esson+0.116
enegger+0.116
Neg logit contributions
 capped-0.057
mil-0.053
 primates-0.052
 kidnapping-0.050
 diving-0.050
Token Hy
Token ID6707
Feature activation+5.94e-01

Pos logit contributions
ngth+0.458
Avg+0.425
aund+0.424
esson+0.423
enegger+0.419
Neg logit contributions
 capped-0.308
mil-0.291
 primates-0.287
 kidnapping-0.278
 diving-0.276
Tokenrule
Token ID25135
Feature activation+9.31e-02

Pos logit contributions
 capped+0.384
mil+0.338
 primates+0.321
 kidnapping+0.293
respond+0.292
Neg logit contributions
ngth-1.848
Avg-1.752
enegger-1.745
esson-1.741
aund-1.739
Token Fi
Token ID23238
Feature activation-1.62e+00

Pos logit contributions
 capped+0.159
mil+0.151
 primates+0.149
 kidnapping+0.145
 diving+0.144
Neg logit contributions
ngth-0.191
Avg-0.176
aund-0.175
esson-0.175
enegger-0.174
Tokenend
Token ID437
Feature activation-5.61e-01

Pos logit contributions
ngth+2.878
aund+2.617
Avg+2.614
esson+2.604
enegger+2.563
Neg logit contributions
 capped-3.162
mil-3.037
 primates-3.008
 kidnapping-2.934
 disadvant-2.915
Token,
Token ID11
Feature activation-5.15e-02

Pos logit contributions
ngth+1.166
aund+1.075
Avg+1.074
esson+1.070
enegger+1.058
Neg logit contributions
 capped-0.933
mil-0.883
 primates-0.878
 kidnapping-0.852
respond-0.848
Token O
Token ID440
Feature activation-1.32e-01

Pos logit contributions
ngth+0.115
Avg+0.107
aund+0.106
esson+0.106
enegger+0.105
Neg logit contributions
 capped-0.078
mil-0.074
 primates-0.073
 kidnapping-0.071
 diving-0.070
Tokencar
Token ID7718
Feature activation+6.83e-01

Pos logit contributions
ngth+0.374
Avg+0.353
aund+0.352
esson+0.352
enegger+0.348
Neg logit contributions
 capped-0.121
mil-0.109
 primates-0.108
 kidnapping-0.102
 disadvant-0.102
Tokenina
Token ID1437
Feature activation+3.63e-01

Pos logit contributions
 capped+0.768
mil+0.706
 primates+0.699
 disadvant+0.670
 kidnapping+0.670
Neg logit contributions
ngth-1.786
Avg-1.685
aund-1.683
esson-1.678
enegger-1.659
Token is
Token ID318
Feature activation-2.01e-01

Pos logit contributions
ngth+0.178
Avg+0.163
aund+0.162
esson+0.162
enegger+0.160
Neg logit contributions
 capped-0.155
mil-0.146
 primates-0.145
 kidnapping-0.141
 diving-0.141
Token determined
Token ID5295
Feature activation-2.90e-01

Pos logit contributions
ngth+0.439
Avg+0.407
aund+0.406
esson+0.405
enegger+0.401
Neg logit contributions
 capped-0.314
mil-0.298
 primates-0.294
 kidnapping-0.285
 diving-0.283
Token by
Token ID416
Feature activation-3.40e-01

Pos logit contributions
ngth+0.537
Avg+0.490
aund+0.488
esson+0.486
enegger+0.481
Neg logit contributions
 capped-0.553
mil-0.527
 primates-0.522
 kidnapping-0.510
 diving-0.507
Token how
Token ID703
Feature activation-1.64e-01

Pos logit contributions
ngth+0.707
Avg+0.650
aund+0.648
esson+0.646
enegger+0.641
Neg logit contributions
 capped-0.573
mil-0.542
 primates-0.539
 kidnapping-0.524
 diving-0.521
Token the
Token ID262
Feature activation-2.41e-01

Pos logit contributions
ngth+0.383
Avg+0.357
aund+0.356
esson+0.355
enegger+0.352
Neg logit contributions
 capped-0.229
mil-0.216
 primates-0.212
 kidnapping-0.206
 diving-0.204
Token carc
Token ID36756
Feature activation-1.59e+00

Pos logit contributions
ngth+0.570
Avg+0.530
aund+0.528
esson+0.527
enegger+0.523
Neg logit contributions
 capped-0.336
mil-0.314
 primates-0.312
 kidnapping-0.301
 diving-0.299
Tokenass
Token ID562
Feature activation-4.83e-01

Pos logit contributions
ngth+2.676
aund+2.440
esson+2.425
Avg+2.399
enegger+2.348
Neg logit contributions
 capped-3.264
 primates-3.101
mil-3.099
 disadvant-3.067
 kidnapping-3.056
Token is
Token ID318
Feature activation-2.36e-01

Pos logit contributions
ngth+0.971
Avg+0.894
aund+0.889
esson+0.886
enegger+0.880
Neg logit contributions
 capped-0.847
mil-0.804
 primates-0.795
 kidnapping-0.773
 diving-0.771
Token scored
Token ID7781
Feature activation-5.11e-02

Pos logit contributions
ngth+0.536
Avg+0.497
aund+0.495
esson+0.493
enegger+0.490
Neg logit contributions
 capped-0.353
mil-0.331
 primates-0.327
 kidnapping-0.317
 diving-0.315
Token for
Token ID329
Feature activation-6.17e-03

Pos logit contributions
ngth+0.100
Avg+0.092
aund+0.091
esson+0.091
enegger+0.090
Neg logit contributions
 capped-0.093
mil-0.088
 primates-0.087
 kidnapping-0.085
 diving-0.085
Token things
Token ID1243
Feature activation-2.60e-02

Pos logit contributions
ngth+0.013
Avg+0.012
aund+0.012
esson+0.012
enegger+0.012
Neg logit contributions
 capped-0.010
mil-0.010
 primates-0.010
 kidnapping-0.009
 diving-0.009
Token glove
Token ID29144
Feature activation-8.42e-01

Pos logit contributions
ngth+0.198
Avg+0.184
aund+0.183
esson+0.182
enegger+0.181
Neg logit contributions
 capped-0.138
mil-0.131
 primates-0.129
 kidnapping-0.125
 diving-0.124
Token comp
Token ID552
Feature activation-6.79e-01

Pos logit contributions
ngth+1.745
Avg+1.605
aund+1.601
esson+1.595
enegger+1.582
Neg logit contributions
 capped-1.428
mil-1.348
 primates-1.338
 kidnapping-1.302
 diving-1.297
Tokenartments
Token ID32514
Feature activation-7.39e-01

Pos logit contributions
ngth+1.001
Avg+0.879
aund+0.864
enegger+0.863
esson+0.857
Neg logit contributions
 capped-1.568
mil-1.500
 primates-1.496
 kidnapping-1.471
 diving-1.467
Token of
Token ID286
Feature activation-1.76e-01

Pos logit contributions
ngth+1.414
Avg+1.296
aund+1.291
esson+1.291
enegger+1.274
Neg logit contributions
 capped-1.341
mil-1.285
 primates-1.273
 kidnapping-1.240
respond-1.233
Token tr
Token ID491
Feature activation+2.38e-01

Pos logit contributions
ngth+0.376
Avg+0.348
aund+0.347
esson+0.346
enegger+0.343
Neg logit contributions
 capped-0.280
mil-0.266
 primates-0.262
 kidnapping-0.255
respond-0.253
Tokenunks
Token ID14125
Feature activation-1.54e+00

Pos logit contributions
 capped+0.431
mil+0.416
 primates+0.407
respond+0.396
 kidnapping+0.392
Neg logit contributions
ngth-0.463
Avg-0.425
enegger-0.419
aund-0.417
esson-0.413
Token of
Token ID286
Feature activation-9.17e-02

Pos logit contributions
ngth+2.864
esson+2.637
Avg+2.623
aund+2.615
enegger+2.572
Neg logit contributions
 capped-2.809
mil-2.714
 primates-2.687
respond-2.620
 kidnapping-2.619
Token cars
Token ID5006
Feature activation+2.89e-02

Pos logit contributions
ngth+0.200
Avg+0.186
aund+0.185
esson+0.185
enegger+0.183
Neg logit contributions
 capped-0.142
mil-0.135
 primates-0.133
 kidnapping-0.129
 disadvant-0.128
Token that
Token ID326
Feature activation-2.02e-01

Pos logit contributions
 capped+0.052
mil+0.049
 primates+0.049
 kidnapping+0.048
 diving+0.047
Neg logit contributions
ngth-0.057
Avg-0.052
aund-0.052
esson-0.052
enegger-0.051
Token enter
Token ID3802
Feature activation+4.06e-01

Pos logit contributions
ngth+0.381
Avg+0.346
aund+0.346
esson+0.341
enegger+0.341
Neg logit contributions
 capped-0.386
 primates-0.363
mil-0.363
 kidnapping-0.353
 diving-0.352
Token national
Token ID2260
Feature activation+1.51e-01

Pos logit contributions
 capped+0.672
 primates+0.626
mil+0.625
 kidnapping+0.609
 diving+0.606
Neg logit contributions
ngth-0.875
Avg-0.801
aund-0.799
esson-0.791
enegger-0.785
Token)
Token ID8
Feature activation-1.91e-01

Pos logit contributions
 capped+0.694
mil+0.665
 primates+0.649
respond+0.633
 kidnapping+0.633
Neg logit contributions
ngth-0.805
Avg-0.748
aund-0.737
esson-0.734
enegger-0.729
Token on
Token ID319
Feature activation-1.37e-01

Pos logit contributions
ngth+0.391
Avg+0.360
aund+0.359
esson+0.357
enegger+0.355
Neg logit contributions
 capped-0.326
mil-0.308
 primates-0.305
 kidnapping-0.297
 diving-0.296
Token the
Token ID262
Feature activation-1.29e-01

Pos logit contributions
ngth+0.291
Avg+0.269
aund+0.268
esson+0.267
enegger+0.265
Neg logit contributions
 capped-0.223
mil-0.211
 primates-0.209
 kidnapping-0.203
 diving-0.202
Token Mongol
Token ID42188
Feature activation-3.56e-01

Pos logit contributions
ngth+0.265
Avg+0.244
aund+0.243
esson+0.242
enegger+0.240
Neg logit contributions
 capped-0.219
mil-0.208
 primates-0.206
 kidnapping-0.201
 diving-0.199
Tokenian
Token ID666
Feature activation-3.35e-01

Pos logit contributions
ngth+0.753
Avg+0.695
aund+0.693
esson+0.690
enegger+0.684
Neg logit contributions
 capped-0.585
mil-0.554
 primates-0.547
 kidnapping-0.533
 diving-0.529
Token Plate
Token ID23648
Feature activation-1.53e+00

Pos logit contributions
ngth+0.690
Avg+0.637
aund+0.634
esson+0.632
enegger+0.626
Neg logit contributions
 capped-0.565
mil-0.536
 primates-0.530
 kidnapping-0.516
 diving-0.512
Tokenau
Token ID559
Feature activation-9.82e-01

Pos logit contributions
ngth+1.905
Avg+1.742
aund+1.702
esson+1.699
avid+1.658
Neg logit contributions
 capped-3.700
 primates-3.565
mil-3.555
 disadvant-3.512
 kidnapping-3.493
Token.
Token ID13
Feature activation+9.93e-02

Pos logit contributions
ngth+1.860
Avg+1.700
aund+1.692
esson+1.690
enegger+1.661
Neg logit contributions
 capped-1.802
mil-1.712
 primates-1.703
 kidnapping-1.664
respond-1.654
Token The
Token ID383
Feature activation-3.10e-02

Pos logit contributions
 capped+0.159
mil+0.152
 primates+0.148
 kidnapping+0.144
respond+0.143
Neg logit contributions
ngth-0.215
aund-0.197
Avg-0.197
esson-0.196
enegger-0.196
Token Il
Token ID13778
Feature activation-5.52e-01

Pos logit contributions
ngth+0.064
Avg+0.059
aund+0.058
esson+0.058
enegger+0.058
Neg logit contributions
 capped-0.053
mil-0.050
 primates-0.050
 kidnapping-0.049
 diving-0.048
Tokenk
Token ID74
Feature activation-1.67e-01

Pos logit contributions
ngth+1.164
Avg+1.075
aund+1.073
esson+1.068
enegger+1.058
Neg logit contributions
 capped-0.904
mil-0.852
 primates-0.847
 kidnapping-0.824
 diving-0.818
Token Bang
Token ID9801
Feature activation-3.10e-02

Pos logit contributions
 capped+0.105
mil+0.099
 primates+0.098
 kidnapping+0.095
 diving+0.095
Neg logit contributions
ngth-0.145
Avg-0.135
aund-0.135
esson-0.134
enegger-0.133
Token Theory
Token ID17003
Feature activation-3.10e-02

Pos logit contributions
ngth+0.063
Avg+0.057
aund+0.057
esson+0.057
enegger+0.055
Neg logit contributions
 capped-0.055
mil-0.052
 primates-0.052
 kidnapping-0.050
respond-0.049
Token CBS
Token ID11133
Feature activation+3.76e-01

Pos logit contributions
ngth+0.065
aund+0.060
Avg+0.059
esson+0.059
enegger+0.059
Neg logit contributions
 capped-0.052
mil-0.049
 primates-0.049
 kidnapping-0.047
 diving-0.047
Token Amy
Token ID14235
Feature activation-2.76e-01

Pos logit contributions
 capped+0.648
mil+0.619
 primates+0.605
 kidnapping+0.593
 diving+0.585
Neg logit contributions
ngth-0.776
aund-0.702
Avg-0.700
esson-0.693
enegger-0.690
Token Poe
Token ID34815
Feature activation+3.34e-01

Pos logit contributions
ngth+0.521
Avg+0.475
aund+0.473
esson+0.470
enegger+0.466
Neg logit contributions
 capped-0.516
mil-0.492
 primates-0.486
 kidnapping-0.476
 diving-0.472
Tokenhler
Token ID49737
Feature activation-1.52e+00

Pos logit contributions
 capped+0.670
mil+0.640
 primates+0.633
 kidnapping+0.620
 diving+0.619
Neg logit contributions
ngth-0.589
Avg-0.531
esson-0.529
aund-0.528
enegger-0.524
Token Parks
Token ID20604
Feature activation-6.16e-02

Pos logit contributions
ngth+3.114
Avg+2.946
esson+2.927
aund+2.899
enegger+2.877
Neg logit contributions
 capped-2.477
mil-2.349
 primates-2.344
 disadvant-2.308
 diving-2.259
Token and
Token ID290
Feature activation-5.47e-02

Pos logit contributions
ngth+0.123
aund+0.111
Avg+0.111
esson+0.110
enegger+0.110
Neg logit contributions
 capped-0.111
mil-0.105
 primates-0.103
 kidnapping-0.101
respond-0.101
Token Recreation
Token ID34285
Feature activation-5.20e-01

Pos logit contributions
ngth+0.177
Avg+0.168
aund+0.168
esson+0.167
enegger+0.166
Neg logit contributions
 capped-0.028
mil-0.023
 primates-0.022
 kidnapping-0.020
 disadvant-0.019
Token NBC
Token ID9716
Feature activation+3.36e-02

Pos logit contributions
ngth+1.096
Avg+1.016
aund+1.011
esson+1.009
enegger+0.999
Neg logit contributions
 capped-0.848
mil-0.802
 primates-0.794
 kidnapping-0.772
 disadvant-0.768
Token
Token ID198
Feature activation+1.21e-01

Pos logit contributions
 capped+0.058
mil+0.055
 primates+0.054
 kidnapping+0.053
 diving+0.053
Neg logit contributions
ngth-0.069
aund-0.063
Avg-0.063
esson-0.062
enegger-0.062
TokenUnlike
Token ID18521
Feature activation-9.54e-02

Pos logit contributions
 capped+0.165
mil+0.161
 primates+0.155
respond+0.153
 kidnapping+0.152
Neg logit contributions
ngth-0.183
aund-0.166
enegger-0.165
esson-0.165
Avg-0.163
Token the
Token ID262
Feature activation-1.63e-01

Pos logit contributions
ngth+0.208
Avg+0.193
aund+0.192
esson+0.192
enegger+0.190
Neg logit contributions
 capped-0.150
mil-0.142
 primates-0.141
 kidnapping-0.136
 diving-0.136
Token produce
Token ID4439
Feature activation-3.48e-01

Pos logit contributions
ngth+0.356
Avg+0.328
aund+0.327
esson+0.326
enegger+0.324
Neg logit contributions
 capped-0.258
mil-0.243
 primates-0.242
 kidnapping-0.234
 diving-0.234
Token section
Token ID2665
Feature activation-6.05e-01

Pos logit contributions
ngth+0.749
Avg+0.693
aund+0.690
esson+0.688
enegger+0.683
Neg logit contributions
 capped-0.558
mil-0.527
 primates-0.521
 kidnapping-0.506
 diving-0.503
Token at
Token ID379
Feature activation-3.83e-01

Pos logit contributions
ngth+1.253
Avg+1.154
aund+1.150
esson+1.146
enegger+1.139
Neg logit contributions
 capped-1.019
mil-0.965
 primates-0.955
 kidnapping-0.929
 diving-0.926
Token grocery
Token ID16918
Feature activation-1.52e+00

Pos logit contributions
ngth+0.790
Avg+0.728
aund+0.725
esson+0.723
enegger+0.717
Neg logit contributions
 capped-0.649
mil-0.616
 primates-0.609
 kidnapping-0.593
 diving-0.591
Token stores
Token ID7000
Feature activation-1.00e+00

Pos logit contributions
ngth+2.730
aund+2.501
Avg+2.500
esson+2.484
enegger+2.428
Neg logit contributions
 capped-2.928
mil-2.796
 primates-2.776
 kidnapping-2.712
 disadvant-2.695
Token,
Token ID11
Feature activation-3.56e-02

Pos logit contributions
ngth+1.961
Avg+1.799
aund+1.792
esson+1.784
enegger+1.768
Neg logit contributions
 capped-1.805
mil-1.718
 primates-1.701
 kidnapping-1.659
 diving-1.649
Token sushi
Token ID36324
Feature activation-1.75e-01

Pos logit contributions
ngth+0.079
Avg+0.073
aund+0.072
esson+0.072
enegger+0.072
Neg logit contributions
 capped-0.056
mil-0.053
 primates-0.052
 diving-0.050
 kidnapping-0.050
Token restaurants
Token ID10808
Feature activation-4.75e-01

Pos logit contributions
ngth+0.360
Avg+0.331
esson+0.329
aund+0.329
enegger+0.326
Neg logit contributions
 capped-0.298
mil-0.281
 primates-0.279
 kidnapping-0.271
 diving-0.271
Token don
Token ID836
Feature activation-6.30e-02

Pos logit contributions
ngth+0.937
Avg+0.856
aund+0.855
esson+0.850
enegger+0.845
Neg logit contributions
 capped-0.852
mil-0.808
 primates-0.800
 kidnapping-0.780
 diving-0.777
Token from
Token ID422
Feature activation+2.32e-02

Pos logit contributions
 capped+0.414
mil+0.387
 primates+0.385
 diving+0.376
 kidnapping+0.375
Neg logit contributions
ngth-0.494
Avg-0.453
aund-0.449
esson-0.446
enegger-0.444
Token their
Token ID511
Feature activation-2.45e-01

Pos logit contributions
 capped+0.041
 primates+0.039
mil+0.039
 kidnapping+0.038
 diving+0.038
Neg logit contributions
ngth-0.047
Avg-0.043
aund-0.043
esson-0.042
enegger-0.042
Token carry
Token ID3283
Feature activation-8.83e-01

Pos logit contributions
ngth+0.463
Avg+0.422
aund+0.420
esson+0.419
enegger+0.414
Neg logit contributions
 capped-0.458
mil-0.435
 primates-0.432
 kidnapping-0.423
 diving-0.421
Token-
Token ID12
Feature activation-2.94e-02

Pos logit contributions
ngth+1.987
Avg+1.852
aund+1.847
esson+1.839
enegger+1.820
Neg logit contributions
 capped-1.307
mil-1.235
 primates-1.218
 kidnapping-1.182
 disadvant-1.173
Tokenon
Token ID261
Feature activation+7.24e-02

Pos logit contributions
ngth+0.056
Avg+0.051
enegger+0.051
aund+0.051
esson+0.051
Neg logit contributions
 capped-0.054
mil-0.052
 primates-0.051
respond-0.050
 kidnapping-0.050
Token baggage
Token ID30466
Feature activation-1.52e+00

Pos logit contributions
 capped+0.141
 primates+0.133
mil+0.132
 diving+0.130
 kidnapping+0.130
Neg logit contributions
ngth-0.135
Avg-0.122
enegger-0.121
aund-0.121
esson-0.120
Token when
Token ID618
Feature activation-6.19e-02

Pos logit contributions
ngth+2.872
aund+2.647
Avg+2.644
esson+2.632
enegger+2.578
Neg logit contributions
 capped-2.720
mil-2.628
 primates-2.585
 kidnapping-2.532
 disadvant-2.520
Token going
Token ID1016
Feature activation-3.48e-01

Pos logit contributions
ngth+0.133
Avg+0.122
aund+0.121
esson+0.121
enegger+0.120
Neg logit contributions
 capped-0.101
mil-0.095
 primates-0.095
 kidnapping-0.092
 diving-0.092
Token through
Token ID832
Feature activation-2.33e-01

Pos logit contributions
ngth+0.712
Avg+0.656
aund+0.654
esson+0.651
enegger+0.645
Neg logit contributions
 capped-0.594
mil-0.564
 primates-0.557
 kidnapping-0.543
 diving-0.539
Token airline
Token ID18091
Feature activation-4.91e-01

Pos logit contributions
ngth+0.495
Avg+0.455
aund+0.451
esson+0.450
enegger+0.447
Neg logit contributions
 capped-0.388
mil-0.365
 primates-0.364
 kidnapping-0.355
 diving-0.353
Token security
Token ID2324
Feature activation-5.19e-02

Pos logit contributions
ngth+1.131
Avg+1.051
aund+1.048
esson+1.044
enegger+1.037
Neg logit contributions
 capped-0.714
mil-0.670
 primates-0.662
 kidnapping-0.642
 diving-0.637
Token Hide
Token ID10415
Feature activation-7.04e-01

Pos logit contributions
ngth+0.254
Avg+0.232
aund+0.231
esson+0.230
enegger+0.227
Neg logit contributions
 capped-0.227
mil-0.216
 primates-0.212
 kidnapping-0.207
respond-0.206
Token Caption
Token ID11260
Feature activation-1.15e+00

Pos logit contributions
ngth+1.020
Avg+0.923
aund+0.916
esson+0.915
enegger+0.897
Neg logit contributions
 capped-1.600
mil-1.539
 primates-1.531
 disadvant-1.506
 kidnapping-1.500
Token 7
Token ID767
Feature activation-2.01e-01

Pos logit contributions
ngth+1.748
Avg+1.613
esson+1.603
enegger+1.584
aund+1.584
Neg logit contributions
 capped-2.493
mil-2.395
 primates-2.386
 disadvant-2.375
 kidnapping-2.345
Token of
Token ID286
Feature activation-1.64e-01

Pos logit contributions
ngth+0.421
Avg+0.392
aund+0.390
esson+0.389
enegger+0.384
Neg logit contributions
 capped-0.330
mil-0.312
 primates-0.311
 disadvant-0.304
 kidnapping-0.301
Token 15
Token ID1315
Feature activation+5.90e-02

Pos logit contributions
ngth+0.282
aund+0.252
Avg+0.252
esson+0.249
enegger+0.248
Neg logit contributions
 capped-0.336
mil-0.321
 primates-0.318
 kidnapping-0.311
 diving-0.310
Token Photos
Token ID9434
Feature activation-1.51e+00

Pos logit contributions
 capped+0.123
mil+0.118
 primates+0.117
 kidnapping+0.114
 diving+0.114
Neg logit contributions
ngth-0.099
Avg-0.089
aund-0.088
esson-0.088
enegger-0.087
Token:
Token ID25
Feature activation+8.39e-02

Pos logit contributions
ngth+2.813
Avg+2.777
esson+2.711
aund+2.686
avid+2.633
Neg logit contributions
 capped-2.620
 disadvant-2.592
 primates-2.528
 describ-2.501
 therap-2.460
Token Carly
Token ID35982
Feature activation+5.18e-02

Pos logit contributions
 capped+0.130
mil+0.123
 primates+0.121
 kidnapping+0.118
 diving+0.117
Neg logit contributions
ngth-0.185
Avg-0.171
aund-0.171
esson-0.170
enegger-0.169
Token Fiorina
Token ID49028
Feature activation-5.53e-02

Pos logit contributions
 capped+0.071
 primates+0.067
mil+0.066
 disadvant+0.065
 kidnapping+0.064
Neg logit contributions
ngth-0.122
Avg-0.114
aund-0.114
esson-0.114
enegger-0.113
Token's
Token ID338
Feature activation+1.28e-01

Pos logit contributions
ngth+0.093
Avg+0.083
aund+0.083
esson+0.082
enegger+0.081
Neg logit contributions
 capped-0.116
mil-0.111
 primates-0.110
 kidnapping-0.108
 diving-0.107
Token political
Token ID1964
Feature activation-3.57e-01

Pos logit contributions
 capped+0.183
mil+0.174
 primates+0.170
 disadvant+0.166
 kidnapping+0.165
Neg logit contributions
ngth-0.291
Avg-0.272
aund-0.271
esson-0.270
enegger-0.268
Token right
Token ID826
Feature activation+4.89e-01

Pos logit contributions
 capped+0.238
mil+0.222
 primates+0.219
 kidnapping+0.212
 diving+0.210
Neg logit contributions
ngth-0.432
Avg-0.403
aund-0.402
esson-0.400
enegger-0.397
Token to
Token ID284
Feature activation+6.95e-02

Pos logit contributions
 capped+0.739
mil+0.694
 primates+0.686
 kidnapping+0.666
 diving+0.660
Neg logit contributions
ngth-1.099
Avg-1.022
aund-1.016
esson-1.013
enegger-1.007
Token personal
Token ID2614
Feature activation-1.07e-01

Pos logit contributions
 capped+0.111
mil+0.104
 primates+0.103
 kidnapping+0.101
 diving+0.100
Neg logit contributions
ngth-0.152
Avg-0.140
aund-0.139
esson-0.138
enegger-0.138
Token integrity
Token ID11540
Feature activation-2.73e-01

Pos logit contributions
ngth+0.245
Avg+0.227
aund+0.226
esson+0.225
enegger+0.224
Neg logit contributions
 capped-0.159
mil-0.150
 primates-0.149
 kidnapping-0.144
 diving-0.143
Token,"
Token ID553
Feature activation+1.67e-01

Pos logit contributions
ngth+0.547
Avg+0.504
aund+0.497
esson+0.497
enegger+0.495
Neg logit contributions
 capped-0.485
mil-0.458
 primates-0.455
 kidnapping-0.445
 diving-0.442
Token Sa
Token ID10318
Feature activation-1.51e+00

Pos logit contributions
 capped+0.289
mil+0.271
 primates+0.270
 kidnapping+0.264
 diving+0.261
Neg logit contributions
ngth-0.343
Avg-0.313
aund-0.313
esson-0.310
enegger-0.310
Tokenarak
Token ID30447
Feature activation+3.65e-01

Pos logit contributions
ngth+2.411
esson+2.289
aund+2.248
avid+2.233
Avg+2.200
Neg logit contributions
 capped-3.140
 disadvant-3.022
 primates-2.985
 diving-2.967
mil-2.965
Tokenk
Token ID74
Feature activation-1.22e-01

Pos logit contributions
 capped+0.521
mil+0.492
 primates+0.481
 kidnapping+0.467
respond+0.463
Neg logit contributions
ngth-0.849
Avg-0.788
aund-0.786
esson-0.782
enegger-0.778
Tokenala
Token ID6081
Feature activation-1.19e-01

Pos logit contributions
ngth+0.241
Avg+0.222
aund+0.222
esson+0.222
enegger+0.217
Neg logit contributions
 capped-0.217
 primates-0.205
mil-0.205
 disadvant-0.200
 kidnapping-0.200
Token is
Token ID318
Feature activation+9.14e-03

Pos logit contributions
ngth+0.264
Avg+0.244
aund+0.243
esson+0.241
enegger+0.239
Neg logit contributions
 capped-0.187
mil-0.175
 primates-0.173
 kidnapping-0.168
 diving-0.167
Token quoted
Token ID10947
Feature activation+2.41e-02

Pos logit contributions
 capped+0.015
mil+0.014
 primates+0.014
 kidnapping+0.014
 diving+0.014
Neg logit contributions
ngth-0.019
Avg-0.018
aund-0.018
esson-0.018
enegger-0.017
Token our
Token ID674
Feature activation+7.10e-02

Pos logit contributions
ngth+0.652
Avg+0.605
aund+0.602
esson+0.601
enegger+0.592
Neg logit contributions
 capped-0.489
mil-0.466
 primates-0.458
 kidnapping-0.446
 disadvant-0.444
Token daily
Token ID4445
Feature activation-6.07e-01

Pos logit contributions
 capped+0.162
 primates+0.156
mil+0.155
 kidnapping+0.153
 diving+0.152
Neg logit contributions
ngth-0.107
Avg-0.095
enegger-0.094
aund-0.092
esson-0.091
Token newsletter
Token ID13129
Feature activation-5.56e-01

Pos logit contributions
ngth+1.211
Avg+1.108
aund+1.102
enegger+1.094
esson+1.092
Neg logit contributions
 capped-1.084
mil-1.023
 primates-1.018
 kidnapping-0.992
 diving-0.983
Token Upgrade
Token ID24236
Feature activation-5.95e-01

Pos logit contributions
ngth+1.272
Avg+1.181
esson+1.177
aund+1.176
enegger+1.158
Neg logit contributions
 capped-0.811
mil-0.762
 primates-0.753
 kidnapping-0.731
 diving-0.725
Token your
Token ID534
Feature activation-1.94e-01

Pos logit contributions
ngth+1.322
esson+1.233
Avg+1.231
aund+1.230
enegger+1.217
Neg logit contributions
 capped-0.898
mil-0.847
 primates-0.834
 disadvant-0.811
 kidnapping-0.804
Token inbox
Token ID13734
Feature activation-1.49e+00

Pos logit contributions
ngth+0.384
Avg+0.351
aund+0.349
enegger+0.347
esson+0.345
Neg logit contributions
 capped-0.347
mil-0.330
 primates-0.327
 kidnapping-0.319
 diving-0.318
Token and
Token ID290
Feature activation-4.57e-02

Pos logit contributions
ngth+2.594
Avg+2.377
aund+2.373
esson+2.368
enegger+2.291
Neg logit contributions
 capped-2.940
mil-2.795
 primates-2.794
 disadvant-2.756
 kidnapping-2.727
Token get
Token ID651
Feature activation-1.43e-02

Pos logit contributions
ngth+0.098
Avg+0.089
aund+0.089
enegger+0.089
esson+0.088
Neg logit contributions
 capped-0.075
mil-0.071
 primates-0.070
 kidnapping-0.068
 diving-0.068
Token our
Token ID674
Feature activation+2.02e-01

Pos logit contributions
ngth+0.032
Avg+0.030
aund+0.030
enegger+0.030
esson+0.029
Neg logit contributions
 capped-0.021
mil-0.020
 primates-0.020
 kidnapping-0.019
 diving-0.019
Token Daily
Token ID6714
Feature activation-6.68e-01

Pos logit contributions
 capped+0.336
mil+0.318
 primates+0.316
 kidnapping+0.307
 diving+0.304
Neg logit contributions
ngth-0.425
Avg-0.389
aund-0.386
enegger-0.385
esson-0.382
Token Dispatch
Token ID35934
Feature activation-5.12e-02

Pos logit contributions
ngth+1.430
Avg+1.321
aund+1.317
esson+1.311
enegger+1.301
Neg logit contributions
 capped-1.077
mil-1.019
 primates-1.008
 kidnapping-0.980
 diving-0.973
Token Hide
Token ID10415
Feature activation-5.65e-01

Pos logit contributions
 capped+0.229
mil+0.218
 primates+0.211
 kidnapping+0.207
respond+0.205
Neg logit contributions
ngth-0.314
aund-0.286
Avg-0.284
esson-0.284
enegger-0.283
Token Caption
Token ID11260
Feature activation-1.28e+00

Pos logit contributions
ngth+0.878
aund+0.779
Avg+0.779
esson+0.774
enegger+0.766
Neg logit contributions
 capped-1.250
mil-1.197
 primates-1.188
 kidnapping-1.166
 diving-1.160
Token 4
Token ID604
Feature activation-2.11e-01

Pos logit contributions
ngth+2.427
Avg+2.228
aund+2.217
esson+2.209
enegger+2.185
Neg logit contributions
 capped-2.345
mil-2.237
 primates-2.216
 kidnapping-2.163
 diving-2.151
Token of
Token ID286
Feature activation-1.62e-01

Pos logit contributions
ngth+0.460
Avg+0.424
aund+0.424
esson+0.421
enegger+0.419
Neg logit contributions
 capped-0.333
mil-0.315
 primates-0.311
 kidnapping-0.302
 diving-0.300
Token 15
Token ID1315
Feature activation+9.43e-02

Pos logit contributions
ngth+0.320
aund+0.287
Avg+0.284
esson+0.284
enegger+0.281
Neg logit contributions
 capped-0.299
mil-0.282
 primates-0.278
 kidnapping-0.273
 diving-0.270
Token Photos
Token ID9434
Feature activation-1.49e+00

Pos logit contributions
 capped+0.191
mil+0.182
 primates+0.180
 kidnapping+0.176
 diving+0.175
Neg logit contributions
ngth-0.165
Avg-0.149
aund-0.149
esson-0.147
enegger-0.146
Token:
Token ID25
Feature activation+1.44e-01

Pos logit contributions
ngth+2.861
Avg+2.765
esson+2.716
aund+2.675
enegger+2.617
Neg logit contributions
 capped-2.535
 disadvant-2.463
 primates-2.434
mil-2.430
 describ-2.367
Token Carly
Token ID35982
Feature activation+2.11e-01

Pos logit contributions
 capped+0.254
mil+0.242
 primates+0.238
 kidnapping+0.233
 diving+0.229
Neg logit contributions
ngth-0.291
Avg-0.264
aund-0.263
esson-0.261
enegger-0.259
Token Fiorina
Token ID49028
Feature activation-6.17e-02

Pos logit contributions
 capped+0.275
mil+0.257
 primates+0.252
 kidnapping+0.246
 diving+0.243
Neg logit contributions
ngth-0.518
Avg-0.482
aund-0.479
esson-0.478
enegger-0.478
Token's
Token ID338
Feature activation+1.53e-01

Pos logit contributions
ngth+0.120
Avg+0.110
aund+0.110
esson+0.109
enegger+0.108
Neg logit contributions
 capped-0.112
mil-0.106
 primates-0.105
 kidnapping-0.103
 diving-0.102
Token political
Token ID1964
Feature activation-4.21e-01

Pos logit contributions
 capped+0.224
mil+0.207
 primates+0.206
 kidnapping+0.203
 diving+0.201
Neg logit contributions
ngth-0.355
Avg-0.328
aund-0.326
esson-0.326
enegger-0.322
Token Ves
Token ID36448
Feature activation+4.74e-01

Pos logit contributions
ngth+0.180
Avg+0.166
aund+0.165
esson+0.164
enegger+0.164
Neg logit contributions
 capped-0.152
mil-0.142
 primates-0.142
 kidnapping-0.138
 diving-0.138
Tokena
Token ID64
Feature activation-1.73e-01

Pos logit contributions
 capped+0.729
mil+0.706
 primates+0.676
respond+0.660
rub+0.654
Neg logit contributions
ngth-1.046
Avg-0.967
enegger-0.950
esson-0.948
aund-0.945
Token-
Token ID12
Feature activation+3.68e-02

Pos logit contributions
ngth+0.355
Avg+0.328
aund+0.326
enegger+0.324
esson+0.324
Neg logit contributions
 capped-0.292
mil-0.277
 primates-0.273
 kidnapping-0.267
 diving-0.266
TokenMatt
Token ID13448
Feature activation-2.04e-02

Pos logit contributions
 capped+0.066
mil+0.064
 primates+0.062
respond+0.061
 kidnapping+0.061
Neg logit contributions
ngth-0.072
aund-0.065
Avg-0.065
enegger-0.065
esson-0.065
Tokeni
Token ID72
Feature activation-2.75e-01

Pos logit contributions
ngth+0.045
Avg+0.042
aund+0.041
enegger+0.041
esson+0.041
Neg logit contributions
 capped-0.032
mil-0.030
 primates-0.029
 diving-0.028
 kidnapping-0.028
Token Sa
Token ID10318
Feature activation-1.49e+00

Pos logit contributions
ngth+0.562
Avg+0.517
aund+0.513
esson+0.511
enegger+0.510
Neg logit contributions
 capped-0.473
mil-0.449
 primates-0.443
 kidnapping-0.432
 diving-0.430
Tokenarak
Token ID30447
Feature activation+1.77e-01

Pos logit contributions
ngth+3.008
aund+2.764
Avg+2.762
esson+2.760
enegger+2.711
Neg logit contributions
 capped-2.573
mil-2.440
 primates-2.422
 kidnapping-2.365
 diving-2.346
Tokenk
Token ID74
Feature activation-1.41e-01

Pos logit contributions
 capped+0.316
mil+0.303
 primates+0.296
respond+0.288
 kidnapping+0.287
Neg logit contributions
ngth-0.347
Avg-0.320
aund-0.316
enegger-0.314
esson-0.310
Tokenala
Token ID6081
Feature activation-2.57e-01

Pos logit contributions
ngth+0.289
Avg+0.268
aund+0.264
enegger+0.263
esson+0.259
Neg logit contributions
 capped-0.239
mil-0.229
 primates-0.223
respond-0.219
 kidnapping-0.216
Token,
Token ID11
Feature activation+4.29e-02

Pos logit contributions
ngth+0.515
Avg+0.472
aund+0.470
esson+0.467
enegger+0.464
Neg logit contributions
 capped-0.451
mil-0.427
 primates-0.423
 kidnapping-0.411
 diving-0.410
Token a
Token ID257
Feature activation-2.14e-02

Pos logit contributions
 capped+0.070
mil+0.065
 primates+0.065
 kidnapping+0.063
 diving+0.063
Neg logit contributions
ngth-0.092
Avg-0.085
aund-0.085
esson-0.084
enegger-0.084
Token
Token ID198
Feature activation-7.47e-02

Pos logit contributions
ngth+0.169
Avg+0.156
aund+0.156
esson+0.155
enegger+0.154
Neg logit contributions
 capped-0.127
mil-0.120
 primates-0.119
 kidnapping-0.116
 diving-0.115
TokenVel
Token ID46261
Feature activation-1.65e-01

Pos logit contributions
ngth+0.151
enegger+0.136
esson+0.136
aund+0.135
Avg+0.133
Neg logit contributions
 capped-0.132
mil-0.129
 primates-0.122
respond-0.122
rub-0.121
Tokenie
Token ID494
Feature activation+1.61e-01

Pos logit contributions
ngth+0.351
Avg+0.324
aund+0.324
esson+0.322
enegger+0.319
Neg logit contributions
 capped-0.267
mil-0.252
 primates-0.250
 kidnapping-0.243
 diving-0.241
Token knows
Token ID4206
Feature activation-3.16e-02

Pos logit contributions
 capped+0.262
mil+0.245
 primates+0.242
 kidnapping+0.236
 diving+0.235
Neg logit contributions
ngth-0.347
Avg-0.319
aund-0.318
esson-0.316
enegger-0.313
Token the
Token ID262
Feature activation+1.43e-01

Pos logit contributions
ngth+0.067
Avg+0.061
aund+0.061
esson+0.061
enegger+0.061
Neg logit contributions
 capped-0.053
mil-0.050
 primates-0.049
 kidnapping-0.048
 diving-0.048
Token Progressive
Token ID25852
Feature activation-1.48e+00

Pos logit contributions
 capped+0.254
mil+0.238
 primates+0.238
 kidnapping+0.232
 diving+0.231
Neg logit contributions
ngth-0.290
esson-0.263
aund-0.263
Avg-0.263
enegger-0.261
Token Conservative
Token ID11132
Feature activation-6.05e-01

Pos logit contributions
ngth+2.585
Avg+2.347
aund+2.338
esson+2.326
enegger+2.301
Neg logit contributions
 capped-2.960
mil-2.832
 primates-2.807
 kidnapping-2.745
 diving-2.731
Token government
Token ID1230
Feature activation-2.98e-02

Pos logit contributions
ngth+1.282
Avg+1.183
aund+1.179
esson+1.175
enegger+1.166
Neg logit contributions
 capped-0.991
mil-0.936
 primates-0.927
 kidnapping-0.902
 diving-0.896
Token is
Token ID318
Feature activation+7.87e-02

Pos logit contributions
ngth+0.060
Avg+0.055
aund+0.055
esson+0.054
enegger+0.054
Neg logit contributions
 capped-0.053
mil-0.050
 primates-0.049
 kidnapping-0.048
 diving-0.048
Token talking
Token ID3375
Feature activation+2.40e-01

Pos logit contributions
 capped+0.121
mil+0.114
 primates+0.113
 kidnapping+0.109
respond+0.108
Neg logit contributions
ngth-0.173
Avg-0.161
aund-0.160
esson-0.160
enegger-0.159
Token openly
Token ID11764
Feature activation-1.56e-02

Pos logit contributions
 capped+0.440
 primates+0.413
mil+0.410
 kidnapping+0.403
 diving+0.400
Neg logit contributions
ngth-0.470
Avg-0.429
aund-0.427
esson-0.425
enegger-0.422
Token the
Token ID262
Feature activation-1.21e-01

Pos logit contributions
ngth+0.587
Avg+0.542
aund+0.539
esson+0.538
enegger+0.535
Neg logit contributions
 capped-0.459
mil-0.432
 primates-0.430
 kidnapping-0.418
 diving-0.417
Token troops
Token ID6553
Feature activation+5.61e-02

Pos logit contributions
ngth+0.259
Avg+0.238
aund+0.237
esson+0.236
enegger+0.235
Neg logit contributions
 capped-0.199
mil-0.188
 primates-0.187
 kidnapping-0.182
 diving-0.181
Token "
Token ID366
Feature activation-9.58e-02

Pos logit contributions
 capped+0.090
mil+0.085
 primates+0.084
 kidnapping+0.082
 diving+0.081
Neg logit contributions
ngth-0.121
Avg-0.112
aund-0.112
esson-0.111
enegger-0.110
Tokento
Token ID1462
Feature activation-2.64e-01

Pos logit contributions
ngth+0.225
Avg+0.209
aund+0.208
esson+0.208
enegger+0.206
Neg logit contributions
 capped-0.135
mil-0.127
 primates-0.125
 kidnapping-0.121
respond-0.120
Token enc
Token ID2207
Feature activation-5.26e-01

Pos logit contributions
ngth+0.593
Avg+0.551
aund+0.549
esson+0.547
enegger+0.542
Neg logit contributions
 capped-0.396
mil-0.373
 primates-0.369
 kidnapping-0.357
 diving-0.355
Tokenirc
Token ID1980
Feature activation-1.47e+00

Pos logit contributions
ngth+0.972
Avg+0.889
aund+0.883
esson+0.879
enegger+0.872
Neg logit contributions
 capped-0.999
mil-0.956
 primates-0.943
 kidnapping-0.922
respond-0.918
Tokenle
Token ID293
Feature activation-3.80e-01

Pos logit contributions
ngth+3.114
aund+2.919
esson+2.897
Avg+2.882
enegger+2.813
Neg logit contributions
 capped-2.326
 primates-2.211
mil-2.168
 disadvant-2.142
 diving-2.116
Token and
Token ID290
Feature activation-1.12e-01

Pos logit contributions
ngth+0.779
Avg+0.719
aund+0.716
esson+0.714
enegger+0.707
Neg logit contributions
 capped-0.649
mil-0.614
 primates-0.609
 kidnapping-0.594
 diving-0.591
Token destroy
Token ID4117
Feature activation-2.69e-02

Pos logit contributions
ngth+0.248
Avg+0.229
aund+0.229
esson+0.228
enegger+0.226
Neg logit contributions
 capped-0.172
mil-0.162
 primates-0.160
 kidnapping-0.156
 diving-0.155
Token [
Token ID685
Feature activation+6.25e-02

Pos logit contributions
ngth+0.058
Avg+0.054
aund+0.054
esson+0.054
enegger+0.053
Neg logit contributions
 capped-0.043
mil-0.040
 primates-0.040
 kidnapping-0.039
 diving-0.039
Tokenthe
Token ID1169
Feature activation-2.26e-01

Pos logit contributions
 capped+0.107
mil+0.104
 primates+0.100
respond+0.099
 kidnapping+0.098
Neg logit contributions
ngth-0.127
aund-0.117
esson-0.116
Avg-0.116
enegger-0.115
Token largely
Token ID5688
Feature activation+6.08e-01

Pos logit contributions
 capped+0.906
mil+0.850
 primates+0.844
 kidnapping+0.829
 diving+0.825
Neg logit contributions
ngth-1.122
Avg-1.040
aund-1.029
enegger-1.024
esson-1.021
Token in
Token ID287
Feature activation+5.05e-02

Pos logit contributions
 capped+1.057
mil+0.999
 primates+0.990
 kidnapping+0.974
 diving+0.968
Neg logit contributions
ngth-1.234
Avg-1.135
aund-1.124
esson-1.116
enegger-1.116
Token anger
Token ID8993
Feature activation-1.00e-01

Pos logit contributions
 capped+0.081
mil+0.076
 primates+0.076
 kidnapping+0.074
 diving+0.074
Neg logit contributions
ngth-0.109
Avg-0.100
aund-0.100
esson-0.099
enegger-0.099
Token at
Token ID379
Feature activation+1.27e-01

Pos logit contributions
ngth+0.211
Avg+0.195
aund+0.193
esson+0.193
enegger+0.192
Neg logit contributions
 capped-0.167
mil-0.157
 primates-0.155
 kidnapping-0.152
 diving-0.151
Token Patricia
Token ID30223
Feature activation-2.57e-01

Pos logit contributions
 capped+0.198
 primates+0.186
mil+0.185
 kidnapping+0.183
 diving+0.181
Neg logit contributions
ngth-0.281
Avg-0.258
aund-0.258
esson-0.255
enegger-0.253
Token Neal
Token ID29189
Feature activation-1.46e+00

Pos logit contributions
ngth+0.473
Avg+0.429
aund+0.428
esson+0.424
enegger+0.421
Neg logit contributions
 capped-0.494
mil-0.472
 primates-0.467
 kidnapping-0.460
 diving-0.456
Token.
Token ID13
Feature activation+3.65e-02

Pos logit contributions
ngth+2.676
Avg+2.444
aund+2.441
esson+2.437
enegger+2.409
Neg logit contributions
 capped-2.765
mil-2.639
 primates-2.621
 kidnapping-2.548
 diving-2.536
Token He
Token ID679
Feature activation+2.51e-01

Pos logit contributions
 capped+0.060
mil+0.058
 primates+0.055
 kidnapping+0.054
respond+0.054
Neg logit contributions
ngth-0.078
aund-0.071
Avg-0.071
enegger-0.070
esson-0.070
Token would
Token ID561
Feature activation-4.21e-01

Pos logit contributions
 capped+0.423
mil+0.398
 primates+0.394
 kidnapping+0.385
 diving+0.382
Neg logit contributions
ngth-0.525
Avg-0.483
aund-0.481
esson-0.478
enegger-0.474
Token threaten
Token ID16180
Feature activation+1.39e-01

Pos logit contributions
ngth+0.893
Avg+0.825
aund+0.823
esson+0.819
enegger+0.812
Neg logit contributions
 capped-0.686
mil-0.649
 primates-0.642
 kidnapping-0.625
 diving-0.620
Token to
Token ID284
Feature activation-1.67e-01

Pos logit contributions
 capped+0.211
mil+0.198
 primates+0.197
 kidnapping+0.194
 diving+0.191
Neg logit contributions
ngth-0.316
aund-0.290
Avg-0.290
esson-0.289
enegger-0.285
Token seems
Token ID2331
Feature activation-7.01e-02

Pos logit contributions
 capped+0.230
mil+0.217
 primates+0.215
 kidnapping+0.210
 diving+0.209
Neg logit contributions
ngth-0.271
Avg-0.248
esson-0.246
aund-0.246
enegger-0.245
Token to
Token ID284
Feature activation+5.84e-02

Pos logit contributions
ngth+0.157
Avg+0.146
aund+0.145
esson+0.145
enegger+0.144
Neg logit contributions
 capped-0.107
mil-0.100
 primates-0.099
 kidnapping-0.097
 diving-0.096
Token be
Token ID307
Feature activation+6.70e-02

Pos logit contributions
 capped+0.107
mil+0.102
 primates+0.101
 kidnapping+0.099
 diving+0.098
Neg logit contributions
ngth-0.113
Avg-0.103
aund-0.103
esson-0.103
enegger-0.102
Token dis
Token ID595
Feature activation-3.51e-01

Pos logit contributions
 capped+0.095
mil+0.088
 primates+0.087
 kidnapping+0.085
 diving+0.084
Neg logit contributions
ngth-0.159
Avg-0.147
aund-0.147
esson-0.146
enegger-0.146
Tokenj
Token ID73
Feature activation-6.67e-02

Pos logit contributions
ngth+0.743
Avg+0.686
aund+0.682
esson+0.680
enegger+0.676
Neg logit contributions
 capped-0.574
mil-0.544
 primates-0.536
 kidnapping-0.521
 diving-0.518
Tokenointed
Token ID20909
Feature activation-1.45e+00

Pos logit contributions
ngth+0.031
Avg+0.021
esson+0.018
enegger+0.018
aund+0.016
Neg logit contributions
 capped-0.220
mil-0.214
 primates-0.212
 kidnapping-0.209
respond-0.208
Token with
Token ID351
Feature activation-7.61e-02

Pos logit contributions
ngth+2.717
Avg+2.493
aund+2.490
esson+2.475
enegger+2.441
Neg logit contributions
 capped-2.667
mil-2.562
 primates-2.532
respond-2.467
 kidnapping-2.465
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
ngth+0.165
Avg+0.152
aund+0.151
esson+0.151
enegger+0.150
Neg logit contributions
 capped-0.122
 primates-0.115
mil-0.115
 kidnapping-0.112
 diving-0.111
Token way
Token ID835
Feature activation-1.93e-01

Pos logit contributions
ngth+0.283
Avg+0.261
esson+0.259
aund+0.259
enegger+0.258
Neg logit contributions
 capped-0.212
 primates-0.200
mil-0.199
 kidnapping-0.194
 diving-0.193
Token the
Token ID262
Feature activation-4.44e-02

Pos logit contributions
ngth+0.415
Avg+0.385
esson+0.381
aund+0.381
enegger+0.380
Neg logit contributions
 capped-0.311
mil-0.293
 primates-0.293
 kidnapping-0.284
 diving-0.283
Token toilet
Token ID16146
Feature activation+2.40e-01

Pos logit contributions
ngth+0.096
Avg+0.089
aund+0.088
esson+0.088
enegger+0.087
Neg logit contributions
 capped-0.071
mil-0.067
 primates-0.067
 kidnapping-0.065
 diving-0.064
Token is
Token ID318
Feature activation+9.14e-03

Pos logit contributions
ngth+0.264
Avg+0.244
aund+0.243
esson+0.241
enegger+0.239
Neg logit contributions
 capped-0.187
mil-0.175
 primates-0.173
 kidnapping-0.168
 diving-0.167
Token quoted
Token ID10947
Feature activation+2.41e-02

Pos logit contributions
 capped+0.015
mil+0.014
 primates+0.014
 kidnapping+0.014
 diving+0.014
Neg logit contributions
ngth-0.019
Avg-0.018
aund-0.018
esson-0.018
enegger-0.017
Token as
Token ID355
Feature activation+2.92e-03

Pos logit contributions
 capped+0.042
mil+0.040
 primates+0.040
 kidnapping+0.039
 diving+0.039
Neg logit contributions
ngth-0.048
Avg-0.044
aund-0.044
esson-0.044
enegger-0.044
Token saying
Token ID2282
Feature activation+3.26e-01

Pos logit contributions
 capped+0.006
mil+0.006
 primates+0.006
 kidnapping+0.005
 diving+0.005
Neg logit contributions
ngth-0.005
Avg-0.005
aund-0.005
esson-0.005
enegger-0.005
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 capped+0.605
mil+0.573
 primates+0.568
 diving+0.556
 kidnapping+0.555
Neg logit contributions
ngth-0.628
Avg-0.572
aund-0.570
esson-0.564
enegger-0.560
Token Sa
Token ID10318
Feature activation-1.44e+00

Pos logit contributions
 capped+0.184
mil+0.175
 primates+0.172
 kidnapping+0.167
respond+0.166
Neg logit contributions
ngth-0.233
Avg-0.214
aund-0.214
esson-0.212
enegger-0.211
Tokenarak
Token ID30447
Feature activation+3.10e-01

Pos logit contributions
ngth+2.349
esson+2.211
aund+2.178
avid+2.157
Avg+2.136
Neg logit contributions
 capped-2.962
 disadvant-2.833
 primates-2.808
mil-2.807
 diving-2.785
Tokenk
Token ID74
Feature activation-1.47e-01

Pos logit contributions
 capped+0.440
mil+0.417
 primates+0.407
 kidnapping+0.395
respond+0.392
Neg logit contributions
ngth-0.723
Avg-0.671
aund-0.669
esson-0.665
enegger-0.662
Tokenala
Token ID6081
Feature activation-1.21e-01

Pos logit contributions
ngth+0.292
Avg+0.270
aund+0.269
esson+0.269
enegger+0.264
Neg logit contributions
 capped-0.260
mil-0.245
 primates-0.245
 disadvant-0.239
 kidnapping-0.238
Token,
Token ID11
Feature activation+4.99e-02

Pos logit contributions
ngth+0.249
Avg+0.229
aund+0.228
esson+0.227
enegger+0.224
Neg logit contributions
 capped-0.206
mil-0.194
 primates-0.192
 kidnapping-0.187
 diving-0.186
Token a
Token ID257
Feature activation+1.23e-01

Pos logit contributions
 capped+0.083
mil+0.077
 primates+0.077
 kidnapping+0.075
 diving+0.074
Neg logit contributions
ngth-0.107
Avg-0.098
aund-0.098
esson-0.097
enegger-0.096
Token said
Token ID531
Feature activation+1.93e-02

Pos logit contributions
ngth+0.296
Avg+0.274
aund+0.274
esson+0.274
enegger+0.271
Neg logit contributions
 capped-0.221
mil-0.210
 primates-0.207
 kidnapping-0.202
 disadvant-0.200
Token the
Token ID262
Feature activation-6.29e-02

Pos logit contributions
 capped+0.032
mil+0.030
 primates+0.030
 kidnapping+0.029
 diving+0.029
Neg logit contributions
ngth-0.041
Avg-0.037
aund-0.037
esson-0.037
enegger-0.037
Token campaign
Token ID1923
Feature activation-4.47e-01

Pos logit contributions
ngth+0.129
Avg+0.118
aund+0.117
esson+0.117
enegger+0.116
Neg logit contributions
 capped-0.108
mil-0.103
 primates-0.102
 kidnapping-0.100
 diving-0.099
Token has
Token ID468
Feature activation-1.18e-01

Pos logit contributions
ngth+0.856
Avg+0.785
aund+0.782
esson+0.782
enegger+0.770
Neg logit contributions
 capped-0.809
mil-0.774
 primates-0.766
 kidnapping-0.746
 disadvant-0.741
Token been
Token ID587
Feature activation+4.11e-01

Pos logit contributions
ngth+0.254
Avg+0.236
aund+0.235
esson+0.235
enegger+0.232
Neg logit contributions
 capped-0.187
mil-0.178
 primates-0.175
 kidnapping-0.170
 disadvant-0.169
Token looking
Token ID2045
Feature activation+8.12e-01

Pos logit contributions
 capped+0.629
mil+0.593
 primates+0.586
 kidnapping+0.569
 diving+0.565
Neg logit contributions
ngth-0.912
Avg-0.846
aund-0.843
esson-0.840
enegger-0.833
Token to
Token ID284
Feature activation+1.46e-01

Pos logit contributions
 capped+1.091
mil+1.017
 primates+1.004
 kidnapping+0.978
 diving+0.973
Neg logit contributions
ngth-1.958
Avg-1.833
aund-1.826
enegger-1.811
esson-1.806
Token media
Token ID2056
Feature activation+2.79e-01

Pos logit contributions
 capped+0.231
mil+0.218
 primates+0.216
 kidnapping+0.210
 diving+0.208
Neg logit contributions
ngth-0.315
Avg-0.292
aund-0.291
esson-0.290
enegger-0.287
Token reports
Token ID3136
Feature activation-1.69e-01

Pos logit contributions
 capped+0.401
mil+0.375
 primates+0.372
 kidnapping+0.361
 diving+0.360
Neg logit contributions
ngth-0.649
Avg-0.603
aund-0.600
esson-0.599
enegger-0.596
Token to
Token ID284
Feature activation+8.79e-02

Pos logit contributions
ngth+0.346
Avg+0.319
aund+0.317
esson+0.316
enegger+0.313
Neg logit contributions
 capped-0.288
mil-0.273
 primates-0.270
 kidnapping-0.263
 diving-0.262
Token make
Token ID787
Feature activation+2.50e-02

Pos logit contributions
 capped+0.144
mil+0.136
 primates+0.135
 kidnapping+0.131
 diving+0.130
Neg logit contributions
ngth-0.186
Avg-0.172
aund-0.171
esson-0.170
enegger-0.169
Token different
Token ID1180
Feature activation+1.99e-02

Pos logit contributions
ngth+0.046
Avg+0.043
aund+0.042
esson+0.042
enegger+0.042
Neg logit contributions
 capped-0.032
mil-0.030
 primates-0.030
 kidnapping-0.029
 diving-0.029
Token places
Token ID4113
Feature activation-2.77e-01

Pos logit contributions
 capped+0.025
mil+0.023
 primates+0.023
 kidnapping+0.022
 diving+0.022
Neg logit contributions
ngth-0.050
Avg-0.047
aund-0.047
esson-0.046
enegger-0.046
Token,
Token ID11
Feature activation-1.72e-02

Pos logit contributions
ngth+0.536
Avg+0.492
aund+0.490
esson+0.487
enegger+0.483
Neg logit contributions
 capped-0.505
mil-0.480
 primates-0.476
 kidnapping-0.464
 diving-0.462
Token and
Token ID290
Feature activation-3.87e-02

Pos logit contributions
ngth+0.036
Avg+0.033
aund+0.033
esson+0.033
enegger+0.032
Neg logit contributions
 capped-0.029
mil-0.027
 primates-0.027
 kidnapping-0.027
 diving-0.026
Token the
Token ID262
Feature activation-2.37e-02

Pos logit contributions
ngth+0.079
Avg+0.073
aund+0.073
esson+0.072
enegger+0.072
Neg logit contributions
 capped-0.066
mil-0.063
 primates-0.062
 kidnapping-0.061
 diving-0.060
Token verb
Token ID15942
Feature activation+9.13e-01

Pos logit contributions
ngth+0.052
Avg+0.048
aund+0.048
esson+0.048
enegger+0.048
Neg logit contributions
 capped-0.037
 primates-0.035
mil-0.035
 kidnapping-0.034
 diving-0.033
Token is
Token ID318
Feature activation+5.15e-02

Pos logit contributions
 capped+1.518
mil+1.422
 primates+1.416
respond+1.379
 kidnapping+1.376
Neg logit contributions
ngth-1.896
Avg-1.780
enegger-1.766
esson-1.747
aund-1.742
Token here
Token ID994
Feature activation-1.24e-01

Pos logit contributions
 capped+0.079
mil+0.074
 primates+0.073
 kidnapping+0.071
 diving+0.071
Neg logit contributions
ngth-0.114
Avg-0.106
aund-0.106
esson-0.105
enegger-0.105
Token,
Token ID11
Feature activation+2.99e-02

Pos logit contributions
ngth+0.250
Avg+0.230
aund+0.229
esson+0.228
enegger+0.226
Neg logit contributions
 capped-0.217
mil-0.206
 primates-0.204
 kidnapping-0.199
 diving-0.198
Token or
Token ID393
Feature activation-1.86e-01

Pos logit contributions
 capped+0.049
mil+0.046
 primates+0.046
 kidnapping+0.045
 diving+0.044
Neg logit contributions
ngth-0.064
Avg-0.059
aund-0.059
esson-0.058
enegger-0.058
Token the
Token ID262
Feature activation+1.04e-02

Pos logit contributions
ngth+0.387
Avg+0.358
aund+0.356
esson+0.355
enegger+0.352
Neg logit contributions
 capped-0.311
mil-0.294
 primates-0.292
 kidnapping-0.284
 diving-0.282
Token "
Token ID366
Feature activation+1.84e-01

Pos logit contributions
ngth+0.285
Avg+0.262
aund+0.262
esson+0.260
enegger+0.259
Neg logit contributions
 capped-0.216
mil-0.201
 primates-0.201
 kidnapping-0.195
 diving-0.194
Tokenex
Token ID1069
Feature activation-1.21e-02

Pos logit contributions
 capped+0.294
mil+0.282
respond+0.275
 primates+0.273
rub+0.265
Neg logit contributions
ngth-0.397
enegger-0.362
aund-0.359
Avg-0.357
esson-0.356
Tokenempl
Token ID18856
Feature activation-1.18e-01

Pos logit contributions
ngth+0.028
Avg+0.026
enegger+0.025
esson+0.025
aund+0.025
Neg logit contributions
 capped-0.018
mil-0.017
 primates-0.017
respond-0.016
 kidnapping-0.016
Tokenary
Token ID560
Feature activation+1.38e-01

Pos logit contributions
ngth+0.270
Avg+0.250
aund+0.248
esson+0.248
enegger+0.246
Neg logit contributions
 capped-0.172
mil-0.163
 primates-0.160
 kidnapping-0.155
respond-0.154
Token"
Token ID1
Feature activation+3.03e-01

Pos logit contributions
 capped+0.223
mil+0.210
 primates+0.207
 kidnapping+0.203
 diving+0.202
Neg logit contributions
ngth-0.301
aund-0.274
Avg-0.274
esson-0.273
enegger-0.272
Token Defense
Token ID5947
Feature activation+9.46e-01

Pos logit contributions
 capped+0.477
mil+0.447
 primates+0.439
 kidnapping+0.430
 diving+0.430
Neg logit contributions
ngth-0.674
aund-0.617
Avg-0.616
esson-0.614
enegger-0.612
Token secretary
Token ID7705
Feature activation+3.37e-01

Pos logit contributions
 capped+0.962
mil+0.913
 primates+0.859
respond+0.832
 diving+0.828
Neg logit contributions
ngth-2.596
enegger-2.441
aund-2.420
esson-2.414
Avg-2.408
Token,
Token ID11
Feature activation-4.38e-02

Pos logit contributions
 capped+0.606
mil+0.570
 primates+0.564
 kidnapping+0.553
 diving+0.552
Neg logit contributions
ngth-0.669
Avg-0.612
aund-0.610
esson-0.602
enegger-0.601
Token calling
Token ID4585
Feature activation-3.64e-01

Pos logit contributions
ngth+0.088
Avg+0.081
aund+0.081
esson+0.080
enegger+0.080
Neg logit contributions
 capped-0.076
mil-0.072
 primates-0.072
 kidnapping-0.070
 diving-0.070
Token him
Token ID683
Feature activation-1.48e-01

Pos logit contributions
ngth+0.921
Avg+0.864
aund+0.863
esson+0.860
enegger+0.853
Neg logit contributions
 capped-0.437
mil-0.407
 primates-0.401
 kidnapping-0.385
 disadvant-0.382
Token critical
Token ID4688
Feature activation+1.11e+00

Pos logit contributions
ngth+0.303
Avg+0.279
aund+0.278
esson+0.277
enegger+0.274
Neg logit contributions
 capped-0.254
mil-0.241
 primates-0.239
 kidnapping-0.232
 diving-0.231
Token TC
Token ID17283
Feature activation-1.55e-01

Pos logit contributions
ngth+0.157
Avg+0.145
aund+0.145
esson+0.144
enegger+0.143
Neg logit contributions
 capped-0.119
mil-0.112
 primates-0.111
 kidnapping-0.108
 diving-0.107
Token button
Token ID4936
Feature activation+3.23e-01

Pos logit contributions
ngth+0.316
Avg+0.291
esson+0.289
enegger+0.288
aund+0.288
Neg logit contributions
 capped-0.267
mil-0.254
 primates-0.249
 kidnapping-0.244
 diving-0.244
Token and
Token ID290
Feature activation-1.01e-01

Pos logit contributions
 capped+0.564
mil+0.532
 primates+0.526
 diving+0.516
 kidnapping+0.515
Neg logit contributions
ngth-0.652
Avg-0.600
esson-0.599
aund-0.597
enegger-0.591
Token the
Token ID262
Feature activation-1.49e-01

Pos logit contributions
ngth+0.218
Avg+0.201
aund+0.201
esson+0.200
enegger+0.199
Neg logit contributions
 capped-0.161
mil-0.152
 primates-0.150
 kidnapping-0.146
 diving-0.146
Token other
Token ID584
Feature activation-1.89e-01

Pos logit contributions
ngth+0.324
Avg+0.300
aund+0.299
esson+0.298
enegger+0.296
Neg logit contributions
 capped-0.234
mil-0.221
 primates-0.218
 kidnapping-0.212
 diving-0.211
Token cigarette
Token ID17779
Feature activation+9.69e-01

Pos logit contributions
ngth+0.407
Avg+0.376
aund+0.375
esson+0.373
enegger+0.371
Neg logit contributions
 capped-0.304
mil-0.287
 primates-0.284
 kidnapping-0.276
 diving-0.275
Token lighter
Token ID14871
Feature activation+7.84e-01

Pos logit contributions
 capped+1.390
mil+1.291
 primates+1.259
 diving+1.247
 kidnapping+1.237
Neg logit contributions
ngth-2.266
Avg-2.111
enegger-2.091
esson-2.082
aund-2.075
TokenAnd
Token ID1870
Feature activation+2.32e-01

Pos logit contributions
 capped+1.282
mil+1.205
 primates+1.177
 diving+1.167
 kidnapping+1.160
Neg logit contributions
ngth-1.666
enegger-1.537
esson-1.536
Avg-1.536
aund-1.518
Token here
Token ID994
Feature activation-3.32e-01

Pos logit contributions
 capped+0.371
mil+0.349
 primates+0.344
 kidnapping+0.336
 diving+0.335
Neg logit contributions
ngth-0.503
Avg-0.465
aund-0.462
esson-0.461
enegger-0.457
Token is
Token ID318
Feature activation-3.43e-01

Pos logit contributions
ngth+0.806
Avg+0.752
aund+0.750
esson+0.748
enegger+0.742
Neg logit contributions
 capped-0.440
mil-0.411
 primates-0.406
 kidnapping-0.392
 diving-0.388
Token the
Token ID262
Feature activation-1.17e-01

Pos logit contributions
ngth+0.746
Avg+0.690
aund+0.686
esson+0.685
enegger+0.681
Neg logit contributions
 capped-0.545
mil-0.513
 primates-0.505
 kidnapping-0.493
 diving-0.491
Token mag
Token ID2153
Feature activation+4.16e-01

Pos logit contributions
 capped+0.093
mil+0.088
 primates+0.087
 kidnapping+0.085
 diving+0.084
Neg logit contributions
ngth-0.125
Avg-0.115
aund-0.115
esson-0.114
enegger-0.114
Tokeni
Token ID72
Feature activation-3.60e-01

Pos logit contributions
 capped+0.688
mil+0.658
 primates+0.645
 kidnapping+0.626
 diving+0.624
Neg logit contributions
ngth-0.876
Avg-0.807
enegger-0.803
aund-0.798
esson-0.793
Token
Token ID784
Feature activation+4.23e-01

Pos logit contributions
ngth+0.747
Avg+0.690
aund+0.685
esson+0.680
enegger+0.679
Neg logit contributions
 capped-0.607
mil-0.576
 primates-0.569
 kidnapping-0.555
 diving-0.553
Token probably
Token ID2192
Feature activation+4.63e-01

Pos logit contributions
 capped+0.688
mil+0.651
 primates+0.643
 kidnapping+0.626
 diving+0.624
Neg logit contributions
ngth-0.901
Avg-0.836
aund-0.832
esson-0.828
enegger-0.828
Token not
Token ID407
Feature activation+5.07e-02

Pos logit contributions
 capped+0.759
mil+0.715
 primates+0.710
 kidnapping+0.690
 diving+0.686
Neg logit contributions
ngth-0.988
Avg-0.913
aund-0.908
esson-0.901
enegger-0.899
Token relevant
Token ID5981
Feature activation+1.06e+00

Pos logit contributions
 capped+0.084
mil+0.079
 primates+0.079
 kidnapping+0.077
 diving+0.076
Neg logit contributions
ngth-0.107
Avg-0.099
aund-0.098
esson-0.098
enegger-0.097
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
 capped+1.782
 primates+1.696
mil+1.690
 diving+1.646
 kidnapping+1.632
Neg logit contributions
ngth-2.198
Avg-2.023
aund-2.010
esson-2.003
enegger-1.984
Token Move
Token ID10028
Feature activation-2.31e-01

Pos logit contributions
 capped+0.209
mil+0.198
 primates+0.196
 kidnapping+0.190
 diving+0.190
Neg logit contributions
ngth-0.254
Avg-0.234
aund-0.233
esson-0.232
enegger-0.231
Token on
Token ID319
Feature activation+8.04e-02

Pos logit contributions
ngth+0.487
Avg+0.450
aund+0.448
esson+0.447
enegger+0.444
Neg logit contributions
 capped-0.382
mil-0.361
 primates-0.358
 kidnapping-0.347
 diving-0.346
Token
Token ID198
Feature activation-7.94e-02

Pos logit contributions
 capped+0.134
mil+0.127
 primates+0.126
 kidnapping+0.122
 diving+0.122
Neg logit contributions
ngth-0.167
Avg-0.155
aund-0.154
esson-0.153
enegger-0.152
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
ngth+0.168
Avg+0.156
aund+0.155
esson+0.155
enegger+0.153
Neg logit contributions
 capped-0.129
mil-0.122
 primates-0.121
 kidnapping-0.117
 disadvant-0.117
Token to
Token ID284
Feature activation+1.74e-01

Pos logit contributions
ngth+0.697
Avg+0.642
aund+0.640
esson+0.638
enegger+0.632
Neg logit contributions
 capped-0.549
mil-0.519
 primates-0.515
 kidnapping-0.501
 diving-0.498
Token celebrate
Token ID10648
Feature activation-2.37e-01

Pos logit contributions
 capped+0.288
mil+0.272
 primates+0.272
 kidnapping+0.264
 diving+0.264
Neg logit contributions
ngth-0.364
Avg-0.336
aund-0.335
esson-0.335
enegger-0.331
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
ngth+0.514
Avg+0.474
aund+0.472
esson+0.471
enegger+0.467
Neg logit contributions
 capped-0.379
mil-0.358
 primates-0.355
 kidnapping-0.346
 diving-0.343
Token birthday
Token ID10955
Feature activation+1.41e-01

Pos logit contributions
 capped+0.184
mil+0.174
 primates+0.173
 kidnapping+0.169
 diving+0.167
Neg logit contributions
ngth-0.229
Avg-0.210
aund-0.209
esson-0.209
enegger-0.207
Token of
Token ID286
Feature activation+1.59e-01

Pos logit contributions
 capped+0.230
mil+0.218
 primates+0.216
 kidnapping+0.209
 diving+0.208
Neg logit contributions
ngth-0.295
Avg-0.273
aund-0.273
esson-0.271
enegger-0.268
Token his
Token ID465
Feature activation+2.15e-01

Pos logit contributions
 capped+0.248
mil+0.234
 primates+0.233
 kidnapping+0.226
 diving+0.225
Neg logit contributions
ngth-0.351
Avg-0.324
aund-0.323
esson-0.322
enegger-0.319
Token brother
Token ID3956
Feature activation-3.67e-01

Pos logit contributions
 capped+0.316
mil+0.294
 primates+0.293
 kidnapping+0.285
 diving+0.284
Neg logit contributions
ngth-0.499
Avg-0.461
aund-0.459
esson-0.458
enegger-0.453
Token,
Token ID11
Feature activation+7.00e-02

Pos logit contributions
ngth+0.721
Avg+0.662
aund+0.659
esson+0.655
enegger+0.651
Neg logit contributions
 capped-0.658
mil-0.625
 primates-0.619
 kidnapping-0.605
 diving-0.601
Token Kim
Token ID6502
Feature activation-4.12e-01

Pos logit contributions
 capped+0.121
mil+0.114
 primates+0.114
 kidnapping+0.111
 diving+0.111
Neg logit contributions
ngth-0.143
Avg-0.131
aund-0.131
esson-0.130
enegger-0.129
Tokenbal
Token ID6893
Feature activation-1.37e-01

Pos logit contributions
ngth+0.805
aund+0.740
Avg+0.740
esson+0.734
enegger+0.729
Neg logit contributions
 capped-0.738
mil-0.700
 primates-0.696
 kidnapping-0.679
 diving-0.674
Token.
Token ID13
Feature activation+2.85e-02

Pos logit contributions
ngth+0.268
Avg+0.245
aund+0.244
esson+0.243
enegger+0.241
Neg logit contributions
 capped-0.248
mil-0.236
 primates-0.233
 kidnapping-0.228
 diving-0.226
Token But
Token ID887
Feature activation+2.34e-01

Pos logit contributions
 capped+0.062
mil+0.059
 primates+0.057
 kidnapping+0.056
respond+0.055
Neg logit contributions
ngth-0.084
aund-0.077
Avg-0.077
esson-0.077
enegger-0.077
Token with
Token ID351
Feature activation+1.01e-02

Pos logit contributions
 capped+0.387
mil+0.360
 primates+0.359
 kidnapping+0.348
 diving+0.346
Neg logit contributions
ngth-0.502
Avg-0.458
aund-0.457
enegger-0.456
esson-0.454
Token a
Token ID257
Feature activation-6.91e-02

Pos logit contributions
 capped+0.016
mil+0.015
 primates+0.015
 kidnapping+0.015
 diving+0.015
Neg logit contributions
ngth-0.022
Avg-0.020
aund-0.020
esson-0.020
enegger-0.020
Token change
Token ID1487
Feature activation-3.84e-01

Pos logit contributions
ngth+0.152
Avg+0.140
aund+0.140
esson+0.139
enegger+0.138
Neg logit contributions
 capped-0.108
mil-0.102
 primates-0.101
 kidnapping-0.098
 diving-0.097
Token to
Token ID284
Feature activation+1.24e-02

Pos logit contributions
ngth+0.806
Avg+0.746
aund+0.744
esson+0.741
enegger+0.733
Neg logit contributions
 capped-0.627
mil-0.597
 primates-0.591
 kidnapping-0.574
 diving-0.570
Token its
Token ID663
Feature activation+1.15e-03

Pos logit contributions
 capped+0.020
mil+0.019
 primates+0.019
 kidnapping+0.018
 diving+0.018
Neg logit contributions
ngth-0.027
Avg-0.025
aund-0.025
esson-0.024
enegger-0.024
Token policies
Token ID4788
Feature activation-4.29e-01

Pos logit contributions
 capped+0.002
mil+0.002
 primates+0.002
 kidnapping+0.002
 diving+0.002
Neg logit contributions
ngth-0.002
Avg-0.002
aund-0.002
esson-0.002
enegger-0.002
Token at
Token ID379
Feature activation-3.81e-01

Pos logit contributions
ngth+0.854
Avg+0.785
aund+0.782
esson+0.779
enegger+0.771
Neg logit contributions
 capped-0.755
mil-0.718
 primates-0.711
 kidnapping-0.693
 diving-0.688
Token the
Token ID262
Feature activation-1.71e-01

Pos logit contributions
ngth+0.830
Avg+0.768
aund+0.764
esson+0.760
enegger+0.760
Neg logit contributions
 capped-0.602
mil-0.567
 primates-0.560
 kidnapping-0.544
 diving-0.542
Token end
Token ID886
Feature activation-3.69e-01

Pos logit contributions
ngth+0.348
Avg+0.319
aund+0.318
esson+0.317
enegger+0.315
Neg logit contributions
 capped-0.296
mil-0.280
 primates-0.277
 kidnapping-0.270
 diving-0.269
Token of
Token ID286
Feature activation-1.07e-02

Pos logit contributions
ngth+0.759
Avg+0.706
aund+0.705
esson+0.703
enegger+0.688
Neg logit contributions
 capped-0.612
mil-0.580
 primates-0.578
 disadvant-0.565
 kidnapping-0.561
Token of
Token ID286
Feature activation+1.07e-01

Pos logit contributions
 capped+0.190
mil+0.179
 primates+0.177
 diving+0.173
 kidnapping+0.173
Neg logit contributions
ngth-0.267
Avg-0.249
aund-0.247
esson-0.246
enegger-0.245
Token things
Token ID1243
Feature activation+2.01e-01

Pos logit contributions
 capped+0.173
mil+0.164
 primates+0.163
 kidnapping+0.159
 diving+0.159
Neg logit contributions
ngth-0.230
Avg-0.212
aund-0.212
esson-0.210
enegger-0.209
Token:
Token ID25
Feature activation+3.54e-01

Pos logit contributions
 capped+0.347
mil+0.328
 primates+0.325
 kidnapping+0.318
 diving+0.317
Neg logit contributions
ngth-0.407
Avg-0.376
aund-0.373
esson-0.371
enegger-0.370
Token going
Token ID1016
Feature activation-1.90e-01

Pos logit contributions
 capped+0.555
mil+0.526
 primates+0.518
 kidnapping+0.507
 diving+0.506
Neg logit contributions
ngth-0.768
Avg-0.713
aund-0.710
esson-0.709
enegger-0.703
Token on
Token ID319
Feature activation+6.09e-03

Pos logit contributions
ngth+0.420
Avg+0.389
aund+0.389
esson+0.387
enegger+0.384
Neg logit contributions
 capped-0.290
mil-0.274
 primates-0.270
 kidnapping-0.262
respond-0.261
Token a
Token ID257
Feature activation+2.21e-01

Pos logit contributions
 capped+0.009
mil+0.008
 primates+0.008
 kidnapping+0.008
 diving+0.008
Neg logit contributions
ngth-0.014
Avg-0.013
aund-0.013
esson-0.013
enegger-0.013
Token holiday
Token ID9912
Feature activation-3.24e-01

Pos logit contributions
 capped+0.285
mil+0.266
 primates+0.262
 kidnapping+0.253
 diving+0.251
Neg logit contributions
ngth-0.545
Avg-0.509
aund-0.508
esson-0.506
enegger-0.502
Token,
Token ID11
Feature activation+1.00e-01

Pos logit contributions
ngth+0.636
Avg+0.584
aund+0.582
esson+0.579
enegger+0.574
Neg logit contributions
 capped-0.579
mil-0.550
 primates-0.545
 kidnapping-0.532
 diving-0.529
Token finding
Token ID4917
Feature activation-5.15e-02

Pos logit contributions
 capped+0.154
mil+0.145
 primates+0.143
 kidnapping+0.140
 diving+0.139
Neg logit contributions
ngth-0.223
Avg-0.206
aund-0.206
esson-0.205
enegger-0.204
Token a
Token ID257
Feature activation+3.17e-01

Pos logit contributions
ngth+0.113
Avg+0.105
aund+0.104
esson+0.104
enegger+0.103
Neg logit contributions
 capped-0.080
mil-0.075
 primates-0.074
 kidnapping-0.072
 disadvant-0.072
Token partner
Token ID5212
Feature activation-1.99e-01

Pos logit contributions
 capped+0.450
mil+0.422
 primates+0.416
 kidnapping+0.404
 diving+0.401
Neg logit contributions
ngth-0.741
Avg-0.691
aund-0.688
esson-0.685
enegger-0.680
Token more
Token ID517
Feature activation-6.11e-02

Pos logit contributions
 capped+0.843
mil+0.787
 primates+0.781
 kidnapping+0.764
 diving+0.756
Neg logit contributions
ngth-1.217
Avg-1.128
aund-1.123
esson-1.113
enegger-1.106
Token out
Token ID503
Feature activation+1.49e-01

Pos logit contributions
ngth+0.136
Avg+0.126
aund+0.126
esson+0.125
enegger+0.124
Neg logit contributions
 capped-0.094
mil-0.088
 primates-0.087
 kidnapping-0.085
 diving-0.084
Token there
Token ID612
Feature activation+3.75e-02

Pos logit contributions
 capped+0.243
mil+0.231
 primates+0.228
 kidnapping+0.222
 diving+0.221
Neg logit contributions
ngth-0.313
Avg-0.291
aund-0.290
esson-0.289
enegger-0.285
Token than
Token ID621
Feature activation+2.22e-01

Pos logit contributions
 capped+0.065
mil+0.061
 primates+0.060
 kidnapping+0.059
 diving+0.059
Neg logit contributions
ngth-0.076
Avg-0.071
aund-0.070
esson-0.070
enegger-0.069
Token most
Token ID749
Feature activation-5.69e-02

Pos logit contributions
 capped+0.356
mil+0.337
 primates+0.334
 kidnapping+0.326
 diving+0.323
Neg logit contributions
ngth-0.479
Avg-0.444
aund-0.441
esson-0.439
enegger-0.435
Token.
Token ID13
Feature activation+1.71e-01

Pos logit contributions
ngth+0.118
Avg+0.109
aund+0.108
esson+0.108
enegger+0.107
Neg logit contributions
 capped-0.096
mil-0.091
 primates-0.090
 kidnapping-0.088
 diving-0.087
Token -
Token ID532
Feature activation+5.03e-01

Pos logit contributions
 capped+0.278
mil+0.265
 primates+0.259
 kidnapping+0.253
 diving+0.251
Neg logit contributions
ngth-0.365
Avg-0.335
aund-0.334
esson-0.332
enegger-0.330
Token July
Token ID2901
Feature activation+9.79e-02

Pos logit contributions
 capped+0.943
mil+0.889
 primates+0.874
 kidnapping+0.857
 diving+0.854
Neg logit contributions
ngth-0.979
aund-0.878
Avg-0.872
enegger-0.867
esson-0.861
Token 6
Token ID718
Feature activation-1.27e-01

Pos logit contributions
 capped+0.153
mil+0.145
 primates+0.145
 disadvant+0.142
 kidnapping+0.140
Neg logit contributions
ngth-0.210
Avg-0.196
aund-0.195
esson-0.195
enegger-0.193
Token,
Token ID11
Feature activation+9.24e-02

Pos logit contributions
ngth+0.254
aund+0.229
Avg+0.228
esson+0.227
enegger+0.226
Neg logit contributions
 capped-0.226
mil-0.216
 primates-0.211
 kidnapping-0.206
respond-0.205
Token 2010
Token ID3050
Feature activation+2.07e-01

Pos logit contributions
 capped+0.190
mil+0.182
 primates+0.178
 kidnapping+0.175
 diving+0.174
Neg logit contributions
ngth-0.162
Avg-0.143
aund-0.143
esson-0.141
enegger-0.141
Token 2018
Token ID2864
Feature activation+2.21e-01

Pos logit contributions
ngth+0.179
Avg+0.165
aund+0.165
esson+0.164
enegger+0.163
Neg logit contributions
 capped-0.142
mil-0.135
 primates-0.134
 kidnapping-0.130
 diving-0.129
Token.
Token ID13
Feature activation-1.78e-02

Pos logit contributions
 capped+0.404
mil+0.380
 primates+0.377
 kidnapping+0.368
 diving+0.367
Neg logit contributions
ngth-0.433
Avg-0.396
aund-0.394
esson-0.392
enegger-0.391
Token
Token ID198
Feature activation+3.33e-02

Pos logit contributions
ngth+0.039
Avg+0.035
aund+0.035
esson+0.035
enegger+0.035
Neg logit contributions
 capped-0.029
mil-0.027
 primates-0.027
 kidnapping-0.026
 diving-0.026
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
 capped+0.054
mil+0.051
 primates+0.051
 kidnapping+0.049
 disadvant+0.049
Neg logit contributions
ngth-0.071
Avg-0.066
aund-0.065
esson-0.065
enegger-0.064
TokenWalker
Token ID39950
Feature activation-2.23e-01

Pos logit contributions
ngth+0.024
aund+0.022
Avg+0.022
esson+0.022
enegger+0.022
Neg logit contributions
 capped-0.020
mil-0.020
 primates-0.019
 kidnapping-0.019
respond-0.019
Token did
Token ID750
Feature activation+7.14e-02

Pos logit contributions
ngth+0.458
Avg+0.425
aund+0.424
esson+0.423
enegger+0.419
Neg logit contributions
 capped-0.369
mil-0.355
 primates-0.350
 kidnapping-0.341
 disadvant-0.339
Token not
Token ID407
Feature activation-6.17e-02

Pos logit contributions
 capped+0.112
mil+0.105
 primates+0.104
 kidnapping+0.101
 diving+0.101
Neg logit contributions
ngth-0.158
Avg-0.146
aund-0.145
esson-0.144
enegger-0.144
Token waste
Token ID7030
Feature activation-8.26e-01

Pos logit contributions
ngth+0.128
Avg+0.118
aund+0.117
esson+0.117
enegger+0.116
Neg logit contributions
 capped-0.105
mil-0.099
 primates-0.098
 kidnapping-0.095
 diving-0.095
Token his
Token ID465
Feature activation+1.11e-01

Pos logit contributions
ngth+1.853
aund+1.724
Avg+1.724
esson+1.718
enegger+1.702
Neg logit contributions
 capped-1.226
mil-1.159
 primates-1.143
 kidnapping-1.110
respond-1.103
Token latest
Token ID3452
Feature activation-5.32e-01

Pos logit contributions
 capped+0.152
mil+0.143
 primates+0.141
 kidnapping+0.136
 diving+0.135
Neg logit contributions
ngth-0.262
Avg-0.244
aund-0.243
esson-0.243
enegger-0.241
Token chance
Token ID2863
Feature activation-1.74e-01

Pos logit contributions
ngth+1.120
Avg+1.036
aund+1.032
esson+1.027
enegger+1.017
Neg logit contributions
 capped-0.866
mil-0.826
 primates-0.815
 kidnapping-0.792
 disadvant-0.788
Token yet
Token ID1865
Feature activation+3.19e-01

Pos logit contributions
ngth+0.323
Avg+0.297
aund+0.296
esson+0.295
enegger+0.292
Neg logit contributions
 capped-0.271
mil-0.257
 primates-0.255
 kidnapping-0.248
 diving-0.247
Token so
Token ID523
Feature activation+1.55e-01

Pos logit contributions
 capped+0.522
mil+0.488
 primates+0.486
 kidnapping+0.475
 diving+0.472
Neg logit contributions
ngth-0.684
Avg-0.631
aund-0.629
esson-0.625
enegger-0.622
Token few
Token ID1178
Feature activation+1.54e-01

Pos logit contributions
 capped+0.226
mil+0.212
 primates+0.208
 kidnapping+0.202
 diving+0.202
Neg logit contributions
ngth-0.360
Avg-0.334
aund-0.333
esson-0.331
enegger-0.329
Token have
Token ID423
Feature activation-1.82e-01

Pos logit contributions
 capped+0.257
 primates+0.241
mil+0.240
 kidnapping+0.233
 diving+0.233
Neg logit contributions
ngth-0.324
aund-0.298
Avg-0.297
esson-0.295
enegger-0.294
Token it
Token ID340
Feature activation-2.32e-01

Pos logit contributions
ngth+0.384
Avg+0.354
aund+0.353
esson+0.351
enegger+0.348
Neg logit contributions
 capped-0.299
mil-0.283
 primates-0.280
 kidnapping-0.272
 diving-0.271
Token,
Token ID11
Feature activation-1.53e-01

Pos logit contributions
ngth+0.479
Avg+0.440
aund+0.438
esson+0.437
enegger+0.432
Neg logit contributions
 capped-0.396
mil-0.373
 primates-0.369
 kidnapping-0.360
 diving-0.360
Token due
Token ID2233
Feature activation-7.51e-01

Pos logit contributions
ngth+0.318
Avg+0.293
aund+0.292
esson+0.291
enegger+0.288
Neg logit contributions
 capped-0.259
mil-0.244
 primates-0.242
 kidnapping-0.236
 diving-0.234
Token to
Token ID284
Feature activation-1.29e-01

Pos logit contributions
ngth+1.612
Avg+1.485
aund+1.481
esson+1.473
enegger+1.451
Neg logit contributions
 capped-1.191
mil-1.137
 primates-1.125
 kidnapping-1.089
respond-1.081
Token the
Token ID262
Feature activation-2.07e-01

Pos logit contributions
ngth+0.279
Avg+0.258
aund+0.257
esson+0.256
enegger+0.254
Neg logit contributions
 capped-0.207
mil-0.194
 primates-0.193
 kidnapping-0.188
 diving-0.187
Token custom
Token ID2183
Feature activation+2.70e-01

Pos logit contributions
ngth+0.433
Avg+0.398
aund+0.397
esson+0.395
enegger+0.393
Neg logit contributions
 capped-0.349
mil-0.326
 primates-0.326
 kidnapping-0.318
 diving-0.316
Token apps
Token ID6725
Feature activation-4.84e-02

Pos logit contributions
 capped+0.442
mil+0.413
 primates+0.409
 kidnapping+0.401
 diving+0.397
Neg logit contributions
ngth-0.578
Avg-0.535
aund-0.530
esson-0.527
enegger-0.524
Token as
Token ID355
Feature activation-1.28e-01

Pos logit contributions
ngth+0.513
Avg+0.473
aund+0.471
esson+0.469
enegger+0.465
Neg logit contributions
 capped-0.425
mil-0.403
 primates-0.398
 kidnapping-0.388
 diving-0.386
Token it
Token ID340
Feature activation-2.57e-01

Pos logit contributions
ngth+0.283
Avg+0.262
aund+0.262
esson+0.261
enegger+0.259
Neg logit contributions
 capped-0.199
mil-0.187
 primates-0.185
 diving-0.180
 kidnapping-0.180
Token tips
Token ID9040
Feature activation-7.41e-01

Pos logit contributions
ngth+0.569
Avg+0.529
aund+0.526
esson+0.524
enegger+0.520
Neg logit contributions
 capped-0.390
mil-0.369
 primates-0.366
 kidnapping-0.355
 disadvant-0.353
Token over
Token ID625
Feature activation-4.44e-02

Pos logit contributions
ngth+1.538
Avg+1.418
aund+1.413
esson+1.408
enegger+1.395
Neg logit contributions
 capped-1.239
mil-1.175
 primates-1.163
 kidnapping-1.132
 diving-1.124
Token,
Token ID11
Feature activation-2.04e-01

Pos logit contributions
ngth+0.090
Avg+0.083
aund+0.083
esson+0.083
enegger+0.082
Neg logit contributions
 capped-0.076
mil-0.072
 primates-0.071
 diving-0.070
 kidnapping-0.070
Token
Token ID447
Feature activation-2.63e-01

Pos logit contributions
ngth+0.437
Avg+0.404
aund+0.403
esson+0.401
enegger+0.397
Neg logit contributions
 capped-0.329
mil-0.312
 primates-0.308
 kidnapping-0.300
 diving-0.297
Token
Token ID251
Feature activation-3.78e-01

Pos logit contributions
ngth+0.570
Avg+0.530
aund+0.526
esson+0.525
enegger+0.519
Neg logit contributions
 capped-0.416
mil-0.393
 primates-0.388
 kidnapping-0.378
 diving-0.376
Token N
Token ID399
Feature activation-4.47e-01

Pos logit contributions
ngth+0.789
Avg+0.728
aund+0.725
esson+0.723
enegger+0.716
Neg logit contributions
 capped-0.629
mil-0.596
 primates-0.590
 kidnapping-0.574
 diving-0.570
Tokenett
Token ID3087
Feature activation-5.80e-01

Pos logit contributions
ngth+0.735
Avg+0.669
aund+0.667
esson+0.664
enegger+0.652
Neg logit contributions
 capped-0.938
mil-0.895
 primates-0.892
 kidnapping-0.873
 disadvant-0.872
Tokenles
Token ID829
Feature activation-2.48e-01

Pos logit contributions
ngth+0.969
esson+0.904
Avg+0.899
aund+0.899
enegger+0.872
Neg logit contributions
 capped-1.178
 disadvant-1.128
 primates-1.119
mil-1.116
 kidnapping-1.104
Token explains
Token ID6688
Feature activation-2.12e-01

Pos logit contributions
ngth+0.460
Avg+0.422
aund+0.420
esson+0.418
enegger+0.414
Neg logit contributions
 capped-0.467
mil-0.445
 primates-0.441
 kidnapping-0.431
 diving-0.428
Token no
Token ID645
Feature activation-1.19e-01

Pos logit contributions
 capped+0.583
mil+0.544
 primates+0.541
 diving+0.525
 kidnapping+0.525
Neg logit contributions
ngth-0.919
Avg-0.858
aund-0.852
esson-0.849
enegger-0.843
Token further
Token ID2252
Feature activation+5.49e-01

Pos logit contributions
ngth+0.273
Avg+0.253
aund+0.252
esson+0.251
enegger+0.249
Neg logit contributions
 capped-0.176
mil-0.166
 primates-0.165
 kidnapping-0.159
 diving-0.159
Token then
Token ID788
Feature activation+1.13e-01

Pos logit contributions
 capped+0.862
 primates+0.816
mil+0.815
 diving+0.791
 kidnapping+0.788
Neg logit contributions
ngth-1.205
Avg-1.119
esson-1.106
aund-1.104
enegger-1.091
Token the
Token ID262
Feature activation-1.56e-02

Pos logit contributions
 capped+0.189
 primates+0.179
mil+0.178
 diving+0.175
 kidnapping+0.174
Neg logit contributions
ngth-0.235
esson-0.217
Avg-0.217
aund-0.215
enegger-0.215
Token river
Token ID7850
Feature activation+3.10e-01

Pos logit contributions
ngth+0.033
esson+0.030
Avg+0.030
aund+0.030
enegger+0.030
Neg logit contributions
 capped-0.026
 primates-0.025
mil-0.025
 kidnapping-0.025
 diving-0.024
Token that
Token ID326
Feature activation-7.45e-02

Pos logit contributions
 capped+0.519
mil+0.493
 primates+0.481
 diving+0.479
 kidnapping+0.473
Neg logit contributions
ngth-0.649
Avg-0.600
aund-0.599
esson-0.598
enegger-0.596
Token runs
Token ID4539
Feature activation-3.03e-01

Pos logit contributions
ngth+0.156
Avg+0.144
aund+0.144
esson+0.143
enegger+0.142
Neg logit contributions
 capped-0.123
mil-0.117
 primates-0.116
 kidnapping-0.112
 diving-0.112
Token through
Token ID832
Feature activation-3.13e-01

Pos logit contributions
ngth+0.564
Avg+0.516
aund+0.514
esson+0.511
enegger+0.505
Neg logit contributions
 capped-0.567
mil-0.542
 primates-0.538
 kidnapping-0.524
 disadvant-0.522
Token Shang
Token ID45797
Feature activation+8.40e-01

Pos logit contributions
ngth+0.633
Avg+0.582
aund+0.579
esson+0.579
enegger+0.572
Neg logit contributions
 capped-0.548
mil-0.518
 primates-0.515
 kidnapping-0.503
 diving-0.502
Tokenba
Token ID7012
Feature activation-4.59e-01

Pos logit contributions
 capped+1.354
mil+1.305
 primates+1.254
 kidnapping+1.227
 diving+1.224
Neg logit contributions
ngth-1.787
Avg-1.664
enegger-1.663
esson-1.658
aund-1.635
Token to
Token ID284
Feature activation-1.04e-02

Pos logit contributions
ngth+0.942
Avg+0.872
esson+0.865
aund+0.864
enegger+0.861
Neg logit contributions
 capped-0.783
mil-0.739
 primates-0.732
 diving-0.715
 kidnapping-0.714
Token agency
Token ID4086
Feature activation-1.41e-01

Pos logit contributions
ngth+0.293
Avg+0.269
aund+0.268
esson+0.267
enegger+0.265
Neg logit contributions
 capped-0.235
mil-0.223
 primates-0.221
 kidnapping-0.215
 diving-0.214
Token's
Token ID338
Feature activation-3.04e-02

Pos logit contributions
ngth+0.295
Avg+0.272
aund+0.271
esson+0.270
enegger+0.268
Neg logit contributions
 capped-0.235
mil-0.222
 primates-0.220
 kidnapping-0.214
 diving-0.213
Token chief
Token ID4039
Feature activation+3.83e-02

Pos logit contributions
ngth+0.065
Avg+0.060
aund+0.060
esson+0.059
enegger+0.059
Neg logit contributions
 capped-0.049
mil-0.047
 primates-0.046
 kidnapping-0.045
 diving-0.045
Token executive
Token ID4640
Feature activation-1.92e-01

Pos logit contributions
 capped+0.060
mil+0.056
 primates+0.056
 kidnapping+0.054
 diving+0.054
Neg logit contributions
ngth-0.084
Avg-0.078
aund-0.078
esson-0.077
enegger-0.077
Token Stella
Token ID45856
Feature activation-1.18e-01

Pos logit contributions
ngth+0.380
Avg+0.350
aund+0.346
esson+0.345
enegger+0.343
Neg logit contributions
 capped-0.344
mil-0.327
 primates-0.324
 kidnapping-0.316
 diving-0.314
Token Av
Token ID5184
Feature activation-6.18e-01

Pos logit contributions
ngth+0.244
Avg+0.224
aund+0.223
esson+0.222
enegger+0.220
Neg logit contributions
 capped-0.199
mil-0.190
 primates-0.186
 kidnapping-0.182
 diving-0.180
Tokenram
Token ID859
Feature activation-2.37e-01

Pos logit contributions
ngth+1.310
Avg+1.211
aund+1.206
esson+1.201
enegger+1.191
Neg logit contributions
 capped-1.006
mil-0.953
 primates-0.941
 kidnapping-0.916
 diving-0.910
Tokenopoulos
Token ID20338
Feature activation+1.63e-01

Pos logit contributions
ngth+0.512
Avg+0.473
aund+0.469
esson+0.466
enegger+0.462
Neg logit contributions
 capped-0.383
mil-0.363
 primates-0.355
 kidnapping-0.346
 diving-0.344
Token said
Token ID531
Feature activation-2.59e-02

Pos logit contributions
 capped+0.253
mil+0.238
 primates+0.235
 kidnapping+0.228
 diving+0.226
Neg logit contributions
ngth-0.363
Avg-0.335
aund-0.333
esson-0.331
enegger-0.328
Token the
Token ID262
Feature activation-9.07e-02

Pos logit contributions
ngth+0.056
Avg+0.051
aund+0.051
esson+0.051
enegger+0.051
Neg logit contributions
 capped-0.043
 primates-0.040
mil-0.040
 kidnapping-0.039
 diving-0.039
Token face
Token ID1986
Feature activation-4.21e-01

Pos logit contributions
ngth+0.189
Avg+0.173
aund+0.172
esson+0.172
enegger+0.171
Neg logit contributions
 capped-0.153
mil-0.145
 primates-0.144
 kidnapping-0.140
 diving-0.139
Tokenort
Token ID419
Feature activation+1.24e-01

Pos logit contributions
 capped+0.182
mil+0.177
 primates+0.173
respond+0.170
 kidnapping+0.170
Neg logit contributions
ngth-0.126
Avg-0.113
enegger-0.111
esson-0.111
aund-0.110
Tokenot
Token ID313
Feature activation+1.63e-01

Pos logit contributions
 capped+0.194
mil+0.186
 primates+0.181
 kidnapping+0.176
respond+0.175
Neg logit contributions
ngth-0.269
Avg-0.250
aund-0.248
esson-0.247
enegger-0.245
Tokenft
Token ID701
Feature activation+2.41e-01

Pos logit contributions
 capped+0.263
mil+0.256
 primates+0.247
respond+0.241
rub+0.241
Neg logit contributions
ngth-0.347
Avg-0.317
aund-0.315
esson-0.314
enegger-0.312
Tokenw
Token ID86
Feature activation+1.67e-01

Pos logit contributions
 capped+0.340
mil+0.323
 primates+0.314
 kidnapping+0.303
respond+0.303
Neg logit contributions
ngth-0.562
Avg-0.524
aund-0.521
esson-0.518
enegger-0.517
Token>
Token ID29
Feature activation+6.12e-01

Pos logit contributions
 capped+0.205
mil+0.195
 primates+0.189
respond+0.181
 kidnapping+0.180
Neg logit contributions
ngth-0.418
Avg-0.392
aund-0.390
esson-0.388
enegger-0.387
Token loving
Token ID14442
Feature activation-5.85e-01

Pos logit contributions
 capped+1.024
mil+0.968
 primates+0.956
 kidnapping+0.934
 diving+0.928
Neg logit contributions
ngth-1.286
Avg-1.185
aund-1.178
esson-1.175
enegger-1.167
Token this
Token ID428
Feature activation+9.71e-02

Pos logit contributions
ngth+1.282
Avg+1.188
aund+1.185
esson+1.180
enegger+1.170
Neg logit contributions
 capped-0.910
mil-0.859
 primates-0.849
 kidnapping-0.825
 diving-0.819
Token meme
Token ID25336
Feature activation+4.52e-01

Pos logit contributions
 capped+0.147
mil+0.139
 primates+0.137
 kidnapping+0.133
 diving+0.132
Neg logit contributions
ngth-0.219
Avg-0.203
aund-0.202
esson-0.201
enegger-0.200
Token [
Token ID685
Feature activation+6.00e-01

Pos logit contributions
 capped+0.773
mil+0.732
 primates+0.720
 kidnapping+0.704
 diving+0.702
Neg logit contributions
ngth-0.931
Avg-0.863
aund-0.854
esson-0.846
enegger-0.841
Token10
Token ID940
Feature activation-2.62e-01

Pos logit contributions
 capped+0.994
mil+0.957
 primates+0.924
respond+0.909
 kidnapping+0.904
Neg logit contributions
ngth-1.271
aund-1.153
enegger-1.149
Avg-1.147
esson-1.140
Token:
Token ID25
Feature activation+3.64e-01

Pos logit contributions
ngth+0.561
Avg+0.522
aund+0.520
esson+0.518
enegger+0.512
Neg logit contributions
 capped-0.419
mil-0.396
 primates-0.393
 kidnapping-0.382
 disadvant-0.381
Token
Token ID198
Feature activation+5.65e-02

Pos logit contributions
 capped+0.083
mil+0.078
 primates+0.076
 kidnapping+0.074
 diving+0.074
Neg logit contributions
ngth-0.119
Avg-0.110
aund-0.110
esson-0.109
enegger-0.108
Token
Token ID198
Feature activation+3.91e-02

Pos logit contributions
 capped+0.091
mil+0.086
 primates+0.086
 kidnapping+0.083
 disadvant+0.083
Neg logit contributions
ngth-0.120
Avg-0.112
aund-0.111
esson-0.111
enegger-0.109
TokenHere
Token ID4342
Feature activation-4.35e-02

Pos logit contributions
 capped+0.068
mil+0.066
 primates+0.063
respond+0.062
 kidnapping+0.062
Neg logit contributions
ngth-0.080
enegger-0.072
aund-0.072
esson-0.072
Avg-0.071
Token's
Token ID338
Feature activation+5.39e-02

Pos logit contributions
ngth+0.077
Avg+0.070
aund+0.069
esson+0.069
enegger+0.068
Neg logit contributions
 capped-0.087
mil-0.083
 primates-0.082
 kidnapping-0.080
 diving-0.080
Token a
Token ID257
Feature activation+6.92e-03

Pos logit contributions
 capped+0.086
mil+0.081
 primates+0.080
 kidnapping+0.078
 diving+0.078
Neg logit contributions
ngth-0.117
Avg-0.108
aund-0.107
esson-0.107
enegger-0.107
Token breakdown
Token ID14608
Feature activation-9.70e-01

Pos logit contributions
 capped+0.012
mil+0.011
 primates+0.011
 kidnapping+0.011
 diving+0.011
Neg logit contributions
ngth-0.014
Avg-0.013
aund-0.013
esson-0.013
enegger-0.013
Token who
Token ID508
Feature activation+2.39e-01

Pos logit contributions
ngth+1.978
aund+1.831
Avg+1.828
esson+1.822
enegger+1.781
Neg logit contributions
 capped-1.611
mil-1.547
 primates-1.531
 disadvant-1.487
 kidnapping-1.483
Token the
Token ID262
Feature activation+5.59e-02

Pos logit contributions
 capped+0.421
mil+0.395
 primates+0.394
 kidnapping+0.385
 diving+0.384
Neg logit contributions
ngth-0.482
Avg-0.443
aund-0.441
enegger-0.439
esson-0.438
Token the
Token ID262
Feature activation+9.55e-02

Pos logit contributions
 capped+0.089
mil+0.083
 primates+0.083
 kidnapping+0.080
 diving+0.080
Neg logit contributions
ngth-0.123
Avg-0.113
aund-0.113
esson-0.113
enegger-0.113
Token targets
Token ID6670
Feature activation+6.41e-01

Pos logit contributions
 capped+0.151
mil+0.141
 primates+0.141
 kidnapping+0.136
 diving+0.136
Neg logit contributions
ngth-0.211
Avg-0.194
aund-0.194
esson-0.193
enegger-0.193
Token remaining
Token ID5637
Feature activation+3.31e-01

Pos logit contributions
 capped+1.263
mil+1.186
 primates+1.180
 kidnapping+1.155
 diving+1.155
Neg logit contributions
ngth-1.166
Avg-1.064
aund-1.061
enegger-1.045
esson-1.041
Token and
Token ID290
Feature activation-1.73e-01

Pos logit contributions
 capped+0.365
mil+0.343
 primates+0.341
 kidnapping+0.336
 diving+0.332
Neg logit contributions
ngth-0.417
Avg-0.384
aund-0.381
esson-0.379
enegger-0.374
Token I
Token ID314
Feature activation-5.58e-02

Pos logit contributions
ngth+0.363
Avg+0.334
aund+0.333
esson+0.331
enegger+0.329
Neg logit contributions
 capped-0.288
mil-0.273
 primates-0.269
 kidnapping-0.263
 diving-0.261
Token just
Token ID655
Feature activation+9.12e-02

Pos logit contributions
ngth+0.117
Avg+0.108
aund+0.108
esson+0.107
enegger+0.106
Neg logit contributions
 capped-0.093
mil-0.088
 primates-0.087
 kidnapping-0.084
 diving-0.084
Token felt
Token ID2936
Feature activation-1.89e-01

Pos logit contributions
 capped+0.149
mil+0.141
 primates+0.140
 kidnapping+0.136
 diving+0.135
Neg logit contributions
ngth-0.195
Avg-0.179
aund-0.179
esson-0.178
enegger-0.177
Token speech
Token ID4046
Feature activation+1.22e-01

Pos logit contributions
ngth+0.410
Avg+0.380
aund+0.377
esson+0.376
enegger+0.375
Neg logit contributions
 capped-0.304
mil-0.287
 primates-0.283
 kidnapping-0.277
 diving-0.274
Tokenless
Token ID1203
Feature activation-9.25e-01

Pos logit contributions
 capped+0.245
mil+0.233
 primates+0.230
 kidnapping+0.226
 diving+0.225
Neg logit contributions
ngth-0.217
Avg-0.195
enegger-0.195
aund-0.194
esson-0.194
Token.
Token ID13
Feature activation+4.37e-02

Pos logit contributions
ngth+1.731
Avg+1.582
aund+1.577
esson+1.571
enegger+1.557
Neg logit contributions
 capped-1.730
mil-1.651
 primates-1.635
 kidnapping-1.595
 disadvant-1.586
Token
Token ID198
Feature activation-5.42e-02

Pos logit contributions
 capped+0.071
mil+0.067
 primates+0.065
 kidnapping+0.064
 diving+0.063
Neg logit contributions
ngth-0.095
Avg-0.087
aund-0.087
esson-0.086
enegger-0.085
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
ngth+0.116
Avg+0.107
aund+0.107
esson+0.106
enegger+0.105
Neg logit contributions
 capped-0.088
mil-0.083
 primates-0.082
 kidnapping-0.080
 diving-0.079
Token"
Token ID1
Feature activation-2.67e-01

Pos logit contributions
ngth+0.248
aund+0.223
Avg+0.222
esson+0.222
enegger+0.222
Neg logit contributions
 capped-0.224
mil-0.217
 primates-0.210
respond-0.206
 kidnapping-0.206
TokenOh
Token ID5812
Feature activation+9.38e-02

Pos logit contributions
ngth+0.531
aund+0.483
Avg+0.483
esson+0.480
enegger+0.479
Neg logit contributions
 capped-0.474
mil-0.456
 primates-0.445
 kidnapping-0.436
respond-0.434
Token and
Token ID290
Feature activation-1.81e-01

Pos logit contributions
ngth+0.274
Avg+0.250
aund+0.249
esson+0.248
enegger+0.246
Neg logit contributions
 capped-0.256
mil-0.242
 primates-0.241
 kidnapping-0.236
 diving-0.234
Token about
Token ID546
Feature activation-3.91e-02

Pos logit contributions
ngth+0.361
Avg+0.330
aund+0.329
esson+0.328
enegger+0.325
Neg logit contributions
 capped-0.322
mil-0.304
 primates-0.302
 kidnapping-0.295
 diving-0.293
Token 53
Token ID7192
Feature activation-4.85e-01

Pos logit contributions
ngth+0.077
Avg+0.070
aund+0.070
esson+0.070
enegger+0.069
Neg logit contributions
 capped-0.070
mil-0.067
 primates-0.066
 kidnapping-0.065
 diving-0.064
Token%
Token ID4
Feature activation-4.56e-01

Pos logit contributions
ngth+0.653
Avg+0.578
aund+0.577
esson+0.570
enegger+0.560
Neg logit contributions
 capped-1.158
mil-1.115
 primates-1.107
 kidnapping-1.090
 diving-1.085
Token were
Token ID547
Feature activation-2.43e-01

Pos logit contributions
ngth+0.714
Avg+0.639
aund+0.635
esson+0.633
enegger+0.625
Neg logit contributions
 capped-0.999
mil-0.958
 primates-0.951
 kidnapping-0.932
 diving-0.928
Token office
Token ID2607
Feature activation-1.00e+00

Pos logit contributions
ngth+0.511
Avg+0.469
aund+0.468
esson+0.467
enegger+0.463
Neg logit contributions
 capped-0.406
mil-0.383
 primates-0.381
 kidnapping-0.371
 diving-0.369
Token workers
Token ID3259
Feature activation-1.43e-01

Pos logit contributions
ngth+1.634
Avg+1.475
aund+1.467
esson+1.458
enegger+1.439
Neg logit contributions
 capped-2.125
mil-2.039
 primates-2.019
 kidnapping-1.982
 disadvant-1.972
Token or
Token ID393
Feature activation-3.68e-01

Pos logit contributions
ngth+0.273
Avg+0.248
aund+0.247
esson+0.247
enegger+0.244
Neg logit contributions
 capped-0.266
mil-0.253
 primates-0.250
 kidnapping-0.245
 diving-0.244
Token in
Token ID287
Feature activation-2.58e-01

Pos logit contributions
ngth+0.772
Avg+0.712
aund+0.709
esson+0.707
enegger+0.702
Neg logit contributions
 capped-0.611
mil-0.578
 primates-0.573
 kidnapping-0.557
 diving-0.554
Token the
Token ID262
Feature activation-1.90e-01

Pos logit contributions
ngth+0.562
Avg+0.521
aund+0.520
esson+0.517
enegger+0.513
Neg logit contributions
 capped-0.403
mil-0.381
 primates-0.376
 kidnapping-0.365
 diving-0.363
Token military
Token ID2422
Feature activation+2.72e-02

Pos logit contributions
ngth+0.414
Avg+0.383
aund+0.381
esson+0.380
enegger+0.377
Neg logit contributions
 capped-0.299
mil-0.282
 primates-0.280
 kidnapping-0.272
 diving-0.270
Token when
Token ID618
Feature activation+1.29e-01

Pos logit contributions
 capped+0.584
mil+0.547
 primates+0.538
 diving+0.531
 kidnapping+0.531
Neg logit contributions
ngth-0.779
Avg-0.723
enegger-0.714
aund-0.713
esson-0.709
Token a
Token ID257
Feature activation+2.66e-01

Pos logit contributions
 capped+0.215
mil+0.203
 primates+0.201
 kidnapping+0.198
 diving+0.197
Neg logit contributions
ngth-0.273
Avg-0.252
aund-0.250
esson-0.249
enegger-0.248
Token TC
Token ID17283
Feature activation-4.37e-01

Pos logit contributions
 capped+0.407
mil+0.378
 primates+0.376
 kidnapping+0.374
 diving+0.369
Neg logit contributions
ngth-0.596
Avg-0.558
aund-0.551
esson-0.550
enegger-0.549
TokenRI
Token ID7112
Feature activation-5.17e-01

Pos logit contributions
ngth+0.889
Avg+0.817
aund+0.812
esson+0.809
enegger+0.803
Neg logit contributions
 capped-0.754
mil-0.716
 primates-0.707
 kidnapping-0.692
 diving-0.688
Token truck
Token ID7779
Feature activation-2.58e-01

Pos logit contributions
ngth+1.133
Avg+1.050
aund+1.046
esson+1.042
enegger+1.034
Neg logit contributions
 capped-0.808
mil-0.762
 primates-0.753
 kidnapping-0.734
 diving-0.728
Token drove
Token ID10357
Feature activation-1.03e+00

Pos logit contributions
ngth+0.567
Avg+0.526
aund+0.521
esson+0.520
enegger+0.516
Neg logit contributions
 capped-0.408
mil-0.382
 primates-0.377
 kidnapping-0.370
 diving-0.366
Token by
Token ID416
Feature activation-1.17e-01

Pos logit contributions
ngth+2.097
Avg+1.931
aund+1.930
esson+1.919
enegger+1.894
Neg logit contributions
 capped-1.746
mil-1.668
 primates-1.645
 disadvant-1.598
respond-1.593
Token,
Token ID11
Feature activation+8.64e-03

Pos logit contributions
ngth+0.241
Avg+0.223
enegger+0.220
esson+0.220
aund+0.219
Neg logit contributions
 capped-0.202
mil-0.189
 primates-0.188
 kidnapping-0.187
 diving-0.185
Token and
Token ID290
Feature activation+4.57e-02

Pos logit contributions
 capped+0.014
mil+0.014
 primates+0.013
 kidnapping+0.013
 diving+0.013
Neg logit contributions
ngth-0.018
Avg-0.017
aund-0.017
enegger-0.017
esson-0.017
Token a
Token ID257
Feature activation+1.59e-01

Pos logit contributions
 capped+0.077
mil+0.072
 primates+0.072
 kidnapping+0.071
 diving+0.070
Neg logit contributions
ngth-0.096
Avg-0.088
aund-0.087
enegger-0.087
esson-0.087
Token strange
Token ID6283
Feature activation-2.34e-01

Pos logit contributions
 capped+0.241
mil+0.224
 primates+0.222
 kidnapping+0.220
 diving+0.218
Neg logit contributions
ngth-0.359
Avg-0.335
aund-0.332
esson-0.331
enegger-0.331
Token less
Token ID1342
Feature activation+1.53e-01

Pos logit contributions
ngth+0.272
Avg+0.253
aund+0.253
esson+0.252
enegger+0.250
Neg logit contributions
 capped-0.160
mil-0.150
 primates-0.148
 kidnapping-0.144
 diving-0.142
Token mirror
Token ID10162
Feature activation-1.83e-01

Pos logit contributions
 capped+0.235
mil+0.219
 primates+0.217
 kidnapping+0.212
 diving+0.210
Neg logit contributions
ngth-0.344
Avg-0.318
aund-0.317
esson-0.316
enegger-0.314
Tokening
Token ID278
Feature activation+1.02e-02

Pos logit contributions
ngth+0.411
Avg+0.380
aund+0.379
esson+0.378
enegger+0.375
Neg logit contributions
 capped-0.278
mil-0.262
 primates-0.259
 kidnapping-0.251
 diving-0.250
Token the
Token ID262
Feature activation+1.29e-02

Pos logit contributions
 capped+0.016
mil+0.015
 primates+0.015
 kidnapping+0.015
 diving+0.015
Neg logit contributions
ngth-0.022
Avg-0.020
aund-0.020
esson-0.020
enegger-0.020
Token Rab
Token ID28559
Feature activation-2.37e-01

Pos logit contributions
 capped+0.022
mil+0.021
 primates+0.021
 kidnapping+0.020
 diving+0.020
Neg logit contributions
ngth-0.027
aund-0.024
Avg-0.024
esson-0.024
enegger-0.024
Tokenid
Token ID312
Feature activation-1.12e+00

Pos logit contributions
ngth+0.508
aund+0.462
enegger+0.461
Avg+0.461
esson+0.453
Neg logit contributions
 capped-0.391
mil-0.372
 primates-0.364
respond-0.354
 kidnapping-0.353
Token Pupp
Token ID20926
Feature activation+9.76e-01

Pos logit contributions
ngth+2.072
Avg+1.934
esson+1.917
aund+1.910
enegger+1.874
Neg logit contributions
 capped-2.074
mil-1.959
 primates-1.944
 kidnapping-1.912
 disadvant-1.910
Tokenies
Token ID444
Feature activation-1.44e-01

Pos logit contributions
 capped+1.883
mil+1.824
 primates+1.789
respond+1.762
 hon+1.752
Neg logit contributions
ngth-1.790
Avg-1.599
enegger-1.573
aund-1.554
esson-1.527
Token slate
Token ID27589
Feature activation-4.89e-01

Pos logit contributions
ngth+0.311
Avg+0.286
aund+0.285
esson+0.283
enegger+0.283
Neg logit contributions
 capped-0.233
mil-0.220
 primates-0.218
 kidnapping-0.211
 diving-0.210
Token.
Token ID13
Feature activation-9.10e-02

Pos logit contributions
ngth+0.971
Avg+0.891
aund+0.887
esson+0.881
enegger+0.878
Neg logit contributions
 capped-0.870
mil-0.825
 primates-0.818
 kidnapping-0.797
 diving-0.792
Token Essentially
Token ID34039
Feature activation+3.23e-01

Pos logit contributions
ngth+0.197
aund+0.180
Avg+0.179
esson+0.179
enegger+0.178
Neg logit contributions
 capped-0.147
mil-0.140
 primates-0.137
 kidnapping-0.132
respond-0.132