TOP ACTIVATIONS
MAX = 2.385

a commissioned aud itors Ernst & Young to look into allegations
t ... 15 1 of 15 Photo : Le
tweets asking " when Sam Bac ile would be arrested "
You can check out my Twitter feed by going here
person of interest at Brigham and Women 's Hospital , two
ian . com or call 03 30 333 68 46 .
David Brock 's Media Matters for America . 2001
director of Concern ed Women for America , we pt as
that also brought Joe N ieu w end yk to the
Width , Height Write - Ver b ose - Message "
Republican nominee Donald Trump Donald John Trump House committee believes it
ahl ia Lith wick has a rundown of the theory ,
of taxes ' Former Sanders campaign spokesman : Clinton staff are
more to help us continue to write about the issues that
Sync This slideshow requires JavaScript . Out
DNC in 2020 after ' not quite even handed ' 2016
former executive at Bear St ear ns , a Goldman competitor
I ve written a couple of books , which
Send email to this address Enter Your Name Add a comment
he was heard over a hot mic saying the Republican Party

MIN ACTIVATIONS
MIN = -2.243

Park under frost ed sequ o ias and frozen water falls
to look to Ant wan Good ley in key moments ,
was spun off from Cond e N ast 's Wired Digital
' Former Sanders campaign spokesman : Clinton staff are ' big
man , and F . Scott Fitzgerald . If
fair treatment by DNC in 2020 after ' not quite even
' Former Sanders campaign spokesman : Clinton staff are ' big
Fore word by Patrick J . Buchanan , and critical essays
' Former Sanders campaign spokesman : Clinton staff are ' big
fair treatment by DNC in 2020 after ' not quite even
fair treatment by DNC in 2020 after ' not quite even
of Murray N . Roth bard ( Prom etheus Books ,
than the previously planned Russian Black Sea resort of Sochi ,
Prom etheus Books , 2000 ), my biography of the great
in the use of cart o ony expressions and slap stick
The lesson of the Gang of Four is you don 't
one source , Penguins GM Jim Rutherford approached Fle ury leading
enty ev / R IA Nov ost i / K remlin
I TA 6 Jac op o Gu arn ieri ( Ast
- rent al service Air B n B and discount shopping

INTERVAL 1.228 - 2.385
CONTAINS 0.039%

am ok . CNN president Jeff Zucker s then
. Ted Cruz speaks to potential supporters
Sign up for our Wine Club today . Did you know
become an In Daily supporter . Powered by Press Pat ron
Democratic Party . Lat ino voters have leaned more

INTERVAL 0.071 - 1.228
CONTAINS 34.999%

filmed . It is our collaboration which makes this kind of
materials such as earth and builders ' rubble is likely to
, who is one of the finest intuitive mathematic ians I
white male perspective we are most used to , but rather
like Carl Hag elin and Nick Bon ino . No mention

INTERVAL -1.086 - 0.071
CONTAINS 64.903%

to several media outlets , including The Herald .
says F AS affects only a small area of speech
Protection Agency has retained some staff to look after its environmental
I can to lower outrage ously high gasoline and heating -
( The SF PD en forces moving violations , while SF

INTERVAL -2.243 - -1.086
CONTAINS 0.058%

headed chance and forcing Gu iller mo O cho a into
odd echoes of M . Night Shy am alan
previously , the " R eth inking Mathematics : Teaching Social
to some extent between wet and dry seasons and the situation
. Car la del P onte , a member
Tokena
Token ID64
Feature activation+7.46e-02

Pos logit contributions
alog+0.106
 chars+0.104
 Guilty+0.094
aukee+0.094
 illum+0.090
Neg logit contributions
                -0.129
hai-0.128
kef-0.127
ynski-0.116
erb-0.115
Token commissioned
Token ID20462
Feature activation+2.55e-01

Pos logit contributions
alog+0.049
 chars+0.048
 Guilty+0.044
aukee+0.044
 illum+0.042
Neg logit contributions
                -0.060
hai-0.059
kef-0.059
ynski-0.054
erb-0.053
Token aud
Token ID2709
Feature activation-6.12e-02

Pos logit contributions
alog+0.181
 chars+0.177
aukee+0.162
 Guilty+0.162
 illum+0.155
Neg logit contributions
                -0.192
hai-0.191
kef-0.190
ynski-0.172
erb-0.171
Tokenitors
Token ID6742
Feature activation+6.85e-02

Pos logit contributions
                +0.046
hai+0.046
kef+0.046
ynski+0.042
erb+0.041
Neg logit contributions
alog-0.043
 chars-0.042
 Guilty-0.039
aukee-0.039
 illum-0.037
Token Ernst
Token ID38129
Feature activation-3.53e-01

Pos logit contributions
alog+0.048
 chars+0.047
 Guilty+0.043
aukee+0.043
 illum+0.041
Neg logit contributions
                -0.052
hai-0.052
kef-0.052
ynski-0.047
erb-0.047
Token &
Token ID1222
Feature activation+2.39e+00

Pos logit contributions
                +0.267
hai+0.266
kef+0.264
ynski+0.239
erb+0.237
Neg logit contributions
alog-0.249
 chars-0.244
 Guilty-0.223
aukee-0.222
 illum-0.214
Token Young
Token ID6960
Feature activation-1.60e-02

Pos logit contributions
alog+1.377
 chars+1.338
aukee+1.230
 Guilty+1.199
 illum+1.144
Neg logit contributions
                -2.110
hai-2.104
kef-2.066
erb-1.914
ynski-1.867
Token to
Token ID284
Feature activation+3.91e-01

Pos logit contributions
                +0.012
hai+0.012
kef+0.012
ynski+0.011
erb+0.011
Neg logit contributions
alog-0.011
 chars-0.011
 Guilty-0.010
aukee-0.010
 illum-0.009
Token look
Token ID804
Feature activation-1.58e-02

Pos logit contributions
alog+0.235
 chars+0.230
aukee+0.207
 Guilty+0.207
 illum+0.196
Neg logit contributions
                -0.336
hai-0.335
kef-0.333
ynski-0.305
erb-0.303
Token into
Token ID656
Feature activation+3.49e-01

Pos logit contributions
                +0.012
hai+0.012
kef+0.011
ynski+0.010
erb+0.010
Neg logit contributions
alog-0.012
 chars-0.011
 Guilty-0.010
aukee-0.010
 illum-0.010
Token allegations
Token ID7636
Feature activation+1.91e-01

Pos logit contributions
alog+0.254
 chars+0.249
aukee+0.229
 Guilty+0.229
 illum+0.219
Neg logit contributions
                -0.255
hai-0.255
kef-0.253
ynski-0.228
erb-0.226
Token
Token ID447
Feature activation+2.26e-01

Pos logit contributions
                +0.098
hai+0.098
kef+0.097
ynski+0.087
erb+0.087
Neg logit contributions
alog-0.104
 chars-0.102
 Guilty-0.094
aukee-0.094
 illum-0.090
Token
Token ID247
Feature activation-9.34e-02

Pos logit contributions
alog+0.144
 chars+0.141
 Guilty+0.128
aukee+0.127
 illum+0.122
Neg logit contributions
                -0.186
hai-0.186
kef-0.184
ynski-0.168
erb-0.167
Tokent
Token ID83
Feature activation-4.27e-02

Pos logit contributions
                +0.072
hai+0.071
kef+0.071
ynski+0.064
erb+0.064
Neg logit contributions
alog-0.065
 chars-0.064
 Guilty-0.058
aukee-0.058
 illum-0.055
Token...
Token ID986
Feature activation+1.70e-01

Pos logit contributions
                +0.033
hai+0.033
kef+0.033
ynski+0.030
erb+0.029
Neg logit contributions
alog-0.029
 chars-0.029
 Guilty-0.026
aukee-0.026
 illum-0.025
Token 15
Token ID1315
Feature activation+9.35e-01

Pos logit contributions
alog+0.114
 chars+0.112
 Guilty+0.102
aukee+0.102
 illum+0.097
Neg logit contributions
                -0.135
hai-0.134
kef-0.133
ynski-0.121
erb-0.120
Token 1
Token ID352
Feature activation+2.13e+00

Pos logit contributions
alog+0.615
 chars+0.600
aukee+0.552
 Guilty+0.543
 illum+0.522
Neg logit contributions
                -0.750
hai-0.748
kef-0.742
ynski-0.676
erb-0.669
Token of
Token ID286
Feature activation+6.06e-01

Pos logit contributions
alog+1.191
 chars+1.152
aukee+1.027
 Guilty+1.004
 illum+0.957
Neg logit contributions
hai-1.949
                -1.939
kef-1.917
ynski-1.762
erb-1.757
Token 15
Token ID1315
Feature activation+9.47e-01

Pos logit contributions
alog+0.407
 chars+0.398
aukee+0.364
 Guilty+0.361
 illum+0.348
Neg logit contributions
                -0.480
hai-0.477
kef-0.473
ynski-0.430
erb-0.426
Token Photo
Token ID5555
Feature activation+5.62e-01

Pos logit contributions
alog+0.587
 chars+0.573
aukee+0.519
 Guilty+0.516
 illum+0.494
Neg logit contributions
                -0.796
hai-0.791
kef-0.787
ynski-0.719
erb-0.712
Token:
Token ID25
Feature activation+1.48e-01

Pos logit contributions
alog+0.377
 chars+0.371
aukee+0.337
 Guilty+0.336
 illum+0.322
Neg logit contributions
                -0.444
hai-0.442
kef-0.437
ynski-0.398
erb-0.395
Token Le
Token ID1004
Feature activation+8.29e-01

Pos logit contributions
alog+0.100
 chars+0.098
aukee+0.090
 Guilty+0.090
 illum+0.086
Neg logit contributions
                -0.116
hai-0.116
kef-0.115
ynski-0.104
erb-0.103
Token tweets
Token ID12665
Feature activation+2.47e-01

Pos logit contributions
alog+0.240
 chars+0.235
aukee+0.214
 Guilty+0.214
 illum+0.205
Neg logit contributions
                -0.280
hai-0.279
kef-0.278
ynski-0.251
erb-0.250
Token asking
Token ID4737
Feature activation+1.91e-01

Pos logit contributions
alog+0.167
 chars+0.163
 Guilty+0.149
aukee+0.148
 illum+0.142
Neg logit contributions
                -0.195
hai-0.195
kef-0.193
ynski-0.175
erb-0.174
Token "
Token ID366
Feature activation-1.15e-01

Pos logit contributions
alog+0.132
 chars+0.130
aukee+0.118
 Guilty+0.118
 illum+0.113
Neg logit contributions
                -0.147
hai-0.146
kef-0.145
ynski-0.131
erb-0.131
Tokenwhen
Token ID12518
Feature activation+4.36e-03

Pos logit contributions
                +0.085
hai+0.085
kef+0.085
ynski+0.076
erb+0.076
Neg logit contributions
alog-0.082
 chars-0.080
 Guilty-0.074
aukee-0.073
 illum-0.070
Token Sam
Token ID3409
Feature activation+6.27e-01

Pos logit contributions
alog+0.003
 chars+0.003
 Guilty+0.003
aukee+0.003
 illum+0.002
Neg logit contributions
                -0.003
hai-0.003
kef-0.003
ynski-0.003
erb-0.003
Token Bac
Token ID35583
Feature activation+2.02e+00

Pos logit contributions
alog+0.423
 chars+0.415
aukee+0.379
 Guilty+0.377
 illum+0.361
Neg logit contributions
                -0.493
hai-0.493
kef-0.488
ynski-0.443
erb-0.440
Tokenile
Token ID576
Feature activation+3.58e-02

Pos logit contributions
alog+1.057
 chars+1.048
aukee+0.944
 Guilty+0.922
 illum+0.896
Neg logit contributions
hai-1.878
kef-1.872
                -1.861
ynski-1.737
erb-1.719
Token would
Token ID561
Feature activation-2.12e-01

Pos logit contributions
alog+0.024
 chars+0.023
 Guilty+0.021
aukee+0.021
 illum+0.020
Neg logit contributions
                -0.029
hai-0.029
kef-0.028
ynski-0.026
erb-0.026
Token be
Token ID307
Feature activation+2.95e-02

Pos logit contributions
                +0.171
hai+0.171
kef+0.169
ynski+0.154
erb+0.153
Neg logit contributions
alog-0.139
 chars-0.136
 Guilty-0.124
aukee-0.123
 illum-0.118
Token arrested
Token ID5169
Feature activation-9.79e-02

Pos logit contributions
alog+0.020
 chars+0.020
 Guilty+0.018
aukee+0.018
 illum+0.017
Neg logit contributions
                -0.023
hai-0.023
kef-0.023
ynski-0.021
erb-0.020
Token"
Token ID1
Feature activation+7.92e-02

Pos logit contributions
                +0.074
hai+0.074
kef+0.073
ynski+0.066
erb+0.066
Neg logit contributions
alog-0.069
 chars-0.068
 Guilty-0.062
aukee-0.062
 illum-0.059
Token
Token ID198
Feature activation-6.42e-01

Pos logit contributions
alog+0.034
 chars+0.033
 Guilty+0.030
aukee+0.030
 illum+0.029
Neg logit contributions
                -0.039
hai-0.039
kef-0.039
ynski-0.035
erb-0.035
TokenYou
Token ID1639
Feature activation+5.64e-01

Pos logit contributions
hai+0.474
                +0.473
kef+0.470
ynski+0.426
erb+0.422
Neg logit contributions
alog-0.464
 chars-0.455
aukee-0.417
 Guilty-0.416
 illum-0.399
Token can
Token ID460
Feature activation-6.62e-01

Pos logit contributions
alog+0.378
 chars+0.368
 Guilty+0.337
aukee+0.336
 illum+0.318
Neg logit contributions
                -0.448
hai-0.444
kef-0.441
ynski-0.402
erb-0.401
Token check
Token ID2198
Feature activation+3.90e-02

Pos logit contributions
hai+0.545
                +0.544
kef+0.540
ynski+0.492
erb+0.489
Neg logit contributions
alog-0.423
 chars-0.414
 Guilty-0.372
aukee-0.370
 illum-0.356
Token out
Token ID503
Feature activation+4.22e-02

Pos logit contributions
alog+0.025
 chars+0.025
 Guilty+0.023
aukee+0.023
 illum+0.021
Neg logit contributions
                -0.032
hai-0.032
kef-0.031
ynski-0.029
erb-0.028
Token my
Token ID616
Feature activation+2.00e+00

Pos logit contributions
alog+0.032
 chars+0.032
 Guilty+0.029
aukee+0.029
 illum+0.028
Neg logit contributions
                -0.029
hai-0.029
kef-0.029
ynski-0.026
erb-0.026
Token Twitter
Token ID3009
Feature activation+1.61e+00

Pos logit contributions
alog+1.350
 chars+1.299
 Guilty+1.215
aukee+1.197
 illum+1.137
Neg logit contributions
hai-1.581
kef-1.561
                -1.559
erb-1.394
ynski-1.393
Token feed
Token ID3745
Feature activation+2.78e-01

Pos logit contributions
alog+0.841
 chars+0.799
aukee+0.731
 Guilty+0.729
 illum+0.670
Neg logit contributions
hai-1.516
                -1.515
kef-1.500
erb-1.379
ynski-1.374
Token by
Token ID416
Feature activation+5.63e-01

Pos logit contributions
alog+0.208
 chars+0.204
aukee+0.189
 Guilty+0.188
 illum+0.181
Neg logit contributions
                -0.198
hai-0.197
kef-0.196
ynski-0.175
erb-0.174
Token going
Token ID1016
Feature activation+5.89e-01

Pos logit contributions
alog+0.354
 chars+0.346
 Guilty+0.314
aukee+0.313
 illum+0.299
Neg logit contributions
                -0.468
hai-0.465
kef-0.464
ynski-0.423
erb-0.420
Token here
Token ID994
Feature activation+1.75e-01

Pos logit contributions
alog+0.362
 chars+0.351
aukee+0.321
 Guilty+0.318
 illum+0.304
Neg logit contributions
                -0.498
hai-0.495
kef-0.494
ynski-0.450
erb-0.448
Token person
Token ID1048
Feature activation-1.36e-01

Pos logit contributions
                +0.072
hai+0.072
kef+0.072
ynski+0.065
erb+0.064
Neg logit contributions
alog-0.066
 chars-0.064
 Guilty-0.059
aukee-0.059
 illum-0.056
Token of
Token ID286
Feature activation+2.61e-01

Pos logit contributions
                +0.105
hai+0.105
kef+0.104
ynski+0.095
erb+0.094
Neg logit contributions
alog-0.093
 chars-0.092
 Guilty-0.084
aukee-0.083
 illum-0.080
Token interest
Token ID1393
Feature activation-2.49e-01

Pos logit contributions
alog+0.211
 chars+0.207
 Guilty+0.192
aukee+0.192
 illum+0.185
Neg logit contributions
                -0.171
hai-0.170
kef-0.169
ynski-0.150
erb-0.149
Token at
Token ID379
Feature activation-1.96e-01

Pos logit contributions
                +0.191
hai+0.191
kef+0.190
ynski+0.172
erb+0.171
Neg logit contributions
alog-0.173
 chars-0.169
 Guilty-0.155
aukee-0.154
 illum-0.148
Token Brigham
Token ID37434
Feature activation-4.67e-01

Pos logit contributions
                +0.147
hai+0.146
kef+0.145
ynski+0.131
erb+0.130
Neg logit contributions
alog-0.139
 chars-0.136
 Guilty-0.125
aukee-0.125
 illum-0.119
Token and
Token ID290
Feature activation+2.00e+00

Pos logit contributions
                +0.414
hai+0.413
kef+0.411
ynski+0.377
erb+0.375
Neg logit contributions
alog-0.269
 chars-0.262
 Guilty-0.235
aukee-0.234
 illum-0.222
Token Women
Token ID6926
Feature activation+3.04e-01

Pos logit contributions
alog+1.219
 chars+1.210
 Guilty+1.085
aukee+1.081
 illum+1.018
Neg logit contributions
                -1.700
hai-1.694
kef-1.673
ynski-1.537
erb-1.507
Token's
Token ID338
Feature activation+1.03e+00

Pos logit contributions
alog+0.182
 chars+0.178
 Guilty+0.159
aukee+0.159
 illum+0.151
Neg logit contributions
                -0.263
hai-0.262
kef-0.260
ynski-0.239
erb-0.237
Token Hospital
Token ID9256
Feature activation+4.58e-02

Pos logit contributions
alog+0.691
 chars+0.678
aukee+0.615
 Guilty+0.614
 illum+0.590
Neg logit contributions
                -0.814
hai-0.813
kef-0.808
ynski-0.731
erb-0.727
Token,
Token ID11
Feature activation-2.41e-03

Pos logit contributions
alog+0.031
 chars+0.031
 Guilty+0.028
aukee+0.028
 illum+0.027
Neg logit contributions
                -0.036
hai-0.036
kef-0.035
ynski-0.032
erb-0.032
Token two
Token ID734
Feature activation+2.06e-02

Pos logit contributions
                +0.002
hai+0.002
kef+0.002
ynski+0.002
erb+0.002
Neg logit contributions
alog-0.002
 chars-0.002
 Guilty-0.001
aukee-0.001
 illum-0.001
Tokenian
Token ID666
Feature activation+2.79e-01

Pos logit contributions
alog+0.080
 chars+0.078
aukee+0.072
 Guilty+0.072
 illum+0.070
Neg logit contributions
                -0.068
hai-0.068
kef-0.067
ynski-0.060
erb-0.060
Token.
Token ID13
Feature activation+1.34e+00

Pos logit contributions
alog+0.179
 chars+0.175
aukee+0.159
 Guilty+0.159
 illum+0.152
Neg logit contributions
                -0.229
hai-0.228
kef-0.226
ynski-0.206
erb-0.205
Tokencom
Token ID785
Feature activation+2.75e-01

Pos logit contributions
alog+0.762
 chars+0.738
aukee+0.677
 Guilty+0.668
 illum+0.635
Neg logit contributions
                -1.192
hai-1.188
kef-1.173
ynski-1.087
erb-1.085
Token or
Token ID393
Feature activation+2.35e-01

Pos logit contributions
alog+0.186
 chars+0.182
 Guilty+0.166
aukee+0.166
 illum+0.158
Neg logit contributions
                -0.215
hai-0.214
kef-0.213
ynski-0.193
erb-0.192
Token call
Token ID869
Feature activation+1.67e+00

Pos logit contributions
alog+0.182
 chars+0.179
aukee+0.165
 Guilty+0.165
 illum+0.159
Neg logit contributions
                -0.161
hai-0.160
kef-0.159
ynski-0.142
erb-0.141
Token 03
Token ID7643
Feature activation+1.96e+00

Pos logit contributions
alog+1.080
 chars+1.066
aukee+0.997
 Guilty+0.967
 illum+0.931
Neg logit contributions
                -1.345
hai-1.333
kef-1.323
ynski-1.215
erb-1.201
Token30
Token ID1270
Feature activation+2.66e-02

Pos logit contributions
alog+1.428
 chars+1.393
aukee+1.325
 Guilty+1.278
 illum+1.249
Neg logit contributions
                -1.438
kef-1.410
hai-1.403
ynski-1.288
erb-1.264
Token 333
Token ID23460
Feature activation+1.35e+00

Pos logit contributions
alog+0.018
 chars+0.017
aukee+0.016
 Guilty+0.016
 illum+0.015
Neg logit contributions
                -0.021
hai-0.021
kef-0.021
ynski-0.019
erb-0.019
Token 68
Token ID8257
Feature activation-3.10e-01

Pos logit contributions
alog+0.933
 chars+0.898
aukee+0.845
 Guilty+0.817
 illum+0.792
Neg logit contributions
                -1.051
hai-1.043
kef-1.039
ynski-0.937
erb-0.927
Token46
Token ID3510
Feature activation+6.94e-01

Pos logit contributions
                +0.237
hai+0.236
kef+0.234
ynski+0.212
erb+0.211
Neg logit contributions
alog-0.216
 chars-0.213
 Guilty-0.194
aukee-0.193
 illum-0.185
Token.
Token ID13
Feature activation+3.53e-01

Pos logit contributions
alog+0.452
 chars+0.441
aukee+0.404
 Guilty+0.402
 illum+0.383
Neg logit contributions
                -0.563
hai-0.559
kef-0.557
erb-0.505
ynski-0.505
Token David
Token ID3271
Feature activation+3.08e-01

Pos logit contributions
alog+0.102
 chars+0.100
aukee+0.091
 Guilty+0.091
 illum+0.087
Neg logit contributions
                -0.112
hai-0.112
kef-0.111
ynski-0.101
erb-0.100
Token Brock
Token ID20501
Feature activation+4.59e-01

Pos logit contributions
alog+0.187
 chars+0.183
aukee+0.165
 Guilty+0.164
 illum+0.156
Neg logit contributions
                -0.264
hai-0.263
kef-0.261
ynski-0.239
erb-0.238
Token's
Token ID338
Feature activation+2.69e-01

Pos logit contributions
alog+0.295
 chars+0.289
aukee+0.262
 Guilty+0.261
 illum+0.250
Neg logit contributions
                -0.375
hai-0.375
kef-0.372
ynski-0.339
erb-0.337
Token Media
Token ID6343
Feature activation-1.95e-01

Pos logit contributions
alog+0.180
 chars+0.177
aukee+0.161
 Guilty+0.160
 illum+0.153
Neg logit contributions
                -0.213
hai-0.212
kef-0.211
ynski-0.191
erb-0.190
Token Matters
Token ID30587
Feature activation+3.24e-02

Pos logit contributions
                +0.157
hai+0.157
kef+0.156
ynski+0.142
erb+0.141
Neg logit contributions
alog-0.128
 chars-0.125
 Guilty-0.114
aukee-0.113
 illum-0.108
Token for
Token ID329
Feature activation+1.91e+00

Pos logit contributions
alog+0.023
 chars+0.022
 Guilty+0.020
aukee+0.020
 illum+0.020
Neg logit contributions
                -0.025
hai-0.025
kef-0.024
ynski-0.022
erb-0.022
Token America
Token ID2253
Feature activation+7.57e-02

Pos logit contributions
alog+1.266
 chars+1.230
aukee+1.105
 Guilty+1.100
 illum+1.077
Neg logit contributions
                -1.531
hai-1.528
kef-1.521
erb-1.377
ynski-1.369
Token.
Token ID13
Feature activation+1.95e-01

Pos logit contributions
alog+0.049
 chars+0.048
 Guilty+0.044
aukee+0.043
 illum+0.042
Neg logit contributions
                -0.061
hai-0.061
kef-0.061
ynski-0.055
erb-0.055
Token
Token ID198
Feature activation-2.54e-02

Pos logit contributions
alog+0.130
 chars+0.128
aukee+0.116
 Guilty+0.116
 illum+0.111
Neg logit contributions
                -0.155
hai-0.154
kef-0.153
ynski-0.139
erb-0.138
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
                +0.020
hai+0.020
kef+0.020
ynski+0.018
erb+0.018
Neg logit contributions
alog-0.017
 chars-0.017
 Guilty-0.015
aukee-0.015
 illum-0.014
Token2001
Token ID14585
Feature activation+3.03e-01

Pos logit contributions
alog+0.075
 chars+0.074
 Guilty+0.068
aukee+0.067
 illum+0.065
Neg logit contributions
                -0.080
hai-0.080
kef-0.079
ynski-0.072
erb-0.071
Token director
Token ID3437
Feature activation-9.23e-02

Pos logit contributions
alog+0.447
 chars+0.438
 Guilty+0.401
aukee+0.400
 illum+0.382
Neg logit contributions
                -0.508
hai-0.506
kef-0.502
ynski-0.455
erb-0.452
Token of
Token ID286
Feature activation-8.58e-02

Pos logit contributions
                +0.070
hai+0.069
kef+0.069
ynski+0.062
erb+0.062
Neg logit contributions
alog-0.065
 chars-0.064
 Guilty-0.059
aukee-0.058
 illum-0.056
Token Concern
Token ID32265
Feature activation+3.19e-01

Pos logit contributions
                +0.064
hai+0.064
kef+0.064
ynski+0.058
erb+0.057
Neg logit contributions
alog-0.061
 chars-0.060
 Guilty-0.055
aukee-0.055
 illum-0.052
Tokened
Token ID276
Feature activation-2.93e-01

Pos logit contributions
alog+0.256
 chars+0.253
aukee+0.234
 Guilty+0.233
 illum+0.225
Neg logit contributions
                -0.210
hai-0.209
kef-0.207
ynski-0.184
erb-0.183
Token Women
Token ID6926
Feature activation-1.82e-02

Pos logit contributions
                +0.250
hai+0.249
kef+0.248
ynski+0.227
erb+0.225
Neg logit contributions
alog-0.179
 chars-0.175
 Guilty-0.158
aukee-0.157
 illum-0.149
Token for
Token ID329
Feature activation+1.89e+00

Pos logit contributions
                +0.014
hai+0.013
kef+0.013
ynski+0.012
erb+0.012
Neg logit contributions
alog-0.013
 chars-0.013
 Guilty-0.012
aukee-0.012
 illum-0.011
Token America
Token ID2253
Feature activation+1.57e-01

Pos logit contributions
alog+1.241
 chars+1.225
 Guilty+1.100
aukee+1.099
 illum+1.065
Neg logit contributions
                -1.527
hai-1.506
kef-1.496
erb-1.379
ynski-1.376
Token,
Token ID11
Feature activation+7.36e-02

Pos logit contributions
alog+0.107
 chars+0.105
 Guilty+0.096
aukee+0.096
 illum+0.092
Neg logit contributions
                -0.122
hai-0.121
kef-0.121
ynski-0.109
erb-0.108
Token we
Token ID356
Feature activation+2.34e-01

Pos logit contributions
alog+0.048
 chars+0.047
 Guilty+0.042
aukee+0.042
 illum+0.040
Neg logit contributions
                -0.060
hai-0.060
kef-0.059
ynski-0.054
erb-0.053
Tokenpt
Token ID457
Feature activation-1.03e-01

Pos logit contributions
alog+0.167
 chars+0.164
 Guilty+0.150
aukee+0.150
 illum+0.144
Neg logit contributions
                -0.175
hai-0.175
kef-0.173
ynski-0.156
erb-0.155
Token as
Token ID355
Feature activation-6.09e-02

Pos logit contributions
                +0.080
hai+0.080
kef+0.080
ynski+0.072
erb+0.072
Neg logit contributions
alog-0.070
 chars-0.069
 Guilty-0.063
aukee-0.063
 illum-0.060
Token that
Token ID326
Feature activation-1.98e-01

Pos logit contributions
alog+0.079
 chars+0.077
 Guilty+0.071
aukee+0.071
 illum+0.068
Neg logit contributions
                -0.076
hai-0.076
kef-0.075
ynski-0.068
erb-0.067
Token also
Token ID635
Feature activation-2.00e-01

Pos logit contributions
                +0.159
hai+0.158
kef+0.157
ynski+0.143
erb+0.142
Neg logit contributions
alog-0.131
 chars-0.129
 Guilty-0.117
aukee-0.116
 illum-0.111
Token brought
Token ID3181
Feature activation+5.56e-01

Pos logit contributions
                +0.150
hai+0.150
kef+0.149
ynski+0.134
erb+0.134
Neg logit contributions
alog-0.142
 chars-0.140
 Guilty-0.128
aukee-0.127
 illum-0.122
Token Joe
Token ID5689
Feature activation+9.07e-02

Pos logit contributions
alog+0.378
 chars+0.371
 Guilty+0.337
aukee+0.337
 illum+0.323
Neg logit contributions
                -0.434
hai-0.432
kef-0.429
ynski-0.390
erb-0.386
Token N
Token ID399
Feature activation-6.79e-02

Pos logit contributions
alog+0.058
 chars+0.057
 Guilty+0.051
aukee+0.051
 illum+0.049
Neg logit contributions
                -0.075
hai-0.074
kef-0.074
ynski-0.067
erb-0.067
Tokenieu
Token ID22304
Feature activation+1.88e+00

Pos logit contributions
                +0.043
hai+0.043
kef+0.043
ynski+0.038
erb+0.038
Neg logit contributions
alog-0.056
 chars-0.055
 Guilty-0.051
aukee-0.051
 illum-0.049
Tokenw
Token ID86
Feature activation-4.53e-01

Pos logit contributions
alog+1.374
 chars+1.357
aukee+1.252
 Guilty+1.238
 illum+1.202
Neg logit contributions
                -1.384
hai-1.364
kef-1.348
erb-1.227
ynski-1.227
Tokenend
Token ID437
Feature activation+6.78e-01

Pos logit contributions
                +0.349
hai+0.349
kef+0.346
ynski+0.313
erb+0.311
Neg logit contributions
alog-0.312
 chars-0.306
aukee-0.279
 Guilty-0.279
 illum-0.267
Tokenyk
Token ID48361
Feature activation+3.87e-01

Pos logit contributions
alog+0.316
 chars+0.310
 Guilty+0.269
aukee+0.267
 illum+0.251
Neg logit contributions
                -0.673
hai-0.673
kef-0.667
ynski-0.623
erb-0.620
Token to
Token ID284
Feature activation+1.34e-01

Pos logit contributions
alog+0.269
 chars+0.264
 Guilty+0.241
aukee+0.240
 illum+0.231
Neg logit contributions
                -0.297
hai-0.296
kef-0.293
ynski-0.266
erb-0.264
Token the
Token ID262
Feature activation+3.14e-02

Pos logit contributions
alog+0.099
 chars+0.097
 Guilty+0.089
aukee+0.089
 illum+0.086
Neg logit contributions
                -0.096
hai-0.096
kef-0.095
ynski-0.085
erb-0.085
Token Width
Token ID38807
Feature activation+3.88e-03

Pos logit contributions
                +0.009
hai+0.009
kef+0.009
ynski+0.008
erb+0.008
Neg logit contributions
alog-0.008
 chars-0.008
 Guilty-0.007
aukee-0.007
 illum-0.007
Token,
Token ID11
Feature activation+7.68e-02

Pos logit contributions
alog+0.003
 chars+0.003
 Guilty+0.002
aukee+0.002
 illum+0.002
Neg logit contributions
                -0.003
hai-0.003
kef-0.003
ynski-0.003
erb-0.003
TokenHeight
Token ID23106
Feature activation-1.24e-01

Pos logit contributions
alog+0.056
 chars+0.055
 Guilty+0.050
aukee+0.050
 illum+0.048
Neg logit contributions
                -0.056
hai-0.056
kef-0.056
ynski-0.050
erb-0.050
Token Write
Token ID19430
Feature activation+3.77e-01

Pos logit contributions
                +0.090
hai+0.090
kef+0.089
ynski+0.080
erb+0.080
Neg logit contributions
alog-0.092
 chars-0.090
 Guilty-0.083
aukee-0.082
 illum-0.079
Token-
Token ID12
Feature activation+1.55e-01

Pos logit contributions
alog+0.254
 chars+0.249
aukee+0.227
 Guilty+0.227
 illum+0.216
Neg logit contributions
                -0.297
hai-0.296
kef-0.294
ynski-0.267
erb-0.265
TokenVer
Token ID13414
Feature activation+1.88e+00

Pos logit contributions
alog+0.142
 chars+0.140
 Guilty+0.130
aukee+0.130
 illum+0.126
Neg logit contributions
                -0.084
hai-0.084
kef-0.083
ynski-0.072
erb-0.071
Tokenb
Token ID65
Feature activation+2.09e-01

Pos logit contributions
alog+1.098
 chars+1.067
aukee+0.976
 Guilty+0.966
 illum+0.907
Neg logit contributions
                -1.647
hai-1.637
kef-1.623
erb-1.508
ynski-1.472
Tokenose
Token ID577
Feature activation+1.56e-01

Pos logit contributions
alog+0.164
 chars+0.161
aukee+0.149
 Guilty+0.148
 illum+0.143
Neg logit contributions
                -0.141
hai-0.141
kef-0.139
ynski-0.124
erb-0.124
Token -
Token ID532
Feature activation+1.96e-01

Pos logit contributions
alog+0.108
 chars+0.105
aukee+0.096
 Guilty+0.096
 illum+0.092
Neg logit contributions
                -0.120
hai-0.120
kef-0.119
ynski-0.107
erb-0.107
TokenMessage
Token ID12837
Feature activation+4.32e-01

Pos logit contributions
alog+0.159
 chars+0.156
aukee+0.145
 Guilty+0.145
 illum+0.139
Neg logit contributions
                -0.127
hai-0.126
kef-0.125
ynski-0.111
erb-0.110
Token "
Token ID366
Feature activation-1.05e-01

Pos logit contributions
alog+0.300
 chars+0.293
aukee+0.269
 Guilty+0.268
 illum+0.257
Neg logit contributions
                -0.331
hai-0.330
kef-0.327
ynski-0.295
erb-0.295
Token Republican
Token ID3415
Feature activation+2.68e-01

Pos logit contributions
alog+0.191
 chars+0.186
aukee+0.168
 Guilty+0.168
 illum+0.160
Neg logit contributions
                -0.259
hai-0.258
kef-0.257
ynski-0.234
erb-0.233
Token nominee
Token ID10429
Feature activation+1.71e-01

Pos logit contributions
alog+0.166
 chars+0.162
 Guilty+0.146
aukee+0.146
 illum+0.139
Neg logit contributions
                -0.225
hai-0.225
kef-0.223
ynski-0.204
erb-0.202
Token Donald
Token ID3759
Feature activation+5.80e-01

Pos logit contributions
alog+0.104
 chars+0.102
 Guilty+0.092
aukee+0.092
 illum+0.087
Neg logit contributions
                -0.145
hai-0.144
kef-0.143
ynski-0.131
erb-0.130
Token Trump
Token ID1301
Feature activation+4.12e-01

Pos logit contributions
alog+0.386
 chars+0.378
aukee+0.346
 Guilty+0.344
 illum+0.330
Neg logit contributions
                -0.461
hai-0.460
kef-0.456
ynski-0.415
erb-0.412
Token Donald
Token ID3759
Feature activation+8.09e-01

Pos logit contributions
alog+0.254
 chars+0.248
 Guilty+0.224
aukee+0.223
 illum+0.213
Neg logit contributions
                -0.347
hai-0.347
kef-0.344
ynski-0.315
erb-0.313
Token John
Token ID1757
Feature activation+1.85e+00

Pos logit contributions
alog+0.632
 chars+0.621
aukee+0.576
 Guilty+0.571
 illum+0.553
Neg logit contributions
                -0.548
hai-0.545
kef-0.540
ynski-0.484
erb-0.482
Token Trump
Token ID1301
Feature activation+6.08e-01

Pos logit contributions
alog+1.280
 chars+1.246
aukee+1.159
 Guilty+1.139
 illum+1.111
Neg logit contributions
                -1.427
hai-1.420
kef-1.409
ynski-1.283
erb-1.260
TokenHouse
Token ID18102
Feature activation+6.00e-02

Pos logit contributions
alog+0.412
 chars+0.404
 Guilty+0.367
aukee+0.367
 illum+0.352
Neg logit contributions
hai-0.476
                -0.476
kef-0.472
erb-0.426
ynski-0.426
Token committee
Token ID5583
Feature activation+1.33e+00

Pos logit contributions
alog+0.035
 chars+0.034
 Guilty+0.030
aukee+0.030
 illum+0.029
Neg logit contributions
                -0.053
hai-0.053
kef-0.053
ynski-0.048
erb-0.048
Token believes
Token ID5804
Feature activation+3.96e-01

Pos logit contributions
alog+1.137
 chars+1.111
aukee+1.037
 Guilty+1.037
 illum+0.999
Neg logit contributions
hai-0.807
                -0.804
kef-0.798
ynski-0.700
erb-0.699
Token it
Token ID340
Feature activation-2.47e-02

Pos logit contributions
alog+0.300
 chars+0.293
aukee+0.270
 Guilty+0.270
 illum+0.261
Neg logit contributions
hai-0.280
                -0.280
kef-0.277
ynski-0.248
erb-0.247
Tokenahl
Token ID15668
Feature activation-1.77e-02

Pos logit contributions
                +0.271
hai+0.271
kef+0.269
ynski+0.242
erb+0.240
Neg logit contributions
alog-0.266
 chars-0.261
 Guilty-0.239
aukee-0.238
 illum-0.228
Tokenia
Token ID544
Feature activation-6.76e-01

Pos logit contributions
                +0.015
hai+0.015
kef+0.015
ynski+0.013
erb+0.013
Neg logit contributions
alog-0.011
 chars-0.011
 Guilty-0.010
aukee-0.010
 illum-0.009
Token Lith
Token ID19730
Feature activation-2.94e-01

Pos logit contributions
                +0.562
hai+0.561
kef+0.558
ynski+0.507
erb+0.506
Neg logit contributions
alog-0.426
 chars-0.418
 Guilty-0.377
aukee-0.376
 illum-0.361
Tokenwick
Token ID16239
Feature activation-5.97e-01

Pos logit contributions
                +0.305
hai+0.304
kef+0.303
ynski+0.281
erb+0.280
Neg logit contributions
alog-0.125
 chars-0.121
 Guilty-0.103
aukee-0.103
 illum-0.095
Token has
Token ID468
Feature activation-4.80e-01

Pos logit contributions
hai+0.397
                +0.394
kef+0.394
ynski+0.348
erb+0.347
Neg logit contributions
alog-0.474
 chars-0.468
 Guilty-0.433
aukee-0.431
 illum-0.417
Token a
Token ID257
Feature activation+1.84e+00

Pos logit contributions
                +0.383
hai+0.382
kef+0.379
ynski+0.345
erb+0.343
Neg logit contributions
alog-0.318
 chars-0.313
 Guilty-0.284
aukee-0.282
 illum-0.271
Token rundown
Token ID45602
Feature activation+3.75e-01

Pos logit contributions
alog+1.316
 chars+1.276
aukee+1.180
 Guilty+1.160
 illum+1.116
Neg logit contributions
hai-1.398
                -1.376
kef-1.372
erb-1.244
ynski-1.234
Token of
Token ID286
Feature activation+9.18e-01

Pos logit contributions
alog+0.267
 chars+0.262
 Guilty+0.240
aukee+0.240
 illum+0.230
Neg logit contributions
                -0.281
hai-0.280
kef-0.279
ynski-0.251
erb-0.250
Token the
Token ID262
Feature activation+5.32e-01

Pos logit contributions
alog+0.683
 chars+0.673
aukee+0.621
 Guilty+0.616
 illum+0.592
Neg logit contributions
                -0.654
hai-0.652
kef-0.649
ynski-0.590
erb-0.580
Token theory
Token ID4583
Feature activation-2.31e-01

Pos logit contributions
alog+0.348
 chars+0.340
aukee+0.311
 Guilty+0.308
 illum+0.295
Neg logit contributions
                -0.429
hai-0.428
kef-0.424
ynski-0.388
erb-0.386
Token,
Token ID11
Feature activation-3.71e-01

Pos logit contributions
                +0.185
hai+0.185
kef+0.183
ynski+0.167
erb+0.166
Neg logit contributions
alog-0.152
 chars-0.149
 Guilty-0.135
aukee-0.135
 illum-0.129
Token of
Token ID286
Feature activation-7.42e-01

Pos logit contributions
                +1.210
kef+1.199
hai+1.194
erb+1.098
ynski+1.093
Neg logit contributions
alog-1.005
 chars-0.991
 Guilty-0.904
aukee-0.894
 illum-0.847
Token taxes
Token ID5704
Feature activation-7.67e-01

Pos logit contributions
                +0.541
hai+0.538
kef+0.533
ynski+0.483
erb+0.479
Neg logit contributions
alog-0.543
 chars-0.535
 Guilty-0.490
aukee-0.489
 illum-0.470
Token'
Token ID6
Feature activation-8.15e-01

Pos logit contributions
                +0.600
hai+0.597
kef+0.595
ynski+0.539
erb+0.537
Neg logit contributions
alog-0.522
 chars-0.511
 Guilty-0.466
aukee-0.463
 illum-0.443
Token Former
Token ID14466
Feature activation-6.52e-01

Pos logit contributions
                +0.507
kef+0.504
hai+0.503
ynski+0.444
erb+0.440
Neg logit contributions
alog-0.684
 chars-0.673
 Guilty-0.629
aukee-0.624
 illum-0.603
Token Sanders
Token ID5831
Feature activation+2.63e-01

Pos logit contributions
                +0.507
hai+0.506
kef+0.502
ynski+0.456
erb+0.451
Neg logit contributions
alog-0.445
 chars-0.438
aukee-0.399
 Guilty-0.398
 illum-0.381
Token campaign
Token ID1923
Feature activation+1.83e+00

Pos logit contributions
alog+0.199
 chars+0.196
 Guilty+0.180
aukee+0.180
 illum+0.173
Neg logit contributions
                -0.185
hai-0.185
kef-0.183
ynski-0.164
erb-0.163
Token spokesman
Token ID6523
Feature activation-4.85e-01

Pos logit contributions
alog+1.371
 chars+1.341
 Guilty+1.248
aukee+1.227
 illum+1.197
Neg logit contributions
hai-1.296
                -1.280
kef-1.278
ynski-1.139
erb-1.128
Token:
Token ID25
Feature activation-1.84e+00

Pos logit contributions
                +0.392
hai+0.392
kef+0.389
erb+0.353
ynski+0.353
Neg logit contributions
alog-0.316
 chars-0.310
 Guilty-0.281
aukee-0.280
 illum-0.267
Token Clinton
Token ID2605
Feature activation+5.09e-01

Pos logit contributions
hai+1.439
kef+1.426
                +1.416
ynski+1.305
erb+1.275
Neg logit contributions
alog-1.247
 chars-1.221
 Guilty-1.153
aukee-1.118
 illum-1.041
Token staff
Token ID3085
Feature activation-6.34e-01

Pos logit contributions
alog+0.342
 chars+0.332
aukee+0.305
 Guilty+0.302
 illum+0.290
Neg logit contributions
hai-0.402
                -0.401
kef-0.398
ynski-0.362
erb-0.359
Token are
Token ID389
Feature activation-1.00e+00

Pos logit contributions
                +0.490
hai+0.487
kef+0.484
ynski+0.437
erb+0.435
Neg logit contributions
alog-0.434
 chars-0.429
 Guilty-0.392
aukee-0.390
 illum-0.372
Token more
Token ID517
Feature activation-2.89e-01

Pos logit contributions
alog+0.023
 chars+0.023
 Guilty+0.021
aukee+0.021
 illum+0.020
Neg logit contributions
                -0.028
hai-0.028
kef-0.028
ynski-0.025
erb-0.025
Token to
Token ID284
Feature activation+5.76e-01

Pos logit contributions
                +0.223
hai+0.223
kef+0.221
ynski+0.200
erb+0.199
Neg logit contributions
alog-0.199
 chars-0.195
 Guilty-0.178
aukee-0.177
 illum-0.170
Token help
Token ID1037
Feature activation+5.23e-02

Pos logit contributions
alog+0.482
 chars+0.471
aukee+0.442
 Guilty+0.439
 illum+0.422
Neg logit contributions
                -0.361
hai-0.357
kef-0.356
ynski-0.315
erb-0.313
Token us
Token ID514
Feature activation+1.05e-02

Pos logit contributions
alog+0.037
 chars+0.036
aukee+0.033
 Guilty+0.033
 illum+0.032
Neg logit contributions
                -0.039
hai-0.039
kef-0.039
ynski-0.035
erb-0.035
Token continue
Token ID2555
Feature activation+1.10e+00

Pos logit contributions
alog+0.007
 chars+0.007
 Guilty+0.006
aukee+0.006
 illum+0.006
Neg logit contributions
                -0.008
hai-0.008
kef-0.008
ynski-0.007
erb-0.007
Token to
Token ID284
Feature activation+1.81e+00

Pos logit contributions
alog+0.794
 chars+0.766
aukee+0.714
 Guilty+0.707
 illum+0.678
Neg logit contributions
                -0.815
hai-0.812
kef-0.804
ynski-0.725
erb-0.723
Token write
Token ID3551
Feature activation-2.18e-02

Pos logit contributions
alog+1.419
 chars+1.367
 Guilty+1.291
aukee+1.289
 illum+1.217
Neg logit contributions
                -1.235
hai-1.226
kef-1.217
ynski-1.109
erb-1.090
Token about
Token ID546
Feature activation-2.91e-01

Pos logit contributions
                +0.018
hai+0.018
kef+0.017
ynski+0.016
erb+0.016
Neg logit contributions
alog-0.014
 chars-0.014
 Guilty-0.013
aukee-0.013
 illum-0.012
Token the
Token ID262
Feature activation+6.71e-01

Pos logit contributions
                +0.212
hai+0.212
kef+0.210
ynski+0.189
erb+0.188
Neg logit contributions
alog-0.213
 chars-0.209
 Guilty-0.192
aukee-0.191
 illum-0.184
Token issues
Token ID2428
Feature activation+4.67e-02

Pos logit contributions
alog+0.439
 chars+0.426
aukee+0.391
 Guilty+0.388
 illum+0.370
Neg logit contributions
                -0.544
hai-0.540
kef-0.534
erb-0.488
ynski-0.488
Token that
Token ID326
Feature activation+1.11e+00

Pos logit contributions
alog+0.034
 chars+0.033
 Guilty+0.031
aukee+0.031
 illum+0.029
Neg logit contributions
                -0.034
hai-0.034
kef-0.034
ynski-0.031
erb-0.030
Token Sync
Token ID35908
Feature activation+2.80e-01

Pos logit contributions
alog+0.069
 chars+0.068
 Guilty+0.062
aukee+0.062
 illum+0.059
Neg logit contributions
                -0.074
hai-0.074
kef-0.074
ynski-0.067
erb-0.066
Token
Token ID198
Feature activation+3.58e-02

Pos logit contributions
alog+0.197
 chars+0.194
 Guilty+0.177
aukee+0.177
 illum+0.170
Neg logit contributions
                -0.211
hai-0.211
kef-0.209
ynski-0.189
erb-0.188
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
alog+0.024
 chars+0.024
 Guilty+0.021
aukee+0.021
 illum+0.021
Neg logit contributions
                -0.028
hai-0.028
kef-0.028
ynski-0.025
erb-0.025
TokenThis
Token ID1212
Feature activation+2.56e-01

Pos logit contributions
alog+0.114
 chars+0.112
 Guilty+0.103
aukee+0.103
 illum+0.099
Neg logit contributions
                -0.114
hai-0.114
kef-0.113
ynski-0.101
erb-0.101
Token slideshow
Token ID48828
Feature activation+4.42e-01

Pos logit contributions
alog+0.171
 chars+0.168
 Guilty+0.153
aukee+0.153
 illum+0.146
Neg logit contributions
                -0.203
hai-0.202
kef-0.201
ynski-0.182
erb-0.181
Token requires
Token ID4433
Feature activation+1.79e+00

Pos logit contributions
alog+0.303
 chars+0.296
aukee+0.272
 Guilty+0.271
 illum+0.260
Neg logit contributions
                -0.342
hai-0.340
kef-0.338
ynski-0.307
erb-0.305
Token JavaScript
Token ID11933
Feature activation+9.48e-01

Pos logit contributions
alog+1.419
 chars+1.373
aukee+1.286
 Guilty+1.284
 illum+1.247
Neg logit contributions
                -1.189
kef-1.189
hai-1.185
ynski-1.057
erb-1.039
Token.
Token ID13
Feature activation+9.65e-02

Pos logit contributions
alog+0.624
 chars+0.605
aukee+0.561
 Guilty+0.554
 illum+0.532
Neg logit contributions
                -0.761
hai-0.757
kef-0.749
ynski-0.685
erb-0.679
Token
Token ID198
Feature activation+1.45e-01

Pos logit contributions
alog+0.062
 chars+0.060
aukee+0.055
 Guilty+0.055
 illum+0.052
Neg logit contributions
                -0.079
hai-0.079
kef-0.078
ynski-0.072
erb-0.071
Token
Token ID198
Feature activation+2.37e-01

Pos logit contributions
alog+0.098
 chars+0.096
aukee+0.087
 Guilty+0.087
 illum+0.083
Neg logit contributions
                -0.114
hai-0.113
kef-0.113
ynski-0.102
erb-0.102
TokenOut
Token ID7975
Feature activation-1.44e-01

Pos logit contributions
alog+0.173
 chars+0.170
 Guilty+0.156
aukee+0.156
 illum+0.150
Neg logit contributions
                -0.173
hai-0.173
kef-0.171
ynski-0.154
erb-0.153
Token DNC
Token ID17824
Feature activation+1.52e-01

Pos logit contributions
alog+0.425
 chars+0.414
aukee+0.378
 Guilty+0.376
 illum+0.359
Neg logit contributions
                -0.526
hai-0.525
kef-0.521
ynski-0.477
erb-0.468
Token in
Token ID287
Feature activation-4.56e-01

Pos logit contributions
alog+0.095
 chars+0.092
aukee+0.083
 Guilty+0.083
 illum+0.079
Neg logit contributions
                -0.127
hai-0.127
kef-0.126
ynski-0.115
erb-0.114
Token 2020
Token ID12131
Feature activation-1.83e+00

Pos logit contributions
                +0.300
hai+0.300
kef+0.297
ynski+0.265
erb+0.261
Neg logit contributions
alog-0.365
 chars-0.361
 Guilty-0.334
aukee-0.333
 illum-0.320
Token after
Token ID706
Feature activation+2.13e-01

Pos logit contributions
                +1.558
kef+1.544
hai+1.539
erb+1.416
ynski+1.396
Neg logit contributions
 chars-1.130
alog-1.116
 Guilty-0.997
aukee-0.988
 illum-0.946
Token '
Token ID705
Feature activation+2.21e-02

Pos logit contributions
alog+0.164
 chars+0.161
 Guilty+0.148
aukee+0.148
 illum+0.143
Neg logit contributions
                -0.147
hai-0.147
kef-0.145
ynski-0.130
erb-0.129
Tokennot
Token ID1662
Feature activation+1.77e+00

Pos logit contributions
alog+0.017
 chars+0.017
 Guilty+0.016
aukee+0.016
 illum+0.015
Neg logit contributions
                -0.015
hai-0.015
kef-0.015
ynski-0.013
erb-0.013
Token quite
Token ID2407
Feature activation+4.49e-01

Pos logit contributions
alog+1.351
 chars+1.329
aukee+1.226
 Guilty+1.215
 illum+1.178
Neg logit contributions
                -1.238
hai-1.223
kef-1.209
erb-1.089
ynski-1.071
Token even
Token ID772
Feature activation-3.82e-01

Pos logit contributions
alog+0.332
 chars+0.326
aukee+0.300
 Guilty+0.300
 illum+0.287
Neg logit contributions
                -0.323
hai-0.323
kef-0.319
ynski-0.288
erb-0.286
Token handed
Token ID10158
Feature activation-7.97e-01

Pos logit contributions
                +0.295
hai+0.294
kef+0.292
ynski+0.264
erb+0.263
Neg logit contributions
alog-0.263
 chars-0.259
 Guilty-0.236
aukee-0.235
 illum-0.226
Token'
Token ID6
Feature activation-4.45e-01

Pos logit contributions
hai+0.623
                +0.619
kef+0.616
ynski+0.556
erb+0.553
Neg logit contributions
alog-0.544
 chars-0.534
aukee-0.486
 Guilty-0.483
 illum-0.462
Token 2016
Token ID1584
Feature activation-5.22e-01

Pos logit contributions
                +0.315
hai+0.314
kef+0.312
ynski+0.280
erb+0.277
Neg logit contributions
alog-0.335
 chars-0.330
 Guilty-0.303
aukee-0.302
 illum-0.291
Token former
Token ID1966
Feature activation+1.92e-01

Pos logit contributions
                +0.099
hai+0.098
kef+0.098
ynski+0.088
erb+0.088
Neg logit contributions
alog-0.087
 chars-0.085
 Guilty-0.078
aukee-0.078
 illum-0.074
Token executive
Token ID4640
Feature activation-3.50e-02

Pos logit contributions
alog+0.118
 chars+0.116
 Guilty+0.104
aukee+0.104
 illum+0.099
Neg logit contributions
                -0.162
hai-0.161
kef-0.160
ynski-0.146
erb-0.146
Token at
Token ID379
Feature activation+1.56e-01

Pos logit contributions
                +0.025
hai+0.025
kef+0.025
ynski+0.023
erb+0.023
Neg logit contributions
alog-0.026
 chars-0.025
 Guilty-0.023
aukee-0.023
 illum-0.022
Token Bear
Token ID14732
Feature activation+1.16e+00

Pos logit contributions
alog+0.094
 chars+0.092
aukee+0.083
 Guilty+0.083
 illum+0.079
Neg logit contributions
                -0.134
hai-0.134
kef-0.133
ynski-0.121
erb-0.121
Token St
Token ID520
Feature activation-1.26e-01

Pos logit contributions
alog+0.688
 chars+0.678
aukee+0.606
 Guilty+0.603
 illum+0.579
Neg logit contributions
                -1.011
hai-1.006
kef-0.996
ynski-0.915
erb-0.911
Tokenear
Token ID451
Feature activation+1.76e+00

Pos logit contributions
                +0.083
hai+0.082
kef+0.082
ynski+0.073
erb+0.072
Neg logit contributions
alog-0.101
 chars-0.100
 Guilty-0.092
aukee-0.092
 illum-0.089
Tokenns
Token ID5907
Feature activation-5.97e-02

Pos logit contributions
alog+1.031
 chars+1.010
aukee+0.906
 Guilty+0.902
 illum+0.860
Neg logit contributions
                -1.536
kef-1.526
hai-1.524
ynski-1.390
erb-1.387
Token,
Token ID11
Feature activation+1.29e-01

Pos logit contributions
                +0.045
hai+0.045
kef+0.045
ynski+0.041
erb+0.040
Neg logit contributions
alog-0.042
 chars-0.041
 Guilty-0.037
aukee-0.037
 illum-0.036
Token a
Token ID257
Feature activation-5.41e-02

Pos logit contributions
alog+0.087
 chars+0.086
aukee+0.078
 Guilty+0.078
 illum+0.074
Neg logit contributions
                -0.101
hai-0.101
kef-0.100
ynski-0.091
erb-0.090
Token Goldman
Token ID18740
Feature activation-3.96e-02

Pos logit contributions
                +0.042
hai+0.042
kef+0.042
ynski+0.038
erb+0.037
Neg logit contributions
alog-0.037
 chars-0.036
 Guilty-0.033
aukee-0.033
 illum-0.032
Token competitor
Token ID20319
Feature activation-9.87e-02

Pos logit contributions
                +0.029
hai+0.029
kef+0.028
ynski+0.026
erb+0.025
Neg logit contributions
alog-0.029
 chars-0.029
 Guilty-0.026
aukee-0.026
 illum-0.025
TokenI
Token ID40
Feature activation+7.93e-01

Pos logit contributions
                +0.385
hai+0.384
kef+0.382
ynski+0.343
erb+0.340
Neg logit contributions
alog-0.389
 chars-0.381
 Guilty-0.350
aukee-0.349
 illum-0.333
Token
Token ID447
Feature activation+2.64e-01

Pos logit contributions
alog+0.408
 chars+0.393
aukee+0.350
 Guilty+0.349
 illum+0.326
Neg logit contributions
                -0.752
hai-0.750
kef-0.746
ynski-0.695
erb-0.685
Token
Token ID247
Feature activation+3.37e-01

Pos logit contributions
alog+0.165
 chars+0.162
aukee+0.146
 Guilty+0.146
 illum+0.139
Neg logit contributions
                -0.220
hai-0.219
kef-0.217
ynski-0.199
erb-0.197
Tokenve
Token ID303
Feature activation-5.99e-01

Pos logit contributions
alog+0.226
 chars+0.222
 Guilty+0.202
aukee+0.202
 illum+0.193
Neg logit contributions
                -0.267
hai-0.266
kef-0.264
ynski-0.240
erb-0.238
Token written
Token ID3194
Feature activation+5.64e-01

Pos logit contributions
                +0.407
hai+0.405
kef+0.404
ynski+0.360
erb+0.355
Neg logit contributions
alog-0.468
 chars-0.462
 Guilty-0.425
aukee-0.423
 illum-0.409
Token a
Token ID257
Feature activation+1.75e+00

Pos logit contributions
alog+0.409
 chars+0.401
 Guilty+0.370
aukee+0.368
 illum+0.354
Neg logit contributions
                -0.414
hai-0.412
kef-0.412
ynski-0.368
erb-0.367
Token couple
Token ID3155
Feature activation+9.67e-01

Pos logit contributions
alog+1.348
 chars+1.314
 Guilty+1.226
aukee+1.214
hari+1.181
Neg logit contributions
                -1.196
hai-1.193
kef-1.173
erb-1.044
ynski-1.040
Token of
Token ID286
Feature activation+1.20e+00

Pos logit contributions
alog+0.699
 chars+0.681
aukee+0.634
 Guilty+0.632
 illum+0.609
Neg logit contributions
                -0.714
hai-0.710
kef-0.700
ynski-0.632
erb-0.627
Token books
Token ID3835
Feature activation+5.67e-01

Pos logit contributions
alog+0.632
 chars+0.607
 Guilty+0.548
aukee+0.546
 illum+0.510
Neg logit contributions
hai-1.122
                -1.120
kef-1.108
ynski-1.020
erb-1.011
Token,
Token ID11
Feature activation-1.71e-01

Pos logit contributions
alog+0.396
 chars+0.387
 Guilty+0.356
aukee+0.354
 illum+0.339
Neg logit contributions
                -0.433
hai-0.431
kef-0.428
ynski-0.387
erb-0.383
Token which
Token ID543
Feature activation+7.39e-01

Pos logit contributions
                +0.139
hai+0.139
kef+0.138
ynski+0.126
erb+0.125
Neg logit contributions
alog-0.111
 chars-0.109
 Guilty-0.098
aukee-0.098
 illum-0.094
Token Send
Token ID16290
Feature activation+1.40e+00

Pos logit contributions
alog+0.031
 chars+0.030
aukee+0.027
 Guilty+0.027
 illum+0.026
Neg logit contributions
                -0.038
hai-0.038
kef-0.037
ynski-0.034
erb-0.034
Token email
Token ID3053
Feature activation+2.68e-01

Pos logit contributions
alog+1.001
 chars+0.969
 Guilty+0.897
aukee+0.891
 illum+0.867
Neg logit contributions
hai-1.052
                -1.051
kef-1.034
erb-0.937
ynski-0.935
Token to
Token ID284
Feature activation+1.08e+00

Pos logit contributions
alog+0.188
 chars+0.184
aukee+0.168
 Guilty+0.168
 illum+0.161
Neg logit contributions
                -0.204
hai-0.204
kef-0.202
ynski-0.182
erb-0.182
Token this
Token ID428
Feature activation+1.02e+00

Pos logit contributions
alog+0.790
 chars+0.773
aukee+0.719
 Guilty+0.714
 illum+0.687
Neg logit contributions
                -0.779
hai-0.777
kef-0.768
erb-0.694
ynski-0.691
Token address
Token ID2209
Feature activation+2.39e-01

Pos logit contributions
alog+0.710
 chars+0.691
 Guilty+0.638
aukee+0.636
 illum+0.605
Neg logit contributions
hai-0.786
                -0.785
kef-0.773
erb-0.705
ynski-0.701
Token Enter
Token ID6062
Feature activation+1.73e+00

Pos logit contributions
alog+0.187
 chars+0.183
aukee+0.170
 Guilty+0.169
 illum+0.163
Neg logit contributions
                -0.162
hai-0.161
kef-0.160
ynski-0.143
erb-0.142
Token Your
Token ID3406
Feature activation+8.75e-01

Pos logit contributions
alog+1.260
 chars+1.200
aukee+1.139
 Guilty+1.116
 illum+1.080
Neg logit contributions
                -1.281
hai-1.269
kef-1.246
ynski-1.142
erb-1.126
Token Name
Token ID6530
Feature activation+1.17e+00

Pos logit contributions
alog+0.636
 chars+0.622
aukee+0.578
 Guilty+0.571
 illum+0.550
Neg logit contributions
                -0.645
hai-0.638
kef-0.633
ynski-0.569
erb-0.567
Token Add
Token ID3060
Feature activation+5.08e-01

Pos logit contributions
alog+0.759
 chars+0.746
aukee+0.674
 Guilty+0.666
 illum+0.645
Neg logit contributions
                -0.948
hai-0.934
kef-0.931
ynski-0.853
erb-0.853
Token a
Token ID257
Feature activation+3.42e-01

Pos logit contributions
alog+0.369
 chars+0.361
aukee+0.331
 Guilty+0.331
 illum+0.319
Neg logit contributions
                -0.373
hai-0.371
kef-0.368
ynski-0.332
erb-0.331
Token comment
Token ID2912
Feature activation+1.11e+00

Pos logit contributions
alog+0.185
 chars+0.180
aukee+0.161
 Guilty+0.160
 illum+0.151
Neg logit contributions
                -0.315
hai-0.313
kef-0.312
ynski-0.287
erb-0.286
Token he
Token ID339
Feature activation+2.03e-01

Pos logit contributions
alog+0.388
 chars+0.379
aukee+0.348
 Guilty+0.348
 illum+0.333
Neg logit contributions
                -0.396
hai-0.395
kef-0.392
ynski-0.354
erb-0.352
Token was
Token ID373
Feature activation+1.44e-01

Pos logit contributions
alog+0.136
 chars+0.133
 Guilty+0.121
aukee+0.121
 illum+0.115
Neg logit contributions
                -0.161
hai-0.161
kef-0.160
ynski-0.145
erb-0.144
Token heard
Token ID2982
Feature activation-1.49e-01

Pos logit contributions
alog+0.096
 chars+0.094
 Guilty+0.085
aukee+0.085
 illum+0.081
Neg logit contributions
                -0.115
hai-0.114
kef-0.114
ynski-0.103
erb-0.103
Token over
Token ID625
Feature activation+5.69e-02

Pos logit contributions
                +0.114
hai+0.114
kef+0.113
ynski+0.102
erb+0.102
Neg logit contributions
alog-0.103
 chars-0.101
 Guilty-0.093
aukee-0.092
 illum-0.088
Token a
Token ID257
Feature activation+3.28e-01

Pos logit contributions
alog+0.041
 chars+0.040
 Guilty+0.037
aukee+0.037
 illum+0.036
Neg logit contributions
                -0.042
hai-0.042
kef-0.041
ynski-0.037
erb-0.037
Token hot
Token ID3024
Feature activation+1.73e+00

Pos logit contributions
alog+0.221
 chars+0.216
aukee+0.197
 Guilty+0.197
 illum+0.188
Neg logit contributions
                -0.258
hai-0.257
kef-0.256
ynski-0.232
erb-0.231
Token mic
Token ID12314
Feature activation-1.49e-01

Pos logit contributions
alog+1.104
 chars+1.074
aukee+0.984
 Guilty+0.983
 illum+0.930
Neg logit contributions
                -1.412
hai-1.411
kef-1.401
ynski-1.274
erb-1.265
Token saying
Token ID2282
Feature activation+3.38e-01

Pos logit contributions
                +0.119
hai+0.119
kef+0.118
ynski+0.107
erb+0.107
Neg logit contributions
alog-0.099
 chars-0.097
 Guilty-0.088
aukee-0.088
 illum-0.084
Token the
Token ID262
Feature activation+2.11e-01

Pos logit contributions
alog+0.235
 chars+0.230
 Guilty+0.210
aukee+0.210
 illum+0.201
Neg logit contributions
                -0.259
hai-0.258
kef-0.257
ynski-0.232
erb-0.231
Token Republican
Token ID3415
Feature activation+1.26e-01

Pos logit contributions
alog+0.130
 chars+0.127
 Guilty+0.115
aukee+0.115
 illum+0.109
Neg logit contributions
                -0.179
hai-0.178
kef-0.177
ynski-0.162
erb-0.161
Token Party
Token ID3615
Feature activation-2.27e-01

Pos logit contributions
alog+0.082
 chars+0.080
 Guilty+0.073
aukee+0.072
 illum+0.069
Neg logit contributions
                -0.103
hai-0.102
kef-0.102
ynski-0.093
erb-0.092
Token Park
Token ID3250
Feature activation-3.73e-02

Pos logit contributions
alog+0.390
 chars+0.381
aukee+0.345
 Guilty+0.345
 illum+0.329
Neg logit contributions
                -0.491
hai-0.490
kef-0.486
ynski-0.441
erb-0.440
Token under
Token ID739
Feature activation-3.00e-01

Pos logit contributions
                +0.029
hai+0.029
kef+0.029
ynski+0.026
erb+0.026
Neg logit contributions
alog-0.026
 chars-0.025
 Guilty-0.023
aukee-0.023
 illum-0.022
Token frost
Token ID21682
Feature activation+1.40e-01

Pos logit contributions
                +0.229
hai+0.228
kef+0.227
ynski+0.205
erb+0.204
Neg logit contributions
alog-0.209
 chars-0.206
 Guilty-0.188
aukee-0.187
 illum-0.180
Tokened
Token ID276
Feature activation-5.08e-01

Pos logit contributions
alog+0.095
 chars+0.093
 Guilty+0.085
aukee+0.084
 illum+0.081
Neg logit contributions
                -0.110
hai-0.110
kef-0.109
ynski-0.099
erb-0.098
Token sequ
Token ID4726
Feature activation+7.47e-01

Pos logit contributions
                +0.402
hai+0.402
kef+0.399
ynski+0.362
erb+0.360
Neg logit contributions
alog-0.339
 chars-0.333
 Guilty-0.303
aukee-0.302
 illum-0.289
Tokeno
Token ID78
Feature activation-2.24e+00

Pos logit contributions
alog+0.567
 chars+0.561
 Guilty+0.517
aukee+0.515
 illum+0.497
Neg logit contributions
hai-0.522
                -0.519
kef-0.514
ynski-0.462
erb-0.461
Tokenias
Token ID4448
Feature activation-1.30e-01

Pos logit contributions
                +1.610
kef+1.608
hai+1.595
ynski+1.458
erb+1.399
Neg logit contributions
alog-1.679
 chars-1.652
 Guilty-1.509
aukee-1.489
 illum-1.450
Token and
Token ID290
Feature activation-4.62e-02

Pos logit contributions
                +0.101
hai+0.101
kef+0.100
ynski+0.090
erb+0.090
Neg logit contributions
alog-0.090
 chars-0.088
 Guilty-0.080
aukee-0.080
 illum-0.077
Token frozen
Token ID12912
Feature activation-3.76e-02

Pos logit contributions
                +0.036
hai+0.036
kef+0.035
ynski+0.032
erb+0.032
Neg logit contributions
alog-0.032
 chars-0.031
 Guilty-0.028
aukee-0.028
 illum-0.027
Token water
Token ID1660
Feature activation+9.65e-02

Pos logit contributions
                +0.029
hai+0.029
kef+0.029
ynski+0.026
erb+0.026
Neg logit contributions
alog-0.026
 chars-0.025
 Guilty-0.023
aukee-0.023
 illum-0.022
Tokenfalls
Token ID23348
Feature activation-1.85e-02

Pos logit contributions
alog+0.070
 chars+0.069
 Guilty+0.063
aukee+0.063
 illum+0.060
Neg logit contributions
                -0.071
hai-0.071
kef-0.070
ynski-0.063
erb-0.063
Token to
Token ID284
Feature activation+1.78e-01

Pos logit contributions
                +0.035
hai+0.035
kef+0.035
ynski+0.032
erb+0.031
Neg logit contributions
alog-0.032
 chars-0.031
 Guilty-0.028
aukee-0.028
 illum-0.027
Token look
Token ID804
Feature activation-1.06e-01

Pos logit contributions
alog+0.121
 chars+0.118
aukee+0.108
 Guilty+0.108
 illum+0.103
Neg logit contributions
                -0.140
hai-0.139
kef-0.138
ynski-0.125
erb-0.124
Token to
Token ID284
Feature activation+3.62e-01

Pos logit contributions
                +0.082
hai+0.082
kef+0.081
ynski+0.074
erb+0.073
Neg logit contributions
alog-0.073
 chars-0.071
 Guilty-0.065
aukee-0.065
 illum-0.062
Token Ant
Token ID3738
Feature activation+4.64e-01

Pos logit contributions
alog+0.255
 chars+0.250
aukee+0.229
 Guilty+0.228
 illum+0.218
Neg logit contributions
                -0.274
hai-0.273
kef-0.271
ynski-0.245
erb-0.243
Tokenwan
Token ID8149
Feature activation-3.71e-01

Pos logit contributions
alog+0.336
 chars+0.330
aukee+0.302
 Guilty+0.302
 illum+0.290
Neg logit contributions
                -0.342
hai-0.341
kef-0.338
ynski-0.304
erb-0.303
Token Good
Token ID4599
Feature activation-2.18e+00

Pos logit contributions
                +0.255
hai+0.254
kef+0.252
ynski+0.226
erb+0.224
Neg logit contributions
alog-0.288
 chars-0.282
 Guilty-0.261
aukee-0.260
 illum-0.250
Tokenley
Token ID1636
Feature activation+9.10e-02

Pos logit contributions
                +1.778
kef+1.768
hai+1.750
ynski+1.572
erb+1.567
Neg logit contributions
alog-1.424
 chars-1.386
 Guilty-1.251
aukee-1.238
hari-1.201
Token in
Token ID287
Feature activation+1.06e-02

Pos logit contributions
alog+0.063
 chars+0.061
 Guilty+0.056
aukee+0.056
 illum+0.054
Neg logit contributions
                -0.070
hai-0.070
kef-0.070
ynski-0.063
erb-0.063
Token key
Token ID1994
Feature activation+8.62e-02

Pos logit contributions
alog+0.008
 chars+0.008
 Guilty+0.007
aukee+0.007
 illum+0.007
Neg logit contributions
                -0.008
hai-0.008
kef-0.008
ynski-0.007
erb-0.007
Token moments
Token ID7188
Feature activation-2.18e-02

Pos logit contributions
alog+0.056
 chars+0.055
 Guilty+0.050
aukee+0.050
 illum+0.047
Neg logit contributions
                -0.070
hai-0.070
kef-0.069
ynski-0.063
erb-0.063
Token,
Token ID11
Feature activation-5.97e-03

Pos logit contributions
                +0.017
hai+0.017
kef+0.017
ynski+0.015
erb+0.015
Neg logit contributions
alog-0.015
 chars-0.015
 Guilty-0.013
aukee-0.013
 illum-0.013
Token was
Token ID373
Feature activation-2.07e-01

Pos logit contributions
                +0.104
hai+0.104
kef+0.103
ynski+0.094
erb+0.093
Neg logit contributions
alog-0.089
 chars-0.087
 Guilty-0.079
aukee-0.079
 illum-0.076
Token spun
Token ID26843
Feature activation-9.94e-02

Pos logit contributions
                +0.159
hai+0.159
kef+0.158
ynski+0.143
erb+0.142
Neg logit contributions
alog-0.143
 chars-0.141
 Guilty-0.128
aukee-0.128
 illum-0.123
Token off
Token ID572
Feature activation+1.01e-01

Pos logit contributions
                +0.070
hai+0.070
kef+0.069
ynski+0.062
erb+0.062
Neg logit contributions
alog-0.075
 chars-0.074
 Guilty-0.068
aukee-0.068
 illum-0.065
Token from
Token ID422
Feature activation+8.75e-02

Pos logit contributions
alog+0.072
 chars+0.071
 Guilty+0.065
aukee+0.065
 illum+0.062
Neg logit contributions
                -0.076
hai-0.076
kef-0.075
ynski-0.068
erb-0.067
Token Cond
Token ID9724
Feature activation-1.37e-01

Pos logit contributions
alog+0.059
 chars+0.058
aukee+0.052
 Guilty+0.052
 illum+0.050
Neg logit contributions
                -0.069
hai-0.069
kef-0.068
ynski-0.062
erb-0.062
Tokene
Token ID68
Feature activation-2.18e+00

Pos logit contributions
                +0.094
hai+0.093
kef+0.093
ynski+0.083
erb+0.082
Neg logit contributions
alog-0.107
 chars-0.105
 Guilty-0.097
aukee-0.097
 illum-0.093
Token N
Token ID399
Feature activation-1.10e+00

Pos logit contributions
hai+1.835
kef+1.830
                +1.819
ynski+1.665
erb+1.638
Neg logit contributions
alog-1.361
 chars-1.342
 Guilty-1.202
aukee-1.174
 illum-1.130
Tokenast
Token ID459
Feature activation+5.84e-02

Pos logit contributions
                +0.733
hai+0.730
kef+0.720
ynski+0.645
erb+0.632
Neg logit contributions
alog-0.888
 chars-0.865
 Guilty-0.803
aukee-0.799
 illum-0.773
Token's
Token ID338
Feature activation-1.41e-01

Pos logit contributions
alog+0.039
 chars+0.038
 Guilty+0.035
aukee+0.035
 illum+0.033
Neg logit contributions
                -0.046
hai-0.046
kef-0.046
ynski-0.042
erb-0.041
Token Wired
Token ID39721
Feature activation-2.45e-01

Pos logit contributions
                +0.102
hai+0.102
kef+0.101
ynski+0.091
erb+0.091
Neg logit contributions
alog-0.104
 chars-0.102
 Guilty-0.093
aukee-0.093
 illum-0.089
Token Digital
Token ID10231
Feature activation-5.02e-01

Pos logit contributions
                +0.187
hai+0.187
kef+0.186
ynski+0.168
erb+0.167
Neg logit contributions
alog-0.170
 chars-0.167
 Guilty-0.152
aukee-0.152
 illum-0.146
Token'
Token ID6
Feature activation-1.08e+00

Pos logit contributions
                +0.730
hai+0.726
kef+0.724
ynski+0.657
erb+0.655
Neg logit contributions
alog-0.629
 chars-0.615
 Guilty-0.560
aukee-0.556
 illum-0.532
Token Former
Token ID14466
Feature activation-7.88e-01

Pos logit contributions
                +0.671
kef+0.667
hai+0.665
ynski+0.589
erb+0.584
Neg logit contributions
alog-0.905
 chars-0.891
 Guilty-0.836
aukee-0.827
 illum-0.799
Token Sanders
Token ID5831
Feature activation+1.15e-01

Pos logit contributions
                +0.618
hai+0.615
kef+0.612
ynski+0.556
erb+0.550
Neg logit contributions
alog-0.533
 chars-0.525
aukee-0.478
 Guilty-0.477
 illum-0.456
Token campaign
Token ID1923
Feature activation+1.44e+00

Pos logit contributions
alog+0.087
 chars+0.086
 Guilty+0.079
aukee+0.079
 illum+0.076
Neg logit contributions
                -0.081
hai-0.081
kef-0.080
ynski-0.072
erb-0.071
Token spokesman
Token ID6523
Feature activation-5.72e-01

Pos logit contributions
alog+1.055
 chars+1.030
 Guilty+0.956
aukee+0.942
 illum+0.917
Neg logit contributions
hai-1.040
                -1.027
kef-1.026
ynski-0.918
erb-0.910
Token:
Token ID25
Feature activation-2.17e+00

Pos logit contributions
                +0.466
hai+0.465
kef+0.463
erb+0.420
ynski+0.419
Neg logit contributions
alog-0.370
 chars-0.363
 Guilty-0.329
aukee-0.328
 illum-0.313
Token Clinton
Token ID2605
Feature activation+4.91e-01

Pos logit contributions
hai+1.710
kef+1.697
                +1.683
ynski+1.554
erb+1.517
Neg logit contributions
alog-1.454
 chars-1.422
 Guilty-1.350
aukee-1.303
 illum-1.208
Token staff
Token ID3085
Feature activation-8.01e-01

Pos logit contributions
alog+0.334
 chars+0.325
aukee+0.299
 Guilty+0.297
 illum+0.285
Neg logit contributions
hai-0.383
                -0.382
kef-0.379
ynski-0.345
erb-0.342
Token are
Token ID389
Feature activation-1.12e+00

Pos logit contributions
                +0.616
hai+0.612
kef+0.608
ynski+0.548
erb+0.546
Neg logit contributions
alog-0.553
 chars-0.547
 Guilty-0.501
aukee-0.497
 illum-0.474
Token '
Token ID705
Feature activation-6.22e-01

Pos logit contributions
                +0.846
hai+0.846
kef+0.835
erb+0.756
ynski+0.755
Neg logit contributions
alog-0.790
 chars-0.785
 Guilty-0.716
aukee-0.710
 illum-0.680
Tokenbig
Token ID14261
Feature activation-1.41e+00

Pos logit contributions
                +0.484
hai+0.481
kef+0.476
ynski+0.434
erb+0.429
Neg logit contributions
alog-0.424
 chars-0.417
 Guilty-0.381
aukee-0.377
 illum-0.362
Tokenman
Token ID805
Feature activation-8.02e-02

Pos logit contributions
                +0.850
hai+0.845
kef+0.845
ynski+0.782
erb+0.780
Neg logit contributions
alog-0.351
 chars-0.338
 Guilty-0.289
aukee-0.286
 illum-0.266
Token,
Token ID11
Feature activation-3.16e-01

Pos logit contributions
                +0.062
hai+0.062
kef+0.062
ynski+0.056
erb+0.056
Neg logit contributions
alog-0.055
 chars-0.054
 Guilty-0.049
aukee-0.049
 illum-0.047
Token and
Token ID290
Feature activation-2.42e-01

Pos logit contributions
                +0.245
hai+0.244
kef+0.243
ynski+0.220
erb+0.218
Neg logit contributions
alog-0.217
 chars-0.213
 Guilty-0.194
aukee-0.193
 illum-0.185
Token F
Token ID376
Feature activation-6.18e-01

Pos logit contributions
                +0.188
hai+0.188
kef+0.187
ynski+0.169
erb+0.168
Neg logit contributions
alog-0.165
 chars-0.162
 Guilty-0.148
aukee-0.147
 illum-0.141
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
                +0.498
hai+0.495
kef+0.493
ynski+0.449
erb+0.445
Neg logit contributions
alog-0.405
 chars-0.396
aukee-0.360
 Guilty-0.359
 illum-0.342
Token Scott
Token ID4746
Feature activation-2.05e+00

Pos logit contributions
                +0.663
kef+0.663
hai+0.662
ynski+0.587
erb+0.581
Neg logit contributions
alog-0.805
 chars-0.795
 Guilty-0.738
aukee-0.732
 illum-0.709
Token Fitzgerald
Token ID28397
Feature activation-1.50e-01

Pos logit contributions
                +1.105
hai+1.099
kef+1.094
ynski+0.962
erb+0.940
Neg logit contributions
alog-1.905
 chars-1.876
aukee-1.749
 Guilty-1.746
 illum-1.684
Token.
Token ID13
Feature activation+7.43e-03

Pos logit contributions
                +0.122
hai+0.122
kef+0.121
ynski+0.110
erb+0.109
Neg logit contributions
alog-0.097
 chars-0.095
 Guilty-0.086
aukee-0.086
 illum-0.082
Token
Token ID198
Feature activation+5.77e-02

Pos logit contributions
alog+0.005
 chars+0.005
 Guilty+0.004
aukee+0.004
 illum+0.004
Neg logit contributions
                -0.006
hai-0.006
kef-0.006
ynski-0.005
erb-0.005
Token
Token ID198
Feature activation-3.40e-02

Pos logit contributions
alog+0.039
 chars+0.038
aukee+0.035
 Guilty+0.035
 illum+0.033
Neg logit contributions
                -0.046
hai-0.045
kef-0.045
ynski-0.041
erb-0.041
TokenIf
Token ID1532
Feature activation-2.18e-01

Pos logit contributions
                +0.024
hai+0.024
kef+0.024
ynski+0.021
erb+0.021
Neg logit contributions
alog-0.026
 chars-0.025
 Guilty-0.023
aukee-0.023
 illum-0.022
Token fair
Token ID3148
Feature activation-8.27e-01

Pos logit contributions
                +0.096
hai+0.096
kef+0.095
ynski+0.088
erb+0.087
Neg logit contributions
alog-0.054
 chars-0.053
 Guilty-0.047
aukee-0.047
 illum-0.044
Token treatment
Token ID3513
Feature activation+4.35e-01

Pos logit contributions
                +0.655
hai+0.651
kef+0.650
erb+0.586
ynski+0.584
Neg logit contributions
alog-0.550
 chars-0.548
 Guilty-0.496
aukee-0.493
 illum-0.472
Token by
Token ID416
Feature activation+6.09e-01

Pos logit contributions
alog+0.306
 chars+0.300
aukee+0.274
 Guilty+0.274
 illum+0.262
Neg logit contributions
                -0.329
hai-0.329
kef-0.326
ynski-0.294
erb-0.292
Token DNC
Token ID17824
Feature activation+2.63e-01

Pos logit contributions
alog+0.399
 chars+0.389
aukee+0.355
 Guilty+0.354
 illum+0.338
Neg logit contributions
                -0.490
hai-0.490
kef-0.486
ynski-0.444
erb-0.437
Token in
Token ID287
Feature activation-6.89e-01

Pos logit contributions
alog+0.164
 chars+0.160
aukee+0.145
 Guilty+0.145
 illum+0.138
Neg logit contributions
hai-0.219
                -0.219
kef-0.217
ynski-0.198
erb-0.197
Token 2020
Token ID12131
Feature activation-1.92e+00

Pos logit contributions
hai+0.461
                +0.461
kef+0.456
ynski+0.407
erb+0.401
Neg logit contributions
alog-0.544
 chars-0.539
 Guilty-0.497
aukee-0.497
 illum-0.476
Token after
Token ID706
Feature activation+3.61e-02

Pos logit contributions
                +1.613
kef+1.598
hai+1.595
erb+1.463
ynski+1.443
Neg logit contributions
 chars-1.201
alog-1.189
 Guilty-1.070
aukee-1.049
 illum-1.006
Token '
Token ID705
Feature activation+3.04e-02

Pos logit contributions
alog+0.028
 chars+0.028
 Guilty+0.025
aukee+0.025
 illum+0.025
Neg logit contributions
                -0.025
hai-0.025
kef-0.024
ynski-0.022
erb-0.022
Tokennot
Token ID1662
Feature activation+1.70e+00

Pos logit contributions
alog+0.024
 chars+0.023
 Guilty+0.022
aukee+0.021
 illum+0.021
Neg logit contributions
                -0.021
hai-0.021
kef-0.020
ynski-0.018
erb-0.018
Token quite
Token ID2407
Feature activation+4.82e-01

Pos logit contributions
alog+1.287
 chars+1.265
aukee+1.162
 Guilty+1.153
 illum+1.123
Neg logit contributions
                -1.205
hai-1.186
kef-1.174
erb-1.058
ynski-1.045
Token even
Token ID772
Feature activation-4.09e-01

Pos logit contributions
alog+0.354
 chars+0.347
aukee+0.319
 Guilty+0.319
 illum+0.305
Neg logit contributions
                -0.349
hai-0.349
kef-0.346
ynski-0.312
erb-0.309
Token'
Token ID6
Feature activation-8.50e-01

Pos logit contributions
                +0.553
hai+0.550
kef+0.548
ynski+0.497
erb+0.495
Neg logit contributions
alog-0.482
 chars-0.472
 Guilty-0.430
aukee-0.428
 illum-0.409
Token Former
Token ID14466
Feature activation-8.56e-01

Pos logit contributions
                +0.531
kef+0.527
hai+0.527
ynski+0.466
erb+0.462
Neg logit contributions
alog-0.710
 chars-0.699
 Guilty-0.653
aukee-0.648
 illum-0.626
Token Sanders
Token ID5831
Feature activation+7.49e-02

Pos logit contributions
                +0.676
hai+0.672
kef+0.669
ynski+0.608
erb+0.601
Neg logit contributions
alog-0.574
 chars-0.566
aukee-0.515
 Guilty-0.513
 illum-0.492
Token campaign
Token ID1923
Feature activation+1.50e+00

Pos logit contributions
alog+0.056
 chars+0.055
 Guilty+0.051
aukee+0.051
 illum+0.049
Neg logit contributions
                -0.053
hai-0.053
kef-0.052
ynski-0.047
erb-0.047
Token spokesman
Token ID6523
Feature activation-4.71e-01

Pos logit contributions
alog+1.100
 chars+1.076
 Guilty+0.998
aukee+0.981
 illum+0.956
Neg logit contributions
hai-1.091
kef-1.080
                -1.078
ynski-0.964
erb-0.956
Token:
Token ID25
Feature activation-1.92e+00

Pos logit contributions
                +0.384
hai+0.383
kef+0.381
erb+0.346
ynski+0.346
Neg logit contributions
alog-0.303
 chars-0.298
 Guilty-0.270
aukee-0.269
 illum-0.256
Token Clinton
Token ID2605
Feature activation+5.28e-01

Pos logit contributions
hai+1.504
kef+1.494
                +1.483
ynski+1.368
erb+1.333
Neg logit contributions
alog-1.299
 chars-1.271
 Guilty-1.200
aukee-1.161
 illum-1.084
Token staff
Token ID3085
Feature activation-5.92e-01

Pos logit contributions
alog+0.360
 chars+0.350
aukee+0.321
 Guilty+0.319
 illum+0.306
Neg logit contributions
hai-0.412
                -0.411
kef-0.407
ynski-0.371
erb-0.368
Token are
Token ID389
Feature activation-8.16e-01

Pos logit contributions
                +0.457
hai+0.454
kef+0.451
ynski+0.407
erb+0.406
Neg logit contributions
alog-0.408
 chars-0.402
 Guilty-0.368
aukee-0.366
 illum-0.350
Token '
Token ID705
Feature activation-5.39e-01

Pos logit contributions
                +0.609
hai+0.608
kef+0.602
ynski+0.543
erb+0.542
Neg logit contributions
alog-0.582
 chars-0.576
 Guilty-0.526
aukee-0.523
 illum-0.502
Tokenbig
Token ID14261
Feature activation-1.31e+00

Pos logit contributions
                +0.420
hai+0.417
kef+0.413
ynski+0.376
erb+0.372
Neg logit contributions
alog-0.369
 chars-0.363
 Guilty-0.331
aukee-0.328
 illum-0.315
Token Fore
Token ID4558
Feature activation+5.72e-01

Pos logit contributions
hai+0.583
                +0.582
kef+0.578
ynski+0.520
erb+0.514
Neg logit contributions
alog-0.578
 chars-0.568
 Guilty-0.523
aukee-0.517
 illum-0.502
Tokenword
Token ID4775
Feature activation-6.49e-01

Pos logit contributions
alog+0.319
 chars+0.313
aukee+0.280
 Guilty+0.278
 illum+0.262
Neg logit contributions
hai-0.517
                -0.515
kef-0.512
erb-0.470
ynski-0.469
Token by
Token ID416
Feature activation+1.99e-01

Pos logit contributions
                +0.478
hai+0.477
kef+0.474
ynski+0.427
erb+0.424
Neg logit contributions
alog-0.470
 chars-0.462
 Guilty-0.425
aukee-0.422
 illum-0.407
Token Patrick
Token ID9925
Feature activation-8.48e-03

Pos logit contributions
alog+0.149
 chars+0.146
 Guilty+0.135
aukee+0.135
 illum+0.129
Neg logit contributions
                -0.142
hai-0.141
kef-0.140
ynski-0.126
erb-0.125
Token J
Token ID449
Feature activation-6.69e-01

Pos logit contributions
                +0.006
hai+0.006
kef+0.006
ynski+0.006
erb+0.006
Neg logit contributions
alog-0.006
 chars-0.006
aukee-0.005
 Guilty-0.005
 illum-0.005
Token.
Token ID13
Feature activation-1.88e+00

Pos logit contributions
                +0.563
hai+0.560
kef+0.556
ynski+0.507
erb+0.502
Neg logit contributions
alog-0.418
 chars-0.404
 Guilty-0.368
aukee-0.368
 illum-0.348
Token Buchanan
Token ID36358
Feature activation-1.43e+00

Pos logit contributions
                +1.858
hai+1.839
kef+1.839
ynski+1.700
erb+1.682
Neg logit contributions
alog-0.897
 chars-0.869
 Guilty-0.784
aukee-0.755
 illum-0.721
Token,
Token ID11
Feature activation-8.31e-01

Pos logit contributions
                +1.154
hai+1.151
kef+1.145
ynski+1.043
erb+1.032
Neg logit contributions
alog-0.934
 chars-0.921
 Guilty-0.837
aukee-0.827
 illum-0.790
Token and
Token ID290
Feature activation-5.70e-01

Pos logit contributions
hai+0.652
                +0.651
kef+0.649
ynski+0.585
erb+0.581
Neg logit contributions
alog-0.561
 chars-0.552
 Guilty-0.503
aukee-0.499
 illum-0.482
Token critical
Token ID4688
Feature activation-1.46e+00

Pos logit contributions
hai+0.469
                +0.469
kef+0.464
ynski+0.424
erb+0.420
Neg logit contributions
alog-0.364
 chars-0.357
 Guilty-0.324
aukee-0.322
 illum-0.307
Token essays
Token ID27126
Feature activation-6.24e-01

Pos logit contributions
                +1.060
kef+1.053
hai+1.045
ynski+0.936
erb+0.928
Neg logit contributions
alog-1.077
 chars-1.064
 Guilty-0.967
aukee-0.959
 illum-0.945
Token'
Token ID6
Feature activation-8.15e-01

Pos logit contributions
                +0.600
hai+0.597
kef+0.595
ynski+0.539
erb+0.537
Neg logit contributions
alog-0.522
 chars-0.511
 Guilty-0.466
aukee-0.463
 illum-0.443
Token Former
Token ID14466
Feature activation-6.52e-01

Pos logit contributions
                +0.507
kef+0.504
hai+0.503
ynski+0.444
erb+0.440
Neg logit contributions
alog-0.684
 chars-0.673
 Guilty-0.629
aukee-0.624
 illum-0.603
Token Sanders
Token ID5831
Feature activation+2.63e-01

Pos logit contributions
                +0.507
hai+0.506
kef+0.502
ynski+0.456
erb+0.451
Neg logit contributions
alog-0.445
 chars-0.438
aukee-0.399
 Guilty-0.398
 illum-0.381
Token campaign
Token ID1923
Feature activation+1.83e+00

Pos logit contributions
alog+0.199
 chars+0.196
 Guilty+0.180
aukee+0.180
 illum+0.173
Neg logit contributions
                -0.185
hai-0.185
kef-0.183
ynski-0.164
erb-0.163
Token spokesman
Token ID6523
Feature activation-4.85e-01

Pos logit contributions
alog+1.371
 chars+1.341
 Guilty+1.248
aukee+1.227
 illum+1.197
Neg logit contributions
hai-1.296
                -1.280
kef-1.278
ynski-1.139
erb-1.128
Token:
Token ID25
Feature activation-1.84e+00

Pos logit contributions
                +0.392
hai+0.392
kef+0.389
erb+0.353
ynski+0.353
Neg logit contributions
alog-0.316
 chars-0.310
 Guilty-0.281
aukee-0.280
 illum-0.267
Token Clinton
Token ID2605
Feature activation+5.09e-01

Pos logit contributions
hai+1.439
kef+1.426
                +1.416
ynski+1.305
erb+1.275
Neg logit contributions
alog-1.247
 chars-1.221
 Guilty-1.153
aukee-1.118
 illum-1.041
Token staff
Token ID3085
Feature activation-6.34e-01

Pos logit contributions
alog+0.342
 chars+0.332
aukee+0.305
 Guilty+0.302
 illum+0.290
Neg logit contributions
hai-0.402
                -0.401
kef-0.398
ynski-0.362
erb-0.359
Token are
Token ID389
Feature activation-1.00e+00

Pos logit contributions
                +0.490
hai+0.487
kef+0.484
ynski+0.437
erb+0.435
Neg logit contributions
alog-0.434
 chars-0.429
 Guilty-0.392
aukee-0.390
 illum-0.372
Token '
Token ID705
Feature activation-3.09e-01

Pos logit contributions
                +0.743
hai+0.742
kef+0.734
erb+0.662
ynski+0.662
Neg logit contributions
alog-0.717
 chars-0.711
 Guilty-0.650
aukee-0.644
 illum-0.618
Tokenbig
Token ID14261
Feature activation-1.07e+00

Pos logit contributions
                +0.238
hai+0.237
kef+0.235
ynski+0.213
erb+0.211
Neg logit contributions
alog-0.214
 chars-0.210
 Guilty-0.192
aukee-0.191
 illum-0.183
Token fair
Token ID3148
Feature activation-9.25e-01

Pos logit contributions
                +0.364
hai+0.362
kef+0.361
ynski+0.332
erb+0.330
Neg logit contributions
alog-0.214
 chars-0.210
 Guilty-0.187
aukee-0.186
 illum-0.175
Token treatment
Token ID3513
Feature activation+5.83e-01

Pos logit contributions
                +0.718
kef+0.710
hai+0.710
erb+0.639
ynski+0.635
Neg logit contributions
alog-0.633
 chars-0.633
 Guilty-0.572
aukee-0.569
 illum-0.545
Token by
Token ID416
Feature activation+6.51e-01

Pos logit contributions
alog+0.416
 chars+0.407
aukee+0.373
 Guilty+0.372
 illum+0.356
Neg logit contributions
                -0.436
hai-0.435
kef-0.432
ynski-0.389
erb-0.385
Token DNC
Token ID17824
Feature activation+1.52e-01

Pos logit contributions
alog+0.425
 chars+0.414
aukee+0.378
 Guilty+0.376
 illum+0.359
Neg logit contributions
                -0.526
hai-0.525
kef-0.521
ynski-0.477
erb-0.468
Token in
Token ID287
Feature activation-4.56e-01

Pos logit contributions
alog+0.095
 chars+0.092
aukee+0.083
 Guilty+0.083
 illum+0.079
Neg logit contributions
                -0.127
hai-0.127
kef-0.126
ynski-0.115
erb-0.114
Token 2020
Token ID12131
Feature activation-1.83e+00

Pos logit contributions
                +0.300
hai+0.300
kef+0.297
ynski+0.265
erb+0.261
Neg logit contributions
alog-0.365
 chars-0.361
 Guilty-0.334
aukee-0.333
 illum-0.320
Token after
Token ID706
Feature activation+2.13e-01

Pos logit contributions
                +1.558
kef+1.544
hai+1.539
erb+1.416
ynski+1.396
Neg logit contributions
 chars-1.130
alog-1.116
 Guilty-0.997
aukee-0.988
 illum-0.946
Token '
Token ID705
Feature activation+2.21e-02

Pos logit contributions
alog+0.164
 chars+0.161
 Guilty+0.148
aukee+0.148
 illum+0.143
Neg logit contributions
                -0.147
hai-0.147
kef-0.145
ynski-0.130
erb-0.129
Tokennot
Token ID1662
Feature activation+1.77e+00

Pos logit contributions
alog+0.017
 chars+0.017
 Guilty+0.016
aukee+0.016
 illum+0.015
Neg logit contributions
                -0.015
hai-0.015
kef-0.015
ynski-0.013
erb-0.013
Token quite
Token ID2407
Feature activation+4.49e-01

Pos logit contributions
alog+1.351
 chars+1.329
aukee+1.226
 Guilty+1.215
 illum+1.178
Neg logit contributions
                -1.238
hai-1.223
kef-1.209
erb-1.089
ynski-1.071
Token even
Token ID772
Feature activation-3.82e-01

Pos logit contributions
alog+0.332
 chars+0.326
aukee+0.300
 Guilty+0.300
 illum+0.287
Neg logit contributions
                -0.323
hai-0.323
kef-0.319
ynski-0.288
erb-0.286
Token fair
Token ID3148
Feature activation-7.79e-01

Pos logit contributions
                +0.021
hai+0.021
kef+0.021
ynski+0.019
erb+0.019
Neg logit contributions
alog-0.015
 chars-0.014
 Guilty-0.013
aukee-0.013
 illum-0.012
Token treatment
Token ID3513
Feature activation+6.05e-01

Pos logit contributions
                +0.599
hai+0.595
kef+0.594
erb+0.534
ynski+0.532
Neg logit contributions
alog-0.537
 chars-0.535
 Guilty-0.485
aukee-0.483
 illum-0.462
Token by
Token ID416
Feature activation+9.86e-01

Pos logit contributions
alog+0.431
 chars+0.421
aukee+0.386
 Guilty+0.385
 illum+0.368
Neg logit contributions
                -0.453
hai-0.452
kef-0.449
ynski-0.404
erb-0.401
Token DNC
Token ID17824
Feature activation+5.12e-01

Pos logit contributions
alog+0.676
 chars+0.658
aukee+0.605
 Guilty+0.601
 illum+0.576
Neg logit contributions
                -0.765
hai-0.763
kef-0.757
ynski-0.690
erb-0.676
Token in
Token ID287
Feature activation-5.45e-01

Pos logit contributions
alog+0.325
 chars+0.317
aukee+0.288
 Guilty+0.286
 illum+0.274
Neg logit contributions
hai-0.422
                -0.422
kef-0.418
ynski-0.381
erb-0.380
Token 2020
Token ID12131
Feature activation-1.82e+00

Pos logit contributions
                +0.350
hai+0.350
kef+0.346
ynski+0.307
erb+0.303
Neg logit contributions
alog-0.444
 chars-0.440
 Guilty-0.407
aukee-0.407
 illum-0.391
Token after
Token ID706
Feature activation+1.19e-01

Pos logit contributions
                +1.548
hai+1.536
kef+1.534
erb+1.408
ynski+1.388
Neg logit contributions
 chars-1.119
alog-1.112
 Guilty-0.995
aukee-0.978
 illum-0.938
Token '
Token ID705
Feature activation-9.11e-02

Pos logit contributions
alog+0.091
 chars+0.089
 Guilty+0.082
aukee+0.082
 illum+0.079
Neg logit contributions
                -0.082
hai-0.082
kef-0.081
ynski-0.073
erb-0.072
Tokennot
Token ID1662
Feature activation+1.67e+00

Pos logit contributions
                +0.062
hai+0.062
kef+0.061
ynski+0.055
erb+0.054
Neg logit contributions
alog-0.071
 chars-0.070
 Guilty-0.064
aukee-0.064
 illum-0.062
Token quite
Token ID2407
Feature activation+6.71e-01

Pos logit contributions
alog+1.275
 chars+1.253
aukee+1.154
 Guilty+1.146
 illum+1.111
Neg logit contributions
                -1.173
hai-1.158
kef-1.148
erb-1.031
ynski-1.015
Token even
Token ID772
Feature activation-5.41e-01

Pos logit contributions
alog+0.492
 chars+0.482
aukee+0.445
 Guilty+0.444
 illum+0.424
Neg logit contributions
                -0.487
hai-0.486
kef-0.481
ynski-0.435
erb-0.431
Token of
Token ID286
Feature activation-7.49e-01

Pos logit contributions
hai+0.458
                +0.456
kef+0.453
ynski+0.410
erb+0.407
Neg logit contributions
alog-0.409
 chars-0.399
 Guilty-0.365
aukee-0.362
 illum-0.349
Token Murray
Token ID12164
Feature activation-7.82e-01

Pos logit contributions
hai+0.572
                +0.569
kef+0.566
ynski+0.512
erb+0.509
Neg logit contributions
alog-0.521
 chars-0.511
 Guilty-0.470
aukee-0.468
 illum-0.448
Token N
Token ID399
Feature activation-2.72e-01

Pos logit contributions
                +0.632
hai+0.630
kef+0.628
ynski+0.572
erb+0.566
Neg logit contributions
alog-0.510
 chars-0.501
 Guilty-0.457
aukee-0.452
 illum-0.432
Token.
Token ID13
Feature activation+4.48e-01

Pos logit contributions
                +0.226
hai+0.225
kef+0.224
ynski+0.204
erb+0.203
Neg logit contributions
alog-0.172
 chars-0.168
 Guilty-0.152
aukee-0.152
 illum-0.144
Token Roth
Token ID16131
Feature activation-1.31e+00

Pos logit contributions
alog+0.353
 chars+0.346
aukee+0.319
 Guilty+0.318
 illum+0.308
Neg logit contributions
                -0.304
hai-0.303
kef-0.300
ynski-0.267
erb-0.265
Tokenbard
Token ID23024
Feature activation-1.81e+00

Pos logit contributions
                +0.738
hai+0.736
kef+0.728
ynski+0.635
erb+0.629
Neg logit contributions
alog-1.175
 chars-1.152
 Guilty-1.070
aukee-1.059
 illum-1.033
Token (
Token ID357
Feature activation-7.36e-01

Pos logit contributions
hai+1.540
kef+1.514
                +1.510
ynski+1.392
erb+1.384
Neg logit contributions
alog-1.112
 chars-1.083
 Guilty-0.988
aukee-0.956
 illum-0.921
TokenProm
Token ID24129
Feature activation-7.52e-01

Pos logit contributions
                +0.678
hai+0.677
kef+0.673
ynski+0.621
erb+0.614
Neg logit contributions
alog-0.397
 chars-0.389
 Guilty-0.345
aukee-0.342
 illum-0.323
Tokenetheus
Token ID36916
Feature activation-6.58e-01

Pos logit contributions
                +0.533
hai+0.530
kef+0.530
ynski+0.469
erb+0.467
Neg logit contributions
alog-0.568
 chars-0.557
 Guilty-0.512
aukee-0.512
 illum-0.491
Token Books
Token ID13661
Feature activation-5.73e-01

Pos logit contributions
hai+0.503
                +0.502
kef+0.499
ynski+0.453
erb+0.447
Neg logit contributions
alog-0.459
 chars-0.451
 Guilty-0.412
aukee-0.409
 illum-0.391
Token,
Token ID11
Feature activation+2.33e-02

Pos logit contributions
hai+0.451
                +0.451
kef+0.449
ynski+0.405
erb+0.404
Neg logit contributions
alog-0.384
 chars-0.377
 Guilty-0.344
aukee-0.341
 illum-0.327
Token than
Token ID621
Feature activation+1.38e-01

Pos logit contributions
                +0.145
hai+0.145
kef+0.144
ynski+0.129
erb+0.129
Neg logit contributions
alog-0.139
 chars-0.136
 Guilty-0.125
aukee-0.124
 illum-0.119
Token the
Token ID262
Feature activation-1.90e-02

Pos logit contributions
alog+0.098
 chars+0.096
 Guilty+0.088
aukee+0.088
 illum+0.084
Neg logit contributions
                -0.103
hai-0.103
kef-0.102
ynski-0.092
erb-0.091
Token previously
Token ID4271
Feature activation+1.15e-01

Pos logit contributions
                +0.015
hai+0.015
kef+0.015
ynski+0.013
erb+0.013
Neg logit contributions
alog-0.013
 chars-0.013
 Guilty-0.012
aukee-0.012
 illum-0.011
Token planned
Token ID6027
Feature activation-2.37e-01

Pos logit contributions
alog+0.076
 chars+0.075
 Guilty+0.068
aukee+0.068
 illum+0.065
Neg logit contributions
                -0.091
hai-0.091
kef-0.090
ynski-0.082
erb-0.081
Token Russian
Token ID3394
Feature activation+1.37e-01

Pos logit contributions
                +0.188
hai+0.188
kef+0.187
ynski+0.170
erb+0.169
Neg logit contributions
alog-0.157
 chars-0.154
 Guilty-0.140
aukee-0.140
 illum-0.134
Token Black
Token ID2619
Feature activation-1.80e+00

Pos logit contributions
alog+0.093
 chars+0.091
 Guilty+0.083
aukee+0.083
 illum+0.079
Neg logit contributions
                -0.108
hai-0.107
kef-0.107
ynski-0.097
erb-0.096
Token Sea
Token ID6896
Feature activation+8.11e-03

Pos logit contributions
kef+1.249
                +1.247
hai+1.246
ynski+1.120
erb+1.102
Neg logit contributions
alog-1.383
 chars-1.334
 Guilty-1.247
aukee-1.237
 illum-1.186
Token resort
Token ID12600
Feature activation-3.71e-01

Pos logit contributions
alog+0.006
 chars+0.006
 Guilty+0.005
aukee+0.005
 illum+0.005
Neg logit contributions
                -0.006
hai-0.006
kef-0.006
ynski-0.006
erb-0.006
Token of
Token ID286
Feature activation-1.11e-01

Pos logit contributions
                +0.284
hai+0.283
kef+0.281
ynski+0.254
erb+0.252
Neg logit contributions
alog-0.258
 chars-0.253
 Guilty-0.231
aukee-0.231
 illum-0.221
Token Sochi
Token ID37126
Feature activation-4.23e-01

Pos logit contributions
                +0.073
hai+0.073
kef+0.072
ynski+0.064
erb+0.064
Neg logit contributions
alog-0.089
 chars-0.087
 Guilty-0.081
aukee-0.080
 illum-0.078
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
                +0.331
hai+0.330
kef+0.328
ynski+0.297
erb+0.296
Neg logit contributions
alog-0.287
 chars-0.282
 Guilty-0.257
aukee-0.256
 illum-0.246
TokenProm
Token ID24129
Feature activation-7.52e-01

Pos logit contributions
                +0.678
hai+0.677
kef+0.673
ynski+0.621
erb+0.614
Neg logit contributions
alog-0.397
 chars-0.389
 Guilty-0.345
aukee-0.342
 illum-0.323
Tokenetheus
Token ID36916
Feature activation-6.58e-01

Pos logit contributions
                +0.533
hai+0.530
kef+0.530
ynski+0.469
erb+0.467
Neg logit contributions
alog-0.568
 chars-0.557
 Guilty-0.512
aukee-0.512
 illum-0.491
Token Books
Token ID13661
Feature activation-5.73e-01

Pos logit contributions
hai+0.503
                +0.502
kef+0.499
ynski+0.453
erb+0.447
Neg logit contributions
alog-0.459
 chars-0.451
 Guilty-0.412
aukee-0.409
 illum-0.391
Token,
Token ID11
Feature activation+2.33e-02

Pos logit contributions
hai+0.451
                +0.451
kef+0.449
ynski+0.405
erb+0.404
Neg logit contributions
alog-0.384
 chars-0.377
 Guilty-0.344
aukee-0.341
 illum-0.327
Token 2000
Token ID4751
Feature activation-7.41e-02

Pos logit contributions
alog+0.013
 chars+0.013
aukee+0.012
 Guilty+0.012
 illum+0.011
Neg logit contributions
                -0.021
hai-0.020
kef-0.020
ynski-0.019
erb-0.019
Token),
Token ID828
Feature activation-1.78e+00

Pos logit contributions
                +0.066
hai+0.065
kef+0.065
ynski+0.060
erb+0.059
Neg logit contributions
alog-0.043
 chars-0.042
 Guilty-0.037
aukee-0.037
 illum-0.035
Token my
Token ID616
Feature activation-4.96e-01

Pos logit contributions
hai+1.330
kef+1.319
                +1.318
ynski+1.179
erb+1.166
Neg logit contributions
alog-1.288
 chars-1.277
 Guilty-1.162
aukee-1.137
 illum-1.126
Token biography
Token ID26444
Feature activation-6.06e-01

Pos logit contributions
                +0.338
hai+0.336
kef+0.335
ynski+0.299
erb+0.295
Neg logit contributions
alog-0.388
 chars-0.382
 Guilty-0.351
aukee-0.350
 illum-0.340
Token of
Token ID286
Feature activation-3.05e-01

Pos logit contributions
hai+0.467
                +0.467
kef+0.463
ynski+0.419
erb+0.418
Neg logit contributions
alog-0.417
 chars-0.410
 Guilty-0.373
aukee-0.373
 illum-0.358
Token the
Token ID262
Feature activation-7.97e-01

Pos logit contributions
hai+0.223
                +0.223
kef+0.221
ynski+0.199
erb+0.197
Neg logit contributions
alog-0.223
 chars-0.219
 Guilty-0.201
aukee-0.200
 illum-0.193
Token great
Token ID1049
Feature activation-8.98e-01

Pos logit contributions
hai+0.578
                +0.573
kef+0.569
ynski+0.517
erb+0.511
Neg logit contributions
alog-0.587
 chars-0.573
 Guilty-0.529
aukee-0.525
 illum-0.505
Token in
Token ID287
Feature activation-8.95e-02

Pos logit contributions
alog+0.043
 chars+0.043
 Guilty+0.039
aukee+0.039
 illum+0.037
Neg logit contributions
                -0.048
hai-0.048
kef-0.047
ynski-0.043
erb-0.043
Token the
Token ID262
Feature activation+1.48e-01

Pos logit contributions
                +0.067
hai+0.067
kef+0.066
ynski+0.060
erb+0.059
Neg logit contributions
alog-0.064
 chars-0.063
 Guilty-0.058
aukee-0.057
 illum-0.055
Token use
Token ID779
Feature activation+9.94e-02

Pos logit contributions
alog+0.101
 chars+0.099
aukee+0.091
 Guilty+0.091
 illum+0.087
Neg logit contributions
                -0.115
hai-0.115
kef-0.114
ynski-0.103
erb-0.103
Token of
Token ID286
Feature activation-3.19e-02

Pos logit contributions
alog+0.070
 chars+0.069
 Guilty+0.063
aukee+0.063
 illum+0.060
Neg logit contributions
                -0.075
hai-0.075
kef-0.075
ynski-0.067
erb-0.067
Token cart
Token ID6383
Feature activation-3.00e-01

Pos logit contributions
                +0.025
hai+0.025
kef+0.024
ynski+0.022
erb+0.022
Neg logit contributions
alog-0.022
 chars-0.021
 Guilty-0.020
aukee-0.020
 illum-0.019
Tokeno
Token ID78
Feature activation-1.77e+00

Pos logit contributions
                +0.230
hai+0.229
kef+0.228
ynski+0.206
erb+0.204
Neg logit contributions
alog-0.209
 chars-0.205
 Guilty-0.187
aukee-0.187
 illum-0.179
Tokenony
Token ID1647
Feature activation-1.11e-02

Pos logit contributions
hai+1.174
                +1.169
kef+1.166
ynski+1.039
erb+1.020
Neg logit contributions
alog-1.424
 chars-1.402
 Guilty-1.287
aukee-1.280
 illum-1.243
Token expressions
Token ID14700
Feature activation+9.87e-02

Pos logit contributions
                +0.009
hai+0.009
kef+0.009
ynski+0.008
erb+0.008
Neg logit contributions
alog-0.008
 chars-0.007
 Guilty-0.007
aukee-0.007
 illum-0.006
Token and
Token ID290
Feature activation-1.43e-01

Pos logit contributions
alog+0.067
 chars+0.066
 Guilty+0.060
aukee+0.060
 illum+0.057
Neg logit contributions
                -0.077
hai-0.077
kef-0.076
ynski-0.069
erb-0.069
Token slap
Token ID23905
Feature activation-1.86e-02

Pos logit contributions
                +0.110
hai+0.110
kef+0.109
ynski+0.099
erb+0.098
Neg logit contributions
alog-0.098
 chars-0.096
 Guilty-0.087
aukee-0.087
 illum-0.084
Tokenstick
Token ID13915
Feature activation-7.95e-02

Pos logit contributions
                +0.016
hai+0.015
kef+0.015
ynski+0.014
erb+0.014
Neg logit contributions
alog-0.012
 chars-0.011
 Guilty-0.010
aukee-0.010
 illum-0.010
Token The
Token ID383
Feature activation-1.05e-01

Pos logit contributions
alog+0.016
 chars+0.016
 Guilty+0.014
aukee+0.014
 illum+0.014
Neg logit contributions
                -0.018
hai-0.018
kef-0.018
ynski-0.017
erb-0.017
Token lesson
Token ID11483
Feature activation+1.28e-01

Pos logit contributions
                +0.082
hai+0.082
kef+0.081
ynski+0.074
erb+0.073
Neg logit contributions
alog-0.072
 chars-0.071
 Guilty-0.064
aukee-0.064
 illum-0.061
Token of
Token ID286
Feature activation-2.31e-01

Pos logit contributions
alog+0.087
 chars+0.086
 Guilty+0.078
aukee+0.078
 illum+0.075
Neg logit contributions
                -0.100
hai-0.099
kef-0.099
ynski-0.089
erb-0.089
Token the
Token ID262
Feature activation-2.78e-01

Pos logit contributions
                +0.177
hai+0.176
kef+0.175
ynski+0.158
erb+0.157
Neg logit contributions
alog-0.161
 chars-0.158
 Guilty-0.145
aukee-0.144
 illum-0.138
Token Gang
Token ID19228
Feature activation-5.15e-01

Pos logit contributions
                +0.223
hai+0.223
kef+0.221
ynski+0.201
erb+0.200
Neg logit contributions
alog-0.183
 chars-0.179
 Guilty-0.163
aukee-0.163
 illum-0.155
Token of
Token ID286
Feature activation-1.72e+00

Pos logit contributions
                +0.418
hai+0.417
kef+0.414
ynski+0.376
erb+0.374
Neg logit contributions
alog-0.335
 chars-0.327
 Guilty-0.297
aukee-0.295
 illum-0.282
Token Four
Token ID6675
Feature activation-7.88e-02

Pos logit contributions
                +1.282
hai+1.273
kef+1.270
ynski+1.149
erb+1.133
Neg logit contributions
alog-1.244
 chars-1.215
 Guilty-1.120
aukee-1.098
 illum-1.058
Token is
Token ID318
Feature activation-1.51e-01

Pos logit contributions
                +0.062
hai+0.062
kef+0.062
ynski+0.056
erb+0.056
Neg logit contributions
alog-0.053
 chars-0.052
 Guilty-0.047
aukee-0.047
 illum-0.045
Token you
Token ID345
Feature activation-3.16e-01

Pos logit contributions
                +0.113
hai+0.113
kef+0.112
ynski+0.101
erb+0.101
Neg logit contributions
alog-0.107
 chars-0.105
 Guilty-0.096
aukee-0.096
 illum-0.092
Token don
Token ID836
Feature activation+6.17e-02

Pos logit contributions
                +0.258
hai+0.258
kef+0.256
ynski+0.233
erb+0.232
Neg logit contributions
alog-0.203
 chars-0.199
 Guilty-0.180
aukee-0.179
 illum-0.172
Token't
Token ID470
Feature activation-9.38e-02

Pos logit contributions
alog+0.048
 chars+0.047
 Guilty+0.044
aukee+0.044
 illum+0.042
Neg logit contributions
                -0.042
hai-0.042
kef-0.042
ynski-0.037
erb-0.037
Token one
Token ID530
Feature activation+9.79e-02

Pos logit contributions
                +0.281
hai+0.281
kef+0.278
ynski+0.252
erb+0.250
Neg logit contributions
alog-0.250
 chars-0.245
 Guilty-0.224
aukee-0.223
 illum-0.214
Token source
Token ID2723
Feature activation-3.94e-01

Pos logit contributions
alog+0.063
 chars+0.061
 Guilty+0.055
aukee+0.055
 illum+0.053
Neg logit contributions
                -0.080
hai-0.080
kef-0.080
ynski-0.073
erb-0.072
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
                +0.305
hai+0.304
kef+0.302
ynski+0.273
erb+0.272
Neg logit contributions
alog-0.270
 chars-0.265
 Guilty-0.242
aukee-0.242
 illum-0.231
Token Penguins
Token ID29755
Feature activation+2.03e-01

Pos logit contributions
                +0.070
hai+0.070
kef+0.069
ynski+0.062
erb+0.061
Neg logit contributions
alog-0.083
 chars-0.082
 Guilty-0.076
aukee-0.076
 illum-0.073
Token GM
Token ID6951
Feature activation-5.86e-01

Pos logit contributions
alog+0.144
 chars+0.142
 Guilty+0.130
aukee+0.129
 illum+0.124
Neg logit contributions
                -0.152
hai-0.151
kef-0.150
ynski-0.135
erb-0.134
Token Jim
Token ID5395
Feature activation-1.71e+00

Pos logit contributions
                +0.433
hai+0.433
kef+0.431
ynski+0.387
erb+0.385
Neg logit contributions
alog-0.421
 chars-0.415
 Guilty-0.380
aukee-0.377
 illum-0.363
Token Rutherford
Token ID49767
Feature activation-2.05e-01

Pos logit contributions
kef+1.438
hai+1.436
                +1.436
erb+1.312
ynski+1.291
Neg logit contributions
alog-1.055
 chars-1.027
 Guilty-0.938
aukee-0.919
 illum-0.876
Token approached
Token ID10448
Feature activation-1.35e-01

Pos logit contributions
                +0.163
hai+0.162
kef+0.161
ynski+0.146
erb+0.146
Neg logit contributions
alog-0.136
 chars-0.134
 Guilty-0.122
aukee-0.121
 illum-0.116
Token Fle
Token ID12005
Feature activation+2.16e-01

Pos logit contributions
                +0.088
hai+0.088
kef+0.087
ynski+0.077
erb+0.076
Neg logit contributions
alog-0.109
 chars-0.107
 Guilty-0.099
aukee-0.099
 illum-0.096
Tokenury
Token ID1601
Feature activation-1.87e-01

Pos logit contributions
alog+0.153
 chars+0.150
 Guilty+0.137
aukee+0.137
 illum+0.131
Neg logit contributions
                -0.163
hai-0.163
kef-0.161
ynski-0.146
erb-0.145
Token leading
Token ID3756
Feature activation+1.05e-01

Pos logit contributions
                +0.145
hai+0.145
kef+0.144
ynski+0.130
erb+0.129
Neg logit contributions
alog-0.128
 chars-0.125
 Guilty-0.114
aukee-0.114
 illum-0.109
Tokenenty
Token ID3787
Feature activation-8.34e-01

Pos logit contributions
                +0.620
hai+0.615
kef+0.613
ynski+0.552
erb+0.551
Neg logit contributions
alog-0.513
 chars-0.500
 Guilty-0.455
aukee-0.455
 illum-0.430
Tokenev
Token ID1990
Feature activation-3.48e-01

Pos logit contributions
                +0.777
hai+0.775
kef+0.772
ynski+0.707
erb+0.704
Neg logit contributions
alog-0.446
 chars-0.431
 Guilty-0.381
aukee-0.379
 illum-0.358
Token/
Token ID14
Feature activation-9.07e-02

Pos logit contributions
                +0.279
hai+0.279
kef+0.277
ynski+0.251
erb+0.250
Neg logit contributions
alog-0.229
 chars-0.224
 Guilty-0.204
aukee-0.203
 illum-0.194
TokenR
Token ID49
Feature activation-9.64e-02

Pos logit contributions
                +0.063
hai+0.063
kef+0.063
ynski+0.056
erb+0.056
Neg logit contributions
alog-0.069
 chars-0.068
 Guilty-0.063
aukee-0.062
 illum-0.060
TokenIA
Token ID3539
Feature activation-7.75e-01

Pos logit contributions
                +0.068
hai+0.067
kef+0.067
ynski+0.060
erb+0.060
Neg logit contributions
alog-0.073
 chars-0.072
 Guilty-0.066
aukee-0.066
 illum-0.064
Token Nov
Token ID5267
Feature activation-1.71e+00

Pos logit contributions
hai+0.653
                +0.652
kef+0.648
ynski+0.591
erb+0.589
Neg logit contributions
alog-0.479
 chars-0.469
 Guilty-0.421
aukee-0.420
 illum-0.402
Tokenost
Token ID455
Feature activation-2.45e-01

Pos logit contributions
                +1.435
hai+1.427
kef+1.414
ynski+1.289
erb+1.280
Neg logit contributions
alog-1.081
 chars-1.029
 Guilty-0.922
aukee-0.915
 illum-0.873
Tokeni
Token ID72
Feature activation-3.31e-01

Pos logit contributions
                +0.199
hai+0.198
kef+0.197
ynski+0.179
erb+0.178
Neg logit contributions
alog-0.159
 chars-0.156
 Guilty-0.141
aukee-0.141
 illum-0.135
Token/
Token ID14
Feature activation+1.02e-01

Pos logit contributions
                +0.266
hai+0.266
kef+0.264
ynski+0.240
erb+0.238
Neg logit contributions
alog-0.217
 chars-0.212
 Guilty-0.193
aukee-0.192
 illum-0.184
TokenK
Token ID42
Feature activation+7.72e-02

Pos logit contributions
alog+0.070
 chars+0.069
 Guilty+0.063
aukee+0.063
 illum+0.060
Neg logit contributions
                -0.078
hai-0.078
kef-0.078
ynski-0.070
erb-0.070
Tokenremlin
Token ID17244
Feature activation-7.61e-01

Pos logit contributions
alog+0.057
 chars+0.057
 Guilty+0.052
aukee+0.052
 illum+0.050
Neg logit contributions
                -0.055
hai-0.055
kef-0.055
ynski-0.049
erb-0.049
Token I
Token ID314
Feature activation+3.30e-01

Pos logit contributions
                +0.047
hai+0.047
kef+0.047
ynski+0.042
erb+0.042
Neg logit contributions
alog-0.045
 chars-0.044
 Guilty-0.041
aukee-0.040
 illum-0.039
TokenTA
Token ID5603
Feature activation-2.06e-01

Pos logit contributions
alog+0.235
 chars+0.230
 Guilty+0.211
aukee+0.211
 illum+0.202
Neg logit contributions
                -0.248
hai-0.247
kef-0.246
ynski-0.222
erb-0.220
Token 6
Token ID718
Feature activation+2.96e-01

Pos logit contributions
                +0.158
hai+0.158
kef+0.156
ynski+0.141
erb+0.141
Neg logit contributions
alog-0.144
 chars-0.141
 Guilty-0.129
aukee-0.129
 illum-0.123
Token Jac
Token ID8445
Feature activation+5.20e-01

Pos logit contributions
alog+0.197
 chars+0.193
aukee+0.176
 Guilty+0.176
 illum+0.168
Neg logit contributions
                -0.236
hai-0.235
kef-0.233
ynski-0.212
erb-0.211
Tokenop
Token ID404
Feature activation-7.96e-01

Pos logit contributions
alog+0.340
 chars+0.334
 Guilty+0.303
aukee+0.302
 illum+0.289
Neg logit contributions
                -0.419
hai-0.419
kef-0.417
ynski-0.379
erb-0.376
Tokeno
Token ID78
Feature activation-1.69e+00

Pos logit contributions
                +0.636
hai+0.636
kef+0.633
ynski+0.573
erb+0.571
Neg logit contributions
alog-0.525
 chars-0.516
 Guilty-0.468
aukee-0.468
 illum-0.448
Token Gu
Token ID1962
Feature activation-3.57e-01

Pos logit contributions
hai+1.251
kef+1.241
                +1.233
erb+1.102
ynski+1.101
Neg logit contributions
alog-1.223
 chars-1.213
 Guilty-1.118
aukee-1.099
 illum-1.075
Tokenarn
Token ID1501
Feature activation-5.46e-01

Pos logit contributions
                +0.314
hai+0.313
kef+0.311
ynski+0.285
erb+0.283
Neg logit contributions
alog-0.208
 chars-0.202
 Guilty-0.182
aukee-0.181
 illum-0.172
Tokenieri
Token ID29864
Feature activation-3.34e-01

Pos logit contributions
                +0.402
hai+0.400
kef+0.398
ynski+0.359
erb+0.356
Neg logit contributions
alog-0.396
 chars-0.388
aukee-0.355
 Guilty-0.355
 illum-0.340
Token (
Token ID357
Feature activation+9.93e-02

Pos logit contributions
                +0.269
hai+0.268
kef+0.267
ynski+0.242
erb+0.241
Neg logit contributions
alog-0.219
 chars-0.215
 Guilty-0.195
aukee-0.195
 illum-0.186
TokenAst
Token ID33751
Feature activation-1.18e-02

Pos logit contributions
alog+0.077
 chars+0.076
aukee+0.070
 Guilty+0.070
 illum+0.067
Neg logit contributions
                -0.068
hai-0.068
kef-0.067
ynski-0.060
erb-0.060
Token-
Token ID12
Feature activation+4.94e-01

Pos logit contributions
                +0.176
hai+0.176
kef+0.175
ynski+0.158
erb+0.157
Neg logit contributions
alog-0.162
 chars-0.159
 Guilty-0.145
aukee-0.145
 illum-0.139
Tokenrent
Token ID1156
Feature activation+4.35e-01

Pos logit contributions
alog+0.348
 chars+0.341
aukee+0.313
 Guilty+0.312
 illum+0.299
Neg logit contributions
                -0.375
hai-0.375
kef-0.370
ynski-0.335
erb-0.333
Tokenal
Token ID282
Feature activation+2.08e-02

Pos logit contributions
alog+0.285
 chars+0.280
 Guilty+0.255
aukee+0.254
 illum+0.243
Neg logit contributions
                -0.349
hai-0.349
kef-0.346
ynski-0.315
erb-0.313
Token service
Token ID2139
Feature activation+6.26e-02

Pos logit contributions
alog+0.014
 chars+0.014
 Guilty+0.013
aukee+0.013
 illum+0.012
Neg logit contributions
                -0.016
hai-0.016
kef-0.016
ynski-0.014
erb-0.014
Token Air
Token ID3701
Feature activation-1.68e-01

Pos logit contributions
alog+0.040
 chars+0.039
aukee+0.035
 Guilty+0.035
 illum+0.034
Neg logit contributions
                -0.051
hai-0.051
kef-0.051
ynski-0.046
erb-0.046
TokenB
Token ID33
Feature activation-1.69e+00

Pos logit contributions
                +0.135
hai+0.135
kef+0.134
ynski+0.122
erb+0.121
Neg logit contributions
alog-0.111
 chars-0.108
 Guilty-0.098
aukee-0.098
 illum-0.094
Tokenn
Token ID77
Feature activation+3.03e-01

Pos logit contributions
kef+1.268
                +1.266
hai+1.256
ynski+1.128
erb+1.121
Neg logit contributions
alog-1.207
 chars-1.181
 Guilty-1.075
aukee-1.071
 illum-1.035
TokenB
Token ID33
Feature activation-1.86e-01

Pos logit contributions
alog+0.183
 chars+0.180
 Guilty+0.163
aukee+0.162
 illum+0.154
Neg logit contributions
hai-0.258
                -0.258
kef-0.256
ynski-0.234
erb-0.232
Token and
Token ID290
Feature activation-4.59e-02

Pos logit contributions
                +0.146
hai+0.146
kef+0.145
ynski+0.131
erb+0.131
Neg logit contributions
alog-0.126
 chars-0.123
 Guilty-0.112
aukee-0.112
 illum-0.107
Token discount
Token ID9780
Feature activation-2.53e-01

Pos logit contributions
                +0.035
hai+0.035
kef+0.035
ynski+0.032
erb+0.032
Neg logit contributions
alog-0.032
 chars-0.031
 Guilty-0.028
aukee-0.028
 illum-0.027
Token shopping
Token ID9735
Feature activation-9.85e-02

Pos logit contributions
                +0.190
hai+0.190
kef+0.188
ynski+0.170
erb+0.169
Neg logit contributions
alog-0.180
 chars-0.177
 Guilty-0.162
aukee-0.161
 illum-0.155
Token am
Token ID716
Feature activation+1.80e-01

Pos logit contributions
                +0.044
hai+0.043
kef+0.043
ynski+0.039
erb+0.039
Neg logit contributions
alog-0.040
 chars-0.039
 Guilty-0.036
aukee-0.035
 illum-0.034
Tokenok
Token ID482
Feature activation-2.07e-01

Pos logit contributions
alog+0.092
 chars+0.090
 Guilty+0.079
aukee+0.079
 illum+0.074
Neg logit contributions
                -0.171
hai-0.171
kef-0.170
ynski-0.157
erb-0.156
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
                +0.168
hai+0.167
kef+0.166
ynski+0.151
erb+0.150
Neg logit contributions
alog-0.134
 chars-0.131
 Guilty-0.119
aukee-0.119
 illum-0.114
Token CNN
Token ID8100
Feature activation+1.47e-02

Pos logit contributions
alog+0.102
 chars+0.100
aukee+0.091
 Guilty+0.091
 illum+0.087
Neg logit contributions
                -0.117
hai-0.116
kef-0.116
ynski-0.105
erb-0.104
Token president
Token ID1893
Feature activation-7.74e-01

Pos logit contributions
alog+0.010
 chars+0.010
 Guilty+0.009
aukee+0.009
 illum+0.008
Neg logit contributions
                -0.012
hai-0.012
kef-0.011
ynski-0.010
erb-0.010
Token Jeff
Token ID5502
Feature activation+1.33e+00

Pos logit contributions
                +0.573
hai+0.570
kef+0.568
ynski+0.511
erb+0.507
Neg logit contributions
alog-0.558
 chars-0.550
 Guilty-0.504
aukee-0.500
 illum-0.483
Token Zucker
Token ID27629
Feature activation-1.15e-01

Pos logit contributions
alog+1.205
 chars+1.185
aukee+1.114
 Guilty+1.106
 illum+1.078
Neg logit contributions
                -0.736
hai-0.733
kef-0.728
erb-0.623
ynski-0.622
Token
Token ID447
Feature activation-3.59e-01

Pos logit contributions
                +0.091
hai+0.090
kef+0.090
ynski+0.081
erb+0.081
Neg logit contributions
alog-0.077
 chars-0.076
 Guilty-0.069
aukee-0.069
 illum-0.066
Token
Token ID247
Feature activation-2.87e-02

Pos logit contributions
                +0.297
hai+0.297
kef+0.295
ynski+0.269
erb+0.267
Neg logit contributions
alog-0.227
 chars-0.222
 Guilty-0.201
aukee-0.200
 illum-0.191
Tokens
Token ID82
Feature activation-6.43e-02

Pos logit contributions
                +0.024
hai+0.024
kef+0.024
ynski+0.022
erb+0.022
Neg logit contributions
alog-0.018
 chars-0.017
 Guilty-0.016
aukee-0.016
 illum-0.015
Token then
Token ID788
Feature activation+3.42e-01

Pos logit contributions
                +0.049
hai+0.049
kef+0.049
ynski+0.044
erb+0.044
Neg logit contributions
alog-0.045
 chars-0.044
 Guilty-0.040
aukee-0.040
 illum-0.038
Token.
Token ID13
Feature activation-1.24e-01

Pos logit contributions
                +0.229
hai+0.229
kef+0.227
ynski+0.205
erb+0.204
Neg logit contributions
alog-0.212
 chars-0.208
 Guilty-0.191
aukee-0.190
 illum-0.182
Token
Token ID447
Feature activation-1.24e-01

Pos logit contributions
                +0.093
hai+0.093
kef+0.092
ynski+0.083
erb+0.083
Neg logit contributions
alog-0.088
 chars-0.087
 Guilty-0.079
aukee-0.079
 illum-0.076
Token
Token ID251
Feature activation+6.18e-02

Pos logit contributions
                +0.114
hai+0.114
kef+0.114
ynski+0.105
erb+0.104
Neg logit contributions
alog-0.066
 chars-0.064
 Guilty-0.057
aukee-0.057
 illum-0.054
Token
Token ID198
Feature activation-1.31e-02

Pos logit contributions
alog+0.041
 chars+0.040
 Guilty+0.036
aukee+0.036
 illum+0.035
Neg logit contributions
                -0.049
hai-0.049
kef-0.049
ynski-0.044
erb-0.044
Token
Token ID198
Feature activation+7.46e-02

Pos logit contributions
                +0.010
hai+0.010
kef+0.010
ynski+0.009
erb+0.009
Neg logit contributions
alog-0.009
 chars-0.009
 Guilty-0.008
aukee-0.008
 illum-0.007
TokenTed
Token ID38972
Feature activation+1.33e+00

Pos logit contributions
alog+0.058
 chars+0.057
 Guilty+0.052
aukee+0.052
 illum+0.050
Neg logit contributions
                -0.051
hai-0.051
kef-0.051
ynski-0.045
erb-0.045
Token Cruz
Token ID8742
Feature activation+2.65e-01

Pos logit contributions
alog+0.831
 chars+0.813
aukee+0.740
 Guilty+0.738
 illum+0.701
Neg logit contributions
                -1.107
hai-1.106
kef-1.098
erb-0.996
ynski-0.996
Token speaks
Token ID9209
Feature activation+3.58e-02

Pos logit contributions
alog+0.179
 chars+0.175
 Guilty+0.160
aukee+0.159
 illum+0.153
Neg logit contributions
                -0.208
hai-0.207
kef-0.206
ynski-0.187
erb-0.186
Token to
Token ID284
Feature activation+1.29e-01

Pos logit contributions
alog+0.024
 chars+0.024
 Guilty+0.022
aukee+0.022
 illum+0.021
Neg logit contributions
                -0.028
hai-0.028
kef-0.028
ynski-0.025
erb-0.025
Token potential
Token ID2785
Feature activation+1.33e-01

Pos logit contributions
alog+0.094
 chars+0.092
 Guilty+0.084
aukee+0.084
 illum+0.081
Neg logit contributions
                -0.095
hai-0.095
kef-0.094
ynski-0.085
erb-0.084
Token supporters
Token ID5941
Feature activation+2.51e-01

Pos logit contributions
alog+0.086
 chars+0.085
 Guilty+0.077
aukee+0.077
 illum+0.073
Neg logit contributions
                -0.108
hai-0.108
kef-0.107
ynski-0.097
erb-0.097
TokenSign
Token ID11712
Feature activation+3.34e-02

Pos logit contributions
                +0.038
hai+0.038
kef+0.038
ynski+0.034
erb+0.034
Neg logit contributions
alog-0.028
 chars-0.028
 Guilty-0.025
aukee-0.025
 illum-0.024
Token up
Token ID510
Feature activation+2.51e-01

Pos logit contributions
alog+0.023
 chars+0.023
 Guilty+0.021
aukee+0.021
 illum+0.020
Neg logit contributions
                -0.026
hai-0.026
kef-0.025
ynski-0.023
erb-0.023
Token for
Token ID329
Feature activation+1.17e+00

Pos logit contributions
alog+0.181
 chars+0.178
aukee+0.163
 Guilty+0.163
 illum+0.156
Neg logit contributions
                -0.185
hai-0.184
kef-0.183
ynski-0.165
erb-0.164
Token our
Token ID674
Feature activation+1.00e+00

Pos logit contributions
alog+0.937
 chars+0.921
aukee+0.861
 Guilty+0.843
 illum+0.815
Neg logit contributions
                -0.784
hai-0.779
kef-0.763
erb-0.684
ynski-0.682
Token Wine
Token ID20447
Feature activation+1.31e+00

Pos logit contributions
alog+0.739
 chars+0.732
aukee+0.668
 Guilty+0.664
 illum+0.638
Neg logit contributions
hai-0.728
                -0.725
kef-0.714
ynski-0.645
erb-0.644
Token Club
Token ID6289
Feature activation+1.71e+00

Pos logit contributions
alog+0.601
 chars+0.590
aukee+0.523
 Guilty+0.496
 illum+0.482
Neg logit contributions
hai-1.310
                -1.301
kef-1.300
erb-1.210
ynski-1.205
Token today
Token ID1909
Feature activation-1.24e-01

Pos logit contributions
alog+1.317
 chars+1.289
aukee+1.198
 Guilty+1.182
 illum+1.156
Neg logit contributions
                -1.176
hai-1.169
kef-1.162
erb-1.045
ynski-1.043
Token.
Token ID13
Feature activation+7.06e-01

Pos logit contributions
                +0.102
hai+0.102
kef+0.102
ynski+0.093
erb+0.092
Neg logit contributions
alog-0.079
 chars-0.077
 Guilty-0.070
aukee-0.070
 illum-0.066
Token Did
Token ID7731
Feature activation-1.26e-01

Pos logit contributions
alog+0.423
 chars+0.410
aukee+0.371
 Guilty+0.367
 illum+0.353
Neg logit contributions
                -0.612
hai-0.606
kef-0.600
ynski-0.553
erb-0.551
Token you
Token ID345
Feature activation-5.84e-01

Pos logit contributions
                +0.101
hai+0.101
kef+0.100
ynski+0.091
erb+0.091
Neg logit contributions
alog-0.083
 chars-0.081
 Guilty-0.073
aukee-0.073
 illum-0.070
Token know
Token ID760
Feature activation-9.91e-01

Pos logit contributions
                +0.422
hai+0.422
kef+0.420
ynski+0.376
erb+0.372
Neg logit contributions
alog-0.430
 chars-0.425
 Guilty-0.388
aukee-0.386
 illum-0.373
Token become
Token ID1716
Feature activation+8.39e-01

Pos logit contributions
alog+0.177
 chars+0.174
 Guilty+0.159
aukee+0.159
 illum+0.152
Neg logit contributions
                -0.188
hai-0.187
kef-0.186
ynski-0.167
erb-0.167
Token an
Token ID281
Feature activation+1.16e-01

Pos logit contributions
alog+0.642
 chars+0.625
aukee+0.582
 Guilty+0.575
 illum+0.555
Neg logit contributions
                -0.590
kef-0.578
hai-0.577
ynski-0.523
erb-0.518
Token In
Token ID554
Feature activation+5.43e-01

Pos logit contributions
alog+0.079
 chars+0.078
 Guilty+0.071
aukee+0.071
 illum+0.068
Neg logit contributions
                -0.090
hai-0.089
kef-0.089
ynski-0.080
erb-0.080
TokenDaily
Token ID28545
Feature activation+3.71e-01

Pos logit contributions
alog+0.541
 chars+0.533
aukee+0.502
 Guilty+0.501
 illum+0.487
Neg logit contributions
                -0.254
hai-0.251
kef-0.248
ynski-0.210
erb-0.208
Token supporter
Token ID15525
Feature activation-1.17e+00

Pos logit contributions
alog+0.300
 chars+0.295
aukee+0.273
 Guilty+0.271
 illum+0.263
Neg logit contributions
                -0.242
hai-0.241
kef-0.239
erb-0.212
ynski-0.212
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
                +0.992
hai+0.990
kef+0.988
ynski+0.906
erb+0.891
Neg logit contributions
alog-0.707
 chars-0.704
 Guilty-0.636
aukee-0.616
 illum-0.602
Token Powered
Token ID45090
Feature activation-4.62e-01

Pos logit contributions
alog+0.817
 chars+0.783
aukee+0.710
 Guilty+0.705
 illum+0.671
Neg logit contributions
                -1.173
hai-1.153
kef-1.149
ynski-1.060
erb-1.058
Token by
Token ID416
Feature activation-6.03e-02

Pos logit contributions
                +0.337
hai+0.336
kef+0.334
ynski+0.300
erb+0.298
Neg logit contributions
alog-0.338
 chars-0.333
 Guilty-0.306
aukee-0.304
 illum-0.292
Token Press
Token ID4332
Feature activation+3.11e-01

Pos logit contributions
                +0.040
hai+0.040
kef+0.039
ynski+0.035
erb+0.035
Neg logit contributions
alog-0.048
 chars-0.047
 Guilty-0.044
aukee-0.044
 illum-0.042
TokenPat
Token ID12130
Feature activation+1.60e+00

Pos logit contributions
alog+0.226
 chars+0.222
aukee+0.204
 Guilty+0.203
 illum+0.195
Neg logit contributions
                -0.227
hai-0.227
kef-0.225
ynski-0.202
erb-0.201
Tokenron
Token ID1313
Feature activation-1.87e-01

Pos logit contributions
alog+0.737
 chars+0.725
aukee+0.643
 Guilty+0.628
 illum+0.597
Neg logit contributions
hai-1.585
                -1.581
kef-1.567
ynski-1.453
erb-1.449
Token Democratic
Token ID4390
Feature activation+2.57e-01

Pos logit contributions
alog+0.009
 chars+0.009
 Guilty+0.008
aukee+0.008
 illum+0.008
Neg logit contributions
                -0.012
hai-0.012
kef-0.011
ynski-0.010
erb-0.010
Token Party
Token ID3615
Feature activation+3.09e-01

Pos logit contributions
alog+0.139
 chars+0.135
aukee+0.120
 Guilty+0.120
 illum+0.113
Neg logit contributions
                -0.237
hai-0.236
kef-0.234
ynski-0.216
erb-0.215
Token.
Token ID13
Feature activation+7.16e-02

Pos logit contributions
alog+0.206
 chars+0.201
 Guilty+0.183
aukee+0.183
 illum+0.175
Neg logit contributions
                -0.247
hai-0.246
kef-0.244
ynski-0.222
erb-0.220
Token
Token ID198
Feature activation-7.98e-02

Pos logit contributions
alog+0.048
 chars+0.047
aukee+0.043
 Guilty+0.043
 illum+0.041
Neg logit contributions
                -0.057
hai-0.056
kef-0.056
ynski-0.051
erb-0.050
Token
Token ID198
Feature activation+7.31e-02

Pos logit contributions
                +0.063
hai+0.063
kef+0.063
ynski+0.057
erb+0.056
Neg logit contributions
alog-0.053
 chars-0.052
 Guilty-0.048
aukee-0.048
 illum-0.045
TokenLat
Token ID24220
Feature activation+1.44e+00

Pos logit contributions
alog+0.053
 chars+0.052
 Guilty+0.048
aukee+0.048
 illum+0.046
Neg logit contributions
                -0.053
hai-0.053
kef-0.053
ynski-0.047
erb-0.047
Tokenino
Token ID2879
Feature activation+1.77e-01

Pos logit contributions
alog+0.490
 chars+0.481
aukee+0.397
 Guilty+0.396
 illum+0.358
Neg logit contributions
                -1.603
hai-1.596
kef-1.589
ynski-1.485
erb-1.482
Token voters
Token ID4446
Feature activation-2.95e-01

Pos logit contributions
alog+0.129
 chars+0.127
 Guilty+0.116
aukee+0.116
 illum+0.111
Neg logit contributions
                -0.130
hai-0.129
kef-0.128
ynski-0.115
erb-0.115
Token have
Token ID423
Feature activation-2.00e-01

Pos logit contributions
                +0.235
hai+0.235
kef+0.233
ynski+0.212
erb+0.211
Neg logit contributions
alog-0.196
 chars-0.192
 Guilty-0.175
aukee-0.174
 illum-0.167
Token leaned
Token ID23831
Feature activation-5.08e-01

Pos logit contributions
                +0.155
hai+0.155
kef+0.154
ynski+0.139
erb+0.139
Neg logit contributions
alog-0.137
 chars-0.135
 Guilty-0.123
aukee-0.123
 illum-0.117
Token more
Token ID517
Feature activation-3.46e-01

Pos logit contributions
                +0.404
hai+0.404
kef+0.401
ynski+0.364
erb+0.362
Neg logit contributions
alog-0.337
 chars-0.330
 Guilty-0.301
aukee-0.300
 illum-0.286
Token filmed
Token ID18976
Feature activation+2.52e-01

Pos logit contributions
alog+0.299
 chars+0.293
 Guilty+0.270
aukee+0.270
 illum+0.259
Neg logit contributions
                -0.278
hai-0.277
kef-0.275
ynski-0.246
erb-0.245
Token.
Token ID13
Feature activation-7.96e-02

Pos logit contributions
alog+0.170
 chars+0.167
 Guilty+0.152
aukee+0.152
 illum+0.145
Neg logit contributions
                -0.199
hai-0.198
kef-0.197
ynski-0.178
erb-0.177
Token It
Token ID632
Feature activation-2.87e-03

Pos logit contributions
                +0.063
hai+0.063
kef+0.062
ynski+0.056
erb+0.056
Neg logit contributions
alog-0.054
 chars-0.053
 Guilty-0.048
aukee-0.048
 illum-0.046
Token is
Token ID318
Feature activation+9.57e-02

Pos logit contributions
                +0.002
hai+0.002
kef+0.002
ynski+0.002
erb+0.002
Neg logit contributions
alog-0.002
 chars-0.002
 Guilty-0.002
aukee-0.002
 illum-0.002
Token our
Token ID674
Feature activation-8.48e-02

Pos logit contributions
alog+0.066
 chars+0.065
 Guilty+0.059
aukee+0.059
 illum+0.057
Neg logit contributions
                -0.074
hai-0.073
kef-0.073
ynski-0.066
erb-0.065
Token collaboration
Token ID12438
Feature activation+2.52e-01

Pos logit contributions
                +0.063
hai+0.063
kef+0.063
ynski+0.057
erb+0.056
Neg logit contributions
alog-0.061
 chars-0.059
 Guilty-0.054
aukee-0.054
 illum-0.052
Token which
Token ID543
Feature activation+3.18e-01

Pos logit contributions
alog+0.181
 chars+0.178
aukee+0.163
 Guilty+0.163
 illum+0.156
Neg logit contributions
                -0.186
hai-0.186
kef-0.184
ynski-0.166
erb-0.165
Token makes
Token ID1838
Feature activation-1.20e-01

Pos logit contributions
alog+0.215
 chars+0.211
aukee+0.192
 Guilty+0.192
 illum+0.183
Neg logit contributions
                -0.249
hai-0.249
kef-0.247
ynski-0.224
erb-0.222
Token this
Token ID428
Feature activation-7.57e-02

Pos logit contributions
                +0.090
hai+0.090
kef+0.089
ynski+0.081
erb+0.080
Neg logit contributions
alog-0.085
 chars-0.083
 Guilty-0.076
aukee-0.076
 illum-0.073
Token kind
Token ID1611
Feature activation-2.14e-03

Pos logit contributions
                +0.057
hai+0.057
kef+0.056
ynski+0.051
erb+0.051
Neg logit contributions
alog-0.054
 chars-0.053
 Guilty-0.048
aukee-0.048
 illum-0.046
Token of
Token ID286
Feature activation-3.53e-01

Pos logit contributions
                +0.002
hai+0.002
kef+0.002
ynski+0.001
erb+0.001
Neg logit contributions
alog-0.002
 chars-0.001
 Guilty-0.001
aukee-0.001
 illum-0.001
Token materials
Token ID5696
Feature activation+1.35e-01

Pos logit contributions
alog+0.003
 chars+0.003
 Guilty+0.002
aukee+0.002
 illum+0.002
Neg logit contributions
                -0.003
hai-0.003
kef-0.003
ynski-0.003
erb-0.003
Token such
Token ID884
Feature activation+7.74e-02

Pos logit contributions
alog+0.095
 chars+0.093
 Guilty+0.085
aukee+0.085
 illum+0.081
Neg logit contributions
                -0.103
hai-0.103
kef-0.102
ynski-0.092
erb-0.091
Token as
Token ID355
Feature activation-2.42e-01

Pos logit contributions
alog+0.054
 chars+0.053
 Guilty+0.049
aukee+0.049
 illum+0.046
Neg logit contributions
                -0.059
hai-0.059
kef-0.058
ynski-0.053
erb-0.052
Token earth
Token ID4534
Feature activation+1.53e-01

Pos logit contributions
                +0.186
hai+0.185
kef+0.184
ynski+0.166
erb+0.165
Neg logit contributions
alog-0.168
 chars-0.165
 Guilty-0.150
aukee-0.150
 illum-0.144
Token and
Token ID290
Feature activation-1.87e-01

Pos logit contributions
alog+0.107
 chars+0.105
 Guilty+0.096
aukee+0.096
 illum+0.091
Neg logit contributions
                -0.117
hai-0.116
kef-0.115
ynski-0.104
erb-0.104
Token builders
Token ID31606
Feature activation+7.59e-02

Pos logit contributions
                +0.147
hai+0.146
kef+0.145
ynski+0.132
erb+0.131
Neg logit contributions
alog-0.126
 chars-0.124
 Guilty-0.113
aukee-0.113
 illum-0.108
Token'
Token ID6
Feature activation+3.04e-01

Pos logit contributions
alog+0.052
 chars+0.051
 Guilty+0.046
aukee+0.046
 illum+0.044
Neg logit contributions
                -0.059
hai-0.059
kef-0.058
ynski-0.053
erb-0.053
Token rubble
Token ID29632
Feature activation+4.40e-02

Pos logit contributions
alog+0.207
 chars+0.202
 Guilty+0.185
aukee+0.184
 illum+0.176
Neg logit contributions
                -0.237
hai-0.237
kef-0.235
ynski-0.213
erb-0.212
Token is
Token ID318
Feature activation+2.01e-01

Pos logit contributions
alog+0.030
 chars+0.029
 Guilty+0.026
aukee+0.026
 illum+0.025
Neg logit contributions
                -0.035
hai-0.035
kef-0.034
ynski-0.031
erb-0.031
Token likely
Token ID1884
Feature activation+3.23e-01

Pos logit contributions
alog+0.136
 chars+0.133
 Guilty+0.122
aukee+0.122
 illum+0.116
Neg logit contributions
                -0.157
hai-0.156
kef-0.155
ynski-0.141
erb-0.140
Token to
Token ID284
Feature activation-8.09e-02

Pos logit contributions
alog+0.226
 chars+0.221
aukee+0.202
 Guilty+0.202
 illum+0.193
Neg logit contributions
                -0.247
hai-0.246
kef-0.244
ynski-0.221
erb-0.220
Token,
Token ID11
Feature activation+5.07e-02

Pos logit contributions
alog+0.153
 chars+0.150
 Guilty+0.137
aukee+0.137
 illum+0.131
Neg logit contributions
                -0.178
hai-0.177
kef-0.176
ynski-0.160
erb-0.159
Token who
Token ID508
Feature activation+1.48e-01

Pos logit contributions
alog+0.034
 chars+0.033
 Guilty+0.030
aukee+0.030
 illum+0.029
Neg logit contributions
                -0.040
hai-0.040
kef-0.040
ynski-0.036
erb-0.036
Token is
Token ID318
Feature activation+1.05e-01

Pos logit contributions
alog+0.099
 chars+0.097
 Guilty+0.088
aukee+0.088
 illum+0.084
Neg logit contributions
                -0.117
hai-0.117
kef-0.116
ynski-0.106
erb-0.105
Token one
Token ID530
Feature activation+6.21e-02

Pos logit contributions
alog+0.072
 chars+0.071
 Guilty+0.065
aukee+0.065
 illum+0.062
Neg logit contributions
                -0.080
hai-0.080
kef-0.079
ynski-0.072
erb-0.071
Token of
Token ID286
Feature activation+3.20e-01

Pos logit contributions
alog+0.044
 chars+0.043
 Guilty+0.039
aukee+0.039
 illum+0.037
Neg logit contributions
                -0.047
hai-0.047
kef-0.047
ynski-0.042
erb-0.042
Token the
Token ID262
Feature activation+2.72e-01

Pos logit contributions
alog+0.232
 chars+0.228
aukee+0.209
 Guilty+0.209
 illum+0.200
Neg logit contributions
                -0.234
hai-0.234
kef-0.232
ynski-0.209
erb-0.208
Token finest
Token ID18822
Feature activation-1.02e-01

Pos logit contributions
alog+0.183
 chars+0.179
aukee+0.163
 Guilty+0.163
 illum+0.156
Neg logit contributions
                -0.215
hai-0.214
kef-0.212
ynski-0.193
erb-0.192
Token intuitive
Token ID19933
Feature activation+3.44e-01

Pos logit contributions
                +0.078
hai+0.078
kef+0.078
ynski+0.070
erb+0.070
Neg logit contributions
alog-0.071
 chars-0.069
 Guilty-0.063
aukee-0.063
 illum-0.061
Token mathematic
Token ID33161
Feature activation+3.11e-01

Pos logit contributions
alog+0.241
 chars+0.237
aukee+0.216
 Guilty+0.216
 illum+0.207
Neg logit contributions
                -0.260
hai-0.260
kef-0.257
ynski-0.233
erb-0.231
Tokenians
Token ID1547
Feature activation+2.63e-01

Pos logit contributions
alog+0.250
 chars+0.246
aukee+0.228
 Guilty+0.228
 illum+0.220
Neg logit contributions
                -0.203
hai-0.203
kef-0.201
ynski-0.179
erb-0.178
Token I
Token ID314
Feature activation+1.35e-01

Pos logit contributions
alog+0.182
 chars+0.179
aukee+0.163
 Guilty+0.163
 illum+0.156
Neg logit contributions
                -0.202
hai-0.201
kef-0.200
ynski-0.181
erb-0.180
Token white
Token ID2330
Feature activation-1.30e-01

Pos logit contributions
                +0.291
hai+0.291
kef+0.289
ynski+0.261
erb+0.259
Neg logit contributions
alog-0.262
 chars-0.257
 Guilty-0.235
aukee-0.234
 illum-0.224
Token male
Token ID4257
Feature activation-9.22e-02

Pos logit contributions
                +0.104
hai+0.104
kef+0.103
ynski+0.094
erb+0.093
Neg logit contributions
alog-0.086
 chars-0.084
 Guilty-0.077
aukee-0.077
 illum-0.073
Token perspective
Token ID6650
Feature activation-6.62e-02

Pos logit contributions
                +0.076
hai+0.076
kef+0.075
ynski+0.069
erb+0.068
Neg logit contributions
alog-0.059
 chars-0.057
 Guilty-0.052
aukee-0.052
 illum-0.049
Token we
Token ID356
Feature activation+5.21e-02

Pos logit contributions
                +0.052
hai+0.052
kef+0.051
ynski+0.047
erb+0.046
Neg logit contributions
alog-0.045
 chars-0.044
 Guilty-0.040
aukee-0.040
 illum-0.038
Token are
Token ID389
Feature activation+9.87e-02

Pos logit contributions
alog+0.034
 chars+0.033
 Guilty+0.030
aukee+0.030
 illum+0.029
Neg logit contributions
                -0.042
hai-0.042
kef-0.042
ynski-0.038
erb-0.038
Token most
Token ID749
Feature activation+1.21e-01

Pos logit contributions
alog+0.063
 chars+0.062
 Guilty+0.056
aukee+0.056
 illum+0.053
Neg logit contributions
                -0.081
hai-0.081
kef-0.080
ynski-0.073
erb-0.073
Token used
Token ID973
Feature activation-1.38e-01

Pos logit contributions
alog+0.077
 chars+0.075
 Guilty+0.068
aukee+0.068
 illum+0.064
Neg logit contributions
                -0.101
hai-0.100
kef-0.100
ynski-0.091
erb-0.090
Token to
Token ID284
Feature activation+1.18e-01

Pos logit contributions
                +0.106
hai+0.106
kef+0.105
ynski+0.095
erb+0.095
Neg logit contributions
alog-0.095
 chars-0.094
 Guilty-0.085
aukee-0.085
 illum-0.082
Token,
Token ID11
Feature activation+2.36e-01

Pos logit contributions
alog+0.091
 chars+0.089
 Guilty+0.082
aukee+0.082
 illum+0.079
Neg logit contributions
                -0.082
hai-0.082
kef-0.081
ynski-0.073
erb-0.072
Token but
Token ID475
Feature activation-1.76e-03

Pos logit contributions
alog+0.162
 chars+0.159
aukee+0.145
 Guilty+0.145
 illum+0.138
Neg logit contributions
                -0.182
hai-0.182
kef-0.181
ynski-0.164
erb-0.163
Token rather
Token ID2138
Feature activation-4.15e-02

Pos logit contributions
                +0.001
hai+0.001
kef+0.001
ynski+0.001
erb+0.001
Neg logit contributions
alog-0.001
 chars-0.001
 Guilty-0.001
aukee-0.001
 illum-0.001
Token like
Token ID588
Feature activation+2.05e-01

Pos logit contributions
alog+0.257
 chars+0.252
aukee+0.229
 Guilty+0.229
 illum+0.219
Neg logit contributions
                -0.300
hai-0.299
kef-0.297
ynski-0.270
erb-0.267
Token Carl
Token ID8124
Feature activation-1.91e-01

Pos logit contributions
alog+0.141
 chars+0.138
 Guilty+0.126
aukee+0.125
 illum+0.120
Neg logit contributions
                -0.159
hai-0.158
kef-0.157
ynski-0.142
erb-0.141
Token Hag
Token ID21375
Feature activation+6.84e-01

Pos logit contributions
                +0.168
hai+0.168
kef+0.167
ynski+0.153
erb+0.152
Neg logit contributions
alog-0.110
 chars-0.108
 Guilty-0.096
aukee-0.096
 illum-0.091
Tokenelin
Token ID27176
Feature activation+1.74e-01

Pos logit contributions
alog+0.473
 chars+0.466
 Guilty+0.426
aukee+0.423
 illum+0.405
Neg logit contributions
hai-0.526
                -0.523
kef-0.520
ynski-0.470
erb-0.470
Token and
Token ID290
Feature activation+6.06e-02

Pos logit contributions
alog+0.117
 chars+0.114
 Guilty+0.104
aukee+0.104
 illum+0.099
Neg logit contributions
                -0.138
hai-0.137
kef-0.136
ynski-0.124
erb-0.123
Token Nick
Token ID8047
Feature activation+5.86e-01

Pos logit contributions
alog+0.040
 chars+0.039
 Guilty+0.036
aukee+0.035
 illum+0.034
Neg logit contributions
                -0.049
hai-0.048
kef-0.048
ynski-0.044
erb-0.043
Token Bon
Token ID7979
Feature activation+1.39e+00

Pos logit contributions
alog+0.393
 chars+0.387
aukee+0.352
 Guilty+0.351
 illum+0.335
Neg logit contributions
hai-0.462
                -0.462
kef-0.456
ynski-0.416
erb-0.413
Tokenino
Token ID2879
Feature activation+2.35e-01

Pos logit contributions
alog+0.480
 chars+0.475
 Guilty+0.398
aukee+0.395
 illum+0.356
Neg logit contributions
hai-1.544
                -1.531
kef-1.519
ynski-1.444
erb-1.422
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
alog+0.156
 chars+0.153
 Guilty+0.139
aukee+0.139
 illum+0.133
Neg logit contributions
                -0.187
hai-0.187
kef-0.186
ynski-0.169
erb-0.168
Token No
Token ID1400
Feature activation-2.40e-01

Pos logit contributions
alog+0.239
 chars+0.234
aukee+0.213
 Guilty+0.212
 illum+0.203
Neg logit contributions
                -0.281
hai-0.280
kef-0.278
ynski-0.252
erb-0.251
Token mention
Token ID3068
Feature activation+2.17e-01

Pos logit contributions
                +0.187
hai+0.187
kef+0.186
ynski+0.168
erb+0.167
Neg logit contributions
alog-0.163
 chars-0.159
 Guilty-0.145
aukee-0.145
 illum-0.139
Token to
Token ID284
Feature activation-2.63e-01

Pos logit contributions
                +0.202
hai+0.201
kef+0.200
ynski+0.181
erb+0.180
Neg logit contributions
alog-0.182
 chars-0.178
 Guilty-0.163
aukee-0.162
 illum-0.155
Token several
Token ID1811
Feature activation-1.06e-02

Pos logit contributions
                +0.201
hai+0.200
kef+0.199
ynski+0.180
erb+0.179
Neg logit contributions
alog-0.184
 chars-0.180
 Guilty-0.165
aukee-0.164
 illum-0.158
Token media
Token ID2056
Feature activation-4.42e-01

Pos logit contributions
                +0.008
hai+0.008
kef+0.007
ynski+0.007
erb+0.007
Neg logit contributions
alog-0.008
 chars-0.008
 Guilty-0.007
aukee-0.007
 illum-0.007
Token outlets
Token ID12527
Feature activation-4.99e-01

Pos logit contributions
                +0.284
hai+0.283
kef+0.281
ynski+0.249
erb+0.247
Neg logit contributions
alog-0.362
 chars-0.356
 Guilty-0.329
aukee-0.329
 illum-0.318
Token,
Token ID11
Feature activation-1.82e-01

Pos logit contributions
                +0.409
hai+0.408
kef+0.406
ynski+0.369
erb+0.367
Neg logit contributions
alog-0.320
 chars-0.314
 Guilty-0.284
aukee-0.284
 illum-0.271
Token including
Token ID1390
Feature activation-7.16e-02

Pos logit contributions
                +0.148
hai+0.147
kef+0.146
ynski+0.133
erb+0.132
Neg logit contributions
alog-0.118
 chars-0.115
 Guilty-0.105
aukee-0.104
 illum-0.100
Token The
Token ID383
Feature activation-2.95e-02

Pos logit contributions
                +0.055
hai+0.055
kef+0.054
ynski+0.049
erb+0.049
Neg logit contributions
alog-0.050
 chars-0.049
 Guilty-0.044
aukee-0.044
 illum-0.043
Token Herald
Token ID18277
Feature activation-1.08e-01

Pos logit contributions
                +0.023
hai+0.023
kef+0.023
ynski+0.020
erb+0.020
Neg logit contributions
alog-0.020
 chars-0.020
 Guilty-0.018
aukee-0.018
 illum-0.017
Token.
Token ID13
Feature activation-2.93e-01

Pos logit contributions
                +0.087
hai+0.087
kef+0.087
ynski+0.079
erb+0.078
Neg logit contributions
alog-0.070
 chars-0.069
 Guilty-0.063
aukee-0.062
 illum-0.060
Token
Token ID564
Feature activation+1.35e-01

Pos logit contributions
                +0.231
hai+0.230
kef+0.229
ynski+0.208
erb+0.206
Neg logit contributions
alog-0.197
 chars-0.193
 Guilty-0.175
aukee-0.175
 illum-0.167
Token
Token ID250
Feature activation-4.76e-02

Pos logit contributions
alog+0.078
 chars+0.077
 Guilty+0.069
aukee+0.068
 illum+0.065
Neg logit contributions
                -0.119
hai-0.119
kef-0.118
ynski-0.108
erb-0.108
Token says
Token ID1139
Feature activation-3.57e-01

Pos logit contributions
alog+0.038
 chars+0.037
 Guilty+0.033
aukee+0.033
 illum+0.032
Neg logit contributions
                -0.057
hai-0.056
kef-0.056
ynski-0.051
erb-0.051
Token F
Token ID376
Feature activation-3.06e-01

Pos logit contributions
                +0.272
hai+0.271
kef+0.269
ynski+0.243
erb+0.241
Neg logit contributions
alog-0.250
 chars-0.245
 Guilty-0.224
aukee-0.223
 illum-0.214
TokenAS
Token ID1921
Feature activation-3.39e-01

Pos logit contributions
                +0.234
hai+0.233
kef+0.231
ynski+0.209
erb+0.208
Neg logit contributions
alog-0.213
 chars-0.209
 Guilty-0.191
aukee-0.191
 illum-0.182
Token affects
Token ID10975
Feature activation-2.34e-01

Pos logit contributions
                +0.265
hai+0.264
kef+0.262
ynski+0.237
erb+0.236
Neg logit contributions
alog-0.231
 chars-0.226
 Guilty-0.206
aukee-0.206
 illum-0.197
Token only
Token ID691
Feature activation-2.57e-01

Pos logit contributions
                +0.186
hai+0.186
kef+0.185
ynski+0.168
erb+0.167
Neg logit contributions
alog-0.155
 chars-0.152
 Guilty-0.138
aukee-0.138
 illum-0.132
Token a
Token ID257
Feature activation-2.13e-01

Pos logit contributions
                +0.203
hai+0.203
kef+0.201
ynski+0.182
erb+0.181
Neg logit contributions
alog-0.173
 chars-0.170
 Guilty-0.155
aukee-0.154
 illum-0.148
Token small
Token ID1402
Feature activation-3.52e-01

Pos logit contributions
                +0.168
hai+0.168
kef+0.167
ynski+0.151
erb+0.150
Neg logit contributions
alog-0.143
 chars-0.140
 Guilty-0.127
aukee-0.127
 illum-0.121
Token area
Token ID1989
Feature activation-6.99e-02

Pos logit contributions
                +0.260
hai+0.260
kef+0.258
ynski+0.232
erb+0.231
Neg logit contributions
alog-0.253
 chars-0.249
 Guilty-0.228
aukee-0.227
 illum-0.218
Token of
Token ID286
Feature activation-2.38e-01

Pos logit contributions
                +0.053
hai+0.053
kef+0.053
ynski+0.048
erb+0.047
Neg logit contributions
alog-0.049
 chars-0.048
 Guilty-0.044
aukee-0.044
 illum-0.042
Token speech
Token ID4046
Feature activation+1.82e-02

Pos logit contributions
                +0.175
hai+0.174
kef+0.173
ynski+0.155
erb+0.154
Neg logit contributions
alog-0.174
 chars-0.171
 Guilty-0.156
aukee-0.156
 illum-0.150
Token
Token ID851
Feature activation-3.17e-01

Pos logit contributions
alog+0.012
 chars+0.012
 Guilty+0.011
aukee+0.011
 illum+0.011
Neg logit contributions
                -0.014
hai-0.014
kef-0.014
ynski-0.013
erb-0.013
Token Protection
Token ID9985
Feature activation-5.49e-01

Pos logit contributions
alog+0.241
 chars+0.237
 Guilty+0.214
aukee+0.214
 illum+0.204
Neg logit contributions
                -0.319
hai-0.319
kef-0.316
ynski-0.288
erb-0.287
Token Agency
Token ID7732
Feature activation+2.18e-02

Pos logit contributions
                +0.349
hai+0.349
kef+0.346
ynski+0.307
erb+0.303
Neg logit contributions
alog-0.453
 chars-0.445
 Guilty-0.413
aukee-0.412
 illum-0.398
Token has
Token ID468
Feature activation+3.72e-02

Pos logit contributions
alog+0.015
 chars+0.015
 Guilty+0.013
aukee+0.013
 illum+0.013
Neg logit contributions
                -0.017
hai-0.017
kef-0.017
ynski-0.015
erb-0.015
Token retained
Token ID17383
Feature activation+5.19e-02

Pos logit contributions
alog+0.025
 chars+0.025
 Guilty+0.023
aukee+0.023
 illum+0.022
Neg logit contributions
                -0.029
hai-0.029
kef-0.029
ynski-0.026
erb-0.026
Token some
Token ID617
Feature activation+4.70e-02

Pos logit contributions
alog+0.036
 chars+0.036
 Guilty+0.033
aukee+0.033
 illum+0.031
Neg logit contributions
                -0.039
hai-0.039
kef-0.039
ynski-0.035
erb-0.035
Token staff
Token ID3085
Feature activation-1.15e-01

Pos logit contributions
alog+0.032
 chars+0.031
 Guilty+0.028
aukee+0.028
 illum+0.027
Neg logit contributions
                -0.037
hai-0.037
kef-0.037
ynski-0.033
erb-0.033
Token to
Token ID284
Feature activation+3.51e-02

Pos logit contributions
                +0.088
hai+0.088
kef+0.087
ynski+0.079
erb+0.078
Neg logit contributions
alog-0.080
 chars-0.078
 Guilty-0.071
aukee-0.071
 illum-0.068
Token look
Token ID804
Feature activation-9.46e-02

Pos logit contributions
alog+0.024
 chars+0.024
 Guilty+0.022
aukee+0.022
 illum+0.021
Neg logit contributions
                -0.027
hai-0.027
kef-0.027
ynski-0.024
erb-0.024
Token after
Token ID706
Feature activation-3.88e-01

Pos logit contributions
                +0.071
hai+0.071
kef+0.071
ynski+0.064
erb+0.063
Neg logit contributions
alog-0.067
 chars-0.066
 Guilty-0.060
aukee-0.060
 illum-0.057
Token its
Token ID663
Feature activation-2.66e-01

Pos logit contributions
                +0.296
hai+0.296
kef+0.294
ynski+0.266
erb+0.264
Neg logit contributions
alog-0.270
 chars-0.265
 Guilty-0.241
aukee-0.241
 illum-0.231
Token environmental
Token ID6142
Feature activation+1.07e-01

Pos logit contributions
                +0.203
hai+0.202
kef+0.201
ynski+0.182
erb+0.181
Neg logit contributions
alog-0.185
 chars-0.182
 Guilty-0.166
aukee-0.166
 illum-0.159
Token I
Token ID314
Feature activation+2.74e-01

Pos logit contributions
hai+0.338
                +0.338
kef+0.335
ynski+0.304
erb+0.302
Neg logit contributions
alog-0.298
 chars-0.293
 Guilty-0.267
aukee-0.266
 illum-0.255
Token can
Token ID460
Feature activation-1.06e-01

Pos logit contributions
alog+0.180
 chars+0.176
 Guilty+0.160
aukee+0.160
 illum+0.152
Neg logit contributions
                -0.221
hai-0.220
kef-0.219
ynski-0.199
erb-0.198
Token to
Token ID284
Feature activation-2.29e-01

Pos logit contributions
                +0.081
hai+0.081
kef+0.081
ynski+0.073
erb+0.073
Neg logit contributions
alog-0.073
 chars-0.072
 Guilty-0.066
aukee-0.065
 illum-0.063
Token lower
Token ID2793
Feature activation-9.84e-02

Pos logit contributions
                +0.172
hai+0.172
kef+0.171
ynski+0.154
erb+0.153
Neg logit contributions
alog-0.163
 chars-0.160
 Guilty-0.146
aukee-0.146
 illum-0.140
Token outrage
Token ID11616
Feature activation+1.11e-03

Pos logit contributions
                +0.072
hai+0.071
kef+0.071
ynski+0.064
erb+0.063
Neg logit contributions
alog-0.072
 chars-0.071
 Guilty-0.065
aukee-0.065
 illum-0.062
Tokenously
Token ID3481
Feature activation+6.64e-02

Pos logit contributions
alog+0.001
 chars+0.001
 Guilty+0.001
aukee+0.001
 illum+0.001
Neg logit contributions
                -0.001
hai-0.001
kef-0.001
ynski-0.001
erb-0.001
Token high
Token ID1029
Feature activation+7.92e-02

Pos logit contributions
alog+0.051
 chars+0.050
 Guilty+0.046
aukee+0.046
 illum+0.044
Neg logit contributions
                -0.046
hai-0.046
kef-0.046
ynski-0.041
erb-0.041
Token gasoline
Token ID21408
Feature activation-3.94e-01

Pos logit contributions
alog+0.052
 chars+0.051
 Guilty+0.046
aukee+0.046
 illum+0.044
Neg logit contributions
                -0.064
hai-0.064
kef-0.063
ynski-0.057
erb-0.057
Token and
Token ID290
Feature activation-2.65e-01

Pos logit contributions
                +0.290
hai+0.289
kef+0.287
ynski+0.259
erb+0.257
Neg logit contributions
alog-0.285
 chars-0.280
 Guilty-0.257
aukee-0.256
 illum-0.246
Token heating
Token ID16930
Feature activation+1.58e-01

Pos logit contributions
                +0.188
hai+0.188
kef+0.186
ynski+0.167
erb+0.166
Neg logit contributions
alog-0.200
 chars-0.196
 Guilty-0.180
aukee-0.180
 illum-0.173
Token-
Token ID12
Feature activation+2.05e-01

Pos logit contributions
alog+0.099
 chars+0.097
 Guilty+0.087
aukee+0.087
 illum+0.083
Neg logit contributions
                -0.132
hai-0.132
kef-0.131
ynski-0.120
erb-0.119
Token (
Token ID357
Feature activation-2.26e-01

Pos logit contributions
                +0.148
hai+0.148
kef+0.147
ynski+0.133
erb+0.132
Neg logit contributions
alog-0.129
 chars-0.127
 Guilty-0.116
aukee-0.115
 illum-0.110
TokenThe
Token ID464
Feature activation-3.22e-01

Pos logit contributions
                +0.172
hai+0.172
kef+0.171
ynski+0.154
erb+0.153
Neg logit contributions
alog-0.158
 chars-0.155
 Guilty-0.141
aukee-0.141
 illum-0.135
Token SF
Token ID14362
Feature activation-3.77e-01

Pos logit contributions
                +0.243
hai+0.243
kef+0.241
ynski+0.218
erb+0.216
Neg logit contributions
alog-0.227
 chars-0.222
 Guilty-0.203
aukee-0.203
 illum-0.194
TokenPD
Token ID5760
Feature activation-1.65e-01

Pos logit contributions
                +0.314
hai+0.313
kef+0.311
ynski+0.284
erb+0.282
Neg logit contributions
alog-0.237
 chars-0.232
 Guilty-0.210
aukee-0.209
 illum-0.199
Token en
Token ID551
Feature activation-2.25e-01

Pos logit contributions
                +0.132
hai+0.132
kef+0.131
ynski+0.119
erb+0.119
Neg logit contributions
alog-0.108
 chars-0.106
 Guilty-0.096
aukee-0.096
 illum-0.092
Tokenforces
Token ID27087
Feature activation-4.06e-01

Pos logit contributions
                +0.149
hai+0.149
kef+0.148
ynski+0.132
erb+0.130
Neg logit contributions
alog-0.180
 chars-0.177
 Guilty-0.163
aukee-0.163
 illum-0.157
Token moving
Token ID3867
Feature activation-2.60e-01

Pos logit contributions
                +0.309
hai+0.309
kef+0.307
ynski+0.277
erb+0.275
Neg logit contributions
alog-0.285
 chars-0.280
 Guilty-0.256
aukee-0.255
 illum-0.244
Token violations
Token ID11734
Feature activation-2.00e-01

Pos logit contributions
                +0.193
hai+0.192
kef+0.191
ynski+0.172
erb+0.171
Neg logit contributions
alog-0.187
 chars-0.184
 Guilty-0.169
aukee-0.168
 illum-0.162
Token,
Token ID11
Feature activation-1.82e-01

Pos logit contributions
                +0.153
hai+0.153
kef+0.152
ynski+0.137
erb+0.136
Neg logit contributions
alog-0.139
 chars-0.136
 Guilty-0.125
aukee-0.124
 illum-0.119
Token while
Token ID981
Feature activation-2.96e-01

Pos logit contributions
                +0.145
hai+0.144
kef+0.143
ynski+0.130
erb+0.129
Neg logit contributions
alog-0.121
 chars-0.119
 Guilty-0.108
aukee-0.108
 illum-0.103
Token SF
Token ID14362
Feature activation-3.94e-01

Pos logit contributions
                +0.222
hai+0.222
kef+0.220
ynski+0.199
erb+0.197
Neg logit contributions
alog-0.210
 chars-0.207
 Guilty-0.189
aukee-0.189
 illum-0.181
Token headed
Token ID9153
Feature activation-4.01e-01

Pos logit contributions
                +0.170
hai+0.170
kef+0.169
ynski+0.152
erb+0.151
Neg logit contributions
alog-0.161
 chars-0.158
 Guilty-0.144
aukee-0.144
 illum-0.138
Token chance
Token ID2863
Feature activation-2.61e-01

Pos logit contributions
                +0.286
hai+0.286
kef+0.283
ynski+0.254
erb+0.252
Neg logit contributions
alog-0.301
 chars-0.295
 Guilty-0.271
aukee-0.271
 illum-0.260
Token and
Token ID290
Feature activation-4.45e-02

Pos logit contributions
                +0.202
hai+0.201
kef+0.200
ynski+0.181
erb+0.180
Neg logit contributions
alog-0.180
 chars-0.176
 Guilty-0.161
aukee-0.160
 illum-0.154
Token forcing
Token ID10833
Feature activation-4.59e-02

Pos logit contributions
                +0.034
hai+0.034
kef+0.034
ynski+0.030
erb+0.030
Neg logit contributions
alog-0.031
 chars-0.030
 Guilty-0.028
aukee-0.028
 illum-0.027
Token Gu
Token ID1962
Feature activation-5.48e-01

Pos logit contributions
                +0.035
hai+0.035
kef+0.035
ynski+0.032
erb+0.032
Neg logit contributions
alog-0.032
 chars-0.031
 Guilty-0.028
aukee-0.028
 illum-0.027
Tokeniller
Token ID4665
Feature activation-1.35e+00

Pos logit contributions
                +0.311
hai+0.309
kef+0.307
ynski+0.267
erb+0.263
Neg logit contributions
alog-0.491
 chars-0.483
 Guilty-0.451
aukee-0.450
 illum-0.436
Tokenmo
Token ID5908
Feature activation-1.35e+00

Pos logit contributions
                +0.805
hai+0.801
kef+0.798
ynski+0.700
erb+0.694
Neg logit contributions
alog-1.167
 chars-1.142
aukee-1.073
 Guilty-1.070
 illum-1.031
Token O
Token ID440
Feature activation-1.45e+00

Pos logit contributions
hai+1.118
                +1.114
kef+1.111
ynski+1.015
erb+1.006
Neg logit contributions
alog-0.860
 chars-0.840
 Guilty-0.770
aukee-0.754
 illum-0.731
Tokencho
Token ID6679
Feature activation+6.33e-01

Pos logit contributions
hai+1.013
                +1.008
kef+1.001
ynski+0.891
erb+0.880
Neg logit contributions
alog-1.108
 chars-1.083
 Guilty-0.998
aukee-0.993
 illum-0.958
Tokena
Token ID64
Feature activation-1.15e-01

Pos logit contributions
alog+0.424
 chars+0.414
aukee+0.379
 Guilty+0.376
 illum+0.360
Neg logit contributions
                -0.500
hai-0.500
kef-0.496
ynski-0.450
erb-0.448
Token into
Token ID656
Feature activation-7.11e-02

Pos logit contributions
                +0.088
hai+0.088
kef+0.087
ynski+0.079
erb+0.078
Neg logit contributions
alog-0.080
 chars-0.078
 Guilty-0.072
aukee-0.071
 illum-0.068
Token odd
Token ID5629
Feature activation-1.28e-01

Pos logit contributions
alog+0.021
 chars+0.020
 Guilty+0.019
aukee+0.018
 illum+0.018
Neg logit contributions
                -0.024
hai-0.024
kef-0.024
ynski-0.022
erb-0.022
Token echoes
Token ID30346
Feature activation-1.07e-02

Pos logit contributions
                +0.099
hai+0.099
kef+0.098
ynski+0.089
erb+0.089
Neg logit contributions
alog-0.087
 chars-0.085
 Guilty-0.078
aukee-0.078
 illum-0.074
Token of
Token ID286
Feature activation-1.57e-01

Pos logit contributions
                +0.008
hai+0.008
kef+0.008
ynski+0.007
erb+0.007
Neg logit contributions
alog-0.007
 chars-0.007
 Guilty-0.007
aukee-0.007
 illum-0.006
Token M
Token ID337
Feature activation-1.40e-01

Pos logit contributions
                +0.120
hai+0.119
kef+0.119
ynski+0.107
erb+0.107
Neg logit contributions
alog-0.109
 chars-0.107
 Guilty-0.098
aukee-0.098
 illum-0.093
Token.
Token ID13
Feature activation-2.42e-01

Pos logit contributions
                +0.109
hai+0.109
kef+0.108
ynski+0.098
erb+0.097
Neg logit contributions
alog-0.096
 chars-0.094
 Guilty-0.086
aukee-0.086
 illum-0.082
Token Night
Token ID5265
Feature activation-1.37e+00

Pos logit contributions
                +0.191
hai+0.191
kef+0.190
ynski+0.172
erb+0.171
Neg logit contributions
alog-0.162
 chars-0.159
 Guilty-0.145
aukee-0.144
 illum-0.138
Token Shy
Token ID36838
Feature activation-1.28e-02

Pos logit contributions
                +0.850
kef+0.846
hai+0.842
ynski+0.738
erb+0.731
Neg logit contributions
alog-1.151
 chars-1.124
 Guilty-1.049
aukee-1.046
 illum-1.014
Tokenam
Token ID321
Feature activation-2.63e-01

Pos logit contributions
                +0.008
hai+0.008
kef+0.008
ynski+0.007
erb+0.007
Neg logit contributions
alog-0.011
 chars-0.011
 Guilty-0.010
aukee-0.010
 illum-0.010
Tokenalan
Token ID25786
Feature activation-3.05e-02

Pos logit contributions
                +0.213
hai+0.212
kef+0.211
ynski+0.192
erb+0.191
Neg logit contributions
alog-0.171
 chars-0.168
 Guilty-0.152
aukee-0.152
 illum-0.145
Token
Token ID447
Feature activation-4.53e-02

Pos logit contributions
                +0.024
hai+0.023
kef+0.023
ynski+0.021
erb+0.021
Neg logit contributions
alog-0.021
 chars-0.021
 Guilty-0.019
aukee-0.019
 illum-0.018
Token
Token ID247
Feature activation+1.17e-01

Pos logit contributions
                +0.038
hai+0.038
kef+0.037
ynski+0.034
erb+0.034
Neg logit contributions
alog-0.028
 chars-0.028
 Guilty-0.025
aukee-0.025
 illum-0.024
Token previously
Token ID4271
Feature activation+1.38e-01

Pos logit contributions
                +0.067
hai+0.067
kef+0.066
ynski+0.060
erb+0.059
Neg logit contributions
alog-0.060
 chars-0.059
 Guilty-0.054
aukee-0.054
 illum-0.052
Token,
Token ID11
Feature activation-4.18e-02

Pos logit contributions
alog+0.094
 chars+0.093
 Guilty+0.084
aukee+0.084
 illum+0.081
Neg logit contributions
                -0.107
hai-0.106
kef-0.106
ynski-0.096
erb-0.095
Token the
Token ID262
Feature activation-7.00e-02

Pos logit contributions
                +0.032
hai+0.032
kef+0.032
ynski+0.029
erb+0.029
Neg logit contributions
alog-0.029
 chars-0.028
 Guilty-0.026
aukee-0.026
 illum-0.025
Token "
Token ID366
Feature activation-8.99e-03

Pos logit contributions
                +0.054
hai+0.054
kef+0.054
ynski+0.049
erb+0.049
Neg logit contributions
alog-0.048
 chars-0.047
 Guilty-0.043
aukee-0.043
 illum-0.041
TokenR
Token ID49
Feature activation-1.11e-01

Pos logit contributions
                +0.007
hai+0.007
kef+0.007
ynski+0.006
erb+0.006
Neg logit contributions
alog-0.006
 chars-0.006
 Guilty-0.006
aukee-0.006
 illum-0.005
Tokeneth
Token ID2788
Feature activation-1.18e+00

Pos logit contributions
                +0.085
hai+0.085
kef+0.084
ynski+0.076
erb+0.076
Neg logit contributions
alog-0.077
 chars-0.076
 Guilty-0.069
aukee-0.069
 illum-0.066
Tokeninking
Token ID8040
Feature activation-1.38e-01

Pos logit contributions
                +0.732
hai+0.728
kef+0.721
ynski+0.633
erb+0.626
Neg logit contributions
alog-1.000
 chars-0.971
 Guilty-0.907
aukee-0.906
 illum-0.877
Token Mathematics
Token ID39448
Feature activation-6.26e-01

Pos logit contributions
                +0.107
hai+0.106
kef+0.106
ynski+0.096
erb+0.095
Neg logit contributions
alog-0.095
 chars-0.093
 Guilty-0.085
aukee-0.085
 illum-0.081
Token:
Token ID25
Feature activation-2.74e-01

Pos logit contributions
                +0.464
hai+0.464
kef+0.461
ynski+0.415
erb+0.412
Neg logit contributions
alog-0.449
 chars-0.441
 Guilty-0.406
aukee-0.403
 illum-0.387
Token Teaching
Token ID38094
Feature activation-2.57e-02

Pos logit contributions
                +0.204
hai+0.204
kef+0.203
ynski+0.183
erb+0.182
Neg logit contributions
alog-0.196
 chars-0.192
 Guilty-0.176
aukee-0.176
 illum-0.169
Token Social
Token ID5483
Feature activation-2.99e-01

Pos logit contributions
                +0.020
hai+0.020
kef+0.019
ynski+0.018
erb+0.017
Neg logit contributions
alog-0.018
 chars-0.018
 Guilty-0.016
aukee-0.016
 illum-0.015
Token to
Token ID284
Feature activation-2.70e-01

Pos logit contributions
                +0.128
hai+0.128
kef+0.127
ynski+0.114
erb+0.114
Neg logit contributions
alog-0.122
 chars-0.120
 Guilty-0.110
aukee-0.110
 illum-0.105
Token some
Token ID617
Feature activation-1.98e-01

Pos logit contributions
                +0.205
hai+0.204
kef+0.203
ynski+0.183
erb+0.182
Neg logit contributions
alog-0.189
 chars-0.186
 Guilty-0.170
aukee-0.169
 illum-0.162
Token extent
Token ID6287
Feature activation-2.51e-01

Pos logit contributions
                +0.128
hai+0.128
kef+0.127
ynski+0.112
erb+0.111
Neg logit contributions
alog-0.162
 chars-0.159
 Guilty-0.147
aukee-0.147
 illum-0.142
Token between
Token ID1022
Feature activation-1.36e-01

Pos logit contributions
                +0.190
hai+0.190
kef+0.189
ynski+0.171
erb+0.169
Neg logit contributions
alog-0.176
 chars-0.173
 Guilty-0.158
aukee-0.157
 illum-0.151
Token wet
Token ID9583
Feature activation-1.77e-02

Pos logit contributions
                +0.102
hai+0.102
kef+0.101
ynski+0.091
erb+0.090
Neg logit contributions
alog-0.097
 chars-0.095
 Guilty-0.087
aukee-0.087
 illum-0.083
Token and
Token ID290
Feature activation-1.16e+00

Pos logit contributions
                +0.014
hai+0.014
kef+0.014
ynski+0.013
erb+0.013
Neg logit contributions
alog-0.012
 chars-0.012
 Guilty-0.011
aukee-0.011
 illum-0.010
Token dry
Token ID5894
Feature activation-3.24e-01

Pos logit contributions
                +0.838
hai+0.836
kef+0.828
ynski+0.750
erb+0.743
Neg logit contributions
alog-0.854
 chars-0.841
aukee-0.770
 Guilty-0.769
 illum-0.742
Token seasons
Token ID7028
Feature activation+9.34e-02

Pos logit contributions
                +0.243
hai+0.242
kef+0.240
ynski+0.217
erb+0.215
Neg logit contributions
alog-0.231
 chars-0.226
 Guilty-0.207
aukee-0.207
 illum-0.198
Token and
Token ID290
Feature activation-3.04e-01

Pos logit contributions
alog+0.063
 chars+0.062
 Guilty+0.056
aukee+0.056
 illum+0.054
Neg logit contributions
                -0.074
hai-0.073
kef-0.073
ynski-0.066
erb-0.066
Token the
Token ID262
Feature activation+6.58e-02

Pos logit contributions
                +0.237
hai+0.237
kef+0.235
ynski+0.213
erb+0.212
Neg logit contributions
alog-0.207
 chars-0.203
 Guilty-0.185
aukee-0.184
 illum-0.177
Token situation
Token ID3074
Feature activation-1.08e-01

Pos logit contributions
alog+0.047
 chars+0.046
 Guilty+0.042
aukee+0.042
 illum+0.040
Neg logit contributions
                -0.049
hai-0.049
kef-0.049
ynski-0.044
erb-0.044
Token.
Token ID13
Feature activation-1.61e-01

Pos logit contributions
alog+0.033
 chars+0.033
 Guilty+0.030
aukee+0.030
 illum+0.028
Neg logit contributions
                -0.043
hai-0.042
kef-0.042
ynski-0.038
erb-0.038
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
                +0.129
hai+0.129
kef+0.128
ynski+0.117
erb+0.116
Neg logit contributions
alog-0.106
 chars-0.104
 Guilty-0.094
aukee-0.094
 illum-0.090
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
                +0.089
hai+0.088
kef+0.088
ynski+0.080
erb+0.079
Neg logit contributions
alog-0.075
 chars-0.073
 Guilty-0.067
aukee-0.067
 illum-0.064
TokenCar
Token ID9914
Feature activation-5.46e-01

Pos logit contributions
                +0.079
hai+0.079
kef+0.078
ynski+0.070
erb+0.070
Neg logit contributions
alog-0.078
 chars-0.077
 Guilty-0.070
aukee-0.070
 illum-0.067
Tokenla
Token ID5031
Feature activation-2.28e-01

Pos logit contributions
                +0.416
hai+0.414
kef+0.412
ynski+0.372
erb+0.368
Neg logit contributions
alog-0.383
 chars-0.375
 Guilty-0.343
aukee-0.342
 illum-0.328
Token del
Token ID1619
Feature activation-1.13e+00

Pos logit contributions
                +0.180
hai+0.179
kef+0.178
ynski+0.162
erb+0.161
Neg logit contributions
alog-0.154
 chars-0.151
 Guilty-0.137
aukee-0.137
 illum-0.131
Token P
Token ID350
Feature activation+3.31e-01

Pos logit contributions
                +0.876
hai+0.868
kef+0.866
ynski+0.786
erb+0.774
Neg logit contributions
alog-0.775
 chars-0.756
 Guilty-0.693
aukee-0.685
 illum-0.659
Tokenonte
Token ID38599
Feature activation-2.09e-01

Pos logit contributions
alog+0.270
 chars+0.267
 Guilty+0.247
aukee+0.246
 illum+0.238
Neg logit contributions
                -0.213
hai-0.213
kef-0.212
ynski-0.188
erb-0.187
Token,
Token ID11
Feature activation-3.10e-02

Pos logit contributions
                +0.163
hai+0.163
kef+0.162
ynski+0.146
erb+0.145
Neg logit contributions
alog-0.142
 chars-0.139
 Guilty-0.127
aukee-0.126
 illum-0.121
Token a
Token ID257
Feature activation-1.56e-01

Pos logit contributions
                +0.023
hai+0.023
kef+0.023
ynski+0.021
erb+0.021
Neg logit contributions
alog-0.022
 chars-0.022
 Guilty-0.020
aukee-0.020
 illum-0.019
Token member
Token ID2888
Feature activation+1.83e-01

Pos logit contributions
                +0.123
hai+0.123
kef+0.122
ynski+0.111
erb+0.110
Neg logit contributions
alog-0.104
 chars-0.102
 Guilty-0.093
aukee-0.092
 illum-0.088