TOP ACTIVATIONS
MAX = 1.354

Russia . Image copyright AFP Image caption The Kurds
Serve hot ! <|endoftext|> Image copyright ESA Image caption Go ce
million years . <|endoftext|> Image copyright AFP The Bank
. <|endoftext|> Image copyright publicity Image caption The Greeks
. <|endoftext|> Image copyright Deadline Police have
ported License . <|endoftext|> Image copyright AP Image caption Peace talks
. Facebook Twitter Pinterest Lake Ur mia , Iran
Facebook Twitter Pinterest Liverpool s Adam
unsupported on your device Media caption Nigel Farage : " Lord
unsupported on your device Media caption A drone - camera was
Utah Date : 18 75 Author : E . S .
ACE ). Image caption Council staff s ifted through
Image copyright AFP Image caption The Kurds ' push has
media reported . <|endoftext|> Image caption Some talked of " b
Heritage Foundation , told the Ou ach ita Citizen that the
<|endoftext|> Image copyright ESA Image caption Go ce uses an electric
000 . Image caption Council staff found paperwork with
ISI Sh ak oor , S ., J aff
super - bad if you hit your own minion . It
Brazil ian President Dil ma Rousse ff and Cuban President R

MIN ACTIVATIONS
MIN = -1.177

), Robin and even Black Canary may be involved , with
binary is moved from one partition to another The way I
to deport . He bl ithe ly called for torture but
t quite br anc hed out to many mainstream
, it is a bl under . If policymakers listened to
t apply any lens profiles since there aren
…. How bad is it ?」 " I think
Ste lla - san collapsed ?」 " Yes .
at left guard and got snaps at right guard as needed
methods to measure weak gravitational lens ing , another way in
for the init binary in partition # 25 instead of #
, and use the extra lens size to e ke out
worthy and flu id with the truth .
ua Fighter or games like Gundam and stuff like that are
, partly when the plac id Qu ij ano failed to
BIOS default . With two cards in close proximity we want
init from the 25 th partition (/ dev / mm c
just after mounting the boot partition , forcing a reboot .
perspective from a very small lens . Image via Pan or
less mirror ing the Rab id Pupp ies slate . Essentially

INTERVAL 0.721 - 1.354
CONTAINS 0.064%

my life , and I hope I 've made you proud
pro - cor porate arbit rators . A separate
last Zelda O ST that Ko ji K ondo worked on
fantasy sequence with the M upp ets as babies was so
feel threatened and just m ose y ed on . We

INTERVAL 0.088 - 0.721
CONTAINS 25.018%

and for all Out er space must only be used
700 million . How much does of that actually
repr isal attacks from the P YD . He
of V io Me factory , who
examination ( including at S DS U ), however .

INTERVAL -0.544 - 0.088
CONTAINS 74.461%

( Low Carb High Fat ) diet improves the body
a national unity crisis to save his government .
is wrong . I lost all this money ." Everything was
your goal ? Do you go out of your way to
respond to this beautiful crying stranger who s sitting

INTERVAL -1.177 - -0.544
CONTAINS 0.456%

is open source - the code applicable to this article can
sit out during the Spring Split due to visa issues .
peer - reviewed journal Clim atic Change , provides policymakers with
from the Harvard - Smith sonian Center for Ast roph ysics
, unsustainable .' ( Canadian Federation of Independent Business ) The
Token Russia
Token ID3284
Feature activation-1.28e-02

Pos logit contributions
 Sinn+0.146
fman+0.146
ogue+0.146
iferation+0.145
 hers+0.141
Neg logit contributions
MENTS-0.180
SPONSORED-0.172
CAR-0.171
ERY-0.170
MENT-0.169
Token.
Token ID13
Feature activation+2.12e-02

Pos logit contributions
MENTS+0.027
ERY+0.026
SPONSORED+0.026
CAR+0.026
MENT+0.026
Neg logit contributions
iferation-0.024
fman-0.024
 Sinn-0.024
ogue-0.024
lus-0.024
Token
Token ID198
Feature activation-1.72e-01

Pos logit contributions
 Sinn+0.038
iferation+0.038
fman+0.038
ogue+0.038
lus+0.037
Neg logit contributions
MENTS-0.048
SPONSORED-0.046
ERY-0.045
 disappointment-0.045
CAR-0.045
Token
Token ID198
Feature activation-1.20e-01

Pos logit contributions
MENTS+0.361
SPONSORED+0.347
CAR+0.343
ERY+0.342
MENT+0.340
Neg logit contributions
iferation-0.339
fman-0.337
ogue-0.334
��-0.333
 Sinn-0.330
TokenImage
Token ID5159
Feature activation+6.28e-02

Pos logit contributions
MENTS+0.209
ERY+0.196
SPONSORED+0.195
 disappointment+0.195
CAR+0.193
Neg logit contributions
fman-0.277
ogue-0.276
iferation-0.275
 Sinn-0.274
lus-0.269
Token copyright
Token ID6634
Feature activation+1.35e+00

Pos logit contributions
 Sinn+0.131
iferation+0.131
ogue+0.130
fman+0.130
lus+0.127
Neg logit contributions
MENTS-0.122
SPONSORED-0.120
ERY-0.117
CAR-0.116
MENT-0.115
Token AFP
Token ID8916
Feature activation+5.66e-01

Pos logit contributions
 Sinn+2.442
fman+2.363
iferation+2.329
ogue+2.323
 hers+2.308
Neg logit contributions
MENTS-3.056
SPONSORED-2.988
ERY-2.915
MENT-2.911
CAR-2.896
Token Image
Token ID7412
Feature activation+3.64e-02

Pos logit contributions
fman+0.985
iferation+0.984
ogue+0.978
 Sinn+0.974
lus+0.941
Neg logit contributions
MENTS-1.303
SPONSORED-1.247
CAR-1.245
ERY-1.243
MENT-1.235
Token caption
Token ID8305
Feature activation+1.05e+00

Pos logit contributions
fman+0.093
ogue+0.093
iferation+0.093
 Sinn+0.092
 hers+0.091
Neg logit contributions
MENTS-0.054
SPONSORED-0.051
ERY-0.050
CAR-0.050
 disappointment-0.050
Token The
Token ID383
Feature activation+1.81e-02

Pos logit contributions
 Sinn+1.727
iferation+1.632
fman+1.630
ogue+1.630
 Nurse+1.607
Neg logit contributions
MENTS-2.565
SPONSORED-2.499
ERY-2.461
 disappointment-2.458
MENT-2.444
Token Kurds
Token ID23880
Feature activation-3.25e-01

Pos logit contributions
fman+0.029
ogue+0.029
iferation+0.029
 Sinn+0.028
lus+0.028
Neg logit contributions
MENTS-0.045
SPONSORED-0.043
ERY-0.043
CAR-0.043
MENT-0.043
Token Serve
Token ID35557
Feature activation+1.28e-01

Pos logit contributions
iferation+0.248
fman+0.243
��+0.241
ogue+0.239
ouf+0.232
Neg logit contributions
MENTS-0.322
SPONSORED-0.311
ERY-0.308
MENT-0.305
CAR-0.304
Token hot
Token ID3024
Feature activation+1.01e-01

Pos logit contributions
fman+0.323
 Sinn+0.321
ogue+0.320
 hers+0.319
iferation+0.317
Neg logit contributions
MENTS-0.196
SPONSORED-0.182
ERY-0.182
CAR-0.180
MENT-0.178
Token!
Token ID0
Feature activation+8.15e-02

Pos logit contributions
fman+0.207
iferation+0.207
ogue+0.206
 Sinn+0.204
��+0.201
Neg logit contributions
MENTS-0.201
SPONSORED-0.192
ERY-0.191
CAR-0.190
MENT-0.189
Token<|endoftext|>
Token ID50256
Feature activation+3.20e-02

Pos logit contributions
fman+0.151
ogue+0.150
iferation+0.150
 Sinn+0.150
lus+0.146
Neg logit contributions
MENTS-0.179
SPONSORED-0.171
ERY-0.170
CAR-0.169
 disappointment-0.169
TokenImage
Token ID5159
Feature activation+1.37e-01

Pos logit contributions
fman+0.074
ogue+0.074
iferation+0.073
 Sinn+0.072
lus+0.072
Neg logit contributions
MENTS-0.056
 disappointment-0.052
ERY-0.052
SPONSORED-0.052
MENT-0.051
Token copyright
Token ID6634
Feature activation+1.31e+00

Pos logit contributions
fman+0.292
ogue+0.292
iferation+0.291
 Sinn+0.291
lus+0.283
Neg logit contributions
MENTS-0.260
SPONSORED-0.250
ERY-0.246
CAR-0.245
MENT-0.244
Token ESA
Token ID42212
Feature activation-1.22e-01

Pos logit contributions
 Sinn+2.351
fman+2.271
ogue+2.247
iferation+2.227
 hers+2.214
Neg logit contributions
MENTS-2.966
SPONSORED-2.897
ERY-2.825
MENT-2.823
CAR-2.808
Token Image
Token ID7412
Feature activation-1.10e-01

Pos logit contributions
MENTS+0.282
SPONSORED+0.271
ERY+0.269
CAR+0.269
MENT+0.267
Neg logit contributions
fman-0.213
iferation-0.212
ogue-0.212
 Sinn-0.212
lus-0.205
Token caption
Token ID8305
Feature activation+1.03e+00

Pos logit contributions
MENTS+0.161
SPONSORED+0.151
ERY+0.150
CAR+0.149
MENT+0.148
Neg logit contributions
fman-0.284
ogue-0.283
iferation-0.283
 Sinn-0.281
lus-0.276
Token Go
Token ID1514
Feature activation+2.14e-01

Pos logit contributions
 Sinn+1.737
fman+1.646
iferation+1.640
ogue+1.633
 Nurse+1.629
Neg logit contributions
MENTS-2.461
SPONSORED-2.422
ERY-2.366
MENT-2.358
 disappointment-2.350
Tokence
Token ID344
Feature activation-1.76e-01

Pos logit contributions
fman+0.420
ogue+0.416
 Sinn+0.407
iferation+0.406
onder+0.396
Neg logit contributions
MENTS-0.456
SPONSORED-0.443
ERY-0.438
MENT-0.431
CAR-0.428
Token million
Token ID1510
Feature activation+3.32e-02

Pos logit contributions
MENTS+0.619
SPONSORED+0.593
ERY+0.589
CAR+0.587
MENT+0.584
Neg logit contributions
fman-0.527
ogue-0.526
iferation-0.525
 Sinn-0.521
lus-0.508
Token years
Token ID812
Feature activation-1.42e-01

Pos logit contributions
iferation+0.062
ogue+0.062
 Sinn+0.062
fman+0.062
 hers+0.060
Neg logit contributions
MENTS-0.072
SPONSORED-0.069
ERY-0.068
CAR-0.068
MENT-0.068
Token.
Token ID13
Feature activation+5.89e-02

Pos logit contributions
MENTS+0.270
SPONSORED+0.258
ERY+0.256
CAR+0.254
MENT+0.252
Neg logit contributions
ogue-0.305
fman-0.305
iferation-0.304
 Sinn-0.302
 hers-0.295
Token<|endoftext|>
Token ID50256
Feature activation-8.22e-02

Pos logit contributions
 Sinn+0.104
ogue+0.104
fman+0.103
iferation+0.103
lus+0.100
Neg logit contributions
MENTS-0.134
SPONSORED-0.129
ERY-0.128
CAR-0.128
 disappointment-0.127
TokenImage
Token ID5159
Feature activation-2.38e-02

Pos logit contributions
MENTS+0.143
 disappointment+0.135
SPONSORED+0.134
ERY+0.134
MENT+0.132
Neg logit contributions
fman-0.189
ogue-0.189
iferation-0.188
 Sinn-0.186
lus-0.184
Token copyright
Token ID6634
Feature activation+1.29e+00

Pos logit contributions
MENTS+0.045
SPONSORED+0.043
ERY+0.043
CAR+0.042
MENT+0.042
Neg logit contributions
fman-0.051
ogue-0.051
iferation-0.051
 Sinn-0.051
lus-0.050
Token AFP
Token ID8916
Feature activation+4.39e-01

Pos logit contributions
 Sinn+2.322
fman+2.244
ogue+2.218
iferation+2.203
 hers+2.186
Neg logit contributions
MENTS-2.928
SPONSORED-2.859
ERY-2.787
MENT-2.786
CAR-2.773
Token
Token ID198
Feature activation+3.52e-02

Pos logit contributions
iferation+0.757
fman+0.757
ogue+0.755
 Sinn+0.753
lus+0.727
Neg logit contributions
MENTS-1.012
SPONSORED-0.967
CAR-0.967
ERY-0.964
 disappointment-0.961
Token
Token ID198
Feature activation+3.67e-02

Pos logit contributions
fman+0.067
iferation+0.067
ogue+0.067
 Sinn+0.066
��+0.066
Neg logit contributions
MENTS-0.075
SPONSORED-0.072
CAR-0.072
ERY-0.072
MENT-0.071
TokenThe
Token ID464
Feature activation-1.21e-01

Pos logit contributions
fman+0.083
ogue+0.083
iferation+0.083
 Sinn+0.083
lus+0.081
Neg logit contributions
MENTS-0.065
 disappointment-0.061
ERY-0.061
SPONSORED-0.061
MENT-0.060
Token Bank
Token ID5018
Feature activation-3.21e-01

Pos logit contributions
MENTS+0.280
SPONSORED+0.268
ERY+0.267
CAR+0.266
 disappointment+0.265
Neg logit contributions
fman-0.211
ogue-0.210
iferation-0.210
 Sinn-0.207
��-0.203
Token.
Token ID13
Feature activation+7.14e-03

Pos logit contributions
MENTS+0.119
SPONSORED+0.113
ERY+0.113
CAR+0.112
MENT+0.111
Neg logit contributions
fman-0.124
ogue-0.124
 Sinn-0.123
iferation-0.123
 hers-0.120
Token
Token ID447
Feature activation-1.13e-01

Pos logit contributions
 Sinn+0.015
ogue+0.015
fman+0.015
iferation+0.015
lus+0.015
Neg logit contributions
MENTS-0.014
SPONSORED-0.013
ERY-0.013
CAR-0.013
 disappointment-0.013
Token
Token ID251
Feature activation+8.41e-02

Pos logit contributions
MENTS+0.255
SPONSORED+0.244
ERY+0.243
CAR+0.242
MENT+0.241
Neg logit contributions
fman-0.202
ogue-0.202
iferation-0.202
 Sinn-0.200
lus-0.195
Token<|endoftext|>
Token ID50256
Feature activation-5.10e-02

Pos logit contributions
 Sinn+0.152
fman+0.152
ogue+0.152
iferation+0.152
lus+0.148
Neg logit contributions
MENTS-0.187
SPONSORED-0.180
ERY-0.178
CAR-0.178
 disappointment-0.177
TokenImage
Token ID5159
Feature activation+5.34e-02

Pos logit contributions
MENTS+0.089
 disappointment+0.083
SPONSORED+0.083
ERY+0.083
MENT+0.082
Neg logit contributions
fman-0.118
ogue-0.117
iferation-0.116
 Sinn-0.116
lus-0.114
Token copyright
Token ID6634
Feature activation+1.26e+00

Pos logit contributions
fman+0.115
ogue+0.115
iferation+0.114
 Sinn+0.114
lus+0.111
Neg logit contributions
MENTS-0.101
SPONSORED-0.097
ERY-0.096
CAR-0.095
MENT-0.095
Token publicity
Token ID19403
Feature activation-2.65e-02

Pos logit contributions
 Sinn+2.267
fman+2.192
ogue+2.170
iferation+2.153
 hers+2.136
Neg logit contributions
MENTS-2.862
SPONSORED-2.792
ERY-2.724
MENT-2.724
CAR-2.712
Token Image
Token ID7412
Feature activation-7.04e-02

Pos logit contributions
MENTS+0.058
ERY+0.055
SPONSORED+0.055
MENT+0.055
CAR+0.055
Neg logit contributions
ogue-0.049
fman-0.049
 Sinn-0.049
iferation-0.048
lus-0.047
Token caption
Token ID8305
Feature activation+9.38e-01

Pos logit contributions
MENTS+0.105
SPONSORED+0.098
ERY+0.097
CAR+0.097
MENT+0.096
Neg logit contributions
fman-0.181
ogue-0.180
iferation-0.180
 Sinn-0.179
 hers-0.176
Token The
Token ID383
Feature activation-5.99e-02

Pos logit contributions
 Sinn+1.637
ogue+1.579
fman+1.573
iferation+1.563
 Nurse+1.536
Neg logit contributions
MENTS-2.186
SPONSORED-2.124
ERY-2.086
CAR-2.080
MENT-2.080
Token Greeks
Token ID25059
Feature activation+7.56e-02

Pos logit contributions
MENTS+0.129
SPONSORED+0.123
ERY+0.123
CAR+0.122
MENT+0.121
Neg logit contributions
fman-0.114
ogue-0.114
iferation-0.114
 Sinn-0.113
lus-0.110
Token.
Token ID13
Feature activation-3.65e-02

Pos logit contributions
MENTS+0.514
SPONSORED+0.492
ERY+0.490
CAR+0.487
MENT+0.484
Neg logit contributions
ogue-0.465
fman-0.464
iferation-0.462
 Sinn-0.459
lus-0.447
Token
Token ID447
Feature activation-1.30e-01

Pos logit contributions
MENTS+0.078
SPONSORED+0.075
ERY+0.075
CAR+0.074
 disappointment+0.074
Neg logit contributions
fman-0.069
ogue-0.069
iferation-0.069
 Sinn-0.069
lus-0.067
Token
Token ID251
Feature activation+1.88e-02

Pos logit contributions
MENTS+0.295
SPONSORED+0.283
ERY+0.282
CAR+0.281
MENT+0.279
Neg logit contributions
fman-0.230
ogue-0.230
iferation-0.229
 Sinn-0.227
��-0.224
Token<|endoftext|>
Token ID50256
Feature activation-8.60e-02

Pos logit contributions
fman+0.034
ogue+0.034
iferation+0.034
 Sinn+0.034
lus+0.033
Neg logit contributions
MENTS-0.042
SPONSORED-0.040
ERY-0.040
CAR-0.040
 disappointment-0.040
TokenImage
Token ID5159
Feature activation+3.83e-02

Pos logit contributions
MENTS+0.150
 disappointment+0.141
SPONSORED+0.140
ERY+0.140
MENT+0.138
Neg logit contributions
fman-0.198
ogue-0.198
iferation-0.197
 Sinn-0.195
lus-0.193
Token copyright
Token ID6634
Feature activation+1.25e+00

Pos logit contributions
fman+0.082
ogue+0.082
iferation+0.082
 Sinn+0.082
lus+0.080
Neg logit contributions
MENTS-0.072
SPONSORED-0.069
ERY-0.069
CAR-0.068
MENT-0.068
Token Deadline
Token ID35954
Feature activation-4.59e-02

Pos logit contributions
 Sinn+2.249
fman+2.178
ogue+2.155
iferation+2.138
 hers+2.121
Neg logit contributions
MENTS-2.839
SPONSORED-2.769
ERY-2.704
MENT-2.702
CAR-2.690
Token
Token ID198
Feature activation+1.42e-01

Pos logit contributions
MENTS+0.102
SPONSORED+0.098
ERY+0.097
CAR+0.097
MENT+0.096
Neg logit contributions
fman-0.084
ogue-0.084
iferation-0.084
 Sinn-0.083
lus-0.081
Token
Token ID198
Feature activation+1.41e-01

Pos logit contributions
fman+0.269
iferation+0.268
ogue+0.268
 Sinn+0.265
��+0.260
Neg logit contributions
MENTS-0.308
SPONSORED-0.295
ERY-0.293
CAR-0.293
MENT-0.291
TokenPolice
Token ID9039
Feature activation+1.69e-01

Pos logit contributions
ogue+0.314
fman+0.313
 Sinn+0.312
iferation+0.311
lus+0.307
Neg logit contributions
MENTS-0.253
 disappointment-0.240
ERY-0.238
SPONSORED-0.238
MENT-0.234
Token have
Token ID423
Feature activation-8.51e-02

Pos logit contributions
 Sinn+0.304
ogue+0.301
iferation+0.299
fman+0.298
 hers+0.290
Neg logit contributions
MENTS-0.384
SPONSORED-0.373
ERY-0.367
MENT-0.363
CAR-0.362
Tokenported
Token ID9213
Feature activation+2.01e-01

Pos logit contributions
iferation+0.622
ogue+0.622
fman+0.613
 Sinn+0.608
ouf+0.595
Neg logit contributions
MENTS-0.973
ERY-0.932
 disappointment-0.931
CAR-0.931
SPONSORED-0.927
Token License
Token ID13789
Feature activation+2.74e-01

Pos logit contributions
iferation+0.304
 Sinn+0.303
ogue+0.302
fman+0.300
lus+0.292
Neg logit contributions
MENTS-0.510
SPONSORED-0.488
CAR-0.487
ERY-0.485
MENT-0.483
Token.
Token ID13
Feature activation-6.26e-02

Pos logit contributions
fman+0.575
iferation+0.572
ogue+0.572
 Sinn+0.571
 hers+0.555
Neg logit contributions
MENTS-0.534
ERY-0.509
SPONSORED-0.506
CAR-0.501
 disappointment-0.498
Token<|endoftext|>
Token ID50256
Feature activation-3.64e-02

Pos logit contributions
MENTS+0.144
SPONSORED+0.138
ERY+0.138
CAR+0.137
 disappointment+0.137
Neg logit contributions
 Sinn-0.108
fman-0.108
iferation-0.108
ogue-0.108
ouf-0.105
TokenImage
Token ID5159
Feature activation-6.86e-02

Pos logit contributions
MENTS+0.063
 disappointment+0.060
SPONSORED+0.059
ERY+0.059
MENT+0.059
Neg logit contributions
fman-0.084
ogue-0.084
iferation-0.083
 Sinn-0.082
lus-0.082
Token copyright
Token ID6634
Feature activation+1.20e+00

Pos logit contributions
MENTS+0.129
SPONSORED+0.124
ERY+0.122
CAR+0.122
MENT+0.121
Neg logit contributions
fman-0.148
ogue-0.148
iferation-0.148
 Sinn-0.147
lus-0.144
Token AP
Token ID3486
Feature activation+4.93e-01

Pos logit contributions
 Sinn+2.156
fman+2.092
ogue+2.068
iferation+2.055
 hers+2.037
Neg logit contributions
MENTS-2.715
SPONSORED-2.647
ERY-2.585
MENT-2.583
CAR-2.572
Token Image
Token ID7412
Feature activation-1.94e-01

Pos logit contributions
iferation+0.846
ogue+0.843
 Sinn+0.843
fman+0.842
lus+0.813
Neg logit contributions
MENTS-1.138
SPONSORED-1.090
 disappointment-1.088
ERY-1.085
CAR-1.084
Token caption
Token ID8305
Feature activation+9.86e-01

Pos logit contributions
MENTS+0.282
SPONSORED+0.264
ERY+0.262
CAR+0.261
MENT+0.258
Neg logit contributions
fman-0.506
iferation-0.505
ogue-0.505
 Sinn-0.501
��-0.493
Token Peace
Token ID12689
Feature activation+2.49e-01

Pos logit contributions
 Sinn+1.709
fman+1.633
iferation+1.632
ogue+1.629
 Nurse+1.599
Neg logit contributions
MENTS-2.315
SPONSORED-2.243
ERY-2.213
MENT-2.205
 disappointment-2.205
Token talks
Token ID6130
Feature activation+1.92e-01

Pos logit contributions
ogue+0.489
 Sinn+0.487
iferation+0.484
fman+0.484
ouf+0.471
Neg logit contributions
MENTS-0.523
SPONSORED-0.500
ERY-0.499
CAR-0.495
MENT-0.491
Token.
Token ID13
Feature activation-5.10e-03

Pos logit contributions
ogue+0.246
fman+0.246
iferation+0.245
 Sinn+0.244
ouf+0.238
Neg logit contributions
MENTS-0.248
SPONSORED-0.235
ERY-0.235
CAR-0.233
MENT-0.232
Token
Token ID198
Feature activation+3.70e-02

Pos logit contributions
MENTS+0.012
SPONSORED+0.011
ERY+0.011
CAR+0.011
 disappointment+0.011
Neg logit contributions
ogue-0.009
fman-0.009
iferation-0.009
 Sinn-0.009
lus-0.009
Token
Token ID198
Feature activation+7.46e-02

Pos logit contributions
iferation+0.072
fman+0.072
ogue+0.071
 Sinn+0.070
��+0.070
Neg logit contributions
MENTS-0.079
SPONSORED-0.076
CAR-0.075
ERY-0.075
MENT-0.074
TokenFacebook
Token ID12025
Feature activation+1.80e-01

Pos logit contributions
fman+0.168
ogue+0.168
iferation+0.167
 Sinn+0.166
lus+0.164
Neg logit contributions
MENTS-0.133
 disappointment-0.125
ERY-0.125
SPONSORED-0.124
CAR-0.124
Token Twitter
Token ID3009
Feature activation+4.55e-01

Pos logit contributions
ogue+0.393
iferation+0.392
fman+0.391
 Sinn+0.389
 hers+0.379
Neg logit contributions
MENTS-0.336
SPONSORED-0.325
CAR-0.317
ERY-0.317
MENT-0.315
Token Pinterest
Token ID17334
Feature activation+1.15e+00

Pos logit contributions
fman+0.965
ogue+0.965
 Sinn+0.954
iferation+0.954
lus+0.929
Neg logit contributions
MENTS-0.880
SPONSORED-0.836
CAR-0.834
ERY-0.833
MENT-0.825
Token Lake
Token ID6233
Feature activation+2.89e-01

Pos logit contributions
 Sinn+1.969
fman+1.901
iferation+1.893
ogue+1.888
ouf+1.858
Neg logit contributions
MENTS-2.669
SPONSORED-2.607
CAR-2.561
ERY-2.545
MENT-2.537
Token Ur
Token ID8799
Feature activation+2.35e-01

Pos logit contributions
ogue+0.515
 Sinn+0.515
fman+0.514
iferation+0.510
ouf+0.496
Neg logit contributions
MENTS-0.655
SPONSORED-0.631
CAR-0.626
ERY-0.622
MENT-0.618
Tokenmia
Token ID20730
Feature activation+7.17e-03

Pos logit contributions
ogue+0.586
fman+0.585
iferation+0.580
 Sinn+0.575
lus+0.569
Neg logit contributions
MENTS-0.372
SPONSORED-0.349
ERY-0.347
CAR-0.346
 disappointment-0.344
Token,
Token ID11
Feature activation+1.26e-01

Pos logit contributions
 Sinn+0.014
ogue+0.014
fman+0.014
iferation+0.014
lus+0.013
Neg logit contributions
MENTS-0.015
SPONSORED-0.015
ERY-0.015
CAR-0.015
MENT-0.015
Token Iran
Token ID4068
Feature activation-1.04e-01

Pos logit contributions
fman+0.248
 Sinn+0.247
ogue+0.247
iferation+0.246
 hers+0.240
Neg logit contributions
MENTS-0.263
SPONSORED-0.250
ERY-0.249
CAR-0.249
MENT-0.247
Token
Token ID251
Feature activation+1.78e-01

Pos logit contributions
ogue+0.005
iferation+0.005
fman+0.005
 Sinn+0.005
��+0.005
Neg logit contributions
MENTS-0.005
SPONSORED-0.005
ERY-0.005
CAR-0.005
MENT-0.005
Token
Token ID198
Feature activation-9.28e-02

Pos logit contributions
ogue+0.332
fman+0.331
 Sinn+0.331
iferation+0.328
lus+0.322
Neg logit contributions
MENTS-0.387
SPONSORED-0.372
ERY-0.369
CAR-0.366
MENT-0.363
Token
Token ID198
Feature activation-4.94e-02

Pos logit contributions
MENTS+0.209
SPONSORED+0.200
 disappointment+0.200
ERY+0.199
CAR+0.197
Neg logit contributions
fman-0.165
ogue-0.164
 Sinn-0.163
iferation-0.163
lus-0.159
TokenFacebook
Token ID12025
Feature activation+1.28e-01

Pos logit contributions
MENTS+0.086
SPONSORED+0.081
ERY+0.081
 disappointment+0.081
CAR+0.080
Neg logit contributions
fman-0.113
ogue-0.113
iferation-0.112
 Sinn-0.112
lus-0.110
Token Twitter
Token ID3009
Feature activation+4.60e-01

Pos logit contributions
ogue+0.279
 Sinn+0.278
fman+0.277
iferation+0.277
lus+0.270
Neg logit contributions
MENTS-0.240
SPONSORED-0.231
ERY-0.227
CAR-0.225
MENT-0.225
Token Pinterest
Token ID17334
Feature activation+1.13e+00

Pos logit contributions
fman+0.878
ogue+0.875
 Sinn+0.869
iferation+0.865
lus+0.841
Neg logit contributions
MENTS-0.988
SPONSORED-0.945
ERY-0.941
CAR-0.940
MENT-0.932
Token Liverpool
Token ID11761
Feature activation-1.73e-01

Pos logit contributions
 Sinn+2.075
ogue+2.001
fman+1.973
iferation+1.959
 Nurse+1.959
Neg logit contributions
MENTS-2.512
SPONSORED-2.475
ERY-2.379
CAR-2.374
MENT-2.369
Token
Token ID447
Feature activation+2.30e-01

Pos logit contributions
MENTS+0.365
SPONSORED+0.349
ERY+0.347
CAR+0.343
MENT+0.342
Neg logit contributions
ogue-0.340
fman-0.339
 Sinn-0.337
iferation-0.336
lus-0.328
Token
Token ID247
Feature activation+2.14e-01

Pos logit contributions
ogue+0.444
fman+0.443
iferation+0.442
 Sinn+0.438
��+0.436
Neg logit contributions
MENTS-0.484
SPONSORED-0.463
ERY-0.460
CAR-0.459
 disappointment-0.457
Tokens
Token ID82
Feature activation+3.54e-02

Pos logit contributions
ogue+0.413
fman+0.412
iferation+0.411
 Sinn+0.408
lus+0.406
Neg logit contributions
MENTS-0.452
SPONSORED-0.435
ERY-0.427
CAR-0.424
MENT-0.424
Token Adam
Token ID7244
Feature activation+1.21e-01

Pos logit contributions
ogue+0.068
fman+0.068
 Sinn+0.068
iferation+0.068
ouf+0.066
Neg logit contributions
MENTS-0.075
SPONSORED-0.072
ERY-0.071
CAR-0.071
MENT-0.071
Token unsupported
Token ID24222
Feature activation+1.97e-01

Pos logit contributions
fman+0.070
 Sinn+0.069
ogue+0.069
iferation+0.069
lus+0.067
Neg logit contributions
MENTS-0.117
SPONSORED-0.113
ERY-0.112
CAR-0.112
MENT-0.111
Token on
Token ID319
Feature activation-3.24e-02

Pos logit contributions
fman+0.338
ogue+0.337
iferation+0.336
 Sinn+0.334
lus+0.325
Neg logit contributions
MENTS-0.461
SPONSORED-0.443
ERY-0.440
CAR-0.439
MENT-0.437
Token your
Token ID534
Feature activation+8.33e-02

Pos logit contributions
MENTS+0.074
SPONSORED+0.071
ERY+0.071
CAR+0.071
MENT+0.070
Neg logit contributions
fman-0.057
iferation-0.057
ogue-0.057
 Sinn-0.057
��-0.055
Token device
Token ID3335
Feature activation-2.95e-03

Pos logit contributions
ogue+0.205
fman+0.204
iferation+0.204
 Sinn+0.203
lus+0.200
Neg logit contributions
MENTS-0.133
SPONSORED-0.126
ERY-0.124
CAR-0.124
MENT-0.122
Token Media
Token ID6343
Feature activation-1.10e-02

Pos logit contributions
MENTS+0.006
SPONSORED+0.006
CAR+0.006
ERY+0.006
 disappointment+0.006
Neg logit contributions
 Sinn-0.006
ogue-0.006
fman-0.006
iferation-0.006
lus-0.006
Token caption
Token ID8305
Feature activation+1.11e+00

Pos logit contributions
MENTS+0.016
SPONSORED+0.015
ERY+0.015
CAR+0.015
MENT+0.015
Neg logit contributions
fman-0.028
ogue-0.028
iferation-0.028
 Sinn-0.028
lus-0.028
Token Nigel
Token ID28772
Feature activation+1.87e-01

Pos logit contributions
 Sinn+1.969
ogue+1.861
fman+1.858
iferation+1.855
 Nurse+1.848
Neg logit contributions
MENTS-2.555
SPONSORED-2.486
ERY-2.433
MENT-2.430
 disappointment-2.423
Token Farage
Token ID33904
Feature activation+9.71e-02

Pos logit contributions
ogue+0.183
iferation+0.183
fman+0.182
 Sinn+0.181
lus+0.171
Neg logit contributions
MENTS-0.575
SPONSORED-0.558
ERY-0.555
CAR-0.554
MENT-0.552
Token:
Token ID25
Feature activation+1.84e-01

Pos logit contributions
ogue+0.210
fman+0.209
 Sinn+0.209
iferation+0.208
lus+0.204
Neg logit contributions
MENTS-0.185
SPONSORED-0.175
ERY-0.174
CAR-0.173
MENT-0.172
Token "
Token ID366
Feature activation+7.34e-02

Pos logit contributions
 Sinn+0.320
iferation+0.318
ogue+0.316
fman+0.316
lus+0.306
Neg logit contributions
MENTS-0.426
SPONSORED-0.412
ERY-0.406
CAR-0.405
MENT-0.404
TokenLord
Token ID22438
Feature activation+1.06e-01

Pos logit contributions
fman+0.166
ogue+0.166
iferation+0.165
 Sinn+0.165
lus+0.162
Neg logit contributions
MENTS-0.131
SPONSORED-0.123
ERY-0.123
 disappointment-0.122
CAR-0.121
Token unsupported
Token ID24222
Feature activation+2.83e-01

Pos logit contributions
fman+0.013
ogue+0.013
iferation+0.013
 Sinn+0.013
lus+0.013
Neg logit contributions
MENTS-0.022
SPONSORED-0.022
ERY-0.021
CAR-0.021
MENT-0.021
Token on
Token ID319
Feature activation-7.25e-02

Pos logit contributions
fman+0.485
ogue+0.483
iferation+0.482
 Sinn+0.478
lus+0.466
Neg logit contributions
MENTS-0.663
SPONSORED-0.637
ERY-0.633
CAR-0.632
MENT-0.628
Token your
Token ID534
Feature activation+2.80e-02

Pos logit contributions
MENTS+0.165
SPONSORED+0.158
ERY+0.157
CAR+0.157
MENT+0.156
Neg logit contributions
fman-0.129
iferation-0.129
ogue-0.128
 Sinn-0.127
��-0.125
Token device
Token ID3335
Feature activation-1.05e-01

Pos logit contributions
ogue+0.070
fman+0.070
iferation+0.069
 Sinn+0.069
lus+0.068
Neg logit contributions
MENTS-0.044
SPONSORED-0.041
ERY-0.041
CAR-0.041
MENT-0.040
Token Media
Token ID6343
Feature activation-6.17e-02

Pos logit contributions
MENTS+0.216
SPONSORED+0.207
CAR+0.206
ERY+0.205
 disappointment+0.203
Neg logit contributions
 Sinn-0.210
ogue-0.210
fman-0.209
iferation-0.208
lus-0.204
Token caption
Token ID8305
Feature activation+1.09e+00

Pos logit contributions
MENTS+0.091
SPONSORED+0.085
ERY+0.084
CAR+0.084
MENT+0.083
Neg logit contributions
fman-0.159
ogue-0.159
iferation-0.159
 Sinn-0.158
lus-0.155
Token A
Token ID317
Feature activation-1.73e-01

Pos logit contributions
 Sinn+1.948
ogue+1.841
fman+1.836
iferation+1.833
 Nurse+1.831
Neg logit contributions
MENTS-2.531
SPONSORED-2.468
ERY-2.410
MENT-2.407
 disappointment-2.398
Token drone
Token ID12170
Feature activation-5.27e-01

Pos logit contributions
MENTS+0.374
SPONSORED+0.360
ERY+0.355
CAR+0.354
MENT+0.353
Neg logit contributions
fman-0.329
ogue-0.328
 Sinn-0.327
iferation-0.327
lus-0.317
Token-
Token ID12
Feature activation-8.16e-02

Pos logit contributions
MENTS+1.105
SPONSORED+1.054
ERY+1.048
CAR+1.046
MENT+1.040
Neg logit contributions
fman-1.030
ogue-1.030
iferation-1.027
 Sinn-1.020
lus-0.995
Tokencamera
Token ID25695
Feature activation+8.31e-02

Pos logit contributions
MENTS+0.177
SPONSORED+0.169
ERY+0.169
CAR+0.168
 disappointment+0.167
Neg logit contributions
fman-0.153
ogue-0.153
iferation-0.153
 Sinn-0.151
lus-0.149
Token was
Token ID373
Feature activation-1.23e-01

Pos logit contributions
ogue+0.173
 Sinn+0.172
fman+0.171
iferation+0.170
 hers+0.167
Neg logit contributions
MENTS-0.166
SPONSORED-0.157
CAR-0.157
ERY-0.156
MENT-0.155
Token Utah
Token ID10202
Feature activation+1.76e-01

Pos logit contributions
ogue+0.031
 Sinn+0.031
fman+0.031
iferation+0.031
 hers+0.031
Neg logit contributions
MENTS-0.024
CAR-0.023
SPONSORED-0.023
ERY-0.023
MENT-0.022
Token Date
Token ID7536
Feature activation+1.14e-01

Pos logit contributions
ogue+0.338
fman+0.337
 Sinn+0.337
iferation+0.334
 hers+0.326
Neg logit contributions
MENTS-0.371
SPONSORED-0.355
ERY-0.353
CAR-0.352
MENT-0.350
Token:
Token ID25
Feature activation+1.49e-01

Pos logit contributions
fman+0.231
ogue+0.230
 Sinn+0.229
iferation+0.229
lus+0.223
Neg logit contributions
MENTS-0.232
SPONSORED-0.223
ERY-0.221
CAR-0.219
MENT-0.218
Token 18
Token ID1248
Feature activation-2.95e-03

Pos logit contributions
fman+0.266
ogue+0.266
iferation+0.265
 Sinn+0.264
lus+0.256
Neg logit contributions
MENTS-0.335
SPONSORED-0.322
ERY-0.320
CAR-0.319
MENT-0.317
Token75
Token ID2425
Feature activation+2.95e-01

Pos logit contributions
MENTS+0.005
SPONSORED+0.005
ERY+0.005
CAR+0.005
 disappointment+0.005
Neg logit contributions
fman-0.007
ogue-0.007
iferation-0.007
 Sinn-0.007
lus-0.007
Token Author
Token ID6434
Feature activation+1.09e+00

Pos logit contributions
ogue+0.534
fman+0.532
iferation+0.530
 Sinn+0.530
lus+0.514
Neg logit contributions
MENTS-0.661
SPONSORED-0.634
ERY-0.631
CAR-0.628
MENT-0.625
Token:
Token ID25
Feature activation+1.00e-01

Pos logit contributions
ogue+2.511
 Sinn+2.480
iferation+2.462
fman+2.447
 Nurse+2.386
Neg logit contributions
MENTS-1.891
SPONSORED-1.831
CAR-1.769
 disappointment-1.762
ERY-1.760
Token E
Token ID412
Feature activation+2.27e-01

Pos logit contributions
ogue+0.179
fman+0.178
 Sinn+0.178
iferation+0.178
 hers+0.172
Neg logit contributions
MENTS-0.226
SPONSORED-0.218
ERY-0.215
CAR-0.215
MENT-0.214
Token.
Token ID13
Feature activation-5.17e-02

Pos logit contributions
 Sinn+0.508
ogue+0.507
fman+0.504
iferation+0.501
lus+0.491
Neg logit contributions
MENTS-0.416
SPONSORED-0.398
CAR-0.389
 disappointment-0.389
ERY-0.387
TokenS
Token ID50
Feature activation+2.01e-01

Pos logit contributions
MENTS+0.123
SPONSORED+0.118
ERY+0.117
 disappointment+0.117
CAR+0.116
Neg logit contributions
 Sinn-0.087
ogue-0.086
fman-0.086
iferation-0.085
lus-0.083
Token.
Token ID13
Feature activation-9.09e-02

Pos logit contributions
ogue+0.447
 Sinn+0.443
fman+0.441
iferation+0.437
lus+0.431
Neg logit contributions
MENTS-0.371
SPONSORED-0.354
CAR-0.351
 disappointment-0.349
ERY-0.346
TokenACE
Token ID11598
Feature activation-2.36e-01

Pos logit contributions
ogue+0.507
iferation+0.496
fman+0.494
lus+0.489
ouf+0.488
Neg logit contributions
MENTS-0.392
SPONSORED-0.367
 disappointment-0.361
MENT-0.359
ERY-0.355
Token).
Token ID737
Feature activation-2.34e-02

Pos logit contributions
MENTS+0.492
SPONSORED+0.471
ERY+0.467
 disappointment+0.464
CAR+0.464
Neg logit contributions
iferation-0.462
fman-0.461
ogue-0.461
 Sinn-0.456
lus-0.448
Token
Token ID198
Feature activation-9.66e-02

Pos logit contributions
MENTS+0.052
SPONSORED+0.050
ERY+0.050
CAR+0.050
 disappointment+0.049
Neg logit contributions
ogue-0.043
iferation-0.043
 Sinn-0.043
fman-0.042
lus-0.041
Token
Token ID198
Feature activation-8.85e-02

Pos logit contributions
MENTS+0.204
SPONSORED+0.196
CAR+0.194
ERY+0.194
MENT+0.193
Neg logit contributions
iferation-0.189
fman-0.189
ogue-0.187
��-0.186
 Sinn-0.185
TokenImage
Token ID5159
Feature activation+1.12e-02

Pos logit contributions
MENTS+0.157
 disappointment+0.148
ERY+0.148
SPONSORED+0.147
CAR+0.146
Neg logit contributions
ogue-0.200
fman-0.200
iferation-0.199
 Sinn-0.198
lus-0.195
Token caption
Token ID8305
Feature activation+1.06e+00

Pos logit contributions
iferation+0.024
 Sinn+0.024
ogue+0.024
fman+0.024
lus+0.023
Neg logit contributions
MENTS-0.021
SPONSORED-0.020
ERY-0.020
CAR-0.020
MENT-0.020
Token Council
Token ID4281
Feature activation-1.79e-01

Pos logit contributions
 Sinn+1.841
ogue+1.762
 Nurse+1.744
fman+1.743
iferation+1.738
Neg logit contributions
MENTS-2.486
SPONSORED-2.421
CAR-2.360
MENT-2.360
 disappointment-2.358
Token staff
Token ID3085
Feature activation+3.65e-02

Pos logit contributions
MENTS+0.377
SPONSORED+0.360
ERY+0.357
CAR+0.356
MENT+0.355
Neg logit contributions
fman-0.350
ogue-0.349
iferation-0.349
 Sinn-0.347
lus-0.338
Token s
Token ID264
Feature activation+2.33e-02

Pos logit contributions
fman+0.068
ogue+0.068
iferation+0.068
 Sinn+0.067
lus+0.066
Neg logit contributions
MENTS-0.080
SPONSORED-0.077
ERY-0.076
CAR-0.076
MENT-0.075
Tokenifted
Token ID21715
Feature activation-2.58e-01

Pos logit contributions
fman+0.041
ogue+0.041
iferation+0.041
 Sinn+0.041
lus+0.040
Neg logit contributions
MENTS-0.054
SPONSORED-0.051
ERY-0.051
CAR-0.051
MENT-0.051
Token through
Token ID832
Feature activation+7.65e-02

Pos logit contributions
MENTS+0.506
SPONSORED+0.484
ERY+0.479
CAR+0.478
MENT+0.474
Neg logit contributions
ogue-0.540
fman-0.538
 Sinn-0.536
iferation-0.535
 hers-0.522
Token
Token ID198
Feature activation-1.20e-01

Pos logit contributions
MENTS+0.361
SPONSORED+0.347
CAR+0.343
ERY+0.342
MENT+0.340
Neg logit contributions
iferation-0.339
fman-0.337
ogue-0.334
��-0.333
 Sinn-0.330
TokenImage
Token ID5159
Feature activation+6.28e-02

Pos logit contributions
MENTS+0.209
ERY+0.196
SPONSORED+0.195
 disappointment+0.195
CAR+0.193
Neg logit contributions
fman-0.277
ogue-0.276
iferation-0.275
 Sinn-0.274
lus-0.269
Token copyright
Token ID6634
Feature activation+1.35e+00

Pos logit contributions
 Sinn+0.131
iferation+0.131
ogue+0.130
fman+0.130
lus+0.127
Neg logit contributions
MENTS-0.122
SPONSORED-0.120
ERY-0.117
CAR-0.116
MENT-0.115
Token AFP
Token ID8916
Feature activation+5.66e-01

Pos logit contributions
 Sinn+2.442
fman+2.363
iferation+2.329
ogue+2.323
 hers+2.308
Neg logit contributions
MENTS-3.056
SPONSORED-2.988
ERY-2.915
MENT-2.911
CAR-2.896
Token Image
Token ID7412
Feature activation+3.64e-02

Pos logit contributions
fman+0.985
iferation+0.984
ogue+0.978
 Sinn+0.974
lus+0.941
Neg logit contributions
MENTS-1.303
SPONSORED-1.247
CAR-1.245
ERY-1.243
MENT-1.235
Token caption
Token ID8305
Feature activation+1.05e+00

Pos logit contributions
fman+0.093
ogue+0.093
iferation+0.093
 Sinn+0.092
 hers+0.091
Neg logit contributions
MENTS-0.054
SPONSORED-0.051
ERY-0.050
CAR-0.050
 disappointment-0.050
Token The
Token ID383
Feature activation+1.81e-02

Pos logit contributions
 Sinn+1.727
iferation+1.632
fman+1.630
ogue+1.630
 Nurse+1.607
Neg logit contributions
MENTS-2.565
SPONSORED-2.499
ERY-2.461
 disappointment-2.458
MENT-2.444
Token Kurds
Token ID23880
Feature activation-3.25e-01

Pos logit contributions
fman+0.029
ogue+0.029
iferation+0.029
 Sinn+0.028
lus+0.028
Neg logit contributions
MENTS-0.045
SPONSORED-0.043
ERY-0.043
CAR-0.043
MENT-0.043
Token'
Token ID6
Feature activation-1.40e-01

Pos logit contributions
MENTS+0.714
SPONSORED+0.684
CAR+0.681
ERY+0.681
 disappointment+0.676
Neg logit contributions
fman-0.600
iferation-0.598
ogue-0.597
 Sinn-0.589
��-0.578
Token push
Token ID4574
Feature activation+4.50e-02

Pos logit contributions
MENTS+0.289
SPONSORED+0.277
ERY+0.274
CAR+0.273
MENT+0.272
Neg logit contributions
fman-0.278
ogue-0.278
iferation-0.278
 Sinn-0.277
lus-0.270
Token has
Token ID468
Feature activation+5.39e-03

Pos logit contributions
ogue+0.085
fman+0.085
iferation+0.084
 Sinn+0.084
lus+0.082
Neg logit contributions
MENTS-0.098
SPONSORED-0.093
ERY-0.093
CAR-0.093
MENT-0.092
Token media
Token ID2056
Feature activation-3.96e-01

Pos logit contributions
MENTS+0.553
SPONSORED+0.528
ERY+0.525
CAR+0.524
MENT+0.518
Neg logit contributions
ogue-0.527
fman-0.525
iferation-0.523
 Sinn-0.522
lus-0.509
Token reported
Token ID2098
Feature activation+2.37e-02

Pos logit contributions
MENTS+0.866
SPONSORED+0.829
ERY+0.824
CAR+0.822
MENT+0.818
Neg logit contributions
fman-0.738
iferation-0.736
ogue-0.735
 Sinn-0.728
��-0.712
Token.
Token ID13
Feature activation-6.73e-02

Pos logit contributions
fman+0.052
ogue+0.051
 Sinn+0.051
iferation+0.051
lus+0.050
Neg logit contributions
MENTS-0.045
SPONSORED-0.042
ERY-0.042
CAR-0.042
MENT-0.042
Token<|endoftext|>
Token ID50256
Feature activation+1.64e-02

Pos logit contributions
MENTS+0.153
SPONSORED+0.147
CAR+0.146
ERY+0.146
 disappointment+0.146
Neg logit contributions
fman-0.119
 Sinn-0.119
ogue-0.118
iferation-0.118
ouf-0.115
TokenImage
Token ID5159
Feature activation+3.09e-02

Pos logit contributions
fman+0.038
ogue+0.038
iferation+0.037
 Sinn+0.037
lus+0.037
Neg logit contributions
MENTS-0.028
 disappointment-0.027
SPONSORED-0.027
ERY-0.027
MENT-0.026
Token caption
Token ID8305
Feature activation+1.05e+00

Pos logit contributions
fman+0.066
ogue+0.066
iferation+0.066
 Sinn+0.066
lus+0.064
Neg logit contributions
MENTS-0.059
SPONSORED-0.056
ERY-0.056
CAR-0.055
MENT-0.055
Token Some
Token ID2773
Feature activation-4.82e-02

Pos logit contributions
 Sinn+1.834
fman+1.756
ogue+1.749
iferation+1.748
 Nurse+1.720
Neg logit contributions
MENTS-2.445
SPONSORED-2.381
ERY-2.335
MENT-2.328
CAR-2.324
Token talked
Token ID6619
Feature activation+2.61e-01

Pos logit contributions
MENTS+0.104
SPONSORED+0.100
ERY+0.099
CAR+0.099
MENT+0.098
Neg logit contributions
ogue-0.091
fman-0.091
 Sinn-0.091
iferation-0.091
lus-0.088
Token of
Token ID286
Feature activation-6.00e-02

Pos logit contributions
ogue+0.473
fman+0.473
iferation+0.472
 Sinn+0.469
lus+0.456
Neg logit contributions
MENTS-0.585
SPONSORED-0.559
ERY-0.556
CAR-0.556
MENT-0.552
Token "
Token ID366
Feature activation+2.27e-03

Pos logit contributions
MENTS+0.126
SPONSORED+0.121
ERY+0.120
CAR+0.120
MENT+0.119
Neg logit contributions
 Sinn-0.117
fman-0.117
ogue-0.117
iferation-0.116
 hers-0.113
Tokenb
Token ID65
Feature activation-8.65e-02

Pos logit contributions
fman+0.004
ogue+0.004
iferation+0.004
 Sinn+0.004
lus+0.004
Neg logit contributions
MENTS-0.005
SPONSORED-0.005
ERY-0.005
CAR-0.005
 disappointment-0.005
Token Heritage
Token ID18518
Feature activation+1.79e-01

Pos logit contributions
MENTS+0.228
SPONSORED+0.218
ERY+0.217
CAR+0.216
MENT+0.216
Neg logit contributions
fman-0.186
ogue-0.186
iferation-0.186
 Sinn-0.185
lus-0.180
Token Foundation
Token ID5693
Feature activation-1.80e-01

Pos logit contributions
ogue+0.284
iferation+0.283
fman+0.282
 Sinn+0.280
lus+0.270
Neg logit contributions
MENTS-0.445
SPONSORED-0.427
ERY-0.425
CAR-0.422
MENT-0.421
Token,
Token ID11
Feature activation-6.37e-02

Pos logit contributions
MENTS+0.408
SPONSORED+0.390
ERY+0.390
CAR+0.387
MENT+0.385
Neg logit contributions
fman-0.325
iferation-0.323
ogue-0.322
 Sinn-0.321
lus-0.311
Token told
Token ID1297
Feature activation-3.88e-01

Pos logit contributions
MENTS+0.131
SPONSORED+0.125
ERY+0.124
CAR+0.123
MENT+0.122
Neg logit contributions
fman-0.128
ogue-0.127
 Sinn-0.127
iferation-0.126
lus-0.123
Token the
Token ID262
Feature activation-5.51e-02

Pos logit contributions
MENTS+0.909
SPONSORED+0.872
ERY+0.869
CAR+0.867
 disappointment+0.861
Neg logit contributions
fman-0.665
iferation-0.663
ogue-0.661
 Sinn-0.654
��-0.639
Token Ou
Token ID42201
Feature activation+1.04e+00

Pos logit contributions
MENTS+0.120
SPONSORED+0.115
ERY+0.114
CAR+0.114
MENT+0.113
Neg logit contributions
ogue-0.104
fman-0.103
iferation-0.103
 Sinn-0.102
lus-0.100
Tokenach
Token ID620
Feature activation+3.44e-01

Pos logit contributions
ogue+1.033
fman+1.016
 Sinn+0.980
ouf+0.968
 hers+0.938
Neg logit contributions
MENTS-3.157
SPONSORED-3.115
 disappointment-3.040
MENT-3.022
ERY-3.016
Tokenita
Token ID5350
Feature activation+1.93e-01

Pos logit contributions
fman+0.525
iferation+0.521
ogue+0.517
 Sinn+0.513
��+0.505
Neg logit contributions
MENTS-0.872
SPONSORED-0.838
CAR-0.834
ERY-0.833
MENT-0.830
Token Citizen
Token ID22307
Feature activation-1.06e-01

Pos logit contributions
ogue+0.323
fman+0.321
iferation+0.319
 Sinn+0.318
ouf+0.309
Neg logit contributions
MENTS-0.460
SPONSORED-0.441
ERY-0.439
CAR-0.438
MENT-0.435
Token that
Token ID326
Feature activation+1.00e-01

Pos logit contributions
MENTS+0.225
SPONSORED+0.215
ERY+0.214
CAR+0.213
MENT+0.212
Neg logit contributions
fman-0.206
ogue-0.206
iferation-0.206
 Sinn-0.204
lus-0.199
Token the
Token ID262
Feature activation+6.62e-03

Pos logit contributions
fman+0.185
ogue+0.185
iferation+0.184
 Sinn+0.184
 hers+0.178
Neg logit contributions
MENTS-0.221
SPONSORED-0.211
ERY-0.210
CAR-0.209
MENT-0.208
Token<|endoftext|>
Token ID50256
Feature activation+3.20e-02

Pos logit contributions
fman+0.151
ogue+0.150
iferation+0.150
 Sinn+0.150
lus+0.146
Neg logit contributions
MENTS-0.179
SPONSORED-0.171
ERY-0.170
CAR-0.169
 disappointment-0.169
TokenImage
Token ID5159
Feature activation+1.37e-01

Pos logit contributions
fman+0.074
ogue+0.074
iferation+0.073
 Sinn+0.072
lus+0.072
Neg logit contributions
MENTS-0.056
 disappointment-0.052
ERY-0.052
SPONSORED-0.052
MENT-0.051
Token copyright
Token ID6634
Feature activation+1.31e+00

Pos logit contributions
fman+0.292
ogue+0.292
iferation+0.291
 Sinn+0.291
lus+0.283
Neg logit contributions
MENTS-0.260
SPONSORED-0.250
ERY-0.246
CAR-0.245
MENT-0.244
Token ESA
Token ID42212
Feature activation-1.22e-01

Pos logit contributions
 Sinn+2.351
fman+2.271
ogue+2.247
iferation+2.227
 hers+2.214
Neg logit contributions
MENTS-2.966
SPONSORED-2.897
ERY-2.825
MENT-2.823
CAR-2.808
Token Image
Token ID7412
Feature activation-1.10e-01

Pos logit contributions
MENTS+0.282
SPONSORED+0.271
ERY+0.269
CAR+0.269
MENT+0.267
Neg logit contributions
fman-0.213
iferation-0.212
ogue-0.212
 Sinn-0.212
lus-0.205
Token caption
Token ID8305
Feature activation+1.03e+00

Pos logit contributions
MENTS+0.161
SPONSORED+0.151
ERY+0.150
CAR+0.149
MENT+0.148
Neg logit contributions
fman-0.284
ogue-0.283
iferation-0.283
 Sinn-0.281
lus-0.276
Token Go
Token ID1514
Feature activation+2.14e-01

Pos logit contributions
 Sinn+1.737
fman+1.646
iferation+1.640
ogue+1.633
 Nurse+1.629
Neg logit contributions
MENTS-2.461
SPONSORED-2.422
ERY-2.366
MENT-2.358
 disappointment-2.350
Tokence
Token ID344
Feature activation-1.76e-01

Pos logit contributions
fman+0.420
ogue+0.416
 Sinn+0.407
iferation+0.406
onder+0.396
Neg logit contributions
MENTS-0.456
SPONSORED-0.443
ERY-0.438
MENT-0.431
CAR-0.428
Token uses
Token ID3544
Feature activation+3.96e-01

Pos logit contributions
MENTS+0.376
SPONSORED+0.361
ERY+0.358
CAR+0.357
MENT+0.355
Neg logit contributions
fman-0.337
ogue-0.336
iferation-0.335
 Sinn-0.333
lus-0.325
Token an
Token ID281
Feature activation+1.95e-01

Pos logit contributions
fman+0.742
 Sinn+0.741
ogue+0.735
iferation+0.731
 hers+0.720
Neg logit contributions
MENTS-0.861
SPONSORED-0.832
CAR-0.821
ERY-0.820
MENT-0.813
Token electric
Token ID5186
Feature activation+1.67e-01

Pos logit contributions
fman+0.385
iferation+0.384
ogue+0.384
 Sinn+0.383
 hers+0.373
Neg logit contributions
MENTS-0.405
SPONSORED-0.387
ERY-0.385
CAR-0.384
MENT-0.381
Token000
Token ID830
Feature activation+3.70e-02

Pos logit contributions
iferation+0.247
fman+0.247
ogue+0.246
 Sinn+0.244
��+0.240
Neg logit contributions
MENTS-0.257
SPONSORED-0.245
ERY-0.244
CAR-0.243
MENT-0.242
Token.
Token ID13
Feature activation-4.02e-02

Pos logit contributions
ogue+0.077
fman+0.077
 Sinn+0.076
iferation+0.076
ouf+0.074
Neg logit contributions
MENTS-0.073
SPONSORED-0.070
ERY-0.069
CAR-0.069
MENT-0.069
Token
Token ID198
Feature activation-7.16e-02

Pos logit contributions
MENTS+0.090
SPONSORED+0.087
CAR+0.086
ERY+0.086
 disappointment+0.085
Neg logit contributions
 Sinn-0.073
ogue-0.072
fman-0.072
iferation-0.072
lus-0.070
Token
Token ID198
Feature activation-4.23e-02

Pos logit contributions
MENTS+0.151
SPONSORED+0.145
CAR+0.144
ERY+0.144
MENT+0.143
Neg logit contributions
iferation-0.140
fman-0.139
ogue-0.138
��-0.137
 Sinn-0.137
TokenImage
Token ID5159
Feature activation+5.46e-03

Pos logit contributions
MENTS+0.076
 disappointment+0.071
ERY+0.071
SPONSORED+0.071
CAR+0.070
Neg logit contributions
ogue-0.095
fman-0.095
iferation-0.094
 Sinn-0.094
lus-0.092
Token caption
Token ID8305
Feature activation+1.01e+00

Pos logit contributions
fman+0.013
ogue+0.013
iferation+0.013
 Sinn+0.013
lus+0.013
Neg logit contributions
MENTS-0.009
SPONSORED-0.008
ERY-0.008
CAR-0.008
MENT-0.008
Token Council
Token ID4281
Feature activation-1.93e-01

Pos logit contributions
 Sinn+1.729
ogue+1.660
fman+1.645
 Nurse+1.640
iferation+1.640
Neg logit contributions
MENTS-2.376
SPONSORED-2.319
MENT-2.259
CAR-2.258
 disappointment-2.249
Token staff
Token ID3085
Feature activation-4.03e-02

Pos logit contributions
MENTS+0.447
SPONSORED+0.429
ERY+0.425
CAR+0.425
MENT+0.423
Neg logit contributions
fman-0.336
ogue-0.335
iferation-0.335
 Sinn-0.333
lus-0.324
Token found
Token ID1043
Feature activation+1.17e-02

Pos logit contributions
MENTS+0.089
SPONSORED+0.085
ERY+0.085
CAR+0.085
MENT+0.084
Neg logit contributions
fman-0.074
ogue-0.074
iferation-0.074
 Sinn-0.073
lus-0.072
Token paperwork
Token ID23143
Feature activation-4.98e-01

Pos logit contributions
 Sinn+0.022
ogue+0.022
fman+0.022
iferation+0.022
 hers+0.022
Neg logit contributions
MENTS-0.025
SPONSORED-0.024
CAR-0.024
ERY-0.024
MENT-0.023
Token with
Token ID351
Feature activation+2.61e-02

Pos logit contributions
MENTS+1.031
SPONSORED+0.985
ERY+0.979
 disappointment+0.978
CAR+0.977
Neg logit contributions
fman-0.987
iferation-0.984
ogue-0.977
 Sinn-0.963
��-0.958
Token ISI
Token ID22048
Feature activation-1.33e-01

Pos logit contributions
iferation+0.513
fman+0.512
ogue+0.508
 Sinn+0.501
��+0.495
Neg logit contributions
MENTS-0.773
SPONSORED-0.744
ERY-0.738
CAR-0.736
MENT-0.734
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
MENTS+0.277
SPONSORED+0.266
CAR+0.263
ERY+0.261
MENT+0.261
Neg logit contributions
iferation-0.264
fman-0.264
ogue-0.261
��-0.261
 Sinn-0.256
Token
Token ID198
Feature activation+1.09e-01

Pos logit contributions
iferation+0.224
fman+0.223
ogue+0.221
��+0.220
 Sinn+0.218
Neg logit contributions
MENTS-0.240
SPONSORED-0.231
CAR-0.228
ERY-0.227
MENT-0.226
TokenSh
Token ID2484
Feature activation-2.58e-01

Pos logit contributions
fman+0.232
ogue+0.231
iferation+0.230
 Sinn+0.230
lus+0.225
Neg logit contributions
MENTS-0.209
SPONSORED-0.198
ERY-0.198
CAR-0.197
 disappointment-0.197
Tokenak
Token ID461
Feature activation+1.21e-01

Pos logit contributions
MENTS+0.567
SPONSORED+0.543
ERY+0.540
CAR+0.539
MENT+0.536
Neg logit contributions
fman-0.480
ogue-0.478
iferation-0.478
 Sinn-0.473
lus-0.462
Tokenoor
Token ID2675
Feature activation+1.00e+00

Pos logit contributions
fman+0.238
ogue+0.238
iferation+0.234
 Sinn+0.234
ouf+0.231
Neg logit contributions
MENTS-0.255
SPONSORED-0.244
ERY-0.242
CAR-0.240
MENT-0.240
Token,
Token ID11
Feature activation+6.52e-02

Pos logit contributions
fman+1.783
ogue+1.766
iferation+1.754
 Sinn+1.752
ouf+1.702
Neg logit contributions
MENTS-2.305
SPONSORED-2.217
ERY-2.193
CAR-2.182
MENT-2.173
Token S
Token ID311
Feature activation+1.32e-02

Pos logit contributions
fman+0.119
iferation+0.119
ogue+0.118
 Sinn+0.117
��+0.115
Neg logit contributions
MENTS-0.145
SPONSORED-0.139
ERY-0.138
CAR-0.138
MENT-0.137
Token.,
Token ID1539
Feature activation-1.54e-01

Pos logit contributions
ogue+0.031
fman+0.031
iferation+0.031
 Sinn+0.031
lus+0.030
Neg logit contributions
MENTS-0.023
SPONSORED-0.021
CAR-0.021
ERY-0.021
 disappointment-0.021
Token J
Token ID449
Feature activation-1.26e-02

Pos logit contributions
MENTS+0.358
SPONSORED+0.344
ERY+0.342
CAR+0.341
MENT+0.340
Neg logit contributions
fman-0.266
iferation-0.265
ogue-0.265
 Sinn-0.262
��-0.256
Tokenaff
Token ID2001
Feature activation-1.55e-01

Pos logit contributions
MENTS+0.026
SPONSORED+0.025
ERY+0.025
CAR+0.025
 disappointment+0.025
Neg logit contributions
ogue-0.025
fman-0.025
iferation-0.025
 Sinn-0.025
ouf-0.024
Token super
Token ID2208
Feature activation+1.74e-01

Pos logit contributions
fman+0.265
iferation+0.264
ogue+0.264
 Sinn+0.261
��+0.255
Neg logit contributions
MENTS-0.299
SPONSORED-0.286
ERY-0.284
CAR-0.283
 disappointment-0.282
Token-
Token ID12
Feature activation+1.69e-01

Pos logit contributions
fman+0.317
ogue+0.317
iferation+0.316
 Sinn+0.314
lus+0.307
Neg logit contributions
MENTS-0.386
SPONSORED-0.370
ERY-0.368
CAR-0.367
MENT-0.364
Tokenbad
Token ID14774
Feature activation+2.19e-01

Pos logit contributions
fman+0.338
ogue+0.337
iferation+0.337
 Sinn+0.334
lus+0.327
Neg logit contributions
MENTS-0.347
SPONSORED-0.330
ERY-0.329
CAR-0.327
MENT-0.326
Token if
Token ID611
Feature activation+9.64e-02

Pos logit contributions
fman+0.461
ogue+0.460
iferation+0.459
 Sinn+0.456
lus+0.447
Neg logit contributions
MENTS-0.425
SPONSORED-0.405
ERY-0.402
CAR-0.401
MENT-0.398
Token you
Token ID345
Feature activation+1.35e-01

Pos logit contributions
fman+0.167
ogue+0.167
iferation+0.166
 Sinn+0.166
��+0.162
Neg logit contributions
MENTS-0.223
SPONSORED-0.214
ERY-0.213
CAR-0.212
MENT-0.211
Token hit
Token ID2277
Feature activation+1.00e+00

Pos logit contributions
fman+0.289
ogue+0.288
iferation+0.288
 Sinn+0.286
lus+0.280
Neg logit contributions
MENTS-0.260
SPONSORED-0.247
ERY-0.246
CAR-0.245
MENT-0.243
Token your
Token ID534
Feature activation+2.54e-01

Pos logit contributions
 Sinn+1.689
 hers+1.680
ogue+1.680
fman+1.674
iferation+1.645
Neg logit contributions
MENTS-2.345
SPONSORED-2.272
ERY-2.248
CAR-2.242
MENT-2.219
Token own
Token ID898
Feature activation+5.20e-02

Pos logit contributions
fman+0.437
ogue+0.435
 Sinn+0.433
iferation+0.433
 hers+0.423
Neg logit contributions
MENTS-0.593
SPONSORED-0.570
ERY-0.567
CAR-0.565
MENT-0.561
Token minion
Token ID28365
Feature activation+6.10e-01

Pos logit contributions
fman+0.079
ogue+0.079
iferation+0.078
 Sinn+0.078
 hers+0.076
Neg logit contributions
MENTS-0.132
SPONSORED-0.127
ERY-0.127
CAR-0.126
MENT-0.125
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
ogue+1.279
fman+1.275
 Sinn+1.271
iferation+1.265
 hers+1.245
Neg logit contributions
MENTS-1.188
SPONSORED-1.139
CAR-1.123
ERY-1.122
MENT-1.109
Token It
Token ID632
Feature activation+1.82e-01

Pos logit contributions
fman+0.555
ogue+0.555
iferation+0.552
 Sinn+0.552
lus+0.537
Neg logit contributions
MENTS-0.697
SPONSORED-0.669
ERY-0.665
CAR-0.663
MENT-0.659
TokenBrazil
Token ID39190
Feature activation-2.75e-01

Pos logit contributions
MENTS+0.093
ERY+0.087
 disappointment+0.087
SPONSORED+0.087
MENT+0.086
Neg logit contributions
ogue-0.125
fman-0.125
iferation-0.125
 Sinn-0.124
lus-0.122
Tokenian
Token ID666
Feature activation-1.48e-01

Pos logit contributions
MENTS+0.563
SPONSORED+0.537
ERY+0.534
CAR+0.533
MENT+0.530
Neg logit contributions
fman-0.550
iferation-0.548
ogue-0.548
 Sinn-0.543
��-0.531
Token President
Token ID1992
Feature activation-1.78e-02

Pos logit contributions
MENTS+0.314
SPONSORED+0.300
ERY+0.299
CAR+0.297
MENT+0.296
Neg logit contributions
ogue-0.287
fman-0.287
iferation-0.286
 Sinn-0.285
lus-0.279
Token Dil
Token ID29328
Feature activation+5.58e-02

Pos logit contributions
MENTS+0.045
SPONSORED+0.044
ERY+0.043
CAR+0.043
MENT+0.043
Neg logit contributions
ogue-0.027
fman-0.027
iferation-0.026
 Sinn-0.026
lus-0.026
Tokenma
Token ID2611
Feature activation-9.66e-02

Pos logit contributions
ogue+0.108
fman+0.107
iferation+0.107
 Sinn+0.106
ouf+0.105
Neg logit contributions
MENTS-0.119
SPONSORED-0.114
ERY-0.114
MENT-0.112
CAR-0.112
Token Rousse
Token ID42849
Feature activation+1.00e+00

Pos logit contributions
MENTS+0.251
SPONSORED+0.242
ERY+0.240
CAR+0.240
MENT+0.239
Neg logit contributions
fman-0.141
ogue-0.140
iferation-0.140
 Sinn-0.139
lus-0.134
Tokenff
Token ID487
Feature activation-7.65e-02

Pos logit contributions
fman+1.253
ogue+1.227
iferation+1.225
 Sinn+1.220
lus+1.192
Neg logit contributions
MENTS-2.801
ERY-2.712
SPONSORED-2.711
CAR-2.683
MENT-2.676
Token and
Token ID290
Feature activation-8.50e-02

Pos logit contributions
MENTS+0.161
SPONSORED+0.154
ERY+0.153
CAR+0.153
MENT+0.152
Neg logit contributions
fman-0.149
ogue-0.148
iferation-0.148
 Sinn-0.147
lus-0.144
Token Cuban
Token ID18147
Feature activation+2.63e-01

Pos logit contributions
MENTS+0.187
SPONSORED+0.179
ERY+0.178
CAR+0.178
 disappointment+0.177
Neg logit contributions
fman-0.158
iferation-0.157
ogue-0.157
 Sinn-0.155
��-0.152
Token President
Token ID1992
Feature activation+1.19e-01

Pos logit contributions
ogue+0.493
fman+0.491
iferation+0.491
 Sinn+0.490
lus+0.478
Neg logit contributions
MENTS-0.572
SPONSORED-0.547
ERY-0.544
CAR-0.541
MENT-0.539
Token R
Token ID371
Feature activation-5.33e-02

Pos logit contributions
 Sinn+0.203
ogue+0.203
iferation+0.203
fman+0.202
 hers+0.198
Neg logit contributions
MENTS-0.279
SPONSORED-0.269
ERY-0.267
CAR-0.265
 disappointment-0.263
Token),
Token ID828
Feature activation+7.32e-02

Pos logit contributions
MENTS+0.395
SPONSORED+0.378
ERY+0.374
CAR+0.374
MENT+0.371
Neg logit contributions
ogue-0.349
fman-0.348
iferation-0.346
 Sinn-0.345
lus-0.336
Token Robin
Token ID12325
Feature activation-2.97e-01

Pos logit contributions
fman+0.141
ogue+0.140
 Sinn+0.140
iferation+0.140
 hers+0.136
Neg logit contributions
MENTS-0.156
SPONSORED-0.150
CAR-0.148
ERY-0.148
MENT-0.147
Token and
Token ID290
Feature activation-6.23e-02

Pos logit contributions
MENTS+0.700
SPONSORED+0.676
CAR+0.669
ERY+0.668
MENT+0.661
Neg logit contributions
fman-0.510
ogue-0.507
iferation-0.505
 Sinn-0.503
lus-0.489
Token even
Token ID772
Feature activation+5.21e-01

Pos logit contributions
MENTS+0.141
SPONSORED+0.135
CAR+0.134
ERY+0.134
MENT+0.133
Neg logit contributions
fman-0.112
ogue-0.112
 Sinn-0.111
iferation-0.111
 hers-0.108
Token Black
Token ID2619
Feature activation-3.07e-01

Pos logit contributions
 Sinn+0.919
ogue+0.908
fman+0.903
 hers+0.897
iferation+0.896
Neg logit contributions
MENTS-1.195
SPONSORED-1.145
CAR-1.137
ERY-1.132
MENT-1.123
Token Canary
Token ID50206
Feature activation-1.18e+00

Pos logit contributions
MENTS+0.618
SPONSORED+0.595
ERY+0.584
CAR+0.582
MENT+0.578
Neg logit contributions
fman-0.629
ogue-0.626
iferation-0.625
 Sinn-0.623
lus-0.612
Token may
Token ID743
Feature activation+5.01e-02

Pos logit contributions
MENTS+2.563
ERY+2.447
CAR+2.443
SPONSORED+2.440
 disappointment+2.439
Neg logit contributions
iferation-2.196
fman-2.178
ogue-2.169
 Sinn-2.119
��-2.103
Token be
Token ID307
Feature activation-4.84e-02

Pos logit contributions
fman+0.101
ogue+0.101
iferation+0.101
 Sinn+0.100
lus+0.098
Neg logit contributions
MENTS-0.102
SPONSORED-0.097
ERY-0.096
CAR-0.096
MENT-0.096
Token involved
Token ID2950
Feature activation+4.00e-01

Pos logit contributions
MENTS+0.097
SPONSORED+0.092
ERY+0.092
CAR+0.092
MENT+0.091
Neg logit contributions
fman-0.099
ogue-0.099
iferation-0.099
 Sinn-0.098
lus-0.096
Token,
Token ID11
Feature activation-3.07e-02

Pos logit contributions
fman+0.756
ogue+0.754
 Sinn+0.751
iferation+0.744
 hers+0.730
Neg logit contributions
MENTS-0.863
SPONSORED-0.832
ERY-0.823
CAR-0.819
 disappointment-0.811
Token with
Token ID351
Feature activation+4.57e-03

Pos logit contributions
MENTS+0.063
SPONSORED+0.060
ERY+0.060
CAR+0.060
MENT+0.060
Neg logit contributions
fman-0.061
ogue-0.061
iferation-0.061
 Sinn-0.060
lus-0.059
Token binary
Token ID13934
Feature activation-2.76e-01

Pos logit contributions
ogue+0.173
fman+0.173
 Sinn+0.170
iferation+0.170
lus+0.169
Neg logit contributions
MENTS-0.193
SPONSORED-0.186
ERY-0.183
MENT-0.182
CAR-0.181
Token is
Token ID318
Feature activation-1.22e-01

Pos logit contributions
MENTS+0.575
SPONSORED+0.549
ERY+0.545
CAR+0.543
MENT+0.540
Neg logit contributions
fman-0.545
ogue-0.544
iferation-0.542
 Sinn-0.539
lus-0.526
Token moved
Token ID3888
Feature activation-2.04e-02

Pos logit contributions
MENTS+0.258
SPONSORED+0.247
ERY+0.245
CAR+0.245
MENT+0.243
Neg logit contributions
fman-0.238
iferation-0.238
ogue-0.237
 Sinn-0.235
��-0.230
Token from
Token ID422
Feature activation-9.43e-02

Pos logit contributions
MENTS+0.043
SPONSORED+0.041
ERY+0.041
CAR+0.041
MENT+0.041
Neg logit contributions
fman-0.039
ogue-0.039
iferation-0.039
 Sinn-0.039
lus-0.038
Token one
Token ID530
Feature activation-6.91e-02

Pos logit contributions
MENTS+0.208
SPONSORED+0.199
ERY+0.198
CAR+0.197
MENT+0.196
Neg logit contributions
fman-0.174
iferation-0.174
ogue-0.174
 Sinn-0.172
��-0.168
Token partition
Token ID18398
Feature activation-1.09e+00

Pos logit contributions
MENTS+0.137
SPONSORED+0.131
ERY+0.130
CAR+0.129
MENT+0.129
Neg logit contributions
fman-0.143
ogue-0.143
iferation-0.142
 Sinn-0.141
lus-0.138
Token to
Token ID284
Feature activation-1.83e-01

Pos logit contributions
MENTS+2.256
SPONSORED+2.176
ERY+2.165
CAR+2.158
 disappointment+2.142
Neg logit contributions
iferation-2.122
ogue-2.087
fman-2.079
 Sinn-2.066
��-2.055
Token another
Token ID1194
Feature activation-1.89e-02

Pos logit contributions
MENTS+0.395
SPONSORED+0.378
ERY+0.376
CAR+0.375
MENT+0.373
Neg logit contributions
fman-0.346
ogue-0.345
iferation-0.344
 Sinn-0.341
lus-0.333
Token The
Token ID383
Feature activation-2.63e-01

Pos logit contributions
MENTS+0.039
SPONSORED+0.037
ERY+0.037
CAR+0.037
MENT+0.036
Neg logit contributions
fman-0.038
ogue-0.038
iferation-0.038
 Sinn-0.038
lus-0.037
Token way
Token ID835
Feature activation+5.58e-02

Pos logit contributions
MENTS+0.483
SPONSORED+0.458
ERY+0.455
CAR+0.454
MENT+0.450
Neg logit contributions
fman-0.584
iferation-0.583
ogue-0.583
 Sinn-0.578
��-0.567
Token I
Token ID314
Feature activation-1.45e-01

Pos logit contributions
fman+0.106
ogue+0.105
iferation+0.105
 Sinn+0.105
lus+0.102
Neg logit contributions
MENTS-0.121
SPONSORED-0.116
ERY-0.115
CAR-0.114
MENT-0.114
Token to
Token ID284
Feature activation+8.25e-02

Pos logit contributions
fman+0.749
ogue+0.748
iferation+0.747
 Sinn+0.743
 hers+0.726
Neg logit contributions
MENTS-0.739
SPONSORED-0.703
ERY-0.701
CAR-0.697
MENT-0.693
Token deport
Token ID29489
Feature activation-2.06e-01

Pos logit contributions
fman+0.150
ogue+0.149
iferation+0.149
 Sinn+0.148
 hers+0.145
Neg logit contributions
MENTS-0.185
SPONSORED-0.177
ERY-0.176
CAR-0.175
MENT-0.174
Token.
Token ID13
Feature activation+4.07e-02

Pos logit contributions
MENTS+0.427
SPONSORED+0.408
ERY+0.406
CAR+0.405
MENT+0.402
Neg logit contributions
fman-0.406
ogue-0.405
iferation-0.405
 Sinn-0.401
lus-0.392
Token He
Token ID679
Feature activation+1.59e-01

Pos logit contributions
fman+0.071
ogue+0.071
iferation+0.071
 Sinn+0.070
lus+0.068
Neg logit contributions
MENTS-0.094
SPONSORED-0.090
ERY-0.090
CAR-0.089
MENT-0.089
Token bl
Token ID698
Feature activation-1.63e-01

Pos logit contributions
fman+0.308
ogue+0.305
iferation+0.305
 Sinn+0.304
lus+0.296
Neg logit contributions
MENTS-0.339
SPONSORED-0.323
ERY-0.322
CAR-0.320
MENT-0.318
Tokenithe
Token ID31470
Feature activation-1.08e+00

Pos logit contributions
MENTS+0.344
SPONSORED+0.328
ERY+0.326
CAR+0.325
MENT+0.324
Neg logit contributions
fman-0.319
iferation-0.318
ogue-0.318
 Sinn-0.315
lus-0.308
Tokenly
Token ID306
Feature activation-2.03e-01

Pos logit contributions
MENTS+1.994
SPONSORED+1.897
ERY+1.880
CAR+1.872
 disappointment+1.870
Neg logit contributions
iferation-2.379
fman-2.372
ogue-2.371
 Sinn-2.344
��-2.320
Token called
Token ID1444
Feature activation+4.46e-02

Pos logit contributions
MENTS+0.430
SPONSORED+0.409
ERY+0.409
CAR+0.407
MENT+0.404
Neg logit contributions
fman-0.392
ogue-0.391
iferation-0.390
 Sinn-0.388
lus-0.379
Token for
Token ID329
Feature activation+5.70e-02

Pos logit contributions
ogue+0.076
fman+0.076
 Sinn+0.076
iferation+0.076
 hers+0.074
Neg logit contributions
MENTS-0.104
SPONSORED-0.100
ERY-0.099
CAR-0.099
MENT-0.099
Token torture
Token ID11543
Feature activation+1.16e-01

Pos logit contributions
fman+0.106
ogue+0.106
 Sinn+0.106
iferation+0.106
 hers+0.103
Neg logit contributions
MENTS-0.125
SPONSORED-0.120
CAR-0.118
ERY-0.118
MENT-0.118
Token but
Token ID475
Feature activation+2.07e-02

Pos logit contributions
ogue+0.223
fman+0.222
iferation+0.221
 Sinn+0.221
lus+0.215
Neg logit contributions
MENTS-0.246
SPONSORED-0.236
ERY-0.235
CAR-0.233
MENT-0.231
Token
Token ID447
Feature activation+9.53e-02

Pos logit contributions
MENTS+0.232
SPONSORED+0.217
CAR+0.214
ERY+0.214
MENT+0.212
Neg logit contributions
fman-0.443
iferation-0.442
ogue-0.441
 Sinn-0.438
��-0.434
Token
Token ID247
Feature activation-1.83e-02

Pos logit contributions
fman+0.214
ogue+0.213
iferation+0.213
 Sinn+0.211
��+0.209
Neg logit contributions
MENTS-0.172
SPONSORED-0.162
ERY-0.161
CAR-0.161
 disappointment-0.160
Tokent
Token ID83
Feature activation-1.55e-01

Pos logit contributions
MENTS+0.041
SPONSORED+0.039
ERY+0.039
CAR+0.039
MENT+0.038
Neg logit contributions
fman-0.033
ogue-0.033
iferation-0.033
 Sinn-0.033
��-0.032
Token quite
Token ID2407
Feature activation-1.27e-01

Pos logit contributions
MENTS+0.310
ERY+0.296
SPONSORED+0.296
CAR+0.295
MENT+0.291
Neg logit contributions
fman-0.319
ogue-0.317
 Sinn-0.316
iferation-0.315
 hers-0.309
Token br
Token ID865
Feature activation-2.96e-01

Pos logit contributions
MENTS+0.264
ERY+0.253
SPONSORED+0.252
CAR+0.252
MENT+0.248
Neg logit contributions
fman-0.252
ogue-0.251
 Sinn-0.250
iferation-0.250
 hers-0.245
Tokenanc
Token ID1192
Feature activation-1.04e+00

Pos logit contributions
MENTS+0.834
SPONSORED+0.806
ERY+0.802
CAR+0.801
MENT+0.797
Neg logit contributions
fman-0.367
ogue-0.365
iferation-0.365
 Sinn-0.360
lus-0.347
Tokenhed
Token ID704
Feature activation-2.26e-01

Pos logit contributions
MENTS+2.120
SPONSORED+2.038
CAR+2.021
ERY+2.019
 disappointment+2.000
Neg logit contributions
fman-2.081
iferation-2.078
ogue-2.076
 Sinn-2.057
��-2.021
Token out
Token ID503
Feature activation-1.60e-01

Pos logit contributions
MENTS+0.459
SPONSORED+0.437
ERY+0.435
CAR+0.434
MENT+0.430
Neg logit contributions
ogue-0.459
fman-0.459
 Sinn-0.457
iferation-0.456
 hers-0.448
Token to
Token ID284
Feature activation+2.42e-02

Pos logit contributions
MENTS+0.311
SPONSORED+0.296
ERY+0.294
CAR+0.293
MENT+0.291
Neg logit contributions
fman-0.337
ogue-0.336
iferation-0.335
 Sinn-0.334
lus-0.326
Token many
Token ID867
Feature activation-1.80e-02

Pos logit contributions
fman+0.047
ogue+0.046
 Sinn+0.046
iferation+0.046
lus+0.045
Neg logit contributions
MENTS-0.052
SPONSORED-0.050
ERY-0.049
CAR-0.049
MENT-0.049
Token mainstream
Token ID8661
Feature activation-3.33e-01

Pos logit contributions
MENTS+0.037
SPONSORED+0.035
CAR+0.035
ERY+0.035
MENT+0.035
Neg logit contributions
fman-0.036
ogue-0.036
 Sinn-0.036
iferation-0.036
lus-0.035
Token,
Token ID11
Feature activation+5.15e-04

Pos logit contributions
ogue+1.499
 Sinn+1.485
iferation+1.481
fman+1.480
ouf+1.437
Neg logit contributions
MENTS-1.532
SPONSORED-1.442
CAR-1.438
ERY-1.434
MENT-1.425
Token it
Token ID340
Feature activation-1.17e-02

Pos logit contributions
fman+0.001
ogue+0.001
iferation+0.001
 Sinn+0.001
lus+0.001
Neg logit contributions
MENTS-0.001
SPONSORED-0.001
ERY-0.001
CAR-0.001
MENT-0.001
Token is
Token ID318
Feature activation+6.99e-02

Pos logit contributions
MENTS+0.024
ERY+0.023
SPONSORED+0.023
CAR+0.023
MENT+0.023
Neg logit contributions
ogue-0.023
fman-0.023
iferation-0.023
 Sinn-0.023
 hers-0.022
Token a
Token ID257
Feature activation+1.31e-02

Pos logit contributions
ogue+0.132
fman+0.132
iferation+0.131
 Sinn+0.131
lus+0.127
Neg logit contributions
MENTS-0.152
SPONSORED-0.144
ERY-0.144
CAR-0.144
MENT-0.143
Token bl
Token ID698
Feature activation-1.26e-02

Pos logit contributions
fman+0.028
ogue+0.028
 Sinn+0.028
iferation+0.028
 hers+0.028
Neg logit contributions
MENTS-0.025
SPONSORED-0.023
CAR-0.023
ERY-0.023
MENT-0.023
Tokenunder
Token ID4625
Feature activation-9.96e-01

Pos logit contributions
MENTS+0.019
SPONSORED+0.018
CAR+0.017
ERY+0.017
MENT+0.017
Neg logit contributions
fman-0.033
iferation-0.033
ogue-0.033
 Sinn-0.032
lus-0.032
Token.
Token ID13
Feature activation-8.89e-02

Pos logit contributions
MENTS+1.960
SPONSORED+1.872
ERY+1.860
CAR+1.850
 disappointment+1.846
Neg logit contributions
iferation-2.067
fman-2.065
ogue-2.057
 Sinn-2.040
��-2.009
Token If
Token ID1002
Feature activation+1.32e-01

Pos logit contributions
MENTS+0.204
SPONSORED+0.195
ERY+0.194
CAR+0.194
MENT+0.193
Neg logit contributions
fman-0.157
ogue-0.156
iferation-0.156
 Sinn-0.155
lus-0.151
Token policymakers
Token ID29484
Feature activation+3.23e-02

Pos logit contributions
fman+0.251
ogue+0.250
iferation+0.250
 Sinn+0.248
lus+0.242
Neg logit contributions
MENTS-0.284
SPONSORED-0.271
ERY-0.270
CAR-0.269
MENT-0.268
Token listened
Token ID16399
Feature activation+5.94e-01

Pos logit contributions
fman+0.062
iferation+0.062
ogue+0.062
 Sinn+0.061
��+0.060
Neg logit contributions
MENTS-0.069
SPONSORED-0.066
ERY-0.065
CAR-0.065
MENT-0.065
Token to
Token ID284
Feature activation+3.10e-02

Pos logit contributions
ogue+1.052
 Sinn+1.051
iferation+1.047
fman+1.045
lus+1.016
Neg logit contributions
MENTS-1.356
SPONSORED-1.292
ERY-1.289
CAR-1.287
MENT-1.279
Token
Token ID447
Feature activation-1.14e-01

Pos logit contributions
MENTS+0.198
SPONSORED+0.186
ERY+0.183
CAR+0.183
MENT+0.182
Neg logit contributions
fman-0.375
iferation-0.374
ogue-0.373
 Sinn-0.371
��-0.367
Token
Token ID247
Feature activation-5.23e-02

Pos logit contributions
MENTS+0.219
SPONSORED+0.209
ERY+0.208
CAR+0.207
MENT+0.206
Neg logit contributions
fman-0.243
iferation-0.243
ogue-0.242
 Sinn-0.241
lus-0.235
Tokent
Token ID83
Feature activation-3.15e-01

Pos logit contributions
MENTS+0.106
SPONSORED+0.102
ERY+0.101
CAR+0.101
MENT+0.100
Neg logit contributions
fman-0.105
iferation-0.105
ogue-0.105
 Sinn-0.104
��-0.102
Token apply
Token ID4174
Feature activation-3.39e-01

Pos logit contributions
MENTS+0.647
SPONSORED+0.621
ERY+0.615
CAR+0.615
 disappointment+0.613
Neg logit contributions
iferation-0.630
fman-0.625
ogue-0.623
 Sinn-0.618
��-0.611
Token any
Token ID597
Feature activation-3.96e-01

Pos logit contributions
MENTS+0.713
SPONSORED+0.679
ERY+0.676
CAR+0.673
MENT+0.672
Neg logit contributions
iferation-0.663
fman-0.661
ogue-0.659
 Sinn-0.651
��-0.641
Token lens
Token ID10317
Feature activation-9.90e-01

Pos logit contributions
MENTS+0.822
SPONSORED+0.784
ERY+0.781
CAR+0.778
 disappointment+0.775
Neg logit contributions
iferation-0.785
fman-0.783
ogue-0.779
 Sinn-0.771
��-0.761
Token profiles
Token ID16545
Feature activation-3.00e-01

Pos logit contributions
MENTS+2.470
SPONSORED+2.368
ERY+2.366
CAR+2.358
MENT+2.352
Neg logit contributions
fman-1.538
iferation-1.537
ogue-1.525
 Sinn-1.511
��-1.489
Token
Token ID784
Feature activation-1.73e-01

Pos logit contributions
MENTS+0.623
SPONSORED+0.595
ERY+0.591
CAR+0.590
MENT+0.586
Neg logit contributions
fman-0.591
ogue-0.590
iferation-0.589
 Sinn-0.584
lus-0.571
Token since
Token ID1201
Feature activation-1.21e-01

Pos logit contributions
MENTS+0.366
SPONSORED+0.350
ERY+0.349
CAR+0.348
 disappointment+0.346
Neg logit contributions
iferation-0.336
fman-0.336
ogue-0.334
 Sinn-0.331
��-0.325
Token there
Token ID612
Feature activation+3.77e-02

Pos logit contributions
MENTS+0.268
SPONSORED+0.257
ERY+0.256
CAR+0.255
MENT+0.254
Neg logit contributions
fman-0.222
iferation-0.222
ogue-0.222
 Sinn-0.219
��-0.214
Token aren
Token ID3588
Feature activation+2.48e-01

Pos logit contributions
fman+0.073
ogue+0.073
iferation+0.073
 Sinn+0.072
lus+0.071
Neg logit contributions
MENTS-0.080
SPONSORED-0.076
ERY-0.076
CAR-0.076
MENT-0.075
Token….
Token ID11580
Feature activation+6.43e-02

Pos logit contributions
MENTS+0.122
SPONSORED+0.116
CAR+0.115
ERY+0.115
MENT+0.114
Neg logit contributions
ogue-0.114
fman-0.113
 Sinn-0.113
iferation-0.113
lus-0.110
Token How
Token ID1374
Feature activation+3.18e-02

Pos logit contributions
fman+0.128
ogue+0.127
 Sinn+0.127
iferation+0.127
lus+0.124
Neg logit contributions
MENTS-0.133
SPONSORED-0.127
ERY-0.125
CAR-0.125
MENT-0.124
Token bad
Token ID2089
Feature activation-5.06e-02

Pos logit contributions
fman+0.063
ogue+0.063
iferation+0.063
 Sinn+0.063
lus+0.061
Neg logit contributions
MENTS-0.065
SPONSORED-0.062
ERY-0.062
CAR-0.062
MENT-0.062
Token is
Token ID318
Feature activation+9.00e-03

Pos logit contributions
MENTS+0.107
SPONSORED+0.102
CAR+0.101
ERY+0.101
MENT+0.100
Neg logit contributions
ogue-0.098
fman-0.098
iferation-0.098
 Sinn-0.098
lus-0.095
Token it
Token ID340
Feature activation-5.10e-02

Pos logit contributions
ogue+0.017
fman+0.017
 Sinn+0.017
iferation+0.017
 hers+0.016
Neg logit contributions
MENTS-0.020
SPONSORED-0.019
CAR-0.019
ERY-0.019
MENT-0.019
Token?」
Token ID42943
Feature activation-9.86e-01

Pos logit contributions
MENTS+0.108
SPONSORED+0.103
ERY+0.103
CAR+0.102
MENT+0.102
Neg logit contributions
ogue-0.098
fman-0.098
iferation-0.098
 Sinn-0.098
 hers-0.095
Token
Token ID198
Feature activation-2.36e-01

Pos logit contributions
MENTS+2.097
SPONSORED+2.008
ERY+1.993
CAR+1.989
MENT+1.980
Neg logit contributions
fman-1.896
iferation-1.888
ogue-1.887
 Sinn-1.859
��-1.832
Token
Token ID198
Feature activation-2.32e-01

Pos logit contributions
MENTS+0.491
SPONSORED+0.472
CAR+0.466
ERY+0.465
MENT+0.464
Neg logit contributions
iferation-0.471
fman-0.469
��-0.466
ogue-0.464
 Sinn-0.459
Token"
Token ID1
Feature activation+1.84e-01

Pos logit contributions
MENTS+0.412
SPONSORED+0.390
ERY+0.388
CAR+0.386
 disappointment+0.384
Neg logit contributions
fman-0.529
ogue-0.527
iferation-0.526
 Sinn-0.523
lus-0.514
TokenI
Token ID40
Feature activation-4.65e-02

Pos logit contributions
ogue+0.421
fman+0.420
 Sinn+0.419
iferation+0.418
lus+0.411
Neg logit contributions
MENTS-0.323
SPONSORED-0.304
ERY-0.303
 disappointment-0.301
CAR-0.300
Token think
Token ID892
Feature activation-3.63e-02

Pos logit contributions
MENTS+0.100
SPONSORED+0.096
ERY+0.096
CAR+0.095
MENT+0.095
Neg logit contributions
fman-0.088
iferation-0.088
ogue-0.088
 Sinn-0.087
��-0.085
TokenSte
Token ID7447
Feature activation-9.42e-02

Pos logit contributions
MENTS+0.421
SPONSORED+0.399
ERY+0.396
CAR+0.395
MENT+0.392
Neg logit contributions
fman-0.528
iferation-0.527
ogue-0.527
 Sinn-0.522
lus-0.512
Tokenlla
Token ID8466
Feature activation+2.35e-01

Pos logit contributions
MENTS+0.228
CAR+0.218
ERY+0.217
SPONSORED+0.217
MENT+0.217
Neg logit contributions
iferation-0.154
ogue-0.152
��-0.152
fman-0.151
 Sinn-0.151
Token-
Token ID12
Feature activation+1.59e-02

Pos logit contributions
ogue+0.441
 Sinn+0.441
fman+0.441
iferation+0.438
lus+0.427
Neg logit contributions
MENTS-0.512
SPONSORED-0.494
CAR-0.485
ERY-0.484
MENT-0.480
Tokensan
Token ID12807
Feature activation-4.74e-01

Pos logit contributions
fman+0.033
ogue+0.033
 Sinn+0.033
iferation+0.032
lus+0.032
Neg logit contributions
MENTS-0.032
SPONSORED-0.030
CAR-0.030
ERY-0.030
 disappointment-0.030
Token collapsed
Token ID14707
Feature activation+9.41e-02

Pos logit contributions
MENTS+1.014
SPONSORED+0.970
ERY+0.962
CAR+0.961
MENT+0.953
Neg logit contributions
fman-0.908
ogue-0.907
iferation-0.905
 Sinn-0.900
lus-0.878
Token?」
Token ID42943
Feature activation-9.76e-01

Pos logit contributions
fman+0.188
ogue+0.188
iferation+0.188
 Sinn+0.186
lus+0.182
Neg logit contributions
MENTS-0.193
SPONSORED-0.184
ERY-0.183
CAR-0.183
MENT-0.181
Token
Token ID198
Feature activation-1.80e-01

Pos logit contributions
MENTS+2.093
SPONSORED+2.002
ERY+1.989
CAR+1.985
MENT+1.976
Neg logit contributions
fman-1.861
iferation-1.853
ogue-1.850
 Sinn-1.824
��-1.795
Token
Token ID198
Feature activation-1.77e-01

Pos logit contributions
MENTS+0.374
SPONSORED+0.360
CAR+0.356
ERY+0.355
MENT+0.354
Neg logit contributions
iferation-0.358
fman-0.356
��-0.353
ogue-0.352
 Sinn-0.348
Token"
Token ID1
Feature activation+2.98e-01

Pos logit contributions
MENTS+0.299
SPONSORED+0.282
ERY+0.280
 disappointment+0.279
CAR+0.278
Neg logit contributions
fman-0.415
ogue-0.415
iferation-0.413
 Sinn-0.411
lus-0.405
TokenYes
Token ID5297
Feature activation-3.37e-01

Pos logit contributions
fman+0.688
ogue+0.687
 Sinn+0.686
iferation+0.682
lus+0.673
Neg logit contributions
MENTS-0.513
SPONSORED-0.485
 disappointment-0.480
ERY-0.480
CAR-0.476
Token.
Token ID13
Feature activation-1.28e-01

Pos logit contributions
MENTS+0.678
SPONSORED+0.646
ERY+0.642
CAR+0.641
MENT+0.636
Neg logit contributions
fman-0.687
ogue-0.687
iferation-0.685
 Sinn-0.681
lus-0.666
Token at
Token ID379
Feature activation-6.88e-02

Pos logit contributions
MENTS+0.722
SPONSORED+0.692
ERY+0.688
CAR+0.686
MENT+0.681
Neg logit contributions
ogue-0.577
fman-0.577
iferation-0.575
 Sinn-0.572
lus-0.557
Token left
Token ID1364
Feature activation-8.87e-02

Pos logit contributions
MENTS+0.137
SPONSORED+0.133
ERY+0.131
CAR+0.130
MENT+0.129
Neg logit contributions
ogue-0.142
fman-0.141
 Sinn-0.141
iferation-0.140
lus-0.138
Token guard
Token ID4860
Feature activation-1.02e-01

Pos logit contributions
MENTS+0.175
SPONSORED+0.167
CAR+0.164
ERY+0.164
MENT+0.162
Neg logit contributions
ogue-0.188
 Sinn-0.184
fman-0.184
iferation-0.184
lus-0.181
Token and
Token ID290
Feature activation-1.13e-01

Pos logit contributions
MENTS+0.225
SPONSORED+0.215
ERY+0.214
CAR+0.213
MENT+0.212
Neg logit contributions
ogue-0.189
fman-0.189
iferation-0.188
 Sinn-0.187
lus-0.183
Token got
Token ID1392
Feature activation+2.55e-02

Pos logit contributions
MENTS+0.237
SPONSORED+0.226
ERY+0.225
CAR+0.225
MENT+0.223
Neg logit contributions
fman-0.221
ogue-0.221
iferation-0.220
 Sinn-0.220
lus-0.214
Token snaps
Token ID23429
Feature activation-9.70e-01

Pos logit contributions
fman+0.048
ogue+0.048
 Sinn+0.047
iferation+0.047
 hers+0.046
Neg logit contributions
MENTS-0.056
SPONSORED-0.053
ERY-0.053
CAR-0.053
MENT-0.053
Token at
Token ID379
Feature activation-7.08e-02

Pos logit contributions
MENTS+1.988
SPONSORED+1.904
ERY+1.889
 disappointment+1.889
CAR+1.889
Neg logit contributions
fman-1.934
iferation-1.929
ogue-1.923
 Sinn-1.903
��-1.877
Token right
Token ID826
Feature activation-1.55e-01

Pos logit contributions
MENTS+0.149
SPONSORED+0.144
ERY+0.142
CAR+0.142
MENT+0.140
Neg logit contributions
ogue-0.138
 Sinn-0.138
fman-0.137
iferation-0.136
lus-0.135
Token guard
Token ID4860
Feature activation-1.70e-01

Pos logit contributions
MENTS+0.300
SPONSORED+0.286
CAR+0.282
ERY+0.282
MENT+0.280
Neg logit contributions
ogue-0.334
fman-0.329
 Sinn-0.328
iferation-0.327
 hers-0.321
Token as
Token ID355
Feature activation-1.91e-01

Pos logit contributions
MENTS+0.348
SPONSORED+0.332
ERY+0.330
CAR+0.329
MENT+0.327
Neg logit contributions
fman-0.341
ogue-0.340
iferation-0.339
 Sinn-0.337
lus-0.330
Token needed
Token ID2622
Feature activation-2.48e-01

Pos logit contributions
MENTS+0.402
SPONSORED+0.384
ERY+0.382
CAR+0.380
MENT+0.378
Neg logit contributions
fman-0.372
ogue-0.371
iferation-0.371
 Sinn-0.369
lus-0.361
Token methods
Token ID5050
Feature activation+8.48e-03

Pos logit contributions
ogue+0.002
fman+0.002
 Sinn+0.002
iferation+0.002
 hers+0.002
Neg logit contributions
MENTS-0.002
SPONSORED-0.002
ERY-0.002
CAR-0.002
MENT-0.002
Token to
Token ID284
Feature activation-2.33e-01

Pos logit contributions
fman+0.016
ogue+0.016
iferation+0.016
 Sinn+0.016
��+0.015
Neg logit contributions
MENTS-0.018
SPONSORED-0.018
ERY-0.017
CAR-0.017
MENT-0.017
Token measure
Token ID3953
Feature activation-1.40e-01

Pos logit contributions
MENTS+0.497
SPONSORED+0.476
ERY+0.473
CAR+0.472
MENT+0.469
Neg logit contributions
fman-0.447
iferation-0.446
ogue-0.446
 Sinn-0.441
��-0.432
Token weak
Token ID4939
Feature activation-1.98e-01

Pos logit contributions
MENTS+0.302
SPONSORED+0.289
ERY+0.288
CAR+0.287
MENT+0.285
Neg logit contributions
fman-0.264
ogue-0.264
iferation-0.263
 Sinn-0.263
 hers-0.256
Token gravitational
Token ID29973
Feature activation-8.52e-02

Pos logit contributions
MENTS+0.403
SPONSORED+0.384
ERY+0.382
CAR+0.381
MENT+0.378
Neg logit contributions
fman-0.399
ogue-0.398
iferation-0.398
 Sinn-0.394
lus-0.386
Token lens
Token ID10317
Feature activation-9.61e-01

Pos logit contributions
MENTS+0.220
SPONSORED+0.213
ERY+0.212
CAR+0.211
MENT+0.210
Neg logit contributions
iferation-0.125
 Sinn-0.124
ogue-0.123
fman-0.123
 hers-0.122
Tokening
Token ID278
Feature activation-9.93e-02

Pos logit contributions
MENTS+1.758
SPONSORED+1.665
CAR+1.657
ERY+1.655
 disappointment+1.647
Neg logit contributions
fman-2.129
ogue-2.126
iferation-2.116
 Sinn-2.102
��-2.094
Token,
Token ID11
Feature activation-2.26e-01

Pos logit contributions
MENTS+0.205
SPONSORED+0.195
ERY+0.195
CAR+0.193
MENT+0.192
Neg logit contributions
fman-0.197
iferation-0.197
ogue-0.197
 Sinn-0.196
 hers-0.191
Token another
Token ID1194
Feature activation-1.48e-01

Pos logit contributions
MENTS+0.456
SPONSORED+0.435
ERY+0.433
CAR+0.431
MENT+0.429
Neg logit contributions
fman-0.459
ogue-0.457
iferation-0.457
 Sinn-0.453
lus-0.443
Token way
Token ID835
Feature activation+1.34e-01

Pos logit contributions
MENTS+0.300
SPONSORED+0.286
ERY+0.283
CAR+0.283
MENT+0.281
Neg logit contributions
fman-0.302
ogue-0.301
iferation-0.300
 Sinn-0.299
 hers-0.291
Token in
Token ID287
Feature activation-1.10e-01

Pos logit contributions
ogue+0.245
fman+0.245
 Sinn+0.244
iferation+0.242
lus+0.236
Neg logit contributions
MENTS-0.297
ERY-0.285
SPONSORED-0.285
CAR-0.283
MENT-0.280
Token for
Token ID329
Feature activation-2.40e-01

Pos logit contributions
MENTS+0.912
SPONSORED+0.871
ERY+0.868
CAR+0.866
 disappointment+0.861
Neg logit contributions
fman-0.810
iferation-0.810
ogue-0.808
 Sinn-0.801
��-0.784
Token the
Token ID262
Feature activation-1.43e-01

Pos logit contributions
MENTS+0.599
SPONSORED+0.577
ERY+0.574
CAR+0.572
MENT+0.569
Neg logit contributions
fman-0.376
ogue-0.374
iferation-0.372
 Sinn-0.370
lus-0.359
Token init
Token ID2315
Feature activation+8.95e-02

Pos logit contributions
MENTS+0.296
SPONSORED+0.283
ERY+0.281
CAR+0.280
MENT+0.278
Neg logit contributions
fman-0.286
ogue-0.285
iferation-0.284
 Sinn-0.283
lus-0.276
Token binary
Token ID13934
Feature activation-2.85e-01

Pos logit contributions
ogue+0.179
fman+0.179
 Sinn+0.177
iferation+0.176
lus+0.174
Neg logit contributions
MENTS-0.185
SPONSORED-0.177
ERY-0.174
CAR-0.173
MENT-0.173
Token in
Token ID287
Feature activation-1.09e-01

Pos logit contributions
MENTS+0.650
SPONSORED+0.623
ERY+0.621
CAR+0.620
 disappointment+0.616
Neg logit contributions
iferation-0.502
fman-0.502
ogue-0.501
 Sinn-0.495
��-0.485
Token partition
Token ID18398
Feature activation-9.60e-01

Pos logit contributions
MENTS+0.325
SPONSORED+0.315
ERY+0.314
CAR+0.313
MENT+0.312
Neg logit contributions
fman-0.118
iferation-0.117
ogue-0.117
 Sinn-0.115
��-0.111
Token #
Token ID1303
Feature activation-4.37e-01

Pos logit contributions
MENTS+2.421
ERY+2.330
SPONSORED+2.326
CAR+2.325
MENT+2.306
Neg logit contributions
iferation-1.461
fman-1.449
ogue-1.446
 Sinn-1.428
��-1.393
Token25
Token ID1495
Feature activation-1.31e-01

Pos logit contributions
MENTS+0.791
SPONSORED+0.750
ERY+0.745
CAR+0.743
MENT+0.738
Neg logit contributions
fman-0.979
iferation-0.977
ogue-0.976
 Sinn-0.969
��-0.949
Token instead
Token ID2427
Feature activation-1.18e-01

Pos logit contributions
MENTS+0.276
SPONSORED+0.264
ERY+0.263
CAR+0.262
MENT+0.260
Neg logit contributions
fman-0.254
ogue-0.253
iferation-0.253
 Sinn-0.251
lus-0.245
Token of
Token ID286
Feature activation-1.41e-01

Pos logit contributions
MENTS+0.252
SPONSORED+0.240
ERY+0.239
CAR+0.238
MENT+0.237
Neg logit contributions
fman-0.228
ogue-0.227
iferation-0.227
 Sinn-0.225
lus-0.220
Token #
Token ID1303
Feature activation-4.69e-01

Pos logit contributions
MENTS+0.328
SPONSORED+0.315
ERY+0.313
CAR+0.312
MENT+0.310
Neg logit contributions
fman-0.245
ogue-0.244
iferation-0.243
 Sinn-0.242
lus-0.235
Token,
Token ID11
Feature activation+6.43e-02

Pos logit contributions
MENTS+0.561
SPONSORED+0.537
ERY+0.533
CAR+0.532
MENT+0.529
Neg logit contributions
fman-0.490
ogue-0.488
iferation-0.488
 Sinn-0.483
lus-0.472
Token and
Token ID290
Feature activation+3.30e-03

Pos logit contributions
fman+0.132
ogue+0.132
iferation+0.131
 Sinn+0.131
lus+0.128
Neg logit contributions
MENTS-0.129
SPONSORED-0.123
ERY-0.122
CAR-0.122
MENT-0.121
Token use
Token ID779
Feature activation+1.59e-01

Pos logit contributions
fman+0.006
ogue+0.006
iferation+0.006
 Sinn+0.006
lus+0.006
Neg logit contributions
MENTS-0.007
SPONSORED-0.007
ERY-0.007
CAR-0.007
MENT-0.007
Token the
Token ID262
Feature activation-2.37e-03

Pos logit contributions
fman+0.290
ogue+0.289
iferation+0.287
 Sinn+0.287
lus+0.280
Neg logit contributions
MENTS-0.355
SPONSORED-0.342
ERY-0.338
CAR-0.338
MENT-0.335
Token extra
Token ID3131
Feature activation+3.18e-01

Pos logit contributions
MENTS+0.005
SPONSORED+0.005
ERY+0.005
CAR+0.005
MENT+0.005
Neg logit contributions
fman-0.005
ogue-0.005
iferation-0.005
 Sinn-0.004
lus-0.004
Token lens
Token ID10317
Feature activation-9.57e-01

Pos logit contributions
ogue+0.618
fman+0.618
 Sinn+0.611
iferation+0.611
lus+0.598
Neg logit contributions
MENTS-0.669
SPONSORED-0.643
CAR-0.636
ERY-0.636
MENT-0.629
Token size
Token ID2546
Feature activation+2.91e-01

Pos logit contributions
MENTS+1.998
ERY+1.894
SPONSORED+1.884
MENT+1.884
CAR+1.884
Neg logit contributions
iferation-1.880
fman-1.870
ogue-1.865
 Sinn-1.847
��-1.822
Token to
Token ID284
Feature activation+3.38e-02

Pos logit contributions
fman+0.552
ogue+0.551
 Sinn+0.547
iferation+0.545
lus+0.534
Neg logit contributions
MENTS-0.626
SPONSORED-0.599
ERY-0.596
CAR-0.595
MENT-0.589
Token e
Token ID304
Feature activation+7.02e-02

Pos logit contributions
fman+0.067
ogue+0.067
iferation+0.066
 Sinn+0.066
lus+0.065
Neg logit contributions
MENTS-0.070
SPONSORED-0.067
ERY-0.067
CAR-0.066
MENT-0.066
Tokenke
Token ID365
Feature activation+4.54e-01

Pos logit contributions
fman+0.173
ogue+0.173
iferation+0.172
 Sinn+0.171
lus+0.169
Neg logit contributions
MENTS-0.112
SPONSORED-0.105
ERY-0.105
CAR-0.104
MENT-0.103
Token out
Token ID503
Feature activation+2.85e-02

Pos logit contributions
ogue+0.899
fman+0.896
 Sinn+0.892
iferation+0.885
 hers+0.871
Neg logit contributions
MENTS-0.933
SPONSORED-0.899
ERY-0.889
CAR-0.888
MENT-0.878
Tokenworthy
Token ID18275
Feature activation+2.37e-01

Pos logit contributions
fman+0.118
ogue+0.116
 Sinn+0.115
iferation+0.115
lus+0.112
Neg logit contributions
MENTS-0.245
SPONSORED-0.239
ERY-0.237
CAR-0.237
 disappointment-0.235
Token and
Token ID290
Feature activation-4.40e-02

Pos logit contributions
fman+0.496
ogue+0.494
iferation+0.494
 Sinn+0.493
lus+0.479
Neg logit contributions
MENTS-0.464
SPONSORED-0.439
ERY-0.437
CAR-0.434
 disappointment-0.431
Token
Token ID564
Feature activation-2.86e-02

Pos logit contributions
MENTS+0.091
SPONSORED+0.086
ERY+0.086
CAR+0.086
MENT+0.085
Neg logit contributions
fman-0.088
iferation-0.087
ogue-0.087
 Sinn-0.087
 hers-0.085
Token
Token ID250
Feature activation-1.23e-02

Pos logit contributions
MENTS+0.060
ERY+0.057
SPONSORED+0.057
CAR+0.057
MENT+0.057
Neg logit contributions
fman-0.055
ogue-0.055
iferation-0.055
 Sinn-0.055
��-0.054
Tokenflu
Token ID35522
Feature activation-3.45e-01

Pos logit contributions
MENTS+0.025
SPONSORED+0.024
ERY+0.024
CAR+0.023
 disappointment+0.023
Neg logit contributions
fman-0.025
ogue-0.025
iferation-0.025
 Sinn-0.025
lus-0.024
Tokenid
Token ID312
Feature activation-9.56e-01

Pos logit contributions
MENTS+0.765
SPONSORED+0.728
CAR+0.726
ERY+0.723
MENT+0.720
Neg logit contributions
ogue-0.637
fman-0.634
iferation-0.631
 Sinn-0.630
lus-0.615
Token with
Token ID351
Feature activation-6.98e-02

Pos logit contributions
MENTS+1.712
SPONSORED+1.627
ERY+1.611
CAR+1.608
MENT+1.598
Neg logit contributions
fman-2.159
ogue-2.152
iferation-2.151
 Sinn-2.134
��-2.098
Token the
Token ID262
Feature activation-7.61e-02

Pos logit contributions
MENTS+0.160
SPONSORED+0.154
ERY+0.152
CAR+0.152
MENT+0.151
Neg logit contributions
fman-0.123
iferation-0.123
ogue-0.123
 Sinn-0.123
 hers-0.119
Token truth
Token ID3872
Feature activation-8.23e-02

Pos logit contributions
MENTS+0.178
SPONSORED+0.171
ERY+0.170
CAR+0.170
MENT+0.169
Neg logit contributions
fman-0.130
ogue-0.130
iferation-0.130
 Sinn-0.128
��-0.125
Token.
Token ID13
Feature activation-1.09e-01

Pos logit contributions
MENTS+0.155
SPONSORED+0.147
ERY+0.147
CAR+0.146
 disappointment+0.145
Neg logit contributions
fman-0.179
iferation-0.178
ogue-0.178
 Sinn-0.177
lus-0.173
Token
Token ID447
Feature activation-1.17e-01

Pos logit contributions
MENTS+0.225
SPONSORED+0.214
ERY+0.213
CAR+0.212
 disappointment+0.211
Neg logit contributions
 Sinn-0.217
fman-0.215
ogue-0.214
iferation-0.214
lus-0.209
Tokenua
Token ID6413
Feature activation+9.19e-03

Pos logit contributions
MENTS+1.040
SPONSORED+0.999
ERY+0.991
CAR+0.987
MENT+0.983
Neg logit contributions
iferation-0.764
ogue-0.762
fman-0.761
 Sinn-0.749
lus-0.739
Token Fighter
Token ID15411
Feature activation+9.50e-02

Pos logit contributions
 Sinn+0.014
fman+0.014
ogue+0.014
iferation+0.014
 hers+0.013
Neg logit contributions
MENTS-0.024
SPONSORED-0.022
ERY-0.022
MENT-0.022
 disappointment-0.022
Token or
Token ID393
Feature activation+8.41e-02

Pos logit contributions
ogue+0.176
fman+0.175
 Sinn+0.175
iferation+0.175
lus+0.170
Neg logit contributions
MENTS-0.210
SPONSORED-0.201
ERY-0.199
CAR-0.198
MENT-0.198
Token games
Token ID1830
Feature activation-3.31e-01

Pos logit contributions
fman+0.148
 Sinn+0.147
ogue+0.147
iferation+0.147
 hers+0.143
Neg logit contributions
MENTS-0.193
SPONSORED-0.186
ERY-0.184
CAR-0.183
MENT-0.182
Token like
Token ID588
Feature activation+9.94e-02

Pos logit contributions
MENTS+0.669
SPONSORED+0.638
ERY+0.634
CAR+0.632
MENT+0.628
Neg logit contributions
fman-0.674
iferation-0.672
ogue-0.672
 Sinn-0.666
lus-0.651
Token Gundam
Token ID32467
Feature activation-9.54e-01

Pos logit contributions
fman+0.176
 Sinn+0.175
ogue+0.175
iferation+0.174
 hers+0.170
Neg logit contributions
MENTS-0.226
SPONSORED-0.220
ERY-0.217
CAR-0.216
MENT-0.214
Token and
Token ID290
Feature activation+6.89e-02

Pos logit contributions
MENTS+2.076
CAR+1.994
ERY+1.989
 disappointment+1.966
SPONSORED+1.961
Neg logit contributions
fman-1.779
iferation-1.771
ogue-1.749
 Sinn-1.718
��-1.706
Token stuff
Token ID3404
Feature activation-3.17e-01

Pos logit contributions
fman+0.127
ogue+0.126
 Sinn+0.126
iferation+0.126
lus+0.122
Neg logit contributions
MENTS-0.152
SPONSORED-0.146
ERY-0.145
CAR-0.144
MENT-0.144
Token like
Token ID588
Feature activation+3.27e-02

Pos logit contributions
MENTS+0.638
SPONSORED+0.609
ERY+0.605
CAR+0.603
MENT+0.599
Neg logit contributions
fman-0.645
iferation-0.643
ogue-0.643
 Sinn-0.637
��-0.624
Token that
Token ID326
Feature activation-8.98e-02

Pos logit contributions
 Sinn+0.066
fman+0.066
ogue+0.066
iferation+0.066
 hers+0.064
Neg logit contributions
MENTS-0.066
SPONSORED-0.064
ERY-0.063
CAR-0.063
MENT-0.062
Token are
Token ID389
Feature activation+3.81e-02

Pos logit contributions
MENTS+0.181
SPONSORED+0.172
ERY+0.172
CAR+0.171
MENT+0.170
Neg logit contributions
fman-0.183
ogue-0.183
iferation-0.182
 Sinn-0.181
lus-0.177
Token,
Token ID11
Feature activation-6.50e-02

Pos logit contributions
MENTS+0.134
SPONSORED+0.128
ERY+0.128
CAR+0.127
MENT+0.127
Neg logit contributions
fman-0.116
ogue-0.116
iferation-0.115
 Sinn-0.115
 hers-0.112
Token partly
Token ID11476
Feature activation-7.39e-02

Pos logit contributions
MENTS+0.117
SPONSORED+0.111
ERY+0.110
CAR+0.110
MENT+0.109
Neg logit contributions
fman-0.147
ogue-0.146
iferation-0.146
 Sinn-0.145
lus-0.142
Token when
Token ID618
Feature activation-3.12e-01

Pos logit contributions
MENTS+0.168
SPONSORED+0.161
ERY+0.161
CAR+0.160
MENT+0.159
Neg logit contributions
fman-0.131
ogue-0.131
iferation-0.130
 Sinn-0.129
lus-0.126
Token the
Token ID262
Feature activation-1.75e-02

Pos logit contributions
MENTS+0.707
SPONSORED+0.678
ERY+0.675
CAR+0.673
MENT+0.669
Neg logit contributions
fman-0.558
ogue-0.556
iferation-0.556
 Sinn-0.550
lus-0.536
Token plac
Token ID21957
Feature activation+1.80e-02

Pos logit contributions
MENTS+0.036
SPONSORED+0.035
ERY+0.035
CAR+0.035
MENT+0.034
Neg logit contributions
fman-0.035
ogue-0.034
iferation-0.034
 Sinn-0.034
lus-0.033
Tokenid
Token ID312
Feature activation-9.49e-01

Pos logit contributions
ogue+0.035
fman+0.035
iferation+0.034
 Sinn+0.034
lus+0.034
Neg logit contributions
MENTS-0.039
SPONSORED-0.037
ERY-0.036
CAR-0.036
 disappointment-0.036
Token Qu
Token ID2264
Feature activation-1.24e-01

Pos logit contributions
MENTS+2.053
SPONSORED+1.966
 disappointment+1.958
ERY+1.955
CAR+1.943
Neg logit contributions
fman-1.787
iferation-1.777
ogue-1.763
��-1.739
 Sinn-1.738
Tokenij
Token ID2926
Feature activation+1.87e-02

Pos logit contributions
MENTS+0.271
SPONSORED+0.260
ERY+0.258
CAR+0.258
MENT+0.256
Neg logit contributions
fman-0.230
iferation-0.230
ogue-0.229
 Sinn-0.228
��-0.223
Tokenano
Token ID5733
Feature activation+2.58e-02

Pos logit contributions
iferation+0.048
 Sinn+0.048
fman+0.048
ogue+0.048
��+0.047
Neg logit contributions
MENTS-0.027
SPONSORED-0.026
ERY-0.025
CAR-0.025
MENT-0.025
Token failed
Token ID4054
Feature activation+2.19e-01

Pos logit contributions
fman+0.051
ogue+0.051
iferation+0.050
 Sinn+0.050
lus+0.049
Neg logit contributions
MENTS-0.054
SPONSORED-0.051
ERY-0.051
CAR-0.051
MENT-0.051
Token to
Token ID284
Feature activation-2.71e-02

Pos logit contributions
fman+0.387
ogue+0.383
 Sinn+0.383
iferation+0.381
 hers+0.375
Neg logit contributions
MENTS-0.501
SPONSORED-0.479
ERY-0.477
CAR-0.477
MENT-0.472
Token BIOS
Token ID27553
Feature activation-2.98e-01

Pos logit contributions
MENTS+0.401
SPONSORED+0.383
ERY+0.380
CAR+0.379
MENT+0.377
Neg logit contributions
fman-0.400
ogue-0.399
iferation-0.398
 Sinn-0.396
lus-0.387
Token default
Token ID4277
Feature activation-1.74e-02

Pos logit contributions
MENTS+0.568
SPONSORED+0.538
ERY+0.534
CAR+0.531
MENT+0.528
Neg logit contributions
ogue-0.644
fman-0.643
 Sinn-0.639
iferation-0.636
lus-0.624
Token.
Token ID13
Feature activation-2.30e-01

Pos logit contributions
MENTS+0.034
SPONSORED+0.032
ERY+0.032
CAR+0.032
MENT+0.031
Neg logit contributions
ogue-0.037
fman-0.037
 Sinn-0.037
iferation-0.037
lus-0.036
Token With
Token ID2080
Feature activation-2.15e-01

Pos logit contributions
MENTS+0.518
SPONSORED+0.496
ERY+0.494
CAR+0.492
MENT+0.489
Neg logit contributions
fman-0.415
ogue-0.413
iferation-0.413
 Sinn-0.410
lus-0.399
Token two
Token ID734
Feature activation-4.72e-01

Pos logit contributions
MENTS+0.458
SPONSORED+0.439
ERY+0.436
CAR+0.435
MENT+0.432
Neg logit contributions
fman-0.414
ogue-0.413
iferation-0.412
 Sinn-0.410
lus-0.401
Token cards
Token ID4116
Feature activation-9.45e-01

Pos logit contributions
MENTS+0.948
SPONSORED+0.903
ERY+0.898
CAR+0.896
MENT+0.890
Neg logit contributions
fman-0.967
iferation-0.965
ogue-0.963
 Sinn-0.954
��-0.935
Token in
Token ID287
Feature activation-2.15e-01

Pos logit contributions
MENTS+1.939
SPONSORED+1.857
ERY+1.842
CAR+1.840
MENT+1.828
Neg logit contributions
iferation-1.887
fman-1.883
ogue-1.873
 Sinn-1.851
��-1.828
Token close
Token ID1969
Feature activation-3.12e-01

Pos logit contributions
MENTS+0.424
SPONSORED+0.404
ERY+0.401
CAR+0.400
MENT+0.397
Neg logit contributions
fman-0.447
ogue-0.445
iferation-0.445
 Sinn-0.441
lus-0.432
Token proximity
Token ID20387
Feature activation+1.20e-01

Pos logit contributions
MENTS+0.636
SPONSORED+0.607
ERY+0.603
CAR+0.602
MENT+0.598
Neg logit contributions
fman-0.628
ogue-0.627
iferation-0.626
 Sinn-0.621
lus-0.607
Token we
Token ID356
Feature activation-1.70e-01

Pos logit contributions
fman+0.225
ogue+0.225
iferation+0.223
 Sinn+0.223
lus+0.217
Neg logit contributions
MENTS-0.261
SPONSORED-0.250
ERY-0.248
CAR-0.248
MENT-0.246
Token want
Token ID765
Feature activation+3.60e-02

Pos logit contributions
MENTS+0.357
SPONSORED+0.341
ERY+0.340
CAR+0.338
MENT+0.337
Neg logit contributions
fman-0.331
ogue-0.331
iferation-0.331
 Sinn-0.327
��-0.321
Tokeninit
Token ID15003
Feature activation-2.87e-02

Pos logit contributions
MENTS+0.148
SPONSORED+0.142
ERY+0.141
CAR+0.140
MENT+0.139
Neg logit contributions
fman-0.139
ogue-0.138
iferation-0.137
 Sinn-0.136
lus-0.134
Token from
Token ID422
Feature activation-9.26e-02

Pos logit contributions
MENTS+0.060
SPONSORED+0.057
ERY+0.057
CAR+0.057
MENT+0.056
Neg logit contributions
ogue-0.057
fman-0.057
iferation-0.056
 Sinn-0.056
lus-0.055
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
MENTS+0.265
SPONSORED+0.256
ERY+0.255
CAR+0.254
MENT+0.253
Neg logit contributions
fman-0.111
ogue-0.110
iferation-0.110
 Sinn-0.108
lus-0.104
Token 25
Token ID1679
Feature activation-6.67e-02

Pos logit contributions
MENTS+0.237
SPONSORED+0.226
ERY+0.225
CAR+0.224
MENT+0.223
Neg logit contributions
fman-0.242
iferation-0.242
ogue-0.242
 Sinn-0.239
��-0.234
Tokenth
Token ID400
Feature activation-9.38e-02

Pos logit contributions
MENTS+0.148
SPONSORED+0.142
ERY+0.141
CAR+0.140
MENT+0.140
Neg logit contributions
fman-0.123
ogue-0.122
iferation-0.122
 Sinn-0.121
lus-0.118
Token partition
Token ID18398
Feature activation-9.43e-01

Pos logit contributions
MENTS+0.197
SPONSORED+0.188
ERY+0.187
CAR+0.186
MENT+0.185
Neg logit contributions
fman-0.184
ogue-0.183
iferation-0.183
 Sinn-0.182
lus-0.177
Token (/
Token ID50247
Feature activation+2.98e-01

Pos logit contributions
MENTS+1.956
SPONSORED+1.880
ERY+1.866
CAR+1.864
MENT+1.854
Neg logit contributions
iferation-1.853
fman-1.830
ogue-1.829
 Sinn-1.807
��-1.794
Tokendev
Token ID7959
Feature activation-2.48e-01

Pos logit contributions
fman+0.495
ogue+0.490
 Sinn+0.487
iferation+0.487
lus+0.478
Neg logit contributions
MENTS-0.711
ERY-0.681
SPONSORED-0.681
 disappointment-0.676
CAR-0.674
Token/
Token ID14
Feature activation+8.94e-02

Pos logit contributions
MENTS+0.605
SPONSORED+0.581
ERY+0.578
CAR+0.576
MENT+0.574
Neg logit contributions
fman-0.403
ogue-0.401
iferation-0.398
 Sinn-0.396
lus-0.385
Tokenmm
Token ID3020
Feature activation-2.70e-01

Pos logit contributions
fman+0.118
ogue+0.115
 Sinn+0.114
iferation+0.114
lus+0.111
Neg logit contributions
MENTS-0.247
SPONSORED-0.238
ERY-0.237
 disappointment-0.236
MENT-0.235
Tokenc
Token ID66
Feature activation-3.41e-02

Pos logit contributions
MENTS+0.496
CAR+0.470
SPONSORED+0.469
ERY+0.469
 disappointment+0.465
Neg logit contributions
iferation-0.598
fman-0.597
ogue-0.593
 Sinn-0.588
��-0.579
Token just
Token ID655
Feature activation-5.23e-02

Pos logit contributions
MENTS+0.257
SPONSORED+0.246
ERY+0.244
CAR+0.243
MENT+0.242
Neg logit contributions
fman-0.254
ogue-0.253
iferation-0.252
 Sinn-0.250
lus-0.245
Token after
Token ID706
Feature activation-4.27e-02

Pos logit contributions
MENTS+0.111
SPONSORED+0.106
ERY+0.105
CAR+0.105
 disappointment+0.105
Neg logit contributions
fman-0.101
iferation-0.101
ogue-0.101
 Sinn-0.100
��-0.098
Token mounting
Token ID17260
Feature activation+9.37e-02

Pos logit contributions
MENTS+0.092
SPONSORED+0.088
ERY+0.087
CAR+0.087
MENT+0.087
Neg logit contributions
fman-0.082
ogue-0.081
 Sinn-0.081
iferation-0.080
lus-0.079
Token the
Token ID262
Feature activation-7.52e-02

Pos logit contributions
fman+0.167
ogue+0.166
 Sinn+0.164
iferation+0.164
lus+0.160
Neg logit contributions
MENTS-0.213
SPONSORED-0.205
ERY-0.204
CAR-0.203
MENT-0.202
Token boot
Token ID6297
Feature activation-4.23e-01

Pos logit contributions
MENTS+0.147
SPONSORED+0.141
ERY+0.140
CAR+0.139
MENT+0.138
Neg logit contributions
fman-0.158
ogue-0.157
 Sinn-0.156
iferation-0.156
lus-0.152
Token partition
Token ID18398
Feature activation-9.37e-01

Pos logit contributions
MENTS+0.854
SPONSORED+0.816
ERY+0.810
CAR+0.807
MENT+0.802
Neg logit contributions
fman-0.863
ogue-0.861
iferation-0.857
 Sinn-0.852
lus-0.834
Token,
Token ID11
Feature activation-5.10e-02

Pos logit contributions
MENTS+1.885
ERY+1.794
SPONSORED+1.792
CAR+1.786
 disappointment+1.777
Neg logit contributions
iferation-1.908
ogue-1.886
fman-1.885
 Sinn-1.868
��-1.845
Token forcing
Token ID10833
Feature activation-2.52e-01

Pos logit contributions
MENTS+0.104
SPONSORED+0.099
ERY+0.099
CAR+0.098
MENT+0.098
Neg logit contributions
fman-0.103
iferation-0.103
ogue-0.103
 Sinn-0.102
��-0.100
Token a
Token ID257
Feature activation-1.98e-01

Pos logit contributions
MENTS+0.587
SPONSORED+0.563
ERY+0.560
CAR+0.559
MENT+0.556
Neg logit contributions
fman-0.434
ogue-0.432
iferation-0.432
 Sinn-0.427
lus-0.416
Token reboot
Token ID20149
Feature activation-1.70e-01

Pos logit contributions
MENTS+0.490
SPONSORED+0.471
ERY+0.469
CAR+0.468
MENT+0.466
Neg logit contributions
fman-0.312
iferation-0.311
ogue-0.310
 Sinn-0.307
��-0.299
Token.
Token ID13
Feature activation-9.86e-02

Pos logit contributions
MENTS+0.330
SPONSORED+0.314
ERY+0.312
CAR+0.311
MENT+0.309
Neg logit contributions
fman-0.359
ogue-0.358
iferation-0.357
 Sinn-0.354
lus-0.347
Token perspective
Token ID6650
Feature activation-5.06e-01

Pos logit contributions
MENTS+0.513
SPONSORED+0.492
ERY+0.491
CAR+0.489
 disappointment+0.486
Neg logit contributions
fman-0.391
iferation-0.390
ogue-0.388
 Sinn-0.385
��-0.376
Token from
Token ID422
Feature activation-2.15e-02

Pos logit contributions
MENTS+1.020
SPONSORED+0.973
ERY+0.968
CAR+0.966
 disappointment+0.961
Neg logit contributions
fman-1.025
iferation-1.024
ogue-1.019
 Sinn-1.013
��-0.996
Token a
Token ID257
Feature activation-1.02e-01

Pos logit contributions
MENTS+0.045
SPONSORED+0.043
ERY+0.042
CAR+0.042
MENT+0.042
Neg logit contributions
ogue-0.043
fman-0.043
 Sinn-0.042
iferation-0.042
 hers-0.041
Token very
Token ID845
Feature activation+1.08e-01

Pos logit contributions
MENTS+0.211
SPONSORED+0.202
ERY+0.200
CAR+0.199
MENT+0.198
Neg logit contributions
ogue-0.204
fman-0.204
 Sinn-0.203
iferation-0.202
lus-0.198
Token small
Token ID1402
Feature activation+8.79e-02

Pos logit contributions
fman+0.207
ogue+0.207
iferation+0.205
 Sinn+0.205
lus+0.200
Neg logit contributions
MENTS-0.229
SPONSORED-0.219
ERY-0.218
CAR-0.217
MENT-0.216
Token lens
Token ID10317
Feature activation-9.35e-01

Pos logit contributions
ogue+0.184
fman+0.183
iferation+0.182
 Sinn+0.181
lus+0.178
Neg logit contributions
MENTS-0.173
SPONSORED-0.165
ERY-0.163
CAR-0.163
MENT-0.162
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
MENTS+1.805
SPONSORED+1.716
ERY+1.709
CAR+1.706
 disappointment+1.705
Neg logit contributions
iferation-1.976
fman-1.969
ogue-1.957
 Sinn-1.949
��-1.921
Token Image
Token ID7412
Feature activation-3.22e-01

Pos logit contributions
MENTS+0.248
SPONSORED+0.237
ERY+0.236
CAR+0.235
MENT+0.234
Neg logit contributions
fman-0.210
iferation-0.209
ogue-0.209
 Sinn-0.207
��-0.202
Token via
Token ID2884
Feature activation+1.15e-01

Pos logit contributions
MENTS+0.590
SPONSORED+0.562
ERY+0.556
CAR+0.554
MENT+0.550
Neg logit contributions
fman-0.715
ogue-0.713
iferation-0.712
 Sinn-0.708
lus-0.693
Token Pan
Token ID5961
Feature activation-4.21e-01

Pos logit contributions
ogue+0.210
fman+0.210
 Sinn+0.210
iferation+0.208
 hers+0.204
Neg logit contributions
MENTS-0.254
SPONSORED-0.245
ERY-0.242
CAR-0.241
MENT-0.239
Tokenor
Token ID273
Feature activation-1.31e-01

Pos logit contributions
MENTS+0.767
SPONSORED+0.726
ERY+0.719
CAR+0.717
MENT+0.713
Neg logit contributions
ogue-0.944
fman-0.942
iferation-0.935
 Sinn-0.930
lus-0.914
Token less
Token ID1342
Feature activation-4.80e-01

Pos logit contributions
fman+0.044
iferation+0.044
ogue+0.044
��+0.044
 Sinn+0.043
Neg logit contributions
MENTS-0.052
SPONSORED-0.050
 disappointment-0.050
MENT-0.050
ERY-0.050
Token mirror
Token ID10162
Feature activation-7.78e-02

Pos logit contributions
MENTS+0.989
SPONSORED+0.947
ERY+0.939
CAR+0.937
 disappointment+0.933
Neg logit contributions
fman-0.954
iferation-0.953
ogue-0.949
 Sinn-0.938
��-0.922
Tokening
Token ID278
Feature activation+4.95e-02

Pos logit contributions
MENTS+0.136
SPONSORED+0.128
ERY+0.128
CAR+0.126
MENT+0.126
Neg logit contributions
fman-0.181
ogue-0.180
 Sinn-0.179
iferation-0.178
 hers-0.175
Token the
Token ID262
Feature activation+3.01e-02

Pos logit contributions
fman+0.088
ogue+0.088
 Sinn+0.087
iferation+0.087
 hers+0.085
Neg logit contributions
MENTS-0.112
ERY-0.108
SPONSORED-0.108
CAR-0.107
MENT-0.106
Token Rab
Token ID28559
Feature activation+4.11e-01

Pos logit contributions
fman+0.060
ogue+0.059
iferation+0.059
 Sinn+0.059
lus+0.057
Neg logit contributions
MENTS-0.063
SPONSORED-0.060
ERY-0.060
CAR-0.059
MENT-0.059
Tokenid
Token ID312
Feature activation-9.34e-01

Pos logit contributions
ogue+0.741
fman+0.732
 Sinn+0.729
iferation+0.728
ouf+0.715
Neg logit contributions
MENTS-0.937
SPONSORED-0.897
ERY-0.890
MENT-0.885
CAR-0.881
Token Pupp
Token ID20926
Feature activation+1.56e-01

Pos logit contributions
MENTS+2.449
CAR+2.373
SPONSORED+2.357
 disappointment+2.350
MENT+2.342
Neg logit contributions
fman-1.326
iferation-1.310
ogue-1.287
 Sinn-1.283
��-1.268
Tokenies
Token ID444
Feature activation+3.72e-02

Pos logit contributions
fman+0.277
ogue+0.276
iferation+0.275
 Sinn+0.272
lus+0.266
Neg logit contributions
MENTS-0.357
SPONSORED-0.343
ERY-0.341
CAR-0.340
MENT-0.337
Token slate
Token ID27589
Feature activation-5.35e-02

Pos logit contributions
fman+0.076
ogue+0.076
iferation+0.076
 Sinn+0.076
lus+0.073
Neg logit contributions
MENTS-0.075
SPONSORED-0.071
ERY-0.071
CAR-0.070
MENT-0.070
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
MENTS+0.111
SPONSORED+0.106
ERY+0.106
CAR+0.105
MENT+0.105
Neg logit contributions
ogue-0.105
fman-0.105
 Sinn-0.105
iferation-0.105
 hers-0.102
Token Essentially
Token ID34039
Feature activation+1.08e-01

Pos logit contributions
fman+0.272
ogue+0.271
iferation+0.270
 Sinn+0.270
lus+0.262
Neg logit contributions
MENTS-0.331
SPONSORED-0.317
ERY-0.316
CAR-0.314
MENT-0.313
Token my
Token ID616
Feature activation+1.92e-01

Pos logit contributions
fman+0.250
ogue+0.249
iferation+0.249
 Sinn+0.247
lus+0.240
Neg logit contributions
MENTS-0.372
SPONSORED-0.358
ERY-0.356
CAR-0.355
MENT-0.353
Token life
Token ID1204
Feature activation+1.46e-01

Pos logit contributions
fman+0.311
iferation+0.310
ogue+0.306
 Sinn+0.304
��+0.302
Neg logit contributions
MENTS-0.469
SPONSORED-0.451
ERY-0.450
 disappointment-0.450
CAR-0.449
Token,
Token ID11
Feature activation+6.56e-02

Pos logit contributions
ogue+0.339
fman+0.338
iferation+0.337
 Sinn+0.336
 hers+0.330
Neg logit contributions
MENTS-0.257
SPONSORED-0.243
ERY-0.241
CAR-0.239
MENT-0.237
Token and
Token ID290
Feature activation+5.70e-02

Pos logit contributions
fman+0.129
ogue+0.129
iferation+0.128
 Sinn+0.128
 hers+0.125
Neg logit contributions
MENTS-0.137
SPONSORED-0.131
ERY-0.130
CAR-0.129
MENT-0.128
Token I
Token ID314
Feature activation+1.40e-02

Pos logit contributions
fman+0.102
ogue+0.102
iferation+0.102
 Sinn+0.101
 hers+0.098
Neg logit contributions
MENTS-0.129
SPONSORED-0.123
ERY-0.123
CAR-0.123
MENT-0.122
Token hope
Token ID2911
Feature activation+7.54e-01

Pos logit contributions
ogue+0.025
iferation+0.025
fman+0.025
 Sinn+0.025
��+0.025
Neg logit contributions
MENTS-0.031
SPONSORED-0.030
ERY-0.030
CAR-0.030
 disappointment-0.029
Token I
Token ID314
Feature activation+1.28e-02

Pos logit contributions
ogue+1.378
 Sinn+1.372
fman+1.361
iferation+1.353
 hers+1.352
Neg logit contributions
MENTS-1.680
SPONSORED-1.606
CAR-1.591
ERY-1.587
MENT-1.579
Token've
Token ID1053
Feature activation-1.61e-02

Pos logit contributions
fman+0.024
ogue+0.024
iferation+0.024
 Sinn+0.023
��+0.023
Neg logit contributions
MENTS-0.028
SPONSORED-0.027
ERY-0.027
CAR-0.027
MENT-0.027
Token made
Token ID925
Feature activation-1.89e-02

Pos logit contributions
MENTS+0.032
SPONSORED+0.031
ERY+0.031
CAR+0.030
MENT+0.030
Neg logit contributions
fman-0.033
iferation-0.033
ogue-0.033
 Sinn-0.032
��-0.032
Token you
Token ID345
Feature activation+1.03e-01

Pos logit contributions
MENTS+0.040
SPONSORED+0.038
ERY+0.038
CAR+0.038
 disappointment+0.038
Neg logit contributions
fman-0.036
iferation-0.036
ogue-0.036
 Sinn-0.036
��-0.035
Token proud
Token ID6613
Feature activation-2.54e-02

Pos logit contributions
fman+0.182
iferation+0.182
ogue+0.182
 Sinn+0.180
��+0.176
Neg logit contributions
MENTS-0.237
SPONSORED-0.227
ERY-0.226
CAR-0.226
MENT-0.224
Token pro
Token ID386
Feature activation+1.47e-01

Pos logit contributions
fman+0.044
 Sinn+0.044
ogue+0.044
iferation+0.043
lus+0.042
Neg logit contributions
MENTS-0.052
SPONSORED-0.050
ERY-0.050
CAR-0.049
MENT-0.049
Token-
Token ID12
Feature activation+8.25e-03

Pos logit contributions
fman+0.271
ogue+0.269
iferation+0.269
 Sinn+0.267
lus+0.261
Neg logit contributions
MENTS-0.325
SPONSORED-0.312
ERY-0.310
CAR-0.308
MENT-0.307
Tokencor
Token ID10215
Feature activation+8.63e-02

Pos logit contributions
fman+0.017
ogue+0.017
iferation+0.017
 Sinn+0.017
lus+0.016
Neg logit contributions
MENTS-0.017
SPONSORED-0.016
ERY-0.016
CAR-0.016
 disappointment-0.016
Tokenporate
Token ID38133
Feature activation+1.40e-01

Pos logit contributions
fman+0.168
iferation+0.168
ogue+0.168
 Sinn+0.166
lus+0.162
Neg logit contributions
MENTS-0.182
SPONSORED-0.174
ERY-0.173
CAR-0.173
 disappointment-0.171
Token arbit
Token ID9277
Feature activation-4.98e-03

Pos logit contributions
fman+0.286
 Sinn+0.286
ogue+0.285
iferation+0.284
lus+0.277
Neg logit contributions
MENTS-0.281
ERY-0.267
SPONSORED-0.267
CAR-0.266
MENT-0.263
Tokenrators
Token ID18942
Feature activation+8.57e-01

Pos logit contributions
MENTS+0.011
SPONSORED+0.011
ERY+0.011
CAR+0.011
 disappointment+0.010
Neg logit contributions
fman-0.009
ogue-0.009
 Sinn-0.009
iferation-0.009
lus-0.009
Token.
Token ID13
Feature activation-5.06e-03

Pos logit contributions
 Sinn+1.761
fman+1.744
ogue+1.727
iferation+1.710
 hers+1.687
Neg logit contributions
MENTS-1.704
ERY-1.619
SPONSORED-1.596
 disappointment-1.594
MENT-1.594
Token
Token ID198
Feature activation-6.92e-03

Pos logit contributions
MENTS+0.011
SPONSORED+0.011
ERY+0.011
CAR+0.011
 disappointment+0.011
Neg logit contributions
fman-0.009
 Sinn-0.009
ogue-0.009
iferation-0.009
lus-0.009
Token
Token ID198
Feature activation+4.65e-02

Pos logit contributions
MENTS+0.015
SPONSORED+0.014
CAR+0.014
ERY+0.014
MENT+0.014
Neg logit contributions
iferation-0.013
fman-0.013
ogue-0.013
 Sinn-0.013
��-0.013
TokenA
Token ID32
Feature activation+1.42e-01

Pos logit contributions
fman+0.108
ogue+0.108
iferation+0.107
 Sinn+0.106
lus+0.105
Neg logit contributions
MENTS-0.080
 disappointment-0.075
ERY-0.075
SPONSORED-0.075
CAR-0.074
Token separate
Token ID4553
Feature activation+1.75e-01

Pos logit contributions
fman+0.283
ogue+0.282
iferation+0.280
 Sinn+0.279
lus+0.272
Neg logit contributions
MENTS-0.295
SPONSORED-0.283
ERY-0.279
CAR-0.278
MENT-0.277
Token last
Token ID938
Feature activation-2.25e-01

Pos logit contributions
MENTS+0.019
SPONSORED+0.018
ERY+0.018
CAR+0.018
MENT+0.018
Neg logit contributions
fman-0.017
iferation-0.017
ogue-0.017
 Sinn-0.016
��-0.016
Token Zelda
Token ID22166
Feature activation+1.08e-01

Pos logit contributions
MENTS+0.498
SPONSORED+0.477
ERY+0.474
CAR+0.473
MENT+0.470
Neg logit contributions
fman-0.413
ogue-0.412
iferation-0.410
 Sinn-0.409
 hers-0.397
Token O
Token ID440
Feature activation+9.44e-02

Pos logit contributions
fman+0.232
ogue+0.232
 Sinn+0.230
iferation+0.229
lus+0.224
Neg logit contributions
MENTS-0.208
SPONSORED-0.199
ERY-0.197
CAR-0.196
MENT-0.194
TokenST
Token ID2257
Feature activation-2.02e-01

Pos logit contributions
ogue+0.240
fman+0.240
 Sinn+0.237
iferation+0.237
lus+0.234
Neg logit contributions
MENTS-0.142
SPONSORED-0.134
ERY-0.132
 disappointment-0.129
MENT-0.126
Token that
Token ID326
Feature activation-6.59e-02

Pos logit contributions
MENTS+0.420
SPONSORED+0.401
ERY+0.399
CAR+0.398
 disappointment+0.397
Neg logit contributions
iferation-0.398
fman-0.398
ogue-0.396
 Sinn-0.393
��-0.386
Token Ko
Token ID17634
Feature activation+7.35e-01

Pos logit contributions
MENTS+0.138
SPONSORED+0.132
ERY+0.131
CAR+0.131
MENT+0.130
Neg logit contributions
fman-0.129
ogue-0.129
iferation-0.128
 Sinn-0.128
lus-0.125
Tokenji
Token ID7285
Feature activation-8.28e-02

Pos logit contributions
fman+1.649
ogue+1.644
 Sinn+1.631
iferation+1.628
ouf+1.603
Neg logit contributions
MENTS-1.350
SPONSORED-1.270
CAR-1.255
ERY-1.253
MENT-1.252
Token K
Token ID509
Feature activation+2.05e-01

Pos logit contributions
MENTS+0.193
SPONSORED+0.185
ERY+0.185
CAR+0.184
MENT+0.183
Neg logit contributions
fman-0.142
ogue-0.142
iferation-0.141
 Sinn-0.141
lus-0.137
Tokenondo
Token ID22311
Feature activation-2.10e-01

Pos logit contributions
ogue+0.486
fman+0.485
iferation+0.482
 Sinn+0.478
ouf+0.472
Neg logit contributions
MENTS-0.347
SPONSORED-0.328
ERY-0.326
CAR-0.325
MENT-0.321
Token worked
Token ID3111
Feature activation+3.74e-01

Pos logit contributions
MENTS+0.406
SPONSORED+0.387
ERY+0.384
CAR+0.383
MENT+0.380
Neg logit contributions
fman-0.445
ogue-0.443
iferation-0.443
 Sinn-0.440
lus-0.431
Token on
Token ID319
Feature activation+4.21e-02

Pos logit contributions
fman+0.633
ogue+0.630
 Sinn+0.626
iferation+0.621
 hers+0.611
Neg logit contributions
MENTS-0.884
SPONSORED-0.853
ERY-0.845
CAR-0.838
MENT-0.834
Token fantasy
Token ID8842
Feature activation-3.08e-01

Pos logit contributions
MENTS+0.109
SPONSORED+0.103
ERY+0.103
CAR+0.103
MENT+0.102
Neg logit contributions
ogue-0.104
fman-0.103
 Sinn-0.103
iferation-0.103
 hers-0.100
Token sequence
Token ID8379
Feature activation-2.21e-01

Pos logit contributions
MENTS+0.632
SPONSORED+0.604
CAR+0.599
ERY+0.597
MENT+0.593
Neg logit contributions
ogue-0.616
fman-0.614
 Sinn-0.610
iferation-0.610
 hers-0.595
Token with
Token ID351
Feature activation-1.78e-02

Pos logit contributions
MENTS+0.469
SPONSORED+0.446
ERY+0.446
CAR+0.445
MENT+0.441
Neg logit contributions
ogue-0.427
fman-0.425
iferation-0.423
 Sinn-0.423
 hers-0.411
Token the
Token ID262
Feature activation-1.51e-02

Pos logit contributions
MENTS+0.040
SPONSORED+0.038
ERY+0.038
CAR+0.038
MENT+0.038
Neg logit contributions
fman-0.032
 Sinn-0.032
ogue-0.032
iferation-0.032
 hers-0.031
Token M
Token ID337
Feature activation-1.98e-01

Pos logit contributions
MENTS+0.032
SPONSORED+0.031
ERY+0.031
CAR+0.031
MENT+0.030
Neg logit contributions
fman-0.029
 Sinn-0.029
ogue-0.029
iferation-0.029
 hers-0.028
Tokenupp
Token ID7211
Feature activation+9.46e-01

Pos logit contributions
MENTS+0.450
SPONSORED+0.436
ERY+0.428
CAR+0.428
 disappointment+0.427
Neg logit contributions
ogue-0.354
fman-0.351
iferation-0.350
 Sinn-0.343
ouf-0.341
Tokenets
Token ID1039
Feature activation+5.46e-01

Pos logit contributions
ogue+1.687
fman+1.675
iferation+1.658
ouf+1.618
 Sinn+1.608
Neg logit contributions
MENTS-2.170
SPONSORED-2.118
 disappointment-2.072
CAR-2.063
MENT-2.044
Token as
Token ID355
Feature activation-3.56e-02

Pos logit contributions
ogue+1.083
fman+1.073
 Sinn+1.072
iferation+1.051
lus+1.040
Neg logit contributions
MENTS-1.124
ERY-1.069
SPONSORED-1.066
CAR-1.053
MENT-1.049
Token babies
Token ID11903
Feature activation+1.89e-01

Pos logit contributions
MENTS+0.078
SPONSORED+0.074
ERY+0.074
CAR+0.074
MENT+0.073
Neg logit contributions
fman-0.067
ogue-0.066
 Sinn-0.066
iferation-0.066
lus-0.065
Token was
Token ID373
Feature activation+5.23e-03

Pos logit contributions
ogue+0.366
fman+0.365
 Sinn+0.363
iferation+0.361
 hers+0.354
Neg logit contributions
MENTS-0.401
ERY-0.380
SPONSORED-0.379
CAR-0.377
MENT-0.376
Token so
Token ID523
Feature activation+2.21e-01

Pos logit contributions
fman+0.010
ogue+0.010
iferation+0.010
 Sinn+0.010
lus+0.010
Neg logit contributions
MENTS-0.011
SPONSORED-0.011
ERY-0.011
CAR-0.011
MENT-0.011
Token feel
Token ID1254
Feature activation+2.31e-01

Pos logit contributions
fman+0.109
ogue+0.109
iferation+0.108
 Sinn+0.108
 hers+0.105
Neg logit contributions
MENTS-0.111
SPONSORED-0.106
ERY-0.105
CAR-0.105
MENT-0.104
Token threatened
Token ID8556
Feature activation-7.17e-02

Pos logit contributions
 Sinn+0.451
ogue+0.451
fman+0.451
iferation+0.446
 hers+0.443
Neg logit contributions
MENTS-0.485
SPONSORED-0.466
ERY-0.462
CAR-0.459
MENT-0.455
Token and
Token ID290
Feature activation+7.97e-02

Pos logit contributions
MENTS+0.139
SPONSORED+0.132
ERY+0.131
CAR+0.131
MENT+0.129
Neg logit contributions
ogue-0.152
fman-0.152
iferation-0.151
 Sinn-0.151
 hers-0.148
Token just
Token ID655
Feature activation+1.13e-01

Pos logit contributions
fman+0.157
ogue+0.157
iferation+0.156
 Sinn+0.156
 hers+0.152
Neg logit contributions
MENTS-0.166
SPONSORED-0.158
ERY-0.157
CAR-0.157
MENT-0.156
Token m
Token ID285
Feature activation-1.23e-01

Pos logit contributions
fman+0.222
ogue+0.221
iferation+0.220
 Sinn+0.220
 hers+0.215
Neg logit contributions
MENTS-0.236
SPONSORED-0.225
ERY-0.225
CAR-0.223
MENT-0.222
Tokenose
Token ID577
Feature activation+7.35e-01

Pos logit contributions
MENTS+0.279
SPONSORED+0.268
ERY+0.267
CAR+0.267
MENT+0.264
Neg logit contributions
fman-0.217
iferation-0.216
 Sinn-0.215
ogue-0.215
��-0.210
Tokeny
Token ID88
Feature activation+3.97e-01

Pos logit contributions
ogue+1.177
fman+1.155
 Sinn+1.139
iferation+1.130
lus+1.125
Neg logit contributions
MENTS-1.818
SPONSORED-1.759
CAR-1.730
MENT-1.720
ERY-1.719
Tokened
Token ID276
Feature activation-8.69e-02

Pos logit contributions
ogue+0.813
fman+0.810
 Sinn+0.804
iferation+0.804
 hers+0.789
Neg logit contributions
MENTS-0.797
SPONSORED-0.763
ERY-0.755
CAR-0.754
MENT-0.748
Token on
Token ID319
Feature activation+6.61e-03

Pos logit contributions
MENTS+0.172
SPONSORED+0.164
ERY+0.163
CAR+0.162
 disappointment+0.161
Neg logit contributions
fman-0.180
iferation-0.180
ogue-0.179
 Sinn-0.178
��-0.175
Token.
Token ID13
Feature activation+7.83e-02

Pos logit contributions
ogue+0.014
fman+0.014
iferation+0.014
 Sinn+0.014
lus+0.013
Neg logit contributions
MENTS-0.013
SPONSORED-0.012
ERY-0.012
CAR-0.012
MENT-0.012
Token We
Token ID775
Feature activation-2.19e-02

Pos logit contributions
ogue+0.141
fman+0.141
 Sinn+0.140
iferation+0.140
lus+0.136
Neg logit contributions
MENTS-0.176
SPONSORED-0.169
ERY-0.168
CAR-0.167
MENT-0.166
Token and
Token ID290
Feature activation+1.81e-02

Pos logit contributions
fman+0.111
iferation+0.111
ogue+0.111
 Sinn+0.110
lus+0.107
Neg logit contributions
MENTS-0.124
SPONSORED-0.119
ERY-0.118
CAR-0.118
MENT-0.117
Token for
Token ID329
Feature activation+2.74e-02

Pos logit contributions
ogue+0.030
iferation+0.029
fman+0.029
 Sinn+0.029
lus+0.028
Neg logit contributions
MENTS-0.044
SPONSORED-0.042
ERY-0.042
CAR-0.042
MENT-0.042
Token all
Token ID477
Feature activation+8.35e-02

Pos logit contributions
fman+0.055
iferation+0.055
ogue+0.055
 Sinn+0.054
��+0.054
Neg logit contributions
MENTS-0.056
SPONSORED-0.053
ERY-0.053
CAR-0.053
 disappointment-0.053
Token
Token ID1399
Feature activation-1.65e-01

Pos logit contributions
ogue+0.176
fman+0.175
iferation+0.175
 Sinn+0.174
lus+0.170
Neg logit contributions
MENTS-0.163
ERY-0.155
SPONSORED-0.154
CAR-0.154
 disappointment-0.152
TokenOut
Token ID7975
Feature activation-8.18e-02

Pos logit contributions
MENTS+0.302
SPONSORED+0.287
ERY+0.285
CAR+0.284
MENT+0.282
Neg logit contributions
fman-0.366
ogue-0.365
iferation-0.364
 Sinn-0.362
lus-0.354
Tokener
Token ID263
Feature activation+5.43e-01

Pos logit contributions
MENTS+0.177
SPONSORED+0.168
ERY+0.168
CAR+0.167
MENT+0.167
Neg logit contributions
fman-0.155
iferation-0.154
ogue-0.154
 Sinn-0.154
lus-0.150
Token space
Token ID2272
Feature activation+6.56e-02

Pos logit contributions
iferation+1.112
 Sinn+1.111
ogue+1.103
fman+1.097
 hers+1.074
Neg logit contributions
MENTS-1.086
ERY-1.047
SPONSORED-1.035
CAR-1.024
MENT-1.009
Token must
Token ID1276
Feature activation-2.46e-01

Pos logit contributions
fman+0.123
ogue+0.123
iferation+0.123
 Sinn+0.123
lus+0.119
Neg logit contributions
MENTS-0.142
SPONSORED-0.136
ERY-0.136
CAR-0.135
MENT-0.134
Token only
Token ID691
Feature activation-9.88e-02

Pos logit contributions
MENTS+0.487
SPONSORED+0.470
 disappointment+0.463
ERY+0.462
CAR+0.462
Neg logit contributions
ogue-0.501
fman-0.501
iferation-0.500
 Sinn-0.494
��-0.489
Token be
Token ID307
Feature activation-1.05e-01

Pos logit contributions
MENTS+0.206
SPONSORED+0.199
ERY+0.197
 disappointment+0.197
CAR+0.196
Neg logit contributions
fman-0.192
ogue-0.192
iferation-0.191
 Sinn-0.189
��-0.187
Token used
Token ID973
Feature activation+1.92e-01

Pos logit contributions
MENTS+0.234
SPONSORED+0.225
ERY+0.223
CAR+0.223
 disappointment+0.222
Neg logit contributions
fman-0.192
iferation-0.192
ogue-0.192
 Sinn-0.190
��-0.186
Token700
Token ID9879
Feature activation+1.47e-01

Pos logit contributions
MENTS+0.431
SPONSORED+0.410
ERY+0.407
CAR+0.406
MENT+0.403
Neg logit contributions
fman-0.479
iferation-0.478
ogue-0.477
 Sinn-0.474
��-0.464
Token million
Token ID1510
Feature activation+4.42e-02

Pos logit contributions
ogue+0.268
iferation+0.268
fman+0.268
 Sinn+0.265
 hers+0.258
Neg logit contributions
MENTS-0.329
SPONSORED-0.313
ERY-0.312
CAR-0.312
MENT-0.310
Token.
Token ID13
Feature activation+3.27e-02

Pos logit contributions
fman+0.084
ogue+0.084
iferation+0.084
 Sinn+0.084
 hers+0.082
Neg logit contributions
MENTS-0.095
SPONSORED-0.091
ERY-0.090
CAR-0.090
MENT-0.089
Token
Token ID198
Feature activation+4.75e-02

Pos logit contributions
fman+0.060
ogue+0.060
iferation+0.060
 Sinn+0.060
lus+0.058
Neg logit contributions
MENTS-0.072
SPONSORED-0.069
ERY-0.069
CAR-0.068
MENT-0.068
Token
Token ID198
Feature activation+7.08e-02

Pos logit contributions
iferation+0.091
fman+0.091
ogue+0.091
 Sinn+0.090
��+0.089
Neg logit contributions
MENTS-0.101
SPONSORED-0.097
CAR-0.096
ERY-0.096
MENT-0.096
TokenHow
Token ID2437
Feature activation+2.51e-01

Pos logit contributions
fman+0.165
ogue+0.164
iferation+0.163
 Sinn+0.163
lus+0.160
Neg logit contributions
MENTS-0.122
SPONSORED-0.114
ERY-0.114
 disappointment-0.114
CAR-0.113
Token much
Token ID881
Feature activation+2.63e-01

Pos logit contributions
fman+0.496
ogue+0.494
iferation+0.493
 Sinn+0.490
lus+0.479
Neg logit contributions
MENTS-0.524
SPONSORED-0.500
ERY-0.497
CAR-0.495
MENT-0.493
Token does
Token ID857
Feature activation-8.45e-02

Pos logit contributions
fman+0.539
ogue+0.537
 Sinn+0.535
iferation+0.531
 hers+0.524
Neg logit contributions
MENTS-0.528
SPONSORED-0.502
ERY-0.497
CAR-0.496
MENT-0.493
Token of
Token ID286
Feature activation-6.95e-02

Pos logit contributions
MENTS+0.186
SPONSORED+0.178
ERY+0.177
CAR+0.177
MENT+0.176
Neg logit contributions
fman-0.156
ogue-0.156
iferation-0.156
 Sinn-0.155
lus-0.151
Token that
Token ID326
Feature activation-7.63e-02

Pos logit contributions
MENTS+0.154
SPONSORED+0.147
ERY+0.146
CAR+0.146
MENT+0.145
Neg logit contributions
fman-0.128
ogue-0.128
iferation-0.127
 Sinn-0.127
 hers-0.124
Token actually
Token ID1682
Feature activation-3.12e-01

Pos logit contributions
MENTS+0.156
SPONSORED+0.149
ERY+0.148
CAR+0.148
MENT+0.147
Neg logit contributions
fman-0.153
ogue-0.153
iferation-0.153
 Sinn-0.152
lus-0.148
Token repr
Token ID41575
Feature activation+2.83e-01

Pos logit contributions
MENTS+0.086
SPONSORED+0.082
ERY+0.082
CAR+0.081
MENT+0.081
Neg logit contributions
fman-0.078
ogue-0.078
iferation-0.078
 Sinn-0.077
lus-0.075
Tokenisal
Token ID28456
Feature activation-5.21e-01

Pos logit contributions
iferation+0.481
ogue+0.480
fman+0.480
 Sinn+0.470
lus+0.458
Neg logit contributions
MENTS-0.672
SPONSORED-0.647
ERY-0.639
CAR-0.638
MENT-0.637
Token attacks
Token ID3434
Feature activation-1.52e-01

Pos logit contributions
MENTS+1.052
SPONSORED+1.002
ERY+0.997
CAR+0.995
 disappointment+0.990
Neg logit contributions
fman-1.059
iferation-1.056
ogue-1.054
 Sinn-1.043
��-1.027
Token from
Token ID422
Feature activation+3.71e-02

Pos logit contributions
MENTS+0.305
SPONSORED+0.290
ERY+0.289
CAR+0.288
MENT+0.286
Neg logit contributions
fman-0.313
ogue-0.312
iferation-0.312
 Sinn-0.310
lus-0.303
Token the
Token ID262
Feature activation-9.57e-02

Pos logit contributions
 Sinn+0.067
fman+0.067
ogue+0.066
iferation+0.066
 hers+0.065
Neg logit contributions
MENTS-0.084
SPONSORED-0.080
ERY-0.080
CAR-0.079
MENT-0.079
Token P
Token ID350
Feature activation+2.01e-01

Pos logit contributions
MENTS+0.249
SPONSORED+0.240
ERY+0.239
CAR+0.238
MENT+0.237
Neg logit contributions
fman-0.139
ogue-0.138
 Sinn-0.138
iferation-0.138
lus-0.133
TokenYD
Token ID35755
Feature activation+6.97e-02

Pos logit contributions
fman+0.410
ogue+0.410
iferation+0.405
 Sinn+0.402
ouf+0.397
Neg logit contributions
MENTS-0.406
SPONSORED-0.386
ERY-0.384
CAR-0.382
 disappointment-0.381
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
 Sinn+0.148
fman+0.148
iferation+0.147
ogue+0.147
 hers+0.145
Neg logit contributions
MENTS-0.134
ERY-0.127
SPONSORED-0.127
CAR-0.126
MENT-0.125
Token
Token ID198
Feature activation-1.36e-01

Pos logit contributions
MENTS+0.252
SPONSORED+0.241
ERY+0.239
CAR+0.239
MENT+0.238
Neg logit contributions
fman-0.207
iferation-0.207
 Sinn-0.206
ogue-0.206
ouf-0.199
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
MENTS+0.283
SPONSORED+0.273
CAR+0.269
ERY+0.268
MENT+0.267
Neg logit contributions
iferation-0.271
fman-0.269
��-0.269
ogue-0.267
 Sinn-0.264
TokenHe
Token ID1544
Feature activation+6.36e-02

Pos logit contributions
MENTS+0.207
 disappointment+0.196
ERY+0.195
SPONSORED+0.194
CAR+0.192
Neg logit contributions
fman-0.238
ogue-0.237
iferation-0.236
 Sinn-0.235
ouf-0.231
Token of
Token ID286
Feature activation-8.14e-02

Pos logit contributions
MENTS+0.053
SPONSORED+0.050
ERY+0.050
MENT+0.049
CAR+0.049
Neg logit contributions
ogue-0.050
fman-0.050
 Sinn-0.050
iferation-0.050
ouf-0.049
Token
Token ID564
Feature activation+5.17e-02

Pos logit contributions
MENTS+0.184
SPONSORED+0.177
ERY+0.176
CAR+0.175
MENT+0.174
Neg logit contributions
fman-0.146
ogue-0.145
iferation-0.145
 Sinn-0.145
lus-0.141
Token
Token ID250
Feature activation+8.05e-02

Pos logit contributions
ogue+0.093
fman+0.093
iferation+0.093
 Sinn+0.092
��+0.092
Neg logit contributions
MENTS-0.116
SPONSORED-0.110
CAR-0.110
ERY-0.110
 disappointment-0.109
TokenV
Token ID53
Feature activation+1.43e-01

Pos logit contributions
fman+0.162
ogue+0.162
iferation+0.161
 Sinn+0.160
lus+0.157
Neg logit contributions
MENTS-0.164
SPONSORED-0.156
ERY-0.156
CAR-0.155
 disappointment-0.154
Tokenio
Token ID952
Feature activation+1.27e-01

Pos logit contributions
ogue+0.313
fman+0.309
iferation+0.308
 Sinn+0.306
lus+0.302
Neg logit contributions
MENTS-0.269
SPONSORED-0.255
ERY-0.252
 disappointment-0.251
CAR-0.250
TokenMe
Token ID5308
Feature activation+2.02e-01

Pos logit contributions
fman+0.222
ogue+0.221
iferation+0.220
 Sinn+0.219
lus+0.213
Neg logit contributions
MENTS-0.293
SPONSORED-0.280
ERY-0.279
MENT-0.277
CAR-0.277
Token
Token ID447
Feature activation-4.80e-03

Pos logit contributions
fman+0.385
 Sinn+0.385
ogue+0.384
iferation+0.382
lus+0.374
Neg logit contributions
MENTS-0.434
SPONSORED-0.415
ERY-0.415
CAR-0.410
MENT-0.410
Token
Token ID251
Feature activation+1.11e-01

Pos logit contributions
MENTS+0.010
SPONSORED+0.010
CAR+0.010
ERY+0.010
 disappointment+0.009
Neg logit contributions
fman-0.009
ogue-0.009
iferation-0.009
 Sinn-0.009
��-0.009
Token factory
Token ID8860
Feature activation+6.54e-02

Pos logit contributions
ogue+0.215
fman+0.215
 Sinn+0.214
iferation+0.213
lus+0.210
Neg logit contributions
MENTS-0.234
SPONSORED-0.225
ERY-0.223
CAR-0.221
 disappointment-0.220
Token,
Token ID11
Feature activation-7.04e-03

Pos logit contributions
ogue+0.125
fman+0.124
 Sinn+0.124
iferation+0.123
ouf+0.120
Neg logit contributions
MENTS-0.141
SPONSORED-0.135
ERY-0.134
CAR-0.133
MENT-0.133
Token who
Token ID508
Feature activation+2.46e-02

Pos logit contributions
MENTS+0.015
SPONSORED+0.015
ERY+0.015
CAR+0.015
MENT+0.015
Neg logit contributions
fman-0.013
ogue-0.013
iferation-0.013
 Sinn-0.013
lus-0.013
Token examination
Token ID12452
Feature activation+1.75e-02

Pos logit contributions
ogue+0.143
iferation+0.143
fman+0.143
 Sinn+0.142
lus+0.138
Neg logit contributions
MENTS-0.174
SPONSORED-0.167
ERY-0.166
CAR-0.165
MENT-0.164
Token (
Token ID357
Feature activation+6.44e-03

Pos logit contributions
ogue+0.035
fman+0.035
 Sinn+0.034
iferation+0.034
lus+0.033
Neg logit contributions
MENTS-0.036
SPONSORED-0.035
ERY-0.035
MENT-0.034
CAR-0.034
Tokenincluding
Token ID8201
Feature activation-1.05e-01

Pos logit contributions
fman+0.014
ogue+0.014
iferation+0.014
 Sinn+0.014
lus+0.013
Neg logit contributions
MENTS-0.012
SPONSORED-0.012
ERY-0.012
 disappointment-0.012
CAR-0.011
Token at
Token ID379
Feature activation-1.25e-02

Pos logit contributions
MENTS+0.232
SPONSORED+0.223
ERY+0.221
CAR+0.220
MENT+0.219
Neg logit contributions
fman-0.195
ogue-0.194
iferation-0.194
 Sinn-0.193
 hers-0.187
Token S
Token ID311
Feature activation+3.33e-02

Pos logit contributions
MENTS+0.026
SPONSORED+0.025
ERY+0.025
CAR+0.025
MENT+0.025
Neg logit contributions
ogue-0.025
iferation-0.024
fman-0.024
 Sinn-0.024
lus-0.024
TokenDS
Token ID5258
Feature activation+2.11e-01

Pos logit contributions
ogue+0.073
fman+0.072
iferation+0.072
 Sinn+0.071
ouf+0.070
Neg logit contributions
MENTS-0.063
SPONSORED-0.060
 disappointment-0.059
ERY-0.059
MENT-0.059
TokenU
Token ID52
Feature activation+1.38e-02

Pos logit contributions
ogue+0.430
iferation+0.430
fman+0.429
 Sinn+0.426
lus+0.418
Neg logit contributions
MENTS-0.425
SPONSORED-0.408
ERY-0.402
 disappointment-0.399
CAR-0.399
Token),
Token ID828
Feature activation+1.38e-01

Pos logit contributions
fman+0.027
ogue+0.027
iferation+0.027
 Sinn+0.027
lus+0.026
Neg logit contributions
MENTS-0.029
SPONSORED-0.028
ERY-0.028
CAR-0.028
 disappointment-0.028
Token however
Token ID2158
Feature activation+3.27e-01

Pos logit contributions
fman+0.289
ogue+0.288
iferation+0.288
 Sinn+0.287
 hers+0.279
Neg logit contributions
MENTS-0.270
SPONSORED-0.258
ERY-0.255
CAR-0.254
MENT-0.252
Token.
Token ID13
Feature activation-4.84e-02

Pos logit contributions
ogue+0.621
iferation+0.618
fman+0.618
 Sinn+0.618
 hers+0.601
Neg logit contributions
MENTS-0.704
SPONSORED-0.673
ERY-0.670
CAR-0.665
MENT-0.661
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
MENTS+0.109
SPONSORED+0.105
ERY+0.104
CAR+0.104
MENT+0.103
Neg logit contributions
fman-0.087
ogue-0.086
iferation-0.086
 Sinn-0.086
lus-0.083
Token (
Token ID357
Feature activation-1.26e-01

Pos logit contributions
fman+0.061
ogue+0.061
iferation+0.061
 Sinn+0.061
lus+0.059
Neg logit contributions
MENTS-0.068
SPONSORED-0.065
ERY-0.065
CAR-0.064
MENT-0.064
TokenLow
Token ID20535
Feature activation-2.20e-01

Pos logit contributions
MENTS+0.246
SPONSORED+0.234
ERY+0.233
CAR+0.231
 disappointment+0.230
Neg logit contributions
fman-0.264
ogue-0.263
iferation-0.263
 Sinn-0.261
lus-0.256
Token Carb
Token ID20453
Feature activation+1.70e-02

Pos logit contributions
MENTS+0.464
SPONSORED+0.443
ERY+0.440
CAR+0.439
MENT+0.436
Neg logit contributions
fman-0.430
ogue-0.429
iferation-0.429
 Sinn-0.425
lus-0.415
Token High
Token ID3334
Feature activation-3.08e-01

Pos logit contributions
fman+0.032
iferation+0.032
ogue+0.032
 Sinn+0.031
��+0.031
Neg logit contributions
MENTS-0.037
SPONSORED-0.036
ERY-0.035
CAR-0.035
MENT-0.035
Token Fat
Token ID12301
Feature activation-7.32e-02

Pos logit contributions
MENTS+0.744
CAR+0.716
SPONSORED+0.716
ERY+0.715
 disappointment+0.709
Neg logit contributions
fman-0.502
ogue-0.497
iferation-0.495
 Sinn-0.488
��-0.478
Token)
Token ID8
Feature activation-6.27e-02

Pos logit contributions
MENTS+0.159
ERY+0.152
SPONSORED+0.152
CAR+0.152
MENT+0.150
Neg logit contributions
fman-0.137
iferation-0.136
ogue-0.136
 Sinn-0.134
��-0.132
Token diet
Token ID5496
Feature activation-1.44e-01

Pos logit contributions
MENTS+0.128
SPONSORED+0.122
ERY+0.121
CAR+0.121
MENT+0.120
Neg logit contributions
fman-0.126
ogue-0.126
iferation-0.126
 Sinn-0.125
��-0.122
Token improves
Token ID19575
Feature activation-2.84e-01

Pos logit contributions
MENTS+0.303
SPONSORED+0.288
ERY+0.288
CAR+0.285
MENT+0.284
Neg logit contributions
fman-0.283
ogue-0.281
iferation-0.280
 Sinn-0.279
lus-0.272
Token the
Token ID262
Feature activation+1.36e-01

Pos logit contributions
MENTS+0.658
SPONSORED+0.632
ERY+0.629
CAR+0.627
MENT+0.623
Neg logit contributions
fman-0.492
ogue-0.490
iferation-0.490
 Sinn-0.486
lus-0.472
Token body
Token ID1767
Feature activation+1.09e-01

Pos logit contributions
fman+0.256
ogue+0.256
iferation+0.256
 Sinn+0.254
lus+0.247
Neg logit contributions
MENTS-0.295
SPONSORED-0.282
ERY-0.281
CAR-0.279
MENT-0.278
Token
Token ID447
Feature activation+2.43e-02

Pos logit contributions
ogue+0.233
iferation+0.233
fman+0.232
 Sinn+0.230
 hers+0.226
Neg logit contributions
MENTS-0.208
SPONSORED-0.199
ERY-0.196
CAR-0.194
 disappointment-0.192
Token a
Token ID257
Feature activation-2.44e-02

Pos logit contributions
MENTS+0.754
SPONSORED+0.722
ERY+0.719
CAR+0.717
MENT+0.712
Neg logit contributions
fman-0.620
ogue-0.618
iferation-0.617
 Sinn-0.613
lus-0.596
Token national
Token ID2260
Feature activation+1.21e-01

Pos logit contributions
MENTS+0.051
SPONSORED+0.049
ERY+0.049
CAR+0.049
MENT+0.048
Neg logit contributions
fman-0.048
ogue-0.048
 Sinn-0.047
iferation-0.047
 hers-0.046
Token unity
Token ID14111
Feature activation+4.87e-01

Pos logit contributions
ogue+0.284
 Sinn+0.281
iferation+0.281
fman+0.280
 hers+0.272
Neg logit contributions
MENTS-0.212
SPONSORED-0.199
CAR-0.198
ERY-0.197
MENT-0.195
Token crisis
Token ID4902
Feature activation-7.93e-03

Pos logit contributions
ogue+0.976
 Sinn+0.973
fman+0.969
iferation+0.957
 hers+0.940
Neg logit contributions
MENTS-1.019
SPONSORED-0.953
ERY-0.953
CAR-0.948
MENT-0.947
Token
Token ID447
Feature activation-1.05e-01

Pos logit contributions
MENTS+0.016
SPONSORED+0.015
ERY+0.015
CAR+0.015
MENT+0.015
Neg logit contributions
fman-0.016
ogue-0.016
iferation-0.016
 Sinn-0.016
 hers-0.016
Token
Token ID251
Feature activation+5.71e-02

Pos logit contributions
MENTS+0.222
SPONSORED+0.213
CAR+0.211
ERY+0.211
 disappointment+0.209
Neg logit contributions
ogue-0.203
fman-0.203
iferation-0.202
 Sinn-0.200
��-0.200
Token to
Token ID284
Feature activation+8.04e-03

Pos logit contributions
fman+0.112
ogue+0.112
 Sinn+0.112
iferation+0.111
lus+0.109
Neg logit contributions
MENTS-0.119
SPONSORED-0.114
ERY-0.113
CAR-0.113
MENT-0.112
Token save
Token ID3613
Feature activation+1.62e-01

Pos logit contributions
fman+0.015
ogue+0.015
 Sinn+0.015
iferation+0.015
 hers+0.015
Neg logit contributions
MENTS-0.017
SPONSORED-0.016
ERY-0.016
CAR-0.016
MENT-0.016
Token his
Token ID465
Feature activation-1.95e-02

Pos logit contributions
fman+0.279
ogue+0.279
 Sinn+0.277
iferation+0.277
 hers+0.269
Neg logit contributions
MENTS-0.377
SPONSORED-0.363
ERY-0.360
CAR-0.359
MENT-0.357
Token government
Token ID1230
Feature activation-1.88e-01

Pos logit contributions
MENTS+0.043
SPONSORED+0.041
ERY+0.041
CAR+0.041
MENT+0.041
Neg logit contributions
ogue-0.036
fman-0.036
 Sinn-0.036
iferation-0.036
lus-0.035
Token.
Token ID13
Feature activation+7.84e-02

Pos logit contributions
MENTS+0.378
SPONSORED+0.363
ERY+0.359
CAR+0.358
MENT+0.354
Neg logit contributions
ogue-0.384
fman-0.384
iferation-0.383
 Sinn-0.382
 hers-0.372
Token is
Token ID318
Feature activation-3.42e-02

Pos logit contributions
MENTS+0.949
SPONSORED+0.907
ERY+0.902
CAR+0.899
MENT+0.893
Neg logit contributions
fman-0.899
ogue-0.897
iferation-0.896
 Sinn-0.889
lus-0.868
Token wrong
Token ID2642
Feature activation-7.20e-02

Pos logit contributions
MENTS+0.072
SPONSORED+0.069
ERY+0.069
CAR+0.068
MENT+0.068
Neg logit contributions
ogue-0.066
fman-0.066
iferation-0.066
 Sinn-0.066
 hers-0.064
Token.
Token ID13
Feature activation-1.09e-01

Pos logit contributions
MENTS+0.142
SPONSORED+0.135
ERY+0.134
CAR+0.134
MENT+0.133
Neg logit contributions
fman-0.151
ogue-0.150
iferation-0.149
 Sinn-0.149
lus-0.146
Token I
Token ID314
Feature activation-8.74e-02

Pos logit contributions
MENTS+0.255
SPONSORED+0.244
ERY+0.243
CAR+0.243
MENT+0.241
Neg logit contributions
fman-0.186
ogue-0.186
iferation-0.185
 Sinn-0.183
lus-0.179
Token lost
Token ID2626
Feature activation-4.79e-02

Pos logit contributions
MENTS+0.195
SPONSORED+0.187
ERY+0.186
CAR+0.186
MENT+0.184
Neg logit contributions
fman-0.159
ogue-0.158
iferation-0.158
 Sinn-0.157
lus-0.153
Token all
Token ID477
Feature activation-6.04e-02

Pos logit contributions
MENTS+0.113
SPONSORED+0.109
CAR+0.107
ERY+0.107
MENT+0.107
Neg logit contributions
ogue-0.081
fman-0.081
 Sinn-0.081
iferation-0.080
 hers-0.079
Token this
Token ID428
Feature activation-1.45e-01

Pos logit contributions
MENTS+0.144
SPONSORED+0.138
ERY+0.137
CAR+0.137
MENT+0.136
Neg logit contributions
ogue-0.101
 Sinn-0.100
fman-0.100
iferation-0.099
 hers-0.097
Token money
Token ID1637
Feature activation-8.68e-02

Pos logit contributions
MENTS+0.242
SPONSORED+0.230
ERY+0.226
CAR+0.226
MENT+0.223
Neg logit contributions
fman-0.347
ogue-0.346
 Sinn-0.344
iferation-0.342
 hers-0.337
Token."
Token ID526
Feature activation-2.40e-02

Pos logit contributions
MENTS+0.176
SPONSORED+0.168
ERY+0.167
CAR+0.167
MENT+0.165
Neg logit contributions
fman-0.176
ogue-0.175
iferation-0.174
 Sinn-0.174
lus-0.169
Token Everything
Token ID11391
Feature activation-1.81e-01

Pos logit contributions
MENTS+0.054
SPONSORED+0.052
ERY+0.052
CAR+0.052
MENT+0.051
Neg logit contributions
ogue-0.043
fman-0.043
 Sinn-0.042
iferation-0.042
lus-0.041
Token was
Token ID373
Feature activation-3.93e-02

Pos logit contributions
MENTS+0.386
SPONSORED+0.369
ERY+0.367
CAR+0.366
MENT+0.364
Neg logit contributions
fman-0.348
ogue-0.347
iferation-0.347
 Sinn-0.343
lus-0.335
Token your
Token ID534
Feature activation+6.05e-02

Pos logit contributions
MENTS+0.619
SPONSORED+0.593
ERY+0.590
CAR+0.588
MENT+0.585
Neg logit contributions
fman-0.517
ogue-0.515
iferation-0.515
 Sinn-0.510
lus-0.498
Token goal
Token ID3061
Feature activation+1.65e-01

Pos logit contributions
fman+0.124
ogue+0.124
iferation+0.123
 Sinn+0.123
lus+0.120
Neg logit contributions
MENTS-0.121
SPONSORED-0.116
ERY-0.115
CAR-0.115
MENT-0.114
Token?
Token ID30
Feature activation-7.71e-02

Pos logit contributions
fman+0.290
ogue+0.290
iferation+0.288
 Sinn+0.288
lus+0.279
Neg logit contributions
MENTS-0.377
SPONSORED-0.363
ERY-0.359
CAR-0.357
MENT-0.357
Token Do
Token ID2141
Feature activation+3.13e-01

Pos logit contributions
MENTS+0.173
SPONSORED+0.166
ERY+0.165
CAR+0.164
MENT+0.163
Neg logit contributions
fman-0.140
ogue-0.139
iferation-0.139
 Sinn-0.138
lus-0.135
Token you
Token ID345
Feature activation-1.04e-02

Pos logit contributions
ogue+0.511
fman+0.511
iferation+0.506
 Sinn+0.505
lus+0.492
Neg logit contributions
MENTS-0.758
SPONSORED-0.732
ERY-0.723
CAR-0.722
MENT-0.719
Token go
Token ID467
Feature activation-2.21e-01

Pos logit contributions
MENTS+0.021
SPONSORED+0.020
ERY+0.020
CAR+0.020
MENT+0.020
Neg logit contributions
fman-0.021
ogue-0.021
iferation-0.021
 Sinn-0.021
lus-0.020
Token out
Token ID503
Feature activation-4.31e-03

Pos logit contributions
MENTS+0.463
SPONSORED+0.444
ERY+0.440
CAR+0.439
MENT+0.436
Neg logit contributions
fman-0.433
ogue-0.431
iferation-0.428
 Sinn-0.427
lus-0.417
Token of
Token ID286
Feature activation+1.94e-01

Pos logit contributions
MENTS+0.009
SPONSORED+0.009
ERY+0.009
CAR+0.009
MENT+0.009
Neg logit contributions
fman-0.008
ogue-0.008
iferation-0.008
 Sinn-0.008
lus-0.008
Token your
Token ID534
Feature activation+2.93e-02

Pos logit contributions
fman+0.338
ogue+0.338
iferation+0.335
 Sinn+0.335
lus+0.324
Neg logit contributions
MENTS-0.449
SPONSORED-0.431
ERY-0.428
CAR-0.427
MENT-0.425
Token way
Token ID835
Feature activation+9.19e-02

Pos logit contributions
fman+0.054
iferation+0.054
ogue+0.054
 Sinn+0.053
��+0.052
Neg logit contributions
MENTS-0.065
SPONSORED-0.062
ERY-0.062
CAR-0.062
 disappointment-0.061
Token to
Token ID284
Feature activation+4.50e-03

Pos logit contributions
fman+0.167
ogue+0.167
iferation+0.167
 Sinn+0.165
lus+0.161
Neg logit contributions
MENTS-0.205
SPONSORED-0.196
ERY-0.195
CAR-0.195
MENT-0.194
Token respond
Token ID3031
Feature activation+2.81e-01

Pos logit contributions
fman+0.116
iferation+0.116
ogue+0.116
 Sinn+0.114
��+0.112
Neg logit contributions
MENTS-0.131
SPONSORED-0.125
ERY-0.124
CAR-0.124
MENT-0.123
Token to
Token ID284
Feature activation+6.95e-03

Pos logit contributions
ogue+0.569
fman+0.569
 Sinn+0.566
iferation+0.563
 hers+0.552
Neg logit contributions
MENTS-0.569
SPONSORED-0.540
ERY-0.538
CAR-0.536
MENT-0.533
Token this
Token ID428
Feature activation+3.62e-02

Pos logit contributions
ogue+0.012
 Sinn+0.012
fman+0.012
iferation+0.012
 hers+0.012
Neg logit contributions
MENTS-0.016
SPONSORED-0.015
ERY-0.015
CAR-0.015
MENT-0.015
Token beautiful
Token ID4950
Feature activation+1.36e-01

Pos logit contributions
ogue+0.076
 Sinn+0.075
fman+0.075
iferation+0.075
 hers+0.073
Neg logit contributions
MENTS-0.071
SPONSORED-0.067
ERY-0.067
CAR-0.067
MENT-0.066
Token crying
Token ID13774
Feature activation+2.57e-01

Pos logit contributions
ogue+0.260
 Sinn+0.258
fman+0.257
iferation+0.255
 hers+0.250
Neg logit contributions
MENTS-0.294
SPONSORED-0.281
ERY-0.280
CAR-0.278
MENT-0.276
Token stranger
Token ID16195
Feature activation+4.60e-02

Pos logit contributions
ogue+0.461
fman+0.457
 Sinn+0.456
iferation+0.451
 hers+0.448
Neg logit contributions
MENTS-0.584
SPONSORED-0.562
ERY-0.556
CAR-0.554
MENT-0.550
Token who
Token ID508
Feature activation+1.67e-01

Pos logit contributions
ogue+0.096
fman+0.095
 Sinn+0.095
iferation+0.094
 hers+0.094
Neg logit contributions
MENTS-0.090
SPONSORED-0.086
ERY-0.085
CAR-0.085
MENT-0.084
Token
Token ID447
Feature activation+1.43e-01

Pos logit contributions
fman+0.333
 Sinn+0.333
ogue+0.332
iferation+0.329
 hers+0.327
Neg logit contributions
MENTS-0.343
SPONSORED-0.325
ERY-0.324
CAR-0.322
MENT-0.321
Token
Token ID247
Feature activation-4.51e-03

Pos logit contributions
ogue+0.285
fman+0.283
iferation+0.283
��+0.283
 Sinn+0.282
Neg logit contributions
MENTS-0.293
SPONSORED-0.277
CAR-0.277
ERY-0.277
MENT-0.275
Tokens
Token ID82
Feature activation+4.58e-02

Pos logit contributions
MENTS+0.011
SPONSORED+0.011
ERY+0.011
CAR+0.011
MENT+0.011
Neg logit contributions
fman-0.007
ogue-0.007
iferation-0.007
 Sinn-0.007
lus-0.007
Token sitting
Token ID5586
Feature activation+4.85e-01

Pos logit contributions
ogue+0.094
fman+0.093
iferation+0.093
 Sinn+0.093
 hers+0.092
Neg logit contributions
MENTS-0.092
SPONSORED-0.088
ERY-0.087
CAR-0.086
MENT-0.086
Token is
Token ID318
Feature activation+2.71e-02

Pos logit contributions
MENTS+0.607
SPONSORED+0.581
ERY+0.576
CAR+0.574
MENT+0.570
Neg logit contributions
fman-0.578
ogue-0.577
iferation-0.576
 Sinn-0.574
lus-0.559
Token open
Token ID1280
Feature activation-2.62e-01

Pos logit contributions
fman+0.051
ogue+0.051
iferation+0.051
 Sinn+0.050
lus+0.049
Neg logit contributions
MENTS-0.059
SPONSORED-0.057
ERY-0.056
CAR-0.056
MENT-0.056
Token source
Token ID2723
Feature activation+2.52e-01

Pos logit contributions
MENTS+0.343
SPONSORED+0.315
ERY+0.312
CAR+0.308
 disappointment+0.307
Neg logit contributions
fman-0.722
 Sinn-0.719
iferation-0.717
ogue-0.716
 hers-0.701
Token -
Token ID532
Feature activation+1.72e-01

Pos logit contributions
fman+0.496
ogue+0.493
iferation+0.490
 Sinn+0.489
 hers+0.475
Neg logit contributions
MENTS-0.530
SPONSORED-0.506
ERY-0.499
CAR-0.496
MENT-0.496
Token the
Token ID262
Feature activation+1.15e-01

Pos logit contributions
fman+0.328
ogue+0.327
iferation+0.326
 Sinn+0.325
 hers+0.317
Neg logit contributions
MENTS-0.369
SPONSORED-0.354
ERY-0.350
CAR-0.350
MENT-0.347
Token code
Token ID2438
Feature activation-5.97e-01

Pos logit contributions
fman+0.264
iferation+0.263
ogue+0.263
 Sinn+0.263
ouf+0.258
Neg logit contributions
MENTS-0.203
SPONSORED-0.193
ERY-0.190
CAR-0.190
MENT-0.189
Token applicable
Token ID9723
Feature activation-2.28e-01

Pos logit contributions
MENTS+1.263
ERY+1.201
CAR+1.200
SPONSORED+1.199
 disappointment+1.190
Neg logit contributions
fman-1.156
iferation-1.153
ogue-1.148
 Sinn-1.139
��-1.121
Token to
Token ID284
Feature activation+5.45e-02

Pos logit contributions
MENTS+0.505
SPONSORED+0.484
ERY+0.481
CAR+0.480
MENT+0.477
Neg logit contributions
fman-0.418
ogue-0.416
iferation-0.416
 Sinn-0.413
lus-0.402
Token this
Token ID428
Feature activation+6.16e-02

Pos logit contributions
 Sinn+0.099
ogue+0.099
fman+0.098
iferation+0.098
 hers+0.097
Neg logit contributions
MENTS-0.121
SPONSORED-0.117
CAR-0.116
ERY-0.116
MENT-0.115
Token article
Token ID2708
Feature activation+3.00e-02

Pos logit contributions
ogue+0.130
fman+0.129
 Sinn+0.129
iferation+0.128
 hers+0.125
Neg logit contributions
MENTS-0.121
SPONSORED-0.114
CAR-0.113
MENT-0.113
ERY-0.113
Token can
Token ID460
Feature activation-7.95e-02

Pos logit contributions
fman+0.058
ogue+0.058
iferation+0.057
 Sinn+0.057
lus+0.056
Neg logit contributions
MENTS-0.064
SPONSORED-0.061
ERY-0.061
CAR-0.061
MENT-0.060
Token sit
Token ID1650
Feature activation+5.34e-01

Pos logit contributions
fman+0.046
ogue+0.046
 Sinn+0.046
iferation+0.046
lus+0.045
Neg logit contributions
MENTS-0.049
SPONSORED-0.047
ERY-0.046
CAR-0.046
MENT-0.046
Token out
Token ID503
Feature activation-2.39e-02

Pos logit contributions
ogue+1.038
fman+1.035
 Sinn+1.031
iferation+1.029
 hers+1.006
Neg logit contributions
MENTS-1.123
SPONSORED-1.078
ERY-1.067
CAR-1.061
MENT-1.057
Token during
Token ID1141
Feature activation-1.20e-02

Pos logit contributions
MENTS+0.053
SPONSORED+0.051
ERY+0.050
CAR+0.050
MENT+0.050
Neg logit contributions
ogue-0.044
fman-0.044
iferation-0.044
 Sinn-0.044
lus-0.043
Token the
Token ID262
Feature activation+3.33e-02

Pos logit contributions
MENTS+0.027
SPONSORED+0.026
ERY+0.025
CAR+0.025
MENT+0.025
Neg logit contributions
ogue-0.022
fman-0.022
 Sinn-0.022
iferation-0.022
lus-0.021
Token Spring
Token ID8225
Feature activation-3.83e-02

Pos logit contributions
ogue+0.065
 Sinn+0.065
iferation+0.065
fman+0.065
lus+0.063
Neg logit contributions
MENTS-0.070
SPONSORED-0.067
ERY-0.066
CAR-0.066
MENT-0.065
Token Split
Token ID27758
Feature activation-6.22e-01

Pos logit contributions
MENTS+0.084
SPONSORED+0.080
ERY+0.079
CAR+0.079
MENT+0.079
Neg logit contributions
ogue-0.072
fman-0.072
 Sinn-0.071
iferation-0.071
lus-0.069
Token due
Token ID2233
Feature activation-2.53e-01

Pos logit contributions
MENTS+1.316
SPONSORED+1.258
ERY+1.251
CAR+1.250
 disappointment+1.246
Neg logit contributions
fman-1.202
iferation-1.198
ogue-1.196
 Sinn-1.181
��-1.160
Token to
Token ID284
Feature activation-4.26e-03

Pos logit contributions
MENTS+0.559
SPONSORED+0.535
ERY+0.532
CAR+0.531
MENT+0.529
Neg logit contributions
fman-0.465
iferation-0.464
ogue-0.463
 Sinn-0.458
��-0.449
Token visa
Token ID14552
Feature activation-2.20e-01

Pos logit contributions
MENTS+0.009
SPONSORED+0.009
ERY+0.009
CAR+0.008
MENT+0.008
Neg logit contributions
 Sinn-0.008
ogue-0.008
iferation-0.008
fman-0.008
 hers-0.008
Token issues
Token ID2428
Feature activation+1.86e-01

Pos logit contributions
MENTS+0.326
SPONSORED+0.308
ERY+0.304
CAR+0.301
MENT+0.299
Neg logit contributions
ogue-0.564
iferation-0.562
 Sinn-0.562
fman-0.560
lus-0.551
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
fman+0.369
ogue+0.369
iferation+0.368
 Sinn+0.368
lus+0.358
Neg logit contributions
MENTS-0.385
SPONSORED-0.368
ERY-0.365
CAR-0.363
MENT-0.360
Token peer
Token ID12720
Feature activation-3.65e-01

Pos logit contributions
ogue+0.011
fman+0.011
iferation+0.011
 Sinn+0.010
lus+0.010
Neg logit contributions
MENTS-0.009
SPONSORED-0.008
ERY-0.008
CAR-0.008
MENT-0.008
Token-
Token ID12
Feature activation-7.20e-02

Pos logit contributions
MENTS+0.812
SPONSORED+0.777
ERY+0.773
CAR+0.771
MENT+0.766
Neg logit contributions
fman-0.666
ogue-0.665
iferation-0.664
 Sinn-0.658
lus-0.642
Tokenreviewed
Token ID32974
Feature activation-3.29e-01

Pos logit contributions
MENTS+0.183
SPONSORED+0.176
ERY+0.176
CAR+0.175
MENT+0.174
Neg logit contributions
fman-0.109
ogue-0.109
iferation-0.108
 Sinn-0.107
lus-0.104
Token journal
Token ID3989
Feature activation-3.71e-01

Pos logit contributions
MENTS+0.596
SPONSORED+0.565
ERY+0.561
CAR+0.559
MENT+0.555
Neg logit contributions
ogue-0.737
fman-0.737
iferation-0.735
 Sinn-0.730
lus-0.713
Token Clim
Token ID28573
Feature activation-1.98e-01

Pos logit contributions
MENTS+0.920
SPONSORED+0.885
ERY+0.881
CAR+0.878
MENT+0.874
Neg logit contributions
fman-0.585
ogue-0.584
iferation-0.582
 Sinn-0.578
lus-0.560
Tokenatic
Token ID1512
Feature activation-5.62e-01

Pos logit contributions
MENTS+0.366
SPONSORED+0.345
ERY+0.344
CAR+0.343
 disappointment+0.340
Neg logit contributions
ogue-0.442
iferation-0.440
fman-0.439
 Sinn-0.434
lus-0.427
Token Change
Token ID9794
Feature activation+1.39e-02

Pos logit contributions
MENTS+1.764
SPONSORED+1.717
CAR+1.705
ERY+1.704
 disappointment+1.699
Neg logit contributions
fman-0.514
iferation-0.504
ogue-0.501
 Sinn-0.490
��-0.473
Token,
Token ID11
Feature activation-5.18e-02

Pos logit contributions
ogue+0.025
iferation+0.025
 Sinn+0.024
fman+0.024
��+0.023
Neg logit contributions
MENTS-0.032
SPONSORED-0.030
ERY-0.030
CAR-0.030
MENT-0.030
Token provides
Token ID3769
Feature activation+1.75e-01

Pos logit contributions
MENTS+0.108
SPONSORED+0.103
ERY+0.102
CAR+0.102
MENT+0.102
Neg logit contributions
fman-0.102
ogue-0.102
iferation-0.102
 Sinn-0.101
lus-0.098
Token policymakers
Token ID29484
Feature activation-2.47e-03

Pos logit contributions
fman+0.338
ogue+0.337
iferation+0.335
 Sinn+0.334
 hers+0.325
Neg logit contributions
MENTS-0.371
SPONSORED-0.354
CAR-0.352
ERY-0.351
MENT-0.349
Token with
Token ID351
Feature activation+2.75e-02

Pos logit contributions
MENTS+0.005
SPONSORED+0.005
ERY+0.005
CAR+0.005
MENT+0.005
Neg logit contributions
ogue-0.005
fman-0.005
iferation-0.005
 Sinn-0.005
 hers-0.005
Token from
Token ID422
Feature activation+1.11e-01

Pos logit contributions
MENTS+0.106
SPONSORED+0.101
ERY+0.101
CAR+0.100
MENT+0.100
Neg logit contributions
fman-0.091
ogue-0.091
iferation-0.091
 Sinn-0.090
lus-0.088
Token the
Token ID262
Feature activation-3.59e-02

Pos logit contributions
 Sinn+0.190
fman+0.189
ogue+0.189
iferation+0.189
 hers+0.182
Neg logit contributions
MENTS-0.262
SPONSORED-0.252
ERY-0.249
MENT-0.249
CAR-0.248
Token Harvard
Token ID11131
Feature activation+2.60e-02

Pos logit contributions
MENTS+0.085
SPONSORED+0.082
ERY+0.081
CAR+0.081
MENT+0.081
Neg logit contributions
fman-0.060
ogue-0.060
 Sinn-0.060
iferation-0.060
ouf-0.058
Token-
Token ID12
Feature activation-6.05e-02

Pos logit contributions
ogue+0.045
 Sinn+0.045
iferation+0.045
fman+0.045
 hers+0.043
Neg logit contributions
MENTS-0.060
SPONSORED-0.058
ERY-0.058
CAR-0.057
MENT-0.057
TokenSmith
Token ID17919
Feature activation+4.20e-01

Pos logit contributions
MENTS+0.150
SPONSORED+0.144
ERY+0.144
CAR+0.143
 disappointment+0.143
Neg logit contributions
fman-0.095
ogue-0.095
iferation-0.095
 Sinn-0.094
 hers-0.091
Tokensonian
Token ID35202
Feature activation-6.49e-01

Pos logit contributions
fman+0.339
ogue+0.337
 Sinn+0.335
iferation+0.333
ouf+0.308
Neg logit contributions
MENTS-1.364
SPONSORED-1.320
ERY-1.317
CAR-1.314
MENT-1.310
Token Center
Token ID3337
Feature activation-5.90e-02

Pos logit contributions
MENTS+1.609
SPONSORED+1.546
CAR+1.537
ERY+1.536
 disappointment+1.528
Neg logit contributions
fman-1.027
ogue-1.021
iferation-1.020
 Sinn-1.014
��-0.980
Token for
Token ID329
Feature activation-3.35e-02

Pos logit contributions
MENTS+0.141
SPONSORED+0.135
ERY+0.134
CAR+0.134
MENT+0.133
Neg logit contributions
fman-0.098
ogue-0.098
iferation-0.098
 Sinn-0.097
lus-0.094
Token Ast
Token ID8304
Feature activation+1.52e-01

Pos logit contributions
MENTS+0.088
ERY+0.085
SPONSORED+0.085
CAR+0.084
 disappointment+0.084
Neg logit contributions
iferation-0.047
fman-0.047
ogue-0.047
 Sinn-0.047
lus-0.045
Tokenroph
Token ID10051
Feature activation+5.77e-01

Pos logit contributions
fman+0.357
ogue+0.351
iferation+0.350
 Sinn+0.344
ouf+0.343
Neg logit contributions
MENTS-0.269
SPONSORED-0.253
CAR-0.249
ERY-0.249
 disappointment-0.248
Tokenysics
Token ID23154
Feature activation+4.49e-01

Pos logit contributions
ogue+1.216
iferation+1.208
fman+1.206
 Sinn+1.182
lus+1.164
Neg logit contributions
MENTS-1.143
SPONSORED-1.087
 disappointment-1.072
CAR-1.071
MENT-1.064
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
MENTS+0.017
SPONSORED+0.015
ERY+0.015
CAR+0.015
MENT+0.015
Neg logit contributions
ogue-0.030
fman-0.030
iferation-0.030
 Sinn-0.030
lus-0.029
Token unsustainable
Token ID39554
Feature activation-3.19e-01

Pos logit contributions
MENTS+0.249
SPONSORED+0.237
ERY+0.236
CAR+0.235
MENT+0.234
Neg logit contributions
ogue-0.229
fman-0.229
 Sinn-0.227
iferation-0.227
 hers-0.222
Token.'
Token ID2637
Feature activation-1.04e-01

Pos logit contributions
MENTS+0.586
SPONSORED+0.556
CAR+0.551
ERY+0.551
MENT+0.545
Neg logit contributions
ogue-0.706
fman-0.705
iferation-0.702
 Sinn-0.701
ouf-0.684
Token (
Token ID357
Feature activation-4.10e-02

Pos logit contributions
MENTS+0.227
SPONSORED+0.218
ERY+0.216
CAR+0.215
MENT+0.214
Neg logit contributions
fman-0.196
ogue-0.196
iferation-0.195
 Sinn-0.194
lus-0.190
TokenCanadian
Token ID28203
Feature activation-4.00e-02

Pos logit contributions
MENTS+0.106
SPONSORED+0.102
 disappointment+0.101
ERY+0.101
MENT+0.100
Neg logit contributions
fman-0.060
ogue-0.060
iferation-0.059
 Sinn-0.059
lus-0.057
Token Federation
Token ID11937
Feature activation-6.55e-01

Pos logit contributions
MENTS+0.116
SPONSORED+0.112
ERY+0.111
CAR+0.111
MENT+0.111
Neg logit contributions
fman-0.047
ogue-0.046
iferation-0.046
 Sinn-0.046
lus-0.044
Token of
Token ID286
Feature activation-2.07e-02

Pos logit contributions
MENTS+1.337
SPONSORED+1.280
CAR+1.277
ERY+1.267
 disappointment+1.264
Neg logit contributions
fman-1.307
iferation-1.301
ogue-1.293
 Sinn-1.283
��-1.268
Token Independent
Token ID13362
Feature activation-2.33e-01

Pos logit contributions
MENTS+0.043
SPONSORED+0.041
CAR+0.041
ERY+0.040
MENT+0.040
Neg logit contributions
fman-0.042
iferation-0.041
ogue-0.041
��-0.041
 Sinn-0.040
Token Business
Token ID7320
Feature activation-2.50e-01

Pos logit contributions
MENTS+0.559
SPONSORED+0.542
CAR+0.541
MENT+0.532
ERY+0.529
Neg logit contributions
��-0.392
fman-0.392
ogue-0.384
iferation-0.383
onder-0.373
Token)
Token ID8
Feature activation+1.60e-02

Pos logit contributions
MENTS+0.542
SPONSORED+0.519
ERY+0.516
CAR+0.514
MENT+0.511
Neg logit contributions
fman-0.472
ogue-0.470
iferation-0.470
 Sinn-0.466
lus-0.455
Token The
Token ID383
Feature activation-8.79e-02

Pos logit contributions
 Sinn+0.028
fman+0.028
ogue+0.028
iferation+0.028
ouf+0.027
Neg logit contributions
MENTS-0.036
SPONSORED-0.035
ERY-0.035
CAR-0.035
 disappointment-0.034