TOP ACTIVATIONS
MAX = 0.859

had a lot of interrupt ions , he said
room . P PT PowerPoint slide Power Point slide
slide Power Point slide PNG larger image
, a pair of Silver Phant oms should acqu it themselves
, procedural maps were already bog ging us in two factors
at the Mal m ö Opera and Music Theatre in 2011
. This caused continual interrupt ions in its scientific use during
, I preferred the Silver Phant oms to the Myth os
little wonders . The Silver Phant oms can t
gaps . Net works Const antly Changing
andre cre ational safety / falls / pub s . html
superb neutrality , the Silver Phant oms more bass slam and
s of the u n ions , and even more so
equal or exceed the Silver Phant oms imaging and
no one knows whether ax ions even exist . It 's
of Dev ial et Silver Phant oms , with Dial og
/ Deb ate Op in ions expressed in USA TODAY 's
organizations to offer HR intern ships , fund studies abroad ,
over which to sound alarm bells . This is less about
resident as opposed to the revolving door of previous restaurants .

MIN ACTIVATIONS
MIN = -0.894

in the mission , material ejected during launch vehicle separation caused
after he was caught shop lifting a box of boots and
likely to engage in shop lifting or other petty crimes .
fluffy creatures . Hug your friends .
sent a cor onal mass eject ion ( charged particles )
and active bystand er training .
financial crisis . Hug hes said the
seats on the Newcastle United Fans Forum below :
и я а о н
active part in the new Fans ' Forum . It is
starts at Sh arm el Sheikh , Egypt , and cru
Qu ito to fight his eject ion , should he be
с с е , е к о т о
." Lu ol Deng was traded to the Cavs
United to launch their new Fans ' Forum , and will
н ы а м н
the court . Smith was ejected and suspended for two games
in public forums . Says Neh ring , All
items , via a revised Fans section on www . n
150 th birthday of Frank Lloyd Wright , America

INTERVAL 0.421 - 0.859
CONTAINS 0.370%

full bailout . Stat ements made by Prime Minister
ais took aim at stars such as Robert Down ey Jr
trolls and ultimately benefit only trolls . Look into
example , a physician would employ blood let ting , using
former Wat erville Hotel ( now the Wat erville Tavern )

INTERVAL -0.017 - 0.421
CONTAINS 55.738%

his grandmother motivated him to do his best and
hand , emphasized the use of plant - based remedies instead
of K ins ba ile G iant
). Even at this rapid speed , the solar system would
kilometers ( 1 , 200 miles ), the F chain

INTERVAL -0.456 - -0.017
CONTAINS 43.592%

regions are responsible for this . Nobody is really
have given Brah ms the bow in person but their collected
t mean you can t look beyond
excitement and the immersion it is your job to provide
million airport opened in late 2007 . Local officials were so

INTERVAL -0.894 - -0.456
CONTAINS 0.300%

, IGN 's Halo super fans Ryan McC aff rey and
relative len iency . Although tear gas was fired at some
H enson 's post hum ous induction as a Disney Legend
's lots of really positive interactions to start your day ."
WC has been at yp ically open about his willingness to
Token had
Token ID550
Feature activation-3.38e-02

Pos logit contributions
 Caption+0.016
hement+0.015
odynamic+0.015
enegger+0.015
acterial+0.015
Neg logit contributions
 har-0.022
 whence-0.022
 Phant-0.022
 gib-0.022
nos-0.022
Token a
Token ID257
Feature activation+1.17e-03

Pos logit contributions
 Caption+0.025
hement+0.024
odynamic+0.024
enegger+0.024
acterial+0.024
Neg logit contributions
 whence-0.019
 Phant-0.019
 har-0.019
 gib-0.019
nos-0.018
Token lot
Token ID1256
Feature activation+5.45e-02

Pos logit contributions
 har+0.001
 Phant+0.001
 whence+0.001
 gib+0.001
 hay+0.001
Neg logit contributions
 Caption-0.001
hement-0.001
odynamic-0.001
enegger-0.001
acterial-0.001
Token of
Token ID286
Feature activation+1.11e-01

Pos logit contributions
 Phant+0.025
 whence+0.025
 gib+0.024
 har+0.024
nos+0.024
Neg logit contributions
 Caption-0.047
hement-0.045
odynamic-0.044
acterial-0.044
enegger-0.044
Token interrupt
Token ID11313
Feature activation+1.68e-01

Pos logit contributions
 har+0.069
 Phant+0.069
 whence+0.069
 gib+0.069
 hay+0.067
Neg logit contributions
 Caption-0.077
hement-0.073
odynamic-0.072
enegger-0.072
acterial-0.071
Tokenions
Token ID507
Feature activation+8.59e-01

Pos logit contributions
 whence+0.023
 Phant+0.023
 har+0.022
 gib+0.021
nos+0.019
Neg logit contributions
 Caption-0.199
hement-0.192
odynamic-0.191
enegger-0.189
acterial-0.189
Token,
Token ID11
Feature activation-3.78e-03

Pos logit contributions
 whence+0.567
 har+0.566
 Phant+0.558
 gib+0.554
 hay+0.543
Neg logit contributions
 Caption-0.568
hement-0.538
odynamic-0.529
enegger-0.529
acterial-0.521
Token
Token ID447
Feature activation-2.76e-02

Pos logit contributions
 Caption+0.003
hement+0.003
odynamic+0.003
enegger+0.003
acterial+0.003
Neg logit contributions
 whence-0.002
 Phant-0.002
 har-0.002
 gib-0.002
nos-0.002
Token
Token ID251
Feature activation-3.40e-01

Pos logit contributions
 Caption+0.018
hement+0.017
odynamic+0.017
acterial+0.016
enegger+0.016
Neg logit contributions
 Phant-0.018
 whence-0.018
 har-0.018
 gib-0.018
nos-0.018
Token he
Token ID339
Feature activation+1.59e-02

Pos logit contributions
 Caption+0.252
hement+0.238
odynamic+0.236
enegger+0.233
acterial+0.232
Neg logit contributions
 whence-0.196
 Phant-0.196
 har-0.194
 gib-0.193
nos-0.188
Token said
Token ID531
Feature activation+7.61e-02

Pos logit contributions
 whence+0.007
 Phant+0.007
 har+0.007
 gib+0.007
nos+0.007
Neg logit contributions
 Caption-0.014
hement-0.013
odynamic-0.013
enegger-0.013
acterial-0.013
Token room
Token ID2119
Feature activation+2.15e-01

Pos logit contributions
 Caption+0.007
hement+0.006
odynamic+0.006
enegger+0.006
acterial+0.006
Neg logit contributions
 whence-0.004
 Phant-0.004
 har-0.004
 gib-0.004
nos-0.004
Token.
Token ID13
Feature activation+5.06e-03

Pos logit contributions
 whence+0.139
 Phant+0.138
 har+0.138
 gib+0.136
nos+0.133
Neg logit contributions
 Caption-0.145
hement-0.137
odynamic-0.135
enegger-0.134
acterial-0.133
Token P
Token ID350
Feature activation+7.44e-02

Pos logit contributions
 Phant+0.003
 whence+0.003
 har+0.003
 gib+0.003
nos+0.003
Neg logit contributions
 Caption-0.003
hement-0.003
odynamic-0.003
enegger-0.003
acterial-0.003
TokenPT
Token ID11571
Feature activation+1.03e-01

Pos logit contributions
 har+0.051
 Phant+0.051
 whence+0.051
 gib+0.050
nos+0.050
Neg logit contributions
 Caption-0.047
hement-0.044
odynamic-0.043
enegger-0.043
acterial-0.042
Token PowerPoint
Token ID43563
Feature activation+4.86e-01

Pos logit contributions
 whence+0.028
 Phant+0.028
 har+0.027
 gib+0.027
nos+0.026
Neg logit contributions
 Caption-0.108
hement-0.104
odynamic-0.103
enegger-0.102
acterial-0.102
Token slide
Token ID10649
Feature activation+8.45e-01

Pos logit contributions
 whence+0.171
 har+0.163
 Phant+0.162
 gib+0.161
nos+0.155
Neg logit contributions
 Caption-0.480
hement-0.451
odynamic-0.448
acterial-0.444
utherland-0.443
Token
Token ID198
Feature activation+2.38e-01

Pos logit contributions
 Phant+0.221
 har+0.216
 whence+0.206
nos+0.199
 gib+0.196
Neg logit contributions
 Caption-0.888
hement-0.879
enegger-0.866
odynamic-0.863
acterial-0.858
Token
Token ID198
Feature activation+2.18e-01

Pos logit contributions
 whence+0.066
 har+0.066
 Phant+0.065
 gib+0.063
nos+0.061
Neg logit contributions
 Caption-0.247
hement-0.240
odynamic-0.238
enegger-0.236
acterial-0.235
TokenPower
Token ID13434
Feature activation+5.17e-01

Pos logit contributions
 Phant+0.152
 har+0.151
 whence+0.150
 gib+0.149
nos+0.148
Neg logit contributions
 Caption-0.134
hement-0.127
enegger-0.126
odynamic-0.125
acterial-0.124
TokenPoint
Token ID12727
Feature activation+2.36e-01

Pos logit contributions
 whence+0.285
 Phant+0.283
 har+0.279
 gib+0.277
nos+0.275
Neg logit contributions
 Caption-0.397
hement-0.376
odynamic-0.372
acterial-0.366
enegger-0.364
Token slide
Token ID10649
Feature activation+8.34e-01

Pos logit contributions
 har+0.063
 Phant+0.062
 whence+0.062
 gib+0.061
 hay+0.058
Neg logit contributions
 Caption-0.246
enegger-0.241
hement-0.241
odynamic-0.239
utherland-0.236
Token slide
Token ID10649
Feature activation+8.45e-01

Pos logit contributions
 whence+0.171
 har+0.163
 Phant+0.162
 gib+0.161
nos+0.155
Neg logit contributions
 Caption-0.480
hement-0.451
odynamic-0.448
acterial-0.444
utherland-0.443
Token
Token ID198
Feature activation+2.38e-01

Pos logit contributions
 Phant+0.221
 har+0.216
 whence+0.206
nos+0.199
 gib+0.196
Neg logit contributions
 Caption-0.888
hement-0.879
enegger-0.866
odynamic-0.863
acterial-0.858
Token
Token ID198
Feature activation+2.18e-01

Pos logit contributions
 whence+0.066
 har+0.066
 Phant+0.065
 gib+0.063
nos+0.061
Neg logit contributions
 Caption-0.247
hement-0.240
odynamic-0.238
enegger-0.236
acterial-0.235
TokenPower
Token ID13434
Feature activation+5.17e-01

Pos logit contributions
 Phant+0.152
 har+0.151
 whence+0.150
 gib+0.149
nos+0.148
Neg logit contributions
 Caption-0.134
hement-0.127
enegger-0.126
odynamic-0.125
acterial-0.124
TokenPoint
Token ID12727
Feature activation+2.36e-01

Pos logit contributions
 whence+0.285
 Phant+0.283
 har+0.279
 gib+0.277
nos+0.275
Neg logit contributions
 Caption-0.397
hement-0.376
odynamic-0.372
acterial-0.366
enegger-0.364
Token slide
Token ID10649
Feature activation+8.34e-01

Pos logit contributions
 har+0.063
 Phant+0.062
 whence+0.062
 gib+0.061
 hay+0.058
Neg logit contributions
 Caption-0.246
enegger-0.241
hement-0.241
odynamic-0.239
utherland-0.236
Token PNG
Token ID36182
Feature activation+3.71e-01

Pos logit contributions
 Phant+0.386
 har+0.379
 whence+0.372
nos+0.368
 gib+0.365
Neg logit contributions
 Caption-0.703
hement-0.692
enegger-0.690
odynamic-0.680
utherland-0.680
Token larger
Token ID4025
Feature activation+8.88e-02

Pos logit contributions
 har+0.228
 whence+0.225
 gib+0.225
 Phant+0.224
ancies+0.219
Neg logit contributions
 Caption-0.256
odynamic-0.247
hement-0.247
acterial-0.245
enegger-0.244
Token image
Token ID2939
Feature activation+1.73e-01

Pos logit contributions
 whence+0.060
 Phant+0.059
 har+0.059
 gib+0.059
nos+0.057
Neg logit contributions
 Caption-0.057
hement-0.054
odynamic-0.053
enegger-0.053
acterial-0.052
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
 Phant+0.047
 har+0.047
 whence+0.047
 gib+0.044
nos+0.044
Neg logit contributions
 Caption-0.180
hement-0.176
odynamic-0.174
enegger-0.174
acterial-0.173
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
 whence+0.030
 har+0.029
 Phant+0.029
 gib+0.028
nos+0.027
Neg logit contributions
 Caption-0.108
hement-0.105
odynamic-0.104
enegger-0.103
acterial-0.103
Token,
Token ID11
Feature activation+7.91e-04

Pos logit contributions
 whence+0.145
 Phant+0.145
 har+0.144
 gib+0.143
nos+0.139
Neg logit contributions
 Caption-0.169
hement-0.160
odynamic-0.158
enegger-0.157
acterial-0.156
Token a
Token ID257
Feature activation+5.15e-02

Pos logit contributions
 whence+0.000
 Phant+0.000
 har+0.000
 gib+0.000
nos+0.000
Neg logit contributions
 Caption-0.001
hement-0.001
odynamic-0.001
enegger-0.001
acterial-0.001
Token pair
Token ID5166
Feature activation-2.07e-01

Pos logit contributions
 Phant+0.033
 har+0.033
 whence+0.032
 gib+0.032
 hay+0.031
Neg logit contributions
 Caption-0.036
hement-0.034
odynamic-0.033
enegger-0.033
acterial-0.033
Token of
Token ID286
Feature activation+8.04e-02

Pos logit contributions
 Caption+0.180
hement+0.171
odynamic+0.170
enegger+0.168
acterial+0.168
Neg logit contributions
 whence-0.093
 Phant-0.093
 har-0.092
 gib-0.091
nos-0.089
Token Silver
Token ID7698
Feature activation+2.08e-01

Pos logit contributions
 Phant+0.047
 whence+0.047
 har+0.047
 gib+0.046
nos+0.045
Neg logit contributions
 Caption-0.059
hement-0.056
odynamic-0.055
enegger-0.055
acterial-0.054
Token Phant
Token ID38699
Feature activation+8.08e-01

Pos logit contributions
 Phant+0.043
 whence+0.042
 har+0.041
 gib+0.040
nos+0.038
Neg logit contributions
 Caption-0.233
hement-0.224
odynamic-0.223
enegger-0.221
acterial-0.221
Tokenoms
Token ID3150
Feature activation-3.37e-02

Pos logit contributions
 Phant+0.571
 har+0.565
 whence+0.558
ancies+0.557
nos+0.555
Neg logit contributions
 Caption-0.502
hement-0.477
enegger-0.470
odynamic-0.458
utherland-0.453
Token should
Token ID815
Feature activation+4.44e-02

Pos logit contributions
 Caption+0.021
hement+0.020
odynamic+0.019
enegger+0.019
acterial+0.019
Neg logit contributions
 har-0.024
 whence-0.024
 Phant-0.024
 gib-0.023
 hay-0.023
Token acqu
Token ID4078
Feature activation+1.38e-01

Pos logit contributions
 whence+0.024
 har+0.024
 Phant+0.024
 gib+0.023
nos+0.023
Neg logit contributions
 Caption-0.035
hement-0.033
odynamic-0.033
enegger-0.033
acterial-0.032
Tokenit
Token ID270
Feature activation+2.12e-01

Pos logit contributions
 har+0.083
 whence+0.082
 Phant+0.081
nos+0.081
 gib+0.080
Neg logit contributions
 Caption-0.101
enegger-0.095
hement-0.095
-0.093
odynamic-0.093
Token themselves
Token ID2405
Feature activation+2.53e-01

Pos logit contributions
 Phant+0.125
 whence+0.124
 har+0.124
 gib+0.122
 hay+0.119
Neg logit contributions
 Caption-0.155
hement-0.146
odynamic-0.145
enegger-0.144
acterial-0.143
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
 Caption+0.014
hement+0.013
odynamic+0.013
enegger+0.013
acterial+0.013
Neg logit contributions
 whence-0.012
 Phant-0.012
 har-0.012
 gib-0.011
nos-0.011
Token procedural
Token ID27931
Feature activation+9.46e-02

Pos logit contributions
 Caption+0.089
hement+0.085
odynamic+0.084
enegger+0.083
acterial+0.083
Neg logit contributions
 whence-0.069
 Phant-0.069
 har-0.069
 gib-0.068
nos-0.066
Token maps
Token ID8739
Feature activation+2.28e-01

Pos logit contributions
 whence+0.046
 Phant+0.046
 har+0.045
 gib+0.045
nos+0.044
Neg logit contributions
 Caption-0.079
hement-0.075
odynamic-0.074
enegger-0.073
acterial-0.073
Token were
Token ID547
Feature activation-1.86e-01

Pos logit contributions
 har+0.173
 whence+0.173
 Phant+0.172
 gib+0.171
nos+0.167
Neg logit contributions
 Caption-0.129
hement-0.120
odynamic-0.118
enegger-0.117
acterial-0.116
Token already
Token ID1541
Feature activation-1.39e-01

Pos logit contributions
 Caption+0.140
hement+0.132
odynamic+0.131
enegger+0.129
acterial+0.129
Neg logit contributions
 whence-0.105
 Phant-0.105
 har-0.104
 gib-0.103
nos-0.101
Token bog
Token ID22922
Feature activation+8.03e-01

Pos logit contributions
 Caption+0.097
hement+0.092
odynamic+0.091
enegger+0.090
acterial+0.089
Neg logit contributions
 whence-0.085
 Phant-0.085
 har-0.085
 gib-0.084
nos-0.082
Tokenging
Token ID2667
Feature activation+1.01e-01

Pos logit contributions
 whence+0.504
 Phant+0.503
 har+0.500
 gib+0.495
nos+0.486
Neg logit contributions
 Caption-0.554
hement-0.521
odynamic-0.517
enegger-0.509
acterial-0.508
Token us
Token ID514
Feature activation+8.87e-02

Pos logit contributions
 whence+0.064
 Phant+0.064
 har+0.063
 gib+0.063
 hay+0.061
Neg logit contributions
 Caption-0.070
hement-0.066
odynamic-0.065
enegger-0.065
acterial-0.064
Token in
Token ID287
Feature activation-1.25e-01

Pos logit contributions
 har+0.054
 whence+0.054
 Phant+0.054
 gib+0.054
 hay+0.053
Neg logit contributions
 Caption-0.062
hement-0.059
odynamic-0.058
enegger-0.058
acterial-0.058
Token two
Token ID734
Feature activation-3.14e-01

Pos logit contributions
 Caption+0.089
hement+0.083
odynamic+0.083
enegger+0.082
acterial+0.081
Neg logit contributions
 whence-0.077
 Phant-0.077
 har-0.076
 gib-0.076
nos-0.074
Token factors
Token ID5087
Feature activation+6.06e-02

Pos logit contributions
 Caption+0.211
hement+0.198
odynamic+0.196
enegger+0.193
acterial+0.193
Neg logit contributions
 whence-0.204
 Phant-0.203
 har-0.203
 gib-0.201
nos-0.197
Token at
Token ID379
Feature activation+9.57e-02

Pos logit contributions
 Caption+0.051
hement+0.048
odynamic+0.047
enegger+0.047
acterial+0.047
Neg logit contributions
 whence-0.040
 Phant-0.040
 har-0.040
 gib-0.040
nos-0.039
Token the
Token ID262
Feature activation+1.34e-01

Pos logit contributions
 whence+0.057
 Phant+0.057
 har+0.057
 gib+0.056
nos+0.055
Neg logit contributions
 Caption-0.069
hement-0.066
odynamic-0.065
enegger-0.064
acterial-0.064
Token Mal
Token ID4434
Feature activation+2.74e-01

Pos logit contributions
 Phant+0.081
 har+0.081
 whence+0.080
 gib+0.079
nos+0.077
Neg logit contributions
 Caption-0.096
hement-0.091
odynamic-0.090
enegger-0.089
acterial-0.089
Tokenm
Token ID76
Feature activation+2.61e-01

Pos logit contributions
 whence+0.140
 Phant+0.139
 har+0.139
 gib+0.137
nos+0.134
Neg logit contributions
 Caption-0.222
hement-0.211
odynamic-0.209
enegger-0.207
acterial-0.206
Tokenö
Token ID9101
Feature activation+3.80e-01

Pos logit contributions
 whence+0.187
 Phant+0.186
 har+0.185
 gib+0.184
nos+0.181
Neg logit contributions
 Caption-0.158
hement-0.147
odynamic-0.146
enegger-0.144
acterial-0.143
Token Opera
Token ID26049
Feature activation+7.93e-01

Pos logit contributions
 whence+0.198
 Phant+0.197
 har+0.196
 gib+0.194
nos+0.190
Neg logit contributions
 Caption-0.303
hement-0.287
odynamic-0.285
enegger-0.282
acterial-0.281
Token and
Token ID290
Feature activation+9.23e-02

Pos logit contributions
 whence+0.523
 Phant+0.522
 har+0.519
 gib+0.514
nos+0.505
Neg logit contributions
 Caption-0.523
hement-0.490
odynamic-0.486
enegger-0.479
acterial-0.478
Token Music
Token ID7849
Feature activation+1.56e-01

Pos logit contributions
 Phant+0.057
 whence+0.057
 har+0.056
 gib+0.056
nos+0.054
Neg logit contributions
 Caption-0.065
hement-0.062
odynamic-0.061
enegger-0.060
acterial-0.060
Token Theatre
Token ID15752
Feature activation+2.79e-01

Pos logit contributions
 whence+0.086
 Phant+0.086
 har+0.086
 gib+0.085
nos+0.083
Neg logit contributions
 Caption-0.119
hement-0.112
odynamic-0.111
enegger-0.110
acterial-0.110
Token in
Token ID287
Feature activation+3.45e-02

Pos logit contributions
 whence+0.192
 Phant+0.192
 har+0.191
 gib+0.190
nos+0.186
Neg logit contributions
 Caption-0.176
hement-0.164
odynamic-0.163
enegger-0.160
acterial-0.160
Token 2011
Token ID2813
Feature activation+1.72e-01

Pos logit contributions
 whence+0.019
 Phant+0.019
 har+0.019
 gib+0.019
nos+0.019
Neg logit contributions
 Caption-0.026
hement-0.025
odynamic-0.025
enegger-0.024
acterial-0.024
Token.
Token ID13
Feature activation-7.13e-02

Pos logit contributions
 whence+0.007
 Phant+0.007
 har+0.007
 gib+0.007
nos+0.007
Neg logit contributions
 Caption-0.007
hement-0.007
odynamic-0.007
enegger-0.007
acterial-0.007
Token This
Token ID770
Feature activation-5.62e-02

Pos logit contributions
 Caption+0.058
hement+0.056
odynamic+0.055
enegger+0.055
acterial+0.054
Neg logit contributions
 Phant-0.036
 whence-0.036
 har-0.035
 gib-0.035
nos-0.034
Token caused
Token ID4073
Feature activation-7.10e-02

Pos logit contributions
 Caption+0.036
hement+0.034
enegger+0.034
odynamic+0.034
acterial+0.034
Neg logit contributions
 har-0.038
 Phant-0.038
 whence-0.037
 gib-0.037
 hay-0.036
Token continual
Token ID37639
Feature activation-9.21e-02

Pos logit contributions
 Caption+0.052
hement+0.049
odynamic+0.049
enegger+0.049
acterial+0.048
Neg logit contributions
 har-0.042
 Phant-0.042
 whence-0.041
 gib-0.041
 hay-0.040
Token interrupt
Token ID11313
Feature activation+1.19e-01

Pos logit contributions
 Caption+0.063
hement+0.060
enegger+0.059
odynamic+0.059
acterial+0.058
Neg logit contributions
 har-0.059
 Phant-0.058
 gib-0.058
 whence-0.058
 hay-0.057
Tokenions
Token ID507
Feature activation+7.85e-01

Pos logit contributions
 whence+0.016
 Phant+0.016
 har+0.016
 gib+0.015
nos+0.014
Neg logit contributions
 Caption-0.141
hement-0.136
odynamic-0.135
enegger-0.134
acterial-0.134
Token in
Token ID287
Feature activation-8.58e-02

Pos logit contributions
 whence+0.534
 har+0.531
 Phant+0.529
 gib+0.524
 hay+0.513
Neg logit contributions
 Caption-0.502
hement-0.474
odynamic-0.468
enegger-0.468
acterial-0.461
Token its
Token ID663
Feature activation+2.07e-01

Pos logit contributions
 Caption+0.062
hement+0.059
odynamic+0.058
enegger+0.057
acterial+0.057
Neg logit contributions
 whence-0.051
 Phant-0.051
 har-0.051
 gib-0.050
nos-0.049
Token scientific
Token ID5654
Feature activation+1.88e-01

Pos logit contributions
 har+0.127
 Phant+0.126
 whence+0.126
 gib+0.124
 hay+0.122
Neg logit contributions
 Caption-0.147
hement-0.139
odynamic-0.138
enegger-0.138
acterial-0.136
Token use
Token ID779
Feature activation+2.32e-01

Pos logit contributions
 har+0.105
 whence+0.105
 Phant+0.104
 gib+0.104
 hay+0.101
Neg logit contributions
 Caption-0.143
hement-0.136
odynamic-0.135
enegger-0.134
acterial-0.133
Token during
Token ID1141
Feature activation-1.72e-01

Pos logit contributions
 whence+0.151
 Phant+0.150
 har+0.150
 gib+0.148
nos+0.146
Neg logit contributions
 Caption-0.155
hement-0.145
odynamic-0.144
enegger-0.142
acterial-0.141
Token,
Token ID11
Feature activation+5.14e-02

Pos logit contributions
 whence+0.136
 Phant+0.136
 har+0.135
 gib+0.134
nos+0.132
Neg logit contributions
 Caption-0.143
hement-0.134
odynamic-0.133
enegger-0.131
acterial-0.131
Token I
Token ID314
Feature activation-1.64e-02

Pos logit contributions
 Phant+0.030
 whence+0.030
 har+0.029
 gib+0.029
nos+0.028
Neg logit contributions
 Caption-0.038
hement-0.036
odynamic-0.036
enegger-0.036
acterial-0.036
Token preferred
Token ID9871
Feature activation+1.96e-01

Pos logit contributions
 Caption+0.011
hement+0.011
odynamic+0.011
enegger+0.011
acterial+0.010
Neg logit contributions
 har-0.010
 Phant-0.010
 whence-0.010
 gib-0.010
 hay-0.010
Token the
Token ID262
Feature activation+2.80e-02

Pos logit contributions
 Phant+0.111
 whence+0.111
 har+0.111
 gib+0.109
nos+0.107
Neg logit contributions
 Caption-0.148
hement-0.139
odynamic-0.139
enegger-0.137
acterial-0.137
Token Silver
Token ID7698
Feature activation+1.83e-01

Pos logit contributions
 Phant+0.017
 har+0.017
 whence+0.017
 gib+0.017
 hay+0.016
Neg logit contributions
 Caption-0.020
hement-0.019
odynamic-0.019
enegger-0.019
acterial-0.019
Token Phant
Token ID38699
Feature activation+7.61e-01

Pos logit contributions
 Phant+0.018
 whence+0.018
 har+0.017
 gib+0.016
nos+0.014
Neg logit contributions
 Caption-0.223
hement-0.216
odynamic-0.215
enegger-0.213
acterial-0.213
Tokenoms
Token ID3150
Feature activation-8.65e-02

Pos logit contributions
 Phant+0.541
 har+0.534
 whence+0.530
nos+0.524
ancies+0.521
Neg logit contributions
 Caption-0.465
hement-0.448
enegger-0.438
odynamic-0.429
utherland-0.426
Token to
Token ID284
Feature activation+8.18e-02

Pos logit contributions
 Caption+0.056
hement+0.053
odynamic+0.053
enegger+0.052
acterial+0.052
Neg logit contributions
 Phant-0.058
 whence-0.058
 har-0.058
 gib-0.057
nos-0.056
Token the
Token ID262
Feature activation+3.63e-02

Pos logit contributions
 Phant+0.047
 whence+0.047
 har+0.047
 gib+0.046
nos+0.045
Neg logit contributions
 Caption-0.061
hement-0.058
odynamic-0.057
enegger-0.057
acterial-0.056
Token Myth
Token ID18900
Feature activation+3.54e-01

Pos logit contributions
 Phant+0.023
 whence+0.022
 har+0.022
 gib+0.022
nos+0.021
Neg logit contributions
 Caption-0.025
hement-0.024
odynamic-0.024
enegger-0.024
acterial-0.024
Tokenos
Token ID418
Feature activation-2.84e-02

Pos logit contributions
 whence+0.231
nos+0.231
 Phant+0.230
 har+0.230
 gib+0.222
Neg logit contributions
 Caption-0.234
hement-0.227
enegger-0.219
acterial-0.217
odynamic-0.216
Token little
Token ID1310
Feature activation+4.18e-02

Pos logit contributions
 Phant+0.156
 har+0.155
 whence+0.155
 gib+0.152
 hay+0.148
Neg logit contributions
 Caption-0.184
hement-0.174
odynamic-0.173
enegger-0.172
acterial-0.170
Token wonders
Token ID19294
Feature activation+4.09e-01

Pos logit contributions
 whence+0.024
 Phant+0.024
 har+0.024
 gib+0.024
nos+0.023
Neg logit contributions
 Caption-0.031
hement-0.029
odynamic-0.029
enegger-0.029
acterial-0.028
Token.
Token ID13
Feature activation+7.99e-02

Pos logit contributions
 whence+0.250
 Phant+0.249
 har+0.248
 gib+0.245
nos+0.240
Neg logit contributions
 Caption-0.290
hement-0.273
odynamic-0.270
enegger-0.268
acterial-0.267
Token The
Token ID383
Feature activation-1.89e-02

Pos logit contributions
 whence+0.038
 Phant+0.038
 har+0.038
 gib+0.037
nos+0.036
Neg logit contributions
 Caption-0.067
hement-0.064
odynamic-0.064
enegger-0.063
acterial-0.063
Token Silver
Token ID7698
Feature activation+1.77e-01

Pos logit contributions
 Caption+0.013
hement+0.012
odynamic+0.012
enegger+0.012
acterial+0.012
Neg logit contributions
 Phant-0.012
 har-0.012
 whence-0.012
 gib-0.012
 hay-0.011
Token Phant
Token ID38699
Feature activation+7.42e-01

Pos logit contributions
 whence+0.010
 Phant+0.009
 har+0.009
 gib+0.009
nos+0.006
Neg logit contributions
 Caption-0.223
hement-0.216
odynamic-0.215
enegger-0.213
acterial-0.213
Tokenoms
Token ID3150
Feature activation-5.37e-02

Pos logit contributions
 Phant+0.526
 har+0.522
 whence+0.517
nos+0.510
ancies+0.510
Neg logit contributions
 Caption-0.454
hement-0.438
enegger-0.428
odynamic-0.418
utherland-0.416
Token can
Token ID460
Feature activation-2.12e-01

Pos logit contributions
 Caption+0.033
hement+0.031
odynamic+0.030
enegger+0.030
acterial+0.030
Neg logit contributions
 Phant-0.038
 whence-0.038
 har-0.038
 gib-0.038
nos-0.037
Token
Token ID447
Feature activation-2.78e-02

Pos logit contributions
 Caption+0.170
hement+0.161
odynamic+0.160
enegger+0.158
acterial+0.157
Neg logit contributions
 whence-0.110
 Phant-0.110
 har-0.109
 gib-0.108
nos-0.105
Token
Token ID247
Feature activation+1.51e-01

Pos logit contributions
 Caption+0.021
enegger+0.021
hement+0.021
odynamic+0.020
utherland+0.020
Neg logit contributions
 har-0.015
 whence-0.015
 Phant-0.015
 gib-0.014
 hay-0.014
Tokent
Token ID83
Feature activation+4.11e-02

Pos logit contributions
 whence+0.058
 Phant+0.058
 har+0.057
 gib+0.056
nos+0.054
Neg logit contributions
 Caption-0.142
hement-0.135
odynamic-0.135
enegger-0.134
acterial-0.133
Token gaps
Token ID17332
Feature activation+3.09e-01

Pos logit contributions
 Caption+0.027
hement+0.026
odynamic+0.025
enegger+0.025
acterial+0.025
Neg logit contributions
 whence-0.024
 Phant-0.023
 har-0.023
 gib-0.023
nos-0.023
Token.
Token ID13
Feature activation+5.45e-02

Pos logit contributions
 whence+0.196
 Phant+0.195
 har+0.195
 gib+0.192
nos+0.189
Neg logit contributions
 Caption-0.212
hement-0.199
odynamic-0.197
enegger-0.194
acterial-0.194
Token
Token ID198
Feature activation+8.57e-02

Pos logit contributions
 whence+0.030
 Phant+0.030
 har+0.029
 gib+0.029
nos+0.028
Neg logit contributions
 Caption-0.042
hement-0.040
odynamic-0.040
enegger-0.039
acterial-0.039
Token
Token ID198
Feature activation+7.30e-02

Pos logit contributions
 whence+0.025
 Phant+0.025
 har+0.024
 gib+0.024
nos+0.023
Neg logit contributions
 Caption-0.088
hement-0.085
odynamic-0.084
enegger-0.084
acterial-0.083
TokenNet
Token ID7934
Feature activation-1.28e-01

Pos logit contributions
 whence+0.054
 Phant+0.054
 har+0.054
 gib+0.053
nos+0.053
Neg logit contributions
 Caption-0.042
hement-0.039
odynamic-0.039
enegger-0.038
acterial-0.038
Tokenworks
Token ID5225
Feature activation+7.37e-01

Pos logit contributions
 Caption+0.105
hement+0.099
odynamic+0.099
enegger+0.098
acterial+0.097
Neg logit contributions
 whence-0.064
 Phant-0.064
 har-0.063
 gib-0.063
nos-0.061
Token Const
Token ID4757
Feature activation-1.11e-01

Pos logit contributions
 Phant+0.398
 har+0.396
 whence+0.395
 gib+0.388
 hay+0.379
Neg logit contributions
 Caption-0.573
hement-0.545
odynamic-0.540
enegger-0.538
acterial-0.536
Tokenantly
Token ID3875
Feature activation-4.42e-01

Pos logit contributions
 Caption+0.067
hement+0.062
odynamic+0.062
enegger+0.061
acterial+0.060
Neg logit contributions
 whence-0.079
 Phant-0.079
 har-0.079
 gib-0.078
nos-0.077
Token Changing
Token ID33680
Feature activation+2.08e-02

Pos logit contributions
 Caption+0.315
hement+0.296
odynamic+0.294
enegger+0.290
acterial+0.289
Neg logit contributions
 whence-0.268
 Phant-0.267
 har-0.265
 gib-0.263
nos-0.258
Token
Token ID198
Feature activation+8.63e-02

Pos logit contributions
 whence+0.009
 Phant+0.009
 har+0.009
 gib+0.008
nos+0.008
Neg logit contributions
 Caption-0.019
hement-0.018
odynamic-0.018
enegger-0.018
acterial-0.018
Token
Token ID198
Feature activation+8.63e-02

Pos logit contributions
 whence+0.025
 Phant+0.025
 har+0.024
 gib+0.024
nos+0.023
Neg logit contributions
 Caption-0.089
hement-0.086
odynamic-0.085
enegger-0.084
acterial-0.084
Tokenandre
Token ID49078
Feature activation-1.14e-02

Pos logit contributions
 Phant+0.132
 whence+0.132
 har+0.130
 gib+0.130
nos+0.127
Neg logit contributions
 Caption-0.113
odynamic-0.104
hement-0.104
acterial-0.102
enegger-0.101
Tokencre
Token ID7513
Feature activation+6.98e-02

Pos logit contributions
 Caption+0.007
hement+0.007
odynamic+0.007
enegger+0.007
acterial+0.007
Neg logit contributions
 whence-0.008
 Phant-0.007
 har-0.007
 gib-0.007
nos-0.007
Tokenational
Token ID864
Feature activation-6.77e-02

Pos logit contributions
 whence+0.043
 Phant+0.043
 har+0.043
 gib+0.042
nos+0.041
Neg logit contributions
 Caption-0.049
hement-0.046
odynamic-0.046
enegger-0.045
acterial-0.045
Tokensafety
Token ID44708
Feature activation+6.24e-02

Pos logit contributions
 Caption+0.044
hement+0.041
odynamic+0.041
enegger+0.040
acterial+0.040
Neg logit contributions
 whence-0.045
 Phant-0.045
 har-0.045
 gib-0.044
nos-0.044
Token/
Token ID14
Feature activation-7.74e-02

Pos logit contributions
 whence+0.044
 Phant+0.044
 har+0.044
 gib+0.043
nos+0.043
Neg logit contributions
 Caption-0.038
hement-0.036
odynamic-0.035
enegger-0.035
acterial-0.035
Tokenfalls
Token ID23348
Feature activation+7.32e-01

Pos logit contributions
 Caption+0.057
hement+0.054
odynamic+0.053
enegger+0.053
acterial+0.053
Neg logit contributions
 whence-0.045
 Phant-0.045
 har-0.045
 gib-0.044
nos-0.043
Token/
Token ID14
Feature activation-7.70e-02

Pos logit contributions
 whence+0.450
 har+0.449
 Phant+0.448
 gib+0.445
nos+0.437
Neg logit contributions
 Caption-0.514
hement-0.483
odynamic-0.477
enegger-0.474
acterial-0.471
Tokenpub
Token ID12984
Feature activation+3.09e-02

Pos logit contributions
 Caption+0.058
hement+0.055
odynamic+0.054
enegger+0.054
acterial+0.054
Neg logit contributions
 whence-0.044
 Phant-0.044
 har-0.043
 gib-0.043
nos-0.042
Tokens
Token ID82
Feature activation+1.87e-01

Pos logit contributions
 whence+0.012
 Phant+0.012
 har+0.012
 gib+0.011
nos+0.011
Neg logit contributions
 Caption-0.029
hement-0.028
odynamic-0.028
acterial-0.027
enegger-0.027
Token.
Token ID13
Feature activation-8.94e-02

Pos logit contributions
 whence+0.134
 Phant+0.134
 har+0.133
 gib+0.132
nos+0.129
Neg logit contributions
 Caption-0.113
hement-0.105
odynamic-0.104
enegger-0.102
acterial-0.102
Tokenhtml
Token ID6494
Feature activation-2.95e-01

Pos logit contributions
 Caption+0.057
hement+0.053
odynamic+0.052
acterial+0.051
enegger+0.051
Neg logit contributions
 whence-0.062
 Phant-0.061
 har-0.061
 gib-0.060
nos-0.059
Token superb
Token ID21840
Feature activation-6.59e-02

Pos logit contributions
 whence+0.022
 Phant+0.022
 har+0.022
 gib+0.022
 hay+0.021
Neg logit contributions
 Caption-0.026
hement-0.024
odynamic-0.024
enegger-0.024
acterial-0.024
Token neutrality
Token ID20723
Feature activation-1.85e-02

Pos logit contributions
 Caption+0.047
hement+0.044
odynamic+0.044
enegger+0.044
acterial+0.043
Neg logit contributions
 Phant-0.040
 har-0.040
 whence-0.040
 gib-0.039
 hay-0.038
Token,
Token ID11
Feature activation+5.49e-02

Pos logit contributions
 Caption+0.013
hement+0.012
odynamic+0.012
enegger+0.012
acterial+0.012
Neg logit contributions
 har-0.011
 whence-0.011
 Phant-0.011
 gib-0.011
 hay-0.011
Token the
Token ID262
Feature activation-8.55e-03

Pos logit contributions
 whence+0.034
 Phant+0.034
 har+0.034
 gib+0.034
nos+0.033
Neg logit contributions
 Caption-0.038
hement-0.036
odynamic-0.036
enegger-0.035
acterial-0.035
Token Silver
Token ID7698
Feature activation+1.67e-01

Pos logit contributions
 Caption+0.006
hement+0.006
odynamic+0.006
enegger+0.006
acterial+0.006
Neg logit contributions
 Phant-0.005
 har-0.005
 whence-0.005
 gib-0.005
nos-0.005
Token Phant
Token ID38699
Feature activation+7.29e-01

Pos logit contributions
 Phant+0.102
 whence+0.101
 har+0.101
 gib+0.100
nos+0.098
Neg logit contributions
 Caption-0.119
hement-0.112
odynamic-0.111
enegger-0.110
acterial-0.110
Tokenoms
Token ID3150
Feature activation-7.07e-02

Pos logit contributions
 Phant+0.512
 har+0.503
 whence+0.501
 gib+0.495
nos+0.492
Neg logit contributions
 Caption-0.461
hement-0.432
enegger-0.430
odynamic-0.422
utherland-0.415
Token more
Token ID517
Feature activation-5.21e-02

Pos logit contributions
 Caption+0.052
hement+0.049
odynamic+0.049
enegger+0.048
acterial+0.048
Neg logit contributions
 whence-0.041
 Phant-0.041
 har-0.041
 gib-0.041
nos-0.040
Token bass
Token ID12702
Feature activation+1.41e-01

Pos logit contributions
 Caption+0.036
hement+0.034
odynamic+0.034
enegger+0.033
acterial+0.033
Neg logit contributions
 whence-0.032
 Phant-0.032
 har-0.032
 gib-0.032
nos-0.031
Token slam
Token ID21158
Feature activation+3.54e-01

Pos logit contributions
 whence+0.086
 har+0.085
 Phant+0.085
 gib+0.084
 hay+0.082
Neg logit contributions
 Caption-0.101
hement-0.094
odynamic-0.093
enegger-0.093
acterial-0.092
Token and
Token ID290
Feature activation-7.20e-03

Pos logit contributions
 whence+0.223
 Phant+0.222
 har+0.222
 gib+0.219
nos+0.215
Neg logit contributions
 Caption-0.245
hement-0.230
odynamic-0.228
enegger-0.225
acterial-0.225
Token s
Token ID264
Feature activation+1.87e-01

Pos logit contributions
 Caption+0.088
hement+0.083
odynamic+0.082
enegger+0.081
acterial+0.081
Neg logit contributions
 whence-0.066
 Phant-0.066
 har-0.066
 gib-0.065
nos-0.063
Token of
Token ID286
Feature activation+2.31e-01

Pos logit contributions
 whence+0.113
 har+0.112
 Phant+0.112
 gib+0.111
nos+0.109
Neg logit contributions
 Caption-0.134
hement-0.126
odynamic-0.125
enegger-0.124
acterial-0.123
Token the
Token ID262
Feature activation+8.43e-02

Pos logit contributions
 whence+0.135
 Phant+0.134
 har+0.134
 gib+0.132
nos+0.129
Neg logit contributions
 Caption-0.170
hement-0.160
odynamic-0.159
enegger-0.157
acterial-0.156
Token u
Token ID334
Feature activation+7.94e-02

Pos logit contributions
 whence+0.053
 Phant+0.052
 har+0.052
 gib+0.052
nos+0.051
Neg logit contributions
 Caption-0.059
hement-0.055
odynamic-0.055
enegger-0.054
acterial-0.054
Token n
Token ID299
Feature activation+3.37e-01

Pos logit contributions
 whence+0.045
 Phant+0.045
 har+0.044
 gib+0.044
nos+0.043
Neg logit contributions
 Caption-0.060
hement-0.057
odynamic-0.056
enegger-0.055
acterial-0.055
Tokenions
Token ID507
Feature activation+7.22e-01

Pos logit contributions
 whence+0.206
 Phant+0.206
 har+0.205
 gib+0.203
nos+0.199
Neg logit contributions
 Caption-0.239
hement-0.225
odynamic-0.223
enegger-0.220
acterial-0.219
Token,
Token ID11
Feature activation+8.80e-02

Pos logit contributions
 whence+0.440
 har+0.438
 Phant+0.437
 gib+0.432
nos+0.422
Neg logit contributions
 Caption-0.514
hement-0.483
odynamic-0.479
enegger-0.476
acterial-0.473
Tokenand
Token ID392
Feature activation-1.31e-03

Pos logit contributions
 whence+0.052
 Phant+0.052
 har+0.051
 gib+0.051
nos+0.050
Neg logit contributions
 Caption-0.064
hement-0.061
odynamic-0.060
enegger-0.060
acterial-0.059
Token even
Token ID772
Feature activation+1.03e-01

Pos logit contributions
 Caption+0.001
hement+0.001
odynamic+0.001
enegger+0.001
acterial+0.001
Neg logit contributions
 whence-0.001
 Phant-0.001
 har-0.001
 gib-0.001
nos-0.001
Token more
Token ID517
Feature activation+6.08e-03

Pos logit contributions
 whence+0.064
 Phant+0.064
 har+0.064
 gib+0.063
nos+0.062
Neg logit contributions
 Caption-0.072
hement-0.067
odynamic-0.067
enegger-0.066
acterial-0.066
Token so
Token ID523
Feature activation+1.40e-01

Pos logit contributions
 whence+0.003
 Phant+0.003
 har+0.003
 gib+0.003
nos+0.003
Neg logit contributions
 Caption-0.005
hement-0.004
odynamic-0.004
enegger-0.004
acterial-0.004
Token equal
Token ID4961
Feature activation+3.06e-01

Pos logit contributions
 Caption+0.244
hement+0.232
odynamic+0.230
enegger+0.227
acterial+0.226
Neg logit contributions
 whence-0.164
 Phant-0.164
 har-0.163
 gib-0.161
nos-0.157
Token or
Token ID393
Feature activation+9.58e-03

Pos logit contributions
 Phant+0.188
 whence+0.187
 har+0.187
 gib+0.184
 hay+0.179
Neg logit contributions
 Caption-0.217
hement-0.205
odynamic-0.202
enegger-0.201
acterial-0.200
Token exceed
Token ID7074
Feature activation+2.05e-01

Pos logit contributions
 whence+0.004
 Phant+0.004
 har+0.004
 gib+0.004
nos+0.004
Neg logit contributions
 Caption-0.009
hement-0.008
odynamic-0.008
enegger-0.008
acterial-0.008
Token the
Token ID262
Feature activation-5.75e-02

Pos logit contributions
 Phant+0.114
 whence+0.114
 har+0.113
 gib+0.112
 hay+0.108
Neg logit contributions
 Caption-0.156
hement-0.149
odynamic-0.147
enegger-0.146
acterial-0.146
Token Silver
Token ID7698
Feature activation+1.63e-01

Pos logit contributions
 Caption+0.041
hement+0.039
odynamic+0.039
enegger+0.039
acterial+0.038
Neg logit contributions
 Phant-0.035
 har-0.034
 whence-0.034
 gib-0.034
 hay-0.033
Token Phant
Token ID38699
Feature activation+7.22e-01

Pos logit contributions
 whence+0.012
 har+0.011
 Phant+0.011
 gib+0.010
nos+0.008
Neg logit contributions
 Caption-0.203
hement-0.195
odynamic-0.195
enegger-0.193
acterial-0.193
Tokenoms
Token ID3150
Feature activation-6.41e-02

Pos logit contributions
 Phant+0.508
 har+0.502
 whence+0.498
ancies+0.492
nos+0.491
Neg logit contributions
 Caption-0.444
hement-0.434
enegger-0.422
utherland-0.410
odynamic-0.410
Token
Token ID447
Feature activation-9.81e-02

Pos logit contributions
 Caption+0.040
hement+0.037
odynamic+0.037
enegger+0.037
acterial+0.036
Neg logit contributions
 Phant-0.045
 har-0.045
 whence-0.045
 gib-0.044
nos-0.043
Token
Token ID247
Feature activation+8.88e-02

Pos logit contributions
 Caption+0.080
hement+0.078
enegger+0.077
odynamic+0.077
utherland+0.076
Neg logit contributions
 har-0.048
 whence-0.048
 Phant-0.047
 gib-0.047
nos-0.046
Token imaging
Token ID19560
Feature activation-2.52e-01

Pos logit contributions
 whence+0.015
 Phant+0.015
 har+0.015
 gib+0.015
nos+0.014
Neg logit contributions
 Caption-0.102
hement-0.098
odynamic-0.097
enegger-0.097
acterial-0.096
Token and
Token ID290
Feature activation-9.45e-02

Pos logit contributions
 Caption+0.178
hement+0.169
odynamic+0.166
enegger+0.166
acterial+0.164
Neg logit contributions
 har-0.155
 Phant-0.155
 whence-0.155
 gib-0.153
 hay-0.149
Token no
Token ID645
Feature activation+2.53e-01

Pos logit contributions
 Caption+0.009
hement+0.009
odynamic+0.009
enegger+0.009
acterial+0.009
Neg logit contributions
 whence-0.008
 Phant-0.008
 har-0.008
 gib-0.008
nos-0.008
Token one
Token ID530
Feature activation+4.87e-02

Pos logit contributions
 har+0.142
 Phant+0.142
 whence+0.142
 gib+0.141
 hay+0.136
Neg logit contributions
 Caption-0.191
hement-0.182
odynamic-0.179
enegger-0.178
acterial-0.177
Token knows
Token ID4206
Feature activation+9.44e-02

Pos logit contributions
 har+0.034
 whence+0.034
 Phant+0.033
 gib+0.033
 hay+0.033
Neg logit contributions
 Caption-0.031
hement-0.029
odynamic-0.028
enegger-0.028
acterial-0.028
Token whether
Token ID1771
Feature activation+2.04e-01

Pos logit contributions
 whence+0.044
 har+0.043
 Phant+0.043
 gib+0.043
 hay+0.041
Neg logit contributions
 Caption-0.081
hement-0.077
odynamic-0.077
enegger-0.076
acterial-0.076
Token ax
Token ID7877
Feature activation+4.25e-03

Pos logit contributions
 whence+0.111
 Phant+0.111
 har+0.111
 gib+0.109
nos+0.107
Neg logit contributions
 Caption-0.158
hement-0.150
odynamic-0.149
enegger-0.147
acterial-0.147
Tokenions
Token ID507
Feature activation+7.01e-01

Pos logit contributions
 har+0.001
 whence+0.001
 gib+0.001
nos+0.001
 Phant+0.001
Neg logit contributions
 Caption-0.005
hement-0.004
enegger-0.004
-0.004
odynamic-0.004
Token even
Token ID772
Feature activation+1.94e-01

Pos logit contributions
 har+0.526
 whence+0.526
 Phant+0.519
 gib+0.517
 hay+0.507
Neg logit contributions
 Caption-0.401
hement-0.377
enegger-0.367
odynamic-0.366
acterial-0.364
Token exist
Token ID2152
Feature activation+3.54e-01

Pos logit contributions
 whence+0.096
 har+0.096
 Phant+0.096
 gib+0.095
 hay+0.092
Neg logit contributions
 Caption-0.159
hement-0.152
odynamic-0.150
enegger-0.149
acterial-0.148
Token.
Token ID13
Feature activation+1.83e-02

Pos logit contributions
 whence+0.235
 har+0.234
 Phant+0.233
 gib+0.232
 hay+0.225
Neg logit contributions
 Caption-0.233
hement-0.220
odynamic-0.215
acterial-0.213
enegger-0.213
Token It
Token ID632
Feature activation-3.36e-02

Pos logit contributions
 Phant+0.008
 whence+0.008
 har+0.008
 gib+0.008
nos+0.008
Neg logit contributions
 Caption-0.016
hement-0.015
odynamic-0.015
enegger-0.015
acterial-0.015
Token's
Token ID338
Feature activation+1.03e-01

Pos logit contributions
 Caption+0.023
hement+0.022
odynamic+0.022
enegger+0.021
acterial+0.021
Neg logit contributions
 har-0.021
 whence-0.021
 Phant-0.021
 gib-0.021
nos-0.020
Token of
Token ID286
Feature activation+6.48e-02

Pos logit contributions
 Caption+0.258
hement+0.246
odynamic+0.244
enegger+0.241
acterial+0.240
Neg logit contributions
 whence-0.134
 Phant-0.132
 har-0.132
 gib-0.131
nos-0.129
Token Dev
Token ID6245
Feature activation+4.57e-02

Pos logit contributions
 Phant+0.039
 whence+0.039
 har+0.039
 gib+0.038
nos+0.038
Neg logit contributions
 Caption-0.046
hement-0.044
odynamic-0.043
enegger-0.043
acterial-0.043
Tokenial
Token ID498
Feature activation+1.46e-01

Pos logit contributions
 har+0.024
 Phant+0.024
 whence+0.024
 gib+0.023
nos+0.023
Neg logit contributions
 Caption-0.036
hement-0.035
odynamic-0.034
enegger-0.034
-0.034
Tokenet
Token ID316
Feature activation-6.34e-02

Pos logit contributions
 whence+0.074
 Phant+0.073
 har+0.073
 gib+0.072
nos+0.070
Neg logit contributions
 Caption-0.118
hement-0.112
odynamic-0.112
enegger-0.110
acterial-0.110
Token Silver
Token ID7698
Feature activation+1.52e-01

Pos logit contributions
 Caption+0.041
hement+0.038
odynamic+0.038
enegger+0.037
acterial+0.037
Neg logit contributions
 Phant-0.043
 whence-0.043
 har-0.043
 gib-0.042
nos-0.042
Token Phant
Token ID38699
Feature activation+6.98e-01

Pos logit contributions
 whence+0.014
 Phant+0.014
 har+0.013
 gib+0.012
nos+0.011
Neg logit contributions
 Caption-0.186
hement-0.180
odynamic-0.179
enegger-0.178
acterial-0.178
Tokenoms
Token ID3150
Feature activation-1.29e-01

Pos logit contributions
 Phant+0.495
 har+0.491
 whence+0.485
nos+0.481
ancies+0.481
Neg logit contributions
 Caption-0.428
hement-0.411
enegger-0.402
odynamic-0.392
utherland-0.391
Token,
Token ID11
Feature activation-2.87e-02

Pos logit contributions
 Caption+0.079
hement+0.075
odynamic+0.074
enegger+0.072
acterial+0.072
Neg logit contributions
 har-0.091
 Phant-0.091
 whence-0.091
 gib-0.090
 hay-0.088
Token with
Token ID351
Feature activation-9.40e-02

Pos logit contributions
 Caption+0.021
hement+0.020
odynamic+0.020
enegger+0.020
acterial+0.020
Neg logit contributions
 Phant-0.017
 whence-0.017
 har-0.017
 gib-0.016
nos-0.016
Token Dial
Token ID21269
Feature activation+1.53e-01

Pos logit contributions
 Caption+0.073
hement+0.069
odynamic+0.069
enegger+0.068
acterial+0.068
Neg logit contributions
 whence-0.051
 Phant-0.051
 har-0.050
 gib-0.050
nos-0.049
Tokenog
Token ID519
Feature activation+2.67e-01

Pos logit contributions
 Phant+0.095
 whence+0.094
 har+0.094
 gib+0.093
nos+0.091
Neg logit contributions
 Caption-0.106
hement-0.101
odynamic-0.099
enegger-0.099
acterial-0.098
Token/
Token ID14
Feature activation-1.63e-01

Pos logit contributions
 whence+0.175
 Phant+0.175
 har+0.174
 gib+0.172
nos+0.169
Neg logit contributions
 Caption-0.208
hement-0.198
odynamic-0.195
enegger-0.194
acterial-0.192
TokenDeb
Token ID16587
Feature activation+1.56e-01

Pos logit contributions
 Caption+0.103
hement+0.097
odynamic+0.096
enegger+0.095
acterial+0.094
Neg logit contributions
 whence-0.111
 Phant-0.111
 har-0.111
 gib-0.109
nos-0.108
Tokenate
Token ID378
Feature activation-1.29e-01

Pos logit contributions
 whence+0.105
 Phant+0.104
 har+0.104
 gib+0.103
nos+0.102
Neg logit contributions
 Caption-0.101
hement-0.095
odynamic-0.094
enegger-0.093
acterial-0.092
Token Op
Token ID8670
Feature activation+1.43e-01

Pos logit contributions
 Caption+0.097
hement+0.093
odynamic+0.092
enegger+0.091
acterial+0.090
Neg logit contributions
 Phant-0.072
 whence-0.072
 har-0.072
 gib-0.071
nos-0.069
Tokenin
Token ID259
Feature activation-2.36e-01

Pos logit contributions
 Phant+0.123
 whence+0.123
 har+0.123
 gib+0.121
nos+0.121
Neg logit contributions
 Caption-0.065
hement-0.060
odynamic-0.059
enegger-0.058
acterial-0.057
Tokenions
Token ID507
Feature activation+6.93e-01

Pos logit contributions
 Caption+0.282
hement+0.272
odynamic+0.270
enegger+0.269
acterial+0.268
Neg logit contributions
 whence-0.029
 Phant-0.029
 har-0.029
 gib-0.027
nos-0.025
Token expressed
Token ID6241
Feature activation+7.33e-03

Pos logit contributions
 whence+0.396
 Phant+0.396
 har+0.394
 gib+0.390
nos+0.381
Neg logit contributions
 Caption-0.516
hement-0.490
odynamic-0.485
enegger-0.483
acterial-0.478
Token in
Token ID287
Feature activation-1.45e-01

Pos logit contributions
 har+0.005
 whence+0.005
 Phant+0.005
 gib+0.005
nos+0.005
Neg logit contributions
 Caption-0.005
hement-0.004
odynamic-0.004
enegger-0.004
acterial-0.004
Token USA
Token ID4916
Feature activation-5.81e-03

Pos logit contributions
 Caption+0.105
hement+0.100
odynamic+0.099
enegger+0.098
acterial+0.097
Neg logit contributions
 Phant-0.085
 whence-0.085
 har-0.084
 gib-0.084
nos-0.082
Token TODAY
Token ID20092
Feature activation+1.49e-01

Pos logit contributions
 Caption+0.004
hement+0.004
enegger+0.004
odynamic+0.004
acterial+0.004
Neg logit contributions
 whence-0.004
 Phant-0.004
 har-0.004
 gib-0.004
nos-0.003
Token's
Token ID338
Feature activation+3.83e-02

Pos logit contributions
 whence+0.088
 Phant+0.088
 har+0.088
 gib+0.087
nos+0.085
Neg logit contributions
 Caption-0.108
hement-0.101
odynamic-0.101
enegger-0.099
acterial-0.099
Token organizations
Token ID5745
Feature activation+1.54e-01

Pos logit contributions
 Caption+0.058
hement+0.054
odynamic+0.054
enegger+0.053
acterial+0.053
Neg logit contributions
 Phant-0.054
 har-0.054
 whence-0.054
 gib-0.053
nos-0.052
Token to
Token ID284
Feature activation+1.17e-02

Pos logit contributions
 whence+0.091
 Phant+0.091
 har+0.091
 gib+0.089
nos+0.087
Neg logit contributions
 Caption-0.112
hement-0.106
odynamic-0.105
enegger-0.104
acterial-0.104
Token offer
Token ID2897
Feature activation-3.27e-02

Pos logit contributions
 whence+0.007
 har+0.007
 Phant+0.007
 gib+0.007
nos+0.006
Neg logit contributions
 Caption-0.009
hement-0.008
odynamic-0.008
enegger-0.008
acterial-0.008
Token HR
Token ID15172
Feature activation-5.75e-02

Pos logit contributions
 Caption+0.024
hement+0.023
odynamic+0.023
enegger+0.023
acterial+0.023
Neg logit contributions
 whence-0.019
 Phant-0.019
 har-0.019
 gib-0.019
nos-0.018
Token intern
Token ID1788
Feature activation+5.37e-02

Pos logit contributions
 Caption+0.041
hement+0.039
odynamic+0.038
enegger+0.038
acterial+0.038
Neg logit contributions
 har-0.035
 Phant-0.035
 whence-0.035
 gib-0.034
nos-0.034
Tokenships
Token ID26313
Feature activation+6.85e-01

Pos logit contributions
 whence+0.014
 Phant+0.014
 har+0.014
 gib+0.013
nos+0.013
Neg logit contributions
 Caption-0.057
hement-0.055
odynamic-0.054
enegger-0.054
acterial-0.054
Token,
Token ID11
Feature activation+5.42e-03

Pos logit contributions
 whence+0.439
 har+0.437
 Phant+0.436
 gib+0.430
 hay+0.419
Neg logit contributions
 Caption-0.464
hement-0.440
odynamic-0.433
enegger-0.428
acterial-0.428
Token fund
Token ID1814
Feature activation+1.37e-01

Pos logit contributions
 whence+0.003
 Phant+0.003
 har+0.003
 gib+0.003
nos+0.003
Neg logit contributions
 Caption-0.004
hement-0.004
odynamic-0.004
enegger-0.004
acterial-0.004
Token studies
Token ID3640
Feature activation+1.66e-01

Pos logit contributions
 whence+0.082
 har+0.082
 Phant+0.082
 gib+0.080
nos+0.079
Neg logit contributions
 Caption-0.098
hement-0.092
odynamic-0.092
enegger-0.091
acterial-0.090
Token abroad
Token ID10522
Feature activation+3.43e-01

Pos logit contributions
 whence+0.106
 Phant+0.106
 har+0.106
 gib+0.105
nos+0.103
Neg logit contributions
 Caption-0.112
hement-0.105
odynamic-0.104
enegger-0.103
acterial-0.103
Token,
Token ID11
Feature activation+1.92e-02

Pos logit contributions
 whence+0.217
 Phant+0.216
 har+0.215
 gib+0.213
nos+0.208
Neg logit contributions
 Caption-0.237
hement-0.223
odynamic-0.220
enegger-0.218
acterial-0.217
Token over
Token ID625
Feature activation-1.28e-01

Pos logit contributions
 Phant+0.094
 har+0.094
 whence+0.094
 gib+0.092
nos+0.090
Neg logit contributions
 Caption-0.118
hement-0.112
odynamic-0.111
enegger-0.109
acterial-0.109
Token which
Token ID543
Feature activation+1.93e-01

Pos logit contributions
 Caption+0.096
hement+0.090
odynamic+0.089
enegger+0.089
acterial+0.088
Neg logit contributions
 Phant-0.072
 whence-0.072
 har-0.071
 gib-0.071
nos-0.070
Token to
Token ID284
Feature activation+3.15e-02

Pos logit contributions
 Phant+0.117
 har+0.117
 whence+0.117
 gib+0.115
nos+0.112
Neg logit contributions
 Caption-0.137
hement-0.130
odynamic-0.128
enegger-0.128
acterial-0.126
Token sound
Token ID2128
Feature activation+1.61e-01

Pos logit contributions
 whence+0.017
 Phant+0.017
 har+0.016
 gib+0.016
nos+0.016
Neg logit contributions
 Caption-0.025
hement-0.024
odynamic-0.023
enegger-0.023
acterial-0.023
Token alarm
Token ID10436
Feature activation+3.27e-01

Pos logit contributions
 har+0.102
 whence+0.101
 Phant+0.101
 gib+0.100
 hay+0.098
Neg logit contributions
 Caption-0.111
hement-0.105
odynamic-0.104
enegger-0.103
acterial-0.102
Token bells
Token ID30987
Feature activation+6.83e-01

Pos logit contributions
 whence+0.110
 Phant+0.109
 har+0.109
 gib+0.107
nos+0.102
Neg logit contributions
 Caption-0.322
hement-0.309
odynamic-0.307
enegger-0.304
acterial-0.303
Token.
Token ID13
Feature activation+4.22e-02

Pos logit contributions
 whence+0.428
 har+0.427
 Phant+0.425
 gib+0.422
 hay+0.411
Neg logit contributions
 Caption-0.474
hement-0.446
odynamic-0.442
enegger-0.436
acterial-0.435
Token This
Token ID770
Feature activation+6.83e-02

Pos logit contributions
 whence+0.018
 Phant+0.018
 har+0.018
 gib+0.018
nos+0.018
Neg logit contributions
 Caption-0.037
hement-0.035
odynamic-0.035
enegger-0.035
acterial-0.035
Token is
Token ID318
Feature activation-1.39e-02

Pos logit contributions
 Phant+0.049
 har+0.049
 whence+0.049
 gib+0.048
 hay+0.047
Neg logit contributions
 Caption-0.041
hement-0.039
odynamic-0.038
enegger-0.038
acterial-0.038
Token less
Token ID1342
Feature activation+1.17e-01

Pos logit contributions
 Caption+0.011
hement+0.011
odynamic+0.010
enegger+0.010
acterial+0.010
Neg logit contributions
 whence-0.007
 Phant-0.007
 har-0.007
 gib-0.007
nos-0.007
Token about
Token ID546
Feature activation+1.23e-01

Pos logit contributions
 whence+0.061
 Phant+0.060
 har+0.060
 gib+0.059
nos+0.058
Neg logit contributions
 Caption-0.094
hement-0.088
odynamic-0.088
acterial-0.086
enegger-0.086
Token resident
Token ID6623
Feature activation-5.09e-02

Pos logit contributions
 whence+0.096
 Phant+0.096
 har+0.096
 gib+0.095
nos+0.092
Neg logit contributions
 Caption-0.119
hement-0.112
odynamic-0.112
enegger-0.110
acterial-0.110
Token as
Token ID355
Feature activation-3.40e-02

Pos logit contributions
 Caption+0.036
hement+0.034
odynamic+0.033
enegger+0.033
acterial+0.033
Neg logit contributions
 whence-0.031
 Phant-0.031
 har-0.031
 gib-0.031
nos-0.030
Token opposed
Token ID6886
Feature activation+2.12e-01

Pos logit contributions
 Caption+0.026
hement+0.025
odynamic+0.025
enegger+0.025
acterial+0.025
Neg logit contributions
 whence-0.018
 Phant-0.018
 har-0.018
 gib-0.018
nos-0.018
Token to
Token ID284
Feature activation+9.52e-02

Pos logit contributions
 har+0.123
 whence+0.123
 Phant+0.122
 gib+0.120
nos+0.118
Neg logit contributions
 Caption-0.157
hement-0.151
odynamic-0.149
enegger-0.148
acterial-0.147
Token the
Token ID262
Feature activation+4.92e-02

Pos logit contributions
 whence+0.051
 Phant+0.051
 har+0.051
 gib+0.050
nos+0.049
Neg logit contributions
 Caption-0.074
hement-0.070
odynamic-0.070
enegger-0.069
acterial-0.069
Token revolving
Token ID39350
Feature activation+6.80e-01

Pos logit contributions
 whence+0.029
 Phant+0.029
 har+0.029
 gib+0.029
 hay+0.028
Neg logit contributions
 Caption-0.036
hement-0.034
odynamic-0.033
enegger-0.033
acterial-0.033
Token door
Token ID3420
Feature activation+2.70e-01

Pos logit contributions
 whence+0.364
 Phant+0.364
 har+0.359
 gib+0.356
nos+0.351
Neg logit contributions
 Caption-0.533
hement-0.502
odynamic-0.499
enegger-0.493
acterial-0.492
Token of
Token ID286
Feature activation+2.40e-01

Pos logit contributions
 whence+0.132
 Phant+0.132
 har+0.131
 gib+0.129
nos+0.126
Neg logit contributions
 Caption-0.224
hement-0.213
odynamic-0.211
enegger-0.209
acterial-0.208
Token previous
Token ID2180
Feature activation-7.71e-03

Pos logit contributions
 whence+0.141
 Phant+0.140
 har+0.140
 gib+0.138
nos+0.135
Neg logit contributions
 Caption-0.175
hement-0.166
odynamic-0.164
enegger-0.162
acterial-0.162
Token restaurants
Token ID10808
Feature activation+1.96e-01

Pos logit contributions
 Caption+0.005
hement+0.005
odynamic+0.005
enegger+0.005
acterial+0.005
Neg logit contributions
 whence-0.005
 Phant-0.005
 har-0.005
 gib-0.005
nos-0.005
Token.
Token ID13
Feature activation+4.09e-02

Pos logit contributions
 whence+0.118
 Phant+0.117
 har+0.117
 gib+0.115
nos+0.112
Neg logit contributions
 Caption-0.142
hement-0.134
odynamic-0.133
enegger-0.132
acterial-0.131
Token in
Token ID287
Feature activation-2.50e-01

Pos logit contributions
 Caption+0.030
hement+0.029
odynamic+0.029
enegger+0.028
acterial+0.028
Neg logit contributions
 Phant-0.024
 har-0.024
 whence-0.024
 gib-0.024
 hay-0.023
Token the
Token ID262
Feature activation-2.54e-02

Pos logit contributions
 Caption+0.178
hement+0.167
odynamic+0.166
enegger+0.164
acterial+0.164
Neg logit contributions
 whence-0.151
 Phant-0.151
 har-0.150
 gib-0.149
nos-0.146
Token mission
Token ID4365
Feature activation-1.68e-01

Pos logit contributions
 Caption+0.017
hement+0.016
odynamic+0.015
enegger+0.015
utherland+0.015
Neg logit contributions
 Phant-0.017
 har-0.017
 whence-0.017
 gib-0.017
 hay-0.016
Token,
Token ID11
Feature activation-5.93e-02

Pos logit contributions
 Caption+0.116
hement+0.109
odynamic+0.108
enegger+0.107
acterial+0.106
Neg logit contributions
 whence-0.105
 Phant-0.105
 har-0.105
 gib-0.104
nos-0.101
Token material
Token ID2587
Feature activation-1.99e-01

Pos logit contributions
 Caption+0.043
hement+0.041
enegger+0.040
odynamic+0.040
acterial+0.040
Neg logit contributions
 Phant-0.035
 har-0.035
 whence-0.035
 gib-0.035
nos-0.034
Token ejected
Token ID38632
Feature activation-8.94e-01

Pos logit contributions
 Caption+0.135
hement+0.127
odynamic+0.126
enegger+0.125
acterial+0.124
Neg logit contributions
 whence-0.127
 har-0.127
 Phant-0.127
 gib-0.126
nos-0.123
Token during
Token ID1141
Feature activation-3.02e-01

Pos logit contributions
 Caption+0.623
hement+0.592
odynamic+0.586
enegger+0.582
acterial+0.577
Neg logit contributions
 whence-0.558
 har-0.555
 Phant-0.554
 gib-0.549
 hay-0.536
Token launch
Token ID4219
Feature activation-2.53e-01

Pos logit contributions
 Caption+0.218
hement+0.206
odynamic+0.205
enegger+0.204
acterial+0.201
Neg logit contributions
 Phant-0.181
 har-0.180
 whence-0.180
 gib-0.178
 hay-0.173
Token vehicle
Token ID4038
Feature activation+1.05e-02

Pos logit contributions
 Caption+0.161
hement+0.151
odynamic+0.150
enegger+0.149
acterial+0.147
Neg logit contributions
 whence-0.172
 har-0.172
 Phant-0.172
 gib-0.170
 hay-0.166
Token separation
Token ID14139
Feature activation-3.77e-01

Pos logit contributions
 har+0.007
 whence+0.007
 Phant+0.007
 gib+0.007
 hay+0.007
Neg logit contributions
 Caption-0.007
enegger-0.007
hement-0.007
odynamic-0.007
acterial-0.006
Token caused
Token ID4073
Feature activation-1.57e-01

Pos logit contributions
 Caption+0.243
hement+0.229
odynamic+0.227
enegger+0.226
acterial+0.224
Neg logit contributions
 whence-0.254
 har-0.253
 Phant-0.253
 gib-0.251
 hay-0.245
Token after
Token ID706
Feature activation+1.40e-02

Pos logit contributions
 whence+0.046
 har+0.046
 Phant+0.046
 gib+0.045
nos+0.044
Neg logit contributions
 Caption-0.046
hement-0.043
odynamic-0.042
enegger-0.042
acterial-0.042
Token he
Token ID339
Feature activation-1.35e-02

Pos logit contributions
 whence+0.008
 Phant+0.007
 har+0.007
 gib+0.007
nos+0.007
Neg logit contributions
 Caption-0.011
hement-0.010
odynamic-0.010
enegger-0.010
acterial-0.010
Token was
Token ID373
Feature activation-2.36e-02

Pos logit contributions
 Caption+0.009
hement+0.008
odynamic+0.008
enegger+0.008
acterial+0.008
Neg logit contributions
 whence-0.009
 har-0.009
 Phant-0.009
 gib-0.009
nos-0.009
Token caught
Token ID4978
Feature activation-2.53e-01

Pos logit contributions
 Caption+0.017
hement+0.016
odynamic+0.016
enegger+0.015
acterial+0.015
Neg logit contributions
 whence-0.014
 har-0.014
 Phant-0.014
 gib-0.014
nos-0.014
Token shop
Token ID6128
Feature activation-1.18e-01

Pos logit contributions
 Caption+0.177
hement+0.169
odynamic+0.167
enegger+0.165
utherland+0.164
Neg logit contributions
 har-0.158
 whence-0.157
 Phant-0.156
 gib-0.155
 hay-0.152
Tokenlifting
Token ID30510
Feature activation-8.82e-01

Pos logit contributions
 Caption+0.032
hement+0.027
odynamic+0.026
enegger+0.025
acterial+0.025
Neg logit contributions
 whence-0.124
 har-0.124
 Phant-0.124
 gib-0.123
nos-0.122
Token a
Token ID257
Feature activation-9.36e-02

Pos logit contributions
 Caption+0.601
hement+0.565
odynamic+0.560
enegger+0.552
acterial+0.551
Neg logit contributions
 whence-0.561
 Phant-0.560
 har-0.557
 gib-0.552
nos-0.542
Token box
Token ID3091
Feature activation-8.83e-02

Pos logit contributions
 Caption+0.069
hement+0.065
odynamic+0.064
enegger+0.063
acterial+0.063
Neg logit contributions
 whence-0.055
 Phant-0.055
 har-0.054
 gib-0.054
nos-0.053
Token of
Token ID286
Feature activation-4.65e-02

Pos logit contributions
 Caption+0.076
hement+0.072
odynamic+0.072
enegger+0.071
acterial+0.071
Neg logit contributions
 whence-0.041
 Phant-0.041
 har-0.040
 gib-0.040
nos-0.039
Token boots
Token ID14412
Feature activation-1.35e-01

Pos logit contributions
 Caption+0.035
hement+0.033
odynamic+0.033
enegger+0.033
acterial+0.033
Neg logit contributions
 whence-0.026
 Phant-0.026
 har-0.026
 gib-0.026
nos-0.025
Token and
Token ID290
Feature activation-1.58e-01

Pos logit contributions
 Caption+0.091
hement+0.086
odynamic+0.086
enegger+0.085
utherland+0.084
Neg logit contributions
 whence-0.086
 har-0.086
 Phant-0.086
 gib-0.085
 hay-0.083
Token likely
Token ID1884
Feature activation+7.38e-02

Pos logit contributions
 Caption+0.074
hement+0.069
odynamic+0.069
enegger+0.068
acterial+0.068
Neg logit contributions
 whence-0.067
 Phant-0.067
 har-0.067
 gib-0.066
nos-0.065
Token to
Token ID284
Feature activation-3.74e-02

Pos logit contributions
 whence+0.046
 Phant+0.046
 har+0.045
 gib+0.045
nos+0.044
Neg logit contributions
 Caption-0.051
odynamic-0.048
hement-0.048
enegger-0.047
acterial-0.047
Token engage
Token ID8209
Feature activation-2.78e-02

Pos logit contributions
 Caption+0.031
hement+0.029
odynamic+0.029
enegger+0.029
acterial+0.029
Neg logit contributions
 whence-0.019
 Phant-0.019
 har-0.019
 gib-0.018
nos-0.018
Token in
Token ID287
Feature activation-1.59e-01

Pos logit contributions
 Caption+0.015
hement+0.014
odynamic+0.014
acterial+0.013
enegger+0.013
Neg logit contributions
 har-0.022
 whence-0.022
 Phant-0.022
 gib-0.022
 hay-0.021
Token shop
Token ID6128
Feature activation-1.51e-01

Pos logit contributions
 Caption+0.110
hement+0.104
odynamic+0.103
enegger+0.102
acterial+0.101
Neg logit contributions
 har-0.100
 whence-0.100
 Phant-0.100
 gib-0.099
 hay-0.096
Tokenlifting
Token ID30510
Feature activation-8.77e-01

Pos logit contributions
 Caption+0.044
hement+0.039
odynamic+0.038
enegger+0.037
acterial+0.036
Neg logit contributions
 har-0.154
 Phant-0.154
 whence-0.154
 gib-0.153
nos-0.151
Token or
Token ID393
Feature activation+2.21e-03

Pos logit contributions
 Caption+0.648
hement+0.614
odynamic+0.608
enegger+0.600
acterial+0.599
Neg logit contributions
 whence-0.510
 har-0.508
 Phant-0.507
 gib-0.501
nos-0.487
Token other
Token ID584
Feature activation-2.22e-01

Pos logit contributions
 har+0.001
 whence+0.001
 Phant+0.001
 gib+0.001
 hay+0.001
Neg logit contributions
 Caption-0.002
hement-0.002
odynamic-0.002
enegger-0.002
acterial-0.001
Token petty
Token ID25229
Feature activation-2.32e-02

Pos logit contributions
 Caption+0.163
hement+0.155
odynamic+0.153
enegger+0.152
acterial+0.151
Neg logit contributions
 har-0.131
 whence-0.130
 Phant-0.130
 gib-0.129
nos-0.125
Token crimes
Token ID6741
Feature activation-4.53e-02

Pos logit contributions
 Caption+0.017
hement+0.016
odynamic+0.016
enegger+0.016
acterial+0.016
Neg logit contributions
 har-0.013
 Phant-0.013
 whence-0.013
 gib-0.013
 hay-0.013
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
 Caption+0.033
hement+0.031
odynamic+0.030
enegger+0.030
acterial+0.030
Neg logit contributions
 whence-0.027
 Phant-0.027
 har-0.027
 gib-0.027
nos-0.026
Token fluffy
Token ID39145
Feature activation-2.69e-01

Pos logit contributions
 Caption+0.041
hement+0.039
odynamic+0.039
enegger+0.038
acterial+0.038
Neg logit contributions
 whence-0.035
 Phant-0.035
 har-0.034
 gib-0.034
nos-0.033
Token creatures
Token ID8109
Feature activation+7.41e-02

Pos logit contributions
 Caption+0.207
hement+0.196
odynamic+0.194
enegger+0.192
acterial+0.191
Neg logit contributions
 whence-0.147
 Phant-0.147
 har-0.145
 gib-0.144
nos-0.141
Token.
Token ID13
Feature activation-1.60e-01

Pos logit contributions
 whence+0.048
 Phant+0.048
 har+0.048
 gib+0.048
nos+0.047
Neg logit contributions
 Caption-0.050
hement-0.046
odynamic-0.046
enegger-0.045
acterial-0.045
Token
Token ID198
Feature activation-7.21e-02

Pos logit contributions
 Caption+0.147
hement+0.139
odynamic+0.138
enegger+0.137
acterial+0.136
Neg logit contributions
 whence-0.065
 Phant-0.064
 har-0.064
 gib-0.063
nos-0.061
Token
Token ID198
Feature activation-6.25e-02

Pos logit contributions
 Caption+0.074
hement+0.071
odynamic+0.070
enegger+0.070
acterial+0.070
Neg logit contributions
 whence-0.021
 Phant-0.021
 har-0.021
 gib-0.020
nos-0.020
TokenHug
Token ID48098
Feature activation-8.53e-01

Pos logit contributions
 Caption+0.038
hement+0.035
odynamic+0.035
enegger+0.034
acterial+0.034
Neg logit contributions
 whence-0.045
 Phant-0.045
 har-0.045
 gib-0.044
nos-0.043
Token your
Token ID534
Feature activation+6.95e-02

Pos logit contributions
 Caption+0.671
hement+0.638
odynamic+0.632
enegger+0.626
acterial+0.625
Neg logit contributions
 whence-0.453
 har-0.452
 Phant-0.452
 gib-0.448
 hay-0.435
Token friends
Token ID2460
Feature activation-4.43e-02

Pos logit contributions
 har+0.041
 Phant+0.041
 whence+0.041
 gib+0.040
 hay+0.039
Neg logit contributions
 Caption-0.051
hement-0.048
odynamic-0.048
enegger-0.047
acterial-0.047
Token.
Token ID13
Feature activation-1.69e-01

Pos logit contributions
 Caption+0.030
hement+0.029
odynamic+0.028
enegger+0.028
acterial+0.028
Neg logit contributions
 whence-0.029
 har-0.028
 Phant-0.028
 gib-0.028
 hay-0.028
Token
Token ID198
Feature activation-7.61e-02

Pos logit contributions
 Caption+0.149
hement+0.141
odynamic+0.140
enegger+0.139
acterial+0.139
Neg logit contributions
 whence-0.074
 Phant-0.074
 har-0.074
 gib-0.073
nos-0.071
Token
Token ID198
Feature activation-6.27e-02

Pos logit contributions
 Caption+0.078
hement+0.075
odynamic+0.074
enegger+0.074
acterial+0.073
Neg logit contributions
 whence-0.022
 Phant-0.022
 har-0.022
 gib-0.022
nos-0.021
Token sent
Token ID1908
Feature activation-1.80e-01

Pos logit contributions
 whence+0.001
 Phant+0.001
 har+0.001
 gib+0.001
nos+0.001
Neg logit contributions
 Caption-0.001
hement-0.001
odynamic-0.001
enegger-0.001
acterial-0.001
Token a
Token ID257
Feature activation-4.00e-02

Pos logit contributions
 Caption+0.141
hement+0.134
odynamic+0.133
enegger+0.131
acterial+0.131
Neg logit contributions
 whence-0.096
 har-0.096
 Phant-0.096
 gib-0.095
nos-0.092
Token cor
Token ID1162
Feature activation-1.08e-01

Pos logit contributions
 Caption+0.028
hement+0.026
odynamic+0.026
enegger+0.026
acterial+0.026
Neg logit contributions
 har-0.025
 whence-0.025
 Phant-0.025
 gib-0.025
 hay-0.024
Tokenonal
Token ID20996
Feature activation-3.65e-01

Pos logit contributions
 Caption+0.062
hement+0.058
enegger+0.057
odynamic+0.056
acterial+0.056
Neg logit contributions
 whence-0.080
 har-0.079
 Phant-0.079
 gib-0.079
nos-0.078
Token mass
Token ID2347
Feature activation-4.40e-01

Pos logit contributions
 Caption+0.239
hement+0.226
enegger+0.225
odynamic+0.220
inical+0.218
Neg logit contributions
 har-0.246
 whence-0.244
 gib-0.242
 Phant-0.241
 hay-0.235
Token eject
Token ID22189
Feature activation-8.39e-01

Pos logit contributions
 Caption+0.117
hement+0.099
odynamic+0.095
enegger+0.094
acterial+0.092
Neg logit contributions
 har-0.467
 whence-0.466
 gib-0.463
 Phant-0.462
 hay-0.454
Tokenion
Token ID295
Feature activation+6.38e-03

Pos logit contributions
 Caption+0.681
hement+0.657
enegger+0.647
odynamic+0.644
acterial+0.636
Neg logit contributions
 whence-0.429
 har-0.428
 Phant-0.421
 gib-0.417
 hay-0.402
Token (
Token ID357
Feature activation-5.92e-02

Pos logit contributions
 whence+0.004
 har+0.004
 gib+0.004
 Phant+0.004
 hay+0.004
Neg logit contributions
 Caption-0.005
hement-0.004
odynamic-0.004
enegger-0.004
acterial-0.004
Tokencharged
Token ID17200
Feature activation+1.56e-01

Pos logit contributions
 Caption+0.030
enegger+0.028
hement+0.028
odynamic+0.027
utherland+0.027
Neg logit contributions
 whence-0.048
 har-0.047
 Phant-0.047
 gib-0.047
ding-0.046
Token particles
Token ID13166
Feature activation+1.25e-01

Pos logit contributions
 har+0.098
 whence+0.097
 gib+0.097
 Phant+0.096
 hay+0.094
Neg logit contributions
 Caption-0.108
hement-0.102
enegger-0.101
odynamic-0.101
inical-0.100
Token)
Token ID8
Feature activation-3.76e-01

Pos logit contributions
 whence+0.076
 har+0.076
 gib+0.075
 Phant+0.075
 hay+0.073
Neg logit contributions
 Caption-0.089
hement-0.084
odynamic-0.083
enegger-0.083
acterial-0.082
Token
Token ID251
Feature activation-3.60e-01

Pos logit contributions
 Caption+0.016
hement+0.014
odynamic+0.014
enegger+0.014
acterial+0.014
Neg logit contributions
 whence-0.047
 Phant-0.047
 har-0.047
 gib-0.047
nos-0.046
Token and
Token ID290
Feature activation+5.36e-02

Pos logit contributions
 Caption+0.242
hement+0.226
odynamic+0.225
enegger+0.221
acterial+0.221
Neg logit contributions
 whence-0.233
 Phant-0.233
 har-0.232
 gib-0.230
nos-0.225
Token
Token ID564
Feature activation+2.47e-01

Pos logit contributions
 whence+0.030
 Phant+0.030
 har+0.030
 gib+0.030
nos+0.029
Neg logit contributions
 Caption-0.040
hement-0.038
odynamic-0.038
enegger-0.037
acterial-0.037
Token
Token ID250
Feature activation+9.83e-03

Pos logit contributions
 whence+0.182
 Phant+0.182
 har+0.181
 gib+0.179
nos+0.177
Neg logit contributions
 Caption-0.143
hement-0.133
odynamic-0.132
enegger-0.129
acterial-0.129
Tokenactive
Token ID5275
Feature activation+1.79e-01

Pos logit contributions
 whence+0.006
 Phant+0.006
 har+0.006
 gib+0.006
nos+0.006
Neg logit contributions
 Caption-0.007
hement-0.007
odynamic-0.007
enegger-0.007
acterial-0.007
Token bystand
Token ID33326
Feature activation-8.15e-01

Pos logit contributions
 har+0.112
 Phant+0.112
 whence+0.112
 gib+0.110
nos+0.108
Neg logit contributions
 Caption-0.124
hement-0.117
odynamic-0.116
enegger-0.115
acterial-0.114
Tokener
Token ID263
Feature activation-4.76e-01

Pos logit contributions
 Caption+0.527
enegger+0.494
odynamic+0.490
hement+0.487
utherland+0.482
Neg logit contributions
 whence-0.550
 har-0.547
 Phant-0.542
 gib-0.535
nos-0.529
Token training
Token ID3047
Feature activation-8.66e-02

Pos logit contributions
 Caption+0.268
hement+0.250
odynamic+0.246
enegger+0.244
acterial+0.241
Neg logit contributions
 whence-0.359
 har-0.359
 Phant-0.358
 gib-0.355
nos-0.349
Token
Token ID447
Feature activation-4.78e-02

Pos logit contributions
 Caption+0.056
hement+0.053
odynamic+0.052
enegger+0.052
acterial+0.051
Neg logit contributions
 whence-0.058
 Phant-0.058
 har-0.057
 gib-0.057
nos-0.056
Token
Token ID251
Feature activation-3.70e-01

Pos logit contributions
 Caption+0.008
hement+0.006
odynamic+0.006
enegger+0.005
acterial+0.005
Neg logit contributions
 whence-0.055
 har-0.055
 Phant-0.055
 gib-0.055
 hay-0.054
Token.
Token ID13
Feature activation+8.11e-02

Pos logit contributions
 Caption+0.241
hement+0.225
odynamic+0.223
enegger+0.220
acterial+0.220
Neg logit contributions
 whence-0.247
 Phant-0.247
 har-0.245
 gib-0.243
nos-0.239
Token financial
Token ID3176
Feature activation-1.94e-01

Pos logit contributions
 har+0.008
 whence+0.008
 Phant+0.008
 gib+0.008
 hay+0.008
Neg logit contributions
 Caption-0.008
hement-0.008
odynamic-0.008
enegger-0.008
acterial-0.008
Token crisis
Token ID4902
Feature activation-3.17e-01

Pos logit contributions
 Caption+0.079
hement+0.070
odynamic+0.069
enegger+0.068
acterial+0.068
Neg logit contributions
 whence-0.177
 har-0.177
 Phant-0.176
 gib-0.175
 hay-0.172
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
 Caption+0.211
hement+0.199
odynamic+0.196
enegger+0.194
acterial+0.194
Neg logit contributions
 whence-0.207
 Phant-0.206
 har-0.206
 gib-0.204
nos-0.200
Token
Token ID198
Feature activation+6.02e-02

Pos logit contributions
 Caption+0.017
hement+0.016
odynamic+0.016
enegger+0.016
acterial+0.016
Neg logit contributions
 whence-0.006
 Phant-0.006
 gib-0.006
 har-0.006
nos-0.006
Token
Token ID198
Feature activation+5.99e-02

Pos logit contributions
 whence+0.017
 Phant+0.017
 har+0.017
 gib+0.017
nos+0.016
Neg logit contributions
 Caption-0.062
hement-0.060
odynamic-0.059
enegger-0.059
acterial-0.059
TokenHug
Token ID48098
Feature activation-8.14e-01

Pos logit contributions
 whence+0.046
 Phant+0.046
 har+0.046
 gib+0.045
nos+0.045
Neg logit contributions
 Caption-0.033
hement-0.031
odynamic-0.030
enegger-0.030
acterial-0.030
Tokenhes
Token ID956
Feature activation-7.32e-02

Pos logit contributions
 Caption+0.540
hement+0.505
odynamic+0.502
enegger+0.496
acterial+0.494
Neg logit contributions
 whence-0.537
 har-0.536
nos-0.530
 Phant-0.529
 gib-0.529
Token said
Token ID531
Feature activation+1.06e-01

Pos logit contributions
 Caption+0.058
hement+0.055
odynamic+0.054
enegger+0.054
acterial+0.053
Neg logit contributions
 whence-0.039
 Phant-0.039
 har-0.039
 gib-0.038
nos-0.037
Token the
Token ID262
Feature activation-2.54e-02

Pos logit contributions
 whence+0.061
 har+0.060
 Phant+0.060
 gib+0.060
 hay+0.058
Neg logit contributions
 Caption-0.079
hement-0.075
odynamic-0.075
enegger-0.074
acterial-0.074
Token
Token ID564
Feature activation+2.14e-01

Pos logit contributions
 Caption+0.018
hement+0.017
odynamic+0.017
enegger+0.017
acterial+0.017
Neg logit contributions
 whence-0.015
 har-0.015
 Phant-0.015
 gib-0.015
 hay-0.014
Token
Token ID250
Feature activation-1.59e-02

Pos logit contributions
 whence+0.161
 Phant+0.160
 har+0.160
 gib+0.158
nos+0.156
Neg logit contributions
 Caption-0.121
hement-0.112
odynamic-0.111
enegger-0.109
acterial-0.109
Token seats
Token ID8632
Feature activation-4.42e-01

Pos logit contributions
 Caption+0.299
hement+0.283
odynamic+0.280
enegger+0.277
acterial+0.276
Neg logit contributions
 har-0.242
 whence-0.242
 Phant-0.241
 gib-0.238
 hay-0.232
Token on
Token ID319
Feature activation-1.80e-01

Pos logit contributions
 Caption+0.330
hement+0.313
odynamic+0.310
enegger+0.308
acterial+0.306
Neg logit contributions
 whence-0.254
 har-0.253
 Phant-0.252
 gib-0.249
 hay-0.243
Token the
Token ID262
Feature activation-1.33e-01

Pos logit contributions
 Caption+0.143
hement+0.136
odynamic+0.135
enegger+0.133
acterial+0.133
Neg logit contributions
 whence-0.095
 Phant-0.094
 har-0.094
 gib-0.093
nos-0.091
Token Newcastle
Token ID22410
Feature activation-1.55e-01

Pos logit contributions
 Caption+0.078
hement+0.072
odynamic+0.072
enegger+0.071
acterial+0.070
Neg logit contributions
 whence-0.098
 Phant-0.098
 har-0.098
 gib-0.096
nos-0.095
Token United
Token ID1578
Feature activation-7.31e-02

Pos logit contributions
 Caption+0.109
hement+0.103
odynamic+0.102
enegger+0.101
acterial+0.101
Neg logit contributions
 whence-0.095
 Phant-0.095
 har-0.094
 gib-0.093
nos-0.091
Token Fans
Token ID24964
Feature activation-7.98e-01

Pos logit contributions
 Caption+0.025
hement+0.022
enegger+0.022
odynamic+0.022
acterial+0.021
Neg logit contributions
 Phant-0.072
 har-0.071
 whence-0.071
 gib-0.071
nos-0.070
Token Forum
Token ID14867
Feature activation-2.49e-01

Pos logit contributions
 Caption+0.501
hement+0.468
odynamic+0.463
enegger+0.456
acterial+0.455
Neg logit contributions
 whence-0.552
 Phant-0.551
 har-0.548
 gib-0.544
nos-0.534
Token below
Token ID2174
Feature activation-1.47e-01

Pos logit contributions
 Caption+0.187
hement+0.177
odynamic+0.175
enegger+0.173
acterial+0.172
Neg logit contributions
 whence-0.142
 Phant-0.141
 har-0.141
 gib-0.139
nos-0.136
Token:
Token ID25
Feature activation-1.65e-01

Pos logit contributions
 Caption+0.098
hement+0.092
odynamic+0.091
enegger+0.090
acterial+0.089
Neg logit contributions
 whence-0.096
 Phant-0.096
 har-0.095
 gib-0.094
nos-0.093
Token
Token ID198
Feature activation+4.99e-02

Pos logit contributions
 Caption+0.153
hement+0.145
odynamic+0.144
acterial+0.142
enegger+0.142
Neg logit contributions
 whence-0.065
 Phant-0.065
 har-0.064
 gib-0.064
nos-0.062
Token
Token ID198
Feature activation+5.14e-02

Pos logit contributions
 whence+0.014
 Phant+0.014
 har+0.014
 gib+0.014
nos+0.013
Neg logit contributions
 Caption-0.051
hement-0.049
odynamic-0.049
enegger-0.049
acterial-0.049
Tokenи
Token ID18849
Feature activation-1.65e-01

Pos logit contributions
 whence+0.005
 Phant+0.005
 har+0.005
 gib+0.005
nos+0.004
Neg logit contributions
 Caption-0.007
hement-0.006
odynamic-0.006
enegger-0.006
acterial-0.006
Tokenя
Token ID40623
Feature activation-2.45e-01

Pos logit contributions
 Caption+0.121
hement+0.114
odynamic+0.113
enegger+0.112
acterial+0.112
Neg logit contributions
 whence-0.097
 Phant-0.096
 har-0.096
 gib-0.095
nos-0.093
Token
Token ID141
Feature activation-2.34e-01

Pos logit contributions
 Caption+0.195
hement+0.185
odynamic+0.183
enegger+0.181
acterial+0.181
Neg logit contributions
 whence-0.129
 Phant-0.128
 har-0.127
 gib-0.126
nos-0.123
Token
Token ID227
Feature activation-4.21e-01

Pos logit contributions
 Caption+0.115
hement+0.105
odynamic+0.104
enegger+0.103
acterial+0.102
Neg logit contributions
 whence-0.194
 Phant-0.194
 har-0.193
 gib-0.191
nos-0.189
Token
Token ID12466
Feature activation+4.45e-02

Pos logit contributions
 Caption+0.342
hement+0.324
odynamic+0.322
enegger+0.318
acterial+0.317
Neg logit contributions
 whence-0.213
 Phant-0.213
 har-0.211
 gib-0.209
nos-0.204
Token
Token ID121
Feature activation-7.76e-01

Pos logit contributions
 whence+0.036
 Phant+0.036
 har+0.036
 gib+0.036
nos+0.035
Neg logit contributions
 Caption-0.023
hement-0.021
odynamic-0.021
enegger-0.020
acterial-0.020
Tokenа
Token ID16142
Feature activation-3.48e-02

Pos logit contributions
 Caption+0.575
hement+0.542
odynamic+0.538
acterial+0.530
enegger+0.530
Neg logit contributions
 whence-0.448
 Phant-0.448
 har-0.443
 gib-0.439
nos-0.431
Token
Token ID12466
Feature activation+8.40e-02

Pos logit contributions
 Caption+0.026
hement+0.025
odynamic+0.024
enegger+0.024
acterial+0.024
Neg logit contributions
 whence-0.020
 Phant-0.020
 har-0.020
 gib-0.020
nos-0.019
Token
Token ID242
Feature activation-2.62e-01

Pos logit contributions
 whence+0.067
 Phant+0.067
 har+0.067
 gib+0.066
nos+0.065
Neg logit contributions
 Caption-0.043
hement-0.040
odynamic-0.040
enegger-0.039
acterial-0.039
Tokenо
Token ID15166
Feature activation-1.61e-01

Pos logit contributions
 Caption+0.199
hement+0.188
odynamic+0.186
enegger+0.184
acterial+0.183
Neg logit contributions
 whence-0.148
 Phant-0.147
 har-0.146
 gib-0.145
nos-0.142
Tokenн
Token ID22177
Feature activation+1.13e-02

Pos logit contributions
 Caption+0.124
hement+0.117
odynamic+0.116
enegger+0.115
acterial+0.114
Neg logit contributions
 whence-0.088
 Phant-0.088
 har-0.087
 gib-0.086
nos-0.085
Token active
Token ID4075
Feature activation-5.92e-03

Pos logit contributions
 Caption+0.001
hement+0.001
enegger+0.001
odynamic+0.001
acterial+0.001
Neg logit contributions
 har-0.001
 whence-0.001
 Phant-0.001
 gib-0.001
nos-0.001
Token part
Token ID636
Feature activation-7.33e-02

Pos logit contributions
 Caption+0.004
hement+0.004
odynamic+0.004
enegger+0.004
acterial+0.003
Neg logit contributions
 har-0.004
 whence-0.004
 Phant-0.004
 gib-0.004
nos-0.004
Token in
Token ID287
Feature activation-1.64e-01

Pos logit contributions
 Caption+0.038
hement+0.035
odynamic+0.035
enegger+0.035
acterial+0.034
Neg logit contributions
 whence-0.059
 har-0.058
 Phant-0.058
 gib-0.058
nos-0.057
Token the
Token ID262
Feature activation-2.40e-02

Pos logit contributions
 Caption+0.121
hement+0.115
odynamic+0.114
enegger+0.113
acterial+0.112
Neg logit contributions
 whence-0.095
 har-0.094
 Phant-0.094
 gib-0.093
nos-0.090
Token new
Token ID649
Feature activation-1.75e-03

Pos logit contributions
 Caption+0.015
hement+0.014
odynamic+0.014
enegger+0.014
acterial+0.013
Neg logit contributions
 har-0.017
 whence-0.017
 Phant-0.017
 gib-0.017
 hay-0.016
Token Fans
Token ID24964
Feature activation-7.75e-01

Pos logit contributions
 Caption+0.001
hement+0.001
odynamic+0.001
enegger+0.001
acterial+0.001
Neg logit contributions
 har-0.001
 whence-0.001
 Phant-0.001
 gib-0.001
 hay-0.001
Token'
Token ID6
Feature activation+5.47e-02

Pos logit contributions
 Caption+0.513
hement+0.497
odynamic+0.486
enegger+0.486
acterial+0.485
Neg logit contributions
 har-0.505
 Phant-0.503
 whence-0.502
 gib-0.493
nos-0.484
Token Forum
Token ID14867
Feature activation-2.15e-01

Pos logit contributions
 whence+0.042
 Phant+0.042
 har+0.042
 gib+0.042
nos+0.041
Neg logit contributions
 Caption-0.030
hement-0.028
odynamic-0.027
enegger-0.027
acterial-0.027
Token.
Token ID13
Feature activation-5.47e-03

Pos logit contributions
 Caption+0.145
hement+0.137
odynamic+0.135
enegger+0.134
acterial+0.133
Neg logit contributions
 whence-0.139
 har-0.138
 Phant-0.138
 gib-0.136
nos-0.133
Token It
Token ID632
Feature activation-1.35e-01

Pos logit contributions
 Caption+0.004
hement+0.004
odynamic+0.004
acterial+0.004
enegger+0.004
Neg logit contributions
 Phant-0.003
 whence-0.003
 har-0.003
 gib-0.003
nos-0.003
Token is
Token ID318
Feature activation-7.47e-02

Pos logit contributions
 Caption+0.076
hement+0.071
odynamic+0.070
enegger+0.069
acterial+0.069
Neg logit contributions
 har-0.102
 whence-0.102
 Phant-0.101
 gib-0.100
 hay-0.098
Token starts
Token ID4940
Feature activation-1.28e-02

Pos logit contributions
 whence+0.031
 Phant+0.031
 har+0.031
 gib+0.030
nos+0.030
Neg logit contributions
 Caption-0.031
hement-0.029
odynamic-0.029
enegger-0.029
acterial-0.029
Token at
Token ID379
Feature activation+8.18e-03

Pos logit contributions
 Caption+0.009
hement+0.008
odynamic+0.008
enegger+0.008
acterial+0.008
Neg logit contributions
 whence-0.008
 Phant-0.008
 har-0.008
 gib-0.008
nos-0.008
Token Sh
Token ID911
Feature activation-2.52e-01

Pos logit contributions
 whence+0.004
 Phant+0.004
 har+0.004
 gib+0.004
nos+0.004
Neg logit contributions
 Caption-0.007
hement-0.007
odynamic-0.007
enegger-0.007
acterial-0.007
Tokenarm
Token ID1670
Feature activation-3.98e-02

Pos logit contributions
 Caption+0.162
hement+0.150
enegger+0.148
odynamic+0.148
acterial+0.145
Neg logit contributions
 whence-0.172
 har-0.171
 Phant-0.170
 gib-0.168
nos-0.168
Token el
Token ID1288
Feature activation+6.37e-02

Pos logit contributions
 Caption+0.029
hement+0.027
odynamic+0.027
enegger+0.027
acterial+0.027
Neg logit contributions
 whence-0.024
 har-0.024
 Phant-0.024
 gib-0.023
nos-0.023
Token Sheikh
Token ID30843
Feature activation-7.71e-01

Pos logit contributions
 whence+0.042
 Phant+0.042
 har+0.042
 gib+0.042
nos+0.041
Neg logit contributions
 Caption-0.042
hement-0.039
odynamic-0.039
enegger-0.038
acterial-0.038
Token,
Token ID11
Feature activation+3.85e-02

Pos logit contributions
 Caption+0.509
hement+0.475
odynamic+0.472
enegger+0.465
acterial+0.464
Neg logit contributions
 whence-0.507
 Phant-0.505
 har-0.502
 gib-0.498
nos-0.490
Token Egypt
Token ID6365
Feature activation+1.18e-01

Pos logit contributions
 Phant+0.022
 whence+0.022
 har+0.022
 gib+0.022
nos+0.022
Neg logit contributions
 Caption-0.028
hement-0.027
odynamic-0.026
enegger-0.026
acterial-0.026
Token,
Token ID11
Feature activation+1.24e-02

Pos logit contributions
 Phant+0.076
 whence+0.076
 har+0.075
 gib+0.075
nos+0.073
Neg logit contributions
 Caption-0.080
hement-0.075
odynamic-0.074
enegger-0.073
acterial-0.073
Token and
Token ID290
Feature activation-1.14e-01

Pos logit contributions
 Phant+0.008
 whence+0.008
 har+0.008
 gib+0.008
nos+0.007
Neg logit contributions
 Caption-0.009
hement-0.008
odynamic-0.008
enegger-0.008
acterial-0.008
Token cru
Token ID4630
Feature activation-1.31e-03

Pos logit contributions
 Caption+0.084
hement+0.079
odynamic+0.079
enegger+0.078
acterial+0.078
Neg logit contributions
 whence-0.067
 Phant-0.067
 har-0.066
 gib-0.066
nos-0.064
Token Qu
Token ID2264
Feature activation+1.17e-01

Pos logit contributions
 Phant+0.079
 whence+0.078
 har+0.078
 gib+0.077
nos+0.076
Neg logit contributions
 Caption-0.092
hement-0.087
odynamic-0.086
acterial-0.085
enegger-0.085
Tokenito
Token ID10094
Feature activation-1.39e-01

Pos logit contributions
 har+0.100
 whence+0.099
 gib+0.098
nos+0.098
 Phant+0.098
Neg logit contributions
 Caption-0.056
enegger-0.050
hement-0.050
odynamic-0.049
utherland-0.048
Token to
Token ID284
Feature activation-2.33e-02

Pos logit contributions
 Caption+0.099
hement+0.093
odynamic+0.092
enegger+0.091
acterial+0.091
Neg logit contributions
 Phant-0.084
 whence-0.084
 har-0.083
 gib-0.082
nos-0.081
Token fight
Token ID1907
Feature activation-3.50e-01

Pos logit contributions
 Caption+0.018
hement+0.017
odynamic+0.017
enegger+0.017
acterial+0.017
Neg logit contributions
 whence-0.013
 har-0.013
 Phant-0.013
 gib-0.013
nos-0.012
Token his
Token ID465
Feature activation+1.17e-02

Pos logit contributions
 Caption+0.258
hement+0.243
odynamic+0.242
enegger+0.239
acterial+0.238
Neg logit contributions
 whence-0.204
 har-0.203
 Phant-0.202
 gib-0.200
nos-0.195
Token eject
Token ID22189
Feature activation-7.68e-01

Pos logit contributions
 har+0.008
 whence+0.008
 Phant+0.008
 gib+0.008
 hay+0.008
Neg logit contributions
 Caption-0.007
hement-0.007
odynamic-0.007
enegger-0.007
acterial-0.007
Tokenion
Token ID295
Feature activation-9.76e-03

Pos logit contributions
 Caption+0.622
hement+0.590
odynamic+0.586
enegger+0.583
acterial+0.578
Neg logit contributions
 whence-0.391
 Phant-0.389
 har-0.389
 gib-0.383
nos-0.373
Token,
Token ID11
Feature activation-5.58e-02

Pos logit contributions
 Caption+0.007
hement+0.006
odynamic+0.006
enegger+0.006
acterial+0.006
Neg logit contributions
 whence-0.006
 har-0.006
 Phant-0.006
 gib-0.006
 hay-0.006
Token should
Token ID815
Feature activation-1.16e-01

Pos logit contributions
 Caption+0.044
hement+0.041
odynamic+0.041
enegger+0.041
acterial+0.040
Neg logit contributions
 whence-0.030
 Phant-0.030
 har-0.030
 gib-0.029
nos-0.029
Token he
Token ID339
Feature activation-1.04e-01

Pos logit contributions
 Caption+0.088
hement+0.083
odynamic+0.082
enegger+0.081
acterial+0.081
Neg logit contributions
 whence-0.066
 Phant-0.065
 har-0.065
 gib-0.064
nos-0.063
Token be
Token ID307
Feature activation+1.17e-01

Pos logit contributions
 Caption+0.080
hement+0.075
odynamic+0.075
enegger+0.074
acterial+0.074
Neg logit contributions
 whence-0.057
 har-0.057
 Phant-0.057
 gib-0.056
nos-0.055
Tokenс
Token ID21727
Feature activation-3.45e-02

Pos logit contributions
 Caption+0.018
hement+0.017
odynamic+0.016
enegger+0.016
acterial+0.016
Neg logit contributions
 whence-0.014
 Phant-0.014
 har-0.014
 gib-0.014
nos-0.014
Tokenс
Token ID21727
Feature activation-3.03e-02

Pos logit contributions
 Caption+0.025
hement+0.024
odynamic+0.024
enegger+0.024
acterial+0.023
Neg logit contributions
 whence-0.020
 Phant-0.020
 har-0.020
 gib-0.020
nos-0.019
Tokenе
Token ID16843
Feature activation-2.00e-01

Pos logit contributions
 Caption+0.024
hement+0.023
odynamic+0.023
enegger+0.023
acterial+0.023
Neg logit contributions
 whence-0.015
 Phant-0.015
 har-0.015
 gib-0.015
nos-0.015
Token,
Token ID11
Feature activation-9.91e-02

Pos logit contributions
 Caption+0.145
hement+0.137
odynamic+0.136
enegger+0.134
acterial+0.134
Neg logit contributions
 whence-0.119
 Phant-0.118
 har-0.118
 gib-0.117
nos-0.114
Token
Token ID12466
Feature activation+9.16e-02

Pos logit contributions
 Caption+0.082
hement+0.077
odynamic+0.077
enegger+0.076
acterial+0.076
Neg logit contributions
 whence-0.049
 Phant-0.049
 har-0.048
 gib-0.048
nos-0.047
Token
Token ID121
Feature activation-7.66e-01

Pos logit contributions
 whence+0.071
 Phant+0.071
 har+0.071
 gib+0.070
nos+0.069
Neg logit contributions
 Caption-0.050
hement-0.046
odynamic-0.045
enegger-0.045
acterial-0.045
Tokenе
Token ID16843
Feature activation-1.45e-01

Pos logit contributions
 Caption+0.570
hement+0.538
odynamic+0.534
acterial+0.526
enegger+0.525
Neg logit contributions
 whence-0.439
 Phant-0.439
 har-0.434
 gib-0.431
nos-0.422
Tokenк
Token ID31583
Feature activation+1.58e-01

Pos logit contributions
 Caption+0.116
hement+0.110
odynamic+0.109
enegger+0.108
acterial+0.108
Neg logit contributions
 whence-0.075
 Phant-0.075
 har-0.074
 gib-0.073
nos-0.072
Tokenо
Token ID15166
Feature activation-1.74e-01

Pos logit contributions
 whence+0.085
 Phant+0.084
 har+0.084
 gib+0.083
nos+0.081
Neg logit contributions
 Caption-0.124
hement-0.117
odynamic-0.116
enegger-0.115
acterial-0.115
Tokenт
Token ID20375
Feature activation-1.18e-01

Pos logit contributions
 Caption+0.121
hement+0.113
odynamic+0.112
enegger+0.111
acterial+0.110
Neg logit contributions
 whence-0.108
 Phant-0.108
 har-0.107
 gib-0.106
nos-0.105
Tokenо
Token ID15166
Feature activation-1.21e-01

Pos logit contributions
 Caption+0.094
hement+0.089
odynamic+0.088
acterial+0.087
enegger+0.087
Neg logit contributions
 whence-0.062
 Phant-0.061
 har-0.061
 gib-0.060
nos-0.059
Token."
Token ID526
Feature activation-4.29e-02

Pos logit contributions
 Caption+0.085
hement+0.081
odynamic+0.080
enegger+0.079
acterial+0.078
Neg logit contributions
 whence-0.077
 har-0.076
 Phant-0.076
 gib-0.076
nos-0.074
Token
Token ID198
Feature activation+2.03e-01

Pos logit contributions
 Caption+0.042
hement+0.040
odynamic+0.040
enegger+0.039
acterial+0.039
Neg logit contributions
 whence-0.015
 Phant-0.015
 har-0.015
 gib-0.015
ancies-0.014
Token
Token ID198
Feature activation+1.83e-01

Pos logit contributions
 whence+0.058
 har+0.057
 Phant+0.057
 gib+0.055
nos+0.053
Neg logit contributions
 Caption-0.210
hement-0.203
odynamic-0.202
enegger-0.200
acterial-0.200
TokenLu
Token ID25596
Feature activation-1.01e-01

Pos logit contributions
 whence+0.141
 har+0.141
 Phant+0.141
 gib+0.139
nos+0.138
Neg logit contributions
 Caption-0.101
hement-0.094
odynamic-0.092
enegger-0.092
acterial-0.091
Tokenol
Token ID349
Feature activation+2.05e-01

Pos logit contributions
 Caption+0.081
hement+0.078
enegger+0.077
odynamic+0.075
utherland+0.075
Neg logit contributions
 har-0.052
 whence-0.052
 Phant-0.051
 gib-0.050
nos-0.050
Token Deng
Token ID41985
Feature activation-7.54e-01

Pos logit contributions
 har+0.205
 whence+0.204
 Phant+0.203
 gib+0.203
 hay+0.199
Neg logit contributions
 Caption-0.066
hement-0.058
enegger-0.056
odynamic-0.056
acterial-0.055
Token was
Token ID373
Feature activation+6.13e-03

Pos logit contributions
 Caption+0.487
hement+0.456
odynamic+0.452
enegger+0.446
acterial+0.444
Neg logit contributions
 whence-0.508
 Phant-0.506
 har-0.505
 gib-0.500
nos-0.491
Token traded
Token ID14018
Feature activation+1.07e-02

Pos logit contributions
 whence+0.003
 Phant+0.003
 har+0.003
 gib+0.003
nos+0.003
Neg logit contributions
 Caption-0.005
hement-0.004
odynamic-0.004
enegger-0.004
acterial-0.004
Token to
Token ID284
Feature activation+6.98e-02

Pos logit contributions
 whence+0.007
 har+0.007
 Phant+0.007
 gib+0.006
 hay+0.006
Neg logit contributions
 Caption-0.007
hement-0.007
odynamic-0.007
enegger-0.007
acterial-0.007
Token the
Token ID262
Feature activation+5.46e-02

Pos logit contributions
 whence+0.040
 Phant+0.040
 har+0.040
 gib+0.039
nos+0.038
Neg logit contributions
 Caption-0.052
hement-0.050
odynamic-0.049
enegger-0.049
acterial-0.049
Token Cavs
Token ID38943
Feature activation-3.12e-01

Pos logit contributions
 Phant+0.034
 har+0.034
 whence+0.034
 gib+0.033
 hay+0.033
Neg logit contributions
 Caption-0.038
hement-0.036
odynamic-0.036
enegger-0.036
acterial-0.035
Token United
Token ID1578
Feature activation-5.42e-02

Pos logit contributions
 Caption+0.115
hement+0.108
odynamic+0.108
enegger+0.106
acterial+0.106
Neg logit contributions
 whence-0.098
 Phant-0.098
 har-0.097
 gib-0.097
nos-0.095
Token to
Token ID284
Feature activation-2.04e-02

Pos logit contributions
 Caption+0.036
hement+0.034
odynamic+0.033
enegger+0.033
acterial+0.033
Neg logit contributions
 whence-0.036
 Phant-0.036
 har-0.035
 gib-0.035
nos-0.034
Token launch
Token ID4219
Feature activation-1.88e-01

Pos logit contributions
 Caption+0.014
hement+0.013
odynamic+0.013
enegger+0.013
acterial+0.013
Neg logit contributions
 har-0.013
 whence-0.013
 Phant-0.013
 gib-0.013
 hay-0.012
Token their
Token ID511
Feature activation+1.30e-01

Pos logit contributions
 Caption+0.152
hement+0.145
odynamic+0.144
enegger+0.143
acterial+0.142
Neg logit contributions
 whence-0.095
 Phant-0.095
 har-0.094
 gib-0.093
nos-0.091
Token new
Token ID649
Feature activation-1.77e-02

Pos logit contributions
 har+0.102
 whence+0.102
 Phant+0.102
 gib+0.101
 hay+0.100
Neg logit contributions
 Caption-0.069
hement-0.064
enegger-0.064
odynamic-0.063
acterial-0.063
Token Fans
Token ID24964
Feature activation-7.54e-01

Pos logit contributions
 Caption+0.009
hement+0.008
enegger+0.008
odynamic+0.008
acterial+0.008
Neg logit contributions
 har-0.015
 whence-0.015
 Phant-0.015
 gib-0.015
 hay-0.014
Token'
Token ID6
Feature activation+2.00e-02

Pos logit contributions
 Caption+0.493
hement+0.471
enegger+0.463
odynamic+0.462
acterial+0.460
Neg logit contributions
 har-0.498
 Phant-0.496
 whence-0.496
 gib-0.489
nos-0.479
Token Forum
Token ID14867
Feature activation-2.29e-01

Pos logit contributions
 whence+0.015
 Phant+0.015
 har+0.015
 gib+0.015
nos+0.015
Neg logit contributions
 Caption-0.011
hement-0.010
odynamic-0.010
enegger-0.010
acterial-0.010
Token,
Token ID11
Feature activation-5.78e-02

Pos logit contributions
 Caption+0.155
hement+0.146
odynamic+0.145
enegger+0.143
acterial+0.143
Neg logit contributions
 whence-0.147
 har-0.147
 Phant-0.146
 gib-0.144
nos-0.141
Token and
Token ID290
Feature activation-1.24e-01

Pos logit contributions
 Caption+0.046
hement+0.044
odynamic+0.044
enegger+0.043
acterial+0.043
Neg logit contributions
 whence-0.030
 har-0.030
 Phant-0.030
 gib-0.029
nos-0.029
Token will
Token ID481
Feature activation-2.27e-01

Pos logit contributions
 Caption+0.088
enegger+0.083
odynamic+0.083
hement+0.082
acterial+0.082
Neg logit contributions
 har-0.076
 whence-0.076
 Phant-0.076
 gib-0.074
 hay-0.072
Tokenн
Token ID22177
Feature activation+4.35e-04

Pos logit contributions
 whence+0.011
 Phant+0.011
 har+0.011
 gib+0.011
nos+0.011
Neg logit contributions
 Caption-0.017
hement-0.016
odynamic-0.016
enegger-0.016
acterial-0.016
Tokenы
Token ID45035
Feature activation+1.16e-02

Pos logit contributions
 whence+0.000
 Phant+0.000
 har+0.000
 gib+0.000
nos+0.000
Neg logit contributions
 Caption-0.000
hement-0.000
odynamic-0.000
enegger-0.000
acterial-0.000
Token
Token ID141
Feature activation-1.60e-01

Pos logit contributions
 whence+0.007
 Phant+0.007
 har+0.007
 gib+0.007
nos+0.007
Neg logit contributions
 Caption-0.008
hement-0.008
odynamic-0.008
enegger-0.008
acterial-0.008
Token
Token ID227
Feature activation-4.10e-01

Pos logit contributions
 Caption+0.087
hement+0.080
odynamic+0.079
enegger+0.078
acterial+0.077
Neg logit contributions
 whence-0.124
 Phant-0.124
 har-0.124
 gib-0.123
nos-0.121
Token
Token ID12466
Feature activation+1.05e-01

Pos logit contributions
 Caption+0.340
hement+0.323
odynamic+0.320
enegger+0.316
acterial+0.316
Neg logit contributions
 whence-0.200
 Phant-0.200
 har-0.198
 gib-0.195
nos-0.191
Token
Token ID121
Feature activation-7.39e-01

Pos logit contributions
 whence+0.087
 Phant+0.087
 har+0.087
 gib+0.086
nos+0.085
Neg logit contributions
 Caption-0.052
hement-0.048
odynamic-0.047
enegger-0.046
acterial-0.046
Tokenа
Token ID16142
Feature activation+7.17e-03

Pos logit contributions
 Caption+0.549
hement+0.518
odynamic+0.514
acterial+0.506
enegger+0.506
Neg logit contributions
 whence-0.426
 Phant-0.425
 har-0.421
 gib-0.417
nos-0.409
Token
Token ID141
Feature activation-1.21e-01

Pos logit contributions
 whence+0.004
 Phant+0.004
 har+0.004
 gib+0.004
nos+0.004
Neg logit contributions
 Caption-0.005
hement-0.005
odynamic-0.005
enegger-0.005
acterial-0.005
Token
Token ID239
Feature activation-5.25e-01

Pos logit contributions
 Caption+0.063
hement+0.058
odynamic+0.057
enegger+0.056
acterial+0.055
Neg logit contributions
 whence-0.097
 Phant-0.097
 har-0.097
 gib-0.096
nos-0.094
Tokenм
Token ID43108
Feature activation-3.35e-01

Pos logit contributions
 Caption+0.324
hement+0.302
odynamic+0.299
enegger+0.294
acterial+0.294
Neg logit contributions
 whence-0.368
 Phant-0.368
 har-0.365
 gib-0.363
nos-0.357
Tokenн
Token ID22177
Feature activation-3.07e-02

Pos logit contributions
 Caption+0.258
hement+0.244
odynamic+0.243
acterial+0.239
enegger+0.239
Neg logit contributions
 whence-0.183
 Phant-0.183
 har-0.180
 gib-0.179
nos-0.176
Token the
Token ID262
Feature activation+8.82e-02

Pos logit contributions
 whence+0.045
 Phant+0.045
 har+0.045
 gib+0.044
 hay+0.043
Neg logit contributions
 Caption-0.066
odynamic-0.063
hement-0.063
enegger-0.063
utherland-0.062
Token court
Token ID2184
Feature activation+1.84e-01

Pos logit contributions
 Phant+0.054
 har+0.053
 gib+0.053
 whence+0.052
 hay+0.052
Neg logit contributions
 Caption-0.064
enegger-0.062
odynamic-0.061
hement-0.061
utherland-0.060
Token.
Token ID13
Feature activation+1.12e-01

Pos logit contributions
 har+0.129
 whence+0.129
 Phant+0.128
 gib+0.127
 hay+0.125
Neg logit contributions
 Caption-0.115
hement-0.108
odynamic-0.108
enegger-0.107
acterial-0.105
Token Smith
Token ID4176
Feature activation-1.10e-01

Pos logit contributions
 whence+0.059
 Phant+0.058
 har+0.058
 gib+0.057
nos+0.056
Neg logit contributions
 Caption-0.089
hement-0.084
odynamic-0.083
acterial-0.082
enegger-0.082
Token was
Token ID373
Feature activation+6.39e-02

Pos logit contributions
 Caption+0.075
odynamic+0.071
hement+0.071
acterial+0.070
utherland+0.069
Neg logit contributions
 har-0.072
 whence-0.071
 Phant-0.070
 gib-0.070
 hay-0.068
Token ejected
Token ID38632
Feature activation-7.34e-01

Pos logit contributions
 har+0.041
 Phant+0.041
 whence+0.041
 gib+0.041
 hay+0.040
Neg logit contributions
 Caption-0.043
hement-0.041
odynamic-0.041
enegger-0.041
acterial-0.040
Token and
Token ID290
Feature activation-2.22e-02

Pos logit contributions
 Caption+0.508
hement+0.480
odynamic+0.479
enegger+0.478
acterial+0.470
Neg logit contributions
 whence-0.463
 har-0.462
 Phant-0.459
 gib-0.453
 hay-0.445
Token suspended
Token ID9951
Feature activation+2.31e-01

Pos logit contributions
 Caption+0.015
odynamic+0.014
enegger+0.014
hement+0.014
acterial+0.014
Neg logit contributions
 har-0.015
 whence-0.014
 Phant-0.014
 gib-0.014
 hay-0.014
Token for
Token ID329
Feature activation+9.91e-02

Pos logit contributions
 whence+0.147
 har+0.146
 Phant+0.146
 gib+0.143
 hay+0.141
Neg logit contributions
 Caption-0.158
hement-0.150
odynamic-0.149
enegger-0.148
acterial-0.147
Token two
Token ID734
Feature activation-2.88e-01

Pos logit contributions
 har+0.059
 whence+0.058
 Phant+0.058
 gib+0.057
 hay+0.056
Neg logit contributions
 Caption-0.072
hement-0.069
odynamic-0.068
enegger-0.068
acterial-0.067
Token games
Token ID1830
Feature activation+1.44e-01

Pos logit contributions
 Caption+0.232
hement+0.219
enegger+0.219
odynamic+0.219
acterial+0.215
Neg logit contributions
 har-0.151
 Phant-0.150
 whence-0.149
 gib-0.147
 hay-0.143
Token in
Token ID287
Feature activation-4.36e-02

Pos logit contributions
 whence+0.084
 har+0.083
 Phant+0.083
 gib+0.082
nos+0.081
Neg logit contributions
 Caption-0.056
hement-0.052
odynamic-0.051
acterial-0.050
enegger-0.050
Token public
Token ID1171
Feature activation+1.05e-01

Pos logit contributions
 Caption+0.032
hement+0.030
odynamic+0.030
enegger+0.029
acterial+0.029
Neg logit contributions
 whence-0.026
 Phant-0.026
 har-0.026
 gib-0.026
nos-0.025
Token forums
Token ID14216
Feature activation-1.18e-01

Pos logit contributions
 whence+0.073
 har+0.072
 Phant+0.072
 gib+0.072
nos+0.070
Neg logit contributions
 Caption-0.066
hement-0.062
odynamic-0.062
enegger-0.061
acterial-0.061
Token.
Token ID13
Feature activation+1.81e-03

Pos logit contributions
 Caption+0.081
hement+0.076
odynamic+0.075
enegger+0.074
acterial+0.074
Neg logit contributions
 whence-0.075
 Phant-0.075
 har-0.075
 gib-0.074
nos-0.073
Token Says
Token ID28628
Feature activation-4.62e-02

Pos logit contributions
 Phant+0.001
 whence+0.001
 har+0.001
 gib+0.001
nos+0.001
Neg logit contributions
 Caption-0.001
hement-0.001
odynamic-0.001
acterial-0.001
enegger-0.001
Token Neh
Token ID44470
Feature activation-7.34e-01

Pos logit contributions
 Caption+0.023
hement+0.021
odynamic+0.021
enegger+0.020
acterial+0.020
Neg logit contributions
 Phant-0.038
 whence-0.038
 har-0.038
 gib-0.038
nos-0.037
Tokenring
Token ID1806
Feature activation-1.12e-01

Pos logit contributions
 Caption+0.512
hement+0.486
odynamic+0.477
enegger+0.474
utherland+0.471
Neg logit contributions
 har-0.460
 whence-0.456
 Phant-0.449
 gib-0.446
nos-0.443
Token,
Token ID11
Feature activation+5.16e-02

Pos logit contributions
 Caption+0.075
hement+0.071
odynamic+0.070
enegger+0.069
acterial+0.069
Neg logit contributions
 whence-0.072
 Phant-0.072
 har-0.072
 gib-0.071
nos-0.070
Token
Token ID198
Feature activation+1.43e-01

Pos logit contributions
 whence+0.029
 Phant+0.029
 har+0.029
 gib+0.029
nos+0.028
Neg logit contributions
 Caption-0.038
hement-0.036
odynamic-0.036
enegger-0.036
acterial-0.035
Token
Token ID198
Feature activation+1.38e-01

Pos logit contributions
 whence+0.040
 har+0.040
 Phant+0.040
 gib+0.038
nos+0.037
Neg logit contributions
 Caption-0.149
hement-0.144
odynamic-0.143
enegger-0.142
acterial-0.141
TokenAll
Token ID3237
Feature activation-1.26e-01

Pos logit contributions
 whence+0.104
 Phant+0.103
 har+0.103
 gib+0.102
nos+0.102
Neg logit contributions
 Caption-0.078
hement-0.073
odynamic-0.071
enegger-0.071
acterial-0.071
Token items
Token ID3709
Feature activation-2.41e-02

Pos logit contributions
 har+0.118
 whence+0.118
 Phant+0.117
 gib+0.117
 hay+0.114
Neg logit contributions
 Caption-0.131
hement-0.124
odynamic-0.122
enegger-0.122
acterial-0.121
Token,
Token ID11
Feature activation+3.03e-02

Pos logit contributions
 Caption+0.018
hement+0.018
odynamic+0.017
enegger+0.017
acterial+0.017
Neg logit contributions
 whence-0.014
 har-0.013
 Phant-0.013
 gib-0.013
 hay-0.013
Token via
Token ID2884
Feature activation-1.32e-01

Pos logit contributions
 whence+0.018
 har+0.018
 Phant+0.018
 gib+0.018
 hay+0.017
Neg logit contributions
 Caption-0.022
hement-0.021
odynamic-0.021
enegger-0.021
acterial-0.020
Token a
Token ID257
Feature activation+4.86e-03

Pos logit contributions
 Caption+0.103
hement+0.098
odynamic+0.097
enegger+0.097
acterial+0.096
Neg logit contributions
 whence-0.071
 har-0.071
 Phant-0.070
 gib-0.070
 hay-0.068
Token revised
Token ID15556
Feature activation+1.55e-01

Pos logit contributions
 har+0.003
 whence+0.003
 Phant+0.003
 gib+0.003
 hay+0.003
Neg logit contributions
 Caption-0.004
hement-0.003
odynamic-0.003
enegger-0.003
acterial-0.003
Token Fans
Token ID24964
Feature activation-7.26e-01

Pos logit contributions
 har+0.094
 whence+0.093
 Phant+0.093
 gib+0.093
 hay+0.091
Neg logit contributions
 Caption-0.110
hement-0.104
odynamic-0.104
enegger-0.103
acterial-0.102
Token section
Token ID2665
Feature activation+1.16e-01

Pos logit contributions
 Caption+0.458
hement+0.428
odynamic+0.424
enegger+0.416
acterial+0.415
Neg logit contributions
 whence-0.500
 Phant-0.499
 har-0.496
 gib-0.492
nos-0.484
Token on
Token ID319
Feature activation-6.62e-02

Pos logit contributions
 whence+0.072
 Phant+0.072
 har+0.071
 gib+0.071
nos+0.069
Neg logit contributions
 Caption-0.081
hement-0.076
odynamic-0.075
enegger-0.074
acterial-0.074
Token www
Token ID7324
Feature activation+1.69e-01

Pos logit contributions
 Caption+0.052
hement+0.049
odynamic+0.048
enegger+0.048
acterial+0.048
Neg logit contributions
 whence-0.036
 Phant-0.036
 har-0.036
 gib-0.035
nos-0.034
Token.
Token ID13
Feature activation-6.46e-03

Pos logit contributions
 har+0.108
 whence+0.107
 Phant+0.106
nos+0.105
 gib+0.104
Neg logit contributions
 Caption-0.114
hement-0.111
odynamic-0.108
acterial-0.107
enegger-0.107
Tokenn
Token ID77
Feature activation+3.65e-02

Pos logit contributions
 Caption+0.005
hement+0.005
odynamic+0.005
enegger+0.005
acterial+0.005
Neg logit contributions
 whence-0.004
 Phant-0.004
 har-0.004
 gib-0.003
nos-0.003
Token 150
Token ID6640
Feature activation-1.27e-01

Pos logit contributions
 Caption+0.086
hement+0.081
odynamic+0.081
enegger+0.080
acterial+0.080
Neg logit contributions
 Phant-0.079
 har-0.079
 whence-0.078
 gib-0.078
 hay-0.076
Tokenth
Token ID400
Feature activation+1.30e-01

Pos logit contributions
 Caption+0.109
hement+0.104
odynamic+0.103
enegger+0.102
acterial+0.102
Neg logit contributions
 whence-0.058
 Phant-0.058
 har-0.058
 gib-0.057
nos-0.056
Token birthday
Token ID10955
Feature activation+2.58e-01

Pos logit contributions
 Phant+0.068
 whence+0.067
 har+0.067
 gib+0.066
nos+0.065
Neg logit contributions
 Caption-0.104
hement-0.099
odynamic-0.098
enegger-0.097
acterial-0.097
Token of
Token ID286
Feature activation-5.92e-04

Pos logit contributions
 Phant+0.117
 whence+0.117
 har+0.116
 gib+0.115
nos+0.113
Neg logit contributions
 Caption-0.222
hement-0.211
odynamic-0.209
acterial-0.206
-0.206
Token Frank
Token ID5278
Feature activation-3.36e-02

Pos logit contributions
 Caption+0.000
hement+0.000
odynamic+0.000
enegger+0.000
acterial+0.000
Neg logit contributions
 Phant-0.000
 whence-0.000
 har-0.000
 gib-0.000
nos-0.000
Token Lloyd
Token ID22361
Feature activation-7.25e-01

Pos logit contributions
 Caption+0.021
hement+0.019
odynamic+0.019
enegger+0.019
acterial+0.019
Neg logit contributions
 Phant-0.024
 whence-0.024
 har-0.023
 gib-0.023
nos-0.023
Token Wright
Token ID12206
Feature activation-2.95e-01

Pos logit contributions
 Caption+0.348
hement+0.327
enegger+0.318
odynamic+0.316
acterial+0.307
Neg logit contributions
 har-0.613
 Phant-0.610
 whence-0.610
 gib-0.608
 hay-0.596
Token,
Token ID11
Feature activation-1.36e-01

Pos logit contributions
 Caption+0.203
hement+0.190
odynamic+0.188
enegger+0.187
acterial+0.185
Neg logit contributions
 whence-0.187
 Phant-0.186
 har-0.185
 gib-0.184
nos-0.181
Token America
Token ID2253
Feature activation-2.74e-02

Pos logit contributions
 Caption+0.102
hement+0.097
odynamic+0.096
enegger+0.095
acterial+0.094
Neg logit contributions
 whence-0.078
 Phant-0.077
 har-0.077
 gib-0.076
nos-0.074
Token
Token ID447
Feature activation-1.50e-01

Pos logit contributions
 Caption+0.022
hement+0.021
odynamic+0.020
enegger+0.020
acterial+0.020
Neg logit contributions
 Phant-0.015
 whence-0.015
 har-0.015
 gib-0.014
nos-0.014
Token
Token ID247
Feature activation-5.80e-02

Pos logit contributions
 Caption+0.069
hement+0.063
odynamic+0.062
enegger+0.061
acterial+0.061
Neg logit contributions
 whence-0.128
 Phant-0.128
 har-0.128
 gib-0.127
nos-0.125
Token full
Token ID1336
Feature activation-2.62e-02

Pos logit contributions
 whence+0.023
 har+0.023
 Phant+0.023
 gib+0.023
nos+0.023
Neg logit contributions
 Caption-0.025
hement-0.023
odynamic-0.023
enegger-0.023
acterial-0.022
Token bailout
Token ID29928
Feature activation+7.75e-03

Pos logit contributions
 Caption+0.017
hement+0.016
odynamic+0.015
enegger+0.015
acterial+0.015
Neg logit contributions
 Phant-0.018
 whence-0.018
 har-0.018
 gib-0.017
nos-0.017
Token.
Token ID13
Feature activation-5.84e-02

Pos logit contributions
 whence+0.005
 Phant+0.005
 har+0.005
 gib+0.005
nos+0.005
Neg logit contributions
 Caption-0.005
hement-0.005
odynamic-0.005
enegger-0.005
acterial-0.005
Token
Token ID198
Feature activation+3.70e-02

Pos logit contributions
 Caption+0.051
hement+0.049
odynamic+0.049
acterial+0.048
enegger+0.048
Neg logit contributions
 whence-0.026
 Phant-0.026
 har-0.026
 gib-0.025
nos-0.024
Token
Token ID198
Feature activation+2.17e-02

Pos logit contributions
 whence+0.011
 Phant+0.010
 har+0.010
 gib+0.010
nos+0.010
Neg logit contributions
 Caption-0.038
hement-0.037
odynamic-0.037
enegger-0.036
acterial-0.036
TokenStat
Token ID17126
Feature activation+4.54e-01

Pos logit contributions
 whence+0.017
 har+0.017
 Phant+0.017
nos+0.017
 gib+0.017
Neg logit contributions
 Caption-0.011
hement-0.010
odynamic-0.010
enegger-0.010
acterial-0.010
Tokenements
Token ID3196
Feature activation+2.45e-01

Pos logit contributions
 har+0.247
 whence+0.247
 Phant+0.245
 gib+0.242
nos+0.238
Neg logit contributions
 Caption-0.352
hement-0.331
odynamic-0.329
enegger-0.329
acterial-0.326
Token made
Token ID925
Feature activation+9.10e-02

Pos logit contributions
 har+0.150
 whence+0.149
 Phant+0.149
 gib+0.147
nos+0.145
Neg logit contributions
 Caption-0.173
hement-0.164
odynamic-0.163
enegger-0.162
acterial-0.161
Token by
Token ID416
Feature activation-7.81e-02

Pos logit contributions
 har+0.063
 whence+0.063
 Phant+0.063
 gib+0.062
 hay+0.061
Neg logit contributions
 Caption-0.057
hement-0.053
odynamic-0.053
enegger-0.052
acterial-0.052
Token Prime
Token ID5537
Feature activation+2.93e-02

Pos logit contributions
 Caption+0.052
hement+0.049
odynamic+0.048
acterial+0.048
enegger+0.048
Neg logit contributions
 har-0.051
 whence-0.051
 Phant-0.051
 gib-0.050
nos-0.049
Token Minister
Token ID4139
Feature activation-1.52e-01

Pos logit contributions
 Phant+0.026
 whence+0.026
 har+0.026
 gib+0.025
 hay+0.025
Neg logit contributions
 Caption-0.013
hement-0.012
odynamic-0.012
acterial-0.012
enegger-0.012
Tokenais
Token ID15152
Feature activation+1.34e-01

Pos logit contributions
 Caption+0.182
hement+0.174
odynamic+0.172
enegger+0.171
+0.170
Neg logit contributions
 har-0.081
 whence-0.080
 Phant-0.077
ancies-0.077
nos-0.077
Token took
Token ID1718
Feature activation-5.58e-02

Pos logit contributions
 har+0.091
 whence+0.091
 Phant+0.090
 gib+0.090
 hay+0.087
Neg logit contributions
 Caption-0.086
hement-0.081
odynamic-0.081
acterial-0.080
enegger-0.079
Token aim
Token ID4031
Feature activation+3.94e-02

Pos logit contributions
 Caption+0.041
hement+0.039
odynamic+0.038
enegger+0.038
acterial+0.038
Neg logit contributions
 har-0.033
 whence-0.033
 Phant-0.033
 gib-0.032
 hay-0.031
Token at
Token ID379
Feature activation+1.16e-01

Pos logit contributions
 whence+0.026
 Phant+0.026
 har+0.026
 gib+0.025
nos+0.025
Neg logit contributions
 Caption-0.026
hement-0.025
odynamic-0.024
enegger-0.024
acterial-0.024
Token stars
Token ID5788
Feature activation+2.99e-01

Pos logit contributions
 whence+0.071
 Phant+0.071
 har+0.071
 gib+0.070
nos+0.068
Neg logit contributions
 Caption-0.083
hement-0.078
odynamic-0.077
enegger-0.076
acterial-0.076
Token such
Token ID884
Feature activation+4.32e-01

Pos logit contributions
 whence+0.159
 har+0.158
 Phant+0.158
 gib+0.156
nos+0.151
Neg logit contributions
 Caption-0.236
hement-0.224
odynamic-0.222
acterial-0.219
enegger-0.219
Token as
Token ID355
Feature activation-6.93e-02

Pos logit contributions
 har+0.301
 whence+0.295
 Phant+0.295
 gib+0.290
 hay+0.287
Neg logit contributions
 Caption-0.270
hement-0.269
odynamic-0.261
acterial-0.255
enegger-0.254
Token Robert
Token ID5199
Feature activation+2.53e-01

Pos logit contributions
 Caption+0.044
hement+0.041
odynamic+0.041
enegger+0.040
acterial+0.040
Neg logit contributions
 Phant-0.047
 whence-0.047
 har-0.047
 gib-0.047
nos-0.046
Token Down
Token ID5588
Feature activation-7.94e-03

Pos logit contributions
 Phant+0.175
 har+0.174
 whence+0.173
 gib+0.172
 hay+0.167
Neg logit contributions
 Caption-0.158
odynamic-0.150
hement-0.150
acterial-0.146
enegger-0.145
Tokeney
Token ID2959
Feature activation+4.43e-02

Pos logit contributions
 Caption+0.006
hement+0.005
odynamic+0.005
enegger+0.005
acterial+0.005
Neg logit contributions
 whence-0.005
 Phant-0.005
 har-0.005
 gib-0.005
nos-0.005
Token Jr
Token ID7504
Feature activation-1.22e-01

Pos logit contributions
 whence+0.033
 Phant+0.033
 har+0.033
 gib+0.033
nos+0.032
Neg logit contributions
 Caption-0.025
hement-0.023
odynamic-0.023
enegger-0.023
acterial-0.023
Token trolls
Token ID24727
Feature activation+5.05e-01

Pos logit contributions
 whence+0.130
 Phant+0.130
 har+0.130
 gib+0.128
nos+0.125
Neg logit contributions
 Caption-0.163
hement-0.154
odynamic-0.153
enegger-0.151
acterial-0.150
Token and
Token ID290
Feature activation+5.73e-02

Pos logit contributions
 whence+0.303
 har+0.302
 Phant+0.302
 gib+0.298
nos+0.291
Neg logit contributions
 Caption-0.364
hement-0.343
odynamic-0.339
enegger-0.336
acterial-0.335
Token ultimately
Token ID6165
Feature activation-1.69e-01

Pos logit contributions
 whence+0.032
 Phant+0.032
 har+0.032
 gib+0.032
nos+0.031
Neg logit contributions
 Caption-0.044
hement-0.041
odynamic-0.041
enegger-0.040
acterial-0.040
Token benefit
Token ID4414
Feature activation+2.69e-01

Pos logit contributions
 Caption+0.124
hement+0.117
odynamic+0.116
enegger+0.115
acterial+0.114
Neg logit contributions
 whence-0.099
 Phant-0.099
 har-0.098
 gib-0.097
nos-0.095
Token only
Token ID691
Feature activation-4.31e-02

Pos logit contributions
 whence+0.159
 Phant+0.159
 har+0.158
 gib+0.156
nos+0.153
Neg logit contributions
 Caption-0.196
hement-0.185
odynamic-0.183
enegger-0.181
acterial-0.180
Token trolls
Token ID24727
Feature activation+4.50e-01

Pos logit contributions
 Caption+0.032
hement+0.031
odynamic+0.031
enegger+0.030
acterial+0.030
Neg logit contributions
 Phant-0.024
 whence-0.024
 har-0.024
 gib-0.024
 hay-0.023
Token.
Token ID13
Feature activation+5.53e-02

Pos logit contributions
 har+0.281
 whence+0.281
 Phant+0.279
 gib+0.276
nos+0.269
Neg logit contributions
 Caption-0.315
hement-0.296
odynamic-0.292
acterial-0.290
enegger-0.290
Token
Token ID198
Feature activation+8.97e-02

Pos logit contributions
 whence+0.026
 Phant+0.026
 har+0.025
 gib+0.025
nos+0.024
Neg logit contributions
 Caption-0.047
hement-0.045
odynamic-0.045
enegger-0.044
acterial-0.044
Token
Token ID198
Feature activation+8.39e-02

Pos logit contributions
 whence+0.026
 Phant+0.026
 har+0.026
 gib+0.025
nos+0.024
Neg logit contributions
 Caption-0.092
hement-0.089
odynamic-0.088
enegger-0.088
acterial-0.087
TokenLook
Token ID8567
Feature activation-7.19e-02

Pos logit contributions
 whence+0.062
 Phant+0.062
 har+0.062
 gib+0.061
nos+0.061
Neg logit contributions
 Caption-0.048
hement-0.045
odynamic-0.045
enegger-0.044
acterial-0.044
Token into
Token ID656
Feature activation+5.88e-02

Pos logit contributions
 Caption+0.050
hement+0.047
odynamic+0.046
acterial+0.046
enegger+0.046
Neg logit contributions
 whence-0.045
 Phant-0.045
 har-0.045
 gib-0.044
nos-0.043
Token example
Token ID1672
Feature activation+5.74e-02

Pos logit contributions
 Caption+0.044
hement+0.041
odynamic+0.041
enegger+0.040
acterial+0.040
Neg logit contributions
 whence-0.033
 Phant-0.032
 har-0.032
 gib-0.032
nos-0.031
Token,
Token ID11
Feature activation-3.60e-02

Pos logit contributions
 whence+0.037
 Phant+0.037
 har+0.037
 gib+0.036
nos+0.036
Neg logit contributions
 Caption-0.039
hement-0.036
odynamic-0.036
enegger-0.036
acterial-0.036
Token a
Token ID257
Feature activation-4.49e-02

Pos logit contributions
 Caption+0.025
hement+0.024
odynamic+0.024
enegger+0.024
acterial+0.023
Neg logit contributions
 har-0.022
 whence-0.022
 Phant-0.022
 gib-0.022
 hay-0.022
Token physician
Token ID14325
Feature activation-8.22e-02

Pos logit contributions
 Caption+0.029
hement+0.028
odynamic+0.027
enegger+0.027
utherland+0.027
Neg logit contributions
 har-0.030
 whence-0.030
 Phant-0.030
 gib-0.030
 hay-0.029
Token would
Token ID561
Feature activation-4.84e-02

Pos logit contributions
 Caption+0.052
hement+0.049
odynamic+0.048
enegger+0.048
acterial+0.047
Neg logit contributions
 har-0.057
 whence-0.057
 Phant-0.056
 gib-0.056
 hay-0.055
Token employ
Token ID1873
Feature activation+4.38e-01

Pos logit contributions
 Caption+0.035
hement+0.033
odynamic+0.032
enegger+0.032
utherland+0.032
Neg logit contributions
 har-0.030
 whence-0.029
 Phant-0.029
 gib-0.029
 hay-0.028
Token blood
Token ID2910
Feature activation-2.42e-01

Pos logit contributions
 har+0.245
 Phant+0.243
 whence+0.243
 gib+0.241
 hay+0.235
Neg logit contributions
 Caption-0.334
hement-0.317
odynamic-0.314
enegger-0.312
utherland-0.308
Tokenlet
Token ID1616
Feature activation-9.68e-02

Pos logit contributions
 Caption+0.148
hement+0.138
odynamic+0.135
enegger+0.135
acterial+0.133
Neg logit contributions
 har-0.172
 whence-0.172
 Phant-0.170
 gib-0.169
 hay-0.166
Tokenting
Token ID889
Feature activation+1.57e-01

Pos logit contributions
 Caption+0.106
hement+0.102
odynamic+0.101
enegger+0.100
acterial+0.100
Neg logit contributions
 har-0.023
 whence-0.022
 Phant-0.022
 gib-0.021
 hay-0.020
Token,
Token ID11
Feature activation-3.39e-02

Pos logit contributions
 whence+0.099
 har+0.099
 Phant+0.098
 gib+0.097
 hay+0.095
Neg logit contributions
 Caption-0.108
hement-0.102
odynamic-0.101
enegger-0.100
acterial-0.099
Token using
Token ID1262
Feature activation+2.46e-01

Pos logit contributions
 Caption+0.026
hement+0.025
odynamic+0.025
enegger+0.025
acterial+0.024
Neg logit contributions
 whence-0.018
 har-0.018
 Phant-0.018
 gib-0.018
nos-0.018
Token former
Token ID1966
Feature activation-4.27e-02

Pos logit contributions
 whence+0.030
 Phant+0.030
 har+0.030
 gib+0.030
 hay+0.029
Neg logit contributions
 Caption-0.039
hement-0.037
odynamic-0.037
acterial-0.036
enegger-0.036
Token Wat
Token ID12242
Feature activation-1.79e-01

Pos logit contributions
 Caption+0.031
hement+0.029
odynamic+0.029
enegger+0.028
acterial+0.028
Neg logit contributions
 whence-0.026
 Phant-0.026
 har-0.025
 gib-0.025
nos-0.025
Tokenerville
Token ID33487
Feature activation+3.39e-01

Pos logit contributions
 Caption+0.120
hement+0.113
odynamic+0.112
enegger+0.111
acterial+0.110
Neg logit contributions
 whence-0.116
 Phant-0.116
 har-0.116
 gib-0.115
nos-0.113
Token Hotel
Token ID12696
Feature activation+9.79e-02

Pos logit contributions
 whence+0.196
 Phant+0.196
 har+0.195
 gib+0.193
nos+0.189
Neg logit contributions
 Caption-0.250
hement-0.236
odynamic-0.234
enegger-0.232
acterial-0.231
Token (
Token ID357
Feature activation+3.75e-02

Pos logit contributions
 whence+0.063
 Phant+0.063
 har+0.062
 gib+0.062
nos+0.061
Neg logit contributions
 Caption-0.066
hement-0.062
odynamic-0.062
enegger-0.061
acterial-0.061
Tokennow
Token ID2197
Feature activation+4.72e-01

Pos logit contributions
 whence+0.022
 Phant+0.022
 har+0.022
 gib+0.022
nos+0.021
Neg logit contributions
 Caption-0.028
hement-0.026
odynamic-0.026
enegger-0.025
acterial-0.025
Token the
Token ID262
Feature activation+2.15e-02

Pos logit contributions
 Phant+0.273
 whence+0.272
 har+0.272
 gib+0.269
nos+0.261
Neg logit contributions
 Caption-0.349
hement-0.332
odynamic-0.328
enegger-0.325
acterial-0.324
Token Wat
Token ID12242
Feature activation-2.60e-01

Pos logit contributions
 Phant+0.013
 whence+0.013
 har+0.013
 gib+0.013
 hay+0.013
Neg logit contributions
 Caption-0.015
hement-0.014
odynamic-0.014
acterial-0.014
enegger-0.014
Tokenerville
Token ID33487
Feature activation+2.99e-01

Pos logit contributions
 Caption+0.227
hement+0.217
odynamic+0.215
enegger+0.213
acterial+0.212
Neg logit contributions
 whence-0.114
 Phant-0.114
 har-0.114
 gib-0.112
nos-0.108
Token Tavern
Token ID32693
Feature activation+2.62e-01

Pos logit contributions
 Phant+0.179
 whence+0.179
 har+0.178
 gib+0.176
nos+0.173
Neg logit contributions
 Caption-0.214
hement-0.203
odynamic-0.201
acterial-0.198
enegger-0.198
Token)
Token ID8
Feature activation-3.75e-01

Pos logit contributions
 whence+0.241
 Phant+0.240
 har+0.240
 gib+0.238
nos+0.235
Neg logit contributions
 Caption-0.105
hement-0.095
odynamic-0.093
acterial-0.092
enegger-0.090
Token his
Token ID465
Feature activation+8.57e-02

Pos logit contributions
 Caption+0.052
hement+0.050
odynamic+0.049
acterial+0.049
enegger+0.049
Neg logit contributions
 Phant-0.047
 har-0.047
 whence-0.047
 gib-0.047
nos-0.045
Token grandmother
Token ID18410
Feature activation+5.20e-02

Pos logit contributions
 Phant+0.058
 har+0.058
 whence+0.058
 gib+0.057
 hay+0.056
Neg logit contributions
 Caption-0.054
hement-0.052
odynamic-0.051
acterial-0.051
enegger-0.050
Token motivated
Token ID13338
Feature activation+7.56e-02

Pos logit contributions
 whence+0.035
 har+0.035
 Phant+0.035
 gib+0.034
 hay+0.034
Neg logit contributions
 Caption-0.033
hement-0.032
odynamic-0.031
acterial-0.031
enegger-0.031
Token him
Token ID683
Feature activation+1.44e-03

Pos logit contributions
 whence+0.045
 har+0.045
 Phant+0.045
 gib+0.044
nos+0.043
Neg logit contributions
 Caption-0.055
hement-0.052
odynamic-0.052
enegger-0.052
acterial-0.051
Token to
Token ID284
Feature activation+9.13e-02

Pos logit contributions
 whence+0.001
 Phant+0.001
 har+0.001
 gib+0.001
nos+0.001
Neg logit contributions
 Caption-0.001
hement-0.001
odynamic-0.001
enegger-0.001
acterial-0.001
Token do
Token ID466
Feature activation+1.78e-01

Pos logit contributions
 whence+0.048
 Phant+0.047
 har+0.047
 gib+0.047
nos+0.045
Neg logit contributions
 Caption-0.073
hement-0.069
odynamic-0.069
enegger-0.068
acterial-0.068
Token his
Token ID465
Feature activation+6.45e-02

Pos logit contributions
 whence+0.078
 Phant+0.077
 har+0.077
 gib+0.076
nos+0.074
Neg logit contributions
 Caption-0.156
hement-0.150
odynamic-0.148
enegger-0.147
acterial-0.147
Token best
Token ID1266
Feature activation+4.78e-02

Pos logit contributions
 har+0.044
 Phant+0.044
 whence+0.044
 gib+0.044
 hay+0.043
Neg logit contributions
 Caption-0.040
hement-0.038
odynamic-0.038
enegger-0.037
acterial-0.037
Token and
Token ID290
Feature activation-7.68e-02

Pos logit contributions
 whence+0.031
 Phant+0.031
 har+0.031
 gib+0.030
nos+0.030
Neg logit contributions
 Caption-0.032
hement-0.030
odynamic-0.030
enegger-0.030
acterial-0.030
Token
Token ID564
Feature activation+2.15e-01

Pos logit contributions
 Caption+0.064
hement+0.061
odynamic+0.060
enegger+0.060
acterial+0.060
Neg logit contributions
 whence-0.037
 Phant-0.037
 har-0.037
 gib-0.037
nos-0.036
Token
Token ID250
Feature activation-1.21e-04

Pos logit contributions
 har+0.166
 whence+0.165
 Phant+0.165
 gib+0.162
 hay+0.161
Neg logit contributions
 Caption-0.117
hement-0.111
odynamic-0.109
enegger-0.107
utherland-0.106
Token hand
Token ID1021
Feature activation-2.80e-01

Pos logit contributions
 Caption+0.032
hement+0.031
enegger+0.031
odynamic+0.030
acterial+0.030
Neg logit contributions
 har-0.037
 whence-0.037
 Phant-0.036
 gib-0.036
 hay-0.036
Token,
Token ID11
Feature activation+1.44e-01

Pos logit contributions
 Caption+0.202
hement+0.195
odynamic+0.190
acterial+0.188
enegger+0.188
Neg logit contributions
 har-0.169
 whence-0.167
 Phant-0.164
 gib-0.163
nos-0.160
Token emphasized
Token ID20047
Feature activation+2.38e-01

Pos logit contributions
 whence+0.090
 har+0.089
 Phant+0.089
 gib+0.088
nos+0.086
Neg logit contributions
 Caption-0.101
hement-0.094
odynamic-0.094
enegger-0.093
acterial-0.092
Token the
Token ID262
Feature activation+1.45e-01

Pos logit contributions
 har+0.141
 whence+0.141
 Phant+0.140
 gib+0.139
 hay+0.136
Neg logit contributions
 Caption-0.173
hement-0.163
odynamic-0.162
enegger-0.161
acterial-0.159
Token use
Token ID779
Feature activation+2.60e-01

Pos logit contributions
 har+0.091
 whence+0.090
 Phant+0.090
 gib+0.089
 hay+0.087
Neg logit contributions
 Caption-0.102
hement-0.096
odynamic-0.094
enegger-0.094
utherland-0.093
Token of
Token ID286
Feature activation+3.31e-01

Pos logit contributions
 har+0.102
 whence+0.102
 Phant+0.101
 gib+0.098
nos+0.096
Neg logit contributions
 Caption-0.241
hement-0.233
odynamic-0.229
enegger-0.228
acterial-0.227
Token plant
Token ID4618
Feature activation+3.64e-01

Pos logit contributions
 har+0.192
 whence+0.192
 Phant+0.191
 gib+0.189
 hay+0.185
Neg logit contributions
 Caption-0.245
hement-0.232
odynamic-0.229
enegger-0.228
acterial-0.226
Token-
Token ID12
Feature activation+1.77e-01

Pos logit contributions
 har+0.204
 whence+0.202
 Phant+0.200
 gib+0.199
 hay+0.194
Neg logit contributions
 Caption-0.278
hement-0.263
odynamic-0.262
enegger-0.258
utherland-0.257
Tokenbased
Token ID3106
Feature activation+4.66e-01

Pos logit contributions
 whence+0.066
 har+0.066
 Phant+0.066
 gib+0.064
nos+0.063
Neg logit contributions
 Caption-0.167
hement-0.160
odynamic-0.159
enegger-0.158
acterial-0.156
Token remedies
Token ID29604
Feature activation+3.20e-01

Pos logit contributions
 har+0.251
 whence+0.250
 Phant+0.248
 gib+0.246
 hay+0.240
Neg logit contributions
 Caption-0.366
hement-0.347
odynamic-0.344
enegger-0.342
acterial-0.339
Token instead
Token ID2427
Feature activation+1.87e-01

Pos logit contributions
 whence+0.191
 har+0.190
 Phant+0.189
 gib+0.187
nos+0.183
Neg logit contributions
 Caption-0.232
hement-0.220
odynamic-0.217
enegger-0.215
acterial-0.214
Token of
Token ID286
Feature activation+4.66e-03

Pos logit contributions
 Caption+0.282
hement+0.268
odynamic+0.266
acterial+0.263
enegger+0.262
Neg logit contributions
 whence-0.145
 har-0.143
 Phant-0.142
 gib-0.141
nos-0.138
Token K
Token ID509
Feature activation-5.74e-02

Pos logit contributions
 Phant+0.003
 whence+0.003
 har+0.003
 gib+0.003
nos+0.003
Neg logit contributions
 Caption-0.003
hement-0.003
odynamic-0.003
enegger-0.003
acterial-0.003
Tokenins
Token ID1040
Feature activation-2.86e-01

Pos logit contributions
 Caption+0.038
hement+0.036
odynamic+0.035
enegger+0.035
acterial+0.035
Neg logit contributions
 whence-0.037
 har-0.037
 Phant-0.037
 gib-0.037
nos-0.036
Tokenba
Token ID7012
Feature activation+1.10e-01

Pos logit contributions
 Caption+0.184
hement+0.172
odynamic+0.170
enegger+0.168
acterial+0.167
Neg logit contributions
 whence-0.194
 Phant-0.193
 har-0.193
 gib-0.191
nos-0.188
Tokenile
Token ID576
Feature activation+1.43e-01

Pos logit contributions
 whence+0.080
 Phant+0.080
 har+0.080
 gib+0.079
nos+0.078
Neg logit contributions
 Caption-0.065
hement-0.061
odynamic-0.060
enegger-0.059
acterial-0.059
Token
Token ID198
Feature activation+2.51e-02

Pos logit contributions
 whence+0.052
 har+0.051
 Phant+0.051
 gib+0.050
 hay+0.048
Neg logit contributions
 Caption-0.137
hement-0.131
odynamic-0.129
enegger-0.127
acterial-0.127
Token
Token ID198
Feature activation+2.06e-02

Pos logit contributions
 whence+0.007
 Phant+0.007
 har+0.007
 gib+0.007
nos+0.007
Neg logit contributions
 Caption-0.026
hement-0.025
odynamic-0.025
enegger-0.025
acterial-0.025
TokenG
Token ID38
Feature activation-1.83e-02

Pos logit contributions
 Phant+0.014
 whence+0.014
 har+0.014
 gib+0.014
nos+0.014
Neg logit contributions
 Caption-0.013
hement-0.012
odynamic-0.012
enegger-0.012
acterial-0.012
Tokeniant
Token ID3014
Feature activation+1.20e-01

Pos logit contributions
 Caption+0.013
hement+0.012
odynamic+0.012
enegger+0.012
acterial+0.012
Neg logit contributions
 whence-0.011
 Phant-0.011
 har-0.011
 gib-0.011
nos-0.011
Token
Token ID447
Feature activation-8.14e-02

Pos logit contributions
 Phant+0.074
 whence+0.074
 har+0.074
 gib+0.073
nos+0.071
Neg logit contributions
 Caption-0.084
hement-0.079
odynamic-0.078
enegger-0.077
acterial-0.077
Token
Token ID247
Feature activation+1.75e-02

Pos logit contributions
 Caption+0.052
hement+0.049
odynamic+0.048
enegger+0.048
acterial+0.047
Neg logit contributions
 whence-0.055
 Phant-0.055
 har-0.055
 gib-0.054
nos-0.053
Token).
Token ID737
Feature activation-1.98e-01

Pos logit contributions
 whence+0.106
 Phant+0.106
 har+0.106
 gib+0.105
nos+0.103
Neg logit contributions
 Caption-0.038
hement-0.034
odynamic-0.033
enegger-0.032
acterial-0.032
Token Even
Token ID3412
Feature activation+7.92e-02

Pos logit contributions
 Caption+0.174
hement+0.165
odynamic+0.164
enegger+0.162
acterial+0.161
Neg logit contributions
 whence-0.087
 Phant-0.086
 har-0.086
 gib-0.085
nos-0.083
Token at
Token ID379
Feature activation+1.22e-01

Pos logit contributions
 whence+0.046
 Phant+0.046
 har+0.045
 gib+0.045
nos+0.044
Neg logit contributions
 Caption-0.059
hement-0.056
odynamic-0.055
enegger-0.054
acterial-0.054
Token this
Token ID428
Feature activation+1.78e-01

Pos logit contributions
 whence+0.063
 Phant+0.062
 har+0.062
 gib+0.061
nos+0.060
Neg logit contributions
 Caption-0.098
hement-0.093
odynamic-0.092
enegger-0.091
acterial-0.091
Token rapid
Token ID5801
Feature activation-1.17e-01

Pos logit contributions
 whence+0.089
 har+0.089
 Phant+0.089
 gib+0.088
 hay+0.085
Neg logit contributions
 Caption-0.145
hement-0.138
odynamic-0.137
enegger-0.136
acterial-0.135
Token speed
Token ID2866
Feature activation+2.79e-01

Pos logit contributions
 Caption+0.096
hement+0.091
odynamic+0.090
acterial+0.089
enegger+0.089
Neg logit contributions
 whence-0.059
 Phant-0.059
 har-0.058
 gib-0.057
nos-0.057
Token,
Token ID11
Feature activation+1.26e-01

Pos logit contributions
 whence+0.175
 Phant+0.175
 har+0.174
 gib+0.172
nos+0.170
Neg logit contributions
 Caption-0.193
hement-0.181
odynamic-0.179
enegger-0.176
acterial-0.176
Token the
Token ID262
Feature activation+1.45e-01

Pos logit contributions
 whence+0.071
 Phant+0.071
 har+0.071
 gib+0.070
nos+0.068
Neg logit contributions
 Caption-0.095
hement-0.090
odynamic-0.089
enegger-0.088
acterial-0.088
Token solar
Token ID6591
Feature activation-1.13e-01

Pos logit contributions
 har+0.065
 whence+0.064
 gib+0.064
 Phant+0.064
 hay+0.061
Neg logit contributions
 Caption-0.127
hement-0.121
odynamic-0.120
enegger-0.119
inical-0.119
Token system
Token ID1080
Feature activation-3.82e-02

Pos logit contributions
 Caption+0.094
hement+0.090
odynamic+0.089
enegger+0.088
acterial+0.087
Neg logit contributions
 whence-0.056
 har-0.055
 Phant-0.055
 gib-0.055
nos-0.053
Token would
Token ID561
Feature activation-3.24e-02

Pos logit contributions
 Caption+0.024
hement+0.022
odynamic+0.022
enegger+0.022
acterial+0.022
Neg logit contributions
 har-0.027
 whence-0.027
 Phant-0.026
 gib-0.026
 hay-0.026
Token kilometers
Token ID18212
Feature activation+8.52e-02

Pos logit contributions
 whence+0.093
 Phant+0.092
 har+0.091
 gib+0.090
nos+0.088
Neg logit contributions
 Caption-0.177
hement-0.169
odynamic-0.167
acterial-0.165
enegger-0.165
Token (
Token ID357
Feature activation-4.84e-02

Pos logit contributions
 whence+0.054
 Phant+0.054
 har+0.053
 gib+0.053
nos+0.052
Neg logit contributions
 Caption-0.059
hement-0.055
odynamic-0.055
enegger-0.054
acterial-0.054
Token1
Token ID16
Feature activation+9.54e-02

Pos logit contributions
 Caption+0.039
hement+0.037
odynamic+0.037
enegger+0.036
acterial+0.036
Neg logit contributions
 whence-0.025
 Phant-0.025
 har-0.025
 gib-0.024
nos-0.024
Token,
Token ID11
Feature activation+4.84e-02

Pos logit contributions
 whence+0.061
 Phant+0.061
 har+0.061
 gib+0.060
nos+0.059
Neg logit contributions
 Caption-0.064
hement-0.060
odynamic-0.060
enegger-0.059
acterial-0.059
Token200
Token ID2167
Feature activation+2.55e-01

Pos logit contributions
 whence+0.020
 Phant+0.019
 har+0.019
 gib+0.019
nos+0.018
Neg logit contributions
 Caption-0.044
hement-0.042
odynamic-0.042
enegger-0.042
acterial-0.042
Token miles
Token ID4608
Feature activation+2.37e-01

Pos logit contributions
 Phant+0.066
 whence+0.066
 har+0.065
 gib+0.064
nos+0.062
Neg logit contributions
 Caption-0.269
hement-0.258
odynamic-0.256
acterial-0.254
enegger-0.253
Token),
Token ID828
Feature activation-2.78e-01

Pos logit contributions
 whence+0.237
 Phant+0.236
 har+0.236
 gib+0.234
nos+0.230
Neg logit contributions
 Caption-0.077
hement-0.067
odynamic-0.065
enegger-0.064
acterial-0.063
Token the
Token ID262
Feature activation+7.41e-03

Pos logit contributions
 Caption+0.204
hement+0.193
odynamic+0.191
enegger+0.190
acterial+0.189
Neg logit contributions
 whence-0.162
 Phant-0.162
 har-0.161
 gib-0.160
nos-0.156
Token F
Token ID376
Feature activation-8.17e-02

Pos logit contributions
 Phant+0.005
 har+0.005
 whence+0.005
 gib+0.004
nos+0.004
Neg logit contributions
 Caption-0.005
hement-0.005
odynamic-0.005
enegger-0.005
acterial-0.005
Token chain
Token ID6333
Feature activation-2.40e-01

Pos logit contributions
 Caption+0.057
hement+0.054
odynamic+0.053
enegger+0.052
acterial+0.052
Neg logit contributions
 whence-0.051
 Phant-0.051
 har-0.050
 gib-0.050
nos-0.049
Token
Token ID447
Feature activation-7.65e-02

Pos logit contributions
 Caption+0.165
hement+0.154
odynamic+0.153
enegger+0.150
acterial+0.150
Neg logit contributions
 whence-0.153
 Phant-0.152
 har-0.151
 gib-0.150
nos-0.147
Token regions
Token ID7652
Feature activation+5.35e-02

Pos logit contributions
 har+0.122
 whence+0.122
 Phant+0.121
 gib+0.120
 hay+0.117
Neg logit contributions
 Caption-0.138
hement-0.130
odynamic-0.128
enegger-0.128
acterial-0.127
Token are
Token ID389
Feature activation-1.85e-01

Pos logit contributions
 whence+0.038
 har+0.038
 Phant+0.038
 gib+0.038
nos+0.037
Neg logit contributions
 Caption-0.032
hement-0.031
odynamic-0.030
acterial-0.030
enegger-0.030
Token responsible
Token ID4497
Feature activation+1.49e-01

Pos logit contributions
 Caption+0.142
hement+0.135
odynamic+0.133
enegger+0.132
acterial+0.132
Neg logit contributions
 whence-0.102
 Phant-0.102
 har-0.102
 gib-0.101
nos-0.098
Token for
Token ID329
Feature activation-1.87e-02

Pos logit contributions
 whence+0.098
 Phant+0.097
 har+0.097
 gib+0.096
nos+0.094
Neg logit contributions
 Caption-0.099
hement-0.094
odynamic-0.092
enegger-0.091
acterial-0.091
Token this
Token ID428
Feature activation+3.93e-02

Pos logit contributions
 Caption+0.014
hement+0.014
odynamic+0.014
enegger+0.013
acterial+0.013
Neg logit contributions
 whence-0.010
 har-0.010
 Phant-0.010
 gib-0.010
 hay-0.010
Token.
Token ID13
Feature activation-2.99e-02

Pos logit contributions
 har+0.025
 whence+0.025
 Phant+0.025
 gib+0.025
 hay+0.025
Neg logit contributions
 Caption-0.027
hement-0.025
odynamic-0.025
enegger-0.025
acterial-0.024
Token
Token ID447
Feature activation-5.27e-02

Pos logit contributions
 Caption+0.020
hement+0.019
odynamic+0.018
enegger+0.018
acterial+0.018
Neg logit contributions
 whence-0.020
 Phant-0.020
 har-0.020
 gib-0.019
nos-0.019
Token
Token ID247
Feature activation+8.84e-02

Pos logit contributions
 Caption+0.020
hement+0.018
odynamic+0.017
+0.017
enegger+0.017
Neg logit contributions
 whence-0.049
 Phant-0.049
 har-0.049
 gib-0.049
nos-0.048
Token Nobody
Token ID15658
Feature activation-1.94e-02

Pos logit contributions
 whence+0.052
 Phant+0.052
 har+0.052
 gib+0.051
nos+0.050
Neg logit contributions
 Caption-0.064
hement-0.061
odynamic-0.061
acterial-0.060
enegger-0.060
Token is
Token ID318
Feature activation-4.56e-02

Pos logit contributions
 Caption+0.013
hement+0.012
odynamic+0.012
acterial+0.012
enegger+0.012
Neg logit contributions
 har-0.013
 whence-0.013
 Phant-0.013
 gib-0.012
nos-0.012
Token really
Token ID1107
Feature activation+7.38e-02

Pos logit contributions
 Caption+0.035
hement+0.034
odynamic+0.033
enegger+0.033
acterial+0.033
Neg logit contributions
 whence-0.025
 har-0.025
 Phant-0.025
 gib-0.024
nos-0.024
Token have
Token ID423
Feature activation-9.19e-02

Pos logit contributions
 whence+0.040
 Phant+0.040
 har+0.040
 gib+0.039
nos+0.038
Neg logit contributions
 Caption-0.040
hement-0.038
odynamic-0.037
enegger-0.037
acterial-0.037
Token given
Token ID1813
Feature activation+6.24e-02

Pos logit contributions
 Caption+0.072
hement+0.068
odynamic+0.067
enegger+0.067
acterial+0.066
Neg logit contributions
 whence-0.049
 Phant-0.049
 har-0.049
 gib-0.048
nos-0.047
Token Brah
Token ID36415
Feature activation+2.16e-01

Pos logit contributions
 whence+0.029
 Phant+0.029
 har+0.029
 gib+0.029
 hay+0.028
Neg logit contributions
 Caption-0.053
hement-0.051
odynamic-0.050
enegger-0.050
acterial-0.050
Tokenms
Token ID907
Feature activation+2.83e-01

Pos logit contributions
 whence+0.081
 har+0.081
 Phant+0.080
 gib+0.079
nos+0.077
Neg logit contributions
 Caption-0.205
hement-0.197
odynamic-0.194
enegger-0.193
acterial-0.192
Token the
Token ID262
Feature activation+1.68e-02

Pos logit contributions
 whence+0.153
 Phant+0.152
 har+0.152
 gib+0.150
nos+0.146
Neg logit contributions
 Caption-0.221
hement-0.209
odynamic-0.207
enegger-0.205
acterial-0.205
Token bow
Token ID9563
Feature activation-2.28e-02

Pos logit contributions
 whence+0.010
 Phant+0.010
 har+0.010
 gib+0.010
nos+0.010
Neg logit contributions
 Caption-0.012
hement-0.011
odynamic-0.011
enegger-0.011
acterial-0.011
Token in
Token ID287
Feature activation-1.03e-01

Pos logit contributions
 Caption+0.015
hement+0.014
odynamic+0.014
acterial+0.014
enegger+0.014
Neg logit contributions
 whence-0.015
 har-0.015
 Phant-0.015
 gib-0.014
 hay-0.014
Token person
Token ID1048
Feature activation-6.24e-02

Pos logit contributions
 Caption+0.071
hement+0.067
odynamic+0.066
enegger+0.065
acterial+0.065
Neg logit contributions
 whence-0.065
 Phant-0.065
 har-0.065
 gib-0.064
 hay-0.063
Token but
Token ID475
Feature activation+2.84e-01

Pos logit contributions
 Caption+0.044
hement+0.041
odynamic+0.041
enegger+0.040
acterial+0.040
Neg logit contributions
 whence-0.039
 har-0.039
 Phant-0.039
 gib-0.038
 hay-0.038
Token their
Token ID511
Feature activation+1.76e-01

Pos logit contributions
 whence+0.165
 Phant+0.165
 har+0.165
 gib+0.162
nos+0.158
Neg logit contributions
 Caption-0.211
hement-0.199
odynamic-0.198
enegger-0.196
acterial-0.195
Token collected
Token ID7723
Feature activation-2.52e-01

Pos logit contributions
 har+0.123
 whence+0.123
 Phant+0.122
 gib+0.122
 hay+0.119
Neg logit contributions
 Caption-0.110
hement-0.103
odynamic-0.102
enegger-0.101
acterial-0.101
Token
Token ID447
Feature activation-1.88e-01

Pos logit contributions
 Caption+0.042
hement+0.041
odynamic+0.039
enegger+0.039
acterial+0.039
Neg logit contributions
 har-0.061
 whence-0.060
 Phant-0.060
nos-0.059
 gib-0.059
Token
Token ID247
Feature activation+2.06e-02

Pos logit contributions
 Caption+0.150
enegger+0.147
hement+0.147
odynamic+0.145
utherland+0.143
Neg logit contributions
 har-0.096
 whence-0.096
 Phant-0.094
 gib-0.092
nos-0.091
Tokent
Token ID83
Feature activation-6.37e-02

Pos logit contributions
 whence+0.007
 har+0.007
 Phant+0.007
 gib+0.007
nos+0.007
Neg logit contributions
 Caption-0.020
hement-0.019
odynamic-0.019
enegger-0.019
acterial-0.018
Token mean
Token ID1612
Feature activation-1.32e-01

Pos logit contributions
 Caption+0.046
hement+0.043
odynamic+0.043
enegger+0.043
acterial+0.042
Neg logit contributions
 whence-0.038
 har-0.038
 Phant-0.038
 gib-0.038
nos-0.037
Token you
Token ID345
Feature activation-9.20e-02

Pos logit contributions
 Caption+0.100
hement+0.095
odynamic+0.094
enegger+0.093
acterial+0.093
Neg logit contributions
 whence-0.074
 har-0.074
 Phant-0.074
 gib-0.073
nos-0.071
Token can
Token ID460
Feature activation-3.15e-01

Pos logit contributions
 Caption+0.049
hement+0.045
odynamic+0.045
enegger+0.044
acterial+0.043
Neg logit contributions
 har-0.073
 Phant-0.072
 whence-0.072
 gib-0.072
nos-0.071
Token
Token ID447
Feature activation-1.48e-01

Pos logit contributions
 Caption+0.212
hement+0.200
odynamic+0.198
enegger+0.195
acterial+0.194
Neg logit contributions
 whence-0.203
 Phant-0.203
 har-0.202
 gib-0.200
nos-0.197
Token
Token ID247
Feature activation+1.76e-02

Pos logit contributions
 Caption+0.120
hement+0.117
enegger+0.117
odynamic+0.115
utherland+0.114
Neg logit contributions
 har-0.075
 whence-0.074
 Phant-0.073
 gib-0.071
nos-0.070
Tokent
Token ID83
Feature activation-3.34e-02

Pos logit contributions
 whence+0.007
 Phant+0.006
 har+0.006
 gib+0.006
nos+0.006
Neg logit contributions
 Caption-0.017
hement-0.016
odynamic-0.016
enegger-0.016
acterial-0.016
Token look
Token ID804
Feature activation-1.29e-01

Pos logit contributions
 Caption+0.027
hement+0.026
odynamic+0.026
enegger+0.025
acterial+0.025
Neg logit contributions
 whence-0.017
 Phant-0.017
 har-0.017
 gib-0.017
nos-0.016
Token beyond
Token ID3675
Feature activation+3.72e-02

Pos logit contributions
 Caption+0.086
hement+0.082
odynamic+0.080
enegger+0.079
acterial+0.079
Neg logit contributions
 whence-0.084
 Phant-0.084
 har-0.084
 gib-0.083
 hay-0.081
Token excitement
Token ID14067
Feature activation-2.32e-01

Pos logit contributions
 Caption+0.015
hement+0.014
odynamic+0.014
enegger+0.014
acterial+0.014
Neg logit contributions
 har-0.014
 Phant-0.014
 whence-0.014
 gib-0.014
 hay-0.013
Token and
Token ID290
Feature activation-8.19e-02

Pos logit contributions
 Caption+0.160
hement+0.150
odynamic+0.149
enegger+0.147
acterial+0.146
Neg logit contributions
 whence-0.146
 Phant-0.145
 har-0.145
 gib-0.143
nos-0.141
Token the
Token ID262
Feature activation-3.07e-02

Pos logit contributions
 Caption+0.061
hement+0.057
odynamic+0.057
enegger+0.056
acterial+0.056
Neg logit contributions
 whence-0.048
 har-0.048
 Phant-0.048
 gib-0.047
 hay-0.046
Token immersion
Token ID36375
Feature activation+2.35e-03

Pos logit contributions
 Caption+0.021
hement+0.019
odynamic+0.019
enegger+0.019
acterial+0.019
Neg logit contributions
 har-0.020
 Phant-0.020
 whence-0.020
 gib-0.020
 hay-0.019
Token
Token ID784
Feature activation-2.03e-02

Pos logit contributions
 whence+0.001
 Phant+0.001
 har+0.001
 gib+0.001
nos+0.001
Neg logit contributions
 Caption-0.002
hement-0.002
odynamic-0.002
enegger-0.002
acterial-0.002
Token it
Token ID340
Feature activation-4.86e-02

Pos logit contributions
 Caption+0.015
hement+0.014
odynamic+0.014
enegger+0.014
acterial+0.014
Neg logit contributions
 whence-0.012
 Phant-0.012
 har-0.012
 gib-0.011
nos-0.011
Token is
Token ID318
Feature activation-9.15e-02

Pos logit contributions
 Caption+0.028
hement+0.026
odynamic+0.026
enegger+0.025
acterial+0.025
Neg logit contributions
 har-0.036
 whence-0.036
 Phant-0.036
 gib-0.036
 hay-0.035
Token your
Token ID534
Feature activation+8.20e-02

Pos logit contributions
 Caption+0.060
hement+0.056
odynamic+0.056
enegger+0.055
acterial+0.055
Neg logit contributions
 har-0.061
 whence-0.061
 Phant-0.061
 gib-0.060
 hay-0.059
Token job
Token ID1693
Feature activation-1.56e-02

Pos logit contributions
 har+0.057
 Phant+0.056
 whence+0.056
 gib+0.056
 hay+0.054
Neg logit contributions
 Caption-0.052
hement-0.049
odynamic-0.048
enegger-0.048
acterial-0.048
Token to
Token ID284
Feature activation+7.88e-02

Pos logit contributions
 Caption+0.011
hement+0.010
odynamic+0.010
enegger+0.010
acterial+0.010
Neg logit contributions
 Phant-0.010
 whence-0.010
 har-0.010
 gib-0.009
nos-0.009
Token provide
Token ID2148
Feature activation+1.93e-03

Pos logit contributions
 whence+0.042
 Phant+0.042
 har+0.042
 gib+0.041
nos+0.040
Neg logit contributions
 Caption-0.062
hement-0.059
odynamic-0.058
enegger-0.058
acterial-0.058
Tokenmillion
Token ID14100
Feature activation+7.20e-02

Pos logit contributions
 Caption+0.053
hement+0.051
odynamic+0.050
acterial+0.050
enegger+0.050
Neg logit contributions
 whence-0.034
 Phant-0.033
 har-0.033
 gib-0.033
nos-0.032
Token airport
Token ID9003
Feature activation+4.98e-02

Pos logit contributions
 whence+0.043
 Phant+0.043
 har+0.043
 gib+0.042
nos+0.041
Neg logit contributions
 Caption-0.052
hement-0.049
odynamic-0.049
enegger-0.048
acterial-0.048
Token opened
Token ID4721
Feature activation-1.85e-01

Pos logit contributions
 whence+0.038
 Phant+0.038
 har+0.038
 gib+0.037
nos+0.037
Neg logit contributions
 Caption-0.028
hement-0.026
odynamic-0.026
enegger-0.025
acterial-0.025
Token in
Token ID287
Feature activation-1.53e-01

Pos logit contributions
 Caption+0.108
hement+0.102
odynamic+0.101
enegger+0.100
acterial+0.099
Neg logit contributions
 whence-0.135
 har-0.135
 Phant-0.134
 gib-0.133
 hay-0.130
Token late
Token ID2739
Feature activation-1.98e-01

Pos logit contributions
 Caption+0.098
hement+0.093
odynamic+0.092
enegger+0.091
acterial+0.090
Neg logit contributions
 whence-0.104
 Phant-0.104
 har-0.104
 gib-0.102
nos-0.100
Token 2007
Token ID4343
Feature activation-1.33e-01

Pos logit contributions
 Caption+0.138
hement+0.131
odynamic+0.130
enegger+0.128
acterial+0.127
Neg logit contributions
 whence-0.123
 Phant-0.123
 har-0.122
 gib-0.121
nos-0.119
Token.
Token ID13
Feature activation-9.54e-02

Pos logit contributions
 Caption+0.088
hement+0.083
odynamic+0.082
enegger+0.081
acterial+0.080
Neg logit contributions
 whence-0.088
 Phant-0.088
 har-0.088
 gib-0.086
nos-0.085
Token Local
Token ID10714
Feature activation-1.17e-01

Pos logit contributions
 Caption+0.071
hement+0.068
odynamic+0.067
acterial+0.066
enegger+0.066
Neg logit contributions
 Phant-0.055
 whence-0.055
 har-0.054
 gib-0.054
nos-0.053
Token officials
Token ID2828
Feature activation-2.80e-01

Pos logit contributions
 Caption+0.078
hement+0.073
odynamic+0.073
enegger+0.072
acterial+0.071
Neg logit contributions
 har-0.077
 whence-0.077
 Phant-0.077
 gib-0.076
nos-0.074
Token were
Token ID547
Feature activation-1.37e-01

Pos logit contributions
 Caption+0.202
hement+0.190
odynamic+0.189
enegger+0.186
acterial+0.186
Neg logit contributions
 whence-0.168
 har-0.167
 Phant-0.167
 gib-0.165
nos-0.161
Token so
Token ID523
Feature activation+2.32e-02

Pos logit contributions
 Caption+0.098
hement+0.092
odynamic+0.091
enegger+0.090
acterial+0.090
Neg logit contributions
 whence-0.083
 Phant-0.083
 har-0.083
 gib-0.082
nos-0.080
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
 Caption+0.164
hement+0.156
odynamic+0.155
enegger+0.154
acterial+0.152
Neg logit contributions
 Phant-0.147
 har-0.146
 whence-0.146
 gib-0.144
 hay-0.141
Token IGN
Token ID28730
Feature activation-1.41e-01

Pos logit contributions
 Caption+0.156
hement+0.148
enegger+0.147
odynamic+0.147
acterial+0.144
Neg logit contributions
 Phant-0.131
 whence-0.131
 har-0.130
 gib-0.127
nos-0.125
Token's
Token ID338
Feature activation-5.62e-03

Pos logit contributions
 Caption+0.084
hement+0.079
odynamic+0.078
enegger+0.077
acterial+0.077
Neg logit contributions
 Phant-0.102
 har-0.102
 whence-0.101
 gib-0.100
nos-0.098
Token Halo
Token ID18597
Feature activation-2.15e-01

Pos logit contributions
 Caption+0.004
enegger+0.004
hement+0.004
odynamic+0.004
acterial+0.003
Neg logit contributions
 Phant-0.004
 har-0.004
 whence-0.004
 gib-0.004
nos-0.003
Token super
Token ID2208
Feature activation-7.81e-02

Pos logit contributions
 Caption+0.144
hement+0.138
enegger+0.136
odynamic+0.135
acterial+0.133
Neg logit contributions
 Phant-0.138
 har-0.138
 whence-0.138
 gib-0.136
nos-0.134
Token fans
Token ID3296
Feature activation-6.05e-01

Pos logit contributions
 Caption+0.053
hement+0.050
odynamic+0.049
enegger+0.049
acterial+0.048
Neg logit contributions
 whence-0.050
 Phant-0.050
 har-0.050
 gib-0.049
nos-0.049
Token Ryan
Token ID6047
Feature activation-5.87e-02

Pos logit contributions
 Caption+0.326
hement+0.304
odynamic+0.299
enegger+0.298
acterial+0.293
Neg logit contributions
 har-0.477
 Phant-0.474
 whence-0.472
 gib-0.468
 hay-0.460
Token McC
Token ID5108
Feature activation+4.19e-02

Pos logit contributions
 Caption+0.039
hement+0.037
odynamic+0.036
enegger+0.036
acterial+0.036
Neg logit contributions
 whence-0.039
 Phant-0.039
 har-0.038
 gib-0.038
nos-0.037
Tokenaff
Token ID2001
Feature activation-3.06e-01

Pos logit contributions
 whence+0.029
 har+0.029
 Phant+0.029
 gib+0.029
nos+0.028
Neg logit contributions
 Caption-0.026
hement-0.024
enegger-0.024
odynamic-0.024
acterial-0.023
Tokenrey
Token ID4364
Feature activation-1.96e-01

Pos logit contributions
 Caption+0.226
hement+0.213
odynamic+0.211
enegger+0.209
acterial+0.208
Neg logit contributions
 whence-0.178
 Phant-0.178
 har-0.177
 gib-0.175
nos-0.172
Token and
Token ID290
Feature activation-1.34e-01

Pos logit contributions
 Caption+0.129
hement+0.121
odynamic+0.119
enegger+0.118
acterial+0.117
Neg logit contributions
 whence-0.129
 Phant-0.129
 har-0.129
 gib-0.127
nos-0.125
Token relative
Token ID3585
Feature activation+1.95e-01

Pos logit contributions
 Caption+0.053
hement+0.049
odynamic+0.049
enegger+0.048
acterial+0.048
Neg logit contributions
 whence-0.047
 Phant-0.047
 har-0.046
 gib-0.046
nos-0.045
Token len
Token ID18896
Feature activation-2.32e-01

Pos logit contributions
 whence+0.120
 Phant+0.120
 har+0.119
 gib+0.118
nos+0.116
Neg logit contributions
 Caption-0.137
hement-0.128
odynamic-0.127
enegger-0.126
acterial-0.125
Tokeniency
Token ID6160
Feature activation-2.56e-01

Pos logit contributions
 Caption+0.114
enegger+0.106
hement+0.104
odynamic+0.102
+0.101
Neg logit contributions
 har-0.191
 whence-0.190
nos-0.189
 gib-0.189
 Phant-0.188
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
 Caption+0.172
hement+0.163
odynamic+0.161
enegger+0.159
acterial+0.158
Neg logit contributions
 whence-0.165
 har-0.165
 Phant-0.164
 gib-0.162
nos-0.159
Token Although
Token ID4900
Feature activation-1.28e-01

Pos logit contributions
 Caption+0.011
hement+0.010
odynamic+0.010
acterial+0.010
enegger+0.010
Neg logit contributions
 Phant-0.008
 har-0.008
 whence-0.008
 gib-0.008
nos-0.008
Token tear
Token ID11626
Feature activation-4.76e-01

Pos logit contributions
 Caption+0.088
hement+0.083
odynamic+0.082
enegger+0.082
acterial+0.081
Neg logit contributions
 har-0.081
 whence-0.081
 Phant-0.081
 gib-0.080
nos-0.078
Token gas
Token ID3623
Feature activation-2.46e-01

Pos logit contributions
 Caption+0.345
hement+0.329
enegger+0.323
odynamic+0.323
acterial+0.320
Neg logit contributions
 har-0.282
 whence-0.280
 Phant-0.279
 gib-0.278
nos-0.271
Token was
Token ID373
Feature activation-6.74e-02

Pos logit contributions
 Caption+0.147
hement+0.140
odynamic+0.138
enegger+0.138
acterial+0.135
Neg logit contributions
 har-0.177
 whence-0.175
 Phant-0.175
 gib-0.174
 hay-0.171
Token fired
Token ID6294
Feature activation-2.08e-02

Pos logit contributions
 Caption+0.060
hement+0.057
odynamic+0.057
enegger+0.056
acterial+0.056
Neg logit contributions
 har-0.029
 Phant-0.029
 whence-0.029
 gib-0.029
 hay-0.028
Token at
Token ID379
Feature activation-3.24e-02

Pos logit contributions
 Caption+0.014
hement+0.013
odynamic+0.013
enegger+0.013
acterial+0.013
Neg logit contributions
 har-0.014
 whence-0.014
 Phant-0.014
 gib-0.014
 hay-0.013
Token some
Token ID617
Feature activation+6.94e-02

Pos logit contributions
 Caption+0.019
hement+0.018
odynamic+0.018
enegger+0.018
acterial+0.018
Neg logit contributions
 har-0.024
 whence-0.023
 Phant-0.023
 gib-0.023
 hay-0.023
Token H
Token ID367
Feature activation+4.67e-02

Pos logit contributions
 Phant+0.106
 har+0.106
 whence+0.106
 gib+0.105
nos+0.103
Neg logit contributions
 Caption-0.096
hement-0.090
odynamic-0.089
enegger-0.087
acterial-0.087
Tokenenson
Token ID19069
Feature activation-7.28e-02

Pos logit contributions
 har+0.041
 whence+0.041
 Phant+0.041
ancies+0.041
nos+0.040
Neg logit contributions
 Caption-0.020
hement-0.018
odynamic-0.018
-0.017
enegger-0.017
Token's
Token ID338
Feature activation+2.66e-01

Pos logit contributions
 Caption+0.051
hement+0.049
odynamic+0.048
acterial+0.048
enegger+0.047
Neg logit contributions
 Phant-0.045
 whence-0.045
 har-0.044
 gib-0.044
nos-0.043
Token post
Token ID1281
Feature activation+1.73e-01

Pos logit contributions
 Phant+0.164
 har+0.163
 whence+0.161
 gib+0.160
 hay+0.155
Neg logit contributions
 Caption-0.188
hement-0.181
odynamic-0.180
acterial-0.176
utherland-0.175
Tokenhum
Token ID17047
Feature activation-2.85e-01

Pos logit contributions
 Phant+0.090
 whence+0.090
 har+0.090
 gib+0.088
nos+0.086
Neg logit contributions
 Caption-0.138
hement-0.131
odynamic-0.130
enegger-0.129
acterial-0.128
Tokenous
Token ID516
Feature activation-5.32e-01

Pos logit contributions
 Caption+0.103
hement+0.094
odynamic+0.092
utherland+0.090
enegger+0.089
Neg logit contributions
 har-0.271
 Phant-0.270
 whence-0.269
nos-0.265
ancies-0.265
Token induction
Token ID28471
Feature activation+5.73e-02

Pos logit contributions
 Caption+0.366
hement+0.346
odynamic+0.343
acterial+0.337
enegger+0.337
Neg logit contributions
 Phant-0.336
 whence-0.335
 har-0.335
 gib-0.331
nos-0.323
Token as
Token ID355
Feature activation-8.38e-03

Pos logit contributions
 whence+0.037
 Phant+0.037
 har+0.037
 gib+0.036
nos+0.036
Neg logit contributions
 Caption-0.039
hement-0.036
odynamic-0.036
enegger-0.036
acterial-0.036
Token a
Token ID257
Feature activation+1.15e-01

Pos logit contributions
 Caption+0.006
hement+0.006
odynamic+0.006
enegger+0.006
acterial+0.006
Neg logit contributions
 whence-0.005
 Phant-0.005
 har-0.005
 gib-0.005
nos-0.005
Token Disney
Token ID8519
Feature activation+5.69e-02

Pos logit contributions
 Phant+0.075
 har+0.075
 whence+0.075
 gib+0.074
nos+0.072
Neg logit contributions
 Caption-0.077
hement-0.073
odynamic-0.072
enegger-0.071
acterial-0.071
Token Legend
Token ID9883
Feature activation+1.14e-01

Pos logit contributions
 whence+0.036
 Phant+0.036
 har+0.036
 gib+0.036
nos+0.035
Neg logit contributions
 Caption-0.039
hement-0.036
odynamic-0.036
enegger-0.035
acterial-0.035
Token's
Token ID338
Feature activation+7.47e-02

Pos logit contributions
 whence+0.040
 Phant+0.039
 har+0.039
 gib+0.039
nos+0.038
Neg logit contributions
 Caption-0.045
hement-0.042
odynamic-0.042
enegger-0.041
acterial-0.041
Token lots
Token ID6041
Feature activation+1.38e-01

Pos logit contributions
 whence+0.041
 Phant+0.041
 har+0.041
 gib+0.040
nos+0.039
Neg logit contributions
 Caption-0.058
hement-0.055
odynamic-0.054
enegger-0.054
acterial-0.054
Token of
Token ID286
Feature activation+3.81e-02

Pos logit contributions
 whence+0.063
 Phant+0.063
 har+0.061
 gib+0.061
nos+0.060
Neg logit contributions
 Caption-0.119
hement-0.114
odynamic-0.112
enegger-0.111
acterial-0.111
Token really
Token ID1107
Feature activation+1.74e-02

Pos logit contributions
 har+0.026
 whence+0.026
 Phant+0.026
 gib+0.025
 hay+0.025
Neg logit contributions
 Caption-0.025
hement-0.023
odynamic-0.023
enegger-0.023
acterial-0.023
Token positive
Token ID3967
Feature activation-3.75e-03

Pos logit contributions
 har+0.012
 whence+0.011
 Phant+0.011
 gib+0.011
 hay+0.011
Neg logit contributions
 Caption-0.011
hement-0.011
odynamic-0.011
enegger-0.011
acterial-0.011
Token interactions
Token ID12213
Feature activation-4.82e-01

Pos logit contributions
 Caption+0.003
hement+0.002
odynamic+0.002
enegger+0.002
acterial+0.002
Neg logit contributions
 har-0.002
 whence-0.002
 Phant-0.002
 gib-0.002
 hay-0.002
Token to
Token ID284
Feature activation-8.91e-02

Pos logit contributions
 Caption+0.332
hement+0.315
odynamic+0.310
enegger+0.307
acterial+0.305
Neg logit contributions
 whence-0.305
 har-0.305
 Phant-0.303
 gib-0.300
 hay-0.293
Token start
Token ID923
Feature activation-1.53e-01

Pos logit contributions
 Caption+0.072
hement+0.069
odynamic+0.068
enegger+0.067
acterial+0.067
Neg logit contributions
 whence-0.045
 Phant-0.045
 har-0.045
 gib-0.044
nos-0.043
Token your
Token ID534
Feature activation+8.07e-02

Pos logit contributions
 Caption+0.103
hement+0.098
odynamic+0.097
enegger+0.096
acterial+0.095
Neg logit contributions
 whence-0.099
 har-0.099
 Phant-0.098
 gib-0.097
 hay-0.095
Token day
Token ID1110
Feature activation+3.18e-02

Pos logit contributions
 whence+0.049
 Phant+0.049
 har+0.049
 gib+0.049
nos+0.047
Neg logit contributions
 Caption-0.057
hement-0.054
odynamic-0.054
enegger-0.053
acterial-0.053
Token."
Token ID526
Feature activation-6.23e-02

Pos logit contributions
 whence+0.023
 har+0.023
 Phant+0.022
 gib+0.022
 hay+0.022
Neg logit contributions
 Caption-0.019
hement-0.018
odynamic-0.018
enegger-0.018
acterial-0.018
TokenWC
Token ID27353
Feature activation-2.41e-01

Pos logit contributions
 Caption+0.094
hement+0.088
enegger+0.086
odynamic+0.086
acterial+0.085
Neg logit contributions
 whence-0.117
 har-0.117
 Phant-0.115
 gib-0.114
nos-0.114
Token has
Token ID468
Feature activation-5.54e-02

Pos logit contributions
 Caption+0.163
hement+0.153
odynamic+0.151
enegger+0.150
acterial+0.149
Neg logit contributions
 whence-0.155
 Phant-0.154
 har-0.154
 gib-0.152
nos-0.149
Token been
Token ID587
Feature activation+7.89e-02

Pos logit contributions
 Caption+0.041
hement+0.039
odynamic+0.039
enegger+0.038
acterial+0.038
Neg logit contributions
 whence-0.032
 Phant-0.032
 har-0.032
 gib-0.031
nos-0.031
Token at
Token ID379
Feature activation+6.05e-03

Pos logit contributions
 har+0.048
 Phant+0.047
 whence+0.047
 gib+0.047
 hay+0.046
Neg logit contributions
 Caption-0.057
hement-0.053
odynamic-0.053
enegger-0.053
acterial-0.053
Tokenyp
Token ID4464
Feature activation-1.03e-01

Pos logit contributions
 whence+0.004
 Phant+0.004
 har+0.004
 gib+0.003
nos+0.003
Neg logit contributions
 Caption-0.004
hement-0.004
odynamic-0.004
enegger-0.004
acterial-0.004
Tokenically
Token ID1146
Feature activation-4.98e-01

Pos logit contributions
 Caption+0.067
hement+0.062
enegger+0.062
odynamic+0.061
utherland+0.060
Neg logit contributions
 har-0.069
 whence-0.069
 Phant-0.068
 gib-0.068
nos-0.068
Token open
Token ID1280
Feature activation-2.21e-02

Pos logit contributions
 Caption+0.327
hement+0.307
odynamic+0.304
enegger+0.303
acterial+0.300
Neg logit contributions
 har-0.331
 whence-0.330
 Phant-0.329
 gib-0.326
 hay-0.319
Token about
Token ID546
Feature activation+6.98e-02

Pos logit contributions
 Caption+0.017
hement+0.016
odynamic+0.016
enegger+0.016
acterial+0.016
Neg logit contributions
 har-0.013
 whence-0.013
 Phant-0.012
 gib-0.012
 hay-0.012
Token his
Token ID465
Feature activation+8.56e-02

Pos logit contributions
 whence+0.035
 Phant+0.035
 har+0.035
 gib+0.034
nos+0.033
Neg logit contributions
 Caption-0.057
hement-0.055
odynamic-0.054
enegger-0.054
acterial-0.053
Token willingness
Token ID16826
Feature activation-1.70e-01

Pos logit contributions
 har+0.062
 Phant+0.062
 whence+0.062
 gib+0.061
 hay+0.060
Neg logit contributions
 Caption-0.051
hement-0.048
odynamic-0.047
enegger-0.047
acterial-0.047
Token to
Token ID284
Feature activation+5.03e-02

Pos logit contributions
 Caption+0.117
hement+0.110
odynamic+0.109
acterial+0.107
enegger+0.107
Neg logit contributions
 Phant-0.107
 whence-0.107
 har-0.105
 gib-0.104
nos-0.103