TOP ACTIVATIONS
MAX = 1.943

huge part of training ( tons of shadow , tons on
т а к е я
с и с к о е и
е к о т о р ы
с и с к и
м н и к о�
т о к а .
, л е к с е
е р о� с к а я е
а р а к т е р и
м н и к о� р а
( 1979 ) Grade : A +
н ы и к о� д а
а р а к т е р н о�
е с к р ы в а т
у к а ы в
« и н к а »
е л о� е к .
Though the decision on the Phi K aps part
: л е к с а н д р

MIN ACTIVATIONS
MIN = -2.052

HR W researcher La et itia B ader told the BBC
room . L acan got this . Like the
global influenza programme , Ke iji Fuk uda , said the
: File photo by J iji Press , AFP / Getty
told Merkel that picking Jun cker would " dest abil ize
Jean - Cla ude Jun cker , she hoped the dispute
she was pressing for Jun cker to take the reins at
is deeply wary of Jun cker , an EU federal ist
are also hostile to Jun cker . <|endoftext|> Ill ustration by
Del more Schwartz compared him favorably with Ye ats -- but
polls A G ravis poll shows that Mr .
point about how these gam eworld s function for players :
teenagers , their sleep - wake cycle shifts two hours later
In the wake of climate change , harmful
ation of Christ was protested vigorously . The difference is that
- 08 Election - T ravis & Protect ing Your Girl
from Mid town via Harris burg Start at the crack of
mar ina in Ke ans burg , New Jersey , following
defended Planned Parenthood in the wake of recently released videos that
. In the wake of the social media fascination

INTERVAL 0.945 - 1.943
CONTAINS 0.383%

, is president of the Toronto company and a director of
that . My release is totally fine , I
was to move to a provincial jail , he
public appearances . After his release , Ken y atta set
. Dist ant outer at olls are even more at risk

INTERVAL -0.054 - 0.945
CONTAINS 68.193%

, using steel cable traps and an animal carc ass as
me , but I was never really sure why .
cons cience clause , which became law
tered by the later Chron o Trigger and Earth bound ,
enough to accommodate the first Falling water tour , is superb

INTERVAL -1.053 - -0.054
CONTAINS 31.213%

pine and grape fruit and under g ird ed by che
ed . The issue of a refund is even
of respondents to the Lok Survey , with a sample size
The Secret Life of Walter M itty , photo journal ist
from Am oud a . He wants a democratic and united

INTERVAL -2.052 - -1.053
CONTAINS 0.211%

ellow eat ers in the spirit of July 4
man who was willing to pay my school fees as long
his classic delivery stride . Cricket nerd os could look at
pain and health care quality improvement . We can listen to
I fight very hard to pay as little tax as possible
Token huge
Token ID3236
Feature activation-1.65e-01

Pos logit contributions
arters+0.010
 balcon+0.008
 Phi+0.008
SourceFile+0.008
rics+0.008
Neg logit contributions
jl-0.009
itia-0.008
 favorably-0.008
wake-0.008
eln-0.008
Token part
Token ID636
Feature activation-3.25e-01

Pos logit contributions
jl+0.028
itia+0.028
 favorably+0.027
wake+0.027
eln+0.026
Neg logit contributions
arters-0.039
 balcon-0.031
 Phi-0.031
SourceFile-0.030
rics-0.029
Token of
Token ID286
Feature activation+5.56e-04

Pos logit contributions
jl+0.050
itia+0.050
 favorably+0.049
wake+0.048
eln+0.047
Neg logit contributions
arters-0.082
 balcon-0.065
 Phi-0.065
SourceFile-0.063
rics-0.063
Token training
Token ID3047
Feature activation+6.63e-02

Pos logit contributions
arters+0.000
 balcon+0.000
 Phi+0.000
SourceFile+0.000
rics+0.000
Neg logit contributions
jl-0.000
itia-0.000
 favorably-0.000
wake-0.000
eln-0.000
Token (
Token ID357
Feature activation+9.95e-02

Pos logit contributions
arters+0.016
 balcon+0.012
 Phi+0.012
SourceFile+0.012
rics+0.012
Neg logit contributions
jl-0.011
itia-0.011
 favorably-0.011
wake-0.011
eln-0.011
Tokentons
Token ID27288
Feature activation+1.84e+00

Pos logit contributions
arters+0.024
 balcon+0.019
 Phi+0.019
SourceFile+0.018
rics+0.018
Neg logit contributions
jl-0.016
itia-0.016
 favorably-0.016
wake-0.016
eln-0.016
Token of
Token ID286
Feature activation+4.53e-03

Pos logit contributions
arters+0.449
 balcon+0.356
 Phi+0.353
SourceFile+0.345
rics+0.342
Neg logit contributions
jl-0.297
itia-0.293
 favorably-0.292
wake-0.285
eln-0.280
Token shadow
Token ID9082
Feature activation-3.68e-01

Pos logit contributions
arters+0.001
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
Token,
Token ID11
Feature activation+2.78e-02

Pos logit contributions
jl+0.055
itia+0.054
 favorably+0.054
wake+0.052
eln+0.051
Neg logit contributions
arters-0.095
 balcon-0.076
 Phi-0.075
rics-0.074
SourceFile-0.073
Token tons
Token ID10860
Feature activation+2.64e-02

Pos logit contributions
arters+0.006
 balcon+0.005
 Phi+0.005
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.005
eln-0.005
Token on
Token ID319
Feature activation-2.00e-03

Pos logit contributions
arters+0.006
 balcon+0.005
 Phi+0.005
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.004
itia-0.004
 favorably-0.004
wake-0.004
eln-0.004
Token
Token ID141
Feature activation+7.53e-01

Pos logit contributions
arters+0.008
 balcon+0.006
 Phi+0.006
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.009
itia-0.009
 favorably-0.009
wake-0.009
eln-0.009
Token
Token ID236
Feature activation-6.58e-02

Pos logit contributions
arters+0.161
 balcon+0.124
 Phi+0.123
rics+0.118
SourceFile+0.118
Neg logit contributions
jl-0.144
 favorably-0.142
itia-0.142
wake-0.139
eln-0.137
Token
Token ID220
Feature activation-3.27e-01

Pos logit contributions
jl+0.013
itia+0.012
 favorably+0.012
wake+0.012
eln+0.012
Neg logit contributions
arters-0.014
 balcon-0.011
 Phi-0.011
SourceFile-0.010
rics-0.010
Tokenт
Token ID20375
Feature activation+8.05e-01

Pos logit contributions
jl+0.067
itia+0.066
 favorably+0.066
wake+0.064
eln+0.063
Neg logit contributions
arters-0.066
 balcon-0.050
 Phi-0.049
SourceFile-0.048
rics-0.047
Tokenа
Token ID16142
Feature activation-2.05e-01

Pos logit contributions
arters+0.170
 balcon+0.129
 Phi+0.128
SourceFile+0.123
rics+0.123
Neg logit contributions
jl-0.158
itia-0.155
 favorably-0.155
wake-0.152
eln-0.149
Tokenк
Token ID31583
Feature activation+1.82e+00

Pos logit contributions
jl+0.050
itia+0.049
 favorably+0.049
wake+0.048
eln+0.048
Neg logit contributions
arters-0.034
 balcon-0.023
 Phi-0.023
SourceFile-0.022
rics-0.022
Token
Token ID140
Feature activation+3.68e-01

Pos logit contributions
arters+0.402
 balcon+0.309
 Phi+0.308
SourceFile+0.297
rics+0.296
Neg logit contributions
jl-0.339
itia-0.334
 favorably-0.333
wake-0.326
eln-0.321
Token
Token ID114
Feature activation-6.67e-02

Pos logit contributions
arters+0.079
 balcon+0.061
 Phi+0.060
SourceFile+0.058
rics+0.058
Neg logit contributions
jl-0.071
itia-0.070
 favorably-0.069
wake-0.068
eln-0.067
Tokenе
Token ID16843
Feature activation+5.52e-01

Pos logit contributions
jl+0.014
itia+0.013
 favorably+0.013
wake+0.013
eln+0.013
Neg logit contributions
arters-0.014
 balcon-0.010
 Phi-0.010
SourceFile-0.010
rics-0.010
Token
Token ID220
Feature activation-3.26e-01

Pos logit contributions
arters+0.113
 balcon+0.085
 Phi+0.085
SourceFile+0.081
rics+0.081
Neg logit contributions
jl-0.111
itia-0.110
 favorably-0.109
wake-0.107
eln-0.106
Tokenя
Token ID40623
Feature activation+6.07e-01

Pos logit contributions
jl+0.063
itia+0.062
 favorably+0.062
wake+0.061
eln+0.060
Neg logit contributions
arters-0.069
 balcon-0.053
 Phi-0.052
SourceFile-0.051
rics-0.050
Tokenс
Token ID21727
Feature activation+5.91e-02

Pos logit contributions
arters+0.010
 balcon+0.008
 Phi+0.008
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.009
itia-0.009
 favorably-0.009
wake-0.008
eln-0.008
Tokenи
Token ID18849
Feature activation+3.33e-01

Pos logit contributions
arters+0.012
 balcon+0.009
 Phi+0.009
SourceFile+0.008
rics+0.008
Neg logit contributions
jl-0.012
itia-0.012
 favorably-0.012
wake-0.012
eln-0.012
Token
Token ID140
Feature activation+3.78e-01

Pos logit contributions
arters+0.069
 balcon+0.052
 Phi+0.052
SourceFile+0.050
rics+0.049
Neg logit contributions
jl-0.067
itia-0.066
 favorably-0.066
wake-0.064
eln-0.063
Token
Token ID117
Feature activation+6.02e-01

Pos logit contributions
arters+0.062
 balcon+0.043
 Phi+0.043
SourceFile+0.041
rics+0.040
Neg logit contributions
jl-0.091
itia-0.090
 favorably-0.090
wake-0.089
eln-0.088
Tokenс
Token ID21727
Feature activation+7.35e-02

Pos logit contributions
arters+0.137
 balcon+0.107
 Phi+0.106
SourceFile+0.103
rics+0.102
Neg logit contributions
jl-0.108
itia-0.106
 favorably-0.106
wake-0.104
eln-0.102
Tokenк
Token ID31583
Feature activation+1.82e+00

Pos logit contributions
arters+0.006
 balcon+0.002
 Phi+0.002
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.024
itia-0.024
 favorably-0.024
wake-0.024
eln-0.023
Tokenо
Token ID15166
Feature activation+3.12e-01

Pos logit contributions
arters+0.368
 balcon+0.276
 Phi+0.273
SourceFile+0.263
rics+0.263
Neg logit contributions
jl-0.373
itia-0.367
 favorably-0.367
wake-0.360
eln-0.354
Tokenе
Token ID16843
Feature activation+5.53e-01

Pos logit contributions
arters+0.068
 balcon+0.052
 Phi+0.052
SourceFile+0.050
rics+0.050
Neg logit contributions
jl-0.059
itia-0.058
 favorably-0.058
wake-0.057
eln-0.056
Token
Token ID12466
Feature activation+2.08e-01

Pos logit contributions
arters+0.124
 balcon+0.096
 Phi+0.095
SourceFile+0.092
rics+0.092
Neg logit contributions
jl-0.101
itia-0.100
 favorably-0.099
wake-0.097
eln-0.096
Token
Token ID250
Feature activation-6.49e-02

Pos logit contributions
arters+0.048
 balcon+0.037
 Phi+0.037
SourceFile+0.036
rics+0.036
Neg logit contributions
jl-0.037
itia-0.036
 favorably-0.036
wake-0.035
eln-0.035
Tokenи
Token ID18849
Feature activation+3.18e-01

Pos logit contributions
jl+0.012
itia+0.012
 favorably+0.012
wake+0.012
eln+0.012
Neg logit contributions
arters-0.014
 balcon-0.011
 Phi-0.011
SourceFile-0.010
rics-0.010
Token
Token ID12466
Feature activation+2.02e-01

Pos logit contributions
arters+0.063
 balcon+0.049
 Phi+0.048
SourceFile+0.047
rics+0.047
Neg logit contributions
jl-0.052
itia-0.052
 favorably-0.052
wake-0.050
eln-0.050
Token
Token ID110
Feature activation+3.59e-01

Pos logit contributions
arters+0.047
 balcon+0.036
 Phi+0.036
SourceFile+0.035
rics+0.035
Neg logit contributions
jl-0.035
itia-0.035
 favorably-0.035
wake-0.034
eln-0.033
Token
Token ID12466
Feature activation+2.07e-01

Pos logit contributions
arters+0.078
 balcon+0.060
 Phi+0.060
SourceFile+0.058
rics+0.057
Neg logit contributions
jl-0.067
itia-0.066
 favorably-0.066
wake-0.065
eln-0.064
Token
Token ID121
Feature activation+6.32e-02

Pos logit contributions
arters+0.047
 balcon+0.037
 Phi+0.037
SourceFile+0.036
rics+0.035
Neg logit contributions
jl-0.037
itia-0.036
 favorably-0.036
wake-0.035
eln-0.035
Tokenе
Token ID16843
Feature activation+5.50e-01

Pos logit contributions
arters+0.013
 balcon+0.010
 Phi+0.010
SourceFile+0.010
rics+0.010
Neg logit contributions
jl-0.012
itia-0.012
 favorably-0.012
wake-0.012
eln-0.012
Tokenк
Token ID31583
Feature activation+1.82e+00

Pos logit contributions
arters+0.123
 balcon+0.096
 Phi+0.095
SourceFile+0.092
rics+0.091
Neg logit contributions
jl-0.100
itia-0.099
 favorably-0.099
wake-0.097
eln-0.095
Tokenо
Token ID15166
Feature activation+3.18e-01

Pos logit contributions
arters+0.383
 balcon+0.292
 Phi+0.289
SourceFile+0.278
rics+0.278
Neg logit contributions
jl-0.354
itia-0.350
 favorably-0.350
wake-0.342
eln-0.337
Tokenт
Token ID20375
Feature activation+8.29e-01

Pos logit contributions
arters+0.070
 balcon+0.054
 Phi+0.054
SourceFile+0.052
rics+0.052
Neg logit contributions
jl-0.059
itia-0.058
 favorably-0.058
wake-0.057
eln-0.056
Tokenо
Token ID15166
Feature activation+3.30e-01

Pos logit contributions
arters+0.179
 balcon+0.137
 Phi+0.136
SourceFile+0.131
rics+0.131
Neg logit contributions
jl-0.158
itia-0.155
 favorably-0.155
wake-0.152
eln-0.149
Tokenр
Token ID21169
Feature activation+6.54e-02

Pos logit contributions
arters+0.073
 balcon+0.057
 Phi+0.056
SourceFile+0.054
rics+0.054
Neg logit contributions
jl-0.061
 favorably-0.060
itia-0.060
wake-0.059
eln-0.058
Tokenы
Token ID45035
Feature activation-7.97e-02

Pos logit contributions
arters+0.014
 balcon+0.011
 Phi+0.011
SourceFile+0.010
rics+0.010
Neg logit contributions
jl-0.013
itia-0.012
 favorably-0.012
wake-0.012
eln-0.012
Tokenс
Token ID21727
Feature activation+4.88e-02

Pos logit contributions
arters+0.005
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.004
itia-0.004
 favorably-0.004
wake-0.004
eln-0.004
Tokenи
Token ID18849
Feature activation+3.43e-01

Pos logit contributions
arters+0.009
 balcon+0.007
 Phi+0.007
SourceFile+0.006
rics+0.006
Neg logit contributions
jl-0.011
itia-0.010
 favorably-0.010
wake-0.010
eln-0.010
Token
Token ID140
Feature activation+3.51e-01

Pos logit contributions
arters+0.069
 balcon+0.052
 Phi+0.051
SourceFile+0.049
rics+0.049
Neg logit contributions
jl-0.071
itia-0.070
 favorably-0.069
wake-0.068
eln-0.067
Token
Token ID117
Feature activation+6.20e-01

Pos logit contributions
arters+0.061
 balcon+0.043
 Phi+0.043
SourceFile+0.041
rics+0.041
Neg logit contributions
jl-0.082
itia-0.081
 favorably-0.080
wake-0.079
eln-0.078
Tokenс
Token ID21727
Feature activation+6.14e-02

Pos logit contributions
arters+0.140
 balcon+0.108
 Phi+0.108
SourceFile+0.104
rics+0.104
Neg logit contributions
jl-0.112
itia-0.111
 favorably-0.111
wake-0.108
eln-0.106
Tokenк
Token ID31583
Feature activation+1.81e+00

Pos logit contributions
arters+0.007
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.018
itia-0.018
 favorably-0.018
wake-0.017
eln-0.017
Tokenи
Token ID18849
Feature activation+3.14e-01

Pos logit contributions
arters+0.387
 balcon+0.295
 Phi+0.293
SourceFile+0.282
rics+0.282
Neg logit contributions
jl-0.351
itia-0.346
 favorably-0.345
wake-0.338
eln-0.333
Token
Token ID141
Feature activation+7.68e-01

Pos logit contributions
arters+0.066
 balcon+0.050
 Phi+0.049
SourceFile+0.048
rics+0.047
Neg logit contributions
jl-0.062
itia-0.061
 favorably-0.061
wake-0.060
eln-0.059
Token
Token ID227
Feature activation-7.66e-02

Pos logit contributions
arters+0.181
 balcon+0.142
 Phi+0.142
rics+0.137
SourceFile+0.137
Neg logit contributions
jl-0.131
 favorably-0.129
itia-0.128
wake-0.126
eln-0.123
Token
Token ID12466
Feature activation+2.28e-01

Pos logit contributions
jl+0.014
itia+0.013
 favorably+0.013
wake+0.013
eln+0.013
Neg logit contributions
arters-0.017
 balcon-0.014
 Phi-0.014
SourceFile-0.013
rics-0.013
Token
Token ID121
Feature activation+4.95e-02

Pos logit contributions
arters+0.052
 balcon+0.041
 Phi+0.041
SourceFile+0.039
rics+0.039
Neg logit contributions
jl-0.040
itia-0.040
 favorably-0.039
wake-0.039
eln-0.038
Token
Token ID141
Feature activation+7.66e-01

Pos logit contributions
jl+0.036
itia+0.035
 favorably+0.035
wake+0.034
eln+0.034
Neg logit contributions
arters-0.045
 balcon-0.035
 Phi-0.034
SourceFile-0.033
rics-0.033
Token
Token ID239
Feature activation+4.98e-01

Pos logit contributions
arters+0.172
 balcon+0.133
 Phi+0.132
SourceFile+0.128
rics+0.127
Neg logit contributions
jl-0.140
itia-0.138
 favorably-0.137
wake-0.134
eln-0.132
Tokenм
Token ID43108
Feature activation-5.37e-01

Pos logit contributions
arters+0.104
 balcon+0.079
 Phi+0.078
SourceFile+0.075
rics+0.075
Neg logit contributions
jl-0.099
itia-0.098
 favorably-0.097
wake-0.095
eln-0.094
Tokenн
Token ID22177
Feature activation+4.91e-01

Pos logit contributions
jl+0.106
itia+0.104
 favorably+0.104
wake+0.102
eln+0.100
Neg logit contributions
arters-0.113
 balcon-0.086
 Phi-0.085
SourceFile-0.082
rics-0.082
Tokenи
Token ID18849
Feature activation+3.45e-01

Pos logit contributions
arters+0.105
 balcon+0.080
 Phi+0.079
rics+0.076
SourceFile+0.076
Neg logit contributions
jl-0.095
itia-0.094
 favorably-0.093
wake-0.092
eln-0.090
Tokenк
Token ID31583
Feature activation+1.80e+00

Pos logit contributions
arters+0.063
 balcon+0.046
 Phi+0.046
SourceFile+0.044
rics+0.043
Neg logit contributions
jl-0.077
itia-0.076
 favorably-0.076
wake-0.075
eln-0.074
Tokenо�
Token ID25443
Feature activation+1.82e-02

Pos logit contributions
arters+0.403
 balcon+0.313
 Phi+0.310
rics+0.299
SourceFile+0.298
Neg logit contributions
jl-0.331
 favorably-0.326
itia-0.325
wake-0.319
eln-0.313
Token
Token ID110
Feature activation+3.48e-01

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.004
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Token
Token ID220
Feature activation-3.32e-01

Pos logit contributions
arters+0.081
 balcon+0.064
 Phi+0.063
SourceFile+0.061
rics+0.061
Neg logit contributions
jl-0.061
itia-0.059
 favorably-0.059
wake-0.058
eln-0.057
Token
Token ID141
Feature activation+7.55e-01

Pos logit contributions
jl+0.064
itia+0.063
 favorably+0.063
wake+0.061
eln+0.060
Neg logit contributions
arters-0.071
 balcon-0.054
 Phi-0.054
SourceFile-0.052
rics-0.052
Token
Token ID227
Feature activation-7.29e-02

Pos logit contributions
arters+0.169
 balcon+0.131
 Phi+0.130
SourceFile+0.126
rics+0.126
Neg logit contributions
jl-0.138
itia-0.136
 favorably-0.135
wake-0.132
eln-0.130
Token
Token ID220
Feature activation-3.61e-01

Pos logit contributions
jl+0.021
itia+0.021
 favorably+0.020
wake+0.020
eln+0.020
Neg logit contributions
arters-0.027
 balcon-0.021
 Phi-0.021
SourceFile-0.020
rics-0.020
Tokenт
Token ID20375
Feature activation+7.75e-01

Pos logit contributions
jl+0.070
itia+0.069
 favorably+0.069
wake+0.067
eln+0.066
Neg logit contributions
arters-0.077
 balcon-0.059
 Phi-0.059
SourceFile-0.057
rics-0.056
Tokenо
Token ID15166
Feature activation+2.59e-01

Pos logit contributions
arters+0.159
 balcon+0.120
 Phi+0.119
SourceFile+0.114
rics+0.114
Neg logit contributions
jl-0.156
itia-0.154
 favorably-0.154
wake-0.151
eln-0.148
Token
Token ID141
Feature activation+7.42e-01

Pos logit contributions
arters+0.059
 balcon+0.045
 Phi+0.045
SourceFile+0.044
rics+0.044
Neg logit contributions
jl-0.047
itia-0.046
 favorably-0.046
wake-0.045
eln-0.044
Token
Token ID229
Feature activation+2.46e-01

Pos logit contributions
arters+0.172
 balcon+0.134
 Phi+0.134
SourceFile+0.129
rics+0.129
Neg logit contributions
jl-0.130
itia-0.128
 favorably-0.128
wake-0.125
eln-0.122
Tokenк
Token ID31583
Feature activation+1.80e+00

Pos logit contributions
arters+0.047
 balcon+0.035
 Phi+0.035
SourceFile+0.033
rics+0.033
Neg logit contributions
jl-0.053
itia-0.052
 favorably-0.052
wake-0.051
eln-0.050
Tokenа
Token ID16142
Feature activation-2.32e-01

Pos logit contributions
arters+0.388
 balcon+0.297
 Phi+0.295
SourceFile+0.284
rics+0.284
Neg logit contributions
jl-0.344
itia-0.339
 favorably-0.339
wake-0.332
eln-0.326
Token
Token ID141
Feature activation+7.22e-01

Pos logit contributions
jl+0.046
itia+0.045
 favorably+0.045
wake+0.044
eln+0.043
Neg logit contributions
arters-0.049
 balcon-0.037
 Phi-0.037
SourceFile-0.035
rics-0.035
Token
Token ID227
Feature activation-8.41e-02

Pos logit contributions
arters+0.167
 balcon+0.131
 Phi+0.130
SourceFile+0.126
rics+0.125
Neg logit contributions
jl-0.126
itia-0.124
 favorably-0.124
wake-0.121
eln-0.119
Token.
Token ID13
Feature activation-2.79e-02

Pos logit contributions
jl+0.015
itia+0.015
 favorably+0.015
wake+0.015
eln+0.015
Neg logit contributions
arters-0.019
 balcon-0.015
 Phi-0.015
SourceFile-0.014
rics-0.014
Token
Token ID12466
Feature activation+1.94e-01

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.005
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Token,
Token ID11
Feature activation-1.03e-03

Pos logit contributions
arters+0.103
 balcon+0.079
 Phi+0.079
SourceFile+0.076
rics+0.076
Neg logit contributions
jl-0.088
itia-0.086
 favorably-0.086
wake-0.085
eln-0.083
Token
Token ID12466
Feature activation+1.37e-01

Pos logit contributions
jl+0.000
itia+0.000
 favorably+0.000
wake+0.000
eln+0.000
Neg logit contributions
arters-0.000
 balcon-0.000
 Phi-0.000
SourceFile-0.000
rics-0.000
Token
Token ID238
Feature activation+2.06e-01

Pos logit contributions
arters+0.031
 balcon+0.024
 Phi+0.024
SourceFile+0.023
rics+0.023
Neg logit contributions
jl-0.025
itia-0.025
 favorably-0.025
wake-0.024
eln-0.024
Tokenл
Token ID30143
Feature activation+4.89e-01

Pos logit contributions
arters+0.042
 balcon+0.032
 Phi+0.031
SourceFile+0.030
rics+0.030
Neg logit contributions
jl-0.042
itia-0.041
 favorably-0.041
wake-0.040
eln-0.040
Tokenе
Token ID16843
Feature activation+5.20e-01

Pos logit contributions
arters+0.095
 balcon+0.070
 Phi+0.070
SourceFile+0.067
rics+0.067
Neg logit contributions
jl-0.104
itia-0.102
 favorably-0.102
wake-0.100
eln-0.099
Tokenк
Token ID31583
Feature activation+1.80e+00

Pos logit contributions
arters+0.051
 balcon+0.024
 Phi+0.024
SourceFile+0.021
rics+0.021
Neg logit contributions
jl-0.161
itia-0.159
 favorably-0.159
wake-0.157
eln-0.156
Tokenс
Token ID21727
Feature activation+6.95e-02

Pos logit contributions
arters+0.399
 balcon+0.309
 Phi+0.306
SourceFile+0.296
rics+0.295
Neg logit contributions
jl-0.331
itia-0.327
 favorably-0.327
wake-0.320
eln-0.314
Tokenе
Token ID16843
Feature activation+5.24e-01

Pos logit contributions
arters+0.016
 balcon+0.012
 Phi+0.012
SourceFile+0.012
rics+0.012
Neg logit contributions
jl-0.012
itia-0.012
 favorably-0.012
wake-0.012
eln-0.012
Token
Token ID140
Feature activation+4.16e-01

Pos logit contributions
arters+0.114
 balcon+0.087
 Phi+0.087
SourceFile+0.084
rics+0.084
Neg logit contributions
jl-0.099
itia-0.098
 favorably-0.098
wake-0.095
eln-0.094
Token
Token ID117
Feature activation+6.55e-01

Pos logit contributions
arters+0.072
 balcon+0.051
 Phi+0.051
SourceFile+0.048
rics+0.048
Neg logit contributions
jl-0.097
itia-0.096
 favorably-0.096
wake-0.094
eln-0.093
Token
Token ID12466
Feature activation+2.95e-01

Pos logit contributions
arters+0.145
 balcon+0.112
 Phi+0.112
SourceFile+0.107
rics+0.107
Neg logit contributions
jl-0.121
itia-0.119
 favorably-0.119
wake-0.116
eln-0.114
Tokenе
Token ID16843
Feature activation+5.03e-01

Pos logit contributions
arters+0.108
 balcon+0.082
 Phi+0.081
rics+0.078
SourceFile+0.078
Neg logit contributions
jl-0.106
itia-0.105
 favorably-0.105
wake-0.103
eln-0.101
Tokenр
Token ID21169
Feature activation+5.81e-02

Pos logit contributions
arters+0.117
 balcon+0.092
 Phi+0.091
rics+0.088
SourceFile+0.088
Neg logit contributions
jl-0.087
itia-0.086
 favorably-0.085
wake-0.083
eln-0.082
Tokenо�
Token ID25443
Feature activation+2.71e-02

Pos logit contributions
arters+0.013
 balcon+0.010
 Phi+0.010
rics+0.009
SourceFile+0.009
Neg logit contributions
jl-0.011
itia-0.011
 favorably-0.011
wake-0.010
eln-0.010
Token
Token ID110
Feature activation+3.48e-01

Pos logit contributions
arters+0.006
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.005
eln-0.005
Tokenс
Token ID21727
Feature activation+3.70e-02

Pos logit contributions
arters+0.065
 balcon+0.048
 Phi+0.047
rics+0.045
SourceFile+0.045
Neg logit contributions
jl-0.076
 favorably-0.075
itia-0.075
wake-0.074
eln-0.073
Tokenк
Token ID31583
Feature activation+1.79e+00

Pos logit contributions
arters+0.004
 balcon+0.002
 Phi+0.002
rics+0.002
SourceFile+0.002
Neg logit contributions
jl-0.011
itia-0.011
 favorably-0.011
wake-0.010
eln-0.010
Tokenа
Token ID16142
Feature activation-1.90e-01

Pos logit contributions
arters+0.367
 balcon+0.277
 Phi+0.274
rics+0.264
SourceFile+0.263
Neg logit contributions
jl-0.362
 favorably-0.356
itia-0.356
wake-0.349
eln-0.344
Tokenя
Token ID40623
Feature activation+6.14e-01

Pos logit contributions
jl+0.037
itia+0.037
 favorably+0.036
wake+0.036
eln+0.035
Neg logit contributions
arters-0.040
 balcon-0.031
 Phi-0.030
SourceFile-0.029
rics-0.029
Token
Token ID12466
Feature activation+1.91e-01

Pos logit contributions
arters+0.152
 balcon+0.121
 Phi+0.121
SourceFile+0.117
rics+0.117
Neg logit contributions
jl-0.097
itia-0.096
 favorably-0.095
wake-0.093
eln-0.091
Token
Token ID120
Feature activation-5.30e-01

Pos logit contributions
arters+0.044
 balcon+0.034
 Phi+0.034
SourceFile+0.033
rics+0.033
Neg logit contributions
jl-0.034
itia-0.034
 favorably-0.034
wake-0.033
eln-0.032
Tokenе
Token ID16843
Feature activation+5.08e-01

Pos logit contributions
jl+0.105
itia+0.103
 favorably+0.103
wake+0.101
eln+0.099
Neg logit contributions
arters-0.111
 balcon-0.084
 Phi-0.084
SourceFile-0.081
rics-0.080
Token
Token ID141
Feature activation+7.55e-01

Pos logit contributions
jl+0.064
itia+0.063
 favorably+0.063
wake+0.061
eln+0.060
Neg logit contributions
arters-0.071
 balcon-0.054
 Phi-0.054
SourceFile-0.052
rics-0.052
Token
Token ID227
Feature activation-7.29e-02

Pos logit contributions
arters+0.169
 balcon+0.131
 Phi+0.130
SourceFile+0.126
rics+0.126
Neg logit contributions
jl-0.138
itia-0.136
 favorably-0.135
wake-0.132
eln-0.130
Tokenа
Token ID16142
Feature activation-2.12e-01

Pos logit contributions
jl+0.014
itia+0.013
 favorably+0.013
wake+0.013
eln+0.013
Neg logit contributions
arters-0.016
 balcon-0.012
 Phi-0.012
SourceFile-0.012
rics-0.012
Tokenр
Token ID21169
Feature activation+4.74e-02

Pos logit contributions
jl+0.040
itia+0.039
 favorably+0.039
wake+0.038
eln+0.038
Neg logit contributions
arters-0.046
 balcon-0.036
 Phi-0.035
SourceFile-0.034
rics-0.034
Tokenа
Token ID16142
Feature activation-2.14e-01

Pos logit contributions
arters+0.010
 balcon+0.007
 Phi+0.007
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.009
itia-0.009
 favorably-0.009
wake-0.009
eln-0.009
Tokenк
Token ID31583
Feature activation+1.79e+00

Pos logit contributions
jl+0.044
itia+0.043
 favorably+0.043
wake+0.042
eln+0.041
Neg logit contributions
arters-0.044
 balcon-0.033
 Phi-0.033
SourceFile-0.031
rics-0.031
Tokenт
Token ID20375
Feature activation+7.95e-01

Pos logit contributions
arters+0.389
 balcon+0.299
 Phi+0.297
SourceFile+0.286
rics+0.286
Neg logit contributions
jl-0.340
itia-0.335
 favorably-0.334
wake-0.327
eln-0.321
Tokenе
Token ID16843
Feature activation+5.13e-01

Pos logit contributions
arters+0.168
 balcon+0.127
 Phi+0.127
SourceFile+0.122
rics+0.122
Neg logit contributions
jl-0.156
itia-0.153
 favorably-0.153
wake-0.150
eln-0.148
Tokenр
Token ID21169
Feature activation+1.88e-02

Pos logit contributions
arters+0.108
 balcon+0.082
 Phi+0.082
SourceFile+0.078
rics+0.078
Neg logit contributions
jl-0.101
itia-0.099
 favorably-0.099
wake-0.097
eln-0.095
Tokenи
Token ID18849
Feature activation+3.16e-01

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.004
itia-0.004
 favorably-0.004
wake-0.004
eln-0.004
Token
Token ID140
Feature activation+3.35e-01

Pos logit contributions
arters+0.064
 balcon+0.048
 Phi+0.048
SourceFile+0.046
rics+0.046
Neg logit contributions
jl-0.065
itia-0.064
 favorably-0.063
wake-0.062
eln-0.061
Token
Token ID141
Feature activation+7.76e-01

Pos logit contributions
jl+0.045
itia+0.044
 favorably+0.044
wake+0.043
eln+0.042
Neg logit contributions
arters-0.051
 balcon-0.039
 Phi-0.039
SourceFile-0.037
rics-0.037
Token
Token ID239
Feature activation+4.84e-01

Pos logit contributions
arters+0.156
 balcon+0.117
 Phi+0.116
SourceFile+0.112
rics+0.111
Neg logit contributions
jl-0.159
itia-0.157
 favorably-0.157
wake-0.154
eln-0.152
Tokenм
Token ID43108
Feature activation-5.38e-01

Pos logit contributions
arters+0.121
 balcon+0.096
 Phi+0.096
SourceFile+0.093
rics+0.093
Neg logit contributions
jl-0.076
itia-0.074
 favorably-0.074
wake-0.072
eln-0.071
Tokenн
Token ID22177
Feature activation+4.98e-01

Pos logit contributions
jl+0.123
itia+0.121
 favorably+0.121
wake+0.119
eln+0.117
Neg logit contributions
arters-0.096
 balcon-0.069
 Phi-0.069
SourceFile-0.066
rics-0.065
Tokenи
Token ID18849
Feature activation+3.41e-01

Pos logit contributions
arters+0.099
 balcon+0.074
 Phi+0.074
rics+0.071
SourceFile+0.071
Neg logit contributions
jl-0.103
itia-0.102
 favorably-0.102
wake-0.100
eln-0.098
Tokenк
Token ID31583
Feature activation+1.79e+00

Pos logit contributions
arters+0.028
 balcon+0.010
 Phi+0.010
SourceFile+0.008
rics+0.008
Neg logit contributions
jl-0.111
itia-0.110
 favorably-0.110
wake-0.109
eln-0.108
Tokenо�
Token ID25443
Feature activation+2.47e-02

Pos logit contributions
arters+0.415
 balcon+0.325
 Phi+0.322
rics+0.311
SourceFile+0.311
Neg logit contributions
jl-0.314
itia-0.309
 favorably-0.308
wake-0.301
eln-0.296
Token
Token ID110
Feature activation+3.69e-01

Pos logit contributions
arters+0.005
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.005
eln-0.005
Token
Token ID220
Feature activation-3.40e-01

Pos logit contributions
arters+0.087
 balcon+0.068
 Phi+0.068
SourceFile+0.065
rics+0.065
Neg logit contributions
jl-0.063
itia-0.062
 favorably-0.062
wake-0.061
eln-0.060
Tokenр
Token ID21169
Feature activation+2.96e-02

Pos logit contributions
jl+0.069
 favorably+0.068
itia+0.068
wake+0.067
eln+0.066
Neg logit contributions
arters-0.069
 balcon-0.052
 Phi-0.052
SourceFile-0.050
rics-0.050
Tokenа
Token ID16142
Feature activation-1.93e-01

Pos logit contributions
arters+0.007
 balcon+0.005
 Phi+0.005
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.005
eln-0.005
Token (
Token ID357
Feature activation+1.07e-01

Pos logit contributions
arters+0.016
 balcon+0.012
 Phi+0.012
SourceFile+0.012
rics+0.012
Neg logit contributions
jl-0.012
itia-0.012
 favorably-0.012
wake-0.012
eln-0.011
Token1979
Token ID33581
Feature activation+2.39e-01

Pos logit contributions
arters+0.023
 balcon+0.017
 Phi+0.017
SourceFile+0.017
rics+0.017
Neg logit contributions
jl-0.021
itia-0.021
 favorably-0.020
wake-0.020
eln-0.020
Token)
Token ID8
Feature activation-1.65e-02

Pos logit contributions
arters+0.058
 balcon+0.046
 Phi+0.046
SourceFile+0.044
rics+0.044
Neg logit contributions
jl-0.039
itia-0.039
 favorably-0.038
wake-0.038
eln-0.037
Token
Token ID198
Feature activation+3.64e-02

Pos logit contributions
jl+0.003
itia+0.003
 favorably+0.003
wake+0.003
eln+0.003
Neg logit contributions
arters-0.004
 balcon-0.003
 Phi-0.003
SourceFile-0.003
rics-0.003
Token
Token ID198
Feature activation+4.94e-02

Pos logit contributions
arters+0.009
 balcon+0.007
 Phi+0.007
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.006
itia-0.006
 favorably-0.006
wake-0.006
eln-0.006
TokenGrade
Token ID42233
Feature activation+1.79e+00

Pos logit contributions
arters+0.005
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.016
itia-0.015
 favorably-0.015
wake-0.015
eln-0.015
Token:
Token ID25
Feature activation+1.66e-01

Pos logit contributions
arters+0.380
 balcon+0.290
 Phi+0.287
SourceFile+0.277
rics+0.276
Neg logit contributions
jl-0.348
itia-0.343
 favorably-0.342
wake-0.335
eln-0.329
Token A
Token ID317
Feature activation-1.56e-01

Pos logit contributions
arters+0.038
 balcon+0.029
 Phi+0.029
SourceFile+0.028
rics+0.028
Neg logit contributions
jl-0.030
itia-0.029
 favorably-0.029
wake-0.029
eln-0.028
Token+
Token ID10
Feature activation-3.76e-01

Pos logit contributions
jl+0.025
itia+0.025
 favorably+0.025
wake+0.024
eln+0.024
Neg logit contributions
arters-0.038
 balcon-0.030
 Phi-0.030
SourceFile-0.029
rics-0.029
Token
Token ID198
Feature activation+6.01e-02

Pos logit contributions
jl+0.062
itia+0.061
 favorably+0.061
wake+0.060
eln+0.059
Neg logit contributions
arters-0.090
 balcon-0.071
 Phi-0.071
SourceFile-0.069
rics-0.069
Token
Token ID198
Feature activation+5.18e-02

Pos logit contributions
arters+0.015
 balcon+0.011
 Phi+0.011
SourceFile+0.011
rics+0.011
Neg logit contributions
jl-0.010
itia-0.010
 favorably-0.010
wake-0.010
eln-0.009
Tokenн
Token ID22177
Feature activation+5.26e-01

Pos logit contributions
arters+0.054
 balcon+0.041
 Phi+0.041
SourceFile+0.039
rics+0.039
Neg logit contributions
jl-0.052
itia-0.051
 favorably-0.051
wake-0.050
eln-0.049
Tokenы
Token ID45035
Feature activation-9.82e-02

Pos logit contributions
arters+0.108
 balcon+0.082
 Phi+0.081
SourceFile+0.078
rics+0.078
Neg logit contributions
jl-0.106
itia-0.104
 favorably-0.104
wake-0.102
eln-0.100
Token
Token ID12466
Feature activation+2.03e-01

Pos logit contributions
jl+0.021
itia+0.021
 favorably+0.021
wake+0.021
eln+0.020
Neg logit contributions
arters-0.019
 balcon-0.014
 Phi-0.014
SourceFile-0.013
rics-0.013
Token
Token ID121
Feature activation+4.67e-02

Pos logit contributions
arters+0.047
 balcon+0.037
 Phi+0.037
SourceFile+0.035
rics+0.035
Neg logit contributions
jl-0.035
itia-0.035
 favorably-0.035
wake-0.034
eln-0.033
Tokenи
Token ID18849
Feature activation+3.38e-01

Pos logit contributions
arters+0.010
 balcon+0.007
 Phi+0.007
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.009
itia-0.009
 favorably-0.009
wake-0.009
eln-0.009
Tokenк
Token ID31583
Feature activation+1.79e+00

Pos logit contributions
arters+0.073
 balcon+0.056
 Phi+0.056
SourceFile+0.054
rics+0.053
Neg logit contributions
jl-0.064
itia-0.063
 favorably-0.063
wake-0.062
eln-0.061
Tokenо�
Token ID25443
Feature activation+1.80e-02

Pos logit contributions
arters+0.395
 balcon+0.305
 Phi+0.303
SourceFile+0.292
rics+0.291
Neg logit contributions
jl-0.331
itia-0.327
 favorably-0.326
wake-0.319
eln-0.314
Token
Token ID111
Feature activation+2.55e-01

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Tokenд
Token ID43666
Feature activation+1.42e-01

Pos logit contributions
arters+0.053
 balcon+0.040
 Phi+0.040
SourceFile+0.038
rics+0.038
Neg logit contributions
jl-0.051
itia-0.050
 favorably-0.050
wake-0.049
eln-0.048
Tokenа
Token ID16142
Feature activation-2.25e-01

Pos logit contributions
arters+0.030
 balcon+0.023
 Phi+0.023
SourceFile+0.022
rics+0.022
Neg logit contributions
jl-0.028
itia-0.027
 favorably-0.027
wake-0.027
eln-0.026
Token
Token ID12466
Feature activation+1.80e-01

Pos logit contributions
jl+0.044
itia+0.043
 favorably+0.043
wake+0.042
eln+0.042
Neg logit contributions
arters-0.048
 balcon-0.036
 Phi-0.036
SourceFile-0.035
rics-0.035
Token
Token ID12466
Feature activation+1.94e-01

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.005
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Token
Token ID98
Feature activation+6.21e-01

Pos logit contributions
arters+0.041
 balcon+0.031
 Phi+0.031
SourceFile+0.030
rics+0.030
Neg logit contributions
jl-0.038
itia-0.037
 favorably-0.037
wake-0.036
eln-0.036
Tokenа
Token ID16142
Feature activation-2.12e-01

Pos logit contributions
arters+0.133
 balcon+0.101
 Phi+0.101
SourceFile+0.097
rics+0.097
Neg logit contributions
jl-0.120
itia-0.118
 favorably-0.118
wake-0.115
eln-0.113
Tokenр
Token ID21169
Feature activation+3.92e-02

Pos logit contributions
jl+0.045
itia+0.045
 favorably+0.045
wake+0.044
eln+0.043
Neg logit contributions
arters-0.041
 balcon-0.030
 Phi-0.030
SourceFile-0.029
rics-0.029
Tokenа
Token ID16142
Feature activation-2.38e-01

Pos logit contributions
arters+0.008
 balcon+0.006
 Phi+0.006
SourceFile+0.006
rics+0.006
Neg logit contributions
jl-0.008
itia-0.008
 favorably-0.008
wake-0.008
eln-0.008
Tokenк
Token ID31583
Feature activation+1.79e+00

Pos logit contributions
jl+0.049
itia+0.048
 favorably+0.048
wake+0.047
eln+0.046
Neg logit contributions
arters-0.048
 balcon-0.036
 Phi-0.036
SourceFile-0.035
rics-0.034
Tokenт
Token ID20375
Feature activation+7.80e-01

Pos logit contributions
arters+0.376
 balcon+0.286
 Phi+0.284
SourceFile+0.274
rics+0.273
Neg logit contributions
jl-0.350
itia-0.345
 favorably-0.344
wake-0.338
eln-0.332
Tokenе
Token ID16843
Feature activation+5.34e-01

Pos logit contributions
arters+0.154
 balcon+0.114
 Phi+0.114
SourceFile+0.109
rics+0.109
Neg logit contributions
jl-0.163
itia-0.161
 favorably-0.161
wake-0.158
eln-0.155
Tokenр
Token ID21169
Feature activation+1.22e-02

Pos logit contributions
arters+0.128
 balcon+0.101
 Phi+0.100
SourceFile+0.097
rics+0.097
Neg logit contributions
jl-0.090
itia-0.088
 favorably-0.088
wake-0.086
eln-0.084
Tokenн
Token ID22177
Feature activation+4.82e-01

Pos logit contributions
arters+0.003
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Tokenо�
Token ID25443
Feature activation-6.53e-03

Pos logit contributions
arters+0.106
 balcon+0.081
 Phi+0.081
SourceFile+0.078
rics+0.078
Neg logit contributions
jl-0.091
itia-0.089
 favorably-0.089
wake-0.087
eln-0.086
Token
Token ID121
Feature activation+5.40e-02

Pos logit contributions
arters+0.043
 balcon+0.034
 Phi+0.034
SourceFile+0.032
rics+0.032
Neg logit contributions
jl-0.033
itia-0.033
 favorably-0.033
wake-0.032
eln-0.032
Tokenе
Token ID16843
Feature activation+5.27e-01

Pos logit contributions
arters+0.011
 balcon+0.009
 Phi+0.009
SourceFile+0.008
rics+0.008
Neg logit contributions
jl-0.011
itia-0.011
 favorably-0.010
wake-0.010
eln-0.010
Token
Token ID12466
Feature activation+2.06e-01

Pos logit contributions
arters+0.117
 balcon+0.091
 Phi+0.090
SourceFile+0.087
rics+0.087
Neg logit contributions
jl-0.097
itia-0.096
 favorably-0.095
wake-0.093
eln-0.092
Token
Token ID110
Feature activation+3.62e-01

Pos logit contributions
arters+0.047
 balcon+0.037
 Phi+0.037
SourceFile+0.036
rics+0.035
Neg logit contributions
jl-0.036
itia-0.036
 favorably-0.036
wake-0.035
eln-0.034
Tokenс
Token ID21727
Feature activation+5.89e-02

Pos logit contributions
arters+0.080
 balcon+0.061
 Phi+0.061
SourceFile+0.059
rics+0.059
Neg logit contributions
jl-0.067
itia-0.066
 favorably-0.066
wake-0.065
eln-0.064
Tokenк
Token ID31583
Feature activation+1.79e+00

Pos logit contributions
arters+0.011
 balcon+0.008
 Phi+0.008
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.013
itia-0.013
 favorably-0.013
wake-0.013
eln-0.013
Tokenр
Token ID21169
Feature activation+4.14e-02

Pos logit contributions
arters+0.382
 balcon+0.292
 Phi+0.289
SourceFile+0.279
rics+0.278
Neg logit contributions
jl-0.344
itia-0.339
 favorably-0.338
wake-0.331
eln-0.326
Tokenы
Token ID45035
Feature activation-1.03e-01

Pos logit contributions
arters+0.009
 balcon+0.007
 Phi+0.007
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.008
itia-0.008
 favorably-0.007
wake-0.007
eln-0.007
Tokenв
Token ID38857
Feature activation-7.58e-02

Pos logit contributions
jl+0.020
itia+0.020
 favorably+0.020
wake+0.020
eln+0.019
Neg logit contributions
arters-0.021
 balcon-0.016
 Phi-0.016
SourceFile-0.015
rics-0.015
Tokenа
Token ID16142
Feature activation-2.21e-01

Pos logit contributions
jl+0.014
itia+0.014
 favorably+0.014
wake+0.014
eln+0.013
Neg logit contributions
arters-0.017
 balcon-0.013
 Phi-0.013
SourceFile-0.012
rics-0.012
Tokenт
Token ID20375
Feature activation+8.08e-01

Pos logit contributions
jl+0.046
itia+0.046
 favorably+0.045
wake+0.045
eln+0.044
Neg logit contributions
arters-0.044
 balcon-0.032
 Phi-0.032
SourceFile-0.031
rics-0.031
Tokenу
Token ID35072
Feature activation+2.60e-01

Pos logit contributions
arters+0.186
 balcon+0.146
 Phi+0.146
SourceFile+0.141
rics+0.140
Neg logit contributions
jl-0.136
itia-0.133
 favorably-0.133
wake-0.130
eln-0.128
Token
Token ID141
Feature activation+7.43e-01

Pos logit contributions
arters+0.051
 balcon+0.038
 Phi+0.038
SourceFile+0.036
rics+0.036
Neg logit contributions
jl-0.055
itia-0.054
 favorably-0.054
wake-0.053
eln-0.052
Token
Token ID236
Feature activation-6.47e-02

Pos logit contributions
arters+0.163
 balcon+0.126
 Phi+0.125
SourceFile+0.121
rics+0.120
Neg logit contributions
jl-0.139
itia-0.137
 favorably-0.137
wake-0.134
eln-0.132
Token
Token ID12466
Feature activation+1.87e-01

Pos logit contributions
jl+0.013
itia+0.012
 favorably+0.012
wake+0.012
eln+0.012
Neg logit contributions
arters-0.014
 balcon-0.010
 Phi-0.010
SourceFile-0.010
rics-0.010
Token
Token ID122
Feature activation-1.52e-01

Pos logit contributions
arters+0.043
 balcon+0.033
 Phi+0.033
SourceFile+0.032
rics+0.032
Neg logit contributions
jl-0.033
itia-0.033
 favorably-0.033
wake-0.032
eln-0.031
Tokenк
Token ID31583
Feature activation+1.78e+00

Pos logit contributions
jl+0.030
itia+0.030
 favorably+0.030
wake+0.029
eln+0.029
Neg logit contributions
arters-0.032
 balcon-0.024
 Phi-0.024
SourceFile-0.023
rics-0.023
Tokenа
Token ID16142
Feature activation-2.38e-01

Pos logit contributions
arters+0.377
 balcon+0.287
 Phi+0.284
SourceFile+0.274
rics+0.273
Neg logit contributions
jl-0.348
itia-0.344
 favorably-0.343
wake-0.336
eln-0.330
Token
Token ID140
Feature activation+3.55e-01

Pos logit contributions
jl+0.048
itia+0.048
 favorably+0.047
wake+0.047
eln+0.046
Neg logit contributions
arters-0.049
 balcon-0.036
 Phi-0.036
SourceFile-0.035
rics-0.035
Token
Token ID115
Feature activation+2.82e-01

Pos logit contributions
arters+0.072
 balcon+0.054
 Phi+0.054
SourceFile+0.051
rics+0.051
Neg logit contributions
jl-0.073
itia-0.072
 favorably-0.071
wake-0.070
eln-0.069
Tokenы
Token ID45035
Feature activation-1.12e-01

Pos logit contributions
arters+0.062
 balcon+0.047
 Phi+0.047
rics+0.045
SourceFile+0.045
Neg logit contributions
jl-0.053
itia-0.052
 favorably-0.052
wake-0.051
eln-0.050
Tokenв
Token ID38857
Feature activation-1.04e-01

Pos logit contributions
jl+0.025
itia+0.025
 favorably+0.025
wake+0.024
eln+0.024
Neg logit contributions
arters-0.020
 balcon-0.015
 Phi-0.015
rics-0.014
SourceFile-0.014
Token «
Token ID21110
Feature activation-2.32e-01

Pos logit contributions
jl+0.020
itia+0.019
 favorably+0.019
wake+0.019
eln+0.019
Neg logit contributions
arters-0.025
 balcon-0.019
 Phi-0.019
SourceFile-0.019
rics-0.019
Token
Token ID141
Feature activation+7.28e-01

Pos logit contributions
jl+0.044
itia+0.044
 favorably+0.044
wake+0.043
eln+0.042
Neg logit contributions
arters-0.050
 balcon-0.038
 Phi-0.038
SourceFile-0.037
rics-0.037
Token
Token ID228
Feature activation+2.36e-01

Pos logit contributions
arters+0.164
 balcon+0.128
 Phi+0.127
SourceFile+0.123
rics+0.122
Neg logit contributions
jl-0.132
itia-0.130
 favorably-0.129
wake-0.126
eln-0.124
Tokenи
Token ID18849
Feature activation+2.95e-01

Pos logit contributions
arters+0.046
 balcon+0.035
 Phi+0.034
SourceFile+0.033
rics+0.033
Neg logit contributions
jl-0.049
itia-0.049
 favorably-0.049
wake-0.048
eln-0.047
Tokenн
Token ID22177
Feature activation+4.64e-01

Pos logit contributions
arters+0.060
 balcon+0.045
 Phi+0.045
SourceFile+0.043
rics+0.043
Neg logit contributions
jl-0.060
itia-0.059
 favorably-0.059
wake-0.058
eln-0.057
Tokenк
Token ID31583
Feature activation+1.78e+00

Pos logit contributions
arters+0.096
 balcon+0.072
 Phi+0.072
SourceFile+0.069
rics+0.069
Neg logit contributions
jl-0.093
itia-0.092
 favorably-0.092
wake-0.090
eln-0.088
Tokenа
Token ID16142
Feature activation-2.26e-01

Pos logit contributions
arters+0.390
 balcon+0.300
 Phi+0.297
rics+0.286
SourceFile+0.286
Neg logit contributions
jl-0.335
 favorably-0.330
itia-0.330
wake-0.323
eln-0.317
Token
Token ID141
Feature activation+7.16e-01

Pos logit contributions
jl+0.043
itia+0.043
 favorably+0.043
wake+0.042
eln+0.041
Neg logit contributions
arters-0.049
 balcon-0.037
 Phi-0.037
SourceFile-0.036
rics-0.035
Token
Token ID227
Feature activation-8.36e-02

Pos logit contributions
arters+0.161
 balcon+0.124
 Phi+0.124
SourceFile+0.120
rics+0.119
Neg logit contributions
jl-0.130
itia-0.128
 favorably-0.128
wake-0.125
eln-0.123
Token»
Token ID17730
Feature activation-2.60e-01

Pos logit contributions
jl+0.014
itia+0.014
 favorably+0.014
wake+0.014
eln+0.013
Neg logit contributions
arters-0.020
 balcon-0.016
 Phi-0.016
SourceFile-0.015
rics-0.015
Token
Token ID220
Feature activation-3.60e-01

Pos logit contributions
jl+0.046
itia+0.045
 favorably+0.045
wake+0.044
eln+0.043
Neg logit contributions
arters-0.060
 balcon-0.047
 Phi-0.047
SourceFile-0.045
rics-0.045
Tokenе
Token ID16843
Feature activation+5.16e-01

Pos logit contributions
arters+0.049
 balcon+0.036
 Phi+0.036
SourceFile+0.034
rics+0.034
Neg logit contributions
jl-0.058
itia-0.057
 favorably-0.057
wake-0.056
eln-0.055
Tokenл
Token ID30143
Feature activation+4.79e-01

Pos logit contributions
arters+0.109
 balcon+0.083
 Phi+0.082
rics+0.079
SourceFile+0.079
Neg logit contributions
jl-0.101
itia-0.100
 favorably-0.099
wake-0.097
eln-0.096
Tokenо�
Token ID25443
Feature activation+1.10e-02

Pos logit contributions
arters+0.098
 balcon+0.074
 Phi+0.074
rics+0.071
SourceFile+0.071
Neg logit contributions
jl-0.096
itia-0.095
 favorably-0.095
wake-0.093
eln-0.091
Token
Token ID110
Feature activation+3.17e-01

Pos logit contributions
arters+0.002
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Tokenе
Token ID16843
Feature activation+4.82e-01

Pos logit contributions
arters+0.069
 balcon+0.054
 Phi+0.053
rics+0.051
SourceFile+0.051
Neg logit contributions
jl-0.060
 favorably-0.059
itia-0.059
wake-0.057
eln-0.057
Tokenк
Token ID31583
Feature activation+1.78e+00

Pos logit contributions
arters+0.097
 balcon+0.073
 Phi+0.073
rics+0.070
SourceFile+0.069
Neg logit contributions
jl-0.098
itia-0.097
 favorably-0.097
wake-0.095
eln-0.094
Token.
Token ID13
Feature activation-2.18e-02

Pos logit contributions
arters+0.374
 balcon+0.285
 Phi+0.283
rics+0.272
SourceFile+0.270
Neg logit contributions
jl-0.348
itia-0.344
 favorably-0.343
wake-0.337
eln-0.331
Token
Token ID198
Feature activation+1.33e-02

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.005
 balcon-0.004
 Phi-0.004
SourceFile-0.004
rics-0.004
Token
Token ID198
Feature activation+2.50e-03

Pos logit contributions
arters+0.003
 balcon+0.003
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
 favorably-0.002
itia-0.002
wake-0.002
eln-0.002
Token
Token ID140
Feature activation+3.43e-01

Pos logit contributions
arters+0.001
 balcon+0.000
 Phi+0.000
SourceFile+0.000
rics+0.000
Neg logit contributions
jl-0.000
itia-0.000
 favorably-0.000
wake-0.000
eln-0.000
Token
Token ID94
Feature activation+8.12e-03

Pos logit contributions
arters+0.072
 balcon+0.055
 Phi+0.055
SourceFile+0.053
rics+0.052
Neg logit contributions
jl-0.067
itia-0.066
 favorably-0.066
wake-0.065
eln-0.064
TokenThough
Token ID10915
Feature activation+2.26e-01

Pos logit contributions
arters+0.008
 balcon+0.006
 Phi+0.006
SourceFile+0.006
rics+0.006
Neg logit contributions
jl-0.006
itia-0.006
 favorably-0.006
wake-0.006
eln-0.006
Token the
Token ID262
Feature activation-1.01e-02

Pos logit contributions
arters+0.053
 balcon+0.042
 Phi+0.042
SourceFile+0.040
rics+0.040
Neg logit contributions
jl-0.039
itia-0.038
 favorably-0.038
wake-0.037
eln-0.037
Token decision
Token ID2551
Feature activation+4.69e-01

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.002
 balcon-0.001
 Phi-0.001
SourceFile-0.001
rics-0.001
Token on
Token ID319
Feature activation-1.21e-02

Pos logit contributions
arters+0.110
 balcon+0.087
 Phi+0.086
SourceFile+0.084
rics+0.083
Neg logit contributions
jl-0.080
itia-0.079
 favorably-0.079
wake-0.077
eln-0.075
Token the
Token ID262
Feature activation+1.94e-02

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.003
 Phi-0.002
 balcon-0.002
SourceFile-0.002
rics-0.002
Token Phi
Token ID47256
Feature activation+1.78e+00

Pos logit contributions
arters+0.004
 Phi+0.003
 balcon+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Token K
Token ID509
Feature activation+5.16e-01

Pos logit contributions
arters+0.318
 balcon+0.229
 Phi+0.223
SourceFile+0.218
rics+0.216
Neg logit contributions
jl-0.404
itia-0.399
 favorably-0.399
wake-0.391
iji-0.384
Tokenaps
Token ID1686
Feature activation-3.19e-01

Pos logit contributions
arters+0.135
 balcon+0.110
 Phi+0.109
SourceFile+0.106
rics+0.105
Neg logit contributions
jl-0.075
itia-0.073
 favorably-0.072
wake-0.071
eln-0.069
Token
Token ID447
Feature activation+3.86e-02

Pos logit contributions
jl+0.058
itia+0.057
 favorably+0.057
wake+0.056
eln+0.055
Neg logit contributions
arters-0.072
 balcon-0.056
 Phi-0.056
SourceFile-0.054
rics-0.054
Token
Token ID247
Feature activation-9.79e-03

Pos logit contributions
arters+0.010
 balcon+0.008
 Phi+0.008
SourceFile+0.008
rics+0.008
Neg logit contributions
jl-0.006
itia-0.006
 favorably-0.006
wake-0.005
eln-0.005
Token part
Token ID636
Feature activation-3.22e-01

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.002
 balcon-0.002
 Phi-0.002
rics-0.002
SourceFile-0.002
Token:
Token ID25
Feature activation+1.02e-01

Pos logit contributions
jl+0.046
itia+0.046
 favorably+0.046
wake+0.045
eln+0.044
Neg logit contributions
arters-0.061
 balcon-0.048
 Phi-0.047
SourceFile-0.046
rics-0.046
Token
Token ID12466
Feature activation+1.55e-01

Pos logit contributions
arters+0.024
 balcon+0.018
 Phi+0.018
SourceFile+0.018
rics+0.018
Neg logit contributions
jl-0.018
itia-0.018
 favorably-0.018
wake-0.017
eln-0.017
Token
Token ID238
Feature activation+1.68e-01

Pos logit contributions
arters+0.035
 balcon+0.027
 Phi+0.027
SourceFile+0.026
rics+0.026
Neg logit contributions
jl-0.028
itia-0.028
 favorably-0.028
wake-0.027
eln-0.027
Tokenл
Token ID30143
Feature activation+4.78e-01

Pos logit contributions
arters+0.035
 balcon+0.026
 Phi+0.026
SourceFile+0.025
rics+0.025
Neg logit contributions
jl-0.033
itia-0.033
 favorably-0.033
wake-0.032
eln-0.031
Tokenе
Token ID16843
Feature activation+4.60e-01

Pos logit contributions
arters+0.095
 balcon+0.071
 Phi+0.071
SourceFile+0.068
rics+0.068
Neg logit contributions
jl-0.099
itia-0.098
 favorably-0.098
wake-0.096
eln-0.094
Tokenк
Token ID31583
Feature activation+1.77e+00

Pos logit contributions
arters+0.088
 balcon+0.065
 Phi+0.065
SourceFile+0.062
rics+0.062
Neg logit contributions
jl-0.099
itia-0.097
 favorably-0.097
wake-0.095
eln-0.094
Tokenс
Token ID21727
Feature activation-3.24e-02

Pos logit contributions
arters+0.384
 balcon+0.295
 Phi+0.293
rics+0.282
SourceFile+0.281
Neg logit contributions
jl-0.336
itia-0.332
 favorably-0.332
wake-0.325
eln-0.319
Tokenа
Token ID16142
Feature activation-2.58e-01

Pos logit contributions
jl+0.007
itia+0.007
 favorably+0.007
wake+0.006
eln+0.006
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Tokenн
Token ID22177
Feature activation+4.39e-01

Pos logit contributions
jl+0.051
itia+0.050
 favorably+0.050
wake+0.049
eln+0.049
Neg logit contributions
arters-0.054
 balcon-0.041
 Phi-0.041
SourceFile-0.039
rics-0.039
Tokenд
Token ID43666
Feature activation+1.09e-01

Pos logit contributions
arters+0.096
 balcon+0.073
 Phi+0.073
SourceFile+0.071
rics+0.070
Neg logit contributions
jl-0.083
itia-0.082
 favorably-0.082
wake-0.080
eln-0.079
Tokenр
Token ID21169
Feature activation-4.20e-02

Pos logit contributions
arters+0.023
 balcon+0.017
 Phi+0.017
SourceFile+0.017
rics+0.017
Neg logit contributions
jl-0.021
itia-0.021
 favorably-0.021
wake-0.021
eln-0.020
Token HR
Token ID15172
Feature activation-5.34e-01

Pos logit contributions
jl+0.053
 favorably+0.052
itia+0.052
wake+0.051
eln+0.050
Neg logit contributions
arters-0.074
 balcon-0.058
 Phi-0.058
rics-0.056
SourceFile-0.055
TokenW
Token ID54
Feature activation-6.13e-01

Pos logit contributions
 favorably+0.075
jl+0.075
itia+0.074
eln+0.071
wake+0.071
Neg logit contributions
arters-0.141
 Phi-0.114
 balcon-0.113
Toronto-0.111
SourceFile-0.111
Token researcher
Token ID13453
Feature activation+1.82e-01

Pos logit contributions
jl+0.110
 favorably+0.108
itia+0.108
wake+0.106
eln+0.105
Neg logit contributions
arters-0.139
 balcon-0.108
 Phi-0.107
rics-0.104
SourceFile-0.103
Token La
Token ID4689
Feature activation+2.48e-01

Pos logit contributions
arters+0.041
 Phi+0.032
 balcon+0.032
rics+0.031
SourceFile+0.030
Neg logit contributions
jl-0.033
itia-0.032
 favorably-0.032
wake-0.032
eln-0.031
Tokenet
Token ID316
Feature activation+3.80e-01

Pos logit contributions
arters+0.055
 Phi+0.042
 balcon+0.042
rics+0.040
SourceFile+0.040
Neg logit contributions
jl-0.046
 favorably-0.046
itia-0.045
wake-0.044
eln-0.044
Tokenitia
Token ID36723
Feature activation-2.05e+00

Pos logit contributions
arters+0.139
 Phi+0.120
 balcon+0.119
rics+0.118
enders+0.116
Neg logit contributions
 favorably-0.015
jl-0.015
wake-0.013
itia-0.012
eln-0.011
Token B
Token ID347
Feature activation+9.91e-02

Pos logit contributions
jl+0.383
itia+0.377
 favorably+0.376
wake+0.368
eln+0.362
Neg logit contributions
arters-0.452
 balcon-0.348
 Phi-0.346
SourceFile-0.334
rics-0.333
Tokenader
Token ID5067
Feature activation+1.91e-01

Pos logit contributions
arters+0.023
 balcon+0.017
 Phi+0.017
rics+0.017
SourceFile+0.017
Neg logit contributions
jl-0.018
itia-0.017
 favorably-0.017
wake-0.017
eln-0.017
Token told
Token ID1297
Feature activation-1.13e-01

Pos logit contributions
arters+0.044
 balcon+0.035
 Phi+0.035
rics+0.033
SourceFile+0.033
Neg logit contributions
jl-0.033
 favorably-0.033
itia-0.032
wake-0.032
eln-0.031
Token the
Token ID262
Feature activation-1.51e-02

Pos logit contributions
jl+0.020
 favorably+0.020
itia+0.020
wake+0.020
eln+0.019
Neg logit contributions
arters-0.025
 balcon-0.020
 Phi-0.020
rics-0.019
SourceFile-0.019
Token BBC
Token ID7823
Feature activation-3.18e-01

Pos logit contributions
jl+0.003
 favorably+0.003
itia+0.003
wake+0.002
eln+0.002
Neg logit contributions
arters-0.004
 balcon-0.003
 Phi-0.003
SourceFile-0.003
rics-0.003
Token room
Token ID2119
Feature activation-3.06e-01

Pos logit contributions
jl+0.009
itia+0.008
 favorably+0.008
wake+0.008
eln+0.008
Neg logit contributions
arters-0.011
 balcon-0.008
 Phi-0.008
rics-0.008
SourceFile-0.008
Token.
Token ID13
Feature activation-4.76e-02

Pos logit contributions
jl+0.052
itia+0.051
 favorably+0.051
wake+0.050
eln+0.049
Neg logit contributions
arters-0.073
 balcon-0.057
 Phi-0.057
SourceFile-0.055
rics-0.055
Token
Token ID198
Feature activation+6.74e-03

Pos logit contributions
jl+0.008
 favorably+0.008
itia+0.008
wake+0.007
eln+0.007
Neg logit contributions
arters-0.012
 Phi-0.009
 balcon-0.009
SourceFile-0.009
rics-0.009
Token
Token ID198
Feature activation+3.24e-03

Pos logit contributions
arters+0.002
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
TokenL
Token ID43
Feature activation-4.97e-01

Pos logit contributions
arters+0.001
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
 favorably-0.001
itia-0.001
wake-0.001
eln-0.001
Tokenacan
Token ID50195
Feature activation-1.84e+00

Pos logit contributions
jl+0.032
 favorably+0.031
itia+0.030
wake+0.028
eln+0.026
Neg logit contributions
arters-0.171
 balcon-0.145
 Phi-0.144
rics-0.142
SourceFile-0.141
Token got
Token ID1392
Feature activation-2.41e-01

Pos logit contributions
jl+0.328
itia+0.323
 favorably+0.322
wake+0.315
eln+0.309
Neg logit contributions
arters-0.419
 balcon-0.326
 Phi-0.324
SourceFile-0.314
rics-0.312
Token this
Token ID428
Feature activation-2.95e-02

Pos logit contributions
jl+0.042
itia+0.042
 favorably+0.041
wake+0.041
eln+0.040
Neg logit contributions
arters-0.056
 balcon-0.043
 Phi-0.043
SourceFile-0.042
rics-0.042
Token.
Token ID13
Feature activation-4.36e-02

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.005
Neg logit contributions
arters-0.007
 balcon-0.006
 Phi-0.006
SourceFile-0.005
rics-0.005
Token Like
Token ID4525
Feature activation+1.10e-01

Pos logit contributions
jl+0.007
 favorably+0.007
itia+0.007
wake+0.007
eln+0.007
Neg logit contributions
arters-0.011
 Phi-0.008
 balcon-0.008
SourceFile-0.008
rics-0.008
Token the
Token ID262
Feature activation-7.40e-03

Pos logit contributions
arters+0.025
 balcon+0.020
 Phi+0.020
SourceFile+0.019
rics+0.019
Neg logit contributions
jl-0.019
itia-0.019
 favorably-0.019
wake-0.018
eln-0.018
Token global
Token ID3298
Feature activation+1.02e-01

Pos logit contributions
arters+0.073
 balcon+0.056
 Phi+0.056
SourceFile+0.054
rics+0.054
Neg logit contributions
jl-0.060
itia-0.059
 favorably-0.059
wake-0.057
eln-0.056
Token influenza
Token ID29491
Feature activation+5.58e-01

Pos logit contributions
arters+0.022
 balcon+0.017
 Phi+0.017
SourceFile+0.016
rics+0.016
Neg logit contributions
jl-0.019
itia-0.019
 favorably-0.019
wake-0.018
eln-0.018
Token programme
Token ID11383
Feature activation-2.80e-01

Pos logit contributions
arters+0.134
 balcon+0.105
 Phi+0.105
SourceFile+0.102
rics+0.101
Neg logit contributions
jl-0.093
itia-0.092
 favorably-0.091
wake-0.089
eln-0.088
Token,
Token ID11
Feature activation-3.28e-03

Pos logit contributions
jl+0.050
itia+0.049
 favorably+0.049
wake+0.048
eln+0.047
Neg logit contributions
arters-0.063
 Phi-0.050
 balcon-0.049
rics-0.048
SourceFile-0.047
Token Ke
Token ID3873
Feature activation-3.27e-01

Pos logit contributions
jl+0.001
itia+0.001
 favorably+0.001
wake+0.001
eln+0.001
Neg logit contributions
arters-0.001
 Phi-0.001
 balcon-0.001
rics-0.001
SourceFile-0.001
Tokeniji
Token ID20770
Feature activation-1.82e+00

Pos logit contributions
jl+0.060
itia+0.059
 favorably+0.059
wake+0.058
eln+0.057
Neg logit contributions
arters-0.073
 balcon-0.056
 Phi-0.056
SourceFile-0.054
rics-0.054
Token Fuk
Token ID22431
Feature activation+1.28e-01

Pos logit contributions
jl+0.330
itia+0.326
 favorably+0.324
wake+0.318
eln+0.312
Neg logit contributions
arters-0.412
 balcon-0.319
 Phi-0.317
SourceFile-0.308
rics-0.306
Tokenuda
Token ID15339
Feature activation-1.11e-01

Pos logit contributions
arters+0.031
 balcon+0.025
 Phi+0.024
rics+0.024
SourceFile+0.024
Neg logit contributions
jl-0.021
 favorably-0.021
itia-0.021
wake-0.020
eln-0.020
Token,
Token ID11
Feature activation+2.47e-03

Pos logit contributions
jl+0.020
itia+0.020
 favorably+0.020
wake+0.019
eln+0.019
Neg logit contributions
arters-0.025
 Phi-0.020
 balcon-0.020
rics-0.019
SourceFile-0.019
Token said
Token ID531
Feature activation+1.56e-02

Pos logit contributions
arters+0.001
 balcon+0.000
 Phi+0.000
rics+0.000
SourceFile+0.000
Neg logit contributions
jl-0.000
itia-0.000
 favorably-0.000
wake-0.000
eln-0.000
Token the
Token ID262
Feature activation+3.23e-03

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.002
Token:
Token ID25
Feature activation+1.54e-01

Pos logit contributions
arters+0.110
 balcon+0.085
 Phi+0.085
SourceFile+0.082
rics+0.082
Neg logit contributions
jl-0.090
 favorably-0.089
itia-0.089
wake-0.087
eln-0.086
Token File
Token ID9220
Feature activation+1.81e-01

Pos logit contributions
arters+0.034
 balcon+0.027
 Phi+0.026
SourceFile+0.026
rics+0.025
Neg logit contributions
jl-0.028
itia-0.028
 favorably-0.028
wake-0.027
eln-0.027
Token photo
Token ID4590
Feature activation-5.29e-02

Pos logit contributions
arters+0.041
 Phi+0.032
 balcon+0.032
rics+0.030
SourceFile+0.030
Neg logit contributions
jl-0.033
 favorably-0.032
itia-0.032
wake-0.032
eln-0.031
Token by
Token ID416
Feature activation+2.56e-02

Pos logit contributions
jl+0.009
itia+0.009
 favorably+0.009
wake+0.009
eln+0.008
Neg logit contributions
arters-0.013
 balcon-0.010
 Phi-0.010
rics-0.010
SourceFile-0.009
Token J
Token ID449
Feature activation+2.64e-03

Pos logit contributions
arters+0.006
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.004
eln-0.004
Tokeniji
Token ID20770
Feature activation-1.80e+00

Pos logit contributions
arters+0.001
 Phi+0.001
 balcon+0.001
rics+0.000
SourceFile+0.000
Neg logit contributions
jl-0.000
 favorably-0.000
itia-0.000
wake-0.000
eln-0.000
Token Press
Token ID4332
Feature activation-9.11e-01

Pos logit contributions
jl+0.330
itia+0.325
 favorably+0.324
wake+0.317
eln+0.312
Neg logit contributions
arters-0.403
 balcon-0.311
 Phi-0.310
SourceFile-0.299
rics-0.298
Token,
Token ID11
Feature activation+2.28e-02

Pos logit contributions
jl+0.164
itia+0.162
 favorably+0.161
wake+0.158
eln+0.155
Neg logit contributions
arters-0.206
 balcon-0.160
 Phi-0.159
SourceFile-0.154
rics-0.153
Token AFP
Token ID8916
Feature activation+7.63e-01

Pos logit contributions
arters+0.005
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.004
itia-0.004
 favorably-0.004
wake-0.004
eln-0.004
Token/
Token ID14
Feature activation+3.35e-01

Pos logit contributions
arters+0.158
 balcon+0.120
 Phi+0.119
rics+0.114
SourceFile+0.114
Neg logit contributions
jl-0.150
 favorably-0.149
itia-0.149
wake-0.146
eln-0.144
TokenGetty
Token ID6633
Feature activation+7.69e-01

Pos logit contributions
arters+0.055
 balcon+0.038
 Phi+0.037
SourceFile+0.036
rics+0.035
Neg logit contributions
jl-0.081
 favorably-0.080
itia-0.080
wake-0.079
eln-0.078
Token told
Token ID1297
Feature activation-1.06e-01

Pos logit contributions
jl+0.033
itia+0.032
 favorably+0.032
wake+0.032
eln+0.031
Neg logit contributions
arters-0.044
 balcon-0.035
 Phi-0.035
rics-0.034
SourceFile-0.033
Token Merkel
Token ID18263
Feature activation-1.67e-01

Pos logit contributions
jl+0.018
itia+0.017
 favorably+0.017
wake+0.017
eln+0.017
Neg logit contributions
arters-0.025
 balcon-0.020
 Phi-0.020
rics-0.019
SourceFile-0.019
Token that
Token ID326
Feature activation+6.93e-03

Pos logit contributions
jl+0.029
itia+0.028
 favorably+0.028
wake+0.027
eln+0.027
Neg logit contributions
arters-0.039
 balcon-0.031
 Phi-0.031
rics-0.030
SourceFile-0.030
Token picking
Token ID10868
Feature activation+6.17e-02

Pos logit contributions
arters+0.002
 balcon+0.001
 Phi+0.001
rics+0.001
SourceFile+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
Token Jun
Token ID7653
Feature activation-4.65e-01

Pos logit contributions
arters+0.015
 balcon+0.012
 Phi+0.012
rics+0.011
SourceFile+0.011
Neg logit contributions
jl-0.010
itia-0.010
 favorably-0.010
wake-0.010
eln-0.010
Tokencker
Token ID15280
Feature activation-1.77e+00

Pos logit contributions
jl+0.026
 favorably+0.025
itia+0.024
wake+0.022
eln+0.021
Neg logit contributions
arters-0.163
 Phi-0.139
 balcon-0.139
rics-0.136
SourceFile-0.136
Token would
Token ID561
Feature activation-7.31e-02

Pos logit contributions
jl+0.315
itia+0.310
 favorably+0.307
wake+0.303
eln+0.297
Neg logit contributions
arters-0.406
 balcon-0.317
 Phi-0.315
rics-0.303
SourceFile-0.303
Token "
Token ID366
Feature activation-3.11e-01

Pos logit contributions
jl+0.013
itia+0.013
 favorably+0.013
wake+0.013
eln+0.012
Neg logit contributions
arters-0.017
 balcon-0.013
 Phi-0.013
rics-0.012
SourceFile-0.012
Tokendest
Token ID16520
Feature activation+7.28e-01

Pos logit contributions
jl+0.052
itia+0.051
 favorably+0.051
wake+0.049
eln+0.048
Neg logit contributions
arters-0.075
 balcon-0.059
 Phi-0.059
SourceFile-0.057
rics-0.057
Tokenabil
Token ID14991
Feature activation+1.87e-01

Pos logit contributions
arters+0.159
 balcon+0.123
 Phi+0.122
SourceFile+0.119
rics+0.117
Neg logit contributions
jl-0.137
itia-0.135
 favorably-0.134
wake-0.132
eln-0.130
Tokenize
Token ID1096
Feature activation+4.17e-01

Pos logit contributions
arters+0.040
 balcon+0.031
 Phi+0.031
SourceFile+0.030
rics+0.029
Neg logit contributions
jl-0.036
itia-0.036
 favorably-0.035
wake-0.035
eln-0.034
Token Jean
Token ID11320
Feature activation+9.20e-02

Pos logit contributions
arters+0.026
 Phi+0.020
 balcon+0.020
rics+0.019
SourceFile+0.019
Neg logit contributions
jl-0.021
itia-0.021
 favorably-0.021
wake-0.021
eln-0.020
Token-
Token ID12
Feature activation-7.67e-02

Pos logit contributions
arters+0.022
 Phi+0.017
 balcon+0.017
rics+0.016
SourceFile+0.016
Neg logit contributions
jl-0.016
 favorably-0.015
itia-0.015
wake-0.015
eln-0.015
TokenCla
Token ID47404
Feature activation-5.02e-01

Pos logit contributions
jl+0.011
 favorably+0.011
itia+0.011
eln+0.010
wake+0.010
Neg logit contributions
arters-0.020
 balcon-0.016
 Phi-0.016
rics-0.016
Toronto-0.016
Tokenude
Token ID2507
Feature activation+8.44e-02

Pos logit contributions
jl+0.082
itia+0.081
 favorably+0.080
wake+0.078
eln+0.077
Neg logit contributions
arters-0.122
 balcon-0.097
 Phi-0.096
SourceFile-0.094
rics-0.093
Token Jun
Token ID7653
Feature activation-4.05e-01

Pos logit contributions
arters+0.023
 balcon+0.019
 Phi+0.019
SourceFile+0.018
rics+0.018
Neg logit contributions
jl-0.011
itia-0.011
 favorably-0.011
wake-0.011
eln-0.010
Tokencker
Token ID15280
Feature activation-1.77e+00

Pos logit contributions
jl+0.024
itia+0.023
 favorably+0.023
wake+0.021
iji+0.019
Neg logit contributions
arters-0.141
 Phi-0.120
 balcon-0.120
rics-0.117
SourceFile-0.116
Token,
Token ID11
Feature activation+1.68e-02

Pos logit contributions
jl+0.312
itia+0.306
 favorably+0.304
wake+0.300
eln+0.293
Neg logit contributions
arters-0.406
 balcon-0.318
 Phi-0.317
rics-0.306
SourceFile-0.303
Token she
Token ID673
Feature activation+1.01e-01

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
rics+0.003
SourceFile+0.003
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Token hoped
Token ID10719
Feature activation+5.51e-02

Pos logit contributions
arters+0.023
 balcon+0.018
 Phi+0.018
rics+0.017
SourceFile+0.017
Neg logit contributions
jl-0.018
itia-0.018
 favorably-0.017
wake-0.017
eln-0.017
Token the
Token ID262
Feature activation-6.73e-03

Pos logit contributions
arters+0.013
 balcon+0.010
 Phi+0.010
rics+0.010
SourceFile+0.010
Neg logit contributions
jl-0.009
itia-0.009
 favorably-0.009
wake-0.009
eln-0.009
Token dispute
Token ID11047
Feature activation+9.48e-02

Pos logit contributions
jl+0.001
itia+0.001
 favorably+0.001
wake+0.001
iji+0.001
Neg logit contributions
arters-0.002
 balcon-0.001
 Phi-0.001
rics-0.001
SourceFile-0.001
Token she
Token ID673
Feature activation+1.23e-01

Pos logit contributions
arters+0.007
 balcon+0.006
 Phi+0.006
rics+0.006
SourceFile+0.005
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.005
eln-0.005
Token was
Token ID373
Feature activation+1.27e-01

Pos logit contributions
arters+0.029
 balcon+0.022
 Phi+0.022
rics+0.022
SourceFile+0.021
Neg logit contributions
jl-0.021
itia-0.021
 favorably-0.021
wake-0.021
eln-0.020
Token pressing
Token ID12273
Feature activation-2.39e-01

Pos logit contributions
arters+0.031
 balcon+0.025
 Phi+0.025
rics+0.024
Toronto+0.024
Neg logit contributions
jl-0.020
itia-0.020
wake-0.019
 favorably-0.019
eln-0.019
Token for
Token ID329
Feature activation+7.51e-02

Pos logit contributions
jl+0.041
itia+0.040
 favorably+0.040
wake+0.039
eln+0.038
Neg logit contributions
arters-0.056
 balcon-0.044
 Phi-0.044
rics-0.043
SourceFile-0.042
Token Jun
Token ID7653
Feature activation-3.90e-01

Pos logit contributions
arters+0.018
 balcon+0.015
 Phi+0.014
rics+0.014
SourceFile+0.014
Neg logit contributions
jl-0.012
itia-0.012
 favorably-0.012
wake-0.012
eln-0.012
Tokencker
Token ID15280
Feature activation-1.75e+00

Pos logit contributions
jl+0.021
 favorably+0.020
itia+0.020
wake+0.019
eln+0.017
Neg logit contributions
arters-0.137
 Phi-0.117
 balcon-0.117
SourceFile-0.114
rics-0.114
Token to
Token ID284
Feature activation+4.35e-02

Pos logit contributions
jl+0.312
itia+0.306
 favorably+0.304
wake+0.299
eln+0.293
Neg logit contributions
arters-0.401
 balcon-0.313
 Phi-0.311
rics-0.300
SourceFile-0.300
Token take
Token ID1011
Feature activation-2.34e-01

Pos logit contributions
arters+0.010
 balcon+0.008
 Phi+0.008
rics+0.008
SourceFile+0.007
Neg logit contributions
jl-0.008
itia-0.008
 favorably-0.007
wake-0.007
eln-0.007
Token the
Token ID262
Feature activation-1.01e-02

Pos logit contributions
jl+0.041
itia+0.041
 favorably+0.040
wake+0.040
eln+0.039
Neg logit contributions
arters-0.054
 balcon-0.042
 Phi-0.042
rics-0.040
SourceFile-0.040
Token reins
Token ID35627
Feature activation-4.35e-01

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.002
 balcon-0.002
 Phi-0.002
rics-0.002
Toronto-0.002
Token at
Token ID379
Feature activation+5.33e-02

Pos logit contributions
jl+0.073
itia+0.072
 favorably+0.070
wake+0.070
eln+0.069
Neg logit contributions
arters-0.102
 balcon-0.081
 Phi-0.081
rics-0.078
SourceFile-0.077
Token is
Token ID318
Feature activation+4.15e-03

Pos logit contributions
jl+0.063
itia+0.062
 favorably+0.061
wake+0.060
eln+0.059
Neg logit contributions
arters-0.081
 balcon-0.063
 Phi-0.063
SourceFile-0.061
rics-0.061
Token deeply
Token ID7744
Feature activation-8.86e-01

Pos logit contributions
arters+0.001
 balcon+0.001
 Phi+0.001
rics+0.001
SourceFile+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
Token wary
Token ID23300
Feature activation-2.02e-01

Pos logit contributions
jl+0.157
itia+0.155
 favorably+0.153
wake+0.150
eln+0.148
Neg logit contributions
arters-0.203
 balcon-0.159
 Phi-0.158
rics-0.152
SourceFile-0.152
Token of
Token ID286
Feature activation-2.08e-02

Pos logit contributions
jl+0.032
itia+0.032
 favorably+0.032
wake+0.031
eln+0.030
Neg logit contributions
arters-0.050
 balcon-0.040
 Phi-0.039
rics-0.038
SourceFile-0.038
Token Jun
Token ID7653
Feature activation-3.91e-01

Pos logit contributions
jl+0.003
itia+0.003
 favorably+0.003
wake+0.003
eln+0.003
Neg logit contributions
arters-0.005
 balcon-0.004
 Phi-0.004
rics-0.004
SourceFile-0.004
Tokencker
Token ID15280
Feature activation-1.75e+00

Pos logit contributions
jl+0.021
 favorably+0.020
itia+0.020
wake+0.018
eln+0.017
Neg logit contributions
arters-0.138
 Phi-0.118
 balcon-0.118
SourceFile-0.115
rics-0.115
Token,
Token ID11
Feature activation+1.30e-02

Pos logit contributions
jl+0.331
itia+0.326
 favorably+0.325
wake+0.319
eln+0.314
Neg logit contributions
arters-0.380
 balcon-0.292
 Phi-0.290
SourceFile-0.280
rics-0.279
Token an
Token ID281
Feature activation+1.83e-02

Pos logit contributions
arters+0.003
 balcon+0.002
 Phi+0.002
rics+0.002
SourceFile+0.002
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Token EU
Token ID4576
Feature activation-3.83e-01

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.003
 favorably-0.003
itia-0.003
wake-0.003
eln-0.003
Token federal
Token ID2717
Feature activation-7.44e-02

Pos logit contributions
jl+0.063
itia+0.062
 favorably+0.062
wake+0.060
eln+0.059
Neg logit contributions
arters-0.092
 balcon-0.073
 Phi-0.073
rics-0.070
SourceFile-0.070
Tokenist
Token ID396
Feature activation+5.35e-02

Pos logit contributions
jl+0.014
itia+0.014
 favorably+0.014
wake+0.014
eln+0.014
Neg logit contributions
arters-0.016
 balcon-0.012
 Phi-0.012
rics-0.011
SourceFile-0.011
Token are
Token ID389
Feature activation+5.07e-02

Pos logit contributions
arters+0.009
 balcon+0.007
 Phi+0.007
rics+0.006
SourceFile+0.006
Neg logit contributions
jl-0.007
itia-0.007
 favorably-0.007
wake-0.007
eln-0.007
Token also
Token ID635
Feature activation+2.71e-01

Pos logit contributions
arters+0.012
 balcon+0.009
 Phi+0.009
rics+0.009
SourceFile+0.009
Neg logit contributions
jl-0.009
itia-0.009
wake-0.008
 favorably-0.008
eln-0.008
Token hostile
Token ID12524
Feature activation-3.92e-01

Pos logit contributions
arters+0.064
 balcon+0.051
 Phi+0.051
rics+0.049
SourceFile+0.048
Neg logit contributions
jl-0.046
itia-0.045
 favorably-0.044
wake-0.044
eln-0.043
Token to
Token ID284
Feature activation+1.94e-02

Pos logit contributions
jl+0.067
itia+0.066
 favorably+0.066
wake+0.065
eln+0.063
Neg logit contributions
arters-0.092
 balcon-0.072
 Phi-0.072
rics-0.069
SourceFile-0.069
Token Jun
Token ID7653
Feature activation-4.17e-01

Pos logit contributions
arters+0.005
 balcon+0.004
 Phi+0.004
rics+0.004
SourceFile+0.004
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Tokencker
Token ID15280
Feature activation-1.74e+00

Pos logit contributions
jl+0.023
 favorably+0.022
itia+0.022
wake+0.020
eln+0.018
Neg logit contributions
arters-0.147
 Phi-0.125
 balcon-0.125
SourceFile-0.122
rics-0.122
Token.
Token ID13
Feature activation-4.07e-02

Pos logit contributions
jl+0.305
itia+0.301
 favorably+0.298
wake+0.293
eln+0.287
Neg logit contributions
arters-0.401
 balcon-0.314
 Phi-0.312
rics-0.301
SourceFile-0.301
Token<|endoftext|>
Token ID50256
Feature activation-9.90e-02

Pos logit contributions
jl+0.007
 favorably+0.007
itia+0.007
wake+0.007
eln+0.006
Neg logit contributions
arters-0.010
 Phi-0.008
 balcon-0.008
rics-0.007
SourceFile-0.007
TokenIll
Token ID21478
Feature activation-3.07e-01

Pos logit contributions
jl+0.018
itia+0.017
 favorably+0.017
wake+0.017
eln+0.017
Neg logit contributions
arters-0.023
 balcon-0.017
 Phi-0.017
SourceFile-0.017
rics-0.017
Tokenustration
Token ID44027
Feature activation+4.85e-01

Pos logit contributions
jl+0.059
itia+0.058
 favorably+0.058
wake+0.057
eln+0.056
Neg logit contributions
arters-0.065
 balcon-0.050
 Phi-0.050
rics-0.048
SourceFile-0.048
Token by
Token ID416
Feature activation+3.28e-02

Pos logit contributions
arters+0.113
 balcon+0.089
 Phi+0.089
SourceFile+0.085
rics+0.085
Neg logit contributions
jl-0.084
itia-0.083
 favorably-0.082
wake-0.080
eln-0.079
Token Del
Token ID4216
Feature activation+1.97e-01

Pos logit contributions
jl+0.016
itia+0.016
 favorably+0.015
wake+0.015
eln+0.015
Neg logit contributions
arters-0.022
 balcon-0.018
 Phi-0.017
SourceFile-0.017
rics-0.017
Tokenmore
Token ID3549
Feature activation+5.46e-01

Pos logit contributions
arters+0.044
 balcon+0.034
 Phi+0.034
SourceFile+0.032
rics+0.032
Neg logit contributions
jl-0.036
itia-0.036
 favorably-0.036
wake-0.035
eln-0.034
Token Schwartz
Token ID28672
Feature activation-3.29e-01

Pos logit contributions
arters+0.124
 balcon+0.096
 Phi+0.096
rics+0.093
SourceFile+0.092
Neg logit contributions
jl-0.098
itia-0.096
 favorably-0.096
wake-0.094
eln-0.093
Token compared
Token ID3688
Feature activation+1.77e-01

Pos logit contributions
jl+0.058
itia+0.057
 favorably+0.056
wake+0.055
eln+0.054
Neg logit contributions
arters-0.076
 balcon-0.059
 Phi-0.059
rics-0.057
SourceFile-0.057
Token him
Token ID683
Feature activation-1.11e-01

Pos logit contributions
arters+0.043
 balcon+0.034
 Phi+0.034
rics+0.033
SourceFile+0.033
Neg logit contributions
jl-0.029
itia-0.028
 favorably-0.028
wake-0.027
eln-0.027
Token favorably
Token ID40594
Feature activation-1.70e+00

Pos logit contributions
jl+0.019
itia+0.019
 favorably+0.018
wake+0.018
eln+0.018
Neg logit contributions
arters-0.026
 balcon-0.021
 Phi-0.020
rics-0.020
SourceFile-0.020
Token with
Token ID351
Feature activation-4.33e-02

Pos logit contributions
jl+0.293
itia+0.288
 favorably+0.283
wake+0.281
eln+0.275
Neg logit contributions
arters-0.397
 balcon-0.313
 Phi-0.312
rics-0.301
SourceFile-0.299
Token Ye
Token ID11609
Feature activation-7.55e-02

Pos logit contributions
jl+0.008
itia+0.008
 favorably+0.008
wake+0.008
eln+0.007
Neg logit contributions
arters-0.010
 balcon-0.008
 Phi-0.008
SourceFile-0.007
rics-0.007
Tokenats
Token ID1381
Feature activation-4.34e-01

Pos logit contributions
jl+0.011
itia+0.011
 favorably+0.011
wake+0.011
eln+0.010
Neg logit contributions
arters-0.019
 balcon-0.016
 Phi-0.016
rics-0.015
SourceFile-0.015
Token --
Token ID1377
Feature activation-6.57e-02

Pos logit contributions
jl+0.073
itia+0.072
 favorably+0.072
wake+0.070
eln+0.069
Neg logit contributions
arters-0.103
 balcon-0.081
 Phi-0.081
SourceFile-0.079
rics-0.078
Token but
Token ID475
Feature activation+4.81e-02

Pos logit contributions
jl+0.011
itia+0.011
 favorably+0.010
wake+0.010
eln+0.010
Neg logit contributions
arters-0.016
 balcon-0.013
 Phi-0.013
SourceFile-0.012
rics-0.012
Token polls
Token ID9231
Feature activation+1.04e-02

Pos logit contributions
arters+0.008
 balcon+0.006
 Phi+0.006
SourceFile+0.006
rics+0.006
Neg logit contributions
jl-0.006
itia-0.006
 favorably-0.006
wake-0.006
eln-0.006
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
arters+0.002
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
 favorably-0.002
itia-0.002
wake-0.002
eln-0.002
Token
Token ID198
Feature activation+1.59e-03

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.004
 balcon-0.003
 Phi-0.003
SourceFile-0.003
rics-0.003
TokenA
Token ID32
Feature activation-2.84e-01

Pos logit contributions
arters+0.000
 balcon+0.000
 Phi+0.000
SourceFile+0.000
rics+0.000
Neg logit contributions
jl-0.000
itia-0.000
 favorably-0.000
wake-0.000
eln-0.000
Token G
Token ID402
Feature activation+7.81e-01

Pos logit contributions
jl+0.050
itia+0.049
 favorably+0.049
wake+0.048
eln+0.047
Neg logit contributions
arters-0.065
 balcon-0.051
 Phi-0.051
SourceFile-0.049
rics-0.049
Tokenravis
Token ID16956
Feature activation-1.60e+00

Pos logit contributions
arters+0.248
 Phi+0.207
 balcon+0.206
rics+0.203
SourceFile+0.200
Neg logit contributions
jl-0.071
itia-0.069
 favorably-0.068
wake-0.065
iji-0.062
Token poll
Token ID3278
Feature activation+5.64e-01

Pos logit contributions
jl+0.319
itia+0.314
 favorably+0.311
wake+0.307
eln+0.303
Neg logit contributions
arters-0.333
 Phi-0.252
 balcon-0.252
rics-0.240
SourceFile-0.240
Token shows
Token ID2523
Feature activation-2.33e-01

Pos logit contributions
arters+0.113
 balcon+0.085
 Phi+0.085
rics+0.081
SourceFile+0.081
Neg logit contributions
jl-0.116
itia-0.114
 favorably-0.114
wake-0.112
eln-0.110
Token that
Token ID326
Feature activation-9.16e-04

Pos logit contributions
jl+0.041
itia+0.040
 favorably+0.040
wake+0.039
eln+0.039
Neg logit contributions
arters-0.054
 balcon-0.042
 Phi-0.042
SourceFile-0.040
rics-0.040
Token Mr
Token ID1770
Feature activation-1.09e-01

Pos logit contributions
jl+0.000
itia+0.000
 favorably+0.000
wake+0.000
eln+0.000
Neg logit contributions
arters-0.000
 balcon-0.000
 Phi-0.000
SourceFile-0.000
rics-0.000
Token.
Token ID13
Feature activation-4.39e-02

Pos logit contributions
jl+0.018
itia+0.018
 favorably+0.018
wake+0.017
eln+0.017
Neg logit contributions
arters-0.026
 balcon-0.021
 Phi-0.021
SourceFile-0.020
rics-0.020
Token point
Token ID966
Feature activation+2.97e-02

Pos logit contributions
arters+0.107
 balcon+0.086
 Phi+0.086
SourceFile+0.083
rics+0.082
Neg logit contributions
jl-0.072
itia-0.071
 favorably-0.070
wake-0.068
eln-0.068
Token about
Token ID546
Feature activation+1.22e-01

Pos logit contributions
arters+0.006
 balcon+0.005
 Phi+0.005
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.006
itia-0.006
 favorably-0.006
wake-0.005
eln-0.005
Token how
Token ID703
Feature activation+1.30e-03

Pos logit contributions
arters+0.028
 balcon+0.022
 Phi+0.022
SourceFile+0.021
rics+0.021
Neg logit contributions
jl-0.021
itia-0.021
 favorably-0.021
wake-0.020
eln-0.020
Token these
Token ID777
Feature activation+2.07e-01

Pos logit contributions
arters+0.000
 balcon+0.000
 Phi+0.000
rics+0.000
SourceFile+0.000
Neg logit contributions
jl-0.000
itia-0.000
 favorably-0.000
wake-0.000
eln-0.000
Token gam
Token ID9106
Feature activation+3.80e-02

Pos logit contributions
arters+0.048
 balcon+0.038
 Phi+0.037
SourceFile+0.036
rics+0.036
Neg logit contributions
jl-0.036
itia-0.035
 favorably-0.035
wake-0.035
eln-0.034
Tokeneworld
Token ID38136
Feature activation-1.60e+00

Pos logit contributions
arters+0.009
 balcon+0.007
 Phi+0.007
rics+0.007
enders+0.007
Neg logit contributions
jl-0.006
 favorably-0.006
itia-0.006
wake-0.006
eln-0.006
Tokens
Token ID82
Feature activation-1.49e-02

Pos logit contributions
jl+0.231
itia+0.226
 favorably+0.225
wake+0.219
eln+0.214
Neg logit contributions
arters-0.420
 balcon-0.339
 Phi-0.338
rics-0.328
SourceFile-0.327
Token function
Token ID2163
Feature activation-3.34e-02

Pos logit contributions
jl+0.003
itia+0.003
 favorably+0.003
wake+0.003
eln+0.003
Neg logit contributions
arters-0.003
 balcon-0.003
 Phi-0.003
SourceFile-0.002
rics-0.002
Token for
Token ID329
Feature activation+7.32e-02

Pos logit contributions
jl+0.006
itia+0.006
 favorably+0.006
wake+0.006
eln+0.005
Neg logit contributions
arters-0.008
 balcon-0.006
 Phi-0.006
rics-0.006
SourceFile-0.006
Token players
Token ID1938
Feature activation+4.14e-01

Pos logit contributions
arters+0.017
 balcon+0.013
 Phi+0.013
SourceFile+0.012
rics+0.012
Neg logit contributions
jl-0.013
itia-0.013
 favorably-0.013
wake-0.013
eln-0.012
Token:
Token ID25
Feature activation+1.39e-01

Pos logit contributions
arters+0.094
 balcon+0.073
 Phi+0.073
rics+0.070
SourceFile+0.070
Neg logit contributions
jl-0.074
itia-0.073
 favorably-0.072
wake-0.071
eln-0.070
Token teenagers
Token ID17366
Feature activation-2.72e-01

Pos logit contributions
arters+0.025
 balcon+0.019
 Phi+0.019
SourceFile+0.019
rics+0.019
Neg logit contributions
jl-0.023
itia-0.022
 favorably-0.022
wake-0.022
eln-0.021
Token,
Token ID11
Feature activation-1.55e-02

Pos logit contributions
jl+0.047
itia+0.047
 favorably+0.046
wake+0.045
eln+0.045
Neg logit contributions
arters-0.063
 balcon-0.049
 Phi-0.049
rics-0.047
SourceFile-0.047
Token their
Token ID511
Feature activation+7.95e-02

Pos logit contributions
jl+0.003
itia+0.003
 favorably+0.003
wake+0.002
eln+0.002
Neg logit contributions
arters-0.004
 balcon-0.003
 Phi-0.003
rics-0.003
SourceFile-0.003
Token sleep
Token ID3993
Feature activation-9.00e-01

Pos logit contributions
arters+0.020
 balcon+0.016
 Phi+0.016
SourceFile+0.015
rics+0.015
Neg logit contributions
jl-0.012
itia-0.012
 favorably-0.012
wake-0.012
eln-0.012
Token-
Token ID12
Feature activation-7.48e-02

Pos logit contributions
jl+0.166
itia+0.163
 favorably+0.162
wake+0.158
eln+0.156
Neg logit contributions
arters-0.201
 balcon-0.155
 Phi-0.154
rics-0.149
SourceFile-0.149
Tokenwake
Token ID48530
Feature activation-1.59e+00

Pos logit contributions
jl+0.007
itia+0.007
 favorably+0.007
eln+0.007
iji+0.006
Neg logit contributions
arters-0.023
 balcon-0.019
 Phi-0.019
rics-0.019
Toronto-0.019
Token cycle
Token ID6772
Feature activation+1.41e-01

Pos logit contributions
jl+0.312
itia+0.307
 favorably+0.305
wake+0.298
eln+0.295
Neg logit contributions
arters-0.337
 balcon-0.256
 Phi-0.255
rics-0.245
SourceFile-0.245
Token shifts
Token ID15381
Feature activation+5.46e-01

Pos logit contributions
arters+0.033
 balcon+0.026
 Phi+0.026
rics+0.025
SourceFile+0.025
Neg logit contributions
jl-0.024
itia-0.024
 favorably-0.023
wake-0.023
eln-0.023
Token two
Token ID734
Feature activation-2.72e-01

Pos logit contributions
arters+0.132
 balcon+0.105
 Phi+0.104
rics+0.100
SourceFile+0.100
Neg logit contributions
jl-0.090
itia-0.089
 favorably-0.087
wake-0.086
eln-0.085
Token hours
Token ID2250
Feature activation+3.45e-01

Pos logit contributions
jl+0.054
itia+0.053
 favorably+0.053
wake+0.052
eln+0.051
Neg logit contributions
arters-0.057
 balcon-0.043
 Phi-0.043
rics-0.041
SourceFile-0.041
Token later
Token ID1568
Feature activation-7.64e-01

Pos logit contributions
arters+0.082
 balcon+0.065
 Phi+0.064
rics+0.062
SourceFile+0.062
Neg logit contributions
jl-0.058
itia-0.057
 favorably-0.057
wake-0.055
eln-0.055
Token
Token ID251
Feature activation+1.16e-02

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.004
eln+0.004
Neg logit contributions
arters-0.005
 balcon-0.004
 Phi-0.004
SourceFile-0.004
rics-0.004
Token
Token ID198
Feature activation-6.95e-03

Pos logit contributions
arters+0.003
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Token
Token ID198
Feature activation-3.47e-02

Pos logit contributions
jl+0.001
itia+0.001
 favorably+0.001
wake+0.001
eln+0.001
Neg logit contributions
arters-0.002
 balcon-0.001
 Phi-0.001
SourceFile-0.001
rics-0.001
TokenIn
Token ID818
Feature activation-4.82e-02

Pos logit contributions
jl+0.006
itia+0.006
 favorably+0.006
wake+0.006
eln+0.006
Neg logit contributions
arters-0.008
 balcon-0.006
 Phi-0.006
SourceFile-0.006
rics-0.006
Token the
Token ID262
Feature activation-2.50e-02

Pos logit contributions
jl+0.008
itia+0.008
 favorably+0.008
wake+0.008
eln+0.008
Neg logit contributions
arters-0.011
 balcon-0.009
 Phi-0.009
SourceFile-0.009
rics-0.009
Token wake
Token ID7765
Feature activation-1.58e+00

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Token of
Token ID286
Feature activation-2.89e-02

Pos logit contributions
jl+0.242
itia+0.237
 favorably+0.236
wake+0.230
eln+0.225
Neg logit contributions
arters-0.402
 balcon-0.322
 Phi-0.321
SourceFile-0.311
rics-0.310
Token climate
Token ID4258
Feature activation-5.83e-01

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.005
Neg logit contributions
arters-0.007
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Token change
Token ID1487
Feature activation-4.79e-01

Pos logit contributions
jl+0.078
itia+0.076
 favorably+0.075
wake+0.073
eln+0.072
Neg logit contributions
arters-0.160
 balcon-0.130
 Phi-0.130
rics-0.126
SourceFile-0.126
Token,
Token ID11
Feature activation+3.93e-03

Pos logit contributions
jl+0.083
itia+0.082
 favorably+0.082
wake+0.080
eln+0.078
Neg logit contributions
arters-0.111
 balcon-0.087
 Phi-0.087
SourceFile-0.084
rics-0.084
Token harmful
Token ID13568
Feature activation-6.97e-01

Pos logit contributions
arters+0.001
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
Tokenation
Token ID341
Feature activation+1.71e-01

Pos logit contributions
arters+0.063
 balcon+0.050
 Phi+0.050
SourceFile+0.048
rics+0.048
Neg logit contributions
jl-0.045
itia-0.044
 favorably-0.044
wake-0.043
eln-0.042
Token of
Token ID286
Feature activation-7.22e-03

Pos logit contributions
arters+0.042
 balcon+0.033
 Phi+0.033
rics+0.032
SourceFile+0.032
Neg logit contributions
jl-0.028
itia-0.027
 favorably-0.027
wake-0.027
eln-0.026
Token Christ
Token ID1951
Feature activation-2.82e-01

Pos logit contributions
jl+0.001
itia+0.001
 favorably+0.001
wake+0.001
eln+0.001
Neg logit contributions
arters-0.002
 balcon-0.001
 Phi-0.001
SourceFile-0.001
rics-0.001
Token was
Token ID373
Feature activation+1.08e-01

Pos logit contributions
jl+0.050
itia+0.049
 favorably+0.049
wake+0.048
eln+0.047
Neg logit contributions
arters-0.065
 balcon-0.051
 Phi-0.051
SourceFile-0.049
rics-0.049
Token protested
Token ID27278
Feature activation-2.01e-02

Pos logit contributions
arters+0.026
 balcon+0.020
 Phi+0.020
SourceFile+0.019
rics+0.019
Neg logit contributions
jl-0.019
itia-0.018
 favorably-0.018
wake-0.018
eln-0.017
Token vigorously
Token ID31609
Feature activation-1.58e+00

Pos logit contributions
jl+0.003
itia+0.003
 favorably+0.003
wake+0.003
eln+0.003
Neg logit contributions
arters-0.005
 balcon-0.004
 Phi-0.004
SourceFile-0.004
rics-0.004
Token.
Token ID13
Feature activation-4.05e-02

Pos logit contributions
jl+0.263
itia+0.259
 favorably+0.258
wake+0.252
eln+0.247
Neg logit contributions
arters-0.380
 balcon-0.300
 Phi-0.298
SourceFile-0.289
rics-0.288
Token The
Token ID383
Feature activation+5.21e-02

Pos logit contributions
jl+0.007
itia+0.007
 favorably+0.007
wake+0.006
eln+0.006
Neg logit contributions
arters-0.010
 Phi-0.008
 balcon-0.008
SourceFile-0.008
rics-0.007
Token difference
Token ID3580
Feature activation-1.19e+00

Pos logit contributions
arters+0.012
 balcon+0.009
 Phi+0.009
SourceFile+0.009
rics+0.009
Neg logit contributions
jl-0.009
itia-0.009
 favorably-0.009
wake-0.009
eln-0.009
Token is
Token ID318
Feature activation-2.29e-02

Pos logit contributions
jl+0.194
itia+0.191
 favorably+0.190
wake+0.186
eln+0.182
Neg logit contributions
arters-0.291
 balcon-0.231
 Phi-0.230
SourceFile-0.223
rics-0.222
Token that
Token ID326
Feature activation+1.77e-02

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.005
 balcon-0.004
 Phi-0.004
SourceFile-0.004
rics-0.004
Token-
Token ID12
Feature activation-7.55e-02

Pos logit contributions
jl+0.061
 favorably+0.060
itia+0.060
wake+0.059
eln+0.058
Neg logit contributions
arters-0.088
 Phi-0.069
 balcon-0.069
rics-0.067
SourceFile-0.066
Token08
Token ID2919
Feature activation-3.44e-01

Pos logit contributions
jl+0.014
itia+0.013
 favorably+0.013
wake+0.013
eln+0.013
Neg logit contributions
arters-0.017
 balcon-0.013
 Phi-0.013
SourceFile-0.013
rics-0.013
Token Election
Token ID14219
Feature activation+8.74e-02

Pos logit contributions
jl+0.061
itia+0.060
 favorably+0.060
wake+0.059
eln+0.058
Neg logit contributions
arters-0.079
 Phi-0.061
 balcon-0.061
rics-0.059
SourceFile-0.059
Token-
Token ID12
Feature activation-8.07e-02

Pos logit contributions
arters+0.021
 Phi+0.017
 balcon+0.017
rics+0.016
SourceFile+0.016
Neg logit contributions
jl-0.014
itia-0.014
 favorably-0.014
wake-0.014
eln-0.014
TokenT
Token ID51
Feature activation-2.16e-01

Pos logit contributions
jl+0.014
itia+0.014
 favorably+0.014
wake+0.014
eln+0.014
Neg logit contributions
arters-0.018
 balcon-0.014
 Phi-0.014
SourceFile-0.014
rics-0.014
Tokenravis
Token ID16956
Feature activation-1.58e+00

Pos logit contributions
jl+0.038
 favorably+0.038
itia+0.038
wake+0.037
eln+0.036
Neg logit contributions
arters-0.049
 balcon-0.038
 Phi-0.038
rics-0.037
SourceFile-0.037
Token &
Token ID1222
Feature activation-1.50e-01

Pos logit contributions
jl+0.288
itia+0.283
 favorably+0.282
wake+0.277
eln+0.272
Neg logit contributions
arters-0.354
 balcon-0.274
 Phi-0.272
SourceFile-0.263
rics-0.262
Token Protect
Token ID21916
Feature activation+1.12e-01

Pos logit contributions
jl+0.028
itia+0.027
 favorably+0.027
wake+0.027
eln+0.026
Neg logit contributions
arters-0.033
 balcon-0.026
 Phi-0.026
SourceFile-0.025
rics-0.025
Tokening
Token ID278
Feature activation-8.79e-02

Pos logit contributions
arters+0.026
 Phi+0.020
 balcon+0.020
rics+0.020
SourceFile+0.020
Neg logit contributions
jl-0.019
 favorably-0.019
itia-0.019
wake-0.019
eln-0.018
Token Your
Token ID3406
Feature activation-1.94e-01

Pos logit contributions
jl+0.016
 favorably+0.016
itia+0.016
wake+0.015
eln+0.015
Neg logit contributions
arters-0.020
 Phi-0.016
 balcon-0.015
rics-0.015
SourceFile-0.015
Token Girl
Token ID7430
Feature activation-2.67e-01

Pos logit contributions
jl+0.033
 favorably+0.033
itia+0.033
wake+0.032
eln+0.031
Neg logit contributions
arters-0.046
 Phi-0.036
 balcon-0.036
SourceFile-0.035
rics-0.034
Token from
Token ID422
Feature activation-1.37e-01

Pos logit contributions
arters+0.078
 balcon+0.062
 Phi+0.061
SourceFile+0.059
rics+0.059
Neg logit contributions
jl-0.051
itia-0.050
 favorably-0.050
wake-0.049
eln-0.048
Token Mid
Token ID7215
Feature activation+5.40e-01

Pos logit contributions
jl+0.028
itia+0.028
 favorably+0.028
wake+0.027
eln+0.027
Neg logit contributions
arters-0.028
 balcon-0.021
 Phi-0.021
SourceFile-0.020
rics-0.020
Tokentown
Token ID12735
Feature activation-2.02e-01

Pos logit contributions
arters+0.135
 Phi+0.107
 balcon+0.105
Toronto+0.103
rics+0.102
Neg logit contributions
jl-0.085
 favorably-0.084
itia-0.083
wake-0.080
iji-0.079
Token via
Token ID2884
Feature activation+1.92e-01

Pos logit contributions
jl+0.038
itia+0.037
 favorably+0.037
wake+0.037
eln+0.036
Neg logit contributions
arters-0.044
 balcon-0.034
 Phi-0.034
SourceFile-0.033
rics-0.032
Token Harris
Token ID10026
Feature activation+3.26e-02

Pos logit contributions
arters+0.042
 balcon+0.032
 Phi+0.032
SourceFile+0.031
rics+0.031
Neg logit contributions
jl-0.036
itia-0.036
 favorably-0.036
wake-0.035
eln-0.034
Tokenburg
Token ID7423
Feature activation-1.56e+00

Pos logit contributions
arters+0.011
 balcon+0.009
 Phi+0.009
SourceFile+0.009
rics+0.009
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Token Start
Token ID7253
Feature activation-6.88e-01

Pos logit contributions
jl+0.267
itia+0.263
 favorably+0.262
wake+0.256
eln+0.252
Neg logit contributions
arters-0.366
 balcon-0.287
 Phi-0.286
SourceFile-0.276
rics-0.275
Token at
Token ID379
Feature activation+2.47e-02

Pos logit contributions
jl+0.120
 favorably+0.118
itia+0.118
wake+0.116
iji+0.113
Neg logit contributions
arters-0.161
 Phi-0.125
 balcon-0.124
rics-0.119
SourceFile-0.119
Token the
Token ID262
Feature activation-1.56e-02

Pos logit contributions
arters+0.005
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.004
eln-0.004
Token crack
Token ID8469
Feature activation+1.65e-01

Pos logit contributions
jl+0.003
itia+0.003
 favorably+0.003
wake+0.003
eln+0.003
Neg logit contributions
arters-0.004
 balcon-0.003
 Phi-0.003
SourceFile-0.003
rics-0.003
Token of
Token ID286
Feature activation-2.24e-02

Pos logit contributions
arters+0.041
 balcon+0.032
 Phi+0.032
rics+0.031
SourceFile+0.031
Neg logit contributions
jl-0.026
 favorably-0.026
itia-0.026
wake-0.025
eln-0.025
Token mar
Token ID1667
Feature activation-6.79e-01

Pos logit contributions
jl+0.001
itia+0.001
 favorably+0.001
wake+0.001
eln+0.001
Neg logit contributions
arters-0.001
 balcon-0.001
 Phi-0.001
SourceFile-0.001
rics-0.001
Tokenina
Token ID1437
Feature activation-7.42e-01

Pos logit contributions
jl+0.097
 favorably+0.097
itia+0.094
wake+0.092
eln+0.091
Neg logit contributions
arters-0.178
 Phi-0.141
 balcon-0.140
rics-0.138
SourceFile-0.137
Token in
Token ID287
Feature activation+3.73e-03

Pos logit contributions
jl+0.129
itia+0.127
 favorably+0.127
wake+0.124
eln+0.121
Neg logit contributions
arters-0.173
 balcon-0.135
 Phi-0.135
SourceFile-0.131
rics-0.130
Token Ke
Token ID3873
Feature activation-3.71e-01

Pos logit contributions
arters+0.001
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
Tokenans
Token ID504
Feature activation-3.76e-01

Pos logit contributions
jl+0.059
itia+0.058
 favorably+0.057
wake+0.056
eln+0.055
Neg logit contributions
arters-0.092
 balcon-0.074
 Phi-0.073
SourceFile-0.071
rics-0.071
Tokenburg
Token ID7423
Feature activation-1.56e+00

Pos logit contributions
jl+0.030
itia+0.029
 favorably+0.029
wake+0.027
eln+0.026
Neg logit contributions
arters-0.123
 Phi-0.104
 balcon-0.104
rics-0.101
SourceFile-0.101
Token,
Token ID11
Feature activation-5.08e-02

Pos logit contributions
jl+0.277
itia+0.273
 favorably+0.272
wake+0.266
eln+0.262
Neg logit contributions
arters-0.355
 balcon-0.277
 Phi-0.275
SourceFile-0.266
rics-0.265
Token New
Token ID968
Feature activation+6.30e-01

Pos logit contributions
jl+0.011
itia+0.011
 favorably+0.011
wake+0.011
eln+0.010
Neg logit contributions
arters-0.010
 balcon-0.007
 Phi-0.007
SourceFile-0.007
rics-0.007
Token Jersey
Token ID8221
Feature activation+7.85e-02

Pos logit contributions
arters+0.123
 Phi+0.091
 balcon+0.091
rics+0.087
SourceFile+0.086
Neg logit contributions
jl-0.132
 favorably-0.131
itia-0.130
wake-0.128
eln-0.127
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
arters+0.018
 balcon+0.014
 Phi+0.014
rics+0.013
SourceFile+0.013
Neg logit contributions
jl-0.014
itia-0.014
 favorably-0.014
wake-0.013
eln-0.013
Token following
Token ID1708
Feature activation-2.45e-01

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.002
 balcon-0.002
 Phi-0.002
rics-0.002
SourceFile-0.002
Token defended
Token ID13768
Feature activation+6.28e-02

Pos logit contributions
arters+0.100
 balcon+0.080
 Phi+0.079
SourceFile+0.077
rics+0.077
Neg logit contributions
jl-0.067
itia-0.066
 favorably-0.066
wake-0.064
eln-0.063
Token Planned
Token ID18451
Feature activation+2.02e-01

Pos logit contributions
arters+0.013
 balcon+0.010
 Phi+0.010
SourceFile+0.010
rics+0.010
Neg logit contributions
jl-0.012
itia-0.012
 favorably-0.012
wake-0.012
eln-0.011
Token Parenthood
Token ID18461
Feature activation+7.48e-01

Pos logit contributions
arters+0.032
 balcon+0.022
 Phi+0.022
SourceFile+0.020
rics+0.020
Neg logit contributions
jl-0.050
itia-0.050
 favorably-0.050
wake-0.049
eln-0.048
Token in
Token ID287
Feature activation+2.12e-02

Pos logit contributions
arters+0.175
 balcon+0.137
 Phi+0.136
SourceFile+0.132
rics+0.131
Neg logit contributions
jl-0.129
itia-0.127
 favorably-0.127
wake-0.124
eln-0.122
Token the
Token ID262
Feature activation-1.96e-02

Pos logit contributions
arters+0.005
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.004
itia-0.004
 favorably-0.004
wake-0.004
eln-0.004
Token wake
Token ID7765
Feature activation-1.55e+00

Pos logit contributions
jl+0.003
itia+0.003
 favorably+0.003
wake+0.003
eln+0.003
Neg logit contributions
arters-0.005
 balcon-0.004
 Phi-0.004
SourceFile-0.004
rics-0.004
Token of
Token ID286
Feature activation-2.51e-02

Pos logit contributions
jl+0.239
itia+0.235
 favorably+0.234
wake+0.228
eln+0.224
Neg logit contributions
arters-0.392
 balcon-0.313
 Phi-0.312
SourceFile-0.303
rics-0.302
Token recently
Token ID2904
Feature activation-4.14e-01

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.004
rics-0.004
Token released
Token ID2716
Feature activation+7.42e-01

Pos logit contributions
 favorably+0.085
jl+0.085
itia+0.085
wake+0.082
eln+0.081
Neg logit contributions
arters-0.084
 balcon-0.062
 Phi-0.061
SourceFile-0.060
rics-0.059
Token videos
Token ID5861
Feature activation-3.40e-02

Pos logit contributions
arters+0.163
 balcon+0.125
 Phi+0.124
SourceFile+0.120
rics+0.119
Neg logit contributions
jl-0.139
itia-0.137
 favorably-0.137
wake-0.134
eln-0.132
Token that
Token ID326
Feature activation+2.66e-02

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.005
Neg logit contributions
arters-0.008
 balcon-0.007
 Phi-0.007
SourceFile-0.006
rics-0.006
Token.
Token ID13
Feature activation-4.76e-02

Pos logit contributions
jl+0.078
itia+0.077
 favorably+0.077
wake+0.074
eln+0.074
Neg logit contributions
arters-0.104
 balcon-0.082
 Phi-0.081
rics-0.078
SourceFile-0.078
Token
Token ID198
Feature activation+6.74e-03

Pos logit contributions
jl+0.008
 favorably+0.008
itia+0.008
wake+0.007
eln+0.007
Neg logit contributions
arters-0.012
 Phi-0.009
 balcon-0.009
rics-0.009
SourceFile-0.009
Token
Token ID198
Feature activation+3.24e-03

Pos logit contributions
arters+0.002
 Phi+0.001
 balcon+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
 favorably-0.001
itia-0.001
wake-0.001
eln-0.001
TokenIn
Token ID818
Feature activation-3.59e-02

Pos logit contributions
arters+0.001
 Phi+0.001
 balcon+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
 favorably-0.001
itia-0.001
wake-0.001
eln-0.001
Token the
Token ID262
Feature activation-1.03e-02

Pos logit contributions
jl+0.006
itia+0.006
 favorably+0.006
wake+0.006
eln+0.006
Neg logit contributions
arters-0.008
 balcon-0.007
 Phi-0.007
rics-0.006
SourceFile-0.006
Token wake
Token ID7765
Feature activation-1.55e+00

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.002
 balcon-0.002
 Phi-0.002
rics-0.002
SourceFile-0.002
Token of
Token ID286
Feature activation-3.96e-02

Pos logit contributions
jl+0.239
itia+0.235
 favorably+0.234
wake+0.228
eln+0.224
Neg logit contributions
arters-0.389
 balcon-0.311
 Phi-0.310
SourceFile-0.300
rics-0.300
Token the
Token ID262
Feature activation-2.42e-02

Pos logit contributions
jl+0.007
itia+0.007
 favorably+0.007
wake+0.007
eln+0.007
Neg logit contributions
arters-0.009
 balcon-0.007
 Phi-0.007
SourceFile-0.007
rics-0.007
Token social
Token ID1919
Feature activation-5.29e-02

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.005
 balcon-0.004
 Phi-0.004
SourceFile-0.004
rics-0.004
Token media
Token ID2056
Feature activation+1.49e-01

Pos logit contributions
jl+0.011
itia+0.011
 favorably+0.010
wake+0.010
eln+0.010
Neg logit contributions
arters-0.011
 balcon-0.008
 Phi-0.008
rics-0.008
SourceFile-0.008
Token fascination
Token ID35556
Feature activation+2.60e-01

Pos logit contributions
arters+0.032
 balcon+0.024
 Phi+0.024
SourceFile+0.023
rics+0.023
Neg logit contributions
jl-0.029
itia-0.028
 favorably-0.028
wake-0.028
eln-0.027
Token,
Token ID11
Feature activation+4.02e-02

Pos logit contributions
arters+0.179
 balcon+0.141
 Phi+0.141
SourceFile+0.136
rics+0.136
Neg logit contributions
jl-0.122
itia-0.120
 favorably-0.120
wake-0.117
eln-0.115
Token is
Token ID318
Feature activation+2.86e-02

Pos logit contributions
arters+0.010
 balcon+0.008
 Phi+0.007
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.007
itia-0.007
 favorably-0.007
wake-0.006
eln-0.006
Token president
Token ID1893
Feature activation+1.43e-01

Pos logit contributions
arters+0.006
 Phi+0.005
 balcon+0.005
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.005
eln-0.005
Token of
Token ID286
Feature activation+3.10e-02

Pos logit contributions
arters+0.035
 Phi+0.028
 balcon+0.028
rics+0.027
SourceFile+0.027
Neg logit contributions
jl-0.023
itia-0.023
 favorably-0.023
wake-0.022
eln-0.022
Token the
Token ID262
Feature activation+3.02e-02

Pos logit contributions
arters+0.007
 balcon+0.005
 Phi+0.005
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.006
itia-0.006
 favorably-0.006
wake-0.006
eln-0.005
Token Toronto
Token ID6586
Feature activation+1.19e+00

Pos logit contributions
arters+0.006
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.006
itia-0.006
 favorably-0.006
wake-0.006
eln-0.006
Token company
Token ID1664
Feature activation-6.00e-01

Pos logit contributions
arters+0.288
 Phi+0.228
 balcon+0.228
SourceFile+0.219
rics+0.219
Neg logit contributions
jl-0.197
itia-0.194
 favorably-0.193
wake-0.189
eln-0.185
Token and
Token ID290
Feature activation+4.16e-02

Pos logit contributions
jl+0.102
itia+0.100
 favorably+0.100
wake+0.098
eln+0.096
Neg logit contributions
arters-0.142
 balcon-0.112
 Phi-0.111
SourceFile-0.108
rics-0.107
Token a
Token ID257
Feature activation+6.80e-02

Pos logit contributions
arters+0.010
 balcon+0.008
 Phi+0.008
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.007
itia-0.007
 favorably-0.007
wake-0.007
eln-0.007
Token director
Token ID3437
Feature activation+3.17e-01

Pos logit contributions
arters+0.015
 balcon+0.012
 Phi+0.012
SourceFile+0.011
rics+0.011
Neg logit contributions
jl-0.013
itia-0.012
 favorably-0.012
wake-0.012
eln-0.012
Token of
Token ID286
Feature activation+3.93e-02

Pos logit contributions
arters+0.076
 Phi+0.061
 balcon+0.060
rics+0.058
SourceFile+0.058
Neg logit contributions
jl-0.052
itia-0.051
 favorably-0.051
wake-0.050
eln-0.049
Token that
Token ID326
Feature activation+3.04e-02

Pos logit contributions
arters+0.055
 balcon+0.043
 Phi+0.043
SourceFile+0.041
rics+0.041
Neg logit contributions
jl-0.040
itia-0.039
 favorably-0.038
wake-0.038
eln-0.037
Token.
Token ID13
Feature activation-2.62e-02

Pos logit contributions
arters+0.007
 balcon+0.006
 Phi+0.006
SourceFile+0.006
rics+0.006
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.005
eln-0.005
Token
Token ID198
Feature activation+5.28e-03

Pos logit contributions
jl+0.004
 favorably+0.004
itia+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.006
 Phi-0.005
 balcon-0.005
SourceFile-0.005
rics-0.005
Token
Token ID198
Feature activation-2.49e-03

Pos logit contributions
arters+0.001
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
TokenMy
Token ID3666
Feature activation-4.51e-02

Pos logit contributions
jl+0.000
 favorably+0.000
itia+0.000
wake+0.000
eln+0.000
Neg logit contributions
arters-0.001
 balcon-0.000
 Phi-0.000
SourceFile-0.000
rics-0.000
Token release
Token ID2650
Feature activation+1.08e+00

Pos logit contributions
jl+0.008
itia+0.008
 favorably+0.008
wake+0.008
eln+0.008
Neg logit contributions
arters-0.010
 balcon-0.008
 Phi-0.008
SourceFile-0.008
rics-0.008
Token is
Token ID318
Feature activation-2.40e-02

Pos logit contributions
arters+0.221
 balcon+0.165
 Phi+0.165
SourceFile+0.159
rics+0.158
Neg logit contributions
jl-0.220
itia-0.217
 favorably-0.216
wake-0.212
eln-0.209
Token totally
Token ID6635
Feature activation-5.21e-01

Pos logit contributions
jl+0.004
 favorably+0.004
itia+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.004
rics-0.004
Token fine
Token ID3734
Feature activation-6.35e-01

Pos logit contributions
jl+0.089
 favorably+0.088
itia+0.088
wake+0.086
eln+0.084
Neg logit contributions
arters-0.123
 balcon-0.096
 Phi-0.096
SourceFile-0.093
rics-0.093
Token,
Token ID11
Feature activation+2.35e-02

Pos logit contributions
jl+0.107
itia+0.106
 favorably+0.105
wake+0.103
eln+0.102
Neg logit contributions
arters-0.150
 balcon-0.118
 Phi-0.118
SourceFile-0.114
rics-0.114
Token I
Token ID314
Feature activation-3.15e-02

Pos logit contributions
arters+0.006
 balcon+0.005
 Phi+0.005
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.004
itia-0.004
 favorably-0.004
wake-0.004
eln-0.004
Token was
Token ID373
Feature activation+8.34e-02

Pos logit contributions
jl+0.008
itia+0.007
 favorably+0.007
wake+0.007
eln+0.007
Neg logit contributions
arters-0.011
 balcon-0.009
 Phi-0.008
SourceFile-0.008
rics-0.008
Token to
Token ID284
Feature activation+4.11e-02

Pos logit contributions
arters+0.020
 balcon+0.015
 Phi+0.015
SourceFile+0.015
rics+0.015
Neg logit contributions
jl-0.014
 favorably-0.014
itia-0.014
wake-0.014
eln-0.013
Token move
Token ID1445
Feature activation-8.47e-02

Pos logit contributions
arters+0.010
 balcon+0.007
 Phi+0.007
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.007
itia-0.007
 favorably-0.007
wake-0.007
eln-0.007
Token to
Token ID284
Feature activation+4.40e-02

Pos logit contributions
jl+0.014
itia+0.014
 favorably+0.014
wake+0.013
eln+0.013
Neg logit contributions
arters-0.020
 balcon-0.016
 Phi-0.016
SourceFile-0.016
rics-0.016
Token a
Token ID257
Feature activation+1.40e-02

Pos logit contributions
arters+0.007
 balcon+0.005
 Phi+0.005
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.011
itia-0.011
 favorably-0.011
wake-0.010
eln-0.010
Token provincial
Token ID15818
Feature activation+1.04e+00

Pos logit contributions
arters+0.002
 balcon+0.002
 Phi+0.002
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Token jail
Token ID7356
Feature activation-1.69e-01

Pos logit contributions
arters+0.238
 balcon+0.185
 Phi+0.185
SourceFile+0.178
rics+0.177
Neg logit contributions
jl-0.183
itia-0.181
 favorably-0.180
wake-0.176
eln-0.173
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
jl+0.029
itia+0.028
 favorably+0.028
wake+0.028
eln+0.027
Neg logit contributions
arters-0.040
 balcon-0.031
 Phi-0.031
SourceFile-0.030
rics-0.030
Token
Token ID564
Feature activation-2.79e-02

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.001
Neg logit contributions
arters-0.003
 balcon-0.002
 Phi-0.002
SourceFile-0.002
rics-0.002
Token
Token ID250
Feature activation-1.50e-01

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.005
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Tokenhe
Token ID258
Feature activation+7.19e-01

Pos logit contributions
jl+0.031
itia+0.031
 favorably+0.031
wake+0.030
eln+0.030
Neg logit contributions
arters-0.030
 balcon-0.022
 Phi-0.022
SourceFile-0.021
rics-0.021
Token public
Token ID1171
Feature activation+8.49e-01

Pos logit contributions
jl+0.021
 favorably+0.020
itia+0.020
wake+0.020
eln+0.020
Neg logit contributions
arters-0.025
 balcon-0.019
 Phi-0.019
SourceFile-0.019
rics-0.019
Token appearances
Token ID11057
Feature activation+6.96e-01

Pos logit contributions
arters+0.163
 balcon+0.120
 Phi+0.119
SourceFile+0.115
rics+0.114
Neg logit contributions
jl-0.182
itia-0.180
 favorably-0.180
wake-0.176
eln-0.174
Token.
Token ID13
Feature activation+7.04e-03

Pos logit contributions
arters+0.163
 balcon+0.128
 Phi+0.128
rics+0.123
SourceFile+0.122
Neg logit contributions
jl-0.120
itia-0.118
 favorably-0.116
wake-0.115
eln-0.113
Token After
Token ID2293
Feature activation-2.59e-01

Pos logit contributions
arters+0.002
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
Token his
Token ID465
Feature activation+1.37e-01

Pos logit contributions
jl+0.045
 favorably+0.044
itia+0.044
wake+0.043
eln+0.042
Neg logit contributions
arters-0.060
 balcon-0.047
 Phi-0.047
SourceFile-0.046
rics-0.045
Token release
Token ID2650
Feature activation+1.09e+00

Pos logit contributions
arters+0.028
 balcon+0.021
 Phi+0.021
SourceFile+0.021
rics+0.020
Neg logit contributions
jl-0.028
 favorably-0.027
itia-0.027
wake-0.027
eln-0.026
Token,
Token ID11
Feature activation+5.37e-02

Pos logit contributions
arters+0.260
 balcon+0.205
 Phi+0.204
SourceFile+0.197
rics+0.196
Neg logit contributions
jl-0.185
itia-0.182
 favorably-0.182
wake-0.178
eln-0.174
Token Ken
Token ID7148
Feature activation+3.32e-01

Pos logit contributions
arters+0.013
 balcon+0.010
 Phi+0.010
SourceFile+0.010
rics+0.009
Neg logit contributions
jl-0.009
itia-0.009
 favorably-0.009
wake-0.009
eln-0.009
Tokeny
Token ID88
Feature activation+7.71e-02

Pos logit contributions
arters+0.078
 balcon+0.062
 Phi+0.061
SourceFile+0.059
rics+0.059
Neg logit contributions
jl-0.057
itia-0.056
 favorably-0.055
wake-0.054
eln-0.053
Tokenatta
Token ID25014
Feature activation+1.39e-01

Pos logit contributions
arters+0.018
 balcon+0.014
 Phi+0.014
SourceFile+0.014
rics+0.014
Neg logit contributions
jl-0.013
itia-0.013
 favorably-0.013
wake-0.012
eln-0.012
Token set
Token ID900
Feature activation-1.95e-01

Pos logit contributions
arters+0.033
 balcon+0.026
 Phi+0.026
rics+0.025
SourceFile+0.025
Neg logit contributions
jl-0.024
itia-0.023
 favorably-0.023
wake-0.023
eln-0.022
Token.
Token ID13
Feature activation-2.51e-02

Pos logit contributions
arters+0.158
 balcon+0.124
 Phi+0.124
SourceFile+0.120
rics+0.120
Neg logit contributions
jl-0.111
itia-0.109
 favorably-0.109
wake-0.106
eln-0.105
Token Dist
Token ID4307
Feature activation-2.63e-01

Pos logit contributions
jl+0.004
 favorably+0.004
itia+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.006
 Phi-0.005
 balcon-0.005
SourceFile-0.005
rics-0.005
Tokenant
Token ID415
Feature activation-3.36e-01

Pos logit contributions
jl+0.047
itia+0.046
 favorably+0.046
wake+0.045
eln+0.044
Neg logit contributions
arters-0.060
 balcon-0.047
 Phi-0.047
SourceFile-0.045
rics-0.045
Token outer
Token ID12076
Feature activation+1.82e-01

Pos logit contributions
jl+0.059
itia+0.058
 favorably+0.058
wake+0.056
eln+0.055
Neg logit contributions
arters-0.078
 balcon-0.061
 Phi-0.061
SourceFile-0.059
rics-0.058
Token at
Token ID379
Feature activation+8.14e-02

Pos logit contributions
arters+0.045
 Phi+0.035
 balcon+0.035
SourceFile+0.035
rics+0.034
Neg logit contributions
jl-0.029
itia-0.029
 favorably-0.029
wake-0.028
eln-0.027
Tokenolls
Token ID33421
Feature activation+1.08e+00

Pos logit contributions
arters+0.008
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.025
itia-0.025
 favorably-0.024
wake-0.024
eln-0.024
Token are
Token ID389
Feature activation+4.51e-02

Pos logit contributions
arters+0.244
 balcon+0.190
 Phi+0.189
SourceFile+0.183
rics+0.182
Neg logit contributions
jl-0.193
itia-0.190
 favorably-0.189
wake-0.186
eln-0.183
Token even
Token ID772
Feature activation+5.56e-01

Pos logit contributions
arters+0.011
 balcon+0.009
 Phi+0.009
SourceFile+0.008
rics+0.008
Neg logit contributions
jl-0.007
itia-0.007
 favorably-0.007
wake-0.007
eln-0.007
Token more
Token ID517
Feature activation-2.89e-02

Pos logit contributions
arters+0.131
 balcon+0.103
 Phi+0.102
SourceFile+0.099
rics+0.098
Neg logit contributions
jl-0.096
itia-0.094
 favorably-0.093
wake-0.092
eln-0.090
Token at
Token ID379
Feature activation+5.16e-02

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.005
Neg logit contributions
arters-0.007
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Token risk
Token ID2526
Feature activation+1.90e-01

Pos logit contributions
arters+0.011
 balcon+0.009
 Phi+0.009
SourceFile+0.008
rics+0.008
Neg logit contributions
jl-0.010
itia-0.009
 favorably-0.009
wake-0.009
eln-0.009
Token,
Token ID11
Feature activation+4.28e-02

Pos logit contributions
arters+0.041
 balcon+0.032
 Phi+0.032
SourceFile+0.031
rics+0.031
Neg logit contributions
jl-0.030
itia-0.030
 favorably-0.030
wake-0.029
eln-0.029
Token using
Token ID1262
Feature activation-4.88e-01

Pos logit contributions
arters+0.010
 balcon+0.008
 Phi+0.008
SourceFile+0.008
rics+0.008
Neg logit contributions
jl-0.007
itia-0.007
 favorably-0.007
wake-0.007
eln-0.007
Token steel
Token ID7771
Feature activation+5.40e-03

Pos logit contributions
jl+0.088
itia+0.087
 favorably+0.087
wake+0.085
eln+0.084
Neg logit contributions
arters-0.110
 balcon-0.085
 Phi-0.085
SourceFile-0.082
rics-0.082
Token cable
Token ID7862
Feature activation-2.77e-01

Pos logit contributions
arters+0.001
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
Token traps
Token ID20348
Feature activation+2.60e-01

Pos logit contributions
jl+0.048
itia+0.047
 favorably+0.047
wake+0.046
eln+0.045
Neg logit contributions
arters-0.065
 balcon-0.051
 Phi-0.051
SourceFile-0.049
rics-0.049
Token and
Token ID290
Feature activation+2.16e-02

Pos logit contributions
arters+0.061
 balcon+0.048
 Phi+0.048
SourceFile+0.046
rics+0.046
Neg logit contributions
jl-0.044
itia-0.044
 favorably-0.043
wake-0.042
eln-0.042
Token an
Token ID281
Feature activation+7.88e-02

Pos logit contributions
arters+0.005
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.004
itia-0.004
 favorably-0.004
wake-0.004
eln-0.004
Token animal
Token ID5044
Feature activation+1.69e-02

Pos logit contributions
arters+0.018
 balcon+0.014
 Phi+0.014
SourceFile+0.013
rics+0.013
Neg logit contributions
jl-0.014
 favorably-0.014
itia-0.014
wake-0.014
eln-0.014
Token carc
Token ID36756
Feature activation-2.73e-01

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Tokenass
Token ID562
Feature activation-3.23e-02

Pos logit contributions
jl+0.044
itia+0.043
 favorably+0.043
wake+0.042
eln+0.041
Neg logit contributions
arters-0.067
 balcon-0.053
 Phi-0.053
SourceFile-0.052
rics-0.051
Token as
Token ID355
Feature activation-2.45e-02

Pos logit contributions
jl+0.006
itia+0.006
 favorably+0.006
wake+0.005
eln+0.005
Neg logit contributions
arters-0.007
 balcon-0.006
 Phi-0.006
rics-0.006
SourceFile-0.006
Token me
Token ID502
Feature activation+7.26e-02

Pos logit contributions
arters+0.094
 balcon+0.073
 Phi+0.072
SourceFile+0.070
rics+0.070
Neg logit contributions
jl-0.074
itia-0.073
 favorably-0.073
wake-0.071
eln-0.070
Token,
Token ID11
Feature activation+3.32e-02

Pos logit contributions
arters+0.017
 balcon+0.013
 Phi+0.013
SourceFile+0.013
rics+0.013
Neg logit contributions
jl-0.013
itia-0.012
 favorably-0.012
wake-0.012
eln-0.012
Token but
Token ID475
Feature activation+1.73e-02

Pos logit contributions
arters+0.008
 balcon+0.006
 Phi+0.006
SourceFile+0.006
rics+0.006
Neg logit contributions
jl-0.006
itia-0.006
 favorably-0.006
wake-0.005
eln-0.005
Token I
Token ID314
Feature activation-8.59e-03

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Token was
Token ID373
Feature activation+9.05e-02

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.001
wake+0.001
eln+0.001
Neg logit contributions
arters-0.002
 balcon-0.002
 Phi-0.002
SourceFile-0.001
rics-0.001
Token never
Token ID1239
Feature activation+3.49e-01

Pos logit contributions
arters+0.020
 balcon+0.016
 Phi+0.016
SourceFile+0.015
rics+0.015
Neg logit contributions
jl-0.016
itia-0.016
 favorably-0.016
wake-0.016
eln-0.015
Token really
Token ID1107
Feature activation+4.55e-01

Pos logit contributions
arters+0.080
 balcon+0.062
 Phi+0.061
SourceFile+0.060
rics+0.060
Neg logit contributions
jl-0.062
 favorably-0.062
itia-0.061
wake-0.060
eln-0.059
Token sure
Token ID1654
Feature activation-3.86e-01

Pos logit contributions
arters+0.111
 balcon+0.087
 Phi+0.087
SourceFile+0.085
rics+0.084
Neg logit contributions
jl-0.074
 favorably-0.073
itia-0.073
wake-0.071
eln-0.070
Token why
Token ID1521
Feature activation+9.08e-02

Pos logit contributions
jl+0.068
itia+0.067
 favorably+0.067
wake+0.065
eln+0.064
Neg logit contributions
arters-0.089
 balcon-0.069
 Phi-0.069
SourceFile-0.066
rics-0.066
Token.
Token ID13
Feature activation-3.17e-04

Pos logit contributions
arters+0.022
 Phi+0.017
 balcon+0.017
rics+0.016
SourceFile+0.016
Neg logit contributions
jl-0.015
itia-0.015
 favorably-0.015
wake-0.015
eln-0.014
Token
Token ID198
Feature activation+3.55e-03

Pos logit contributions
jl+0.000
 favorably+0.000
itia+0.000
wake+0.000
eln+0.000
Neg logit contributions
arters-0.000
 Phi-0.000
 balcon-0.000
SourceFile-0.000
rics-0.000
Token
Token ID564
Feature activation-9.01e-02

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.004
eln+0.004
Neg logit contributions
arters-0.009
 balcon-0.007
 Phi-0.007
SourceFile-0.007
rics-0.007
Token
Token ID250
Feature activation-1.12e-01

Pos logit contributions
jl+0.016
itia+0.016
 favorably+0.016
wake+0.016
eln+0.016
Neg logit contributions
arters-0.020
 Phi-0.016
 balcon-0.016
SourceFile-0.015
rics-0.015
Tokencons
Token ID5936
Feature activation+3.70e-01

Pos logit contributions
jl+0.020
itia+0.020
 favorably+0.020
wake+0.019
eln+0.019
Neg logit contributions
arters-0.026
 balcon-0.020
 Phi-0.020
SourceFile-0.019
rics-0.019
Tokencience
Token ID4234
Feature activation-4.78e-02

Pos logit contributions
arters+0.091
 balcon+0.072
 Phi+0.072
rics+0.069
SourceFile+0.069
Neg logit contributions
jl-0.060
itia-0.058
 favorably-0.058
wake-0.057
eln-0.056
Token clause
Token ID13444
Feature activation+1.44e-01

Pos logit contributions
jl+0.008
itia+0.008
 favorably+0.008
wake+0.008
eln+0.008
Neg logit contributions
arters-0.011
 balcon-0.009
 Phi-0.009
rics-0.008
Toronto-0.008
Token,
Token ID11
Feature activation-2.57e-02

Pos logit contributions
arters+0.035
 balcon+0.027
 Phi+0.027
SourceFile+0.026
rics+0.026
Neg logit contributions
jl-0.024
itia-0.023
 favorably-0.023
wake-0.023
eln-0.022
Token
Token ID447
Feature activation-8.01e-03

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Token
Token ID251
Feature activation+7.21e-02

Pos logit contributions
jl+0.001
itia+0.001
 favorably+0.001
wake+0.001
eln+0.001
Neg logit contributions
arters-0.002
 balcon-0.001
 Phi-0.001
SourceFile-0.001
rics-0.001
Token which
Token ID543
Feature activation-9.92e-02

Pos logit contributions
arters+0.017
 balcon+0.014
 Phi+0.013
SourceFile+0.013
rics+0.013
Neg logit contributions
jl-0.012
itia-0.012
 favorably-0.012
wake-0.012
eln-0.011
Token became
Token ID2627
Feature activation+9.25e-02

Pos logit contributions
jl+0.017
itia+0.017
 favorably+0.017
wake+0.016
eln+0.016
Neg logit contributions
arters-0.023
 balcon-0.018
 Phi-0.018
SourceFile-0.018
rics-0.018
Token law
Token ID1099
Feature activation+7.32e-01

Pos logit contributions
arters+0.020
 balcon+0.015
 Phi+0.015
SourceFile+0.014
rics+0.014
Neg logit contributions
jl-0.018
itia-0.018
 favorably-0.018
wake-0.017
eln-0.017
Tokentered
Token ID4400
Feature activation+2.85e-01

Pos logit contributions
jl+0.056
itia+0.055
 favorably+0.055
wake+0.054
eln+0.053
Neg logit contributions
arters-0.071
 balcon-0.056
 Phi-0.055
SourceFile-0.053
rics-0.053
Token by
Token ID416
Feature activation+6.60e-02

Pos logit contributions
arters+0.068
 balcon+0.053
 Phi+0.053
SourceFile+0.052
rics+0.051
Neg logit contributions
jl-0.048
itia-0.047
 favorably-0.047
wake-0.046
eln-0.045
Token the
Token ID262
Feature activation+1.73e-02

Pos logit contributions
arters+0.015
 balcon+0.012
 Phi+0.012
SourceFile+0.011
rics+0.011
Neg logit contributions
jl-0.012
itia-0.011
 favorably-0.011
wake-0.011
eln-0.011
Token later
Token ID1568
Feature activation-7.41e-01

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Token Chron
Token ID8750
Feature activation-3.56e-01

Pos logit contributions
jl+0.135
itia+0.133
 favorably+0.133
wake+0.130
eln+0.128
Neg logit contributions
arters-0.166
 balcon-0.128
 Phi-0.128
SourceFile-0.123
rics-0.123
Tokeno
Token ID78
Feature activation+6.88e-01

Pos logit contributions
jl+0.082
 favorably+0.081
itia+0.080
wake+0.079
eln+0.078
Neg logit contributions
arters-0.063
 balcon-0.045
 Phi-0.045
rics-0.042
SourceFile-0.042
Token Trigger
Token ID24593
Feature activation-6.13e-01

Pos logit contributions
arters+0.187
 Phi+0.156
 balcon+0.153
rics+0.150
SourceFile+0.148
Neg logit contributions
jl-0.091
 favorably-0.087
itia-0.087
wake-0.085
quartered-0.082
Token and
Token ID290
Feature activation+1.09e-02

Pos logit contributions
jl+0.108
itia+0.106
 favorably+0.106
wake+0.103
eln+0.102
Neg logit contributions
arters-0.141
 balcon-0.110
 Phi-0.110
SourceFile-0.106
rics-0.106
Token Earth
Token ID3668
Feature activation+2.37e-01

Pos logit contributions
arters+0.003
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Tokenbound
Token ID7784
Feature activation+1.95e-02

Pos logit contributions
arters+0.052
 balcon+0.039
 Phi+0.039
rics+0.038
SourceFile+0.038
Neg logit contributions
jl-0.045
itia-0.044
 favorably-0.044
wake-0.043
iji-0.042
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Token enough
Token ID1576
Feature activation+6.66e-01

Pos logit contributions
jl+0.001
itia+0.001
 favorably+0.001
wake+0.001
eln+0.001
Neg logit contributions
arters-0.002
 balcon-0.001
 Phi-0.001
rics-0.001
SourceFile-0.001
Token to
Token ID284
Feature activation+1.98e-03

Pos logit contributions
arters+0.156
 balcon+0.122
 Phi+0.121
SourceFile+0.117
rics+0.117
Neg logit contributions
jl-0.115
itia-0.113
 favorably-0.113
wake-0.111
eln-0.109
Token accommodate
Token ID15550
Feature activation-3.39e-01

Pos logit contributions
arters+0.000
 balcon+0.000
 Phi+0.000
SourceFile+0.000
rics+0.000
Neg logit contributions
jl-0.000
itia-0.000
 favorably-0.000
wake-0.000
eln-0.000
Token the
Token ID262
Feature activation+9.41e-03

Pos logit contributions
jl+0.061
itia+0.061
 favorably+0.060
wake+0.059
eln+0.058
Neg logit contributions
arters-0.076
 balcon-0.059
 Phi-0.059
SourceFile-0.057
rics-0.057
Token first
Token ID717
Feature activation+1.12e-01

Pos logit contributions
arters+0.002
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Token Falling
Token ID42914
Feature activation+3.60e-01

Pos logit contributions
arters+0.026
 balcon+0.020
 Phi+0.020
SourceFile+0.019
rics+0.019
Neg logit contributions
jl-0.020
itia-0.020
 favorably-0.020
wake-0.019
eln-0.019
Tokenwater
Token ID7050
Feature activation-6.19e-01

Pos logit contributions
arters+0.108
 balcon+0.089
 Phi+0.089
SourceFile+0.087
rics+0.087
Neg logit contributions
jl-0.039
itia-0.038
 favorably-0.038
wake-0.036
eln-0.035
Token tour
Token ID4205
Feature activation+3.29e-01

Pos logit contributions
jl+0.119
itia+0.117
 favorably+0.117
wake+0.115
eln+0.113
Neg logit contributions
arters-0.133
 balcon-0.101
 Phi-0.101
SourceFile-0.098
rics-0.097
Token,
Token ID11
Feature activation+2.37e-02

Pos logit contributions
arters+0.078
 balcon+0.061
 Phi+0.061
rics+0.058
SourceFile+0.058
Neg logit contributions
jl-0.056
itia-0.056
 favorably-0.055
wake-0.054
eln-0.053
Token is
Token ID318
Feature activation-1.65e-02

Pos logit contributions
arters+0.005
 balcon+0.004
 Phi+0.004
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.004
itia-0.004
 favorably-0.004
wake-0.004
eln-0.004
Token superb
Token ID21840
Feature activation-4.87e-01

Pos logit contributions
jl+0.003
 favorably+0.003
itia+0.003
wake+0.003
eln+0.003
Neg logit contributions
arters-0.004
 balcon-0.003
 Phi-0.003
SourceFile-0.003
rics-0.003
Token pine
Token ID20161
Feature activation-4.08e-01

Pos logit contributions
jl+0.017
itia+0.016
 favorably+0.016
wake+0.016
eln+0.016
Neg logit contributions
arters-0.022
 balcon-0.018
 Phi-0.018
SourceFile-0.017
rics-0.017
Token and
Token ID290
Feature activation+5.76e-03

Pos logit contributions
jl+0.069
itia+0.068
 favorably+0.068
wake+0.067
iji+0.065
Neg logit contributions
arters-0.096
 balcon-0.076
 Phi-0.075
rics-0.073
SourceFile-0.073
Token grape
Token ID30777
Feature activation-3.40e-01

Pos logit contributions
arters+0.001
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
Tokenfruit
Token ID34711
Feature activation+1.75e-01

Pos logit contributions
jl+0.051
itia+0.050
 favorably+0.050
wake+0.049
iji+0.048
Neg logit contributions
arters-0.087
 balcon-0.070
 Phi-0.069
rics-0.067
SourceFile-0.067
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
arters+0.040
 balcon+0.032
 Phi+0.032
rics+0.031
SourceFile+0.030
Neg logit contributions
jl-0.030
itia-0.030
 favorably-0.030
wake-0.029
eln-0.029
Token under
Token ID739
Feature activation-1.61e-01

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.004
rics-0.004
Tokeng
Token ID70
Feature activation+5.78e-01

Pos logit contributions
jl+0.028
itia+0.028
 favorably+0.028
wake+0.027
eln+0.027
Neg logit contributions
arters-0.037
 balcon-0.029
 Phi-0.029
SourceFile-0.028
rics-0.028
Tokenird
Token ID1447
Feature activation+3.18e-01

Pos logit contributions
arters+0.124
 balcon+0.094
 Phi+0.093
rics+0.090
SourceFile+0.090
Neg logit contributions
jl-0.112
itia-0.110
 favorably-0.110
wake-0.108
eln-0.106
Tokened
Token ID276
Feature activation-6.01e-01

Pos logit contributions
arters+0.099
 balcon+0.084
 Phi+0.083
rics+0.082
SourceFile+0.081
Neg logit contributions
jl-0.030
itia-0.029
 favorably-0.028
wake-0.028
iji-0.027
Token by
Token ID416
Feature activation+3.76e-02

Pos logit contributions
jl+0.091
itia+0.090
 favorably+0.089
wake+0.087
eln+0.085
Neg logit contributions
arters-0.153
 balcon-0.123
 Phi-0.122
SourceFile-0.119
rics-0.119
Token che
Token ID1125
Feature activation+5.53e-01

Pos logit contributions
arters+0.009
 balcon+0.007
 Phi+0.007
SourceFile+0.007
rics+0.007
Neg logit contributions
jl-0.007
itia-0.006
 favorably-0.006
wake-0.006
eln-0.006
Tokened
Token ID276
Feature activation-6.41e-01

Pos logit contributions
arters+0.290
 balcon+0.244
 Phi+0.243
rics+0.238
SourceFile+0.237
Neg logit contributions
jl-0.077
itia-0.074
 favorably-0.074
wake-0.071
eln-0.067
Token.
Token ID13
Feature activation-8.18e-02

Pos logit contributions
jl+0.109
itia+0.108
 favorably+0.107
wake+0.105
eln+0.103
Neg logit contributions
arters-0.151
 balcon-0.119
 Phi-0.119
rics-0.114
SourceFile-0.114
Token
Token ID198
Feature activation-1.51e-02

Pos logit contributions
jl+0.014
 favorably+0.013
itia+0.013
wake+0.013
eln+0.013
Neg logit contributions
arters-0.020
 balcon-0.016
 Phi-0.015
rics-0.015
SourceFile-0.015
Token
Token ID198
Feature activation-3.13e-02

Pos logit contributions
jl+0.003
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.004
 Phi-0.003
 balcon-0.003
SourceFile-0.003
rics-0.003
TokenThe
Token ID464
Feature activation+1.14e-01

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.005
Neg logit contributions
arters-0.007
 balcon-0.006
 Phi-0.006
SourceFile-0.005
rics-0.005
Token issue
Token ID2071
Feature activation-4.92e-01

Pos logit contributions
arters+0.028
 balcon+0.022
 Phi+0.022
SourceFile+0.021
rics+0.021
Neg logit contributions
jl-0.019
itia-0.019
 favorably-0.019
wake-0.018
eln-0.018
Token of
Token ID286
Feature activation-5.22e-02

Pos logit contributions
jl+0.080
itia+0.079
 favorably+0.078
wake+0.077
eln+0.075
Neg logit contributions
arters-0.120
 balcon-0.095
 Phi-0.095
SourceFile-0.092
rics-0.091
Token a
Token ID257
Feature activation-2.59e-03

Pos logit contributions
jl+0.009
itia+0.008
 favorably+0.008
wake+0.008
eln+0.008
Neg logit contributions
arters-0.013
 balcon-0.010
 Phi-0.010
SourceFile-0.010
rics-0.010
Token refund
Token ID12929
Feature activation-2.34e-01

Pos logit contributions
jl+0.000
itia+0.000
 favorably+0.000
wake+0.000
eln+0.000
Neg logit contributions
arters-0.001
 balcon-0.001
 Phi-0.001
SourceFile-0.001
rics-0.001
Token is
Token ID318
Feature activation-1.26e-02

Pos logit contributions
jl+0.040
itia+0.039
 favorably+0.039
wake+0.038
eln+0.037
Neg logit contributions
arters-0.055
 balcon-0.044
 Phi-0.044
rics-0.042
SourceFile-0.042
Token even
Token ID772
Feature activation+5.50e-01

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.003
 balcon-0.002
 Phi-0.002
SourceFile-0.002
rics-0.002
Token of
Token ID286
Feature activation+7.03e-03

Pos logit contributions
jl+0.036
itia+0.035
 favorably+0.035
wake+0.034
eln+0.033
Neg logit contributions
arters-0.058
 balcon-0.046
 Phi-0.046
SourceFile-0.044
rics-0.044
Token respondents
Token ID14502
Feature activation+1.09e-01

Pos logit contributions
arters+0.002
 balcon+0.001
 Phi+0.001
SourceFile+0.001
rics+0.001
Neg logit contributions
jl-0.001
 favorably-0.001
itia-0.001
wake-0.001
eln-0.001
Token to
Token ID284
Feature activation+1.58e-02

Pos logit contributions
arters+0.025
 balcon+0.019
 Phi+0.019
rics+0.018
SourceFile+0.018
Neg logit contributions
jl-0.020
itia-0.019
 favorably-0.019
wake-0.019
eln-0.019
Token the
Token ID262
Feature activation+1.10e-02

Pos logit contributions
arters+0.004
 balcon+0.003
 Phi+0.003
SourceFile+0.003
rics+0.003
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Token Lok
Token ID36861
Feature activation+8.30e-01

Pos logit contributions
arters+0.003
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Token Survey
Token ID13084
Feature activation-5.14e-01

Pos logit contributions
arters+0.105
 balcon+0.063
 Phi+0.063
SourceFile+0.058
rics+0.057
Neg logit contributions
jl-0.232
 favorably-0.230
itia-0.229
wake-0.226
eln-0.225
Token,
Token ID11
Feature activation+2.98e-02

Pos logit contributions
jl+0.090
itia+0.089
 favorably+0.088
wake+0.087
eln+0.085
Neg logit contributions
arters-0.119
 balcon-0.093
 Phi-0.092
rics-0.089
SourceFile-0.089
Token with
Token ID351
Feature activation-6.62e-02

Pos logit contributions
arters+0.007
 balcon+0.005
 Phi+0.005
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.005
eln-0.005
Token a
Token ID257
Feature activation+6.05e-02

Pos logit contributions
jl+0.012
itia+0.012
 favorably+0.012
wake+0.012
eln+0.012
Neg logit contributions
arters-0.015
 balcon-0.011
 Phi-0.011
SourceFile-0.011
rics-0.011
Token sample
Token ID6291
Feature activation+1.13e-01

Pos logit contributions
arters+0.014
 Phi+0.011
 balcon+0.011
SourceFile+0.010
rics+0.010
Neg logit contributions
jl-0.011
itia-0.011
 favorably-0.011
wake-0.010
eln-0.010
Token size
Token ID2546
Feature activation+3.60e-01

Pos logit contributions
arters+0.024
 Phi+0.018
 balcon+0.018
rics+0.017
Toronto+0.017
Neg logit contributions
jl-0.022
itia-0.022
 favorably-0.022
wake-0.021
eln-0.021
Token The
Token ID383
Feature activation+6.84e-02

Pos logit contributions
jl+0.037
itia+0.036
 favorably+0.036
wake+0.035
eln+0.035
Neg logit contributions
arters-0.049
 balcon-0.039
 Phi-0.039
SourceFile-0.037
rics-0.037
Token Secret
Token ID3943
Feature activation-8.01e-01

Pos logit contributions
arters+0.016
 balcon+0.013
 Phi+0.013
SourceFile+0.012
rics+0.012
Neg logit contributions
jl-0.011
itia-0.011
 favorably-0.011
wake-0.011
eln-0.011
Token Life
Token ID5155
Feature activation-2.76e-02

Pos logit contributions
jl+0.126
itia+0.124
 favorably+0.123
wake+0.120
eln+0.118
Neg logit contributions
arters-0.200
 balcon-0.159
 Phi-0.158
SourceFile-0.154
rics-0.153
Token of
Token ID286
Feature activation+2.54e-02

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.007
 balcon-0.006
 Phi-0.006
SourceFile-0.005
rics-0.005
Token Walter
Token ID15819
Feature activation+1.05e-01

Pos logit contributions
arters+0.007
 balcon+0.005
 Phi+0.005
SourceFile+0.005
rics+0.005
Neg logit contributions
jl-0.004
itia-0.004
 favorably-0.003
wake-0.003
eln-0.003
Token M
Token ID337
Feature activation-1.27e-01

Pos logit contributions
arters+0.025
 balcon+0.020
 Phi+0.020
SourceFile+0.019
rics+0.019
Neg logit contributions
jl-0.017
itia-0.017
 favorably-0.017
wake-0.017
eln-0.016
Tokenitty
Token ID9760
Feature activation-5.49e-01

Pos logit contributions
jl+0.017
itia+0.016
 favorably+0.016
wake+0.016
eln+0.015
Neg logit contributions
arters-0.035
 Phi-0.029
 balcon-0.028
SourceFile-0.028
rics-0.028
Token,
Token ID11
Feature activation+5.19e-02

Pos logit contributions
jl+0.097
itia+0.095
 favorably+0.095
wake+0.093
eln+0.092
Neg logit contributions
arters-0.126
 balcon-0.098
 Phi-0.098
rics-0.095
SourceFile-0.095
Token photo
Token ID4590
Feature activation+6.64e-03

Pos logit contributions
arters+0.012
 balcon+0.009
 Phi+0.009
SourceFile+0.009
rics+0.009
Neg logit contributions
jl-0.009
itia-0.009
 favorably-0.009
wake-0.009
eln-0.009
Tokenjournal
Token ID24891
Feature activation+1.61e-02

Pos logit contributions
arters+0.002
 balcon+0.001
 Phi+0.001
rics+0.001
SourceFile+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.001
Tokenist
Token ID396
Feature activation+8.73e-02

Pos logit contributions
arters+0.003
 balcon+0.003
 Phi+0.003
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.003
itia-0.003
 favorably-0.003
wake-0.003
eln-0.003
Token from
Token ID422
Feature activation-1.38e-01

Pos logit contributions
jl+0.009
itia+0.009
 favorably+0.009
wake+0.009
eln+0.009
Neg logit contributions
arters-0.012
 balcon-0.010
 Phi-0.010
rics-0.009
SourceFile-0.009
Token Am
Token ID1703
Feature activation-4.41e-01

Pos logit contributions
jl+0.025
itia+0.024
 favorably+0.024
wake+0.024
eln+0.023
Neg logit contributions
arters-0.031
 balcon-0.024
 Phi-0.024
SourceFile-0.023
rics-0.023
Tokenoud
Token ID2778
Feature activation-4.44e-01

Pos logit contributions
jl+0.059
itia+0.057
 favorably+0.057
wake+0.055
eln+0.054
Neg logit contributions
arters-0.121
 balcon-0.098
 Phi-0.098
SourceFile-0.096
rics-0.095
Tokena
Token ID64
Feature activation+4.47e-01

Pos logit contributions
jl+0.085
 favorably+0.083
itia+0.083
wake+0.082
iji+0.079
Neg logit contributions
arters-0.097
 balcon-0.074
 Phi-0.073
rics-0.071
SourceFile-0.069
Token.
Token ID13
Feature activation-2.68e-02

Pos logit contributions
arters+0.100
 balcon+0.077
 Phi+0.077
rics+0.074
SourceFile+0.073
Neg logit contributions
jl-0.082
 favorably-0.081
itia-0.080
wake-0.079
eln-0.077
Token He
Token ID679
Feature activation-1.59e-01

Pos logit contributions
 favorably+0.004
jl+0.004
itia+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.006
 Phi-0.005
 balcon-0.005
rics-0.005
SourceFile-0.005
Token wants
Token ID3382
Feature activation-9.05e-02

Pos logit contributions
jl+0.027
itia+0.027
 favorably+0.027
wake+0.026
eln+0.026
Neg logit contributions
arters-0.037
 balcon-0.029
 Phi-0.029
rics-0.028
SourceFile-0.028
Token a
Token ID257
Feature activation+1.82e-03

Pos logit contributions
jl+0.016
itia+0.016
 favorably+0.016
wake+0.015
eln+0.015
Neg logit contributions
arters-0.021
 balcon-0.016
 Phi-0.016
SourceFile-0.016
rics-0.016
Token democratic
Token ID10518
Feature activation-3.67e-01

Pos logit contributions
arters+0.000
 Phi+0.000
 balcon+0.000
SourceFile+0.000
rics+0.000
Neg logit contributions
jl-0.000
itia-0.000
 favorably-0.000
wake-0.000
eln-0.000
Token and
Token ID290
Feature activation-3.98e-02

Pos logit contributions
jl+0.062
itia+0.061
 favorably+0.061
wake+0.060
eln+0.059
Neg logit contributions
arters-0.086
 balcon-0.068
 Phi-0.068
rics-0.065
SourceFile-0.065
Token united
Token ID16503
Feature activation-4.82e-01

Pos logit contributions
jl+0.007
itia+0.007
 favorably+0.007
wake+0.007
eln+0.007
Neg logit contributions
arters-0.009
 balcon-0.007
 Phi-0.007
SourceFile-0.007
rics-0.007
Tokenellow
Token ID5037
Feature activation-1.52e-01

Pos logit contributions
arters+0.042
 balcon+0.033
 Phi+0.033
SourceFile+0.032
rics+0.032
Neg logit contributions
jl-0.028
itia-0.028
 favorably-0.028
wake-0.027
eln-0.026
Token eat
Token ID4483
Feature activation+2.22e-01

Pos logit contributions
jl+0.028
 favorably+0.027
itia+0.027
wake+0.027
eln+0.026
Neg logit contributions
arters-0.034
 balcon-0.026
 Phi-0.026
SourceFile-0.025
rics-0.025
Tokeners
Token ID364
Feature activation-8.03e-01

Pos logit contributions
arters+0.064
 balcon+0.053
 Phi+0.053
SourceFile+0.052
rics+0.052
Neg logit contributions
jl-0.026
itia-0.025
 favorably-0.025
wake-0.024
eln-0.024
Token in
Token ID287
Feature activation-2.12e-02

Pos logit contributions
jl+0.136
itia+0.134
 favorably+0.133
wake+0.130
eln+0.128
Neg logit contributions
arters-0.191
 balcon-0.150
 Phi-0.149
SourceFile-0.145
rics-0.144
Token the
Token ID262
Feature activation-3.11e-02

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.005
 balcon-0.004
 Phi-0.004
SourceFile-0.004
rics-0.003
Token spirit
Token ID4437
Feature activation-1.15e+00

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.005
Neg logit contributions
arters-0.007
 balcon-0.006
 Phi-0.006
SourceFile-0.005
rics-0.005
Token of
Token ID286
Feature activation-2.73e-02

Pos logit contributions
jl+0.181
itia+0.178
 favorably+0.177
wake+0.172
eln+0.169
Neg logit contributions
arters-0.287
 balcon-0.228
 Phi-0.227
SourceFile-0.221
rics-0.220
Token July
Token ID2901
Feature activation-8.45e-02

Pos logit contributions
jl+0.005
itia+0.005
 favorably+0.005
wake+0.005
eln+0.004
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Token
Token ID198
Feature activation+1.11e-02

Pos logit contributions
jl+0.016
itia+0.015
 favorably+0.015
wake+0.015
eln+0.015
Neg logit contributions
arters-0.019
 balcon-0.014
 Phi-0.014
SourceFile-0.014
rics-0.014
Token
Token ID198
Feature activation-8.80e-03

Pos logit contributions
arters+0.003
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Token4
Token ID19
Feature activation-2.32e-01

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.001
eln+0.001
Neg logit contributions
arters-0.002
 balcon-0.002
 Phi-0.002
SourceFile-0.002
rics-0.002
Token man
Token ID582
Feature activation-1.76e-01

Pos logit contributions
jl+0.003
itia+0.003
 favorably+0.003
wake+0.003
eln+0.003
Neg logit contributions
arters-0.004
 balcon-0.003
 Phi-0.003
SourceFile-0.003
rics-0.003
Token who
Token ID508
Feature activation+2.27e-01

Pos logit contributions
jl+0.033
itia+0.033
 favorably+0.033
wake+0.032
eln+0.032
Neg logit contributions
arters-0.038
 Phi-0.029
 balcon-0.029
rics-0.028
SourceFile-0.028
Token was
Token ID373
Feature activation+8.21e-02

Pos logit contributions
arters+0.052
 balcon+0.041
 Phi+0.041
rics+0.040
SourceFile+0.039
Neg logit contributions
jl-0.040
itia-0.039
wake-0.039
 favorably-0.038
eln-0.038
Token willing
Token ID4684
Feature activation-3.24e-01

Pos logit contributions
arters+0.019
 Phi+0.015
 balcon+0.015
SourceFile+0.015
rics+0.015
Neg logit contributions
jl-0.014
itia-0.014
 favorably-0.014
wake-0.013
eln-0.013
Token to
Token ID284
Feature activation+3.29e-03

Pos logit contributions
jl+0.056
itia+0.055
 favorably+0.055
wake+0.054
eln+0.053
Neg logit contributions
arters-0.076
 balcon-0.060
 Phi-0.059
SourceFile-0.057
rics-0.057
Token pay
Token ID1414
Feature activation-1.24e+00

Pos logit contributions
arters+0.001
 balcon+0.001
 Phi+0.001
rics+0.001
SourceFile+0.001
Neg logit contributions
jl-0.001
itia-0.001
 favorably-0.001
wake-0.001
eln-0.000
Token my
Token ID616
Feature activation-5.07e-01

Pos logit contributions
jl+0.216
 favorably+0.213
itia+0.213
wake+0.207
eln+0.204
Neg logit contributions
arters-0.288
 balcon-0.225
 Phi-0.223
SourceFile-0.218
rics-0.216
Token school
Token ID1524
Feature activation+2.82e-01

Pos logit contributions
jl+0.104
 favorably+0.102
itia+0.102
wake+0.100
eln+0.099
Neg logit contributions
arters-0.103
 balcon-0.077
 Phi-0.076
SourceFile-0.074
rics-0.073
Token fees
Token ID6642
Feature activation-1.10e-01

Pos logit contributions
arters+0.065
 balcon+0.051
 Phi+0.051
SourceFile+0.049
rics+0.049
Neg logit contributions
jl-0.049
itia-0.049
 favorably-0.049
wake-0.047
eln-0.047
Token as
Token ID355
Feature activation-1.10e-01

Pos logit contributions
jl+0.019
itia+0.018
 favorably+0.018
wake+0.018
eln+0.018
Neg logit contributions
arters-0.026
 balcon-0.020
 Phi-0.020
rics-0.020
SourceFile-0.020
Token long
Token ID890
Feature activation-9.86e-02

Pos logit contributions
jl+0.018
itia+0.018
 favorably+0.018
wake+0.018
eln+0.017
Neg logit contributions
arters-0.026
 balcon-0.021
 Phi-0.021
SourceFile-0.020
rics-0.020
Token his
Token ID465
Feature activation+7.08e-02

Pos logit contributions
arters+0.006
 balcon+0.005
 Phi+0.005
SourceFile+0.004
rics+0.004
Neg logit contributions
jl-0.005
itia-0.005
 favorably-0.005
wake-0.004
eln-0.004
Token classic
Token ID6833
Feature activation-8.23e-02

Pos logit contributions
arters+0.016
 balcon+0.012
 Phi+0.012
SourceFile+0.012
rics+0.012
Neg logit contributions
jl-0.013
itia-0.013
 favorably-0.013
wake-0.012
eln-0.012
Token delivery
Token ID7585
Feature activation+4.31e-01

Pos logit contributions
jl+0.015
itia+0.014
 favorably+0.014
wake+0.014
eln+0.014
Neg logit contributions
arters-0.019
 balcon-0.015
 Phi-0.015
SourceFile-0.014
rics-0.014
Token stride
Token ID33769
Feature activation-5.42e-01

Pos logit contributions
arters+0.106
 balcon+0.084
 Phi+0.083
SourceFile+0.081
rics+0.080
Neg logit contributions
jl-0.070
itia-0.069
 favorably-0.068
wake-0.067
eln-0.066
Token.
Token ID13
Feature activation-2.45e-02

Pos logit contributions
jl+0.088
itia+0.087
 favorably+0.087
wake+0.085
eln+0.083
Neg logit contributions
arters-0.132
 balcon-0.105
 Phi-0.104
rics-0.101
SourceFile-0.101
Token Cricket
Token ID34761
Feature activation-1.09e+00

Pos logit contributions
jl+0.004
itia+0.004
 favorably+0.004
wake+0.004
eln+0.004
Neg logit contributions
arters-0.006
 balcon-0.005
 Phi-0.005
SourceFile-0.005
rics-0.005
Token nerd
Token ID34712
Feature activation+1.75e-01

Pos logit contributions
jl+0.193
itia+0.190
 favorably+0.189
wake+0.185
eln+0.182
Neg logit contributions
arters-0.251
 balcon-0.196
 Phi-0.195
SourceFile-0.189
rics-0.188
Tokenos
Token ID418
Feature activation-2.17e-01

Pos logit contributions
arters+0.040
 balcon+0.031
 Phi+0.031
SourceFile+0.030
rics+0.030
Neg logit contributions
jl-0.032
itia-0.031
 favorably-0.031
wake-0.030
eln-0.030
Token could
Token ID714
Feature activation-2.67e-01

Pos logit contributions
jl+0.038
itia+0.038
 favorably+0.037
wake+0.037
eln+0.036
Neg logit contributions
arters-0.050
 balcon-0.039
 Phi-0.039
SourceFile-0.038
rics-0.038
Token look
Token ID804
Feature activation+9.04e-02

Pos logit contributions
jl+0.047
itia+0.046
 favorably+0.046
wake+0.045
eln+0.044
Neg logit contributions
arters-0.062
 balcon-0.048
 Phi-0.048
SourceFile-0.046
rics-0.046
Token at
Token ID379
Feature activation+6.31e-02

Pos logit contributions
arters+0.020
 balcon+0.015
 Phi+0.015
SourceFile+0.015
rics+0.015
Neg logit contributions
jl-0.017
itia-0.017
 favorably-0.016
wake-0.016
eln-0.016
Token pain
Token ID2356
Feature activation+2.54e-01

Pos logit contributions
arters+0.002
 balcon+0.002
 Phi+0.002
SourceFile+0.002
rics+0.002
Neg logit contributions
jl-0.002
itia-0.002
 favorably-0.002
wake-0.002
eln-0.002
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
arters+0.058
 balcon+0.046
 Phi+0.045
SourceFile+0.044
rics+0.044
Neg logit contributions
jl-0.045
itia-0.044
 favorably-0.044
wake-0.043
eln-0.042
Token health
Token ID1535
Feature activation-9.05e-02

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.003
 balcon-0.002
 Phi-0.002
SourceFile-0.002
rics-0.002
Token care
Token ID1337
Feature activation-3.02e-01

Pos logit contributions
jl+0.015
itia+0.015
 favorably+0.015
wake+0.014
eln+0.014
Neg logit contributions
arters-0.022
 balcon-0.017
 Phi-0.017
SourceFile-0.017
rics-0.016
Token quality
Token ID3081
Feature activation+9.14e-01

Pos logit contributions
jl+0.054
itia+0.053
 favorably+0.052
wake+0.051
eln+0.050
Neg logit contributions
arters-0.069
 balcon-0.054
 Phi-0.054
SourceFile-0.052
rics-0.052
Token improvement
Token ID9025
Feature activation-1.13e+00

Pos logit contributions
arters+0.207
 balcon+0.162
 Phi+0.161
rics+0.155
SourceFile+0.154
Neg logit contributions
jl-0.164
itia-0.161
 favorably-0.160
wake-0.158
eln-0.156
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
jl+0.211
itia+0.208
 favorably+0.208
wake+0.203
eln+0.200
Neg logit contributions
arters-0.250
 balcon-0.192
 Phi-0.191
SourceFile-0.185
rics-0.184
Token We
Token ID775
Feature activation+5.33e-02

Pos logit contributions
jl+0.003
 favorably+0.003
itia+0.003
wake+0.003
eln+0.003
Neg logit contributions
arters-0.005
 Phi-0.004
 balcon-0.004
SourceFile-0.004
rics-0.004
Token can
Token ID460
Feature activation-2.41e-01

Pos logit contributions
arters+0.012
 balcon+0.010
 Phi+0.009
SourceFile+0.009
rics+0.009
Neg logit contributions
jl-0.009
itia-0.009
 favorably-0.009
wake-0.009
eln-0.009
Token listen
Token ID6004
Feature activation-9.95e-02

Pos logit contributions
jl+0.042
itia+0.042
 favorably+0.041
wake+0.041
eln+0.040
Neg logit contributions
arters-0.056
 balcon-0.043
 Phi-0.043
rics-0.042
SourceFile-0.042
Token to
Token ID284
Feature activation+2.98e-02

Pos logit contributions
jl+0.017
itia+0.017
 favorably+0.017
wake+0.017
eln+0.016
Neg logit contributions
arters-0.023
 balcon-0.018
 Phi-0.018
rics-0.017
SourceFile-0.017
TokenI
Token ID40
Feature activation+9.47e-02

Pos logit contributions
jl+0.022
itia+0.022
 favorably+0.022
wake+0.021
eln+0.021
Neg logit contributions
arters-0.028
 balcon-0.022
 Phi-0.022
SourceFile-0.021
rics-0.021
Token fight
Token ID1907
Feature activation-3.03e-01

Pos logit contributions
arters+0.021
 balcon+0.016
 Phi+0.016
SourceFile+0.016
rics+0.016
Neg logit contributions
jl-0.017
itia-0.017
 favorably-0.017
wake-0.017
eln-0.016
Token very
Token ID845
Feature activation-1.58e-01

Pos logit contributions
jl+0.052
itia+0.051
 favorably+0.051
wake+0.050
eln+0.049
Neg logit contributions
arters-0.071
 balcon-0.056
 Phi-0.056
rics-0.054
SourceFile-0.054
Token hard
Token ID1327
Feature activation+4.49e-01

Pos logit contributions
jl+0.035
itia+0.035
 favorably+0.035
wake+0.034
eln+0.034
Neg logit contributions
arters-0.029
 balcon-0.021
 Phi-0.021
SourceFile-0.020
rics-0.020
Token to
Token ID284
Feature activation-9.32e-03

Pos logit contributions
arters+0.104
 balcon+0.081
 Phi+0.081
SourceFile+0.078
rics+0.078
Neg logit contributions
jl-0.078
itia-0.077
 favorably-0.077
wake-0.075
eln-0.074
Token pay
Token ID1414
Feature activation-1.24e+00

Pos logit contributions
jl+0.002
itia+0.002
 favorably+0.002
wake+0.002
eln+0.002
Neg logit contributions
arters-0.002
 balcon-0.002
 Phi-0.002
SourceFile-0.002
rics-0.002
Token as
Token ID355
Feature activation-7.06e-02

Pos logit contributions
jl+0.186
itia+0.183
 favorably+0.182
wake+0.178
eln+0.174
Neg logit contributions
arters-0.319
 balcon-0.256
 Phi-0.255
SourceFile-0.248
rics-0.247
Token little
Token ID1310
Feature activation-6.12e-01

Pos logit contributions
jl+0.013
itia+0.013
 favorably+0.013
wake+0.012
eln+0.012
Neg logit contributions
arters-0.016
 balcon-0.012
 Phi-0.012
SourceFile-0.012
rics-0.012
Token tax
Token ID1687
Feature activation-6.60e-01

Pos logit contributions
jl+0.100
itia+0.098
 favorably+0.097
wake+0.096
eln+0.094
Neg logit contributions
arters-0.149
 balcon-0.118
 Phi-0.118
SourceFile-0.114
rics-0.114
Token as
Token ID355
Feature activation-6.29e-02

Pos logit contributions
jl+0.113
itia+0.111
 favorably+0.110
wake+0.109
eln+0.106
Neg logit contributions
arters-0.155
 balcon-0.121
 Phi-0.121
SourceFile-0.117
rics-0.117
Token possible
Token ID1744
Feature activation-6.48e-01

Pos logit contributions
jl+0.008
itia+0.008
 favorably+0.008
wake+0.008
eln+0.007
Neg logit contributions
arters-0.018
 Phi-0.015
 balcon-0.014
rics-0.014
Toronto-0.014