TOP ACTIVATIONS
MAX = 0.496

according to a judge in Winn Parish , is a "
Mike Richards Daniel Winn ik Nick las
Vol can on Lan zar ote costs £ 1 , 259
21 Jul ien Ver m ote ( O mega Pharma -
f ift ies against West Indies in a Test match in
han want ri . Dev ote es perform p uj as
At wood and Michael O nda at je . It
W ile E . Coy ote thing is to continuously come
superior . T ote up those points and the
use apps like E vern ote and Dropbox , and video
Test series against the West Indies . " The
got a 50 for West Indies in the second innings .
). Numbers and Ke yn ote will be receiving " improve
different flight ( because Exp edia informed her that her original
Faith . Hope . Charity . Pr udence . Justice
National Association of Basketball Co aches ,[ 122 ] and Associated
the p iv otal Supreme Court justices
theories of rationality since Des cart es . The purpose of
MON TON Omar Kh adr has agreed to remain in
by New Hampshire Senator Daniel Clark on March 22 ; Poll

MIN ACTIVATIONS
MIN = -0.514

to see the From Dusk Till Dawn maze coming this year
47 ] < en 7 _> lol Mang 0 king [
serving a mult icultural community .
FI is affected . Recon nect the radio to the power
on the edge of vit icultural viability , the math becomes
of N PH version of Ol af . Plus , I
rose on hate and master fully directed the hate consuming his
it plans to be " fully engaged throughout the process .
boy ishly and r ue fully . He is relaxing ,
A solo album titled Ol ' Brown E ars is
friend clarified pr ide fully and unp rompt ed
one of the most skill fully - l iked media centers
writes very honestly and truth fully about the human experience .
he says cheer fully . You just
" Mom ? Dad ? Ol af ?" I stood still
, " Okay , focus Ol af . If Brian comes
evil ness that is in Ol af I don
case financial situation if he opt s in .
. ( 1986 ). The Ol we us B ully /
caught me off guard . Ol af releases me and it

INTERVAL 0.244 - 0.496
CONTAINS 0.596%

the same nearby , Ky odo News Service reported . Authorities
water . According to H oss ain , the spill has
. Often , the protesters sly ly distorted the traditional rallying
of the E BS CO Discovery Services ( ED S )
hands In sports referees are often disliked - in

INTERVAL -0.009 - 0.244
CONTAINS 62.370%

We pay for your stories ! Do you have a story
to just send the notification , and then go doing something
people on the city council be making this decision ?
No State , territory , or possession of the United
that might deter future attacks . Indeed , I wish there

INTERVAL -0.262 - -0.009
CONTAINS 36.611%

to trip ? The most important question to answer is how
128 . 8 9 Yemen 128
Start of system () ' s stack frame | void *
anda intended to help minority communities , as well as local
board her flight in Malaysia , and the DOJ claimed ,

INTERVAL -0.514 - -0.262
CONTAINS 0.423%

vec e ux la mo iti é des Cors aires ay
producer He ik ki M ä en p ä ä .
where you stripped back the shield . Plug this loop into
. You might think of Swed es as upright social democr
to the website owned by Alex Jones that propag ates conspiracy
Token according
Token ID1864
Feature activation-4.17e-02

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
Token to
Token ID284
Feature activation+6.06e-03

Pos logit contributions
=#+0.027
 Neon+0.027
 裏�+0.025
nect+0.025
orage+0.025
Neg logit contributions
tch-0.028
 Holo-0.027
 ruth-0.027
Clark-0.026
 Doyle-0.026
Token a
Token ID257
Feature activation+2.27e-03

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Token judge
Token ID5052
Feature activation+2.07e-01

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.001
nect-0.001
orage-0.001
Token in
Token ID287
Feature activation+1.02e-02

Pos logit contributions
tch+0.138
 Holo+0.135
 ruth+0.131
Clark+0.130
 Doyle+0.127
Neg logit contributions
=#-0.134
 Neon-0.133
 裏�-0.127
nect-0.126
orage-0.125
Token Winn
Token ID19454
Feature activation+4.96e-01

Pos logit contributions
tch+0.007
 Holo+0.006
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.007
nect-0.006
orage-0.006
Token Parish
Token ID35240
Feature activation+6.40e-02

Pos logit contributions
tch+0.366
 Holo+0.360
 ruth+0.352
Clark+0.348
 Doyle+0.338
Neg logit contributions
 Neon-0.285
=#-0.283
 裏�-0.269
nect-0.267
icultural-0.267
Token,
Token ID11
Feature activation+1.79e-03

Pos logit contributions
tch+0.042
 Holo+0.042
 ruth+0.041
Clark+0.040
 Doyle+0.039
Neg logit contributions
=#-0.041
 Neon-0.041
 裏�-0.039
nect-0.039
orage-0.039
Token is
Token ID318
Feature activation-6.19e-04

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token a
Token ID257
Feature activation-1.27e-03

Pos logit contributions
=#+0.000
 Neon+0.000
 裏�+0.000
nect+0.000
orage+0.000
Neg logit contributions
tch-0.000
 Holo-0.000
 ruth-0.000
Clark-0.000
 Doyle-0.000
Token "
Token ID366
Feature activation+1.77e-02

Pos logit contributions
 Neon+0.001
=#+0.001
 裏�+0.001
nect+0.001
icultural+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
TokenMike
Token ID16073
Feature activation-1.43e-01

Pos logit contributions
=#+0.006
 Neon+0.006
 裏�+0.006
icultural+0.006
nect+0.006
Neg logit contributions
tch-0.007
 Holo-0.007
Clark-0.007
 ruth-0.007
 Doyle-0.007
Token Richards
Token ID14743
Feature activation+2.63e-01

Pos logit contributions
=#+0.094
 Neon+0.093
 裏�+0.089
nect+0.088
orage+0.088
Neg logit contributions
tch-0.093
 Holo-0.092
 ruth-0.089
Clark-0.088
 Doyle-0.086
Token
Token ID198
Feature activation-1.26e-02

Pos logit contributions
tch+0.176
 Holo+0.173
 ruth+0.168
Clark+0.166
 Doyle+0.162
Neg logit contributions
=#-0.170
 Neon-0.168
 裏�-0.161
nect-0.160
orage-0.159
Token
Token ID198
Feature activation-6.08e-03

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.008
nect+0.008
orage+0.007
Neg logit contributions
tch-0.009
 Holo-0.008
 ruth-0.008
Clark-0.008
 Doyle-0.008
TokenDaniel
Token ID19962
Feature activation-1.65e-02

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.004
nect+0.004
icultural+0.004
Neg logit contributions
tch-0.004
 Holo-0.004
Clark-0.004
 ruth-0.004
 Doyle-0.004
Token Winn
Token ID19454
Feature activation+4.83e-01

Pos logit contributions
=#+0.010
 Neon+0.010
 裏�+0.010
nect+0.010
orage+0.010
Neg logit contributions
tch-0.011
 Holo-0.011
 ruth-0.011
Clark-0.011
 Doyle-0.010
Tokenik
Token ID1134
Feature activation-7.82e-02

Pos logit contributions
tch+0.397
 Holo+0.397
 ruth+0.389
Clark+0.385
 Doyle+0.373
Neg logit contributions
=#-0.231
 Neon-0.227
 裏�-0.215
nect-0.214
orage-0.212
Token
Token ID198
Feature activation-1.26e-02

Pos logit contributions
=#+0.050
 Neon+0.050
 裏�+0.048
nect+0.048
orage+0.047
Neg logit contributions
tch-0.052
 Holo-0.051
 ruth-0.050
Clark-0.049
 Doyle-0.048
Token
Token ID198
Feature activation-1.16e-02

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.008
nect+0.008
orage+0.007
Neg logit contributions
tch-0.009
 Holo-0.008
 ruth-0.008
Clark-0.008
 Doyle-0.008
TokenNick
Token ID23609
Feature activation-1.17e-01

Pos logit contributions
=#+0.007
 Neon+0.007
 裏�+0.007
icultural+0.007
nect+0.007
Neg logit contributions
tch-0.008
 Holo-0.008
Clark-0.008
 ruth-0.008
 Doyle-0.007
Tokenlas
Token ID21921
Feature activation+9.22e-02

Pos logit contributions
=#+0.079
 Neon+0.078
 裏�+0.076
nect+0.075
icultural+0.074
Neg logit contributions
tch-0.074
 Holo-0.073
 ruth-0.070
Clark-0.070
 Doyle-0.069
Token Vol
Token ID4709
Feature activation+1.92e-02

Pos logit contributions
=#+0.014
 Neon+0.014
 裏�+0.013
nect+0.013
orage+0.013
Neg logit contributions
tch-0.015
 Holo-0.015
 ruth-0.014
Clark-0.014
 Doyle-0.014
Tokencan
Token ID5171
Feature activation+2.08e-02

Pos logit contributions
tch+0.013
 Holo+0.013
 ruth+0.013
Clark+0.013
 Doyle+0.012
Neg logit contributions
=#-0.012
 Neon-0.012
 裏�-0.011
nect-0.011
orage-0.011
Token on
Token ID319
Feature activation-2.05e-02

Pos logit contributions
tch+0.015
 Holo+0.015
 ruth+0.015
Clark+0.014
 Doyle+0.014
Neg logit contributions
=#-0.012
 Neon-0.012
 裏�-0.011
nect-0.011
orage-0.011
Token Lan
Token ID14730
Feature activation+4.96e-02

Pos logit contributions
=#+0.013
 Neon+0.013
 裏�+0.012
nect+0.012
orage+0.012
Neg logit contributions
tch-0.014
 Holo-0.013
 ruth-0.013
Clark-0.013
 Doyle-0.013
Tokenzar
Token ID41046
Feature activation-1.30e-01

Pos logit contributions
tch+0.033
 Holo+0.032
 ruth+0.031
Clark+0.031
 Doyle+0.030
Neg logit contributions
=#-0.032
 Neon-0.032
 裏�-0.030
nect-0.030
orage-0.030
Tokenote
Token ID1258
Feature activation+4.83e-01

Pos logit contributions
=#+0.156
 Neon+0.154
 裏�+0.151
nect+0.151
orage+0.150
Neg logit contributions
tch-0.016
 Holo-0.014
 ruth-0.011
Clark-0.011
 Doyle-0.008
Token costs
Token ID3484
Feature activation-4.61e-02

Pos logit contributions
tch+0.317
 Holo+0.312
 ruth+0.303
Clark+0.299
 Doyle+0.292
Neg logit contributions
=#-0.315
 Neon-0.313
 裏�-0.301
nect-0.297
icultural-0.295
Token £
Token ID4248
Feature activation-5.74e-02

Pos logit contributions
=#+0.027
 Neon+0.027
 裏�+0.026
orage+0.025
nect+0.025
Neg logit contributions
tch-0.033
 Holo-0.033
 ruth-0.032
Clark-0.031
 Doyle-0.031
Token1
Token ID16
Feature activation+5.19e-02

Pos logit contributions
=#+0.039
 Neon+0.038
 裏�+0.037
nect+0.036
orage+0.036
Neg logit contributions
tch-0.037
 Holo-0.036
 ruth-0.035
Clark-0.034
 Doyle-0.034
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
tch+0.035
 Holo+0.034
 ruth+0.033
Clark+0.033
 Doyle+0.032
Neg logit contributions
=#-0.033
 Neon-0.033
 裏�-0.031
nect-0.031
orage-0.031
Token259
Token ID25191
Feature activation+1.14e-01

Pos logit contributions
tch+0.014
 Holo+0.013
 ruth+0.013
Clark+0.013
 Doyle+0.013
Neg logit contributions
=#-0.013
 Neon-0.013
 裏�-0.012
nect-0.012
orage-0.012
Token 21
Token ID2310
Feature activation+1.20e-02

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
orage+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token Jul
Token ID5979
Feature activation+1.36e-01

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.008
Clark+0.008
 Doyle+0.008
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.007
nect-0.007
orage-0.007
Tokenien
Token ID2013
Feature activation+1.59e-01

Pos logit contributions
tch+0.074
 Holo+0.072
 ruth+0.070
Clark+0.069
 Doyle+0.066
Neg logit contributions
 Neon-0.105
=#-0.104
 裏�-0.101
nect-0.100
icultural-0.100
Token Ver
Token ID4643
Feature activation-5.33e-02

Pos logit contributions
tch+0.106
 Holo+0.102
 ruth+0.100
Clark+0.100
 Doyle+0.096
Neg logit contributions
 Neon-0.103
=#-0.102
 裏�-0.098
nect-0.097
orage-0.096
Tokenm
Token ID76
Feature activation+1.00e-01

Pos logit contributions
=#+0.036
 Neon+0.036
 裏�+0.034
nect+0.034
orage+0.034
Neg logit contributions
tch-0.033
 Holo-0.033
 ruth-0.032
Clark-0.032
 Doyle-0.031
Tokenote
Token ID1258
Feature activation+4.79e-01

Pos logit contributions
tch+0.013
 Holo+0.012
 ruth+0.010
Clark+0.010
 Doyle+0.008
Neg logit contributions
=#-0.118
 Neon-0.118
 裏�-0.115
nect-0.115
icultural-0.114
Token (
Token ID357
Feature activation-1.63e-02

Pos logit contributions
tch+0.326
 Holo+0.321
 ruth+0.311
Clark+0.308
 Doyle+0.302
Neg logit contributions
=#-0.303
 Neon-0.299
 裏�-0.287
nect-0.284
orage-0.282
TokenO
Token ID46
Feature activation-6.49e-02

Pos logit contributions
=#+0.011
 Neon+0.010
 裏�+0.010
nect+0.010
orage+0.010
Neg logit contributions
tch-0.011
 Holo-0.011
 ruth-0.010
Clark-0.010
 Doyle-0.010
Tokenmega
Token ID13731
Feature activation+2.58e-01

Pos logit contributions
=#+0.062
 Neon+0.061
 裏�+0.060
nect+0.059
orage+0.059
Neg logit contributions
tch-0.023
 Holo-0.022
 ruth-0.021
Clark-0.021
 Doyle-0.020
Token Pharma
Token ID41845
Feature activation+1.06e-01

Pos logit contributions
tch+0.157
 Holo+0.152
 ruth+0.149
Clark+0.148
 Doyle+0.141
Neg logit contributions
=#-0.182
 Neon-0.181
 裏�-0.172
nect-0.171
orage-0.170
Token -
Token ID532
Feature activation+1.25e-01

Pos logit contributions
tch+0.059
 Holo+0.057
 ruth+0.055
Clark+0.055
 Doyle+0.053
Neg logit contributions
 Neon-0.080
=#-0.080
 裏�-0.076
nect-0.076
orage-0.075
Token f
Token ID277
Feature activation+3.64e-02

Pos logit contributions
tch+0.013
 Holo+0.012
 ruth+0.012
Clark+0.012
 Doyle+0.012
Neg logit contributions
=#-0.011
 Neon-0.011
 裏�-0.010
nect-0.010
orage-0.010
Tokenift
Token ID2135
Feature activation+1.82e-02

Pos logit contributions
tch+0.025
 Holo+0.025
 ruth+0.024
Clark+0.024
 Doyle+0.023
Neg logit contributions
=#-0.023
 Neon-0.023
nect-0.021
icultural-0.021
orage-0.021
Tokenies
Token ID444
Feature activation-1.43e-01

Pos logit contributions
tch+0.016
 Holo+0.016
 ruth+0.015
Clark+0.015
 Doyle+0.015
Neg logit contributions
=#-0.008
 Neon-0.008
 裏�-0.007
nect-0.007
orage-0.007
Token against
Token ID1028
Feature activation+3.12e-02

Pos logit contributions
=#+0.094
 Neon+0.093
 裏�+0.090
nect+0.089
orage+0.088
Neg logit contributions
tch-0.094
 Holo-0.092
 ruth-0.089
Clark-0.088
 Doyle-0.086
Token West
Token ID2688
Feature activation+2.16e-01

Pos logit contributions
tch+0.018
 Holo+0.018
 ruth+0.017
Clark+0.017
 Doyle+0.017
Neg logit contributions
=#-0.023
 Neon-0.022
 裏�-0.022
nect-0.021
orage-0.021
Token Indies
Token ID42556
Feature activation+4.78e-01

Pos logit contributions
tch+0.021
 Holo+0.019
 ruth+0.014
Clark+0.013
 Doyle+0.010
Neg logit contributions
=#-0.263
 Neon-0.261
 裏�-0.256
nect-0.254
orage-0.253
Token in
Token ID287
Feature activation+1.40e-02

Pos logit contributions
tch+0.318
 Holo+0.312
 ruth+0.303
Clark+0.300
 Doyle+0.293
Neg logit contributions
=#-0.309
 Neon-0.307
 裏�-0.295
nect-0.291
orage-0.289
Token a
Token ID257
Feature activation+5.23e-03

Pos logit contributions
tch+0.009
 Holo+0.009
 ruth+0.009
Clark+0.009
 Doyle+0.009
Neg logit contributions
=#-0.009
 Neon-0.009
 裏�-0.009
nect-0.009
icultural-0.009
Token Test
Token ID6208
Feature activation-5.97e-02

Pos logit contributions
tch+0.004
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Token match
Token ID2872
Feature activation-7.40e-05

Pos logit contributions
=#+0.037
 Neon+0.037
 裏�+0.036
nect+0.035
orage+0.035
Neg logit contributions
tch-0.041
 Holo-0.040
 ruth-0.039
Clark-0.039
 Doyle-0.038
Token in
Token ID287
Feature activation+2.05e-02

Pos logit contributions
=#+0.000
 Neon+0.000
 裏�+0.000
nect+0.000
orage+0.000
Neg logit contributions
tch-0.000
 Holo-0.000
 ruth-0.000
Clark-0.000
 Doyle-0.000
Tokenhan
Token ID7637
Feature activation+2.33e-01

Pos logit contributions
=#+0.101
 Neon+0.100
 裏�+0.097
nect+0.096
orage+0.095
Neg logit contributions
tch-0.071
 Holo-0.069
 ruth-0.067
Clark-0.066
 Doyle-0.064
Tokenwant
Token ID42949
Feature activation-9.38e-02

Pos logit contributions
tch+0.181
 Holo+0.178
 ruth+0.174
Clark+0.172
 Doyle+0.168
Neg logit contributions
=#-0.126
 Neon-0.124
 裏�-0.118
nect-0.116
orage-0.115
Tokenri
Token ID380
Feature activation-4.75e-02

Pos logit contributions
=#+0.053
 Neon+0.052
 裏�+0.050
nect+0.049
orage+0.049
Neg logit contributions
tch-0.070
 Holo-0.069
 ruth-0.067
Clark-0.067
 Doyle-0.066
Token.
Token ID13
Feature activation+3.20e-03

Pos logit contributions
=#+0.030
 Neon+0.030
 裏�+0.029
nect+0.028
orage+0.028
Neg logit contributions
tch-0.032
 Holo-0.032
 ruth-0.031
Clark-0.030
 Doyle-0.030
Token Dev
Token ID6245
Feature activation+1.80e-01

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
icultural-0.002
Tokenote
Token ID1258
Feature activation+4.77e-01

Pos logit contributions
tch+0.097
 Holo+0.095
 ruth+0.092
Clark+0.091
 Doyle+0.088
Neg logit contributions
=#-0.139
 Neon-0.137
 裏�-0.133
nect-0.132
orage-0.131
Tokenes
Token ID274
Feature activation+5.97e-02

Pos logit contributions
tch+0.293
 Holo+0.288
 ruth+0.279
Clark+0.276
 Doyle+0.269
Neg logit contributions
=#-0.332
 Neon-0.329
 裏�-0.317
nect-0.314
orage-0.312
Token perform
Token ID1620
Feature activation+2.36e-01

Pos logit contributions
tch+0.040
 Holo+0.039
 ruth+0.038
Clark+0.038
 Doyle+0.037
Neg logit contributions
=#-0.038
 Neon-0.038
 裏�-0.037
nect-0.036
orage-0.036
Token p
Token ID279
Feature activation+1.38e-01

Pos logit contributions
tch+0.157
 Holo+0.155
 ruth+0.150
Clark+0.148
 Doyle+0.145
Neg logit contributions
=#-0.153
 Neon-0.151
 裏�-0.146
nect-0.144
orage-0.143
Tokenuj
Token ID23577
Feature activation+8.23e-02

Pos logit contributions
tch+0.081
 Holo+0.079
 ruth+0.076
Clark+0.076
 Doyle+0.074
Neg logit contributions
=#-0.100
 Neon-0.099
 裏�-0.096
nect-0.095
orage-0.095
Tokenas
Token ID292
Feature activation+1.69e-01

Pos logit contributions
tch+0.037
 Holo+0.036
 ruth+0.034
Clark+0.034
 Doyle+0.032
Neg logit contributions
=#-0.071
 Neon-0.071
 裏�-0.069
nect-0.068
orage-0.068
Token At
Token ID1629
Feature activation+1.02e-01

Pos logit contributions
 Neon+0.059
=#+0.059
 裏�+0.056
nect+0.055
orage+0.055
Neg logit contributions
tch-0.057
 Holo-0.056
 ruth-0.055
Clark-0.054
 Doyle-0.052
Tokenwood
Token ID3822
Feature activation+5.62e-02

Pos logit contributions
tch+0.065
 Holo+0.064
 ruth+0.062
Clark+0.061
 Doyle+0.060
Neg logit contributions
=#-0.068
 Neon-0.067
 裏�-0.064
icultural-0.064
nect-0.064
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
tch+0.037
 Holo+0.037
 ruth+0.035
Clark+0.035
 Doyle+0.034
Neg logit contributions
=#-0.036
 Neon-0.036
 裏�-0.035
nect-0.034
orage-0.034
Token Michael
Token ID3899
Feature activation-7.76e-02

Pos logit contributions
tch+0.007
 Holo+0.007
 ruth+0.007
Clark+0.007
 Doyle+0.006
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.007
nect-0.006
orage-0.006
Token O
Token ID440
Feature activation-9.17e-02

Pos logit contributions
=#+0.051
 Neon+0.050
 裏�+0.048
nect+0.048
orage+0.047
Neg logit contributions
tch-0.051
 Holo-0.050
 ruth-0.049
Clark-0.048
 Doyle-0.047
Tokennda
Token ID45658
Feature activation+4.75e-01

Pos logit contributions
=#+0.057
 Neon+0.056
 裏�+0.054
nect+0.053
icultural+0.053
Neg logit contributions
tch-0.063
 Holo-0.063
 ruth-0.061
Clark-0.060
 Doyle-0.058
Tokenat
Token ID265
Feature activation+1.39e-01

Pos logit contributions
tch+0.272
 Holo+0.258
 ruth+0.252
Clark+0.247
mite+0.232
Neg logit contributions
=#-0.358
 Neon-0.354
 裏�-0.341
nect-0.336
orage-0.336
Tokenje
Token ID18015
Feature activation-1.08e-01

Pos logit contributions
tch+0.119
 Holo+0.117
 ruth+0.114
Clark+0.113
 Doyle+0.110
Neg logit contributions
=#-0.063
 Neon-0.062
icultural-0.058
 裏�-0.058
nect-0.058
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
=#+0.069
 Neon+0.068
 裏�+0.066
nect+0.065
orage+0.064
Neg logit contributions
tch-0.073
 Holo-0.072
 ruth-0.070
Clark-0.069
 Doyle-0.067
Token It
Token ID632
Feature activation-1.70e-02

Pos logit contributions
tch+0.010
 Holo+0.010
 ruth+0.009
Clark+0.009
 Doyle+0.009
Neg logit contributions
=#-0.009
 Neon-0.009
 裏�-0.009
nect-0.009
orage-0.009
Token
Token ID447
Feature activation+9.30e-03

Pos logit contributions
=#+0.011
 Neon+0.011
 裏�+0.010
nect+0.010
orage+0.010
Neg logit contributions
tch-0.012
 Holo-0.011
 ruth-0.011
Clark-0.011
 Doyle-0.011
Token W
Token ID370
Feature activation+2.06e-02

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Tokenile
Token ID576
Feature activation-8.55e-02

Pos logit contributions
tch+0.016
 Holo+0.016
 ruth+0.015
Clark+0.015
 Doyle+0.015
Neg logit contributions
=#-0.011
 Neon-0.011
 裏�-0.010
nect-0.010
orage-0.010
Token E
Token ID412
Feature activation-9.98e-02

Pos logit contributions
=#+0.044
 Neon+0.044
 裏�+0.042
nect+0.041
icultural+0.041
Neg logit contributions
tch-0.068
 Holo-0.067
 ruth-0.065
Clark-0.064
 Doyle-0.062
Token.
Token ID13
Feature activation+7.22e-03

Pos logit contributions
=#+0.064
 Neon+0.063
 裏�+0.061
nect+0.060
orage+0.060
Neg logit contributions
tch-0.067
 Holo-0.066
 ruth-0.064
Clark-0.063
 Doyle-0.062
Token Coy
Token ID26887
Feature activation+2.47e-01

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.002
 Doyle+0.002
Neg logit contributions
 Neon-0.007
=#-0.007
 裏�-0.006
nect-0.006
orage-0.006
Tokenote
Token ID1258
Feature activation+4.75e-01

Pos logit contributions
tch+0.022
 Holo+0.021
 ruth+0.016
Clark+0.014
 Doyle+0.009
Neg logit contributions
=#-0.301
 Neon-0.299
nect-0.292
 裏�-0.290
orage-0.290
Token thing
Token ID1517
Feature activation-5.56e-02

Pos logit contributions
tch+0.331
 Holo+0.326
 ruth+0.317
Clark+0.313
 Doyle+0.307
Neg logit contributions
=#-0.291
 Neon-0.288
 裏�-0.277
nect-0.274
orage-0.272
Token is
Token ID318
Feature activation+1.64e-03

Pos logit contributions
=#+0.036
 Neon+0.036
 裏�+0.034
nect+0.034
orage+0.034
Neg logit contributions
tch-0.037
 Holo-0.036
 ruth-0.035
Clark-0.035
 Doyle-0.034
Token to
Token ID284
Feature activation+6.83e-03

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token continuously
Token ID17282
Feature activation+7.09e-02

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.005
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Token come
Token ID1282
Feature activation-2.72e-02

Pos logit contributions
tch+0.046
 Holo+0.045
 ruth+0.044
Clark+0.043
 Doyle+0.042
Neg logit contributions
=#-0.047
 Neon-0.046
 裏�-0.045
nect-0.044
orage-0.044
Token superior
Token ID9098
Feature activation+1.84e-01

Pos logit contributions
tch+0.051
 Holo+0.050
Clark+0.048
 ruth+0.048
 Doyle+0.047
Neg logit contributions
=#-0.049
 Neon-0.049
 裏�-0.047
nect-0.046
orage-0.046
Token.
Token ID13
Feature activation+1.76e-03

Pos logit contributions
tch+0.121
 Holo+0.119
 ruth+0.116
Clark+0.115
 Doyle+0.112
Neg logit contributions
=#-0.119
 Neon-0.119
 裏�-0.114
nect-0.112
orage-0.112
Token
Token ID198
Feature activation+7.26e-03

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token
Token ID198
Feature activation+3.88e-03

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.005
Neg logit contributions
=#-0.005
 Neon-0.005
 裏�-0.004
nect-0.004
orage-0.004
TokenT
Token ID51
Feature activation-2.46e-02

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Tokenote
Token ID1258
Feature activation+4.74e-01

Pos logit contributions
=#+0.015
 Neon+0.015
 裏�+0.014
nect+0.014
orage+0.014
Neg logit contributions
tch-0.017
 Holo-0.017
 ruth-0.017
Clark-0.016
 Doyle-0.016
Token up
Token ID510
Feature activation-8.67e-03

Pos logit contributions
tch+0.315
 Holo+0.309
 ruth+0.300
Clark+0.297
 Doyle+0.290
Neg logit contributions
=#-0.306
 Neon-0.304
 裏�-0.292
nect-0.288
orage-0.287
Token those
Token ID883
Feature activation+9.17e-02

Pos logit contributions
=#+0.006
 Neon+0.006
 裏�+0.005
nect+0.005
orage+0.005
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.005
Clark-0.005
 Doyle-0.005
Token points
Token ID2173
Feature activation-8.56e-02

Pos logit contributions
tch+0.061
 Holo+0.060
 ruth+0.058
Clark+0.058
 Doyle+0.056
Neg logit contributions
=#-0.059
 Neon-0.059
 裏�-0.056
nect-0.056
orage-0.055
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
=#+0.056
 Neon+0.055
 裏�+0.053
nect+0.053
orage+0.052
Neg logit contributions
tch-0.056
 Holo-0.055
 ruth-0.054
Clark-0.053
 Doyle-0.052
Token the
Token ID262
Feature activation+1.97e-02

Pos logit contributions
tch+0.007
 Holo+0.007
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.007
 Neon-0.006
 裏�-0.006
nect-0.006
orage-0.006
Token use
Token ID779
Feature activation-8.88e-02

Pos logit contributions
=#+0.036
 Neon+0.036
 裏�+0.035
nect+0.034
orage+0.034
Neg logit contributions
tch-0.039
 Holo-0.038
 ruth-0.037
Clark-0.037
 Doyle-0.036
Token apps
Token ID6725
Feature activation+2.29e-03

Pos logit contributions
=#+0.057
 Neon+0.057
 裏�+0.054
nect+0.054
orage+0.053
Neg logit contributions
tch-0.059
 Holo-0.058
 ruth-0.057
Clark-0.056
 Doyle-0.055
Token like
Token ID588
Feature activation+4.13e-04

Pos logit contributions
tch+0.002
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token E
Token ID412
Feature activation-9.66e-02

Pos logit contributions
tch+0.000
 Holo+0.000
 ruth+0.000
Clark+0.000
 Doyle+0.000
Neg logit contributions
 Neon-0.000
=#-0.000
 裏�-0.000
nect-0.000
orage-0.000
Tokenvern
Token ID933
Feature activation+1.97e-01

Pos logit contributions
=#+0.076
 Neon+0.075
 裏�+0.073
nect+0.072
orage+0.072
Neg logit contributions
tch-0.050
 Holo-0.050
 ruth-0.048
Clark-0.047
 Doyle-0.046
Tokenote
Token ID1258
Feature activation+4.74e-01

Pos logit contributions
tch+0.015
 Holo+0.013
 ruth+0.009
Clark+0.008
 Doyle+0.005
Neg logit contributions
=#-0.243
 Neon-0.242
 裏�-0.236
nect-0.235
orage-0.235
Token and
Token ID290
Feature activation+5.95e-03

Pos logit contributions
tch+0.313
 Holo+0.307
 ruth+0.298
Clark+0.295
 Doyle+0.288
Neg logit contributions
=#-0.308
 Neon-0.306
 裏�-0.294
nect-0.290
orage-0.289
Token Dropbox
Token ID38930
Feature activation+1.23e-01

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
 Neon-0.004
=#-0.004
 裏�-0.004
nect-0.004
orage-0.004
Token,
Token ID11
Feature activation+3.45e-03

Pos logit contributions
tch+0.083
 Holo+0.081
 ruth+0.079
Clark+0.078
 Doyle+0.076
Neg logit contributions
=#-0.079
 Neon-0.079
 裏�-0.075
nect-0.075
orage-0.074
Token and
Token ID290
Feature activation+9.30e-04

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
 Neon-0.002
=#-0.002
 裏�-0.002
nect-0.002
orage-0.002
Token video
Token ID2008
Feature activation-4.53e-02

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token Test
Token ID6208
Feature activation-6.93e-02

Pos logit contributions
=#+0.058
 Neon+0.058
 裏�+0.056
nect+0.055
orage+0.055
Neg logit contributions
tch-0.063
 Holo-0.062
 ruth-0.060
Clark-0.059
 Doyle-0.058
Token series
Token ID2168
Feature activation-1.11e-01

Pos logit contributions
=#+0.042
 Neon+0.042
 裏�+0.040
nect+0.040
orage+0.039
Neg logit contributions
tch-0.049
 Holo-0.048
 ruth-0.047
Clark-0.046
 Doyle-0.045
Token against
Token ID1028
Feature activation+9.70e-03

Pos logit contributions
=#+0.075
 Neon+0.074
 裏�+0.071
nect+0.070
orage+0.070
Neg logit contributions
tch-0.072
 Holo-0.070
 ruth-0.068
Clark-0.067
 Doyle-0.066
Token the
Token ID262
Feature activation-2.81e-03

Pos logit contributions
tch+0.006
 Holo+0.006
 ruth+0.006
Clark+0.005
 Doyle+0.005
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.007
nect-0.006
orage-0.006
Token West
Token ID2688
Feature activation+2.14e-01

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
icultural+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token Indies
Token ID42556
Feature activation+4.73e-01

Pos logit contributions
tch+0.020
 Holo+0.017
 ruth+0.013
Clark+0.012
 Doyle+0.008
Neg logit contributions
=#-0.261
 Neon-0.260
 裏�-0.254
nect-0.253
orage-0.252
Token.
Token ID13
Feature activation-6.08e-03

Pos logit contributions
tch+0.309
 Holo+0.304
 ruth+0.295
Clark+0.292
 Doyle+0.285
Neg logit contributions
=#-0.311
 Neon-0.308
 裏�-0.296
nect-0.293
orage-0.291
Token
Token ID198
Feature activation+5.60e-04

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.004
nect+0.004
orage+0.004
Neg logit contributions
tch-0.004
 Holo-0.004
 ruth-0.004
Clark-0.004
 Doyle-0.004
Token
Token ID198
Feature activation-7.01e-03

Pos logit contributions
tch+0.000
 Holo+0.000
 ruth+0.000
Clark+0.000
 Doyle+0.000
Neg logit contributions
=#-0.000
 Neon-0.000
 裏�-0.000
nect-0.000
orage-0.000
Token"
Token ID1
Feature activation-1.65e-02

Pos logit contributions
=#+0.005
 Neon+0.004
 裏�+0.004
nect+0.004
orage+0.004
Neg logit contributions
tch-0.005
 Holo-0.005
 ruth-0.004
Clark-0.004
 Doyle-0.004
TokenThe
Token ID464
Feature activation-2.16e-02

Pos logit contributions
=#+0.010
 Neon+0.010
 裏�+0.010
nect+0.010
icultural+0.010
Neg logit contributions
tch-0.011
 Holo-0.011
 ruth-0.011
Clark-0.011
 Doyle-0.010
Token got
Token ID1392
Feature activation-3.71e-02

Pos logit contributions
=#+0.003
 Neon+0.003
 裏�+0.003
nect+0.003
orage+0.003
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token a
Token ID257
Feature activation-1.16e-02

Pos logit contributions
=#+0.024
 Neon+0.024
 裏�+0.023
nect+0.023
orage+0.023
Neg logit contributions
tch-0.024
 Holo-0.024
 ruth-0.023
Clark-0.023
 Doyle-0.022
Token 50
Token ID2026
Feature activation+3.56e-02

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.008
nect+0.008
orage+0.008
Neg logit contributions
tch-0.007
 Holo-0.007
 ruth-0.007
Clark-0.007
 Doyle-0.006
Token for
Token ID329
Feature activation-1.73e-02

Pos logit contributions
tch+0.024
 Holo+0.023
 ruth+0.022
Clark+0.022
 Doyle+0.022
Neg logit contributions
=#-0.023
 Neon-0.023
 裏�-0.022
nect-0.022
orage-0.021
Token West
Token ID2688
Feature activation+2.12e-01

Pos logit contributions
=#+0.012
 Neon+0.011
 裏�+0.011
nect+0.011
orage+0.011
Neg logit contributions
tch-0.011
 Holo-0.011
 ruth-0.011
Clark-0.010
 Doyle-0.010
Token Indies
Token ID42556
Feature activation+4.72e-01

Pos logit contributions
tch+0.021
 Holo+0.019
 ruth+0.014
Clark+0.013
 Doyle+0.010
Neg logit contributions
=#-0.257
 Neon-0.256
 裏�-0.250
nect-0.249
orage-0.248
Token in
Token ID287
Feature activation+1.13e-02

Pos logit contributions
tch+0.317
 Holo+0.311
 ruth+0.302
Clark+0.299
 Doyle+0.292
Neg logit contributions
=#-0.303
 Neon-0.300
 裏�-0.289
nect-0.285
orage-0.283
Token the
Token ID262
Feature activation+2.89e-03

Pos logit contributions
tch+0.007
 Holo+0.007
 ruth+0.007
Clark+0.007
 Doyle+0.007
Neg logit contributions
=#-0.008
 Neon-0.007
 裏�-0.007
nect-0.007
orage-0.007
Token second
Token ID1218
Feature activation+7.95e-02

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Token innings
Token ID16267
Feature activation-2.26e-01

Pos logit contributions
tch+0.063
 Holo+0.062
 ruth+0.060
Clark+0.060
 Doyle+0.059
Neg logit contributions
=#-0.041
 Neon-0.041
 裏�-0.039
nect-0.038
orage-0.038
Token.
Token ID13
Feature activation+3.56e-03

Pos logit contributions
=#+0.141
 Neon+0.140
 裏�+0.134
nect+0.132
orage+0.132
Neg logit contributions
tch-0.154
 Holo-0.152
 ruth-0.147
Clark-0.146
 Doyle-0.142
Token).
Token ID737
Feature activation+1.76e-02

Pos logit contributions
tch+0.016
 Holo+0.016
 ruth+0.015
Clark+0.015
 Doyle+0.015
Neg logit contributions
=#-0.015
 Neon-0.015
 裏�-0.014
nect-0.014
orage-0.014
Token Numbers
Token ID27797
Feature activation-2.79e-01

Pos logit contributions
tch+0.012
 Holo+0.012
 ruth+0.011
Clark+0.011
 Doyle+0.011
Neg logit contributions
=#-0.011
 Neon-0.011
 裏�-0.011
nect-0.011
orage-0.011
Token and
Token ID290
Feature activation+1.17e-02

Pos logit contributions
=#+0.179
 Neon+0.178
 裏�+0.170
nect+0.168
icultural+0.167
Neg logit contributions
tch-0.187
 Holo-0.183
 ruth-0.178
Clark-0.176
 Doyle-0.172
Token Ke
Token ID3873
Feature activation+1.70e-01

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.008
Clark+0.008
 Doyle+0.008
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.007
nect-0.006
orage-0.006
Tokenyn
Token ID2047
Feature activation+2.02e-01

Pos logit contributions
tch+0.084
 Holo+0.082
 ruth+0.079
Clark+0.078
 Doyle+0.076
Neg logit contributions
=#-0.138
 Neon-0.137
 裏�-0.133
nect-0.132
orage-0.131
Tokenote
Token ID1258
Feature activation+4.72e-01

Pos logit contributions
tch+0.020
 Holo+0.019
 ruth+0.014
Clark+0.013
 Doyle+0.010
Neg logit contributions
=#-0.245
 Neon-0.244
 裏�-0.238
nect-0.237
orage-0.237
Token will
Token ID481
Feature activation+2.36e-02

Pos logit contributions
tch+0.327
 Holo+0.322
 ruth+0.312
Clark+0.309
 Doyle+0.302
Neg logit contributions
=#-0.292
 Neon-0.289
 裏�-0.278
nect-0.274
orage-0.273
Token be
Token ID307
Feature activation+2.16e-02

Pos logit contributions
tch+0.015
 Holo+0.015
 ruth+0.015
Clark+0.015
 Doyle+0.014
Neg logit contributions
=#-0.015
 Neon-0.015
 裏�-0.015
nect-0.015
orage-0.014
Token receiving
Token ID6464
Feature activation+9.92e-03

Pos logit contributions
tch+0.015
 Holo+0.014
 ruth+0.014
Clark+0.014
 Doyle+0.013
Neg logit contributions
=#-0.014
 Neon-0.014
 裏�-0.013
nect-0.013
orage-0.013
Token "
Token ID366
Feature activation+2.50e-02

Pos logit contributions
tch+0.007
 Holo+0.006
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.006
 Neon-0.006
 裏�-0.006
nect-0.006
orage-0.006
Tokenimprove
Token ID49453
Feature activation+1.55e-01

Pos logit contributions
tch+0.017
 Holo+0.016
 ruth+0.016
Clark+0.016
 Doyle+0.015
Neg logit contributions
=#-0.016
 Neon-0.016
 裏�-0.015
nect-0.015
orage-0.015
Token different
Token ID1180
Feature activation-1.01e-02

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
orage-0.001
nect-0.001
Token flight
Token ID5474
Feature activation-9.60e-03

Pos logit contributions
=#+0.007
 Neon+0.007
 裏�+0.006
nect+0.006
orage+0.006
Neg logit contributions
tch-0.007
 Holo-0.006
 ruth-0.006
Clark-0.006
 Doyle-0.006
Token (
Token ID357
Feature activation-2.42e-02

Pos logit contributions
=#+0.006
 Neon+0.006
 裏�+0.006
nect+0.006
icultural+0.006
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.006
Clark-0.006
 Doyle-0.006
Tokenbecause
Token ID13893
Feature activation-3.66e-02

Pos logit contributions
=#+0.015
 Neon+0.015
 裏�+0.015
nect+0.014
icultural+0.014
Neg logit contributions
tch-0.016
 Holo-0.016
 ruth-0.016
Clark-0.015
 Doyle-0.015
Token Exp
Token ID5518
Feature activation+2.60e-01

Pos logit contributions
=#+0.024
 Neon+0.024
 裏�+0.023
nect+0.023
orage+0.023
Neg logit contributions
tch-0.024
 Holo-0.023
 ruth-0.023
Clark-0.022
 Doyle-0.022
Tokenedia
Token ID5507
Feature activation+4.71e-01

Pos logit contributions
tch+0.035
 Holo+0.032
 ruth+0.027
Clark+0.025
 Doyle+0.022
Neg logit contributions
=#-0.305
 Neon-0.304
 裏�-0.297
nect-0.295
orage-0.295
Token informed
Token ID7981
Feature activation-4.37e-02

Pos logit contributions
tch+0.309
 Holo+0.304
 ruth+0.294
Clark+0.291
 Doyle+0.285
Neg logit contributions
=#-0.309
 Neon-0.306
 裏�-0.295
nect-0.291
orage-0.289
Token her
Token ID607
Feature activation+6.08e-03

Pos logit contributions
=#+0.029
 Neon+0.029
 裏�+0.028
nect+0.027
orage+0.027
Neg logit contributions
tch-0.029
 Holo-0.028
 ruth-0.027
Clark-0.027
 Doyle-0.026
Token that
Token ID326
Feature activation+2.06e-02

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Token her
Token ID607
Feature activation+6.64e-03

Pos logit contributions
tch+0.014
 Holo+0.013
 ruth+0.013
Clark+0.013
 Doyle+0.012
Neg logit contributions
=#-0.014
 Neon-0.013
 裏�-0.013
nect-0.013
orage-0.013
Token original
Token ID2656
Feature activation+1.68e-01

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Token
Token ID198
Feature activation+1.38e-04

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
TokenFaith
Token ID45536
Feature activation-8.12e-02

Pos logit contributions
tch+0.000
 Holo+0.000
 ruth+0.000
Clark+0.000
 Doyle+0.000
Neg logit contributions
=#-0.000
 Neon-0.000
 裏�-0.000
nect-0.000
orage-0.000
Token.
Token ID13
Feature activation+9.32e-03

Pos logit contributions
=#+0.050
 Neon+0.049
 裏�+0.047
nect+0.047
orage+0.047
Neg logit contributions
tch-0.057
 Holo-0.056
 ruth-0.054
Clark-0.053
 Doyle-0.052
Token Hope
Token ID13408
Feature activation+6.83e-02

Pos logit contributions
tch+0.006
 Holo+0.006
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.006
 Neon-0.006
 裏�-0.006
nect-0.006
orage-0.005
Token.
Token ID13
Feature activation+3.97e-04

Pos logit contributions
tch+0.046
 Holo+0.045
 ruth+0.044
Clark+0.043
 Doyle+0.042
Neg logit contributions
=#-0.043
 Neon-0.043
 裏�-0.041
nect-0.041
orage-0.041
Token Charity
Token ID37575
Feature activation+4.71e-01

Pos logit contributions
tch+0.000
 Holo+0.000
 ruth+0.000
Clark+0.000
 Doyle+0.000
Neg logit contributions
=#-0.000
 Neon-0.000
 裏�-0.000
nect-0.000
orage-0.000
Token.
Token ID13
Feature activation+2.08e-03

Pos logit contributions
tch+0.315
 Holo+0.310
 ruth+0.301
Clark+0.298
 Doyle+0.291
Neg logit contributions
=#-0.302
 Neon-0.300
 裏�-0.288
nect-0.285
orage-0.283
Token Pr
Token ID1736
Feature activation-4.88e-02

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Tokenudence
Token ID42581
Feature activation+2.61e-01

Pos logit contributions
=#+0.042
 Neon+0.042
 裏�+0.041
nect+0.040
orage+0.040
Neg logit contributions
tch-0.022
 Holo-0.021
 ruth-0.020
Clark-0.020
 Doyle-0.019
Token.
Token ID13
Feature activation+3.48e-03

Pos logit contributions
tch+0.174
 Holo+0.171
 ruth+0.166
Clark+0.164
 Doyle+0.161
Neg logit contributions
=#-0.168
 Neon-0.167
 裏�-0.160
nect-0.158
orage-0.157
Token Justice
Token ID4796
Feature activation+1.09e-01

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Token National
Token ID2351
Feature activation-2.32e-01

Pos logit contributions
tch+0.045
 Holo+0.044
 ruth+0.043
Clark+0.042
 Doyle+0.041
Neg logit contributions
=#-0.045
 Neon-0.045
 裏�-0.043
nect-0.043
orage-0.042
Token Association
Token ID5396
Feature activation-9.29e-02

Pos logit contributions
=#+0.157
 Neon+0.156
 裏�+0.150
nect+0.148
orage+0.147
Neg logit contributions
tch-0.148
 Holo-0.145
 ruth-0.141
Clark-0.139
 Doyle-0.135
Token of
Token ID286
Feature activation+5.90e-03

Pos logit contributions
=#+0.058
 Neon+0.058
 裏�+0.055
nect+0.055
icultural+0.054
Neg logit contributions
tch-0.063
 Holo-0.062
 ruth-0.061
Clark-0.060
 Doyle-0.058
Token Basketball
Token ID25911
Feature activation+1.67e-01

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
 Neon-0.005
=#-0.005
 裏�-0.004
nect-0.004
icultural-0.004
Token Co
Token ID1766
Feature activation-3.91e-02

Pos logit contributions
tch+0.102
 Holo+0.100
 ruth+0.097
Clark+0.096
 Doyle+0.093
Neg logit contributions
=#-0.116
 Neon-0.116
 裏�-0.111
nect-0.110
icultural-0.109
Tokenaches
Token ID3694
Feature activation+4.67e-01

Pos logit contributions
=#+0.044
 Neon+0.044
icultural+0.043
nect+0.043
orage+0.043
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.006
Clark-0.005
 Doyle-0.004
Token,[
Token ID17414
Feature activation-6.63e-02

Pos logit contributions
tch+0.317
 Holo+0.312
 ruth+0.303
Clark+0.300
 Doyle+0.293
Neg logit contributions
=#-0.295
 Neon-0.292
 裏�-0.280
nect-0.277
icultural-0.275
Token122
Token ID18376
Feature activation+1.44e-01

Pos logit contributions
=#+0.054
 Neon+0.054
 裏�+0.052
nect+0.052
orage+0.051
Neg logit contributions
tch-0.033
 Holo-0.032
 ruth-0.031
Clark-0.030
 Doyle-0.029
Token]
Token ID60
Feature activation+6.59e-02

Pos logit contributions
tch+0.086
 Holo+0.084
 ruth+0.082
Clark+0.081
 Doyle+0.078
Neg logit contributions
=#-0.102
 Neon-0.102
 裏�-0.098
nect-0.097
orage-0.096
Token and
Token ID290
Feature activation+5.02e-04

Pos logit contributions
tch+0.044
 Holo+0.043
 ruth+0.042
Clark+0.042
 Doyle+0.040
Neg logit contributions
=#-0.042
 Neon-0.042
 裏�-0.040
nect-0.040
icultural-0.040
Token Associated
Token ID10575
Feature activation-5.91e-02

Pos logit contributions
tch+0.000
 Holo+0.000
 ruth+0.000
Clark+0.000
 Doyle+0.000
Neg logit contributions
=#-0.000
 Neon-0.000
 裏�-0.000
nect-0.000
icultural-0.000
Token the
Token ID262
Feature activation+2.69e-02

Pos logit contributions
tch+0.020
 Holo+0.020
 ruth+0.019
Clark+0.019
 Doyle+0.019
Neg logit contributions
=#-0.022
 Neon-0.022
 裏�-0.021
nect-0.021
orage-0.020
Token
Token ID564
Feature activation+2.90e-02

Pos logit contributions
tch+0.018
 Holo+0.018
 ruth+0.017
Clark+0.017
 Doyle+0.017
Neg logit contributions
=#-0.017
 Neon-0.017
 裏�-0.016
nect-0.016
orage-0.016
Token
Token ID246
Feature activation+4.31e-02

Pos logit contributions
tch+0.019
 Holo+0.019
 ruth+0.018
Clark+0.018
 Doyle+0.017
Neg logit contributions
=#-0.019
 Neon-0.019
 裏�-0.018
nect-0.018
orage-0.018
Tokenp
Token ID79
Feature activation+4.45e-03

Pos logit contributions
tch+0.030
 Holo+0.029
 ruth+0.029
Clark+0.028
 Doyle+0.028
Neg logit contributions
=#-0.026
 Neon-0.026
 裏�-0.025
nect-0.025
orage-0.025
Tokeniv
Token ID452
Feature activation-7.21e-02

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Tokenotal
Token ID4997
Feature activation+4.67e-01

Pos logit contributions
=#+0.069
 Neon+0.068
 裏�+0.067
nect+0.066
orage+0.066
Neg logit contributions
tch-0.026
 Holo-0.025
 ruth-0.024
Clark-0.023
 Doyle-0.022
Token
Token ID447
Feature activation+2.49e-02

Pos logit contributions
tch+0.314
 Holo+0.309
 ruth+0.300
Clark+0.297
 Doyle+0.290
Neg logit contributions
=#-0.297
 Neon-0.295
 裏�-0.283
nect-0.280
orage-0.278
Token
Token ID247
Feature activation+1.64e-02

Pos logit contributions
tch+0.017
 Holo+0.017
 ruth+0.016
Clark+0.016
 Doyle+0.016
Neg logit contributions
=#-0.016
 Neon-0.015
 裏�-0.015
nect-0.015
orage-0.015
Token Supreme
Token ID5617
Feature activation-6.92e-02

Pos logit contributions
tch+0.011
 Holo+0.011
 ruth+0.011
Clark+0.010
 Doyle+0.010
Neg logit contributions
=#-0.010
 Neon-0.010
 裏�-0.010
nect-0.010
orage-0.010
Token Court
Token ID3078
Feature activation-1.54e-02

Pos logit contributions
=#+0.040
 Neon+0.040
 裏�+0.038
nect+0.038
icultural+0.037
Neg logit contributions
tch-0.050
 Holo-0.049
 ruth-0.048
Clark-0.048
 Doyle-0.047
Token justices
Token ID27818
Feature activation+7.60e-02

Pos logit contributions
=#+0.010
 Neon+0.010
 裏�+0.010
nect+0.010
orage+0.010
Neg logit contributions
tch-0.010
 Holo-0.010
 ruth-0.010
Clark-0.009
 Doyle-0.009
Token theories
Token ID10946
Feature activation+5.29e-02

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
Token of
Token ID286
Feature activation+2.87e-02

Pos logit contributions
tch+0.035
 Holo+0.035
 ruth+0.033
Clark+0.033
 Doyle+0.032
Neg logit contributions
=#-0.034
 Neon-0.034
 裏�-0.033
nect-0.032
orage-0.032
Token rationality
Token ID45627
Feature activation+2.92e-01

Pos logit contributions
tch+0.012
 Holo+0.011
Clark+0.010
 ruth+0.010
 Doyle+0.010
Neg logit contributions
=#-0.026
 Neon-0.026
 裏�-0.025
nect-0.025
icultural-0.025
Token since
Token ID1201
Feature activation-1.23e-01

Pos logit contributions
tch+0.193
 Holo+0.190
 ruth+0.184
Clark+0.182
 Doyle+0.178
Neg logit contributions
=#-0.190
 Neon-0.188
 裏�-0.181
nect-0.178
orage-0.177
Token Des
Token ID2935
Feature activation+1.92e-01

Pos logit contributions
=#+0.080
 Neon+0.080
 裏�+0.076
nect+0.075
orage+0.075
Neg logit contributions
tch-0.081
 Holo-0.079
 ruth-0.077
Clark-0.076
 Doyle-0.074
Tokencart
Token ID26674
Feature activation+4.62e-01

Pos logit contributions
tch+0.029
 Holo+0.027
 ruth+0.023
Clark+0.022
 Doyle+0.019
Neg logit contributions
=#-0.223
 Neon-0.221
 裏�-0.216
nect-0.216
icultural-0.215
Tokenes
Token ID274
Feature activation+7.82e-02

Pos logit contributions
tch+0.282
 Holo+0.277
 ruth+0.268
Clark+0.265
 Doyle+0.258
Neg logit contributions
=#-0.323
 Neon-0.321
 裏�-0.309
nect-0.306
orage-0.304
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
tch+0.053
 Holo+0.052
 ruth+0.050
Clark+0.050
 Doyle+0.048
Neg logit contributions
=#-0.050
 Neon-0.049
 裏�-0.047
nect-0.047
orage-0.047
Token The
Token ID383
Feature activation-1.81e-02

Pos logit contributions
tch+0.011
 Holo+0.010
 ruth+0.010
Clark+0.010
 Doyle+0.010
Neg logit contributions
=#-0.011
 Neon-0.011
 裏�-0.010
nect-0.010
orage-0.010
Token purpose
Token ID4007
Feature activation-2.77e-02

Pos logit contributions
=#+0.012
 Neon+0.011
 裏�+0.011
nect+0.011
orage+0.011
Neg logit contributions
tch-0.012
 Holo-0.012
 ruth-0.012
Clark-0.012
 Doyle-0.011
Token of
Token ID286
Feature activation+2.24e-02

Pos logit contributions
=#+0.018
 Neon+0.018
 裏�+0.017
nect+0.017
orage+0.017
Neg logit contributions
tch-0.018
 Holo-0.018
 ruth-0.017
Clark-0.017
 Doyle-0.017
TokenMON
Token ID27857
Feature activation-7.51e-02

Pos logit contributions
tch+0.078
 Holo+0.077
 ruth+0.075
Clark+0.074
 Doyle+0.073
Neg logit contributions
=#-0.065
 Neon-0.064
 裏�-0.062
nect-0.061
orage-0.061
TokenTON
Token ID11357
Feature activation-1.20e-02

Pos logit contributions
=#+0.041
 Neon+0.041
 裏�+0.039
nect+0.038
orage+0.038
Neg logit contributions
tch-0.056
 Holo-0.056
 ruth-0.055
Clark-0.054
 Doyle-0.053
Token
Token ID784
Feature activation+1.16e-02

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.008
nect+0.007
icultural+0.007
Neg logit contributions
tch-0.008
 Holo-0.008
 ruth-0.007
Clark-0.007
 Doyle-0.007
Token Omar
Token ID24980
Feature activation-1.10e-02

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.007
Clark+0.007
 Doyle+0.007
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.007
nect-0.007
orage-0.007
Token Kh
Token ID5311
Feature activation-1.46e-02

Pos logit contributions
=#+0.007
 Neon+0.007
 裏�+0.006
nect+0.006
icultural+0.006
Neg logit contributions
tch-0.008
 Holo-0.008
 ruth-0.007
Clark-0.007
 Doyle-0.007
Tokenadr
Token ID41909
Feature activation+4.60e-01

Pos logit contributions
=#+0.017
 Neon+0.017
 裏�+0.017
nect+0.017
icultural+0.017
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.001
Clark-0.001
 Doyle-0.001
Token has
Token ID468
Feature activation+3.80e-02

Pos logit contributions
tch+0.308
 Holo+0.302
 ruth+0.294
Clark+0.291
 Doyle+0.284
Neg logit contributions
=#-0.296
 Neon-0.293
 裏�-0.282
nect-0.278
orage-0.277
Token agreed
Token ID4987
Feature activation-2.40e-02

Pos logit contributions
tch+0.025
 Holo+0.024
 ruth+0.024
Clark+0.024
 Doyle+0.023
Neg logit contributions
=#-0.025
 Neon-0.025
 裏�-0.024
nect-0.023
icultural-0.023
Token to
Token ID284
Feature activation+1.81e-02

Pos logit contributions
=#+0.016
 Neon+0.015
 裏�+0.015
nect+0.015
orage+0.015
Neg logit contributions
tch-0.016
 Holo-0.016
 ruth-0.015
Clark-0.015
 Doyle-0.015
Token remain
Token ID3520
Feature activation-1.80e-01

Pos logit contributions
tch+0.012
 Holo+0.011
 ruth+0.011
Clark+0.011
 Doyle+0.011
Neg logit contributions
 Neon-0.012
=#-0.012
nect-0.011
 裏�-0.011
orage-0.011
Token in
Token ID287
Feature activation+3.25e-02

Pos logit contributions
=#+0.117
 Neon+0.117
 裏�+0.112
nect+0.111
icultural+0.110
Neg logit contributions
tch-0.118
 Holo-0.115
 ruth-0.112
Clark-0.111
 Doyle-0.108
Token by
Token ID416
Feature activation+6.47e-03

Pos logit contributions
=#+0.090
 Neon+0.090
 裏�+0.086
nect+0.085
orage+0.084
Neg logit contributions
tch-0.095
 Holo-0.094
 ruth-0.090
Clark-0.090
 Doyle-0.087
Token New
Token ID968
Feature activation-1.47e-02

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Token Hampshire
Token ID13910
Feature activation+1.01e-01

Pos logit contributions
=#+0.012
 Neon+0.012
 裏�+0.011
nect+0.011
orage+0.011
Neg logit contributions
tch-0.008
 Holo-0.007
 ruth-0.007
Clark-0.007
 Doyle-0.007
Token Senator
Token ID8962
Feature activation-4.14e-02

Pos logit contributions
tch+0.074
 Holo+0.072
 ruth+0.070
Clark+0.070
 Doyle+0.068
Neg logit contributions
=#-0.059
 Neon-0.059
 裏�-0.056
nect-0.055
orage-0.055
Token Daniel
Token ID7806
Feature activation+3.54e-02

Pos logit contributions
=#+0.025
 Neon+0.025
 裏�+0.024
nect+0.024
orage+0.023
Neg logit contributions
tch-0.029
 Holo-0.029
 ruth-0.028
Clark-0.027
 Doyle-0.027
Token Clark
Token ID11264
Feature activation+4.59e-01

Pos logit contributions
tch+0.024
 Holo+0.023
 ruth+0.023
Clark+0.023
 Doyle+0.022
Neg logit contributions
=#-0.023
 Neon-0.022
 裏�-0.022
nect-0.021
orage-0.021
Token on
Token ID319
Feature activation-1.78e-02

Pos logit contributions
tch+0.301
 Holo+0.296
 ruth+0.287
Clark+0.284
 Doyle+0.278
Neg logit contributions
=#-0.301
 Neon-0.298
 裏�-0.286
nect-0.283
icultural-0.281
Token March
Token ID2805
Feature activation-2.44e-02

Pos logit contributions
=#+0.010
 Neon+0.010
 裏�+0.010
nect+0.010
orage+0.009
Neg logit contributions
tch-0.013
 Holo-0.013
 ruth-0.013
Clark-0.012
 Doyle-0.012
Token 22
Token ID2534
Feature activation+5.49e-02

Pos logit contributions
=#+0.016
 Neon+0.016
 裏�+0.015
nect+0.015
orage+0.015
Neg logit contributions
tch-0.016
 Holo-0.016
 ruth-0.015
Clark-0.015
 Doyle-0.015
Token;
Token ID26
Feature activation+2.41e-02

Pos logit contributions
tch+0.037
 Holo+0.036
 ruth+0.035
Clark+0.035
 Doyle+0.034
Neg logit contributions
=#-0.035
 Neon-0.035
 裏�-0.034
nect-0.033
orage-0.033
Token Poll
Token ID12868
Feature activation-4.65e-02

Pos logit contributions
tch+0.016
 Holo+0.016
 ruth+0.015
Clark+0.015
 Doyle+0.015
Neg logit contributions
=#-0.016
 Neon-0.016
 裏�-0.015
nect-0.015
orage-0.015
Token to
Token ID284
Feature activation-2.49e-02

Pos logit contributions
tch+0.047
 Holo+0.046
 ruth+0.045
Clark+0.044
 Doyle+0.043
Neg logit contributions
=#-0.046
 Neon-0.046
 裏�-0.044
nect-0.043
orage-0.043
Token see
Token ID766
Feature activation-6.05e-02

Pos logit contributions
 Neon+0.016
=#+0.016
 裏�+0.015
nect+0.015
orage+0.015
Neg logit contributions
tch-0.016
 Holo-0.016
Clark-0.015
 ruth-0.015
 Doyle-0.015
Token the
Token ID262
Feature activation-2.70e-02

Pos logit contributions
=#+0.038
 Neon+0.038
 裏�+0.036
nect+0.036
orage+0.036
Neg logit contributions
tch-0.041
 Holo-0.040
 ruth-0.039
Clark-0.039
 Doyle-0.038
Token From
Token ID3574
Feature activation+4.57e-02

Pos logit contributions
=#+0.018
 Neon+0.018
 裏�+0.017
nect+0.017
orage+0.017
Neg logit contributions
tch-0.017
 Holo-0.017
 ruth-0.016
Clark-0.016
 Doyle-0.016
Token Dusk
Token ID42553
Feature activation+1.11e-01

Pos logit contributions
tch+0.029
 Holo+0.028
 ruth+0.028
Clark+0.028
 Doyle+0.027
Neg logit contributions
 Neon-0.031
=#-0.031
 裏�-0.029
orage-0.029
nect-0.029
Token Till
Token ID17888
Feature activation-4.91e-01

Pos logit contributions
tch+0.129
 Holo+0.127
 ruth+0.125
Clark+0.123
 Doyle+0.122
Neg logit contributions
=#-0.017
 Neon-0.015
nect-0.014
icultural-0.014
 裏�-0.013
Token Dawn
Token ID12258
Feature activation-3.25e-02

Pos logit contributions
=#+0.306
 Neon+0.304
 裏�+0.290
nect+0.287
orage+0.285
Neg logit contributions
tch-0.339
 Holo-0.332
 ruth-0.324
Clark-0.321
 Doyle-0.313
Token maze
Token ID31237
Feature activation+1.01e-01

Pos logit contributions
=#+0.022
 Neon+0.022
 裏�+0.021
nect+0.021
orage+0.021
Neg logit contributions
tch-0.021
 Holo-0.020
 ruth-0.020
Clark-0.019
 Doyle-0.019
Token coming
Token ID2406
Feature activation+5.31e-02

Pos logit contributions
tch+0.066
 Holo+0.065
 ruth+0.063
Clark+0.063
 Doyle+0.061
Neg logit contributions
 Neon-0.066
=#-0.066
 裏�-0.063
nect-0.062
icultural-0.062
Token this
Token ID428
Feature activation-3.98e-02

Pos logit contributions
tch+0.035
 Holo+0.035
 ruth+0.034
Clark+0.033
 Doyle+0.033
Neg logit contributions
=#-0.034
 Neon-0.034
 裏�-0.033
nect-0.032
orage-0.032
Token year
Token ID614
Feature activation+3.54e-02

Pos logit contributions
 Neon+0.025
=#+0.025
 裏�+0.024
nect+0.023
orage+0.023
Neg logit contributions
tch-0.027
 Holo-0.027
 ruth-0.026
Clark-0.026
 Doyle-0.025
Token47
Token ID2857
Feature activation+1.35e-01

Pos logit contributions
=#+0.009
 Neon+0.009
nect+0.008
 裏�+0.008
orage+0.008
Neg logit contributions
tch-0.005
 Holo-0.005
 ruth-0.005
Clark-0.005
 Doyle-0.005
Token]
Token ID60
Feature activation+7.24e-02

Pos logit contributions
tch+0.081
 Holo+0.079
 ruth+0.077
Clark+0.076
 Doyle+0.074
Neg logit contributions
=#-0.097
 Neon-0.096
 裏�-0.092
nect-0.091
orage-0.091
Token <
Token ID1279
Feature activation+1.78e-01

Pos logit contributions
tch+0.036
 Holo+0.035
 ruth+0.033
Clark+0.033
 Doyle+0.032
Neg logit contributions
=#-0.059
 Neon-0.059
 裏�-0.057
nect-0.057
orage-0.056
Tokenen
Token ID268
Feature activation+2.64e-01

Pos logit contributions
tch+0.123
 Holo+0.120
 ruth+0.117
Clark+0.116
 Doyle+0.113
Neg logit contributions
=#-0.111
 Neon-0.110
 裏�-0.106
nect-0.104
orage-0.104
Token7
Token ID22
Feature activation-7.08e-02

Pos logit contributions
tch+0.180
 Holo+0.177
 ruth+0.172
Clark+0.171
 Doyle+0.167
Neg logit contributions
=#-0.166
 Neon-0.164
 裏�-0.157
nect-0.156
orage-0.155
Token_>
Token ID49029
Feature activation-4.71e-01

Pos logit contributions
=#+0.046
 Neon+0.045
 裏�+0.043
nect+0.043
orage+0.043
Neg logit contributions
tch-0.047
 Holo-0.046
 ruth-0.045
Clark-0.045
 Doyle-0.044
Token lol
Token ID19462
Feature activation-4.29e-02

Pos logit contributions
=#+0.304
 Neon+0.302
 裏�+0.289
nect+0.286
orage+0.284
Neg logit contributions
tch-0.313
 Holo-0.307
 ruth-0.298
Clark-0.296
 Doyle-0.288
Token Mang
Token ID27609
Feature activation+1.55e-02

Pos logit contributions
=#+0.033
 Neon+0.033
 裏�+0.032
nect+0.031
orage+0.031
Neg logit contributions
tch-0.023
 Holo-0.023
 ruth-0.022
Clark-0.022
 Doyle-0.021
Token0
Token ID15
Feature activation-7.01e-02

Pos logit contributions
tch+0.010
 Holo+0.010
 ruth+0.010
Clark+0.010
 Doyle+0.010
Neg logit contributions
=#-0.010
 Neon-0.010
 裏�-0.009
nect-0.009
icultural-0.009
Tokenking
Token ID3364
Feature activation-8.73e-02

Pos logit contributions
=#+0.044
 Neon+0.043
 裏�+0.041
nect+0.041
icultural+0.041
Neg logit contributions
tch-0.048
 Holo-0.047
 ruth-0.046
Clark-0.045
 Doyle-0.044
Token [
Token ID685
Feature activation-4.60e-02

Pos logit contributions
=#+0.055
 Neon+0.054
 裏�+0.052
nect+0.051
orage+0.051
Neg logit contributions
tch-0.060
 Holo-0.059
 ruth-0.057
Clark-0.056
 Doyle-0.055
Token serving
Token ID7351
Feature activation+2.88e-02

Pos logit contributions
tch+0.071
 Holo+0.070
 ruth+0.068
Clark+0.067
 Doyle+0.066
Neg logit contributions
=#-0.069
 Neon-0.068
 裏�-0.066
nect-0.065
orage-0.064
Token a
Token ID257
Feature activation-1.21e-02

Pos logit contributions
tch+0.019
 Holo+0.019
Clark+0.018
 ruth+0.018
 Doyle+0.017
Neg logit contributions
=#-0.019
 Neon-0.019
 裏�-0.018
icultural-0.017
nect-0.017
Token
Token ID564
Feature activation-6.30e-03

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.008
nect+0.007
icultural+0.007
Neg logit contributions
tch-0.008
 Holo-0.008
 ruth-0.008
Clark-0.008
 Doyle-0.007
Token
Token ID250
Feature activation-1.02e-03

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.004
nect+0.004
orage+0.004
Neg logit contributions
tch-0.004
 Holo-0.004
 ruth-0.004
Clark-0.004
 Doyle-0.004
Tokenmult
Token ID16680
Feature activation-2.01e-02

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
Tokenicultural
Token ID26823
Feature activation-4.62e-01

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.003
nect+0.003
orage+0.003
Neg logit contributions
tch-0.022
 Holo-0.022
 ruth-0.022
Clark-0.022
 Doyle-0.021
Token community
Token ID2055
Feature activation-1.40e-01

Pos logit contributions
=#+0.297
 Neon+0.294
 裏�+0.283
nect+0.279
icultural+0.278
Neg logit contributions
tch-0.308
 Holo-0.303
 ruth-0.293
Clark-0.291
 Doyle-0.284
Token
Token ID447
Feature activation-2.24e-03

Pos logit contributions
=#+0.090
 Neon+0.089
 裏�+0.086
nect+0.085
orage+0.084
Neg logit contributions
tch-0.093
 Holo-0.092
 ruth-0.089
Clark-0.088
 Doyle-0.086
Token
Token ID251
Feature activation-2.63e-03

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
orage+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token
Token ID198
Feature activation-7.79e-03

Pos logit contributions
=#+0.007
 Neon+0.007
 裏�+0.006
nect+0.006
orage+0.006
Neg logit contributions
tch-0.007
 Holo-0.007
 ruth-0.006
Clark-0.006
 Doyle-0.006
TokenFI
Token ID11674
Feature activation-1.12e-01

Pos logit contributions
tch+0.147
 Holo+0.145
 ruth+0.142
Clark+0.141
 Doyle+0.138
Neg logit contributions
=#-0.085
 Neon-0.084
 裏�-0.079
nect-0.078
orage-0.078
Token is
Token ID318
Feature activation-5.27e-03

Pos logit contributions
=#+0.072
 Neon+0.072
 裏�+0.068
nect+0.068
orage+0.067
Neg logit contributions
tch-0.075
 Holo-0.074
 ruth-0.071
Clark-0.071
 Doyle-0.069
Token affected
Token ID5676
Feature activation-6.75e-02

Pos logit contributions
=#+0.003
 Neon+0.003
 裏�+0.003
nect+0.003
orage+0.003
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token.
Token ID13
Feature activation+1.80e-03

Pos logit contributions
=#+0.044
 Neon+0.044
 裏�+0.042
nect+0.042
orage+0.042
Neg logit contributions
tch-0.044
 Holo-0.043
 ruth-0.042
Clark-0.042
 Doyle-0.041
Token Recon
Token ID23419
Feature activation-2.34e-01

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Tokennect
Token ID1606
Feature activation-4.61e-01

Pos logit contributions
=#+0.024
 Neon+0.023
nect+0.019
 裏�+0.017
orage+0.015
Neg logit contributions
tch-0.281
 Holo-0.278
 ruth-0.274
Clark-0.273
 Doyle-0.269
Token the
Token ID262
Feature activation+5.03e-03

Pos logit contributions
=#+0.307
 Neon+0.304
 裏�+0.292
nect+0.289
orage+0.287
Neg logit contributions
tch-0.298
 Holo-0.293
 ruth-0.284
Clark-0.281
 Doyle-0.274
Token radio
Token ID5243
Feature activation+1.88e-01

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Token to
Token ID284
Feature activation+1.80e-03

Pos logit contributions
tch+0.122
 Holo+0.120
 ruth+0.117
Clark+0.115
 Doyle+0.112
Neg logit contributions
=#-0.125
 Neon-0.124
 裏�-0.120
nect-0.117
orage-0.117
Token the
Token ID262
Feature activation-5.20e-04

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token power
Token ID1176
Feature activation+7.39e-02

Pos logit contributions
=#+0.000
 Neon+0.000
 裏�+0.000
nect+0.000
orage+0.000
Neg logit contributions
tch-0.000
 Holo-0.000
 ruth-0.000
Clark-0.000
 Doyle-0.000
Token on
Token ID319
Feature activation-2.52e-02

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.004
nect+0.004
orage+0.004
Neg logit contributions
tch-0.004
 Holo-0.004
 ruth-0.004
Clark-0.004
 Doyle-0.004
Token the
Token ID262
Feature activation-5.38e-03

Pos logit contributions
=#+0.017
 Neon+0.017
 裏�+0.016
nect+0.016
icultural+0.016
Neg logit contributions
tch-0.016
 Holo-0.016
 ruth-0.016
Clark-0.015
 Doyle-0.015
Token edge
Token ID5743
Feature activation+2.10e-01

Pos logit contributions
 Neon+0.004
=#+0.004
 裏�+0.004
nect+0.003
icultural+0.003
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token of
Token ID286
Feature activation-8.72e-03

Pos logit contributions
tch+0.139
 Holo+0.136
 ruth+0.132
Clark+0.131
 Doyle+0.128
Neg logit contributions
=#-0.136
 Neon-0.135
 裏�-0.130
nect-0.128
orage-0.128
Token vit
Token ID9090
Feature activation-1.13e-01

Pos logit contributions
=#+0.006
 Neon+0.006
 裏�+0.005
nect+0.005
orage+0.005
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.006
Clark-0.005
 Doyle-0.005
Tokenicultural
Token ID26823
Feature activation-4.53e-01

Pos logit contributions
=#+0.039
 Neon+0.039
 裏�+0.036
nect+0.035
orage+0.035
Neg logit contributions
tch-0.108
 Holo-0.107
 ruth-0.105
Clark-0.104
 Doyle-0.102
Token viability
Token ID32141
Feature activation-6.81e-02

Pos logit contributions
=#+0.284
 Neon+0.282
 裏�+0.270
nect+0.266
orage+0.266
Neg logit contributions
tch-0.310
 Holo-0.305
 ruth-0.297
Clark-0.294
 Doyle-0.287
Token,
Token ID11
Feature activation-6.75e-03

Pos logit contributions
=#+0.045
 Neon+0.045
 裏�+0.043
nect+0.043
orage+0.042
Neg logit contributions
tch-0.044
 Holo-0.043
 ruth-0.042
Clark-0.042
 Doyle-0.041
Token the
Token ID262
Feature activation-5.75e-03

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.004
nect+0.004
orage+0.004
Neg logit contributions
tch-0.005
 Holo-0.005
 ruth-0.005
Clark-0.004
 Doyle-0.004
Token math
Token ID10688
Feature activation-3.68e-02

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.004
nect+0.004
icultural+0.004
Neg logit contributions
tch-0.004
 Holo-0.004
 ruth-0.004
Clark-0.003
 Doyle-0.003
Token becomes
Token ID4329
Feature activation+5.05e-02

Pos logit contributions
=#+0.024
 Neon+0.024
 裏�+0.023
nect+0.023
orage+0.022
Neg logit contributions
tch-0.024
 Holo-0.024
 ruth-0.023
Clark-0.023
 Doyle-0.022
Token of
Token ID286
Feature activation-9.14e-03

Pos logit contributions
=#+0.058
 Neon+0.058
 裏�+0.056
nect+0.055
orage+0.055
Neg logit contributions
tch-0.059
 Holo-0.058
 ruth-0.056
Clark-0.055
 Doyle-0.054
Token N
Token ID399
Feature activation-9.64e-02

Pos logit contributions
=#+0.006
 Neon+0.006
 裏�+0.005
nect+0.005
orage+0.005
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.006
Clark-0.006
 Doyle-0.006
TokenPH
Token ID11909
Feature activation+1.48e-01

Pos logit contributions
=#+0.063
 Neon+0.063
 裏�+0.061
nect+0.060
orage+0.060
Neg logit contributions
tch-0.063
 Holo-0.062
 ruth-0.060
Clark-0.059
 Doyle-0.058
Token version
Token ID2196
Feature activation+4.60e-02

Pos logit contributions
tch+0.099
 Holo+0.098
 ruth+0.095
Clark+0.094
 Doyle+0.092
Neg logit contributions
=#-0.095
 Neon-0.094
 裏�-0.090
nect-0.089
orage-0.089
Token of
Token ID286
Feature activation-1.26e-02

Pos logit contributions
tch+0.030
 Holo+0.030
 ruth+0.029
Clark+0.029
 Doyle+0.028
Neg logit contributions
=#-0.030
 Neon-0.030
 裏�-0.028
nect-0.028
orage-0.028
Token Ol
Token ID6544
Feature activation-4.49e-01

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.008
nect+0.008
orage+0.008
Neg logit contributions
tch-0.008
 Holo-0.008
 ruth-0.008
Clark-0.008
 Doyle-0.008
Tokenaf
Token ID1878
Feature activation-7.28e-02

Pos logit contributions
=#+0.268
 Neon+0.267
nect+0.253
 裏�+0.253
orage+0.250
Neg logit contributions
tch-0.317
 Holo-0.314
 ruth-0.304
Clark-0.301
 Doyle-0.294
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
=#+0.045
 Neon+0.045
 裏�+0.043
nect+0.043
orage+0.042
Neg logit contributions
tch-0.050
 Holo-0.049
 ruth-0.048
Clark-0.047
 Doyle-0.046
Token Plus
Token ID8227
Feature activation+3.42e-01

Pos logit contributions
=#+0.010
 Neon+0.010
 裏�+0.009
nect+0.009
orage+0.009
Neg logit contributions
tch-0.010
 Holo-0.010
 ruth-0.009
Clark-0.009
 Doyle-0.009
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
tch+0.225
 Holo+0.222
 ruth+0.215
Clark+0.213
 Doyle+0.208
Neg logit contributions
=#-0.223
 Neon-0.221
 裏�-0.213
nect-0.210
orage-0.209
Token I
Token ID314
Feature activation-6.62e-03

Pos logit contributions
=#+0.012
 Neon+0.012
 裏�+0.011
nect+0.011
orage+0.011
Neg logit contributions
tch-0.012
 Holo-0.012
 ruth-0.012
Clark-0.012
 Doyle-0.011
Token rose
Token ID8278
Feature activation+1.57e-01

Pos logit contributions
tch+0.028
 Holo+0.028
Clark+0.027
 ruth+0.027
 Doyle+0.026
Neg logit contributions
=#-0.028
 Neon-0.027
 裏�-0.026
nect-0.026
icultural-0.026
Token on
Token ID319
Feature activation-2.54e-02

Pos logit contributions
tch+0.102
 Holo+0.101
 ruth+0.097
Clark+0.097
 Doyle+0.094
Neg logit contributions
=#-0.103
 Neon-0.103
 裏�-0.099
nect-0.097
icultural-0.097
Token hate
Token ID5465
Feature activation+1.55e-01

Pos logit contributions
=#+0.017
 Neon+0.017
 裏�+0.016
nect+0.016
icultural+0.016
Neg logit contributions
tch-0.016
 Holo-0.016
 ruth-0.016
Clark-0.016
 Doyle-0.015
Token and
Token ID290
Feature activation-7.33e-03

Pos logit contributions
tch+0.103
 Holo+0.101
 ruth+0.098
Clark+0.097
 Doyle+0.094
Neg logit contributions
=#-0.101
 Neon-0.100
 裏�-0.096
nect-0.095
icultural-0.094
Token master
Token ID4958
Feature activation+5.26e-02

Pos logit contributions
=#+0.005
 Neon+0.005
 裏�+0.005
nect+0.004
icultural+0.004
Neg logit contributions
tch-0.005
 Holo-0.005
Clark-0.005
 ruth-0.005
 Doyle-0.004
Tokenfully
Token ID2759
Feature activation-4.46e-01

Pos logit contributions
tch+0.042
 Holo+0.041
 ruth+0.040
Clark+0.040
 Doyle+0.039
Neg logit contributions
=#-0.027
 Neon-0.027
 裏�-0.026
nect-0.025
orage-0.025
Token directed
Token ID7924
Feature activation+9.02e-02

Pos logit contributions
=#+0.296
 Neon+0.294
 裏�+0.280
nect+0.279
icultural+0.277
Neg logit contributions
tch-0.287
 Holo-0.282
Clark-0.272
 ruth-0.270
 Doyle-0.264
Token the
Token ID262
Feature activation-7.43e-03

Pos logit contributions
tch+0.059
 Holo+0.058
 ruth+0.056
Clark+0.055
 Doyle+0.054
Neg logit contributions
=#-0.060
 Neon-0.059
 裏�-0.057
nect-0.056
icultural-0.056
Token hate
Token ID5465
Feature activation+1.59e-01

Pos logit contributions
=#+0.005
 Neon+0.005
 裏�+0.005
nect+0.005
orage+0.005
Neg logit contributions
tch-0.005
 Holo-0.005
 ruth-0.005
Clark-0.004
 Doyle-0.004
Token consuming
Token ID18587
Feature activation-8.00e-02

Pos logit contributions
tch+0.104
 Holo+0.103
 ruth+0.099
Clark+0.098
 Doyle+0.096
Neg logit contributions
=#-0.104
 Neon-0.103
 裏�-0.099
nect-0.098
orage-0.097
Token his
Token ID465
Feature activation-1.95e-02

Pos logit contributions
=#+0.052
 Neon+0.052
 裏�+0.050
nect+0.049
orage+0.049
Neg logit contributions
tch-0.053
 Holo-0.052
 ruth-0.050
Clark-0.050
 Doyle-0.049
Token it
Token ID340
Feature activation-1.33e-02

Pos logit contributions
=#+0.003
 Neon+0.003
 裏�+0.003
nect+0.003
orage+0.003
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token plans
Token ID3352
Feature activation+7.33e-03

Pos logit contributions
=#+0.009
 Neon+0.009
 裏�+0.008
nect+0.008
orage+0.008
Neg logit contributions
tch-0.009
 Holo-0.009
 ruth-0.008
Clark-0.008
 Doyle-0.008
Token to
Token ID284
Feature activation-1.20e-02

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.004
Neg logit contributions
=#-0.005
 Neon-0.005
 裏�-0.005
nect-0.005
orage-0.004
Token be
Token ID307
Feature activation+1.05e-02

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.007
nect+0.007
orage+0.007
Neg logit contributions
tch-0.008
 Holo-0.008
Clark-0.008
 ruth-0.008
 Doyle-0.007
Token "
Token ID366
Feature activation+1.01e-02

Pos logit contributions
tch+0.007
 Holo+0.007
 ruth+0.007
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.007
nect-0.007
orage-0.007
Tokenfully
Token ID2759
Feature activation-4.41e-01

Pos logit contributions
tch+0.007
 Holo+0.007
 ruth+0.007
Clark+0.007
 Doyle+0.007
Neg logit contributions
=#-0.006
 Neon-0.006
 裏�-0.006
nect-0.006
orage-0.006
Token engaged
Token ID7953
Feature activation+9.37e-04

Pos logit contributions
=#+0.264
 Neon+0.261
 裏�+0.251
nect+0.247
orage+0.246
Neg logit contributions
tch-0.314
 Holo-0.310
 ruth-0.302
Clark-0.298
 Doyle-0.292
Token throughout
Token ID3690
Feature activation-2.18e-02

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
icultural-0.001
Token the
Token ID262
Feature activation-8.06e-03

Pos logit contributions
=#+0.014
 Neon+0.014
 裏�+0.014
nect+0.014
orage+0.014
Neg logit contributions
tch-0.014
 Holo-0.014
 ruth-0.014
Clark-0.013
 Doyle-0.013
Token process
Token ID1429
Feature activation-2.02e-01

Pos logit contributions
 Neon+0.005
=#+0.005
 裏�+0.005
nect+0.005
orage+0.005
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.005
Clark-0.005
 Doyle-0.005
Token.
Token ID13
Feature activation+1.90e-03

Pos logit contributions
=#+0.130
 Neon+0.129
 裏�+0.124
nect+0.123
orage+0.122
Neg logit contributions
tch-0.134
 Holo-0.132
 ruth-0.127
Clark-0.126
 Doyle-0.123
Token boy
Token ID2933
Feature activation-6.26e-02

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.002
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
Tokenishly
Token ID29735
Feature activation+6.83e-02

Pos logit contributions
=#+0.051
 Neon+0.051
 裏�+0.049
nect+0.049
orage+0.049
Neg logit contributions
tch-0.031
 Holo-0.030
 ruth-0.029
Clark-0.029
 Doyle-0.028
Token and
Token ID290
Feature activation+4.35e-03

Pos logit contributions
tch+0.046
 Holo+0.045
 ruth+0.044
Clark+0.044
 Doyle+0.042
Neg logit contributions
=#-0.043
 Neon-0.043
 裏�-0.041
nect-0.041
icultural-0.040
Token r
Token ID374
Feature activation+2.35e-01

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Tokenue
Token ID518
Feature activation-6.41e-02

Pos logit contributions
tch+0.158
 Holo+0.155
 ruth+0.151
Clark+0.149
 Doyle+0.146
Neg logit contributions
=#-0.150
 Neon-0.149
 裏�-0.142
nect-0.141
orage-0.140
Tokenfully
Token ID2759
Feature activation-4.41e-01

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.006
nect+0.006
orage+0.006
Neg logit contributions
tch-0.076
 Holo-0.075
 ruth-0.074
Clark-0.073
 Doyle-0.072
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
=#+0.277
 Neon+0.276
 裏�+0.263
nect+0.260
icultural+0.259
Neg logit contributions
tch-0.298
 Holo-0.294
 ruth-0.283
Clark-0.283
 Doyle-0.275
Token He
Token ID679
Feature activation+4.10e-02

Pos logit contributions
 Neon+0.008
=#+0.008
 裏�+0.007
nect+0.007
orage+0.007
Neg logit contributions
tch-0.008
 Holo-0.008
 ruth-0.007
Clark-0.007
 Doyle-0.007
Token is
Token ID318
Feature activation-1.39e-02

Pos logit contributions
tch+0.027
 Holo+0.027
Clark+0.026
 ruth+0.026
 Doyle+0.025
Neg logit contributions
=#-0.026
 Neon-0.026
 裏�-0.025
nect-0.025
icultural-0.024
Token relaxing
Token ID28175
Feature activation+1.18e-01

Pos logit contributions
 Neon+0.009
=#+0.009
 裏�+0.009
nect+0.009
icultural+0.008
Neg logit contributions
tch-0.009
 Holo-0.009
Clark-0.009
 ruth-0.009
 Doyle-0.008
Token,
Token ID11
Feature activation-6.75e-03

Pos logit contributions
tch+0.077
 Holo+0.075
 ruth+0.073
Clark+0.072
 Doyle+0.071
Neg logit contributions
=#-0.077
 Neon-0.077
 裏�-0.074
nect-0.073
orage-0.073
Token
Token ID198
Feature activation-7.86e-03

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.004
nect+0.004
orage+0.004
Neg logit contributions
tch-0.004
 Holo-0.004
 ruth-0.004
Clark-0.004
 Doyle-0.004
TokenA
Token ID32
Feature activation+1.94e-02

Pos logit contributions
=#+0.005
 Neon+0.005
 裏�+0.005
nect+0.005
orage+0.005
Neg logit contributions
tch-0.005
 Holo-0.005
 ruth-0.005
Clark-0.005
 Doyle-0.005
Token solo
Token ID12199
Feature activation-1.89e-02

Pos logit contributions
tch+0.011
 Holo+0.011
 ruth+0.011
Clark+0.010
 Doyle+0.010
Neg logit contributions
=#-0.014
 Neon-0.014
 裏�-0.014
nect-0.013
icultural-0.013
Token album
Token ID5062
Feature activation-5.80e-02

Pos logit contributions
=#+0.012
 Neon+0.012
 裏�+0.012
nect+0.011
orage+0.011
Neg logit contributions
tch-0.013
 Holo-0.012
 ruth-0.012
Clark-0.012
 Doyle-0.012
Token titled
Token ID11946
Feature activation-2.41e-01

Pos logit contributions
=#+0.038
 Neon+0.037
 裏�+0.036
nect+0.035
orage+0.035
Neg logit contributions
tch-0.038
 Holo-0.038
 ruth-0.037
Clark-0.036
 Doyle-0.035
Token Ol
Token ID6544
Feature activation-4.39e-01

Pos logit contributions
=#+0.156
 Neon+0.155
 裏�+0.148
nect+0.147
orage+0.146
Neg logit contributions
tch-0.160
 Holo-0.157
 ruth-0.153
Clark-0.151
 Doyle-0.147
Token'
Token ID6
Feature activation-5.39e-02

Pos logit contributions
=#+0.261
 Neon+0.260
 裏�+0.247
nect+0.246
orage+0.244
Neg logit contributions
tch-0.312
 Holo-0.308
 ruth-0.300
Clark-0.297
 Doyle-0.290
Token Brown
Token ID4373
Feature activation+3.07e-02

Pos logit contributions
=#+0.035
 Neon+0.034
 裏�+0.033
nect+0.032
orage+0.032
Neg logit contributions
tch-0.036
 Holo-0.036
 ruth-0.034
Clark-0.034
 Doyle-0.033
Token E
Token ID412
Feature activation-1.03e-01

Pos logit contributions
tch+0.021
 Holo+0.020
 ruth+0.020
Clark+0.019
 Doyle+0.019
Neg logit contributions
=#-0.020
 Neon-0.020
 裏�-0.019
nect-0.019
orage-0.018
Tokenars
Token ID945
Feature activation-4.94e-02

Pos logit contributions
=#+0.065
 Neon+0.065
 裏�+0.062
nect+0.061
orage+0.061
Neg logit contributions
tch-0.069
 Holo-0.068
 ruth-0.066
Clark-0.065
 Doyle-0.064
Token is
Token ID318
Feature activation-1.42e-02

Pos logit contributions
=#+0.032
 Neon+0.032
 裏�+0.030
nect+0.030
orage+0.030
Neg logit contributions
tch-0.033
 Holo-0.032
 ruth-0.031
Clark-0.031
 Doyle-0.030
Token friend
Token ID1545
Feature activation-8.93e-02

Pos logit contributions
 Neon+0.002
=#+0.002
 裏�+0.002
nect+0.002
icultural+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token clarified
Token ID28464
Feature activation-1.66e-01

Pos logit contributions
=#+0.056
 Neon+0.056
 裏�+0.053
nect+0.053
icultural+0.052
Neg logit contributions
tch-0.061
 Holo-0.060
 ruth-0.058
Clark-0.058
 Doyle-0.056
Token
Token ID960
Feature activation+1.35e-01

Pos logit contributions
=#+0.111
 Neon+0.109
 裏�+0.105
nect+0.104
orage+0.104
Neg logit contributions
tch-0.107
 Holo-0.105
 ruth-0.102
Clark-0.101
 Doyle-0.098
Tokenpr
Token ID1050
Feature activation+1.63e-02

Pos logit contributions
tch+0.091
 Holo+0.089
 ruth+0.087
Clark+0.086
 Doyle+0.084
Neg logit contributions
=#-0.087
 Neon-0.086
 裏�-0.083
nect-0.082
orage-0.081
Tokenide
Token ID485
Feature activation+1.30e-01

Pos logit contributions
tch+0.011
 Holo+0.010
 ruth+0.010
Clark+0.010
 Doyle+0.010
Neg logit contributions
=#-0.011
 Neon-0.011
 裏�-0.010
nect-0.010
orage-0.010
Tokenfully
Token ID2759
Feature activation-4.36e-01

Pos logit contributions
tch+0.102
 Holo+0.100
 ruth+0.098
Clark+0.097
 Doyle+0.095
Neg logit contributions
=#-0.068
 Neon-0.068
 裏�-0.065
nect-0.064
orage-0.063
Token and
Token ID290
Feature activation-3.06e-03

Pos logit contributions
=#+0.330
 Neon+0.328
 裏�+0.317
nect+0.314
icultural+0.312
Neg logit contributions
tch-0.242
 Holo-0.236
 ruth-0.227
Clark-0.225
 Doyle-0.218
Token unp
Token ID8593
Feature activation+6.87e-03

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
orage+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Tokenrompt
Token ID45700
Feature activation-6.50e-02

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.005
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Tokened
Token ID276
Feature activation+1.93e-01

Pos logit contributions
=#+0.059
 Neon+0.058
 裏�+0.057
nect+0.056
orage+0.056
Neg logit contributions
tch-0.026
 Holo-0.026
 ruth-0.024
Clark-0.024
 Doyle-0.023
Token
Token ID960
Feature activation+1.37e-01

Pos logit contributions
tch+0.101
 Holo+0.099
 ruth+0.095
Clark+0.094
 Doyle+0.091
Neg logit contributions
=#-0.151
 Neon-0.150
 裏�-0.145
nect-0.144
orage-0.143
Token one
Token ID530
Feature activation-1.24e-02

Pos logit contributions
=#+0.003
 Neon+0.003
 裏�+0.003
nect+0.003
orage+0.003
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token of
Token ID286
Feature activation+4.93e-03

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.007
nect+0.007
orage+0.007
Neg logit contributions
tch-0.008
 Holo-0.008
 ruth-0.008
Clark-0.008
 Doyle-0.008
Token the
Token ID262
Feature activation+7.32e-03

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Token most
Token ID749
Feature activation-4.95e-02

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.005
Neg logit contributions
=#-0.005
 Neon-0.005
 裏�-0.004
nect-0.004
orage-0.004
Token skill
Token ID5032
Feature activation-6.05e-03

Pos logit contributions
=#+0.030
 Neon+0.030
 裏�+0.028
nect+0.028
orage+0.028
Neg logit contributions
tch-0.035
 Holo-0.034
 ruth-0.033
Clark-0.033
 Doyle-0.032
Tokenfully
Token ID2759
Feature activation-4.36e-01

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
orage+0.002
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.006
Clark-0.006
 Doyle-0.005
Token-
Token ID12
Feature activation+6.92e-03

Pos logit contributions
=#+0.286
 Neon+0.285
 裏�+0.271
nect+0.270
orage+0.268
Neg logit contributions
tch-0.285
 Holo-0.279
 ruth-0.270
Clark-0.269
 Doyle-0.263
Tokenl
Token ID75
Feature activation+8.60e-03

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.005
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Tokeniked
Token ID17951
Feature activation+3.15e-02

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.004
Neg logit contributions
=#-0.006
 Neon-0.006
 裏�-0.006
nect-0.006
orage-0.006
Token media
Token ID2056
Feature activation+3.18e-02

Pos logit contributions
tch+0.020
 Holo+0.019
 ruth+0.018
Clark+0.018
 Doyle+0.018
Neg logit contributions
=#-0.022
 Neon-0.022
 裏�-0.021
nect-0.021
orage-0.021
Token centers
Token ID10399
Feature activation-1.40e-01

Pos logit contributions
tch+0.022
 Holo+0.022
 ruth+0.021
Clark+0.021
 Doyle+0.020
Neg logit contributions
=#-0.020
 Neon-0.020
 裏�-0.019
nect-0.019
orage-0.019
Token writes
Token ID6797
Feature activation-3.69e-02

Pos logit contributions
tch+0.011
 Holo+0.011
 ruth+0.010
Clark+0.010
 Doyle+0.010
Neg logit contributions
=#-0.011
 Neon-0.011
 裏�-0.011
nect-0.011
orage-0.011
Token very
Token ID845
Feature activation+1.02e-01

Pos logit contributions
=#+0.024
 Neon+0.024
 裏�+0.023
nect+0.023
orage+0.023
Neg logit contributions
tch-0.024
 Holo-0.024
 ruth-0.023
Clark-0.023
 Doyle-0.022
Token honestly
Token ID12698
Feature activation-8.14e-02

Pos logit contributions
tch+0.066
 Holo+0.065
 ruth+0.063
Clark+0.062
 Doyle+0.061
Neg logit contributions
=#-0.068
 Neon-0.067
 裏�-0.065
nect-0.064
orage-0.064
Token and
Token ID290
Feature activation+7.57e-04

Pos logit contributions
=#+0.053
 Neon+0.052
 裏�+0.050
nect+0.049
orage+0.049
Neg logit contributions
tch-0.054
 Holo-0.053
 ruth-0.052
Clark-0.051
 Doyle-0.050
Token truth
Token ID3872
Feature activation+1.24e-01

Pos logit contributions
tch+0.000
 Holo+0.000
Clark+0.000
 ruth+0.000
 Doyle+0.000
Neg logit contributions
=#-0.000
 Neon-0.000
 裏�-0.000
nect-0.000
orage-0.000
Tokenfully
Token ID2759
Feature activation-4.34e-01

Pos logit contributions
tch+0.143
 Holo+0.142
 ruth+0.140
Clark+0.139
 Doyle+0.137
Neg logit contributions
=#-0.018
 Neon-0.018
 裏�-0.015
icultural-0.013
orage-0.013
Token about
Token ID546
Feature activation-1.37e-02

Pos logit contributions
=#+0.277
 Neon+0.275
 裏�+0.264
nect+0.261
orage+0.259
Neg logit contributions
tch-0.291
 Holo-0.286
 ruth-0.277
Clark-0.275
 Doyle-0.268
Token the
Token ID262
Feature activation+5.39e-03

Pos logit contributions
=#+0.009
 Neon+0.009
 裏�+0.009
nect+0.009
orage+0.008
Neg logit contributions
tch-0.009
 Holo-0.009
 ruth-0.008
Clark-0.008
 Doyle-0.008
Token human
Token ID1692
Feature activation+8.62e-02

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
icultural-0.003
Token experience
Token ID1998
Feature activation-1.70e-01

Pos logit contributions
tch+0.061
 Holo+0.060
 ruth+0.059
Clark+0.057
 Doyle+0.057
Neg logit contributions
=#-0.052
 Neon-0.051
 裏�-0.050
icultural-0.049
nect-0.049
Token.
Token ID13
Feature activation+4.74e-03

Pos logit contributions
=#+0.111
 Neon+0.110
 裏�+0.105
nect+0.104
orage+0.104
Neg logit contributions
tch-0.113
 Holo-0.111
 ruth-0.107
Clark-0.106
 Doyle-0.104
Token
Token ID447
Feature activation+1.07e-02

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Token
Token ID251
Feature activation+7.52e-03

Pos logit contributions
tch+0.007
 Holo+0.007
 ruth+0.007
Clark+0.007
 Doyle+0.007
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.006
nect-0.006
orage-0.006
Token he
Token ID339
Feature activation+3.26e-04

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.004
Neg logit contributions
=#-0.005
 Neon-0.005
 裏�-0.005
nect-0.005
icultural-0.005
Token says
Token ID1139
Feature activation+4.21e-03

Pos logit contributions
tch+0.000
 Holo+0.000
 ruth+0.000
Clark+0.000
 Doyle+0.000
Neg logit contributions
=#-0.000
 Neon-0.000
 裏�-0.000
nect-0.000
orage-0.000
Token cheer
Token ID14042
Feature activation+9.45e-03

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
icultural-0.002
Tokenfully
Token ID2759
Feature activation-4.32e-01

Pos logit contributions
tch+0.011
 Holo+0.011
 ruth+0.010
Clark+0.010
 Doyle+0.010
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.001
nect-0.001
orage-0.001
Token.
Token ID13
Feature activation+1.17e-03

Pos logit contributions
=#+0.270
 Neon+0.269
 裏�+0.258
nect+0.254
icultural+0.253
Neg logit contributions
tch-0.295
 Holo-0.290
 ruth-0.280
Clark-0.279
 Doyle-0.272
Token
Token ID564
Feature activation+2.44e-03

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token
Token ID250
Feature activation+8.20e-03

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.001
nect-0.001
orage-0.001
TokenYou
Token ID1639
Feature activation-3.78e-02

Pos logit contributions
tch+0.006
 Holo+0.006
 ruth+0.005
Clark+0.005
 Doyle+0.005
Neg logit contributions
=#-0.005
 Neon-0.005
 裏�-0.005
nect-0.005
orage-0.005
Token just
Token ID655
Feature activation+1.09e-02

Pos logit contributions
=#+0.025
 Neon+0.025
 裏�+0.024
nect+0.024
orage+0.023
Neg logit contributions
tch-0.025
 Holo-0.024
 ruth-0.023
Clark-0.023
 Doyle-0.023
Token"
Token ID1
Feature activation-1.42e-02

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
TokenMom
Token ID29252
Feature activation+5.35e-02

Pos logit contributions
=#+0.009
 Neon+0.009
 裏�+0.009
nect+0.009
icultural+0.008
Neg logit contributions
tch-0.010
 Holo-0.009
 ruth-0.009
Clark-0.009
 Doyle-0.009
Token?
Token ID30
Feature activation+2.55e-02

Pos logit contributions
tch+0.034
 Holo+0.033
 ruth+0.032
Clark+0.032
 Doyle+0.031
Neg logit contributions
=#-0.036
 Neon-0.036
 裏�-0.034
nect-0.034
orage-0.034
Token Dad
Token ID17415
Feature activation+1.46e-01

Pos logit contributions
tch+0.016
 Holo+0.016
 ruth+0.015
Clark+0.015
 Doyle+0.015
Neg logit contributions
=#-0.017
 Neon-0.017
 裏�-0.016
nect-0.016
orage-0.016
Token?
Token ID30
Feature activation+2.01e-02

Pos logit contributions
tch+0.091
 Holo+0.089
 ruth+0.086
Clark+0.085
 Doyle+0.083
Neg logit contributions
=#-0.101
 Neon-0.100
 裏�-0.096
nect-0.095
orage-0.095
Token Ol
Token ID6544
Feature activation-4.32e-01

Pos logit contributions
tch+0.013
 Holo+0.013
 ruth+0.012
Clark+0.012
 Doyle+0.012
Neg logit contributions
=#-0.013
 Neon-0.013
 裏�-0.013
nect-0.013
orage-0.013
Tokenaf
Token ID1878
Feature activation-5.87e-02

Pos logit contributions
=#+0.227
 Neon+0.225
nect+0.210
orage+0.209
 裏�+0.207
Neg logit contributions
tch-0.337
 Holo-0.331
 ruth-0.323
Clark-0.321
 Doyle-0.315
Token?"
Token ID1701
Feature activation+5.22e-02

Pos logit contributions
=#+0.040
 Neon+0.040
 裏�+0.038
nect+0.037
orage+0.037
Neg logit contributions
tch-0.037
 Holo-0.036
 ruth-0.035
Clark-0.035
 Doyle-0.034
Token I
Token ID314
Feature activation+1.32e-02

Pos logit contributions
tch+0.034
 Holo+0.034
 ruth+0.033
Clark+0.032
 Doyle+0.031
Neg logit contributions
=#-0.034
 Neon-0.034
 裏�-0.033
nect-0.032
orage-0.032
Token stood
Token ID6204
Feature activation-8.41e-02

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.008
Clark+0.008
 Doyle+0.008
Neg logit contributions
=#-0.009
 Neon-0.009
 裏�-0.008
nect-0.008
icultural-0.008
Token still
Token ID991
Feature activation+2.42e-03

Pos logit contributions
=#+0.053
 Neon+0.053
 裏�+0.051
nect+0.050
orage+0.050
Neg logit contributions
tch-0.057
 Holo-0.056
 ruth-0.054
Clark-0.054
 Doyle-0.052
Token,
Token ID11
Feature activation+3.68e-04

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.004
nect+0.004
orage+0.004
Neg logit contributions
tch-0.004
 Holo-0.004
 ruth-0.004
Clark-0.004
 Doyle-0.004
Token "
Token ID366
Feature activation+1.71e-02

Pos logit contributions
tch+0.000
 Holo+0.000
 ruth+0.000
Clark+0.000
 Doyle+0.000
Neg logit contributions
=#-0.000
 Neon-0.000
 裏�-0.000
nect-0.000
orage-0.000
TokenOkay
Token ID16454
Feature activation-3.69e-02

Pos logit contributions
tch+0.012
 Holo+0.012
 ruth+0.011
Clark+0.011
 Doyle+0.011
Neg logit contributions
=#-0.011
 Neon-0.010
 裏�-0.010
nect-0.010
orage-0.010
Token,
Token ID11
Feature activation-5.49e-04

Pos logit contributions
=#+0.024
 Neon+0.024
 裏�+0.023
nect+0.023
orage+0.022
Neg logit contributions
tch-0.025
 Holo-0.024
 ruth-0.023
Clark-0.023
 Doyle-0.023
Token focus
Token ID2962
Feature activation-2.26e-01

Pos logit contributions
=#+0.000
 Neon+0.000
 裏�+0.000
nect+0.000
orage+0.000
Neg logit contributions
tch-0.000
 Holo-0.000
 ruth-0.000
Clark-0.000
 Doyle-0.000
Token Ol
Token ID6544
Feature activation-4.32e-01

Pos logit contributions
=#+0.142
 Neon+0.140
 裏�+0.135
nect+0.133
orage+0.132
Neg logit contributions
tch-0.155
 Holo-0.152
 ruth-0.148
Clark-0.147
 Doyle-0.143
Tokenaf
Token ID1878
Feature activation-5.44e-02

Pos logit contributions
=#+0.227
 Neon+0.223
nect+0.213
icultural+0.211
 裏�+0.211
Neg logit contributions
tch-0.338
 Holo-0.334
 ruth-0.325
Clark-0.322
 Doyle-0.315
Token.
Token ID13
Feature activation-1.68e-03

Pos logit contributions
=#+0.035
 Neon+0.034
 裏�+0.033
nect+0.033
orage+0.033
Neg logit contributions
tch-0.037
 Holo-0.036
 ruth-0.035
Clark-0.035
 Doyle-0.034
Token If
Token ID1002
Feature activation+1.96e-02

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
Token Brian
Token ID8403
Feature activation-4.27e-02

Pos logit contributions
tch+0.013
 Holo+0.013
 ruth+0.012
Clark+0.012
 Doyle+0.012
Neg logit contributions
=#-0.013
 Neon-0.013
 裏�-0.012
nect-0.012
orage-0.012
Token comes
Token ID2058
Feature activation+4.38e-02

Pos logit contributions
=#+0.028
 Neon+0.028
 裏�+0.027
nect+0.026
orage+0.026
Neg logit contributions
tch-0.028
 Holo-0.027
 ruth-0.027
Clark-0.026
 Doyle-0.026
Token evil
Token ID6181
Feature activation-5.42e-02

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.005
Neg logit contributions
=#-0.005
 Neon-0.005
 裏�-0.005
nect-0.005
orage-0.005
Tokenness
Token ID1108
Feature activation+2.50e-01

Pos logit contributions
=#+0.036
 Neon+0.036
 裏�+0.035
nect+0.034
orage+0.034
Neg logit contributions
tch-0.035
 Holo-0.034
 ruth-0.033
Clark-0.033
 Doyle-0.032
Token that
Token ID326
Feature activation+1.03e-02

Pos logit contributions
tch+0.164
 Holo+0.161
 ruth+0.156
Clark+0.155
 Doyle+0.151
Neg logit contributions
=#-0.163
 Neon-0.162
 裏�-0.155
nect-0.153
orage-0.153
Token is
Token ID318
Feature activation-9.47e-03

Pos logit contributions
tch+0.007
 Holo+0.007
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.006
nect-0.006
orage-0.006
Token in
Token ID287
Feature activation+1.58e-02

Pos logit contributions
=#+0.006
 Neon+0.006
 裏�+0.006
nect+0.006
orage+0.006
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.006
Clark-0.006
 Doyle-0.006
Token Ol
Token ID6544
Feature activation-4.31e-01

Pos logit contributions
tch+0.010
 Holo+0.010
 ruth+0.010
Clark+0.010
 Doyle+0.009
Neg logit contributions
=#-0.011
 Neon-0.010
 裏�-0.010
nect-0.010
orage-0.010
Tokenaf
Token ID1878
Feature activation-5.98e-02

Pos logit contributions
=#+0.221
 Neon+0.218
nect+0.207
icultural+0.207
orage+0.206
Neg logit contributions
tch-0.342
 Holo-0.339
 ruth-0.330
Clark-0.327
 Doyle-0.319
Token
Token ID960
Feature activation+1.37e-01

Pos logit contributions
=#+0.039
 Neon+0.039
 裏�+0.037
nect+0.036
orage+0.036
Neg logit contributions
tch-0.040
 Holo-0.039
 ruth-0.038
Clark-0.037
 Doyle-0.036
TokenI
Token ID40
Feature activation+1.46e-02

Pos logit contributions
tch+0.093
 Holo+0.092
 ruth+0.089
Clark+0.088
 Doyle+0.086
Neg logit contributions
=#-0.086
 Neon-0.085
 裏�-0.082
nect-0.081
orage-0.080
Token don
Token ID836
Feature activation+4.09e-02

Pos logit contributions
tch+0.010
 Holo+0.010
 ruth+0.010
Clark+0.010
 Doyle+0.009
Neg logit contributions
=#-0.009
 Neon-0.009
 裏�-0.009
nect-0.008
orage-0.008
Token
Token ID447
Feature activation-3.58e-04

Pos logit contributions
tch+0.024
 Holo+0.023
 ruth+0.023
Clark+0.022
 Doyle+0.022
Neg logit contributions
=#-0.030
 Neon-0.030
 裏�-0.029
nect-0.028
orage-0.028
Token case
Token ID1339
Feature activation+5.64e-03

Pos logit contributions
tch+0.058
 Holo+0.057
 ruth+0.055
Clark+0.055
 Doyle+0.053
Neg logit contributions
=#-0.060
 Neon-0.060
 裏�-0.057
nect-0.057
orage-0.056
Token financial
Token ID3176
Feature activation-1.16e-01

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.005
Neg logit contributions
=#-0.003
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Token situation
Token ID3074
Feature activation-3.78e-02

Pos logit contributions
=#+0.073
 Neon+0.072
 裏�+0.069
nect+0.068
orage+0.068
Neg logit contributions
tch-0.079
 Holo-0.078
 ruth-0.075
Clark-0.075
 Doyle-0.073
Token if
Token ID611
Feature activation-4.10e-02

Pos logit contributions
=#+0.024
 Neon+0.024
 裏�+0.023
nect+0.023
orage+0.023
Neg logit contributions
tch-0.025
 Holo-0.025
 ruth-0.024
Clark-0.024
 Doyle-0.024
Token he
Token ID339
Feature activation-3.33e-02

Pos logit contributions
=#+0.026
 Neon+0.026
 裏�+0.025
nect+0.025
orage+0.025
Neg logit contributions
tch-0.027
 Holo-0.027
 ruth-0.026
Clark-0.026
 Doyle-0.025
Token opt
Token ID2172
Feature activation-4.31e-01

Pos logit contributions
=#+0.021
 Neon+0.021
 裏�+0.020
nect+0.020
orage+0.020
Neg logit contributions
tch-0.022
 Holo-0.022
 ruth-0.021
Clark-0.021
 Doyle-0.020
Tokens
Token ID82
Feature activation-1.41e-02

Pos logit contributions
=#+0.293
 Neon+0.290
 裏�+0.280
nect+0.276
orage+0.275
Neg logit contributions
tch-0.272
 Holo-0.267
 ruth-0.259
Clark-0.256
 Doyle-0.250
Token in
Token ID287
Feature activation-1.96e-02

Pos logit contributions
=#+0.009
 Neon+0.008
 裏�+0.008
nect+0.008
orage+0.008
Neg logit contributions
tch-0.010
 Holo-0.010
 ruth-0.009
Clark-0.009
 Doyle-0.009
Token.
Token ID13
Feature activation-3.28e-02

Pos logit contributions
=#+0.013
 Neon+0.013
 裏�+0.012
nect+0.012
orage+0.012
Neg logit contributions
tch-0.013
 Holo-0.013
 ruth-0.012
Clark-0.012
 Doyle-0.012
Token
Token ID198
Feature activation-2.86e-02

Pos logit contributions
=#+0.021
 Neon+0.021
 裏�+0.020
nect+0.020
orage+0.020
Neg logit contributions
tch-0.022
 Holo-0.021
 ruth-0.021
Clark-0.021
 Doyle-0.020
Token
Token ID198
Feature activation-1.99e-02

Pos logit contributions
=#+0.018
 Neon+0.018
 裏�+0.017
nect+0.017
orage+0.017
Neg logit contributions
tch-0.020
 Holo-0.019
 ruth-0.019
Clark-0.018
 Doyle-0.018
Token.
Token ID13
Feature activation+3.16e-03

Pos logit contributions
=#+0.079
 Neon+0.078
 裏�+0.075
nect+0.074
orage+0.074
Neg logit contributions
tch-0.090
 Holo-0.088
 ruth-0.086
Clark-0.085
 Doyle-0.083
Token (
Token ID357
Feature activation-2.37e-02

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Token 1986
Token ID12113
Feature activation+4.05e-02

Pos logit contributions
=#+0.015
 Neon+0.015
 裏�+0.014
nect+0.014
orage+0.014
Neg logit contributions
tch-0.016
 Holo-0.016
 ruth-0.016
Clark-0.015
 Doyle-0.015
Token ).
Token ID6739
Feature activation-5.35e-02

Pos logit contributions
tch+0.030
 Holo+0.030
 ruth+0.029
Clark+0.029
 Doyle+0.028
Neg logit contributions
=#-0.023
 Neon-0.023
 裏�-0.022
nect-0.021
icultural-0.021
Token The
Token ID383
Feature activation-2.93e-02

Pos logit contributions
=#+0.033
 Neon+0.033
 裏�+0.032
nect+0.031
orage+0.031
Neg logit contributions
tch-0.037
 Holo-0.036
 ruth-0.035
Clark-0.035
 Doyle-0.034
Token Ol
Token ID6544
Feature activation-4.30e-01

Pos logit contributions
=#+0.019
 Neon+0.019
 裏�+0.018
nect+0.018
orage+0.018
Neg logit contributions
tch-0.019
 Holo-0.019
 ruth-0.018
Clark-0.018
 Doyle-0.018
Tokenwe
Token ID732
Feature activation-1.19e-01

Pos logit contributions
=#+0.214
 Neon+0.212
 裏�+0.201
nect+0.198
orage+0.196
Neg logit contributions
tch-0.350
 Holo-0.345
 ruth-0.336
Clark-0.334
 Doyle-0.327
Tokenus
Token ID385
Feature activation+8.52e-03

Pos logit contributions
=#+0.072
 Neon+0.072
 裏�+0.069
nect+0.068
orage+0.067
Neg logit contributions
tch-0.084
 Holo-0.082
 ruth-0.080
Clark-0.079
 Doyle-0.077
Token B
Token ID347
Feature activation-2.94e-03

Pos logit contributions
tch+0.006
 Holo+0.006
 ruth+0.005
Clark+0.005
 Doyle+0.005
Neg logit contributions
=#-0.006
 Neon-0.006
 裏�-0.005
nect-0.005
icultural-0.005
Tokenully
Token ID2132
Feature activation-3.93e-02

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
icultural+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token/
Token ID14
Feature activation+3.57e-02

Pos logit contributions
=#+0.025
 Neon+0.025
 裏�+0.024
nect+0.024
icultural+0.024
Neg logit contributions
tch-0.026
 Holo-0.026
 ruth-0.025
Clark-0.025
 Doyle-0.024
Token caught
Token ID4978
Feature activation-4.85e-02

Pos logit contributions
tch+0.006
 Holo+0.006
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.006
 Neon-0.006
 裏�-0.006
nect-0.005
orage-0.005
Token me
Token ID502
Feature activation+3.25e-02

Pos logit contributions
=#+0.034
 Neon+0.034
 裏�+0.033
nect+0.032
icultural+0.032
Neg logit contributions
tch-0.029
 Holo-0.029
 ruth-0.028
Clark-0.028
 Doyle-0.027
Token off
Token ID572
Feature activation-3.90e-02

Pos logit contributions
tch+0.022
 Holo+0.022
 ruth+0.021
Clark+0.021
 Doyle+0.021
Neg logit contributions
=#-0.020
 Neon-0.020
 裏�-0.019
nect-0.019
icultural-0.019
Token guard
Token ID4860
Feature activation-1.57e-01

Pos logit contributions
=#+0.018
 Neon+0.018
 裏�+0.017
nect+0.016
orage+0.016
Neg logit contributions
tch-0.033
 Holo-0.033
 ruth-0.032
Clark-0.032
 Doyle-0.031
Token.
Token ID13
Feature activation+3.57e-03

Pos logit contributions
=#+0.102
 Neon+0.101
 裏�+0.097
nect+0.096
orage+0.096
Neg logit contributions
tch-0.104
 Holo-0.102
 ruth-0.099
Clark-0.098
 Doyle-0.096
Token Ol
Token ID6544
Feature activation-4.28e-01

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Tokenaf
Token ID1878
Feature activation-5.48e-02

Pos logit contributions
=#+0.227
 Neon+0.223
nect+0.211
orage+0.211
icultural+0.210
Neg logit contributions
tch-0.334
 Holo-0.330
 ruth-0.321
Clark-0.319
 Doyle-0.312
Token releases
Token ID10050
Feature activation+2.18e-01

Pos logit contributions
=#+0.035
 Neon+0.035
 裏�+0.033
nect+0.033
orage+0.033
Neg logit contributions
tch-0.037
 Holo-0.036
 ruth-0.035
Clark-0.035
 Doyle-0.034
Token me
Token ID502
Feature activation+3.13e-02

Pos logit contributions
tch+0.135
 Holo+0.133
 ruth+0.128
Clark+0.127
 Doyle+0.124
Neg logit contributions
=#-0.150
 Neon-0.149
 裏�-0.143
nect-0.142
orage-0.141
Token and
Token ID290
Feature activation+3.95e-03

Pos logit contributions
tch+0.020
 Holo+0.020
 ruth+0.019
Clark+0.019
 Doyle+0.019
Neg logit contributions
=#-0.021
 Neon-0.021
 裏�-0.020
nect-0.019
orage-0.019
Token it
Token ID340
Feature activation-9.14e-05

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.002
nect-0.002
orage-0.002
Token the
Token ID262
Feature activation+5.83e-03

Pos logit contributions
tch+0.034
 Holo+0.033
 ruth+0.032
Clark+0.032
 Doyle+0.031
Neg logit contributions
=#-0.035
 Neon-0.034
 裏�-0.033
nect-0.033
orage-0.032
Token same
Token ID976
Feature activation+3.19e-02

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
icultural-0.004
Token nearby
Token ID6716
Feature activation-8.04e-02

Pos logit contributions
tch+0.021
 Holo+0.021
 ruth+0.020
Clark+0.020
 Doyle+0.019
Neg logit contributions
=#-0.021
 Neon-0.021
 裏�-0.020
nect-0.020
orage-0.020
Token,
Token ID11
Feature activation-6.03e-04

Pos logit contributions
=#+0.051
 Neon+0.051
 裏�+0.049
nect+0.048
orage+0.048
Neg logit contributions
tch-0.054
 Holo-0.053
 ruth-0.051
Clark-0.051
 Doyle-0.050
Token Ky
Token ID11118
Feature activation-6.86e-02

Pos logit contributions
 Neon+0.000
=#+0.000
 裏�+0.000
nect+0.000
orage+0.000
Neg logit contributions
tch-0.000
 Holo-0.000
 ruth-0.000
Clark-0.000
 Doyle-0.000
Tokenodo
Token ID24313
Feature activation+2.46e-01

Pos logit contributions
=#+0.052
 Neon+0.051
 裏�+0.050
nect+0.049
icultural+0.049
Neg logit contributions
tch-0.038
 Holo-0.038
 ruth-0.036
Clark-0.036
 Doyle-0.035
Token News
Token ID3000
Feature activation-3.42e-02

Pos logit contributions
tch+0.165
 Holo+0.163
 ruth+0.158
Clark+0.157
 Doyle+0.153
Neg logit contributions
=#-0.157
 Neon-0.155
 裏�-0.150
nect-0.148
orage-0.147
Token Service
Token ID4809
Feature activation+5.12e-02

Pos logit contributions
=#+0.024
 Neon+0.024
 裏�+0.023
nect+0.022
orage+0.022
Neg logit contributions
tch-0.021
 Holo-0.021
 ruth-0.020
Clark-0.020
 Doyle-0.019
Token reported
Token ID2098
Feature activation+3.59e-02

Pos logit contributions
tch+0.032
 Holo+0.031
 ruth+0.030
Clark+0.030
 Doyle+0.029
Neg logit contributions
=#-0.036
 Neon-0.035
 裏�-0.034
nect-0.034
orage-0.033
Token.
Token ID13
Feature activation+1.10e-03

Pos logit contributions
tch+0.024
 Holo+0.023
 ruth+0.023
Clark+0.022
 Doyle+0.022
Neg logit contributions
=#-0.023
 Neon-0.023
 裏�-0.022
nect-0.022
orage-0.022
Token Authorities
Token ID30756
Feature activation-5.80e-03

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token water
Token ID1660
Feature activation+1.19e-01

Pos logit contributions
=#+0.094
 Neon+0.093
 裏�+0.090
nect+0.089
orage+0.088
Neg logit contributions
tch-0.096
 Holo-0.094
 ruth-0.091
Clark-0.090
 Doyle-0.088
Token.
Token ID13
Feature activation-3.63e-03

Pos logit contributions
tch+0.078
 Holo+0.077
 ruth+0.074
Clark+0.073
 Doyle+0.072
Neg logit contributions
=#-0.078
 Neon-0.077
 裏�-0.074
nect-0.073
orage-0.073
Token According
Token ID4784
Feature activation+9.95e-03

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
icultural+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token to
Token ID284
Feature activation-7.72e-04

Pos logit contributions
tch+0.007
 Holo+0.007
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.006
 Neon-0.006
 裏�-0.006
nect-0.006
orage-0.006
Token H
Token ID367
Feature activation+1.39e-01

Pos logit contributions
=#+0.001
 Neon+0.000
 裏�+0.000
nect+0.000
orage+0.000
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.000
Clark-0.000
 Doyle-0.000
Tokenoss
Token ID793
Feature activation+2.56e-01

Pos logit contributions
tch+0.051
 Holo+0.050
 ruth+0.047
Clark+0.046
 Doyle+0.044
Neg logit contributions
=#-0.132
 Neon-0.131
 裏�-0.127
nect-0.126
orage-0.126
Tokenain
Token ID391
Feature activation+3.29e-02

Pos logit contributions
tch+0.167
 Holo+0.165
 ruth+0.159
Clark+0.158
 Doyle+0.154
Neg logit contributions
=#-0.169
 Neon-0.167
 裏�-0.160
nect-0.159
orage-0.158
Token,
Token ID11
Feature activation-7.30e-03

Pos logit contributions
tch+0.021
 Holo+0.021
 ruth+0.020
Clark+0.020
 Doyle+0.020
Neg logit contributions
=#-0.022
 Neon-0.022
 裏�-0.021
nect-0.021
orage-0.021
Token the
Token ID262
Feature activation-2.60e-03

Pos logit contributions
=#+0.005
 Neon+0.005
 裏�+0.005
nect+0.005
icultural+0.005
Neg logit contributions
tch-0.005
 Holo-0.005
 ruth-0.005
Clark-0.004
 Doyle-0.004
Token spill
Token ID19431
Feature activation+2.34e-01

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
orage+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token has
Token ID468
Feature activation+5.49e-03

Pos logit contributions
tch+0.155
 Holo+0.153
 ruth+0.148
Clark+0.147
 Doyle+0.143
Neg logit contributions
=#-0.151
 Neon-0.150
 裏�-0.144
nect-0.142
icultural-0.141
Token.
Token ID13
Feature activation+5.41e-03

Pos logit contributions
tch+0.101
 Holo+0.099
 ruth+0.097
Clark+0.095
 Doyle+0.093
Neg logit contributions
=#-0.102
 Neon-0.101
 裏�-0.097
nect-0.096
orage-0.096
Token Often
Token ID18023
Feature activation+1.55e-01

Pos logit contributions
tch+0.004
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.004
 Neon-0.003
 裏�-0.003
nect-0.003
icultural-0.003
Token,
Token ID11
Feature activation-1.71e-03

Pos logit contributions
tch+0.101
 Holo+0.100
 ruth+0.097
Clark+0.095
 Doyle+0.093
Neg logit contributions
=#-0.102
 Neon-0.101
 裏�-0.097
nect-0.096
orage-0.096
Token the
Token ID262
Feature activation+5.92e-03

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
Token protesters
Token ID9584
Feature activation-2.95e-02

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.003
nect-0.003
orage-0.003
Token sly
Token ID49822
Feature activation+2.60e-01

Pos logit contributions
=#+0.019
 Neon+0.019
 裏�+0.018
nect+0.018
orage+0.018
Neg logit contributions
tch-0.019
 Holo-0.019
 ruth-0.018
Clark-0.018
 Doyle-0.018
Tokenly
Token ID306
Feature activation+1.89e-02

Pos logit contributions
tch+0.171
 Holo+0.169
 ruth+0.163
Clark+0.162
 Doyle+0.158
Neg logit contributions
=#-0.169
 Neon-0.168
 裏�-0.161
nect-0.160
icultural-0.158
Token distorted
Token ID26987
Feature activation+1.20e-02

Pos logit contributions
tch+0.012
 Holo+0.012
Clark+0.012
 ruth+0.012
 Doyle+0.011
Neg logit contributions
=#-0.012
 Neon-0.012
 裏�-0.012
nect-0.012
icultural-0.011
Token the
Token ID262
Feature activation+5.77e-03

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.008
Clark+0.007
 Doyle+0.007
Neg logit contributions
=#-0.008
 Neon-0.008
 裏�-0.007
nect-0.007
orage-0.007
Token traditional
Token ID4569
Feature activation+1.52e-01

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.003
nect-0.003
orage-0.003
Token rallying
Token ID36467
Feature activation-1.24e-01

Pos logit contributions
tch+0.104
 Holo+0.103
 ruth+0.100
Clark+0.099
 Doyle+0.097
Neg logit contributions
=#-0.094
 Neon-0.093
 裏�-0.090
nect-0.089
orage-0.088
Token of
Token ID286
Feature activation+4.09e-03

Pos logit contributions
tch+0.079
 Holo+0.078
 ruth+0.076
Clark+0.075
 Doyle+0.073
Neg logit contributions
=#-0.078
 Neon-0.078
 裏�-0.075
nect-0.074
orage-0.073
Token the
Token ID262
Feature activation+7.21e-03

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.002
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Token E
Token ID412
Feature activation-1.00e-01

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.004
Neg logit contributions
=#-0.005
 Neon-0.005
 裏�-0.004
nect-0.004
orage-0.004
TokenBS
Token ID4462
Feature activation-6.95e-02

Pos logit contributions
=#+0.057
 Neon+0.056
 裏�+0.054
nect+0.053
orage+0.053
Neg logit contributions
tch-0.075
 Holo-0.074
 ruth-0.072
Clark-0.071
 Doyle-0.070
TokenCO
Token ID8220
Feature activation-4.33e-02

Pos logit contributions
=#+0.040
 Neon+0.040
 裏�+0.038
nect+0.037
orage+0.037
Neg logit contributions
tch-0.051
 Holo-0.050
 ruth-0.049
Clark-0.048
 Doyle-0.047
Token Discovery
Token ID23455
Feature activation+4.11e-01

Pos logit contributions
=#+0.029
 Neon+0.028
 裏�+0.027
nect+0.027
orage+0.027
Neg logit contributions
tch-0.028
 Holo-0.028
 ruth-0.027
Clark-0.027
 Doyle-0.026
Token Services
Token ID6168
Feature activation+1.01e-01

Pos logit contributions
tch+0.267
 Holo+0.262
 ruth+0.254
Clark+0.252
 Doyle+0.246
Neg logit contributions
=#-0.271
 Neon-0.269
 裏�-0.259
nect-0.256
orage-0.254
Token (
Token ID357
Feature activation-2.32e-02

Pos logit contributions
tch+0.065
 Holo+0.063
 ruth+0.061
Clark+0.061
 Doyle+0.059
Neg logit contributions
=#-0.067
 Neon-0.067
 裏�-0.064
nect-0.064
orage-0.063
TokenED
Token ID1961
Feature activation+7.83e-02

Pos logit contributions
=#+0.016
 Neon+0.016
 裏�+0.015
nect+0.015
orage+0.015
Neg logit contributions
tch-0.015
 Holo-0.014
 ruth-0.014
Clark-0.014
 Doyle-0.014
TokenS
Token ID50
Feature activation+6.57e-02

Pos logit contributions
tch+0.048
 Holo+0.047
 ruth+0.045
Clark+0.045
 Doyle+0.044
Neg logit contributions
=#-0.055
 Neon-0.055
 裏�-0.053
nect-0.052
orage-0.052
Token)
Token ID8
Feature activation-2.04e-02

Pos logit contributions
tch+0.045
 Holo+0.044
 ruth+0.043
Clark+0.043
 Doyle+0.042
Neg logit contributions
=#-0.041
 Neon-0.041
 裏�-0.039
nect-0.039
orage-0.038
Token hands
Token ID2832
Feature activation+9.14e-02

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.008
nect+0.008
icultural+0.008
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.005
Clark-0.005
 Doyle-0.005
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
tch+0.059
 Holo+0.058
 ruth+0.056
Clark+0.056
 Doyle+0.054
Neg logit contributions
=#-0.061
 Neon-0.060
 裏�-0.058
nect-0.057
orage-0.057
Token
Token ID198
Feature activation-9.59e-03

Pos logit contributions
=#+0.007
 Neon+0.007
 裏�+0.007
nect+0.007
orage+0.007
Neg logit contributions
tch-0.008
 Holo-0.007
 ruth-0.007
Clark-0.007
 Doyle-0.007
TokenIn
Token ID818
Feature activation+4.19e-02

Pos logit contributions
=#+0.006
 Neon+0.006
 裏�+0.006
nect+0.006
orage+0.006
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.006
Clark-0.006
 Doyle-0.006
Token sports
Token ID5701
Feature activation-1.31e-02

Pos logit contributions
tch+0.027
 Holo+0.026
 ruth+0.025
Clark+0.025
 Doyle+0.025
Neg logit contributions
=#-0.028
 Neon-0.028
 裏�-0.027
nect-0.027
orage-0.026
Token referees
Token ID42692
Feature activation+2.74e-01

Pos logit contributions
=#+0.009
 Neon+0.009
 裏�+0.008
nect+0.008
orage+0.008
Neg logit contributions
tch-0.009
 Holo-0.008
 ruth-0.008
Clark-0.008
 Doyle-0.008
Token are
Token ID389
Feature activation-1.44e-02

Pos logit contributions
tch+0.181
 Holo+0.178
 ruth+0.173
Clark+0.171
 Doyle+0.167
Neg logit contributions
=#-0.178
 Neon-0.177
 裏�-0.170
nect-0.168
orage-0.167
Token often
Token ID1690
Feature activation+6.94e-02

Pos logit contributions
=#+0.010
 Neon+0.010
 裏�+0.009
nect+0.009
icultural+0.009
Neg logit contributions
tch-0.009
 Holo-0.009
Clark-0.009
 ruth-0.009
 Doyle-0.008
Token disliked
Token ID43252
Feature activation+1.60e-01

Pos logit contributions
tch+0.046
 Holo+0.045
 ruth+0.044
Clark+0.044
 Doyle+0.043
Neg logit contributions
=#-0.045
 Neon-0.044
 裏�-0.043
nect-0.042
orage-0.042
Token -
Token ID532
Feature activation+1.03e-01

Pos logit contributions
tch+0.104
 Holo+0.102
 ruth+0.099
Clark+0.098
 Doyle+0.096
Neg logit contributions
=#-0.106
 Neon-0.105
 裏�-0.101
nect-0.100
orage-0.099
Token in
Token ID287
Feature activation+3.24e-03

Pos logit contributions
tch+0.067
 Holo+0.066
 ruth+0.064
Clark+0.063
 Doyle+0.062
Neg logit contributions
=#-0.067
 Neon-0.067
 裏�-0.064
nect-0.063
icultural-0.063
TokenWe
Token ID1135
Feature activation-2.55e-02

Pos logit contributions
=#+0.009
 Neon+0.009
 裏�+0.009
nect+0.009
orage+0.009
Neg logit contributions
tch-0.010
 Holo-0.010
 ruth-0.009
Clark-0.009
 Doyle-0.009
Token pay
Token ID1414
Feature activation+6.01e-02

Pos logit contributions
=#+0.017
 Neon+0.016
 裏�+0.016
nect+0.016
orage+0.016
Neg logit contributions
tch-0.017
 Holo-0.017
 ruth-0.016
Clark-0.016
 Doyle-0.016
Token for
Token ID329
Feature activation-3.71e-02

Pos logit contributions
tch+0.041
 Holo+0.040
 ruth+0.039
Clark+0.038
 Doyle+0.038
Neg logit contributions
=#-0.038
 Neon-0.038
 裏�-0.036
nect-0.036
orage-0.036
Token your
Token ID534
Feature activation-5.85e-02

Pos logit contributions
=#+0.024
 Neon+0.024
 裏�+0.023
icultural+0.023
nect+0.023
Neg logit contributions
tch-0.024
 Holo-0.024
 ruth-0.024
Clark-0.023
 Doyle-0.023
Token stories
Token ID3923
Feature activation-5.44e-02

Pos logit contributions
=#+0.038
 Neon+0.038
 裏�+0.036
icultural+0.036
nect+0.036
Neg logit contributions
tch-0.038
 Holo-0.038
 ruth-0.037
Clark-0.036
 Doyle-0.036
Token!
Token ID0
Feature activation+5.95e-02

Pos logit contributions
=#+0.041
 Neon+0.040
 裏�+0.039
nect+0.038
orage+0.038
Neg logit contributions
tch-0.031
 Holo-0.030
 ruth-0.029
Clark-0.029
 Doyle-0.028
Token Do
Token ID2141
Feature activation+1.43e-01

Pos logit contributions
tch+0.024
 Holo+0.024
 ruth+0.023
Clark+0.023
 Doyle+0.022
Neg logit contributions
=#-0.053
 Neon-0.052
 裏�-0.051
nect-0.051
icultural-0.051
Token you
Token ID345
Feature activation-1.55e-02

Pos logit contributions
tch+0.081
 Holo+0.080
 ruth+0.077
Clark+0.076
 Doyle+0.075
Neg logit contributions
=#-0.104
 裏�-0.103
 Neon-0.103
nect-0.102
orage-0.100
Token have
Token ID423
Feature activation-5.19e-02

Pos logit contributions
=#+0.010
 Neon+0.010
 裏�+0.009
nect+0.009
icultural+0.009
Neg logit contributions
tch-0.010
 Holo-0.010
 ruth-0.010
Clark-0.010
 Doyle-0.010
Token a
Token ID257
Feature activation-3.27e-02

Pos logit contributions
=#+0.034
 Neon+0.033
 裏�+0.032
nect+0.032
icultural+0.031
Neg logit contributions
tch-0.034
 Holo-0.034
 ruth-0.032
Clark-0.032
 Doyle-0.032
Token story
Token ID1621
Feature activation-1.90e-01

Pos logit contributions
=#+0.016
 Neon+0.016
 裏�+0.015
nect+0.015
orage+0.015
Neg logit contributions
 Holo-0.026
tch-0.026
 ruth-0.026
Clark-0.025
 Doyle-0.025
Token to
Token ID284
Feature activation-3.37e-03

Pos logit contributions
=#+0.007
 Neon+0.007
 裏�+0.006
nect+0.006
orage+0.006
Neg logit contributions
tch-0.007
 Holo-0.007
 ruth-0.007
Clark-0.007
 Doyle-0.007
Token just
Token ID655
Feature activation+9.16e-03

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
orage+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token send
Token ID3758
Feature activation-9.28e-02

Pos logit contributions
tch+0.007
 Holo+0.006
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.006
 Neon-0.005
 裏�-0.005
nect-0.005
orage-0.005
Token the
Token ID262
Feature activation+1.78e-03

Pos logit contributions
=#+0.061
 Neon+0.061
 裏�+0.058
nect+0.058
orage+0.057
Neg logit contributions
tch-0.060
 Holo-0.059
 ruth-0.057
Clark-0.057
 Doyle-0.055
Token notification
Token ID14483
Feature activation+4.23e-03

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token,
Token ID11
Feature activation-2.55e-03

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Token and
Token ID290
Feature activation-2.25e-04

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
orage+0.002
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token then
Token ID788
Feature activation-1.41e-01

Pos logit contributions
=#+0.000
 Neon+0.000
 裏�+0.000
nect+0.000
orage+0.000
Neg logit contributions
tch-0.000
 Holo-0.000
 ruth-0.000
Clark-0.000
 Doyle-0.000
Token go
Token ID467
Feature activation+2.18e-02

Pos logit contributions
=#+0.087
 Neon+0.086
 裏�+0.082
nect+0.081
orage+0.081
Neg logit contributions
tch-0.098
 Holo-0.096
 ruth-0.093
Clark-0.093
 Doyle-0.090
Token doing
Token ID1804
Feature activation+2.57e-02

Pos logit contributions
tch+0.015
 Holo+0.014
 ruth+0.014
Clark+0.014
 Doyle+0.013
Neg logit contributions
=#-0.014
 Neon-0.014
 裏�-0.013
nect-0.013
orage-0.013
Token something
Token ID1223
Feature activation+9.36e-02

Pos logit contributions
tch+0.016
 Holo+0.016
 ruth+0.015
Clark+0.015
 Doyle+0.015
Neg logit contributions
=#-0.018
 Neon-0.018
 裏�-0.017
nect-0.017
orage-0.017
Token people
Token ID661
Feature activation+3.76e-02

Pos logit contributions
tch+0.049
 Holo+0.048
 ruth+0.047
Clark+0.046
 Doyle+0.045
Neg logit contributions
=#-0.047
 Neon-0.046
 裏�-0.045
nect-0.044
orage-0.044
Token on
Token ID319
Feature activation-2.60e-02

Pos logit contributions
tch+0.025
 Holo+0.025
 ruth+0.024
Clark+0.024
 Doyle+0.023
Neg logit contributions
=#-0.024
 Neon-0.024
 裏�-0.023
nect-0.023
orage-0.022
Token the
Token ID262
Feature activation-4.09e-03

Pos logit contributions
=#+0.017
 Neon+0.017
 裏�+0.016
nect+0.016
orage+0.016
Neg logit contributions
tch-0.017
 Holo-0.016
 ruth-0.016
Clark-0.016
 Doyle-0.015
Token city
Token ID1748
Feature activation-3.37e-02

Pos logit contributions
=#+0.002
 Neon+0.002
 裏�+0.002
nect+0.002
icultural+0.002
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token council
Token ID6745
Feature activation+1.32e-01

Pos logit contributions
=#+0.024
 Neon+0.024
 裏�+0.023
nect+0.023
icultural+0.023
Neg logit contributions
tch-0.020
 Holo-0.020
 ruth-0.019
Clark-0.019
 Doyle-0.018
Token be
Token ID307
Feature activation+1.83e-02

Pos logit contributions
tch+0.088
 Holo+0.087
 ruth+0.084
Clark+0.084
 Doyle+0.082
Neg logit contributions
=#-0.084
 Neon-0.083
 裏�-0.080
nect-0.079
orage-0.079
Token making
Token ID1642
Feature activation-2.47e-02

Pos logit contributions
tch+0.012
 Holo+0.012
 ruth+0.011
Clark+0.011
 Doyle+0.011
Neg logit contributions
=#-0.012
 Neon-0.012
 裏�-0.011
nect-0.011
orage-0.011
Token this
Token ID428
Feature activation+1.30e-03

Pos logit contributions
=#+0.016
 Neon+0.016
 裏�+0.015
nect+0.015
orage+0.015
Neg logit contributions
tch-0.017
 Holo-0.016
 ruth-0.016
Clark-0.016
 Doyle-0.015
Token decision
Token ID2551
Feature activation-1.36e-02

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token?
Token ID30
Feature activation+1.90e-02

Pos logit contributions
=#+0.009
 Neon+0.009
 裏�+0.008
nect+0.008
orage+0.008
Neg logit contributions
tch-0.009
 Holo-0.009
 ruth-0.009
Clark-0.009
 Doyle-0.008
Token
Token ID198
Feature activation-5.81e-03

Pos logit contributions
tch+0.012
 Holo+0.012
 ruth+0.012
Clark+0.012
 Doyle+0.011
Neg logit contributions
=#-0.012
 Neon-0.012
 裏�-0.012
nect-0.012
orage-0.012
Token
Token ID198
Feature activation+3.86e-03

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
TokenNo
Token ID2949
Feature activation-3.73e-02

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Token State
Token ID1812
Feature activation-1.91e-02

Pos logit contributions
=#+0.025
 Neon+0.025
 裏�+0.024
nect+0.024
orage+0.024
Neg logit contributions
tch-0.024
 Holo-0.023
 ruth-0.023
Clark-0.022
 Doyle-0.022
Token,
Token ID11
Feature activation+2.93e-03

Pos logit contributions
=#+0.011
 Neon+0.011
 裏�+0.011
nect+0.010
orage+0.010
Neg logit contributions
tch-0.014
 Holo-0.014
 ruth-0.013
Clark-0.013
 Doyle-0.013
Token territory
Token ID7674
Feature activation+7.85e-02

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Token,
Token ID11
Feature activation+5.72e-03

Pos logit contributions
tch+0.050
 Holo+0.049
 ruth+0.048
Clark+0.047
 Doyle+0.046
Neg logit contributions
=#-0.053
 Neon-0.052
 裏�-0.050
nect-0.050
orage-0.049
Token or
Token ID393
Feature activation+6.67e-02

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Token possession
Token ID7797
Feature activation+2.58e-02

Pos logit contributions
tch+0.039
 Holo+0.039
 ruth+0.038
Clark+0.037
 Doyle+0.036
Neg logit contributions
=#-0.048
 Neon-0.048
 裏�-0.046
nect-0.045
orage-0.045
Token of
Token ID286
Feature activation+1.45e-03

Pos logit contributions
tch+0.017
 Holo+0.017
 ruth+0.016
Clark+0.016
 Doyle+0.016
Neg logit contributions
=#-0.017
 Neon-0.017
 裏�-0.016
nect-0.016
orage-0.016
Token the
Token ID262
Feature activation+6.22e-03

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token United
Token ID1578
Feature activation+2.04e-01

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
 Neon-0.005
=#-0.005
 裏�-0.005
nect-0.005
orage-0.005
Token that
Token ID326
Feature activation+1.16e-02

Pos logit contributions
=#+0.064
 Neon+0.064
 裏�+0.061
nect+0.060
icultural+0.060
Neg logit contributions
tch-0.067
 Holo-0.066
 ruth-0.064
Clark-0.063
 Doyle-0.062
Token might
Token ID1244
Feature activation-8.76e-02

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.008
Clark+0.007
 Doyle+0.007
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.007
nect-0.007
orage-0.007
Token deter
Token ID2206
Feature activation-1.14e-01

Pos logit contributions
=#+0.057
 Neon+0.056
 裏�+0.054
nect+0.053
orage+0.053
Neg logit contributions
tch-0.058
 Holo-0.057
 ruth-0.055
Clark-0.055
 Doyle-0.054
Token future
Token ID2003
Feature activation-1.45e-02

Pos logit contributions
=#+0.075
 Neon+0.074
 裏�+0.071
nect+0.070
icultural+0.070
Neg logit contributions
tch-0.075
 Holo-0.073
 ruth-0.071
Clark-0.071
 Doyle-0.069
Token attacks
Token ID3434
Feature activation+5.02e-02

Pos logit contributions
=#+0.010
 Neon+0.010
 裏�+0.009
nect+0.009
orage+0.009
Neg logit contributions
tch-0.009
 Holo-0.009
 ruth-0.009
Clark-0.009
 Doyle-0.008
Token.
Token ID13
Feature activation+3.61e-03

Pos logit contributions
tch+0.033
 Holo+0.033
 ruth+0.032
Clark+0.032
 Doyle+0.031
Neg logit contributions
=#-0.032
 Neon-0.032
 裏�-0.031
nect-0.030
orage-0.030
Token Indeed
Token ID9676
Feature activation+2.40e-02

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
icultural-0.002
Token,
Token ID11
Feature activation+6.56e-03

Pos logit contributions
tch+0.016
 Holo+0.016
 ruth+0.015
Clark+0.015
 Doyle+0.015
Neg logit contributions
=#-0.016
 Neon-0.016
 裏�-0.015
nect-0.015
orage-0.015
Token I
Token ID314
Feature activation+1.88e-02

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Token wish
Token ID4601
Feature activation+2.34e-01

Pos logit contributions
tch+0.013
 Holo+0.013
 ruth+0.012
Clark+0.012
 Doyle+0.012
Neg logit contributions
=#-0.012
 Neon-0.012
 裏�-0.011
nect-0.011
orage-0.011
Token there
Token ID612
Feature activation+5.93e-04

Pos logit contributions
tch+0.154
 Holo+0.151
 ruth+0.147
Clark+0.146
 Doyle+0.142
Neg logit contributions
=#-0.153
 Neon-0.151
 裏�-0.145
nect-0.143
orage-0.143
Token to
Token ID284
Feature activation+4.39e-03

Pos logit contributions
=#+0.031
 Neon+0.030
 裏�+0.029
nect+0.029
orage+0.029
Neg logit contributions
tch-0.031
 Holo-0.031
 ruth-0.030
Clark-0.029
 Doyle-0.029
Token trip
Token ID5296
Feature activation+1.21e-02

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Token?
Token ID30
Feature activation+3.02e-02

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.008
Clark+0.007
 Doyle+0.007
Neg logit contributions
=#-0.008
 Neon-0.008
 裏�-0.008
nect-0.007
orage-0.007
Token The
Token ID383
Feature activation-2.44e-02

Pos logit contributions
tch+0.021
 Holo+0.020
 ruth+0.020
Clark+0.019
 Doyle+0.019
Neg logit contributions
=#-0.019
 Neon-0.019
 裏�-0.018
nect-0.018
orage-0.018
Token most
Token ID749
Feature activation-5.18e-02

Pos logit contributions
=#+0.016
 Neon+0.016
 裏�+0.015
nect+0.015
orage+0.015
Neg logit contributions
tch-0.016
 Holo-0.016
 ruth-0.015
Clark-0.015
 Doyle-0.015
Token important
Token ID1593
Feature activation-4.15e-02

Pos logit contributions
=#+0.032
 Neon+0.032
 裏�+0.030
nect+0.030
orage+0.030
Neg logit contributions
tch-0.036
 Holo-0.035
 ruth-0.034
Clark-0.034
 Doyle-0.033
Token question
Token ID1808
Feature activation-1.02e-01

Pos logit contributions
=#+0.026
 Neon+0.025
 裏�+0.024
nect+0.024
orage+0.024
Neg logit contributions
tch-0.029
 Holo-0.028
 ruth-0.028
Clark-0.027
 Doyle-0.027
Token to
Token ID284
Feature activation+1.15e-02

Pos logit contributions
=#+0.067
 Neon+0.066
 裏�+0.064
nect+0.063
orage+0.062
Neg logit contributions
tch-0.067
 Holo-0.066
 ruth-0.064
Clark-0.064
 Doyle-0.062
Token answer
Token ID3280
Feature activation-3.40e-02

Pos logit contributions
tch+0.008
 Holo+0.007
 ruth+0.007
Clark+0.007
 Doyle+0.007
Neg logit contributions
=#-0.008
 Neon-0.007
 裏�-0.007
nect-0.007
orage-0.007
Token is
Token ID318
Feature activation-4.28e-03

Pos logit contributions
=#+0.022
 Neon+0.022
 裏�+0.021
nect+0.021
orage+0.021
Neg logit contributions
tch-0.023
 Holo-0.022
 ruth-0.022
Clark-0.021
 Doyle-0.021
Token how
Token ID703
Feature activation-3.53e-02

Pos logit contributions
=#+0.003
 Neon+0.003
 裏�+0.003
nect+0.003
orage+0.003
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token
Token ID198
Feature activation-4.07e-03

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Token128
Token ID12762
Feature activation+9.59e-03

Pos logit contributions
=#+0.003
 Neon+0.003
 裏�+0.003
nect+0.003
orage+0.003
Neg logit contributions
tch-0.002
 Holo-0.002
 ruth-0.002
Clark-0.002
 Doyle-0.002
Token.
Token ID13
Feature activation-9.56e-03

Pos logit contributions
tch+0.006
 Holo+0.006
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.006
 Neon-0.006
 裏�-0.006
nect-0.006
orage-0.006
Token8
Token ID23
Feature activation-1.09e-01

Pos logit contributions
=#+0.006
 Neon+0.005
 裏�+0.005
nect+0.005
orage+0.005
Neg logit contributions
tch-0.007
 Holo-0.007
 ruth-0.007
Clark-0.007
 Doyle-0.006
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
=#+0.070
 Neon+0.069
 裏�+0.066
nect+0.066
orage+0.065
Neg logit contributions
tch-0.073
 Holo-0.071
 ruth-0.069
Clark-0.069
 Doyle-0.067
Token
Token ID198
Feature activation-9.55e-03

Pos logit contributions
=#+0.008
 Neon+0.008
 裏�+0.007
nect+0.007
orage+0.007
Neg logit contributions
tch-0.008
 Holo-0.008
 ruth-0.008
Clark-0.008
 Doyle-0.008
Token9
Token ID24
Feature activation-1.32e-01

Pos logit contributions
=#+0.006
 Neon+0.006
 裏�+0.005
nect+0.005
orage+0.005
Neg logit contributions
tch-0.007
 Holo-0.007
 ruth-0.007
Clark-0.006
 Doyle-0.006
Token Yemen
Token ID12492
Feature activation-5.57e-02

Pos logit contributions
=#+0.089
 Neon+0.089
 裏�+0.085
nect+0.084
orage+0.084
Neg logit contributions
tch-0.084
 Holo-0.082
 ruth-0.080
Clark-0.079
 Doyle-0.077
Token
Token ID198
Feature activation-1.56e-02

Pos logit contributions
=#+0.036
 Neon+0.036
 裏�+0.035
nect+0.034
orage+0.034
Neg logit contributions
tch-0.037
 Holo-0.036
 ruth-0.035
Clark-0.034
 Doyle-0.034
Token
Token ID198
Feature activation-1.87e-02

Pos logit contributions
=#+0.010
 Neon+0.010
 裏�+0.009
nect+0.009
orage+0.009
Neg logit contributions
tch-0.011
 Holo-0.010
 ruth-0.010
Clark-0.010
 Doyle-0.010
Token128
Token ID12762
Feature activation-5.30e-03

Pos logit contributions
=#+0.013
 Neon+0.013
 裏�+0.013
nect+0.013
orage+0.012
Neg logit contributions
tch-0.011
 Holo-0.011
 ruth-0.011
Clark-0.011
 Doyle-0.010
Token Start
Token ID7253
Feature activation+5.14e-03

Pos logit contributions
tch+0.035
 Holo+0.034
 ruth+0.033
Clark+0.033
 Doyle+0.032
Neg logit contributions
=#-0.035
 Neon-0.035
 裏�-0.033
nect-0.033
orage-0.033
Token of
Token ID286
Feature activation-5.57e-03

Pos logit contributions
tch+0.003
 Holo+0.003
 ruth+0.003
Clark+0.003
 Doyle+0.003
Neg logit contributions
=#-0.003
 Neon-0.003
 裏�-0.003
nect-0.003
orage-0.003
Token system
Token ID1080
Feature activation-6.76e-02

Pos logit contributions
=#+0.004
 Neon+0.004
 裏�+0.004
nect+0.004
orage+0.004
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token()
Token ID3419
Feature activation+7.11e-03

Pos logit contributions
=#+0.045
 Neon+0.045
 裏�+0.043
nect+0.042
orage+0.042
Neg logit contributions
tch-0.044
 Holo-0.043
 ruth-0.042
Clark-0.041
 Doyle-0.040
Token'
Token ID6
Feature activation-6.34e-02

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.005
 Neon-0.005
 裏�-0.005
nect-0.005
orage-0.005
Tokens
Token ID82
Feature activation-1.03e-02

Pos logit contributions
=#+0.044
 Neon+0.043
 裏�+0.041
nect+0.041
orage+0.041
Neg logit contributions
tch-0.039
 Holo-0.039
 ruth-0.038
Clark-0.037
 Doyle-0.036
Token stack
Token ID8931
Feature activation+5.01e-02

Pos logit contributions
=#+0.007
 Neon+0.007
 裏�+0.007
nect+0.007
orage+0.007
Neg logit contributions
tch-0.006
 Holo-0.006
 ruth-0.006
Clark-0.006
 Doyle-0.006
Token frame
Token ID5739
Feature activation-1.99e-02

Pos logit contributions
tch+0.032
 Holo+0.032
 ruth+0.031
Clark+0.031
 Doyle+0.030
Neg logit contributions
=#-0.033
 Neon-0.033
 裏�-0.032
nect-0.031
orage-0.031
Token |
Token ID930
Feature activation-1.13e-01

Pos logit contributions
=#+0.014
 Neon+0.014
 裏�+0.013
nect+0.013
orage+0.013
Neg logit contributions
tch-0.012
 Holo-0.012
 ruth-0.011
Clark-0.011
 Doyle-0.011
Token void
Token ID7951
Feature activation-1.02e-01

Pos logit contributions
=#+0.080
 Neon+0.079
 裏�+0.076
nect+0.075
orage+0.075
Neg logit contributions
tch-0.069
 Holo-0.068
 ruth-0.066
Clark-0.065
 Doyle-0.063
Token *
Token ID1635
Feature activation+7.50e-02

Pos logit contributions
=#+0.065
 Neon+0.065
 裏�+0.062
nect+0.061
orage+0.061
Neg logit contributions
tch-0.069
 Holo-0.068
 ruth-0.066
Clark-0.065
 Doyle-0.064
Tokenanda
Token ID5282
Feature activation+8.25e-02

Pos logit contributions
=#+0.039
 Neon+0.038
nect+0.037
orage+0.037
 裏�+0.037
Neg logit contributions
tch-0.030
 Holo-0.030
 ruth-0.029
Clark-0.028
 Doyle-0.028
Token intended
Token ID5292
Feature activation-2.55e-02

Pos logit contributions
tch+0.055
 Holo+0.054
 ruth+0.052
Clark+0.052
 Doyle+0.050
Neg logit contributions
=#-0.053
 Neon-0.053
 裏�-0.051
nect-0.050
orage-0.050
Token to
Token ID284
Feature activation-1.63e-03

Pos logit contributions
=#+0.017
 Neon+0.016
 裏�+0.016
nect+0.016
orage+0.016
Neg logit contributions
tch-0.017
 Holo-0.017
 ruth-0.016
Clark-0.016
 Doyle-0.016
Token help
Token ID1037
Feature activation+6.38e-02

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
Token minority
Token ID9137
Feature activation-6.65e-02

Pos logit contributions
tch+0.040
 Holo+0.039
 ruth+0.038
Clark+0.038
 Doyle+0.037
Neg logit contributions
=#-0.044
 Neon-0.043
 裏�-0.042
nect-0.041
orage-0.041
Token communities
Token ID5348
Feature activation-1.07e-01

Pos logit contributions
=#+0.040
 Neon+0.040
 裏�+0.038
nect+0.037
icultural+0.037
Neg logit contributions
tch-0.047
 Holo-0.046
 ruth-0.045
Clark-0.045
 Doyle-0.044
Token,
Token ID11
Feature activation-7.16e-03

Pos logit contributions
=#+0.068
 Neon+0.068
 裏�+0.065
nect+0.064
orage+0.064
Neg logit contributions
tch-0.071
 Holo-0.070
 ruth-0.068
Clark-0.067
 Doyle-0.066
Token as
Token ID355
Feature activation-2.28e-02

Pos logit contributions
=#+0.005
 Neon+0.005
 裏�+0.004
nect+0.004
orage+0.004
Neg logit contributions
tch-0.005
 Holo-0.005
 ruth-0.005
Clark-0.005
 Doyle-0.004
Token well
Token ID880
Feature activation-1.72e-02

Pos logit contributions
=#+0.014
 Neon+0.014
 裏�+0.014
nect+0.014
icultural+0.013
Neg logit contributions
tch-0.016
 Holo-0.015
 ruth-0.015
Clark-0.015
 Doyle-0.014
Token as
Token ID355
Feature activation-2.23e-02

Pos logit contributions
=#+0.011
 Neon+0.011
 裏�+0.010
nect+0.010
orage+0.010
Neg logit contributions
tch-0.012
 Holo-0.011
 ruth-0.011
Clark-0.011
 Doyle-0.011
Token local
Token ID1957
Feature activation-1.07e-02

Pos logit contributions
=#+0.014
 Neon+0.014
 裏�+0.014
nect+0.014
orage+0.013
Neg logit contributions
tch-0.015
 Holo-0.015
 ruth-0.014
Clark-0.014
 Doyle-0.014
Token board
Token ID3096
Feature activation+2.93e-01

Pos logit contributions
=#+0.003
 Neon+0.003
 裏�+0.003
nect+0.003
orage+0.003
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token her
Token ID607
Feature activation-1.55e-02

Pos logit contributions
tch+0.183
 Holo+0.180
 ruth+0.174
Clark+0.172
 Doyle+0.168
Neg logit contributions
=#-0.201
 Neon-0.199
 裏�-0.192
nect-0.190
icultural-0.189
Token flight
Token ID5474
Feature activation-1.99e-02

Pos logit contributions
=#+0.011
 Neon+0.011
 裏�+0.010
nect+0.010
icultural+0.010
Neg logit contributions
tch-0.010
 Holo-0.010
 ruth-0.009
Clark-0.009
 Doyle-0.009
Token in
Token ID287
Feature activation+2.69e-03

Pos logit contributions
=#+0.013
 Neon+0.013
 裏�+0.012
nect+0.012
orage+0.012
Neg logit contributions
tch-0.013
 Holo-0.013
 ruth-0.012
Clark-0.012
 Doyle-0.012
Token Malaysia
Token ID15336
Feature activation+8.90e-02

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
icultural-0.002
Token,
Token ID11
Feature activation-1.30e-02

Pos logit contributions
tch+0.058
 Holo+0.057
 ruth+0.055
Clark+0.055
 Doyle+0.054
Neg logit contributions
=#-0.059
 Neon-0.058
 裏�-0.056
nect-0.055
orage-0.055
Token and
Token ID290
Feature activation-4.27e-03

Pos logit contributions
=#+0.009
 Neon+0.009
 裏�+0.008
nect+0.008
orage+0.008
Neg logit contributions
tch-0.008
 Holo-0.008
 ruth-0.008
Clark-0.008
 Doyle-0.008
Token the
Token ID262
Feature activation-8.23e-03

Pos logit contributions
=#+0.003
 Neon+0.003
 裏�+0.003
nect+0.003
orage+0.003
Neg logit contributions
tch-0.003
 Holo-0.003
 ruth-0.003
Clark-0.003
 Doyle-0.003
Token DOJ
Token ID27055
Feature activation+7.84e-02

Pos logit contributions
=#+0.006
 Neon+0.006
 裏�+0.005
nect+0.005
orage+0.005
Neg logit contributions
tch-0.005
 Holo-0.005
 ruth-0.005
Clark-0.005
 Doyle-0.005
Token claimed
Token ID4752
Feature activation+7.71e-02

Pos logit contributions
tch+0.050
 Holo+0.049
 ruth+0.048
Clark+0.047
 Doyle+0.046
Neg logit contributions
=#-0.053
 Neon-0.052
 裏�-0.050
nect-0.050
orage-0.050
Token,
Token ID11
Feature activation-1.30e-02

Pos logit contributions
tch+0.049
 Holo+0.048
 ruth+0.047
Clark+0.046
 Doyle+0.045
Neg logit contributions
=#-0.052
 Neon-0.051
 裏�-0.050
nect-0.049
icultural-0.049
Tokenvec
Token ID35138
Feature activation+2.04e-01

Pos logit contributions
=#+0.009
 Neon+0.009
 裏�+0.009
nect+0.009
icultural+0.009
Neg logit contributions
tch-0.007
 Holo-0.007
 ruth-0.007
Clark-0.007
 Doyle-0.007
Token e
Token ID304
Feature activation+3.19e-02

Pos logit contributions
tch+0.142
 Holo+0.139
 ruth+0.135
Clark+0.135
 Doyle+0.131
Neg logit contributions
=#-0.125
 Neon-0.125
 裏�-0.119
nect-0.118
orage-0.117
Tokenux
Token ID2821
Feature activation+1.56e-02

Pos logit contributions
tch+0.021
 Holo+0.021
 ruth+0.020
Clark+0.020
 Doyle+0.020
Neg logit contributions
=#-0.021
 Neon-0.020
 裏�-0.020
nect-0.019
orage-0.019
Token la
Token ID8591
Feature activation-1.06e-02

Pos logit contributions
tch+0.011
 Holo+0.010
 ruth+0.010
Clark+0.010
 Doyle+0.010
Neg logit contributions
=#-0.010
 Neon-0.010
 裏�-0.009
nect-0.009
orage-0.009
Token mo
Token ID6941
Feature activation+1.04e-01

Pos logit contributions
=#+0.007
 Neon+0.007
 裏�+0.007
nect+0.007
orage+0.007
Neg logit contributions
tch-0.007
 Holo-0.007
 ruth-0.007
Clark-0.007
 Doyle-0.006
Tokeniti
Token ID8846
Feature activation-2.72e-01

Pos logit contributions
tch+0.079
 Holo+0.078
 ruth+0.076
Clark+0.075
 Doyle+0.074
Neg logit contributions
=#-0.058
 Neon-0.057
 裏�-0.055
nect-0.054
orage-0.054
Tokené
Token ID2634
Feature activation+3.40e-02

Pos logit contributions
=#+0.186
 Neon+0.185
 裏�+0.178
nect+0.176
orage+0.176
Neg logit contributions
tch-0.170
 Holo-0.167
 ruth-0.161
Clark-0.160
 Doyle-0.155
Token des
Token ID748
Feature activation-1.57e-01

Pos logit contributions
tch+0.024
 Holo+0.024
 ruth+0.023
Clark+0.023
 Doyle+0.022
Neg logit contributions
=#-0.021
 Neon-0.020
 裏�-0.020
nect-0.019
icultural-0.019
Token Cors
Token ID26978
Feature activation+1.51e-01

Pos logit contributions
 Neon+0.101
=#+0.101
 裏�+0.097
nect+0.096
icultural+0.095
Neg logit contributions
tch-0.104
 Holo-0.102
Clark-0.099
 ruth-0.099
 Doyle-0.096
Tokenaires
Token ID17693
Feature activation+2.02e-01

Pos logit contributions
tch+0.065
 Holo+0.063
 ruth+0.060
Clark+0.060
 Doyle+0.057
Neg logit contributions
=#-0.133
 Neon-0.132
 裏�-0.129
nect-0.128
orage-0.127
Token ay
Token ID38762
Feature activation+8.42e-02

Pos logit contributions
tch+0.140
 Holo+0.138
 ruth+0.134
Clark+0.132
 Doyle+0.129
Neg logit contributions
=#-0.125
 Neon-0.124
 裏�-0.119
nect-0.117
orage-0.117
Token producer
Token ID9920
Feature activation-5.94e-02

Pos logit contributions
tch+0.002
 Holo+0.002
 ruth+0.002
Clark+0.002
 Doyle+0.002
Neg logit contributions
=#-0.002
 Neon-0.002
 裏�-0.002
nect-0.002
orage-0.002
Token He
Token ID679
Feature activation+5.72e-02

Pos logit contributions
=#+0.036
 Neon+0.035
 裏�+0.034
nect+0.034
orage+0.033
Neg logit contributions
tch-0.042
 Holo-0.041
 ruth-0.040
Clark-0.040
 Doyle-0.039
Tokenik
Token ID1134
Feature activation-6.61e-02

Pos logit contributions
tch+0.038
 Holo+0.037
 ruth+0.036
Clark+0.036
 Doyle+0.035
Neg logit contributions
=#-0.037
 Neon-0.037
 裏�-0.035
nect-0.035
orage-0.035
Tokenki
Token ID4106
Feature activation-5.32e-02

Pos logit contributions
=#+0.037
 Neon+0.036
 裏�+0.034
nect+0.034
orage+0.034
Neg logit contributions
tch-0.050
 Holo-0.049
 ruth-0.048
Clark-0.048
 Doyle-0.047
Token M
Token ID337
Feature activation+6.63e-02

Pos logit contributions
=#+0.033
 Neon+0.032
 裏�+0.031
nect+0.031
orage+0.030
Neg logit contributions
tch-0.037
 Holo-0.036
 ruth-0.035
Clark-0.035
 Doyle-0.034
Tokenä
Token ID11033
Feature activation-2.64e-01

Pos logit contributions
tch+0.053
 Holo+0.052
 ruth+0.051
Clark+0.051
 Doyle+0.050
Neg logit contributions
=#-0.034
 Neon-0.034
 裏�-0.032
nect-0.031
icultural-0.031
Tokenen
Token ID268
Feature activation+2.58e-01

Pos logit contributions
=#+0.245
 Neon+0.243
 裏�+0.237
nect+0.235
orage+0.234
Neg logit contributions
tch-0.101
 Holo-0.098
 ruth-0.093
Clark-0.091
 Doyle-0.087
Tokenp
Token ID79
Feature activation-1.63e-02

Pos logit contributions
tch+0.186
 Holo+0.183
 ruth+0.178
Clark+0.176
 Doyle+0.172
Neg logit contributions
=#-0.153
 Neon-0.151
 裏�-0.145
nect-0.143
orage-0.142
Tokenä
Token ID11033
Feature activation-2.65e-01

Pos logit contributions
=#+0.007
 Neon+0.007
 裏�+0.007
nect+0.006
orage+0.006
Neg logit contributions
tch-0.014
 Holo-0.014
 ruth-0.014
Clark-0.014
 Doyle-0.014
Tokenä
Token ID11033
Feature activation-2.67e-01

Pos logit contributions
=#+0.107
 Neon+0.106
 裏�+0.099
nect+0.098
orage+0.096
Neg logit contributions
tch-0.239
 Holo-0.237
 ruth-0.231
Clark-0.230
 Doyle-0.226
Token.
Token ID13
Feature activation+4.73e-03

Pos logit contributions
=#+0.154
 Neon+0.153
 裏�+0.146
nect+0.145
orage+0.143
Neg logit contributions
tch-0.194
 Holo-0.192
 ruth-0.187
Clark-0.185
 Doyle-0.181
Token where
Token ID810
Feature activation+4.46e-02

Pos logit contributions
=#+0.001
 Neon+0.001
 裏�+0.001
nect+0.001
orage+0.001
Neg logit contributions
tch-0.001
 Holo-0.001
 ruth-0.001
Clark-0.001
 Doyle-0.001
Token you
Token ID345
Feature activation+1.29e-02

Pos logit contributions
tch+0.029
 Holo+0.029
 ruth+0.028
Clark+0.028
 Doyle+0.027
Neg logit contributions
=#-0.029
 Neon-0.029
 裏�-0.028
nect-0.028
orage-0.027
Token stripped
Token ID18818
Feature activation-4.01e-02

Pos logit contributions
tch+0.009
 Holo+0.008
 ruth+0.008
Clark+0.008
 Doyle+0.008
Neg logit contributions
=#-0.008
 Neon-0.008
 裏�-0.008
nect-0.008
orage-0.008
Token back
Token ID736
Feature activation-6.75e-02

Pos logit contributions
=#+0.026
 Neon+0.026
 裏�+0.025
nect+0.024
orage+0.024
Neg logit contributions
tch-0.027
 Holo-0.026
 ruth-0.025
Clark-0.025
 Doyle-0.024
Token the
Token ID262
Feature activation+1.23e-02

Pos logit contributions
=#+0.044
 Neon+0.044
 裏�+0.042
nect+0.042
orage+0.041
Neg logit contributions
tch-0.044
 Holo-0.044
 ruth-0.042
Clark-0.042
 Doyle-0.041
Token shield
Token ID7614
Feature activation-2.79e-01

Pos logit contributions
tch+0.009
 Holo+0.009
 ruth+0.009
Clark+0.008
 Doyle+0.008
Neg logit contributions
=#-0.007
 Neon-0.007
 裏�-0.007
nect-0.007
orage-0.007
Token.
Token ID13
Feature activation+1.15e-03

Pos logit contributions
=#+0.180
 Neon+0.179
 裏�+0.171
nect+0.170
orage+0.169
Neg logit contributions
tch-0.185
 Holo-0.182
 ruth-0.176
Clark-0.175
 Doyle-0.170
Token Plug
Token ID22689
Feature activation-4.39e-02

Pos logit contributions
tch+0.001
 Holo+0.001
 ruth+0.001
Clark+0.001
 Doyle+0.001
Neg logit contributions
=#-0.001
 Neon-0.001
 裏�-0.001
nect-0.001
orage-0.001
Token this
Token ID428
Feature activation+2.83e-04

Pos logit contributions
=#+0.029
 Neon+0.029
 裏�+0.028
nect+0.027
orage+0.027
Neg logit contributions
tch-0.029
 Holo-0.028
 ruth-0.027
Clark-0.027
 Doyle-0.026
Token loop
Token ID9052
Feature activation+1.76e-01

Pos logit contributions
tch+0.000
 Holo+0.000
 ruth+0.000
Clark+0.000
 Doyle+0.000
Neg logit contributions
=#-0.000
 Neon-0.000
 裏�-0.000
nect-0.000
orage-0.000
Token into
Token ID656
Feature activation-6.46e-02

Pos logit contributions
tch+0.123
 Holo+0.121
 ruth+0.117
Clark+0.116
 Doyle+0.114
Neg logit contributions
=#-0.109
 Neon-0.108
 裏�-0.103
nect-0.102
orage-0.101
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
tch+0.106
 Holo+0.105
 ruth+0.103
Clark+0.100
 Doyle+0.099
Neg logit contributions
=#-0.109
 Neon-0.108
 裏�-0.104
nect-0.103
icultural-0.102
Token You
Token ID921
Feature activation+2.13e-02

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.008
Clark+0.008
 Doyle+0.007
Neg logit contributions
=#-0.008
 Neon-0.008
 裏�-0.007
nect-0.007
icultural-0.007
Token might
Token ID1244
Feature activation-9.19e-02

Pos logit contributions
tch+0.014
 Holo+0.014
 ruth+0.014
Clark+0.014
 Doyle+0.013
Neg logit contributions
=#-0.014
 Neon-0.013
 裏�-0.013
nect-0.013
orage-0.013
Token think
Token ID892
Feature activation-5.18e-02

Pos logit contributions
=#+0.059
 Neon+0.058
 裏�+0.056
nect+0.055
orage+0.055
Neg logit contributions
tch-0.062
 Holo-0.061
 ruth-0.059
Clark-0.059
 Doyle-0.057
Token of
Token ID286
Feature activation+6.99e-03

Pos logit contributions
=#+0.034
 Neon+0.033
 裏�+0.032
nect+0.032
orage+0.032
Neg logit contributions
tch-0.034
 Holo-0.034
 ruth-0.033
Clark-0.032
 Doyle-0.032
Token Swed
Token ID7289
Feature activation-3.40e-01

Pos logit contributions
tch+0.005
 Holo+0.005
 ruth+0.005
Clark+0.005
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Tokenes
Token ID274
Feature activation+5.57e-02

Pos logit contributions
=#+0.240
 Neon+0.239
 裏�+0.230
nect+0.227
orage+0.226
Neg logit contributions
tch-0.206
 Holo-0.202
 ruth-0.195
Clark-0.193
 Doyle-0.188
Token as
Token ID355
Feature activation-1.42e-02

Pos logit contributions
tch+0.037
 Holo+0.037
 ruth+0.035
Clark+0.035
 Doyle+0.034
Neg logit contributions
=#-0.036
 Neon-0.035
 裏�-0.034
nect-0.034
orage-0.033
Token upright
Token ID24826
Feature activation-1.15e-01

Pos logit contributions
=#+0.009
 Neon+0.009
 裏�+0.009
nect+0.009
icultural+0.009
Neg logit contributions
tch-0.009
 Holo-0.009
Clark-0.009
 ruth-0.009
 Doyle-0.009
Token social
Token ID1919
Feature activation+1.61e-01

Pos logit contributions
=#+0.077
 Neon+0.076
 裏�+0.073
nect+0.072
icultural+0.072
Neg logit contributions
tch-0.075
 Holo-0.073
 ruth-0.071
Clark-0.071
 Doyle-0.069
Token democr
Token ID5084
Feature activation-6.91e-02

Pos logit contributions
tch+0.116
 Holo+0.114
 ruth+0.111
Clark+0.110
 Doyle+0.108
Neg logit contributions
=#-0.094
 Neon-0.093
 裏�-0.089
nect-0.088
orage-0.088
Token to
Token ID284
Feature activation+1.21e-02

Pos logit contributions
tch+0.142
 Holo+0.139
 ruth+0.135
Clark+0.134
 Doyle+0.130
Neg logit contributions
=#-0.135
 Neon-0.133
 裏�-0.127
nect-0.126
orage-0.125
Token the
Token ID262
Feature activation+6.53e-03

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.008
Clark+0.008
 Doyle+0.007
Neg logit contributions
=#-0.008
 Neon-0.008
 裏�-0.007
nect-0.007
orage-0.007
Token website
Token ID3052
Feature activation+1.88e-02

Pos logit contributions
tch+0.004
 Holo+0.004
 ruth+0.004
Clark+0.004
 Doyle+0.004
Neg logit contributions
=#-0.004
 Neon-0.004
 裏�-0.004
nect-0.004
orage-0.004
Token owned
Token ID6898
Feature activation+2.66e-02

Pos logit contributions
tch+0.014
 Holo+0.013
 ruth+0.013
Clark+0.013
 Doyle+0.013
Neg logit contributions
=#-0.011
 Neon-0.011
 裏�-0.010
nect-0.010
orage-0.010
Token by
Token ID416
Feature activation+9.79e-03

Pos logit contributions
tch+0.017
 Holo+0.017
 ruth+0.016
Clark+0.016
 Doyle+0.016
Neg logit contributions
=#-0.018
 Neon-0.018
 裏�-0.017
nect-0.017
orage-0.017
Token Alex
Token ID4422
Feature activation-2.75e-01

Pos logit contributions
tch+0.007
 Holo+0.007
 ruth+0.006
Clark+0.006
 Doyle+0.006
Neg logit contributions
=#-0.006
 Neon-0.006
 裏�-0.006
nect-0.006
orage-0.006
Token Jones
Token ID5437
Feature activation+5.68e-02

Pos logit contributions
=#+0.200
 Neon+0.199
 裏�+0.192
nect+0.190
orage+0.189
Neg logit contributions
tch-0.160
 Holo-0.157
 ruth-0.152
Clark-0.151
 Doyle-0.147
Token that
Token ID326
Feature activation+1.21e-02

Pos logit contributions
tch+0.038
 Holo+0.037
 ruth+0.036
Clark+0.036
 Doyle+0.035
Neg logit contributions
=#-0.036
 Neon-0.036
 裏�-0.035
nect-0.034
orage-0.034
Token propag
Token ID8928
Feature activation+1.33e-01

Pos logit contributions
tch+0.008
 Holo+0.008
 ruth+0.007
Clark+0.007
 Doyle+0.007
Neg logit contributions
=#-0.008
 Neon-0.008
 裏�-0.008
nect-0.007
orage-0.007
Tokenates
Token ID689
Feature activation+2.97e-02

Pos logit contributions
tch+0.091
 Holo+0.090
 ruth+0.088
Clark+0.087
 Doyle+0.085
Neg logit contributions
=#-0.082
 Neon-0.082
 裏�-0.078
nect-0.077
orage-0.077
Token conspiracy
Token ID10086
Feature activation-8.54e-02

Pos logit contributions
tch+0.021
 Holo+0.020
 ruth+0.020
Clark+0.020
 Doyle+0.019
Neg logit contributions
=#-0.018
 Neon-0.018
 裏�-0.017
nect-0.017
orage-0.017