TOP ACTIVATIONS
MAX = 2.124

of Anarchy star Charlie Hun nam is being eyed for the
. While Hun nam is the star of the
yet but it appears Hun nam may be the frontrunner for
its two leads . Hun nam joins a list of actors
, because my instinct , hon ed by a lifetime of
was the group 's biggest hon ore e , collecting the
the entire series and G phan ultimately ends up doing nothing
- a skill that was hon ed as a child in
iana Mas l any Or phan Black BBC America Matthew Mc
s Office of Or phan Products Development .
auer wrote that Tal ent hits a target no
founder of Cot é Or phan Consulting in Silver Spring and
to know . Tal os - 1 isn
the team s fortunes that spring , posting a
. Though shows like Or phan Black and The Vampire Di
on chains , Iran y anking them Check it
activities include running , pl anking and lung es . (
| Select - Exp and Property B ounds | Select Width
Wilson . R ANK 4 Xavier Rhodes , Vikings
iggs . R ANK 2 Richard Sherman , Seahawks

MIN ACTIVATIONS
MIN = -1.946

a thunder ous standing o vation . Long er an was
last season . An o vation of thanks from Jets fans
who got a standing o vation from the capacity crowd inside
200 . US ADA announced last week that Les
months without rain . Th irst y Marshall ese , many
lam . This second US ADA violation will certainly put additional
denied the rape and mol estation charges , saying it is
don 't have Q d oba there ." Ant
of and mock Child Mol estation and Rape speaks volumes about
( in true Moore doubles peak , everyone keeps referring to
under way . Post - peak : Pand emic activity appears
of rape and sexual mol estation . The 39 - year
control . ( Man it oba Chambers of Commerce ) "
- strength , military - grade , aluminum alloy that helps
of people travelling off - peak has almost doubled , with
each sip a bright clear wedge of lime rises up and
Help and # C hen nai Fl ood s have been
delay , The insol ence of office , and
the troops " to enc irc le and destroy [ the
cheap low - speed pulse jet engine suddenly starts

INTERVAL 1.106 - 2.124
CONTAINS 0.168%

open new store at Five Corn ers in Bur ien
ointed with the way the toilet is away from the bathroom
ep ā iw i l ā ua o
top pitching prospect in Matthew Wis ler , the No .
who are neither dumb nor blind than to their

INTERVAL 0.089 - 1.106
CONTAINS 27.321%

, had illegally collected personal location data without permission , according
and other messages via Mi iverse . There are
live in areas of high unemployment and low educational attainment and
can hardly think about anything else . This goes on for
was removed from the conference website by two chair men because

INTERVAL -0.929 - 0.089
CONTAINS 71.977%

pricing schemes will increase in the coming years .
years ago and I will understand it differently five years from
of Darwin and Wallace . ' The presence of
covert ly recorded und ressing in a changing room .
" Now , it seems like I 'm a little more

INTERVAL -1.946 - -0.929
CONTAINS 0.535%

High School building was renamed Higgins Science Institute for the shooting
city really reckoned with the cond ominium as its most significant
recent films such as Dead fall and The L edge but
Marxist takeover of V enez ue ... From Vox
examples : a 90 per cent reduction in HIV - related
Token of
Token ID286
Feature activation-1.52e-02

Pos logit contributions
 hon+0.065
nam+0.064
anked+0.063
phan+0.060
pora+0.059
Neg logit contributions
isle-0.056
 Horowitz-0.054
ADA-0.054
VL-0.052
APTER-0.052
Token Anarchy
Token ID38899
Feature activation+9.61e-01

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token star
Token ID3491
Feature activation-1.91e-01

Pos logit contributions
 hon+0.166
nam+0.162
anked+0.160
phan+0.153
pora+0.150
Neg logit contributions
isle-0.166
 Horowitz-0.161
ADA-0.160
VL-0.155
APTER-0.154
Token Charlie
Token ID11526
Feature activation+6.27e-01

Pos logit contributions
isle+0.032
 Horowitz+0.031
ADA+0.031
VL+0.030
APTER+0.029
Neg logit contributions
 hon-0.034
nam-0.033
anked-0.033
phan-0.031
pora-0.031
Token Hun
Token ID5900
Feature activation+7.27e-01

Pos logit contributions
 hon+0.119
nam+0.117
anked+0.115
phan+0.111
pora+0.109
Neg logit contributions
isle-0.098
 Horowitz-0.094
ADA-0.093
VL-0.090
APTER-0.089
Tokennam
Token ID7402
Feature activation+2.12e+00

Pos logit contributions
 hon+0.011
anked+0.006
pora+0.000
nam+0.000
phan-0.001
Neg logit contributions
isle-0.240
 Horowitz-0.235
ADA-0.232
hent-0.230
VL-0.229
Token is
Token ID318
Feature activation-6.14e-03

Pos logit contributions
 hon+0.355
nam+0.347
anked+0.341
phan+0.326
pora+0.320
Neg logit contributions
isle-0.379
 Horowitz-0.367
ADA-0.365
VL-0.355
APTER-0.352
Token being
Token ID852
Feature activation-8.66e-01

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token eyed
Token ID45320
Feature activation-2.39e-01

Pos logit contributions
isle+0.157
 Horowitz+0.153
ADA+0.152
VL+0.147
hent+0.146
Neg logit contributions
 hon-0.139
nam-0.137
anked-0.135
pora-0.128
phan-0.128
Token for
Token ID329
Feature activation-6.72e-02

Pos logit contributions
isle+0.041
 Horowitz+0.039
ADA+0.039
VL+0.038
APTER+0.037
Neg logit contributions
 hon-0.042
nam-0.041
anked-0.040
phan-0.039
pora-0.038
Token the
Token ID262
Feature activation+1.92e-02

Pos logit contributions
isle+0.012
 Horowitz+0.012
ADA+0.012
VL+0.011
APTER+0.011
Neg logit contributions
 hon-0.011
nam-0.011
anked-0.011
phan-0.010
pora-0.010
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
 hon+0.018
nam+0.018
anked+0.017
phan+0.017
pora+0.016
Neg logit contributions
isle-0.018
 Horowitz-0.017
ADA-0.017
VL-0.017
APTER-0.017
Token
Token ID198
Feature activation+4.49e-02

Pos logit contributions
 hon+0.003
nam+0.003
anked+0.003
phan+0.003
pora+0.003
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token
Token ID198
Feature activation+1.73e-02

Pos logit contributions
 hon+0.008
nam+0.008
anked+0.008
phan+0.007
pora+0.007
Neg logit contributions
isle-0.008
 Horowitz-0.007
ADA-0.007
VL-0.007
APTER-0.007
TokenWhile
Token ID3633
Feature activation+2.46e-01

Pos logit contributions
 hon+0.003
nam+0.003
anked+0.003
phan+0.003
pora+0.003
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token Hun
Token ID5900
Feature activation+6.90e-01

Pos logit contributions
 hon+0.041
nam+0.040
anked+0.039
phan+0.038
pora+0.037
Neg logit contributions
isle-0.044
 Horowitz-0.043
ADA-0.043
VL-0.041
APTER-0.041
Tokennam
Token ID7402
Feature activation+2.12e+00

Pos logit contributions
 hon+0.002
nam+0.000
anked-0.002
phan-0.007
pora-0.009
Neg logit contributions
isle-0.236
 Horowitz-0.233
ADA-0.232
VL-0.228
APTER-0.227
Token is
Token ID318
Feature activation+8.84e-03

Pos logit contributions
 hon+0.348
nam+0.339
anked+0.333
phan+0.318
pora+0.311
Neg logit contributions
isle-0.385
 Horowitz-0.373
ADA-0.371
VL-0.360
APTER-0.358
Token the
Token ID262
Feature activation+1.86e-02

Pos logit contributions
 hon+0.002
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.002
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Token star
Token ID3491
Feature activation-1.90e-01

Pos logit contributions
 hon+0.003
nam+0.003
anked+0.003
phan+0.003
pora+0.003
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token of
Token ID286
Feature activation-3.82e-02

Pos logit contributions
isle+0.032
 Horowitz+0.031
ADA+0.030
VL+0.029
APTER+0.029
Neg logit contributions
 hon-0.034
nam-0.033
anked-0.033
phan-0.031
pora-0.031
Token the
Token ID262
Feature activation+2.92e-02

Pos logit contributions
isle+0.007
 Horowitz+0.007
ADA+0.007
VL+0.006
APTER+0.006
Neg logit contributions
 hon-0.006
nam-0.006
anked-0.006
phan-0.006
pora-0.006
Token yet
Token ID1865
Feature activation+8.71e-01

Pos logit contributions
 hon+0.000
nam+0.000
anked+0.000
phan+0.000
pora+0.000
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Token but
Token ID475
Feature activation+9.39e-02

Pos logit contributions
 hon+0.139
nam+0.136
anked+0.133
phan+0.127
pora+0.125
Neg logit contributions
isle-0.162
 Horowitz-0.157
ADA-0.156
VL-0.152
APTER-0.150
Token it
Token ID340
Feature activation-1.46e-01

Pos logit contributions
 hon+0.016
nam+0.016
anked+0.015
phan+0.015
pora+0.014
Neg logit contributions
isle-0.017
 Horowitz-0.016
ADA-0.016
VL-0.015
APTER-0.015
Token appears
Token ID3568
Feature activation+4.52e-01

Pos logit contributions
isle+0.027
 Horowitz+0.027
ADA+0.027
VL+0.026
APTER+0.026
Neg logit contributions
 hon-0.023
nam-0.023
anked-0.022
phan-0.021
pora-0.021
Token Hun
Token ID5900
Feature activation+7.18e-01

Pos logit contributions
 hon+0.072
nam+0.070
anked+0.069
phan+0.066
pora+0.064
Neg logit contributions
isle-0.084
 Horowitz-0.082
ADA-0.081
VL-0.079
APTER-0.078
Tokennam
Token ID7402
Feature activation+2.11e+00

Pos logit contributions
 hon+0.002
nam+0.000
anked-0.003
phan-0.007
pora-0.010
Neg logit contributions
isle-0.246
 Horowitz-0.242
ADA-0.241
VL-0.238
APTER-0.237
Token may
Token ID743
Feature activation+3.59e-02

Pos logit contributions
 hon+0.364
nam+0.355
anked+0.349
phan+0.334
pora+0.327
Neg logit contributions
isle-0.365
 Horowitz-0.353
ADA-0.351
VL-0.341
APTER-0.339
Token be
Token ID307
Feature activation-1.12e-02

Pos logit contributions
 hon+0.006
nam+0.005
anked+0.005
phan+0.005
pora+0.005
Neg logit contributions
isle-0.007
 Horowitz-0.007
ADA-0.007
VL-0.006
hent-0.006
Token the
Token ID262
Feature activation+2.90e-02

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.002
APTER+0.002
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002
Token frontrunner
Token ID49981
Feature activation+3.52e-01

Pos logit contributions
 hon+0.005
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
APTER-0.005
Token for
Token ID329
Feature activation-6.19e-02

Pos logit contributions
 hon+0.062
nam+0.060
anked+0.059
phan+0.057
pora+0.056
Neg logit contributions
isle-0.060
 Horowitz-0.058
ADA-0.058
VL-0.056
APTER-0.056
Token its
Token ID663
Feature activation+3.93e-01

Pos logit contributions
 hon+0.017
nam+0.016
anked+0.016
phan+0.015
pora+0.015
Neg logit contributions
isle-0.018
 Horowitz-0.017
ADA-0.017
VL-0.017
APTER-0.017
Token two
Token ID734
Feature activation+3.27e-01

Pos logit contributions
 hon+0.063
nam+0.061
anked+0.060
phan+0.058
pora+0.057
Neg logit contributions
isle-0.073
 Horowitz-0.071
ADA-0.070
VL-0.068
APTER-0.068
Token leads
Token ID5983
Feature activation-1.66e-01

Pos logit contributions
 hon+0.056
nam+0.055
anked+0.054
phan+0.051
pora+0.051
Neg logit contributions
isle-0.057
 Horowitz-0.055
ADA-0.055
VL-0.053
APTER-0.053
Token.
Token ID13
Feature activation-4.71e-03

Pos logit contributions
isle+0.029
 Horowitz+0.028
ADA+0.027
VL+0.026
APTER+0.026
Neg logit contributions
 hon-0.029
nam-0.028
anked-0.028
phan-0.026
pora-0.026
Token Hun
Token ID5900
Feature activation+6.83e-01

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Tokennam
Token ID7402
Feature activation+2.07e+00

Pos logit contributions
 hon+0.001
nam+0.000
anked-0.003
phan-0.008
pora-0.010
Neg logit contributions
isle-0.235
 Horowitz-0.231
ADA-0.230
VL-0.227
APTER-0.226
Token joins
Token ID15449
Feature activation-2.29e-01

Pos logit contributions
 hon+0.340
nam+0.331
anked+0.325
phan+0.311
pora+0.304
Neg logit contributions
isle-0.378
 Horowitz-0.366
ADA-0.364
VL-0.354
APTER-0.352
Token a
Token ID257
Feature activation-6.54e-03

Pos logit contributions
isle+0.040
 Horowitz+0.039
ADA+0.038
VL+0.037
APTER+0.037
Neg logit contributions
 hon-0.039
nam-0.038
anked-0.038
phan-0.036
pora-0.036
Token list
Token ID1351
Feature activation-1.69e-01

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token of
Token ID286
Feature activation-5.34e-02

Pos logit contributions
isle+0.028
 Horowitz+0.027
ADA+0.027
VL+0.026
APTER+0.025
Neg logit contributions
 hon-0.031
nam-0.030
anked-0.030
phan-0.028
pora-0.028
Token actors
Token ID10544
Feature activation-6.40e-02

Pos logit contributions
isle+0.009
 Horowitz+0.009
ADA+0.009
VL+0.008
APTER+0.008
Neg logit contributions
 hon-0.009
nam-0.009
anked-0.009
phan-0.009
pora-0.009
Token,
Token ID11
Feature activation+2.44e-02

Pos logit contributions
 hon+0.086
nam+0.084
anked+0.083
phan+0.080
pora+0.079
Neg logit contributions
isle-0.082
 Horowitz-0.079
ADA-0.078
VL-0.076
APTER-0.075
Token because
Token ID780
Feature activation+1.15e-02

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.003
Neg logit contributions
isle-0.005
 Horowitz-0.004
ADA-0.004
VL-0.004
APTER-0.004
Token my
Token ID616
Feature activation+2.25e-01

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token instinct
Token ID13311
Feature activation+3.44e-02

Pos logit contributions
 hon+0.038
nam+0.037
anked+0.036
phan+0.034
pora+0.034
Neg logit contributions
isle-0.040
 Horowitz-0.039
ADA-0.039
VL-0.038
APTER-0.037
Token,
Token ID11
Feature activation+5.63e-03

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.005
phan+0.005
pora+0.005
Neg logit contributions
isle-0.006
 Horowitz-0.006
ADA-0.006
VL-0.006
APTER-0.006
Token hon
Token ID3032
Feature activation+1.87e+00

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Tokened
Token ID276
Feature activation+3.56e-01

Pos logit contributions
 hon+0.236
nam+0.229
anked+0.224
phan+0.212
pora+0.207
Neg logit contributions
isle-0.410
 Horowitz-0.399
ADA-0.396
VL-0.387
hent-0.384
Token by
Token ID416
Feature activation+1.10e-01

Pos logit contributions
 hon+0.055
nam+0.054
anked+0.054
phan+0.051
pora+0.050
Neg logit contributions
isle-0.067
 Horowitz-0.065
ADA-0.065
VL-0.063
APTER-0.062
Token a
Token ID257
Feature activation+2.72e-02

Pos logit contributions
 hon+0.018
nam+0.017
anked+0.017
phan+0.016
pora+0.016
Neg logit contributions
isle-0.020
 Horowitz-0.020
ADA-0.020
VL-0.019
APTER-0.019
Token lifetime
Token ID10869
Feature activation-1.31e-01

Pos logit contributions
 hon+0.005
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
APTER-0.005
Token of
Token ID286
Feature activation-3.52e-02

Pos logit contributions
isle+0.021
 Horowitz+0.020
ADA+0.020
VL+0.019
APTER+0.019
Neg logit contributions
 hon-0.024
nam-0.024
anked-0.023
phan-0.022
pora-0.022
Tokenwas
Token ID9776
Feature activation+2.48e-01

Pos logit contributions
isle+0.041
 Horowitz+0.039
ADA+0.039
VL+0.038
hent+0.037
Neg logit contributions
 hon-0.040
nam-0.039
anked-0.038
phan-0.037
pora-0.036
Token the
Token ID262
Feature activation+2.32e-02

Pos logit contributions
 hon+0.041
nam+0.040
anked+0.040
phan+0.038
pora+0.038
Neg logit contributions
isle-0.044
 Horowitz-0.043
ADA-0.043
VL-0.041
APTER-0.041
Token group
Token ID1448
Feature activation+1.80e-01

Pos logit contributions
 hon+0.004
nam+0.003
anked+0.003
phan+0.003
pora+0.003
Neg logit contributions
isle-0.004
 Horowitz-0.004
ADA-0.004
VL-0.004
hent-0.004
Token's
Token ID338
Feature activation-2.94e-01

Pos logit contributions
 hon+0.038
nam+0.037
anked+0.037
phan+0.036
pora+0.035
Neg logit contributions
isle-0.024
 Horowitz-0.023
ADA-0.023
VL-0.022
APTER-0.021
Token biggest
Token ID4094
Feature activation+5.60e-01

Pos logit contributions
isle+0.057
 Horowitz+0.056
ADA+0.055
VL+0.054
APTER+0.053
Neg logit contributions
 hon-0.044
nam-0.043
anked-0.043
phan-0.040
pora-0.040
Token hon
Token ID3032
Feature activation+1.87e+00

Pos logit contributions
 hon+0.089
nam+0.088
anked+0.087
phan+0.083
pora+0.082
Neg logit contributions
isle-0.103
 Horowitz-0.100
ADA-0.098
VL-0.096
hent-0.095
Tokenore
Token ID382
Feature activation+4.30e-02

Pos logit contributions
 hon+0.291
nam+0.285
anked+0.282
phan+0.268
pora+0.263
Neg logit contributions
isle-0.350
 Horowitz-0.338
ADA-0.334
VL-0.327
hent-0.325
Tokene
Token ID68
Feature activation+3.29e-01

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.006
pora+0.005
Neg logit contributions
isle-0.009
 Horowitz-0.008
ADA-0.008
VL-0.008
APTER-0.008
Token,
Token ID11
Feature activation+4.36e-02

Pos logit contributions
 hon+0.055
nam+0.054
anked+0.054
phan+0.051
pora+0.051
Neg logit contributions
isle-0.058
 Horowitz-0.056
ADA-0.056
VL-0.054
hent-0.053
Token collecting
Token ID13157
Feature activation-5.10e-03

Pos logit contributions
 hon+0.007
nam+0.007
anked+0.007
phan+0.006
pora+0.006
Neg logit contributions
isle-0.008
 Horowitz-0.008
ADA-0.008
VL-0.007
hent-0.007
Token the
Token ID262
Feature activation+6.99e-03

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token the
Token ID262
Feature activation+2.50e-02

Pos logit contributions
 hon+0.034
nam+0.033
anked+0.032
phan+0.031
pora+0.030
Neg logit contributions
isle-0.037
 Horowitz-0.036
ADA-0.036
VL-0.035
APTER-0.034
Token entire
Token ID2104
Feature activation-3.66e-01

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.005
 Horowitz-0.004
ADA-0.004
VL-0.004
hent-0.004
Token series
Token ID2168
Feature activation-1.53e-01

Pos logit contributions
isle+0.064
 Horowitz+0.062
ADA+0.061
VL+0.060
hent+0.059
Neg logit contributions
 hon-0.062
nam-0.061
anked-0.060
phan-0.057
pora-0.057
Token and
Token ID290
Feature activation+4.75e-02

Pos logit contributions
isle+0.028
 Horowitz+0.027
ADA+0.027
VL+0.026
hent+0.026
Neg logit contributions
 hon-0.025
nam-0.025
anked-0.024
phan-0.023
pora-0.023
Token G
Token ID402
Feature activation-2.61e-01

Pos logit contributions
 hon+0.008
nam+0.008
anked+0.008
phan+0.007
pora+0.007
Neg logit contributions
isle-0.008
 Horowitz-0.008
ADA-0.008
VL-0.008
APTER-0.008
Tokenphan
Token ID19080
Feature activation+1.82e+00

Pos logit contributions
isle+0.044
 Horowitz+0.042
ADA+0.042
VL+0.041
APTER+0.040
Neg logit contributions
 hon-0.046
nam-0.045
anked-0.045
phan-0.043
pora-0.042
Token ultimately
Token ID6165
Feature activation+2.43e-01

Pos logit contributions
 hon+0.308
nam+0.301
anked+0.296
phan+0.283
pora+0.279
Neg logit contributions
isle-0.321
 Horowitz-0.311
ADA-0.309
VL-0.299
APTER-0.297
Token ends
Token ID5645
Feature activation+7.95e-01

Pos logit contributions
 hon+0.038
nam+0.037
anked+0.037
phan+0.035
pora+0.034
Neg logit contributions
isle-0.046
 Horowitz-0.044
ADA-0.044
VL-0.043
hent-0.042
Token up
Token ID510
Feature activation+4.20e-01

Pos logit contributions
 hon+0.100
nam+0.097
anked+0.095
phan+0.090
pora+0.088
Neg logit contributions
isle-0.174
 Horowitz-0.170
ADA-0.169
VL-0.165
APTER-0.164
Token doing
Token ID1804
Feature activation+3.01e-01

Pos logit contributions
 hon+0.075
nam+0.073
anked+0.072
phan+0.069
pora+0.068
Neg logit contributions
isle-0.070
 Horowitz-0.068
ADA-0.068
VL-0.065
APTER-0.065
Token nothing
Token ID2147
Feature activation+1.67e-01

Pos logit contributions
 hon+0.051
nam+0.049
anked+0.049
phan+0.046
pora+0.046
Neg logit contributions
isle-0.054
 Horowitz-0.052
ADA-0.051
VL-0.050
APTER-0.050
Token -
Token ID532
Feature activation+3.14e-01

Pos logit contributions
 hon+0.016
nam+0.016
anked+0.016
phan+0.015
pora+0.015
Neg logit contributions
isle-0.018
 Horowitz-0.017
ADA-0.017
VL-0.016
APTER-0.016
Token a
Token ID257
Feature activation-5.25e-02

Pos logit contributions
 hon+0.051
nam+0.050
anked+0.049
phan+0.047
pora+0.046
Neg logit contributions
isle-0.057
 Horowitz-0.055
ADA-0.055
VL-0.053
hent-0.053
Token skill
Token ID5032
Feature activation+2.16e-01

Pos logit contributions
isle+0.010
 Horowitz+0.009
ADA+0.009
VL+0.009
hent+0.009
Neg logit contributions
 hon-0.009
nam-0.008
anked-0.008
phan-0.008
pora-0.008
Token that
Token ID326
Feature activation-2.43e-02

Pos logit contributions
 hon+0.036
nam+0.035
anked+0.035
phan+0.033
pora+0.033
Neg logit contributions
isle-0.039
 Horowitz-0.037
ADA-0.037
VL-0.036
APTER-0.036
Token was
Token ID373
Feature activation+7.51e-02

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.004
Token hon
Token ID3032
Feature activation+1.82e+00

Pos logit contributions
 hon+0.013
nam+0.012
anked+0.012
phan+0.012
pora+0.011
Neg logit contributions
isle-0.013
 Horowitz-0.013
ADA-0.013
VL-0.012
APTER-0.012
Tokened
Token ID276
Feature activation+2.78e-01

Pos logit contributions
 hon+0.228
nam+0.222
anked+0.217
phan+0.206
pora+0.203
Neg logit contributions
isle-0.396
 Horowitz-0.385
ADA-0.381
VL-0.374
hent-0.370
Token as
Token ID355
Feature activation+6.27e-02

Pos logit contributions
 hon+0.044
nam+0.044
anked+0.043
phan+0.041
pora+0.040
Neg logit contributions
isle-0.051
 Horowitz-0.050
ADA-0.049
VL-0.048
APTER-0.047
Token a
Token ID257
Feature activation-5.33e-02

Pos logit contributions
 hon+0.011
nam+0.011
anked+0.011
phan+0.010
pora+0.010
Neg logit contributions
isle-0.011
 Horowitz-0.010
ADA-0.010
VL-0.010
APTER-0.010
Token child
Token ID1200
Feature activation-1.10e-01

Pos logit contributions
isle+0.009
 Horowitz+0.009
ADA+0.009
VL+0.008
hent+0.008
Neg logit contributions
 hon-0.009
nam-0.009
anked-0.009
phan-0.009
pora-0.009
Token in
Token ID287
Feature activation-1.36e-02

Pos logit contributions
isle+0.020
 Horowitz+0.019
ADA+0.019
VL+0.018
APTER+0.018
Neg logit contributions
 hon-0.018
nam-0.018
anked-0.018
phan-0.017
pora-0.017
Tokeniana
Token ID7484
Feature activation-1.69e+00

Pos logit contributions
 hon+0.199
nam+0.198
anked+0.195
phan+0.190
pora+0.187
Neg logit contributions
isle-0.037
ADA-0.033
 Horowitz-0.032
VL-0.030
APTER-0.029
Token Mas
Token ID11066
Feature activation+4.63e-01

Pos logit contributions
isle+0.379
ADA+0.368
 Horowitz+0.363
VL+0.356
hent+0.355
Neg logit contributions
 hon-0.210
nam-0.203
anked-0.194
phan-0.182
pora-0.177
Tokenl
Token ID75
Feature activation+6.00e-01

Pos logit contributions
 hon+0.052
nam+0.050
anked+0.049
phan+0.046
pora+0.045
Neg logit contributions
isle-0.107
 Horowitz-0.105
ADA-0.104
VL-0.102
APTER-0.101
Tokenany
Token ID1092
Feature activation-2.21e-01

Pos logit contributions
 hon+0.109
nam+0.107
anked+0.105
phan+0.100
pora+0.099
Neg logit contributions
isle-0.099
 Horowitz-0.095
ADA-0.095
VL-0.091
hent-0.091
Token Or
Token ID1471
Feature activation-3.65e-01

Pos logit contributions
isle+0.038
 Horowitz+0.037
ADA+0.037
VL+0.036
hent+0.035
Neg logit contributions
 hon-0.038
nam-0.037
anked-0.037
phan-0.035
pora-0.035
Tokenphan
Token ID19080
Feature activation+1.81e+00

Pos logit contributions
isle+0.108
 Horowitz+0.106
ADA+0.105
VL+0.103
APTER+0.103
Neg logit contributions
 hon-0.018
nam-0.016
anked-0.016
pora-0.012
phan-0.011
Token Black
Token ID2619
Feature activation-4.83e-02

Pos logit contributions
 hon+0.329
nam+0.322
anked+0.318
phan+0.303
pora+0.300
Neg logit contributions
isle-0.298
 Horowitz-0.288
ADA-0.285
VL-0.275
APTER-0.273
Token BBC
Token ID7823
Feature activation+7.32e-01

Pos logit contributions
isle+0.009
 Horowitz+0.008
ADA+0.008
VL+0.008
APTER+0.008
Neg logit contributions
 hon-0.008
nam-0.008
anked-0.008
phan-0.007
pora-0.007
Token America
Token ID2253
Feature activation-2.99e-01

Pos logit contributions
 hon+0.127
nam+0.125
anked+0.123
phan+0.118
pora+0.115
Neg logit contributions
isle-0.126
 Horowitz-0.122
ADA-0.121
VL-0.117
APTER-0.116
Token Matthew
Token ID9308
Feature activation-2.33e-01

Pos logit contributions
isle+0.051
 Horowitz+0.049
ADA+0.049
VL+0.047
APTER+0.047
Neg logit contributions
 hon-0.053
nam-0.052
anked-0.051
phan-0.049
pora-0.048
Token Mc
Token ID1982
Feature activation-3.74e-01

Pos logit contributions
isle+0.037
 Horowitz+0.036
ADA+0.035
VL+0.034
APTER+0.033
Neg logit contributions
 hon-0.044
nam-0.043
anked-0.043
phan-0.041
pora-0.040
Token
Token ID247
Feature activation-1.26e-02

Pos logit contributions
isle+0.009
 Horowitz+0.009
ADA+0.009
VL+0.008
hent+0.008
Neg logit contributions
 hon-0.007
nam-0.007
anked-0.006
phan-0.006
pora-0.006
Tokens
Token ID82
Feature activation-1.94e-02

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.002
APTER+0.002
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002
Token Office
Token ID4452
Feature activation+6.85e-01

Pos logit contributions
isle+0.004
 Horowitz+0.003
ADA+0.003
VL+0.003
APTER+0.003
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token of
Token ID286
Feature activation-7.63e-02

Pos logit contributions
 hon+0.121
nam+0.118
anked+0.115
phan+0.112
pora+0.109
Neg logit contributions
isle-0.117
 Horowitz-0.112
ADA-0.112
APTER-0.108
VL-0.108
Token Or
Token ID1471
Feature activation-3.81e-01

Pos logit contributions
isle+0.013
 Horowitz+0.013
ADA+0.013
VL+0.012
APTER+0.012
Neg logit contributions
 hon-0.013
nam-0.013
anked-0.013
phan-0.012
pora-0.012
Tokenphan
Token ID19080
Feature activation+1.81e+00

Pos logit contributions
isle+0.126
 Horowitz+0.124
ADA+0.123
VL+0.121
hent+0.121
Neg logit contributions
 hon-0.006
nam-0.004
anked-0.004
phan-0.000
pora+0.000
Token Products
Token ID18675
Feature activation-3.62e-01

Pos logit contributions
 hon+0.349
nam+0.342
anked+0.337
phan+0.325
pora+0.320
Neg logit contributions
isle-0.275
 Horowitz-0.265
ADA-0.263
VL-0.254
APTER-0.252
Token Development
Token ID7712
Feature activation+7.39e-01

Pos logit contributions
isle+0.061
 Horowitz+0.059
ADA+0.058
VL+0.056
APTER+0.056
Neg logit contributions
 hon-0.064
nam-0.063
anked-0.062
phan-0.060
pora-0.059
Token.
Token ID13
Feature activation-2.77e-02

Pos logit contributions
 hon+0.130
nam+0.127
anked+0.125
phan+0.120
pora+0.117
Neg logit contributions
isle-0.126
 Horowitz-0.122
ADA-0.121
VL-0.117
APTER-0.116
Token
Token ID198
Feature activation-7.11e-04

Pos logit contributions
isle+0.005
 Horowitz+0.005
ADA+0.005
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.005
nam-0.005
anked-0.005
phan-0.004
pora-0.004
Token
Token ID198
Feature activation+3.67e-03

Pos logit contributions
isle+0.000
 Horowitz+0.000
ADA+0.000
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.000
nam-0.000
anked-0.000
phan-0.000
pora-0.000
Tokenauer
Token ID16261
Feature activation-3.31e-01

Pos logit contributions
isle+0.028
 Horowitz+0.026
ADA+0.026
VL+0.025
APTER+0.025
Neg logit contributions
 hon-0.045
nam-0.044
anked-0.044
phan-0.042
pora-0.042
Token wrote
Token ID2630
Feature activation+5.24e-01

Pos logit contributions
isle+0.060
 Horowitz+0.058
ADA+0.058
VL+0.056
APTER+0.055
Neg logit contributions
 hon-0.055
nam-0.053
anked-0.053
phan-0.050
pora-0.049
Token that
Token ID326
Feature activation+8.19e-02

Pos logit contributions
 hon+0.085
nam+0.083
anked+0.081
phan+0.078
pora+0.076
Neg logit contributions
isle-0.097
 Horowitz-0.093
ADA-0.093
VL-0.090
hent-0.090
Token
Token ID564
Feature activation+2.38e-02

Pos logit contributions
 hon+0.014
nam+0.013
anked+0.013
phan+0.012
pora+0.012
Neg logit contributions
isle-0.015
 Horowitz-0.014
ADA-0.014
VL-0.014
APTER-0.014
Token
Token ID250
Feature activation-7.51e-02

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.003
Neg logit contributions
isle-0.004
 Horowitz-0.004
ADA-0.004
VL-0.004
APTER-0.004
TokenTal
Token ID31466
Feature activation+1.81e+00

Pos logit contributions
isle+0.013
 Horowitz+0.012
ADA+0.012
VL+0.012
APTER+0.011
Neg logit contributions
 hon-0.013
nam-0.013
anked-0.013
phan-0.012
pora-0.012
Tokenent
Token ID298
Feature activation-2.22e-01

Pos logit contributions
 hon+0.311
nam+0.304
anked+0.299
phan+0.287
pora+0.282
Neg logit contributions
isle-0.313
 Horowitz-0.303
ADA-0.301
VL-0.292
APTER-0.289
Token hits
Token ID7127
Feature activation+2.22e-01

Pos logit contributions
isle+0.039
 Horowitz+0.038
ADA+0.037
VL+0.036
APTER+0.036
Neg logit contributions
 hon-0.038
nam-0.037
anked-0.036
phan-0.035
pora-0.034
Token a
Token ID257
Feature activation+3.44e-02

Pos logit contributions
 hon+0.037
nam+0.036
anked+0.035
phan+0.034
pora+0.033
Neg logit contributions
isle-0.040
 Horowitz-0.039
ADA-0.038
VL-0.037
APTER-0.037
Token target
Token ID2496
Feature activation+7.76e-03

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.005
pora+0.005
Neg logit contributions
isle-0.006
 Horowitz-0.006
ADA-0.006
VL-0.006
hent-0.006
Token no
Token ID645
Feature activation-9.47e-03

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Token founder
Token ID9119
Feature activation-6.83e-02

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
APTER+0.003
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token of
Token ID286
Feature activation-5.98e-02

Pos logit contributions
isle+0.012
 Horowitz+0.011
ADA+0.011
VL+0.011
APTER+0.011
Neg logit contributions
 hon-0.012
nam-0.012
anked-0.012
phan-0.011
pora-0.011
Token Cot
Token ID42266
Feature activation-4.67e-01

Pos logit contributions
isle+0.010
 Horowitz+0.010
ADA+0.010
VL+0.010
APTER+0.010
Neg logit contributions
 hon-0.010
nam-0.010
anked-0.010
phan-0.010
pora-0.009
Tokené
Token ID2634
Feature activation+2.07e-01

Pos logit contributions
isle+0.083
 Horowitz+0.080
ADA+0.080
VL+0.077
hent+0.077
Neg logit contributions
 hon-0.078
nam-0.076
anked-0.075
phan-0.072
pora-0.071
Token Or
Token ID1471
Feature activation-3.78e-01

Pos logit contributions
 hon+0.039
nam+0.039
anked+0.038
phan+0.037
pora+0.036
Neg logit contributions
isle-0.032
 Horowitz-0.031
ADA-0.031
VL-0.030
APTER-0.030
Tokenphan
Token ID19080
Feature activation+1.75e+00

Pos logit contributions
isle+0.070
 Horowitz+0.068
ADA+0.068
VL+0.065
hent+0.065
Neg logit contributions
 hon-0.061
nam-0.058
anked-0.058
phan-0.054
pora-0.054
Token Consulting
Token ID41005
Feature activation-1.36e+00

Pos logit contributions
 hon+0.292
nam+0.285
anked+0.281
phan+0.267
pora+0.264
Neg logit contributions
isle-0.313
 Horowitz-0.303
ADA-0.301
VL-0.291
APTER-0.289
Token in
Token ID287
Feature activation-6.41e-03

Pos logit contributions
isle+0.243
 Horowitz+0.236
ADA+0.234
VL+0.227
APTER+0.225
Neg logit contributions
 hon-0.228
nam-0.222
anked-0.219
phan-0.209
pora-0.206
Token Silver
Token ID7698
Feature activation-1.21e+00

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token Spring
Token ID8225
Feature activation-5.23e-01

Pos logit contributions
isle+0.160
 Horowitz+0.154
ADA+0.153
VL+0.148
APTER+0.146
Neg logit contributions
 hon-0.257
nam-0.252
anked-0.248
phan-0.240
pora-0.237
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
isle+0.098
 Horowitz+0.095
ADA+0.095
VL+0.092
APTER+0.091
Neg logit contributions
 hon-0.082
nam-0.080
anked-0.079
phan-0.075
pora-0.074
Token to
Token ID284
Feature activation-1.34e-02

Pos logit contributions
isle+0.141
 Horowitz+0.137
ADA+0.135
VL+0.131
hent+0.130
Neg logit contributions
 hon-0.137
nam-0.134
anked-0.132
phan-0.126
pora-0.125
Token know
Token ID760
Feature activation-1.87e-01

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.002
APTER+0.002
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002
Token.
Token ID13
Feature activation-3.64e-02

Pos logit contributions
isle+0.033
 Horowitz+0.032
ADA+0.031
VL+0.030
APTER+0.030
Neg logit contributions
 hon-0.032
nam-0.031
anked-0.031
phan-0.030
pora-0.029
Token
Token ID198
Feature activation-3.29e-02

Pos logit contributions
isle+0.006
 Horowitz+0.006
ADA+0.006
VL+0.006
APTER+0.006
Neg logit contributions
 hon-0.006
nam-0.006
anked-0.006
phan-0.006
pora-0.006
Token
Token ID198
Feature activation-2.28e-02

Pos logit contributions
isle+0.006
 Horowitz+0.005
ADA+0.005
VL+0.005
APTER+0.005
Neg logit contributions
 hon-0.006
nam-0.006
anked-0.006
phan-0.005
pora-0.005
TokenTal
Token ID31466
Feature activation+1.75e+00

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.004
Tokenos
Token ID418
Feature activation+3.75e-02

Pos logit contributions
 hon+0.260
nam+0.254
anked+0.249
phan+0.238
pora+0.232
Neg logit contributions
isle-0.342
 Horowitz-0.334
ADA-0.332
VL-0.323
APTER-0.322
Token-
Token ID12
Feature activation-5.58e-02

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.006
pora+0.006
Neg logit contributions
isle-0.007
 Horowitz-0.006
ADA-0.006
VL-0.006
APTER-0.006
Token1
Token ID16
Feature activation+2.82e-01

Pos logit contributions
isle+0.011
 Horowitz+0.011
ADA+0.011
VL+0.010
APTER+0.010
Neg logit contributions
 hon-0.008
nam-0.008
anked-0.008
phan-0.008
pora-0.008
Token isn
Token ID2125
Feature activation+4.23e-01

Pos logit contributions
 hon+0.045
nam+0.044
anked+0.043
phan+0.041
pora+0.040
Neg logit contributions
isle-0.052
 Horowitz-0.051
ADA-0.050
VL-0.049
hent-0.049
Token
Token ID447
Feature activation-6.11e-02

Pos logit contributions
 hon+0.071
nam+0.069
anked+0.068
phan+0.065
pora+0.064
Neg logit contributions
isle-0.075
 Horowitz-0.073
ADA-0.073
VL-0.070
APTER-0.070
Token the
Token ID262
Feature activation+3.09e-02

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.002
APTER+0.002
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002
Token team
Token ID1074
Feature activation+1.97e-01

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.005
phan+0.005
pora+0.005
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
APTER-0.005
Token
Token ID447
Feature activation+4.84e-03

Pos logit contributions
 hon+0.036
nam+0.035
anked+0.035
phan+0.033
pora+0.033
Neg logit contributions
isle-0.032
 Horowitz-0.031
ADA-0.031
VL-0.030
APTER-0.030
Token
Token ID247
Feature activation+3.10e-02

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
hent-0.001
Tokens
Token ID82
Feature activation+1.26e-02

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.006
pora+0.006
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.004
VL-0.004
APTER-0.004
Token fortunes
Token ID27806
Feature activation+1.74e+00

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token that
Token ID326
Feature activation+8.09e-02

Pos logit contributions
 hon+0.297
nam+0.290
anked+0.286
phan+0.274
pora+0.269
Neg logit contributions
isle-0.305
 Horowitz-0.295
ADA-0.293
VL-0.284
APTER-0.282
Token spring
Token ID6076
Feature activation-6.39e-02

Pos logit contributions
 hon+0.013
nam+0.013
anked+0.013
phan+0.012
pora+0.012
Neg logit contributions
isle-0.015
 Horowitz-0.014
ADA-0.014
VL-0.014
APTER-0.014
Token,
Token ID11
Feature activation+4.32e-02

Pos logit contributions
isle+0.012
 Horowitz+0.011
ADA+0.011
VL+0.011
hent+0.011
Neg logit contributions
 hon-0.010
nam-0.010
anked-0.010
phan-0.009
pora-0.009
Token posting
Token ID10754
Feature activation-8.89e-02

Pos logit contributions
 hon+0.007
nam+0.007
anked+0.007
phan+0.007
pora+0.007
Neg logit contributions
isle-0.008
 Horowitz-0.007
ADA-0.007
VL-0.007
APTER-0.007
Token a
Token ID257
Feature activation+2.54e-02

Pos logit contributions
isle+0.016
 Horowitz+0.016
ADA+0.016
VL+0.015
APTER+0.015
Neg logit contributions
 hon-0.015
nam-0.014
anked-0.014
phan-0.013
pora-0.013
Token.
Token ID13
Feature activation-3.65e-02

Pos logit contributions
 hon+0.079
nam+0.077
anked+0.075
phan+0.072
pora+0.070
Neg logit contributions
isle-0.085
 Horowitz-0.083
ADA-0.082
VL-0.080
��-0.079
Token Though
Token ID7486
Feature activation-6.33e-01

Pos logit contributions
isle+0.006
 Horowitz+0.006
ADA+0.006
VL+0.006
APTER+0.006
Neg logit contributions
 hon-0.007
nam-0.006
anked-0.006
phan-0.006
pora-0.006
Token shows
Token ID2523
Feature activation+1.83e-01

Pos logit contributions
isle+0.115
 Horowitz+0.112
ADA+0.111
VL+0.108
APTER+0.107
Neg logit contributions
 hon-0.103
nam-0.101
anked-0.100
phan-0.095
pora-0.093
Token like
Token ID588
Feature activation+3.50e-02

Pos logit contributions
 hon+0.028
nam+0.027
anked+0.027
phan+0.025
pora+0.025
Neg logit contributions
isle-0.035
 Horowitz-0.034
ADA-0.034
VL-0.033
hent-0.033
Token Or
Token ID1471
Feature activation-4.12e-01

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.005
pora+0.005
Neg logit contributions
isle-0.006
 Horowitz-0.006
ADA-0.006
VL-0.006
hent-0.006
Tokenphan
Token ID19080
Feature activation+1.74e+00

Pos logit contributions
isle+0.135
 Horowitz+0.132
ADA+0.132
VL+0.129
APTER+0.129
Neg logit contributions
 hon-0.008
nam-0.006
anked-0.005
pora-0.001
phan-0.001
Token Black
Token ID2619
Feature activation-8.51e-02

Pos logit contributions
 hon+0.311
nam+0.304
anked+0.300
phan+0.287
pora+0.283
Neg logit contributions
isle-0.290
 Horowitz-0.280
ADA-0.278
VL-0.269
APTER-0.266
Token and
Token ID290
Feature activation+6.95e-03

Pos logit contributions
isle+0.016
 Horowitz+0.015
ADA+0.015
VL+0.015
APTER+0.015
Neg logit contributions
 hon-0.014
nam-0.013
anked-0.013
phan-0.012
pora-0.012
Token The
Token ID383
Feature activation-1.15e-01

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
hent-0.001
Token Vampire
Token ID21463
Feature activation+3.72e-01

Pos logit contributions
isle+0.019
 Horowitz+0.019
ADA+0.018
VL+0.018
hent+0.017
Neg logit contributions
 hon-0.021
nam-0.020
anked-0.020
phan-0.019
pora-0.019
Token Di
Token ID6031
Feature activation-9.71e-02

Pos logit contributions
 hon+0.064
nam+0.062
anked+0.061
phan+0.059
pora+0.058
Neg logit contributions
isle-0.064
 Horowitz-0.062
ADA-0.062
VL-0.060
hent-0.060
Token on
Token ID319
Feature activation+1.42e-02

Pos logit contributions
 hon+0.075
nam+0.073
anked+0.072
phan+0.068
pora+0.067
Neg logit contributions
isle-0.089
 Horowitz-0.086
ADA-0.086
VL-0.083
APTER-0.083
Token chains
Token ID14659
Feature activation+1.31e-01

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.003
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token,
Token ID11
Feature activation+5.97e-02

Pos logit contributions
 hon+0.022
nam+0.021
anked+0.021
phan+0.020
pora+0.020
Neg logit contributions
isle-0.023
 Horowitz-0.023
ADA-0.023
VL-0.022
hent-0.022
Token Iran
Token ID4068
Feature activation-1.44e-01

Pos logit contributions
 hon+0.010
nam+0.010
anked+0.010
phan+0.009
pora+0.009
Neg logit contributions
isle-0.011
 Horowitz-0.010
ADA-0.010
VL-0.010
APTER-0.010
Token y
Token ID331
Feature activation+4.05e-01

Pos logit contributions
isle+0.026
 Horowitz+0.025
ADA+0.025
VL+0.024
hent+0.024
Neg logit contributions
 hon-0.024
nam-0.023
anked-0.023
phan-0.022
pora-0.021
Tokenanking
Token ID15230
Feature activation+1.68e+00

Pos logit contributions
 hon+0.027
nam+0.025
anked+0.024
phan+0.022
pora+0.021
Neg logit contributions
isle-0.113
 Horowitz-0.111
ADA-0.110
VL-0.108
hent-0.107
Token them
Token ID606
Feature activation-9.21e-02

Pos logit contributions
 hon+0.284
nam+0.277
anked+0.273
phan+0.260
pora+0.255
Neg logit contributions
isle-0.299
 Horowitz-0.288
ADA-0.287
APTER-0.279
VL-0.278
Token
Token ID198
Feature activation+4.19e-02

Pos logit contributions
isle+0.017
 Horowitz+0.016
ADA+0.016
VL+0.016
APTER+0.016
Neg logit contributions
 hon-0.015
nam-0.015
anked-0.014
phan-0.014
pora-0.013
Token
Token ID198
Feature activation+3.84e-02

Pos logit contributions
 hon+0.007
nam+0.007
anked+0.007
phan+0.007
pora+0.007
Neg logit contributions
isle-0.007
 Horowitz-0.007
ADA-0.007
VL-0.007
APTER-0.007
TokenCheck
Token ID9787
Feature activation+6.62e-01

Pos logit contributions
 hon+0.007
nam+0.007
anked+0.006
phan+0.006
pora+0.006
Neg logit contributions
isle-0.007
 Horowitz-0.006
ADA-0.006
VL-0.006
APTER-0.006
Token it
Token ID340
Feature activation-1.31e-01

Pos logit contributions
 hon+0.115
nam+0.113
anked+0.111
phan+0.106
pora+0.104
Neg logit contributions
isle-0.114
 Horowitz-0.110
ADA-0.109
VL-0.106
APTER-0.105
Token activities
Token ID4568
Feature activation+1.15e-02

Pos logit contributions
isle+0.018
 Horowitz+0.017
ADA+0.017
VL+0.017
APTER+0.016
Neg logit contributions
 hon-0.016
nam-0.016
anked-0.016
phan-0.015
pora-0.015
Token include
Token ID2291
Feature activation-2.06e-01

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token running
Token ID2491
Feature activation-5.73e-01

Pos logit contributions
isle+0.036
 Horowitz+0.035
ADA+0.035
VL+0.034
APTER+0.033
Neg logit contributions
 hon-0.035
nam-0.034
anked-0.034
phan-0.032
pora-0.032
Token,
Token ID11
Feature activation+5.20e-02

Pos logit contributions
isle+0.107
 Horowitz+0.103
ADA+0.103
VL+0.100
APTER+0.099
Neg logit contributions
 hon-0.092
nam-0.089
anked-0.088
phan-0.084
pora-0.082
Token pl
Token ID458
Feature activation+7.29e-01

Pos logit contributions
 hon+0.009
nam+0.009
anked+0.009
phan+0.008
pora+0.008
Neg logit contributions
isle-0.009
 Horowitz-0.009
ADA-0.009
VL-0.008
APTER-0.008
Tokenanking
Token ID15230
Feature activation+1.68e+00

Pos logit contributions
 hon+0.149
nam+0.146
anked+0.146
phan+0.140
pora+0.138
Neg logit contributions
isle-0.102
 Horowitz-0.099
ADA-0.098
VL-0.095
��-0.095
Token and
Token ID290
Feature activation+6.70e-02

Pos logit contributions
 hon+0.263
nam+0.255
anked+0.250
phan+0.238
pora+0.233
Neg logit contributions
isle-0.319
 Horowitz-0.309
ADA-0.307
VL-0.300
APTER-0.299
Token lung
Token ID12317
Feature activation-6.39e-01

Pos logit contributions
 hon+0.011
nam+0.011
anked+0.011
phan+0.010
pora+0.010
Neg logit contributions
isle-0.012
 Horowitz-0.012
ADA-0.012
VL-0.011
APTER-0.011
Tokenes
Token ID274
Feature activation-1.55e-01

Pos logit contributions
isle+0.098
 Horowitz+0.095
ADA+0.094
VL+0.091
APTER+0.090
Neg logit contributions
 hon-0.123
nam-0.120
anked-0.118
phan-0.114
pora-0.112
Token.
Token ID13
Feature activation-5.17e-03

Pos logit contributions
isle+0.027
 Horowitz+0.026
ADA+0.026
VL+0.025
APTER+0.025
Neg logit contributions
 hon-0.026
nam-0.026
anked-0.025
phan-0.024
pora-0.024
Token (
Token ID357
Feature activation+3.67e-02

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token |
Token ID930
Feature activation+2.86e-02

Pos logit contributions
 hon+0.115
nam+0.111
anked+0.111
phan+0.106
pora+0.105
Neg logit contributions
isle-0.116
 Horowitz-0.113
ADA-0.112
VL-0.110
APTER-0.108
Token Select
Token ID9683
Feature activation-9.83e-03

Pos logit contributions
 hon+0.005
nam+0.005
anked+0.005
phan+0.005
pora+0.005
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
APTER-0.004
Token -
Token ID532
Feature activation+3.69e-01

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.002
APTER+0.002
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002
TokenExp
Token ID16870
Feature activation+1.09e+00

Pos logit contributions
 hon+0.053
nam+0.052
anked+0.051
phan+0.048
pora+0.048
Neg logit contributions
isle-0.074
 Horowitz-0.072
ADA-0.072
VL-0.070
APTER-0.069
Tokenand
Token ID392
Feature activation+1.75e-01

Pos logit contributions
 hon+0.153
nam+0.148
anked+0.146
phan+0.138
pora+0.136
Neg logit contributions
isle-0.223
 Horowitz-0.217
ADA-0.216
VL-0.210
hent-0.208
TokenProperty
Token ID21746
Feature activation+1.68e+00

Pos logit contributions
 hon+0.013
nam+0.012
anked+0.012
phan+0.011
pora+0.011
Neg logit contributions
isle-0.047
 Horowitz-0.046
ADA-0.046
VL-0.046
��-0.045
Token B
Token ID347
Feature activation-4.11e-03

Pos logit contributions
 hon+0.272
nam+0.266
anked+0.263
phan+0.252
pora+0.249
Neg logit contributions
isle-0.305
 Horowitz-0.297
ADA-0.293
VL-0.287
hent-0.282
Tokenounds
Token ID3733
Feature activation-1.93e-01

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token |
Token ID930
Feature activation+4.65e-02

Pos logit contributions
isle+0.035
 Horowitz+0.034
ADA+0.034
VL+0.034
��+0.033
Neg logit contributions
 hon-0.031
nam-0.030
anked-0.030
phan-0.029
pora-0.028
Token Select
Token ID9683
Feature activation+2.68e-02

Pos logit contributions
 hon+0.008
nam+0.008
anked+0.008
phan+0.007
pora+0.007
Neg logit contributions
isle-0.008
 Horowitz-0.008
ADA-0.008
VL-0.007
APTER-0.007
Token Width
Token ID38807
Feature activation+7.18e-01

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
APTER-0.005
Token Wilson
Token ID8127
Feature activation+2.05e-01

Pos logit contributions
 hon+0.098
nam+0.094
anked+0.093
phan+0.088
pora+0.086
Neg logit contributions
isle-0.103
ADA-0.100
VL-0.099
 Horowitz-0.098
APTER-0.097
Token.
Token ID13
Feature activation+3.62e-02

Pos logit contributions
 hon+0.035
nam+0.034
anked+0.034
phan+0.032
pora+0.032
Neg logit contributions
isle-0.036
 Horowitz-0.034
ADA-0.034
VL-0.033
APTER-0.033
Token
Token ID198
Feature activation+5.23e-02

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.006
pora+0.006
Neg logit contributions
isle-0.006
ADA-0.006
 Horowitz-0.006
VL-0.006
hent-0.006
Token
Token ID198
Feature activation+5.82e-02

Pos logit contributions
 hon+0.009
nam+0.009
anked+0.009
phan+0.008
pora+0.008
Neg logit contributions
isle-0.009
 Horowitz-0.009
ADA-0.009
VL-0.008
APTER-0.008
TokenR
Token ID49
Feature activation-2.31e-01

Pos logit contributions
 hon+0.010
nam+0.010
anked+0.010
phan+0.010
pora+0.009
Neg logit contributions
isle-0.010
 Horowitz-0.009
ADA-0.009
VL-0.009
APTER-0.009
TokenANK
Token ID15154
Feature activation+1.67e+00

Pos logit contributions
isle+0.066
ADA+0.064
 Horowitz+0.064
VL+0.063
APTER+0.063
Neg logit contributions
 hon-0.014
nam-0.013
anked-0.012
phan-0.011
pora-0.011
Token 4
Token ID604
Feature activation+1.74e-02

Pos logit contributions
 hon+0.273
nam+0.266
anked+0.264
phan+0.251
pora+0.249
Neg logit contributions
isle-0.303
 Horowitz-0.294
ADA-0.294
VL-0.286
APTER-0.283
Token Xavier
Token ID30825
Feature activation+7.86e-01

Pos logit contributions
 hon+0.003
nam+0.003
anked+0.003
phan+0.003
pora+0.003
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token Rhodes
Token ID25656
Feature activation+8.77e-02

Pos logit contributions
 hon+0.163
nam+0.161
anked+0.159
phan+0.153
pora+0.152
Neg logit contributions
isle-0.107
 Horowitz-0.105
ADA-0.101
VL-0.098
��-0.097
Token,
Token ID11
Feature activation+5.00e-02

Pos logit contributions
 hon+0.014
nam+0.014
anked+0.013
phan+0.013
pora+0.013
Neg logit contributions
isle-0.016
 Horowitz-0.016
ADA-0.016
VL-0.015
APTER-0.015
Token Vikings
Token ID16420
Feature activation+3.79e-01

Pos logit contributions
 hon+0.009
nam+0.009
anked+0.009
phan+0.008
pora+0.008
Neg logit contributions
isle-0.008
 Horowitz-0.008
ADA-0.008
VL-0.008
APTER-0.008
Tokeniggs
Token ID20340
Feature activation+3.75e-01

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
pora+0.001
phan+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
hent-0.001
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
 hon+0.061
nam+0.060
anked+0.059
phan+0.056
pora+0.055
Neg logit contributions
isle-0.068
 Horowitz-0.066
ADA-0.066
VL-0.064
hent-0.063
Token
Token ID198
Feature activation+2.94e-02

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token
Token ID198
Feature activation+3.40e-02

Pos logit contributions
 hon+0.005
nam+0.005
anked+0.005
phan+0.005
pora+0.005
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
APTER-0.005
TokenR
Token ID49
Feature activation-2.27e-01

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.005
pora+0.005
Neg logit contributions
isle-0.006
 Horowitz-0.006
ADA-0.006
VL-0.005
APTER-0.005
TokenANK
Token ID15154
Feature activation+1.66e+00

Pos logit contributions
isle+0.058
 Horowitz+0.057
ADA+0.057
VL+0.056
APTER+0.055
Neg logit contributions
 hon-0.020
nam-0.019
anked-0.019
phan-0.017
pora-0.016
Token 2
Token ID362
Feature activation+1.33e-01

Pos logit contributions
 hon+0.243
nam+0.237
anked+0.232
phan+0.222
pora+0.218
Neg logit contributions
isle-0.331
 Horowitz-0.322
ADA-0.320
VL-0.312
APTER-0.309
Token Richard
Token ID6219
Feature activation-4.57e-01

Pos logit contributions
 hon+0.023
nam+0.023
anked+0.022
phan+0.022
pora+0.021
Neg logit contributions
isle-0.023
 Horowitz-0.022
ADA-0.022
VL-0.021
APTER-0.021
Token Sherman
Token ID23198
Feature activation-7.06e-01

Pos logit contributions
isle+0.039
 Horowitz+0.037
ADA+0.036
VL+0.034
APTER+0.034
Neg logit contributions
 hon-0.119
nam-0.117
anked-0.116
phan-0.112
pora-0.110
Token,
Token ID11
Feature activation+3.22e-02

Pos logit contributions
isle+0.136
 Horowitz+0.132
ADA+0.131
VL+0.127
APTER+0.127
Neg logit contributions
 hon-0.108
nam-0.106
anked-0.104
phan-0.099
pora-0.097
Token Seahawks
Token ID16732
Feature activation+2.41e-01

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.005
phan+0.005
pora+0.005
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
APTER-0.005
Token a
Token ID257
Feature activation-3.33e-02

Pos logit contributions
isle+0.007
 Horowitz+0.007
ADA+0.007
VL+0.007
APTER+0.007
Neg logit contributions
 hon-0.007
nam-0.006
anked-0.006
phan-0.006
pora-0.006
Token thunder
Token ID18355
Feature activation-9.37e-01

Pos logit contributions
isle+0.006
 Horowitz+0.006
ADA+0.006
VL+0.006
APTER+0.006
Neg logit contributions
 hon-0.005
nam-0.005
anked-0.005
phan-0.005
pora-0.005
Tokenous
Token ID516
Feature activation-7.62e-01

Pos logit contributions
isle+0.103
 Horowitz+0.098
ADA+0.097
VL+0.092
APTER+0.091
Neg logit contributions
 hon-0.222
nam-0.217
anked-0.215
phan-0.208
pora-0.205
Token standing
Token ID5055
Feature activation-5.49e-01

Pos logit contributions
isle+0.133
 Horowitz+0.129
ADA+0.127
VL+0.123
APTER+0.122
Neg logit contributions
 hon-0.130
nam-0.128
anked-0.126
phan-0.120
pora-0.118
Token o
Token ID267
Feature activation-7.59e-02

Pos logit contributions
isle+0.096
 Horowitz+0.093
ADA+0.092
VL+0.089
hent+0.088
Neg logit contributions
nam-0.092
 hon-0.092
anked-0.090
phan-0.087
pora-0.085
Tokenvation
Token ID10473
Feature activation-1.95e+00

Pos logit contributions
isle+0.005
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.021
nam-0.021
anked-0.021
phan-0.020
pora-0.020
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
isle+0.350
 Horowitz+0.340
ADA+0.336
VL+0.327
APTER+0.323
Neg logit contributions
 hon-0.319
nam-0.315
anked-0.309
phan-0.296
pora-0.291
Token Long
Token ID5882
Feature activation-3.54e-01

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
hent+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.004
Tokener
Token ID263
Feature activation+5.69e-01

Pos logit contributions
isle+0.062
 Horowitz+0.061
ADA+0.060
VL+0.059
APTER+0.058
Neg logit contributions
 hon-0.060
nam-0.059
anked-0.058
phan-0.055
pora-0.054
Tokenan
Token ID272
Feature activation+1.12e-01

Pos logit contributions
 hon+0.102
nam+0.100
anked+0.098
phan+0.094
pora+0.092
Neg logit contributions
isle-0.095
 Horowitz-0.092
ADA-0.092
VL-0.089
APTER-0.088
Token was
Token ID373
Feature activation+1.34e-01

Pos logit contributions
 hon+0.019
nam+0.018
anked+0.018
phan+0.017
pora+0.017
Neg logit contributions
isle-0.020
 Horowitz-0.019
ADA-0.019
VL-0.019
APTER-0.019
Token last
Token ID938
Feature activation-1.23e-01

Pos logit contributions
isle+0.045
 Horowitz+0.044
ADA+0.043
VL+0.042
APTER+0.041
Neg logit contributions
 hon-0.039
nam-0.039
anked-0.038
phan-0.037
pora-0.036
Token season
Token ID1622
Feature activation+1.15e-01

Pos logit contributions
isle+0.023
 Horowitz+0.022
ADA+0.022
VL+0.021
APTER+0.021
Neg logit contributions
 hon-0.020
nam-0.019
anked-0.019
phan-0.018
pora-0.018
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
 hon+0.020
nam+0.020
anked+0.019
phan+0.019
pora+0.018
Neg logit contributions
isle-0.020
 Horowitz-0.019
ADA-0.019
VL-0.018
hent-0.018
Token An
Token ID1052
Feature activation-2.90e-01

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.004
Token o
Token ID267
Feature activation-5.65e-02

Pos logit contributions
isle+0.054
 Horowitz+0.052
ADA+0.052
VL+0.050
hent+0.050
Neg logit contributions
 hon-0.046
nam-0.045
anked-0.044
phan-0.042
pora-0.042
Tokenvation
Token ID10473
Feature activation-1.94e+00

Pos logit contributions
isle+0.009
 Horowitz+0.009
ADA+0.008
VL+0.008
APTER+0.008
Neg logit contributions
 hon-0.011
nam-0.010
anked-0.010
phan-0.010
pora-0.010
Token of
Token ID286
Feature activation-5.88e-02

Pos logit contributions
isle+0.373
 Horowitz+0.363
ADA+0.359
VL+0.349
APTER+0.346
Neg logit contributions
 hon-0.293
nam-0.288
anked-0.283
phan-0.270
pora-0.266
Token thanks
Token ID5176
Feature activation+7.58e-01

Pos logit contributions
isle+0.011
ADA+0.010
 Horowitz+0.010
VL+0.010
hent+0.010
Neg logit contributions
 hon-0.010
nam-0.010
anked-0.009
phan-0.009
pora-0.009
Token from
Token ID422
Feature activation+1.95e-01

Pos logit contributions
 hon+0.116
nam+0.113
anked+0.111
phan+0.106
pora+0.104
Neg logit contributions
isle-0.146
 Horowitz-0.141
ADA-0.140
VL-0.137
APTER-0.135
Token Jets
Token ID14728
Feature activation+5.15e-01

Pos logit contributions
 hon+0.034
nam+0.033
anked+0.032
phan+0.031
pora+0.030
Neg logit contributions
isle-0.034
 Horowitz-0.033
ADA-0.033
VL-0.032
APTER-0.032
Token fans
Token ID3296
Feature activation+5.50e-01

Pos logit contributions
 hon+0.087
nam+0.085
anked+0.083
phan+0.080
pora+0.078
Neg logit contributions
isle-0.091
 Horowitz-0.088
ADA-0.088
VL-0.085
APTER-0.085
Token who
Token ID508
Feature activation+7.19e-02

Pos logit contributions
 hon+0.010
nam+0.009
anked+0.009
phan+0.009
pora+0.009
Neg logit contributions
isle-0.011
 Horowitz-0.010
ADA-0.010
VL-0.010
APTER-0.010
Token got
Token ID1392
Feature activation+3.22e-01

Pos logit contributions
 hon+0.012
nam+0.011
anked+0.011
phan+0.011
pora+0.010
Neg logit contributions
isle-0.013
 Horowitz-0.013
ADA-0.013
VL-0.012
APTER-0.012
Token a
Token ID257
Feature activation+4.63e-02

Pos logit contributions
 hon+0.055
nam+0.054
anked+0.053
phan+0.051
pora+0.049
Neg logit contributions
isle-0.056
 Horowitz-0.054
ADA-0.054
VL-0.053
APTER-0.052
Token standing
Token ID5055
Feature activation-4.93e-01

Pos logit contributions
 hon+0.008
nam+0.007
anked+0.007
phan+0.007
pora+0.007
Neg logit contributions
isle-0.008
 Horowitz-0.008
ADA-0.008
VL-0.008
hent-0.008
Token o
Token ID267
Feature activation-2.78e-02

Pos logit contributions
isle+0.086
 Horowitz+0.084
ADA+0.083
VL+0.080
��+0.079
Neg logit contributions
nam-0.083
 hon-0.082
anked-0.081
phan-0.079
pora-0.077
Tokenvation
Token ID10473
Feature activation-1.92e+00

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.008
nam-0.008
anked-0.008
phan-0.008
pora-0.007
Token from
Token ID422
Feature activation+2.16e-01

Pos logit contributions
isle+0.349
 Horowitz+0.339
ADA+0.335
VL+0.326
APTER+0.322
Neg logit contributions
 hon-0.311
nam-0.306
anked-0.301
phan-0.288
pora-0.284
Token the
Token ID262
Feature activation+2.36e-02

Pos logit contributions
 hon+0.039
nam+0.037
anked+0.037
phan+0.035
pora+0.035
Neg logit contributions
isle-0.037
ADA-0.035
 Horowitz-0.035
VL-0.034
hent-0.034
Token capacity
Token ID5339
Feature activation+2.88e-03

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.004
ADA-0.004
 Horowitz-0.004
VL-0.004
APTER-0.004
Token crowd
Token ID4315
Feature activation+1.07e-01

Pos logit contributions
 hon+0.000
nam+0.000
anked+0.000
phan+0.000
pora+0.000
Neg logit contributions
isle-0.001
 Horowitz-0.000
ADA-0.000
VL-0.000
APTER-0.000
Token inside
Token ID2641
Feature activation-1.55e-01

Pos logit contributions
 hon+0.018
nam+0.018
anked+0.018
phan+0.017
pora+0.017
Neg logit contributions
isle-0.019
 Horowitz-0.018
ADA-0.018
VL-0.017
APTER-0.017
Token 200
Token ID939
Feature activation-1.75e-01

Pos logit contributions
isle+0.017
 Horowitz+0.017
ADA+0.017
VL+0.016
APTER+0.016
Neg logit contributions
 hon-0.017
nam-0.016
anked-0.016
phan-0.015
pora-0.015
Token.
Token ID13
Feature activation-6.42e-03

Pos logit contributions
isle+0.030
 Horowitz+0.029
ADA+0.029
VL+0.028
hent+0.028
Neg logit contributions
 hon-0.030
nam-0.030
anked-0.029
phan-0.028
pora-0.028
Token
Token ID198
Feature activation+2.26e-02

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token
Token ID198
Feature activation-7.65e-03

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.004
 Horowitz-0.004
ADA-0.004
VL-0.004
APTER-0.004
TokenUS
Token ID2937
Feature activation-8.53e-02

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
TokenADA
Token ID26853
Feature activation-1.90e+00

Pos logit contributions
isle+0.001
 Horowitz+0.000
ADA+0.000
VL-0.000
APTER-0.000
Neg logit contributions
 hon-0.029
nam-0.028
anked-0.028
phan-0.028
pora-0.027
Token announced
Token ID3414
Feature activation+1.34e-01

Pos logit contributions
isle+0.344
 Horowitz+0.334
ADA+0.331
VL+0.322
APTER+0.319
Neg logit contributions
 hon-0.314
nam-0.306
anked-0.302
phan-0.288
pora-0.283
Token last
Token ID938
Feature activation-6.65e-02

Pos logit contributions
 hon+0.022
nam+0.021
anked+0.021
phan+0.020
pora+0.020
Neg logit contributions
isle-0.025
 Horowitz-0.024
ADA-0.024
VL-0.023
APTER-0.023
Token week
Token ID1285
Feature activation+3.83e-01

Pos logit contributions
isle+0.014
 Horowitz+0.014
ADA+0.014
VL+0.013
hent+0.013
Neg logit contributions
 hon-0.009
nam-0.009
anked-0.008
phan-0.008
pora-0.008
Token that
Token ID326
Feature activation+1.26e-02

Pos logit contributions
 hon+0.062
nam+0.060
anked+0.060
phan+0.057
pora+0.056
Neg logit contributions
isle-0.070
 Horowitz-0.068
ADA-0.067
VL-0.066
APTER-0.065
Token Les
Token ID11732
Feature activation-1.88e-01

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token months
Token ID1933
Feature activation-2.46e-01

Pos logit contributions
 hon+0.056
nam+0.055
anked+0.054
phan+0.052
pora+0.052
Neg logit contributions
isle-0.046
 Horowitz-0.045
ADA-0.044
VL-0.043
APTER-0.042
Token without
Token ID1231
Feature activation-6.16e-01

Pos logit contributions
isle+0.043
 Horowitz+0.042
ADA+0.041
VL+0.040
hent+0.040
Neg logit contributions
 hon-0.042
nam-0.041
anked-0.040
phan-0.039
pora-0.038
Token rain
Token ID6290
Feature activation-4.00e-01

Pos logit contributions
isle+0.106
 Horowitz+0.103
ADA+0.103
VL+0.099
APTER+0.098
Neg logit contributions
 hon-0.105
nam-0.104
anked-0.102
phan-0.098
pora-0.096
Token.
Token ID13
Feature activation+7.03e-04

Pos logit contributions
isle+0.070
 Horowitz+0.067
ADA+0.067
VL+0.065
hent+0.064
Neg logit contributions
 hon-0.068
nam-0.067
anked-0.066
phan-0.063
pora-0.062
Token Th
Token ID536
Feature activation+4.28e-01

Pos logit contributions
 hon+0.000
nam+0.000
anked+0.000
phan+0.000
pora+0.000
Neg logit contributions
isle-0.000
 Horowitz-0.000
ADA-0.000
VL-0.000
hent-0.000
Tokenirst
Token ID667
Feature activation-1.88e+00

Pos logit contributions
 hon+0.080
nam+0.079
anked+0.078
phan+0.075
pora+0.073
Neg logit contributions
isle-0.067
 Horowitz-0.065
ADA-0.065
VL-0.063
APTER-0.062
Tokeny
Token ID88
Feature activation-3.01e-01

Pos logit contributions
isle+0.301
 Horowitz+0.290
ADA+0.287
VL+0.278
APTER+0.275
Neg logit contributions
 hon-0.350
nam-0.343
anked-0.338
phan-0.325
pora-0.320
Token Marshall
Token ID13606
Feature activation+1.25e-02

Pos logit contributions
isle+0.048
 Horowitz+0.047
ADA+0.046
VL+0.045
APTER+0.044
Neg logit contributions
 hon-0.056
nam-0.055
anked-0.054
phan-0.052
pora-0.051
Tokenese
Token ID2771
Feature activation-7.08e-01

Pos logit contributions
 hon+0.003
nam+0.003
anked+0.003
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token,
Token ID11
Feature activation+4.99e-02

Pos logit contributions
isle+0.125
 Horowitz+0.121
ADA+0.121
VL+0.117
APTER+0.116
Neg logit contributions
 hon-0.119
nam-0.117
anked-0.115
phan-0.110
pora-0.108
Token many
Token ID867
Feature activation+1.36e-01

Pos logit contributions
 hon+0.008
nam+0.008
anked+0.008
phan+0.008
pora+0.007
Neg logit contributions
isle-0.009
 Horowitz-0.009
ADA-0.009
VL-0.008
hent-0.008
Tokenlam
Token ID2543
Feature activation-6.54e-01

Pos logit contributions
 hon+0.078
nam+0.076
anked+0.075
phan+0.073
pora+0.072
Neg logit contributions
isle-0.041
ADA-0.039
 Horowitz-0.038
VL-0.037
hent-0.036
Token.
Token ID13
Feature activation+3.46e-02

Pos logit contributions
isle+0.117
 Horowitz+0.113
ADA+0.113
VL+0.109
APTER+0.108
Neg logit contributions
 hon-0.109
nam-0.106
anked-0.105
phan-0.100
pora-0.099
Token This
Token ID770
Feature activation+1.62e-01

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.006
pora+0.005
Neg logit contributions
isle-0.006
 Horowitz-0.006
ADA-0.006
VL-0.005
APTER-0.005
Token second
Token ID1218
Feature activation+2.96e-01

Pos logit contributions
 hon+0.026
nam+0.025
anked+0.025
phan+0.024
pora+0.023
Neg logit contributions
isle-0.030
 Horowitz-0.029
ADA-0.029
VL-0.028
APTER-0.028
Token US
Token ID1294
Feature activation+6.17e-01

Pos logit contributions
 hon+0.043
nam+0.042
anked+0.041
phan+0.039
pora+0.038
Neg logit contributions
isle-0.059
 Horowitz-0.058
ADA-0.057
VL-0.056
APTER-0.055
TokenADA
Token ID26853
Feature activation-1.87e+00

Pos logit contributions
 hon+0.208
nam+0.205
anked+0.204
phan+0.199
pora+0.198
Neg logit contributions
isle-0.006
 Horowitz-0.002
ADA-0.001
VL+0.002
APTER+0.003
Token violation
Token ID8747
Feature activation+2.92e-01

Pos logit contributions
isle+0.392
 Horowitz+0.382
ADA+0.380
VL+0.370
APTER+0.367
Neg logit contributions
 hon-0.254
nam-0.246
anked-0.242
phan-0.228
pora-0.224
Token will
Token ID481
Feature activation-3.21e-01

Pos logit contributions
 hon+0.046
nam+0.045
anked+0.044
phan+0.042
pora+0.042
Neg logit contributions
isle-0.055
 Horowitz-0.053
ADA-0.053
VL-0.051
hent-0.051
Token certainly
Token ID3729
Feature activation-4.29e-01

Pos logit contributions
isle+0.061
 Horowitz+0.060
ADA+0.059
VL+0.058
hent+0.057
Neg logit contributions
 hon-0.049
nam-0.048
anked-0.047
phan-0.045
pora-0.044
Token put
Token ID1234
Feature activation+6.76e-02

Pos logit contributions
isle+0.083
 Horowitz+0.081
ADA+0.081
VL+0.078
APTER+0.077
Neg logit contributions
 hon-0.065
nam-0.063
anked-0.062
phan-0.059
pora-0.058
Token additional
Token ID3224
Feature activation+2.89e-01

Pos logit contributions
 hon+0.012
nam+0.012
anked+0.012
phan+0.011
pora+0.011
Neg logit contributions
isle-0.011
 Horowitz-0.011
ADA-0.010
VL-0.010
APTER-0.010
Token denied
Token ID6699
Feature activation+3.86e-01

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.003
pora+0.003
Neg logit contributions
isle-0.004
 Horowitz-0.004
ADA-0.004
VL-0.004
hent-0.004
Token the
Token ID262
Feature activation+2.14e-02

Pos logit contributions
 hon+0.065
nam+0.064
anked+0.063
phan+0.061
pora+0.060
Neg logit contributions
isle-0.068
 Horowitz-0.066
ADA-0.065
VL-0.063
hent-0.063
Token rape
Token ID7262
Feature activation+1.99e-01

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
hent-0.003
Token and
Token ID290
Feature activation+2.81e-02

Pos logit contributions
 hon+0.036
nam+0.035
anked+0.034
phan+0.033
pora+0.033
Neg logit contributions
isle-0.033
 Horowitz-0.032
ADA-0.032
VL-0.031
hent-0.030
Token mol
Token ID18605
Feature activation+2.08e-01

Pos logit contributions
 hon+0.004
anked+0.004
nam+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
hent-0.005
Tokenestation
Token ID27364
Feature activation-1.83e+00

Pos logit contributions
 hon+0.063
nam+0.062
anked+0.061
phan+0.060
pora+0.059
Neg logit contributions
isle-0.009
 Horowitz-0.008
ADA-0.008
VL-0.007
APTER-0.006
Token charges
Token ID4530
Feature activation-2.39e-01

Pos logit contributions
isle+0.291
 Horowitz+0.282
ADA+0.279
VL+0.269
hent+0.266
Neg logit contributions
 hon-0.340
nam-0.333
anked-0.329
phan-0.317
pora-0.313
Token,
Token ID11
Feature activation+1.52e-02

Pos logit contributions
isle+0.038
 Horowitz+0.037
ADA+0.037
VL+0.036
APTER+0.035
Neg logit contributions
 hon-0.044
nam-0.043
anked-0.043
phan-0.041
pora-0.040
Token saying
Token ID2282
Feature activation+3.58e-01

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
hent-0.003
Token it
Token ID340
Feature activation-1.99e-01

Pos logit contributions
 hon+0.060
nam+0.059
anked+0.058
phan+0.056
pora+0.055
Neg logit contributions
isle-0.063
 Horowitz-0.061
ADA-0.061
VL-0.059
APTER-0.059
Token is
Token ID318
Feature activation-5.98e-02

Pos logit contributions
isle+0.038
 Horowitz+0.037
ADA+0.037
VL+0.036
APTER+0.035
Neg logit contributions
 hon-0.031
nam-0.030
anked-0.029
phan-0.028
pora-0.027
Token don
Token ID836
Feature activation+1.64e-01

Pos logit contributions
 hon+0.011
nam+0.011
anked+0.011
phan+0.010
pora+0.010
Neg logit contributions
isle-0.014
 Horowitz-0.013
ADA-0.013
VL-0.013
APTER-0.013
Token't
Token ID470
Feature activation-1.26e-01

Pos logit contributions
 hon+0.027
nam+0.026
anked+0.026
phan+0.025
pora+0.024
Neg logit contributions
isle-0.030
 Horowitz-0.029
ADA-0.029
VL-0.028
APTER-0.027
Token have
Token ID423
Feature activation-1.70e-01

Pos logit contributions
isle+0.024
 Horowitz+0.023
ADA+0.023
VL+0.022
APTER+0.022
Neg logit contributions
 hon-0.019
nam-0.019
anked-0.019
phan-0.018
pora-0.018
Token Q
Token ID1195
Feature activation-4.22e-01

Pos logit contributions
isle+0.030
 Horowitz+0.029
ADA+0.029
VL+0.028
APTER+0.028
Neg logit contributions
 hon-0.029
nam-0.028
anked-0.028
phan-0.027
pora-0.026
Tokend
Token ID67
Feature activation-4.04e-01

Pos logit contributions
isle+0.068
 Horowitz+0.065
ADA+0.065
VL+0.063
APTER+0.062
Neg logit contributions
 hon-0.078
nam-0.077
anked-0.075
phan-0.072
pora-0.071
Tokenoba
Token ID19981
Feature activation-1.81e+00

Pos logit contributions
isle+0.019
 Horowitz+0.017
ADA+0.016
VL+0.014
APTER+0.013
Neg logit contributions
 hon-0.121
nam-0.119
anked-0.118
phan-0.115
pora-0.114
Token there
Token ID612
Feature activation-1.66e-01

Pos logit contributions
isle+0.318
 Horowitz+0.308
ADA+0.306
VL+0.296
APTER+0.294
Neg logit contributions
 hon-0.306
nam-0.299
anked-0.295
phan-0.282
pora-0.277
Token."
Token ID526
Feature activation-1.94e-02

Pos logit contributions
isle+0.029
 Horowitz+0.028
ADA+0.028
VL+0.027
APTER+0.026
Neg logit contributions
 hon-0.029
nam-0.028
anked-0.028
phan-0.026
pora-0.026
Token
Token ID198
Feature activation-4.73e-02

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
hent+0.003
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token
Token ID198
Feature activation-6.56e-02

Pos logit contributions
isle+0.008
 Horowitz+0.008
ADA+0.008
VL+0.008
APTER+0.007
Neg logit contributions
 hon-0.008
nam-0.008
anked-0.008
phan-0.008
pora-0.008
TokenAnt
Token ID13217
Feature activation-5.48e-01

Pos logit contributions
isle+0.011
 Horowitz+0.011
ADA+0.011
VL+0.011
APTER+0.011
Neg logit contributions
 hon-0.011
nam-0.011
anked-0.011
phan-0.010
pora-0.010
Token of
Token ID286
Feature activation-5.12e-02

Pos logit contributions
isle+0.012
 Horowitz+0.011
ADA+0.011
VL+0.011
APTER+0.011
Neg logit contributions
 hon-0.013
nam-0.012
anked-0.012
phan-0.012
pora-0.012
Token and
Token ID290
Feature activation+1.77e-02

Pos logit contributions
isle+0.009
ADA+0.009
 Horowitz+0.009
VL+0.008
APTER+0.008
Neg logit contributions
 hon-0.009
nam-0.009
anked-0.008
phan-0.008
pora-0.008
Token mock
Token ID15290
Feature activation+1.54e-01

Pos logit contributions
 hon+0.003
nam+0.003
anked+0.003
phan+0.003
pora+0.003
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token Child
Token ID5932
Feature activation+2.09e-01

Pos logit contributions
 hon+0.027
nam+0.027
anked+0.026
phan+0.025
pora+0.024
Neg logit contributions
isle-0.026
 Horowitz-0.025
ADA-0.025
VL-0.024
APTER-0.024
Token Mol
Token ID17958
Feature activation-3.17e-02

Pos logit contributions
 hon+0.028
nam+0.027
anked+0.027
phan+0.025
pora+0.025
Neg logit contributions
isle-0.044
 Horowitz-0.043
ADA-0.043
VL-0.042
APTER-0.041
Tokenestation
Token ID27364
Feature activation-1.80e+00

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
APTER+0.003
Neg logit contributions
 hon-0.008
nam-0.008
anked-0.008
phan-0.007
pora-0.007
Token and
Token ID290
Feature activation+8.28e-03

Pos logit contributions
isle+0.323
 Horowitz+0.313
ADA+0.311
VL+0.302
APTER+0.300
Neg logit contributions
 hon-0.301
nam-0.294
anked-0.289
phan-0.276
pora-0.271
Token Rape
Token ID35286
Feature activation+3.52e-02

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Token speaks
Token ID9209
Feature activation+3.76e-01

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.005
pora+0.005
Neg logit contributions
isle-0.006
 Horowitz-0.006
ADA-0.006
VL-0.006
APTER-0.006
Token volumes
Token ID15343
Feature activation-3.56e-01

Pos logit contributions
 hon+0.077
nam+0.075
anked+0.074
phan+0.072
pora+0.071
Neg logit contributions
isle-0.053
 Horowitz-0.051
ADA-0.051
VL-0.049
APTER-0.048
Token about
Token ID546
Feature activation-2.33e-01

Pos logit contributions
isle+0.060
 Horowitz+0.058
ADA+0.058
VL+0.056
hent+0.056
Neg logit contributions
 hon-0.063
nam-0.062
anked-0.061
phan-0.058
pora-0.057
Token (
Token ID357
Feature activation-1.80e-02

Pos logit contributions
 hon+0.055
nam+0.054
anked+0.053
phan+0.051
pora+0.050
Neg logit contributions
isle-0.059
 Horowitz-0.057
ADA-0.057
VL-0.055
APTER-0.055
Tokenin
Token ID259
Feature activation-9.59e-02

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
APTER+0.003
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token true
Token ID2081
Feature activation-8.46e-02

Pos logit contributions
isle+0.018
 Horowitz+0.017
ADA+0.017
VL+0.016
APTER+0.016
Neg logit contributions
 hon-0.016
nam-0.015
anked-0.015
phan-0.014
pora-0.014
Token Moore
Token ID8877
Feature activation-3.22e-01

Pos logit contributions
isle+0.015
 Horowitz+0.014
ADA+0.014
VL+0.014
hent+0.014
Neg logit contributions
 hon-0.014
nam-0.014
anked-0.014
phan-0.013
pora-0.013
Token doubles
Token ID21938
Feature activation-1.15e-02

Pos logit contributions
isle+0.049
 Horowitz+0.048
ADA+0.047
APTER+0.046
VL+0.046
Neg logit contributions
 hon-0.061
nam-0.060
anked-0.059
phan-0.057
pora-0.056
Tokenpeak
Token ID36729
Feature activation-1.79e+00

Pos logit contributions
isle+0.000
 Horowitz+0.000
ADA+0.000
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.003
pora-0.003
Token,
Token ID11
Feature activation-3.34e-02

Pos logit contributions
isle+0.328
 Horowitz+0.318
ADA+0.315
VL+0.307
hent+0.304
Neg logit contributions
 hon-0.291
nam-0.285
anked-0.280
phan-0.268
pora-0.263
Token everyone
Token ID2506
Feature activation-2.73e-01

Pos logit contributions
isle+0.006
 Horowitz+0.006
ADA+0.006
VL+0.006
hent+0.006
Neg logit contributions
 hon-0.006
nam-0.005
anked-0.005
phan-0.005
pora-0.005
Token keeps
Token ID7622
Feature activation+2.43e-01

Pos logit contributions
isle+0.051
 Horowitz+0.050
ADA+0.049
VL+0.048
APTER+0.048
Neg logit contributions
 hon-0.043
nam-0.042
anked-0.042
phan-0.040
pora-0.039
Token referring
Token ID9759
Feature activation-1.84e-01

Pos logit contributions
 hon+0.037
nam+0.036
anked+0.035
phan+0.033
pora+0.033
Neg logit contributions
isle-0.047
 Horowitz-0.046
ADA-0.046
VL-0.044
APTER-0.044
Token to
Token ID284
Feature activation-6.74e-02

Pos logit contributions
isle+0.033
 Horowitz+0.032
ADA+0.031
VL+0.030
APTER+0.030
Neg logit contributions
 hon-0.031
nam-0.030
anked-0.030
phan-0.028
pora-0.028
Token under
Token ID739
Feature activation-1.73e-01

Pos logit contributions
 hon+0.096
nam+0.093
anked+0.092
phan+0.088
pora+0.086
Neg logit contributions
isle-0.110
 Horowitz-0.106
ADA-0.106
VL-0.102
APTER-0.102
Token way
Token ID835
Feature activation+4.76e-01

Pos logit contributions
isle+0.035
 Horowitz+0.034
ADA+0.034
VL+0.033
APTER+0.033
Neg logit contributions
 hon-0.025
nam-0.024
anked-0.024
phan-0.023
pora-0.022
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
 hon+0.081
nam+0.079
anked+0.078
phan+0.074
pora+0.073
Neg logit contributions
isle-0.084
 Horowitz-0.081
ADA-0.081
VL-0.078
APTER-0.077
Token Post
Token ID2947
Feature activation-7.83e-02

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.002
hent+0.002
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.002
pora-0.002
Token-
Token ID12
Feature activation-5.01e-02

Pos logit contributions
isle+0.014
 Horowitz+0.013
ADA+0.013
VL+0.013
APTER+0.013
Neg logit contributions
 hon-0.013
nam-0.013
anked-0.013
phan-0.012
pora-0.012
Tokenpeak
Token ID36729
Feature activation-1.79e+00

Pos logit contributions
isle+0.008
 Horowitz+0.008
ADA+0.008
VL+0.008
APTER+0.008
Neg logit contributions
 hon-0.009
nam-0.009
anked-0.009
phan-0.008
pora-0.008
Token:
Token ID25
Feature activation+1.32e-01

Pos logit contributions
isle+0.326
 Horowitz+0.316
ADA+0.314
VL+0.306
APTER+0.302
Neg logit contributions
 hon-0.291
nam-0.284
anked-0.280
phan-0.267
pora-0.262
Token Pand
Token ID16492
Feature activation+2.69e-01

Pos logit contributions
 hon+0.022
nam+0.022
anked+0.022
phan+0.021
pora+0.020
Neg logit contributions
isle-0.023
 Horowitz-0.022
ADA-0.022
VL-0.021
APTER-0.021
Tokenemic
Token ID5314
Feature activation+5.71e-01

Pos logit contributions
 hon+0.028
nam+0.027
anked+0.026
phan+0.024
pora+0.023
Neg logit contributions
isle-0.065
 Horowitz-0.064
ADA-0.064
VL-0.062
APTER-0.062
Token activity
Token ID3842
Feature activation+1.17e-01

Pos logit contributions
 hon+0.093
nam+0.091
anked+0.089
phan+0.085
pora+0.084
Neg logit contributions
isle-0.104
 Horowitz-0.101
ADA-0.101
VL-0.097
APTER-0.097
Token appears
Token ID3568
Feature activation+3.95e-01

Pos logit contributions
 hon+0.019
nam+0.018
anked+0.018
phan+0.017
pora+0.017
Neg logit contributions
isle-0.022
 Horowitz-0.021
ADA-0.021
VL-0.020
APTER-0.020
Token of
Token ID286
Feature activation-1.03e-02

Pos logit contributions
isle+0.012
 Horowitz+0.012
ADA+0.012
VL+0.011
APTER+0.011
Neg logit contributions
 hon-0.012
nam-0.012
anked-0.012
phan-0.011
pora-0.011
Token rape
Token ID7262
Feature activation+2.22e-01

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.002
hent+0.002
Neg logit contributions
 hon-0.002
anked-0.002
nam-0.002
phan-0.002
pora-0.002
Token and
Token ID290
Feature activation+5.41e-02

Pos logit contributions
 hon+0.036
nam+0.035
anked+0.035
phan+0.033
pora+0.032
Neg logit contributions
isle-0.041
 Horowitz-0.039
ADA-0.039
VL-0.038
hent-0.038
Token sexual
Token ID3206
Feature activation+2.12e-01

Pos logit contributions
 hon+0.009
nam+0.009
anked+0.009
phan+0.008
pora+0.008
Neg logit contributions
isle-0.010
 Horowitz-0.010
ADA-0.009
VL-0.009
hent-0.009
Token mol
Token ID18605
Feature activation+2.58e-01

Pos logit contributions
 hon+0.040
nam+0.039
anked+0.039
phan+0.037
pora+0.037
Neg logit contributions
isle-0.033
 Horowitz-0.032
ADA-0.032
VL-0.031
APTER-0.031
Tokenestation
Token ID27364
Feature activation-1.78e+00

Pos logit contributions
 hon+0.077
nam+0.077
anked+0.076
phan+0.074
pora+0.073
Neg logit contributions
isle-0.012
 Horowitz-0.010
ADA-0.010
VL-0.009
APTER-0.008
Token.
Token ID13
Feature activation+3.76e-02

Pos logit contributions
isle+0.308
 Horowitz+0.298
ADA+0.296
VL+0.286
hent+0.284
Neg logit contributions
 hon-0.306
nam-0.301
anked-0.296
phan-0.285
pora-0.279
Token The
Token ID383
Feature activation-2.74e-02

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.006
pora+0.006
Neg logit contributions
isle-0.007
 Horowitz-0.006
ADA-0.006
VL-0.006
APTER-0.006
Token 39
Token ID5014
Feature activation+1.57e-01

Pos logit contributions
isle+0.005
 Horowitz+0.005
ADA+0.005
VL+0.005
hent+0.005
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.004
Token-
Token ID12
Feature activation+2.04e-02

Pos logit contributions
 hon+0.028
nam+0.027
anked+0.027
phan+0.026
pora+0.025
Neg logit contributions
isle-0.026
 Horowitz-0.026
ADA-0.025
VL-0.025
APTER-0.024
Tokenyear
Token ID1941
Feature activation+3.43e-02

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.003
pora+0.003
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token control
Token ID1630
Feature activation+9.30e-02

Pos logit contributions
isle+0.032
 Horowitz+0.031
ADA+0.031
VL+0.030
APTER+0.030
Neg logit contributions
 hon-0.023
nam-0.022
anked-0.022
phan-0.020
pora-0.020
Token.
Token ID13
Feature activation-3.06e-02

Pos logit contributions
 hon+0.016
nam+0.016
anked+0.016
phan+0.015
pora+0.015
Neg logit contributions
isle-0.016
 Horowitz-0.015
ADA-0.015
VL-0.015
APTER-0.015
Token (
Token ID357
Feature activation-1.48e-03

Pos logit contributions
isle+0.005
 Horowitz+0.005
ADA+0.005
VL+0.005
APTER+0.005
Neg logit contributions
 hon-0.005
nam-0.005
anked-0.005
phan-0.005
pora-0.005
TokenMan
Token ID5124
Feature activation+3.67e-01

Pos logit contributions
isle+0.000
 Horowitz+0.000
ADA+0.000
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.000
nam-0.000
anked-0.000
phan-0.000
pora-0.000
Tokenit
Token ID270
Feature activation-3.67e-01

Pos logit contributions
 hon+0.077
nam+0.075
anked+0.075
phan+0.072
pora+0.071
Neg logit contributions
isle-0.051
 Horowitz-0.048
ADA-0.048
VL-0.046
hent-0.046
Tokenoba
Token ID19981
Feature activation-1.78e+00

Pos logit contributions
isle+0.017
 Horowitz+0.015
ADA+0.014
VL+0.012
APTER+0.012
Neg logit contributions
 hon-0.110
nam-0.109
anked-0.108
phan-0.105
pora-0.104
Token Chambers
Token ID30717
Feature activation+2.57e-01

Pos logit contributions
isle+0.313
 Horowitz+0.303
ADA+0.300
VL+0.291
hent+0.289
Neg logit contributions
 hon-0.303
nam-0.296
anked-0.291
phan-0.278
pora-0.273
Token of
Token ID286
Feature activation-7.46e-02

Pos logit contributions
 hon+0.046
nam+0.044
anked+0.044
phan+0.042
pora+0.041
Neg logit contributions
isle-0.044
 Horowitz-0.042
ADA-0.042
VL-0.041
APTER-0.040
Token Commerce
Token ID16127
Feature activation+1.89e-01

Pos logit contributions
isle+0.014
 Horowitz+0.013
ADA+0.013
VL+0.013
APTER+0.013
Neg logit contributions
 hon-0.012
nam-0.012
anked-0.012
phan-0.011
pora-0.011
Token)
Token ID8
Feature activation+1.44e-01

Pos logit contributions
 hon+0.028
nam+0.027
anked+0.026
phan+0.025
pora+0.025
Neg logit contributions
isle-0.037
 Horowitz-0.036
ADA-0.036
VL-0.035
APTER-0.035
Token "
Token ID366
Feature activation-1.17e-01

Pos logit contributions
 hon+0.025
nam+0.024
anked+0.024
phan+0.023
pora+0.022
Neg logit contributions
isle-0.025
 Horowitz-0.024
ADA-0.024
VL-0.023
hent-0.023
Token-
Token ID12
Feature activation-4.95e-02

Pos logit contributions
isle+0.006
 Horowitz+0.006
ADA+0.006
VL+0.006
APTER+0.006
Neg logit contributions
 hon-0.006
nam-0.006
anked-0.006
phan-0.006
pora-0.006
Tokenstrength
Token ID41402
Feature activation-4.46e-01

Pos logit contributions
isle+0.007
 Horowitz+0.007
ADA+0.007
VL+0.006
APTER+0.006
Neg logit contributions
 hon-0.010
nam-0.010
anked-0.010
phan-0.010
pora-0.009
Token,
Token ID11
Feature activation-2.47e-02

Pos logit contributions
isle+0.090
 Horowitz+0.088
ADA+0.087
VL+0.085
hent+0.084
Neg logit contributions
 hon-0.064
nam-0.062
anked-0.061
phan-0.058
pora-0.057
Token military
Token ID2422
Feature activation+8.54e-02

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.004
Token-
Token ID12
Feature activation-5.58e-02

Pos logit contributions
 hon+0.015
nam+0.015
anked+0.015
phan+0.014
pora+0.014
Neg logit contributions
isle-0.014
 Horowitz-0.014
ADA-0.014
VL-0.013
APTER-0.013
Tokengrade
Token ID9526
Feature activation-1.78e+00

Pos logit contributions
isle+0.002
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.018
nam-0.017
anked-0.017
phan-0.017
pora-0.017
Token,
Token ID11
Feature activation-3.70e-02

Pos logit contributions
isle+0.355
 Horowitz+0.345
ADA+0.343
VL+0.333
APTER+0.330
Neg logit contributions
 hon-0.257
nam-0.253
anked-0.248
phan-0.237
pora-0.232
Token aluminum
Token ID15955
Feature activation+3.41e-01

Pos logit contributions
isle+0.008
 Horowitz+0.007
ADA+0.007
VL+0.007
APTER+0.007
Neg logit contributions
 hon-0.005
nam-0.005
anked-0.005
phan-0.005
pora-0.005
Token alloy
Token ID36186
Feature activation-1.82e-01

Pos logit contributions
 hon+0.058
nam+0.057
anked+0.056
phan+0.053
pora+0.052
Neg logit contributions
isle-0.060
 Horowitz-0.058
ADA-0.058
VL-0.056
APTER-0.055
Token that
Token ID326
Feature activation-1.01e-03

Pos logit contributions
isle+0.034
 Horowitz+0.033
ADA+0.032
VL+0.032
hent+0.031
Neg logit contributions
 hon-0.029
nam-0.029
anked-0.028
phan-0.027
pora-0.026
Token helps
Token ID5419
Feature activation+9.24e-02

Pos logit contributions
isle+0.000
 Horowitz+0.000
ADA+0.000
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.000
nam-0.000
anked-0.000
phan-0.000
pora-0.000
Token of
Token ID286
Feature activation-7.47e-02

Pos logit contributions
 hon+0.160
nam+0.157
anked+0.155
phan+0.149
pora+0.146
Neg logit contributions
isle-0.143
 Horowitz-0.139
ADA-0.137
VL-0.133
APTER-0.132
Token people
Token ID661
Feature activation-1.26e-01

Pos logit contributions
isle+0.013
 Horowitz+0.013
ADA+0.013
VL+0.012
hent+0.012
Neg logit contributions
 hon-0.013
nam-0.012
anked-0.012
phan-0.012
pora-0.011
Token travelling
Token ID16574
Feature activation+2.91e-01

Pos logit contributions
isle+0.023
 Horowitz+0.023
ADA+0.022
VL+0.022
hent+0.021
Neg logit contributions
 hon-0.020
nam-0.020
anked-0.019
phan-0.018
pora-0.018
Token off
Token ID572
Feature activation+4.73e-01

Pos logit contributions
 hon+0.047
nam+0.046
anked+0.046
phan+0.044
pora+0.043
Neg logit contributions
isle-0.053
 Horowitz-0.052
ADA-0.051
VL-0.049
hent-0.049
Token-
Token ID12
Feature activation-5.29e-02

Pos logit contributions
 hon+0.081
nam+0.079
anked+0.078
phan+0.075
pora+0.073
Neg logit contributions
isle-0.083
 Horowitz-0.080
ADA-0.079
VL-0.077
APTER-0.076
Tokenpeak
Token ID36729
Feature activation-1.77e+00

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.014
nam-0.014
anked-0.014
phan-0.013
pora-0.013
Token has
Token ID468
Feature activation-4.61e-02

Pos logit contributions
isle+0.325
 Horowitz+0.314
ADA+0.310
VL+0.303
hent+0.297
Neg logit contributions
 hon-0.284
nam-0.282
anked-0.276
phan-0.265
pora-0.260
Token almost
Token ID2048
Feature activation+2.40e-01

Pos logit contributions
isle+0.009
 Horowitz+0.008
ADA+0.008
VL+0.008
hent+0.008
Neg logit contributions
 hon-0.007
nam-0.007
anked-0.007
phan-0.007
pora-0.007
Token doubled
Token ID15229
Feature activation+2.53e-01

Pos logit contributions
 hon+0.034
nam+0.033
anked+0.033
phan+0.031
pora+0.031
Neg logit contributions
isle-0.049
 Horowitz-0.047
ADA-0.047
VL-0.046
hent-0.045
Token,
Token ID11
Feature activation-3.75e-02

Pos logit contributions
 hon+0.043
nam+0.042
anked+0.042
phan+0.040
pora+0.040
Neg logit contributions
isle-0.044
 Horowitz-0.042
ADA-0.042
VL-0.041
APTER-0.040
Token with
Token ID351
Feature activation-3.79e-02

Pos logit contributions
isle+0.007
 Horowitz+0.007
ADA+0.007
VL+0.007
hent+0.006
Neg logit contributions
 hon-0.006
nam-0.006
anked-0.006
phan-0.006
pora-0.005
Token each
Token ID1123
Feature activation+4.43e-02

Pos logit contributions
isle+0.005
 Horowitz+0.005
ADA+0.005
VL+0.005
APTER+0.005
Neg logit contributions
 hon-0.005
nam-0.005
anked-0.005
phan-0.004
pora-0.004
Token sip
Token ID31145
Feature activation+1.48e-02

Pos logit contributions
 hon+0.007
nam+0.007
anked+0.006
phan+0.006
pora+0.006
Neg logit contributions
isle-0.009
 Horowitz-0.008
ADA-0.008
VL-0.008
hent-0.008
Token a
Token ID257
Feature activation+2.98e-02

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
hent-0.002
Token bright
Token ID6016
Feature activation+6.20e-01

Pos logit contributions
 hon+0.005
nam+0.005
anked+0.005
phan+0.005
pora+0.004
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
APTER-0.005
Token clear
Token ID1598
Feature activation+4.02e-01

Pos logit contributions
 hon+0.103
nam+0.101
anked+0.099
phan+0.095
pora+0.093
Neg logit contributions
isle-0.111
 Horowitz-0.108
ADA-0.107
VL-0.104
APTER-0.103
Token wedge
Token ID35901
Feature activation-1.77e+00

Pos logit contributions
 hon+0.067
nam+0.066
anked+0.065
phan+0.062
pora+0.061
Neg logit contributions
isle-0.072
 Horowitz-0.069
ADA-0.069
VL-0.067
hent-0.066
Token of
Token ID286
Feature activation-2.23e-02

Pos logit contributions
isle+0.295
 Horowitz+0.285
ADA+0.283
VL+0.274
hent+0.271
Neg logit contributions
 hon-0.314
nam-0.308
anked-0.304
phan-0.292
pora-0.287
Token lime
Token ID28738
Feature activation+6.74e-01

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
hent+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.003
phan-0.003
pora-0.003
Token rises
Token ID16736
Feature activation-1.49e-01

Pos logit contributions
 hon+0.108
nam+0.105
anked+0.103
phan+0.098
pora+0.097
Neg logit contributions
isle-0.125
 Horowitz-0.122
ADA-0.121
VL-0.117
APTER-0.116
Token up
Token ID510
Feature activation+4.03e-01

Pos logit contributions
isle+0.028
 Horowitz+0.027
ADA+0.027
VL+0.026
hent+0.026
Neg logit contributions
 hon-0.023
nam-0.023
anked-0.022
phan-0.022
pora-0.021
Token and
Token ID290
Feature activation+5.35e-02

Pos logit contributions
 hon+0.066
nam+0.064
anked+0.064
phan+0.062
pora+0.061
Neg logit contributions
isle-0.072
 Horowitz-0.070
ADA-0.069
VL-0.067
hent-0.067
TokenHelp
Token ID22087
Feature activation-8.51e-02

Pos logit contributions
isle+0.011
ADA+0.011
 Horowitz+0.011
VL+0.011
APTER+0.011
Neg logit contributions
 hon-0.010
nam-0.010
anked-0.010
phan-0.009
pora-0.009
Token and
Token ID290
Feature activation+4.27e-02

Pos logit contributions
isle+0.015
 Horowitz+0.015
ADA+0.015
VL+0.014
APTER+0.014
Neg logit contributions
 hon-0.014
nam-0.014
anked-0.013
phan-0.013
pora-0.013
Token #
Token ID1303
Feature activation+2.23e-01

Pos logit contributions
 hon+0.007
nam+0.006
anked+0.006
phan+0.006
pora+0.006
Neg logit contributions
isle-0.008
 Horowitz-0.008
ADA-0.008
VL-0.008
APTER-0.008
TokenC
Token ID34
Feature activation+6.65e-02

Pos logit contributions
 hon+0.039
nam+0.038
anked+0.037
phan+0.036
pora+0.035
Neg logit contributions
isle-0.038
 Horowitz-0.037
ADA-0.037
VL-0.036
APTER-0.035
Tokenhen
Token ID831
Feature activation+3.27e-01

Pos logit contributions
 hon+0.009
nam+0.009
anked+0.009
phan+0.008
pora+0.008
Neg logit contributions
isle-0.014
 Horowitz-0.013
ADA-0.013
VL-0.013
APTER-0.013
Tokennai
Token ID38600
Feature activation-1.76e+00

Pos logit contributions
 hon+0.100
nam+0.099
anked+0.098
phan+0.095
pora+0.095
Neg logit contributions
isle-0.013
 Horowitz-0.011
ADA-0.011
VL-0.009
APTER-0.009
TokenFl
Token ID7414
Feature activation-4.00e-02

Pos logit contributions
isle+0.295
 Horowitz+0.285
ADA+0.283
VL+0.274
APTER+0.272
Neg logit contributions
 hon-0.312
nam-0.306
anked-0.301
phan-0.289
pora-0.284
Tokenood
Token ID702
Feature activation+2.76e-01

Pos logit contributions
isle+0.006
 Horowitz+0.006
ADA+0.006
VL+0.006
hent+0.006
Neg logit contributions
 hon-0.008
nam-0.007
anked-0.007
phan-0.007
pora-0.007
Tokens
Token ID82
Feature activation+2.74e-02

Pos logit contributions
 hon+0.047
nam+0.046
anked+0.045
phan+0.043
pora+0.042
Neg logit contributions
isle-0.048
 Horowitz-0.047
ADA-0.047
VL-0.045
APTER-0.045
Token have
Token ID423
Feature activation-1.44e-01

Pos logit contributions
 hon+0.005
nam+0.005
anked+0.005
phan+0.004
pora+0.004
Neg logit contributions
isle-0.005
 Horowitz-0.004
ADA-0.004
VL-0.004
APTER-0.004
Token been
Token ID587
Feature activation-6.03e-02

Pos logit contributions
isle+0.025
 Horowitz+0.024
ADA+0.024
VL+0.023
APTER+0.023
Neg logit contributions
 hon-0.025
nam-0.024
anked-0.024
phan-0.023
pora-0.022
Token delay
Token ID5711
Feature activation+6.97e-01

Pos logit contributions
isle+0.056
 Horowitz+0.054
ADA+0.054
VL+0.052
APTER+0.052
Neg logit contributions
 hon-0.050
nam-0.048
anked-0.048
phan-0.045
pora-0.045
Token,
Token ID11
Feature activation+2.69e-02

Pos logit contributions
 hon+0.114
nam+0.111
anked+0.109
phan+0.104
pora+0.102
Neg logit contributions
isle-0.127
 Horowitz-0.124
ADA-0.123
VL-0.120
APTER-0.119
Token
Token ID198
Feature activation-3.46e-03

Pos logit contributions
 hon+0.005
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.004
hent-0.004
Token
Token ID198
Feature activation+2.43e-02

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
TokenThe
Token ID464
Feature activation+1.96e-01

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.004
 Horowitz-0.004
ADA-0.004
VL-0.004
APTER-0.004
Token insol
Token ID35831
Feature activation-1.75e+00

Pos logit contributions
 hon+0.032
nam+0.031
anked+0.031
phan+0.029
pora+0.029
Neg logit contributions
isle-0.036
 Horowitz-0.035
ADA-0.034
VL-0.033
APTER-0.033
Tokenence
Token ID594
Feature activation-7.75e-03

Pos logit contributions
isle+0.294
 Horowitz+0.283
ADA+0.281
VL+0.272
hent+0.270
Neg logit contributions
 hon-0.311
nam-0.304
anked-0.300
phan-0.287
pora-0.283
Token of
Token ID286
Feature activation-2.15e-02

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token office
Token ID2607
Feature activation+4.60e-01

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token,
Token ID11
Feature activation+1.94e-02

Pos logit contributions
 hon+0.078
nam+0.076
anked+0.074
phan+0.071
pora+0.070
Neg logit contributions
isle-0.081
 Horowitz-0.079
ADA-0.078
VL-0.076
APTER-0.076
Token and
Token ID290
Feature activation+2.76e-02

Pos logit contributions
 hon+0.003
nam+0.003
anked+0.003
phan+0.003
pora+0.003
Neg logit contributions
isle-0.004
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token the
Token ID262
Feature activation-2.92e-02

Pos logit contributions
isle+0.140
 Horowitz+0.135
ADA+0.135
VL+0.131
APTER+0.130
Neg logit contributions
 hon-0.128
nam-0.125
anked-0.123
phan-0.117
pora-0.115
Token troops
Token ID6553
Feature activation-3.45e-01

Pos logit contributions
isle+0.005
 Horowitz+0.005
ADA+0.005
VL+0.005
APTER+0.005
Neg logit contributions
 hon-0.005
nam-0.005
anked-0.005
phan-0.005
pora-0.005
Token "
Token ID366
Feature activation-1.56e-01

Pos logit contributions
isle+0.061
 Horowitz+0.059
ADA+0.059
VL+0.057
APTER+0.057
Neg logit contributions
 hon-0.058
nam-0.057
anked-0.056
phan-0.053
pora-0.052
Tokento
Token ID1462
Feature activation-1.92e-01

Pos logit contributions
isle+0.026
 Horowitz+0.025
ADA+0.024
VL+0.024
APTER+0.023
Neg logit contributions
 hon-0.028
nam-0.028
anked-0.027
phan-0.026
pora-0.026
Token enc
Token ID2207
Feature activation+5.02e-01

Pos logit contributions
isle+0.036
 Horowitz+0.035
ADA+0.035
VL+0.034
APTER+0.033
Neg logit contributions
 hon-0.030
nam-0.029
anked-0.029
phan-0.028
pora-0.027
Tokenirc
Token ID1980
Feature activation-1.74e+00

Pos logit contributions
 hon+0.146
nam+0.144
anked+0.142
phan+0.140
pora+0.138
Neg logit contributions
isle-0.028
 Horowitz-0.025
ADA-0.024
VL-0.021
hent-0.021
Tokenle
Token ID293
Feature activation+7.05e-02

Pos logit contributions
isle+0.317
 Horowitz+0.303
ADA+0.301
hent+0.293
VL+0.291
Neg logit contributions
 hon-0.286
anked-0.278
nam-0.277
phan-0.262
pora-0.261
Token and
Token ID290
Feature activation+6.95e-03

Pos logit contributions
 hon+0.012
nam+0.011
anked+0.011
phan+0.011
pora+0.010
Neg logit contributions
isle-0.013
 Horowitz-0.012
ADA-0.012
VL-0.012
APTER-0.012
Token destroy
Token ID4117
Feature activation+3.29e-03

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Token [
Token ID685
Feature activation+1.56e-01

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Tokenthe
Token ID1169
Feature activation-3.10e-01

Pos logit contributions
 hon+0.030
nam+0.029
anked+0.029
phan+0.028
pora+0.027
Neg logit contributions
isle-0.024
 Horowitz-0.023
ADA-0.023
VL-0.022
APTER-0.022
Token cheap
Token ID7026
Feature activation-1.94e-01

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
hent+0.003
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token low
Token ID1877
Feature activation-4.23e-01

Pos logit contributions
isle+0.035
 Horowitz+0.034
ADA+0.033
VL+0.033
hent+0.032
Neg logit contributions
 hon-0.032
nam-0.031
anked-0.031
phan-0.029
pora-0.029
Token-
Token ID12
Feature activation-6.66e-02

Pos logit contributions
isle+0.075
 Horowitz+0.072
ADA+0.072
VL+0.070
APTER+0.069
Neg logit contributions
 hon-0.072
nam-0.070
anked-0.069
phan-0.066
pora-0.065
Tokenspeed
Token ID12287
Feature activation+8.32e-01

Pos logit contributions
isle+0.012
 Horowitz+0.011
ADA+0.011
VL+0.011
APTER+0.011
Neg logit contributions
 hon-0.011
nam-0.011
anked-0.011
phan-0.011
pora-0.010
Token pulse
Token ID19445
Feature activation-3.23e-01

Pos logit contributions
 hon+0.136
nam+0.133
anked+0.131
phan+0.125
pora+0.123
Neg logit contributions
isle-0.151
 Horowitz-0.146
ADA-0.145
VL-0.141
APTER-0.140
Tokenjet
Token ID31173
Feature activation-1.74e+00

Pos logit contributions
isle+0.022
 Horowitz+0.020
ADA+0.020
VL+0.018
APTER+0.017
Neg logit contributions
 hon-0.090
nam-0.089
anked-0.088
phan-0.086
pora-0.085
Token engine
Token ID3113
Feature activation+4.82e-01

Pos logit contributions
isle+0.314
 Horowitz+0.306
ADA+0.303
VL+0.295
APTER+0.291
Neg logit contributions
 hon-0.284
nam-0.278
anked-0.274
phan-0.262
pora-0.258
Token suddenly
Token ID6451
Feature activation-5.32e-02

Pos logit contributions
 hon+0.081
nam+0.079
anked+0.078
phan+0.075
pora+0.073
Neg logit contributions
isle-0.086
 Horowitz-0.083
ADA-0.082
VL-0.080
APTER-0.079
Token starts
Token ID4940
Feature activation-1.92e-01

Pos logit contributions
isle+0.009
 Horowitz+0.009
ADA+0.009
VL+0.008
APTER+0.008
Neg logit contributions
 hon-0.009
nam-0.009
anked-0.009
phan-0.009
pora-0.009
Token
Token ID198
Feature activation-5.89e-02

Pos logit contributions
isle+0.034
 Horowitz+0.033
ADA+0.033
VL+0.032
APTER+0.031
Neg logit contributions
 hon-0.032
nam-0.032
anked-0.031
phan-0.030
pora-0.029
Token
Token ID198
Feature activation-3.89e-02

Pos logit contributions
isle+0.010
 Horowitz+0.010
ADA+0.010
VL+0.009
hent+0.009
Neg logit contributions
 hon-0.010
nam-0.010
anked-0.010
phan-0.010
pora-0.010
Token open
Token ID1280
Feature activation+1.58e-01

Pos logit contributions
isle+0.071
 Horowitz+0.069
ADA+0.069
VL+0.067
APTER+0.066
Neg logit contributions
 hon-0.051
nam-0.050
anked-0.049
phan-0.047
pora-0.046
Token new
Token ID649
Feature activation-1.46e-01

Pos logit contributions
 hon+0.027
nam+0.027
anked+0.027
phan+0.025
pora+0.025
Neg logit contributions
isle-0.027
 Horowitz-0.026
ADA-0.026
VL-0.025
APTER-0.025
Token store
Token ID3650
Feature activation-4.94e-01

Pos logit contributions
isle+0.022
 Horowitz+0.021
ADA+0.021
VL+0.020
APTER+0.020
Neg logit contributions
 hon-0.029
nam-0.028
anked-0.028
phan-0.027
pora-0.026
Token at
Token ID379
Feature activation+5.72e-02

Pos logit contributions
isle+0.099
 Horowitz+0.096
ADA+0.095
VL+0.093
APTER+0.092
Neg logit contributions
 hon-0.072
nam-0.070
anked-0.069
phan-0.065
pora-0.064
Token Five
Token ID10579
Feature activation-2.00e-01

Pos logit contributions
 hon+0.010
nam+0.010
anked+0.010
phan+0.009
pora+0.009
Neg logit contributions
isle-0.010
 Horowitz-0.009
ADA-0.009
VL-0.009
APTER-0.009
Token Corn
Token ID11424
Feature activation+1.41e+00

Pos logit contributions
isle+0.052
 Horowitz+0.051
ADA+0.051
VL+0.050
APTER+0.050
Neg logit contributions
 hon-0.017
nam-0.016
anked-0.016
phan-0.014
pora-0.014
Tokeners
Token ID364
Feature activation+1.42e-01

Pos logit contributions
 hon+0.219
nam+0.213
anked+0.210
phan+0.200
pora+0.197
Neg logit contributions
isle-0.267
 Horowitz-0.258
ADA-0.257
VL-0.250
APTER-0.248
Token in
Token ID287
Feature activation-3.18e-02

Pos logit contributions
 hon+0.023
nam+0.022
anked+0.022
phan+0.021
pora+0.021
Neg logit contributions
isle-0.026
 Horowitz-0.025
ADA-0.025
VL-0.025
APTER-0.024
Token Bur
Token ID5481
Feature activation+1.23e-01

Pos logit contributions
isle+0.006
 Horowitz+0.006
ADA+0.006
VL+0.006
APTER+0.006
Neg logit contributions
 hon-0.005
nam-0.005
anked-0.005
phan-0.005
pora-0.004
Tokenien
Token ID2013
Feature activation-6.84e-01

Pos logit contributions
 hon+0.028
nam+0.028
anked+0.027
phan+0.026
pora+0.026
Neg logit contributions
isle-0.014
 Horowitz-0.014
ADA-0.014
VL-0.013
��-0.013
Token
Token ID198
Feature activation-3.49e-02

Pos logit contributions
isle+0.127
 Horowitz+0.123
ADA+0.122
VL+0.119
APTER+0.118
Neg logit contributions
 hon-0.110
nam-0.107
anked-0.105
phan-0.100
pora-0.099
Tokenointed
Token ID20909
Feature activation+8.01e-01

Pos logit contributions
isle+0.098
 Horowitz+0.097
ADA+0.096
VL+0.094
APTER+0.094
Neg logit contributions
 hon-0.034
nam-0.033
anked-0.032
phan-0.029
pora-0.027
Token with
Token ID351
Feature activation+9.28e-03

Pos logit contributions
 hon+0.129
nam+0.126
anked+0.124
phan+0.119
pora+0.117
Neg logit contributions
isle-0.147
 Horowitz-0.143
ADA-0.142
VL-0.138
APTER-0.137
Token the
Token ID262
Feature activation-8.44e-03

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token way
Token ID835
Feature activation+4.95e-01

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token the
Token ID262
Feature activation+4.86e-03

Pos logit contributions
 hon+0.083
nam+0.081
anked+0.079
phan+0.076
pora+0.075
Neg logit contributions
isle-0.089
 Horowitz-0.086
ADA-0.085
VL-0.083
APTER-0.082
Token toilet
Token ID16146
Feature activation+1.22e+00

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Token is
Token ID318
Feature activation-5.90e-02

Pos logit contributions
 hon+0.192
nam+0.187
anked+0.184
phan+0.175
pora+0.172
Neg logit contributions
isle-0.231
 Horowitz-0.224
ADA-0.223
VL-0.217
APTER-0.215
Token away
Token ID1497
Feature activation+2.07e-01

Pos logit contributions
isle+0.010
 Horowitz+0.010
ADA+0.010
VL+0.010
APTER+0.010
Neg logit contributions
 hon-0.010
nam-0.010
anked-0.010
phan-0.009
pora-0.009
Token from
Token ID422
Feature activation+2.01e-01

Pos logit contributions
 hon+0.031
nam+0.030
anked+0.030
phan+0.029
pora+0.028
Neg logit contributions
isle-0.040
 Horowitz-0.039
ADA-0.039
VL-0.038
APTER-0.037
Token the
Token ID262
Feature activation+8.84e-04

Pos logit contributions
 hon+0.033
nam+0.032
anked+0.032
phan+0.030
pora+0.030
Neg logit contributions
isle-0.036
 Horowitz-0.035
ADA-0.035
VL-0.034
APTER-0.034
Token bathroom
Token ID12436
Feature activation+3.91e-01

Pos logit contributions
 hon+0.000
nam+0.000
anked+0.000
phan+0.000
pora+0.000
Neg logit contributions
isle-0.000
 Horowitz-0.000
ADA-0.000
VL-0.000
hent-0.000
Tokenep
Token ID538
Feature activation-1.46e-01

Pos logit contributions
isle+0.007
 Horowitz+0.007
ADA+0.007
VL+0.007
APTER+0.006
Neg logit contributions
 hon-0.007
nam-0.007
anked-0.007
phan-0.006
pora-0.006
Tokenā
Token ID10235
Feature activation+1.21e+00

Pos logit contributions
isle+0.034
 Horowitz+0.033
ADA+0.033
VL+0.032
APTER+0.032
Neg logit contributions
 hon-0.017
nam-0.016
anked-0.016
phan-0.015
pora-0.014
Tokeniw
Token ID14246
Feature activation-5.35e-01

Pos logit contributions
 hon+0.261
nam+0.256
anked+0.253
phan+0.245
pora+0.242
Neg logit contributions
isle-0.158
 Horowitz-0.151
ADA-0.150
VL-0.144
APTER-0.142
Tokeni
Token ID72
Feature activation+3.03e-01

Pos logit contributions
isle+0.101
 Horowitz+0.098
ADA+0.097
VL+0.094
hent+0.094
Neg logit contributions
 hon-0.084
nam-0.081
anked-0.080
phan-0.076
pora-0.075
Token l
Token ID300
Feature activation-7.65e-02

Pos logit contributions
 hon+0.050
nam+0.049
anked+0.048
phan+0.046
pora+0.046
Neg logit contributions
isle-0.054
 Horowitz-0.053
ADA-0.052
VL-0.051
APTER-0.050
Tokenā
Token ID10235
Feature activation+1.24e+00

Pos logit contributions
isle+0.013
 Horowitz+0.012
ADA+0.012
VL+0.012
hent+0.012
Neg logit contributions
 hon-0.014
nam-0.013
anked-0.013
phan-0.013
pora-0.012
Tokenua
Token ID6413
Feature activation+1.85e-01

Pos logit contributions
 hon+0.212
nam+0.207
anked+0.204
phan+0.195
pora+0.192
Neg logit contributions
isle-0.217
 Horowitz-0.210
ADA-0.208
VL-0.202
APTER-0.200
Token
Token ID220
Feature activation+1.93e-01

Pos logit contributions
 hon+0.030
nam+0.030
anked+0.029
phan+0.028
pora+0.027
Neg logit contributions
isle-0.034
 Horowitz-0.033
ADA-0.032
VL-0.031
APTER-0.031
Token
Token ID134
Feature activation+4.26e-01

Pos logit contributions
 hon+0.024
nam+0.023
anked+0.023
phan+0.021
pora+0.021
Neg logit contributions
isle-0.043
 Horowitz-0.042
ADA-0.042
VL-0.041
APTER-0.040
Token
Token ID119
Feature activation-2.25e-02

Pos logit contributions
 hon+0.082
nam+0.081
anked+0.080
phan+0.077
pora+0.075
Neg logit contributions
isle-0.065
 Horowitz-0.063
ADA-0.062
VL-0.060
APTER-0.059
Tokeno
Token ID78
Feature activation+4.87e-01

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
hent+0.004
Neg logit contributions
 hon-0.004
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token top
Token ID1353
Feature activation+4.27e-01

Pos logit contributions
isle+0.014
 Horowitz+0.013
ADA+0.013
VL+0.013
APTER+0.013
Neg logit contributions
 hon-0.012
nam-0.012
anked-0.012
phan-0.011
pora-0.011
Token pitching
Token ID21088
Feature activation+7.34e-02

Pos logit contributions
 hon+0.066
nam+0.064
anked+0.063
phan+0.060
pora+0.059
Neg logit contributions
isle-0.081
 Horowitz-0.079
ADA-0.078
VL-0.076
APTER-0.076
Token prospect
Token ID6034
Feature activation+1.87e-01

Pos logit contributions
 hon+0.010
nam+0.010
anked+0.010
phan+0.009
pora+0.009
Neg logit contributions
isle-0.015
 Horowitz-0.014
ADA-0.014
VL-0.014
hent-0.014
Token in
Token ID287
Feature activation+5.53e-02

Pos logit contributions
 hon+0.031
nam+0.030
anked+0.030
phan+0.029
pora+0.028
Neg logit contributions
isle-0.033
 Horowitz-0.032
ADA-0.032
VL-0.031
hent-0.031
Token Matthew
Token ID9308
Feature activation-2.45e-01

Pos logit contributions
 hon+0.009
nam+0.009
anked+0.009
phan+0.009
pora+0.008
Neg logit contributions
isle-0.010
 Horowitz-0.009
ADA-0.009
VL-0.009
hent-0.009
Token Wis
Token ID24884
Feature activation+1.24e+00

Pos logit contributions
isle+0.042
ADA+0.041
 Horowitz+0.040
APTER+0.039
VL+0.039
Neg logit contributions
 hon-0.043
nam-0.042
anked-0.041
phan-0.039
pora-0.039
Tokenler
Token ID1754
Feature activation+3.71e-01

Pos logit contributions
 hon+0.211
nam+0.208
anked+0.201
phan+0.192
pora+0.191
Neg logit contributions
isle-0.217
 Horowitz-0.214
ADA-0.210
��-0.205
VL-0.205
Token,
Token ID11
Feature activation+1.40e-02

Pos logit contributions
 hon+0.061
nam+0.060
anked+0.059
phan+0.056
pora+0.055
Neg logit contributions
isle-0.067
 Horowitz-0.065
ADA-0.064
VL-0.062
APTER-0.062
Token the
Token ID262
Feature activation-2.50e-02

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.003
 Horowitz-0.002
ADA-0.002
VL-0.002
hent-0.002
Token No
Token ID1400
Feature activation+3.99e-01

Pos logit contributions
isle+0.005
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.004
Token.
Token ID13
Feature activation-1.72e-02

Pos logit contributions
 hon+0.072
nam+0.071
anked+0.070
phan+0.067
pora+0.066
Neg logit contributions
isle-0.065
 Horowitz-0.063
ADA-0.063
VL-0.061
APTER-0.060
Tokenwho
Token ID8727
Feature activation-1.78e-01

Pos logit contributions
 hon+0.076
nam+0.075
anked+0.074
phan+0.071
pora+0.070
Neg logit contributions
isle-0.065
 Horowitz-0.063
ADA-0.063
VL-0.060
APTER-0.060
Token are
Token ID389
Feature activation+8.34e-02

Pos logit contributions
isle+0.033
 Horowitz+0.032
ADA+0.032
VL+0.031
APTER+0.031
Neg logit contributions
 hon-0.028
nam-0.028
anked-0.027
phan-0.026
pora-0.025
Token neither
Token ID6159
Feature activation-1.85e-01

Pos logit contributions
 hon+0.014
nam+0.013
anked+0.013
phan+0.012
pora+0.012
Neg logit contributions
isle-0.015
 Horowitz-0.015
ADA-0.015
VL-0.014
APTER-0.014
Token dumb
Token ID13526
Feature activation+1.66e-02

Pos logit contributions
isle+0.033
 Horowitz+0.031
ADA+0.031
VL+0.030
APTER+0.030
Neg logit contributions
 hon-0.031
nam-0.031
anked-0.030
phan-0.029
pora-0.028
Token nor
Token ID4249
Feature activation-4.66e-01

Pos logit contributions
 hon+0.003
nam+0.003
anked+0.003
phan+0.003
pora+0.003
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.002
APTER-0.002
Token blind
Token ID7770
Feature activation+1.15e+00

Pos logit contributions
isle+0.086
 Horowitz+0.083
ADA+0.083
VL+0.080
APTER+0.080
Neg logit contributions
 hon-0.075
nam-0.073
anked-0.072
phan-0.069
pora-0.068
Token
Token ID126
Feature activation+1.92e-01

Pos logit contributions
 hon+0.195
nam+0.190
anked+0.187
phan+0.179
pora+0.176
Neg logit contributions
isle-0.204
 Horowitz-0.198
ADA-0.196
VL-0.191
APTER-0.189
Token
Token ID245
Feature activation+4.10e-01

Pos logit contributions
 hon+0.029
nam+0.028
anked+0.028
phan+0.026
pora+0.026
Neg logit contributions
isle-0.037
 Horowitz-0.036
ADA-0.036
VL-0.035
hent-0.035
Tokenthan
Token ID14813
Feature activation-9.69e-02

Pos logit contributions
 hon+0.068
nam+0.066
anked+0.065
phan+0.062
pora+0.061
Neg logit contributions
isle-0.074
 Horowitz-0.072
ADA-0.071
VL-0.069
hent-0.068
Token to
Token ID284
Feature activation-8.25e-04

Pos logit contributions
isle+0.017
 Horowitz+0.017
ADA+0.017
VL+0.016
APTER+0.016
Neg logit contributions
 hon-0.016
nam-0.016
anked-0.016
phan-0.015
pora-0.015
Token their
Token ID511
Feature activation-5.79e-02

Pos logit contributions
isle+0.000
 Horowitz+0.000
ADA+0.000
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.000
nam-0.000
anked-0.000
phan-0.000
pora-0.000
Token,
Token ID11
Feature activation+6.95e-02

Pos logit contributions
 hon+0.065
nam+0.063
anked+0.063
phan+0.060
pora+0.059
Neg logit contributions
isle-0.076
 Horowitz-0.074
ADA-0.074
VL-0.071
hent-0.071
Token had
Token ID550
Feature activation-3.79e-02

Pos logit contributions
 hon+0.011
nam+0.011
anked+0.011
phan+0.010
pora+0.010
Neg logit contributions
isle-0.013
 Horowitz-0.012
ADA-0.012
VL-0.012
hent-0.012
Token illegally
Token ID15572
Feature activation+4.05e-01

Pos logit contributions
isle+0.007
 Horowitz+0.007
ADA+0.007
VL+0.007
hent+0.007
Neg logit contributions
 hon-0.006
nam-0.006
anked-0.006
phan-0.006
pora-0.005
Token collected
Token ID7723
Feature activation-3.50e-01

Pos logit contributions
 hon+0.066
nam+0.065
anked+0.064
phan+0.061
pora+0.060
Neg logit contributions
isle-0.074
 Horowitz-0.071
ADA-0.071
VL-0.069
hent-0.068
Token personal
Token ID2614
Feature activation-2.06e-01

Pos logit contributions
isle+0.061
 Horowitz+0.059
ADA+0.058
VL+0.057
hent+0.056
Neg logit contributions
 hon-0.060
nam-0.058
anked-0.058
phan-0.055
pora-0.054
Token location
Token ID4067
Feature activation+1.31e-01

Pos logit contributions
isle+0.032
 Horowitz+0.031
ADA+0.031
VL+0.029
hent+0.029
Neg logit contributions
 hon-0.038
anked-0.038
nam-0.038
phan-0.036
pora-0.036
Token data
Token ID1366
Feature activation-7.59e-01

Pos logit contributions
 hon+0.027
nam+0.027
anked+0.027
phan+0.026
pora+0.026
Neg logit contributions
isle-0.017
 Horowitz-0.017
ADA-0.016
VL-0.016
hent-0.016
Token without
Token ID1231
Feature activation-5.97e-01

Pos logit contributions
isle+0.136
 Horowitz+0.132
ADA+0.131
VL+0.127
hent+0.126
Neg logit contributions
 hon-0.125
nam-0.123
anked-0.121
phan-0.116
pora-0.114
Token permission
Token ID7170
Feature activation+6.29e-01

Pos logit contributions
isle+0.109
 Horowitz+0.106
ADA+0.105
VL+0.101
hent+0.100
Neg logit contributions
 hon-0.096
nam-0.095
anked-0.094
phan-0.090
pora-0.088
Token,
Token ID11
Feature activation-6.59e-04

Pos logit contributions
 hon+0.104
nam+0.101
anked+0.100
phan+0.095
pora+0.094
Neg logit contributions
isle-0.113
 Horowitz-0.110
ADA-0.109
VL-0.106
hent-0.105
Token according
Token ID1864
Feature activation+3.72e-01

Pos logit contributions
isle+0.000
 Horowitz+0.000
ADA+0.000
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.000
nam-0.000
anked-0.000
phan-0.000
pora-0.000
Token and
Token ID290
Feature activation+1.56e-03

Pos logit contributions
isle+0.010
 Horowitz+0.010
ADA+0.010
VL+0.010
APTER+0.010
Neg logit contributions
 hon-0.009
nam-0.009
anked-0.009
phan-0.008
pora-0.008
Token other
Token ID584
Feature activation-1.27e-01

Pos logit contributions
 hon+0.000
nam+0.000
anked+0.000
phan+0.000
pora+0.000
Neg logit contributions
isle-0.000
 Horowitz-0.000
ADA-0.000
VL-0.000
APTER-0.000
Token messages
Token ID6218
Feature activation-1.35e-01

Pos logit contributions
isle+0.022
 Horowitz+0.021
ADA+0.021
VL+0.021
APTER+0.020
Neg logit contributions
 hon-0.022
nam-0.021
anked-0.021
phan-0.020
pora-0.020
Token via
Token ID2884
Feature activation+5.07e-01

Pos logit contributions
isle+0.024
 Horowitz+0.023
ADA+0.023
VL+0.022
APTER+0.022
Neg logit contributions
 hon-0.023
nam-0.022
anked-0.022
phan-0.021
pora-0.021
Token Mi
Token ID13756
Feature activation-2.65e-01

Pos logit contributions
 hon+0.076
nam+0.074
anked+0.072
phan+0.069
pora+0.067
Neg logit contributions
isle-0.100
 Horowitz-0.097
ADA-0.096
VL-0.094
APTER-0.093
Tokeniverse
Token ID3997
Feature activation+5.79e-01

Pos logit contributions
isle+0.055
 Horowitz+0.054
ADA+0.053
APTER+0.052
VL+0.052
Neg logit contributions
 hon-0.037
nam-0.035
anked-0.035
phan-0.033
pora-0.032
Token.
Token ID13
Feature activation-6.34e-02

Pos logit contributions
 hon+0.097
nam+0.095
anked+0.093
phan+0.089
pora+0.088
Neg logit contributions
isle-0.103
 Horowitz-0.100
ADA-0.099
VL-0.096
APTER-0.096
Token
Token ID198
Feature activation-3.15e-02

Pos logit contributions
isle+0.011
 Horowitz+0.010
ADA+0.010
VL+0.010
APTER+0.010
Neg logit contributions
 hon-0.011
nam-0.011
anked-0.011
phan-0.010
pora-0.010
Token
Token ID198
Feature activation-7.53e-02

Pos logit contributions
isle+0.005
 Horowitz+0.005
ADA+0.005
VL+0.005
APTER+0.005
Neg logit contributions
 hon-0.006
nam-0.005
anked-0.005
phan-0.005
pora-0.005
TokenThere
Token ID1858
Feature activation+8.89e-03

Pos logit contributions
isle+0.014
 Horowitz+0.013
ADA+0.013
VL+0.013
APTER+0.013
Neg logit contributions
 hon-0.012
nam-0.012
anked-0.012
phan-0.011
pora-0.011
Token are
Token ID389
Feature activation+4.40e-02

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.001
Token live
Token ID2107
Feature activation-1.94e-01

Pos logit contributions
 hon+0.021
nam+0.020
anked+0.020
phan+0.019
pora+0.019
Neg logit contributions
isle-0.020
 Horowitz-0.019
ADA-0.019
VL-0.018
APTER-0.018
Token in
Token ID287
Feature activation+7.08e-02

Pos logit contributions
isle+0.036
 Horowitz+0.035
ADA+0.034
VL+0.033
APTER+0.033
Neg logit contributions
 hon-0.031
nam-0.031
anked-0.030
phan-0.029
pora-0.028
Token areas
Token ID3006
Feature activation+1.77e-01

Pos logit contributions
 hon+0.012
nam+0.012
anked+0.011
phan+0.011
pora+0.011
Neg logit contributions
isle-0.013
 Horowitz-0.012
ADA-0.012
VL-0.012
APTER-0.012
Token of
Token ID286
Feature activation-2.55e-02

Pos logit contributions
 hon+0.030
nam+0.029
anked+0.028
phan+0.027
pora+0.027
Neg logit contributions
isle-0.032
 Horowitz-0.031
ADA-0.031
VL-0.030
APTER-0.029
Token high
Token ID1029
Feature activation+1.17e-02

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.004
Token unemployment
Token ID10681
Feature activation+4.65e-01

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token and
Token ID290
Feature activation+3.97e-02

Pos logit contributions
 hon+0.076
nam+0.074
anked+0.073
phan+0.069
pora+0.068
Neg logit contributions
isle-0.085
 Horowitz-0.083
ADA-0.082
VL-0.080
APTER-0.079
Token low
Token ID1877
Feature activation-3.54e-01

Pos logit contributions
 hon+0.007
nam+0.007
anked+0.007
phan+0.006
pora+0.006
Neg logit contributions
isle-0.007
 Horowitz-0.007
ADA-0.007
VL-0.006
APTER-0.006
Token educational
Token ID9856
Feature activation+3.49e-01

Pos logit contributions
isle+0.063
 Horowitz+0.061
ADA+0.061
VL+0.058
APTER+0.058
Neg logit contributions
 hon-0.059
nam-0.058
anked-0.057
phan-0.055
pora-0.054
Token attainment
Token ID40375
Feature activation+1.36e+00

Pos logit contributions
 hon+0.034
nam+0.033
anked+0.032
phan+0.030
pora+0.029
Neg logit contributions
isle-0.086
 Horowitz-0.084
ADA-0.084
VL-0.082
APTER-0.082
Token and
Token ID290
Feature activation+3.73e-02

Pos logit contributions
 hon+0.234
nam+0.228
anked+0.225
phan+0.215
pora+0.211
Neg logit contributions
isle-0.236
 Horowitz-0.228
ADA-0.227
VL-0.220
APTER-0.218
Token can
Token ID460
Feature activation-2.39e-02

Pos logit contributions
 hon+0.033
nam+0.033
anked+0.032
phan+0.031
pora+0.030
Neg logit contributions
isle-0.040
 Horowitz-0.039
ADA-0.039
VL-0.038
APTER-0.037
Token hardly
Token ID8941
Feature activation-1.71e-01

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.004
Token think
Token ID892
Feature activation+2.52e-01

Pos logit contributions
isle+0.032
 Horowitz+0.031
ADA+0.031
VL+0.030
hent+0.030
Neg logit contributions
 hon-0.026
nam-0.026
anked-0.026
phan-0.024
pora-0.024
Token about
Token ID546
Feature activation-2.89e-01

Pos logit contributions
 hon+0.045
nam+0.045
anked+0.044
phan+0.042
pora+0.042
Neg logit contributions
isle-0.042
 Horowitz-0.040
ADA-0.040
VL-0.039
hent-0.038
Token anything
Token ID1997
Feature activation+2.61e-01

Pos logit contributions
isle+0.054
 Horowitz+0.053
ADA+0.052
VL+0.051
APTER+0.050
Neg logit contributions
 hon-0.045
nam-0.044
anked-0.044
phan-0.042
pora-0.041
Token else
Token ID2073
Feature activation+1.04e-01

Pos logit contributions
 hon+0.044
nam+0.043
anked+0.043
phan+0.041
pora+0.040
Neg logit contributions
isle-0.046
 Horowitz-0.044
ADA-0.044
VL-0.043
APTER-0.042
Token.
Token ID13
Feature activation-6.26e-02

Pos logit contributions
 hon+0.019
nam+0.018
anked+0.018
phan+0.017
pora+0.017
Neg logit contributions
isle-0.017
 Horowitz-0.017
ADA-0.017
VL-0.016
APTER-0.016
Token This
Token ID770
Feature activation+4.06e-02

Pos logit contributions
isle+0.011
 Horowitz+0.011
ADA+0.011
VL+0.011
APTER+0.010
Neg logit contributions
 hon-0.010
nam-0.010
anked-0.010
phan-0.010
pora-0.009
Token goes
Token ID2925
Feature activation+3.37e-01

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.006
pora+0.006
Neg logit contributions
isle-0.008
 Horowitz-0.007
ADA-0.007
VL-0.007
APTER-0.007
Token on
Token ID319
Feature activation-5.93e-02

Pos logit contributions
 hon+0.057
nam+0.056
anked+0.055
phan+0.052
pora+0.051
Neg logit contributions
isle-0.060
 Horowitz-0.058
ADA-0.058
VL-0.056
hent-0.056
Token for
Token ID329
Feature activation-1.19e-01

Pos logit contributions
isle+0.010
 Horowitz+0.010
ADA+0.010
VL+0.009
APTER+0.009
Neg logit contributions
 hon-0.010
nam-0.010
anked-0.010
phan-0.010
pora-0.010
Token was
Token ID373
Feature activation+1.50e-01

Pos logit contributions
 hon+0.059
nam+0.058
anked+0.057
phan+0.055
pora+0.054
Neg logit contributions
isle-0.057
 Horowitz-0.055
ADA-0.054
VL-0.053
APTER-0.052
Token removed
Token ID4615
Feature activation+5.88e-01

Pos logit contributions
 hon+0.025
nam+0.024
anked+0.024
phan+0.023
pora+0.022
Neg logit contributions
isle-0.027
 Horowitz-0.026
ADA-0.026
VL-0.025
APTER-0.025
Token from
Token ID422
Feature activation+1.87e-01

Pos logit contributions
 hon+0.093
nam+0.091
anked+0.090
phan+0.086
pora+0.084
Neg logit contributions
isle-0.109
 Horowitz-0.106
ADA-0.105
VL-0.102
hent-0.102
Token the
Token ID262
Feature activation+7.86e-03

Pos logit contributions
 hon+0.031
nam+0.030
anked+0.030
phan+0.028
pora+0.028
Neg logit contributions
isle-0.034
 Horowitz-0.033
ADA-0.033
VL-0.032
APTER-0.032
Token conference
Token ID4495
Feature activation-3.46e-02

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Token website
Token ID3052
Feature activation+2.82e-01

Pos logit contributions
isle+0.006
 Horowitz+0.006
ADA+0.006
VL+0.006
APTER+0.006
Neg logit contributions
 hon-0.006
nam-0.006
anked-0.006
phan-0.005
pora-0.005
Token by
Token ID416
Feature activation+1.25e-01

Pos logit contributions
 hon+0.048
nam+0.047
anked+0.046
phan+0.044
pora+0.043
Neg logit contributions
isle-0.050
 Horowitz-0.048
ADA-0.048
VL-0.046
hent-0.046
Token two
Token ID734
Feature activation+3.70e-01

Pos logit contributions
 hon+0.021
nam+0.021
anked+0.020
phan+0.019
pora+0.019
Neg logit contributions
isle-0.022
 Horowitz-0.022
ADA-0.021
VL-0.021
APTER-0.021
Token chair
Token ID5118
Feature activation-2.18e-01

Pos logit contributions
 hon+0.071
nam+0.070
anked+0.069
phan+0.066
pora+0.065
Neg logit contributions
isle-0.057
 Horowitz-0.055
ADA-0.054
VL-0.052
APTER-0.052
Tokenmen
Token ID3653
Feature activation-9.14e-01

Pos logit contributions
isle+0.024
 Horowitz+0.023
ADA+0.023
VL+0.021
APTER+0.021
Neg logit contributions
 hon-0.051
nam-0.050
anked-0.050
phan-0.048
pora-0.048
Token because
Token ID780
Feature activation+4.46e-02

Pos logit contributions
isle+0.162
 Horowitz+0.158
ADA+0.156
VL+0.151
hent+0.150
Neg logit contributions
 hon-0.153
nam-0.150
anked-0.148
phan-0.141
pora-0.138
Token pricing
Token ID13045
Feature activation-5.60e-01

Pos logit contributions
 hon+0.080
nam+0.078
anked+0.076
phan+0.074
pora+0.072
Neg logit contributions
isle-0.059
 Horowitz-0.057
ADA-0.057
VL-0.055
APTER-0.054
Token schemes
Token ID16546
Feature activation+2.13e-01

Pos logit contributions
isle+0.101
 Horowitz+0.098
ADA+0.097
VL+0.094
APTER+0.093
Neg logit contributions
 hon-0.093
nam-0.091
anked-0.089
phan-0.085
pora-0.084
Token will
Token ID481
Feature activation-1.85e-01

Pos logit contributions
 hon+0.035
nam+0.035
anked+0.034
phan+0.033
pora+0.032
Neg logit contributions
isle-0.038
 Horowitz-0.037
ADA-0.037
VL-0.036
APTER-0.035
Token increase
Token ID2620
Feature activation+5.63e-01

Pos logit contributions
isle+0.035
 Horowitz+0.034
ADA+0.034
VL+0.033
APTER+0.032
Neg logit contributions
 hon-0.029
nam-0.028
anked-0.028
phan-0.026
pora-0.026
Token in
Token ID287
Feature activation+1.66e-01

Pos logit contributions
 hon+0.094
nam+0.092
anked+0.090
phan+0.086
pora+0.085
Neg logit contributions
isle-0.101
 Horowitz-0.098
ADA-0.097
VL-0.094
APTER-0.093
Token the
Token ID262
Feature activation+5.27e-02

Pos logit contributions
 hon+0.025
nam+0.024
anked+0.024
phan+0.023
pora+0.022
Neg logit contributions
isle-0.033
 Horowitz-0.032
ADA-0.031
VL-0.031
APTER-0.030
Token coming
Token ID2406
Feature activation-3.22e-01

Pos logit contributions
 hon+0.009
nam+0.009
anked+0.009
phan+0.008
pora+0.008
Neg logit contributions
isle-0.009
 Horowitz-0.009
ADA-0.009
VL-0.009
hent-0.008
Token years
Token ID812
Feature activation-7.09e-02

Pos logit contributions
isle+0.054
 Horowitz+0.052
ADA+0.051
VL+0.050
APTER+0.049
Neg logit contributions
 hon-0.057
nam-0.056
anked-0.055
phan-0.053
pora-0.052
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
isle+0.013
 Horowitz+0.012
ADA+0.012
VL+0.012
APTER+0.012
Neg logit contributions
 hon-0.012
nam-0.012
anked-0.012
phan-0.011
pora-0.011
Token
Token ID564
Feature activation-1.40e-02

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.002
hent+0.002
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002
Token
Token ID250
Feature activation-1.15e-01

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
APTER+0.003
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002
Token years
Token ID812
Feature activation-7.56e-02

Pos logit contributions
 hon+0.011
nam+0.010
anked+0.010
phan+0.010
pora+0.010
Neg logit contributions
isle-0.011
 Horowitz-0.010
ADA-0.010
VL-0.010
APTER-0.010
Token ago
Token ID2084
Feature activation+9.63e-01

Pos logit contributions
isle+0.019
 Horowitz+0.019
ADA+0.019
VL+0.019
APTER+0.018
Neg logit contributions
 hon-0.007
nam-0.006
anked-0.006
phan-0.006
pora-0.005
Token and
Token ID290
Feature activation+5.98e-02

Pos logit contributions
 hon+0.165
nam+0.160
anked+0.158
phan+0.151
pora+0.149
Neg logit contributions
isle-0.169
 Horowitz-0.163
ADA-0.162
VL-0.157
APTER-0.156
Token I
Token ID314
Feature activation+9.09e-02

Pos logit contributions
 hon+0.010
nam+0.009
anked+0.009
phan+0.009
pora+0.009
Neg logit contributions
isle-0.011
 Horowitz-0.011
ADA-0.011
VL-0.010
APTER-0.010
Token will
Token ID481
Feature activation-2.94e-01

Pos logit contributions
 hon+0.014
nam+0.013
anked+0.013
phan+0.012
pora+0.012
Neg logit contributions
isle-0.018
 Horowitz-0.017
ADA-0.017
VL-0.017
APTER-0.017
Token understand
Token ID1833
Feature activation-6.09e-01

Pos logit contributions
isle+0.057
 Horowitz+0.055
ADA+0.055
VL+0.054
APTER+0.053
Neg logit contributions
 hon-0.044
nam-0.043
anked-0.043
phan-0.040
pora-0.040
Token it
Token ID340
Feature activation-1.52e-01

Pos logit contributions
isle+0.104
 Horowitz+0.100
ADA+0.100
VL+0.097
APTER+0.096
Neg logit contributions
 hon-0.107
nam-0.104
anked-0.103
phan-0.099
pora-0.097
Token differently
Token ID10338
Feature activation-4.83e-01

Pos logit contributions
isle+0.030
 Horowitz+0.029
ADA+0.029
VL+0.028
APTER+0.028
Neg logit contributions
 hon-0.022
nam-0.022
anked-0.021
phan-0.020
pora-0.020
Token five
Token ID1936
Feature activation+2.25e-02

Pos logit contributions
isle+0.092
 Horowitz+0.089
ADA+0.089
VL+0.086
APTER+0.085
Neg logit contributions
 hon-0.075
nam-0.073
anked-0.072
phan-0.069
pora-0.067
Token years
Token ID812
Feature activation-7.65e-02

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.004
 Horowitz-0.004
ADA-0.004
VL-0.003
APTER-0.003
Token from
Token ID422
Feature activation+2.28e-01

Pos logit contributions
isle+0.015
 Horowitz+0.014
ADA+0.014
VL+0.014
APTER+0.014
Neg logit contributions
 hon-0.012
nam-0.011
anked-0.011
phan-0.011
pora-0.011
Token of
Token ID286
Feature activation-6.43e-02

Pos logit contributions
isle+0.077
 Horowitz+0.074
ADA+0.073
VL+0.071
APTER+0.070
Neg logit contributions
 hon-0.085
nam-0.083
anked-0.082
phan-0.079
pora-0.078
Token Darwin
Token ID21450
Feature activation+1.92e-01

Pos logit contributions
isle+0.011
 Horowitz+0.011
ADA+0.011
VL+0.011
APTER+0.010
Neg logit contributions
 hon-0.011
nam-0.011
anked-0.011
phan-0.010
pora-0.010
Token and
Token ID290
Feature activation+3.70e-03

Pos logit contributions
 hon+0.035
nam+0.034
anked+0.034
phan+0.032
pora+0.032
Neg logit contributions
isle-0.031
 Horowitz-0.030
ADA-0.030
VL-0.029
APTER-0.029
Token Wallace
Token ID17161
Feature activation-4.62e-01

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.001
pora+0.001
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
Token.
Token ID13
Feature activation-2.77e-02

Pos logit contributions
isle+0.079
 Horowitz+0.076
ADA+0.076
VL+0.073
APTER+0.073
Neg logit contributions
 hon-0.081
nam-0.079
anked-0.078
phan-0.075
pora-0.074
Token
Token ID198
Feature activation-7.11e-04

Pos logit contributions
isle+0.005
 Horowitz+0.005
ADA+0.005
VL+0.005
hent+0.004
Neg logit contributions
 hon-0.005
nam-0.005
anked-0.005
phan-0.004
pora-0.004
Token
Token ID198
Feature activation+3.67e-03

Pos logit contributions
isle+0.000
 Horowitz+0.000
ADA+0.000
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.000
nam-0.000
anked-0.000
phan-0.000
pora-0.000
Token'
Token ID6
Feature activation+7.22e-01

Pos logit contributions
 hon+0.001
nam+0.001
anked+0.001
phan+0.000
pora+0.000
Neg logit contributions
isle-0.001
 Horowitz-0.001
ADA-0.001
VL-0.001
APTER-0.001
TokenThe
Token ID464
Feature activation+1.82e-01

Pos logit contributions
 hon+0.121
nam+0.118
anked+0.116
phan+0.111
pora+0.109
Neg logit contributions
isle-0.129
 Horowitz-0.125
ADA-0.123
VL-0.120
APTER-0.119
Token presence
Token ID4931
Feature activation+2.07e-02

Pos logit contributions
 hon+0.028
nam+0.027
anked+0.027
phan+0.026
pora+0.025
Neg logit contributions
isle-0.035
 Horowitz-0.034
ADA-0.033
VL-0.033
hent-0.032
Token of
Token ID286
Feature activation-1.58e-02

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.003
pora+0.003
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token covert
Token ID22624
Feature activation-6.87e-02

Pos logit contributions
 hon+0.028
nam+0.027
anked+0.027
phan+0.026
pora+0.025
Neg logit contributions
isle-0.025
 Horowitz-0.025
ADA-0.024
VL-0.024
APTER-0.023
Tokenly
Token ID306
Feature activation+8.12e-01

Pos logit contributions
isle+0.017
 Horowitz+0.017
ADA+0.017
VL+0.016
hent+0.016
Neg logit contributions
 hon-0.006
nam-0.006
anked-0.006
phan-0.006
pora-0.005
Token recorded
Token ID6264
Feature activation-4.62e-01

Pos logit contributions
 hon+0.137
nam+0.134
anked+0.131
phan+0.126
pora+0.123
Neg logit contributions
isle-0.144
 Horowitz-0.140
ADA-0.139
VL-0.135
APTER-0.134
Token und
Token ID3318
Feature activation+2.80e-01

Pos logit contributions
isle+0.089
 Horowitz+0.087
ADA+0.086
VL+0.084
hent+0.083
Neg logit contributions
 hon-0.070
nam-0.068
anked-0.067
phan-0.064
pora-0.063
Tokenressing
Token ID11697
Feature activation-5.57e-01

Pos logit contributions
 hon+0.058
nam+0.057
anked+0.056
phan+0.054
pora+0.053
Neg logit contributions
isle-0.039
 Horowitz-0.038
ADA-0.037
VL-0.036
APTER-0.035
Token in
Token ID287
Feature activation+7.35e-02

Pos logit contributions
isle+0.102
 Horowitz+0.099
ADA+0.098
VL+0.095
APTER+0.095
Neg logit contributions
 hon-0.090
nam-0.088
anked-0.087
phan-0.083
pora-0.082
Token a
Token ID257
Feature activation+3.65e-02

Pos logit contributions
 hon+0.012
nam+0.012
anked+0.011
phan+0.011
pora+0.011
Neg logit contributions
isle-0.014
 Horowitz-0.013
ADA-0.013
VL-0.013
APTER-0.013
Token changing
Token ID5609
Feature activation+2.15e-02

Pos logit contributions
 hon+0.006
nam+0.006
anked+0.006
phan+0.006
pora+0.006
Neg logit contributions
isle-0.006
 Horowitz-0.006
ADA-0.006
VL-0.006
APTER-0.006
Token room
Token ID2119
Feature activation-1.10e+00

Pos logit contributions
 hon+0.005
nam+0.005
anked+0.005
phan+0.005
pora+0.005
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token.
Token ID13
Feature activation+5.63e-02

Pos logit contributions
isle+0.196
 Horowitz+0.189
ADA+0.188
VL+0.182
hent+0.180
Neg logit contributions
 hon-0.185
nam-0.181
anked-0.179
phan-0.171
pora-0.169
Token
Token ID198
Feature activation+3.35e-02

Pos logit contributions
 hon+0.010
nam+0.009
anked+0.009
phan+0.009
pora+0.009
Neg logit contributions
isle-0.010
 Horowitz-0.009
ADA-0.009
VL-0.009
APTER-0.009
Token"
Token ID1
Feature activation+2.22e-01

Pos logit contributions
isle+0.005
 Horowitz+0.005
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.003
phan-0.003
pora-0.003
TokenNow
Token ID3844
Feature activation-1.40e-01

Pos logit contributions
 hon+0.039
nam+0.038
anked+0.037
phan+0.036
pora+0.035
Neg logit contributions
isle-0.038
 Horowitz-0.037
ADA-0.036
VL-0.035
APTER-0.035
Token,
Token ID11
Feature activation-3.34e-02

Pos logit contributions
isle+0.026
 Horowitz+0.025
ADA+0.025
VL+0.024
APTER+0.024
Neg logit contributions
 hon-0.022
nam-0.022
anked-0.021
phan-0.021
pora-0.020
Token it
Token ID340
Feature activation-2.22e-01

Pos logit contributions
isle+0.006
 Horowitz+0.006
ADA+0.006
VL+0.006
APTER+0.006
Neg logit contributions
 hon-0.005
nam-0.005
anked-0.005
phan-0.005
pora-0.005
Token seems
Token ID2331
Feature activation+1.64e-02

Pos logit contributions
isle+0.033
 Horowitz+0.032
ADA+0.031
VL+0.030
APTER+0.030
Neg logit contributions
 hon-0.044
nam-0.043
anked-0.043
phan-0.041
pora-0.040
Token like
Token ID588
Feature activation+1.90e-02

Pos logit contributions
 hon+0.003
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token I
Token ID314
Feature activation+1.38e-02

Pos logit contributions
 hon+0.003
nam+0.003
anked+0.003
phan+0.003
pora+0.003
Neg logit contributions
isle-0.003
 Horowitz-0.003
ADA-0.003
VL-0.003
APTER-0.003
Token'm
Token ID1101
Feature activation+3.26e-02

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token a
Token ID257
Feature activation-1.85e-02

Pos logit contributions
 hon+0.005
nam+0.005
anked+0.005
phan+0.005
pora+0.005
Neg logit contributions
isle-0.006
 Horowitz-0.006
ADA-0.006
VL-0.006
APTER-0.005
Token little
Token ID1310
Feature activation-3.13e-01

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
hent+0.003
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token more
Token ID517
Feature activation-1.17e-01

Pos logit contributions
isle+0.051
 Horowitz+0.050
ADA+0.049
VL+0.048
APTER+0.047
Neg logit contributions
 hon-0.057
nam-0.056
anked-0.055
phan-0.053
pora-0.052
Token High
Token ID3334
Feature activation-6.64e-02

Pos logit contributions
 hon+0.018
nam+0.018
anked+0.017
phan+0.017
pora+0.016
Neg logit contributions
isle-0.015
 Horowitz-0.015
ADA-0.014
VL-0.014
APTER-0.014
Token School
Token ID3961
Feature activation+1.92e-01

Pos logit contributions
isle+0.011
 Horowitz+0.011
ADA+0.010
VL+0.010
APTER+0.010
Neg logit contributions
 hon-0.012
nam-0.012
anked-0.012
phan-0.011
pora-0.011
Token building
Token ID2615
Feature activation+1.20e-01

Pos logit contributions
 hon+0.032
nam+0.031
anked+0.030
phan+0.029
pora+0.029
Neg logit contributions
isle-0.035
 Horowitz-0.033
ADA-0.033
VL-0.032
hent-0.032
Token was
Token ID373
Feature activation+1.53e-01

Pos logit contributions
 hon+0.020
nam+0.019
anked+0.019
phan+0.018
pora+0.018
Neg logit contributions
isle-0.022
 Horowitz-0.021
ADA-0.021
VL-0.020
APTER-0.020
Token renamed
Token ID25121
Feature activation+5.57e-01

Pos logit contributions
 hon+0.024
nam+0.024
anked+0.023
phan+0.022
pora+0.022
Neg logit contributions
isle-0.029
 Horowitz-0.028
ADA-0.028
VL-0.027
APTER-0.027
Token Higgins
Token ID39043
Feature activation-1.03e+00

Pos logit contributions
 hon+0.095
nam+0.092
anked+0.091
phan+0.087
pora+0.086
Neg logit contributions
isle-0.098
 Horowitz-0.095
ADA-0.094
VL-0.091
APTER-0.091
Token Science
Token ID5800
Feature activation+1.33e-02

Pos logit contributions
isle+0.169
 Horowitz+0.163
ADA+0.162
VL+0.157
APTER+0.156
Neg logit contributions
 hon-0.185
nam-0.181
anked-0.179
phan-0.172
pora-0.169
Token Institute
Token ID5136
Feature activation-3.16e-01

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token for
Token ID329
Feature activation-8.47e-02

Pos logit contributions
isle+0.057
 Horowitz+0.055
ADA+0.054
VL+0.053
APTER+0.052
Neg logit contributions
 hon-0.052
nam-0.051
anked-0.050
phan-0.048
pora-0.047
Token the
Token ID262
Feature activation-1.81e-02

Pos logit contributions
isle+0.015
 Horowitz+0.014
ADA+0.014
VL+0.014
APTER+0.014
Neg logit contributions
 hon-0.015
nam-0.014
anked-0.014
phan-0.013
pora-0.013
Token shooting
Token ID4395
Feature activation+1.05e-01

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
APTER+0.003
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token city
Token ID1748
Feature activation-2.84e-01

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.001
APTER+0.001
Neg logit contributions
 hon-0.001
nam-0.001
anked-0.001
phan-0.001
pora-0.001
Token really
Token ID1107
Feature activation+2.91e-01

Pos logit contributions
isle+0.048
 Horowitz+0.046
ADA+0.046
VL+0.044
APTER+0.044
Neg logit contributions
 hon-0.051
nam-0.049
anked-0.049
phan-0.047
pora-0.046
Token reckoned
Token ID46630
Feature activation+7.75e-02

Pos logit contributions
 hon+0.046
nam+0.045
anked+0.044
phan+0.042
pora+0.042
Neg logit contributions
isle-0.054
 Horowitz-0.053
ADA-0.052
VL-0.051
APTER-0.050
Token with
Token ID351
Feature activation-2.52e-03

Pos logit contributions
 hon+0.013
nam+0.012
anked+0.012
phan+0.012
pora+0.011
Neg logit contributions
isle-0.014
 Horowitz-0.014
ADA-0.014
VL-0.013
hent-0.013
Token the
Token ID262
Feature activation-1.34e-02

Pos logit contributions
isle+0.000
 Horowitz+0.000
ADA+0.000
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.000
nam-0.000
anked-0.000
phan-0.000
pora-0.000
Token cond
Token ID1779
Feature activation-1.13e+00

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.002
APTER+0.002
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002
Tokenominium
Token ID46134
Feature activation-4.08e-01

Pos logit contributions
isle+0.183
 Horowitz+0.177
ADA+0.176
VL+0.170
APTER+0.168
Neg logit contributions
 hon-0.208
nam-0.204
anked-0.201
phan-0.193
pora-0.190
Token as
Token ID355
Feature activation+1.47e-01

Pos logit contributions
isle+0.076
 Horowitz+0.073
ADA+0.073
VL+0.071
APTER+0.070
Neg logit contributions
 hon-0.065
nam-0.064
anked-0.063
phan-0.060
pora-0.059
Token its
Token ID663
Feature activation+4.07e-01

Pos logit contributions
 hon+0.025
nam+0.024
anked+0.024
phan+0.023
pora+0.022
Neg logit contributions
isle-0.026
 Horowitz-0.025
ADA-0.025
VL-0.024
APTER-0.024
Token most
Token ID749
Feature activation+1.31e-01

Pos logit contributions
 hon+0.068
nam+0.066
anked+0.065
phan+0.062
pora+0.061
Neg logit contributions
isle-0.073
 Horowitz-0.071
ADA-0.070
VL-0.068
APTER-0.068
Token significant
Token ID2383
Feature activation+3.40e-01

Pos logit contributions
 hon+0.023
nam+0.022
anked+0.022
phan+0.021
pora+0.021
Neg logit contributions
isle-0.023
 Horowitz-0.022
ADA-0.022
VL-0.021
APTER-0.021
Token recent
Token ID2274
Feature activation+1.64e-01

Pos logit contributions
 hon+0.002
nam+0.002
anked+0.002
phan+0.002
pora+0.002
Neg logit contributions
isle-0.002
 Horowitz-0.002
ADA-0.002
VL-0.002
APTER-0.002
Token films
Token ID7328
Feature activation+2.95e-01

Pos logit contributions
 hon+0.029
nam+0.028
anked+0.028
phan+0.027
pora+0.026
Neg logit contributions
isle-0.028
 Horowitz-0.027
ADA-0.027
VL-0.026
hent-0.026
Token such
Token ID884
Feature activation-1.05e-01

Pos logit contributions
 hon+0.049
nam+0.048
anked+0.047
phan+0.045
pora+0.044
Neg logit contributions
isle-0.053
 Horowitz-0.051
ADA-0.051
VL-0.049
hent-0.049
Token as
Token ID355
Feature activation+9.27e-02

Pos logit contributions
isle+0.020
 Horowitz+0.020
ADA+0.019
VL+0.019
APTER+0.019
Neg logit contributions
 hon-0.016
nam-0.016
anked-0.015
phan-0.015
pora-0.014
Token Dead
Token ID5542
Feature activation-6.14e-01

Pos logit contributions
 hon+0.016
nam+0.016
anked+0.016
phan+0.015
pora+0.015
Neg logit contributions
isle-0.016
 Horowitz-0.015
ADA-0.015
VL-0.015
APTER-0.015
Tokenfall
Token ID7207
Feature activation-1.15e+00

Pos logit contributions
isle+0.101
 Horowitz+0.097
ADA+0.096
VL+0.093
APTER+0.092
Neg logit contributions
 hon-0.112
nam-0.109
anked-0.108
phan-0.103
pora-0.102
Token and
Token ID290
Feature activation-1.08e-02

Pos logit contributions
isle+0.212
 Horowitz+0.205
ADA+0.204
VL+0.198
APTER+0.196
Neg logit contributions
 hon-0.185
nam-0.180
anked-0.177
phan-0.169
pora-0.166
Token The
Token ID383
Feature activation-1.33e-01

Pos logit contributions
isle+0.002
 Horowitz+0.002
ADA+0.002
VL+0.002
APTER+0.002
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002
Token L
Token ID406
Feature activation-1.76e-02

Pos logit contributions
isle+0.023
 Horowitz+0.022
ADA+0.022
VL+0.021
hent+0.021
Neg logit contributions
 hon-0.023
nam-0.023
anked-0.022
phan-0.021
pora-0.021
Tokenedge
Token ID14907
Feature activation-1.33e+00

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
��+0.003
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
Token but
Token ID475
Feature activation+2.18e-02

Pos logit contributions
isle+0.229
 Horowitz+0.222
ADA+0.220
VL+0.214
APTER+0.212
Neg logit contributions
 hon-0.230
nam-0.225
anked-0.222
phan-0.212
pora-0.209
Token Marxist
Token ID26305
Feature activation+3.86e-01

Pos logit contributions
isle+0.001
 Horowitz+0.001
ADA+0.001
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.000
nam-0.000
anked-0.000
phan-0.000
pora-0.000
Token takeover
Token ID27011
Feature activation-7.31e-02

Pos logit contributions
 hon+0.066
nam+0.065
anked+0.064
phan+0.061
pora+0.060
Neg logit contributions
isle-0.067
 Horowitz-0.065
ADA-0.065
VL-0.063
APTER-0.062
Token of
Token ID286
Feature activation-4.53e-02

Pos logit contributions
isle+0.012
 Horowitz+0.012
ADA+0.012
VL+0.011
APTER+0.011
Neg logit contributions
 hon-0.013
nam-0.013
anked-0.013
phan-0.012
pora-0.012
Token V
Token ID569
Feature activation-5.22e-01

Pos logit contributions
isle+0.008
 Horowitz+0.008
ADA+0.008
VL+0.007
APTER+0.007
Neg logit contributions
 hon-0.008
nam-0.008
anked-0.008
phan-0.007
pora-0.007
Tokenenez
Token ID11437
Feature activation+5.98e-01

Pos logit contributions
isle+0.091
 Horowitz+0.088
ADA+0.088
VL+0.085
APTER+0.084
Neg logit contributions
 hon-0.089
nam-0.087
anked-0.086
phan-0.082
pora-0.081
Tokenue
Token ID518
Feature activation-1.04e+00

Pos logit contributions
 hon+0.164
nam+0.162
anked+0.160
phan+0.156
pora+0.154
Neg logit contributions
isle-0.043
 Horowitz-0.040
ADA-0.039
VL-0.036
APTER-0.035
Token...
Token ID986
Feature activation-2.55e-01

Pos logit contributions
isle+0.180
 Horowitz+0.174
ADA+0.173
VL+0.168
APTER+0.167
Neg logit contributions
 hon-0.179
nam-0.175
anked-0.172
phan-0.164
pora-0.161
Token
Token ID198
Feature activation-1.98e-02

Pos logit contributions
isle+0.045
 Horowitz+0.043
ADA+0.043
VL+0.042
APTER+0.041
Neg logit contributions
 hon-0.043
nam-0.042
anked-0.042
phan-0.040
pora-0.039
Token
Token ID198
Feature activation-2.30e-02

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
APTER+0.003
Neg logit contributions
 hon-0.003
nam-0.003
anked-0.003
phan-0.003
pora-0.003
TokenFrom
Token ID4863
Feature activation+3.36e-01

Pos logit contributions
isle+0.004
 Horowitz+0.004
ADA+0.004
VL+0.004
APTER+0.004
Neg logit contributions
 hon-0.004
nam-0.004
anked-0.004
phan-0.004
pora-0.003
Token Vox
Token ID28035
Feature activation+1.27e-02

Pos logit contributions
 hon+0.045
nam+0.044
anked+0.043
phan+0.041
pora+0.040
Neg logit contributions
isle-0.071
 Horowitz-0.069
ADA-0.069
VL-0.067
APTER-0.067
Token examples
Token ID6096
Feature activation-4.09e-01

Pos logit contributions
isle+0.000
 Horowitz+0.000
ADA+0.000
VL+0.000
APTER+0.000
Neg logit contributions
 hon-0.000
nam-0.000
anked-0.000
phan-0.000
pora-0.000
Token:
Token ID25
Feature activation+1.45e-01

Pos logit contributions
isle+0.075
 Horowitz+0.073
ADA+0.072
VL+0.070
hent+0.070
Neg logit contributions
 hon-0.066
nam-0.065
anked-0.063
phan-0.061
pora-0.060
Token a
Token ID257
Feature activation+2.63e-02

Pos logit contributions
 hon+0.025
nam+0.025
anked+0.024
phan+0.023
pora+0.023
Neg logit contributions
isle-0.025
 Horowitz-0.024
ADA-0.024
VL-0.023
APTER-0.023
Token 90
Token ID4101
Feature activation-2.00e-01

Pos logit contributions
 hon+0.004
nam+0.004
anked+0.004
phan+0.004
pora+0.004
Neg logit contributions
isle-0.005
 Horowitz-0.005
ADA-0.005
VL-0.005
APTER-0.004
Token per
Token ID583
Feature activation+4.34e-01

Pos logit contributions
isle+0.035
 Horowitz+0.034
ADA+0.034
VL+0.033
APTER+0.033
Neg logit contributions
 hon-0.034
nam-0.033
anked-0.032
phan-0.031
pora-0.030
Token cent
Token ID1247
Feature activation-9.62e-01

Pos logit contributions
 hon+0.106
nam+0.105
anked+0.103
phan+0.101
pora+0.099
Neg logit contributions
isle-0.044
 Horowitz-0.041
ADA-0.041
VL-0.039
APTER-0.038
Token reduction
Token ID7741
Feature activation+9.77e-02

Pos logit contributions
isle+0.180
 Horowitz+0.175
ADA+0.173
VL+0.169
hent+0.167
Neg logit contributions
 hon-0.151
nam-0.149
anked-0.146
phan-0.139
pora-0.136
Token in
Token ID287
Feature activation+3.28e-02

Pos logit contributions
 hon+0.016
nam+0.015
anked+0.015
phan+0.014
pora+0.014
Neg logit contributions
isle-0.018
 Horowitz-0.017
ADA-0.017
VL-0.017
APTER-0.017
Token HIV
Token ID10498
Feature activation+2.69e-01

Pos logit contributions
 hon+0.005
nam+0.005
anked+0.005
phan+0.005
pora+0.005
Neg logit contributions
isle-0.006
 Horowitz-0.006
ADA-0.006
VL-0.006
APTER-0.006
Token-
Token ID12
Feature activation-1.60e-02

Pos logit contributions
 hon+0.048
nam+0.047
anked+0.046
phan+0.044
pora+0.043
Neg logit contributions
isle-0.045
 Horowitz-0.044
ADA-0.043
VL-0.042
APTER-0.042
Tokenrelated
Token ID5363
Feature activation+5.75e-01

Pos logit contributions
isle+0.003
 Horowitz+0.003
ADA+0.003
VL+0.003
hent+0.003
Neg logit contributions
 hon-0.002
nam-0.002
anked-0.002
phan-0.002
pora-0.002