TOP ACTIVATIONS
MAX = 3.629

haven 't been par av irtual ized yet . I also
ancer s C asket , 18 56 .
and out - of - pocket costs , but lower premiums
song ) Legal tender is a medium of payment
featuring mature couples engaging in tender ness or sex beyond the
circuit , starting at the tender age of eight in 60
y base anch oring a tender , cake y dome .
Despite the fear that N dam ukong Su h
shirt or pants . A blind hem stitch is the best
At the tender age of 69 , my
people to contract cancer at tender ages , but not in
turn ing a blind eye to the shortcomings of
else , what minor le agu ers have . No one
In addition to turning a blind eye to women right ,
rest of alone to the tender merc ies of incomp ent
), surplus ( h ul agu ), [ 3 ] blue
Though deaf and blind , Helen Keller became a
. We all turn a blind eye to the ways that
the presidency . They are blind to the fact their uncon
and ongoing . It seemed blind ingly obvious that there is

MIN ACTIVATIONS
MIN = -2.564

a means to spot problems earlier and prevent officials
say they received notices from Tinder saying the were unable to
many as 100 people on Tinder . The married Iowa resident
of a sp iced pear gal ette ? Lots
the floor while another makes pelvic thrust s at her buttocks
1945 , Northeast Ohio to asted its world champion Cleveland Rams
months ago , Jo Song Gy u , director of the
Luxembourg premier Jean - Cla ude Jun cker , she hoped
( verified ) from the registry . So technically the director
qualifying applicants , enabling every Newcastle United supporter to stand a
N UFC Fans United and Newcastle United Disabled Supporters Association (
of 5 by Quake D ude from Explosive !!!!!!!! If you
the - counter supplement called gal ant amine , which increases
th Premier League goal at Newcastle on Saturday , becoming the
mb owl 96 - D ude Bro M uther h ucker
to established fan groups - Newcastle United Supporters Trust ( N
levels ) but with limited latitude for highlight recovery , or
City Hall . He lamb asted journalists , calling the media
We welcome the initiative by Newcastle United to launch their new
are connected to the state registry via official API , and

INTERVAL 2.081 - 3.629
CONTAINS 0.027%

), surplus ( h ul agu ), [ 3 ] blue
-- could you place your seal here ... huh ?
org ersen and other Sad Pupp y mod er
turn ing a blind eye to the shortcomings of
Guardian s Luke Harding , Stephanie Kir ch ga

INTERVAL 0.533 - 2.081
CONTAINS 8.360%

tentative inf ruct ue use de coup d ' E t
intentions , what can you tell us about the Mah one
the list . As the voluntary filtering regime comes
it was ready to loan him to Bel en enses ,
, a gallon of gasoline costs us about $ 3 .

INTERVAL -1.016 - 0.533
CONTAINS 89.846%

in their opposition to the United States , Haiti does not
556 x . x . x . 145 12 457 01
pomp and ceremony , nothing in the president s
brought in an unknown to rewrite it . David S elt
to your monitor or tv you 'll see the same thing

INTERVAL -2.564 - -1.016
CONTAINS 1.768%

situation to enter the Free Speech Zone outside the pre z
s Director of Parking , Transportation and Services Kevin
, and by prohibiting free assault on imperial citizens . Practices
re talking about . Like when I talk to a
in ik ain en @ il media . fi Translation to
Token haven
Token ID4398
Feature activation-3.47e-01

Pos logit contributions
vals+0.018
 Ple+0.017
 gal+0.017
GY+0.017
 AMA+0.017
Neg logit contributions
irtual-0.025
asket-0.021
��-0.019
IDENT-0.019
dam-0.018
Token't
Token ID470
Feature activation+8.70e-02

Pos logit contributions
vals+0.055
 Ple+0.053
GY+0.053
 gal+0.052
 AMA+0.052
Neg logit contributions
irtual-0.067
asket-0.056
��-0.052
IDENT-0.050
dam-0.048
Token been
Token ID587
Feature activation+3.72e-02

Pos logit contributions
irtual+0.018
asket+0.015
��+0.015
IDENT+0.014
dam+0.013
Neg logit contributions
vals-0.012
 Ple-0.012
 gal-0.012
GY-0.012
 AMA-0.012
Token par
Token ID1582
Feature activation+5.34e-01

Pos logit contributions
irtual+0.008
asket+0.006
��+0.006
IDENT+0.006
dam+0.006
Neg logit contributions
vals-0.005
 Ple-0.005
 gal-0.005
GY-0.005
 AMA-0.005
Tokenav
Token ID615
Feature activation-4.73e-01

Pos logit contributions
irtual+0.122
asket+0.104
��+0.099
IDENT+0.095
dam+0.093
Neg logit contributions
vals-0.066
 Ple-0.063
GY-0.062
 gal-0.062
 AMA-0.062
Tokenirtual
Token ID22341
Feature activation+3.63e+00

Pos logit contributions
vals+0.082
 Ple+0.080
GY+0.079
 gal+0.078
 AMA+0.078
Neg logit contributions
irtual-0.085
asket-0.069
��-0.063
IDENT-0.061
dam-0.059
Tokenized
Token ID1143
Feature activation+3.54e-01

Pos logit contributions
irtual+0.758
asket+0.636
��+0.596
IDENT+0.571
dam+0.558
Neg logit contributions
vals-0.521
 Ple-0.503
GY-0.497
 gal-0.491
 AMA-0.491
Token yet
Token ID1865
Feature activation+6.29e-02

Pos logit contributions
irtual+0.072
asket+0.061
��+0.057
IDENT+0.055
dam+0.053
Neg logit contributions
vals-0.052
 Ple-0.051
GY-0.050
 gal-0.050
 AMA-0.049
Token.
Token ID13
Feature activation-6.06e-03

Pos logit contributions
irtual+0.013
asket+0.011
��+0.011
IDENT+0.010
dam+0.010
Neg logit contributions
vals-0.009
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token I
Token ID314
Feature activation+2.27e-02

Pos logit contributions
vals+0.001
 Ple+0.001
GY+0.001
 gal+0.001
 AMA+0.001
Neg logit contributions
irtual-0.001
asket-0.001
��-0.001
IDENT-0.001
dam-0.001
Token also
Token ID635
Feature activation+4.96e-01

Pos logit contributions
irtual+0.004
asket+0.004
��+0.003
IDENT+0.003
dam+0.003
Neg logit contributions
vals-0.004
 Ple-0.003
GY-0.003
 gal-0.003
 AMA-0.003
Tokenancer
Token ID8250
Feature activation-2.07e-01

Pos logit contributions
irtual+0.073
asket+0.062
��+0.058
IDENT+0.056
dam+0.054
Neg logit contributions
vals-0.046
 Ple-0.044
GY-0.043
 gal-0.043
 AMA-0.043
Token
Token ID447
Feature activation+2.62e-02

Pos logit contributions
vals+0.030
 Ple+0.029
GY+0.028
 gal+0.028
 AMA+0.028
Neg logit contributions
irtual-0.043
asket-0.036
��-0.034
IDENT-0.033
dam-0.032
Token
Token ID247
Feature activation-3.41e-02

Pos logit contributions
irtual+0.004
asket+0.003
��+0.003
IDENT+0.003
dam+0.003
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Tokens
Token ID82
Feature activation+6.38e-03

Pos logit contributions
vals+0.005
 Ple+0.005
GY+0.005
 gal+0.005
 AMA+0.005
Neg logit contributions
irtual-0.007
asket-0.006
��-0.005
IDENT-0.005
dam-0.005
Token C
Token ID327
Feature activation-4.08e-01

Pos logit contributions
irtual+0.001
asket+0.001
��+0.001
IDENT+0.001
dam+0.001
Neg logit contributions
vals-0.001
GY-0.001
 Ple-0.001
 gal-0.001
 AMA-0.001
Tokenasket
Token ID11715
Feature activation+2.92e+00

Pos logit contributions
vals+0.064
 Ple+0.062
GY+0.061
 gal+0.061
 AMA+0.061
Neg logit contributions
irtual-0.079
asket-0.066
��-0.062
IDENT-0.059
dam-0.057
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
irtual+0.613
asket+0.520
��+0.486
IDENT+0.466
dam+0.455
Neg logit contributions
vals-0.414
 Ple-0.397
GY-0.395
 gal-0.389
 AMA-0.388
Token 18
Token ID1248
Feature activation+5.95e-02

Pos logit contributions
irtual+0.003
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token56
Token ID3980
Feature activation+7.97e-02

Pos logit contributions
irtual+0.011
asket+0.009
��+0.008
IDENT+0.008
dam+0.008
Neg logit contributions
vals-0.010
 Ple-0.010
GY-0.010
 gal-0.010
 AMA-0.010
Token.
Token ID13
Feature activation-3.74e-02

Pos logit contributions
irtual+0.017
asket+0.015
��+0.014
IDENT+0.013
dam+0.013
Neg logit contributions
vals-0.011
 Ple-0.010
GY-0.010
 gal-0.010
 AMA-0.010
Token
Token ID198
Feature activation+7.53e-03

Pos logit contributions
vals+0.005
 Ple+0.005
GY+0.005
 gal+0.005
 AMA+0.005
Neg logit contributions
irtual-0.008
asket-0.007
��-0.006
IDENT-0.006
dam-0.006
Token and
Token ID290
Feature activation-1.46e-02

Pos logit contributions
irtual+0.065
asket+0.055
��+0.051
IDENT+0.049
dam+0.048
Neg logit contributions
vals-0.042
 Ple-0.041
GY-0.040
 gal-0.040
 AMA-0.040
Token out
Token ID503
Feature activation-4.50e-02

Pos logit contributions
vals+0.002
 Ple+0.002
GY+0.002
 gal+0.002
 AMA+0.002
Neg logit contributions
irtual-0.003
asket-0.003
��-0.002
IDENT-0.002
dam-0.002
Token-
Token ID12
Feature activation+8.85e-02

Pos logit contributions
vals+0.006
 Ple+0.006
GY+0.006
 gal+0.006
 AMA+0.006
Neg logit contributions
irtual-0.010
asket-0.008
��-0.008
IDENT-0.007
dam-0.007
Tokenof
Token ID1659
Feature activation-5.90e-01

Pos logit contributions
irtual+0.021
asket+0.018
��+0.018
IDENT+0.017
dam+0.016
Neg logit contributions
vals-0.010
 Ple-0.009
GY-0.009
 gal-0.009
 AMA-0.009
Token-
Token ID12
Feature activation+1.27e-01

Pos logit contributions
vals+0.082
 Ple+0.079
GY+0.078
 gal+0.077
 AMA+0.077
Neg logit contributions
irtual-0.126
asket-0.106
��-0.100
IDENT-0.096
dam-0.094
Tokenpocket
Token ID31991
Feature activation+2.74e+00

Pos logit contributions
irtual+0.011
asket+0.006
��+0.005
IDENT+0.004
+0.004
Neg logit contributions
vals-0.034
 Ple-0.033
GY-0.033
 AMA-0.033
 gal-0.033
Token costs
Token ID3484
Feature activation+9.61e-01

Pos logit contributions
irtual+0.441
asket+0.355
��+0.322
IDENT+0.307
dam+0.296
Neg logit contributions
vals-0.523
 Ple-0.505
GY-0.500
 gal-0.492
 AMA-0.491
Token,
Token ID11
Feature activation+2.44e-01

Pos logit contributions
irtual+0.195
asket+0.163
��+0.153
IDENT+0.146
dam+0.142
Neg logit contributions
vals-0.144
 Ple-0.139
GY-0.137
 gal-0.136
 AMA-0.136
Token but
Token ID475
Feature activation-4.38e-01

Pos logit contributions
irtual+0.051
asket+0.043
��+0.041
IDENT+0.039
dam+0.038
Neg logit contributions
vals-0.034
 Ple-0.033
GY-0.033
 gal-0.032
 AMA-0.032
Token lower
Token ID2793
Feature activation+1.24e-01

Pos logit contributions
vals+0.070
 Ple+0.068
GY+0.067
 AMA+0.067
 gal+0.067
Neg logit contributions
irtual-0.084
asket-0.070
��-0.065
IDENT-0.062
dam-0.060
Token premiums
Token ID21884
Feature activation+2.66e-01

Pos logit contributions
irtual+0.023
asket+0.019
��+0.018
IDENT+0.017
dam+0.017
Neg logit contributions
vals-0.020
 Ple-0.020
GY-0.020
 gal-0.019
 AMA-0.019
Tokensong
Token ID34050
Feature activation-6.96e-02

Pos logit contributions
irtual+0.033
asket+0.028
��+0.026
IDENT+0.025
 Ukrain+0.024
Neg logit contributions
vals-0.023
 Ple-0.022
GY-0.022
 gal-0.022
 AMA-0.022
Token)
Token ID8
Feature activation-1.08e-01

Pos logit contributions
vals+0.011
 Ple+0.011
GY+0.011
 gal+0.010
 AMA+0.010
Neg logit contributions
irtual-0.014
asket-0.011
��-0.010
IDENT-0.010
dam-0.010
Token
Token ID198
Feature activation+3.89e-02

Pos logit contributions
vals+0.015
 Ple+0.015
GY+0.015
 gal+0.015
 AMA+0.015
Neg logit contributions
irtual-0.022
asket-0.019
��-0.018
IDENT-0.017
dam-0.016
Token
Token ID198
Feature activation+5.33e-02

Pos logit contributions
irtual+0.008
asket+0.007
��+0.006
IDENT+0.006
dam+0.006
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.006
 gal-0.006
 AMA-0.006
TokenLegal
Token ID38263
Feature activation-1.05e-01

Pos logit contributions
irtual+0.011
asket+0.009
��+0.009
IDENT+0.008
dam+0.008
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.007
 gal-0.007
 AMA-0.007
Token tender
Token ID15403
Feature activation+2.67e+00

Pos logit contributions
vals+0.015
 Ple+0.015
GY+0.014
 gal+0.014
 AMA+0.014
Neg logit contributions
irtual-0.022
asket-0.019
��-0.017
IDENT-0.017
dam-0.016
Token is
Token ID318
Feature activation+5.72e-02

Pos logit contributions
irtual+0.540
asket+0.452
��+0.424
IDENT+0.406
dam+0.393
Neg logit contributions
vals-0.400
 Ple-0.388
GY-0.382
 gal-0.379
 AMA-0.378
Token a
Token ID257
Feature activation+1.69e-01

Pos logit contributions
irtual+0.011
asket+0.010
��+0.009
IDENT+0.009
dam+0.008
Neg logit contributions
vals-0.009
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token medium
Token ID7090
Feature activation+4.25e-01

Pos logit contributions
irtual+0.034
asket+0.029
��+0.027
IDENT+0.026
dam+0.025
Neg logit contributions
vals-0.025
 Ple-0.024
GY-0.024
 gal-0.024
 AMA-0.024
Token of
Token ID286
Feature activation+2.67e-01

Pos logit contributions
irtual+0.083
asket+0.068
��+0.064
IDENT+0.061
dam+0.059
Neg logit contributions
vals-0.067
 Ple-0.065
GY-0.064
 gal-0.064
 AMA-0.063
Token payment
Token ID6074
Feature activation+6.94e-02

Pos logit contributions
irtual+0.058
asket+0.050
��+0.047
IDENT+0.045
dam+0.044
Neg logit contributions
vals-0.036
 Ple-0.034
GY-0.034
 gal-0.033
 AMA-0.033
Token featuring
Token ID9593
Feature activation-1.82e-01

Pos logit contributions
irtual+0.070
asket+0.057
��+0.053
IDENT+0.050
dam+0.049
Neg logit contributions
vals-0.065
 Ple-0.063
GY-0.062
 gal-0.062
 AMA-0.062
Token mature
Token ID15345
Feature activation+2.84e-01

Pos logit contributions
vals+0.025
 Ple+0.024
GY+0.024
 gal+0.023
 AMA+0.023
Neg logit contributions
irtual-0.039
asket-0.033
��-0.031
IDENT-0.030
dam-0.029
Token couples
Token ID11886
Feature activation-6.87e-01

Pos logit contributions
irtual+0.060
asket+0.051
��+0.048
IDENT+0.046
dam+0.045
Neg logit contributions
vals-0.040
 Ple-0.038
GY-0.038
 gal-0.037
 AMA-0.037
Token engaging
Token ID11932
Feature activation+1.40e-02

Pos logit contributions
vals+0.102
 Ple+0.098
GY+0.097
 AMA+0.096
 gal+0.096
Neg logit contributions
irtual-0.140
asket-0.118
��-0.110
IDENT-0.106
dam-0.103
Token in
Token ID287
Feature activation-1.43e-01

Pos logit contributions
irtual+0.003
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token tender
Token ID15403
Feature activation+2.63e+00

Pos logit contributions
vals+0.018
 Ple+0.017
GY+0.017
 gal+0.017
 AMA+0.017
Neg logit contributions
irtual-0.032
asket-0.028
��-0.026
IDENT-0.025
dam-0.024
Tokenness
Token ID1108
Feature activation-4.06e-01

Pos logit contributions
irtual+0.584
asket+0.496
��+0.466
IDENT+0.449
dam+0.441
Neg logit contributions
vals-0.340
 Ple-0.325
GY-0.323
 AMA-0.318
 gal-0.318
Token or
Token ID393
Feature activation+2.13e-01

Pos logit contributions
vals+0.057
 Ple+0.055
GY+0.054
 gal+0.054
 AMA+0.054
Neg logit contributions
irtual-0.086
asket-0.072
��-0.068
IDENT-0.065
dam-0.064
Token sex
Token ID1714
Feature activation-1.41e+00

Pos logit contributions
irtual+0.044
asket+0.037
��+0.035
IDENT+0.034
dam+0.033
Neg logit contributions
vals-0.031
 Ple-0.029
GY-0.029
 AMA-0.029
 gal-0.029
Token beyond
Token ID3675
Feature activation-1.07e+00

Pos logit contributions
vals+0.207
 Ple+0.200
GY+0.197
 gal+0.196
 AMA+0.195
Neg logit contributions
irtual-0.290
asket-0.243
��-0.228
IDENT-0.219
dam-0.213
Token the
Token ID262
Feature activation-3.08e-02

Pos logit contributions
vals+0.157
 Ple+0.152
GY+0.150
 gal+0.148
 AMA+0.148
Neg logit contributions
irtual-0.220
asket-0.185
��-0.173
IDENT-0.166
dam-0.161
Token circuit
Token ID10349
Feature activation+1.10e+00

Pos logit contributions
vals+0.030
 Ple+0.030
 gal+0.029
GY+0.029
 AMA+0.029
Neg logit contributions
irtual-0.027
asket-0.021
��-0.020
IDENT-0.019
dam-0.018
Token,
Token ID11
Feature activation+3.64e-02

Pos logit contributions
irtual+0.229
asket+0.193
��+0.181
IDENT+0.174
dam+0.169
Neg logit contributions
vals-0.159
 Ple-0.154
GY-0.152
 gal-0.150
 AMA-0.150
Token starting
Token ID3599
Feature activation+3.09e-01

Pos logit contributions
irtual+0.007
asket+0.006
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.006
 gal-0.006
 AMA-0.006
Token at
Token ID379
Feature activation-1.09e-01

Pos logit contributions
irtual+0.062
asket+0.052
��+0.049
IDENT+0.047
dam+0.045
Neg logit contributions
vals-0.047
 Ple-0.045
 gal-0.044
 AMA-0.044
GY-0.044
Token the
Token ID262
Feature activation-5.73e-02

Pos logit contributions
vals+0.016
 Ple+0.016
GY+0.016
 gal+0.016
 AMA+0.016
Neg logit contributions
irtual-0.022
asket-0.018
��-0.017
IDENT-0.017
dam-0.016
Token tender
Token ID15403
Feature activation+2.61e+00

Pos logit contributions
vals+0.009
 Ple+0.009
GY+0.009
 AMA+0.009
 gal+0.009
Neg logit contributions
irtual-0.011
asket-0.009
��-0.009
IDENT-0.008
dam-0.008
Token age
Token ID2479
Feature activation+2.78e-01

Pos logit contributions
irtual+0.501
asket+0.415
��+0.387
IDENT+0.369
dam+0.357
Neg logit contributions
vals-0.417
 Ple-0.405
GY-0.399
 gal-0.397
 AMA-0.396
Token of
Token ID286
Feature activation+7.25e-02

Pos logit contributions
irtual+0.056
asket+0.047
��+0.044
IDENT+0.042
dam+0.041
Neg logit contributions
vals-0.042
 Ple-0.041
 gal-0.040
GY-0.040
 AMA-0.040
Token eight
Token ID3624
Feature activation+3.03e-01

Pos logit contributions
irtual+0.014
asket+0.011
��+0.011
IDENT+0.010
dam+0.010
Neg logit contributions
vals-0.012
 Ple-0.011
GY-0.011
 gal-0.011
 AMA-0.011
Token in
Token ID287
Feature activation-1.67e-01

Pos logit contributions
irtual+0.065
asket+0.055
��+0.052
IDENT+0.049
dam+0.048
Neg logit contributions
vals-0.042
 Ple-0.041
GY-0.040
 gal-0.040
 AMA-0.040
Token 60
Token ID3126
Feature activation-3.44e-01

Pos logit contributions
vals+0.026
 Ple+0.025
GY+0.025
 gal+0.025
 AMA+0.025
Neg logit contributions
irtual-0.033
asket-0.027
��-0.025
IDENT-0.024
dam-0.024
Tokeny
Token ID88
Feature activation+3.20e-01

Pos logit contributions
vals+0.063
 Ple+0.061
GY+0.060
 gal+0.060
 AMA+0.059
Neg logit contributions
irtual-0.076
asket-0.063
��-0.059
IDENT-0.056
dam-0.054
Token base
Token ID2779
Feature activation-9.57e-01

Pos logit contributions
irtual+0.061
asket+0.050
��+0.047
IDENT+0.045
dam+0.043
Neg logit contributions
vals-0.052
 Ple-0.050
GY-0.050
 gal-0.049
 AMA-0.049
Token anch
Token ID12619
Feature activation-7.32e-01

Pos logit contributions
vals+0.148
 Ple+0.143
GY+0.141
 gal+0.140
 AMA+0.140
Neg logit contributions
irtual-0.189
asket-0.158
��-0.147
IDENT-0.141
dam-0.137
Tokenoring
Token ID3255
Feature activation-5.31e-02

Pos logit contributions
vals+0.112
 Ple+0.108
GY+0.107
 gal+0.106
 AMA+0.106
Neg logit contributions
irtual-0.146
asket-0.121
��-0.113
IDENT-0.109
dam-0.105
Token a
Token ID257
Feature activation+6.30e-02

Pos logit contributions
vals+0.008
 Ple+0.008
GY+0.008
 gal+0.008
 AMA+0.008
Neg logit contributions
irtual-0.011
asket-0.009
��-0.008
IDENT-0.008
dam-0.008
Token tender
Token ID15403
Feature activation+2.61e+00

Pos logit contributions
irtual+0.012
asket+0.010
��+0.010
IDENT+0.009
dam+0.009
Neg logit contributions
vals-0.010
 Ple-0.009
GY-0.009
 AMA-0.009
 gal-0.009
Token,
Token ID11
Feature activation+3.67e-02

Pos logit contributions
irtual+0.512
asket+0.426
��+0.396
IDENT+0.380
dam+0.372
Neg logit contributions
vals-0.404
 Ple-0.388
GY-0.386
 AMA-0.380
 gal-0.377
Token cake
Token ID12187
Feature activation+9.77e-02

Pos logit contributions
irtual+0.007
asket+0.006
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.006
 gal-0.006
 AMA-0.006
Tokeny
Token ID88
Feature activation+2.98e-01

Pos logit contributions
irtual+0.019
asket+0.016
��+0.015
IDENT+0.014
dam+0.014
Neg logit contributions
vals-0.015
 Ple-0.015
GY-0.015
 gal-0.014
 AMA-0.014
Token dome
Token ID29500
Feature activation+8.80e-02

Pos logit contributions
irtual+0.057
asket+0.047
��+0.044
IDENT+0.042
dam+0.041
Neg logit contributions
vals-0.048
 Ple-0.046
GY-0.046
 AMA-0.045
 gal-0.045
Token.
Token ID13
Feature activation+7.82e-03

Pos logit contributions
irtual+0.018
asket+0.015
��+0.014
IDENT+0.013
dam+0.013
Neg logit contributions
vals-0.013
 Ple-0.013
GY-0.013
 AMA-0.013
 gal-0.013
Token Despite
Token ID7945
Feature activation-6.11e-01

Pos logit contributions
vals+0.096
 Ple+0.092
GY+0.091
 gal+0.090
 AMA+0.090
Neg logit contributions
irtual-0.128
asket-0.107
��-0.100
IDENT-0.096
dam-0.093
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
vals+0.096
 Ple+0.094
GY+0.092
 gal+0.092
 AMA+0.092
Neg logit contributions
irtual-0.119
asket-0.098
��-0.092
IDENT-0.088
dam-0.085
Token fear
Token ID3252
Feature activation+5.33e-01

Pos logit contributions
vals+0.018
 Ple+0.017
GY+0.017
 gal+0.017
 AMA+0.017
Neg logit contributions
irtual-0.021
asket-0.017
��-0.016
IDENT-0.015
dam-0.015
Token that
Token ID326
Feature activation+3.21e-01

Pos logit contributions
irtual+0.105
asket+0.088
��+0.082
IDENT+0.078
dam+0.076
Neg logit contributions
vals-0.083
 Ple-0.080
GY-0.079
 gal-0.078
 AMA-0.078
Token N
Token ID399
Feature activation-1.24e-01

Pos logit contributions
irtual+0.065
asket+0.055
��+0.051
IDENT+0.049
dam+0.048
Neg logit contributions
vals-0.048
 Ple-0.046
GY-0.045
 gal-0.045
 AMA-0.045
Tokendam
Token ID11043
Feature activation+2.60e+00

Pos logit contributions
vals+0.019
 Ple+0.018
GY+0.018
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.025
asket-0.021
��-0.019
IDENT-0.018
dam-0.018
Tokenukong
Token ID46654
Feature activation-8.37e-01

Pos logit contributions
irtual+0.647
asket+0.560
��+0.534
IDENT+0.516
dam+0.503
Neg logit contributions
vals-0.268
 Ple-0.256
GY-0.251
 gal-0.248
 AMA-0.246
Token Su
Token ID1778
Feature activation+1.06e-01

Pos logit contributions
vals+0.115
 Ple+0.111
GY+0.109
 gal+0.108
 AMA+0.108
Neg logit contributions
irtual-0.180
asket-0.152
��-0.143
IDENT-0.138
dam-0.134
Tokenh
Token ID71
Feature activation+6.11e-01

Pos logit contributions
irtual+0.017
asket+0.014
��+0.013
IDENT+0.012
dam+0.011
Neg logit contributions
vals-0.020
 Ple-0.019
GY-0.019
 gal-0.019
 AMA-0.019
Token
Token ID447
Feature activation-1.49e-02

Pos logit contributions
irtual+0.123
asket+0.103
��+0.096
IDENT+0.092
dam+0.089
Neg logit contributions
vals-0.092
 Ple-0.089
GY-0.088
 gal-0.087
 AMA-0.087
Token
Token ID247
Feature activation-2.65e-02

Pos logit contributions
vals+0.003
 Ple+0.003
GY+0.003
 gal+0.003
 AMA+0.003
Neg logit contributions
irtual-0.002
asket-0.002
��-0.002
IDENT-0.002
dam-0.002
Token shirt
Token ID10147
Feature activation-1.14e-01

Pos logit contributions
irtual+0.009
asket+0.007
��+0.007
IDENT+0.007
dam+0.007
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.006
 gal-0.006
 AMA-0.006
Token or
Token ID393
Feature activation+1.92e-01

Pos logit contributions
vals+0.016
 Ple+0.015
GY+0.015
 gal+0.015
 AMA+0.015
Neg logit contributions
irtual-0.024
asket-0.020
��-0.019
IDENT-0.018
dam-0.018
Token pants
Token ID12581
Feature activation-2.53e-01

Pos logit contributions
irtual+0.039
asket+0.033
��+0.031
IDENT+0.029
dam+0.029
Neg logit contributions
vals-0.029
 Ple-0.028
GY-0.027
 gal-0.027
 AMA-0.027
Token.
Token ID13
Feature activation-1.65e-03

Pos logit contributions
vals+0.036
 Ple+0.034
GY+0.034
 gal+0.034
 AMA+0.034
Neg logit contributions
irtual-0.054
asket-0.045
��-0.042
IDENT-0.041
dam-0.040
Token A
Token ID317
Feature activation+3.75e-01

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 AMA+0.000
 gal+0.000
Neg logit contributions
irtual-0.000
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Token blind
Token ID7770
Feature activation+2.59e+00

Pos logit contributions
irtual+0.072
asket+0.060
��+0.056
IDENT+0.053
dam+0.051
Neg logit contributions
vals-0.060
 Ple-0.058
GY-0.057
 gal-0.057
 AMA-0.057
Token hem
Token ID16869
Feature activation+1.70e-01

Pos logit contributions
irtual+0.482
asket+0.396
��+0.368
IDENT+0.351
dam+0.339
Neg logit contributions
vals-0.431
 Ple-0.418
GY-0.413
 gal-0.410
 AMA-0.409
Token stitch
Token ID24695
Feature activation+1.08e+00

Pos logit contributions
irtual+0.034
asket+0.029
��+0.027
IDENT+0.026
 Ukrain+0.025
Neg logit contributions
vals-0.025
 Ple-0.025
 gal-0.024
GY-0.024
 AMA-0.024
Token is
Token ID318
Feature activation+7.16e-03

Pos logit contributions
irtual+0.222
asket+0.186
��+0.177
IDENT+0.167
+0.162
Neg logit contributions
vals-0.159
 Ple-0.155
 gal-0.153
 AMA-0.152
GY-0.152
Token the
Token ID262
Feature activation-3.29e-02

Pos logit contributions
irtual+0.001
asket+0.001
��+0.001
IDENT+0.001
dam+0.001
Neg logit contributions
vals-0.001
 Ple-0.001
 gal-0.001
GY-0.001
 AMA-0.001
Token best
Token ID1266
Feature activation-2.17e-01

Pos logit contributions
vals+0.005
 Ple+0.005
 gal+0.005
GY+0.005
 AMA+0.005
Neg logit contributions
irtual-0.006
asket-0.005
��-0.005
IDENT-0.005
dam-0.004
Token
Token ID198
Feature activation+9.14e-03

Pos logit contributions
irtual+0.003
asket+0.003
��+0.003
IDENT+0.003
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token
Token ID447
Feature activation-2.12e-02

Pos logit contributions
irtual+0.002
asket+0.001
��+0.001
IDENT+0.001
dam+0.001
Neg logit contributions
vals-0.001
 Ple-0.001
GY-0.001
 gal-0.001
 AMA-0.001
Token
Token ID250
Feature activation-3.52e-02

Pos logit contributions
vals+0.004
 Ple+0.004
GY+0.004
 gal+0.004
 AMA+0.004
Neg logit contributions
irtual-0.004
asket-0.003
��-0.003
IDENT-0.003
dam-0.002
TokenAt
Token ID2953
Feature activation+1.79e-01

Pos logit contributions
vals+0.005
 Ple+0.005
GY+0.005
 gal+0.005
 AMA+0.005
Neg logit contributions
irtual-0.007
asket-0.006
��-0.005
IDENT-0.005
dam-0.005
Token the
Token ID262
Feature activation-8.04e-02

Pos logit contributions
irtual+0.034
asket+0.029
��+0.027
IDENT+0.025
dam+0.025
Neg logit contributions
vals-0.028
 Ple-0.028
GY-0.027
 gal-0.027
 AMA-0.027
Token tender
Token ID15403
Feature activation+2.59e+00

Pos logit contributions
vals+0.012
 Ple+0.011
GY+0.011
 gal+0.011
 AMA+0.011
Neg logit contributions
irtual-0.017
asket-0.014
��-0.013
IDENT-0.012
dam-0.012
Token age
Token ID2479
Feature activation+2.32e-01

Pos logit contributions
irtual+0.501
asket+0.414
��+0.388
IDENT+0.371
dam+0.358
Neg logit contributions
vals-0.410
 Ple-0.400
GY-0.393
 gal-0.391
 AMA-0.391
Token of
Token ID286
Feature activation+5.18e-02

Pos logit contributions
irtual+0.046
asket+0.039
��+0.036
IDENT+0.035
dam+0.034
Neg logit contributions
vals-0.035
 Ple-0.034
GY-0.034
 gal-0.034
 AMA-0.034
Token 69
Token ID8644
Feature activation-2.16e-01

Pos logit contributions
irtual+0.011
asket+0.009
��+0.009
IDENT+0.008
dam+0.008
Neg logit contributions
vals-0.007
 Ple-0.007
GY-0.007
 gal-0.007
 AMA-0.007
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
vals+0.031
 Ple+0.030
GY+0.029
 gal+0.029
 AMA+0.029
Neg logit contributions
irtual-0.045
asket-0.038
��-0.036
IDENT-0.034
dam-0.033
Token my
Token ID616
Feature activation+3.41e-01

Pos logit contributions
irtual+0.005
asket+0.005
��+0.004
IDENT+0.004
dam+0.004
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token people
Token ID661
Feature activation-3.16e-01

Pos logit contributions
vals+0.019
 Ple+0.018
GY+0.018
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.025
asket-0.021
��-0.020
IDENT-0.019
dam-0.018
Token to
Token ID284
Feature activation+2.09e-02

Pos logit contributions
vals+0.049
 Ple+0.048
GY+0.047
 gal+0.047
 AMA+0.047
Neg logit contributions
irtual-0.062
asket-0.051
��-0.048
IDENT-0.046
dam-0.044
Token contract
Token ID2775
Feature activation+3.36e-01

Pos logit contributions
irtual+0.004
asket+0.003
��+0.003
IDENT+0.003
dam+0.003
Neg logit contributions
vals-0.003
 Ple-0.003
GY-0.003
 gal-0.003
 AMA-0.003
Token cancer
Token ID4890
Feature activation-3.95e-01

Pos logit contributions
irtual+0.064
asket+0.053
��+0.050
IDENT+0.047
dam+0.046
Neg logit contributions
vals-0.054
 Ple-0.053
GY-0.052
 gal-0.052
 AMA-0.051
Token at
Token ID379
Feature activation-1.75e-01

Pos logit contributions
vals+0.058
 Ple+0.056
GY+0.055
 gal+0.055
 AMA+0.055
Neg logit contributions
irtual-0.081
asket-0.068
��-0.064
IDENT-0.061
dam-0.059
Token tender
Token ID15403
Feature activation+2.58e+00

Pos logit contributions
vals+0.027
 Ple+0.026
GY+0.026
 AMA+0.025
 gal+0.025
Neg logit contributions
irtual-0.034
asket-0.029
��-0.027
IDENT-0.026
dam-0.025
Token ages
Token ID9337
Feature activation+1.32e+00

Pos logit contributions
irtual+0.487
asket+0.402
��+0.373
IDENT+0.356
dam+0.346
Neg logit contributions
vals-0.421
 Ple-0.409
GY-0.404
 gal-0.402
 AMA-0.401
Token,
Token ID11
Feature activation-2.21e-02

Pos logit contributions
irtual+0.278
asket+0.234
��+0.220
IDENT+0.211
dam+0.206
Neg logit contributions
vals-0.186
 Ple-0.179
GY-0.177
 gal-0.175
 AMA-0.175
Token but
Token ID475
Feature activation-6.87e-01

Pos logit contributions
vals+0.003
 Ple+0.003
GY+0.003
 gal+0.003
 AMA+0.003
Neg logit contributions
irtual-0.005
asket-0.004
��-0.004
IDENT-0.004
dam-0.003
Token not
Token ID407
Feature activation-7.65e-02

Pos logit contributions
vals+0.107
 Ple+0.104
GY+0.103
 gal+0.102
 AMA+0.102
Neg logit contributions
irtual-0.135
asket-0.112
��-0.105
IDENT-0.100
dam-0.097
Token in
Token ID287
Feature activation-2.15e-01

Pos logit contributions
vals+0.012
 Ple+0.011
GY+0.011
 gal+0.011
 AMA+0.011
Neg logit contributions
irtual-0.015
asket-0.013
��-0.012
IDENT-0.011
dam-0.011
Token
Token ID564
Feature activation-2.24e-03

Pos logit contributions
irtual+0.003
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token
Token ID250
Feature activation+1.16e-02

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.000
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Tokenturn
Token ID15344
Feature activation+6.32e-01

Pos logit contributions
irtual+0.002
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Tokening
Token ID278
Feature activation-1.31e-01

Pos logit contributions
irtual+0.130
asket+0.109
��+0.102
IDENT+0.098
dam+0.095
Neg logit contributions
vals-0.093
 Ple-0.090
GY-0.089
 gal-0.088
 AMA-0.088
Token a
Token ID257
Feature activation+8.18e-02

Pos logit contributions
vals+0.019
 Ple+0.018
GY+0.018
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.027
asket-0.023
��-0.021
IDENT-0.020
dam-0.020
Token blind
Token ID7770
Feature activation+2.58e+00

Pos logit contributions
irtual+0.009
asket+0.006
��+0.005
IDENT+0.004
dam+0.004
Neg logit contributions
vals-0.020
 Ple-0.020
GY-0.020
 gal-0.020
 AMA-0.020
Token eye
Token ID4151
Feature activation+3.82e-01

Pos logit contributions
irtual+0.463
asket+0.382
��+0.349
IDENT+0.332
dam+0.326
Neg logit contributions
vals-0.440
 Ple-0.423
GY-0.422
 AMA-0.417
 gal-0.415
Token to
Token ID284
Feature activation+4.77e-02

Pos logit contributions
irtual+0.076
asket+0.063
��+0.059
IDENT+0.056
dam+0.055
Neg logit contributions
vals-0.059
 Ple-0.057
GY-0.056
 gal-0.056
 AMA-0.056
Token the
Token ID262
Feature activation-6.60e-02

Pos logit contributions
irtual+0.009
asket+0.008
��+0.007
IDENT+0.007
dam+0.007
Neg logit contributions
vals-0.007
 Ple-0.007
GY-0.007
 gal-0.007
 AMA-0.007
Token shortcomings
Token ID31731
Feature activation-2.68e-01

Pos logit contributions
vals+0.011
 Ple+0.010
GY+0.010
 gal+0.010
 AMA+0.010
Neg logit contributions
irtual-0.012
asket-0.010
��-0.010
IDENT-0.009
dam-0.009
Token of
Token ID286
Feature activation+9.09e-02

Pos logit contributions
vals+0.041
 Ple+0.040
GY+0.039
 gal+0.039
 AMA+0.039
Neg logit contributions
irtual-0.053
asket-0.045
��-0.042
IDENT-0.040
dam-0.039
Token else
Token ID2073
Feature activation-6.21e-01

Pos logit contributions
irtual+0.137
asket+0.117
��+0.111
IDENT+0.107
dam+0.104
Neg logit contributions
vals-0.075
 Ple-0.072
GY-0.071
 gal-0.070
 AMA-0.070
Token,
Token ID11
Feature activation+7.36e-02

Pos logit contributions
vals+0.088
 Ple+0.085
GY+0.084
 gal+0.084
 AMA+0.083
Neg logit contributions
irtual-0.130
asket-0.110
��-0.103
IDENT-0.099
dam-0.096
Token what
Token ID644
Feature activation-1.18e-03

Pos logit contributions
irtual+0.015
asket+0.012
��+0.011
IDENT+0.011
dam+0.011
Neg logit contributions
vals-0.011
 Ple-0.011
GY-0.011
 gal-0.010
 AMA-0.010
Token minor
Token ID4159
Feature activation+1.08e+00

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.000
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Token le
Token ID443
Feature activation+6.26e-02

Pos logit contributions
irtual+0.164
asket+0.128
��+0.117
IDENT+0.109
dam+0.104
Neg logit contributions
vals-0.216
 Ple-0.210
GY-0.208
 gal-0.207
 AMA-0.206
Tokenagu
Token ID11433
Feature activation+2.57e+00

Pos logit contributions
irtual+0.003
asket+0.001
��+0.001
IDENT+0.000
dam+0.000
Neg logit contributions
vals-0.019
 Ple-0.018
GY-0.018
 AMA-0.018
 gal-0.018
Tokeners
Token ID364
Feature activation-5.79e-01

Pos logit contributions
irtual+0.596
asket+0.513
��+0.483
IDENT+0.465
dam+0.458
Neg logit contributions
vals-0.304
 Ple-0.294
GY-0.290
 AMA-0.285
 gal-0.282
Token have
Token ID423
Feature activation+5.20e-02

Pos logit contributions
vals+0.087
 Ple+0.084
GY+0.083
 gal+0.083
 AMA+0.082
Neg logit contributions
irtual-0.117
asket-0.098
��-0.091
IDENT-0.087
dam-0.085
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
irtual+0.010
asket+0.009
��+0.008
IDENT+0.008
dam+0.008
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.007
 gal-0.007
 AMA-0.007
Token No
Token ID1400
Feature activation+4.20e-01

Pos logit contributions
vals+0.003
GY+0.003
 Ple+0.003
 gal+0.003
 AMA+0.003
Neg logit contributions
irtual-0.004
asket-0.004
��-0.003
IDENT-0.003
dam-0.003
Token one
Token ID530
Feature activation-1.44e-01

Pos logit contributions
irtual+0.085
asket+0.071
��+0.066
IDENT+0.064
dam+0.062
Neg logit contributions
vals-0.063
 Ple-0.061
GY-0.060
 AMA-0.059
 gal-0.059
Token In
Token ID554
Feature activation-1.84e-02

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.001
asket-0.001
��-0.000
IDENT-0.000
dam-0.000
Token addition
Token ID3090
Feature activation-3.51e-01

Pos logit contributions
vals+0.003
 Ple+0.003
GY+0.003
 gal+0.003
 AMA+0.003
Neg logit contributions
irtual-0.004
asket-0.003
��-0.003
IDENT-0.003
dam-0.003
Token to
Token ID284
Feature activation+3.45e-02

Pos logit contributions
vals+0.050
 Ple+0.049
GY+0.048
 gal+0.048
 AMA+0.047
Neg logit contributions
irtual-0.073
asket-0.062
��-0.058
IDENT-0.056
dam-0.054
Token turning
Token ID6225
Feature activation+1.21e-01

Pos logit contributions
irtual+0.007
asket+0.006
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Token a
Token ID257
Feature activation+4.82e-02

Pos logit contributions
irtual+0.024
asket+0.020
��+0.019
IDENT+0.018
dam+0.017
Neg logit contributions
vals-0.019
 Ple-0.018
GY-0.018
 gal-0.018
 AMA-0.018
Token blind
Token ID7770
Feature activation+2.56e+00

Pos logit contributions
irtual+0.003
asket+0.002
��+0.001
IDENT+0.001
dam+0.001
Neg logit contributions
vals-0.014
 Ple-0.014
GY-0.013
 gal-0.013
 AMA-0.013
Token eye
Token ID4151
Feature activation+3.55e-01

Pos logit contributions
irtual+0.463
asket+0.382
��+0.350
IDENT+0.335
dam+0.327
Neg logit contributions
vals-0.435
 Ple-0.419
GY-0.417
 AMA-0.413
 gal-0.410
Token to
Token ID284
Feature activation+3.72e-02

Pos logit contributions
irtual+0.070
asket+0.059
��+0.055
IDENT+0.052
dam+0.051
Neg logit contributions
vals-0.054
 Ple-0.053
GY-0.052
 gal-0.052
 AMA-0.052
Token women
Token ID1466
Feature activation-6.66e-01

Pos logit contributions
irtual+0.007
asket+0.006
��+0.006
IDENT+0.006
dam+0.005
Neg logit contributions
vals-0.006
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Token right
Token ID826
Feature activation-4.29e-01

Pos logit contributions
vals+0.097
 Ple+0.093
GY+0.092
 AMA+0.091
 gal+0.091
Neg logit contributions
irtual-0.137
asket-0.115
��-0.108
IDENT-0.104
dam-0.102
Token,
Token ID11
Feature activation+3.38e-02

Pos logit contributions
vals+0.067
 Ple+0.064
GY+0.064
 gal+0.063
 AMA+0.063
Neg logit contributions
irtual-0.084
asket-0.070
��-0.066
IDENT-0.063
dam-0.061
Token rest
Token ID1334
Feature activation-1.27e-01

Pos logit contributions
vals+0.022
 Ple+0.021
GY+0.021
 gal+0.021
 AMA+0.021
Neg logit contributions
irtual-0.028
asket-0.023
��-0.022
IDENT-0.021
dam-0.020
Token of
Token ID286
Feature activation+1.43e-03

Pos logit contributions
vals+0.020
 Ple+0.019
GY+0.019
 gal+0.019
 AMA+0.019
Neg logit contributions
irtual-0.025
asket-0.021
��-0.019
IDENT-0.019
dam-0.018
Token alone
Token ID3436
Feature activation-6.13e-01

Pos logit contributions
irtual+0.000
asket+0.000
��+0.000
IDENT+0.000
dam+0.000
Neg logit contributions
vals-0.000
 Ple-0.000
GY-0.000
 gal-0.000
 AMA-0.000
Token to
Token ID284
Feature activation+1.63e-04

Pos logit contributions
vals+0.086
 Ple+0.083
GY+0.082
 gal+0.081
 AMA+0.081
Neg logit contributions
irtual-0.129
asket-0.109
��-0.103
IDENT-0.098
dam-0.096
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
irtual+0.000
asket+0.000
��+0.000
IDENT+0.000
dam+0.000
Neg logit contributions
vals-0.000
 Ple-0.000
GY-0.000
 gal-0.000
 AMA-0.000
Token tender
Token ID15403
Feature activation+2.56e+00

Pos logit contributions
vals+0.018
 Ple+0.018
GY+0.018
 gal+0.018
 AMA+0.017
Neg logit contributions
irtual-0.023
asket-0.020
��-0.018
IDENT-0.017
dam-0.017
Token merc
Token ID11991
Feature activation-7.04e-01

Pos logit contributions
irtual+0.519
asket+0.435
��+0.407
IDENT+0.390
dam+0.378
Neg logit contributions
vals-0.381
 Ple-0.369
GY-0.363
 gal-0.361
 AMA-0.360
Tokenies
Token ID444
Feature activation-3.71e-01

Pos logit contributions
vals+0.084
 Ple+0.081
GY+0.079
 gal+0.079
 AMA+0.078
Neg logit contributions
irtual-0.164
asket-0.141
��-0.134
IDENT-0.128
dam-0.125
Token of
Token ID286
Feature activation+3.66e-02

Pos logit contributions
vals+0.058
 Ple+0.056
GY+0.056
 gal+0.055
 AMA+0.055
Neg logit contributions
irtual-0.072
asket-0.060
��-0.056
IDENT-0.054
dam-0.052
Token incomp
Token ID13352
Feature activation-8.90e-01

Pos logit contributions
irtual+0.007
asket+0.006
��+0.006
IDENT+0.006
dam+0.005
Neg logit contributions
vals-0.006
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Tokenent
Token ID298
Feature activation+2.71e-01

Pos logit contributions
vals+0.113
 Ple+0.109
GY+0.107
 gal+0.106
 AMA+0.106
Neg logit contributions
irtual-0.200
asket-0.171
��-0.161
IDENT-0.155
dam-0.151
Token),
Token ID828
Feature activation+4.04e-02

Pos logit contributions
vals+0.018
 Ple+0.018
GY+0.018
 gal+0.017
 AMA+0.017
Neg logit contributions
irtual-0.024
asket-0.020
��-0.018
IDENT-0.018
dam-0.017
Token surplus
Token ID18201
Feature activation-1.03e+00

Pos logit contributions
irtual+0.008
asket+0.007
��+0.006
IDENT+0.006
dam+0.006
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.006
 gal-0.006
 AMA-0.006
Token (
Token ID357
Feature activation+1.73e-01

Pos logit contributions
vals+0.157
 Ple+0.152
GY+0.150
 gal+0.149
 AMA+0.149
Neg logit contributions
irtual-0.205
asket-0.171
��-0.160
IDENT-0.153
dam-0.148
Tokenh
Token ID71
Feature activation+6.24e-01

Pos logit contributions
irtual+0.036
asket+0.030
��+0.028
IDENT+0.027
dam+0.026
Neg logit contributions
vals-0.025
 Ple-0.025
GY-0.024
 gal-0.024
 AMA-0.024
Tokenul
Token ID377
Feature activation-2.76e-01

Pos logit contributions
irtual+0.126
asket+0.106
��+0.099
IDENT+0.094
dam+0.092
Neg logit contributions
vals-0.093
 Ple-0.090
GY-0.089
 gal-0.088
 AMA-0.088
Tokenagu
Token ID11433
Feature activation+2.55e+00

Pos logit contributions
vals+0.040
 Ple+0.039
GY+0.038
 gal+0.038
 AMA+0.038
Neg logit contributions
irtual-0.057
asket-0.048
��-0.045
IDENT-0.043
dam-0.042
Token),
Token ID828
Feature activation+3.59e-02

Pos logit contributions
irtual+0.507
asket+0.424
��+0.396
IDENT+0.377
dam+0.369
Neg logit contributions
vals-0.388
 Ple-0.375
GY-0.370
 AMA-0.367
 gal-0.366
Token[
Token ID58
Feature activation+8.21e-02

Pos logit contributions
irtual+0.007
asket+0.006
��+0.006
IDENT+0.006
dam+0.005
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 AMA-0.005
 gal-0.005
Token3
Token ID18
Feature activation-6.04e-02

Pos logit contributions
irtual+0.017
asket+0.014
��+0.013
IDENT+0.013
dam+0.012
Neg logit contributions
vals-0.012
 Ple-0.012
GY-0.012
 gal-0.011
 AMA-0.011
Token]
Token ID60
Feature activation+2.38e-01

Pos logit contributions
vals+0.011
 Ple+0.011
GY+0.010
 gal+0.010
 AMA+0.010
Neg logit contributions
irtual-0.010
asket-0.008
��-0.008
IDENT-0.007
dam-0.007
Token blue
Token ID4171
Feature activation+2.51e-01

Pos logit contributions
irtual+0.049
asket+0.041
��+0.038
IDENT+0.037
dam+0.036
Neg logit contributions
vals-0.035
 Ple-0.034
GY-0.033
 gal-0.033
 AMA-0.033
Token
Token ID198
Feature activation-2.51e-03

Pos logit contributions
vals+0.010
 Ple+0.010
GY+0.009
 AMA+0.009
 gal+0.009
Neg logit contributions
irtual-0.015
asket-0.012
��-0.012
IDENT-0.011
dam-0.011
Token
Token ID198
Feature activation-1.58e-02

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.001
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
TokenThough
Token ID10915
Feature activation+9.45e-02

Pos logit contributions
vals+0.002
 Ple+0.002
GY+0.002
 gal+0.002
 AMA+0.002
Neg logit contributions
irtual-0.003
asket-0.003
��-0.002
IDENT-0.002
 Ukrain-0.002
Token deaf
Token ID21815
Feature activation+1.88e+00

Pos logit contributions
irtual+0.018
asket+0.015
��+0.014
IDENT+0.014
dam+0.013
Neg logit contributions
vals-0.015
 Ple-0.015
GY-0.014
 gal-0.014
 AMA-0.014
Token and
Token ID290
Feature activation-6.13e-02

Pos logit contributions
irtual+0.387
asket+0.324
��+0.305
IDENT+0.291
dam+0.283
Neg logit contributions
vals-0.274
 Ple-0.265
GY-0.261
 gal-0.260
 AMA-0.259
Token blind
Token ID7770
Feature activation+2.54e+00

Pos logit contributions
vals+0.012
 Ple+0.012
GY+0.012
 gal+0.012
 AMA+0.012
Neg logit contributions
irtual-0.009
asket-0.007
��-0.007
IDENT-0.006
dam-0.006
Token,
Token ID11
Feature activation-2.49e-02

Pos logit contributions
irtual+0.523
asket+0.438
��+0.411
IDENT+0.393
dam+0.382
Neg logit contributions
vals-0.373
 Ple-0.362
GY-0.355
 gal-0.355
 AMA-0.353
Token Helen
Token ID22695
Feature activation+1.80e+00

Pos logit contributions
vals+0.005
 Ple+0.005
 gal+0.005
 AMA+0.005
GY+0.005
Neg logit contributions
irtual-0.003
asket-0.002
��-0.002
IDENT-0.002
dam-0.002
Token Keller
Token ID35999
Feature activation+1.91e+00

Pos logit contributions
irtual+0.206
asket+0.145
��+0.125
IDENT+0.114
dam+0.106
Neg logit contributions
vals-0.427
 Ple-0.421
 gal-0.415
GY-0.415
 AMA-0.413
Token became
Token ID2627
Feature activation-1.97e-01

Pos logit contributions
irtual+0.390
asket+0.326
��+0.305
IDENT+0.292
dam+0.283
Neg logit contributions
vals-0.285
 Ple-0.277
 gal-0.272
GY-0.271
 AMA-0.270
Token a
Token ID257
Feature activation-5.97e-03

Pos logit contributions
vals+0.029
 Ple+0.028
 gal+0.028
GY+0.028
 AMA+0.028
Neg logit contributions
irtual-0.040
asket-0.034
��-0.032
IDENT-0.030
dam-0.029
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
irtual+0.031
asket+0.026
��+0.024
IDENT+0.023
dam+0.023
Neg logit contributions
vals-0.020
 Ple-0.020
GY-0.019
 gal-0.019
 AMA-0.019
Token We
Token ID775
Feature activation-4.62e-01

Pos logit contributions
vals+0.004
 Ple+0.003
GY+0.003
 gal+0.003
 AMA+0.003
Neg logit contributions
irtual-0.005
asket-0.004
��-0.004
IDENT-0.004
dam-0.004
Token all
Token ID477
Feature activation+5.87e-01

Pos logit contributions
vals+0.078
 Ple+0.076
GY+0.075
 gal+0.074
 AMA+0.074
Neg logit contributions
irtual-0.085
asket-0.069
��-0.064
IDENT-0.061
dam-0.059
Token turn
Token ID1210
Feature activation+1.74e-01

Pos logit contributions
irtual+0.115
asket+0.096
��+0.089
IDENT+0.086
dam+0.083
Neg logit contributions
vals-0.091
 Ple-0.088
GY-0.087
 gal-0.086
 AMA-0.086
Token a
Token ID257
Feature activation+6.80e-02

Pos logit contributions
irtual+0.035
asket+0.029
��+0.027
IDENT+0.026
dam+0.025
Neg logit contributions
vals-0.027
 Ple-0.026
GY-0.025
 gal-0.025
 AMA-0.025
Token blind
Token ID7770
Feature activation+2.54e+00

Pos logit contributions
irtual+0.007
asket+0.005
��+0.004
IDENT+0.004
dam+0.004
Neg logit contributions
vals-0.017
 Ple-0.016
GY-0.016
 AMA-0.016
 gal-0.016
Token eye
Token ID4151
Feature activation+3.63e-01

Pos logit contributions
irtual+0.460
asket+0.381
��+0.348
IDENT+0.333
dam+0.326
Neg logit contributions
vals-0.430
 Ple-0.415
GY-0.414
 AMA-0.410
 gal-0.406
Token to
Token ID284
Feature activation+5.78e-03

Pos logit contributions
irtual+0.072
asket+0.060
��+0.056
IDENT+0.053
dam+0.052
Neg logit contributions
vals-0.056
 Ple-0.054
GY-0.054
 gal-0.053
 AMA-0.053
Token the
Token ID262
Feature activation-8.04e-02

Pos logit contributions
irtual+0.001
asket+0.001
��+0.001
IDENT+0.001
dam+0.001
Neg logit contributions
vals-0.001
 Ple-0.001
GY-0.001
 gal-0.001
 AMA-0.001
Token ways
Token ID2842
Feature activation-2.55e-01

Pos logit contributions
vals+0.013
 Ple+0.012
GY+0.012
 gal+0.012
 AMA+0.012
Neg logit contributions
irtual-0.016
asket-0.013
��-0.012
IDENT-0.012
dam-0.011
Token that
Token ID326
Feature activation+3.38e-01

Pos logit contributions
vals+0.039
 Ple+0.038
GY+0.037
 AMA+0.037
 gal+0.037
Neg logit contributions
irtual-0.050
asket-0.042
��-0.039
IDENT-0.038
dam-0.037
Token the
Token ID262
Feature activation-9.98e-02

Pos logit contributions
irtual+0.049
asket+0.041
��+0.039
IDENT+0.037
dam+0.036
Neg logit contributions
vals-0.036
 Ple-0.035
GY-0.034
 gal-0.034
 AMA-0.034
Token presidency
Token ID12112
Feature activation+3.28e-01

Pos logit contributions
vals+0.016
 Ple+0.015
GY+0.015
 gal+0.015
 AMA+0.015
Neg logit contributions
irtual-0.019
asket-0.016
��-0.015
IDENT-0.014
dam-0.014
Token.
Token ID13
Feature activation-3.61e-02

Pos logit contributions
irtual+0.068
asket+0.057
��+0.053
IDENT+0.051
dam+0.049
Neg logit contributions
vals-0.048
 Ple-0.046
GY-0.046
 gal-0.046
 AMA-0.045
Token They
Token ID1119
Feature activation-1.24e-01

Pos logit contributions
vals+0.005
 Ple+0.005
GY+0.005
 gal+0.005
 AMA+0.005
Neg logit contributions
irtual-0.007
asket-0.006
��-0.006
IDENT-0.005
dam-0.005
Token are
Token ID389
Feature activation-8.39e-02

Pos logit contributions
vals+0.019
 Ple+0.019
GY+0.018
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.024
asket-0.020
��-0.019
IDENT-0.018
dam-0.018
Token blind
Token ID7770
Feature activation+2.54e+00

Pos logit contributions
vals+0.014
 Ple+0.013
GY+0.013
 gal+0.013
 AMA+0.013
Neg logit contributions
irtual-0.016
asket-0.013
��-0.012
IDENT-0.012
dam-0.011
Token to
Token ID284
Feature activation+1.93e-02

Pos logit contributions
irtual+0.498
asket+0.414
��+0.387
IDENT+0.370
dam+0.359
Neg logit contributions
vals-0.395
 Ple-0.383
GY-0.378
 gal-0.375
 AMA-0.374
Token the
Token ID262
Feature activation-8.79e-02

Pos logit contributions
irtual+0.004
asket+0.003
��+0.003
IDENT+0.003
dam+0.003
Neg logit contributions
vals-0.003
 Ple-0.003
GY-0.003
 gal-0.003
 AMA-0.003
Token fact
Token ID1109
Feature activation+3.01e-02

Pos logit contributions
vals+0.014
 Ple+0.014
GY+0.013
 gal+0.013
 AMA+0.013
Neg logit contributions
irtual-0.017
asket-0.014
��-0.013
IDENT-0.012
dam-0.012
Token their
Token ID511
Feature activation-1.35e-01

Pos logit contributions
irtual+0.005
asket+0.004
��+0.004
IDENT+0.003
dam+0.003
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.005
 gal-0.005
 AMA-0.005
Token uncon
Token ID21254
Feature activation-1.10e+00

Pos logit contributions
vals+0.020
 Ple+0.019
GY+0.019
 gal+0.019
 AMA+0.019
Neg logit contributions
irtual-0.027
asket-0.023
��-0.021
IDENT-0.021
dam-0.020
Token and
Token ID290
Feature activation-9.17e-02

Pos logit contributions
irtual+0.012
asket+0.010
��+0.010
IDENT+0.009
dam+0.009
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token ongoing
Token ID7044
Feature activation+2.17e-01

Pos logit contributions
vals+0.014
 Ple+0.014
GY+0.014
 gal+0.013
 AMA+0.013
Neg logit contributions
irtual-0.018
asket-0.015
��-0.014
IDENT-0.013
dam-0.013
Token.
Token ID13
Feature activation-5.33e-02

Pos logit contributions
irtual+0.046
asket+0.039
��+0.036
IDENT+0.035
dam+0.034
Neg logit contributions
vals-0.031
 Ple-0.030
GY-0.029
 gal-0.029
 AMA-0.029
Token It
Token ID632
Feature activation-2.55e-01

Pos logit contributions
vals+0.008
 Ple+0.008
GY+0.008
 gal+0.008
 AMA+0.008
Neg logit contributions
irtual-0.011
asket-0.009
��-0.008
IDENT-0.008
dam-0.008
Token seemed
Token ID3947
Feature activation-3.29e-01

Pos logit contributions
vals+0.039
 Ple+0.038
GY+0.037
 gal+0.037
 AMA+0.037
Neg logit contributions
irtual-0.051
asket-0.042
��-0.040
IDENT-0.038
dam-0.037
Token blind
Token ID7770
Feature activation+2.52e+00

Pos logit contributions
vals+0.049
 Ple+0.048
GY+0.047
 gal+0.047
 AMA+0.047
Neg logit contributions
irtual-0.067
asket-0.056
��-0.052
IDENT-0.050
dam-0.049
Tokeningly
Token ID4420
Feature activation-8.28e-01

Pos logit contributions
irtual+0.590
asket+0.507
��+0.478
IDENT+0.462
dam+0.452
Neg logit contributions
vals-0.298
 Ple-0.285
GY-0.281
 gal-0.278
 AMA-0.277
Token obvious
Token ID3489
Feature activation+4.97e-01

Pos logit contributions
vals+0.144
 Ple+0.141
GY+0.139
 gal+0.139
 AMA+0.138
Neg logit contributions
irtual-0.148
asket-0.120
��-0.111
IDENT-0.105
dam-0.101
Token that
Token ID326
Feature activation+2.68e-01

Pos logit contributions
irtual+0.093
asket+0.076
��+0.071
IDENT+0.067
dam+0.065
Neg logit contributions
vals-0.082
 Ple-0.080
GY-0.079
 gal-0.078
 AMA-0.078
Token there
Token ID612
Feature activation+2.67e-01

Pos logit contributions
irtual+0.053
asket+0.044
��+0.041
IDENT+0.039
dam+0.038
Neg logit contributions
vals-0.041
 Ple-0.040
GY-0.040
 gal-0.039
 AMA-0.039
Token is
Token ID318
Feature activation-7.48e-02

Pos logit contributions
irtual+0.053
asket+0.044
��+0.041
IDENT+0.039
dam+0.038
Neg logit contributions
vals-0.041
 Ple-0.040
GY-0.039
 gal-0.039
 AMA-0.039
Token a
Token ID257
Feature activation+3.54e-02

Pos logit contributions
irtual+0.043
asket+0.036
��+0.033
IDENT+0.032
dam+0.031
Neg logit contributions
vals-0.032
 Ple-0.031
GY-0.031
 gal-0.030
 AMA-0.030
Token means
Token ID1724
Feature activation+7.17e-02

Pos logit contributions
irtual+0.007
asket+0.006
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.005
 gal-0.005
 AMA-0.005
Token to
Token ID284
Feature activation+1.18e-02

Pos logit contributions
irtual+0.014
asket+0.012
��+0.011
IDENT+0.010
dam+0.010
Neg logit contributions
vals-0.011
 Ple-0.011
GY-0.011
 gal-0.011
 AMA-0.011
Token
Token ID564
Feature activation-6.04e-02

Pos logit contributions
irtual+0.002
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token
Token ID250
Feature activation-5.68e-02

Pos logit contributions
vals+0.011
 Ple+0.011
GY+0.011
 AMA+0.011
 gal+0.011
Neg logit contributions
irtual-0.010
asket-0.008
��-0.008
IDENT-0.007
dam-0.007
Tokenspot
Token ID20485
Feature activation-2.56e+00

Pos logit contributions
vals+0.009
 Ple+0.009
GY+0.009
 gal+0.008
 AMA+0.008
Neg logit contributions
irtual-0.011
asket-0.009
��-0.009
IDENT-0.008
dam-0.008
Token problems
Token ID2761
Feature activation+9.08e-01

Pos logit contributions
vals+0.428
 Ple+0.417
GY+0.411
 gal+0.409
 AMA+0.408
Neg logit contributions
irtual-0.473
asket-0.389
��-0.362
IDENT-0.344
dam-0.332
Token earlier
Token ID2961
Feature activation+3.41e-01

Pos logit contributions
irtual+0.181
asket+0.152
��+0.142
IDENT+0.136
dam+0.132
Neg logit contributions
vals-0.138
 Ple-0.133
GY-0.132
 gal-0.130
 AMA-0.130
Token and
Token ID290
Feature activation-5.77e-02

Pos logit contributions
irtual+0.069
asket+0.058
��+0.054
IDENT+0.052
dam+0.051
Neg logit contributions
vals-0.051
 Ple-0.049
GY-0.048
 gal-0.048
 AMA-0.048
Token prevent
Token ID2948
Feature activation-2.32e-01

Pos logit contributions
vals+0.009
 Ple+0.009
GY+0.009
 gal+0.009
 AMA+0.009
Neg logit contributions
irtual-0.011
asket-0.009
��-0.009
IDENT-0.008
dam-0.008
Token officials
Token ID2828
Feature activation+4.10e-01

Pos logit contributions
vals+0.041
 Ple+0.040
GY+0.040
 gal+0.040
 AMA+0.039
Neg logit contributions
irtual-0.040
asket-0.033
��-0.030
IDENT-0.029
dam-0.028
Token say
Token ID910
Feature activation+2.61e-01

Pos logit contributions
vals+0.113
 Ple+0.109
GY+0.107
 gal+0.107
 AMA+0.106
Neg logit contributions
irtual-0.151
asket-0.126
��-0.118
IDENT-0.113
dam-0.110
Token they
Token ID484
Feature activation-1.58e-01

Pos logit contributions
irtual+0.053
asket+0.045
��+0.042
IDENT+0.040
dam+0.039
Neg logit contributions
vals-0.039
 Ple-0.037
GY-0.037
 gal-0.037
 AMA-0.036
Token received
Token ID2722
Feature activation+1.67e-01

Pos logit contributions
vals+0.024
 Ple+0.024
GY+0.023
 gal+0.023
 AMA+0.023
Neg logit contributions
irtual-0.031
asket-0.026
��-0.024
IDENT-0.023
dam-0.022
Token notices
Token ID19748
Feature activation+7.95e-01

Pos logit contributions
irtual+0.032
asket+0.026
��+0.024
IDENT+0.023
dam+0.023
Neg logit contributions
vals-0.027
 Ple-0.026
GY-0.026
 gal-0.025
 AMA-0.025
Token from
Token ID422
Feature activation-2.16e-01

Pos logit contributions
irtual+0.158
asket+0.132
��+0.123
IDENT+0.118
dam+0.115
Neg logit contributions
vals-0.122
 Ple-0.117
GY-0.116
 AMA-0.115
 gal-0.115
Token Tinder
Token ID44043
Feature activation-2.45e+00

Pos logit contributions
vals+0.024
 Ple+0.023
GY+0.023
 gal+0.022
 AMA+0.022
Neg logit contributions
irtual-0.052
asket-0.045
��-0.042
IDENT-0.041
dam-0.040
Token saying
Token ID2282
Feature activation+8.55e-02

Pos logit contributions
vals+0.388
 Ple+0.375
GY+0.372
 gal+0.367
 AMA+0.367
Neg logit contributions
irtual-0.474
asket-0.395
��-0.367
IDENT-0.351
dam-0.341
Token the
Token ID262
Feature activation-1.05e-01

Pos logit contributions
irtual+0.017
asket+0.014
��+0.013
IDENT+0.012
dam+0.012
Neg logit contributions
vals-0.013
 Ple-0.013
GY-0.013
 gal-0.013
 AMA-0.013
Token were
Token ID547
Feature activation-1.02e-01

Pos logit contributions
vals+0.016
 Ple+0.016
GY+0.016
 gal+0.015
 AMA+0.015
Neg logit contributions
irtual-0.020
asket-0.017
��-0.016
IDENT-0.015
dam-0.015
Token unable
Token ID5906
Feature activation-1.61e-01

Pos logit contributions
vals+0.017
 Ple+0.016
GY+0.016
 gal+0.016
 AMA+0.016
Neg logit contributions
irtual-0.019
asket-0.016
��-0.015
IDENT-0.014
dam-0.014
Token to
Token ID284
Feature activation+2.40e-03

Pos logit contributions
vals+0.025
 Ple+0.025
GY+0.024
 gal+0.024
 AMA+0.024
Neg logit contributions
irtual-0.031
asket-0.026
��-0.024
IDENT-0.023
dam-0.023
Token many
Token ID867
Feature activation+3.15e-01

Pos logit contributions
irtual+0.040
asket+0.033
��+0.030
IDENT+0.028
dam+0.027
Neg logit contributions
vals-0.043
 Ple-0.042
GY-0.041
 AMA-0.041
 gal-0.041
Token as
Token ID355
Feature activation+2.26e-01

Pos logit contributions
irtual+0.058
asket+0.048
��+0.044
IDENT+0.042
dam+0.041
Neg logit contributions
vals-0.053
 Ple-0.051
GY-0.051
 gal-0.050
 AMA-0.050
Token 100
Token ID1802
Feature activation-6.37e-01

Pos logit contributions
irtual+0.050
asket+0.042
��+0.040
IDENT+0.038
dam+0.037
Neg logit contributions
vals-0.030
 Ple-0.029
GY-0.029
 gal-0.028
 AMA-0.028
Token people
Token ID661
Feature activation-2.79e-01

Pos logit contributions
vals+0.097
 Ple+0.093
GY+0.092
 gal+0.091
 AMA+0.091
Neg logit contributions
irtual-0.127
asket-0.107
��-0.100
IDENT-0.095
dam-0.093
Token on
Token ID319
Feature activation+2.35e-01

Pos logit contributions
vals+0.042
 Ple+0.040
GY+0.040
 gal+0.040
 AMA+0.039
Neg logit contributions
irtual-0.056
asket-0.047
��-0.044
IDENT-0.042
dam-0.041
Token Tinder
Token ID44043
Feature activation-2.42e+00

Pos logit contributions
irtual+0.049
asket+0.042
��+0.039
IDENT+0.037
dam+0.036
Neg logit contributions
vals-0.033
 Ple-0.032
GY-0.032
 gal-0.031
 AMA-0.031
Token.
Token ID13
Feature activation-8.44e-03

Pos logit contributions
vals+0.366
 Ple+0.353
GY+0.349
 gal+0.345
 AMA+0.345
Neg logit contributions
irtual-0.485
asket-0.406
��-0.378
IDENT-0.364
dam-0.353
Token The
Token ID383
Feature activation+1.26e-01

Pos logit contributions
vals+0.001
 Ple+0.001
GY+0.001
 gal+0.001
 AMA+0.001
Neg logit contributions
irtual-0.002
asket-0.001
��-0.001
IDENT-0.001
dam-0.001
Token married
Token ID6405
Feature activation-3.64e-01

Pos logit contributions
irtual+0.026
asket+0.022
��+0.020
IDENT+0.020
dam+0.019
Neg logit contributions
vals-0.018
 Ple-0.018
GY-0.018
 gal-0.017
 AMA-0.017
Token Iowa
Token ID9406
Feature activation+5.60e-01

Pos logit contributions
vals+0.051
 Ple+0.049
GY+0.048
 gal+0.048
 AMA+0.048
Neg logit contributions
irtual-0.077
asket-0.065
��-0.061
IDENT-0.059
dam-0.057
Token resident
Token ID6623
Feature activation+4.12e-01

Pos logit contributions
irtual+0.116
asket+0.097
��+0.091
IDENT+0.088
dam+0.085
Neg logit contributions
vals-0.081
 Ple-0.079
GY-0.077
 gal-0.077
 AMA-0.077
Token of
Token ID286
Feature activation+6.15e-02

Pos logit contributions
vals+0.049
 Ple+0.048
 gal+0.047
GY+0.047
 AMA+0.046
Neg logit contributions
irtual-0.064
asket-0.053
��-0.050
IDENT-0.048
dam-0.046
Token a
Token ID257
Feature activation+9.53e-02

Pos logit contributions
irtual+0.012
asket+0.010
��+0.009
IDENT+0.009
dam+0.009
Neg logit contributions
vals-0.010
 Ple-0.009
GY-0.009
 gal-0.009
 AMA-0.009
Token sp
Token ID599
Feature activation-7.56e-01

Pos logit contributions
irtual+0.019
asket+0.016
��+0.015
IDENT+0.014
dam+0.014
Neg logit contributions
vals-0.014
 Ple-0.014
GY-0.014
 AMA-0.014
 gal-0.013
Tokeniced
Token ID3711
Feature activation+6.05e-01

Pos logit contributions
vals+0.119
 Ple+0.115
GY+0.114
 gal+0.113
 AMA+0.113
Neg logit contributions
irtual-0.146
asket-0.121
��-0.114
IDENT-0.109
dam-0.105
Token pear
Token ID25286
Feature activation-5.64e-01

Pos logit contributions
irtual+0.122
asket+0.102
��+0.095
IDENT+0.091
dam+0.089
Neg logit contributions
vals-0.091
 Ple-0.087
GY-0.086
 AMA-0.085
 gal-0.085
Token gal
Token ID9426
Feature activation-2.40e+00

Pos logit contributions
vals+0.086
 Ple+0.083
GY+0.082
 gal+0.082
 AMA+0.081
Neg logit contributions
irtual-0.112
asket-0.094
��-0.088
IDENT-0.084
dam-0.081
Tokenette
Token ID5857
Feature activation+6.06e-02

Pos logit contributions
vals+0.304
 Ple+0.293
GY+0.290
 AMA+0.285
 gal+0.284
Neg logit contributions
irtual-0.540
asket-0.463
��-0.435
IDENT-0.418
dam-0.409
Token?
Token ID30
Feature activation+9.17e-02

Pos logit contributions
irtual+0.013
asket+0.011
��+0.010
IDENT+0.010
dam+0.009
Neg logit contributions
vals-0.009
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token
Token ID198
Feature activation+3.36e-02

Pos logit contributions
irtual+0.019
asket+0.016
��+0.015
IDENT+0.014
dam+0.014
Neg logit contributions
vals-0.014
 Ple-0.013
GY-0.013
 gal-0.013
 AMA-0.013
Token
Token ID198
Feature activation+7.24e-03

Pos logit contributions
irtual+0.007
asket+0.006
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
TokenLots
Token ID43643
Feature activation-5.49e-01

Pos logit contributions
irtual+0.001
asket+0.001
��+0.001
IDENT+0.001
dam+0.001
Neg logit contributions
vals-0.001
 Ple-0.001
GY-0.001
 gal-0.001
 AMA-0.001
Token the
Token ID262
Feature activation-9.80e-02

Pos logit contributions
irtual+0.039
asket+0.033
��+0.031
IDENT+0.029
dam+0.028
Neg logit contributions
vals-0.031
 Ple-0.030
GY-0.030
 gal-0.029
 AMA-0.029
Token floor
Token ID4314
Feature activation+1.99e-01

Pos logit contributions
vals+0.014
 Ple+0.013
GY+0.013
 AMA+0.013
 gal+0.013
Neg logit contributions
irtual-0.021
asket-0.018
��-0.016
IDENT-0.016
dam-0.015
Token while
Token ID981
Feature activation-3.38e-01

Pos logit contributions
irtual+0.040
asket+0.034
��+0.032
IDENT+0.030
dam+0.030
Neg logit contributions
vals-0.030
 Ple-0.029
GY-0.028
 gal-0.028
 AMA-0.028
Token another
Token ID1194
Feature activation-1.74e-01

Pos logit contributions
vals+0.050
 Ple+0.049
GY+0.048
 gal+0.048
 AMA+0.047
Neg logit contributions
irtual-0.069
asket-0.058
��-0.054
IDENT-0.052
dam-0.050
Token makes
Token ID1838
Feature activation-2.78e-01

Pos logit contributions
vals+0.026
 Ple+0.025
GY+0.025
 gal+0.025
 AMA+0.025
Neg logit contributions
irtual-0.035
asket-0.029
��-0.028
IDENT-0.026
dam-0.026
Token pelvic
Token ID43510
Feature activation-2.36e+00

Pos logit contributions
vals+0.043
 Ple+0.042
GY+0.042
 gal+0.041
 AMA+0.041
Neg logit contributions
irtual-0.054
asket-0.045
��-0.042
IDENT-0.040
dam-0.039
Token thrust
Token ID14613
Feature activation-8.82e-01

Pos logit contributions
vals+0.298
 Ple+0.287
GY+0.283
 gal+0.279
 AMA+0.279
Neg logit contributions
irtual-0.533
asket-0.455
��-0.429
IDENT-0.414
dam-0.404
Tokens
Token ID82
Feature activation+1.05e-02

Pos logit contributions
vals+0.138
 Ple+0.134
GY+0.133
 gal+0.131
 AMA+0.131
Neg logit contributions
irtual-0.172
asket-0.143
��-0.133
IDENT-0.127
dam-0.124
Token at
Token ID379
Feature activation-1.09e-01

Pos logit contributions
irtual+0.002
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.001
GY-0.001
 gal-0.001
 AMA-0.001
Token her
Token ID607
Feature activation+4.70e-01

Pos logit contributions
vals+0.018
 Ple+0.018
GY+0.018
 AMA+0.017
 gal+0.017
Neg logit contributions
irtual-0.020
asket-0.017
��-0.015
IDENT-0.015
dam-0.014
Token buttocks
Token ID50193
Feature activation-1.49e+00

Pos logit contributions
irtual+0.101
asket+0.087
��+0.080
IDENT+0.079
dam+0.077
Neg logit contributions
vals-0.063
GY-0.059
 Ple-0.059
 AMA-0.058
-0.058
Token 1945
Token ID15761
Feature activation+8.70e-01

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.000
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Token,
Token ID11
Feature activation-2.08e-02

Pos logit contributions
irtual+0.185
asket+0.156
��+0.147
IDENT+0.141
dam+0.137
Neg logit contributions
vals-0.121
 Ple-0.118
 gal-0.116
GY-0.116
 AMA-0.115
Token Northeast
Token ID26315
Feature activation+7.45e-01

Pos logit contributions
vals+0.003
 Ple+0.003
GY+0.003
 gal+0.003
 AMA+0.003
Neg logit contributions
irtual-0.004
asket-0.004
��-0.003
IDENT-0.003
dam-0.003
Token Ohio
Token ID6835
Feature activation-5.35e-02

Pos logit contributions
irtual+0.154
asket+0.130
��+0.122
IDENT+0.117
dam+0.114
Neg logit contributions
vals-0.108
 Ple-0.104
GY-0.103
 gal-0.102
 AMA-0.101
Token to
Token ID284
Feature activation-1.08e-02

Pos logit contributions
vals+0.008
 Ple+0.008
GY+0.007
 gal+0.007
 AMA+0.007
Neg logit contributions
irtual-0.011
asket-0.009
��-0.009
IDENT-0.008
dam-0.008
Tokenasted
Token ID8992
Feature activation-2.35e+00

Pos logit contributions
vals+0.002
 Ple+0.001
GY+0.001
 gal+0.001
 AMA+0.001
Neg logit contributions
irtual-0.002
asket-0.002
��-0.002
IDENT-0.002
dam-0.002
Token its
Token ID663
Feature activation-3.14e-01

Pos logit contributions
vals+0.342
 Ple+0.331
GY+0.326
 gal+0.324
 AMA+0.323
Neg logit contributions
irtual-0.485
asket-0.408
��-0.382
IDENT-0.366
dam-0.356
Token world
Token ID995
Feature activation+7.70e-01

Pos logit contributions
vals+0.051
 Ple+0.049
GY+0.049
 gal+0.048
 AMA+0.048
Neg logit contributions
irtual-0.060
asket-0.049
��-0.046
IDENT-0.044
dam-0.043
Token champion
Token ID8783
Feature activation+5.46e-02

Pos logit contributions
irtual+0.154
asket+0.129
��+0.120
IDENT+0.116
dam+0.113
Neg logit contributions
vals-0.116
 Ple-0.113
GY-0.111
 AMA-0.110
 gal-0.110
Token Cleveland
Token ID10306
Feature activation+9.32e-02

Pos logit contributions
irtual+0.011
asket+0.009
��+0.009
IDENT+0.008
dam+0.008
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token Rams
Token ID16077
Feature activation+2.29e-01

Pos logit contributions
irtual+0.019
asket+0.016
��+0.015
IDENT+0.014
dam+0.014
Neg logit contributions
vals-0.014
 Ple-0.013
GY-0.013
 AMA-0.013
 gal-0.013
Token months
Token ID1933
Feature activation-1.63e-01

Pos logit contributions
vals+0.061
 Ple+0.059
GY+0.058
 gal+0.058
 AMA+0.058
Neg logit contributions
irtual-0.080
asket-0.067
��-0.063
IDENT-0.060
dam-0.058
Token ago
Token ID2084
Feature activation-4.19e-01

Pos logit contributions
vals+0.021
 Ple+0.020
GY+0.020
 gal+0.019
 AMA+0.019
Neg logit contributions
irtual-0.037
asket-0.031
��-0.029
IDENT-0.028
dam-0.028
Token,
Token ID11
Feature activation-6.91e-04

Pos logit contributions
vals+0.060
 Ple+0.059
GY+0.058
 gal+0.057
 AMA+0.057
Neg logit contributions
irtual-0.087
asket-0.073
��-0.069
IDENT-0.066
dam-0.064
Token Jo
Token ID5302
Feature activation+3.58e-01

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.000
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Token Song
Token ID10940
Feature activation+1.91e-01

Pos logit contributions
irtual+0.072
asket+0.061
��+0.057
IDENT+0.054
dam+0.053
Neg logit contributions
vals-0.054
 Ple-0.052
GY-0.051
 gal-0.051
 AMA-0.051
Token Gy
Token ID18488
Feature activation-2.34e+00

Pos logit contributions
irtual+0.040
asket+0.034
��+0.032
IDENT+0.030
dam+0.029
Neg logit contributions
vals-0.027
 Ple-0.026
GY-0.026
 gal-0.026
 AMA-0.026
Tokenu
Token ID84
Feature activation-1.82e-01

Pos logit contributions
vals+0.339
 Ple+0.328
GY+0.323
 gal+0.320
 AMA+0.320
Neg logit contributions
irtual-0.484
asket-0.408
��-0.382
IDENT-0.366
dam-0.356
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
vals+0.026
 Ple+0.025
GY+0.025
 AMA+0.024
 gal+0.024
Neg logit contributions
irtual-0.038
asket-0.032
��-0.030
IDENT-0.029
dam-0.028
Token director
Token ID3437
Feature activation-3.53e-01

Pos logit contributions
vals+0.002
 Ple+0.001
GY+0.001
 gal+0.001
 AMA+0.001
Neg logit contributions
irtual-0.002
asket-0.002
��-0.002
IDENT-0.002
dam-0.002
Token of
Token ID286
Feature activation+2.32e-02

Pos logit contributions
vals+0.054
 Ple+0.053
GY+0.052
 gal+0.052
 AMA+0.052
Neg logit contributions
irtual-0.070
asket-0.058
��-0.054
IDENT-0.052
dam-0.050
Token the
Token ID262
Feature activation-9.47e-02

Pos logit contributions
irtual+0.005
asket+0.004
��+0.004
IDENT+0.004
dam+0.004
Neg logit contributions
vals-0.003
 Ple-0.003
GY-0.003
 gal-0.003
 AMA-0.003
Token Luxembourg
Token ID31864
Feature activation+8.97e-02

Pos logit contributions
vals+0.093
 Ple+0.090
GY+0.089
 gal+0.088
 AMA+0.088
Neg logit contributions
irtual-0.119
asket-0.099
��-0.093
IDENT-0.089
dam-0.086
Token premier
Token ID18256
Feature activation+1.59e-01

Pos logit contributions
irtual+0.019
asket+0.017
��+0.016
IDENT+0.015
dam+0.015
Neg logit contributions
vals-0.012
 Ple-0.012
GY-0.011
 AMA-0.011
 gal-0.011
Token Jean
Token ID11320
Feature activation+9.73e-01

Pos logit contributions
irtual+0.029
asket+0.024
��+0.022
IDENT+0.021
dam+0.020
Neg logit contributions
vals-0.027
 Ple-0.026
GY-0.026
 gal-0.026
 AMA-0.026
Token-
Token ID12
Feature activation+4.33e-02

Pos logit contributions
irtual+0.213
asket+0.180
��+0.170
IDENT+0.163
dam+0.158
Neg logit contributions
vals-0.131
 Ple-0.127
 gal-0.124
GY-0.123
 AMA-0.122
TokenCla
Token ID47404
Feature activation+2.11e-01

Pos logit contributions
irtual+0.007
asket+0.006
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Tokenude
Token ID2507
Feature activation-2.33e+00

Pos logit contributions
irtual+0.065
asket+0.059
��+0.056
IDENT+0.055
dam+0.054
Neg logit contributions
vals-0.009
 Ple-0.008
GY-0.007
 gal-0.007
 AMA-0.007
Token Jun
Token ID7653
Feature activation+2.44e-01

Pos logit contributions
vals+0.364
 Ple+0.353
GY+0.349
 gal+0.346
 AMA+0.345
Neg logit contributions
irtual-0.457
asket-0.380
��-0.355
IDENT-0.339
dam-0.329
Tokencker
Token ID15280
Feature activation-3.76e-01

Pos logit contributions
irtual+0.058
asket+0.050
��+0.047
IDENT+0.046
dam+0.045
Neg logit contributions
vals-0.028
 Ple-0.027
GY-0.026
 gal-0.026
 AMA-0.026
Token,
Token ID11
Feature activation-2.21e-02

Pos logit contributions
vals+0.057
 Ple+0.056
GY+0.055
 gal+0.054
 AMA+0.054
Neg logit contributions
irtual-0.075
asket-0.063
��-0.059
IDENT-0.056
dam-0.054
Token she
Token ID673
Feature activation+6.71e-02

Pos logit contributions
vals+0.003
 Ple+0.003
GY+0.003
 gal+0.003
 AMA+0.003
Neg logit contributions
irtual-0.004
asket-0.004
��-0.004
IDENT-0.003
dam-0.003
Token hoped
Token ID10719
Feature activation-2.54e-02

Pos logit contributions
irtual+0.013
asket+0.011
��+0.010
IDENT+0.010
dam+0.009
Neg logit contributions
vals-0.011
 Ple-0.010
GY-0.010
 gal-0.010
 AMA-0.010
Token (
Token ID357
Feature activation+1.66e-01

Pos logit contributions
irtual+0.238
asket+0.200
��+0.186
IDENT+0.180
dam+0.175
Neg logit contributions
vals-0.175
 Ple-0.169
GY-0.167
 gal-0.164
 AMA-0.164
Tokenverified
Token ID47684
Feature activation-6.99e-01

Pos logit contributions
irtual+0.030
asket+0.024
��+0.022
IDENT+0.021
dam+0.020
Neg logit contributions
vals-0.029
 Ple-0.028
GY-0.028
 gal-0.028
 AMA-0.028
Token)
Token ID8
Feature activation-1.28e-01

Pos logit contributions
vals+0.113
 Ple+0.110
GY+0.109
 gal+0.108
 AMA+0.108
Neg logit contributions
irtual-0.133
asket-0.110
��-0.102
IDENT-0.097
dam-0.094
Token from
Token ID422
Feature activation-1.94e-01

Pos logit contributions
vals+0.019
 Ple+0.019
GY+0.018
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.026
asket-0.022
��-0.020
IDENT-0.019
dam-0.019
Token the
Token ID262
Feature activation-1.04e-01

Pos logit contributions
vals+0.030
 Ple+0.029
GY+0.029
 gal+0.028
 AMA+0.028
Neg logit contributions
irtual-0.038
asket-0.032
��-0.030
IDENT-0.028
dam-0.028
Token registry
Token ID20478
Feature activation-2.33e+00

Pos logit contributions
vals+0.013
 Ple+0.013
GY+0.012
 gal+0.012
 AMA+0.012
Neg logit contributions
irtual-0.023
asket-0.020
��-0.019
IDENT-0.018
dam-0.018
Token.
Token ID13
Feature activation-5.24e-02

Pos logit contributions
vals+0.340
 Ple+0.328
GY+0.324
 gal+0.321
 AMA+0.320
Neg logit contributions
irtual-0.480
asket-0.404
��-0.378
IDENT-0.364
dam-0.353
Token So
Token ID1406
Feature activation+2.06e-02

Pos logit contributions
vals+0.008
 Ple+0.008
GY+0.008
 gal+0.008
 AMA+0.008
Neg logit contributions
irtual-0.010
asket-0.009
��-0.008
IDENT-0.008
dam-0.008
Token technically
Token ID14497
Feature activation+7.76e-01

Pos logit contributions
irtual+0.004
asket+0.003
��+0.003
IDENT+0.003
dam+0.003
Neg logit contributions
vals-0.003
 Ple-0.003
GY-0.003
 gal-0.003
 AMA-0.003
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
irtual+0.155
asket+0.130
��+0.121
IDENT+0.116
dam+0.112
Neg logit contributions
vals-0.118
 Ple-0.114
GY-0.113
 gal-0.112
 AMA-0.112
Token director
Token ID3437
Feature activation-3.75e-01

Pos logit contributions
vals+0.018
 Ple+0.018
GY+0.017
 gal+0.017
 AMA+0.017
Neg logit contributions
irtual-0.028
asket-0.023
��-0.022
IDENT-0.021
dam-0.021
Token qualifying
Token ID18677
Feature activation+7.31e-01

Pos logit contributions
irtual+0.111
asket+0.094
��+0.087
IDENT+0.084
dam+0.081
Neg logit contributions
vals-0.080
 Ple-0.077
GY-0.077
 gal-0.076
 AMA-0.075
Token applicants
Token ID17057
Feature activation-5.50e-01

Pos logit contributions
irtual+0.150
asket+0.126
��+0.118
IDENT+0.113
dam+0.110
Neg logit contributions
vals-0.106
 Ple-0.102
GY-0.102
 AMA-0.100
 gal-0.100
Token,
Token ID11
Feature activation-1.33e-02

Pos logit contributions
vals+0.080
 Ple+0.077
GY+0.076
 gal+0.075
 AMA+0.075
Neg logit contributions
irtual-0.113
asket-0.096
��-0.089
IDENT-0.086
dam-0.084
Token enabling
Token ID15882
Feature activation-8.99e-01

Pos logit contributions
vals+0.002
 Ple+0.002
GY+0.002
 gal+0.002
 AMA+0.002
Neg logit contributions
irtual-0.003
asket-0.002
��-0.002
IDENT-0.002
dam-0.002
Token every
Token ID790
Feature activation-2.23e-01

Pos logit contributions
vals+0.142
 Ple+0.138
GY+0.136
 gal+0.135
 AMA+0.134
Neg logit contributions
irtual-0.173
asket-0.143
��-0.133
IDENT-0.128
dam-0.123
Token Newcastle
Token ID22410
Feature activation-2.32e+00

Pos logit contributions
vals+0.030
 Ple+0.029
GY+0.029
 gal+0.028
 AMA+0.028
Neg logit contributions
irtual-0.048
asket-0.041
��-0.038
IDENT-0.037
dam-0.035
Token United
Token ID1578
Feature activation+2.69e-02

Pos logit contributions
vals+0.331
 Ple+0.318
GY+0.314
 gal+0.309
 AMA+0.309
Neg logit contributions
irtual-0.483
asket-0.409
��-0.384
IDENT-0.368
dam-0.357
Token supporter
Token ID15525
Feature activation-4.86e-01

Pos logit contributions
irtual+0.006
asket+0.005
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.003
 Ple-0.003
GY-0.003
 AMA-0.003
 gal-0.003
Token to
Token ID284
Feature activation-4.40e-02

Pos logit contributions
vals+0.076
 Ple+0.074
GY+0.073
 gal+0.072
 AMA+0.072
Neg logit contributions
irtual-0.094
asket-0.078
��-0.073
IDENT-0.070
dam-0.068
Token stand
Token ID1302
Feature activation+9.77e-02

Pos logit contributions
vals+0.007
 Ple+0.006
GY+0.006
 gal+0.006
 AMA+0.006
Neg logit contributions
irtual-0.009
asket-0.007
��-0.007
IDENT-0.007
dam-0.006
Token a
Token ID257
Feature activation+1.83e-02

Pos logit contributions
irtual+0.020
asket+0.016
��+0.015
IDENT+0.015
dam+0.014
Neg logit contributions
vals-0.015
 Ple-0.014
GY-0.014
 gal-0.014
 AMA-0.014
Token N
Token ID399
Feature activation-1.34e-01

Pos logit contributions
irtual+0.008
asket+0.007
��+0.007
IDENT+0.007
dam+0.006
Neg logit contributions
vals-0.003
 Ple-0.003
GY-0.003
 gal-0.003
 AMA-0.003
TokenUFC
Token ID44534
Feature activation-1.79e+00

Pos logit contributions
vals+0.020
 Ple+0.019
GY+0.019
 gal+0.019
 AMA+0.019
Neg logit contributions
irtual-0.027
asket-0.023
��-0.022
IDENT-0.021
dam-0.020
Token Fans
Token ID24964
Feature activation-3.90e-01

Pos logit contributions
vals+0.253
 Ple+0.244
GY+0.241
 gal+0.239
 AMA+0.238
Neg logit contributions
irtual-0.377
asket-0.319
��-0.299
IDENT-0.287
dam-0.279
Token United
Token ID1578
Feature activation+4.42e-02

Pos logit contributions
vals+0.062
 Ple+0.060
GY+0.059
 gal+0.059
 AMA+0.058
Neg logit contributions
irtual-0.075
asket-0.062
��-0.058
IDENT-0.056
dam-0.054
Token and
Token ID290
Feature activation-8.40e-02

Pos logit contributions
irtual+0.008
asket+0.007
��+0.007
IDENT+0.006
dam+0.006
Neg logit contributions
vals-0.007
 Ple-0.007
GY-0.007
 gal-0.007
 AMA-0.007
Token Newcastle
Token ID22410
Feature activation-2.31e+00

Pos logit contributions
vals+0.010
 Ple+0.010
GY+0.010
 gal+0.009
 AMA+0.009
Neg logit contributions
irtual-0.019
asket-0.017
��-0.016
IDENT-0.015
dam-0.015
Token United
Token ID1578
Feature activation+4.34e-02

Pos logit contributions
vals+0.346
 Ple+0.334
GY+0.331
 gal+0.327
 AMA+0.326
Neg logit contributions
irtual-0.465
asket-0.391
��-0.365
IDENT-0.349
dam-0.340
Token Disabled
Token ID43201
Feature activation+7.44e-01

Pos logit contributions
irtual+0.011
asket+0.009
��+0.009
IDENT+0.009
dam+0.008
Neg logit contributions
vals-0.004
 Ple-0.004
GY-0.004
 gal-0.004
 AMA-0.004
Token Supporters
Token ID39995
Feature activation-1.18e+00

Pos logit contributions
irtual+0.168
asket+0.144
��+0.136
IDENT+0.131
dam+0.128
Neg logit contributions
vals-0.093
 Ple-0.088
GY-0.088
 gal-0.087
 AMA-0.086
Token Association
Token ID5396
Feature activation+1.07e+00

Pos logit contributions
vals+0.191
 Ple+0.185
GY+0.183
 gal+0.182
 AMA+0.181
Neg logit contributions
irtual-0.223
asket-0.185
��-0.172
IDENT-0.164
dam-0.159
Token (
Token ID357
Feature activation+1.48e-01

Pos logit contributions
irtual+0.212
asket+0.178
��+0.166
IDENT+0.159
dam+0.154
Neg logit contributions
vals-0.163
 Ple-0.158
GY-0.156
 gal-0.155
 AMA-0.154
Token of
Token ID286
Feature activation-3.88e-03

Pos logit contributions
vals+0.017
 Ple+0.016
GY+0.016
 gal+0.016
 AMA+0.016
Neg logit contributions
irtual-0.021
asket-0.018
��-0.016
IDENT-0.016
dam-0.015
Token 5
Token ID642
Feature activation-8.52e-02

Pos logit contributions
vals+0.001
 Ple+0.000
GY+0.000
 AMA+0.000
 gal+0.000
Neg logit contributions
irtual-0.001
asket-0.001
��-0.001
IDENT-0.001
dam-0.001
Token by
Token ID416
Feature activation+2.91e-01

Pos logit contributions
vals+0.015
 Ple+0.015
GY+0.015
 AMA+0.014
 gal+0.014
Neg logit contributions
irtual-0.015
asket-0.012
��-0.011
IDENT-0.011
dam-0.010
Token Quake
Token ID42901
Feature activation+1.69e+00

Pos logit contributions
irtual+0.060
asket+0.050
��+0.047
IDENT+0.045
dam+0.044
Neg logit contributions
vals-0.043
 Ple-0.041
GY-0.041
 gal-0.040
 AMA-0.040
TokenD
Token ID35
Feature activation+1.89e-01

Pos logit contributions
irtual+0.346
asket+0.292
��+0.273
IDENT+0.263
dam+0.258
Neg logit contributions
vals-0.245
 Ple-0.238
GY-0.233
 AMA-0.232
 gal-0.231
Tokenude
Token ID2507
Feature activation-2.31e+00

Pos logit contributions
irtual+0.040
asket+0.034
��+0.031
IDENT+0.030
dam+0.030
Neg logit contributions
vals-0.026
 Ple-0.026
GY-0.025
 AMA-0.025
 gal-0.025
Token from
Token ID422
Feature activation-2.17e-01

Pos logit contributions
vals+0.332
 Ple+0.320
GY+0.315
 gal+0.313
 AMA+0.313
Neg logit contributions
irtual-0.478
asket-0.404
��-0.378
IDENT-0.364
dam-0.355
Token Explosive
Token ID48178
Feature activation+9.50e-01

Pos logit contributions
vals+0.030
 Ple+0.028
GY+0.028
 gal+0.028
 AMA+0.028
Neg logit contributions
irtual-0.046
asket-0.039
��-0.037
IDENT-0.035
dam-0.035
Token!!!!!!!!
Token ID34635
Feature activation-2.11e-01

Pos logit contributions
irtual+0.194
asket+0.164
��+0.152
IDENT+0.147
dam+0.144
Neg logit contributions
vals-0.139
GY-0.132
 Ple-0.132
 gal-0.130
 AMA-0.130
Token If
Token ID1002
Feature activation+3.39e-01

Pos logit contributions
vals+0.028
 Ple+0.027
GY+0.026
 gal+0.026
 AMA+0.026
Neg logit contributions
irtual-0.046
asket-0.039
��-0.037
IDENT-0.036
dam-0.035
Token you
Token ID345
Feature activation+2.36e-01

Pos logit contributions
irtual+0.061
asket+0.050
��+0.046
IDENT+0.044
dam+0.043
Neg logit contributions
vals-0.058
 Ple-0.056
GY-0.056
 gal-0.055
 AMA-0.055
Tokenthe
Token ID1169
Feature activation-1.77e-01

Pos logit contributions
irtual+0.027
asket+0.022
��+0.021
IDENT+0.020
dam+0.020
Neg logit contributions
vals-0.018
 Ple-0.017
GY-0.017
 gal-0.017
 AMA-0.017
Token-
Token ID12
Feature activation+1.26e-01

Pos logit contributions
vals+0.025
 Ple+0.024
GY+0.024
 gal+0.024
 AMA+0.024
Neg logit contributions
irtual-0.037
asket-0.031
��-0.029
IDENT-0.028
dam-0.028
Tokencounter
Token ID24588
Feature activation-1.50e+00

Pos logit contributions
irtual+0.032
asket+0.028
��+0.026
IDENT+0.025
dam+0.025
Neg logit contributions
vals-0.013
 Ple-0.012
GY-0.012
 gal-0.012
 AMA-0.012
Token supplement
Token ID10327
Feature activation-1.63e-01

Pos logit contributions
vals+0.213
 Ple+0.206
GY+0.203
 gal+0.202
 AMA+0.201
Neg logit contributions
irtual-0.315
asket-0.265
��-0.249
IDENT-0.239
dam-0.232
Token called
Token ID1444
Feature activation-4.18e-02

Pos logit contributions
vals+0.025
 Ple+0.024
GY+0.023
 AMA+0.023
 gal+0.023
Neg logit contributions
irtual-0.033
asket-0.028
��-0.026
IDENT-0.025
dam-0.024
Token gal
Token ID9426
Feature activation-2.31e+00

Pos logit contributions
vals+0.006
 Ple+0.006
GY+0.006
 AMA+0.005
 gal+0.005
Neg logit contributions
irtual-0.009
asket-0.007
��-0.007
IDENT-0.007
dam-0.007
Tokenant
Token ID415
Feature activation-2.68e-02

Pos logit contributions
vals+0.341
 Ple+0.335
GY+0.329
 AMA+0.328
 gal+0.325
Neg logit contributions
irtual-0.469
asket-0.392
��-0.362
IDENT-0.349
dam-0.344
Tokenamine
Token ID9862
Feature activation+8.71e-01

Pos logit contributions
vals+0.005
 Ple+0.005
GY+0.005
 gal+0.005
 AMA+0.005
Neg logit contributions
irtual-0.004
asket-0.003
��-0.003
IDENT-0.003
dam-0.003
Token,
Token ID11
Feature activation+1.06e-01

Pos logit contributions
irtual+0.178
asket+0.149
��+0.139
IDENT+0.134
dam+0.130
Neg logit contributions
vals-0.129
 Ple-0.124
GY-0.123
 gal-0.121
 AMA-0.121
Token which
Token ID543
Feature activation-3.14e-01

Pos logit contributions
irtual+0.022
asket+0.018
��+0.017
IDENT+0.017
dam+0.016
Neg logit contributions
vals-0.015
 Ple-0.015
GY-0.014
 AMA-0.014
 gal-0.014
Token increases
Token ID5732
Feature activation+1.99e-01

Pos logit contributions
vals+0.049
 Ple+0.047
GY+0.046
 AMA+0.046
 gal+0.046
Neg logit contributions
irtual-0.062
asket-0.052
��-0.048
IDENT-0.046
dam-0.045
Tokenth
Token ID400
Feature activation+2.27e-01

Pos logit contributions
vals+0.121
 Ple+0.118
GY+0.117
 gal+0.116
 AMA+0.116
Neg logit contributions
irtual-0.121
asket-0.098
��-0.091
IDENT-0.086
dam-0.083
Token Premier
Token ID9952
Feature activation+7.11e-02

Pos logit contributions
irtual+0.049
asket+0.042
��+0.039
IDENT+0.038
dam+0.037
Neg logit contributions
vals-0.031
 Ple-0.030
GY-0.029
 gal-0.029
 AMA-0.029
Token League
Token ID4041
Feature activation+1.33e+00

Pos logit contributions
irtual+0.011
asket+0.008
��+0.008
IDENT+0.007
dam+0.007
Neg logit contributions
vals-0.014
 Ple-0.014
GY-0.014
 AMA-0.014
 gal-0.014
Token goal
Token ID3061
Feature activation-6.66e-01

Pos logit contributions
irtual+0.299
asket+0.257
��+0.242
IDENT+0.234
dam+0.229
Neg logit contributions
vals-0.168
 Ple-0.158
GY-0.157
 gal-0.154
 AMA-0.154
Token at
Token ID379
Feature activation-1.74e-01

Pos logit contributions
vals+0.100
 Ple+0.096
GY+0.095
 gal+0.094
 AMA+0.094
Neg logit contributions
irtual-0.135
asket-0.113
��-0.105
IDENT-0.101
dam-0.098
Token Newcastle
Token ID22410
Feature activation-2.30e+00

Pos logit contributions
vals+0.022
 Ple+0.021
GY+0.021
+0.020
 gal+0.020
Neg logit contributions
irtual-0.039
asket-0.034
��-0.032
IDENT-0.031
dam-0.030
Token on
Token ID319
Feature activation+1.40e-01

Pos logit contributions
vals+0.351
 Ple+0.339
GY+0.335
 gal+0.332
 AMA+0.331
Neg logit contributions
irtual-0.459
asket-0.384
��-0.358
IDENT-0.343
dam-0.333
Token Saturday
Token ID3909
Feature activation-3.12e-01

Pos logit contributions
irtual+0.030
asket+0.026
��+0.024
IDENT+0.023
dam+0.023
Neg logit contributions
vals-0.019
 Ple-0.018
GY-0.018
 gal-0.018
 AMA-0.018
Token,
Token ID11
Feature activation+1.47e-03

Pos logit contributions
vals+0.043
 Ple+0.042
GY+0.041
 gal+0.041
 AMA+0.041
Neg logit contributions
irtual-0.066
asket-0.056
��-0.053
IDENT-0.051
dam-0.049
Token becoming
Token ID5033
Feature activation+8.03e-02

Pos logit contributions
irtual+0.000
asket+0.000
��+0.000
IDENT+0.000
dam+0.000
Neg logit contributions
vals-0.000
 Ple-0.000
GY-0.000
 gal-0.000
 AMA-0.000
Token the
Token ID262
Feature activation-1.48e-01

Pos logit contributions
irtual+0.016
asket+0.014
��+0.013
IDENT+0.012
dam+0.012
Neg logit contributions
vals-0.012
 Ple-0.011
GY-0.011
 gal-0.011
 AMA-0.011
Tokenmb
Token ID2022
Feature activation-5.14e-01

Pos logit contributions
vals+0.003
 Ple+0.003
GY+0.003
 gal+0.003
 AMA+0.003
Neg logit contributions
irtual-0.004
asket-0.003
��-0.003
IDENT-0.003
dam-0.003
Tokenowl
Token ID4883
Feature activation+6.66e-02

Pos logit contributions
vals+0.077
 Ple+0.074
GY+0.073
 gal+0.073
 AMA+0.073
Neg logit contributions
irtual-0.104
asket-0.087
��-0.081
IDENT-0.078
dam-0.076
Token 96
Token ID9907
Feature activation-6.02e-01

Pos logit contributions
irtual+0.014
asket+0.012
��+0.011
IDENT+0.010
dam+0.010
Neg logit contributions
vals-0.010
 Ple-0.009
GY-0.009
 gal-0.009
 AMA-0.009
Token-
Token ID12
Feature activation+3.04e-02

Pos logit contributions
vals+0.085
 Ple+0.082
GY+0.081
 gal+0.081
 AMA+0.080
Neg logit contributions
irtual-0.126
asket-0.107
��-0.100
IDENT-0.096
dam-0.093
TokenD
Token ID35
Feature activation+1.75e-01

Pos logit contributions
irtual+0.006
asket+0.005
��+0.005
IDENT+0.005
dam+0.004
Neg logit contributions
vals-0.005
 Ple-0.004
GY-0.004
 gal-0.004
 AMA-0.004
Tokenude
Token ID2507
Feature activation-2.30e+00

Pos logit contributions
irtual+0.036
asket+0.031
��+0.029
IDENT+0.028
dam+0.027
Neg logit contributions
vals-0.025
 Ple-0.024
GY-0.024
 gal-0.024
 AMA-0.024
Token Bro
Token ID2806
Feature activation+5.37e-01

Pos logit contributions
vals+0.317
 Ple+0.303
GY+0.301
 gal+0.298
 AMA+0.295
Neg logit contributions
irtual-0.490
asket-0.416
��-0.389
IDENT-0.375
dam-0.368
Token M
Token ID337
Feature activation-1.76e-01

Pos logit contributions
irtual+0.113
asket+0.096
��+0.089
IDENT+0.086
dam+0.085
Neg logit contributions
vals-0.075
 Ple-0.071
GY-0.071
 gal-0.070
 AMA-0.070
Tokenuther
Token ID12866
Feature activation-1.90e-01

Pos logit contributions
vals+0.025
 Ple+0.024
GY+0.024
 gal+0.023
 AMA+0.023
Neg logit contributions
irtual-0.037
asket-0.032
��-0.029
IDENT-0.028
dam-0.028
Tokenh
Token ID71
Feature activation+6.08e-01

Pos logit contributions
vals+0.028
 Ple+0.026
GY+0.026
 gal+0.026
 AMA+0.026
Neg logit contributions
irtual-0.039
asket-0.033
��-0.031
IDENT-0.030
dam-0.029
Tokenucker
Token ID12603
Feature activation+6.72e-01

Pos logit contributions
irtual+0.123
asket+0.103
��+0.096
IDENT+0.092
dam+0.090
Neg logit contributions
vals-0.091
 Ple-0.088
GY-0.087
 gal-0.086
 AMA-0.086
Token to
Token ID284
Feature activation-3.35e-02

Pos logit contributions
vals+0.105
 Ple+0.102
GY+0.100
 gal+0.099
 AMA+0.099
Neg logit contributions
irtual-0.144
asket-0.121
��-0.113
IDENT-0.108
dam-0.105
Token established
Token ID4920
Feature activation-6.39e-01

Pos logit contributions
vals+0.005
 Ple+0.005
GY+0.004
 gal+0.004
 AMA+0.004
Neg logit contributions
irtual-0.007
asket-0.006
��-0.006
IDENT-0.005
dam-0.005
Token fan
Token ID4336
Feature activation+1.26e-01

Pos logit contributions
vals+0.080
 Ple+0.077
GY+0.076
 gal+0.075
 AMA+0.075
Neg logit contributions
irtual-0.144
asket-0.123
��-0.116
IDENT-0.112
dam-0.109
Token groups
Token ID2628
Feature activation+1.46e-01

Pos logit contributions
irtual+0.025
asket+0.021
��+0.019
IDENT+0.018
dam+0.018
Neg logit contributions
vals-0.019
 Ple-0.019
GY-0.019
 gal-0.018
 AMA-0.018
Token -
Token ID532
Feature activation-1.87e-01

Pos logit contributions
irtual+0.030
asket+0.025
��+0.024
IDENT+0.023
dam+0.022
Neg logit contributions
vals-0.021
 Ple-0.020
GY-0.020
 gal-0.020
 AMA-0.020
Token Newcastle
Token ID22410
Feature activation-2.29e+00

Pos logit contributions
vals+0.027
 Ple+0.025
GY+0.025
 gal+0.025
 AMA+0.025
Neg logit contributions
irtual-0.039
asket-0.033
��-0.031
IDENT-0.030
dam-0.029
Token United
Token ID1578
Feature activation+3.31e-02

Pos logit contributions
vals+0.346
 Ple+0.333
GY+0.330
 gal+0.326
 AMA+0.326
Neg logit contributions
irtual-0.461
asket-0.386
��-0.361
IDENT-0.345
dam-0.336
Token Supporters
Token ID39995
Feature activation-1.18e+00

Pos logit contributions
irtual+0.008
asket+0.006
��+0.006
IDENT+0.006
dam+0.006
Neg logit contributions
vals-0.004
 Ple-0.004
GY-0.004
 AMA-0.004
 gal-0.004
Token Trust
Token ID9870
Feature activation+7.07e-01

Pos logit contributions
vals+0.157
 Ple+0.149
GY+0.149
 gal+0.147
 AMA+0.146
Neg logit contributions
irtual-0.257
asket-0.218
��-0.205
IDENT-0.197
dam-0.193
Token (
Token ID357
Feature activation+1.28e-01

Pos logit contributions
irtual+0.144
asket+0.121
��+0.113
IDENT+0.109
dam+0.106
Neg logit contributions
vals-0.104
 Ple-0.100
GY-0.099
 gal-0.098
 AMA-0.098
TokenN
Token ID45
Feature activation-2.25e-01

Pos logit contributions
irtual+0.026
asket+0.022
��+0.020
IDENT+0.020
dam+0.019
Neg logit contributions
vals-0.019
 Ple-0.019
 gal-0.018
 AMA-0.018
GY-0.018
Token levels
Token ID2974
Feature activation+1.49e-01

Pos logit contributions
vals+0.270
 Ple+0.262
GY+0.259
 gal+0.256
 AMA+0.256
Neg logit contributions
irtual-0.341
asket-0.284
��-0.265
IDENT-0.254
dam-0.246
Token)
Token ID8
Feature activation-1.51e-01

Pos logit contributions
irtual+0.029
asket+0.024
��+0.023
IDENT+0.022
dam+0.021
Neg logit contributions
vals-0.023
 Ple-0.022
GY-0.022
 AMA-0.022
 gal-0.022
Token but
Token ID475
Feature activation-6.72e-01

Pos logit contributions
vals+0.020
 Ple+0.019
GY+0.019
 gal+0.019
 AMA+0.019
Neg logit contributions
irtual-0.033
asket-0.028
��-0.027
IDENT-0.026
dam-0.025
Token with
Token ID351
Feature activation+1.45e-01

Pos logit contributions
vals+0.111
 Ple+0.108
GY+0.107
 gal+0.106
 AMA+0.106
Neg logit contributions
irtual-0.126
asket-0.103
��-0.096
IDENT-0.091
dam-0.088
Token limited
Token ID3614
Feature activation-5.92e-02

Pos logit contributions
irtual+0.028
asket+0.023
��+0.021
IDENT+0.020
dam+0.020
Neg logit contributions
vals-0.023
 Ple-0.022
GY-0.022
 gal-0.022
 AMA-0.022
Token latitude
Token ID32477
Feature activation-2.29e+00

Pos logit contributions
vals+0.009
 Ple+0.009
GY+0.009
 gal+0.009
 AMA+0.009
Neg logit contributions
irtual-0.012
asket-0.010
��-0.009
IDENT-0.009
dam-0.008
Token for
Token ID329
Feature activation-1.07e-01

Pos logit contributions
vals+0.334
 Ple+0.323
GY+0.319
 gal+0.316
 AMA+0.315
Neg logit contributions
irtual-0.471
asket-0.395
��-0.370
IDENT-0.355
dam-0.345
Token highlight
Token ID7238
Feature activation-2.23e-01

Pos logit contributions
vals+0.017
 Ple+0.017
GY+0.016
 gal+0.016
 AMA+0.016
Neg logit contributions
irtual-0.021
asket-0.017
��-0.016
IDENT-0.015
dam-0.015
Token recovery
Token ID7628
Feature activation+1.60e-01

Pos logit contributions
vals+0.033
 Ple+0.032
GY+0.032
 gal+0.032
 AMA+0.032
Neg logit contributions
irtual-0.045
asket-0.038
��-0.035
IDENT-0.034
dam-0.033
Token,
Token ID11
Feature activation-2.81e-03

Pos logit contributions
irtual+0.034
asket+0.029
��+0.027
IDENT+0.026
dam+0.025
Neg logit contributions
vals-0.023
 Ple-0.022
GY-0.022
 gal-0.021
 AMA-0.021
Token or
Token ID393
Feature activation+1.45e-01

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.001
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Token City
Token ID2254
Feature activation-8.57e-01

Pos logit contributions
irtual+0.126
asket+0.106
��+0.099
IDENT+0.096
dam+0.093
Neg logit contributions
vals-0.085
GY-0.081
 Ple-0.081
 AMA-0.079
 gal-0.079
Token Hall
Token ID4789
Feature activation+1.03e+00

Pos logit contributions
vals+0.172
 Ple+0.167
GY+0.166
 gal+0.165
 AMA+0.165
Neg logit contributions
irtual-0.130
asket-0.102
��-0.092
IDENT-0.086
dam-0.083
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
irtual+0.204
asket+0.171
��+0.159
IDENT+0.153
dam+0.149
Neg logit contributions
vals-0.157
 Ple-0.151
GY-0.150
 AMA-0.148
 gal-0.148
Token He
Token ID679
Feature activation-2.32e-01

Pos logit contributions
irtual+0.003
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token lamb
Token ID19343
Feature activation+1.49e+00

Pos logit contributions
vals+0.034
 Ple+0.033
GY+0.033
 gal+0.032
 AMA+0.032
Neg logit contributions
irtual-0.047
asket-0.040
��-0.037
IDENT-0.036
dam-0.035
Tokenasted
Token ID8992
Feature activation-2.28e+00

Pos logit contributions
irtual+0.474
asket+0.426
��+0.407
IDENT+0.397
dam+0.393
Neg logit contributions
vals-0.050
 Ple-0.045
GY-0.041
 AMA-0.039
 gal-0.039
Token journalists
Token ID9046
Feature activation+2.45e-01

Pos logit contributions
vals+0.348
 Ple+0.337
GY+0.332
 gal+0.330
 AMA+0.329
Neg logit contributions
irtual-0.455
asket-0.380
��-0.355
IDENT-0.340
dam-0.330
Token,
Token ID11
Feature activation+5.57e-02

Pos logit contributions
irtual+0.053
asket+0.045
��+0.042
IDENT+0.041
dam+0.040
Neg logit contributions
vals-0.033
 Ple-0.032
GY-0.032
 AMA-0.031
 gal-0.031
Token calling
Token ID4585
Feature activation+3.42e-01

Pos logit contributions
irtual+0.011
asket+0.009
��+0.009
IDENT+0.008
dam+0.008
Neg logit contributions
vals-0.009
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token the
Token ID262
Feature activation-4.19e-02

Pos logit contributions
irtual+0.054
asket+0.043
��+0.039
IDENT+0.037
dam+0.036
Neg logit contributions
vals-0.066
 Ple-0.064
GY-0.064
 gal-0.063
 AMA-0.063
Token media
Token ID2056
Feature activation+2.43e-01

Pos logit contributions
vals+0.007
 Ple+0.007
GY+0.006
 gal+0.006
 AMA+0.006
Neg logit contributions
irtual-0.008
asket-0.007
��-0.006
IDENT-0.006
dam-0.006
TokenWe
Token ID1135
Feature activation-3.56e-01

Pos logit contributions
vals+0.037
 Ple+0.036
 gal+0.036
 AMA+0.036
GY+0.035
Neg logit contributions
irtual-0.045
asket-0.038
��-0.035
IDENT-0.035
dam-0.033
Token welcome
Token ID7062
Feature activation-1.22e+00

Pos logit contributions
vals+0.054
 Ple+0.052
GY+0.052
 gal+0.051
 AMA+0.051
Neg logit contributions
irtual-0.071
asket-0.060
��-0.055
IDENT-0.053
dam-0.052
Token the
Token ID262
Feature activation-1.15e-01

Pos logit contributions
vals+0.182
 Ple+0.175
GY+0.174
 gal+0.172
 AMA+0.171
Neg logit contributions
irtual-0.244
asket-0.204
��-0.190
IDENT-0.184
dam-0.178
Token initiative
Token ID10219
Feature activation+4.69e-02

Pos logit contributions
vals+0.015
GY+0.014
 Ple+0.014
 gal+0.014
+0.014
Neg logit contributions
irtual-0.025
asket-0.021
��-0.020
IDENT-0.019
dam-0.019
Token by
Token ID416
Feature activation+3.15e-01

Pos logit contributions
irtual+0.009
asket+0.008
��+0.007
IDENT+0.007
dam+0.007
Neg logit contributions
vals-0.007
 Ple-0.007
GY-0.007
 AMA-0.007
 gal-0.007
Token Newcastle
Token ID22410
Feature activation-2.27e+00

Pos logit contributions
irtual+0.080
asket+0.070
��+0.066
IDENT+0.065
dam+0.063
Neg logit contributions
vals-0.030
 Ple-0.028
GY-0.028
 gal-0.027
 AMA-0.027
Token United
Token ID1578
Feature activation+5.74e-02

Pos logit contributions
vals+0.349
 Ple+0.336
GY+0.333
 gal+0.329
 AMA+0.328
Neg logit contributions
irtual-0.451
asket-0.377
��-0.352
IDENT-0.337
dam-0.326
Token to
Token ID284
Feature activation+1.52e-02

Pos logit contributions
irtual+0.012
asket+0.010
��+0.009
IDENT+0.009
dam+0.008
Neg logit contributions
vals-0.009
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token launch
Token ID4219
Feature activation-6.91e-01

Pos logit contributions
irtual+0.003
asket+0.003
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token their
Token ID511
Feature activation-1.30e-01

Pos logit contributions
vals+0.104
 Ple+0.100
GY+0.100
 gal+0.098
 AMA+0.098
Neg logit contributions
irtual-0.139
asket-0.115
��-0.108
IDENT-0.104
dam-0.100
Token new
Token ID649
Feature activation-2.03e-01

Pos logit contributions
vals+0.018
 Ple+0.017
GY+0.017
 gal+0.017
 AMA+0.017
Neg logit contributions
irtual-0.028
asket-0.023
��-0.022
IDENT-0.021
dam-0.021
Token are
Token ID389
Feature activation-4.06e-02

Pos logit contributions
vals+0.226
 Ple+0.219
GY+0.216
 gal+0.215
 AMA+0.214
Neg logit contributions
irtual-0.289
asket-0.241
��-0.226
IDENT-0.215
dam-0.209
Token connected
Token ID5884
Feature activation+9.75e-01

Pos logit contributions
vals+0.006
 Ple+0.006
GY+0.006
 gal+0.006
 AMA+0.006
Neg logit contributions
irtual-0.008
asket-0.007
��-0.006
IDENT-0.006
dam-0.006
Token to
Token ID284
Feature activation+5.30e-02

Pos logit contributions
irtual+0.194
asket+0.161
��+0.151
IDENT+0.144
dam+0.140
Neg logit contributions
vals-0.150
 Ple-0.145
GY-0.143
 gal-0.142
 AMA-0.141
Token the
Token ID262
Feature activation-6.34e-02

Pos logit contributions
irtual+0.011
asket+0.009
��+0.008
IDENT+0.008
dam+0.008
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.007
 gal-0.007
 AMA-0.007
Token state
Token ID1181
Feature activation+1.82e-01

Pos logit contributions
vals+0.009
 Ple+0.009
GY+0.009
 gal+0.009
 AMA+0.009
Neg logit contributions
irtual-0.013
asket-0.011
��-0.010
IDENT-0.010
dam-0.010
Token registry
Token ID20478
Feature activation-2.27e+00

Pos logit contributions
irtual+0.039
asket+0.033
��+0.030
IDENT+0.029
dam+0.029
Neg logit contributions
vals-0.026
 Ple-0.025
GY-0.024
 gal-0.024
 AMA-0.024
Token via
Token ID2884
Feature activation-1.91e-01

Pos logit contributions
vals+0.324
 Ple+0.314
GY+0.309
 gal+0.308
 AMA+0.307
Neg logit contributions
irtual-0.475
asket-0.400
��-0.377
IDENT-0.360
dam-0.350
Token official
Token ID1743
Feature activation+8.69e-01

Pos logit contributions
vals+0.029
 Ple+0.028
GY+0.027
 gal+0.027
 AMA+0.027
Neg logit contributions
irtual-0.039
asket-0.032
��-0.030
IDENT-0.029
dam-0.028
Token API
Token ID7824
Feature activation-9.30e-01

Pos logit contributions
irtual+0.172
asket+0.143
��+0.134
IDENT+0.128
dam+0.124
Neg logit contributions
vals-0.134
 Ple-0.129
GY-0.128
 gal-0.127
 AMA-0.126
Token,
Token ID11
Feature activation+5.85e-02

Pos logit contributions
vals+0.132
 Ple+0.127
GY+0.125
 gal+0.124
 AMA+0.124
Neg logit contributions
irtual-0.195
asket-0.165
��-0.154
IDENT-0.149
dam-0.144
Token and
Token ID290
Feature activation-4.04e-03

Pos logit contributions
irtual+0.012
asket+0.010
��+0.010
IDENT+0.009
dam+0.009
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token),
Token ID828
Feature activation+4.04e-02

Pos logit contributions
vals+0.018
 Ple+0.018
GY+0.018
 gal+0.017
 AMA+0.017
Neg logit contributions
irtual-0.024
asket-0.020
��-0.018
IDENT-0.018
dam-0.017
Token surplus
Token ID18201
Feature activation-1.03e+00

Pos logit contributions
irtual+0.008
asket+0.007
��+0.006
IDENT+0.006
dam+0.006
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.006
 gal-0.006
 AMA-0.006
Token (
Token ID357
Feature activation+1.73e-01

Pos logit contributions
vals+0.157
 Ple+0.152
GY+0.150
 gal+0.149
 AMA+0.149
Neg logit contributions
irtual-0.205
asket-0.171
��-0.160
IDENT-0.153
dam-0.148
Tokenh
Token ID71
Feature activation+6.24e-01

Pos logit contributions
irtual+0.036
asket+0.030
��+0.028
IDENT+0.027
dam+0.026
Neg logit contributions
vals-0.025
 Ple-0.025
GY-0.024
 gal-0.024
 AMA-0.024
Tokenul
Token ID377
Feature activation-2.76e-01

Pos logit contributions
irtual+0.126
asket+0.106
��+0.099
IDENT+0.094
dam+0.092
Neg logit contributions
vals-0.093
 Ple-0.090
GY-0.089
 gal-0.088
 AMA-0.088
Tokenagu
Token ID11433
Feature activation+2.55e+00

Pos logit contributions
vals+0.040
 Ple+0.039
GY+0.038
 gal+0.038
 AMA+0.038
Neg logit contributions
irtual-0.057
asket-0.048
��-0.045
IDENT-0.043
dam-0.042
Token),
Token ID828
Feature activation+3.59e-02

Pos logit contributions
irtual+0.507
asket+0.424
��+0.396
IDENT+0.377
dam+0.369
Neg logit contributions
vals-0.388
 Ple-0.375
GY-0.370
 AMA-0.367
 gal-0.366
Token[
Token ID58
Feature activation+8.21e-02

Pos logit contributions
irtual+0.007
asket+0.006
��+0.006
IDENT+0.006
dam+0.005
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 AMA-0.005
 gal-0.005
Token3
Token ID18
Feature activation-6.04e-02

Pos logit contributions
irtual+0.017
asket+0.014
��+0.013
IDENT+0.013
dam+0.012
Neg logit contributions
vals-0.012
 Ple-0.012
GY-0.012
 gal-0.011
 AMA-0.011
Token]
Token ID60
Feature activation+2.38e-01

Pos logit contributions
vals+0.011
 Ple+0.011
GY+0.010
 gal+0.010
 AMA+0.010
Neg logit contributions
irtual-0.010
asket-0.008
��-0.008
IDENT-0.007
dam-0.007
Token blue
Token ID4171
Feature activation+2.51e-01

Pos logit contributions
irtual+0.049
asket+0.041
��+0.038
IDENT+0.037
dam+0.036
Neg logit contributions
vals-0.035
 Ple-0.034
GY-0.033
 gal-0.033
 AMA-0.033
Token --
Token ID1377
Feature activation+1.87e-01

Pos logit contributions
vals+0.021
 Ple+0.020
GY+0.020
 gal+0.019
 AMA+0.019
Neg logit contributions
irtual-0.030
asket-0.026
��-0.024
IDENT-0.023
dam-0.022
Token could
Token ID714
Feature activation-1.15e-01

Pos logit contributions
irtual+0.038
asket+0.032
��+0.029
IDENT+0.028
dam+0.027
Neg logit contributions
vals-0.028
 Ple-0.027
GY-0.027
 gal-0.027
 AMA-0.027
Token you
Token ID345
Feature activation+2.64e-01

Pos logit contributions
vals+0.019
 Ple+0.018
GY+0.018
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.022
asket-0.018
��-0.017
IDENT-0.016
dam-0.015
Token place
Token ID1295
Feature activation-7.44e-01

Pos logit contributions
irtual+0.053
asket+0.044
��+0.041
IDENT+0.039
dam+0.038
Neg logit contributions
vals-0.040
 Ple-0.039
GY-0.039
 gal-0.039
 AMA-0.038
Token your
Token ID534
Feature activation-2.03e-01

Pos logit contributions
vals+0.119
 Ple+0.117
 gal+0.115
GY+0.115
 AMA+0.114
Neg logit contributions
irtual-0.142
asket-0.117
��-0.110
IDENT-0.104
dam-0.101
Token seal
Token ID13810
Feature activation+2.35e+00

Pos logit contributions
vals+0.031
 Ple+0.030
GY+0.030
 gal+0.030
 AMA+0.030
Neg logit contributions
irtual-0.040
asket-0.033
��-0.031
IDENT-0.030
dam-0.029
Token here
Token ID994
Feature activation+5.72e-01

Pos logit contributions
irtual+0.463
asket+0.383
��+0.360
IDENT+0.342
dam+0.332
Neg logit contributions
vals-0.364
 Ple-0.354
GY-0.348
 gal-0.347
 AMA-0.345
Token...
Token ID986
Feature activation-1.90e-01

Pos logit contributions
irtual+0.117
asket+0.098
��+0.092
IDENT+0.087
dam+0.085
Neg logit contributions
vals-0.085
 Ple-0.082
GY-0.081
 gal-0.081
 AMA-0.080
Token huh
Token ID24926
Feature activation-3.83e-01

Pos logit contributions
vals+0.029
 Ple+0.028
GY+0.027
 gal+0.027
 AMA+0.027
Neg logit contributions
irtual-0.038
asket-0.032
��-0.030
IDENT-0.029
dam-0.028
Token?
Token ID30
Feature activation+5.74e-02

Pos logit contributions
vals+0.055
 Ple+0.054
GY+0.053
 gal+0.053
 AMA+0.052
Neg logit contributions
irtual-0.080
asket-0.067
��-0.063
IDENT-0.060
dam-0.058
Token
Token ID447
Feature activation-4.98e-03

Pos logit contributions
irtual+0.012
asket+0.010
��+0.009
IDENT+0.009
dam+0.008
Neg logit contributions
vals-0.009
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Tokenorg
Token ID2398
Feature activation-1.23e-01

Pos logit contributions
vals+0.027
 Ple+0.026
GY+0.025
 gal+0.025
 AMA+0.025
Neg logit contributions
irtual-0.038
asket-0.032
��-0.029
IDENT-0.028
dam-0.028
Tokenersen
Token ID46516
Feature activation+2.25e+00

Pos logit contributions
vals+0.036
 Ple+0.035
GY+0.035
 gal+0.035
 AMA+0.035
Neg logit contributions
irtual-0.007
asket-0.003
��-0.002
IDENT-0.001
dam-0.001
Token and
Token ID290
Feature activation-4.84e-02

Pos logit contributions
irtual+0.454
asket+0.381
��+0.356
IDENT+0.342
dam+0.332
Neg logit contributions
vals-0.336
 Ple-0.325
GY-0.321
 gal-0.317
 AMA-0.317
Token other
Token ID584
Feature activation-4.91e-01

Pos logit contributions
vals+0.007
 Ple+0.007
GY+0.007
 gal+0.007
 AMA+0.007
Neg logit contributions
irtual-0.010
asket-0.008
��-0.008
IDENT-0.007
dam-0.007
Token Sad
Token ID14668
Feature activation+1.04e+00

Pos logit contributions
vals+0.086
 Ple+0.083
GY+0.082
 gal+0.082
 AMA+0.082
Neg logit contributions
irtual-0.087
asket-0.071
��-0.066
IDENT-0.062
dam-0.060
Token Pupp
Token ID20926
Feature activation+2.42e+00

Pos logit contributions
irtual+0.094
asket+0.060
��+0.048
IDENT+0.042
dam+0.037
Neg logit contributions
vals-0.274
 Ple-0.269
GY-0.267
 gal-0.266
 AMA-0.265
Tokeny
Token ID88
Feature activation+2.46e-01

Pos logit contributions
irtual+0.560
asket+0.481
��+0.454
IDENT+0.437
dam+0.428
Neg logit contributions
vals-0.291
 Ple-0.280
GY-0.275
 gal-0.272
 AMA-0.272
Token
Token ID564
Feature activation-5.33e-02

Pos logit contributions
irtual+0.049
asket+0.041
��+0.039
IDENT+0.037
dam+0.036
Neg logit contributions
vals-0.037
 Ple-0.036
GY-0.035
 gal-0.035
 AMA-0.035
Token
Token ID250
Feature activation-2.74e-02

Pos logit contributions
vals+0.010
 Ple+0.009
GY+0.009
 gal+0.009
 AMA+0.009
Neg logit contributions
irtual-0.009
asket-0.007
��-0.007
IDENT-0.006
dam-0.006
Tokenmod
Token ID4666
Feature activation-6.13e-01

Pos logit contributions
vals+0.004
 Ple+0.004
GY+0.004
 gal+0.004
 AMA+0.004
Neg logit contributions
irtual-0.006
asket-0.005
��-0.005
IDENT-0.004
dam-0.004
Tokener
Token ID263
Feature activation-2.68e-01

Pos logit contributions
vals+0.077
 Ple+0.074
GY+0.072
 gal+0.072
 AMA+0.071
Neg logit contributions
irtual-0.140
asket-0.119
��-0.113
IDENT-0.109
dam-0.106
Token
Token ID564
Feature activation-2.24e-03

Pos logit contributions
irtual+0.003
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token
Token ID250
Feature activation+1.16e-02

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.000
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Tokenturn
Token ID15344
Feature activation+6.32e-01

Pos logit contributions
irtual+0.002
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Tokening
Token ID278
Feature activation-1.31e-01

Pos logit contributions
irtual+0.130
asket+0.109
��+0.102
IDENT+0.098
dam+0.095
Neg logit contributions
vals-0.093
 Ple-0.090
GY-0.089
 gal-0.088
 AMA-0.088
Token a
Token ID257
Feature activation+8.18e-02

Pos logit contributions
vals+0.019
 Ple+0.018
GY+0.018
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.027
asket-0.023
��-0.021
IDENT-0.020
dam-0.020
Token blind
Token ID7770
Feature activation+2.58e+00

Pos logit contributions
irtual+0.009
asket+0.006
��+0.005
IDENT+0.004
dam+0.004
Neg logit contributions
vals-0.020
 Ple-0.020
GY-0.020
 gal-0.020
 AMA-0.020
Token eye
Token ID4151
Feature activation+3.82e-01

Pos logit contributions
irtual+0.463
asket+0.382
��+0.349
IDENT+0.332
dam+0.326
Neg logit contributions
vals-0.440
 Ple-0.423
GY-0.422
 AMA-0.417
 gal-0.415
Token to
Token ID284
Feature activation+4.77e-02

Pos logit contributions
irtual+0.076
asket+0.063
��+0.059
IDENT+0.056
dam+0.055
Neg logit contributions
vals-0.059
 Ple-0.057
GY-0.056
 gal-0.056
 AMA-0.056
Token the
Token ID262
Feature activation-6.60e-02

Pos logit contributions
irtual+0.009
asket+0.008
��+0.007
IDENT+0.007
dam+0.007
Neg logit contributions
vals-0.007
 Ple-0.007
GY-0.007
 gal-0.007
 AMA-0.007
Token shortcomings
Token ID31731
Feature activation-2.68e-01

Pos logit contributions
vals+0.011
 Ple+0.010
GY+0.010
 gal+0.010
 AMA+0.010
Neg logit contributions
irtual-0.012
asket-0.010
��-0.010
IDENT-0.009
dam-0.009
Token of
Token ID286
Feature activation+9.09e-02

Pos logit contributions
vals+0.041
 Ple+0.040
GY+0.039
 gal+0.039
 AMA+0.039
Neg logit contributions
irtual-0.053
asket-0.045
��-0.042
IDENT-0.040
dam-0.039
Token Guardian
Token ID8283
Feature activation+1.47e+00

Pos logit contributions
irtual+0.025
asket+0.021
��+0.019
IDENT+0.019
dam+0.018
Neg logit contributions
vals-0.021
 Ple-0.020
GY-0.020
 gal-0.020
 AMA-0.019
Token
Token ID447
Feature activation+5.47e-02

Pos logit contributions
irtual+0.304
asket+0.255
��+0.239
IDENT+0.230
dam+0.224
Neg logit contributions
vals-0.213
 Ple-0.205
GY-0.203
 gal-0.201
 AMA-0.200
Token
Token ID247
Feature activation+6.48e-03

Pos logit contributions
irtual+0.009
asket+0.007
��+0.007
IDENT+0.006
dam+0.006
Neg logit contributions
vals-0.010
 Ple-0.010
GY-0.010
 AMA-0.010
 gal-0.010
Tokens
Token ID82
Feature activation+2.01e-02

Pos logit contributions
irtual+0.001
asket+0.001
��+0.001
IDENT+0.001
dam+0.001
Neg logit contributions
vals-0.001
 Ple-0.001
GY-0.001
 gal-0.001
 AMA-0.001
Token Luke
Token ID11336
Feature activation+2.45e-01

Pos logit contributions
irtual+0.004
asket+0.003
��+0.003
IDENT+0.003
dam+0.003
Neg logit contributions
vals-0.003
 Ple-0.003
GY-0.003
 gal-0.003
 AMA-0.003
Token Harding
Token ID45732
Feature activation+2.28e+00

Pos logit contributions
irtual+0.032
asket+0.024
��+0.021
IDENT+0.020
dam+0.018
Neg logit contributions
vals-0.054
 Ple-0.053
 gal-0.052
GY-0.052
 AMA-0.052
Token,
Token ID11
Feature activation+7.41e-02

Pos logit contributions
irtual+0.468
asket+0.393
��+0.368
IDENT+0.353
dam+0.344
Neg logit contributions
vals-0.334
 Ple-0.322
GY-0.319
 gal-0.316
 AMA-0.315
Token Stephanie
Token ID26085
Feature activation-3.63e-01

Pos logit contributions
irtual+0.015
asket+0.013
��+0.012
IDENT+0.011
dam+0.011
Neg logit contributions
vals-0.011
 Ple-0.011
GY-0.011
 gal-0.011
 AMA-0.010
Token Kir
Token ID7385
Feature activation+1.85e-02

Pos logit contributions
vals+0.053
 Ple+0.052
GY+0.051
 gal+0.051
 AMA+0.050
Neg logit contributions
irtual-0.074
asket-0.062
��-0.059
IDENT-0.056
dam-0.055
Tokench
Token ID354
Feature activation-3.45e-01

Pos logit contributions
irtual+0.003
asket+0.003
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.003
 Ple-0.003
GY-0.003
 gal-0.003
 AMA-0.003
Tokenga
Token ID4908
Feature activation-8.76e-01

Pos logit contributions
vals+0.054
 Ple+0.053
GY+0.052
 gal+0.052
 AMA+0.052
Neg logit contributions
irtual-0.067
asket-0.055
��-0.052
IDENT-0.049
dam-0.048
Token tentative
Token ID38358
Feature activation-2.12e-01

Pos logit contributions
irtual+0.010
asket+0.008
��+0.008
IDENT+0.007
dam+0.007
Neg logit contributions
vals-0.007
 Ple-0.007
GY-0.007
 AMA-0.007
 gal-0.007
Token inf
Token ID1167
Feature activation-1.76e-01

Pos logit contributions
vals+0.033
 Ple+0.032
GY+0.031
 gal+0.031
 AMA+0.031
Neg logit contributions
irtual-0.042
asket-0.035
��-0.032
IDENT-0.031
dam-0.030
Tokenruct
Token ID1356
Feature activation-1.41e-01

Pos logit contributions
vals+0.028
 Ple+0.027
GY+0.027
 gal+0.026
 AMA+0.026
Neg logit contributions
irtual-0.034
asket-0.028
��-0.026
IDENT-0.025
dam-0.025
Tokenue
Token ID518
Feature activation-4.03e-02

Pos logit contributions
vals+0.023
 Ple+0.022
GY+0.022
 gal+0.021
 AMA+0.021
Neg logit contributions
irtual-0.027
asket-0.022
��-0.021
IDENT-0.020
dam-0.019
Tokenuse
Token ID1904
Feature activation+6.58e-01

Pos logit contributions
vals+0.006
 Ple+0.006
GY+0.006
 gal+0.006
 AMA+0.006
Neg logit contributions
irtual-0.008
asket-0.007
��-0.006
IDENT-0.006
dam-0.006
Token de
Token ID390
Feature activation+6.99e-01

Pos logit contributions
irtual+0.126
asket+0.105
��+0.097
IDENT+0.093
dam+0.091
Neg logit contributions
vals-0.105
 Ple-0.101
GY-0.101
 AMA-0.099
 gal-0.099
Token coup
Token ID12092
Feature activation-7.11e-02

Pos logit contributions
irtual+0.139
asket+0.116
��+0.108
IDENT+0.104
dam+0.101
Neg logit contributions
vals-0.106
 Ple-0.102
GY-0.102
 AMA-0.100
 gal-0.100
Token d
Token ID288
Feature activation+4.51e-01

Pos logit contributions
vals+0.012
 Ple+0.011
GY+0.011
 gal+0.011
 AMA+0.011
Neg logit contributions
irtual-0.013
asket-0.011
��-0.010
IDENT-0.010
dam-0.009
Token'
Token ID6
Feature activation-1.32e-01

Pos logit contributions
irtual+0.103
asket+0.088
��+0.083
IDENT+0.080
dam+0.079
Neg logit contributions
vals-0.056
 Ple-0.053
GY-0.053
 AMA-0.052
 gal-0.052
TokenE
Token ID36
Feature activation+1.39e+00

Pos logit contributions
vals+0.023
 Ple+0.022
GY+0.022
 gal+0.022
 AMA+0.022
Neg logit contributions
irtual-0.024
asket-0.020
��-0.018
IDENT-0.017
dam-0.017
Tokent
Token ID83
Feature activation+5.00e-01

Pos logit contributions
irtual+0.350
asket+0.305
��+0.289
IDENT+0.281
dam+0.276
Neg logit contributions
vals-0.139
 Ple-0.133
GY-0.129
 AMA-0.129
 gal-0.129
Token intentions
Token ID14953
Feature activation-1.62e-02

Pos logit contributions
vals+0.089
 Ple+0.086
GY+0.085
 gal+0.084
 AMA+0.084
Neg logit contributions
irtual-0.114
asket-0.095
��-0.089
IDENT-0.085
dam-0.082
Token,
Token ID11
Feature activation-5.67e-02

Pos logit contributions
vals+0.002
 Ple+0.002
GY+0.002
 AMA+0.002
 gal+0.002
Neg logit contributions
irtual-0.003
asket-0.003
��-0.003
IDENT-0.003
dam-0.002
Token what
Token ID644
Feature activation-1.29e-01

Pos logit contributions
vals+0.009
 Ple+0.008
GY+0.008
 gal+0.008
 AMA+0.008
Neg logit contributions
irtual-0.011
asket-0.009
��-0.009
IDENT-0.008
dam-0.008
Token can
Token ID460
Feature activation-1.14e-01

Pos logit contributions
vals+0.020
 Ple+0.019
GY+0.019
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.026
asket-0.022
��-0.020
IDENT-0.019
dam-0.019
Token you
Token ID345
Feature activation+1.82e-01

Pos logit contributions
vals+0.018
 Ple+0.017
GY+0.017
 gal+0.017
 AMA+0.017
Neg logit contributions
irtual-0.023
asket-0.019
��-0.018
IDENT-0.017
dam-0.016
Token tell
Token ID1560
Feature activation+6.20e-01

Pos logit contributions
irtual+0.032
asket+0.026
��+0.024
IDENT+0.023
dam+0.022
Neg logit contributions
vals-0.032
 Ple-0.031
GY-0.031
 gal-0.030
 AMA-0.030
Token us
Token ID514
Feature activation+2.50e-01

Pos logit contributions
irtual+0.116
asket+0.095
��+0.089
IDENT+0.084
dam+0.082
Neg logit contributions
vals-0.102
 Ple-0.099
GY-0.098
 gal-0.098
 AMA-0.097
Token about
Token ID546
Feature activation-4.19e-01

Pos logit contributions
irtual+0.057
asket+0.049
��+0.046
IDENT+0.044
dam+0.043
Neg logit contributions
vals-0.031
 Ple-0.030
GY-0.029
 gal-0.029
 AMA-0.029
Token the
Token ID262
Feature activation-1.97e-01

Pos logit contributions
vals+0.063
 Ple+0.062
 gal+0.061
GY+0.061
 AMA+0.061
Neg logit contributions
irtual-0.084
asket-0.070
��-0.066
IDENT-0.063
dam-0.061
Token Mah
Token ID8882
Feature activation-4.29e-01

Pos logit contributions
vals+0.030
 Ple+0.029
GY+0.028
 gal+0.028
 AMA+0.028
Neg logit contributions
irtual-0.040
asket-0.033
��-0.031
IDENT-0.030
dam-0.029
Tokenone
Token ID505
Feature activation-7.74e-01

Pos logit contributions
vals+0.062
 Ple+0.060
GY+0.059
 gal+0.058
 AMA+0.058
Neg logit contributions
irtual-0.089
asket-0.076
��-0.070
IDENT-0.067
dam-0.066
Token the
Token ID262
Feature activation-1.13e-01

Pos logit contributions
vals+0.004
 Ple+0.004
GY+0.004
 gal+0.004
 AMA+0.004
Neg logit contributions
irtual-0.006
asket-0.005
��-0.005
IDENT-0.004
dam-0.004
Token list
Token ID1351
Feature activation-3.09e-01

Pos logit contributions
vals+0.015
 Ple+0.014
GY+0.014
 gal+0.014
 AMA+0.014
Neg logit contributions
irtual-0.025
asket-0.021
��-0.020
IDENT-0.019
dam-0.018
Token.
Token ID13
Feature activation-5.88e-02

Pos logit contributions
vals+0.045
 Ple+0.044
GY+0.043
 gal+0.043
 AMA+0.042
Neg logit contributions
irtual-0.064
asket-0.054
��-0.050
IDENT-0.048
dam-0.047
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
vals+0.009
 Ple+0.008
GY+0.008
 AMA+0.008
 gal+0.008
Neg logit contributions
irtual-0.012
asket-0.010
��-0.009
IDENT-0.009
dam-0.009
Token
Token ID198
Feature activation-2.57e-02

Pos logit contributions
vals+0.002
 Ple+0.002
GY+0.002
 gal+0.002
 AMA+0.002
Neg logit contributions
irtual-0.003
asket-0.002
��-0.002
IDENT-0.002
dam-0.002
TokenAs
Token ID1722
Feature activation+6.00e-01

Pos logit contributions
vals+0.004
 Ple+0.004
GY+0.004
 gal+0.004
 AMA+0.004
Neg logit contributions
irtual-0.005
asket-0.004
��-0.004
IDENT-0.004
dam-0.004
Token the
Token ID262
Feature activation-9.99e-02

Pos logit contributions
irtual+0.120
asket+0.100
��+0.093
IDENT+0.090
dam+0.087
Neg logit contributions
vals-0.091
 Ple-0.088
GY-0.087
 gal-0.087
 AMA-0.086
Token voluntary
Token ID16171
Feature activation+4.27e-01

Pos logit contributions
vals+0.015
 Ple+0.014
GY+0.014
 gal+0.014
 AMA+0.014
Neg logit contributions
irtual-0.020
asket-0.017
��-0.016
IDENT-0.015
dam-0.015
Token filtering
Token ID25431
Feature activation-6.31e-02

Pos logit contributions
irtual+0.087
asket+0.073
��+0.068
IDENT+0.065
dam+0.063
Neg logit contributions
vals-0.063
 Ple-0.061
GY-0.060
 gal-0.060
 AMA-0.060
Token regime
Token ID7142
Feature activation-1.19e+00

Pos logit contributions
vals+0.010
 Ple+0.010
GY+0.009
 gal+0.009
 AMA+0.009
Neg logit contributions
irtual-0.012
asket-0.010
��-0.010
IDENT-0.009
dam-0.009
Token comes
Token ID2058
Feature activation-1.79e-01

Pos logit contributions
vals+0.180
 Ple+0.175
GY+0.172
 gal+0.171
 AMA+0.171
Neg logit contributions
irtual-0.237
asket-0.198
��-0.186
IDENT-0.177
dam-0.172
Token it
Token ID340
Feature activation-1.15e-01

Pos logit contributions
irtual+0.071
asket+0.060
��+0.056
IDENT+0.053
dam+0.052
Neg logit contributions
vals-0.054
 Ple-0.053
GY-0.052
 gal-0.052
 AMA-0.051
Token was
Token ID373
Feature activation+2.76e-02

Pos logit contributions
vals+0.018
 Ple+0.017
GY+0.017
 gal+0.017
 AMA+0.017
Neg logit contributions
irtual-0.023
asket-0.019
��-0.018
IDENT-0.017
dam-0.016
Token ready
Token ID3492
Feature activation-1.46e-01

Pos logit contributions
irtual+0.006
asket+0.005
��+0.004
IDENT+0.004
dam+0.004
Neg logit contributions
vals-0.004
 Ple-0.004
GY-0.004
 gal-0.004
 AMA-0.004
Token to
Token ID284
Feature activation+1.47e-02

Pos logit contributions
vals+0.022
 Ple+0.021
GY+0.021
 AMA+0.021
 gal+0.021
Neg logit contributions
irtual-0.029
asket-0.024
��-0.023
IDENT-0.022
dam-0.021
Token loan
Token ID8063
Feature activation+9.74e-02

Pos logit contributions
irtual+0.003
asket+0.003
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 AMA-0.002
 gal-0.002
Token him
Token ID683
Feature activation+5.41e-01

Pos logit contributions
irtual+0.016
asket+0.013
��+0.012
IDENT+0.011
dam+0.011
Neg logit contributions
vals-0.018
 Ple-0.018
GY-0.017
 gal-0.017
 AMA-0.017
Token to
Token ID284
Feature activation-7.71e-03

Pos logit contributions
irtual+0.108
asket+0.091
��+0.084
IDENT+0.081
dam+0.079
Neg logit contributions
vals-0.082
 Ple-0.079
GY-0.078
 gal-0.078
 AMA-0.077
Token Bel
Token ID3944
Feature activation+4.80e-01

Pos logit contributions
vals+0.001
 Ple+0.001
GY+0.001
 gal+0.001
 AMA+0.001
Neg logit contributions
irtual-0.002
asket-0.001
��-0.001
IDENT-0.001
dam-0.001
Tokenen
Token ID268
Feature activation+4.98e-02

Pos logit contributions
irtual+0.086
asket+0.070
��+0.064
IDENT+0.061
dam+0.060
Neg logit contributions
vals-0.083
 Ple-0.081
GY-0.080
 AMA-0.079
 gal-0.079
Tokenenses
Token ID4541
Feature activation-1.27e-01

Pos logit contributions
irtual+0.010
asket+0.008
��+0.007
IDENT+0.007
dam+0.007
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.007
 gal-0.007
 AMA-0.007
Token,
Token ID11
Feature activation+7.15e-03

Pos logit contributions
vals+0.018
 Ple+0.018
GY+0.018
 gal+0.017
 AMA+0.017
Neg logit contributions
irtual-0.026
asket-0.022
��-0.021
IDENT-0.020
dam-0.019
Token,
Token ID11
Feature activation+7.01e-03

Pos logit contributions
vals+0.027
 Ple+0.026
GY+0.026
 gal+0.025
 AMA+0.025
Neg logit contributions
irtual-0.039
asket-0.033
��-0.031
IDENT-0.030
dam-0.029
Token a
Token ID257
Feature activation+3.38e-02

Pos logit contributions
irtual+0.001
asket+0.001
��+0.001
IDENT+0.001
dam+0.001
Neg logit contributions
vals-0.001
 Ple-0.001
GY-0.001
 gal-0.001
 AMA-0.001
Token gallon
Token ID26860
Feature activation+2.83e-01

Pos logit contributions
irtual+0.007
asket+0.006
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Token of
Token ID286
Feature activation+2.39e-02

Pos logit contributions
irtual+0.056
asket+0.047
��+0.044
IDENT+0.042
dam+0.041
Neg logit contributions
vals-0.043
 Ple-0.042
GY-0.041
 gal-0.041
 AMA-0.041
Token gasoline
Token ID21408
Feature activation+1.20e+00

Pos logit contributions
irtual+0.004
asket+0.003
��+0.003
IDENT+0.003
dam+0.003
Neg logit contributions
vals-0.004
 Ple-0.004
GY-0.004
 gal-0.004
 AMA-0.004
Token costs
Token ID3484
Feature activation+7.43e-01

Pos logit contributions
irtual+0.234
asket+0.193
��+0.182
IDENT+0.172
dam+0.166
Neg logit contributions
vals-0.190
 Ple-0.185
 gal-0.183
GY-0.183
 AMA-0.181
Token us
Token ID514
Feature activation+3.01e-01

Pos logit contributions
irtual+0.153
asket+0.129
��+0.121
IDENT+0.116
dam+0.113
Neg logit contributions
vals-0.108
 Ple-0.104
GY-0.103
 gal-0.102
 AMA-0.102
Token about
Token ID546
Feature activation-4.14e-01

Pos logit contributions
irtual+0.061
asket+0.052
��+0.048
IDENT+0.046
dam+0.045
Neg logit contributions
vals-0.045
 Ple-0.043
GY-0.042
 AMA-0.042
 gal-0.042
Token $
Token ID720
Feature activation-1.40e-02

Pos logit contributions
vals+0.059
 Ple+0.056
GY+0.056
 AMA+0.055
 gal+0.055
Neg logit contributions
irtual-0.087
asket-0.074
��-0.069
IDENT-0.066
dam-0.064
Token3
Token ID18
Feature activation-8.02e-02

Pos logit contributions
vals+0.002
 Ple+0.002
GY+0.002
 gal+0.002
 AMA+0.002
Neg logit contributions
irtual-0.003
asket-0.002
��-0.002
IDENT-0.002
dam-0.002
Token.
Token ID13
Feature activation-8.47e-02

Pos logit contributions
vals+0.011
 Ple+0.011
GY+0.011
 gal+0.011
 AMA+0.011
Neg logit contributions
irtual-0.017
asket-0.014
��-0.013
IDENT-0.013
dam-0.012
Token in
Token ID287
Feature activation-2.28e-01

Pos logit contributions
irtual+0.012
asket+0.010
��+0.010
IDENT+0.009
dam+0.009
Neg logit contributions
vals-0.007
 Ple-0.007
GY-0.007
 gal-0.007
 AMA-0.007
Token their
Token ID511
Feature activation-1.43e-01

Pos logit contributions
vals+0.033
 Ple+0.032
GY+0.031
 gal+0.031
 AMA+0.031
Neg logit contributions
irtual-0.047
asket-0.040
��-0.038
IDENT-0.036
dam-0.035
Token opposition
Token ID5471
Feature activation+1.39e-02

Pos logit contributions
vals+0.019
 Ple+0.019
GY+0.018
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.031
asket-0.026
��-0.025
IDENT-0.024
dam-0.023
Token to
Token ID284
Feature activation-2.02e-02

Pos logit contributions
irtual+0.003
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token the
Token ID262
Feature activation-9.61e-02

Pos logit contributions
vals+0.003
 Ple+0.003
GY+0.003
 gal+0.003
 AMA+0.003
Neg logit contributions
irtual-0.004
asket-0.003
��-0.003
IDENT-0.003
dam-0.003
Token United
Token ID1578
Feature activation+3.78e-02

Pos logit contributions
vals+0.015
 Ple+0.015
GY+0.014
 gal+0.014
 AMA+0.014
Neg logit contributions
irtual-0.019
asket-0.016
��-0.015
IDENT-0.014
dam-0.013
Token States
Token ID1829
Feature activation-1.32e-01

Pos logit contributions
irtual+0.008
asket+0.007
��+0.007
IDENT+0.006
dam+0.006
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Token,
Token ID11
Feature activation-1.62e-02

Pos logit contributions
vals+0.019
 Ple+0.018
GY+0.018
 gal+0.018
 AMA+0.018
Neg logit contributions
irtual-0.027
asket-0.023
��-0.022
IDENT-0.021
dam-0.020
Token Haiti
Token ID25051
Feature activation+5.86e-01

Pos logit contributions
vals+0.002
 Ple+0.002
GY+0.002
 gal+0.002
 AMA+0.002
Neg logit contributions
irtual-0.003
asket-0.003
��-0.003
IDENT-0.002
dam-0.002
Token does
Token ID857
Feature activation+1.48e-01

Pos logit contributions
irtual+0.118
asket+0.099
��+0.092
IDENT+0.088
dam+0.086
Neg logit contributions
vals-0.088
 Ple-0.085
GY-0.084
 gal-0.084
 AMA-0.083
Token not
Token ID407
Feature activation-1.34e-01

Pos logit contributions
irtual+0.028
asket+0.023
��+0.021
IDENT+0.021
dam+0.020
Neg logit contributions
vals-0.024
 Ple-0.023
GY-0.023
 gal-0.023
 AMA-0.023
Token556
Token ID37864
Feature activation-4.16e-01

Pos logit contributions
vals+0.082
 Ple+0.079
GY+0.078
 gal+0.078
 AMA+0.078
Neg logit contributions
irtual-0.095
asket-0.079
��-0.073
IDENT-0.070
dam-0.068
Token x
Token ID2124
Feature activation-3.23e-02

Pos logit contributions
vals+0.053
 Ple+0.051
GY+0.050
 gal+0.049
 AMA+0.049
Neg logit contributions
irtual-0.094
asket-0.080
��-0.075
IDENT-0.073
dam-0.071
Token.
Token ID13
Feature activation+2.37e-02

Pos logit contributions
vals+0.004
 Ple+0.004
GY+0.004
 gal+0.004
 AMA+0.004
Neg logit contributions
irtual-0.007
asket-0.006
��-0.005
IDENT-0.005
dam-0.005
Tokenx
Token ID87
Feature activation-9.26e-01

Pos logit contributions
irtual+0.006
asket+0.005
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token.
Token ID13
Feature activation+1.62e-02

Pos logit contributions
vals+0.127
 Ple+0.122
GY+0.120
 gal+0.119
 AMA+0.119
Neg logit contributions
irtual-0.198
asket-0.169
��-0.158
IDENT-0.153
dam-0.149
Tokenx
Token ID87
Feature activation-9.52e-01

Pos logit contributions
irtual+0.004
asket+0.004
��+0.003
IDENT+0.003
dam+0.003
Neg logit contributions
vals-0.001
 Ple-0.001
GY-0.001
 gal-0.001
 AMA-0.001
Token.
Token ID13
Feature activation+3.78e-03

Pos logit contributions
vals+0.128
 Ple+0.123
GY+0.121
 gal+0.120
 AMA+0.120
Neg logit contributions
irtual-0.208
asket-0.176
��-0.166
IDENT-0.159
dam-0.155
Token145
Token ID18781
Feature activation-3.38e-01

Pos logit contributions
irtual+0.001
asket+0.001
��+0.001
IDENT+0.001
 Ukrain+0.001
Neg logit contributions
vals-0.001
 Ple-0.000
GY-0.000
 AMA-0.000
 gal-0.000
Token 12
Token ID1105
Feature activation-1.17e-01

Pos logit contributions
vals+0.044
 Ple+0.042
GY+0.042
 gal+0.041
 AMA+0.041
Neg logit contributions
irtual-0.074
asket-0.064
��-0.060
IDENT-0.058
dam-0.057
Token457
Token ID33032
Feature activation+5.42e-01

Pos logit contributions
vals+0.020
 Ple+0.020
GY+0.019
 gal+0.019
 AMA+0.019
Neg logit contributions
irtual-0.021
asket-0.017
��-0.016
IDENT-0.015
dam-0.015
Token01
Token ID486
Feature activation+2.93e-01

Pos logit contributions
irtual+0.098
asket+0.080
��+0.074
IDENT+0.071
dam+0.068
Neg logit contributions
vals-0.093
 Ple-0.090
GY-0.089
 gal-0.088
 AMA-0.088
Token pomp
Token ID49435
Feature activation+5.85e-02

Pos logit contributions
vals+0.010
 Ple+0.010
GY+0.010
 gal+0.010
 AMA+0.010
Neg logit contributions
irtual-0.012
asket-0.010
��-0.009
IDENT-0.009
dam-0.009
Token and
Token ID290
Feature activation-7.64e-03

Pos logit contributions
irtual+0.012
asket+0.011
��+0.010
IDENT+0.010
dam+0.009
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token ceremony
Token ID10542
Feature activation+1.46e+00

Pos logit contributions
vals+0.001
 Ple+0.001
 gal+0.001
GY+0.001
 AMA+0.001
Neg logit contributions
irtual-0.001
asket-0.001
��-0.001
IDENT-0.001
dam-0.001
Token,
Token ID11
Feature activation+3.68e-02

Pos logit contributions
irtual+0.287
asket+0.239
��+0.223
IDENT+0.213
dam+0.207
Neg logit contributions
vals-0.227
 Ple-0.220
GY-0.217
 gal-0.215
 AMA-0.215
Token nothing
Token ID2147
Feature activation-6.37e-01

Pos logit contributions
irtual+0.007
asket+0.006
��+0.006
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.006
 gal-0.005
 AMA-0.005
Token in
Token ID287
Feature activation-1.74e-01

Pos logit contributions
vals+0.097
 Ple+0.093
GY+0.092
 AMA+0.091
 gal+0.091
Neg logit contributions
irtual-0.127
asket-0.107
��-0.099
IDENT-0.096
dam-0.093
Token the
Token ID262
Feature activation-6.08e-02

Pos logit contributions
vals+0.027
 Ple+0.026
GY+0.026
 gal+0.025
 AMA+0.025
Neg logit contributions
irtual-0.035
asket-0.029
��-0.027
IDENT-0.026
dam-0.025
Token president
Token ID1893
Feature activation-1.66e-01

Pos logit contributions
vals+0.009
 Ple+0.008
GY+0.008
 gal+0.008
 AMA+0.008
Neg logit contributions
irtual-0.013
asket-0.011
��-0.010
IDENT-0.010
dam-0.009
Token
Token ID447
Feature activation+3.17e-02

Pos logit contributions
vals+0.021
 Ple+0.020
GY+0.020
 gal+0.019
 AMA+0.019
Neg logit contributions
irtual-0.038
asket-0.032
��-0.031
IDENT-0.029
dam-0.029
Token
Token ID247
Feature activation+1.26e-02

Pos logit contributions
irtual+0.006
asket+0.005
��+0.005
IDENT+0.005
dam+0.004
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Tokens
Token ID82
Feature activation+5.42e-02

Pos logit contributions
irtual+0.003
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Token brought
Token ID3181
Feature activation-5.90e-01

Pos logit contributions
vals+0.001
 Ple+0.001
GY+0.001
 gal+0.001
 AMA+0.001
Neg logit contributions
irtual-0.001
asket-0.001
��-0.001
IDENT-0.001
dam-0.001
Token in
Token ID287
Feature activation-1.68e-01

Pos logit contributions
vals+0.090
 Ple+0.087
GY+0.086
 gal+0.085
 AMA+0.085
Neg logit contributions
irtual-0.118
asket-0.098
��-0.092
IDENT-0.088
dam-0.085
Token an
Token ID281
Feature activation+3.16e-02

Pos logit contributions
vals+0.026
 Ple+0.025
GY+0.024
 gal+0.024
 AMA+0.024
Neg logit contributions
irtual-0.034
asket-0.028
��-0.026
IDENT-0.025
dam-0.024
Token unknown
Token ID6439
Feature activation-2.70e-01

Pos logit contributions
irtual+0.006
asket+0.005
��+0.005
IDENT+0.005
dam+0.004
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Token to
Token ID284
Feature activation+3.09e-02

Pos logit contributions
vals+0.037
 Ple+0.036
GY+0.035
 gal+0.035
 AMA+0.035
Neg logit contributions
irtual-0.058
asket-0.049
��-0.046
IDENT-0.044
dam-0.043
Token rewrite
Token ID28183
Feature activation+1.15e-01

Pos logit contributions
irtual+0.006
asket+0.005
��+0.004
IDENT+0.004
dam+0.004
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Token it
Token ID340
Feature activation-1.07e-01

Pos logit contributions
irtual+0.023
asket+0.020
��+0.018
IDENT+0.018
dam+0.017
Neg logit contributions
vals-0.017
 Ple-0.017
GY-0.016
 gal-0.016
 AMA-0.016
Token.
Token ID13
Feature activation-6.40e-04

Pos logit contributions
vals+0.015
 Ple+0.014
GY+0.014
 gal+0.014
 AMA+0.014
Neg logit contributions
irtual-0.023
asket-0.019
��-0.018
IDENT-0.017
dam-0.017
Token David
Token ID3271
Feature activation+1.42e-02

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.000
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Token S
Token ID311
Feature activation+2.77e-01

Pos logit contributions
irtual+0.003
asket+0.002
��+0.002
IDENT+0.002
dam+0.002
Neg logit contributions
vals-0.002
 Ple-0.002
GY-0.002
 gal-0.002
 AMA-0.002
Tokenelt
Token ID2120
Feature activation+4.55e-01

Pos logit contributions
irtual+0.057
asket+0.048
��+0.045
IDENT+0.043
dam+0.042
Neg logit contributions
vals-0.041
 Ple-0.039
GY-0.039
 gal-0.039
 AMA-0.038
Token to
Token ID284
Feature activation+4.07e-02

Pos logit contributions
vals+0.095
 Ple+0.093
GY+0.091
 gal+0.091
 AMA+0.091
Neg logit contributions
irtual-0.120
asket-0.100
��-0.094
IDENT-0.089
dam-0.087
Token your
Token ID534
Feature activation-1.66e-01

Pos logit contributions
irtual+0.008
asket+0.007
��+0.007
IDENT+0.006
dam+0.006
Neg logit contributions
vals-0.006
 Ple-0.006
GY-0.006
 gal-0.006
 AMA-0.006
Token monitor
Token ID5671
Feature activation-4.18e-01

Pos logit contributions
vals+0.025
 Ple+0.024
GY+0.024
 gal+0.024
 AMA+0.024
Neg logit contributions
irtual-0.033
asket-0.027
��-0.026
IDENT-0.025
dam-0.024
Token or
Token ID393
Feature activation+1.76e-01

Pos logit contributions
vals+0.065
 Ple+0.063
GY+0.063
 gal+0.062
 AMA+0.062
Neg logit contributions
irtual-0.082
asket-0.068
��-0.063
IDENT-0.061
dam-0.059
Token tv
Token ID31557
Feature activation-2.55e-03

Pos logit contributions
irtual+0.035
asket+0.029
��+0.027
IDENT+0.026
dam+0.025
Neg logit contributions
vals-0.027
 Ple-0.026
GY-0.026
 gal-0.026
 AMA-0.026
Token you
Token ID345
Feature activation+2.95e-01

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.001
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Token'll
Token ID1183
Feature activation+2.88e-01

Pos logit contributions
irtual+0.059
asket+0.049
��+0.046
IDENT+0.044
dam+0.043
Neg logit contributions
vals-0.044
 Ple-0.043
GY-0.043
 gal-0.042
 AMA-0.042
Token see
Token ID766
Feature activation+7.13e-01

Pos logit contributions
irtual+0.052
asket+0.042
��+0.039
IDENT+0.037
dam+0.036
Neg logit contributions
vals-0.050
 Ple-0.048
GY-0.048
 gal-0.048
 AMA-0.047
Token the
Token ID262
Feature activation-3.69e-02

Pos logit contributions
irtual+0.143
asket+0.120
��+0.112
IDENT+0.107
dam+0.104
Neg logit contributions
vals-0.108
 Ple-0.104
GY-0.103
 gal-0.102
 AMA-0.102
Token same
Token ID976
Feature activation-4.01e-01

Pos logit contributions
vals+0.005
 Ple+0.005
GY+0.005
 gal+0.005
 AMA+0.005
Neg logit contributions
irtual-0.008
asket-0.006
��-0.006
IDENT-0.006
dam-0.005
Token thing
Token ID1517
Feature activation-6.77e-01

Pos logit contributions
vals+0.058
 Ple+0.057
GY+0.056
 gal+0.055
 AMA+0.055
Neg logit contributions
irtual-0.083
asket-0.070
��-0.065
IDENT-0.063
dam-0.061
Token situation
Token ID3074
Feature activation+5.35e-01

Pos logit contributions
irtual+0.030
asket+0.025
��+0.023
IDENT+0.022
dam+0.022
Neg logit contributions
vals-0.024
 Ple-0.024
GY-0.023
 gal-0.023
 AMA-0.023
Token to
Token ID284
Feature activation+3.14e-02

Pos logit contributions
irtual+0.108
asket+0.090
��+0.085
IDENT+0.081
dam+0.078
Neg logit contributions
vals-0.080
 Ple-0.078
 gal-0.077
GY-0.076
 AMA-0.076
Token enter
Token ID3802
Feature activation+3.51e-01

Pos logit contributions
irtual+0.006
asket+0.005
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.004
 AMA-0.004
Token the
Token ID262
Feature activation-9.26e-02

Pos logit contributions
irtual+0.072
asket+0.060
��+0.056
IDENT+0.054
dam+0.052
Neg logit contributions
vals-0.052
 Ple-0.050
GY-0.049
 gal-0.049
 AMA-0.049
Token Free
Token ID3232
Feature activation+3.45e-01

Pos logit contributions
vals+0.014
 Ple+0.014
GY+0.014
 gal+0.014
 AMA+0.014
Neg logit contributions
irtual-0.018
asket-0.015
��-0.014
IDENT-0.014
dam-0.013
Token Speech
Token ID24709
Feature activation-1.53e+00

Pos logit contributions
irtual+0.093
asket+0.082
��+0.078
IDENT+0.076
dam+0.074
Neg logit contributions
vals-0.028
 Ple-0.027
GY-0.026
 gal-0.026
 AMA-0.026
Token Zone
Token ID13035
Feature activation-8.46e-01

Pos logit contributions
vals+0.159
 Ple+0.150
GY+0.148
 gal+0.146
 AMA+0.146
Neg logit contributions
irtual-0.379
asket-0.329
��-0.312
IDENT-0.302
dam-0.296
Token outside
Token ID2354
Feature activation+1.71e-01

Pos logit contributions
vals+0.123
 Ple+0.120
GY+0.118
 gal+0.117
 AMA+0.117
Neg logit contributions
irtual-0.174
asket-0.146
��-0.138
IDENT-0.131
dam-0.127
Token the
Token ID262
Feature activation-8.85e-02

Pos logit contributions
irtual+0.034
asket+0.028
��+0.026
IDENT+0.025
dam+0.024
Neg logit contributions
vals-0.026
 Ple-0.026
GY-0.025
 gal-0.025
 AMA-0.025
Token pre
Token ID662
Feature activation-4.17e-01

Pos logit contributions
vals+0.014
 Ple+0.013
GY+0.013
 gal+0.013
 AMA+0.013
Neg logit contributions
irtual-0.018
asket-0.015
��-0.014
IDENT-0.013
dam-0.013
Tokenz
Token ID89
Feature activation-8.17e-01

Pos logit contributions
vals+0.058
 Ple+0.056
GY+0.055
 gal+0.055
 AMA+0.055
Neg logit contributions
irtual-0.089
asket-0.075
��-0.071
IDENT-0.068
dam-0.066
Token
Token ID447
Feature activation-5.55e-02

Pos logit contributions
vals+0.118
 Ple+0.114
GY+0.112
 gal+0.111
 AMA+0.111
Neg logit contributions
irtual-0.161
asket-0.135
��-0.127
IDENT-0.121
dam-0.118
Token
Token ID247
Feature activation-6.96e-02

Pos logit contributions
vals+0.011
 Ple+0.010
GY+0.010
 AMA+0.010
 gal+0.010
Neg logit contributions
irtual-0.009
asket-0.007
��-0.007
IDENT-0.006
dam-0.006
Tokens
Token ID82
Feature activation-4.81e-02

Pos logit contributions
vals+0.011
 Ple+0.011
GY+0.011
 AMA+0.010
 gal+0.010
Neg logit contributions
irtual-0.013
asket-0.011
��-0.010
IDENT-0.010
dam-0.010
Token Director
Token ID5890
Feature activation-2.82e-01

Pos logit contributions
vals+0.007
 Ple+0.007
GY+0.007
 gal+0.007
 AMA+0.007
Neg logit contributions
irtual-0.010
asket-0.008
��-0.008
IDENT-0.007
dam-0.007
Token of
Token ID286
Feature activation+3.35e-02

Pos logit contributions
vals+0.044
 Ple+0.042
GY+0.042
 gal+0.042
 AMA+0.042
Neg logit contributions
irtual-0.055
asket-0.046
��-0.043
IDENT-0.041
dam-0.040
Token Parking
Token ID29259
Feature activation-1.99e+00

Pos logit contributions
irtual+0.007
asket+0.006
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Token,
Token ID11
Feature activation+3.57e-02

Pos logit contributions
vals+0.279
 Ple+0.268
GY+0.266
 gal+0.263
 AMA+0.262
Neg logit contributions
irtual-0.420
asket-0.355
��-0.331
IDENT-0.319
dam-0.311
Token Transportation
Token ID15198
Feature activation-3.35e-01

Pos logit contributions
irtual+0.007
asket+0.006
��+0.006
IDENT+0.006
dam+0.005
Neg logit contributions
vals-0.005
 Ple-0.005
GY-0.005
 gal-0.005
 AMA-0.005
Token and
Token ID290
Feature activation-2.17e-04

Pos logit contributions
vals+0.046
 Ple+0.044
GY+0.043
 gal+0.043
 AMA+0.043
Neg logit contributions
irtual-0.072
asket-0.061
��-0.058
IDENT-0.055
dam-0.054
Token Services
Token ID6168
Feature activation+1.71e-01

Pos logit contributions
vals+0.000
 Ple+0.000
GY+0.000
 gal+0.000
 AMA+0.000
Neg logit contributions
irtual-0.000
asket-0.000
��-0.000
IDENT-0.000
dam-0.000
Token Kevin
Token ID7939
Feature activation-1.18e-01

Pos logit contributions
irtual+0.035
asket+0.029
��+0.027
IDENT+0.026
dam+0.026
Neg logit contributions
vals-0.025
 Ple-0.024
GY-0.024
 gal-0.024
 AMA-0.024
Token,
Token ID11
Feature activation+8.74e-02

Pos logit contributions
vals+0.025
 Ple+0.024
GY+0.023
 gal+0.023
 AMA+0.023
Neg logit contributions
irtual-0.035
asket-0.029
��-0.027
IDENT-0.026
dam-0.026
Token and
Token ID290
Feature activation+5.41e-02

Pos logit contributions
irtual+0.018
asket+0.015
��+0.014
IDENT+0.013
dam+0.013
Neg logit contributions
vals-0.013
 Ple-0.013
GY-0.012
 gal-0.012
 AMA-0.012
Token by
Token ID416
Feature activation+3.91e-01

Pos logit contributions
irtual+0.010
asket+0.009
��+0.008
IDENT+0.008
dam+0.007
Neg logit contributions
vals-0.009
 Ple-0.008
GY-0.008
 gal-0.008
 AMA-0.008
Token prohibiting
Token ID30363
Feature activation-1.66e-01

Pos logit contributions
irtual+0.073
asket+0.060
��+0.056
IDENT+0.053
dam+0.051
Neg logit contributions
vals-0.065
 Ple-0.063
GY-0.062
 gal-0.062
 AMA-0.061
Token free
Token ID1479
Feature activation+2.37e-01

Pos logit contributions
vals+0.026
 Ple+0.025
GY+0.025
 gal+0.024
 AMA+0.024
Neg logit contributions
irtual-0.033
asket-0.027
��-0.025
IDENT-0.024
dam-0.024
Token assault
Token ID4641
Feature activation-1.26e+00

Pos logit contributions
irtual+0.050
asket+0.042
��+0.040
IDENT+0.038
dam+0.037
Neg logit contributions
vals-0.033
 Ple-0.032
GY-0.031
 gal-0.031
 AMA-0.031
Token on
Token ID319
Feature activation+2.99e-01

Pos logit contributions
vals+0.190
 Ple+0.184
GY+0.181
 gal+0.181
 AMA+0.180
Neg logit contributions
irtual-0.255
asket-0.213
��-0.200
IDENT-0.191
dam-0.185
Token imperial
Token ID14312
Feature activation-6.05e-01

Pos logit contributions
irtual+0.059
asket+0.049
��+0.046
IDENT+0.044
dam+0.043
Neg logit contributions
vals-0.046
GY-0.044
 Ple-0.044
 AMA-0.043
 gal-0.043
Token citizens
Token ID4290
Feature activation+3.89e-01

Pos logit contributions
vals+0.086
GY+0.082
 Ple+0.082
 AMA+0.080
+0.079
Neg logit contributions
irtual-0.127
asket-0.106
��-0.101
IDENT-0.096
dam-0.094
Token.
Token ID13
Feature activation+4.43e-02

Pos logit contributions
irtual+0.080
asket+0.067
��+0.063
IDENT+0.060
dam+0.059
Neg logit contributions
vals-0.057
 Ple-0.055
GY-0.054
 AMA-0.053
 gal-0.053
Token Practices
Token ID42134
Feature activation+5.08e-01

Pos logit contributions
irtual+0.009
asket+0.007
��+0.007
IDENT+0.007
dam+0.006
Neg logit contributions
vals-0.007
 Ple-0.006
GY-0.006
 gal-0.006
 AMA-0.006
Token
Token ID247
Feature activation-5.99e-03

Pos logit contributions
irtual+0.007
asket+0.005
��+0.005
IDENT+0.005
dam+0.005
Neg logit contributions
vals-0.008
 Ple-0.008
GY-0.008
 AMA-0.008
 gal-0.008
Tokenre
Token ID260
Feature activation-4.40e-01

Pos logit contributions
vals+0.001
 Ple+0.001
GY+0.001
 gal+0.001
 AMA+0.001
Neg logit contributions
irtual-0.001
asket-0.001
��-0.001
IDENT-0.001
dam-0.001
Token talking
Token ID3375
Feature activation+6.48e-01

Pos logit contributions
vals+0.085
 Ple+0.083
GY+0.082
 gal+0.081
 AMA+0.081
Neg logit contributions
irtual-0.070
asket-0.056
��-0.051
IDENT-0.048
dam-0.046
Token about
Token ID546
Feature activation-3.20e-01

Pos logit contributions
irtual+0.147
asket+0.127
��+0.119
IDENT+0.115
dam+0.112
Neg logit contributions
vals-0.081
 Ple-0.077
GY-0.076
 AMA-0.075
 gal-0.075
Token.
Token ID13
Feature activation+2.50e-03

Pos logit contributions
vals+0.045
 Ple+0.043
GY+0.043
 gal+0.042
 AMA+0.042
Neg logit contributions
irtual-0.068
asket-0.057
��-0.054
IDENT-0.052
dam-0.050
Token Like
Token ID4525
Feature activation-1.15e+00

Pos logit contributions
irtual+0.001
asket+0.000
��+0.000
IDENT+0.000
dam+0.000
Neg logit contributions
vals-0.000
 Ple-0.000
GY-0.000
 gal-0.000
 AMA-0.000
Token when
Token ID618
Feature activation+2.26e-01

Pos logit contributions
vals+0.173
 Ple+0.167
GY+0.165
 gal+0.164
 AMA+0.163
Neg logit contributions
irtual-0.233
asket-0.195
��-0.183
IDENT-0.175
dam-0.170
Token I
Token ID314
Feature activation+4.52e-02

Pos logit contributions
irtual+0.043
asket+0.036
��+0.033
IDENT+0.032
dam+0.031
Neg logit contributions
vals-0.036
 Ple-0.035
GY-0.035
 gal-0.034
 AMA-0.034
Token talk
Token ID1561
Feature activation+3.19e-01

Pos logit contributions
irtual+0.009
asket+0.007
��+0.007
IDENT+0.007
dam+0.006
Neg logit contributions
vals-0.007
 Ple-0.007
GY-0.007
 gal-0.007
 AMA-0.007
Token to
Token ID284
Feature activation+7.61e-02

Pos logit contributions
irtual+0.070
asket+0.059
��+0.056
IDENT+0.054
dam+0.052
Neg logit contributions
vals-0.042
 Ple-0.041
GY-0.040
 gal-0.040
 AMA-0.040
Token a
Token ID257
Feature activation+7.83e-02

Pos logit contributions
irtual+0.014
asket+0.012
��+0.011
IDENT+0.011
dam+0.010
Neg logit contributions
vals-0.012
 Ple-0.012
GY-0.012
 gal-0.012
 AMA-0.012
Tokenin
Token ID259
Feature activation+7.11e-02

Pos logit contributions
vals+0.065
 Ple+0.063
GY+0.063
 gal+0.062
 AMA+0.062
Neg logit contributions
irtual-0.092
asket-0.077
��-0.072
IDENT-0.069
dam-0.068
Tokenik
Token ID1134
Feature activation+2.68e-01

Pos logit contributions
irtual+0.014
asket+0.012
��+0.011
IDENT+0.011
dam+0.011
Neg logit contributions
vals-0.011
 Ple-0.010
GY-0.010
 gal-0.010
 AMA-0.010
Tokenain
Token ID391
Feature activation+1.32e-01

Pos logit contributions
irtual+0.050
asket+0.041
��+0.039
IDENT+0.037
dam+0.036
Neg logit contributions
vals-0.044
 Ple-0.043
GY-0.042
 gal-0.042
 AMA-0.042
Tokenen
Token ID268
Feature activation+1.56e-01

Pos logit contributions
irtual+0.026
asket+0.022
��+0.021
IDENT+0.020
dam+0.019
Neg logit contributions
vals-0.020
 Ple-0.019
GY-0.019
 gal-0.019
 AMA-0.019
Token @
Token ID2488
Feature activation-1.74e-01

Pos logit contributions
irtual+0.030
asket+0.025
��+0.023
IDENT+0.022
dam+0.021
Neg logit contributions
vals-0.025
 Ple-0.024
GY-0.024
 gal-0.024
 AMA-0.024
Token il
Token ID4229
Feature activation-1.27e+00

Pos logit contributions
vals+0.026
 Ple+0.025
GY+0.025
 gal+0.024
 AMA+0.024
Neg logit contributions
irtual-0.035
asket-0.030
��-0.028
IDENT-0.027
dam-0.026
Tokenmedia
Token ID11431
Feature activation+2.82e-01

Pos logit contributions
vals+0.186
 Ple+0.180
GY+0.178
 gal+0.176
 AMA+0.176
Neg logit contributions
irtual-0.260
asket-0.218
��-0.203
IDENT-0.195
dam-0.190
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
irtual+0.058
asket+0.048
��+0.045
IDENT+0.044
dam+0.042
Neg logit contributions
vals-0.041
 Ple-0.040
GY-0.039
 gal-0.039
 AMA-0.039
Tokenfi
Token ID12463
Feature activation+4.36e-01

Pos logit contributions
irtual+0.042
asket+0.035
��+0.033
IDENT+0.032
dam+0.031
Neg logit contributions
vals-0.029
 Ple-0.028
GY-0.028
 gal-0.027
 AMA-0.027
TokenTranslation
Token ID48313
Feature activation+8.79e-01

Pos logit contributions
irtual+0.089
asket+0.075
��+0.070
IDENT+0.067
dam+0.066
Neg logit contributions
vals-0.064
 Ple-0.062
GY-0.061
 gal-0.061
 AMA-0.060
Token to
Token ID284
Feature activation-2.87e-02

Pos logit contributions
irtual+0.180
asket+0.151
��+0.142
IDENT+0.136
dam+0.132
Neg logit contributions
vals-0.129
 Ple-0.125
GY-0.123
 gal-0.123
 AMA-0.122