TOP ACTIVATIONS
MAX = 0.587

months without rain . Th irst y Marshall ese , many
( t ) = exp (- ( t - t 0
( 1 exp (- 1 / u ))
Wilson . R ANK 4 Xavier Rhodes , Vikings
season .) R ANK 3 Patrick Peterson , Cardinals
awaits . R ANK 6 Jal en Ramsey ,
rankings . R ANK 1 Josh Norman , Redskins
iggs . R ANK 2 Richard Sherman , Seahawks
a molecule expressed as exp (- t / u ) where
headed chance and forcing Gu iller mo O cho a into
Ms . Set zer said . These
As my colleague Nicole O zer has discussed , there is
THE L AND OF EXT REM ELY EXP ENS IVE BE
jas Gil ( M ov istar Team ) s . t
kind of f iller between moves ,
Facebook ) I hor Mal anch ak is the CEO of
fades . R ANK 5 Jimmy Smith , Ravens
run - off between Gu iller mo L asso and Lenin
Von Miller (@ von m iller ) on Sep 30 ,
FR ANK M ORD ICA , V

MIN ACTIVATIONS
MIN = -0.543

Sh ams ia Hass ani , Afghanistan
with two flat head screw drivers . It can be a
St . and Bou lt bee Ave . area , north
is a security algorithm for IEEE 802 . 11 wireless networks
said G ools bee . This is
W PA 2 ), the IEEE declared that both W EP
privacy component of the original IEEE 802 . 11 standard ratified
with money and coat - tails than to minorities . The
f 4 b be 66 IEEE 802 . 11 b gn
San Francisco Board of Super visors San Francisco ,
f 4 b be 66 IEEE 802 . 11 b gn
fer ometer Grav itational - wave Observatory ( L IG O
, or whether a honey bee might also demonstrate rule learning
petty - bour ge ois intellig ents ia resisted en masse
How Bees See <|endoftext|> The fuss is in the flavors :
petty - bour ge ois intellig ents ia forged in haste
petty - bour ge ois intellig ents ia could vind icate
. Mr . Walton s physical contact
Zub air Bl ocked Un block Follow Following Nov 21 ,
Chicago - based he i ress Beat rice Joyce Ke an

INTERVAL 0.305 - 0.587
CONTAINS 0.235%

pages . Join the revolution TODAY ! It s
hen ry . c ordes @ ow h . com ,
, but less severe than Chern obyl . A
Post - p and emic : activity returns to normal
adapt able , decentral ised tool that is the English language

INTERVAL 0.022 - 0.305
CONTAINS 29.362%

The next round was a coding phone interview .
each other in a one - on - one , single
to be a modern states man . Enter the
by his character that they failed to name him to their
passer rating against the two was 104 . 20 and 111

INTERVAL -0.260 - 0.022
CONTAINS 69.729%

M itch has progressed through a comprehensive rehabilitation
Of course , I may be the gateway drug to opportunities
health , which is now a management priority , but that
Australia has six states New South Wales ( NS
and they didn t really have much competition for

INTERVAL -0.543 - -0.260
CONTAINS 0.674%

its propriet or , alone ly young man named Norman Bates
also identified with the middle class , despite not necessarily enjoying
Self less , Moral , Vigil ant - Cond
ation , the shift in demand for cleaner , cheaper energy
t actually load anything . It
Token months
Token ID1933
Feature activation-4.67e-02

Pos logit contributions
vg+0.078
irst+0.073
ANK+0.058
iller+0.057
(-+0.056
Neg logit contributions
drivers-0.083
AFTA-0.081
 IEEE-0.079
bee-0.078
ulus-0.077
Token without
Token ID1231
Feature activation-5.58e-02

Pos logit contributions
drivers+0.035
AFTA+0.034
 IEEE+0.033
bee+0.033
ulus+0.032
Neg logit contributions
vg-0.035
irst-0.033
ANK-0.026
iller-0.026
(--0.026
Token rain
Token ID6290
Feature activation-1.40e-01

Pos logit contributions
drivers+0.041
AFTA+0.040
 IEEE+0.039
bee+0.038
ulus+0.038
Neg logit contributions
vg-0.043
irst-0.040
ANK-0.033
iller-0.032
(--0.032
Token.
Token ID13
Feature activation+1.45e-03

Pos logit contributions
drivers+0.102
AFTA+0.098
 IEEE+0.096
bee+0.095
ulus+0.093
Neg logit contributions
vg-0.110
irst-0.103
ANK-0.084
iller-0.082
(--0.082
Token Th
Token ID536
Feature activation+1.22e-01

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Tokenirst
Token ID667
Feature activation+5.87e-01

Pos logit contributions
vg+0.085
irst+0.080
ANK+0.062
iller+0.061
(-+0.060
Neg logit contributions
drivers-0.099
AFTA-0.096
 IEEE-0.095
bee-0.093
ulus-0.092
Tokeny
Token ID88
Feature activation-7.97e-03

Pos logit contributions
vg+0.465
irst+0.441
ANK+0.357
iller+0.351
(-+0.349
Neg logit contributions
drivers-0.416
AFTA-0.406
 IEEE-0.395
bee-0.387
ulus-0.381
Token Marshall
Token ID13606
Feature activation-2.88e-02

Pos logit contributions
drivers+0.006
AFTA+0.005
 IEEE+0.005
bee+0.005
ulus+0.005
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Tokenese
Token ID2771
Feature activation-4.61e-02

Pos logit contributions
drivers+0.021
AFTA+0.020
 IEEE+0.020
bee+0.019
ulus+0.019
Neg logit contributions
vg-0.023
irst-0.021
ANK-0.017
iller-0.017
(--0.017
Token,
Token ID11
Feature activation-4.93e-03

Pos logit contributions
drivers+0.033
AFTA+0.032
 IEEE+0.032
bee+0.031
ulus+0.031
Neg logit contributions
vg-0.036
irst-0.034
ANK-0.027
iller-0.027
(--0.027
Token many
Token ID867
Feature activation+1.85e-01

Pos logit contributions
drivers+0.004
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.004
irst-0.004
ANK-0.003
iller-0.003
(--0.003
Token(
Token ID7
Feature activation+2.83e-01

Pos logit contributions
drivers+0.137
AFTA+0.132
 IEEE+0.128
bee+0.126
ulus+0.123
Neg logit contributions
vg-0.190
irst-0.180
ANK-0.150
(--0.147
iller-0.147
Tokent
Token ID83
Feature activation-4.69e-02

Pos logit contributions
vg+0.228
irst+0.215
ANK+0.175
(-+0.173
iller+0.171
Neg logit contributions
drivers-0.199
AFTA-0.193
 IEEE-0.188
bee-0.184
ratulations-0.183
Token)
Token ID8
Feature activation+3.41e-02

Pos logit contributions
drivers+0.036
AFTA+0.035
 IEEE+0.034
bee+0.033
ulus+0.033
Neg logit contributions
vg-0.035
irst-0.033
ANK-0.026
iller-0.025
(--0.025
Token =
Token ID796
Feature activation+1.58e-01

Pos logit contributions
vg+0.024
irst+0.023
ANK+0.018
iller+0.017
(-+0.017
Neg logit contributions
drivers-0.027
AFTA-0.026
 IEEE-0.026
bee-0.025
ulus-0.025
Token exp
Token ID1033
Feature activation-6.11e-02

Pos logit contributions
vg+0.130
irst+0.123
ANK+0.101
(-+0.100
iller+0.099
Neg logit contributions
drivers-0.107
AFTA-0.103
 IEEE-0.100
bee-0.099
ratulations-0.098
Token(-
Token ID32590
Feature activation+4.91e-01

Pos logit contributions
drivers+0.068
AFTA+0.067
 IEEE+0.066
bee+0.065
ulus+0.064
Neg logit contributions
vg-0.024
irst-0.021
ANK-0.012
iller-0.012
(--0.012
Token (
Token ID357
Feature activation+2.35e-02

Pos logit contributions
vg+0.407
irst+0.383
ANK+0.315
(-+0.313
iller+0.309
Neg logit contributions
drivers-0.333
AFTA-0.322
 IEEE-0.314
bee-0.308
ratulations-0.304
Tokent
Token ID83
Feature activation-5.09e-02

Pos logit contributions
vg+0.019
irst+0.018
ANK+0.015
(-+0.015
iller+0.015
Neg logit contributions
drivers-0.016
AFTA-0.016
 IEEE-0.015
bee-0.015
ratulations-0.015
Token-
Token ID12
Feature activation+2.60e-02

Pos logit contributions
drivers+0.039
AFTA+0.038
 IEEE+0.037
bee+0.036
ulus+0.036
Neg logit contributions
vg-0.038
irst-0.036
ANK-0.028
(--0.028
iller-0.028
Tokent
Token ID83
Feature activation-5.28e-02

Pos logit contributions
vg+0.020
irst+0.019
ANK+0.015
(-+0.015
iller+0.015
Neg logit contributions
drivers-0.019
AFTA-0.018
 IEEE-0.018
bee-0.018
ratulations-0.017
Token 0
Token ID657
Feature activation-2.23e-03

Pos logit contributions
drivers+0.043
AFTA+0.042
 IEEE+0.041
bee+0.040
ulus+0.040
Neg logit contributions
vg-0.037
irst-0.034
ANK-0.027
(--0.026
iller-0.026
Token
Token ID784
Feature activation-2.25e-01

Pos logit contributions
vg+0.005
irst+0.005
ANK+0.004
(-+0.004
iller+0.004
Neg logit contributions
drivers-0.006
AFTA-0.006
 IEEE-0.006
bee-0.006
ratulations-0.006
Token (
Token ID357
Feature activation+2.52e-02

Pos logit contributions
drivers+0.157
AFTA+0.152
 IEEE+0.147
bee+0.145
ratulations+0.143
Neg logit contributions
vg-0.182
irst-0.172
ANK-0.140
(--0.138
iller-0.137
Token1
Token ID16
Feature activation+2.70e-02

Pos logit contributions
vg+0.020
irst+0.019
ANK+0.015
(-+0.015
iller+0.015
Neg logit contributions
drivers-0.018
AFTA-0.017
 IEEE-0.017
bee-0.016
ratulations-0.016
Token
Token ID784
Feature activation-2.26e-01

Pos logit contributions
vg+0.021
irst+0.020
ANK+0.016
(-+0.016
iller+0.016
Neg logit contributions
drivers-0.020
AFTA-0.019
 IEEE-0.019
bee-0.018
ratulations-0.018
Token exp
Token ID1033
Feature activation-6.56e-02

Pos logit contributions
drivers+0.154
AFTA+0.149
 IEEE+0.144
bee+0.142
ratulations+0.140
Neg logit contributions
vg-0.187
irst-0.177
ANK-0.145
(--0.143
iller-0.142
Token(-
Token ID32590
Feature activation+4.88e-01

Pos logit contributions
drivers+0.069
AFTA+0.067
 IEEE+0.066
bee+0.065
ulus+0.065
Neg logit contributions
vg-0.030
irst-0.027
ANK-0.018
iller-0.017
(--0.017
Token1
Token ID16
Feature activation+4.00e-02

Pos logit contributions
vg+0.391
irst+0.369
ANK+0.302
(-+0.298
iller+0.295
Neg logit contributions
drivers-0.343
AFTA-0.332
 IEEE-0.323
bee-0.318
ratulations-0.313
Token/
Token ID14
Feature activation-1.59e-01

Pos logit contributions
vg+0.029
irst+0.027
ANK+0.022
(-+0.022
iller+0.021
Neg logit contributions
drivers-0.031
AFTA-0.030
 IEEE-0.029
bee-0.029
ratulations-0.028
Tokenu
Token ID84
Feature activation-9.31e-03

Pos logit contributions
drivers+0.119
AFTA+0.116
 IEEE+0.113
bee+0.111
ulus+0.110
Neg logit contributions
vg-0.119
irst-0.112
ANK-0.090
(--0.088
iller-0.088
Token))
Token ID4008
Feature activation+2.13e-01

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.008
bee+0.008
ulus+0.007
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.004
iller-0.004
(--0.004
Token
Token ID6184
Feature activation-4.47e-02

Pos logit contributions
vg+0.170
irst+0.160
ANK+0.130
(-+0.129
iller+0.128
Neg logit contributions
drivers-0.151
AFTA-0.146
 IEEE-0.142
bee-0.140
ratulations-0.138
Token Wilson
Token ID8127
Feature activation-8.75e-02

Pos logit contributions
drivers+0.111
AFTA+0.106
 IEEE+0.104
bee+0.101
ulus+0.100
Neg logit contributions
vg-0.160
irst-0.151
ANK-0.126
iller-0.123
(--0.122
Token.
Token ID13
Feature activation+2.07e-03

Pos logit contributions
drivers+0.066
AFTA+0.064
 IEEE+0.062
bee+0.061
ulus+0.061
Neg logit contributions
vg-0.066
irst-0.062
ANK-0.050
(--0.049
iller-0.049
Token
Token ID198
Feature activation+6.49e-03

Pos logit contributions
vg+0.002
irst+0.001
ANK+0.001
(-+0.001
iller+0.001
Neg logit contributions
drivers-0.002
AFTA-0.001
 IEEE-0.001
bee-0.001
ratulations-0.001
Token
Token ID198
Feature activation+6.57e-03

Pos logit contributions
vg+0.005
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.005
AFTA-0.005
 IEEE-0.005
bee-0.005
ulus-0.004
TokenR
Token ID49
Feature activation+1.47e-01

Pos logit contributions
vg+0.004
irst+0.004
ANK+0.003
iller+0.003
(-+0.003
Neg logit contributions
drivers-0.006
AFTA-0.005
 IEEE-0.005
bee-0.005
ulus-0.005
TokenANK
Token ID15154
Feature activation+4.87e-01

Pos logit contributions
vg+0.029
irst+0.022
ANK+0.000
iller-0.000
(--0.001
Neg logit contributions
drivers-0.192
AFTA-0.189
 IEEE-0.186
bee-0.185
ulus-0.184
Token 4
Token ID604
Feature activation-9.99e-02

Pos logit contributions
vg+0.433
irst+0.411
ANK+0.346
(-+0.338
iller+0.337
Neg logit contributions
drivers-0.299
AFTA-0.289
 IEEE-0.278
bee-0.274
ratulations-0.271
Token Xavier
Token ID30825
Feature activation+3.05e-01

Pos logit contributions
drivers+0.077
AFTA+0.075
 IEEE+0.073
bee+0.072
ulus+0.071
Neg logit contributions
vg-0.073
irst-0.069
ANK-0.055
iller-0.054
(--0.054
Token Rhodes
Token ID25656
Feature activation-1.84e-01

Pos logit contributions
vg+0.316
irst+0.302
ANK+0.259
iller+0.255
(-+0.254
Neg logit contributions
drivers-0.143
AFTA-0.136
 IEEE-0.131
bee-0.128
ulus-0.125
Token,
Token ID11
Feature activation-1.07e-03

Pos logit contributions
drivers+0.133
AFTA+0.129
 IEEE+0.126
bee+0.124
ulus+0.122
Neg logit contributions
vg-0.143
irst-0.135
ANK-0.109
iller-0.107
(--0.106
Token Vikings
Token ID16420
Feature activation+1.36e-01

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
(--0.001
iller-0.001
Token season
Token ID1622
Feature activation-5.12e-02

Pos logit contributions
drivers+0.027
AFTA+0.026
 IEEE+0.026
bee+0.025
ulus+0.025
Neg logit contributions
vg-0.032
irst-0.031
ANK-0.025
iller-0.025
(--0.025
Token.)
Token ID2014
Feature activation+4.23e-02

Pos logit contributions
drivers+0.039
AFTA+0.038
 IEEE+0.037
bee+0.036
ulus+0.036
Neg logit contributions
vg-0.038
irst-0.036
ANK-0.029
iller-0.028
(--0.028
Token
Token ID198
Feature activation+3.22e-03

Pos logit contributions
vg+0.031
irst+0.029
ANK+0.023
iller+0.023
(-+0.023
Neg logit contributions
drivers-0.032
AFTA-0.031
 IEEE-0.031
bee-0.030
ulus-0.030
Token
Token ID198
Feature activation+8.09e-03

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
TokenR
Token ID49
Feature activation+1.42e-01

Pos logit contributions
vg+0.006
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.007
AFTA-0.006
 IEEE-0.006
bee-0.006
ulus-0.006
TokenANK
Token ID15154
Feature activation+4.84e-01

Pos logit contributions
vg+0.027
irst+0.021
ANK+0.001
iller-0.001
(--0.001
Neg logit contributions
drivers-0.186
AFTA-0.183
 IEEE-0.181
bee-0.179
ulus-0.178
Token 3
Token ID513
Feature activation-2.71e-02

Pos logit contributions
vg+0.391
irst+0.372
ANK+0.304
(-+0.297
iller+0.297
Neg logit contributions
drivers-0.336
AFTA-0.325
 IEEE-0.317
bee-0.312
ulus-0.307
Token Patrick
Token ID9925
Feature activation+1.37e-01

Pos logit contributions
drivers+0.021
AFTA+0.020
 IEEE+0.020
bee+0.019
ulus+0.019
Neg logit contributions
vg-0.020
irst-0.019
ANK-0.015
(--0.015
iller-0.015
Token Peterson
Token ID18914
Feature activation-2.01e-01

Pos logit contributions
vg+0.137
irst+0.130
ANK+0.112
(-+0.110
iller+0.110
Neg logit contributions
drivers-0.069
AFTA-0.066
 IEEE-0.063
bee-0.062
ulus-0.061
Token,
Token ID11
Feature activation+1.04e-03

Pos logit contributions
drivers+0.145
AFTA+0.140
 IEEE+0.137
bee+0.135
ulus+0.133
Neg logit contributions
vg-0.157
irst-0.149
ANK-0.120
iller-0.117
(--0.117
Token Cardinals
Token ID18071
Feature activation+1.13e-01

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
(-+0.001
iller+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ratulations-0.001
Token awaits
Token ID37180
Feature activation-1.37e-01

Pos logit contributions
drivers+0.077
AFTA+0.074
 IEEE+0.073
bee+0.072
ulus+0.071
Neg logit contributions
vg-0.086
irst-0.081
ANK-0.066
iller-0.064
(--0.064
Token.
Token ID13
Feature activation-1.86e-03

Pos logit contributions
drivers+0.102
AFTA+0.099
 IEEE+0.096
bee+0.095
ulus+0.094
Neg logit contributions
vg-0.105
irst-0.099
ANK-0.079
iller-0.078
(--0.078
Token
Token ID198
Feature activation+3.38e-03

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
(--0.001
Token
Token ID198
Feature activation-4.29e-03

Pos logit contributions
vg+0.003
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.003
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
TokenR
Token ID49
Feature activation+1.47e-01

Pos logit contributions
drivers+0.004
AFTA+0.004
 IEEE+0.004
bee+0.004
ulus+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.002
iller-0.002
 decay-0.002
TokenANK
Token ID15154
Feature activation+4.83e-01

Pos logit contributions
vg+0.029
irst+0.022
ANK+0.000
iller-0.000
(--0.001
Neg logit contributions
drivers-0.191
AFTA-0.188
 IEEE-0.186
bee-0.184
ulus-0.183
Token 6
Token ID718
Feature activation-1.22e-01

Pos logit contributions
vg+0.442
irst+0.420
ANK+0.353
(-+0.347
iller+0.346
Neg logit contributions
drivers-0.284
AFTA-0.273
 IEEE-0.264
bee-0.259
ulus-0.255
Token Jal
Token ID28649
Feature activation+2.16e-02

Pos logit contributions
drivers+0.095
AFTA+0.092
 IEEE+0.090
bee+0.089
ulus+0.088
Neg logit contributions
vg-0.088
irst-0.083
ANK-0.066
(--0.064
iller-0.064
Tokenen
Token ID268
Feature activation-6.36e-02

Pos logit contributions
vg+0.019
irst+0.018
ANK+0.015
iller+0.015
(-+0.014
Neg logit contributions
drivers-0.014
AFTA-0.013
 IEEE-0.013
bee-0.013
ulus-0.012
Token Ramsey
Token ID30552
Feature activation-2.26e-01

Pos logit contributions
drivers+0.028
AFTA+0.027
 IEEE+0.026
bee+0.026
 Hass+0.025
Neg logit contributions
vg-0.067
irst-0.064
ANK-0.055
BuyableInstoreAndOnline-0.055
iller-0.054
Token,
Token ID11
Feature activation-1.95e-03

Pos logit contributions
drivers+0.162
AFTA+0.157
 IEEE+0.154
bee+0.151
ulus+0.149
Neg logit contributions
vg-0.177
irst-0.167
ANK-0.135
iller-0.132
(--0.131
Token rankings
Token ID16905
Feature activation+2.52e-02

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.007
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Token.
Token ID13
Feature activation-5.67e-03

Pos logit contributions
vg+0.019
irst+0.018
ANK+0.015
iller+0.014
(-+0.014
Neg logit contributions
drivers-0.019
AFTA-0.018
 IEEE-0.018
bee-0.017
ulus-0.017
Token
Token ID198
Feature activation+5.22e-03

Pos logit contributions
drivers+0.004
AFTA+0.004
 IEEE+0.004
bee+0.004
ulus+0.004
Neg logit contributions
vg-0.004
irst-0.004
ANK-0.003
iller-0.003
(--0.003
Token
Token ID198
Feature activation-4.33e-03

Pos logit contributions
vg+0.004
irst+0.004
ANK+0.003
iller+0.003
(-+0.003
Neg logit contributions
drivers-0.004
AFTA-0.004
 IEEE-0.004
bee-0.004
ulus-0.004
TokenR
Token ID49
Feature activation+1.38e-01

Pos logit contributions
drivers+0.003
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.003
(--0.003
iller-0.003
TokenANK
Token ID15154
Feature activation+4.81e-01

Pos logit contributions
vg+0.104
irst+0.099
ANK+0.080
iller+0.077
(-+0.077
Neg logit contributions
drivers-0.104
AFTA-0.101
 IEEE-0.099
bee-0.097
ratulations-0.096
Token 1
Token ID352
Feature activation-1.14e-01

Pos logit contributions
vg+0.359
irst+0.339
ANK+0.275
(-+0.268
iller+0.265
Neg logit contributions
drivers-0.364
AFTA-0.353
 IEEE-0.342
bee-0.338
ratulations-0.335
Token Josh
Token ID8518
Feature activation+1.45e-01

Pos logit contributions
drivers+0.085
AFTA+0.082
 IEEE+0.080
bee+0.079
ratulations+0.078
Neg logit contributions
vg-0.086
irst-0.081
ANK-0.066
(--0.064
iller-0.064
Token Norman
Token ID19636
Feature activation-7.85e-02

Pos logit contributions
vg+0.125
irst+0.118
ANK+0.097
iller+0.096
(-+0.096
Neg logit contributions
drivers-0.093
AFTA-0.089
 IEEE-0.087
bee-0.086
ulus-0.085
Token,
Token ID11
Feature activation-5.71e-03

Pos logit contributions
drivers+0.055
AFTA+0.054
 IEEE+0.052
bee+0.051
ulus+0.051
Neg logit contributions
vg-0.063
irst-0.059
ANK-0.048
(--0.047
iller-0.047
Token Redskins
Token ID22038
Feature activation+8.42e-02

Pos logit contributions
drivers+0.004
AFTA+0.004
 IEEE+0.004
bee+0.004
ulus+0.004
Neg logit contributions
vg-0.005
irst-0.004
ANK-0.004
(--0.004
iller-0.004
Tokeniggs
Token ID20340
Feature activation-7.50e-02

Pos logit contributions
drivers+0.007
AFTA+0.007
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.012
irst-0.012
BuyableInstoreAndOnline-0.010
istar-0.010
 drip-0.010
Token.
Token ID13
Feature activation-2.73e-04

Pos logit contributions
drivers+0.054
AFTA+0.053
 IEEE+0.051
bee+0.051
ulus+0.050
Neg logit contributions
vg-0.058
irst-0.055
ANK-0.045
iller-0.044
(--0.043
Token
Token ID198
Feature activation+2.13e-03

Pos logit contributions
drivers+0.000
AFTA+0.000
 IEEE+0.000
bee+0.000
ulus+0.000
Neg logit contributions
vg-0.000
irst-0.000
ANK-0.000
iller-0.000
(--0.000
Token
Token ID198
Feature activation+6.54e-04

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.001
ulus-0.001
TokenR
Token ID49
Feature activation+1.41e-01

Pos logit contributions
vg+0.000
irst+0.000
ANK+0.000
iller+0.000
 decay+0.000
Neg logit contributions
drivers-0.000
AFTA-0.000
 IEEE-0.000
bee-0.000
ulus-0.000
TokenANK
Token ID15154
Feature activation+4.81e-01

Pos logit contributions
vg+0.051
irst+0.044
ANK+0.024
iller+0.023
(-+0.023
Neg logit contributions
drivers-0.161
AFTA-0.158
 IEEE-0.155
bee-0.154
ulus-0.153
Token 2
Token ID362
Feature activation-8.78e-03

Pos logit contributions
vg+0.375
irst+0.354
ANK+0.288
iller+0.281
(-+0.280
Neg logit contributions
drivers-0.349
AFTA-0.337
 IEEE-0.328
bee-0.324
ulus-0.319
Token Richard
Token ID6219
Feature activation-6.03e-02

Pos logit contributions
drivers+0.007
AFTA+0.007
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Token Sherman
Token ID23198
Feature activation-8.94e-02

Pos logit contributions
drivers+0.037
AFTA+0.036
 IEEE+0.035
bee+0.034
ulus+0.034
Neg logit contributions
vg-0.054
irst-0.051
ANK-0.042
(--0.042
iller-0.042
Token,
Token ID11
Feature activation-6.12e-03

Pos logit contributions
drivers+0.066
AFTA+0.064
 IEEE+0.063
bee+0.062
ulus+0.061
Neg logit contributions
vg-0.068
irst-0.064
ANK-0.052
iller-0.051
(--0.051
Token Seahawks
Token ID16732
Feature activation-2.20e-01

Pos logit contributions
drivers+0.003
AFTA+0.003
 IEEE+0.003
bee+0.003
ratulations+0.003
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
(--0.005
iller-0.005
Token a
Token ID257
Feature activation-8.65e-03

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.006
bee+0.005
ulus+0.005
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Token molecule
Token ID27756
Feature activation+6.09e-02

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.007
irst-0.007
ANK-0.005
iller-0.005
(--0.005
Token expressed
Token ID6241
Feature activation-1.07e-02

Pos logit contributions
vg+0.047
irst+0.044
ANK+0.036
iller+0.035
(-+0.035
Neg logit contributions
drivers-0.045
AFTA-0.043
 IEEE-0.042
bee-0.041
ulus-0.041
Token as
Token ID355
Feature activation+2.90e-02

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.007
bee+0.007
ulus+0.007
Neg logit contributions
vg-0.008
irst-0.008
ANK-0.006
iller-0.006
(--0.006
Token exp
Token ID1033
Feature activation-7.44e-02

Pos logit contributions
vg+0.022
irst+0.021
ANK+0.017
(-+0.017
iller+0.016
Neg logit contributions
drivers-0.022
AFTA-0.021
 IEEE-0.020
bee-0.020
visors-0.020
Token(-
Token ID32590
Feature activation+4.78e-01

Pos logit contributions
drivers+0.062
AFTA+0.060
 IEEE+0.059
bee+0.058
ulus+0.058
Neg logit contributions
vg-0.050
irst-0.047
ANK-0.036
iller-0.035
 drip-0.035
Tokent
Token ID83
Feature activation-6.51e-02

Pos logit contributions
vg+0.361
irst+0.340
ANK+0.272
(-+0.267
iller+0.266
Neg logit contributions
drivers-0.359
AFTA-0.348
 IEEE-0.340
bee-0.334
ratulations-0.330
Token/
Token ID14
Feature activation-1.78e-01

Pos logit contributions
drivers+0.050
AFTA+0.049
 IEEE+0.048
bee+0.047
ulus+0.047
Neg logit contributions
vg-0.047
irst-0.044
ANK-0.035
iller-0.034
(--0.034
Tokenu
Token ID84
Feature activation-2.01e-02

Pos logit contributions
drivers+0.130
AFTA+0.126
 IEEE+0.123
bee+0.121
ulus+0.119
Neg logit contributions
vg-0.138
irst-0.130
ANK-0.105
iller-0.102
(--0.102
Token)
Token ID8
Feature activation+2.32e-02

Pos logit contributions
drivers+0.016
AFTA+0.015
 IEEE+0.015
bee+0.015
ulus+0.014
Neg logit contributions
vg-0.015
irst-0.014
ANK-0.011
iller-0.011
(--0.011
Token where
Token ID810
Feature activation+4.71e-02

Pos logit contributions
vg+0.018
irst+0.017
ANK+0.013
iller+0.013
(-+0.013
Neg logit contributions
drivers-0.017
AFTA-0.017
 IEEE-0.016
bee-0.016
ulus-0.016
Token headed
Token ID9153
Feature activation-2.48e-02

Pos logit contributions
vg+0.108
irst+0.103
ANK+0.083
(-+0.082
iller+0.082
Neg logit contributions
drivers-0.093
AFTA-0.090
 IEEE-0.088
bee-0.086
ratulations-0.085
Token chance
Token ID2863
Feature activation+1.75e-01

Pos logit contributions
drivers+0.017
AFTA+0.016
 IEEE+0.015
bee+0.015
ratulations+0.015
Neg logit contributions
vg-0.021
irst-0.020
ANK-0.016
(--0.016
iller-0.016
Token and
Token ID290
Feature activation-7.53e-03

Pos logit contributions
vg+0.133
irst+0.126
ANK+0.101
(-+0.099
iller+0.099
Neg logit contributions
drivers-0.130
AFTA-0.126
 IEEE-0.123
bee-0.121
ratulations-0.120
Token forcing
Token ID10833
Feature activation-5.61e-02

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.005
bee+0.005
ulus+0.005
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.004
(--0.004
iller-0.004
Token Gu
Token ID1962
Feature activation-5.21e-02

Pos logit contributions
drivers+0.042
AFTA+0.040
 IEEE+0.039
bee+0.039
ulus+0.038
Neg logit contributions
vg-0.043
irst-0.040
ANK-0.032
iller-0.032
(--0.032
Tokeniller
Token ID4665
Feature activation+4.77e-01

Pos logit contributions
drivers+0.064
AFTA+0.063
 IEEE+0.062
bee+0.061
ulus+0.061
Neg logit contributions
vg-0.014
irst-0.012
ANK-0.005
(--0.004
 drip-0.004
Tokenmo
Token ID5908
Feature activation+3.76e-02

Pos logit contributions
vg+0.379
irst+0.359
ANK+0.298
(-+0.288
BuyableInstoreAndOnline+0.286
Neg logit contributions
drivers-0.333
AFTA-0.322
 IEEE-0.316
bee-0.312
visors-0.306
Token O
Token ID440
Feature activation-4.65e-02

Pos logit contributions
vg+0.029
irst+0.028
ANK+0.022
iller+0.022
(-+0.022
Neg logit contributions
drivers-0.027
AFTA-0.026
 IEEE-0.026
bee-0.026
ulus-0.025
Tokencho
Token ID6679
Feature activation+2.24e-02

Pos logit contributions
drivers+0.034
AFTA+0.032
 IEEE+0.032
bee+0.032
ulus+0.031
Neg logit contributions
vg-0.036
irst-0.034
ANK-0.028
(--0.027
 decay-0.027
Tokena
Token ID64
Feature activation-5.52e-02

Pos logit contributions
vg+0.019
irst+0.018
ANK+0.015
iller+0.015
(-+0.015
Neg logit contributions
drivers-0.015
AFTA-0.014
 IEEE-0.014
bee-0.013
ulus-0.013
Token into
Token ID656
Feature activation+5.41e-02

Pos logit contributions
drivers+0.041
AFTA+0.040
 IEEE+0.039
bee+0.038
ulus+0.038
Neg logit contributions
vg-0.042
irst-0.040
ANK-0.032
iller-0.031
(--0.031
Token
Token ID447
Feature activation-4.31e-03

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Token
Token ID251
Feature activation+2.79e-03

Pos logit contributions
drivers+0.003
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.003
(--0.003
iller-0.003
Token Ms
Token ID6997
Feature activation-6.12e-02

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Token.
Token ID13
Feature activation-1.32e-04

Pos logit contributions
drivers+0.045
AFTA+0.044
 IEEE+0.043
bee+0.042
ulus+0.042
Neg logit contributions
vg-0.047
irst-0.044
ANK-0.035
iller-0.035
BuyableInstoreAndOnline-0.034
Token Set
Token ID5345
Feature activation-1.99e-01

Pos logit contributions
drivers+0.000
AFTA+0.000
 IEEE+0.000
bee+0.000
ulus+0.000
Neg logit contributions
vg-0.000
irst-0.000
ANK-0.000
iller-0.000
(--0.000
Tokenzer
Token ID9107
Feature activation+4.76e-01

Pos logit contributions
drivers+0.254
AFTA+0.247
bee+0.245
 IEEE+0.244
ulus+0.242
Neg logit contributions
vg-0.046
irst-0.037
ANK-0.010
BuyableInstoreAndOnline-0.009
 decay-0.006
Token said
Token ID531
Feature activation+6.63e-03

Pos logit contributions
vg+0.367
irst+0.347
ANK+0.280
iller+0.274
(-+0.273
Neg logit contributions
drivers-0.348
AFTA-0.337
 IEEE-0.329
bee-0.324
ulus-0.319
Token.
Token ID13
Feature activation-1.86e-03

Pos logit contributions
vg+0.005
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.005
AFTA-0.005
 IEEE-0.005
bee-0.005
ulus-0.005
Token
Token ID564
Feature activation+3.05e-03

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
(--0.001
Token
Token ID250
Feature activation-1.42e-02

Pos logit contributions
vg+0.003
irst+0.002
ANK+0.002
(-+0.002
iller+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
TokenThese
Token ID4711
Feature activation-2.12e-01

Pos logit contributions
drivers+0.010
AFTA+0.010
 IEEE+0.009
bee+0.009
ulus+0.009
Neg logit contributions
vg-0.012
irst-0.011
ANK-0.009
iller-0.009
(--0.009
Token As
Token ID1081
Feature activation-5.67e-02

Pos logit contributions
vg+0.004
irst+0.003
ANK+0.003
iller+0.003
(-+0.003
Neg logit contributions
drivers-0.003
AFTA-0.003
 IEEE-0.003
bee-0.003
ulus-0.003
Token my
Token ID616
Feature activation+3.15e-02

Pos logit contributions
drivers+0.042
AFTA+0.040
 IEEE+0.040
bee+0.039
ulus+0.038
Neg logit contributions
vg-0.044
irst-0.041
ANK-0.033
iller-0.032
(--0.032
Token colleague
Token ID16008
Feature activation-1.81e-01

Pos logit contributions
vg+0.028
irst+0.027
ANK+0.022
iller+0.022
(-+0.022
Neg logit contributions
drivers-0.019
AFTA-0.019
 IEEE-0.018
bee-0.018
ulus-0.017
Token Nicole
Token ID25176
Feature activation+1.05e-01

Pos logit contributions
drivers+0.142
AFTA+0.138
 IEEE+0.135
bee+0.133
ulus+0.131
Neg logit contributions
vg-0.130
irst-0.122
ANK-0.097
iller-0.094
(--0.094
Token O
Token ID440
Feature activation-4.31e-02

Pos logit contributions
vg+0.080
irst+0.075
ANK+0.060
iller+0.059
(-+0.059
Neg logit contributions
drivers-0.079
AFTA-0.076
 IEEE-0.074
bee-0.073
ulus-0.072
Tokenzer
Token ID9107
Feature activation+4.75e-01

Pos logit contributions
drivers+0.034
AFTA+0.033
 IEEE+0.032
bee+0.031
ulus+0.031
Neg logit contributions
vg-0.031
irst-0.029
ANK-0.023
iller-0.023
(--0.023
Token has
Token ID468
Feature activation-7.77e-02

Pos logit contributions
vg+0.390
irst+0.371
ANK+0.303
iller+0.299
(-+0.298
Neg logit contributions
drivers-0.323
AFTA-0.313
 IEEE-0.304
bee-0.296
ulus-0.294
Token discussed
Token ID6693
Feature activation-3.21e-02

Pos logit contributions
drivers+0.054
AFTA+0.052
 IEEE+0.051
bee+0.050
ulus+0.049
Neg logit contributions
vg-0.063
irst-0.060
ANK-0.049
iller-0.048
(--0.047
Token,
Token ID11
Feature activation+8.48e-04

Pos logit contributions
drivers+0.024
AFTA+0.024
 IEEE+0.023
bee+0.023
ulus+0.023
Neg logit contributions
vg-0.024
irst-0.022
ANK-0.018
iller-0.017
(--0.017
Token there
Token ID612
Feature activation+3.04e-02

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.000
iller+0.000
(-+0.000
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Token is
Token ID318
Feature activation+2.92e-02

Pos logit contributions
vg+0.024
irst+0.022
ANK+0.018
iller+0.018
(-+0.018
Neg logit contributions
drivers-0.022
AFTA-0.021
 IEEE-0.021
bee-0.020
ulus-0.020
Token THE
Token ID3336
Feature activation+1.06e-02

Pos logit contributions
drivers+0.025
AFTA+0.024
 IEEE+0.024
bee+0.023
ulus+0.023
Neg logit contributions
vg-0.026
irst-0.024
ANK-0.020
(--0.019
iller-0.019
Token L
Token ID406
Feature activation+4.29e-02

Pos logit contributions
vg+0.009
irst+0.008
ANK+0.007
(-+0.006
iller+0.006
Neg logit contributions
drivers-0.007
AFTA-0.007
 IEEE-0.007
bee-0.007
ulus-0.007
TokenAND
Token ID6981
Feature activation+1.52e-01

Pos logit contributions
vg+0.036
irst+0.034
ANK+0.028
(-+0.028
iller+0.028
Neg logit contributions
drivers-0.029
AFTA-0.027
 IEEE-0.027
bee-0.026
visors-0.026
Token OF
Token ID3963
Feature activation+5.38e-02

Pos logit contributions
vg+0.116
irst+0.110
ANK+0.088
iller+0.086
(-+0.086
Neg logit contributions
drivers-0.112
AFTA-0.108
 IEEE-0.105
bee-0.104
ulus-0.103
Token EXT
Token ID27489
Feature activation-1.75e-01

Pos logit contributions
vg+0.042
irst+0.039
ANK+0.032
iller+0.031
(-+0.031
Neg logit contributions
drivers-0.039
AFTA-0.038
 IEEE-0.037
bee-0.037
ulus-0.036
TokenREM
Token ID40726
Feature activation+4.75e-01

Pos logit contributions
drivers+0.122
AFTA+0.118
 IEEE+0.115
bee+0.113
ulus+0.112
Neg logit contributions
vg-0.141
irst-0.133
ANK-0.108
iller-0.106
(--0.106
TokenELY
Token ID30943
Feature activation-1.34e-01

Pos logit contributions
vg+0.437
irst+0.416
ANK+0.349
iller+0.343
(-+0.343
Neg logit contributions
drivers-0.276
AFTA-0.266
 IEEE-0.257
bee-0.252
ulus-0.248
Token EXP
Token ID25703
Feature activation-1.42e-03

Pos logit contributions
drivers+0.094
AFTA+0.091
 IEEE+0.089
bee+0.088
ulus+0.086
Neg logit contributions
vg-0.107
irst-0.102
ANK-0.082
iller-0.081
(--0.081
TokenENS
Token ID16938
Feature activation-8.17e-03

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
(--0.001
TokenIVE
Token ID9306
Feature activation+3.54e-02

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.006
bee+0.006
ulus+0.005
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Token BE
Token ID9348
Feature activation+1.41e-02

Pos logit contributions
vg+0.028
irst+0.026
ANK+0.021
iller+0.021
(-+0.021
Neg logit contributions
drivers-0.025
AFTA-0.024
 IEEE-0.024
bee-0.024
ulus-0.023
Tokenjas
Token ID28121
Feature activation-1.54e-01

Pos logit contributions
drivers+0.124
AFTA+0.118
 IEEE+0.114
bee+0.111
ratulations+0.109
Neg logit contributions
vg-0.214
irst-0.205
iller-0.172
ANK-0.171
zer-0.171
Token Gil
Token ID10689
Feature activation-1.60e-01

Pos logit contributions
drivers+0.112
AFTA+0.109
 IEEE+0.106
bee+0.105
ulus+0.103
Neg logit contributions
vg-0.119
irst-0.112
ANK-0.090
iller-0.088
(--0.088
Token (
Token ID357
Feature activation+2.97e-02

Pos logit contributions
drivers+0.120
AFTA+0.116
 IEEE+0.113
bee+0.112
ulus+0.110
Neg logit contributions
vg-0.121
irst-0.114
ANK-0.091
iller-0.090
(--0.089
TokenM
Token ID44
Feature activation+1.38e-01

Pos logit contributions
vg+0.024
irst+0.023
ANK+0.018
iller+0.018
(-+0.018
Neg logit contributions
drivers-0.021
AFTA-0.020
 IEEE-0.020
bee-0.019
ulus-0.019
Tokenov
Token ID709
Feature activation-1.07e-01

Pos logit contributions
vg+0.122
irst+0.116
ANK+0.097
iller+0.095
(-+0.095
Neg logit contributions
drivers-0.085
AFTA-0.082
 IEEE-0.080
bee-0.078
ratulations-0.077
Tokenistar
Token ID47229
Feature activation+4.74e-01

Pos logit contributions
drivers+0.135
AFTA+0.133
 IEEE+0.131
bee+0.130
ulus+0.130
Neg logit contributions
vg-0.025
irst-0.020
ANK-0.006
(--0.004
BuyableInstoreAndOnline-0.004
Token Team
Token ID4816
Feature activation+2.56e-01

Pos logit contributions
vg+0.253
irst+0.231
ANK+0.164
iller+0.158
(-+0.157
Neg logit contributions
drivers-0.460
AFTA-0.449
 IEEE-0.442
bee-0.437
ulus-0.433
Token)
Token ID8
Feature activation+2.71e-02

Pos logit contributions
vg+0.186
irst+0.175
ANK+0.139
iller+0.135
(-+0.135
Neg logit contributions
drivers-0.198
AFTA-0.193
 IEEE-0.188
bee-0.186
ulus-0.183
Token s
Token ID264
Feature activation-1.19e-01

Pos logit contributions
vg+0.024
irst+0.023
ANK+0.019
iller+0.019
(-+0.019
Neg logit contributions
drivers-0.017
AFTA-0.016
 IEEE-0.016
bee-0.015
ulus-0.015
Token.
Token ID13
Feature activation-1.68e-04

Pos logit contributions
drivers+0.088
AFTA+0.085
 IEEE+0.084
bee+0.083
ulus+0.081
Neg logit contributions
vg-0.090
irst-0.085
ANK-0.068
iller-0.067
(--0.066
Tokent
Token ID83
Feature activation-5.10e-02

Pos logit contributions
drivers+0.000
AFTA+0.000
 IEEE+0.000
bee+0.000
ulus+0.000
Neg logit contributions
vg-0.000
irst-0.000
ANK-0.000
iller-0.000
(--0.000
Token kind
Token ID1611
Feature activation+8.92e-02

Pos logit contributions
vg+0.014
irst+0.013
ANK+0.010
iller+0.010
(-+0.010
Neg logit contributions
drivers-0.015
AFTA-0.015
 IEEE-0.015
bee-0.014
ulus-0.014
Token of
Token ID286
Feature activation+1.28e-02

Pos logit contributions
vg+0.069
irst+0.065
ANK+0.053
iller+0.051
(-+0.051
Neg logit contributions
drivers-0.065
AFTA-0.063
 IEEE-0.061
bee-0.061
ulus-0.060
Token
Token ID564
Feature activation+8.37e-03

Pos logit contributions
vg+0.010
irst+0.009
ANK+0.008
iller+0.007
(-+0.007
Neg logit contributions
drivers-0.009
AFTA-0.009
 IEEE-0.009
bee-0.009
ulus-0.008
Token
Token ID250
Feature activation-7.11e-03

Pos logit contributions
vg+0.007
irst+0.006
ANK+0.005
iller+0.005
(-+0.005
Neg logit contributions
drivers-0.006
AFTA-0.006
 IEEE-0.006
bee-0.006
ulus-0.005
Tokenf
Token ID69
Feature activation-1.74e-02

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.005
bee+0.004
ulus+0.004
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
iller-0.004
(--0.004
Tokeniller
Token ID4665
Feature activation+4.74e-01

Pos logit contributions
drivers+0.013
AFTA+0.013
 IEEE+0.013
bee+0.013
ulus+0.012
Neg logit contributions
vg-0.013
irst-0.012
ANK-0.010
(--0.009
iller-0.009
Token
Token ID447
Feature activation+1.41e-03

Pos logit contributions
vg+0.361
irst+0.339
ANK+0.273
iller+0.267
(-+0.266
Neg logit contributions
drivers-0.352
AFTA-0.341
 IEEE-0.333
bee-0.328
ulus-0.324
Token
Token ID251
Feature activation-2.99e-03

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
(-+0.001
iller+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Token between
Token ID1022
Feature activation-2.29e-03

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.002
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.002
irst-0.002
ANK-0.002
iller-0.002
(--0.002
Token moves
Token ID6100
Feature activation-1.45e-02

Pos logit contributions
drivers+0.002
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.002
irst-0.002
ANK-0.001
(--0.001
iller-0.001
Token,
Token ID11
Feature activation-2.51e-03

Pos logit contributions
drivers+0.011
AFTA+0.010
 IEEE+0.010
bee+0.010
ulus+0.010
Neg logit contributions
vg-0.011
irst-0.010
ANK-0.008
iller-0.008
(--0.008
Token Facebook
Token ID3203
Feature activation-4.48e-02

Pos logit contributions
vg+0.025
irst+0.024
ANK+0.020
iller+0.019
(-+0.019
Neg logit contributions
drivers-0.019
AFTA-0.019
 IEEE-0.018
bee-0.018
ulus-0.018
Token)
Token ID8
Feature activation+4.59e-02

Pos logit contributions
drivers+0.036
AFTA+0.035
 IEEE+0.034
bee+0.033
ratulations+0.033
Neg logit contributions
vg-0.032
irst-0.030
ANK-0.024
(--0.023
iller-0.023
TokenI
Token ID40
Feature activation+3.00e-02

Pos logit contributions
vg+0.034
irst+0.032
ANK+0.026
(-+0.025
iller+0.025
Neg logit contributions
drivers-0.035
AFTA-0.034
 IEEE-0.033
bee-0.033
ratulations-0.032
Tokenhor
Token ID17899
Feature activation+5.75e-02

Pos logit contributions
vg+0.026
irst+0.024
ANK+0.020
iller+0.020
(-+0.020
Neg logit contributions
drivers-0.019
AFTA-0.019
 IEEE-0.018
bee-0.018
ulus-0.018
Token Mal
Token ID4434
Feature activation-6.17e-03

Pos logit contributions
vg+0.043
irst+0.040
ANK+0.032
iller+0.031
(-+0.031
Neg logit contributions
drivers-0.044
AFTA-0.043
 IEEE-0.042
bee-0.041
ulus-0.041
Tokenanch
Token ID3702
Feature activation+4.73e-01

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.007
bee+0.007
ulus+0.007
Neg logit contributions
vg-0.002
irst-0.001
ANK-0.000
BuyableInstoreAndOnline-0.000
iller-0.000
Tokenak
Token ID461
Feature activation+3.65e-02

Pos logit contributions
vg+0.354
irst+0.333
ANK+0.266
iller+0.260
(-+0.260
Neg logit contributions
drivers-0.358
AFTA-0.347
 IEEE-0.339
bee-0.334
ulus-0.330
Token is
Token ID318
Feature activation+4.41e-02

Pos logit contributions
vg+0.028
irst+0.026
ANK+0.021
iller+0.021
(-+0.021
Neg logit contributions
drivers-0.027
AFTA-0.026
 IEEE-0.026
bee-0.025
ulus-0.025
Token the
Token ID262
Feature activation+6.67e-03

Pos logit contributions
vg+0.034
irst+0.032
ANK+0.025
iller+0.025
(-+0.025
Neg logit contributions
drivers-0.033
AFTA-0.032
 IEEE-0.031
bee-0.030
ulus-0.030
Token CEO
Token ID6123
Feature activation-6.84e-02

Pos logit contributions
vg+0.005
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.005
AFTA-0.004
 IEEE-0.004
bee-0.004
ulus-0.004
Token of
Token ID286
Feature activation+1.48e-02

Pos logit contributions
drivers+0.049
AFTA+0.048
 IEEE+0.047
bee+0.046
ulus+0.045
Neg logit contributions
vg-0.054
irst-0.051
ANK-0.041
iller-0.040
(--0.040
Token fades
Token ID41381
Feature activation+2.17e-03

Pos logit contributions
drivers+0.014
AFTA+0.014
 IEEE+0.014
bee+0.013
ulus+0.013
Neg logit contributions
vg-0.016
irst-0.015
ANK-0.012
iller-0.012
(--0.012
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.001
Token
Token ID198
Feature activation-4.06e-03

Pos logit contributions
drivers+0.007
AFTA+0.007
 IEEE+0.007
bee+0.007
ulus+0.007
Neg logit contributions
vg-0.008
irst-0.007
ANK-0.006
iller-0.006
(--0.006
Token
Token ID198
Feature activation-1.80e-03

Pos logit contributions
drivers+0.003
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.002
iller-0.002
(--0.002
TokenR
Token ID49
Feature activation+1.33e-01

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
 decay-0.001
TokenANK
Token ID15154
Feature activation+4.72e-01

Pos logit contributions
vg+0.026
irst+0.020
ANK+0.000
iller-0.000
(--0.001
Neg logit contributions
drivers-0.174
AFTA-0.171
 IEEE-0.168
bee-0.167
ulus-0.166
Token 5
Token ID642
Feature activation-8.24e-02

Pos logit contributions
vg+0.408
irst+0.387
ANK+0.322
(-+0.315
iller+0.315
Neg logit contributions
drivers-0.302
AFTA-0.292
 IEEE-0.283
bee-0.278
ulus-0.274
Token Jimmy
Token ID12963
Feature activation+3.00e-02

Pos logit contributions
drivers+0.064
AFTA+0.062
 IEEE+0.060
bee+0.059
ulus+0.059
Neg logit contributions
vg-0.060
irst-0.057
ANK-0.045
(--0.044
iller-0.044
Token Smith
Token ID4176
Feature activation+4.13e-02

Pos logit contributions
vg+0.025
irst+0.024
ANK+0.019
iller+0.019
(-+0.019
Neg logit contributions
drivers-0.020
AFTA-0.019
 IEEE-0.019
bee-0.019
ulus-0.018
Token,
Token ID11
Feature activation-1.79e-02

Pos logit contributions
vg+0.032
irst+0.030
ANK+0.024
iller+0.024
(-+0.024
Neg logit contributions
drivers-0.030
AFTA-0.029
 IEEE-0.028
bee-0.028
ulus-0.028
Token Ravens
Token ID20091
Feature activation-1.97e-01

Pos logit contributions
drivers+0.012
AFTA+0.011
 IEEE+0.011
bee+0.011
ratulations+0.011
Neg logit contributions
vg-0.015
irst-0.014
ANK-0.012
(--0.012
iller-0.012
Token run
Token ID1057
Feature activation+1.26e-03

Pos logit contributions
vg+0.101
irst+0.095
ANK+0.076
(-+0.074
iller+0.074
Neg logit contributions
drivers-0.102
AFTA-0.099
 IEEE-0.096
bee-0.095
ulus-0.094
Token-
Token ID12
Feature activation+2.67e-02

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Tokenoff
Token ID2364
Feature activation+3.60e-02

Pos logit contributions
vg+0.021
irst+0.020
ANK+0.016
iller+0.016
(-+0.016
Neg logit contributions
drivers-0.019
AFTA-0.019
 IEEE-0.018
bee-0.018
ulus-0.018
Token between
Token ID1022
Feature activation-4.07e-03

Pos logit contributions
vg+0.027
irst+0.026
ANK+0.021
(-+0.020
iller+0.020
Neg logit contributions
drivers-0.027
AFTA-0.026
 IEEE-0.025
bee-0.025
ulus-0.025
Token Gu
Token ID1962
Feature activation-4.50e-02

Pos logit contributions
drivers+0.003
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.002
(--0.002
iller-0.002
Tokeniller
Token ID4665
Feature activation+4.71e-01

Pos logit contributions
drivers+0.033
AFTA+0.032
 IEEE+0.031
bee+0.031
ulus+0.030
Neg logit contributions
vg-0.035
irst-0.033
ANK-0.026
(--0.026
 decay-0.026
Tokenmo
Token ID5908
Feature activation+3.77e-02

Pos logit contributions
vg+0.375
irst+0.353
ANK+0.289
(-+0.284
 decay+0.280
Neg logit contributions
drivers-0.330
AFTA-0.322
 IEEE-0.313
bee-0.310
visors-0.304
Token L
Token ID406
Feature activation+4.61e-02

Pos logit contributions
vg+0.027
irst+0.025
ANK+0.020
(-+0.020
iller+0.020
Neg logit contributions
drivers-0.029
AFTA-0.029
 IEEE-0.028
bee-0.028
ulus-0.027
Tokenasso
Token ID28372
Feature activation+1.45e-01

Pos logit contributions
vg+0.038
irst+0.036
ANK+0.029
iller+0.028
(-+0.028
Neg logit contributions
drivers-0.032
AFTA-0.031
 IEEE-0.030
bee-0.029
ulus-0.029
Token and
Token ID290
Feature activation-2.51e-03

Pos logit contributions
vg+0.113
irst+0.107
ANK+0.086
iller+0.085
(-+0.085
Neg logit contributions
drivers-0.105
AFTA-0.101
 IEEE-0.098
bee-0.097
ulus-0.095
Token Lenin
Token ID22899
Feature activation+7.81e-02

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.002
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.002
irst-0.002
ANK-0.001
iller-0.001
(--0.001
Token Von
Token ID26985
Feature activation-3.24e-01

Pos logit contributions
vg+0.000
irst+0.000
ANK+0.000
iller+0.000
(-+0.000
Neg logit contributions
drivers-0.000
AFTA-0.000
 IEEE-0.000
bee-0.000
ulus-0.000
Token Miller
Token ID7920
Feature activation+1.61e-01

Pos logit contributions
drivers+0.266
AFTA+0.259
 IEEE+0.254
bee+0.250
ulus+0.247
Neg logit contributions
vg-0.221
irst-0.206
ANK-0.161
iller-0.157
(--0.156
Token (@
Token ID4275
Feature activation+1.20e-01

Pos logit contributions
vg+0.108
irst+0.101
ANK+0.078
iller+0.076
(-+0.076
Neg logit contributions
drivers-0.134
AFTA-0.131
 IEEE-0.128
bee-0.126
ulus-0.125
Tokenvon
Token ID26982
Feature activation-3.09e-01

Pos logit contributions
vg+0.101
irst+0.096
ANK+0.079
iller+0.077
(-+0.077
Neg logit contributions
drivers-0.079
AFTA-0.077
 IEEE-0.075
bee-0.073
ulus-0.072
Tokenm
Token ID76
Feature activation+2.47e-02

Pos logit contributions
drivers+0.237
AFTA+0.228
 IEEE+0.223
bee+0.221
ulus+0.218
Neg logit contributions
vg-0.227
irst-0.214
ANK-0.171
(--0.166
iller-0.165
Tokeniller
Token ID4665
Feature activation+4.70e-01

Pos logit contributions
vg+0.009
irst+0.008
ANK+0.005
(-+0.004
BuyableInstoreAndOnline+0.004
Neg logit contributions
drivers-0.028
AFTA-0.027
 IEEE-0.027
bee-0.027
ulus-0.027
Token)
Token ID8
Feature activation+3.40e-02

Pos logit contributions
vg+0.356
irst+0.334
ANK+0.268
iller+0.263
(-+0.261
Neg logit contributions
drivers-0.351
AFTA-0.342
 IEEE-0.334
bee-0.327
ulus-0.323
Token on
Token ID319
Feature activation-9.32e-03

Pos logit contributions
vg+0.027
irst+0.026
ANK+0.021
iller+0.021
BuyableInstoreAndOnline+0.021
Neg logit contributions
drivers-0.024
AFTA-0.023
 IEEE-0.023
bee-0.022
ulus-0.022
Token Sep
Token ID8621
Feature activation-9.21e-03

Pos logit contributions
drivers+0.007
AFTA+0.007
 IEEE+0.007
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.007
irst-0.007
ANK-0.005
iller-0.005
(--0.005
Token 30
Token ID1542
Feature activation+1.24e-02

Pos logit contributions
drivers+0.007
AFTA+0.006
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.007
irst-0.007
ANK-0.006
iller-0.005
(--0.005
Token,
Token ID11
Feature activation-2.49e-03

Pos logit contributions
vg+0.010
irst+0.009
ANK+0.007
iller+0.007
(-+0.007
Neg logit contributions
drivers-0.009
AFTA-0.009
 IEEE-0.008
bee-0.008
ulus-0.008
Token
Token ID447
Feature activation-2.19e-02

Pos logit contributions
drivers+0.009
AFTA+0.009
 IEEE+0.009
bee+0.009
ulus+0.009
Neg logit contributions
vg-0.010
irst-0.010
ANK-0.008
iller-0.008
(--0.007
Token
Token ID251
Feature activation-1.15e-02

Pos logit contributions
drivers+0.017
AFTA+0.016
 IEEE+0.016
bee+0.016
ulus+0.015
Neg logit contributions
vg-0.016
irst-0.015
ANK-0.012
iller-0.012
(--0.012
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.008
bee+0.008
ulus+0.008
Neg logit contributions
vg-0.009
irst-0.008
ANK-0.007
iller-0.007
(--0.007
Token
Token ID198
Feature activation-1.37e-02

Pos logit contributions
drivers+0.013
AFTA+0.012
 IEEE+0.012
bee+0.012
ulus+0.012
Neg logit contributions
vg-0.013
irst-0.012
ANK-0.010
iller-0.010
BuyableInstoreAndOnline-0.010
TokenFR
Token ID10913
Feature activation-1.14e-01

Pos logit contributions
drivers+0.010
AFTA+0.009
 IEEE+0.009
bee+0.009
ulus+0.009
Neg logit contributions
vg-0.011
irst-0.010
ANK-0.008
iller-0.008
(--0.008
TokenANK
Token ID15154
Feature activation+4.70e-01

Pos logit contributions
drivers+0.095
AFTA+0.092
 IEEE+0.090
bee+0.089
ulus+0.088
Neg logit contributions
vg-0.076
irst-0.071
ANK-0.055
iller-0.053
(--0.053
Token M
Token ID337
Feature activation+7.75e-02

Pos logit contributions
vg+0.358
irst+0.337
ANK+0.273
(-+0.265
iller+0.265
Neg logit contributions
drivers-0.348
AFTA-0.337
 IEEE-0.328
bee-0.324
ulus-0.320
TokenORD
Token ID12532
Feature activation-2.94e-02

Pos logit contributions
vg+0.057
irst+0.054
ANK+0.043
iller+0.042
(-+0.042
Neg logit contributions
drivers-0.059
AFTA-0.057
 IEEE-0.056
bee-0.055
ulus-0.055
TokenICA
Token ID25241
Feature activation-7.63e-03

Pos logit contributions
drivers+0.021
AFTA+0.020
 IEEE+0.020
bee+0.020
ulus+0.019
Neg logit contributions
vg-0.023
irst-0.022
ANK-0.018
iller-0.017
(--0.017
Token,
Token ID11
Feature activation-1.35e-02

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.005
bee+0.005
ulus+0.005
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Token V
Token ID569
Feature activation-3.78e-02

Pos logit contributions
drivers+0.010
AFTA+0.009
 IEEE+0.009
bee+0.009
ulus+0.009
Neg logit contributions
vg-0.011
irst-0.010
ANK-0.008
iller-0.008
(--0.008
Token
Token ID198
Feature activation+3.47e-03

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
(--0.001
Token
Token ID198
Feature activation+4.60e-03

Pos logit contributions
vg+0.003
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.003
AFTA-0.003
 IEEE-0.002
bee-0.002
ulus-0.002
TokenSh
Token ID2484
Feature activation+1.11e-02

Pos logit contributions
vg+0.004
irst+0.004
ANK+0.003
(-+0.003
iller+0.003
Neg logit contributions
drivers-0.003
AFTA-0.003
 IEEE-0.003
bee-0.003
ulus-0.003
Tokenams
Token ID4105
Feature activation-1.84e-01

Pos logit contributions
vg+0.009
irst+0.008
ANK+0.007
iller+0.007
(-+0.007
Neg logit contributions
drivers-0.008
AFTA-0.008
 IEEE-0.007
bee-0.007
visors-0.007
Tokenia
Token ID544
Feature activation-1.18e-01

Pos logit contributions
drivers+0.134
AFTA+0.130
 IEEE+0.127
bee+0.125
ulus+0.123
Neg logit contributions
vg-0.143
irst-0.134
ANK-0.109
iller-0.106
(--0.106
Token Hass
Token ID20300
Feature activation-5.25e-01

Pos logit contributions
drivers+0.086
AFTA+0.083
 IEEE+0.081
bee+0.080
ulus+0.079
Neg logit contributions
vg-0.091
irst-0.086
ANK-0.069
iller-0.068
(--0.068
Tokenani
Token ID3216
Feature activation-2.16e-01

Pos logit contributions
drivers+0.346
AFTA+0.335
 IEEE+0.325
bee+0.320
ulus+0.315
Neg logit contributions
vg-0.443
irst-0.420
ANK-0.345
iller-0.339
(--0.339
Token,
Token ID11
Feature activation+3.62e-03

Pos logit contributions
drivers+0.157
AFTA+0.152
 IEEE+0.149
bee+0.146
ulus+0.144
Neg logit contributions
vg-0.168
irst-0.159
ANK-0.128
iller-0.125
(--0.125
Token Afghanistan
Token ID8037
Feature activation+3.24e-03

Pos logit contributions
vg+0.003
irst+0.003
ANK+0.002
(-+0.002
iller+0.002
Neg logit contributions
drivers-0.003
AFTA-0.003
 IEEE-0.002
bee-0.002
ulus-0.002
Token
Token ID447
Feature activation-2.91e-03

Pos logit contributions
vg+0.003
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Token
Token ID247
Feature activation+1.26e-02

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.002
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.002
irst-0.002
ANK-0.002
iller-0.002
(--0.002
Token with
Token ID351
Feature activation+1.51e-02

Pos logit contributions
vg+0.055
irst+0.052
ANK+0.042
iller+0.041
(-+0.041
Neg logit contributions
drivers-0.053
AFTA-0.051
 IEEE-0.050
bee-0.049
ulus-0.049
Token two
Token ID734
Feature activation+6.31e-02

Pos logit contributions
vg+0.011
irst+0.011
ANK+0.009
(-+0.008
iller+0.008
Neg logit contributions
drivers-0.011
AFTA-0.011
 IEEE-0.011
bee-0.011
ulus-0.010
Token flat
Token ID6228
Feature activation+4.31e-02

Pos logit contributions
vg+0.050
irst+0.047
ANK+0.038
(-+0.038
iller+0.037
Neg logit contributions
drivers-0.045
AFTA-0.044
 IEEE-0.042
bee-0.042
ulus-0.041
Tokenhead
Token ID2256
Feature activation-2.04e-01

Pos logit contributions
vg+0.035
irst+0.033
ANK+0.027
(-+0.027
iller+0.027
Neg logit contributions
drivers-0.029
AFTA-0.029
 IEEE-0.028
bee-0.027
ulus-0.027
Token screw
Token ID9580
Feature activation+1.62e-01

Pos logit contributions
drivers+0.152
AFTA+0.149
 IEEE+0.145
bee+0.142
ulus+0.141
Neg logit contributions
vg-0.154
irst-0.144
ANK-0.116
(--0.113
iller-0.113
Tokendrivers
Token ID36702
Feature activation-5.17e-01

Pos logit contributions
vg+0.215
irst+0.209
ANK+0.186
(-+0.185
iller+0.185
Neg logit contributions
AFTA-0.025
drivers-0.023
 IEEE-0.021
bee-0.018
ratulations-0.018
Token.
Token ID13
Feature activation-7.12e-04

Pos logit contributions
drivers+0.379
AFTA+0.366
 IEEE+0.358
bee+0.353
ulus+0.348
Neg logit contributions
vg-0.400
irst-0.377
ANK-0.304
iller-0.297
(--0.296
Token It
Token ID632
Feature activation-3.24e-02

Pos logit contributions
drivers+0.001
AFTA+0.000
 IEEE+0.000
bee+0.000
ulus+0.000
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.000
iller-0.000
(--0.000
Token can
Token ID460
Feature activation+5.25e-02

Pos logit contributions
drivers+0.023
AFTA+0.023
 IEEE+0.022
bee+0.022
ulus+0.022
Neg logit contributions
vg-0.025
irst-0.024
ANK-0.019
iller-0.019
(--0.019
Token be
Token ID307
Feature activation-3.36e-02

Pos logit contributions
vg+0.042
irst+0.040
ANK+0.032
iller+0.032
(-+0.031
Neg logit contributions
drivers-0.037
AFTA-0.036
 IEEE-0.035
bee-0.034
ulus-0.034
Token a
Token ID257
Feature activation+1.02e-02

Pos logit contributions
drivers+0.027
AFTA+0.026
 IEEE+0.025
bee+0.025
ulus+0.025
Neg logit contributions
vg-0.024
irst-0.022
ANK-0.018
(--0.017
iller-0.017
Token St
Token ID520
Feature activation+5.24e-02

Pos logit contributions
drivers+0.110
AFTA+0.107
 IEEE+0.104
bee+0.103
ulus+0.101
Neg logit contributions
vg-0.117
irst-0.110
ANK-0.089
iller-0.087
(--0.087
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
vg+0.040
irst+0.037
ANK+0.030
(-+0.029
iller+0.029
Neg logit contributions
drivers-0.039
AFTA-0.038
 IEEE-0.037
bee-0.036
ulus-0.036
Token and
Token ID290
Feature activation-2.09e-02

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.008
bee+0.007
ulus+0.007
Neg logit contributions
vg-0.009
irst-0.008
ANK-0.007
(--0.006
iller-0.006
Token Bou
Token ID14551
Feature activation+2.67e-02

Pos logit contributions
drivers+0.016
AFTA+0.015
 IEEE+0.015
bee+0.015
ulus+0.015
Neg logit contributions
vg-0.016
irst-0.015
ANK-0.012
iller-0.011
(--0.011
Tokenlt
Token ID2528
Feature activation+1.54e-01

Pos logit contributions
vg+0.017
irst+0.016
ANK+0.012
(-+0.012
iller+0.012
Neg logit contributions
drivers-0.023
AFTA-0.022
 IEEE-0.022
bee-0.022
ulus-0.022
Tokenbee
Token ID20963
Feature activation-5.16e-01

Pos logit contributions
vg+0.114
irst+0.107
ANK+0.085
(-+0.083
iller+0.083
Neg logit contributions
drivers-0.118
AFTA-0.114
 IEEE-0.112
bee-0.110
ulus-0.109
Token Ave
Token ID12761
Feature activation-1.46e-01

Pos logit contributions
drivers+0.385
AFTA+0.372
 IEEE+0.363
bee+0.360
ulus+0.354
Neg logit contributions
vg-0.391
irst-0.366
ANK-0.296
(--0.288
iller-0.287
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
drivers+0.108
AFTA+0.105
 IEEE+0.102
bee+0.101
ulus+0.099
Neg logit contributions
vg-0.111
irst-0.104
ANK-0.084
iller-0.082
(--0.081
Token area
Token ID1989
Feature activation-2.00e-02

Pos logit contributions
drivers+0.008
AFTA+0.007
 IEEE+0.007
bee+0.007
ulus+0.007
Neg logit contributions
vg-0.009
irst-0.009
ANK-0.007
iller-0.007
(--0.007
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
drivers+0.014
AFTA+0.013
 IEEE+0.013
bee+0.013
ulus+0.012
Neg logit contributions
vg-0.017
irst-0.016
ANK-0.013
iller-0.013
(--0.013
Token north
Token ID5093
Feature activation+2.01e-02

Pos logit contributions
drivers+0.010
AFTA+0.010
 IEEE+0.010
bee+0.010
ulus+0.010
Neg logit contributions
vg-0.010
irst-0.010
ANK-0.008
iller-0.008
(--0.008
Token is
Token ID318
Feature activation+2.83e-02

Pos logit contributions
vg+0.029
irst+0.027
ANK+0.022
iller+0.022
(-+0.022
Neg logit contributions
drivers-0.027
AFTA-0.026
 IEEE-0.025
bee-0.025
ulus-0.024
Token a
Token ID257
Feature activation+1.02e-02

Pos logit contributions
vg+0.022
irst+0.020
ANK+0.016
iller+0.016
(-+0.016
Neg logit contributions
drivers-0.021
AFTA-0.020
 IEEE-0.020
bee-0.020
ulus-0.019
Token security
Token ID2324
Feature activation-4.82e-02

Pos logit contributions
vg+0.008
irst+0.008
ANK+0.006
(-+0.006
iller+0.006
Neg logit contributions
drivers-0.007
AFTA-0.007
 IEEE-0.007
bee-0.007
ulus-0.006
Token algorithm
Token ID11862
Feature activation-6.20e-02

Pos logit contributions
drivers+0.034
AFTA+0.033
 IEEE+0.032
bee+0.032
ulus+0.031
Neg logit contributions
vg-0.038
irst-0.036
ANK-0.029
(--0.029
iller-0.029
Token for
Token ID329
Feature activation-2.20e-02

Pos logit contributions
drivers+0.046
AFTA+0.044
 IEEE+0.043
bee+0.043
ulus+0.042
Neg logit contributions
vg-0.047
irst-0.045
ANK-0.036
iller-0.035
(--0.035
Token IEEE
Token ID40552
Feature activation-5.14e-01

Pos logit contributions
drivers+0.015
AFTA+0.015
 IEEE+0.014
bee+0.014
ulus+0.014
Neg logit contributions
vg-0.018
irst-0.017
ANK-0.014
(--0.013
iller-0.013
Token 802
Token ID33121
Feature activation-3.22e-01

Pos logit contributions
drivers+0.233
AFTA+0.223
ulus+0.205
bee+0.205
 IEEE+0.204
Neg logit contributions
vg-0.536
irst-0.514
ANK-0.445
(--0.440
 decay-0.434
Token.
Token ID13
Feature activation-4.45e-03

Pos logit contributions
drivers+0.243
AFTA+0.235
 IEEE+0.230
bee+0.226
ulus+0.223
Neg logit contributions
vg-0.241
irst-0.227
ANK-0.182
iller-0.178
(--0.178
Token11
Token ID1157
Feature activation+4.20e-02

Pos logit contributions
drivers+0.004
AFTA+0.004
ulus+0.003
ratulations+0.003
visors+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.002
(--0.002
 drip-0.002
Token wireless
Token ID12521
Feature activation+4.52e-02

Pos logit contributions
vg+0.033
irst+0.031
ANK+0.025
(-+0.025
iller+0.025
Neg logit contributions
drivers-0.030
AFTA-0.029
ratulations-0.028
ulus-0.028
visors-0.028
Token networks
Token ID7686
Feature activation+6.94e-02

Pos logit contributions
vg+0.036
irst+0.034
ANK+0.028
(-+0.027
iller+0.027
Neg logit contributions
drivers-0.032
AFTA-0.031
bee-0.030
 IEEE-0.030
ulus-0.029
Token
Token ID447
Feature activation-1.41e-02

Pos logit contributions
drivers+0.009
AFTA+0.009
 IEEE+0.008
bee+0.008
ulus+0.008
Neg logit contributions
vg-0.010
irst-0.009
ANK-0.007
iller-0.007
(--0.007
Token
Token ID251
Feature activation-1.17e-02

Pos logit contributions
drivers+0.011
AFTA+0.010
 IEEE+0.010
bee+0.010
ulus+0.010
Neg logit contributions
vg-0.011
irst-0.010
ANK-0.008
(--0.008
iller-0.008
Token said
Token ID531
Feature activation-4.73e-03

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.008
bee+0.008
ulus+0.008
Neg logit contributions
vg-0.009
irst-0.009
ANK-0.007
iller-0.007
(--0.007
Token G
Token ID402
Feature activation+1.33e-01

Pos logit contributions
drivers+0.004
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.004
irst-0.003
ANK-0.003
iller-0.003
(--0.003
Tokenools
Token ID10141
Feature activation+2.91e-01

Pos logit contributions
vg+0.102
irst+0.096
ANK+0.077
iller+0.076
(-+0.075
Neg logit contributions
drivers-0.098
AFTA-0.094
 IEEE-0.092
bee-0.091
ulus-0.090
Tokenbee
Token ID20963
Feature activation-5.12e-01

Pos logit contributions
vg+0.419
irst+0.406
ANK+0.365
iller+0.361
(-+0.361
Neg logit contributions
drivers-0.018
AFTA-0.012
 IEEE-0.007
bee-0.003
ulus-0.001
Token.
Token ID13
Feature activation-7.27e-03

Pos logit contributions
drivers+0.372
AFTA+0.361
 IEEE+0.354
bee+0.348
ulus+0.342
Neg logit contributions
vg-0.398
irst-0.373
ANK-0.302
iller-0.294
(--0.294
Token
Token ID564
Feature activation-1.01e-04

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.005
bee+0.005
ulus+0.005
Neg logit contributions
vg-0.006
irst-0.005
ANK-0.004
iller-0.004
(--0.004
Token
Token ID250
Feature activation-1.53e-02

Pos logit contributions
drivers+0.000
AFTA+0.000
 IEEE+0.000
bee+0.000
ulus+0.000
Neg logit contributions
vg-0.000
irst-0.000
ANK-0.000
(--0.000
iller-0.000
TokenThis
Token ID1212
Feature activation-5.18e-02

Pos logit contributions
drivers+0.011
AFTA+0.010
 IEEE+0.010
bee+0.010
ulus+0.010
Neg logit contributions
vg-0.012
irst-0.012
ANK-0.010
iller-0.009
(--0.009
Token is
Token ID318
Feature activation+2.49e-02

Pos logit contributions
drivers+0.038
AFTA+0.037
 IEEE+0.036
bee+0.035
ulus+0.035
Neg logit contributions
vg-0.040
irst-0.038
ANK-0.030
iller-0.030
(--0.030
Token W
Token ID370
Feature activation+5.06e-02

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
bee-0.001
 IEEE-0.001
ratulations-0.001
TokenPA
Token ID4537
Feature activation-1.04e-01

Pos logit contributions
vg+0.045
irst+0.042
ANK+0.035
iller+0.035
(-+0.035
Neg logit contributions
drivers-0.032
AFTA-0.030
 IEEE-0.029
bee-0.029
ulus-0.029
Token2
Token ID17
Feature activation+1.07e-01

Pos logit contributions
drivers+0.079
AFTA+0.076
 IEEE+0.074
bee+0.073
ulus+0.073
Neg logit contributions
vg-0.078
irst-0.072
ANK-0.059
(--0.057
iller-0.057
Token),
Token ID828
Feature activation+8.90e-02

Pos logit contributions
vg+0.075
irst+0.070
ANK+0.055
(-+0.054
iller+0.054
Neg logit contributions
drivers-0.086
AFTA-0.084
 IEEE-0.081
bee-0.080
ratulations-0.080
Token the
Token ID262
Feature activation-3.64e-03

Pos logit contributions
vg+0.068
irst+0.064
ANK+0.052
(-+0.051
iller+0.051
Neg logit contributions
drivers-0.065
AFTA-0.064
bee-0.061
 IEEE-0.061
ulus-0.060
Token IEEE
Token ID40552
Feature activation-5.09e-01

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.002
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.002
iller-0.002
(--0.002
Token declared
Token ID6875
Feature activation-1.15e-01

Pos logit contributions
drivers+0.314
AFTA+0.302
 IEEE+0.287
bee+0.286
ulus+0.284
Neg logit contributions
vg-0.451
irst-0.428
ANK-0.358
(--0.353
iller-0.351
Token that
Token ID326
Feature activation+1.92e-02

Pos logit contributions
drivers+0.089
AFTA+0.086
 IEEE+0.084
bee+0.083
ulus+0.082
Neg logit contributions
vg-0.084
irst-0.079
ANK-0.062
iller-0.061
(--0.061
Token both
Token ID1111
Feature activation-4.75e-03

Pos logit contributions
vg+0.014
irst+0.014
ANK+0.011
(-+0.011
iller+0.011
Neg logit contributions
drivers-0.014
AFTA-0.014
 IEEE-0.013
bee-0.013
ulus-0.013
Token W
Token ID370
Feature activation+5.22e-02

Pos logit contributions
drivers+0.003
AFTA+0.003
bee+0.003
 IEEE+0.003
ratulations+0.003
Neg logit contributions
vg-0.004
irst-0.004
ANK-0.003
(--0.003
iller-0.003
TokenEP
Token ID8905
Feature activation-2.82e-02

Pos logit contributions
vg+0.045
irst+0.043
ANK+0.036
iller+0.035
(-+0.035
Neg logit contributions
drivers-0.033
AFTA-0.032
 IEEE-0.031
bee-0.030
ulus-0.030
Token privacy
Token ID6782
Feature activation+4.48e-02

Pos logit contributions
drivers+0.003
AFTA+0.003
bee+0.003
 IEEE+0.003
ulus+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.002
iller-0.002
(--0.002
Token component
Token ID7515
Feature activation+9.61e-02

Pos logit contributions
vg+0.040
irst+0.038
ANK+0.031
(-+0.031
iller+0.031
Neg logit contributions
drivers-0.027
AFTA-0.027
 IEEE-0.025
bee-0.025
ratulations-0.025
Token of
Token ID286
Feature activation+1.45e-03

Pos logit contributions
vg+0.073
irst+0.068
ANK+0.055
(-+0.054
iller+0.054
Neg logit contributions
drivers-0.071
AFTA-0.069
 IEEE-0.067
bee-0.067
ulus-0.066
Token the
Token ID262
Feature activation-7.26e-03

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
(-+0.001
iller+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
bee-0.001
 IEEE-0.001
ulus-0.001
Token original
Token ID2656
Feature activation+1.17e-01

Pos logit contributions
drivers+0.004
AFTA+0.004
bee+0.004
 IEEE+0.004
ratulations+0.004
Neg logit contributions
vg-0.007
irst-0.007
ANK-0.006
iller-0.005
(--0.005
Token IEEE
Token ID40552
Feature activation-5.07e-01

Pos logit contributions
vg+0.129
irst+0.124
ANK+0.107
(-+0.106
iller+0.106
Neg logit contributions
drivers-0.047
AFTA-0.044
bee-0.040
ratulations-0.040
 IEEE-0.040
Token 802
Token ID33121
Feature activation-3.13e-01

Pos logit contributions
drivers+0.181
AFTA+0.172
bee+0.153
ratulations+0.152
ulus+0.151
Neg logit contributions
vg-0.578
irst-0.557
ANK-0.488
(--0.481
iller-0.479
Token.
Token ID13
Feature activation-4.37e-03

Pos logit contributions
drivers+0.235
AFTA+0.227
 IEEE+0.222
bee+0.219
ulus+0.216
Neg logit contributions
vg-0.235
irst-0.222
ANK-0.177
iller-0.174
(--0.173
Token11
Token ID1157
Feature activation+3.44e-02

Pos logit contributions
drivers+0.004
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.002
(--0.002
iller-0.002
Token standard
Token ID3210
Feature activation-1.45e-01

Pos logit contributions
vg+0.032
irst+0.030
ANK+0.025
(-+0.025
iller+0.025
Neg logit contributions
drivers-0.020
AFTA-0.019
ratulations-0.018
 IEEE-0.018
bee-0.018
Token ratified
Token ID39067
Feature activation-6.79e-02

Pos logit contributions
drivers+0.109
AFTA+0.106
 IEEE+0.102
bee+0.101
ratulations+0.100
Neg logit contributions
vg-0.109
irst-0.103
ANK-0.083
(--0.081
iller-0.081
Token with
Token ID351
Feature activation+4.17e-03

Pos logit contributions
vg+0.019
irst+0.017
ANK+0.014
iller+0.013
(-+0.013
Neg logit contributions
drivers-0.020
AFTA-0.019
 IEEE-0.019
bee-0.019
ulus-0.018
Token money
Token ID1637
Feature activation+2.13e-03

Pos logit contributions
vg+0.003
irst+0.003
ANK+0.003
iller+0.003
(-+0.003
Neg logit contributions
drivers-0.003
AFTA-0.003
 IEEE-0.003
bee-0.003
ulus-0.003
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
vg+0.002
irst+0.001
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Token coat
Token ID13209
Feature activation-7.32e-02

Pos logit contributions
drivers+0.016
AFTA+0.015
 IEEE+0.015
bee+0.015
ulus+0.015
Neg logit contributions
vg-0.013
irst-0.012
ANK-0.010
iller-0.009
(--0.009
Token-
Token ID12
Feature activation+1.55e-02

Pos logit contributions
drivers+0.039
AFTA+0.037
 IEEE+0.036
bee+0.035
ulus+0.035
Neg logit contributions
vg-0.071
irst-0.068
ANK-0.058
(--0.057
iller-0.057
Tokentails
Token ID26404
Feature activation-5.07e-01

Pos logit contributions
vg+0.021
irst+0.021
ANK+0.018
iller+0.018
(-+0.018
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
ulus-0.001
bee-0.001
Token than
Token ID621
Feature activation+1.32e-01

Pos logit contributions
drivers+0.418
AFTA+0.407
 IEEE+0.399
bee+0.394
ulus+0.389
Neg logit contributions
vg-0.344
irst-0.320
ANK-0.249
iller-0.242
(--0.241
Token to
Token ID284
Feature activation-2.50e-02

Pos logit contributions
vg+0.109
irst+0.104
ANK+0.085
iller+0.083
(-+0.083
Neg logit contributions
drivers-0.090
AFTA-0.086
 IEEE-0.085
bee-0.083
ulus-0.082
Token minorities
Token ID15557
Feature activation+3.93e-02

Pos logit contributions
drivers+0.014
AFTA+0.014
 IEEE+0.013
bee+0.013
ulus+0.013
Neg logit contributions
vg-0.023
irst-0.022
ANK-0.019
(--0.018
iller-0.018
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
vg+0.030
irst+0.029
ANK+0.023
iller+0.022
(-+0.022
Neg logit contributions
drivers-0.029
AFTA-0.028
 IEEE-0.027
bee-0.027
ulus-0.027
Token The
Token ID383
Feature activation+3.28e-02

Pos logit contributions
drivers+0.010
AFTA+0.010
 IEEE+0.010
bee+0.009
ulus+0.009
Neg logit contributions
vg-0.011
irst-0.010
ANK-0.008
iller-0.008
(--0.008
Tokenf
Token ID69
Feature activation-1.68e-02

Pos logit contributions
vg+0.032
irst+0.030
ANK+0.024
iller+0.024
(-+0.024
Neg logit contributions
drivers-0.032
AFTA-0.031
 IEEE-0.030
bee-0.029
ulus-0.029
Token4
Token ID19
Feature activation+3.37e-02

Pos logit contributions
drivers+0.012
AFTA+0.012
 IEEE+0.012
bee+0.012
ratulations+0.011
Neg logit contributions
vg-0.013
irst-0.012
ANK-0.010
iller-0.009
(--0.009
Tokenb
Token ID65
Feature activation+9.46e-03

Pos logit contributions
vg+0.026
irst+0.024
ANK+0.019
iller+0.019
(-+0.019
Neg logit contributions
drivers-0.025
AFTA-0.024
 IEEE-0.024
bee-0.023
ulus-0.023
Tokenbe
Token ID1350
Feature activation-5.52e-02

Pos logit contributions
vg+0.007
irst+0.007
ANK+0.006
iller+0.005
(-+0.005
Neg logit contributions
drivers-0.007
AFTA-0.007
 IEEE-0.007
bee-0.006
ratulations-0.006
Token66
Token ID2791
Feature activation-9.79e-02

Pos logit contributions
drivers+0.040
AFTA+0.039
 IEEE+0.038
bee+0.037
ratulations+0.037
Neg logit contributions
vg-0.043
irst-0.040
ANK-0.032
iller-0.032
(--0.032
Token IEEE
Token ID40552
Feature activation-5.07e-01

Pos logit contributions
drivers+0.072
AFTA+0.070
 IEEE+0.069
bee+0.067
ulus+0.067
Neg logit contributions
vg-0.075
irst-0.070
ANK-0.057
iller-0.055
(--0.055
Token 802
Token ID33121
Feature activation-3.11e-01

Pos logit contributions
drivers+0.210
AFTA+0.203
 IEEE+0.185
visors+0.183
bee+0.183
Neg logit contributions
vg-0.549
irst-0.528
ANK-0.457
(--0.453
iller-0.449
Token.
Token ID13
Feature activation+7.54e-03

Pos logit contributions
drivers+0.234
AFTA+0.227
 IEEE+0.221
bee+0.218
ulus+0.215
Neg logit contributions
vg-0.233
irst-0.219
ANK-0.176
iller-0.172
(--0.171
Token11
Token ID1157
Feature activation+4.48e-02

Pos logit contributions
vg+0.005
irst+0.005
ANK+0.004
(-+0.004
iller+0.004
Neg logit contributions
drivers-0.006
AFTA-0.006
 IEEE-0.006
visors-0.006
ulus-0.006
Tokenb
Token ID65
Feature activation+3.32e-03

Pos logit contributions
vg+0.035
irst+0.033
ANK+0.027
(-+0.027
iller+0.027
Neg logit contributions
drivers-0.032
AFTA-0.031
 IEEE-0.030
ratulations-0.030
bee-0.029
Tokengn
Token ID4593
Feature activation+1.97e-01

Pos logit contributions
vg+0.003
irst+0.003
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ratulations-0.002
TokenSan
Token ID15017
Feature activation-1.30e-01

Pos logit contributions
vg+0.003
irst+0.003
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.003
AFTA-0.003
 IEEE-0.003
bee-0.003
ulus-0.003
Token Francisco
Token ID6033
Feature activation-1.65e-01

Pos logit contributions
drivers+0.084
AFTA+0.081
 IEEE+0.079
bee+0.078
ulus+0.076
Neg logit contributions
vg-0.111
irst-0.105
ANK-0.087
(--0.085
iller-0.085
Token Board
Token ID5926
Feature activation+2.61e-02

Pos logit contributions
drivers+0.105
AFTA+0.101
 IEEE+0.097
bee+0.096
ulus+0.095
Neg logit contributions
vg-0.143
irst-0.136
ANK-0.113
iller-0.111
(--0.111
Token of
Token ID286
Feature activation+7.53e-03

Pos logit contributions
vg+0.020
irst+0.019
ANK+0.015
iller+0.015
(-+0.015
Neg logit contributions
drivers-0.019
AFTA-0.019
 IEEE-0.018
bee-0.018
ulus-0.018
Token Super
Token ID3115
Feature activation+5.53e-02

Pos logit contributions
vg+0.006
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.006
AFTA-0.006
 IEEE-0.006
bee-0.005
ulus-0.005
Tokenvisors
Token ID27681
Feature activation-5.04e-01

Pos logit contributions
vg+0.079
irst+0.077
ANK+0.069
iller+0.068
(-+0.068
Neg logit contributions
drivers-0.004
AFTA-0.003
 IEEE-0.002
bee-0.001
ulus-0.001
Token
Token ID198
Feature activation+2.88e-03

Pos logit contributions
drivers+0.374
AFTA+0.363
 IEEE+0.353
bee+0.349
ulus+0.345
Neg logit contributions
vg-0.383
irst-0.361
ANK-0.290
iller-0.284
(--0.283
Token
Token ID198
Feature activation-1.62e-03

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
TokenSan
Token ID15017
Feature activation-1.32e-01

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
(--0.001
iller-0.001
Token Francisco
Token ID6033
Feature activation-1.66e-01

Pos logit contributions
drivers+0.074
AFTA+0.071
 IEEE+0.068
bee+0.067
ulus+0.066
Neg logit contributions
vg-0.125
irst-0.119
ANK-0.101
iller-0.099
(--0.099
Token,
Token ID11
Feature activation-5.69e-03

Pos logit contributions
drivers+0.117
AFTA+0.113
 IEEE+0.110
bee+0.108
ulus+0.107
Neg logit contributions
vg-0.133
irst-0.126
ANK-0.102
iller-0.100
(--0.100
Tokenf
Token ID69
Feature activation-1.52e-02

Pos logit contributions
vg+0.037
irst+0.035
ANK+0.029
iller+0.028
(-+0.028
Neg logit contributions
drivers-0.032
AFTA-0.031
 IEEE-0.030
bee-0.030
ulus-0.029
Token4
Token ID19
Feature activation+2.41e-02

Pos logit contributions
drivers+0.012
AFTA+0.011
 IEEE+0.011
bee+0.011
ulus+0.011
Neg logit contributions
vg-0.011
irst-0.011
ANK-0.009
iller-0.008
(--0.008
Tokenb
Token ID65
Feature activation+8.87e-03

Pos logit contributions
vg+0.019
irst+0.018
ANK+0.014
(-+0.014
iller+0.014
Neg logit contributions
drivers-0.017
AFTA-0.017
 IEEE-0.016
bee-0.016
ulus-0.016
Tokenbe
Token ID1350
Feature activation-5.32e-02

Pos logit contributions
vg+0.008
irst+0.007
ANK+0.006
(-+0.006
iller+0.006
Neg logit contributions
drivers-0.006
AFTA-0.006
 IEEE-0.005
ratulations-0.005
ulus-0.005
Token66
Token ID2791
Feature activation-1.01e-01

Pos logit contributions
drivers+0.033
AFTA+0.032
 IEEE+0.031
bee+0.030
ratulations+0.030
Neg logit contributions
vg-0.047
irst-0.044
ANK-0.037
(--0.036
iller-0.036
Token IEEE
Token ID40552
Feature activation-5.03e-01

Pos logit contributions
drivers+0.039
AFTA+0.037
bee+0.034
 IEEE+0.033
visors+0.033
Neg logit contributions
vg-0.113
irst-0.109
ANK-0.095
(--0.093
iller-0.093
Token 802
Token ID33121
Feature activation-3.19e-01

Pos logit contributions
drivers+0.165
AFTA+0.156
 IEEE+0.142
bee+0.140
ratulations+0.136
Neg logit contributions
vg-0.590
irst-0.568
ANK-0.498
(--0.492
iller-0.490
Token.
Token ID13
Feature activation+4.91e-04

Pos logit contributions
drivers+0.235
AFTA+0.226
 IEEE+0.223
bee+0.220
ulus+0.216
Neg logit contributions
vg-0.244
irst-0.230
ANK-0.184
iller-0.181
BuyableInstoreAndOnline-0.180
Token11
Token ID1157
Feature activation+3.90e-02

Pos logit contributions
vg+0.000
irst+0.000
ANK+0.000
(-+0.000
iller+0.000
Neg logit contributions
drivers-0.000
AFTA-0.000
 IEEE-0.000
bee-0.000
ratulations-0.000
Tokenb
Token ID65
Feature activation-2.84e-03

Pos logit contributions
vg+0.031
irst+0.029
ANK+0.023
(-+0.023
iller+0.023
Neg logit contributions
drivers-0.028
AFTA-0.027
 IEEE-0.026
bee-0.026
ulus-0.026
Tokengn
Token ID4593
Feature activation+1.93e-01

Pos logit contributions
drivers+0.003
AFTA+0.003
 IEEE+0.003
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.002
irst-0.001
ANK-0.001
(--0.001
iller-0.001
Tokenfer
Token ID2232
Feature activation-2.08e-02

Pos logit contributions
drivers+0.073
AFTA+0.071
 IEEE+0.070
ratulations+0.067
ulus+0.067
Neg logit contributions
vg-0.081
irst-0.076
(--0.061
ANK-0.061
zer-0.061
Tokenometer
Token ID15635
Feature activation+3.50e-02

Pos logit contributions
drivers+0.015
AFTA+0.015
 IEEE+0.015
bee+0.014
ulus+0.014
Neg logit contributions
vg-0.016
irst-0.015
ANK-0.012
iller-0.012
(--0.012
Token Grav
Token ID32599
Feature activation+3.45e-01

Pos logit contributions
vg+0.014
irst+0.013
ANK+0.008
iller+0.007
(-+0.007
Neg logit contributions
drivers-0.038
AFTA-0.038
 IEEE-0.037
bee-0.037
ulus-0.036
Tokenitational
Token ID22181
Feature activation-1.70e-01

Pos logit contributions
vg+0.344
irst+0.332
ANK+0.280
iller+0.280
(-+0.279
Neg logit contributions
drivers-0.171
AFTA-0.164
 IEEE-0.160
ratulations-0.152
bee-0.151
Token-
Token ID12
Feature activation+2.33e-02

Pos logit contributions
drivers+0.126
AFTA+0.122
 IEEE+0.120
bee+0.118
ulus+0.116
Neg logit contributions
vg-0.131
irst-0.123
ANK-0.099
iller-0.096
(--0.095
Tokenwave
Token ID19204
Feature activation-5.00e-01

Pos logit contributions
vg+0.031
irst+0.030
ANK+0.026
(-+0.026
iller+0.026
Neg logit contributions
drivers-0.004
AFTA-0.004
 IEEE-0.003
bee-0.003
ratulations-0.003
Token Observatory
Token ID25758
Feature activation-2.23e-01

Pos logit contributions
drivers+0.207
AFTA+0.196
 IEEE+0.189
bee+0.183
ulus+0.177
Neg logit contributions
vg-0.546
irst-0.524
ANK-0.455
iller-0.446
(--0.445
Token (
Token ID357
Feature activation+1.76e-02

Pos logit contributions
drivers+0.163
AFTA+0.158
 IEEE+0.154
bee+0.152
ulus+0.150
Neg logit contributions
vg-0.172
irst-0.163
ANK-0.131
iller-0.128
(--0.128
TokenL
Token ID43
Feature activation+1.04e-01

Pos logit contributions
vg+0.012
irst+0.011
ANK+0.008
iller+0.008
(-+0.008
Neg logit contributions
drivers-0.015
AFTA-0.015
 IEEE-0.014
bee-0.014
ulus-0.014
TokenIG
Token ID3528
Feature activation+1.61e-01

Pos logit contributions
vg+0.066
irst+0.061
ANK+0.046
iller+0.045
(-+0.045
Neg logit contributions
drivers-0.091
AFTA-0.089
 IEEE-0.087
bee-0.086
ulus-0.085
TokenO
Token ID46
Feature activation+3.35e-02

Pos logit contributions
vg+0.119
irst+0.110
ANK+0.089
(-+0.087
iller+0.086
Neg logit contributions
drivers-0.125
AFTA-0.121
 IEEE-0.118
bee-0.116
ratulations-0.115
Token,
Token ID11
Feature activation-1.52e-03

Pos logit contributions
drivers+0.110
AFTA+0.106
 IEEE+0.104
bee+0.102
ulus+0.101
Neg logit contributions
vg-0.113
irst-0.107
ANK-0.086
iller-0.084
(--0.084
Token or
Token ID393
Feature activation-2.94e-02

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
(--0.001
Token whether
Token ID1771
Feature activation-5.16e-02

Pos logit contributions
drivers+0.020
AFTA+0.019
 IEEE+0.019
bee+0.018
ulus+0.018
Neg logit contributions
vg-0.024
irst-0.023
ANK-0.019
iller-0.018
(--0.018
Token a
Token ID257
Feature activation+9.11e-03

Pos logit contributions
drivers+0.037
AFTA+0.036
 IEEE+0.035
ulus+0.034
bee+0.034
Neg logit contributions
vg-0.040
irst-0.038
ANK-0.030
(--0.030
 drip-0.030
Token honey
Token ID12498
Feature activation-2.57e-01

Pos logit contributions
vg+0.007
irst+0.007
ANK+0.006
iller+0.006
(-+0.006
Neg logit contributions
drivers-0.006
AFTA-0.006
 IEEE-0.006
bee-0.006
ratulations-0.006
Tokenbee
Token ID20963
Feature activation-5.00e-01

Pos logit contributions
drivers+0.025
AFTA+0.020
 IEEE+0.015
ulus+0.010
ratulations+0.009
Neg logit contributions
vg-0.361
irst-0.351
ANK-0.314
iller-0.312
(--0.311
Token might
Token ID1244
Feature activation+5.56e-02

Pos logit contributions
drivers+0.337
AFTA+0.327
 IEEE+0.317
bee+0.311
ulus+0.308
Neg logit contributions
vg-0.415
irst-0.393
ANK-0.322
iller-0.316
(--0.316
Token also
Token ID635
Feature activation+7.90e-03

Pos logit contributions
vg+0.044
irst+0.041
ANK+0.033
iller+0.033
(-+0.033
Neg logit contributions
drivers-0.040
AFTA-0.039
 IEEE-0.038
bee-0.037
ulus-0.037
Token demonstrate
Token ID10176
Feature activation+6.01e-02

Pos logit contributions
vg+0.006
irst+0.006
ANK+0.005
iller+0.005
(-+0.005
Neg logit contributions
drivers-0.006
AFTA-0.005
 IEEE-0.005
bee-0.005
ulus-0.005
Token rule
Token ID3896
Feature activation-1.79e-02

Pos logit contributions
vg+0.047
irst+0.044
ANK+0.035
iller+0.035
(-+0.035
Neg logit contributions
drivers-0.044
AFTA-0.042
 IEEE-0.041
bee-0.041
ulus-0.040
Token learning
Token ID4673
Feature activation+1.88e-02

Pos logit contributions
drivers+0.014
AFTA+0.014
 IEEE+0.013
bee+0.013
ulus+0.013
Neg logit contributions
vg-0.013
irst-0.012
ANK-0.010
(--0.010
iller-0.009
Token petty
Token ID25229
Feature activation-2.24e-02

Pos logit contributions
vg+0.048
irst+0.046
ANK+0.038
iller+0.037
(-+0.037
Neg logit contributions
drivers-0.040
AFTA-0.039
 IEEE-0.038
bee-0.037
ulus-0.036
Token-
Token ID12
Feature activation+2.29e-02

Pos logit contributions
drivers+0.015
AFTA+0.015
 IEEE+0.014
bee+0.014
ulus+0.014
Neg logit contributions
vg-0.018
irst-0.017
ANK-0.014
(--0.014
iller-0.014
Tokenbour
Token ID6084
Feature activation+1.52e-01

Pos logit contributions
vg+0.014
irst+0.013
ANK+0.010
iller+0.010
(-+0.010
Neg logit contributions
drivers-0.020
AFTA-0.020
 IEEE-0.019
bee-0.019
ulus-0.019
Tokenge
Token ID469
Feature activation+7.23e-02

Pos logit contributions
vg+0.108
irst+0.101
ANK+0.080
iller+0.078
(-+0.078
Neg logit contributions
drivers-0.120
AFTA-0.117
 IEEE-0.114
bee-0.113
ulus-0.111
Tokenois
Token ID10924
Feature activation-3.84e-01

Pos logit contributions
vg+0.092
irst+0.089
ANK+0.079
iller+0.078
(-+0.078
Neg logit contributions
drivers-0.017
AFTA-0.015
 IEEE-0.014
bee-0.013
ulus-0.012
Token intellig
Token ID10878
Feature activation-4.88e-01

Pos logit contributions
drivers+0.166
AFTA+0.158
 IEEE+0.151
bee+0.147
ulus+0.144
Neg logit contributions
vg-0.411
irst-0.394
ANK-0.340
iller-0.336
(--0.335
Tokenents
Token ID658
Feature activation-6.19e-02

Pos logit contributions
drivers+0.303
AFTA+0.293
 IEEE+0.283
bee+0.278
ulus+0.273
Neg logit contributions
vg-0.432
irst-0.411
ANK-0.340
iller-0.337
(--0.335
Tokenia
Token ID544
Feature activation-1.16e-01

Pos logit contributions
drivers+0.035
AFTA+0.034
 IEEE+0.033
bee+0.032
ulus+0.032
Neg logit contributions
vg-0.058
irst-0.055
ANK-0.046
iller-0.046
(--0.046
Token resisted
Token ID26643
Feature activation-1.69e-01

Pos logit contributions
drivers+0.085
AFTA+0.082
 IEEE+0.080
bee+0.079
ulus+0.078
Neg logit contributions
vg-0.090
irst-0.085
ANK-0.069
iller-0.067
(--0.067
Token en
Token ID551
Feature activation-1.03e-01

Pos logit contributions
drivers+0.125
AFTA+0.121
 IEEE+0.119
bee+0.117
ulus+0.115
Neg logit contributions
vg-0.128
irst-0.121
ANK-0.097
iller-0.095
(--0.095
Token masse
Token ID41869
Feature activation-3.32e-02

Pos logit contributions
drivers+0.073
AFTA+0.071
 IEEE+0.069
bee+0.068
ulus+0.067
Neg logit contributions
vg-0.081
irst-0.077
ANK-0.062
iller-0.061
(--0.061
Token How
Token ID1374
Feature activation-4.17e-02

Pos logit contributions
vg+0.082
irst+0.078
ANK+0.063
iller+0.062
zer+0.062
Neg logit contributions
drivers-0.072
AFTA-0.070
 IEEE-0.067
ratulations-0.067
visors-0.066
Token Bees
Token ID46698
Feature activation-3.15e-01

Pos logit contributions
drivers+0.026
AFTA+0.026
 IEEE+0.025
ratulations+0.024
ulus+0.024
Neg logit contributions
vg-0.036
irst-0.034
ANK-0.028
iller-0.028
zer-0.028
Token See
Token ID4091
Feature activation-1.50e-01

Pos logit contributions
drivers+0.224
AFTA+0.217
 IEEE+0.210
bee+0.207
ulus+0.205
Neg logit contributions
vg-0.250
irst-0.236
ANK-0.191
iller-0.188
(--0.188
Token<|endoftext|>
Token ID50256
Feature activation+8.36e-02

Pos logit contributions
drivers+0.113
AFTA+0.110
 IEEE+0.106
bee+0.104
ratulations+0.104
Neg logit contributions
vg-0.112
irst-0.105
ANK-0.084
(--0.083
iller-0.082
TokenThe
Token ID464
Feature activation+2.55e-03

Pos logit contributions
vg+0.067
irst+0.063
ANK+0.052
(-+0.051
iller+0.051
Neg logit contributions
drivers-0.059
AFTA-0.057
 IEEE-0.055
bee-0.054
ulus-0.054
Token fuss
Token ID34297
Feature activation-4.87e-01

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.002
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Token is
Token ID318
Feature activation+2.25e-02

Pos logit contributions
drivers+0.336
AFTA+0.324
 IEEE+0.316
bee+0.312
ulus+0.307
Neg logit contributions
vg-0.397
irst-0.375
ANK-0.307
iller-0.301
(--0.300
Token in
Token ID287
Feature activation-2.13e-02

Pos logit contributions
vg+0.017
irst+0.016
ANK+0.013
(-+0.013
iller+0.013
Neg logit contributions
drivers-0.017
AFTA-0.016
 IEEE-0.016
bee-0.015
ulus-0.015
Token the
Token ID262
Feature activation-1.14e-02

Pos logit contributions
drivers+0.016
AFTA+0.015
 IEEE+0.015
bee+0.015
ratulations+0.015
Neg logit contributions
vg-0.016
irst-0.015
ANK-0.012
(--0.012
iller-0.012
Token flavors
Token ID17361
Feature activation-1.95e-01

Pos logit contributions
drivers+0.009
AFTA+0.009
 IEEE+0.008
bee+0.008
ulus+0.008
Neg logit contributions
vg-0.008
irst-0.008
ANK-0.006
iller-0.006
(--0.006
Token:
Token ID25
Feature activation+1.49e-02

Pos logit contributions
drivers+0.144
AFTA+0.140
 IEEE+0.137
bee+0.135
ulus+0.133
Neg logit contributions
vg-0.148
irst-0.139
ANK-0.112
iller-0.110
(--0.109
Token petty
Token ID25229
Feature activation-2.21e-02

Pos logit contributions
vg+0.052
irst+0.049
ANK+0.040
iller+0.040
(-+0.040
Neg logit contributions
drivers-0.042
AFTA-0.041
 IEEE-0.039
bee-0.039
ulus-0.038
Token-
Token ID12
Feature activation+2.68e-02

Pos logit contributions
drivers+0.017
AFTA+0.016
 IEEE+0.016
bee+0.015
ulus+0.015
Neg logit contributions
vg-0.017
irst-0.016
ANK-0.013
(--0.012
iller-0.012
Tokenbour
Token ID6084
Feature activation+1.48e-01

Pos logit contributions
vg+0.017
irst+0.016
ANK+0.012
iller+0.012
(-+0.012
Neg logit contributions
drivers-0.023
AFTA-0.023
 IEEE-0.022
bee-0.022
ulus-0.022
Tokenge
Token ID469
Feature activation+6.90e-02

Pos logit contributions
vg+0.106
irst+0.100
ANK+0.079
(-+0.077
iller+0.077
Neg logit contributions
drivers-0.116
AFTA-0.113
 IEEE-0.110
bee-0.109
ulus-0.107
Tokenois
Token ID10924
Feature activation-3.78e-01

Pos logit contributions
vg+0.088
irst+0.085
ANK+0.075
iller+0.075
(-+0.074
Neg logit contributions
drivers-0.016
AFTA-0.014
 IEEE-0.013
bee-0.012
ulus-0.012
Token intellig
Token ID10878
Feature activation-4.84e-01

Pos logit contributions
drivers+0.067
AFTA+0.059
 IEEE+0.052
bee+0.048
ulus+0.045
Neg logit contributions
vg-0.502
irst-0.485
ANK-0.431
iller-0.428
(--0.427
Tokenents
Token ID658
Feature activation-5.97e-02

Pos logit contributions
drivers+0.296
AFTA+0.286
 IEEE+0.277
bee+0.272
ulus+0.267
Neg logit contributions
vg-0.431
irst-0.410
ANK-0.341
iller-0.336
(--0.335
Tokenia
Token ID544
Feature activation-1.22e-01

Pos logit contributions
drivers+0.034
AFTA+0.032
 IEEE+0.031
bee+0.031
ulus+0.030
Neg logit contributions
vg-0.056
irst-0.053
ANK-0.045
iller-0.044
(--0.044
Token forged
Token ID24918
Feature activation+1.64e-02

Pos logit contributions
drivers+0.086
AFTA+0.084
 IEEE+0.081
bee+0.080
ulus+0.079
Neg logit contributions
vg-0.097
irst-0.091
ANK-0.074
iller-0.073
(--0.073
Token in
Token ID287
Feature activation-1.57e-02

Pos logit contributions
vg+0.012
irst+0.012
ANK+0.009
iller+0.009
(-+0.009
Neg logit contributions
drivers-0.012
AFTA-0.012
 IEEE-0.012
bee-0.012
ulus-0.011
Token haste
Token ID36797
Feature activation-3.66e-02

Pos logit contributions
drivers+0.012
AFTA+0.012
 IEEE+0.011
bee+0.011
ulus+0.011
Neg logit contributions
vg-0.012
irst-0.011
ANK-0.009
iller-0.009
(--0.009
Token petty
Token ID25229
Feature activation-2.62e-02

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.004
bee+0.004
ulus+0.004
Neg logit contributions
vg-0.006
irst-0.005
ANK-0.004
iller-0.004
(--0.004
Token-
Token ID12
Feature activation+2.28e-02

Pos logit contributions
drivers+0.017
AFTA+0.017
 IEEE+0.016
bee+0.016
ulus+0.016
Neg logit contributions
vg-0.022
irst-0.021
ANK-0.017
iller-0.017
(--0.017
Tokenbour
Token ID6084
Feature activation+1.53e-01

Pos logit contributions
vg+0.015
irst+0.014
ANK+0.011
(-+0.011
iller+0.011
Neg logit contributions
drivers-0.019
AFTA-0.018
 IEEE-0.018
bee-0.018
ulus-0.018
Tokenge
Token ID469
Feature activation+7.65e-02

Pos logit contributions
vg+0.108
irst+0.101
ANK+0.080
iller+0.078
(-+0.078
Neg logit contributions
drivers-0.122
AFTA-0.118
 IEEE-0.116
bee-0.114
ulus-0.113
Tokenois
Token ID10924
Feature activation-3.84e-01

Pos logit contributions
vg+0.098
irst+0.095
ANK+0.084
iller+0.083
(-+0.083
Neg logit contributions
drivers-0.017
AFTA-0.015
 IEEE-0.014
bee-0.013
ulus-0.012
Token intellig
Token ID10878
Feature activation-4.83e-01

Pos logit contributions
drivers+0.274
AFTA+0.265
 IEEE+0.259
bee+0.254
ulus+0.251
Neg logit contributions
vg-0.303
irst-0.286
ANK-0.232
iller-0.228
(--0.227
Tokenents
Token ID658
Feature activation-6.89e-02

Pos logit contributions
drivers+0.298
AFTA+0.288
 IEEE+0.279
bee+0.273
ulus+0.268
Neg logit contributions
vg-0.430
irst-0.409
ANK-0.339
iller-0.336
(--0.333
Tokenia
Token ID544
Feature activation-1.12e-01

Pos logit contributions
drivers+0.039
AFTA+0.038
 IEEE+0.036
bee+0.036
ulus+0.035
Neg logit contributions
vg-0.065
irst-0.061
ANK-0.052
iller-0.051
(--0.051
Token could
Token ID714
Feature activation+1.25e-01

Pos logit contributions
drivers+0.082
AFTA+0.080
 IEEE+0.078
bee+0.077
ulus+0.076
Neg logit contributions
vg-0.086
irst-0.081
ANK-0.065
iller-0.064
(--0.064
Token vind
Token ID29178
Feature activation-3.56e-02

Pos logit contributions
vg+0.093
irst+0.088
ANK+0.070
iller+0.068
(-+0.068
Neg logit contributions
drivers-0.094
AFTA-0.091
 IEEE-0.089
bee-0.088
ulus-0.087
Tokenicate
Token ID5344
Feature activation+1.11e-01

Pos logit contributions
drivers+0.030
AFTA+0.029
 IEEE+0.028
bee+0.028
ulus+0.027
Neg logit contributions
vg-0.024
irst-0.023
ANK-0.017
iller-0.017
(--0.017
Token.
Token ID13
Feature activation-2.28e-03

Pos logit contributions
vg+0.014
irst+0.013
ANK+0.011
(-+0.010
iller+0.010
Neg logit contributions
drivers-0.014
AFTA-0.014
 IEEE-0.014
bee-0.013
ulus-0.013
Token
Token ID564
Feature activation-1.99e-03

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.002
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.002
irst-0.002
ANK-0.001
iller-0.001
(--0.001
Token
Token ID250
Feature activation-1.20e-02

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
(--0.001
TokenMr
Token ID5246
Feature activation+1.33e-03

Pos logit contributions
drivers+0.009
AFTA+0.008
 IEEE+0.008
bee+0.008
ulus+0.008
Neg logit contributions
vg-0.009
irst-0.009
ANK-0.007
iller-0.007
(--0.007
Token.
Token ID13
Feature activation-2.96e-03

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Token Walton
Token ID34468
Feature activation-4.75e-01

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.004
irst-0.004
ANK-0.003
iller-0.003
(--0.003
Token
Token ID447
Feature activation-1.25e-02

Pos logit contributions
drivers+0.339
AFTA+0.328
 IEEE+0.321
bee+0.318
ulus+0.312
Neg logit contributions
vg-0.375
irst-0.353
ANK-0.287
iller-0.280
(--0.278
Token
Token ID247
Feature activation+3.65e-03

Pos logit contributions
drivers+0.009
AFTA+0.009
 IEEE+0.009
bee+0.009
ulus+0.008
Neg logit contributions
vg-0.010
irst-0.009
ANK-0.007
iller-0.007
(--0.007
Tokens
Token ID82
Feature activation+1.20e-02

Pos logit contributions
vg+0.003
irst+0.003
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.003
AFTA-0.003
 IEEE-0.003
bee-0.002
ulus-0.002
Token physical
Token ID3518
Feature activation-5.19e-03

Pos logit contributions
vg+0.010
irst+0.009
ANK+0.007
iller+0.007
(-+0.007
Neg logit contributions
drivers-0.008
AFTA-0.008
 IEEE-0.008
bee-0.008
ulus-0.008
Token contact
Token ID2800
Feature activation-1.38e-01

Pos logit contributions
drivers+0.004
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.004
irst-0.004
ANK-0.003
iller-0.003
(--0.003
Token Zub
Token ID47828
Feature activation+1.30e-01

Pos logit contributions
vg+0.021
irst+0.020
ANK+0.016
iller+0.015
(-+0.015
Neg logit contributions
drivers-0.020
AFTA-0.019
 IEEE-0.019
bee-0.018
ulus-0.018
Tokenair
Token ID958
Feature activation+1.70e-01

Pos logit contributions
vg+0.096
irst+0.090
ANK+0.072
iller+0.070
(-+0.070
Neg logit contributions
drivers-0.099
AFTA-0.097
 IEEE-0.094
bee-0.093
ulus-0.092
Token Bl
Token ID1086
Feature activation-1.52e-02

Pos logit contributions
vg+0.134
irst+0.126
ANK+0.102
iller+0.100
(-+0.100
Neg logit contributions
drivers-0.122
AFTA-0.118
 IEEE-0.115
bee-0.113
ulus-0.112
Tokenocked
Token ID3543
Feature activation-4.99e-02

Pos logit contributions
drivers+0.011
AFTA+0.011
 IEEE+0.011
bee+0.011
ulus+0.010
Neg logit contributions
vg-0.012
irst-0.011
ANK-0.009
iller-0.009
(--0.009
Token Un
Token ID791
Feature activation-1.00e-01

Pos logit contributions
drivers+0.036
AFTA+0.035
 IEEE+0.034
bee+0.034
ulus+0.033
Neg logit contributions
vg-0.039
irst-0.037
ANK-0.030
iller-0.029
(--0.029
Tokenblock
Token ID9967
Feature activation-4.74e-01

Pos logit contributions
drivers+0.064
AFTA+0.062
 IEEE+0.060
bee+0.059
ulus+0.058
Neg logit contributions
vg-0.087
irst-0.082
ANK-0.068
iller-0.067
(--0.067
Token Follow
Token ID7281
Feature activation-5.78e-02

Pos logit contributions
drivers+0.439
AFTA+0.428
 IEEE+0.421
bee+0.416
ulus+0.411
Neg logit contributions
vg-0.274
irst-0.253
ANK-0.186
iller-0.180
(--0.179
Token Following
Token ID14207
Feature activation-1.08e-01

Pos logit contributions
drivers+0.041
AFTA+0.039
 IEEE+0.038
bee+0.038
ulus+0.037
Neg logit contributions
vg-0.046
irst-0.044
ANK-0.036
iller-0.035
(--0.035
Token Nov
Token ID5267
Feature activation-4.73e-02

Pos logit contributions
drivers+0.077
AFTA+0.075
 IEEE+0.073
bee+0.072
ulus+0.071
Neg logit contributions
vg-0.085
irst-0.081
ANK-0.065
iller-0.064
(--0.064
Token 21
Token ID2310
Feature activation-1.08e-02

Pos logit contributions
drivers+0.035
AFTA+0.034
 IEEE+0.033
bee+0.033
ratulations+0.032
Neg logit contributions
vg-0.036
irst-0.034
ANK-0.027
iller-0.027
(--0.027
Token,
Token ID11
Feature activation-2.04e-02

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.008
bee+0.007
ulus+0.007
Neg logit contributions
vg-0.008
irst-0.008
ANK-0.006
iller-0.006
(--0.006
Token Chicago
Token ID4842
Feature activation+1.22e-01

Pos logit contributions
drivers+0.032
AFTA+0.031
 IEEE+0.030
bee+0.029
ulus+0.029
Neg logit contributions
vg-0.037
irst-0.035
ANK-0.029
iller-0.028
(--0.028
Token-
Token ID12
Feature activation+1.45e-02

Pos logit contributions
vg+0.095
irst+0.089
ANK+0.072
iller+0.070
(-+0.070
Neg logit contributions
drivers-0.089
AFTA-0.086
 IEEE-0.084
bee-0.083
ulus-0.082
Tokenbased
Token ID3106
Feature activation-1.92e-01

Pos logit contributions
vg+0.015
irst+0.015
ANK+0.013
iller+0.012
(-+0.012
Neg logit contributions
drivers-0.007
AFTA-0.006
 IEEE-0.006
bee-0.006
ulus-0.006
Token he
Token ID339
Feature activation-7.48e-02

Pos logit contributions
drivers+0.137
AFTA+0.132
 IEEE+0.129
bee+0.127
ulus+0.125
Neg logit contributions
vg-0.151
irst-0.143
ANK-0.116
iller-0.113
(--0.113
Tokeni
Token ID72
Feature activation+1.40e-02

Pos logit contributions
drivers+0.056
AFTA+0.055
 IEEE+0.053
bee+0.053
ulus+0.052
Neg logit contributions
vg-0.056
irst-0.053
ANK-0.042
iller-0.041
(--0.041
Tokenress
Token ID601
Feature activation-4.72e-01

Pos logit contributions
vg+0.019
irst+0.018
ANK+0.016
iller+0.016
(-+0.016
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.001
ulus-0.001
Token Beat
Token ID12568
Feature activation-2.27e-01

Pos logit contributions
drivers+0.347
AFTA+0.336
 IEEE+0.328
bee+0.324
ulus+0.319
Neg logit contributions
vg-0.362
irst-0.342
ANK-0.275
iller-0.269
(--0.268
Tokenrice
Token ID20970
Feature activation-4.06e-02

Pos logit contributions
drivers+0.152
AFTA+0.147
 IEEE+0.143
bee+0.140
ratulations+0.138
Neg logit contributions
vg-0.190
irst-0.180
ANK-0.148
iller-0.145
(--0.145
Token Joyce
Token ID25936
Feature activation+1.72e-01

Pos logit contributions
drivers+0.030
AFTA+0.029
 IEEE+0.028
bee+0.028
ulus+0.028
Neg logit contributions
vg-0.031
irst-0.029
ANK-0.024
iller-0.023
(--0.023
Token Ke
Token ID3873
Feature activation+2.52e-02

Pos logit contributions
vg+0.131
irst+0.124
ANK+0.100
iller+0.097
(-+0.097
Neg logit contributions
drivers-0.126
AFTA-0.123
 IEEE-0.120
bee-0.118
ulus-0.116
Tokenan
Token ID272
Feature activation+8.44e-02

Pos logit contributions
vg+0.019
irst+0.018
ANK+0.015
iller+0.014
(-+0.014
Neg logit contributions
drivers-0.019
AFTA-0.018
 IEEE-0.018
bee-0.017
ulus-0.017
Token pages
Token ID5468
Feature activation-4.92e-02

Pos logit contributions
drivers+0.036
AFTA+0.035
 IEEE+0.034
bee+0.034
ulus+0.033
Neg logit contributions
vg-0.047
irst-0.044
ANK-0.037
iller-0.036
(--0.036
Token.
Token ID13
Feature activation-3.98e-03

Pos logit contributions
drivers+0.036
AFTA+0.035
 IEEE+0.034
bee+0.034
ulus+0.033
Neg logit contributions
vg-0.038
irst-0.036
ANK-0.029
iller-0.028
(--0.028
Token Join
Token ID15251
Feature activation-3.48e-02

Pos logit contributions
drivers+0.003
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.002
iller-0.002
(--0.002
Token the
Token ID262
Feature activation+7.82e-04

Pos logit contributions
drivers+0.025
AFTA+0.024
 IEEE+0.024
bee+0.024
ulus+0.023
Neg logit contributions
vg-0.027
irst-0.026
ANK-0.021
iller-0.020
(--0.020
Token revolution
Token ID5854
Feature activation-2.34e-01

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.000
iller+0.000
(-+0.000
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Token TODAY
Token ID20092
Feature activation+3.14e-01

Pos logit contributions
drivers+0.172
AFTA+0.167
 IEEE+0.163
bee+0.160
ulus+0.158
Neg logit contributions
vg-0.179
irst-0.169
ANK-0.136
iller-0.133
(--0.132
Token!
Token ID0
Feature activation+3.97e-02

Pos logit contributions
vg+0.235
irst+0.221
ANK+0.177
iller+0.173
(-+0.173
Neg logit contributions
drivers-0.237
AFTA-0.230
 IEEE-0.224
bee-0.221
ulus-0.218
Token It
Token ID632
Feature activation-3.83e-02

Pos logit contributions
vg+0.029
irst+0.027
ANK+0.021
iller+0.021
(-+0.021
Neg logit contributions
drivers-0.031
AFTA-0.030
 IEEE-0.029
bee-0.029
ulus-0.029
Token
Token ID447
Feature activation-8.18e-03

Pos logit contributions
drivers+0.027
AFTA+0.026
 IEEE+0.025
bee+0.025
ulus+0.025
Neg logit contributions
vg-0.031
irst-0.029
ANK-0.024
iller-0.023
(--0.023
Token
Token ID247
Feature activation+1.55e-03

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.005
bee+0.005
ulus+0.005
Neg logit contributions
vg-0.007
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Tokens
Token ID82
Feature activation+1.07e-02

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Tokenhen
Token ID831
Feature activation-1.00e-01

Pos logit contributions
vg+0.000
irst+0.000
ANK+0.000
iller+0.000
(-+0.000
Neg logit contributions
drivers-0.000
AFTA-0.000
 IEEE-0.000
bee-0.000
ulus-0.000
Tokenry
Token ID563
Feature activation+4.63e-04

Pos logit contributions
drivers+0.073
AFTA+0.070
 IEEE+0.069
bee+0.068
ulus+0.067
Neg logit contributions
vg-0.078
irst-0.073
ANK-0.059
iller-0.058
(--0.058
Token.
Token ID13
Feature activation+3.30e-03

Pos logit contributions
vg+0.000
irst+0.000
ANK+0.000
iller+0.000
(-+0.000
Neg logit contributions
drivers-0.000
AFTA-0.000
 IEEE-0.000
bee-0.000
ulus-0.000
Tokenc
Token ID66
Feature activation+1.69e-01

Pos logit contributions
vg+0.003
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Tokenordes
Token ID35770
Feature activation-1.82e-01

Pos logit contributions
vg+0.136
irst+0.129
ANK+0.105
iller+0.103
(-+0.102
Neg logit contributions
drivers-0.118
AFTA-0.115
 IEEE-0.112
bee-0.109
ulus-0.108
Token@
Token ID31
Feature activation+3.84e-01

Pos logit contributions
drivers+0.167
AFTA+0.163
 IEEE+0.160
bee+0.159
ulus+0.157
Neg logit contributions
vg-0.106
irst-0.099
ANK-0.073
(--0.070
iller-0.070
Tokenow
Token ID322
Feature activation-3.31e-02

Pos logit contributions
vg+0.304
irst+0.287
ANK+0.232
iller+0.228
(-+0.227
Neg logit contributions
drivers-0.273
AFTA-0.266
 IEEE-0.259
bee-0.253
ulus-0.250
Tokenh
Token ID71
Feature activation-1.85e-01

Pos logit contributions
drivers+0.025
AFTA+0.025
 IEEE+0.024
bee+0.023
ratulations+0.023
Neg logit contributions
vg-0.025
irst-0.024
ANK-0.019
iller-0.018
(--0.018
Token.
Token ID13
Feature activation-1.85e-03

Pos logit contributions
drivers+0.145
AFTA+0.141
 IEEE+0.137
bee+0.136
ulus+0.134
Neg logit contributions
vg-0.134
irst-0.125
ANK-0.099
(--0.097
iller-0.096
Tokencom
Token ID785
Feature activation-9.88e-02

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
ratulations+0.001
bee+0.001
Neg logit contributions
vg-0.002
irst-0.002
ANK-0.001
zer-0.001
(--0.001
Token,
Token ID11
Feature activation-4.59e-03

Pos logit contributions
drivers+0.078
AFTA+0.076
 IEEE+0.075
bee+0.073
ulus+0.073
Neg logit contributions
vg-0.070
irst-0.066
ANK-0.052
iller-0.051
(--0.050
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
vg+0.032
irst+0.030
ANK+0.024
iller+0.024
(-+0.024
Neg logit contributions
drivers-0.031
AFTA-0.030
 IEEE-0.029
bee-0.029
ulus-0.029
Token but
Token ID475
Feature activation-6.43e-02

Pos logit contributions
drivers+0.012
AFTA+0.011
 IEEE+0.011
bee+0.011
ulus+0.010
Neg logit contributions
vg-0.014
irst-0.013
ANK-0.010
iller-0.010
(--0.010
Token less
Token ID1342
Feature activation-1.01e-02

Pos logit contributions
drivers+0.047
AFTA+0.046
 IEEE+0.045
bee+0.044
ulus+0.044
Neg logit contributions
vg-0.049
irst-0.046
ANK-0.037
(--0.036
iller-0.036
Token severe
Token ID6049
Feature activation+3.17e-03

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.008
bee+0.008
ulus+0.008
Neg logit contributions
vg-0.007
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Token than
Token ID621
Feature activation+1.32e-01

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Token Chern
Token ID37243
Feature activation+3.45e-01

Pos logit contributions
vg+0.099
irst+0.093
ANK+0.074
iller+0.073
(-+0.072
Neg logit contributions
drivers-0.100
AFTA-0.097
 IEEE-0.095
bee-0.094
ulus-0.093
Tokenobyl
Token ID46666
Feature activation-1.39e-01

Pos logit contributions
vg+0.351
irst+0.336
ANK+0.287
iller+0.283
(-+0.282
Neg logit contributions
drivers-0.168
AFTA-0.161
 IEEE-0.155
bee-0.151
ulus-0.147
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
drivers+0.101
AFTA+0.098
 IEEE+0.095
bee+0.094
ulus+0.093
Neg logit contributions
vg-0.108
irst-0.102
ANK-0.082
iller-0.080
(--0.080
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
drivers+0.013
AFTA+0.013
 IEEE+0.012
bee+0.012
ulus+0.012
Neg logit contributions
vg-0.013
irst-0.012
ANK-0.010
iller-0.010
(--0.010
Token
Token ID198
Feature activation-7.79e-03

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.008
bee+0.008
ulus+0.008
Neg logit contributions
vg-0.009
irst-0.008
ANK-0.006
iller-0.006
(--0.006
TokenA
Token ID32
Feature activation-8.32e-04

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.005
bee+0.005
ulus+0.005
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Token
Token ID198
Feature activation-1.23e-03

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
TokenPost
Token ID6307
Feature activation-1.78e-02

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
(--0.001
Token-
Token ID12
Feature activation+2.69e-02

Pos logit contributions
drivers+0.014
AFTA+0.013
 IEEE+0.013
bee+0.013
ulus+0.012
Neg logit contributions
vg-0.013
irst-0.012
ANK-0.010
iller-0.010
(--0.010
Tokenp
Token ID79
Feature activation-1.54e-01

Pos logit contributions
vg+0.027
irst+0.026
ANK+0.022
iller+0.021
(-+0.021
Neg logit contributions
drivers-0.014
AFTA-0.013
 IEEE-0.013
bee-0.012
ulus-0.012
Tokenand
Token ID392
Feature activation+2.67e-01

Pos logit contributions
drivers+0.131
AFTA+0.128
 IEEE+0.125
bee+0.124
ulus+0.123
Neg logit contributions
vg-0.100
irst-0.093
ANK-0.072
(--0.070
iller-0.069
Tokenemic
Token ID5314
Feature activation+3.54e-01

Pos logit contributions
vg+0.099
irst+0.086
ANK+0.050
(-+0.046
iller+0.046
Neg logit contributions
drivers-0.302
AFTA-0.296
 IEEE-0.291
bee-0.289
ulus-0.287
Token:
Token ID25
Feature activation+2.00e-02

Pos logit contributions
vg+0.272
irst+0.257
ANK+0.207
iller+0.203
(-+0.202
Neg logit contributions
drivers-0.260
AFTA-0.252
 IEEE-0.245
bee-0.242
ulus-0.239
Token activity
Token ID3842
Feature activation-3.24e-02

Pos logit contributions
vg+0.015
irst+0.015
ANK+0.012
iller+0.011
(-+0.011
Neg logit contributions
drivers-0.015
AFTA-0.014
 IEEE-0.014
bee-0.014
ulus-0.013
Token returns
Token ID5860
Feature activation-1.53e-02

Pos logit contributions
drivers+0.020
AFTA+0.019
 IEEE+0.019
bee+0.018
ulus+0.018
Neg logit contributions
vg-0.029
irst-0.027
ANK-0.023
iller-0.022
(--0.022
Token to
Token ID284
Feature activation+6.49e-04

Pos logit contributions
drivers+0.011
AFTA+0.011
 IEEE+0.011
bee+0.010
ulus+0.010
Neg logit contributions
vg-0.012
irst-0.011
ANK-0.009
iller-0.009
(--0.009
Token normal
Token ID3487
Feature activation+5.39e-02

Pos logit contributions
vg+0.001
irst+0.000
ANK+0.000
iller+0.000
(-+0.000
Neg logit contributions
drivers-0.000
AFTA-0.000
 IEEE-0.000
bee-0.000
ulus-0.000
Token adapt
Token ID6068
Feature activation-9.93e-03

Pos logit contributions
vg+0.028
irst+0.026
ANK+0.021
iller+0.021
(-+0.021
Neg logit contributions
drivers-0.026
AFTA-0.025
 IEEE-0.025
bee-0.024
ulus-0.024
Tokenable
Token ID540
Feature activation+2.52e-01

Pos logit contributions
drivers+0.010
AFTA+0.010
 IEEE+0.010
bee+0.009
ulus+0.009
Neg logit contributions
vg-0.005
irst-0.005
ANK-0.003
iller-0.003
(--0.003
Token,
Token ID11
Feature activation-9.51e-03

Pos logit contributions
vg+0.202
irst+0.190
ANK+0.155
iller+0.152
(-+0.151
Neg logit contributions
drivers-0.177
AFTA-0.172
 IEEE-0.167
bee-0.165
ulus-0.163
Token decentral
Token ID20649
Feature activation-5.70e-02

Pos logit contributions
drivers+0.007
AFTA+0.007
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.008
irst-0.007
ANK-0.006
iller-0.006
(--0.006
Tokenised
Token ID1417
Feature activation+1.69e-01

Pos logit contributions
drivers+0.054
AFTA+0.053
 IEEE+0.052
bee+0.051
ulus+0.051
Neg logit contributions
vg-0.032
irst-0.029
ANK-0.021
iller-0.020
(--0.020
Token tool
Token ID2891
Feature activation+3.15e-01

Pos logit contributions
vg+0.131
irst+0.124
ANK+0.100
iller+0.098
(-+0.097
Neg logit contributions
drivers-0.124
AFTA-0.120
 IEEE-0.117
bee-0.115
ulus-0.114
Token that
Token ID326
Feature activation+1.16e-02

Pos logit contributions
vg+0.247
irst+0.233
ANK+0.189
iller+0.185
(-+0.184
Neg logit contributions
drivers-0.226
AFTA-0.220
 IEEE-0.214
bee-0.211
ratulations-0.208
Token is
Token ID318
Feature activation+2.16e-02

Pos logit contributions
vg+0.009
irst+0.008
ANK+0.007
iller+0.007
(-+0.007
Neg logit contributions
drivers-0.009
AFTA-0.008
 IEEE-0.008
bee-0.008
ulus-0.008
Token the
Token ID262
Feature activation-7.34e-03

Pos logit contributions
vg+0.016
irst+0.015
ANK+0.012
iller+0.012
(-+0.012
Neg logit contributions
drivers-0.016
AFTA-0.016
 IEEE-0.016
bee-0.015
ulus-0.015
Token English
Token ID3594
Feature activation-1.18e-01

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.005
bee+0.005
ulus+0.005
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
iller-0.004
(--0.004
Token language
Token ID3303
Feature activation+4.69e-02

Pos logit contributions
drivers+0.089
AFTA+0.087
 IEEE+0.085
bee+0.084
ulus+0.082
Neg logit contributions
vg-0.088
irst-0.083
ANK-0.066
iller-0.065
(--0.065
Token
Token ID198
Feature activation-3.11e-03

Pos logit contributions
drivers+0.156
AFTA+0.152
 IEEE+0.148
bee+0.146
ulus+0.144
Neg logit contributions
vg-0.157
irst-0.148
ANK-0.119
(--0.116
iller-0.116
Token
Token ID198
Feature activation-7.18e-03

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.002
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.002
irst-0.002
ANK-0.002
iller-0.002
(--0.002
TokenThe
Token ID464
Feature activation-4.53e-03

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.005
bee+0.004
ratulations+0.004
Neg logit contributions
vg-0.006
irst-0.006
ANK-0.005
(--0.005
iller-0.005
Token next
Token ID1306
Feature activation-2.09e-01

Pos logit contributions
drivers+0.003
AFTA+0.003
 IEEE+0.003
bee+0.003
ulus+0.003
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.003
iller-0.003
(--0.003
Token round
Token ID2835
Feature activation-2.24e-01

Pos logit contributions
drivers+0.160
AFTA+0.155
 IEEE+0.152
bee+0.149
ulus+0.147
Neg logit contributions
vg-0.155
irst-0.145
ANK-0.116
iller-0.113
(--0.113
Token was
Token ID373
Feature activation+3.37e-02

Pos logit contributions
drivers+0.166
AFTA+0.161
 IEEE+0.157
bee+0.154
ulus+0.152
Neg logit contributions
vg-0.171
irst-0.161
ANK-0.129
iller-0.126
(--0.126
Token a
Token ID257
Feature activation+7.17e-03

Pos logit contributions
vg+0.026
irst+0.025
ANK+0.020
iller+0.020
(-+0.020
Neg logit contributions
drivers-0.024
AFTA-0.024
 IEEE-0.023
bee-0.023
ulus-0.022
Token coding
Token ID19617
Feature activation-8.24e-02

Pos logit contributions
vg+0.006
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.005
AFTA-0.005
 IEEE-0.005
bee-0.005
ulus-0.005
Token phone
Token ID3072
Feature activation+1.29e-01

Pos logit contributions
drivers+0.058
AFTA+0.056
 IEEE+0.054
bee+0.053
ulus+0.053
Neg logit contributions
vg-0.066
irst-0.063
ANK-0.051
(--0.050
iller-0.050
Token interview
Token ID2720
Feature activation+1.51e-01

Pos logit contributions
vg+0.081
irst+0.075
ANK+0.057
iller+0.055
(-+0.055
Neg logit contributions
drivers-0.113
AFTA-0.110
 IEEE-0.108
bee-0.106
ulus-0.105
Token.
Token ID13
Feature activation-9.12e-03

Pos logit contributions
vg+0.114
irst+0.107
ANK+0.086
iller+0.084
(-+0.084
Neg logit contributions
drivers-0.114
AFTA-0.111
 IEEE-0.108
bee-0.106
ulus-0.105
Token each
Token ID1123
Feature activation+1.44e-02

Pos logit contributions
vg+0.169
irst+0.159
ANK+0.129
(-+0.127
iller+0.127
Neg logit contributions
drivers-0.153
AFTA-0.149
 IEEE-0.144
bee-0.143
ulus-0.141
Token other
Token ID584
Feature activation-2.07e-01

Pos logit contributions
vg+0.015
irst+0.014
ANK+0.012
iller+0.012
(-+0.012
Neg logit contributions
drivers-0.007
AFTA-0.007
 IEEE-0.006
bee-0.006
ulus-0.006
Token in
Token ID287
Feature activation-1.25e-02

Pos logit contributions
drivers+0.151
AFTA+0.146
 IEEE+0.143
bee+0.141
ulus+0.139
Neg logit contributions
vg-0.160
irst-0.151
ANK-0.122
iller-0.119
(--0.119
Token a
Token ID257
Feature activation+6.05e-03

Pos logit contributions
drivers+0.009
AFTA+0.009
 IEEE+0.009
bee+0.009
ratulations+0.009
Neg logit contributions
vg-0.010
irst-0.009
ANK-0.007
(--0.007
iller-0.007
Token one
Token ID530
Feature activation-1.07e-01

Pos logit contributions
vg+0.005
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.004
AFTA-0.004
 IEEE-0.004
bee-0.004
ulus-0.004
Token-
Token ID12
Feature activation+2.37e-02

Pos logit contributions
drivers+0.083
AFTA+0.081
 IEEE+0.079
bee+0.078
ulus+0.077
Neg logit contributions
vg-0.078
irst-0.074
ANK-0.058
(--0.057
iller-0.057
Tokenon
Token ID261
Feature activation+2.25e-03

Pos logit contributions
vg+0.020
irst+0.019
ANK+0.015
iller+0.015
(-+0.015
Neg logit contributions
drivers-0.016
AFTA-0.016
 IEEE-0.015
bee-0.015
ulus-0.015
Token-
Token ID12
Feature activation+2.59e-02

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Tokenone
Token ID505
Feature activation-3.62e-02

Pos logit contributions
vg+0.022
irst+0.021
ANK+0.017
iller+0.017
(-+0.017
Neg logit contributions
drivers-0.017
AFTA-0.016
 IEEE-0.016
bee-0.016
ulus-0.015
Token,
Token ID11
Feature activation-5.83e-03

Pos logit contributions
drivers+0.025
AFTA+0.025
 IEEE+0.024
bee+0.023
ulus+0.023
Neg logit contributions
vg-0.029
irst-0.027
ANK-0.022
iller-0.022
(--0.022
Token single
Token ID2060
Feature activation-8.17e-02

Pos logit contributions
drivers+0.004
AFTA+0.004
 IEEE+0.004
bee+0.004
ulus+0.004
Neg logit contributions
vg-0.005
irst-0.004
ANK-0.003
iller-0.003
(--0.003
Token to
Token ID284
Feature activation-8.69e-03

Pos logit contributions
drivers+0.029
AFTA+0.028
 IEEE+0.027
bee+0.027
ulus+0.027
Neg logit contributions
vg-0.031
irst-0.029
ANK-0.024
iller-0.023
(--0.023
Token be
Token ID307
Feature activation-3.47e-02

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.007
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Token a
Token ID257
Feature activation+6.62e-03

Pos logit contributions
drivers+0.025
AFTA+0.024
 IEEE+0.024
bee+0.023
ulus+0.023
Neg logit contributions
vg-0.027
irst-0.026
ANK-0.021
iller-0.020
(--0.020
Token modern
Token ID3660
Feature activation+5.31e-02

Pos logit contributions
vg+0.005
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.005
AFTA-0.005
 IEEE-0.004
bee-0.004
ulus-0.004
Token states
Token ID2585
Feature activation-1.78e-01

Pos logit contributions
vg+0.042
irst+0.039
ANK+0.032
iller+0.031
(-+0.031
Neg logit contributions
drivers-0.038
AFTA-0.037
 IEEE-0.036
bee-0.036
ulus-0.035
Tokenman
Token ID805
Feature activation+2.42e-02

Pos logit contributions
drivers+0.124
AFTA+0.119
 IEEE+0.116
bee+0.116
ulus+0.112
Neg logit contributions
vg-0.144
irst-0.137
ANK-0.112
(--0.108
BuyableInstoreAndOnline-0.108
Token.
Token ID13
Feature activation-5.68e-03

Pos logit contributions
vg+0.018
irst+0.017
ANK+0.014
iller+0.014
(-+0.014
Neg logit contributions
drivers-0.018
AFTA-0.017
 IEEE-0.017
bee-0.017
ulus-0.016
Token
Token ID198
Feature activation+3.72e-03

Pos logit contributions
drivers+0.004
AFTA+0.004
 IEEE+0.004
bee+0.004
ulus+0.004
Neg logit contributions
vg-0.004
irst-0.004
ANK-0.003
iller-0.003
(--0.003
Token
Token ID198
Feature activation+1.28e-03

Pos logit contributions
vg+0.003
irst+0.003
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.003
AFTA-0.003
 IEEE-0.003
bee-0.003
ulus-0.003
TokenEnter
Token ID17469
Feature activation-3.30e-02

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Token the
Token ID262
Feature activation-5.87e-03

Pos logit contributions
drivers+0.025
AFTA+0.024
 IEEE+0.024
bee+0.024
ulus+0.023
Neg logit contributions
vg-0.024
irst-0.023
ANK-0.018
iller-0.018
(--0.018
Token by
Token ID416
Feature activation+3.83e-03

Pos logit contributions
vg+0.091
irst+0.086
ANK+0.069
iller+0.068
(-+0.068
Neg logit contributions
drivers-0.084
AFTA-0.081
 IEEE-0.079
bee-0.078
ulus-0.077
Token his
Token ID465
Feature activation-3.28e-02

Pos logit contributions
vg+0.003
irst+0.003
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.003
AFTA-0.003
 IEEE-0.003
bee-0.003
ulus-0.003
Token character
Token ID2095
Feature activation-5.81e-02

Pos logit contributions
drivers+0.023
AFTA+0.023
 IEEE+0.022
bee+0.022
ulus+0.021
Neg logit contributions
vg-0.026
irst-0.024
ANK-0.020
iller-0.019
(--0.019
Token that
Token ID326
Feature activation+2.22e-02

Pos logit contributions
drivers+0.043
AFTA+0.042
 IEEE+0.041
bee+0.040
ulus+0.040
Neg logit contributions
vg-0.044
irst-0.042
ANK-0.033
iller-0.033
(--0.032
Token they
Token ID484
Feature activation+3.35e-02

Pos logit contributions
vg+0.017
irst+0.016
ANK+0.013
iller+0.013
(-+0.013
Neg logit contributions
drivers-0.016
AFTA-0.016
 IEEE-0.015
bee-0.015
ulus-0.015
Token failed
Token ID4054
Feature activation+9.78e-02

Pos logit contributions
vg+0.027
irst+0.026
ANK+0.021
iller+0.020
(-+0.020
Neg logit contributions
drivers-0.023
AFTA-0.023
 IEEE-0.022
bee-0.022
ulus-0.021
Token to
Token ID284
Feature activation-6.93e-03

Pos logit contributions
vg+0.077
irst+0.072
ANK+0.058
iller+0.057
(-+0.057
Neg logit contributions
drivers-0.070
AFTA-0.068
 IEEE-0.066
bee-0.065
ulus-0.065
Token name
Token ID1438
Feature activation-3.58e-02

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.005
bee+0.005
ulus+0.005
Neg logit contributions
vg-0.005
irst-0.005
ANK-0.004
iller-0.004
(--0.004
Token him
Token ID683
Feature activation+1.06e-02

Pos logit contributions
drivers+0.027
AFTA+0.026
 IEEE+0.025
bee+0.025
ulus+0.025
Neg logit contributions
vg-0.027
irst-0.025
ANK-0.020
iller-0.020
(--0.020
Token to
Token ID284
Feature activation-1.37e-03

Pos logit contributions
vg+0.008
irst+0.008
ANK+0.006
iller+0.006
(-+0.006
Neg logit contributions
drivers-0.008
AFTA-0.008
 IEEE-0.007
bee-0.007
ulus-0.007
Token their
Token ID511
Feature activation+5.61e-02

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
(--0.001
Token passer
Token ID30286
Feature activation-1.05e-01

Pos logit contributions
drivers+0.029
AFTA+0.029
 IEEE+0.028
bee+0.027
ulus+0.027
Neg logit contributions
vg-0.032
irst-0.030
ANK-0.024
iller-0.024
(--0.024
Token rating
Token ID7955
Feature activation+2.63e-01

Pos logit contributions
drivers+0.112
AFTA+0.110
 IEEE+0.108
bee+0.107
ulus+0.106
Neg logit contributions
vg-0.046
irst-0.041
ANK-0.027
(--0.025
iller-0.025
Token against
Token ID1028
Feature activation+2.18e-01

Pos logit contributions
vg+0.177
irst+0.166
ANK+0.129
iller+0.126
(-+0.125
Neg logit contributions
drivers-0.217
AFTA-0.210
 IEEE-0.207
bee-0.204
ulus-0.202
Token the
Token ID262
Feature activation-9.62e-03

Pos logit contributions
vg+0.172
irst+0.163
ANK+0.132
iller+0.129
(-+0.129
Neg logit contributions
drivers-0.155
AFTA-0.150
 IEEE-0.146
bee-0.144
ulus-0.142
Token two
Token ID734
Feature activation+6.71e-02

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.008
irst-0.008
ANK-0.006
iller-0.006
(--0.006
Token was
Token ID373
Feature activation+3.74e-02

Pos logit contributions
vg+0.051
irst+0.048
ANK+0.038
iller+0.037
(-+0.037
Neg logit contributions
drivers-0.050
AFTA-0.049
 IEEE-0.048
bee-0.047
ulus-0.046
Token 104
Token ID14436
Feature activation-7.17e-02

Pos logit contributions
vg+0.030
irst+0.028
ANK+0.023
iller+0.023
(-+0.022
Neg logit contributions
drivers-0.026
AFTA-0.025
 IEEE-0.025
bee-0.024
ulus-0.024
Token.
Token ID13
Feature activation-6.38e-03

Pos logit contributions
drivers+0.054
AFTA+0.052
 IEEE+0.051
bee+0.050
ulus+0.050
Neg logit contributions
vg-0.054
irst-0.051
ANK-0.041
iller-0.040
(--0.040
Token20
Token ID1238
Feature activation+4.52e-02

Pos logit contributions
drivers+0.005
AFTA+0.005
 IEEE+0.004
bee+0.004
ulus+0.004
Neg logit contributions
vg-0.005
irst-0.005
ANK-0.004
iller-0.004
(--0.004
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
vg+0.035
irst+0.033
ANK+0.026
iller+0.026
(-+0.026
Neg logit contributions
drivers-0.033
AFTA-0.032
 IEEE-0.031
bee-0.031
ulus-0.031
Token 111
Token ID13374
Feature activation-3.37e-02

Pos logit contributions
drivers+0.007
AFTA+0.007
 IEEE+0.007
bee+0.007
ulus+0.007
Neg logit contributions
vg-0.009
irst-0.008
ANK-0.007
iller-0.007
(--0.007
Token
Token ID198
Feature activation+9.25e-03

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Token
Token ID447
Feature activation-1.56e-03

Pos logit contributions
vg+0.007
irst+0.007
ANK+0.006
iller+0.006
(-+0.006
Neg logit contributions
drivers-0.006
AFTA-0.006
 IEEE-0.006
bee-0.006
ulus-0.006
Token
Token ID250
Feature activation+2.06e-03

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
(--0.001
iller-0.001
TokenM
Token ID44
Feature activation+1.44e-01

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.001
iller+0.001
 decay+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Tokenitch
Token ID2007
Feature activation-7.21e-02

Pos logit contributions
vg+0.123
irst+0.117
ANK+0.097
iller+0.095
(-+0.094
Neg logit contributions
drivers-0.094
AFTA-0.090
 IEEE-0.088
bee-0.086
ulus-0.085
Token has
Token ID468
Feature activation-7.70e-02

Pos logit contributions
drivers+0.052
AFTA+0.050
 IEEE+0.049
bee+0.048
ulus+0.048
Neg logit contributions
vg-0.057
irst-0.053
ANK-0.043
iller-0.042
(--0.042
Token progressed
Token ID28775
Feature activation+1.46e-01

Pos logit contributions
drivers+0.057
AFTA+0.055
 IEEE+0.054
bee+0.053
ulus+0.053
Neg logit contributions
vg-0.058
irst-0.055
ANK-0.044
iller-0.043
BuyableInstoreAndOnline-0.043
Token through
Token ID832
Feature activation-8.59e-03

Pos logit contributions
vg+0.105
irst+0.099
ANK+0.078
iller+0.076
BuyableInstoreAndOnline+0.076
Neg logit contributions
drivers-0.114
AFTA-0.111
 IEEE-0.109
bee-0.108
ulus-0.106
Token a
Token ID257
Feature activation+1.03e-02

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.007
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Token comprehensive
Token ID9815
Feature activation-2.00e-02

Pos logit contributions
vg+0.008
irst+0.008
ANK+0.006
iller+0.006
(-+0.006
Neg logit contributions
drivers-0.007
AFTA-0.007
 IEEE-0.007
bee-0.007
ulus-0.007
Token rehabilitation
Token ID23668
Feature activation+1.16e-01

Pos logit contributions
drivers+0.016
AFTA+0.015
 IEEE+0.015
bee+0.015
ulus+0.014
Neg logit contributions
vg-0.014
irst-0.014
ANK-0.011
iller-0.011
BuyableInstoreAndOnline-0.010
Token Of
Token ID3226
Feature activation+2.46e-03

Pos logit contributions
vg+0.006
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.006
AFTA-0.005
 IEEE-0.005
bee-0.005
ulus-0.005
Token course
Token ID1781
Feature activation-1.66e-02

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Token,
Token ID11
Feature activation+1.05e-02

Pos logit contributions
drivers+0.012
AFTA+0.012
 IEEE+0.012
bee+0.011
ulus+0.011
Neg logit contributions
vg-0.013
irst-0.012
ANK-0.010
iller-0.010
(--0.010
Token I
Token ID314
Feature activation-2.95e-03

Pos logit contributions
vg+0.008
irst+0.008
ANK+0.006
iller+0.006
(-+0.006
Neg logit contributions
drivers-0.008
AFTA-0.007
 IEEE-0.007
bee-0.007
ulus-0.007
Token may
Token ID743
Feature activation-4.99e-02

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.002
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.002
irst-0.002
ANK-0.002
iller-0.002
(--0.002
Token be
Token ID307
Feature activation-2.54e-02

Pos logit contributions
drivers+0.038
AFTA+0.037
 IEEE+0.037
bee+0.036
ulus+0.036
Neg logit contributions
vg-0.036
irst-0.034
ANK-0.027
iller-0.027
(--0.026
Token the
Token ID262
Feature activation+9.28e-03

Pos logit contributions
drivers+0.020
AFTA+0.019
 IEEE+0.019
bee+0.018
ulus+0.018
Neg logit contributions
vg-0.019
irst-0.017
ANK-0.014
iller-0.014
(--0.013
Token gateway
Token ID24308
Feature activation-8.63e-02

Pos logit contributions
vg+0.008
irst+0.007
ANK+0.006
iller+0.006
(-+0.006
Neg logit contributions
drivers-0.006
AFTA-0.006
 IEEE-0.006
bee-0.006
ulus-0.006
Token drug
Token ID2563
Feature activation+8.66e-02

Pos logit contributions
drivers+0.063
AFTA+0.061
 IEEE+0.060
bee+0.059
ulus+0.058
Neg logit contributions
vg-0.067
irst-0.063
ANK-0.051
iller-0.050
(--0.050
Token to
Token ID284
Feature activation-3.67e-04

Pos logit contributions
vg+0.067
irst+0.063
ANK+0.051
iller+0.050
(-+0.050
Neg logit contributions
drivers-0.063
AFTA-0.061
 IEEE-0.060
bee-0.059
ulus-0.058
Token opportunities
Token ID6443
Feature activation+1.42e-01

Pos logit contributions
drivers+0.000
AFTA+0.000
 IEEE+0.000
bee+0.000
ulus+0.000
Neg logit contributions
vg-0.000
irst-0.000
ANK-0.000
iller-0.000
(--0.000
Token health
Token ID1535
Feature activation+1.56e-01

Pos logit contributions
vg+0.029
irst+0.027
ANK+0.021
iller+0.020
(-+0.020
Neg logit contributions
drivers-0.040
AFTA-0.039
 IEEE-0.038
bee-0.038
ulus-0.037
Token,
Token ID11
Feature activation+5.37e-03

Pos logit contributions
vg+0.120
irst+0.113
ANK+0.091
iller+0.089
(-+0.089
Neg logit contributions
drivers-0.115
AFTA-0.112
 IEEE-0.109
bee-0.107
ulus-0.106
Token which
Token ID543
Feature activation+6.47e-02

Pos logit contributions
vg+0.004
irst+0.004
ANK+0.003
iller+0.003
(-+0.003
Neg logit contributions
drivers-0.004
AFTA-0.004
 IEEE-0.004
bee-0.004
ulus-0.004
Token is
Token ID318
Feature activation+3.16e-02

Pos logit contributions
vg+0.051
irst+0.048
ANK+0.039
iller+0.038
(-+0.038
Neg logit contributions
drivers-0.047
AFTA-0.045
 IEEE-0.044
bee-0.043
ulus-0.043
Token now
Token ID783
Feature activation-7.81e-02

Pos logit contributions
vg+0.025
irst+0.024
ANK+0.019
iller+0.019
(-+0.019
Neg logit contributions
drivers-0.023
AFTA-0.022
 IEEE-0.021
bee-0.021
ulus-0.021
Token a
Token ID257
Feature activation+1.18e-02

Pos logit contributions
drivers+0.056
AFTA+0.054
 IEEE+0.052
bee+0.052
ulus+0.051
Neg logit contributions
vg-0.062
irst-0.058
ANK-0.047
iller-0.046
(--0.046
Token management
Token ID4542
Feature activation+7.16e-02

Pos logit contributions
vg+0.010
irst+0.009
ANK+0.007
iller+0.007
(-+0.007
Neg logit contributions
drivers-0.008
AFTA-0.008
 IEEE-0.008
bee-0.008
ulus-0.007
Token priority
Token ID8475
Feature activation-8.00e-02

Pos logit contributions
vg+0.053
irst+0.049
ANK+0.039
iller+0.038
(-+0.038
Neg logit contributions
drivers-0.055
AFTA-0.053
 IEEE-0.052
bee-0.051
ulus-0.051
Token,
Token ID11
Feature activation+2.03e-03

Pos logit contributions
drivers+0.058
AFTA+0.056
 IEEE+0.055
bee+0.054
ulus+0.053
Neg logit contributions
vg-0.062
irst-0.058
ANK-0.047
iller-0.046
(--0.046
Token but
Token ID475
Feature activation-4.33e-02

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.001
iller+0.001
(-+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
Token that
Token ID326
Feature activation+1.59e-02

Pos logit contributions
drivers+0.032
AFTA+0.031
 IEEE+0.030
bee+0.030
ulus+0.029
Neg logit contributions
vg-0.033
irst-0.031
ANK-0.025
iller-0.025
(--0.024
Token
Token ID198
Feature activation+3.19e-03

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.000
iller+0.000
(-+0.000
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ulus-0.001
TokenAustralia
Token ID27429
Feature activation-1.37e-01

Pos logit contributions
vg+0.003
irst+0.003
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Token has
Token ID468
Feature activation-8.14e-02

Pos logit contributions
drivers+0.096
AFTA+0.092
 IEEE+0.090
bee+0.089
ulus+0.087
Neg logit contributions
vg-0.111
irst-0.105
ANK-0.085
iller-0.083
(--0.083
Token six
Token ID2237
Feature activation+2.25e-02

Pos logit contributions
drivers+0.062
AFTA+0.060
 IEEE+0.059
bee+0.058
ulus+0.057
Neg logit contributions
vg-0.061
irst-0.057
ANK-0.046
iller-0.045
(--0.044
Token states
Token ID2585
Feature activation-1.81e-01

Pos logit contributions
vg+0.019
irst+0.018
ANK+0.014
iller+0.014
(-+0.014
Neg logit contributions
drivers-0.015
AFTA-0.015
 IEEE-0.014
bee-0.014
ulus-0.014
Token
Token ID851
Feature activation-8.47e-04

Pos logit contributions
drivers+0.132
AFTA+0.128
 IEEE+0.125
bee+0.123
ulus+0.121
Neg logit contributions
vg-0.140
irst-0.132
ANK-0.106
iller-0.104
(--0.104
Token New
Token ID968
Feature activation-1.23e-01

Pos logit contributions
drivers+0.001
AFTA+0.001
 IEEE+0.001
bee+0.001
ulus+0.001
Neg logit contributions
vg-0.001
irst-0.001
ANK-0.001
iller-0.001
(--0.001
Token South
Token ID2520
Feature activation+6.91e-02

Pos logit contributions
drivers+0.097
AFTA+0.094
 IEEE+0.092
bee+0.092
ulus+0.090
Neg logit contributions
vg-0.087
irst-0.082
ANK-0.065
iller-0.063
(--0.063
Token Wales
Token ID11769
Feature activation-5.53e-02

Pos logit contributions
vg+0.059
irst+0.056
ANK+0.046
iller+0.045
(-+0.045
Neg logit contributions
drivers-0.045
AFTA-0.044
 IEEE-0.042
bee-0.042
ulus-0.041
Token (
Token ID357
Feature activation+1.63e-02

Pos logit contributions
drivers+0.040
AFTA+0.039
 IEEE+0.038
bee+0.037
ulus+0.037
Neg logit contributions
vg-0.043
irst-0.041
ANK-0.033
iller-0.032
(--0.032
TokenNS
Token ID8035
Feature activation-1.64e-01

Pos logit contributions
vg+0.013
irst+0.012
ANK+0.010
iller+0.010
(-+0.010
Neg logit contributions
drivers-0.011
AFTA-0.011
 IEEE-0.011
bee-0.011
ulus-0.010
Token and
Token ID290
Feature activation-2.46e-02

Pos logit contributions
vg+0.043
irst+0.040
ANK+0.033
iller+0.032
(-+0.032
Neg logit contributions
drivers-0.040
AFTA-0.039
 IEEE-0.038
bee-0.037
ulus-0.037
Token they
Token ID484
Feature activation+2.89e-02

Pos logit contributions
drivers+0.018
AFTA+0.017
 IEEE+0.017
bee+0.017
ulus+0.016
Neg logit contributions
vg-0.019
irst-0.018
ANK-0.015
iller-0.014
(--0.014
Token didn
Token ID1422
Feature activation-1.43e-01

Pos logit contributions
vg+0.023
irst+0.022
ANK+0.018
iller+0.017
(-+0.017
Neg logit contributions
drivers-0.020
AFTA-0.020
 IEEE-0.019
bee-0.019
ulus-0.019
Token
Token ID447
Feature activation-1.15e-02

Pos logit contributions
drivers+0.083
AFTA+0.079
 IEEE+0.077
bee+0.076
ulus+0.074
Neg logit contributions
vg-0.132
irst-0.126
ANK-0.106
iller-0.104
(--0.104
Token
Token ID247
Feature activation+2.47e-03

Pos logit contributions
drivers+0.008
AFTA+0.008
 IEEE+0.008
bee+0.008
ulus+0.008
Neg logit contributions
vg-0.009
irst-0.009
ANK-0.007
iller-0.007
(--0.007
Tokent
Token ID83
Feature activation-5.34e-02

Pos logit contributions
vg+0.002
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Token really
Token ID1107
Feature activation-2.26e-02

Pos logit contributions
drivers+0.039
AFTA+0.038
 IEEE+0.037
bee+0.037
ulus+0.036
Neg logit contributions
vg-0.041
irst-0.038
ANK-0.031
BuyableInstoreAndOnline-0.031
iller-0.030
Token have
Token ID423
Feature activation-7.96e-02

Pos logit contributions
drivers+0.017
AFTA+0.016
 IEEE+0.016
bee+0.015
ulus+0.015
Neg logit contributions
vg-0.017
irst-0.016
ANK-0.013
iller-0.013
BuyableInstoreAndOnline-0.013
Token much
Token ID881
Feature activation+7.50e-02

Pos logit contributions
drivers+0.062
AFTA+0.060
 IEEE+0.059
bee+0.058
ulus+0.057
Neg logit contributions
vg-0.058
irst-0.054
ANK-0.043
iller-0.042
(--0.042
Token competition
Token ID5449
Feature activation-2.11e-01

Pos logit contributions
vg+0.060
irst+0.057
ANK+0.046
(-+0.045
iller+0.045
Neg logit contributions
drivers-0.053
AFTA-0.051
 IEEE-0.050
bee-0.049
ulus-0.048
Token for
Token ID329
Feature activation-2.51e-02

Pos logit contributions
drivers+0.158
AFTA+0.154
 IEEE+0.150
bee+0.147
ulus+0.145
Neg logit contributions
vg-0.160
irst-0.150
ANK-0.121
iller-0.118
(--0.118
Token its
Token ID663
Feature activation+2.49e-01

Pos logit contributions
vg+0.016
irst+0.015
ANK+0.012
iller+0.012
(-+0.012
Neg logit contributions
drivers-0.015
AFTA-0.015
 IEEE-0.015
bee-0.014
ulus-0.014
Token propriet
Token ID38090
Feature activation-8.77e-02

Pos logit contributions
vg+0.202
irst+0.191
ANK+0.156
iller+0.153
(-+0.153
Neg logit contributions
drivers-0.172
AFTA-0.167
 IEEE-0.163
bee-0.160
ulus-0.158
Tokenor
Token ID273
Feature activation+1.17e-01

Pos logit contributions
drivers+0.074
AFTA+0.073
 IEEE+0.071
bee+0.070
ulus+0.069
Neg logit contributions
vg-0.057
irst-0.054
ANK-0.041
iller-0.040
(--0.040
Token,
Token ID11
Feature activation+1.23e-02

Pos logit contributions
vg+0.090
irst+0.084
ANK+0.068
iller+0.066
(-+0.066
Neg logit contributions
drivers-0.087
AFTA-0.084
 IEEE-0.082
bee-0.081
ulus-0.080
Token alone
Token ID3436
Feature activation+3.80e-01

Pos logit contributions
vg+0.009
irst+0.009
ANK+0.007
iller+0.007
(-+0.007
Neg logit contributions
drivers-0.009
AFTA-0.009
 IEEE-0.009
bee-0.008
ulus-0.008
Tokenly
Token ID306
Feature activation-3.10e-01

Pos logit contributions
vg+0.293
irst+0.277
ANK+0.223
iller+0.219
(-+0.219
Neg logit contributions
drivers-0.277
AFTA-0.269
 IEEE-0.263
bee-0.258
ulus-0.255
Token young
Token ID1862
Feature activation-5.38e-02

Pos logit contributions
drivers+0.231
AFTA+0.224
 IEEE+0.219
bee+0.216
ulus+0.213
Neg logit contributions
vg-0.234
irst-0.221
ANK-0.177
iller-0.173
(--0.173
Token man
Token ID582
Feature activation+5.56e-02

Pos logit contributions
drivers+0.039
AFTA+0.038
 IEEE+0.037
bee+0.037
ulus+0.036
Neg logit contributions
vg-0.042
irst-0.039
ANK-0.032
iller-0.031
(--0.031
Token named
Token ID3706
Feature activation-7.54e-02

Pos logit contributions
vg+0.042
irst+0.040
ANK+0.032
(-+0.031
iller+0.031
Neg logit contributions
drivers-0.041
AFTA-0.040
 IEEE-0.039
bee-0.038
ulus-0.038
Token Norman
Token ID19636
Feature activation-6.31e-02

Pos logit contributions
drivers+0.057
AFTA+0.055
 IEEE+0.054
bee+0.053
ulus+0.052
Neg logit contributions
vg-0.057
irst-0.053
ANK-0.043
iller-0.042
(--0.042
Token Bates
Token ID35591
Feature activation-6.88e-02

Pos logit contributions
drivers+0.047
AFTA+0.045
 IEEE+0.044
bee+0.043
ulus+0.043
Neg logit contributions
vg-0.048
irst-0.045
ANK-0.037
(--0.036
iller-0.036
Token also
Token ID635
Feature activation+1.45e-02

Pos logit contributions
drivers+0.098
AFTA+0.095
 IEEE+0.092
bee+0.091
ulus+0.090
Neg logit contributions
vg-0.109
irst-0.103
ANK-0.084
iller-0.082
(--0.082
Token identified
Token ID5174
Feature activation-1.05e-01

Pos logit contributions
vg+0.011
irst+0.010
ANK+0.008
(-+0.008
iller+0.008
Neg logit contributions
drivers-0.011
AFTA-0.010
 IEEE-0.010
bee-0.010
ulus-0.010
Token with
Token ID351
Feature activation+1.37e-02

Pos logit contributions
drivers+0.090
AFTA+0.088
 IEEE+0.086
bee+0.085
ulus+0.084
Neg logit contributions
vg-0.067
irst-0.062
ANK-0.048
iller-0.046
(--0.046
Token the
Token ID262
Feature activation+7.38e-04

Pos logit contributions
vg+0.011
irst+0.010
ANK+0.008
(-+0.008
iller+0.008
Neg logit contributions
drivers-0.010
AFTA-0.010
 IEEE-0.009
bee-0.009
ratulations-0.009
Token middle
Token ID3504
Feature activation-1.34e-01

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.000
(-+0.000
iller+0.000
Neg logit contributions
drivers-0.000
AFTA-0.000
 IEEE-0.000
bee-0.000
ulus-0.000
Token class
Token ID1398
Feature activation-2.75e-01

Pos logit contributions
drivers+0.056
AFTA+0.053
 IEEE+0.050
bee+0.049
ulus+0.048
Neg logit contributions
vg-0.145
irst-0.140
ANK-0.121
(--0.119
iller-0.119
Token,
Token ID11
Feature activation+2.67e-04

Pos logit contributions
drivers+0.203
AFTA+0.197
 IEEE+0.192
bee+0.189
ulus+0.187
Neg logit contributions
vg-0.210
irst-0.199
ANK-0.160
(--0.157
iller-0.156
Token despite
Token ID3805
Feature activation-6.45e-02

Pos logit contributions
vg+0.000
irst+0.000
ANK+0.000
iller+0.000
(-+0.000
Neg logit contributions
drivers-0.000
AFTA-0.000
 IEEE-0.000
bee-0.000
ulus-0.000
Token not
Token ID407
Feature activation+1.04e-01

Pos logit contributions
drivers+0.048
AFTA+0.047
 IEEE+0.046
bee+0.045
ulus+0.044
Neg logit contributions
vg-0.049
irst-0.046
ANK-0.037
iller-0.036
(--0.036
Token necessarily
Token ID6646
Feature activation+6.77e-02

Pos logit contributions
vg+0.084
irst+0.079
ANK+0.065
iller+0.063
(-+0.063
Neg logit contributions
drivers-0.072
AFTA-0.070
 IEEE-0.068
bee-0.067
ulus-0.066
Token enjoying
Token ID13226
Feature activation+1.48e-01

Pos logit contributions
vg+0.053
irst+0.050
ANK+0.041
iller+0.040
(-+0.040
Neg logit contributions
drivers-0.048
AFTA-0.047
 IEEE-0.046
bee-0.045
ulus-0.044
Token Self
Token ID12189
Feature activation+1.13e-01

Pos logit contributions
vg+0.160
irst+0.151
ANK+0.122
(-+0.119
iller+0.119
Neg logit contributions
drivers-0.151
AFTA-0.146
 IEEE-0.142
bee-0.140
ulus-0.138
Tokenless
Token ID1203
Feature activation+2.18e-01

Pos logit contributions
vg+0.082
irst+0.077
ANK+0.061
iller+0.060
(-+0.060
Neg logit contributions
drivers-0.088
AFTA-0.086
 IEEE-0.084
bee-0.083
ulus-0.082
Token,
Token ID11
Feature activation-2.91e-03

Pos logit contributions
vg+0.169
irst+0.160
ANK+0.129
iller+0.126
(-+0.126
Neg logit contributions
drivers-0.159
AFTA-0.154
 IEEE-0.150
bee-0.148
ulus-0.146
Token Moral
Token ID39020
Feature activation+1.37e-02

Pos logit contributions
drivers+0.002
AFTA+0.002
 IEEE+0.002
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.002
irst-0.002
ANK-0.002
iller-0.002
(--0.002
Token,
Token ID11
Feature activation-2.21e-05

Pos logit contributions
vg+0.010
irst+0.010
ANK+0.008
iller+0.008
(-+0.008
Neg logit contributions
drivers-0.010
AFTA-0.010
 IEEE-0.010
bee-0.010
ulus-0.010
Token Vigil
Token ID39840
Feature activation-3.54e-01

Pos logit contributions
drivers+0.000
AFTA+0.000
 IEEE+0.000
bee+0.000
ulus+0.000
Neg logit contributions
vg-0.000
irst-0.000
ANK-0.000
iller-0.000
(--0.000
Tokenant
Token ID415
Feature activation+3.89e-02

Pos logit contributions
drivers+0.251
AFTA+0.243
 IEEE+0.237
bee+0.233
ulus+0.230
Neg logit contributions
vg-0.282
irst-0.266
ANK-0.216
iller-0.212
(--0.212
Token
Token ID198
Feature activation+7.56e-03

Pos logit contributions
vg+0.030
irst+0.028
ANK+0.022
iller+0.022
(-+0.022
Neg logit contributions
drivers-0.029
AFTA-0.028
 IEEE-0.027
bee-0.027
ulus-0.027
Token
Token ID198
Feature activation+3.37e-03

Pos logit contributions
vg+0.006
irst+0.005
ANK+0.004
iller+0.004
(-+0.004
Neg logit contributions
drivers-0.006
AFTA-0.005
 IEEE-0.005
bee-0.005
ulus-0.005
Token-
Token ID12
Feature activation+2.53e-02

Pos logit contributions
vg+0.003
irst+0.002
ANK+0.002
iller+0.002
(-+0.002
Neg logit contributions
drivers-0.002
AFTA-0.002
 IEEE-0.002
bee-0.002
ulus-0.002
Token Cond
Token ID9724
Feature activation+1.73e-01

Pos logit contributions
vg+0.020
irst+0.019
ANK+0.015
iller+0.015
(-+0.015
Neg logit contributions
drivers-0.018
AFTA-0.018
 IEEE-0.017
bee-0.017
ulus-0.017
Tokenation
Token ID341
Feature activation-1.84e-02

Pos logit contributions
vg+0.094
irst+0.089
ANK+0.071
iller+0.070
(-+0.070
Neg logit contributions
drivers-0.092
AFTA-0.089
 IEEE-0.087
bee-0.085
ulus-0.084
Token,
Token ID11
Feature activation-6.79e-03

Pos logit contributions
drivers+0.013
AFTA+0.013
 IEEE+0.013
bee+0.013
ulus+0.012
Neg logit contributions
vg-0.014
irst-0.013
ANK-0.011
iller-0.011
(--0.011
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
drivers+0.005
AFTA+0.004
 IEEE+0.004
bee+0.004
ulus+0.004
Neg logit contributions
vg-0.006
irst-0.005
ANK-0.004
iller-0.004
(--0.004
Token shift
Token ID6482
Feature activation-2.22e-02

Pos logit contributions
drivers+0.010
AFTA+0.010
 IEEE+0.010
bee+0.010
ulus+0.010
Neg logit contributions
vg-0.011
irst-0.010
ANK-0.008
iller-0.008
(--0.008
Token in
Token ID287
Feature activation-1.87e-02

Pos logit contributions
drivers+0.016
AFTA+0.016
 IEEE+0.015
bee+0.015
ulus+0.015
Neg logit contributions
vg-0.017
irst-0.016
ANK-0.013
iller-0.013
(--0.013
Token demand
Token ID3512
Feature activation-2.65e-01

Pos logit contributions
drivers+0.013
AFTA+0.013
 IEEE+0.013
bee+0.012
ulus+0.012
Neg logit contributions
vg-0.015
irst-0.014
ANK-0.011
iller-0.011
(--0.011
Token for
Token ID329
Feature activation-3.16e-02

Pos logit contributions
drivers+0.192
AFTA+0.186
 IEEE+0.181
bee+0.179
ulus+0.176
Neg logit contributions
vg-0.206
irst-0.195
ANK-0.157
iller-0.154
(--0.154
Token cleaner
Token ID21723
Feature activation-6.30e-02

Pos logit contributions
drivers+0.021
AFTA+0.021
 IEEE+0.020
bee+0.020
ulus+0.019
Neg logit contributions
vg-0.026
irst-0.025
ANK-0.020
iller-0.020
(--0.020
Token,
Token ID11
Feature activation-9.51e-03

Pos logit contributions
drivers+0.046
AFTA+0.045
 IEEE+0.044
bee+0.043
ulus+0.042
Neg logit contributions
vg-0.049
irst-0.046
ANK-0.037
(--0.036
iller-0.036
Token cheaper
Token ID11721
Feature activation-8.58e-02

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.008
irst-0.008
ANK-0.006
iller-0.006
(--0.006
Token energy
Token ID2568
Feature activation-7.74e-02

Pos logit contributions
drivers+0.062
AFTA+0.060
 IEEE+0.059
bee+0.058
ulus+0.057
Neg logit contributions
vg-0.067
irst-0.063
ANK-0.051
iller-0.050
(--0.050
Token
Token ID247
Feature activation-3.69e-03

Pos logit contributions
drivers+0.010
AFTA+0.009
 IEEE+0.009
bee+0.009
ulus+0.009
Neg logit contributions
vg-0.010
irst-0.009
ANK-0.007
iller-0.007
(--0.007
Tokent
Token ID83
Feature activation-6.34e-02

Pos logit contributions
drivers+0.003
AFTA+0.003
 IEEE+0.003
bee+0.002
ulus+0.002
Neg logit contributions
vg-0.003
irst-0.003
ANK-0.002
(--0.002
iller-0.002
Token actually
Token ID1682
Feature activation+4.82e-02

Pos logit contributions
drivers+0.049
AFTA+0.047
 IEEE+0.046
bee+0.046
ulus+0.045
Neg logit contributions
vg-0.046
irst-0.044
ANK-0.035
iller-0.034
(--0.034
Token
Token ID564
Feature activation-8.35e-03

Pos logit contributions
vg+0.037
irst+0.035
ANK+0.028
iller+0.028
(-+0.028
Neg logit contributions
drivers-0.035
AFTA-0.034
 IEEE-0.033
bee-0.033
ulus-0.032
Token
Token ID250
Feature activation-1.66e-02

Pos logit contributions
drivers+0.006
AFTA+0.006
 IEEE+0.006
bee+0.006
ulus+0.006
Neg logit contributions
vg-0.007
irst-0.006
ANK-0.005
iller-0.005
(--0.005
Tokenload
Token ID2220
Feature activation-3.00e-01

Pos logit contributions
drivers+0.012
AFTA+0.012
 IEEE+0.011
bee+0.011
ulus+0.011
Neg logit contributions
vg-0.013
irst-0.012
ANK-0.010
iller-0.010
(--0.010
Token
Token ID447
Feature activation-1.32e-02

Pos logit contributions
drivers+0.217
AFTA+0.211
 IEEE+0.205
bee+0.202
ratulations+0.200
Neg logit contributions
vg-0.234
irst-0.220
ANK-0.178
(--0.174
iller-0.174
Token
Token ID251
Feature activation-1.28e-02

Pos logit contributions
drivers+0.009
AFTA+0.009
 IEEE+0.009
bee+0.009
ulus+0.008
Neg logit contributions
vg-0.011
irst-0.010
ANK-0.008
iller-0.008
(--0.008
Token anything
Token ID1997
Feature activation+1.85e-03

Pos logit contributions
drivers+0.009
AFTA+0.009
 IEEE+0.009
bee+0.009
ulus+0.009
Neg logit contributions
vg-0.010
irst-0.009
ANK-0.007
iller-0.007
(--0.007
Token.
Token ID13
Feature activation-1.79e-02

Pos logit contributions
vg+0.001
irst+0.001
ANK+0.001
(-+0.001
iller+0.001
Neg logit contributions
drivers-0.001
AFTA-0.001
 IEEE-0.001
bee-0.001
ratulations-0.001
Token It
Token ID632
Feature activation-4.87e-02

Pos logit contributions
drivers+0.013
AFTA+0.013
 IEEE+0.012
bee+0.012
ulus+0.012
Neg logit contributions
vg-0.014
irst-0.013
ANK-0.011
iller-0.010
(--0.010