TOP ACTIVATIONS
MAX = 0.527

A large arsenal of the Ban u Qur ay za which
leading online stores like T ajor i . p k and
execution of the men of Ban u Qur ay za ,
- known stores like T ajor i . p k is
call for his " Travel Ban " on Twitter as an
! K el sey Ban field Food 52
died . The siege of Ban u Q ura iza
of the siege laid to Ban u Q ura iza ,
No woman of Ban u Qur ay za was
. We need the Travel Ban as an extra level of
had joined the ranks of Ban u Q ura iza when
. OS HA ( Occup ational Health and Safety Administration )
often depicted as overly confront ational . Yet Mac ri is
to the unstable situation in Baghdad , both matches are going
to create a rapid vibr ational effect . [
gov / home andre cre ational safety / falls / pub
flipping out and being confront ational towards the audience , like
in an educ ational program and
't be a bad rot ational lineman , but it would
, wildlife sightings , navig ational challenges , even solitude .

MIN ACTIVATIONS
MIN = -0.580

of cassette tapes in the glove box of our old Pont
face gently with his boxing glove s padded red
they would with a baseball glove ! In Wii Sports Club
at his eye with his glove , as if trying to
required guns be locked in glove comp artments of tr unks
two metal clips near the glove box . Make sure the
last year . Hur tt has also tut ored
<|endoftext|> Haw aii Congress woman Tuls i G abb ard is
the most serious charges ever leve lled at a musical genius
Game 74 v Blues Lost 4 - 0
Caps out - shot the Blues 10 - 8 . The
and more against an impressive Blues outfit , however it proved
Inter pol list which Tel stra is examining is believed to
. However , a Tel stra spokesperson today said the tel
morning sunshine for those " cold " AM hours . I
Regina , if you are Indigenous , you re
or poly gon al crater iform pap ules developing along the
Two Mind s blog - Kind le <|endoftext|> A Review of
According to the Hof stra University website , security precautions
do you expect using paint Kind of hard to do a

INTERVAL 0.250 - 0.527
CONTAINS 0.856%

last week s episode , Ann al ise is
utilize easily with excellent Arduino compatibility . I m
of toxic al gal blo oms directly to ocean warming and
J y ot irling a Temple 9 . Sw
World War II . Nearly 400 , 000 others have committed

INTERVAL -0.027 - 0.250
CONTAINS 74.538%

on employees , suppliers and landlords affected by the closure of
apt itudes past a certain age , and when an older
The fourth big Project DI va X update is in
unveiling ( of sorts ) of Destiny via its official web
Dream ers from deportation , there will be

INTERVAL -0.304 - -0.027
CONTAINS 24.374%

everyone does it ? Hillary Clinton isn t proposing
$ 840 to cover expenses working downtown . ( CBC News
snap off anyway Once it 's mostly detached you just need
on the list . From Edward Has b rou ck 's
el it ist bloggers like John Scal zi

INTERVAL -0.580 - -0.304
CONTAINS 0.233%

ot is group identifies additional tax a . Australian Journal of
world enters the age of climate change , the tiny islands
Donald J . Trump (@ realDonaldTrump ) November 3 , 2017
. The Florida senator has also refused to answer
drugs come off their cr utch . There are
TokenA
Token ID32
Feature activation-1.14e-01

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
 Ban-0.001
-0.001
 Dates-0.001
ajor-0.001
Token large
Token ID1588
Feature activation-1.02e-02

Pos logit contributions
culosis+0.085
 glove+0.082
 concess+0.080
 isEnabled+0.078
PLA+0.078
Neg logit contributions
 Canter-0.081
-0.072
 Ban-0.072
 Dates-0.070
ajor-0.069
Token arsenal
Token ID24375
Feature activation-3.27e-02

Pos logit contributions
culosis+0.008
 glove+0.008
 concess+0.007
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.007
-0.006
 Ban-0.006
 Dates-0.006
ajor-0.006
Token of
Token ID286
Feature activation+1.23e-02

Pos logit contributions
culosis+0.024
 glove+0.023
 concess+0.023
 isEnabled+0.022
PLA+0.022
Neg logit contributions
 Canter-0.024
-0.021
 Ban-0.021
ajor-0.020
 Dates-0.020
Token the
Token ID262
Feature activation+7.41e-03

Pos logit contributions
 Canter+0.009
+0.008
 Ban+0.008
ajor+0.008
 Dates+0.008
Neg logit contributions
culosis-0.009
 glove-0.009
 concess-0.008
 isEnabled-0.008
PLA-0.008
Token Ban
Token ID10274
Feature activation+5.27e-01

Pos logit contributions
 Canter+0.005
+0.005
 Ban+0.005
 Dates+0.005
ajor+0.005
Neg logit contributions
culosis-0.005
 glove-0.005
 concess-0.005
 isEnabled-0.005
PLA-0.005
Tokenu
Token ID84
Feature activation-1.21e-01

Pos logit contributions
 Canter+0.462
 Ban+0.422
+0.421
 Dates+0.408
ajor+0.408
Neg logit contributions
culosis-0.312
 glove-0.294
 concess-0.286
 isEnabled-0.280
PLA-0.276
Token Qur
Token ID17599
Feature activation-2.64e-02

Pos logit contributions
culosis+0.086
 glove+0.081
 concess+0.080
 isEnabled+0.078
PLA+0.077
Neg logit contributions
 Canter-0.092
-0.082
 Ban-0.082
 Dates-0.080
ajor-0.080
Tokenay
Token ID323
Feature activation-9.12e-03

Pos logit contributions
culosis+0.019
 glove+0.018
 concess+0.018
 isEnabled+0.017
PLA+0.017
Neg logit contributions
 Canter-0.020
-0.018
 Ban-0.018
 Dates-0.017
ajor-0.017
Tokenza
Token ID4496
Feature activation+6.94e-02

Pos logit contributions
culosis+0.008
 glove+0.007
 concess+0.007
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.006
-0.005
 Ban-0.005
 Dates-0.005
ajor-0.005
Token which
Token ID543
Feature activation+1.03e-02

Pos logit contributions
 Canter+0.050
+0.044
 Ban+0.044
ajor+0.043
 Dates+0.043
Neg logit contributions
culosis-0.052
 glove-0.050
 concess-0.049
 isEnabled-0.048
PLA-0.047
Token leading
Token ID3756
Feature activation+1.12e-02

Pos logit contributions
 Canter+0.001
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token online
Token ID2691
Feature activation-1.37e-01

Pos logit contributions
 Canter+0.008
 Ban+0.007
+0.007
 Dates+0.007
ajor+0.007
Neg logit contributions
culosis-0.009
 glove-0.008
 concess-0.008
 isEnabled-0.008
PLA-0.008
Token stores
Token ID7000
Feature activation+8.84e-02

Pos logit contributions
culosis+0.107
 glove+0.102
 concess+0.100
 isEnabled+0.098
PLA+0.097
Neg logit contributions
 Canter-0.094
 Ban-0.083
-0.083
 Dates-0.080
ajor-0.080
Token like
Token ID588
Feature activation+6.07e-03

Pos logit contributions
 Canter+0.064
 Ban+0.057
+0.057
 Dates+0.055
ajor+0.055
Neg logit contributions
culosis-0.066
 glove-0.063
 concess-0.061
 isEnabled-0.060
PLA-0.060
Token T
Token ID309
Feature activation-5.03e-02

Pos logit contributions
 Canter+0.004
 Ban+0.004
+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Tokenajor
Token ID1518
Feature activation+5.27e-01

Pos logit contributions
culosis+0.039
 glove+0.037
 concess+0.036
 isEnabled+0.035
PLA+0.035
Neg logit contributions
 Canter-0.035
 Ban-0.031
-0.031
ajor-0.030
 Dates-0.030
Tokeni
Token ID72
Feature activation-1.05e-01

Pos logit contributions
 Canter+0.382
 Ban+0.341
+0.339
ajor+0.328
 Dates+0.328
Neg logit contributions
culosis-0.391
 glove-0.374
 concess-0.367
 isEnabled-0.361
PLA-0.356
Token.
Token ID13
Feature activation+2.01e-04

Pos logit contributions
culosis+0.079
 glove+0.075
 concess+0.074
 isEnabled+0.073
PLA+0.072
Neg logit contributions
 Canter-0.076
 Ban-0.068
-0.067
 Dates-0.065
ajor-0.065
Tokenp
Token ID79
Feature activation-8.76e-02

Pos logit contributions
 Canter+0.000
+0.000
 Ban+0.000
 Dates+0.000
ajor+0.000
Neg logit contributions
culosis-0.000
 glove-0.000
 concess-0.000
 isEnabled-0.000
PLA-0.000
Tokenk
Token ID74
Feature activation-6.40e-03

Pos logit contributions
culosis+0.066
 glove+0.063
 concess+0.062
 isEnabled+0.061
PLA+0.060
Neg logit contributions
 Canter-0.063
-0.055
 Ban-0.055
 Dates-0.054
ajor-0.053
Token and
Token ID290
Feature activation-4.76e-03

Pos logit contributions
culosis+0.005
 glove+0.005
 concess+0.005
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.005
 Ban-0.004
-0.004
 Dates-0.004
ajor-0.004
Token execution
Token ID9706
Feature activation+1.25e-01

Pos logit contributions
 Canter+0.001
+0.001
 Ban+0.001
ajor+0.001
 Dates+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token of
Token ID286
Feature activation+8.17e-03

Pos logit contributions
 Canter+0.090
 Ban+0.080
+0.080
 Dates+0.077
ajor+0.077
Neg logit contributions
culosis-0.093
 glove-0.089
 concess-0.087
 isEnabled-0.086
PLA-0.085
Token the
Token ID262
Feature activation+2.84e-03

Pos logit contributions
 Canter+0.006
+0.005
 Ban+0.005
ajor+0.005
 Dates+0.005
Neg logit contributions
culosis-0.006
 glove-0.006
 concess-0.006
 isEnabled-0.006
PLA-0.006
Token men
Token ID1450
Feature activation+3.78e-02

Pos logit contributions
 Canter+0.002
+0.002
 Ban+0.002
ajor+0.002
 Dates+0.002
Neg logit contributions
culosis-0.002
 glove-0.002
 concess-0.002
 isEnabled-0.002
PLA-0.002
Token of
Token ID286
Feature activation+7.63e-03

Pos logit contributions
 Canter+0.027
+0.024
 Ban+0.024
 Dates+0.023
ajor+0.023
Neg logit contributions
culosis-0.028
 glove-0.027
 concess-0.026
 isEnabled-0.026
PLA-0.026
Token Ban
Token ID10274
Feature activation+5.21e-01

Pos logit contributions
 Canter+0.005
+0.005
 Ban+0.005
 Dates+0.005
ajor+0.005
Neg logit contributions
culosis-0.006
 glove-0.006
 concess-0.005
 isEnabled-0.005
PLA-0.005
Tokenu
Token ID84
Feature activation-1.28e-01

Pos logit contributions
 Canter+0.413
 Ban+0.372
+0.371
 Dates+0.359
ajor+0.358
Neg logit contributions
culosis-0.352
 glove-0.335
 concess-0.327
 isEnabled-0.321
PLA-0.317
Token Qur
Token ID17599
Feature activation-3.11e-02

Pos logit contributions
culosis+0.093
 glove+0.089
 concess+0.087
 isEnabled+0.085
PLA+0.084
Neg logit contributions
 Canter-0.095
-0.085
 Ban-0.084
 Dates-0.082
ajor-0.082
Tokenay
Token ID323
Feature activation-8.25e-03

Pos logit contributions
culosis+0.025
 glove+0.023
 concess+0.023
 isEnabled+0.023
PLA+0.022
Neg logit contributions
 Canter-0.021
-0.019
 Ban-0.019
 Dates-0.018
ajor-0.018
Tokenza
Token ID4496
Feature activation+6.68e-02

Pos logit contributions
culosis+0.006
 glove+0.006
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.006
-0.005
 Ban-0.005
 Dates-0.005
ajor-0.005
Token,
Token ID11
Feature activation+8.78e-03

Pos logit contributions
 Canter+0.049
+0.043
 Ban+0.043
 Dates+0.042
ajor+0.042
Neg logit contributions
culosis-0.049
 glove-0.047
 concess-0.046
 isEnabled-0.045
PLA-0.044
Token-
Token ID12
Feature activation+3.72e-03

Pos logit contributions
culosis+0.016
 glove+0.016
 concess+0.015
 isEnabled+0.015
PLA+0.015
Neg logit contributions
 Canter-0.017
-0.015
 Ban-0.015
 Dates-0.015
ajor-0.015
Tokenknown
Token ID4002
Feature activation-7.92e-02

Pos logit contributions
 Canter+0.003
+0.003
 Ban+0.003
 Dates+0.003
ajor+0.003
Neg logit contributions
culosis-0.002
 glove-0.002
 concess-0.002
 isEnabled-0.002
PLA-0.002
Token stores
Token ID7000
Feature activation+8.62e-02

Pos logit contributions
culosis+0.058
 glove+0.055
 concess+0.054
 isEnabled+0.053
PLA+0.053
Neg logit contributions
 Canter-0.058
 Ban-0.052
-0.052
 Dates-0.050
ajor-0.050
Token like
Token ID588
Feature activation+9.07e-03

Pos logit contributions
 Canter+0.065
 Ban+0.058
+0.058
 Dates+0.056
ajor+0.056
Neg logit contributions
culosis-0.062
 glove-0.059
 concess-0.058
 isEnabled-0.057
PLA-0.056
Token T
Token ID309
Feature activation-4.86e-02

Pos logit contributions
 Canter+0.007
 Ban+0.006
+0.006
 Dates+0.006
ajor+0.006
Neg logit contributions
culosis-0.007
 glove-0.006
 concess-0.006
 isEnabled-0.006
PLA-0.006
Tokenajor
Token ID1518
Feature activation+5.20e-01

Pos logit contributions
culosis+0.063
 glove+0.062
 concess+0.061
 isEnabled+0.060
PLA+0.060
Neg logit contributions
 Canter-0.008
 Ban-0.004
-0.004
 Dates-0.003
ajor-0.003
Tokeni
Token ID72
Feature activation-1.09e-01

Pos logit contributions
 Canter+0.437
 Ban+0.397
+0.396
ajor+0.385
 Dates+0.384
Neg logit contributions
culosis-0.327
 glove-0.309
 concess-0.303
 isEnabled-0.294
PLA-0.291
Token.
Token ID13
Feature activation+4.47e-03

Pos logit contributions
culosis+0.081
 glove+0.077
 concess+0.075
 isEnabled+0.074
PLA+0.073
Neg logit contributions
 Canter-0.079
-0.070
 Ban-0.070
 Dates-0.068
ajor-0.068
Tokenp
Token ID79
Feature activation-8.69e-02

Pos logit contributions
 Canter+0.004
+0.003
 Ban+0.003
 Dates+0.003
ajor+0.003
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.003
PLA-0.003
Tokenk
Token ID74
Feature activation-7.32e-03

Pos logit contributions
culosis+0.064
 glove+0.061
 concess+0.060
 isEnabled+0.059
PLA+0.058
Neg logit contributions
 Canter-0.063
-0.056
 Ban-0.056
 Dates-0.054
ajor-0.054
Token is
Token ID318
Feature activation-1.72e-02

Pos logit contributions
culosis+0.005
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
 Ban-0.005
-0.005
 Dates-0.005
ajor-0.005
Token call
Token ID869
Feature activation+1.30e-02

Pos logit contributions
 Canter+0.020
+0.018
 Ban+0.018
 Dates+0.017
ajor+0.017
Neg logit contributions
culosis-0.020
 glove-0.019
 concess-0.019
 isEnabled-0.018
PLA-0.018
Token for
Token ID329
Feature activation+4.16e-03

Pos logit contributions
 Canter+0.010
+0.008
 Ban+0.008
 Dates+0.008
ajor+0.008
Neg logit contributions
culosis-0.010
 glove-0.009
 concess-0.009
 isEnabled-0.009
PLA-0.009
Token his
Token ID465
Feature activation+2.40e-02

Pos logit contributions
 Canter+0.003
+0.003
 Ban+0.003
 Dates+0.003
ajor+0.003
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.003
PLA-0.003
Token "
Token ID366
Feature activation+3.94e-02

Pos logit contributions
 Canter+0.017
+0.015
 Ban+0.015
 Dates+0.014
ajor+0.014
Neg logit contributions
culosis-0.018
 glove-0.018
 concess-0.017
 isEnabled-0.017
PLA-0.017
TokenTravel
Token ID33074
Feature activation+1.11e-01

Pos logit contributions
 Canter+0.029
+0.026
 Ban+0.026
 Dates+0.025
ajor+0.025
Neg logit contributions
culosis-0.028
 glove-0.027
 concess-0.027
 isEnabled-0.026
PLA-0.026
Token Ban
Token ID10274
Feature activation+5.18e-01

Pos logit contributions
 Canter+0.014
+0.005
 Ban+0.003
ajor+0.002
 Dates+0.002
Neg logit contributions
culosis-0.149
 glove-0.145
 concess-0.143
 isEnabled-0.142
PLA-0.141
Token"
Token ID1
Feature activation-9.53e-04

Pos logit contributions
 Canter+0.381
+0.339
 Ban+0.339
 Dates+0.328
ajor+0.327
Neg logit contributions
culosis-0.379
 glove-0.361
 concess-0.354
 isEnabled-0.347
PLA-0.343
Token on
Token ID319
Feature activation+1.49e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
ajor-0.001
 Dates-0.001
Token Twitter
Token ID3009
Feature activation+1.15e-01

Pos logit contributions
 Canter+0.011
+0.010
 Ban+0.010
 Dates+0.010
ajor+0.009
Neg logit contributions
culosis-0.011
 glove-0.010
 concess-0.010
 isEnabled-0.010
PLA-0.010
Token as
Token ID355
Feature activation+2.50e-02

Pos logit contributions
 Canter+0.083
+0.074
 Ban+0.074
 Dates+0.071
ajor+0.071
Neg logit contributions
culosis-0.085
 glove-0.081
 concess-0.079
 isEnabled-0.078
PLA-0.077
Token an
Token ID281
Feature activation+5.00e-02

Pos logit contributions
 Canter+0.018
+0.016
 Ban+0.016
 Dates+0.016
ajor+0.016
Neg logit contributions
culosis-0.018
 glove-0.017
 concess-0.017
 isEnabled-0.017
PLA-0.017
Token!
Token ID0
Feature activation+6.45e-02

Pos logit contributions
 Canter+0.115
+0.102
 Ban+0.102
 Dates+0.098
ajor+0.098
Neg logit contributions
culosis-0.122
 glove-0.117
 concess-0.114
 isEnabled-0.112
PLA-0.111
Token
Token ID851
Feature activation-9.97e-02

Pos logit contributions
 Canter+0.045
+0.040
 Ban+0.039
ajor+0.039
 Dates+0.038
Neg logit contributions
culosis-0.049
 glove-0.047
 concess-0.046
 isEnabled-0.045
PLA-0.045
TokenK
Token ID42
Feature activation-1.19e-02

Pos logit contributions
culosis+0.076
 glove+0.072
 concess+0.071
 isEnabled+0.069
PLA+0.069
Neg logit contributions
 Canter-0.071
-0.063
 Ban-0.062
ajor-0.060
 Dates-0.060
Tokenel
Token ID417
Feature activation+9.01e-02

Pos logit contributions
culosis+0.009
 glove+0.009
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.008
-0.008
 Ban-0.008
 Dates-0.007
ajor-0.007
Tokensey
Token ID4397
Feature activation+2.68e-01

Pos logit contributions
 Canter+0.046
+0.039
 Ban+0.039
 Dates+0.037
ajor+0.037
Neg logit contributions
culosis-0.086
 glove-0.083
 concess-0.081
 isEnabled-0.080
PLA-0.080
Token Ban
Token ID10274
Feature activation+5.17e-01

Pos logit contributions
 Canter+0.188
+0.166
 Ban+0.166
 Dates+0.160
ajor+0.160
Neg logit contributions
culosis-0.205
 glove-0.196
 concess-0.192
 isEnabled-0.189
PLA-0.187
Tokenfield
Token ID3245
Feature activation+1.37e-01

Pos logit contributions
 Canter+0.348
+0.306
 Ban+0.305
 Dates+0.295
ajor+0.293
Neg logit contributions
culosis-0.413
 glove-0.394
 concess-0.387
 isEnabled-0.380
PLA-0.377
Token
Token ID198
Feature activation-3.27e-04

Pos logit contributions
 Canter+0.094
+0.083
 Ban+0.082
ajor+0.080
 Dates+0.080
Neg logit contributions
culosis-0.107
 glove-0.102
 concess-0.100
 isEnabled-0.098
PLA-0.097
Token
Token ID198
Feature activation-6.22e-03

Pos logit contributions
culosis+0.000
 glove+0.000
 concess+0.000
 isEnabled+0.000
PLA+0.000
Neg logit contributions
 Canter-0.000
-0.000
 Ban-0.000
 Dates-0.000
ajor-0.000
TokenFood
Token ID24602
Feature activation+1.23e-01

Pos logit contributions
culosis+0.005
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.004
 Ban-0.004
-0.004
 Dates-0.004
ajor-0.004
Token52
Token ID4309
Feature activation-5.65e-02

Pos logit contributions
 Canter+0.078
+0.068
 Ban+0.068
ajor+0.065
 Dates+0.065
Neg logit contributions
culosis-0.103
 glove-0.099
 concess-0.097
 isEnabled-0.095
PLA-0.094
Token died
Token ID3724
Feature activation+3.97e-02

Pos logit contributions
 Canter+0.003
+0.002
 Ban+0.002
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.003
PLA-0.002
Token.
Token ID13
Feature activation+7.00e-03

Pos logit contributions
 Canter+0.030
 Ban+0.027
+0.027
 Dates+0.026
ajor+0.026
Neg logit contributions
culosis-0.028
 glove-0.026
 concess-0.026
 isEnabled-0.025
PLA-0.025
Token The
Token ID383
Feature activation+1.14e-02

Pos logit contributions
 Canter+0.005
+0.005
 Ban+0.004
ajor+0.004
 Dates+0.004
Neg logit contributions
culosis-0.005
 glove-0.005
 concess-0.005
 isEnabled-0.005
PLA-0.005
Token siege
Token ID20690
Feature activation+1.36e-01

Pos logit contributions
 Canter+0.008
+0.007
 Ban+0.007
ajor+0.007
 Dates+0.007
Neg logit contributions
culosis-0.008
 glove-0.008
 concess-0.008
 isEnabled-0.008
PLA-0.008
Token of
Token ID286
Feature activation+7.41e-03

Pos logit contributions
 Canter+0.090
+0.079
 Ban+0.078
ajor+0.076
 Dates+0.076
Neg logit contributions
culosis-0.110
 glove-0.106
 concess-0.104
 isEnabled-0.102
PLA-0.101
Token Ban
Token ID10274
Feature activation+5.16e-01

Pos logit contributions
 Canter+0.004
+0.003
 Ban+0.003
ajor+0.003
 Dates+0.003
Neg logit contributions
culosis-0.007
 glove-0.007
 concess-0.007
 isEnabled-0.006
PLA-0.006
Tokenu
Token ID84
Feature activation-1.25e-01

Pos logit contributions
 Canter+0.462
+0.420
 Ban+0.419
 Dates+0.409
ajor+0.408
Neg logit contributions
culosis-0.297
 glove-0.279
 concess-0.271
 isEnabled-0.265
PLA-0.261
Token Q
Token ID1195
Feature activation+3.12e-02

Pos logit contributions
culosis+0.098
 glove+0.094
 concess+0.092
 isEnabled+0.090
PLA+0.089
Neg logit contributions
 Canter-0.085
-0.075
 Ban-0.074
 Dates-0.072
ajor-0.072
Tokenura
Token ID5330
Feature activation-3.33e-01

Pos logit contributions
 Canter+0.037
+0.034
 Ban+0.034
 Dates+0.034
ajor+0.033
Neg logit contributions
culosis-0.009
 glove-0.008
 concess-0.007
 isEnabled-0.007
PLA-0.007
Tokeniza
Token ID23638
Feature activation+5.31e-02

Pos logit contributions
culosis+0.263
 glove+0.249
 concess+0.243
 isEnabled+0.241
PLA+0.239
Neg logit contributions
 Canter-0.228
-0.198
 Ban-0.197
 Dates-0.192
ajor-0.191
Token
Token ID447
Feature activation+9.69e-03

Pos logit contributions
 Canter+0.036
+0.032
 Ban+0.031
ajor+0.031
 Dates+0.030
Neg logit contributions
culosis-0.042
 glove-0.040
 concess-0.039
 isEnabled-0.039
PLA-0.038
Token of
Token ID286
Feature activation+2.46e-03

Pos logit contributions
 Canter+0.041
+0.037
 Ban+0.036
ajor+0.036
 Dates+0.036
Neg logit contributions
culosis-0.041
 glove-0.040
 concess-0.039
 isEnabled-0.038
PLA-0.037
Token the
Token ID262
Feature activation-7.32e-03

Pos logit contributions
 Canter+0.002
+0.002
 Ban+0.002
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.002
 glove-0.002
 concess-0.002
 isEnabled-0.002
PLA-0.002
Token siege
Token ID20690
Feature activation+1.29e-01

Pos logit contributions
culosis+0.006
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
-0.005
 Ban-0.005
 Dates-0.004
ajor-0.004
Token laid
Token ID8104
Feature activation-1.59e-01

Pos logit contributions
 Canter+0.094
+0.084
 Ban+0.083
ajor+0.081
 Dates+0.081
Neg logit contributions
culosis-0.095
 glove-0.090
 concess-0.088
 isEnabled-0.086
PLA-0.086
Token to
Token ID284
Feature activation-1.83e-03

Pos logit contributions
culosis+0.107
 glove+0.102
 concess+0.099
 isEnabled+0.097
PLA+0.096
Neg logit contributions
 Canter-0.127
-0.114
 Ban-0.114
 Dates-0.110
ajor-0.110
Token Ban
Token ID10274
Feature activation+5.10e-01

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.002
-0.001
 Ban-0.001
 Dates-0.001
ajor-0.001
Tokenu
Token ID84
Feature activation-1.30e-01

Pos logit contributions
 Canter+0.449
 Ban+0.410
+0.409
 Dates+0.397
ajor+0.397
Neg logit contributions
culosis-0.299
 glove-0.282
 concess-0.275
 isEnabled-0.268
PLA-0.264
Token Q
Token ID1195
Feature activation+2.44e-02

Pos logit contributions
culosis+0.097
 glove+0.092
 concess+0.091
 isEnabled+0.089
PLA+0.088
Neg logit contributions
 Canter-0.094
-0.084
 Ban-0.083
ajor-0.081
 Dates-0.081
Tokenura
Token ID5330
Feature activation-3.35e-01

Pos logit contributions
 Canter+0.028
+0.026
 Ban+0.026
 Dates+0.026
ajor+0.026
Neg logit contributions
culosis-0.008
 glove-0.007
 concess-0.006
 isEnabled-0.006
PLA-0.006
Tokeniza
Token ID23638
Feature activation+4.36e-02

Pos logit contributions
culosis+0.267
 glove+0.254
 concess+0.249
 isEnabled+0.245
PLA+0.243
Neg logit contributions
 Canter-0.226
-0.197
 Ban-0.196
 Dates-0.191
ajor-0.190
Token,
Token ID11
Feature activation+6.71e-03

Pos logit contributions
 Canter+0.032
+0.028
 Ban+0.028
ajor+0.027
 Dates+0.027
Neg logit contributions
culosis-0.032
 glove-0.031
 concess-0.030
 isEnabled-0.029
PLA-0.029
Token
Token ID198
Feature activation-2.31e-03

Pos logit contributions
culosis+0.013
 glove+0.012
 concess+0.012
 isEnabled+0.012
PLA+0.012
Neg logit contributions
 Canter-0.013
-0.011
 Ban-0.011
 Dates-0.011
ajor-0.011
Token
Token ID198
Feature activation-4.47e-03

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
 Ban-0.001
-0.001
 Dates-0.001
ajor-0.001
TokenNo
Token ID2949
Feature activation-1.22e-02

Pos logit contributions
culosis+0.003
 glove+0.003
 concess+0.003
 isEnabled+0.003
PLA+0.003
Neg logit contributions
 Canter-0.003
 Ban-0.003
-0.003
 Dates-0.003
ajor-0.003
Token woman
Token ID2415
Feature activation-3.33e-02

Pos logit contributions
culosis+0.009
 glove+0.009
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.009
-0.008
 Ban-0.008
 Dates-0.008
ajor-0.008
Token of
Token ID286
Feature activation+5.04e-03

Pos logit contributions
culosis+0.025
 glove+0.024
 concess+0.024
 isEnabled+0.023
PLA+0.023
Neg logit contributions
 Canter-0.024
 Ban-0.021
-0.021
 Dates-0.020
ajor-0.020
Token Ban
Token ID10274
Feature activation+5.09e-01

Pos logit contributions
 Canter+0.004
+0.003
 Ban+0.003
 Dates+0.003
ajor+0.003
Neg logit contributions
culosis-0.004
 glove-0.004
 concess-0.003
 isEnabled-0.003
PLA-0.003
Tokenu
Token ID84
Feature activation-1.29e-01

Pos logit contributions
 Canter+0.447
 Ban+0.408
+0.407
ajor+0.395
 Dates+0.395
Neg logit contributions
culosis-0.300
 glove-0.283
 concess-0.276
 isEnabled-0.269
PLA-0.265
Token Qur
Token ID17599
Feature activation-2.90e-02

Pos logit contributions
culosis+0.101
 glove+0.097
 concess+0.095
 isEnabled+0.093
PLA+0.092
Neg logit contributions
 Canter-0.088
-0.077
 Ban-0.077
 Dates-0.075
ajor-0.074
Tokenay
Token ID323
Feature activation-1.16e-02

Pos logit contributions
culosis+0.021
 glove+0.020
 concess+0.019
 isEnabled+0.019
PLA+0.019
Neg logit contributions
 Canter-0.022
-0.020
 Ban-0.019
 Dates-0.019
ajor-0.019
Tokenza
Token ID4496
Feature activation+6.12e-02

Pos logit contributions
culosis+0.010
 glove+0.009
 concess+0.009
 isEnabled+0.009
PLA+0.009
Neg logit contributions
 Canter-0.007
-0.007
 Ban-0.007
 Dates-0.006
ajor-0.006
Token was
Token ID373
Feature activation+4.45e-02

Pos logit contributions
 Canter+0.043
 Ban+0.038
+0.038
 Dates+0.037
ajor+0.037
Neg logit contributions
culosis-0.047
 glove-0.045
 concess-0.044
 isEnabled-0.043
PLA-0.043
Token.
Token ID13
Feature activation+5.10e-03

Pos logit contributions
 Canter+0.048
+0.043
 Ban+0.043
 Dates+0.041
ajor+0.041
Neg logit contributions
culosis-0.050
 glove-0.048
 concess-0.047
 isEnabled-0.046
PLA-0.046
Token We
Token ID775
Feature activation+5.15e-02

Pos logit contributions
 Canter+0.004
 Ban+0.003
+0.003
 Dates+0.003
ajor+0.003
Neg logit contributions
culosis-0.004
 glove-0.004
 concess-0.004
 isEnabled-0.003
PLA-0.003
Token need
Token ID761
Feature activation+1.69e-02

Pos logit contributions
 Canter+0.038
 Ban+0.034
+0.034
 Dates+0.033
ajor+0.033
Neg logit contributions
culosis-0.037
 glove-0.036
 concess-0.035
 isEnabled-0.034
PLA-0.034
Token the
Token ID262
Feature activation-6.21e-03

Pos logit contributions
 Canter+0.013
+0.011
 Ban+0.011
 Dates+0.011
ajor+0.011
Neg logit contributions
culosis-0.012
 glove-0.012
 concess-0.011
 isEnabled-0.011
PLA-0.011
Token Travel
Token ID13524
Feature activation-1.64e-02

Pos logit contributions
culosis+0.005
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.005
-0.004
 Ban-0.004
ajor-0.004
 Dates-0.004
Token Ban
Token ID10274
Feature activation+5.05e-01

Pos logit contributions
culosis+0.015
 glove+0.014
 concess+0.014
 isEnabled+0.014
PLA+0.014
Neg logit contributions
 Canter-0.009
-0.008
 Ban-0.008
ajor-0.008
 Dates-0.008
Token as
Token ID355
Feature activation+1.39e-02

Pos logit contributions
 Canter+0.359
+0.317
 Ban+0.315
ajor+0.306
 Dates+0.306
Neg logit contributions
culosis-0.383
 glove-0.368
 concess-0.359
 isEnabled-0.354
PLA-0.350
Token an
Token ID281
Feature activation+3.67e-02

Pos logit contributions
 Canter+0.010
+0.009
 Ban+0.009
ajor+0.009
 Dates+0.009
Neg logit contributions
culosis-0.010
 glove-0.010
 isEnabled-0.010
 concess-0.010
PLA-0.009
Token extra
Token ID3131
Feature activation+4.13e-02

Pos logit contributions
 Canter+0.025
+0.022
 Ban+0.022
 Dates+0.021
ajor+0.021
Neg logit contributions
culosis-0.029
 glove-0.028
 concess-0.027
 isEnabled-0.027
PLA-0.027
Token level
Token ID1241
Feature activation-5.01e-02

Pos logit contributions
 Canter+0.032
 Ban+0.029
+0.029
ajor+0.028
 Dates+0.028
Neg logit contributions
culosis-0.029
 glove-0.027
 concess-0.026
 isEnabled-0.026
PLA-0.026
Token of
Token ID286
Feature activation-1.67e-04

Pos logit contributions
culosis+0.037
 glove+0.035
 concess+0.035
 isEnabled+0.034
PLA+0.034
Neg logit contributions
 Canter-0.037
-0.032
 Ban-0.032
 Dates-0.031
ajor-0.031
Token had
Token ID550
Feature activation+6.53e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
 Dates-0.001
ajor-0.001
Token joined
Token ID5399
Feature activation-1.15e-01

Pos logit contributions
 Canter+0.048
+0.042
 Ban+0.042
 Dates+0.041
ajor+0.041
Neg logit contributions
culosis-0.048
 glove-0.046
 concess-0.045
 isEnabled-0.044
PLA-0.043
Token the
Token ID262
Feature activation-5.08e-04

Pos logit contributions
culosis+0.084
 glove+0.080
 concess+0.078
 isEnabled+0.077
PLA+0.076
Neg logit contributions
 Canter-0.085
-0.075
 Ban-0.075
 Dates-0.073
ajor-0.073
Token ranks
Token ID9803
Feature activation-7.37e-02

Pos logit contributions
culosis+0.000
 glove+0.000
 concess+0.000
 isEnabled+0.000
PLA+0.000
Neg logit contributions
 Canter-0.000
-0.000
 Ban-0.000
 Dates-0.000
ajor-0.000
Token of
Token ID286
Feature activation+6.32e-03

Pos logit contributions
culosis+0.055
 glove+0.053
 concess+0.052
 isEnabled+0.051
PLA+0.050
Neg logit contributions
 Canter-0.053
 Ban-0.047
-0.047
 Dates-0.045
ajor-0.045
Token Ban
Token ID10274
Feature activation+5.05e-01

Pos logit contributions
 Canter+0.005
+0.004
 Ban+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Tokenu
Token ID84
Feature activation-1.32e-01

Pos logit contributions
 Canter+0.444
 Ban+0.405
+0.405
ajor+0.393
 Dates+0.393
Neg logit contributions
culosis-0.297
 glove-0.280
 concess-0.273
 isEnabled-0.266
PLA-0.262
Token Q
Token ID1195
Feature activation+2.29e-02

Pos logit contributions
culosis+0.093
 glove+0.088
 concess+0.086
 isEnabled+0.084
PLA+0.084
Neg logit contributions
 Canter-0.101
-0.090
 Ban-0.089
 Dates-0.087
ajor-0.087
Tokenura
Token ID5330
Feature activation-3.38e-01

Pos logit contributions
 Canter+0.017
+0.016
 Ban+0.016
 Dates+0.015
ajor+0.015
Neg logit contributions
culosis-0.016
 glove-0.015
 concess-0.015
 isEnabled-0.015
PLA-0.015
Tokeniza
Token ID23638
Feature activation+4.10e-02

Pos logit contributions
culosis+0.291
 glove+0.279
 concess+0.275
 isEnabled+0.269
PLA+0.268
Neg logit contributions
 Canter-0.205
-0.175
 Ban-0.174
 Dates-0.170
ajor-0.168
Token when
Token ID618
Feature activation-7.19e-02

Pos logit contributions
 Canter+0.031
+0.027
 Ban+0.027
 Dates+0.026
ajor+0.026
Neg logit contributions
culosis-0.030
 glove-0.028
 concess-0.028
 isEnabled-0.027
PLA-0.027
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
 Canter+0.015
+0.013
 Ban+0.013
ajor+0.013
 Dates+0.013
Neg logit contributions
culosis-0.015
 glove-0.014
 concess-0.014
 isEnabled-0.014
PLA-0.014
Token OS
Token ID7294
Feature activation+1.24e-01

Pos logit contributions
 Canter+0.009
+0.008
 Ban+0.008
ajor+0.008
 Dates+0.008
Neg logit contributions
culosis-0.010
 glove-0.009
 concess-0.009
 isEnabled-0.009
PLA-0.009
TokenHA
Token ID7801
Feature activation-2.53e-02

Pos logit contributions
 Canter+0.090
+0.080
 Ban+0.080
 Dates+0.077
ajor+0.077
Neg logit contributions
culosis-0.091
 glove-0.087
 concess-0.085
 isEnabled-0.084
PLA-0.083
Token (
Token ID357
Feature activation+2.45e-02

Pos logit contributions
culosis+0.018
 glove+0.018
 concess+0.017
 isEnabled+0.017
PLA+0.017
Neg logit contributions
 Canter-0.019
-0.017
 Ban-0.017
ajor-0.016
 Dates-0.016
TokenOccup
Token ID47658
Feature activation-4.22e-02

Pos logit contributions
 Canter+0.018
 Ban+0.016
+0.016
 Dates+0.016
ajor+0.016
Neg logit contributions
culosis-0.018
 glove-0.017
 concess-0.017
 isEnabled-0.016
PLA-0.016
Tokenational
Token ID864
Feature activation+4.80e-01

Pos logit contributions
culosis+0.054
 glove+0.053
 concess+0.052
 isEnabled+0.051
PLA+0.051
Neg logit contributions
 Canter-0.008
-0.005
 Ban-0.004
ajor-0.004
 Dates-0.004
Token Health
Token ID3893
Feature activation+1.23e-01

Pos logit contributions
 Canter+0.272
 Ban+0.234
+0.230
 Dates+0.224
ajor+0.217
Neg logit contributions
culosis-0.433
 glove-0.417
 concess-0.412
 isEnabled-0.408
PLA-0.403
Token and
Token ID290
Feature activation-1.70e-03

Pos logit contributions
 Canter+0.094
+0.084
 Ban+0.084
ajor+0.082
 Dates+0.081
Neg logit contributions
culosis-0.086
 glove-0.082
 concess-0.080
 isEnabled-0.079
PLA-0.077
Token Safety
Token ID11233
Feature activation+1.28e-01

Pos logit contributions
culosis+0.002
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
 Ban-0.001
-0.001
 Dates-0.001
ajor-0.001
Token Administration
Token ID8694
Feature activation+1.18e-01

Pos logit contributions
 Canter+0.080
+0.070
ajor+0.069
 Ban+0.068
zek+0.067
Neg logit contributions
culosis-0.107
 glove-0.103
 concess-0.100
 isEnabled-0.099
PLA-0.098
Token)
Token ID8
Feature activation+4.05e-02

Pos logit contributions
 Canter+0.079
+0.070
 Ban+0.070
 Dates+0.067
ajor+0.067
Neg logit contributions
culosis-0.094
 glove-0.090
 concess-0.088
 isEnabled-0.086
PLA-0.085
Token often
Token ID1690
Feature activation-1.25e-02

Pos logit contributions
 Canter+0.001
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token depicted
Token ID18904
Feature activation-2.62e-01

Pos logit contributions
culosis+0.009
 glove+0.009
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.009
 Ban-0.008
-0.008
 Dates-0.008
ajor-0.008
Token as
Token ID355
Feature activation+1.80e-02

Pos logit contributions
culosis+0.201
 glove+0.192
 concess+0.188
 isEnabled+0.185
PLA+0.183
Neg logit contributions
 Canter-0.184
-0.164
 Ban-0.163
 Dates-0.157
ajor-0.157
Token overly
Token ID17698
Feature activation+1.51e-01

Pos logit contributions
 Canter+0.013
+0.012
 Ban+0.012
 Dates+0.011
ajor+0.011
Neg logit contributions
culosis-0.013
 glove-0.013
 concess-0.013
 isEnabled-0.012
PLA-0.012
Token confront
Token ID7239
Feature activation+3.55e-03

Pos logit contributions
 Canter+0.109
+0.096
 Ban+0.096
 Dates+0.093
ajor+0.093
Neg logit contributions
culosis-0.113
 glove-0.109
 concess-0.107
 isEnabled-0.105
PLA-0.103
Tokenational
Token ID864
Feature activation+4.79e-01

Pos logit contributions
 Canter+0.001
 Ban+0.000
+0.000
 Dates+0.000
ajor+0.000
Neg logit contributions
culosis-0.005
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token.
Token ID13
Feature activation-9.70e-04

Pos logit contributions
 Canter+0.352
+0.313
 Ban+0.313
 Dates+0.303
ajor+0.302
Neg logit contributions
culosis-0.352
 glove-0.336
 concess-0.329
 isEnabled-0.323
PLA-0.319
Token Yet
Token ID6430
Feature activation-2.03e-01

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
ajor-0.001
 Dates-0.001
Token Mac
Token ID4100
Feature activation+1.58e-01

Pos logit contributions
culosis+0.150
 glove+0.143
 concess+0.140
 isEnabled+0.137
PLA+0.136
Neg logit contributions
 Canter-0.147
-0.131
 Ban-0.130
 Dates-0.127
ajor-0.127
Tokenri
Token ID380
Feature activation-1.58e-01

Pos logit contributions
 Canter+0.149
 Ban+0.136
+0.136
 Dates+0.133
ajor+0.132
Neg logit contributions
culosis-0.083
 glove-0.077
 concess-0.075
 isEnabled-0.073
PLA-0.072
Token is
Token ID318
Feature activation-2.02e-02

Pos logit contributions
culosis+0.115
 glove+0.109
 concess+0.107
 isEnabled+0.105
PLA+0.104
Neg logit contributions
 Canter-0.118
 Ban-0.105
-0.105
 Dates-0.102
ajor-0.101
Token to
Token ID284
Feature activation+6.75e-03

Pos logit contributions
culosis+0.063
 glove+0.060
 concess+0.059
 isEnabled+0.058
PLA+0.057
Neg logit contributions
 Canter-0.064
 Ban-0.057
-0.057
ajor-0.055
 Dates-0.055
Token the
Token ID262
Feature activation+8.53e-03

Pos logit contributions
 Canter+0.005
+0.004
 Ban+0.004
ajor+0.004
 Dates+0.004
Neg logit contributions
culosis-0.005
 glove-0.005
 concess-0.005
 isEnabled-0.005
PLA-0.005
Token unstable
Token ID21354
Feature activation+2.03e-01

Pos logit contributions
 Canter+0.006
+0.006
 Ban+0.006
 Dates+0.005
ajor+0.005
Neg logit contributions
culosis-0.006
 glove-0.006
 concess-0.006
 isEnabled-0.006
PLA-0.006
Token situation
Token ID3074
Feature activation+5.62e-02

Pos logit contributions
 Canter+0.145
+0.129
 Ban+0.128
 Dates+0.124
ajor+0.124
Neg logit contributions
culosis-0.154
 glove-0.147
 concess-0.144
 isEnabled-0.141
PLA-0.140
Token in
Token ID287
Feature activation-5.20e-03

Pos logit contributions
 Canter+0.042
+0.037
 Ban+0.037
 Dates+0.036
ajor+0.036
Neg logit contributions
culosis-0.040
 glove-0.039
 concess-0.038
 isEnabled-0.037
PLA-0.037
Token Baghdad
Token ID19761
Feature activation+4.78e-01

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.004
-0.003
 Ban-0.003
ajor-0.003
 Dates-0.003
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 Canter+0.347
+0.308
 Ban+0.308
 Dates+0.298
ajor+0.297
Neg logit contributions
culosis-0.356
 glove-0.339
 concess-0.332
 isEnabled-0.326
PLA-0.322
Token both
Token ID1111
Feature activation-4.18e-02

Pos logit contributions
 Canter+0.008
+0.007
 Ban+0.007
ajor+0.007
 Dates+0.007
Neg logit contributions
culosis-0.009
 glove-0.008
 concess-0.008
 isEnabled-0.008
PLA-0.008
Token matches
Token ID7466
Feature activation+6.05e-02

Pos logit contributions
culosis+0.031
 glove+0.030
 concess+0.029
 isEnabled+0.029
PLA+0.028
Neg logit contributions
 Canter-0.030
-0.027
 Ban-0.026
ajor-0.026
 Dates-0.026
Token are
Token ID389
Feature activation-1.50e-02

Pos logit contributions
 Canter+0.046
+0.041
 Ban+0.041
 Dates+0.039
ajor+0.039
Neg logit contributions
culosis-0.043
 glove-0.041
 concess-0.040
 isEnabled-0.040
PLA-0.039
Token going
Token ID1016
Feature activation+1.03e-01

Pos logit contributions
culosis+0.011
 glove+0.011
 concess+0.010
 isEnabled+0.010
PLA+0.010
Neg logit contributions
 Canter-0.011
-0.010
 Ban-0.010
 Dates-0.010
ajor-0.009
Token to
Token ID284
Feature activation-7.48e-03

Pos logit contributions
 Canter+0.029
+0.026
 Ban+0.026
ajor+0.025
 Dates+0.025
Neg logit contributions
culosis-0.030
 glove-0.029
 concess-0.028
 isEnabled-0.027
PLA-0.027
Token create
Token ID2251
Feature activation-1.75e-02

Pos logit contributions
culosis+0.006
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
-0.005
 Ban-0.005
 Dates-0.005
ajor-0.005
Token a
Token ID257
Feature activation-1.66e-02

Pos logit contributions
culosis+0.013
 glove+0.012
 concess+0.012
 isEnabled+0.012
PLA+0.011
Neg logit contributions
 Canter-0.013
-0.012
 Ban-0.012
ajor-0.011
 Dates-0.011
Token rapid
Token ID5801
Feature activation-3.52e-02

Pos logit contributions
culosis+0.013
 glove+0.012
 concess+0.012
 isEnabled+0.012
PLA+0.012
Neg logit contributions
 Canter-0.011
-0.010
 Ban-0.010
 Dates-0.010
ajor-0.010
Token vibr
Token ID12611
Feature activation+1.05e-01

Pos logit contributions
culosis+0.027
 glove+0.026
 concess+0.025
 isEnabled+0.025
PLA+0.024
Neg logit contributions
 Canter-0.025
-0.022
 Ban-0.022
 Dates-0.021
ajor-0.021
Tokenational
Token ID864
Feature activation+4.77e-01

Pos logit contributions
 Canter+0.055
+0.046
 Ban+0.046
 Dates+0.045
ajor+0.044
Neg logit contributions
culosis-0.099
 glove-0.096
 concess-0.094
 isEnabled-0.092
PLA-0.091
Token effect
Token ID1245
Feature activation-2.12e-02

Pos logit contributions
 Canter+0.328
+0.291
 Ban+0.289
 Dates+0.282
ajor+0.281
Neg logit contributions
culosis-0.371
 glove-0.359
 concess-0.350
 isEnabled-0.342
PLA-0.338
Token.
Token ID13
Feature activation-2.86e-03

Pos logit contributions
culosis+0.016
 glove+0.015
 concess+0.015
 isEnabled+0.014
PLA+0.014
Neg logit contributions
 Canter-0.015
 Ban-0.014
-0.014
 Dates-0.013
ajor-0.013
Token
Token ID198
Feature activation-4.91e-03

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
-0.002
 Ban-0.002
ajor-0.002
 Dates-0.002
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
culosis+0.004
 glove+0.003
 concess+0.003
 isEnabled+0.003
PLA+0.003
Neg logit contributions
 Canter-0.004
-0.003
 Ban-0.003
 Dates-0.003
ajor-0.003
Token[
Token ID58
Feature activation-3.76e-02

Pos logit contributions
culosis+0.009
 glove+0.009
 concess+0.009
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.009
 Ban-0.008
-0.008
 Dates-0.007
ajor-0.007
Tokengov
Token ID9567
Feature activation-2.51e-02

Pos logit contributions
 Canter+0.006
 Ban+0.005
+0.005
 Dates+0.005
ajor+0.005
Neg logit contributions
culosis-0.006
 glove-0.005
 concess-0.005
 isEnabled-0.005
PLA-0.005
Token/
Token ID14
Feature activation-2.35e-02

Pos logit contributions
culosis+0.018
 glove+0.017
 concess+0.017
 isEnabled+0.017
PLA+0.017
Neg logit contributions
 Canter-0.019
 Ban-0.017
-0.017
 Dates-0.016
ajor-0.016
Tokenhome
Token ID11195
Feature activation+1.56e-01

Pos logit contributions
culosis+0.020
 glove+0.019
 concess+0.019
 isEnabled+0.019
PLA+0.018
Neg logit contributions
 Canter-0.015
-0.013
 Ban-0.013
 Dates-0.012
ajor-0.012
Tokenandre
Token ID49078
Feature activation+2.96e-02

Pos logit contributions
 Canter+0.115
+0.102
 Ban+0.102
 Dates+0.099
ajor+0.099
Neg logit contributions
culosis-0.115
 glove-0.109
 concess-0.107
 isEnabled-0.105
PLA-0.104
Tokencre
Token ID7513
Feature activation+1.95e-01

Pos logit contributions
 Canter+0.021
 Ban+0.019
+0.019
 Dates+0.018
ajor+0.018
Neg logit contributions
culosis-0.022
 glove-0.021
 concess-0.021
 isEnabled-0.021
PLA-0.020
Tokenational
Token ID864
Feature activation+4.76e-01

Pos logit contributions
 Canter+0.135
 Ban+0.120
+0.120
 Dates+0.115
ajor+0.115
Neg logit contributions
culosis-0.151
 glove-0.144
 concess-0.142
 isEnabled-0.139
PLA-0.138
Tokensafety
Token ID44708
Feature activation+1.42e-01

Pos logit contributions
 Canter+0.341
+0.304
 Ban+0.304
 Dates+0.294
ajor+0.293
Neg logit contributions
culosis-0.356
 glove-0.341
 concess-0.334
 isEnabled-0.328
PLA-0.324
Token/
Token ID14
Feature activation-2.36e-02

Pos logit contributions
 Canter+0.103
 Ban+0.092
+0.092
 Dates+0.089
ajor+0.089
Neg logit contributions
culosis-0.105
 glove-0.100
 concess-0.098
 isEnabled-0.096
PLA-0.095
Tokenfalls
Token ID23348
Feature activation-3.48e-02

Pos logit contributions
culosis+0.018
 glove+0.017
 concess+0.017
 isEnabled+0.017
PLA+0.017
Neg logit contributions
 Canter-0.016
 Ban-0.014
-0.014
 Dates-0.014
ajor-0.014
Token/
Token ID14
Feature activation-2.50e-02

Pos logit contributions
culosis+0.026
 glove+0.025
 concess+0.025
 isEnabled+0.024
PLA+0.024
Neg logit contributions
 Canter-0.025
-0.022
 Ban-0.022
 Dates-0.021
ajor-0.021
Tokenpub
Token ID12984
Feature activation-3.09e-02

Pos logit contributions
culosis+0.019
 glove+0.018
 concess+0.018
 isEnabled+0.018
PLA+0.018
Neg logit contributions
 Canter-0.017
 Ban-0.015
-0.015
 Dates-0.015
ajor-0.015
Token flipping
Token ID33097
Feature activation+1.03e-02

Pos logit contributions
culosis+0.010
 glove+0.010
 concess+0.010
 isEnabled+0.009
PLA+0.009
Neg logit contributions
 Canter-0.010
-0.009
 Ban-0.009
ajor-0.008
 Dates-0.008
Token out
Token ID503
Feature activation+3.46e-02

Pos logit contributions
 Canter+0.007
+0.007
 Ban+0.007
ajor+0.006
 Dates+0.006
Neg logit contributions
culosis-0.008
 glove-0.007
 concess-0.007
 isEnabled-0.007
PLA-0.007
Token and
Token ID290
Feature activation-6.53e-03

Pos logit contributions
 Canter+0.025
+0.022
 Ban+0.022
 Dates+0.022
ajor+0.022
Neg logit contributions
culosis-0.026
 glove-0.024
 concess-0.024
 isEnabled-0.023
PLA-0.023
Token being
Token ID852
Feature activation+4.54e-03

Pos logit contributions
culosis+0.005
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
 Ban-0.004
-0.004
 Dates-0.004
ajor-0.004
Token confront
Token ID7239
Feature activation+1.28e-02

Pos logit contributions
 Canter+0.003
+0.003
 Ban+0.003
ajor+0.003
 Dates+0.003
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.003
PLA-0.003
Tokenational
Token ID864
Feature activation+4.74e-01

Pos logit contributions
 Canter+0.002
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.017
 glove-0.016
 concess-0.016
 isEnabled-0.016
PLA-0.016
Token towards
Token ID3371
Feature activation-9.62e-02

Pos logit contributions
 Canter+0.345
+0.306
 Ban+0.306
 Dates+0.297
ajor+0.296
Neg logit contributions
culosis-0.351
 glove-0.335
 concess-0.328
 isEnabled-0.322
PLA-0.319
Token the
Token ID262
Feature activation+4.55e-03

Pos logit contributions
culosis+0.071
 glove+0.068
 concess+0.066
 isEnabled+0.065
PLA+0.064
Neg logit contributions
 Canter-0.070
-0.063
 Ban-0.062
ajor-0.060
 Dates-0.060
Token audience
Token ID5386
Feature activation+6.81e-02

Pos logit contributions
 Canter+0.003
+0.003
 Ban+0.003
 Dates+0.003
ajor+0.003
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.003
PLA-0.003
Token,
Token ID11
Feature activation+1.95e-03

Pos logit contributions
 Canter+0.049
+0.044
 Ban+0.044
 Dates+0.042
ajor+0.042
Neg logit contributions
culosis-0.051
 glove-0.049
 concess-0.048
 isEnabled-0.047
PLA-0.046
Token like
Token ID588
Feature activation+1.11e-03

Pos logit contributions
 Canter+0.001
+0.001
 Ban+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token in
Token ID287
Feature activation-1.59e-02

Pos logit contributions
culosis+0.108
 glove+0.102
 concess+0.100
 isEnabled+0.098
PLA+0.097
Neg logit contributions
 Canter-0.112
-0.100
 Ban-0.100
 Dates-0.097
ajor-0.097
Token an
Token ID281
Feature activation+4.90e-02

Pos logit contributions
culosis+0.012
 glove+0.011
 concess+0.011
 isEnabled+0.011
PLA+0.011
Neg logit contributions
 Canter-0.012
-0.010
 Ban-0.010
ajor-0.010
 Dates-0.010
Token
Token ID564
Feature activation-2.65e-02

Pos logit contributions
 Canter+0.037
+0.033
 Ban+0.033
 Dates+0.032
ajor+0.032
Neg logit contributions
culosis-0.035
 glove-0.033
 concess-0.033
 isEnabled-0.032
PLA-0.032
Token
Token ID250
Feature activation+2.21e-03

Pos logit contributions
culosis+0.021
 glove+0.020
 concess+0.020
 isEnabled+0.019
PLA+0.019
Neg logit contributions
 Canter-0.018
 Ban-0.016
-0.016
 Dates-0.015
ajor-0.015
Tokeneduc
Token ID18123
Feature activation-4.15e-02

Pos logit contributions
 Canter+0.002
+0.001
 Ban+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.002
 glove-0.002
 concess-0.002
 isEnabled-0.002
PLA-0.001
Tokenational
Token ID864
Feature activation+4.70e-01

Pos logit contributions
culosis+0.053
 glove+0.052
 concess+0.051
 isEnabled+0.050
PLA+0.050
Neg logit contributions
 Canter-0.008
-0.004
 Ban-0.004
 Dates-0.004
ajor-0.003
Token program
Token ID1430
Feature activation+1.63e-01

Pos logit contributions
 Canter+0.329
+0.290
 Ban+0.290
 Dates+0.281
ajor+0.280
Neg logit contributions
culosis-0.361
 glove-0.346
 concess-0.339
 isEnabled-0.332
PLA-0.329
Token
Token ID447
Feature activation+5.45e-03

Pos logit contributions
 Canter+0.119
+0.106
 Ban+0.106
 Dates+0.103
ajor+0.103
Neg logit contributions
culosis-0.120
 glove-0.115
 concess-0.112
 isEnabled-0.110
PLA-0.109
Token
Token ID251
Feature activation-5.73e-03

Pos logit contributions
 Canter+0.004
 Ban+0.003
+0.003
 Dates+0.003
ajor+0.003
Neg logit contributions
culosis-0.004
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.004
-0.004
 Ban-0.004
 Dates-0.004
ajor-0.004
Token
Token ID564
Feature activation-2.83e-02

Pos logit contributions
culosis+0.009
 glove+0.009
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.010
-0.009
 Ban-0.009
 Dates-0.009
ajor-0.009
Token't
Token ID470
Feature activation+1.63e-02

Pos logit contributions
 Canter+0.001
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token be
Token ID307
Feature activation-2.04e-02

Pos logit contributions
 Canter+0.012
+0.011
 Ban+0.011
 Dates+0.011
ajor+0.011
Neg logit contributions
culosis-0.012
 glove-0.011
 concess-0.011
 isEnabled-0.011
PLA-0.010
Token a
Token ID257
Feature activation-1.82e-02

Pos logit contributions
culosis+0.015
 glove+0.014
 concess+0.014
 isEnabled+0.013
PLA+0.013
Neg logit contributions
 Canter-0.015
-0.014
 Ban-0.014
 Dates-0.013
ajor-0.013
Token bad
Token ID2089
Feature activation-1.50e-01

Pos logit contributions
culosis+0.013
 glove+0.013
 concess+0.012
 isEnabled+0.012
PLA+0.012
Neg logit contributions
 Canter-0.013
-0.012
 Ban-0.012
 Dates-0.012
ajor-0.012
Token rot
Token ID5724
Feature activation-6.00e-02

Pos logit contributions
culosis+0.109
 glove+0.106
 concess+0.102
 isEnabled+0.100
PLA+0.098
Neg logit contributions
 Canter-0.111
-0.099
 Ban-0.098
 Dates-0.095
ajor-0.095
Tokenational
Token ID864
Feature activation+4.69e-01

Pos logit contributions
culosis+0.076
 glove+0.075
 concess+0.073
 isEnabled+0.073
PLA+0.072
Neg logit contributions
 Canter-0.012
-0.007
 Ban-0.007
 Dates-0.006
ajor-0.005
Token lineman
Token ID33157
Feature activation-2.22e-01

Pos logit contributions
 Canter+0.388
+0.350
 Ban+0.348
 Dates+0.341
ajor+0.340
Neg logit contributions
culosis-0.300
 glove-0.290
 concess-0.279
 isEnabled-0.271
PLA-0.265
Token,
Token ID11
Feature activation-2.86e-03

Pos logit contributions
culosis+0.164
 glove+0.160
 concess+0.153
 isEnabled+0.152
PLA+0.149
Neg logit contributions
 Canter-0.160
-0.142
 Ban-0.141
 Dates-0.138
ajor-0.137
Token but
Token ID475
Feature activation-1.79e-02

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
-0.002
 Ban-0.002
 Dates-0.002
ajor-0.002
Token it
Token ID340
Feature activation-2.01e-02

Pos logit contributions
culosis+0.013
 glove+0.013
 concess+0.012
 isEnabled+0.012
PLA+0.012
Neg logit contributions
 Canter-0.013
-0.012
 Ban-0.012
ajor-0.011
 Dates-0.011
Token would
Token ID561
Feature activation-2.81e-02

Pos logit contributions
culosis+0.014
 glove+0.013
 concess+0.013
 isEnabled+0.013
PLA+0.013
Neg logit contributions
 Canter-0.015
 Ban-0.014
-0.014
 Dates-0.013
ajor-0.013
Token,
Token ID11
Feature activation-1.77e-03

Pos logit contributions
 Canter+0.020
+0.018
 Ban+0.018
 Dates+0.017
ajor+0.017
Neg logit contributions
culosis-0.021
 glove-0.020
 concess-0.020
 isEnabled-0.019
PLA-0.019
Token wildlife
Token ID15599
Feature activation-2.02e-01

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
 Dates-0.001
ajor-0.001
Token sightings
Token ID36242
Feature activation-1.20e-01

Pos logit contributions
culosis+0.138
 glove+0.131
 concess+0.128
 isEnabled+0.125
PLA+0.123
Neg logit contributions
 Canter-0.159
-0.142
 Ban-0.142
 Dates-0.138
ajor-0.138
Token,
Token ID11
Feature activation-8.55e-03

Pos logit contributions
culosis+0.089
 glove+0.085
 concess+0.083
 isEnabled+0.081
PLA+0.080
Neg logit contributions
 Canter-0.088
-0.078
 Ban-0.078
 Dates-0.075
ajor-0.075
Token navig
Token ID20436
Feature activation-1.41e-01

Pos logit contributions
culosis+0.006
 glove+0.006
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.006
 Ban-0.006
-0.006
 Dates-0.006
ajor-0.005
Tokenational
Token ID864
Feature activation+4.69e-01

Pos logit contributions
culosis+0.179
 glove+0.173
 concess+0.171
 isEnabled+0.169
PLA+0.168
Neg logit contributions
 Canter-0.029
-0.018
 Ban-0.017
 Dates-0.015
ajor-0.014
Token challenges
Token ID6459
Feature activation+6.86e-02

Pos logit contributions
 Canter+0.344
 Ban+0.306
+0.306
 Dates+0.296
ajor+0.295
Neg logit contributions
culosis-0.345
 glove-0.328
 concess-0.321
 isEnabled-0.316
PLA-0.312
Token,
Token ID11
Feature activation-2.16e-03

Pos logit contributions
 Canter+0.050
 Ban+0.045
+0.045
 Dates+0.043
ajor+0.043
Neg logit contributions
culosis-0.051
 glove-0.048
 concess-0.047
 isEnabled-0.047
PLA-0.046
Token even
Token ID772
Feature activation-9.95e-02

Pos logit contributions
culosis+0.002
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.002
 Ban-0.001
-0.001
 Dates-0.001
ajor-0.001
Token solitude
Token ID49917
Feature activation-1.89e-01

Pos logit contributions
culosis+0.073
 glove+0.070
 concess+0.068
 isEnabled+0.067
PLA+0.066
Neg logit contributions
 Canter-0.073
-0.065
 Ban-0.065
 Dates-0.063
ajor-0.063
Token.
Token ID13
Feature activation-2.92e-03

Pos logit contributions
culosis+0.141
 glove+0.135
 concess+0.132
 isEnabled+0.129
PLA+0.128
Neg logit contributions
 Canter-0.136
-0.121
 Ban-0.121
 Dates-0.117
ajor-0.117
Token of
Token ID286
Feature activation-8.78e-05

Pos logit contributions
 Canter+0.013
 Ban+0.011
+0.011
 Dates+0.011
ajor+0.011
Neg logit contributions
culosis-0.013
 glove-0.012
 concess-0.012
 isEnabled-0.012
PLA-0.012
Token cassette
Token ID42812
Feature activation+1.40e-01

Pos logit contributions
culosis+0.000
 glove+0.000
 concess+0.000
 isEnabled+0.000
PLA+0.000
Neg logit contributions
 Canter-0.000
-0.000
 Ban-0.000
ajor-0.000
 Dates-0.000
Token tapes
Token ID23695
Feature activation+2.81e-01

Pos logit contributions
 Canter+0.081
+0.070
 Ban+0.070
 Dates+0.067
ajor+0.067
Neg logit contributions
culosis-0.125
 glove-0.120
 concess-0.118
 isEnabled-0.116
PLA-0.115
Token in
Token ID287
Feature activation-1.40e-02

Pos logit contributions
 Canter+0.204
+0.181
 Ban+0.181
 Dates+0.175
ajor+0.175
Neg logit contributions
culosis-0.209
 glove-0.200
 concess-0.196
 isEnabled-0.192
PLA-0.190
Token the
Token ID262
Feature activation-6.03e-03

Pos logit contributions
culosis+0.010
 glove+0.010
 concess+0.010
 isEnabled+0.010
PLA+0.009
Neg logit contributions
 Canter-0.010
-0.009
 Ban-0.009
 Dates-0.009
ajor-0.009
Token glove
Token ID29144
Feature activation-5.80e-01

Pos logit contributions
culosis+0.005
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.004
-0.004
 Ban-0.004
 Dates-0.004
ajor-0.004
Tokenbox
Token ID3524
Feature activation-1.74e-01

Pos logit contributions
culosis+0.394
 glove+0.372
 concess+0.365
 isEnabled+0.357
PLA+0.353
Neg logit contributions
 Canter-0.460
 Ban-0.413
-0.413
 Dates-0.400
ajor-0.399
Token of
Token ID286
Feature activation-2.46e-03

Pos logit contributions
culosis+0.131
 glove+0.125
 concess+0.122
 isEnabled+0.120
PLA+0.119
Neg logit contributions
 Canter-0.125
-0.111
 Ban-0.111
 Dates-0.108
ajor-0.107
Token our
Token ID674
Feature activation-9.74e-03

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
 Ban-0.002
-0.002
 Dates-0.002
ajor-0.002
Token old
Token ID1468
Feature activation-1.58e-02

Pos logit contributions
culosis+0.007
 glove+0.007
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.007
-0.007
 Ban-0.007
 Dates-0.006
ajor-0.006
Token Pont
Token ID26345
Feature activation-8.08e-02

Pos logit contributions
culosis+0.011
 glove+0.011
 concess+0.011
 isEnabled+0.010
PLA+0.010
Neg logit contributions
 Canter-0.012
-0.011
 Ban-0.011
 Dates-0.010
ajor-0.010
Token face
Token ID1986
Feature activation-2.42e-02

Pos logit contributions
 Canter+0.013
 Ban+0.012
+0.012
 Dates+0.012
ajor+0.011
Neg logit contributions
culosis-0.013
 glove-0.012
 concess-0.012
 isEnabled-0.012
PLA-0.012
Token gently
Token ID15165
Feature activation-1.62e-01

Pos logit contributions
culosis+0.018
 glove+0.017
 concess+0.017
 isEnabled+0.017
PLA+0.016
Neg logit contributions
 Canter-0.017
-0.015
 Ban-0.015
 Dates-0.015
ajor-0.015
Token with
Token ID351
Feature activation+1.65e-03

Pos logit contributions
culosis+0.122
 glove+0.117
 concess+0.114
 isEnabled+0.112
PLA+0.111
Neg logit contributions
 Canter-0.117
 Ban-0.103
-0.103
 Dates-0.100
ajor-0.100
Token his
Token ID465
Feature activation+2.10e-02

Pos logit contributions
 Canter+0.001
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token boxing
Token ID21576
Feature activation-1.03e-01

Pos logit contributions
 Canter+0.017
 Ban+0.015
+0.015
 Dates+0.015
ajor+0.015
Neg logit contributions
culosis-0.014
 glove-0.013
 concess-0.013
 isEnabled-0.013
PLA-0.013
Token glove
Token ID29144
Feature activation-5.80e-01

Pos logit contributions
culosis+0.041
 concess+0.035
 glove+0.035
 isEnabled+0.034
PLA+0.034
Neg logit contributions
 Canter-0.111
 Ban-0.104
-0.103
ajor-0.101
 Dates-0.101
Token
Token ID447
Feature activation+9.50e-03

Pos logit contributions
culosis+0.438
 glove+0.417
 concess+0.410
 isEnabled+0.402
PLA+0.398
Neg logit contributions
 Canter-0.414
-0.367
 Ban-0.367
 Dates-0.354
ajor-0.354
Token
Token ID247
Feature activation-3.40e-03

Pos logit contributions
 Canter+0.006
+0.005
 Ban+0.005
 Dates+0.005
ajor+0.005
Neg logit contributions
culosis-0.008
 glove-0.008
 concess-0.008
 isEnabled-0.007
PLA-0.007
Tokens
Token ID82
Feature activation-9.08e-04

Pos logit contributions
culosis+0.003
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
-0.002
 Ban-0.002
 Dates-0.002
ajor-0.002
Token padded
Token ID44582
Feature activation-6.81e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
ajor-0.001
 Dates-0.001
Token red
Token ID2266
Feature activation-8.75e-02

Pos logit contributions
culosis+0.044
 glove+0.041
 concess+0.040
 isEnabled+0.039
PLA+0.039
Neg logit contributions
 Canter-0.056
 Ban-0.051
-0.051
ajor-0.049
 Dates-0.049
Token they
Token ID484
Feature activation-1.47e-02

Pos logit contributions
 Canter+0.005
+0.005
 Ban+0.005
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.005
 concess-0.005
 isEnabled-0.005
PLA-0.005
Token would
Token ID561
Feature activation-1.45e-02

Pos logit contributions
culosis+0.011
 glove+0.011
 concess+0.011
 isEnabled+0.010
PLA+0.010
Neg logit contributions
 Canter-0.010
-0.009
 Ban-0.009
 Dates-0.009
ajor-0.009
Token with
Token ID351
Feature activation+1.36e-02

Pos logit contributions
culosis+0.011
 glove+0.010
 concess+0.010
 isEnabled+0.010
PLA+0.010
Neg logit contributions
 Canter-0.011
 Ban-0.009
-0.009
 Dates-0.009
ajor-0.009
Token a
Token ID257
Feature activation-1.32e-02

Pos logit contributions
 Canter+0.010
 Ban+0.009
+0.009
 Dates+0.009
ajor+0.009
Neg logit contributions
culosis-0.010
 glove-0.010
 concess-0.009
 isEnabled-0.009
PLA-0.009
Token baseball
Token ID9283
Feature activation+9.74e-03

Pos logit contributions
culosis+0.010
 glove+0.009
 concess+0.009
 isEnabled+0.009
PLA+0.009
Neg logit contributions
 Canter-0.010
 Ban-0.009
-0.009
 Dates-0.008
ajor-0.008
Token glove
Token ID29144
Feature activation-5.77e-01

Pos logit contributions
 Canter+0.007
 Ban+0.007
+0.007
ajor+0.006
 Dates+0.006
Neg logit contributions
culosis-0.007
 concess-0.006
 glove-0.006
 isEnabled-0.006
PLA-0.006
Token!
Token ID0
Feature activation+6.62e-02

Pos logit contributions
culosis+0.432
 glove+0.411
 concess+0.403
 isEnabled+0.396
PLA+0.391
Neg logit contributions
 Canter-0.416
 Ban-0.369
-0.369
 Dates-0.357
ajor-0.356
Token In
Token ID554
Feature activation-8.62e-03

Pos logit contributions
 Canter+0.047
 Ban+0.042
+0.042
 Dates+0.040
ajor+0.040
Neg logit contributions
culosis-0.050
 glove-0.048
 concess-0.047
 isEnabled-0.046
PLA-0.046
Token Wii
Token ID16591
Feature activation-3.29e-02

Pos logit contributions
culosis+0.007
 glove+0.006
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.006
 Ban-0.005
-0.005
 Dates-0.005
ajor-0.005
Token Sports
Token ID7092
Feature activation-7.06e-02

Pos logit contributions
culosis+0.022
 glove+0.021
 concess+0.021
 isEnabled+0.020
PLA+0.020
Neg logit contributions
 Canter-0.026
-0.024
 Ban-0.023
ajor-0.023
 Dates-0.023
Token Club
Token ID6289
Feature activation+1.27e-01

Pos logit contributions
culosis+0.063
 glove+0.061
 concess+0.060
 isEnabled+0.059
PLA+0.058
Neg logit contributions
 Canter-0.040
-0.035
 Ban-0.035
 Dates-0.033
ajor-0.033
Token at
Token ID379
Feature activation+8.80e-03

Pos logit contributions
culosis+0.054
 glove+0.052
 concess+0.051
 isEnabled+0.050
PLA+0.049
Neg logit contributions
 Canter-0.051
-0.045
 Ban-0.045
 Dates-0.044
ajor-0.044
Token his
Token ID465
Feature activation+3.35e-02

Pos logit contributions
 Canter+0.006
 Ban+0.006
+0.006
 Dates+0.005
ajor+0.005
Neg logit contributions
culosis-0.007
 glove-0.006
 concess-0.006
 isEnabled-0.006
PLA-0.006
Token eye
Token ID4151
Feature activation+1.20e-01

Pos logit contributions
 Canter+0.024
 Ban+0.021
+0.021
 Dates+0.020
ajor+0.020
Neg logit contributions
culosis-0.026
 glove-0.024
 concess-0.024
 isEnabled-0.023
PLA-0.023
Token with
Token ID351
Feature activation+2.02e-02

Pos logit contributions
 Canter+0.086
+0.077
 Ban+0.077
 Dates+0.074
ajor+0.074
Neg logit contributions
culosis-0.090
 glove-0.086
 concess-0.085
 isEnabled-0.083
PLA-0.082
Token his
Token ID465
Feature activation+3.69e-02

Pos logit contributions
 Canter+0.015
+0.013
 Ban+0.013
 Dates+0.013
ajor+0.013
Neg logit contributions
culosis-0.015
 glove-0.014
 concess-0.014
 isEnabled-0.014
PLA-0.014
Token glove
Token ID29144
Feature activation-5.75e-01

Pos logit contributions
 Canter+0.028
 Ban+0.025
+0.025
 Dates+0.024
ajor+0.024
Neg logit contributions
culosis-0.027
 glove-0.025
 concess-0.025
 isEnabled-0.024
PLA-0.024
Token,
Token ID11
Feature activation+1.12e-02

Pos logit contributions
culosis+0.434
 glove+0.410
 concess+0.405
 isEnabled+0.397
PLA+0.393
Neg logit contributions
 Canter-0.412
-0.366
 Ban-0.366
 Dates-0.352
ajor-0.352
Token as
Token ID355
Feature activation+3.27e-02

Pos logit contributions
 Canter+0.008
 Ban+0.007
+0.007
 Dates+0.007
ajor+0.007
Neg logit contributions
culosis-0.008
 glove-0.008
 concess-0.008
 isEnabled-0.008
PLA-0.008
Token if
Token ID611
Feature activation+1.01e-01

Pos logit contributions
 Canter+0.022
 Ban+0.019
+0.019
 Dates+0.018
ajor+0.018
Neg logit contributions
culosis-0.027
 glove-0.025
 concess-0.025
 isEnabled-0.025
PLA-0.024
Token trying
Token ID2111
Feature activation-7.15e-02

Pos logit contributions
 Canter+0.074
+0.066
 Ban+0.066
 Dates+0.064
ajor+0.064
Neg logit contributions
culosis-0.074
 glove-0.070
 concess-0.069
 isEnabled-0.068
PLA-0.067
Token to
Token ID284
Feature activation+2.14e-03

Pos logit contributions
culosis+0.053
 glove+0.050
 concess+0.049
 isEnabled+0.048
PLA+0.048
Neg logit contributions
 Canter-0.052
-0.046
 Ban-0.046
 Dates-0.045
ajor-0.045
Token required
Token ID2672
Feature activation+9.67e-02

Pos logit contributions
 Canter+0.013
+0.011
 Ban+0.011
 Dates+0.011
ajor+0.011
Neg logit contributions
culosis-0.017
 glove-0.016
 concess-0.016
 isEnabled-0.016
PLA-0.015
Token guns
Token ID6541
Feature activation-6.66e-02

Pos logit contributions
 Canter+0.063
+0.056
 Ban+0.055
ajor+0.053
 Dates+0.053
Neg logit contributions
culosis-0.079
 glove-0.076
 concess-0.074
 isEnabled-0.073
PLA-0.072
Token be
Token ID307
Feature activation-5.62e-03

Pos logit contributions
culosis+0.049
 glove+0.047
 concess+0.046
 isEnabled+0.045
PLA+0.044
Neg logit contributions
 Canter-0.049
-0.044
 Ban-0.044
 Dates-0.042
ajor-0.042
Token locked
Token ID8970
Feature activation-2.09e-02

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.004
-0.004
 Ban-0.004
 Dates-0.004
ajor-0.004
Token in
Token ID287
Feature activation+2.04e-03

Pos logit contributions
culosis+0.014
 glove+0.013
 concess+0.013
 isEnabled+0.013
PLA+0.013
Neg logit contributions
 Canter-0.017
 Ban-0.015
-0.015
 Dates-0.015
ajor-0.014
Token glove
Token ID29144
Feature activation-5.66e-01

Pos logit contributions
 Canter+0.002
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token comp
Token ID552
Feature activation-3.10e-02

Pos logit contributions
culosis+0.350
 glove+0.327
 concess+0.322
 isEnabled+0.314
PLA+0.311
Neg logit contributions
 Canter-0.482
 Ban-0.437
-0.436
 Dates-0.424
ajor-0.422
Tokenartments
Token ID32514
Feature activation-7.03e-03

Pos logit contributions
culosis+0.023
 glove+0.021
 concess+0.021
 isEnabled+0.021
PLA+0.020
Neg logit contributions
 Canter-0.023
 Ban-0.021
-0.021
ajor-0.020
 Dates-0.020
Token of
Token ID286
Feature activation+1.28e-02

Pos logit contributions
culosis+0.005
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
-0.004
 Ban-0.004
 Dates-0.004
ajor-0.004
Token tr
Token ID491
Feature activation-6.73e-02

Pos logit contributions
 Canter+0.010
 Ban+0.009
+0.009
 Dates+0.009
ajor+0.009
Neg logit contributions
culosis-0.009
 glove-0.008
 concess-0.008
 isEnabled-0.008
PLA-0.008
Tokenunks
Token ID14125
Feature activation+6.74e-02

Pos logit contributions
culosis+0.056
 glove+0.054
 concess+0.053
 isEnabled+0.052
PLA+0.052
Neg logit contributions
 Canter-0.042
 Ban-0.037
-0.037
ajor-0.036
 Dates-0.035
Token two
Token ID734
Feature activation-4.50e-02

Pos logit contributions
 Canter+0.007
+0.006
 Ban+0.006
 Dates+0.006
ajor+0.006
Neg logit contributions
culosis-0.007
 glove-0.006
 concess-0.006
 isEnabled-0.006
PLA-0.006
Token metal
Token ID6147
Feature activation+8.38e-02

Pos logit contributions
culosis+0.033
 glove+0.032
 concess+0.031
 isEnabled+0.031
PLA+0.030
Neg logit contributions
 Canter-0.033
-0.029
 Ban-0.029
 Dates-0.028
ajor-0.028
Token clips
Token ID19166
Feature activation+2.44e-01

Pos logit contributions
 Canter+0.060
+0.054
 Ban+0.053
 Dates+0.052
ajor+0.052
Neg logit contributions
culosis-0.063
 glove-0.060
 concess-0.059
 isEnabled-0.058
PLA-0.057
Token near
Token ID1474
Feature activation-1.90e-02

Pos logit contributions
 Canter+0.180
+0.160
 Ban+0.160
 Dates+0.155
ajor+0.155
Neg logit contributions
culosis-0.179
 glove-0.171
 concess-0.167
 isEnabled-0.164
PLA-0.162
Token the
Token ID262
Feature activation+7.19e-03

Pos logit contributions
culosis+0.014
 glove+0.013
 concess+0.013
 isEnabled+0.013
PLA+0.013
Neg logit contributions
 Canter-0.014
-0.012
 Ban-0.012
 Dates-0.012
ajor-0.012
Token glove
Token ID29144
Feature activation-5.66e-01

Pos logit contributions
 Canter+0.005
 Ban+0.004
+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.006
 glove-0.006
 concess-0.005
 isEnabled-0.005
PLA-0.005
Tokenbox
Token ID3524
Feature activation-1.64e-01

Pos logit contributions
culosis+0.375
 glove+0.346
 concess+0.345
 isEnabled+0.336
PLA+0.333
Neg logit contributions
 Canter-0.459
 Ban-0.416
-0.414
 Dates-0.401
ajor-0.400
Token.
Token ID13
Feature activation+6.07e-03

Pos logit contributions
culosis+0.122
 glove+0.116
 concess+0.114
 isEnabled+0.112
PLA+0.110
Neg logit contributions
 Canter-0.119
 Ban-0.106
-0.106
 Dates-0.102
ajor-0.102
Token Make
Token ID6889
Feature activation+8.70e-02

Pos logit contributions
 Canter+0.004
+0.004
 Ban+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token sure
Token ID1654
Feature activation+1.47e-02

Pos logit contributions
 Canter+0.062
+0.055
 Ban+0.055
 Dates+0.053
ajor+0.053
Neg logit contributions
culosis-0.066
 glove-0.063
 concess-0.061
 isEnabled-0.060
PLA-0.060
Token the
Token ID262
Feature activation+9.41e-03

Pos logit contributions
 Canter+0.011
 Ban+0.010
+0.010
 Dates+0.009
ajor+0.009
Neg logit contributions
culosis-0.011
 glove-0.010
 concess-0.010
 isEnabled-0.010
PLA-0.010
Token last
Token ID938
Feature activation+2.64e-02

Pos logit contributions
 Canter+0.068
+0.060
 Ban+0.060
 Dates+0.058
ajor+0.058
Neg logit contributions
culosis-0.066
 glove-0.063
 concess-0.061
 isEnabled-0.060
PLA-0.060
Token year
Token ID614
Feature activation+2.31e-02

Pos logit contributions
 Canter+0.019
 Ban+0.017
+0.017
 Dates+0.016
ajor+0.016
Neg logit contributions
culosis-0.020
 glove-0.019
 concess-0.019
 isEnabled-0.018
PLA-0.018
Token.
Token ID13
Feature activation+5.89e-03

Pos logit contributions
 Canter+0.017
+0.015
 Ban+0.015
 Dates+0.014
ajor+0.014
Neg logit contributions
culosis-0.017
 glove-0.016
 concess-0.016
 isEnabled-0.016
PLA-0.016
Token
Token ID198
Feature activation-3.32e-03

Pos logit contributions
 Canter+0.004
 Ban+0.004
+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.004
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token
Token ID198
Feature activation+3.78e-04

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
 Ban-0.002
-0.002
 Dates-0.002
ajor-0.002
TokenHur
Token ID42633
Feature activation-5.09e-01

Pos logit contributions
 Canter+0.000
+0.000
 Ban+0.000
 Dates+0.000
ajor+0.000
Neg logit contributions
culosis-0.000
 glove-0.000
 concess-0.000
 isEnabled-0.000
PLA-0.000
Tokentt
Token ID926
Feature activation-1.64e-01

Pos logit contributions
culosis+0.274
 glove+0.257
 concess+0.250
 isEnabled+0.242
PLA+0.239
Neg logit contributions
 Canter-0.474
 Ban-0.433
-0.433
 Dates-0.422
ajor-0.421
Token has
Token ID468
Feature activation-9.69e-04

Pos logit contributions
culosis+0.117
 glove+0.113
 concess+0.109
 isEnabled+0.108
PLA+0.105
Neg logit contributions
 Canter-0.123
-0.110
 Ban-0.109
 Dates-0.106
ajor-0.106
Token also
Token ID635
Feature activation+6.66e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
 Dates-0.001
ajor-0.001
Token tut
Token ID9732
Feature activation+2.47e-03

Pos logit contributions
 Canter+0.052
+0.046
 Ban+0.046
 Dates+0.045
ajor+0.045
Neg logit contributions
culosis-0.046
 glove-0.044
 concess-0.043
 isEnabled-0.042
PLA-0.041
Tokenored
Token ID1850
Feature activation-3.11e-01

Pos logit contributions
 Canter+0.003
 Ban+0.003
+0.003
ajor+0.003
 Dates+0.003
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token<|endoftext|>
Token ID50256
Feature activation+7.58e-02

Pos logit contributions
culosis+0.007
 glove+0.007
 concess+0.007
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.007
-0.006
 Ban-0.006
ajor-0.006
 Dates-0.006
TokenHaw
Token ID33055
Feature activation-2.43e-01

Pos logit contributions
 Canter+0.054
 Ban+0.048
+0.048
 Dates+0.046
ajor+0.046
Neg logit contributions
culosis-0.057
 glove-0.055
 concess-0.054
 isEnabled-0.053
PLA-0.052
Tokenaii
Token ID42648
Feature activation-2.22e-04

Pos logit contributions
culosis+0.179
 glove+0.170
 concess+0.167
 isEnabled+0.164
PLA+0.162
Neg logit contributions
 Canter-0.179
 Ban-0.160
-0.159
 Dates-0.154
ajor-0.154
Token Congress
Token ID3162
Feature activation+1.07e-02

Pos logit contributions
culosis+0.000
 glove+0.000
 concess+0.000
 isEnabled+0.000
PLA+0.000
Neg logit contributions
 Canter-0.000
 Ban-0.000
-0.000
 Dates-0.000
ajor-0.000
Tokenwoman
Token ID8580
Feature activation-1.58e-01

Pos logit contributions
 Canter+0.009
 Ban+0.008
+0.008
 Dates+0.008
ajor+0.008
Neg logit contributions
culosis-0.007
 glove-0.006
 concess-0.006
 isEnabled-0.006
PLA-0.006
Token Tuls
Token ID33219
Feature activation-4.90e-01

Pos logit contributions
culosis+0.078
 glove+0.072
 concess+0.071
 isEnabled+0.068
PLA+0.068
Neg logit contributions
 Canter-0.155
 Ban-0.143
-0.141
 Dates-0.139
ajor-0.137
Tokeni
Token ID72
Feature activation-1.19e-01

Pos logit contributions
culosis+0.331
 concess+0.319
 glove+0.316
 isEnabled+0.301
PLA+0.298
Neg logit contributions
 Canter-0.387
 Ban-0.359
-0.349
 Dates-0.347
ajor-0.342
Token G
Token ID402
Feature activation-2.59e-02

Pos logit contributions
culosis+0.099
 concess+0.096
 glove+0.094
 isEnabled+0.090
PLA+0.090
Neg logit contributions
 Canter-0.077
 Ban-0.072
-0.068
 Dates-0.067
 PAGE-0.066
Tokenabb
Token ID6485
Feature activation-2.25e-01

Pos logit contributions
culosis+0.013
 glove+0.012
 concess+0.012
 isEnabled+0.011
PLA+0.011
Neg logit contributions
 Canter-0.025
 Ban-0.023
-0.023
ajor-0.023
 Dates-0.023
Tokenard
Token ID446
Feature activation-2.10e-01

Pos logit contributions
culosis+0.127
 glove+0.119
 concess+0.117
 isEnabled+0.112
PLA+0.111
Neg logit contributions
 Canter-0.205
 Ban-0.187
-0.185
 Dates-0.183
ajor-0.183
Token is
Token ID318
Feature activation-2.77e-02

Pos logit contributions
culosis+0.154
 glove+0.146
 concess+0.143
 isEnabled+0.141
PLA+0.139
Neg logit contributions
 Canter-0.156
 Ban-0.139
-0.138
 Dates-0.134
ajor-0.133
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
 Canter+0.012
 Ban+0.011
+0.010
 Dates+0.010
ajor+0.010
Neg logit contributions
culosis-0.013
 glove-0.012
 concess-0.012
 isEnabled-0.012
PLA-0.012
Token most
Token ID749
Feature activation+6.41e-03

Pos logit contributions
 Canter+0.008
+0.007
 Ban+0.007
 Dates+0.007
ajor+0.007
Neg logit contributions
culosis-0.008
 glove-0.007
 concess-0.007
 isEnabled-0.007
PLA-0.007
Token serious
Token ID2726
Feature activation-4.66e-02

Pos logit contributions
 Canter+0.005
 Ban+0.004
+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token charges
Token ID4530
Feature activation-5.23e-02

Pos logit contributions
culosis+0.031
 glove+0.029
 concess+0.028
 isEnabled+0.028
PLA+0.027
Neg logit contributions
 Canter-0.038
 Ban-0.034
-0.034
 Dates-0.033
ajor-0.033
Token ever
Token ID1683
Feature activation-1.44e-01

Pos logit contributions
culosis+0.040
 glove+0.037
 concess+0.037
 isEnabled+0.036
PLA+0.036
Neg logit contributions
 Canter-0.038
 Ban-0.034
-0.034
 Dates-0.032
ajor-0.032
Token leve
Token ID34002
Feature activation-4.87e-01

Pos logit contributions
culosis+0.077
 glove+0.070
 concess+0.069
 isEnabled+0.068
PLA+0.067
Neg logit contributions
 Canter-0.136
 Ban-0.125
-0.124
 Dates-0.121
ajor-0.120
Tokenlled
Token ID3353
Feature activation-1.91e-02

Pos logit contributions
culosis+0.346
 glove+0.328
 concess+0.322
 isEnabled+0.315
PLA+0.311
Neg logit contributions
 Canter-0.370
 Ban-0.332
-0.331
ajor-0.320
 Dates-0.320
Token at
Token ID379
Feature activation+1.14e-02

Pos logit contributions
culosis+0.015
 glove+0.014
 concess+0.014
 isEnabled+0.014
PLA+0.014
Neg logit contributions
 Canter-0.013
 Ban-0.012
-0.012
 Dates-0.011
ajor-0.011
Token a
Token ID257
Feature activation-7.49e-03

Pos logit contributions
 Canter+0.008
 Ban+0.007
+0.007
 Dates+0.007
ajor+0.007
Neg logit contributions
culosis-0.009
 glove-0.008
 concess-0.008
 isEnabled-0.008
PLA-0.008
Token musical
Token ID10530
Feature activation+2.24e-01

Pos logit contributions
culosis+0.006
 glove+0.006
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
 Ban-0.005
-0.005
 Dates-0.004
ajor-0.004
Token genius
Token ID15632
Feature activation+1.87e-03

Pos logit contributions
 Canter+0.153
+0.135
 Ban+0.134
 Dates+0.130
ajor+0.129
Neg logit contributions
culosis-0.176
 glove-0.169
 concess-0.165
 isEnabled-0.162
PLA-0.161
Token
Token ID198
Feature activation-7.46e-03

Pos logit contributions
culosis+0.008
 glove+0.008
 concess+0.008
 isEnabled+0.008
PLA+0.007
Neg logit contributions
 Canter-0.008
 Ban-0.007
-0.007
 Dates-0.007
ajor-0.007
Token
Token ID198
Feature activation-3.75e-03

Pos logit contributions
culosis+0.006
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
 Ban-0.005
-0.005
 Dates-0.005
ajor-0.005
TokenGame
Token ID8777
Feature activation-2.07e-01

Pos logit contributions
culosis+0.003
 glove+0.003
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.003
-0.003
 Ban-0.003
 Dates-0.002
ajor-0.002
Token 74
Token ID8915
Feature activation+1.01e-01

Pos logit contributions
culosis+0.172
 glove+0.164
 concess+0.160
 isEnabled+0.158
PLA+0.157
Neg logit contributions
 Canter-0.133
-0.117
 Ban-0.115
ajor-0.112
 Dates-0.111
Token v
Token ID410
Feature activation+2.15e-01

Pos logit contributions
 Canter+0.046
+0.038
 Ban+0.038
 Dates+0.036
ajor+0.036
Neg logit contributions
culosis-0.102
 glove-0.098
 concess-0.097
 isEnabled-0.095
PLA-0.095
Token Blues
Token ID15261
Feature activation-4.85e-01

Pos logit contributions
 Canter+0.184
 Ban+0.167
+0.165
 Dates+0.161
ajor+0.159
Neg logit contributions
culosis-0.136
 glove-0.126
 concess-0.126
 isEnabled-0.121
PLA-0.119
Token
Token ID784
Feature activation-7.82e-02

Pos logit contributions
culosis+0.338
 glove+0.319
 concess+0.316
 isEnabled+0.308
PLA+0.304
Neg logit contributions
 Canter-0.374
 Ban-0.338
-0.337
 Dates-0.326
ajor-0.324
Token Lost
Token ID9164
Feature activation-1.66e-01

Pos logit contributions
culosis+0.057
 glove+0.055
 concess+0.054
 isEnabled+0.053
PLA+0.052
Neg logit contributions
 Canter-0.058
 Ban-0.051
-0.051
 Dates-0.050
ajor-0.049
Token 4
Token ID604
Feature activation+1.14e-02

Pos logit contributions
culosis+0.127
 glove+0.121
 concess+0.119
 isEnabled+0.117
PLA+0.116
Neg logit contributions
 Canter-0.117
 Ban-0.104
-0.104
 Dates-0.100
ajor-0.100
Token-
Token ID12
Feature activation-5.96e-03

Pos logit contributions
 Canter+0.008
 Ban+0.007
+0.007
 Dates+0.007
ajor+0.007
Neg logit contributions
culosis-0.008
 glove-0.008
 concess-0.008
 isEnabled-0.008
PLA-0.008
Token0
Token ID15
Feature activation-1.86e-01

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.004
-0.004
 Ban-0.004
 Dates-0.004
ajor-0.004
Token Caps
Token ID23534
Feature activation-2.05e-01

Pos logit contributions
culosis+0.000
 glove+0.000
 concess+0.000
 isEnabled+0.000
PLA+0.000
Neg logit contributions
 Canter-0.001
 Ban-0.001
-0.001
 Dates-0.001
ajor-0.001
Token out
Token ID503
Feature activation+3.60e-02

Pos logit contributions
culosis+0.152
 glove+0.144
 concess+0.142
 isEnabled+0.139
PLA+0.137
Neg logit contributions
 Canter-0.149
 Ban-0.133
-0.133
 Dates-0.128
ajor-0.128
Token-
Token ID12
Feature activation+6.56e-04

Pos logit contributions
 Canter+0.021
+0.018
 Ban+0.018
 Dates+0.017
ajor+0.017
Neg logit contributions
culosis-0.032
 glove-0.030
 concess-0.030
 isEnabled-0.030
PLA-0.029
Tokenshot
Token ID9442
Feature activation+1.40e-01

Pos logit contributions
 Canter+0.000
+0.000
 Ban+0.000
ajor+0.000
 Dates+0.000
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token the
Token ID262
Feature activation-6.03e-03

Pos logit contributions
 Canter+0.103
 Ban+0.092
+0.091
 Dates+0.088
ajor+0.087
Neg logit contributions
culosis-0.105
 glove-0.099
 concess-0.098
 isEnabled-0.096
PLA-0.094
Token Blues
Token ID15261
Feature activation-4.79e-01

Pos logit contributions
culosis+0.002
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.007
 Ban-0.007
-0.007
 Dates-0.007
ajor-0.007
Token 10
Token ID838
Feature activation+3.72e-02

Pos logit contributions
culosis+0.354
 glove+0.337
 concess+0.330
 isEnabled+0.324
PLA+0.320
Neg logit contributions
 Canter-0.351
-0.312
 Ban-0.312
 Dates-0.302
ajor-0.301
Token-
Token ID12
Feature activation-8.40e-03

Pos logit contributions
 Canter+0.026
+0.024
 Ban+0.024
 Dates+0.023
ajor+0.023
Neg logit contributions
culosis-0.028
 glove-0.027
 concess-0.026
 isEnabled-0.026
PLA-0.026
Token8
Token ID23
Feature activation+1.37e-02

Pos logit contributions
culosis+0.006
 glove+0.006
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.006
-0.005
 Ban-0.005
 Dates-0.005
ajor-0.005
Token.
Token ID13
Feature activation+2.95e-03

Pos logit contributions
 Canter+0.010
+0.009
 Ban+0.009
 Dates+0.009
ajor+0.009
Neg logit contributions
culosis-0.010
 glove-0.010
 concess-0.010
 isEnabled-0.009
PLA-0.009
Token The
Token ID383
Feature activation+1.08e-03

Pos logit contributions
 Canter+0.002
 Ban+0.002
+0.002
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.002
 concess-0.002
 glove-0.002
 isEnabled-0.002
PLA-0.002
Token and
Token ID290
Feature activation-1.54e-02

Pos logit contributions
 Canter+0.006
 Ban+0.005
+0.005
 Dates+0.005
ajor+0.005
Neg logit contributions
culosis-0.006
 glove-0.005
 concess-0.005
 isEnabled-0.005
PLA-0.005
Token more
Token ID517
Feature activation-1.06e-03

Pos logit contributions
culosis+0.011
 glove+0.011
 concess+0.011
 isEnabled+0.010
PLA+0.010
Neg logit contributions
 Canter-0.011
 Ban-0.010
-0.010
 Dates-0.010
ajor-0.010
Token against
Token ID1028
Feature activation+3.33e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
 Dates-0.001
ajor-0.001
Token an
Token ID281
Feature activation+4.47e-02

Pos logit contributions
 Canter+0.025
 Ban+0.022
+0.022
 Dates+0.021
ajor+0.021
Neg logit contributions
culosis-0.024
 glove-0.023
 concess-0.023
 isEnabled-0.022
PLA-0.022
Token impressive
Token ID8036
Feature activation+1.31e-02

Pos logit contributions
 Canter+0.032
 Ban+0.029
+0.029
 Dates+0.027
ajor+0.027
Neg logit contributions
culosis-0.034
 glove-0.032
 concess-0.031
 isEnabled-0.031
PLA-0.031
Token Blues
Token ID15261
Feature activation-4.79e-01

Pos logit contributions
 Canter+0.012
 Ban+0.011
+0.011
 Dates+0.011
ajor+0.011
Neg logit contributions
culosis-0.007
 glove-0.007
 concess-0.007
 isEnabled-0.006
PLA-0.006
Token outfit
Token ID16313
Feature activation-1.66e-02

Pos logit contributions
culosis+0.299
 glove+0.281
 concess+0.275
 isEnabled+0.269
PLA+0.265
Neg logit contributions
 Canter-0.406
 Ban-0.367
-0.367
 Dates-0.356
ajor-0.356
Token,
Token ID11
Feature activation+6.04e-03

Pos logit contributions
culosis+0.012
 glove+0.012
 concess+0.012
 isEnabled+0.011
PLA+0.011
Neg logit contributions
 Canter-0.012
-0.011
 Ban-0.011
 Dates-0.010
ajor-0.010
Token however
Token ID2158
Feature activation+4.68e-02

Pos logit contributions
 Canter+0.004
 Ban+0.004
+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token it
Token ID340
Feature activation-1.65e-02

Pos logit contributions
 Canter+0.034
 Ban+0.031
+0.031
 Dates+0.030
ajor+0.029
Neg logit contributions
culosis-0.035
 glove-0.033
 concess-0.032
 isEnabled-0.032
PLA-0.031
Token proved
Token ID8302
Feature activation+2.04e-01

Pos logit contributions
culosis+0.013
 glove+0.012
 concess+0.012
 isEnabled+0.012
PLA+0.012
Neg logit contributions
 Canter-0.011
 Ban-0.010
-0.010
 Dates-0.010
ajor-0.010
Token Inter
Token ID4225
Feature activation+8.05e-02

Pos logit contributions
 Canter+0.009
+0.008
 Ban+0.008
 Dates+0.007
ajor+0.007
Neg logit contributions
culosis-0.009
 glove-0.009
 concess-0.008
 isEnabled-0.008
PLA-0.008
Tokenpol
Token ID16104
Feature activation-6.95e-02

Pos logit contributions
 Canter+0.059
 Ban+0.054
+0.053
 Dates+0.051
zek+0.051
Neg logit contributions
culosis-0.058
 glove-0.056
 concess-0.055
 isEnabled-0.053
PLA-0.053
Token list
Token ID1351
Feature activation-4.77e-02

Pos logit contributions
culosis+0.051
 glove+0.048
 concess+0.047
 isEnabled+0.046
PLA+0.046
Neg logit contributions
 Canter-0.052
-0.046
 Ban-0.046
ajor-0.044
 Dates-0.044
Token which
Token ID543
Feature activation+3.35e-04

Pos logit contributions
culosis+0.034
 glove+0.033
 concess+0.032
 isEnabled+0.031
PLA+0.031
Neg logit contributions
 Canter-0.036
-0.032
 Ban-0.032
 Dates-0.031
ajor-0.031
Token Tel
Token ID12088
Feature activation+1.85e-01

Pos logit contributions
 Canter+0.000
+0.000
 Ban+0.000
ajor+0.000
 Dates+0.000
Neg logit contributions
culosis-0.000
 glove-0.000
 concess-0.000
 isEnabled-0.000
PLA-0.000
Tokenstra
Token ID12044
Feature activation-4.78e-01

Pos logit contributions
 Canter+0.235
+0.220
 Ban+0.220
ajor+0.216
 Dates+0.216
Neg logit contributions
culosis-0.037
 glove-0.031
 concess-0.028
 isEnabled-0.026
PLA-0.024
Token is
Token ID318
Feature activation-2.28e-02

Pos logit contributions
culosis+0.342
 glove+0.328
 concess+0.322
 isEnabled+0.314
PLA+0.309
Neg logit contributions
 Canter-0.359
-0.321
 Ban-0.319
ajor-0.310
 Dates-0.309
Token examining
Token ID17247
Feature activation-1.23e-02

Pos logit contributions
culosis+0.017
 glove+0.016
 concess+0.016
 isEnabled+0.016
PLA+0.015
Neg logit contributions
 Canter-0.017
-0.015
 Ban-0.015
 Dates-0.014
ajor-0.014
Token is
Token ID318
Feature activation-2.57e-02

Pos logit contributions
culosis+0.009
 glove+0.008
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.009
-0.008
 Ban-0.008
 Dates-0.008
ajor-0.008
Token believed
Token ID4762
Feature activation-8.00e-02

Pos logit contributions
culosis+0.019
 glove+0.018
 concess+0.017
 isEnabled+0.017
PLA+0.017
Neg logit contributions
 Canter-0.019
-0.017
 Ban-0.017
 Dates-0.017
ajor-0.017
Token to
Token ID284
Feature activation-2.08e-03

Pos logit contributions
culosis+0.059
 glove+0.056
 concess+0.055
 isEnabled+0.054
PLA+0.053
Neg logit contributions
 Canter-0.059
 Ban-0.052
-0.052
 Dates-0.051
ajor-0.050
Token.
Token ID13
Feature activation-6.82e-03

Pos logit contributions
culosis+0.052
 glove+0.049
 concess+0.048
 isEnabled+0.047
PLA+0.047
Neg logit contributions
 Canter-0.051
-0.045
 Ban-0.045
 Dates-0.044
ajor-0.043
Token However
Token ID2102
Feature activation+7.17e-03

Pos logit contributions
culosis+0.005
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
-0.004
 Ban-0.004
 Dates-0.004
ajor-0.004
Token,
Token ID11
Feature activation-7.13e-03

Pos logit contributions
 Canter+0.005
+0.005
 Ban+0.005
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.005
 concess-0.005
 isEnabled-0.005
PLA-0.005
Token a
Token ID257
Feature activation-2.44e-02

Pos logit contributions
culosis+0.005
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
-0.005
 Ban-0.005
 Dates-0.005
ajor-0.005
Token Tel
Token ID12088
Feature activation+1.82e-01

Pos logit contributions
culosis+0.018
 glove+0.017
 concess+0.017
 isEnabled+0.017
PLA+0.017
Neg logit contributions
 Canter-0.018
-0.016
 Ban-0.016
 Dates-0.015
ajor-0.015
Tokenstra
Token ID12044
Feature activation-4.78e-01

Pos logit contributions
 Canter+0.163
 Ban+0.148
+0.148
 Dates+0.144
ajor+0.144
Neg logit contributions
culosis-0.104
 glove-0.098
 concess-0.096
 isEnabled-0.093
PLA-0.092
Token spokesperson
Token ID11821
Feature activation+5.03e-02

Pos logit contributions
culosis+0.364
 glove+0.348
 concess+0.342
 isEnabled+0.334
PLA+0.331
Neg logit contributions
 Canter-0.338
-0.299
 Ban-0.299
 Dates-0.289
ajor-0.289
Token today
Token ID1909
Feature activation-1.52e-02

Pos logit contributions
 Canter+0.034
+0.030
 Ban+0.030
 Dates+0.029
ajor+0.029
Neg logit contributions
culosis-0.039
 glove-0.038
 concess-0.037
 isEnabled-0.036
PLA-0.036
Token said
Token ID531
Feature activation+2.80e-02

Pos logit contributions
culosis+0.011
 glove+0.011
 concess+0.011
 isEnabled+0.010
PLA+0.010
Neg logit contributions
 Canter-0.011
-0.010
 Ban-0.010
 Dates-0.010
ajor-0.009
Token the
Token ID262
Feature activation-1.02e-02

Pos logit contributions
 Canter+0.020
 Ban+0.018
+0.018
 Dates+0.018
ajor+0.017
Neg logit contributions
culosis-0.021
 glove-0.020
 concess-0.019
 isEnabled-0.019
PLA-0.019
Token tel
Token ID13632
Feature activation-2.76e-02

Pos logit contributions
culosis+0.008
 glove+0.008
 concess+0.007
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.007
-0.006
 Ban-0.006
 Dates-0.006
ajor-0.006
Token morning
Token ID3329
Feature activation+1.72e-01

Pos logit contributions
culosis+0.050
 glove+0.048
 concess+0.047
 isEnabled+0.046
PLA+0.046
Neg logit contributions
 Canter-0.048
 Ban-0.043
-0.043
 Dates-0.041
ajor-0.041
Token sunshine
Token ID34488
Feature activation-2.11e-02

Pos logit contributions
 Canter+0.128
 Ban+0.114
+0.114
 Dates+0.111
ajor+0.110
Neg logit contributions
culosis-0.125
 glove-0.119
 concess-0.117
 isEnabled-0.115
PLA-0.113
Token for
Token ID329
Feature activation-4.28e-03

Pos logit contributions
culosis+0.016
 glove+0.015
 concess+0.015
 isEnabled+0.014
PLA+0.014
Neg logit contributions
 Canter-0.015
-0.014
 Ban-0.014
ajor-0.013
 Dates-0.013
Token those
Token ID883
Feature activation-2.43e-02

Pos logit contributions
culosis+0.003
 glove+0.003
 concess+0.003
 isEnabled+0.003
PLA+0.003
Neg logit contributions
 Canter-0.003
 Ban-0.003
-0.003
 Dates-0.003
ajor-0.003
Token "
Token ID366
Feature activation+3.27e-02

Pos logit contributions
culosis+0.018
 glove+0.017
 concess+0.017
 isEnabled+0.017
PLA+0.017
Neg logit contributions
 Canter-0.018
 Ban-0.016
-0.016
 Dates-0.015
ajor-0.015
Tokencold
Token ID36673
Feature activation-4.77e-01

Pos logit contributions
 Canter+0.024
+0.021
 Ban+0.021
 Dates+0.021
ajor+0.021
Neg logit contributions
culosis-0.024
 glove-0.023
 concess-0.022
 isEnabled-0.022
PLA-0.022
Token"
Token ID1
Feature activation-2.04e-02

Pos logit contributions
culosis+0.348
 glove+0.331
 concess+0.324
 isEnabled+0.318
PLA+0.314
Neg logit contributions
 Canter-0.354
-0.315
 Ban-0.315
 Dates-0.305
ajor-0.304
Token AM
Token ID3001
Feature activation+3.17e-01

Pos logit contributions
culosis+0.014
 glove+0.013
 concess+0.013
 isEnabled+0.012
PLA+0.012
Neg logit contributions
 Canter-0.016
 Ban-0.015
-0.015
 Dates-0.014
ajor-0.014
Token hours
Token ID2250
Feature activation-4.78e-02

Pos logit contributions
 Canter+0.235
 Ban+0.209
+0.209
ajor+0.202
 Dates+0.202
Neg logit contributions
culosis-0.231
 glove-0.219
 concess-0.216
 isEnabled-0.212
PLA-0.208
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
culosis+0.036
 glove+0.034
 concess+0.033
 isEnabled+0.033
PLA+0.032
Neg logit contributions
 Canter-0.035
-0.031
 Ban-0.031
 Dates-0.030
ajor-0.030
Token I
Token ID314
Feature activation-1.27e-02

Pos logit contributions
culosis+0.009
 glove+0.008
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.008
-0.007
 Ban-0.007
ajor-0.007
 Dates-0.007
Token Regina
Token ID38739
Feature activation-1.42e-02

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.004
-0.003
 Ban-0.003
ajor-0.003
 Dates-0.003
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
culosis+0.010
 glove+0.010
 concess+0.010
 isEnabled+0.010
PLA+0.009
Neg logit contributions
 Canter-0.010
-0.009
 Ban-0.009
 Dates-0.009
ajor-0.009
Token if
Token ID611
Feature activation+7.44e-02

Pos logit contributions
culosis+0.007
 glove+0.007
 concess+0.007
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.008
-0.007
 Ban-0.007
 Dates-0.007
ajor-0.007
Token you
Token ID345
Feature activation-2.44e-02

Pos logit contributions
 Canter+0.056
+0.049
 Ban+0.049
 Dates+0.048
ajor+0.048
Neg logit contributions
culosis-0.054
 glove-0.051
 concess-0.050
 isEnabled-0.049
PLA-0.049
Token are
Token ID389
Feature activation-3.76e-02

Pos logit contributions
culosis+0.018
 glove+0.017
 concess+0.017
 isEnabled+0.016
PLA+0.016
Neg logit contributions
 Canter-0.018
 Ban-0.016
-0.016
 Dates-0.016
ajor-0.016
Token Indigenous
Token ID24033
Feature activation-4.75e-01

Pos logit contributions
culosis+0.028
 glove+0.027
 concess+0.026
 isEnabled+0.026
PLA+0.026
Neg logit contributions
 Canter-0.027
 Ban-0.024
-0.024
 Dates-0.023
ajor-0.023
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
culosis+0.351
 glove+0.335
 concess+0.328
 isEnabled+0.322
PLA+0.318
Neg logit contributions
 Canter-0.347
 Ban-0.308
-0.308
 Dates-0.298
ajor-0.297
Token you
Token ID345
Feature activation-2.49e-02

Pos logit contributions
culosis+0.009
 glove+0.009
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.009
 Ban-0.008
-0.008
 Dates-0.008
ajor-0.008
Token
Token ID447
Feature activation-1.63e-03

Pos logit contributions
culosis+0.018
 glove+0.017
 concess+0.017
 isEnabled+0.017
PLA+0.016
Neg logit contributions
 Canter-0.018
-0.016
 Ban-0.016
 Dates-0.016
ajor-0.016
Token
Token ID247
Feature activation-1.25e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
 Ban-0.001
-0.001
 Dates-0.001
ajor-0.001
Tokenre
Token ID260
Feature activation-6.11e-02

Pos logit contributions
culosis+0.009
 glove+0.009
 concess+0.009
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.009
 Ban-0.008
-0.008
 Dates-0.008
ajor-0.008
Token or
Token ID393
Feature activation-1.31e-03

Pos logit contributions
 Canter+0.142
+0.126
 Ban+0.126
 Dates+0.122
ajor+0.121
Neg logit contributions
culosis-0.145
 glove-0.139
 concess-0.136
 isEnabled-0.133
PLA-0.132
Token poly
Token ID7514
Feature activation-1.38e-03

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
 Dates-0.001
ajor-0.001
Tokengon
Token ID14520
Feature activation+5.41e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
ajor-0.001
 Dates-0.001
Tokenal
Token ID282
Feature activation+2.26e-01

Pos logit contributions
 Canter+0.026
 Ban+0.022
+0.022
 Dates+0.021
ajor+0.021
Neg logit contributions
culosis-0.053
 glove-0.051
 concess-0.050
 isEnabled-0.050
PLA-0.049
Token crater
Token ID32593
Feature activation+2.76e-01

Pos logit contributions
 Canter+0.164
 Ban+0.146
+0.146
 Dates+0.141
ajor+0.140
Neg logit contributions
culosis-0.168
 glove-0.160
 concess-0.157
 isEnabled-0.154
PLA-0.152
Tokeniform
Token ID6933
Feature activation-4.74e-01

Pos logit contributions
 Canter+0.209
+0.187
 Ban+0.187
 Dates+0.181
ajor+0.180
Neg logit contributions
culosis-0.197
 glove-0.187
 concess-0.183
 isEnabled-0.180
PLA-0.178
Token pap
Token ID20461
Feature activation+1.12e-01

Pos logit contributions
culosis+0.356
 glove+0.340
 concess+0.333
 isEnabled+0.327
PLA+0.323
Neg logit contributions
 Canter-0.341
-0.302
 Ban-0.302
 Dates-0.292
ajor-0.291
Tokenules
Token ID5028
Feature activation+1.26e-03

Pos logit contributions
 Canter+0.083
+0.074
 Ban+0.074
ajor+0.072
 Dates+0.072
Neg logit contributions
culosis-0.080
 glove-0.077
 concess-0.075
 isEnabled-0.074
PLA-0.073
Token developing
Token ID5922
Feature activation-1.11e-02

Pos logit contributions
 Canter+0.001
+0.001
 Ban+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token along
Token ID1863
Feature activation-2.66e-02

Pos logit contributions
culosis+0.009
 glove+0.008
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.008
-0.007
 Ban-0.007
 Dates-0.007
ajor-0.007
Token the
Token ID262
Feature activation+1.49e-02

Pos logit contributions
culosis+0.020
 glove+0.019
 concess+0.019
 isEnabled+0.018
PLA+0.018
Neg logit contributions
 Canter-0.019
 Ban-0.017
-0.017
 Dates-0.016
ajor-0.016
Token Two
Token ID4930
Feature activation-4.51e-02

Pos logit contributions
 Canter+0.027
+0.024
 Ban+0.024
 Dates+0.023
ajor+0.023
Neg logit contributions
culosis-0.026
 glove-0.025
 concess-0.024
 isEnabled-0.024
PLA-0.023
Token Mind
Token ID10175
Feature activation-1.44e-01

Pos logit contributions
culosis+0.027
 glove+0.025
 concess+0.025
 isEnabled+0.024
PLA+0.024
Neg logit contributions
 Canter-0.039
-0.036
 Ban-0.036
ajor-0.035
 Dates-0.035
Tokens
Token ID82
Feature activation-6.04e-03

Pos logit contributions
culosis+0.106
 glove+0.101
 concess+0.099
 isEnabled+0.097
PLA+0.096
Neg logit contributions
 Canter-0.106
 Ban-0.094
-0.094
 Dates-0.091
ajor-0.091
Token blog
Token ID4130
Feature activation-1.47e-01

Pos logit contributions
culosis+0.005
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.004
-0.004
 Ban-0.004
 Dates-0.004
ajor-0.004
Token-
Token ID12
Feature activation-7.60e-03

Pos logit contributions
culosis+0.110
 glove+0.105
 concess+0.102
 isEnabled+0.100
PLA+0.099
Neg logit contributions
 Canter-0.107
-0.095
 Ban-0.095
ajor-0.092
 Dates-0.092
TokenKind
Token ID35854
Feature activation-4.74e-01

Pos logit contributions
culosis+0.006
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
 Ban-0.005
-0.005
 Dates-0.005
ajor-0.005
Tokenle
Token ID293
Feature activation+1.60e-03

Pos logit contributions
culosis+0.335
 glove+0.321
 concess+0.312
 isEnabled+0.310
PLA+0.307
Neg logit contributions
 Canter-0.354
-0.316
 Ban-0.315
ajor-0.310
 Dates-0.306
Token<|endoftext|>
Token ID50256
Feature activation+8.33e-02

Pos logit contributions
 Canter+0.001
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
TokenA
Token ID32
Feature activation-1.32e-01

Pos logit contributions
 Canter+0.059
 Ban+0.053
+0.053
 Dates+0.051
ajor+0.051
Neg logit contributions
culosis-0.063
 glove-0.060
 concess-0.059
 isEnabled-0.058
PLA-0.057
Token Review
Token ID6602
Feature activation-3.58e-02

Pos logit contributions
culosis+0.098
 glove+0.093
 concess+0.091
 isEnabled+0.089
PLA+0.088
Neg logit contributions
 Canter-0.096
-0.085
 Ban-0.085
 Dates-0.082
ajor-0.082
Token of
Token ID286
Feature activation+3.19e-03

Pos logit contributions
culosis+0.027
 glove+0.026
 concess+0.025
 isEnabled+0.025
PLA+0.025
Neg logit contributions
 Canter-0.026
 Ban-0.023
-0.023
 Dates-0.022
ajor-0.022
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
culosis+0.005
 glove+0.005
 concess+0.005
 isEnabled+0.005
PLA+0.005
Neg logit contributions
 Canter-0.005
-0.005
 Ban-0.005
 Dates-0.004
ajor-0.004
TokenAccording
Token ID4821
Feature activation+9.92e-02

Pos logit contributions
culosis+0.009
 glove+0.009
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.009
-0.008
 Ban-0.008
 Dates-0.007
ajor-0.007
Token to
Token ID284
Feature activation-8.90e-03

Pos logit contributions
 Canter+0.072
 Ban+0.064
+0.064
 Dates+0.062
ajor+0.061
Neg logit contributions
culosis-0.074
 glove-0.071
 concess-0.070
 isEnabled-0.068
PLA-0.067
Token the
Token ID262
Feature activation-9.83e-03

Pos logit contributions
culosis+0.006
 glove+0.006
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.007
 Ban-0.006
-0.006
 Dates-0.006
ajor-0.006
Token Hof
Token ID37745
Feature activation-2.68e-02

Pos logit contributions
culosis+0.007
 glove+0.006
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.008
-0.007
 Ban-0.007
 Dates-0.007
ajor-0.007
Tokenstra
Token ID12044
Feature activation-4.72e-01

Pos logit contributions
culosis+0.017
 glove+0.016
 concess+0.016
 isEnabled+0.015
PLA+0.015
Neg logit contributions
 Canter-0.022
 Ban-0.020
-0.020
 Dates-0.020
ajor-0.019
Token University
Token ID2059
Feature activation-8.16e-02

Pos logit contributions
culosis+0.340
 glove+0.324
 concess+0.316
 isEnabled+0.310
PLA+0.307
Neg logit contributions
 Canter-0.354
-0.316
 Ban-0.315
 Dates-0.305
ajor-0.305
Token website
Token ID3052
Feature activation-2.28e-01

Pos logit contributions
culosis+0.062
 glove+0.059
 concess+0.058
 isEnabled+0.057
PLA+0.056
Neg logit contributions
 Canter-0.058
-0.051
 Ban-0.051
ajor-0.050
 Dates-0.050
Token,
Token ID11
Feature activation-2.63e-03

Pos logit contributions
culosis+0.169
 glove+0.162
 concess+0.158
 isEnabled+0.155
PLA+0.154
Neg logit contributions
 Canter-0.166
-0.147
 Ban-0.147
 Dates-0.142
ajor-0.142
Token security
Token ID2324
Feature activation+6.64e-02

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
 Ban-0.002
-0.002
 Dates-0.002
ajor-0.002
Token precautions
Token ID31320
Feature activation+5.06e-02

Pos logit contributions
 Canter+0.050
 Ban+0.045
+0.045
 Dates+0.044
ajor+0.044
Neg logit contributions
culosis-0.047
 glove-0.045
 concess-0.044
 isEnabled-0.043
PLA-0.042
Token do
Token ID466
Feature activation+5.49e-02

Pos logit contributions
 Canter+0.005
 Ban+0.004
+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.005
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token you
Token ID345
Feature activation-1.13e-02

Pos logit contributions
 Canter+0.043
+0.039
 Ban+0.039
 Dates+0.038
ajor+0.038
Neg logit contributions
culosis-0.037
 glove-0.036
 concess-0.035
 isEnabled-0.034
PLA-0.034
Token expect
Token ID1607
Feature activation-1.68e-02

Pos logit contributions
culosis+0.008
 glove+0.008
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.008
-0.007
 Ban-0.007
 Dates-0.007
ajor-0.007
Token using
Token ID1262
Feature activation-1.24e-01

Pos logit contributions
culosis+0.012
 glove+0.012
 concess+0.011
 isEnabled+0.011
PLA+0.011
Neg logit contributions
 Canter-0.013
 Ban-0.011
-0.011
 Dates-0.011
ajor-0.011
Token paint
Token ID7521
Feature activation-9.23e-02

Pos logit contributions
culosis+0.091
 glove+0.087
 concess+0.085
 isEnabled+0.084
PLA+0.083
Neg logit contributions
 Canter-0.092
-0.082
 Ban-0.081
 Dates-0.079
ajor-0.079
TokenKind
Token ID35854
Feature activation-4.68e-01

Pos logit contributions
culosis+0.068
 glove+0.064
 concess+0.063
 isEnabled+0.062
PLA+0.061
Neg logit contributions
 Canter-0.068
 Ban-0.060
-0.060
 Dates-0.058
ajor-0.058
Token of
Token ID286
Feature activation+1.05e-03

Pos logit contributions
culosis+0.344
 glove+0.328
 concess+0.321
 isEnabled+0.316
PLA+0.312
Neg logit contributions
 Canter-0.344
-0.306
 Ban-0.306
 Dates-0.295
ajor-0.295
Token hard
Token ID1327
Feature activation-3.57e-02

Pos logit contributions
 Canter+0.001
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token to
Token ID284
Feature activation+2.43e-03

Pos logit contributions
culosis+0.026
 glove+0.025
 concess+0.024
 isEnabled+0.024
PLA+0.023
Neg logit contributions
 Canter-0.027
-0.024
 Ban-0.024
 Dates-0.023
ajor-0.023
Token do
Token ID466
Feature activation+5.44e-02

Pos logit contributions
 Canter+0.002
+0.002
 Ban+0.002
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.002
 glove-0.002
 concess-0.002
 isEnabled-0.002
PLA-0.002
Token a
Token ID257
Feature activation-1.13e-02

Pos logit contributions
 Canter+0.040
+0.036
 Ban+0.035
 Dates+0.034
ajor+0.034
Neg logit contributions
culosis-0.040
 glove-0.038
 concess-0.037
 isEnabled-0.037
PLA-0.036
Token last
Token ID938
Feature activation+2.99e-02

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.004
 Ban-0.004
-0.004
 Dates-0.003
ajor-0.003
Token week
Token ID1285
Feature activation+4.93e-02

Pos logit contributions
 Canter+0.020
 Ban+0.018
+0.018
 Dates+0.017
ajor+0.017
Neg logit contributions
culosis-0.024
 glove-0.023
 concess-0.022
 isEnabled-0.022
PLA-0.022
Token
Token ID447
Feature activation+1.75e-02

Pos logit contributions
 Canter+0.038
+0.034
 Ban+0.034
 Dates+0.033
ajor+0.033
Neg logit contributions
culosis-0.034
 glove-0.032
 concess-0.032
 isEnabled-0.031
PLA-0.031
Token
Token ID247
Feature activation+1.97e-03

Pos logit contributions
 Canter+0.013
 Ban+0.012
+0.012
ajor+0.011
 Dates+0.011
Neg logit contributions
culosis-0.013
 glove-0.012
 concess-0.012
 isEnabled-0.012
PLA-0.011
Tokens
Token ID82
Feature activation+8.53e-03

Pos logit contributions
 Canter+0.001
+0.001
 Ban+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token episode
Token ID4471
Feature activation+2.73e-01

Pos logit contributions
 Canter+0.006
+0.005
 Ban+0.005
 Dates+0.005
ajor+0.005
Neg logit contributions
culosis-0.007
 glove-0.007
 concess-0.007
 isEnabled-0.006
PLA-0.006
Token,
Token ID11
Feature activation+8.92e-03

Pos logit contributions
 Canter+0.195
 Ban+0.173
+0.173
 Dates+0.167
ajor+0.166
Neg logit contributions
culosis-0.207
 glove-0.197
 concess-0.193
 isEnabled-0.190
PLA-0.187
Token Ann
Token ID5506
Feature activation+5.21e-02

Pos logit contributions
 Canter+0.006
+0.006
 Ban+0.006
ajor+0.006
 Dates+0.006
Neg logit contributions
culosis-0.007
 glove-0.006
 concess-0.006
 isEnabled-0.006
PLA-0.006
Tokenal
Token ID282
Feature activation+2.22e-01

Pos logit contributions
 Canter+0.026
+0.021
 Ban+0.021
 Dates+0.020
ajor+0.020
Neg logit contributions
culosis-0.051
 glove-0.049
 concess-0.048
 isEnabled-0.048
PLA-0.047
Tokenise
Token ID786
Feature activation+1.63e-02

Pos logit contributions
 Canter+0.163
+0.146
 Ban+0.145
 Dates+0.141
ajor+0.140
Neg logit contributions
culosis-0.163
 glove-0.155
 concess-0.152
 isEnabled-0.149
PLA-0.147
Token is
Token ID318
Feature activation-1.24e-02

Pos logit contributions
 Canter+0.012
 Ban+0.011
+0.011
 Dates+0.010
ajor+0.010
Neg logit contributions
culosis-0.012
 glove-0.011
 concess-0.011
 isEnabled-0.011
PLA-0.011
Token utilize
Token ID17624
Feature activation-5.95e-02

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
-0.002
 Ban-0.002
 Dates-0.002
ajor-0.002
Token easily
Token ID3538
Feature activation-6.69e-02

Pos logit contributions
culosis+0.043
 glove+0.042
 concess+0.041
 isEnabled+0.040
PLA+0.039
Neg logit contributions
 Canter-0.044
-0.039
 Ban-0.039
 Dates-0.038
ajor-0.038
Token with
Token ID351
Feature activation+1.49e-02

Pos logit contributions
culosis+0.050
 glove+0.048
 concess+0.047
 isEnabled+0.046
PLA+0.045
Neg logit contributions
 Canter-0.048
-0.043
 Ban-0.043
 Dates-0.041
ajor-0.041
Token excellent
Token ID6275
Feature activation+4.80e-02

Pos logit contributions
 Canter+0.011
+0.010
 Ban+0.010
 Dates+0.009
ajor+0.009
Neg logit contributions
culosis-0.011
 glove-0.010
 concess-0.010
 isEnabled-0.010
PLA-0.010
Token Arduino
Token ID27634
Feature activation-9.39e-02

Pos logit contributions
 Canter+0.033
 Ban+0.029
+0.029
 Dates+0.028
ajor+0.028
Neg logit contributions
culosis-0.038
 glove-0.036
 concess-0.035
 isEnabled-0.035
PLA-0.034
Token compatibility
Token ID17764
Feature activation+2.51e-01

Pos logit contributions
culosis+0.069
 glove+0.065
 concess+0.064
 isEnabled+0.063
PLA+0.062
Neg logit contributions
 Canter-0.069
 Ban-0.062
-0.062
 Dates-0.059
ajor-0.059
Token.
Token ID13
Feature activation+6.96e-03

Pos logit contributions
 Canter+0.181
+0.160
 Ban+0.160
 Dates+0.155
ajor+0.154
Neg logit contributions
culosis-0.188
 glove-0.179
 concess-0.176
 isEnabled-0.172
PLA-0.170
Token I
Token ID314
Feature activation+1.27e-02

Pos logit contributions
 Canter+0.005
+0.004
 Ban+0.004
ajor+0.004
 Dates+0.004
Neg logit contributions
culosis-0.005
 glove-0.005
 concess-0.005
 isEnabled-0.005
PLA-0.005
Token
Token ID447
Feature activation+1.64e-02

Pos logit contributions
 Canter+0.009
 Ban+0.008
+0.008
 Dates+0.008
ajor+0.008
Neg logit contributions
culosis-0.010
 glove-0.009
 concess-0.009
 isEnabled-0.009
PLA-0.009
Token
Token ID247
Feature activation+3.70e-04

Pos logit contributions
 Canter+0.012
 Ban+0.011
+0.011
 Dates+0.011
ajor+0.010
Neg logit contributions
culosis-0.012
 glove-0.011
 concess-0.011
 isEnabled-0.011
PLA-0.011
Tokenm
Token ID76
Feature activation-6.47e-02

Pos logit contributions
 Canter+0.000
 Ban+0.000
+0.000
 Dates+0.000
ajor+0.000
Neg logit contributions
culosis-0.000
 glove-0.000
 concess-0.000
 isEnabled-0.000
PLA-0.000
Token of
Token ID286
Feature activation+1.14e-02

Pos logit contributions
culosis+0.230
 glove+0.219
 concess+0.215
 isEnabled+0.211
PLA+0.208
Neg logit contributions
 Canter-0.222
-0.197
 Ban-0.197
 Dates-0.190
ajor-0.190
Token toxic
Token ID11422
Feature activation+8.19e-02

Pos logit contributions
 Canter+0.009
 Ban+0.008
+0.008
 Dates+0.008
ajor+0.008
Neg logit contributions
culosis-0.008
 glove-0.007
 concess-0.007
 isEnabled-0.007
PLA-0.007
Token al
Token ID435
Feature activation-8.80e-02

Pos logit contributions
 Canter+0.072
 Ban+0.066
+0.066
 Dates+0.064
ajor+0.063
Neg logit contributions
culosis-0.048
 glove-0.045
 concess-0.044
 isEnabled-0.043
PLA-0.042
Tokengal
Token ID13528
Feature activation-1.26e-01

Pos logit contributions
culosis+0.054
 glove+0.051
 concess+0.050
 isEnabled+0.049
PLA+0.048
Neg logit contributions
 Canter-0.075
 Ban-0.068
-0.068
 Dates-0.066
ajor-0.066
Token blo
Token ID24924
Feature activation+5.67e-02

Pos logit contributions
culosis+0.105
 glove+0.100
 concess+0.099
 isEnabled+0.097
PLA+0.096
Neg logit contributions
 Canter-0.081
 Ban-0.071
-0.071
 Dates-0.068
ajor-0.068
Tokenoms
Token ID3150
Feature activation+3.17e-01

Pos logit contributions
 Canter+0.020
+0.016
 Ban+0.016
 Dates+0.015
ajor+0.015
Neg logit contributions
culosis-0.063
 glove-0.061
 concess-0.060
 isEnabled-0.059
PLA-0.059
Token directly
Token ID3264
Feature activation+5.39e-02

Pos logit contributions
 Canter+0.229
+0.204
 Ban+0.204
 Dates+0.196
ajor+0.196
Neg logit contributions
culosis-0.237
 glove-0.225
 concess-0.221
 isEnabled-0.217
PLA-0.214
Token to
Token ID284
Feature activation+3.06e-03

Pos logit contributions
 Canter+0.039
+0.035
 Ban+0.035
 Dates+0.034
ajor+0.034
Neg logit contributions
culosis-0.040
 glove-0.038
 concess-0.037
 isEnabled-0.036
PLA-0.036
Token ocean
Token ID9151
Feature activation-2.48e-01

Pos logit contributions
 Canter+0.002
 Ban+0.002
+0.002
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.002
 glove-0.002
 concess-0.002
 isEnabled-0.002
PLA-0.002
Token warming
Token ID9917
Feature activation-1.97e-01

Pos logit contributions
culosis+0.130
 glove+0.121
 concess+0.118
 isEnabled+0.116
PLA+0.113
Neg logit contributions
 Canter-0.232
-0.214
 Ban-0.214
 Dates-0.208
ajor-0.207
Token and
Token ID290
Feature activation+1.92e-04

Pos logit contributions
culosis+0.147
 glove+0.140
 concess+0.137
 isEnabled+0.135
PLA+0.133
Neg logit contributions
 Canter-0.142
-0.127
 Ban-0.127
 Dates-0.122
ajor-0.122
Token J
Token ID449
Feature activation-2.18e-02

Pos logit contributions
 Canter+0.053
 Ban+0.047
+0.047
 Dates+0.045
ajor+0.045
Neg logit contributions
culosis-0.052
 glove-0.050
 concess-0.049
 isEnabled-0.048
PLA-0.047
Tokeny
Token ID88
Feature activation+1.00e-01

Pos logit contributions
culosis+0.020
 glove+0.019
 concess+0.018
 isEnabled+0.018
PLA+0.018
Neg logit contributions
 Canter-0.012
-0.011
 Ban-0.011
 Dates-0.010
ajor-0.010
Tokenot
Token ID313
Feature activation-4.40e-02

Pos logit contributions
 Canter+0.078
 Ban+0.070
+0.070
ajor+0.068
 Dates+0.068
Neg logit contributions
culosis-0.069
 glove-0.066
 concess-0.064
 isEnabled-0.063
PLA-0.062
Tokenirling
Token ID24297
Feature activation+1.43e-01

Pos logit contributions
culosis+0.040
 glove+0.038
 concess+0.038
 isEnabled+0.037
PLA+0.037
Neg logit contributions
 Canter-0.025
-0.021
 Ban-0.021
 Dates-0.020
ajor-0.020
Tokena
Token ID64
Feature activation-1.03e-01

Pos logit contributions
 Canter+0.123
 Ban+0.112
+0.111
 Dates+0.108
ajor+0.108
Neg logit contributions
culosis-0.087
 glove-0.082
 concess-0.080
 isEnabled-0.078
PLA-0.077
Token Temple
Token ID10857
Feature activation+3.01e-01

Pos logit contributions
culosis+0.112
 glove+0.109
 concess+0.107
 isEnabled+0.106
PLA+0.105
Neg logit contributions
 Canter-0.039
-0.030
 Ban-0.030
ajor-0.028
 Dates-0.028
Token
Token ID198
Feature activation-2.35e-03

Pos logit contributions
 Canter+0.254
 Ban+0.231
+0.228
 Dates+0.224
ajor+0.221
Neg logit contributions
culosis-0.191
 glove-0.180
 concess-0.177
 isEnabled-0.171
PLA-0.169
Token
Token ID198
Feature activation-4.18e-03

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
 Ban-0.002
-0.002
 Dates-0.001
ajor-0.001
Token9
Token ID24
Feature activation-1.33e-02

Pos logit contributions
culosis+0.003
 glove+0.003
 concess+0.003
 isEnabled+0.003
PLA+0.003
Neg logit contributions
 Canter-0.003
-0.003
 Ban-0.003
 Dates-0.003
ajor-0.003
Token.
Token ID13
Feature activation+8.62e-03

Pos logit contributions
culosis+0.010
 glove+0.010
 concess+0.009
 isEnabled+0.009
PLA+0.009
Neg logit contributions
 Canter-0.009
-0.008
 Ban-0.008
 Dates-0.008
ajor-0.008
Token Sw
Token ID2451
Feature activation+1.19e-01

Pos logit contributions
 Canter+0.006
 Ban+0.006
+0.006
 Dates+0.006
ajor+0.006
Neg logit contributions
culosis-0.006
 glove-0.006
 concess-0.006
 isEnabled-0.006
PLA-0.006
Token World
Token ID2159
Feature activation-5.33e-02

Pos logit contributions
culosis+0.068
 glove+0.065
 concess+0.064
 isEnabled+0.063
PLA+0.062
Neg logit contributions
 Canter-0.069
 Ban-0.062
-0.062
 Dates-0.059
ajor-0.059
Token War
Token ID1810
Feature activation+3.96e-03

Pos logit contributions
culosis+0.042
 glove+0.040
 concess+0.040
 isEnabled+0.039
PLA+0.039
Neg logit contributions
 Canter-0.036
 Ban-0.032
-0.032
 Dates-0.031
ajor-0.031
Token II
Token ID2873
Feature activation+2.04e-02

Pos logit contributions
 Canter+0.003
 Ban+0.002
+0.002
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.003
PLA-0.003
Token.
Token ID13
Feature activation-6.11e-04

Pos logit contributions
 Canter+0.015
+0.013
 Ban+0.013
 Dates+0.013
ajor+0.013
Neg logit contributions
culosis-0.015
 glove-0.015
 concess-0.014
 isEnabled-0.014
PLA-0.014
Token Nearly
Token ID21233
Feature activation+3.20e-02

Pos logit contributions
culosis+0.000
 glove+0.000
 concess+0.000
 isEnabled+0.000
PLA+0.000
Neg logit contributions
 Canter-0.000
 Ban-0.000
-0.000
 Dates-0.000
ajor-0.000
Token 400
Token ID7337
Feature activation+3.13e-01

Pos logit contributions
 Canter+0.022
+0.019
 Ban+0.019
 Dates+0.018
ajor+0.018
Neg logit contributions
culosis-0.025
 glove-0.024
 concess-0.024
 isEnabled-0.023
PLA-0.023
Token,
Token ID11
Feature activation-1.24e-03

Pos logit contributions
 Canter+0.223
 Ban+0.198
+0.198
 Dates+0.192
ajor+0.191
Neg logit contributions
culosis-0.236
 glove-0.225
 concess-0.221
 isEnabled-0.217
PLA-0.214
Token000
Token ID830
Feature activation+1.67e-01

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
 Dates-0.001
ajor-0.000
Token others
Token ID1854
Feature activation+4.27e-02

Pos logit contributions
 Canter+0.118
 Ban+0.105
+0.104
 Dates+0.101
ajor+0.101
Neg logit contributions
culosis-0.128
 glove-0.122
 concess-0.119
 isEnabled-0.117
PLA-0.116
Token have
Token ID423
Feature activation-1.11e-02

Pos logit contributions
 Canter+0.032
+0.028
 Ban+0.028
 Dates+0.027
ajor+0.027
Neg logit contributions
culosis-0.031
 glove-0.030
 concess-0.029
 isEnabled-0.029
PLA-0.028
Token committed
Token ID5364
Feature activation-4.58e-02

Pos logit contributions
culosis+0.008
 glove+0.008
 concess+0.008
 isEnabled+0.008
PLA+0.007
Neg logit contributions
 Canter-0.008
-0.007
 Ban-0.007
 Dates-0.007
ajor-0.007
Token on
Token ID319
Feature activation+1.50e-03

Pos logit contributions
 Canter+0.073
+0.065
 Ban+0.064
 Dates+0.063
ajor+0.063
Neg logit contributions
culosis-0.071
 glove-0.068
 concess-0.067
 isEnabled-0.065
PLA-0.065
Token employees
Token ID4409
Feature activation+7.03e-02

Pos logit contributions
 Canter+0.001
+0.001
 Ban+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token,
Token ID11
Feature activation-1.36e-02

Pos logit contributions
 Canter+0.052
+0.046
 Ban+0.046
ajor+0.045
 Dates+0.044
Neg logit contributions
culosis-0.051
 glove-0.049
 concess-0.048
 isEnabled-0.047
PLA-0.046
Token suppliers
Token ID20499
Feature activation+3.27e-02

Pos logit contributions
culosis+0.010
 glove+0.010
 concess+0.010
 isEnabled+0.009
PLA+0.009
Neg logit contributions
 Canter-0.010
-0.009
 Ban-0.009
 Dates-0.008
ajor-0.008
Token and
Token ID290
Feature activation-2.59e-02

Pos logit contributions
 Canter+0.025
+0.022
 Ban+0.022
 Dates+0.022
ajor+0.021
Neg logit contributions
culosis-0.023
 glove-0.022
 concess-0.022
 isEnabled-0.021
PLA-0.021
Token landlords
Token ID30658
Feature activation+3.75e-02

Pos logit contributions
culosis+0.021
 glove+0.020
 concess+0.019
 isEnabled+0.019
PLA+0.019
Neg logit contributions
 Canter-0.017
-0.015
 Ban-0.015
 Dates-0.015
ajor-0.015
Token affected
Token ID5676
Feature activation+6.46e-02

Pos logit contributions
 Canter+0.027
+0.024
 Ban+0.024
ajor+0.024
 Dates+0.024
Neg logit contributions
culosis-0.027
 glove-0.026
 concess-0.026
 isEnabled-0.025
PLA-0.025
Token by
Token ID416
Feature activation-1.19e-02

Pos logit contributions
 Canter+0.049
+0.044
 Ban+0.043
 Dates+0.042
ajor+0.042
Neg logit contributions
culosis-0.046
 glove-0.044
 concess-0.043
 isEnabled-0.042
PLA-0.041
Token the
Token ID262
Feature activation-8.03e-03

Pos logit contributions
culosis+0.009
 glove+0.008
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.009
-0.008
 Ban-0.008
 Dates-0.007
ajor-0.007
Token closure
Token ID16512
Feature activation+6.96e-02

Pos logit contributions
culosis+0.006
 glove+0.006
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.005
-0.005
 Ban-0.005
 Dates-0.004
ajor-0.004
Token of
Token ID286
Feature activation-2.35e-03

Pos logit contributions
 Canter+0.051
+0.046
 Ban+0.045
ajor+0.044
 Dates+0.044
Neg logit contributions
culosis-0.050
 glove-0.049
 concess-0.048
 isEnabled-0.046
PLA-0.046
Token apt
Token ID15409
Feature activation+4.12e-02

Pos logit contributions
culosis+0.006
 glove+0.006
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.006
 Ban-0.005
-0.005
 Dates-0.005
ajor-0.005
Tokenitudes
Token ID10455
Feature activation-1.03e-01

Pos logit contributions
 Canter+0.032
 Ban+0.028
+0.028
 Dates+0.028
ajor+0.028
Neg logit contributions
culosis-0.029
 glove-0.027
 concess-0.027
 isEnabled-0.026
PLA-0.026
Token past
Token ID1613
Feature activation+1.82e-01

Pos logit contributions
culosis+0.076
 glove+0.072
 concess+0.071
 isEnabled+0.069
PLA+0.069
Neg logit contributions
 Canter-0.075
 Ban-0.067
-0.067
 Dates-0.065
ajor-0.064
Token a
Token ID257
Feature activation-1.45e-02

Pos logit contributions
 Canter+0.132
+0.117
 Ban+0.117
ajor+0.113
 Dates+0.113
Neg logit contributions
culosis-0.136
 glove-0.130
 concess-0.127
 isEnabled-0.124
PLA-0.123
Token certain
Token ID1728
Feature activation-3.19e-02

Pos logit contributions
culosis+0.011
 glove+0.010
 concess+0.010
 isEnabled+0.010
PLA+0.010
Neg logit contributions
 Canter-0.011
-0.009
 Ban-0.009
 Dates-0.009
ajor-0.009
Token age
Token ID2479
Feature activation-4.12e-05

Pos logit contributions
culosis+0.022
 glove+0.021
 concess+0.020
 isEnabled+0.020
PLA+0.020
Neg logit contributions
 Canter-0.025
 Ban-0.022
-0.022
 Dates-0.022
ajor-0.022
Token,
Token ID11
Feature activation-5.18e-03

Pos logit contributions
culosis+0.000
 glove+0.000
 concess+0.000
 isEnabled+0.000
PLA+0.000
Neg logit contributions
 Canter-0.000
-0.000
 Ban-0.000
 Dates-0.000
ajor-0.000
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.004
 isEnabled+0.003
PLA+0.003
Neg logit contributions
 Canter-0.004
-0.003
 Ban-0.003
 Dates-0.003
ajor-0.003
Token when
Token ID618
Feature activation-6.72e-02

Pos logit contributions
culosis+0.008
 glove+0.007
 concess+0.007
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.008
-0.007
 Ban-0.007
 Dates-0.007
ajor-0.007
Token an
Token ID281
Feature activation+4.25e-02

Pos logit contributions
culosis+0.050
 glove+0.048
 concess+0.046
 isEnabled+0.046
PLA+0.045
Neg logit contributions
 Canter-0.049
-0.044
 Ban-0.043
ajor-0.042
 Dates-0.042
Token older
Token ID4697
Feature activation-1.22e-02

Pos logit contributions
 Canter+0.030
+0.026
 Ban+0.026
 Dates+0.026
ajor+0.026
Neg logit contributions
culosis-0.032
 glove-0.031
 concess-0.030
 isEnabled-0.030
PLA-0.029
Token
Token ID198
Feature activation-4.05e-03

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
 Ban-0.002
-0.002
 Dates-0.002
ajor-0.002
TokenThe
Token ID464
Feature activation+1.70e-02

Pos logit contributions
culosis+0.003
 glove+0.003
 concess+0.003
 isEnabled+0.003
PLA+0.003
Neg logit contributions
 Canter-0.003
-0.003
 Ban-0.003
 Dates-0.003
ajor-0.003
Token fourth
Token ID5544
Feature activation+9.47e-02

Pos logit contributions
 Canter+0.012
+0.011
 Ban+0.011
 Dates+0.011
ajor+0.011
Neg logit contributions
culosis-0.013
 glove-0.012
 concess-0.012
 isEnabled-0.011
PLA-0.011
Token big
Token ID1263
Feature activation-1.10e-01

Pos logit contributions
 Canter+0.067
 Ban+0.060
+0.059
 Dates+0.058
ajor+0.057
Neg logit contributions
culosis-0.072
 glove-0.069
 concess-0.068
 isEnabled-0.066
PLA-0.066
Token Project
Token ID4935
Feature activation-4.08e-02

Pos logit contributions
culosis+0.084
 glove+0.080
 concess+0.079
 isEnabled+0.077
PLA+0.076
Neg logit contributions
 Canter-0.078
-0.069
 Ban-0.069
 Dates-0.067
ajor-0.067
Token DI
Token ID14766
Feature activation+1.52e-01

Pos logit contributions
culosis+0.037
 glove+0.036
 concess+0.035
 isEnabled+0.035
PLA+0.034
Neg logit contributions
 Canter-0.023
 Ban-0.020
-0.020
 Dates-0.019
ajor-0.019
Tokenva
Token ID6862
Feature activation+1.95e-01

Pos logit contributions
 Canter+0.106
+0.093
 Ban+0.093
 Dates+0.090
ajor+0.090
Neg logit contributions
culosis-0.118
 glove-0.113
 concess-0.111
 isEnabled-0.109
PLA-0.108
Token X
Token ID1395
Feature activation-4.02e-02

Pos logit contributions
 Canter+0.141
+0.126
 Ban+0.125
ajor+0.121
 Dates+0.121
Neg logit contributions
culosis-0.145
 glove-0.139
 concess-0.136
 isEnabled-0.133
PLA-0.132
Token update
Token ID4296
Feature activation+1.78e-01

Pos logit contributions
culosis+0.031
 glove+0.030
 concess+0.029
 isEnabled+0.029
PLA+0.028
Neg logit contributions
 Canter-0.028
-0.025
 Ban-0.024
ajor-0.024
 Dates-0.024
Token is
Token ID318
Feature activation-1.39e-02

Pos logit contributions
 Canter+0.132
+0.118
 Ban+0.117
ajor+0.114
 Dates+0.113
Neg logit contributions
culosis-0.128
 glove-0.123
 concess-0.120
 isEnabled-0.118
PLA-0.116
Token in
Token ID287
Feature activation-1.58e-02

Pos logit contributions
culosis+0.011
 glove+0.010
 concess+0.010
 isEnabled+0.010
PLA+0.010
Neg logit contributions
 Canter-0.010
 Ban-0.009
-0.009
 Dates-0.008
ajor-0.008
Token unveiling
Token ID41438
Feature activation-1.60e-01

Pos logit contributions
 Canter+0.000
 Ban+0.000
+0.000
 Dates+0.000
ajor+0.000
Neg logit contributions
culosis-0.000
 glove-0.000
 concess-0.000
 isEnabled-0.000
PLA-0.000
Token (
Token ID357
Feature activation+1.32e-02

Pos logit contributions
culosis+0.121
 glove+0.115
 concess+0.113
 isEnabled+0.111
PLA+0.109
Neg logit contributions
 Canter-0.115
 Ban-0.102
-0.102
 Dates-0.099
ajor-0.098
Tokenof
Token ID1659
Feature activation-1.86e-02

Pos logit contributions
 Canter+0.009
+0.008
 Ban+0.008
 Dates+0.007
ajor+0.007
Neg logit contributions
culosis-0.010
 glove-0.010
 concess-0.010
 isEnabled-0.010
PLA-0.010
Token sorts
Token ID10524
Feature activation-1.16e-02

Pos logit contributions
culosis+0.014
 glove+0.013
 concess+0.013
 isEnabled+0.012
PLA+0.012
Neg logit contributions
 Canter-0.014
-0.012
 Ban-0.012
 Dates-0.012
ajor-0.012
Token)
Token ID8
Feature activation+3.86e-02

Pos logit contributions
culosis+0.009
 glove+0.008
 concess+0.008
 isEnabled+0.008
PLA+0.008
Neg logit contributions
 Canter-0.008
-0.008
 Ban-0.008
 Dates-0.007
ajor-0.007
Token of
Token ID286
Feature activation+3.01e-03

Pos logit contributions
 Canter+0.028
+0.025
 Ban+0.025
 Dates+0.024
ajor+0.024
Neg logit contributions
culosis-0.029
 glove-0.027
 concess-0.027
 isEnabled-0.026
PLA-0.026
Token Destiny
Token ID17886
Feature activation+5.27e-03

Pos logit contributions
 Canter+0.002
+0.002
 Ban+0.002
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.002
 glove-0.002
 concess-0.002
 isEnabled-0.002
PLA-0.002
Token via
Token ID2884
Feature activation+9.45e-04

Pos logit contributions
 Canter+0.004
 Ban+0.003
+0.003
 Dates+0.003
ajor+0.003
Neg logit contributions
culosis-0.004
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token its
Token ID663
Feature activation+2.07e-02

Pos logit contributions
 Canter+0.001
+0.001
 Ban+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token official
Token ID1743
Feature activation-1.22e-01

Pos logit contributions
 Canter+0.016
+0.014
 Ban+0.014
 Dates+0.014
ajor+0.014
Neg logit contributions
culosis-0.015
 glove-0.014
 concess-0.014
 isEnabled-0.013
PLA-0.013
Token web
Token ID3992
Feature activation-8.04e-02

Pos logit contributions
culosis+0.088
 glove+0.084
 concess+0.083
 isEnabled+0.081
PLA+0.080
Neg logit contributions
 Canter-0.091
-0.081
 Ban-0.081
 Dates-0.079
ajor-0.078
Token
Token ID246
Feature activation-4.22e-02

Pos logit contributions
culosis+0.033
 glove+0.031
 concess+0.031
 isEnabled+0.030
PLA+0.030
Neg logit contributions
 Canter-0.021
-0.018
 Ban-0.018
 Dates-0.017
ajor-0.017
TokenDream
Token ID30571
Feature activation-1.92e-02

Pos logit contributions
culosis+0.032
 glove+0.030
 concess+0.030
 isEnabled+0.029
PLA+0.029
Neg logit contributions
 Canter-0.030
-0.027
 Ban-0.027
 Dates-0.026
ajor-0.026
Tokeners
Token ID364
Feature activation-5.75e-02

Pos logit contributions
culosis+0.013
 glove+0.012
 concess+0.012
 isEnabled+0.011
PLA+0.011
Neg logit contributions
 Canter-0.015
-0.014
 Ban-0.014
ajor-0.014
 Dates-0.014
Token
Token ID447
Feature activation+3.85e-03

Pos logit contributions
culosis+0.043
 glove+0.041
 concess+0.041
 isEnabled+0.040
PLA+0.039
Neg logit contributions
 Canter-0.041
-0.036
 Ban-0.036
 Dates-0.035
ajor-0.035
Token
Token ID247
Feature activation-5.13e-03

Pos logit contributions
 Canter+0.003
 Ban+0.002
+0.002
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.003
PLA-0.003
Token from
Token ID422
Feature activation-2.33e-02

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.004
 isEnabled+0.003
PLA+0.003
Neg logit contributions
 Canter-0.004
-0.003
 Ban-0.003
 Dates-0.003
ajor-0.003
Token deportation
Token ID24031
Feature activation-2.11e-02

Pos logit contributions
culosis+0.017
 glove+0.016
 concess+0.015
 isEnabled+0.015
PLA+0.015
Neg logit contributions
 Canter-0.018
-0.016
 Ban-0.016
 Dates-0.015
ajor-0.015
Token,
Token ID11
Feature activation-1.52e-03

Pos logit contributions
culosis+0.016
 glove+0.015
 concess+0.015
 isEnabled+0.015
PLA+0.014
Neg logit contributions
 Canter-0.015
 Ban-0.013
-0.013
 Dates-0.013
ajor-0.013
Token there
Token ID612
Feature activation-2.58e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
 Ban-0.001
-0.001
 Dates-0.001
ajor-0.001
Token will
Token ID481
Feature activation-1.14e-03

Pos logit contributions
culosis+0.018
 glove+0.017
 concess+0.017
 isEnabled+0.016
PLA+0.016
Neg logit contributions
 Canter-0.020
-0.018
 Ban-0.018
 Dates-0.017
ajor-0.017
Token be
Token ID307
Feature activation-1.76e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
-0.001
 Ban-0.001
 Dates-0.001
ajor-0.001
Token everyone
Token ID2506
Feature activation+6.36e-02

Pos logit contributions
 Canter+0.015
+0.013
 Ban+0.013
 Dates+0.013
ajor+0.013
Neg logit contributions
culosis-0.015
 glove-0.014
 concess-0.014
 isEnabled-0.013
PLA-0.013
Token does
Token ID857
Feature activation+2.57e-02

Pos logit contributions
 Canter+0.049
+0.044
 Ban+0.044
 Dates+0.042
ajor+0.042
Neg logit contributions
culosis-0.045
 glove-0.042
 concess-0.041
 isEnabled-0.041
PLA-0.040
Token it
Token ID340
Feature activation-1.81e-04

Pos logit contributions
 Canter+0.019
+0.017
 Ban+0.017
 Dates+0.016
ajor+0.016
Neg logit contributions
culosis-0.019
 glove-0.018
 concess-0.018
 isEnabled-0.017
PLA-0.017
Token?
Token ID30
Feature activation-2.07e-02

Pos logit contributions
culosis+0.000
 glove+0.000
 concess+0.000
 isEnabled+0.000
PLA+0.000
Neg logit contributions
 Canter-0.000
-0.000
 Ban-0.000
 Dates-0.000
ajor-0.000
Token Hillary
Token ID5073
Feature activation-5.81e-02

Pos logit contributions
culosis+0.016
 glove+0.015
 concess+0.015
 isEnabled+0.015
PLA+0.014
Neg logit contributions
 Canter-0.015
-0.013
 Ban-0.013
 Dates-0.012
ajor-0.012
Token Clinton
Token ID2605
Feature activation-4.12e-02

Pos logit contributions
culosis+0.041
 glove+0.039
 concess+0.038
 isEnabled+0.038
PLA+0.037
Neg logit contributions
 Canter-0.044
 Ban-0.040
-0.040
 Dates-0.038
ajor-0.038
Token isn
Token ID2125
Feature activation+1.44e-02

Pos logit contributions
culosis+0.030
 glove+0.029
 concess+0.028
 isEnabled+0.027
PLA+0.027
Neg logit contributions
 Canter-0.031
 Ban-0.027
-0.027
 Dates-0.026
ajor-0.026
Token
Token ID447
Feature activation+2.16e-02

Pos logit contributions
 Canter+0.010
 Ban+0.009
+0.009
 Dates+0.009
ajor+0.009
Neg logit contributions
culosis-0.011
 glove-0.011
 concess-0.010
 isEnabled-0.010
PLA-0.010
Token
Token ID247
Feature activation+8.99e-03

Pos logit contributions
 Canter+0.016
 Ban+0.014
+0.014
 Dates+0.014
ajor+0.014
Neg logit contributions
culosis-0.016
 glove-0.015
 concess-0.015
 isEnabled-0.015
PLA-0.014
Tokent
Token ID83
Feature activation+1.67e-02

Pos logit contributions
 Canter+0.007
+0.006
 Ban+0.006
 Dates+0.006
ajor+0.006
Neg logit contributions
culosis-0.007
 glove-0.006
 concess-0.006
 isEnabled-0.006
PLA-0.006
Token proposing
Token ID24634
Feature activation-1.14e-01

Pos logit contributions
 Canter+0.012
 Ban+0.011
+0.011
 Dates+0.010
ajor+0.010
Neg logit contributions
culosis-0.013
 glove-0.012
 concess-0.012
 isEnabled-0.012
PLA-0.011
Token $
Token ID720
Feature activation-3.40e-02

Pos logit contributions
culosis+0.075
 glove+0.071
 concess+0.069
 isEnabled+0.068
PLA+0.068
Neg logit contributions
 Canter-0.078
 Ban-0.069
-0.069
 Dates-0.067
ajor-0.067
Token840
Token ID40675
Feature activation+9.76e-02

Pos logit contributions
culosis+0.026
 glove+0.025
 concess+0.025
 isEnabled+0.024
PLA+0.024
Neg logit contributions
 Canter-0.024
-0.021
 Ban-0.021
 Dates-0.020
ajor-0.020
Token to
Token ID284
Feature activation-3.21e-03

Pos logit contributions
 Canter+0.073
 Ban+0.065
+0.065
 Dates+0.063
ajor+0.063
Neg logit contributions
culosis-0.071
 glove-0.067
 concess-0.065
 isEnabled-0.065
PLA-0.064
Token cover
Token ID3002
Feature activation+4.83e-02

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
-0.002
 Ban-0.002
 Dates-0.002
ajor-0.002
Token expenses
Token ID9307
Feature activation+2.50e-02

Pos logit contributions
 Canter+0.035
 Ban+0.032
+0.031
 Dates+0.030
ajor+0.030
Neg logit contributions
culosis-0.036
 glove-0.034
 concess-0.033
 isEnabled-0.033
PLA-0.032
Token working
Token ID1762
Feature activation-1.45e-01

Pos logit contributions
 Canter+0.019
+0.017
 Ban+0.017
 Dates+0.016
ajor+0.016
Neg logit contributions
culosis-0.018
 glove-0.017
 concess-0.017
 isEnabled-0.016
PLA-0.016
Token downtown
Token ID9436
Feature activation+1.95e-02

Pos logit contributions
culosis+0.110
 glove+0.104
 concess+0.102
 isEnabled+0.101
PLA+0.099
Neg logit contributions
 Canter-0.104
 Ban-0.093
-0.092
 Dates-0.089
ajor-0.089
Token.
Token ID13
Feature activation+1.40e-03

Pos logit contributions
 Canter+0.014
 Ban+0.013
+0.013
 Dates+0.012
ajor+0.012
Neg logit contributions
culosis-0.015
 glove-0.014
 concess-0.014
 isEnabled-0.013
PLA-0.013
Token (
Token ID357
Feature activation+1.52e-02

Pos logit contributions
 Canter+0.001
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
TokenCBC
Token ID29208
Feature activation-6.77e-02

Pos logit contributions
 Canter+0.011
+0.010
 Ban+0.010
ajor+0.010
 Dates+0.010
Neg logit contributions
culosis-0.011
 glove-0.010
 concess-0.010
 isEnabled-0.010
PLA-0.010
Token News
Token ID3000
Feature activation-7.20e-02

Pos logit contributions
culosis+0.054
 glove+0.052
 concess+0.051
 isEnabled+0.050
PLA+0.049
Neg logit contributions
 Canter-0.045
 Ban-0.040
-0.040
 Dates-0.039
ajor-0.038
Token snap
Token ID11495
Feature activation+2.50e-01

Pos logit contributions
culosis+0.017
 glove+0.017
 concess+0.016
 isEnabled+0.016
PLA+0.016
Neg logit contributions
 Canter-0.019
-0.017
 Ban-0.017
 Dates-0.016
ajor-0.016
Token off
Token ID572
Feature activation-1.19e-01

Pos logit contributions
 Canter+0.194
+0.174
 Ban+0.174
 Dates+0.169
ajor+0.168
Neg logit contributions
culosis-0.173
 glove-0.164
 concess-0.160
 isEnabled-0.157
PLA-0.155
Token anyway
Token ID6949
Feature activation-7.00e-02

Pos logit contributions
culosis+0.088
 glove+0.084
 concess+0.082
 isEnabled+0.081
PLA+0.080
Neg logit contributions
 Canter-0.086
 Ban-0.077
-0.077
 Dates-0.074
ajor-0.074
TokenOnce
Token ID7454
Feature activation-4.05e-02

Pos logit contributions
culosis+0.053
 glove+0.050
 concess+0.049
 isEnabled+0.048
PLA+0.048
Neg logit contributions
 Canter-0.050
-0.045
 Ban-0.044
 Dates-0.043
ajor-0.043
Token it
Token ID340
Feature activation-1.77e-02

Pos logit contributions
culosis+0.029
 glove+0.028
 concess+0.028
 isEnabled+0.027
PLA+0.027
Neg logit contributions
 Canter-0.030
 Ban-0.027
-0.027
 Dates-0.026
ajor-0.026
Token's
Token ID338
Feature activation-6.22e-02

Pos logit contributions
culosis+0.013
 glove+0.012
 concess+0.012
PLA+0.012
 isEnabled+0.012
Neg logit contributions
 Canter-0.013
-0.012
 Ban-0.012
 Dates-0.011
ajor-0.011
Token mostly
Token ID4632
Feature activation+5.03e-02

Pos logit contributions
culosis+0.047
 glove+0.044
 concess+0.044
 isEnabled+0.043
PLA+0.043
Neg logit contributions
 Canter-0.045
 Ban-0.040
-0.040
ajor-0.038
 Dates-0.038
Token detached
Token ID30795
Feature activation-7.59e-02

Pos logit contributions
 Canter+0.036
 Ban+0.032
+0.032
 Dates+0.030
ajor+0.030
Neg logit contributions
culosis-0.039
 glove-0.036
 concess-0.036
 isEnabled-0.035
PLA-0.035
Token you
Token ID345
Feature activation-1.11e-02

Pos logit contributions
culosis+0.056
 glove+0.053
 concess+0.052
 isEnabled+0.051
PLA+0.051
Neg logit contributions
 Canter-0.056
 Ban-0.050
-0.050
 Dates-0.048
ajor-0.048
Token just
Token ID655
Feature activation+1.48e-02

Pos logit contributions
culosis+0.008
 glove+0.007
 concess+0.007
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.009
-0.008
 Ban-0.008
 Dates-0.007
ajor-0.007
Token need
Token ID761
Feature activation+2.67e-02

Pos logit contributions
 Canter+0.011
+0.010
 Ban+0.010
 Dates+0.009
ajor+0.009
Neg logit contributions
culosis-0.011
 glove-0.010
 concess-0.010
 isEnabled-0.010
PLA-0.010
Token on
Token ID319
Feature activation+1.18e-02

Pos logit contributions
 Canter+0.017
+0.015
 Ban+0.015
 Dates+0.014
ajor+0.014
Neg logit contributions
culosis-0.017
 glove-0.016
 concess-0.016
 isEnabled-0.016
PLA-0.015
Token the
Token ID262
Feature activation+3.62e-03

Pos logit contributions
 Canter+0.008
 Ban+0.008
+0.008
 Dates+0.007
ajor+0.007
Neg logit contributions
culosis-0.009
 glove-0.008
 concess-0.008
 isEnabled-0.008
PLA-0.008
Token list
Token ID1351
Feature activation-3.86e-02

Pos logit contributions
 Canter+0.003
 Ban+0.002
+0.002
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.002
PLA-0.002
Token.
Token ID13
Feature activation-2.53e-04

Pos logit contributions
culosis+0.029
 glove+0.028
 concess+0.027
 isEnabled+0.027
PLA+0.027
Neg logit contributions
 Canter-0.028
 Ban-0.024
-0.024
 Dates-0.024
ajor-0.023
Token From
Token ID3574
Feature activation-7.60e-02

Pos logit contributions
culosis+0.000
 glove+0.000
 concess+0.000
 isEnabled+0.000
PLA+0.000
Neg logit contributions
 Canter-0.000
-0.000
 Ban-0.000
 Dates-0.000
ajor-0.000
Token Edward
Token ID10443
Feature activation-6.89e-02

Pos logit contributions
culosis+0.056
 glove+0.054
 concess+0.053
 isEnabled+0.052
PLA+0.051
Neg logit contributions
 Canter-0.055
 Ban-0.049
-0.049
 Dates-0.047
ajor-0.047
Token Has
Token ID7875
Feature activation+2.30e-02

Pos logit contributions
culosis+0.043
 glove+0.041
 concess+0.040
 isEnabled+0.039
PLA+0.039
Neg logit contributions
 Canter-0.058
-0.052
 Ban-0.052
 Dates-0.051
ajor-0.051
Tokenb
Token ID65
Feature activation-1.72e-02

Pos logit contributions
 Canter+0.014
 Ban+0.012
+0.012
 Dates+0.012
ajor+0.012
Neg logit contributions
culosis-0.020
 glove-0.019
 concess-0.019
 isEnabled-0.018
PLA-0.018
Tokenrou
Token ID472
Feature activation-3.27e-01

Pos logit contributions
culosis+0.008
 glove+0.007
 concess+0.007
 isEnabled+0.007
PLA+0.006
Neg logit contributions
 Canter-0.018
 Ban-0.016
-0.016
ajor-0.016
 Dates-0.016
Tokenck
Token ID694
Feature activation+2.41e-01

Pos logit contributions
culosis+0.333
 glove+0.321
 concess+0.316
 isEnabled+0.312
PLA+0.309
Neg logit contributions
 Canter-0.148
-0.122
 Ban-0.121
 Dates-0.115
ajor-0.114
Token's
Token ID338
Feature activation-5.57e-02

Pos logit contributions
 Canter+0.178
 Ban+0.159
+0.158
 Dates+0.153
ajor+0.153
Neg logit contributions
culosis-0.177
 glove-0.168
 concess-0.165
 isEnabled-0.162
PLA-0.160
Token el
Token ID1288
Feature activation-4.84e-02

Pos logit contributions
culosis+0.050
 glove+0.047
 concess+0.046
 isEnabled+0.045
PLA+0.045
Neg logit contributions
 Canter-0.054
 Ban-0.048
-0.048
 Dates-0.047
ajor-0.047
Tokenit
Token ID270
Feature activation-1.37e-02

Pos logit contributions
culosis+0.035
 glove+0.034
 concess+0.033
 isEnabled+0.032
PLA+0.032
Neg logit contributions
 Canter-0.036
-0.032
 Ban-0.032
 Dates-0.031
ajor-0.031
Tokenist
Token ID396
Feature activation-1.01e-01

Pos logit contributions
culosis+0.008
 glove+0.008
 concess+0.008
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.012
-0.011
 Ban-0.011
 Dates-0.011
ajor-0.011
Token
Token ID447
Feature activation+2.29e-02

Pos logit contributions
culosis+0.071
 glove+0.068
 concess+0.066
 isEnabled+0.065
PLA+0.064
Neg logit contributions
 Canter-0.077
 Ban-0.069
-0.069
 Dates-0.067
ajor-0.067
Token
Token ID251
Feature activation+3.75e-03

Pos logit contributions
 Canter+0.015
+0.014
 Ban+0.014
 Dates+0.013
ajor+0.013
Neg logit contributions
culosis-0.018
 glove-0.017
 concess-0.017
 isEnabled-0.017
PLA-0.017
Token bloggers
Token ID30357
Feature activation-1.43e-01

Pos logit contributions
 Canter+0.003
 Ban+0.002
+0.002
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.003
PLA-0.002
Token like
Token ID588
Feature activation+1.50e-02

Pos logit contributions
culosis+0.110
 glove+0.105
 concess+0.103
 isEnabled+0.101
PLA+0.100
Neg logit contributions
 Canter-0.100
-0.088
 Ban-0.088
 Dates-0.085
ajor-0.085
Token John
Token ID1757
Feature activation-1.24e-01

Pos logit contributions
 Canter+0.011
 Ban+0.010
+0.010
 Dates+0.010
ajor+0.010
Neg logit contributions
culosis-0.011
 glove-0.010
 concess-0.010
 isEnabled-0.010
PLA-0.010
Token Scal
Token ID34529
Feature activation-2.18e-01

Pos logit contributions
culosis+0.091
 glove+0.087
 concess+0.085
 isEnabled+0.084
PLA+0.083
Neg logit contributions
 Canter-0.091
 Ban-0.081
-0.081
 Dates-0.078
ajor-0.078
Tokenzi
Token ID17027
Feature activation-2.55e-02

Pos logit contributions
culosis+0.141
 glove+0.134
 isEnabled+0.128
 concess+0.128
PLA+0.127
Neg logit contributions
 Canter-0.179
-0.162
 Ban-0.160
 Dates-0.156
ajor-0.155
Token
Token ID960
Feature activation+4.70e-02

Pos logit contributions
culosis+0.020
 glove+0.019
 concess+0.018
 isEnabled+0.018
PLA+0.018
Neg logit contributions
 Canter-0.018
-0.016
 Ban-0.016
 Dates-0.015
ajor-0.015
Tokenot
Token ID313
Feature activation-4.39e-02

Pos logit contributions
 Canter+0.040
 Ban+0.036
+0.036
ajor+0.035
 Dates+0.035
Neg logit contributions
culosis-0.036
 glove-0.035
 concess-0.034
 isEnabled-0.033
PLA-0.033
Tokenis
Token ID271
Feature activation+3.35e-02

Pos logit contributions
culosis+0.034
 glove+0.032
 concess+0.031
 isEnabled+0.031
PLA+0.031
Neg logit contributions
 Canter-0.031
 Ban-0.027
-0.027
 Dates-0.026
ajor-0.026
Token group
Token ID1448
Feature activation+1.90e-01

Pos logit contributions
 Canter+0.026
 Ban+0.024
+0.024
 Dates+0.023
ajor+0.023
Neg logit contributions
culosis-0.023
 glove-0.022
 concess-0.021
 isEnabled-0.021
PLA-0.021
Token identifies
Token ID21079
Feature activation-1.63e-01

Pos logit contributions
 Canter+0.137
 Ban+0.122
+0.121
 Dates+0.117
ajor+0.117
Neg logit contributions
culosis-0.142
 glove-0.136
 concess-0.133
 isEnabled-0.131
PLA-0.129
Token additional
Token ID3224
Feature activation+2.59e-02

Pos logit contributions
culosis+0.120
 glove+0.114
 concess+0.112
 isEnabled+0.109
PLA+0.108
Neg logit contributions
 Canter-0.120
 Ban-0.107
-0.107
 Dates-0.103
ajor-0.103
Token tax
Token ID1687
Feature activation-3.50e-01

Pos logit contributions
 Canter+0.020
 Ban+0.018
+0.018
 Dates+0.017
ajor+0.017
Neg logit contributions
culosis-0.018
 glove-0.017
 concess-0.017
 isEnabled-0.017
PLA-0.016
Tokena
Token ID64
Feature activation-1.06e-01

Pos logit contributions
culosis+0.210
 glove+0.198
 concess+0.193
 isEnabled+0.188
PLA+0.185
Neg logit contributions
 Canter-0.305
 Ban-0.277
-0.276
 Dates-0.270
ajor-0.269
Token.
Token ID13
Feature activation-1.01e-03

Pos logit contributions
culosis+0.079
 glove+0.075
 concess+0.074
 isEnabled+0.073
PLA+0.072
Neg logit contributions
 Canter-0.077
 Ban-0.068
-0.068
 Dates-0.066
ajor-0.066
Token Australian
Token ID6638
Feature activation-5.97e-02

Pos logit contributions
culosis+0.001
 glove+0.001
 concess+0.001
 isEnabled+0.001
PLA+0.001
Neg logit contributions
 Canter-0.001
 Ban-0.001
-0.001
 Dates-0.001
ajor-0.001
Token Journal
Token ID4913
Feature activation-7.11e-02

Pos logit contributions
culosis+0.042
 glove+0.040
 concess+0.039
 isEnabled+0.038
PLA+0.038
Neg logit contributions
 Canter-0.046
 Ban-0.042
-0.041
 Dates-0.040
ajor-0.040
Token of
Token ID286
Feature activation+5.30e-04

Pos logit contributions
culosis+0.054
 glove+0.051
 concess+0.051
 isEnabled+0.050
PLA+0.049
Neg logit contributions
 Canter-0.051
 Ban-0.045
-0.045
 Dates-0.043
ajor-0.043
Token world
Token ID995
Feature activation+3.58e-02

Pos logit contributions
 Canter+0.005
 Ban+0.004
+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.004
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token enters
Token ID14170
Feature activation-2.14e-02

Pos logit contributions
 Canter+0.027
 Ban+0.025
+0.025
 Dates+0.024
ajor+0.024
Neg logit contributions
culosis-0.025
 glove-0.024
 concess-0.024
 isEnabled-0.023
PLA-0.023
Token the
Token ID262
Feature activation+4.03e-03

Pos logit contributions
culosis+0.016
 glove+0.015
 concess+0.015
 isEnabled+0.015
PLA+0.015
Neg logit contributions
 Canter-0.015
 Ban-0.014
-0.014
ajor-0.013
 Dates-0.013
Token age
Token ID2479
Feature activation+6.67e-03

Pos logit contributions
 Canter+0.003
 Ban+0.003
+0.003
 Dates+0.002
ajor+0.002
Neg logit contributions
culosis-0.003
 glove-0.003
 concess-0.003
 isEnabled-0.003
PLA-0.003
Token of
Token ID286
Feature activation+1.43e-02

Pos logit contributions
 Canter+0.005
+0.004
 Ban+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.005
 concess-0.005
 isEnabled-0.005
PLA-0.005
Token climate
Token ID4258
Feature activation-3.37e-01

Pos logit contributions
 Canter+0.012
 Ban+0.011
+0.011
 Dates+0.011
ajor+0.011
Neg logit contributions
culosis-0.009
 glove-0.008
 concess-0.008
 isEnabled-0.008
PLA-0.008
Token change
Token ID1487
Feature activation+2.82e-01

Pos logit contributions
culosis+0.358
 glove+0.348
 concess+0.343
 isEnabled+0.337
PLA+0.333
Neg logit contributions
 Canter-0.138
-0.109
 Ban-0.108
 Dates-0.103
ajor-0.102
Token,
Token ID11
Feature activation+6.67e-03

Pos logit contributions
 Canter+0.199
 Ban+0.177
+0.177
 Dates+0.170
ajor+0.170
Neg logit contributions
culosis-0.215
 glove-0.205
 concess-0.201
 isEnabled-0.198
PLA-0.196
Token the
Token ID262
Feature activation+9.36e-03

Pos logit contributions
 Canter+0.005
 Ban+0.004
+0.004
 Dates+0.004
ajor+0.004
Neg logit contributions
culosis-0.005
 glove-0.005
 concess-0.005
 isEnabled-0.004
PLA-0.004
Token tiny
Token ID7009
Feature activation+2.59e-02

Pos logit contributions
 Canter+0.007
 Ban+0.006
+0.006
 Dates+0.006
ajor+0.006
Neg logit contributions
culosis-0.007
 glove-0.007
 concess-0.007
 isEnabled-0.006
PLA-0.006
Token islands
Token ID14807
Feature activation-3.90e-02

Pos logit contributions
 Canter+0.020
 Ban+0.018
+0.018
 Dates+0.018
ajor+0.018
Neg logit contributions
culosis-0.018
 glove-0.017
 concess-0.016
 isEnabled-0.016
PLA-0.016
Token Donald
Token ID3759
Feature activation-5.71e-02

Pos logit contributions
culosis+0.073
 glove+0.069
 concess+0.068
 isEnabled+0.066
PLA+0.065
Neg logit contributions
 Canter-0.086
-0.077
 Ban-0.077
 Dates-0.075
ajor-0.075
Token J
Token ID449
Feature activation-2.90e-02

Pos logit contributions
culosis+0.042
 glove+0.040
 concess+0.039
 isEnabled+0.038
PLA+0.038
Neg logit contributions
 Canter-0.042
-0.038
 Ban-0.038
 Dates-0.036
ajor-0.036
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
culosis+0.022
 glove+0.021
 concess+0.020
 isEnabled+0.020
PLA+0.020
Neg logit contributions
 Canter-0.021
-0.019
 Ban-0.018
 Dates-0.018
ajor-0.018
Token Trump
Token ID1301
Feature activation-2.08e-02

Pos logit contributions
culosis+0.007
 glove+0.007
 concess+0.007
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.008
-0.007
 Ban-0.007
ajor-0.007
 Dates-0.007
Token (@
Token ID4275
Feature activation+1.09e-01

Pos logit contributions
culosis+0.019
 glove+0.018
 concess+0.018
 isEnabled+0.018
PLA+0.017
Neg logit contributions
 Canter-0.012
 Ban-0.010
-0.010
 Dates-0.010
ajor-0.010
TokenrealDonaldTrump
Token ID28024
Feature activation-3.38e-01

Pos logit contributions
 Canter+0.123
 Ban+0.113
+0.113
 Dates+0.111
ajor+0.111
Neg logit contributions
culosis-0.037
 glove-0.034
 concess-0.032
 isEnabled-0.031
PLA-0.031
Token)
Token ID8
Feature activation+3.91e-02

Pos logit contributions
culosis+0.258
 glove+0.250
 isEnabled+0.243
 concess+0.241
PLA+0.241
Neg logit contributions
 Canter-0.235
 Ban-0.205
-0.204
 Dates-0.199
ajor-0.198
Token November
Token ID3389
Feature activation+5.53e-02

Pos logit contributions
 Canter+0.026
+0.023
 Ban+0.023
ajor+0.022
 Dates+0.022
Neg logit contributions
culosis-0.031
 glove-0.030
 concess-0.029
 isEnabled-0.029
PLA-0.029
Token 3
Token ID513
Feature activation-2.66e-02

Pos logit contributions
 Canter+0.042
 Ban+0.037
+0.037
 Dates+0.036
ajor+0.036
Neg logit contributions
culosis-0.040
 glove-0.038
 concess-0.037
 isEnabled-0.036
PLA-0.036
Token,
Token ID11
Feature activation-5.21e-03

Pos logit contributions
culosis+0.020
 glove+0.019
 concess+0.018
 isEnabled+0.018
PLA+0.018
Neg logit contributions
 Canter-0.019
-0.017
 Ban-0.017
 Dates-0.017
ajor-0.017
Token 2017
Token ID2177
Feature activation-1.19e-02

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.003
 isEnabled+0.003
PLA+0.003
Neg logit contributions
 Canter-0.004
-0.003
 Ban-0.003
 Dates-0.003
ajor-0.003
Token.
Token ID13
Feature activation-2.67e-03

Pos logit contributions
culosis+0.091
 glove+0.087
 concess+0.085
 isEnabled+0.083
PLA+0.082
Neg logit contributions
 Canter-0.088
-0.079
 Ban-0.079
 Dates-0.076
ajor-0.076
Token
Token ID198
Feature activation-9.66e-03

Pos logit contributions
culosis+0.002
 glove+0.002
 concess+0.002
 isEnabled+0.002
PLA+0.002
Neg logit contributions
 Canter-0.002
-0.002
 Ban-0.002
ajor-0.002
 Dates-0.002
Token
Token ID198
Feature activation-1.04e-02

Pos logit contributions
culosis+0.007
 glove+0.007
 concess+0.007
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.007
-0.006
 Ban-0.006
 Dates-0.006
ajor-0.006
TokenThe
Token ID464
Feature activation+1.70e-02

Pos logit contributions
culosis+0.008
 glove+0.007
 concess+0.007
 isEnabled+0.007
PLA+0.007
Neg logit contributions
 Canter-0.008
 Ban-0.007
-0.007
 Dates-0.007
ajor-0.007
Token Florida
Token ID4744
Feature activation-1.11e-01

Pos logit contributions
 Canter+0.013
 Ban+0.012
+0.012
 Dates+0.012
ajor+0.012
Neg logit contributions
culosis-0.012
 glove-0.011
 concess-0.011
 isEnabled-0.011
PLA-0.010
Token senator
Token ID12329
Feature activation-3.67e-01

Pos logit contributions
culosis+0.076
 glove+0.073
 concess+0.071
 isEnabled+0.070
PLA+0.069
Neg logit contributions
 Canter-0.087
-0.078
 Ban-0.078
 Dates-0.075
ajor-0.075
Token has
Token ID468
Feature activation-8.62e-03

Pos logit contributions
culosis+0.273
 glove+0.261
 concess+0.255
 isEnabled+0.251
PLA+0.248
Neg logit contributions
 Canter-0.265
-0.236
 Ban-0.236
 Dates-0.228
ajor-0.227
Token also
Token ID635
Feature activation+6.27e-02

Pos logit contributions
culosis+0.006
 glove+0.006
 concess+0.006
 isEnabled+0.006
PLA+0.006
Neg logit contributions
 Canter-0.006
 Ban-0.006
-0.006
 Dates-0.006
ajor-0.005
Token refused
Token ID6520
Feature activation-3.80e-02

Pos logit contributions
 Canter+0.048
 Ban+0.043
+0.043
 Dates+0.041
ajor+0.041
Neg logit contributions
culosis-0.044
 glove-0.042
 concess-0.041
 isEnabled-0.040
PLA-0.040
Token to
Token ID284
Feature activation-5.90e-03

Pos logit contributions
culosis+0.029
 glove+0.027
 concess+0.027
 isEnabled+0.026
PLA+0.026
Neg logit contributions
 Canter-0.027
 Ban-0.024
-0.024
 Dates-0.023
ajor-0.023
Token answer
Token ID3280
Feature activation-1.09e-01

Pos logit contributions
culosis+0.004
 glove+0.004
 concess+0.004
 isEnabled+0.004
PLA+0.004
Neg logit contributions
 Canter-0.005
 Ban-0.004
-0.004
 Dates-0.004
ajor-0.004
Token drugs
Token ID5010
Feature activation-4.97e-02

Pos logit contributions
culosis+0.047
 glove+0.045
 concess+0.043
 isEnabled+0.043
PLA+0.042
Neg logit contributions
 Canter-0.048
-0.042
 Ban-0.042
ajor-0.041
 Dates-0.041
Token come
Token ID1282
Feature activation+4.34e-02

Pos logit contributions
culosis+0.037
 glove+0.035
 concess+0.034
 isEnabled+0.034
PLA+0.033
Neg logit contributions
 Canter-0.036
 Ban-0.032
-0.032
 Dates-0.031
ajor-0.031
Token off
Token ID572
Feature activation-1.07e-01

Pos logit contributions
 Canter+0.031
 Ban+0.028
+0.028
 Dates+0.027
ajor+0.027
Neg logit contributions
culosis-0.033
 glove-0.031
 concess-0.030
 isEnabled-0.030
PLA-0.030
Token their
Token ID511
Feature activation+4.14e-02

Pos logit contributions
culosis+0.079
 glove+0.075
 concess+0.073
 isEnabled+0.072
PLA+0.071
Neg logit contributions
 Canter-0.079
-0.070
 Ban-0.070
ajor-0.068
 Dates-0.068
Token cr
Token ID1067
Feature activation-3.95e-02

Pos logit contributions
 Canter+0.031
+0.028
 Ban+0.028
 Dates+0.027
ajor+0.027
Neg logit contributions
culosis-0.030
 glove-0.029
 concess-0.028
 isEnabled-0.027
PLA-0.027
Tokenutch
Token ID7140
Feature activation-3.54e-01

Pos logit contributions
culosis+0.025
 glove+0.024
 concess+0.023
 isEnabled+0.022
PLA+0.022
Neg logit contributions
 Canter-0.033
-0.030
 Ban-0.030
 Dates-0.029
ajor-0.029
Token.
Token ID13
Feature activation+6.25e-03

Pos logit contributions
culosis+0.264
 glove+0.252
 concess+0.247
 isEnabled+0.242
PLA+0.239
Neg logit contributions
 Canter-0.256
 Ban-0.228
-0.228
 Dates-0.220
ajor-0.220
Token
Token ID198
Feature activation+4.93e-03

Pos logit contributions
 Canter+0.004
+0.004
 Ban+0.004
ajor+0.004
 Dates+0.004
Neg logit contributions
culosis-0.005
 glove-0.004
 concess-0.004
 isEnabled-0.004
PLA-0.004
Token
Token ID198
Feature activation+1.22e-03

Pos logit contributions
 Canter+0.004
+0.003
 Ban+0.003
 Dates+0.003
ajor+0.003
Neg logit contributions
culosis-0.004
 glove-0.004
 concess-0.003
 isEnabled-0.003
PLA-0.003
TokenThere
Token ID1858
Feature activation-3.13e-02

Pos logit contributions
 Canter+0.001
 Ban+0.001
+0.001
 Dates+0.001
ajor+0.001
Neg logit contributions
culosis-0.001
 glove-0.001
 concess-0.001
 isEnabled-0.001
PLA-0.001
Token are
Token ID389
Feature activation-1.77e-02

Pos logit contributions
culosis+0.022
 glove+0.021
 concess+0.020
 isEnabled+0.020
PLA+0.020
Neg logit contributions
 Canter-0.024
 Ban-0.021
-0.021
 Dates-0.021
ajor-0.021