TOP ACTIVATIONS
MAX = 0.678

plugins on Windows , Audio Unit plugins on OS X ,
4 . Expl ained Murd y ,
zz lement and char lat anism , all later dropped .
problem . Expl ained Marin , When
he says , sounding p ained . So you
alf , Gim li the Dwarf , Leg olas the Elf
two m aint ained frequent , direct contact
on the k art ing circuit , starting at the tender
b unch of mag g ots .
ways . Hist orian Regina Mor antz highlighted the
Ay at ollah Ali Kham enei , warned that the day
Ole ch owski , a Qu ist spokeswoman , said the
of a political dial ogue .
handling . Best for precision circuit rac ers who aim for
. En light enment is like that . EVERY
attack ads casting Mr . Qu ist as more
poll shows that Mr . Qu ist has cut Mr .
different concepts from the arg uable king of curse , Scott
ho arded and ready for immediate export to aid his first
revision . Cons idered the economy most vulnerable to

MIN ACTIVATIONS
MIN = -0.595

offered a tweet of condol ence to those affected by the
FO OT T OC AK NOW LED G ED NET CALL
library together : pub struct Atomic < T >
drinks . Plan ning a Christmas or new year
toss in hyper bole here . PART
' a . pub struct Shared < ' a
are basically continuing to be equally as tight - f isted
are basically continuing to be equally as tight - f isted
E ER T OC AK NOW LED G ED NET CALL
SEM INAL MA IN AK NOW LED G ED NET CALL
rules are designed to work equally well whether or not you
majority of China 's air traffic is still concentrated among a
steel as an " organ ization of distinction ."
William Carpenter , Mont pel ier Con Hogan
capacity and very high decentral ization . CL TV ( BIP
. Will Dun ning , who has his own
endure great mechanical stress ( bending open and slamming
's Halo super fans Ryan McC aff rey and Brian Albert
in jury / research / pub / alcohol - laws /
adults ); pres ence of critical illness - related

INTERVAL 0.360 - 0.678
CONTAINS 0.184%

minute scene that ends with Sed aris covered in blood ,
it s an assemb l age of disparate things
one had happened . A Redd itor posted what they claimed
, Lv iv and Iv ano - Frank iv sk .
even modify your interior . Susp ension and engine performance can

INTERVAL 0.042 - 0.360
CONTAINS 28.913%

report concluded that Ike a managers were aware of the possibility
Hollande has set out an ambitious goal for the Paris meeting
of Part icle :: Get Mass ( ambient D ensity ).
ker atin 9 and ker atin 16 .[ 22 ]
prime minister should take Mr Hes elt ine 's comments "

INTERVAL -0.276 - 0.042
CONTAINS 69.930%

Finally , the Obama administration might reach out to
You see fewer people on the In man 300
! The segment is actually painful to
judge you for playing some Pokemon while in line . In
ashamed of in a woman s organization , and

INTERVAL -0.595 - -0.276
CONTAINS 0.973%

and seven hundred were be headed therein .[ 1 ][ 27
thought to have a distinct microbiome across all women was the
range at will . If no direction is held during the
have been changed for the present article to use the new
V V p DM 2 V INE : Im Justin
Token plugins
Token ID20652
Feature activation+2.70e-01

Pos logit contributions
ascript+0.023
Unit+0.021
anse+0.019
arius+0.019
dial+0.018
Neg logit contributions
\/\/-0.017
ivas-0.016
 equally-0.014
NESS-0.014
 pastry-0.014
Token on
Token ID319
Feature activation+1.71e-01

Pos logit contributions
ascript+0.238
Unit+0.225
anse+0.204
arius+0.203
dial+0.191
Neg logit contributions
\/\/-0.177
ivas-0.165
 equally-0.148
NESS-0.146
 pastry-0.145
Token Windows
Token ID3964
Feature activation+1.54e-01

Pos logit contributions
ascript+0.151
Unit+0.143
anse+0.130
arius+0.129
dial+0.121
Neg logit contributions
\/\/-0.111
ivas-0.104
 equally-0.092
NESS-0.092
 pastry-0.091
Token,
Token ID11
Feature activation+4.12e-02

Pos logit contributions
ascript+0.136
Unit+0.129
anse+0.116
arius+0.116
dial+0.109
Neg logit contributions
\/\/-0.100
ivas-0.093
NESS-0.083
 equally-0.083
 pastry-0.082
Token Audio
Token ID13491
Feature activation-9.36e-03

Pos logit contributions
ascript+0.037
Unit+0.035
anse+0.031
arius+0.031
dial+0.029
Neg logit contributions
\/\/-0.027
ivas-0.025
 equally-0.022
NESS-0.022
 pastry-0.022
TokenUnit
Token ID26453
Feature activation+6.78e-01

Pos logit contributions
\/\/+0.007
ivas+0.006
 equally+0.006
NESS+0.006
 pastry+0.006
Neg logit contributions
ascript-0.008
Unit-0.007
anse-0.006
arius-0.006
dial-0.006
Token plugins
Token ID20652
Feature activation+2.22e-01

Pos logit contributions
ascript+0.614
Unit+0.582
anse+0.528
arius+0.526
dial+0.497
Neg logit contributions
\/\/-0.428
ivas-0.397
 equally-0.351
NESS-0.350
 pastry-0.347
Token on
Token ID319
Feature activation+1.20e-01

Pos logit contributions
ascript+0.195
Unit+0.185
anse+0.168
arius+0.167
dial+0.157
Neg logit contributions
\/\/-0.144
ivas-0.134
 equally-0.121
NESS-0.119
 pastry-0.119
Token OS
Token ID7294
Feature activation-1.00e-01

Pos logit contributions
ascript+0.106
Unit+0.100
anse+0.091
arius+0.090
dial+0.085
Neg logit contributions
\/\/-0.078
ivas-0.073
 equally-0.065
NESS-0.065
 pastry-0.065
TokenX
Token ID55
Feature activation-1.30e-01

Pos logit contributions
\/\/+0.058
ivas+0.053
 equally+0.047
NESS+0.046
 pastry+0.046
Neg logit contributions
ascript-0.096
Unit-0.091
anse-0.083
arius-0.083
dial-0.078
Token,
Token ID11
Feature activation+3.78e-02

Pos logit contributions
\/\/+0.080
ivas+0.075
 equally+0.066
NESS+0.065
 pastry+0.065
Neg logit contributions
ascript-0.119
Unit-0.113
anse-0.103
arius-0.103
dial-0.097
Token 4
Token ID604
Feature activation-1.46e-02

Pos logit contributions
ascript+0.046
Unit+0.043
anse+0.039
arius+0.039
dial+0.036
Neg logit contributions
\/\/-0.039
ivas-0.037
 equally-0.033
NESS-0.033
 pastry-0.033
Token.
Token ID13
Feature activation+3.47e-02

Pos logit contributions
\/\/+0.010
ivas+0.009
 equally+0.008
NESS+0.008
 pastry+0.008
Neg logit contributions
ascript-0.013
Unit-0.012
anse-0.011
arius-0.011
dial-0.010
Token
Token ID198
Feature activation+5.54e-02

Pos logit contributions
ascript+0.031
Unit+0.029
anse+0.026
arius+0.026
dial+0.025
Neg logit contributions
\/\/-0.023
ivas-0.021
 equally-0.019
NESS-0.019
 pastry-0.019
Token
Token ID198
Feature activation+5.98e-02

Pos logit contributions
ascript+0.049
Unit+0.046
anse+0.042
arius+0.042
dial+0.039
Neg logit contributions
\/\/-0.036
ivas-0.034
 equally-0.030
NESS-0.030
 pastry-0.030
TokenExpl
Token ID18438
Feature activation-2.07e-02

Pos logit contributions
ascript+0.049
Unit+0.046
anse+0.041
arius+0.041
dial+0.038
Neg logit contributions
\/\/-0.043
ivas-0.041
 equally-0.036
NESS-0.036
 pastry-0.036
Tokenained
Token ID1328
Feature activation+6.72e-01

Pos logit contributions
\/\/+0.018
ivas+0.017
 equally+0.016
NESS+0.016
 pastry+0.015
Neg logit contributions
ascript-0.014
Unit-0.013
anse-0.011
arius-0.011
dial-0.010
Token Murd
Token ID20305
Feature activation+2.32e-01

Pos logit contributions
ascript+0.571
Unit+0.540
anse+0.486
arius+0.483
dial+0.456
Neg logit contributions
\/\/-0.460
ivas-0.430
NESS-0.382
 equally-0.379
 pastry-0.377
Tokeny
Token ID88
Feature activation-2.16e-02

Pos logit contributions
ascript+0.239
Unit+0.228
anse+0.209
arius+0.208
dial+0.199
Neg logit contributions
\/\/-0.117
ivas-0.107
 equally-0.091
NESS-0.090
 pastry-0.090
Token,
Token ID11
Feature activation+3.22e-02

Pos logit contributions
\/\/+0.014
ivas+0.013
NESS+0.011
 equally+0.011
 pastry+0.011
Neg logit contributions
ascript-0.019
Unit-0.018
anse-0.017
arius-0.016
dial-0.016
Token
Token ID564
Feature activation+5.69e-02

Pos logit contributions
ascript+0.025
Unit+0.023
anse+0.021
arius+0.021
dial+0.019
Neg logit contributions
\/\/-0.025
ivas-0.023
NESS-0.021
 equally-0.021
 pastry-0.021
Token
Token ID250
Feature activation-2.39e-02

Pos logit contributions
ascript+0.044
Unit+0.041
anse+0.036
arius+0.036
dial+0.034
Neg logit contributions
\/\/-0.044
ivas-0.041
 equally-0.037
NESS-0.037
 pastry-0.037
Tokenzz
Token ID3019
Feature activation+1.34e-01

Pos logit contributions
\/\/+0.039
ivas+0.037
NESS+0.034
 equally+0.034
 pastry+0.034
Neg logit contributions
ascript-0.033
Unit-0.031
anse-0.028
arius-0.027
dial-0.025
Tokenlement
Token ID1732
Feature activation+1.03e-01

Pos logit contributions
ascript+0.110
Unit+0.104
anse+0.094
arius+0.093
dial+0.087
Neg logit contributions
\/\/-0.096
ivas-0.090
NESS-0.081
 equally-0.081
 framing-0.080
Token and
Token ID290
Feature activation+5.63e-02

Pos logit contributions
ascript+0.089
Unit+0.084
anse+0.076
arius+0.076
dial+0.071
Neg logit contributions
\/\/-0.069
ivas-0.064
NESS-0.057
 equally-0.057
 pastry-0.056
Token char
Token ID1149
Feature activation-7.22e-02

Pos logit contributions
ascript+0.050
Unit+0.048
anse+0.043
arius+0.043
dial+0.041
Neg logit contributions
\/\/-0.036
ivas-0.034
NESS-0.030
 equally-0.029
 BALL-0.029
Tokenlat
Token ID15460
Feature activation-4.32e-02

Pos logit contributions
\/\/+0.031
ivas+0.028
 equally+0.023
NESS+0.023
 pastry+0.022
Neg logit contributions
ascript-0.080
Unit-0.077
anse-0.071
arius-0.071
dial-0.067
Tokenanism
Token ID48162
Feature activation+6.46e-01

Pos logit contributions
\/\/+0.051
ivas+0.049
 equally+0.046
NESS+0.046
 pastry+0.046
Neg logit contributions
ascript-0.016
Unit-0.014
anse-0.010
arius-0.010
dial-0.008
Token,
Token ID11
Feature activation+3.23e-03

Pos logit contributions
ascript+0.554
Unit+0.523
anse+0.471
arius+0.470
dial+0.442
Neg logit contributions
\/\/-0.437
ivas-0.408
NESS-0.364
 equally-0.360
 BALL-0.360
Token all
Token ID477
Feature activation+3.00e-02

Pos logit contributions
ascript+0.003
Unit+0.003
anse+0.002
arius+0.002
dial+0.002
Neg logit contributions
\/\/-0.002
ivas-0.002
 equally-0.002
NESS-0.002
 pastry-0.002
Token later
Token ID1568
Feature activation-7.14e-02

Pos logit contributions
ascript+0.027
Unit+0.025
anse+0.023
arius+0.023
dial+0.022
Neg logit contributions
\/\/-0.019
ivas-0.018
NESS-0.016
 equally-0.016
 pastry-0.016
Token dropped
Token ID5710
Feature activation+1.22e-01

Pos logit contributions
\/\/+0.055
ivas+0.052
 equally+0.047
NESS+0.047
 pastry+0.047
Neg logit contributions
ascript-0.055
Unit-0.051
anse-0.046
arius-0.045
dial-0.042
Token.
Token ID13
Feature activation-5.70e-03

Pos logit contributions
ascript+0.103
Unit+0.097
anse+0.087
arius+0.087
dial+0.082
Neg logit contributions
\/\/-0.084
ivas-0.078
NESS-0.070
 pastry-0.069
 equally-0.069
Token problem
Token ID1917
Feature activation-1.33e-01

Pos logit contributions
\/\/+0.003
ivas+0.003
NESS+0.003
 equally+0.003
 pastry+0.003
Neg logit contributions
ascript-0.005
Unit-0.005
anse-0.004
arius-0.004
dial-0.004
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
\/\/+0.093
ivas+0.087
NESS+0.077
 pastry+0.077
 equally+0.077
Neg logit contributions
ascript-0.111
Unit-0.105
anse-0.094
arius-0.094
dial-0.088
Token
Token ID447
Feature activation+1.75e-02

Pos logit contributions
ascript+0.013
Unit+0.012
anse+0.011
arius+0.011
dial+0.010
Neg logit contributions
\/\/-0.010
ivas-0.009
 equally-0.008
NESS-0.008
 pastry-0.008
Token
Token ID251
Feature activation+1.07e-02

Pos logit contributions
ascript+0.011
Unit+0.010
anse+0.009
arius+0.009
dial+0.008
Neg logit contributions
\/\/-0.016
ivas-0.015
NESS-0.014
 equally-0.014
 pastry-0.014
Token Expl
Token ID5905
Feature activation-1.64e-01

Pos logit contributions
ascript+0.009
Unit+0.009
anse+0.008
arius+0.008
dial+0.008
Neg logit contributions
\/\/-0.007
ivas-0.007
 equally-0.006
NESS-0.006
 pastry-0.006
Tokenained
Token ID1328
Feature activation+6.20e-01

Pos logit contributions
\/\/+0.158
ivas+0.151
 equally+0.140
NESS+0.139
 pastry+0.138
Neg logit contributions
ascript-0.094
Unit-0.086
anse-0.072
arius-0.072
dial-0.065
Token Marin
Token ID36384
Feature activation+3.98e-02

Pos logit contributions
ascript+0.509
Unit+0.482
anse+0.431
arius+0.431
dial+0.404
Neg logit contributions
\/\/-0.440
ivas-0.412
NESS-0.369
 equally-0.367
 pastry-0.365
Token,
Token ID11
Feature activation-2.00e-02

Pos logit contributions
ascript+0.036
Unit+0.034
anse+0.031
arius+0.030
dial+0.029
Neg logit contributions
\/\/-0.026
ivas-0.024
NESS-0.021
 pastry-0.021
 BALL-0.021
Token
Token ID564
Feature activation-1.07e-02

Pos logit contributions
\/\/+0.015
ivas+0.014
 equally+0.013
NESS+0.013
 pastry+0.013
Neg logit contributions
ascript-0.016
Unit-0.015
anse-0.013
arius-0.013
dial-0.012
Token
Token ID250
Feature activation+2.08e-02

Pos logit contributions
\/\/+0.008
ivas+0.008
 equally+0.007
NESS+0.007
 framing+0.007
Neg logit contributions
ascript-0.008
Unit-0.008
anse-0.007
arius-0.007
dial-0.006
TokenWhen
Token ID2215
Feature activation-1.74e-02

Pos logit contributions
ascript+0.016
Unit+0.015
anse+0.013
arius+0.013
dial+0.012
Neg logit contributions
\/\/-0.016
ivas-0.015
 equally-0.014
NESS-0.014
 pastry-0.014
Token he
Token ID339
Feature activation-1.66e-02

Pos logit contributions
ascript+0.005
Unit+0.004
anse+0.004
arius+0.004
dial+0.004
Neg logit contributions
\/\/-0.003
ivas-0.003
 equally-0.003
NESS-0.003
 pastry-0.003
Token says
Token ID1139
Feature activation+1.33e-02

Pos logit contributions
\/\/+0.013
ivas+0.012
NESS+0.011
 equally+0.011
 pastry+0.011
Neg logit contributions
ascript-0.013
Unit-0.012
anse-0.011
arius-0.011
dial-0.010
Token,
Token ID11
Feature activation-6.50e-02

Pos logit contributions
ascript+0.011
Unit+0.010
anse+0.009
arius+0.009
dial+0.009
Neg logit contributions
\/\/-0.009
ivas-0.009
NESS-0.008
 equally-0.008
 pastry-0.008
Token sounding
Token ID22655
Feature activation-2.46e-01

Pos logit contributions
\/\/+0.049
ivas+0.046
 equally+0.042
NESS+0.041
 pastry+0.041
Neg logit contributions
ascript-0.051
Unit-0.048
anse-0.043
arius-0.042
dial-0.040
Token p
Token ID279
Feature activation-7.85e-02

Pos logit contributions
\/\/+0.161
ivas+0.150
 equally+0.134
NESS+0.133
 pastry+0.132
Neg logit contributions
ascript-0.218
Unit-0.206
anse-0.186
arius-0.185
dial-0.175
Tokenained
Token ID1328
Feature activation+5.83e-01

Pos logit contributions
\/\/+0.091
ivas+0.088
 equally+0.082
 pastry+0.082
NESS+0.081
Neg logit contributions
ascript-0.030
Unit-0.026
arius-0.019
anse-0.019
dial-0.016
Token.
Token ID13
Feature activation-2.02e-02

Pos logit contributions
ascript+0.494
Unit+0.463
anse+0.418
arius+0.417
dial+0.393
Neg logit contributions
\/\/-0.402
ivas-0.377
NESS-0.335
 pastry-0.331
 BALL-0.331
Token
Token ID198
Feature activation-2.89e-02

Pos logit contributions
\/\/+0.015
ivas+0.014
NESS+0.013
 equally+0.013
 pastry+0.013
Neg logit contributions
ascript-0.016
Unit-0.015
anse-0.014
arius-0.013
dial-0.013
Token
Token ID198
Feature activation-2.43e-02

Pos logit contributions
\/\/+0.019
ivas+0.017
NESS+0.015
 equally+0.015
 BALL+0.015
Neg logit contributions
ascript-0.026
Unit-0.024
anse-0.022
arius-0.022
dial-0.021
TokenSo
Token ID2396
Feature activation-6.83e-02

Pos logit contributions
\/\/+0.018
ivas+0.016
NESS+0.015
 equally+0.015
 pastry+0.015
Neg logit contributions
ascript-0.020
Unit-0.019
anse-0.017
arius-0.017
dial-0.016
Token you
Token ID345
Feature activation+1.92e-02

Pos logit contributions
\/\/+0.046
ivas+0.043
 equally+0.038
NESS+0.038
 pastry+0.038
Neg logit contributions
ascript-0.059
Unit-0.056
anse-0.050
arius-0.050
dial-0.047
Tokenalf
Token ID1604
Feature activation-1.69e-02

Pos logit contributions
ascript+0.077
Unit+0.072
anse+0.063
arius+0.063
dial+0.059
Neg logit contributions
\/\/-0.076
ivas-0.071
 equally-0.065
 pastry-0.064
NESS-0.064
Token,
Token ID11
Feature activation-2.67e-02

Pos logit contributions
\/\/+0.010
ivas+0.010
 equally+0.009
NESS+0.008
 pastry+0.008
Neg logit contributions
ascript-0.016
Unit-0.015
anse-0.013
arius-0.013
dial-0.013
Token Gim
Token ID41123
Feature activation+1.30e-01

Pos logit contributions
\/\/+0.018
ivas+0.017
 equally+0.015
NESS+0.015
 pastry+0.015
Neg logit contributions
ascript-0.023
Unit-0.022
anse-0.019
arius-0.019
dial-0.018
Tokenli
Token ID4528
Feature activation-2.61e-01

Pos logit contributions
ascript+0.135
Unit+0.130
anse+0.120
arius+0.120
dial+0.115
Neg logit contributions
\/\/-0.064
ivas-0.057
NESS-0.048
 equally-0.048
 framing-0.048
Token the
Token ID262
Feature activation-2.69e-02

Pos logit contributions
\/\/+0.161
ivas+0.150
 equally+0.132
NESS+0.131
 pastry+0.130
Neg logit contributions
ascript-0.240
Unit-0.227
anse-0.206
arius-0.205
dial-0.194
Token Dwarf
Token ID33196
Feature activation+5.66e-01

Pos logit contributions
\/\/+0.020
ivas+0.018
 equally+0.017
 pastry+0.017
 BALL+0.017
Neg logit contributions
ascript-0.022
Unit-0.020
anse-0.018
arius-0.018
dial-0.017
Token,
Token ID11
Feature activation-3.33e-02

Pos logit contributions
ascript+0.504
Unit+0.479
anse+0.433
arius+0.433
dial+0.409
Neg logit contributions
\/\/-0.364
ivas-0.339
NESS-0.299
 equally-0.298
 framing-0.296
Token Leg
Token ID3564
Feature activation+2.95e-02

Pos logit contributions
\/\/+0.023
ivas+0.021
 equally+0.019
NESS+0.019
 pastry+0.019
Neg logit contributions
ascript-0.028
Unit-0.027
anse-0.024
arius-0.024
dial-0.023
Tokenolas
Token ID12456
Feature activation+1.03e-02

Pos logit contributions
ascript+0.028
Unit+0.027
anse+0.024
arius+0.024
enment+0.023
Neg logit contributions
\/\/-0.017
ivas-0.016
 equally-0.014
 pastry-0.013
notes-0.013
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
ascript+0.009
Unit+0.009
anse+0.008
arius+0.008
dial+0.008
Neg logit contributions
\/\/-0.006
ivas-0.006
NESS-0.005
 equally-0.005
 pastry-0.005
Token Elf
Token ID19067
Feature activation+9.09e-02

Pos logit contributions
\/\/+0.012
ivas+0.011
 equally+0.010
NESS+0.010
 pastry+0.010
Neg logit contributions
ascript-0.011
Unit-0.011
anse-0.009
arius-0.009
dial-0.009
Token two
Token ID734
Feature activation-4.89e-02

Pos logit contributions
\/\/+0.013
ivas+0.012
 equally+0.011
NESS+0.011
 pastry+0.011
Neg logit contributions
ascript-0.016
Unit-0.015
anse-0.013
arius-0.013
dial-0.012
Token
Token ID564
Feature activation+4.37e-02

Pos logit contributions
\/\/+0.033
ivas+0.031
NESS+0.028
 equally+0.028
 pastry+0.028
Neg logit contributions
ascript-0.042
Unit-0.039
anse-0.035
arius-0.035
dial-0.033
Token
Token ID250
Feature activation-5.81e-03

Pos logit contributions
ascript+0.034
Unit+0.032
anse+0.028
arius+0.028
dial+0.026
Neg logit contributions
\/\/-0.033
ivas-0.032
 equally-0.029
NESS-0.028
 framing-0.028
Tokenm
Token ID76
Feature activation-2.78e-01

Pos logit contributions
\/\/+0.004
ivas+0.004
 equally+0.004
NESS+0.004
 pastry+0.004
Neg logit contributions
ascript-0.005
Unit-0.004
anse-0.004
arius-0.004
dial-0.004
Tokenaint
Token ID2913
Feature activation+1.90e-01

Pos logit contributions
\/\/+0.223
ivas+0.211
 equally+0.192
NESS+0.191
 pastry+0.190
Neg logit contributions
ascript-0.203
Unit-0.190
anse-0.168
arius-0.166
dial-0.156
Tokenained
Token ID1328
Feature activation+5.63e-01

Pos logit contributions
ascript+0.052
Unit+0.043
anse+0.027
arius+0.026
dial+0.019
Neg logit contributions
\/\/-0.241
ivas-0.233
 equally-0.220
NESS-0.218
 pastry-0.218
Token frequent
Token ID10792
Feature activation+1.86e-02

Pos logit contributions
ascript+0.479
Unit+0.453
anse+0.408
arius+0.407
dial+0.383
Neg logit contributions
\/\/-0.384
ivas-0.360
NESS-0.321
 equally-0.318
 pastry-0.317
Token,
Token ID11
Feature activation-1.81e-02

Pos logit contributions
ascript+0.016
Unit+0.015
anse+0.014
arius+0.014
dial+0.013
Neg logit contributions
\/\/-0.012
ivas-0.011
NESS-0.010
 equally-0.010
 pastry-0.010
Token direct
Token ID1277
Feature activation+8.46e-02

Pos logit contributions
\/\/+0.012
ivas+0.011
 equally+0.010
NESS+0.010
 pastry+0.010
Neg logit contributions
ascript-0.016
Unit-0.015
anse-0.013
arius-0.013
dial-0.012
Token contact
Token ID2800
Feature activation-1.20e-01

Pos logit contributions
ascript+0.075
Unit+0.071
anse+0.064
arius+0.064
dial+0.060
Neg logit contributions
\/\/-0.055
ivas-0.051
NESS-0.045
 equally-0.045
 pastry-0.045
Token
Token ID447
Feature activation+4.17e-03

Pos logit contributions
\/\/+0.087
ivas+0.082
 equally+0.073
NESS+0.073
 pastry+0.073
Neg logit contributions
ascript-0.098
Unit-0.092
anse-0.083
arius-0.082
dial-0.077
Token on
Token ID319
Feature activation+5.56e-02

Pos logit contributions
ascript+0.130
Unit+0.124
anse+0.112
arius+0.112
dial+0.106
Neg logit contributions
\/\/-0.086
ivas-0.080
NESS-0.070
 equally-0.070
 pastry-0.069
Token the
Token ID262
Feature activation+1.00e-03

Pos logit contributions
ascript+0.048
Unit+0.045
anse+0.041
arius+0.040
dial+0.038
Neg logit contributions
\/\/-0.038
ivas-0.035
 equally-0.032
NESS-0.031
 pastry-0.031
Token k
Token ID479
Feature activation-4.23e-02

Pos logit contributions
ascript+0.001
Unit+0.001
anse+0.001
arius+0.001
dial+0.001
Neg logit contributions
\/\/-0.001
ivas-0.001
 equally-0.001
 pastry-0.001
NESS-0.001
Tokenart
Token ID433
Feature activation-1.41e-01

Pos logit contributions
\/\/+0.025
ivas+0.023
NESS+0.021
 equally+0.020
 framing+0.020
Neg logit contributions
ascript-0.040
Unit-0.038
anse-0.034
arius-0.034
dial-0.032
Tokening
Token ID278
Feature activation-1.95e-01

Pos logit contributions
\/\/+0.097
ivas+0.090
NESS+0.080
 equally+0.080
 pastry+0.080
Neg logit contributions
ascript-0.121
Unit-0.114
anse-0.103
arius-0.102
dial-0.097
Token circuit
Token ID10349
Feature activation+5.58e-01

Pos logit contributions
\/\/+0.218
ivas+0.210
 equally+0.198
 pastry+0.197
NESS+0.196
Neg logit contributions
ascript-0.081
Unit-0.071
anse-0.055
arius-0.055
dial-0.046
Token,
Token ID11
Feature activation+2.60e-02

Pos logit contributions
ascript+0.501
Unit+0.474
anse+0.429
arius+0.428
dial+0.404
Neg logit contributions
\/\/-0.356
ivas-0.331
NESS-0.292
 equally-0.292
 pastry-0.290
Token starting
Token ID3599
Feature activation+4.92e-02

Pos logit contributions
ascript+0.023
Unit+0.022
anse+0.020
arius+0.020
dial+0.019
Neg logit contributions
\/\/-0.017
ivas-0.016
 equally-0.014
 pastry-0.014
NESS-0.014
Token at
Token ID379
Feature activation-7.13e-02

Pos logit contributions
ascript+0.046
Unit+0.043
anse+0.039
arius+0.039
dial+0.037
Neg logit contributions
\/\/-0.030
ivas-0.028
 equally-0.025
NESS-0.025
 pastry-0.025
Token the
Token ID262
Feature activation+2.64e-03

Pos logit contributions
\/\/+0.050
ivas+0.046
 equally+0.042
NESS+0.042
 pastry+0.041
Neg logit contributions
ascript-0.060
Unit-0.056
anse-0.051
arius-0.050
dial-0.047
Token tender
Token ID15403
Feature activation-5.47e-02

Pos logit contributions
ascript+0.002
Unit+0.002
arius+0.001
anse+0.001
dial+0.001
Neg logit contributions
\/\/-0.002
ivas-0.002
 equally-0.002
 pastry-0.002
NESS-0.002
Token
Token ID564
Feature activation+1.24e-01

Pos logit contributions
ascript+0.022
Unit+0.021
anse+0.019
arius+0.019
dial+0.017
Neg logit contributions
\/\/-0.021
ivas-0.020
NESS-0.018
 equally-0.018
 pastry-0.018
Token
Token ID250
Feature activation+1.48e-02

Pos logit contributions
ascript+0.086
Unit+0.080
anse+0.070
arius+0.070
dial+0.065
Neg logit contributions
\/\/-0.105
ivas-0.100
 equally-0.091
NESS-0.091
 pastry-0.090
Tokenb
Token ID65
Feature activation-2.37e-02

Pos logit contributions
ascript+0.012
Unit+0.012
anse+0.010
arius+0.010
dial+0.010
Neg logit contributions
\/\/-0.010
ivas-0.010
 equally-0.009
NESS-0.009
 pastry-0.009
Tokenunch
Token ID3316
Feature activation+1.51e-01

Pos logit contributions
\/\/+0.019
ivas+0.018
 equally+0.016
NESS+0.016
 pastry+0.016
Neg logit contributions
ascript-0.017
Unit-0.016
anse-0.014
arius-0.014
dial-0.013
Token of
Token ID286
Feature activation+3.28e-02

Pos logit contributions
ascript+0.137
Unit+0.130
anse+0.118
arius+0.117
dial+0.111
Neg logit contributions
\/\/-0.095
ivas-0.088
NESS-0.078
 equally-0.077
 pastry-0.077
Token mag
Token ID2153
Feature activation+5.55e-01

Pos logit contributions
ascript+0.029
Unit+0.028
anse+0.025
arius+0.025
dial+0.024
Neg logit contributions
\/\/-0.021
ivas-0.020
NESS-0.017
 equally-0.017
 pastry-0.017
Tokeng
Token ID70
Feature activation+1.27e-02

Pos logit contributions
ascript+0.507
Unit+0.481
anse+0.437
arius+0.434
dial+0.412
Neg logit contributions
\/\/-0.345
ivas-0.318
NESS-0.282
 equally-0.281
 pastry-0.279
Tokenots
Token ID1747
Feature activation+3.53e-03

Pos logit contributions
ascript+0.011
Unit+0.011
anse+0.010
arius+0.010
dial+0.009
Neg logit contributions
\/\/-0.008
ivas-0.008
 equally-0.007
NESS-0.007
 pastry-0.007
Token.
Token ID13
Feature activation+6.53e-02

Pos logit contributions
ascript+0.003
Unit+0.003
anse+0.003
arius+0.003
dial+0.002
Neg logit contributions
\/\/-0.002
ivas-0.002
NESS-0.002
 equally-0.002
 pastry-0.002
Token
Token ID447
Feature activation+7.10e-02

Pos logit contributions
ascript+0.058
Unit+0.054
anse+0.049
arius+0.049
dial+0.046
Neg logit contributions
\/\/-0.043
ivas-0.040
 equally-0.035
NESS-0.035
 BALL-0.035
Token
Token ID251
Feature activation+2.54e-02

Pos logit contributions
ascript+0.061
Unit+0.057
anse+0.051
arius+0.051
dial+0.048
Neg logit contributions
\/\/-0.048
ivas-0.045
 equally-0.040
NESS-0.040
 pastry-0.040
Token ways
Token ID2842
Feature activation-1.06e-01

Pos logit contributions
ascript+0.120
Unit+0.114
anse+0.105
arius+0.104
dial+0.099
Neg logit contributions
\/\/-0.067
ivas-0.062
 equally-0.053
NESS-0.053
 pastry-0.053
Token.
Token ID13
Feature activation+6.36e-02

Pos logit contributions
\/\/+0.073
ivas+0.068
NESS+0.060
 equally+0.060
 pastry+0.060
Neg logit contributions
ascript-0.090
Unit-0.085
anse-0.077
arius-0.076
dial-0.072
Token
Token ID198
Feature activation+6.78e-02

Pos logit contributions
ascript+0.057
Unit+0.054
anse+0.049
arius+0.049
dial+0.046
Neg logit contributions
\/\/-0.041
ivas-0.038
 equally-0.034
NESS-0.034
 pastry-0.033
Token
Token ID198
Feature activation+8.71e-02

Pos logit contributions
ascript+0.060
Unit+0.057
anse+0.051
arius+0.051
dial+0.048
Neg logit contributions
\/\/-0.044
ivas-0.041
 equally-0.036
NESS-0.036
 pastry-0.036
TokenHist
Token ID13749
Feature activation+1.95e-01

Pos logit contributions
ascript+0.071
Unit+0.067
anse+0.060
arius+0.060
dial+0.056
Neg logit contributions
\/\/-0.063
ivas-0.059
NESS-0.053
 equally-0.053
 pastry-0.052
Tokenorian
Token ID22618
Feature activation+5.48e-01

Pos logit contributions
ascript+0.114
Unit+0.105
anse+0.089
arius+0.088
dial+0.080
Neg logit contributions
\/\/-0.186
ivas-0.177
NESS-0.164
 equally-0.164
 pastry-0.163
Token Regina
Token ID38739
Feature activation+1.21e-01

Pos logit contributions
ascript+0.485
Unit+0.461
anse+0.417
arius+0.415
dial+0.392
Neg logit contributions
\/\/-0.354
ivas-0.329
NESS-0.293
 equally-0.290
 pastry-0.289
Token Mor
Token ID3461
Feature activation+3.58e-02

Pos logit contributions
ascript+0.103
Unit+0.097
anse+0.088
arius+0.087
dial+0.082
Neg logit contributions
\/\/-0.083
ivas-0.077
 equally-0.069
NESS-0.069
 pastry-0.068
Tokenantz
Token ID46269
Feature activation+1.16e-02

Pos logit contributions
ascript+0.029
Unit+0.028
anse+0.025
arius+0.025
dial+0.023
Neg logit contributions
\/\/-0.026
ivas-0.024
NESS-0.022
 equally-0.022
 pastry-0.022
Token highlighted
Token ID14537
Feature activation+9.33e-02

Pos logit contributions
ascript+0.010
Unit+0.009
anse+0.008
arius+0.008
dial+0.008
Neg logit contributions
\/\/-0.008
ivas-0.008
NESS-0.007
 equally-0.007
 pastry-0.007
Token the
Token ID262
Feature activation+7.79e-02

Pos logit contributions
ascript+0.084
Unit+0.079
anse+0.072
arius+0.072
dial+0.068
Neg logit contributions
\/\/-0.060
ivas-0.055
NESS-0.049
 equally-0.048
 pastry-0.048
Token Ay
Token ID13709
Feature activation-1.39e-01

Pos logit contributions
ascript+0.018
Unit+0.017
anse+0.016
arius+0.016
dial+0.015
Neg logit contributions
\/\/-0.010
ivas-0.009
NESS-0.008
 equally-0.008
 pastry-0.008
Tokenat
Token ID265
Feature activation+9.29e-02

Pos logit contributions
\/\/+0.102
ivas+0.096
 equally+0.087
NESS+0.086
 BALL+0.086
Neg logit contributions
ascript-0.111
Unit-0.105
anse-0.093
arius-0.093
dial-0.087
Tokenollah
Token ID17999
Feature activation+5.79e-03

Pos logit contributions
ascript+0.074
Unit+0.069
anse+0.062
arius+0.062
dial+0.058
Neg logit contributions
\/\/-0.069
ivas-0.065
NESS-0.058
 equally-0.058
 framing-0.058
Token Ali
Token ID12104
Feature activation-5.67e-02

Pos logit contributions
ascript+0.006
Unit+0.005
anse+0.005
arius+0.005
dial+0.005
Neg logit contributions
\/\/-0.003
ivas-0.003
 equally-0.003
NESS-0.003
 BALL-0.003
Token Kham
Token ID45159
Feature activation+2.44e-01

Pos logit contributions
\/\/+0.053
ivas+0.050
NESS+0.047
 pastry+0.046
 equally+0.046
Neg logit contributions
ascript-0.034
Unit-0.032
anse-0.027
arius-0.027
dial-0.025
Tokenenei
Token ID46009
Feature activation+5.46e-01

Pos logit contributions
ascript+0.110
Unit+0.098
anse+0.078
arius+0.078
dial+0.067
Neg logit contributions
\/\/-0.266
ivas-0.255
 equally-0.238
NESS-0.237
 framing-0.236
Token,
Token ID11
Feature activation+3.33e-02

Pos logit contributions
ascript+0.476
Unit+0.452
anse+0.409
arius+0.409
dial+0.383
Neg logit contributions
\/\/-0.362
ivas-0.334
NESS-0.299
 equally-0.295
 pastry-0.295
Token warned
Token ID7728
Feature activation+9.39e-02

Pos logit contributions
ascript+0.027
Unit+0.026
anse+0.023
arius+0.023
dial+0.021
Neg logit contributions
\/\/-0.024
ivas-0.022
 equally-0.020
NESS-0.020
 pastry-0.020
Token that
Token ID326
Feature activation-3.83e-02

Pos logit contributions
ascript+0.085
Unit+0.080
anse+0.073
arius+0.073
dial+0.069
Neg logit contributions
\/\/-0.060
ivas-0.055
NESS-0.049
 pastry-0.048
 equally-0.048
Token the
Token ID262
Feature activation+4.36e-03

Pos logit contributions
\/\/+0.024
ivas+0.022
NESS+0.020
 equally+0.020
 pastry+0.020
Neg logit contributions
ascript-0.035
Unit-0.033
anse-0.030
arius-0.030
dial-0.028
Token day
Token ID1110
Feature activation+5.00e-02

Pos logit contributions
ascript+0.004
Unit+0.003
anse+0.003
arius+0.003
dial+0.003
Neg logit contributions
\/\/-0.003
ivas-0.003
 equally-0.003
 framing-0.003
 pastry-0.003
Token Ole
Token ID30093
Feature activation+9.13e-02

Pos logit contributions
ascript+0.046
Unit+0.043
anse+0.039
arius+0.039
dial+0.036
Neg logit contributions
\/\/-0.037
ivas-0.035
 equally-0.031
NESS-0.031
 pastry-0.031
Tokench
Token ID354
Feature activation+2.15e-03

Pos logit contributions
ascript+0.077
Unit+0.073
anse+0.066
arius+0.065
dial+0.062
Neg logit contributions
\/\/-0.063
ivas-0.059
 equally-0.052
NESS-0.052
 pastry-0.052
Tokenowski
Token ID12079
Feature activation+4.01e-02

Pos logit contributions
ascript+0.002
Unit+0.002
anse+0.002
arius+0.002
dial+0.001
Neg logit contributions
\/\/-0.001
ivas-0.001
NESS-0.001
 equally-0.001
 pastry-0.001
Token,
Token ID11
Feature activation+1.30e-02

Pos logit contributions
ascript+0.036
Unit+0.034
anse+0.031
arius+0.031
dial+0.029
Neg logit contributions
\/\/-0.025
ivas-0.024
NESS-0.021
 equally-0.021
 pastry-0.021
Token a
Token ID257
Feature activation+2.82e-02

Pos logit contributions
ascript+0.012
Unit+0.011
anse+0.010
arius+0.010
dial+0.009
Neg logit contributions
\/\/-0.008
ivas-0.008
NESS-0.007
 equally-0.007
 pastry-0.007
Token Qu
Token ID2264
Feature activation+5.43e-01

Pos logit contributions
ascript+0.024
Unit+0.023
anse+0.021
arius+0.021
dial+0.019
Neg logit contributions
\/\/-0.019
ivas-0.018
 equally-0.016
NESS-0.016
 pastry-0.016
Tokenist
Token ID396
Feature activation+4.39e-02

Pos logit contributions
ascript+0.469
Unit+0.442
anse+0.398
arius+0.396
dial+0.374
Neg logit contributions
\/\/-0.366
ivas-0.342
 equally-0.304
NESS-0.303
 pastry-0.301
Token spokeswoman
Token ID12262
Feature activation+5.00e-02

Pos logit contributions
ascript+0.038
Unit+0.036
anse+0.033
arius+0.032
dial+0.030
Neg logit contributions
\/\/-0.029
ivas-0.027
 equally-0.024
NESS-0.024
 pastry-0.024
Token,
Token ID11
Feature activation+2.36e-03

Pos logit contributions
ascript+0.045
Unit+0.043
anse+0.039
arius+0.039
dial+0.037
Neg logit contributions
\/\/-0.032
ivas-0.029
NESS-0.026
 BALL-0.026
 equally-0.026
Token said
Token ID531
Feature activation+3.33e-04

Pos logit contributions
ascript+0.002
Unit+0.002
anse+0.002
arius+0.002
dial+0.001
Neg logit contributions
\/\/-0.002
ivas-0.002
NESS-0.002
 equally-0.002
 pastry-0.002
Token the
Token ID262
Feature activation-3.63e-02

Pos logit contributions
ascript+0.000
Unit+0.000
anse+0.000
arius+0.000
dial+0.000
Neg logit contributions
\/\/-0.000
ivas-0.000
NESS-0.000
 equally-0.000
 BALL-0.000
Token of
Token ID286
Feature activation+2.28e-02

Pos logit contributions
\/\/+0.039
ivas+0.036
NESS+0.032
 equally+0.032
 pastry+0.032
Neg logit contributions
ascript-0.058
Unit-0.055
anse-0.050
arius-0.050
dial-0.047
Token a
Token ID257
Feature activation+4.99e-02

Pos logit contributions
ascript+0.020
Unit+0.019
anse+0.017
arius+0.017
dial+0.016
Neg logit contributions
\/\/-0.015
ivas-0.014
 equally-0.013
NESS-0.012
 framing-0.012
Token political
Token ID1964
Feature activation+8.86e-02

Pos logit contributions
ascript+0.042
Unit+0.039
anse+0.035
arius+0.035
dial+0.033
Neg logit contributions
\/\/-0.035
ivas-0.033
 equally-0.029
NESS-0.029
 framing-0.029
Token
Token ID564
Feature activation+5.96e-02

Pos logit contributions
ascript+0.076
Unit+0.072
anse+0.065
arius+0.065
dial+0.061
Neg logit contributions
\/\/-0.060
ivas-0.056
NESS-0.050
 equally-0.050
 pastry-0.049
Token
Token ID246
Feature activation+1.71e-01

Pos logit contributions
ascript+0.036
Unit+0.033
anse+0.029
arius+0.028
dial+0.026
Neg logit contributions
\/\/-0.055
ivas-0.053
 equally-0.049
NESS-0.049
 framing-0.048
Tokendial
Token ID38969
Feature activation+5.32e-01

Pos logit contributions
ascript+0.140
Unit+0.132
anse+0.118
arius+0.118
dial+0.110
Neg logit contributions
\/\/-0.122
ivas-0.114
 equally-0.103
NESS-0.103
 pastry-0.102
Tokenogue
Token ID5119
Feature activation-1.29e-01

Pos logit contributions
ascript+0.541
Unit+0.516
anse+0.473
arius+0.471
dial+0.449
Neg logit contributions
\/\/-0.276
ivas-0.252
 equally-0.215
NESS-0.215
 pastry-0.213
Token
Token ID447
Feature activation+7.09e-02

Pos logit contributions
\/\/+0.086
ivas+0.080
 equally+0.072
NESS+0.071
 framing+0.071
Neg logit contributions
ascript-0.112
Unit-0.106
anse-0.095
arius-0.095
dial-0.090
Token
Token ID247
Feature activation+8.65e-02

Pos logit contributions
ascript+0.045
Unit+0.042
anse+0.036
arius+0.036
dial+0.033
Neg logit contributions
\/\/-0.064
ivas-0.061
NESS-0.056
 equally-0.056
 pastry-0.056
Token.
Token ID13
Feature activation-3.64e-03

Pos logit contributions
ascript+0.076
Unit+0.072
anse+0.065
arius+0.064
dial+0.061
Neg logit contributions
\/\/-0.057
ivas-0.053
 equally-0.047
NESS-0.047
 pastry-0.047
Token
Token ID198
Feature activation-6.52e-03

Pos logit contributions
\/\/+0.002
ivas+0.002
 equally+0.002
NESS+0.002
 pastry+0.002
Neg logit contributions
ascript-0.003
Unit-0.003
anse-0.003
arius-0.003
dial-0.003
Token handling
Token ID9041
Feature activation-8.40e-02

Pos logit contributions
\/\/+0.000
ivas+0.000
 equally+0.000
NESS+0.000
 framing+0.000
Neg logit contributions
ascript-0.001
Unit-0.000
anse-0.000
arius-0.000
dial-0.000
Token.
Token ID13
Feature activation+6.26e-03

Pos logit contributions
\/\/+0.056
ivas+0.052
 equally+0.046
NESS+0.046
 framing+0.046
Neg logit contributions
ascript-0.074
Unit-0.069
anse-0.063
arius-0.063
dial-0.059
Token Best
Token ID6705
Feature activation-1.36e-01

Pos logit contributions
ascript+0.005
Unit+0.005
anse+0.005
arius+0.005
dial+0.004
Neg logit contributions
\/\/-0.004
ivas-0.004
 equally-0.004
NESS-0.004
 BALL-0.003
Token for
Token ID329
Feature activation+1.38e-02

Pos logit contributions
\/\/+0.087
ivas+0.081
 equally+0.072
NESS+0.072
 pastry+0.071
Neg logit contributions
ascript-0.122
Unit-0.115
anse-0.104
arius-0.104
dial-0.098
Token precision
Token ID15440
Feature activation+1.53e-02

Pos logit contributions
ascript+0.012
Unit+0.012
anse+0.011
arius+0.010
dial+0.010
Neg logit contributions
\/\/-0.009
ivas-0.008
 equally-0.007
NESS-0.007
 framing-0.007
Token circuit
Token ID10349
Feature activation+5.29e-01

Pos logit contributions
ascript+0.014
Unit+0.013
anse+0.012
arius+0.012
dial+0.011
Neg logit contributions
\/\/-0.009
ivas-0.009
 equally-0.008
NESS-0.008
 framing-0.008
Token rac
Token ID3444
Feature activation-1.89e-01

Pos logit contributions
ascript+0.417
Unit+0.392
anse+0.349
arius+0.347
dial+0.325
Neg logit contributions
\/\/-0.396
ivas-0.372
 equally-0.336
NESS-0.335
 pastry-0.334
Tokeners
Token ID364
Feature activation-3.34e-01

Pos logit contributions
\/\/+0.075
ivas+0.067
 equally+0.054
NESS+0.054
 pastry+0.053
Neg logit contributions
ascript-0.215
Unit-0.206
anse-0.191
arius-0.190
dial-0.182
Token who
Token ID508
Feature activation+4.71e-02

Pos logit contributions
\/\/+0.224
ivas+0.209
 equally+0.188
 framing+0.186
NESS+0.186
Neg logit contributions
ascript-0.290
Unit-0.274
anse-0.247
arius-0.246
dial-0.231
Token aim
Token ID4031
Feature activation+1.81e-01

Pos logit contributions
ascript+0.042
Unit+0.040
anse+0.036
arius+0.036
dial+0.034
Neg logit contributions
\/\/-0.030
ivas-0.028
 equally-0.025
 pastry-0.025
NESS-0.025
Token for
Token ID329
Feature activation-4.22e-02

Pos logit contributions
ascript+0.161
Unit+0.152
anse+0.138
arius+0.137
dial+0.130
Neg logit contributions
\/\/-0.117
ivas-0.108
NESS-0.096
 equally-0.096
 pastry-0.095
Token.
Token ID13
Feature activation+3.27e-02

Pos logit contributions
ascript+0.022
Unit+0.021
anse+0.019
arius+0.019
dial+0.018
Neg logit contributions
\/\/-0.016
ivas-0.015
 equally-0.013
NESS-0.013
 pastry-0.013
Token
Token ID198
Feature activation+3.52e-02

Pos logit contributions
ascript+0.030
Unit+0.028
anse+0.026
arius+0.026
dial+0.024
Neg logit contributions
\/\/-0.020
ivas-0.019
NESS-0.016
 equally-0.016
 pastry-0.016
Token
Token ID198
Feature activation+3.94e-02

Pos logit contributions
ascript+0.031
Unit+0.029
anse+0.027
arius+0.026
dial+0.025
Neg logit contributions
\/\/-0.023
ivas-0.021
 equally-0.019
NESS-0.019
 pastry-0.019
TokenEn
Token ID4834
Feature activation+1.55e-02

Pos logit contributions
ascript+0.033
Unit+0.031
anse+0.028
arius+0.028
dial+0.026
Neg logit contributions
\/\/-0.028
ivas-0.026
NESS-0.023
 equally-0.023
 pastry-0.023
Tokenlight
Token ID2971
Feature activation-1.05e-01

Pos logit contributions
ascript+0.013
Unit+0.013
anse+0.011
arius+0.011
dial+0.011
Neg logit contributions
\/\/-0.010
ivas-0.010
 equally-0.009
NESS-0.009
 pastry-0.009
Tokenenment
Token ID23242
Feature activation+5.28e-01

Pos logit contributions
\/\/+0.134
ivas+0.128
NESS+0.122
 equally+0.122
 pastry+0.121
Neg logit contributions
ascript-0.027
Unit-0.022
anse-0.014
arius-0.014
dial-0.009
Token is
Token ID318
Feature activation+8.24e-03

Pos logit contributions
ascript+0.458
Unit+0.434
anse+0.391
arius+0.390
dial+0.368
Neg logit contributions
\/\/-0.353
ivas-0.329
 equally-0.291
NESS-0.291
 pastry-0.290
Token like
Token ID588
Feature activation+2.14e-01

Pos logit contributions
ascript+0.007
Unit+0.007
anse+0.006
arius+0.006
dial+0.006
Neg logit contributions
\/\/-0.006
ivas-0.005
 equally-0.005
NESS-0.005
 pastry-0.005
Token that
Token ID326
Feature activation-2.17e-02

Pos logit contributions
ascript+0.188
Unit+0.178
anse+0.161
arius+0.160
dial+0.151
Neg logit contributions
\/\/-0.141
ivas-0.131
NESS-0.116
 equally-0.116
 pastry-0.115
Token.
Token ID13
Feature activation+3.93e-02

Pos logit contributions
\/\/+0.014
ivas+0.013
NESS+0.012
 equally+0.012
 pastry+0.012
Neg logit contributions
ascript-0.019
Unit-0.018
anse-0.016
arius-0.016
dial-0.015
Token EVERY
Token ID27222
Feature activation+6.05e-02

Pos logit contributions
ascript+0.035
Unit+0.033
anse+0.030
arius+0.030
dial+0.028
Neg logit contributions
\/\/-0.025
ivas-0.023
NESS-0.021
 equally-0.021
 BALL-0.020
Token attack
Token ID1368
Feature activation-4.55e-02

Pos logit contributions
\/\/+0.073
ivas+0.068
 equally+0.060
NESS+0.060
 pastry+0.060
Neg logit contributions
ascript-0.094
Unit-0.089
anse-0.080
arius-0.080
dial-0.075
Token ads
Token ID9011
Feature activation-9.97e-04

Pos logit contributions
\/\/+0.025
ivas+0.022
NESS+0.019
 equally+0.019
 pastry+0.019
Neg logit contributions
ascript-0.045
Unit-0.043
anse-0.040
arius-0.039
dial-0.038
Token casting
Token ID13092
Feature activation+8.61e-02

Pos logit contributions
\/\/+0.001
ivas+0.001
NESS+0.001
 equally+0.001
 pastry+0.001
Neg logit contributions
ascript-0.001
Unit-0.001
anse-0.001
arius-0.001
dial-0.001
Token Mr
Token ID1770
Feature activation-2.19e-02

Pos logit contributions
ascript+0.076
Unit+0.072
arius+0.065
anse+0.065
dial+0.062
Neg logit contributions
\/\/-0.056
ivas-0.052
NESS-0.046
 equally-0.045
 pastry-0.045
Token.
Token ID13
Feature activation+4.36e-02

Pos logit contributions
\/\/+0.015
ivas+0.014
 equally+0.012
 BALL+0.012
NESS+0.012
Neg logit contributions
ascript-0.019
Unit-0.018
anse-0.016
arius-0.016
dial-0.015
Token Qu
Token ID2264
Feature activation+5.27e-01

Pos logit contributions
ascript+0.039
Unit+0.037
anse+0.033
arius+0.033
dial+0.031
Neg logit contributions
\/\/-0.028
ivas-0.026
NESS-0.023
 equally-0.023
 pastry-0.023
Tokenist
Token ID396
Feature activation+9.40e-02

Pos logit contributions
ascript+0.484
Unit+0.463
anse+0.421
arius+0.420
dial+0.395
Neg logit contributions
\/\/-0.325
ivas-0.296
NESS-0.266
 equally-0.264
 framing-0.263
Token as
Token ID355
Feature activation-1.06e-01

Pos logit contributions
ascript+0.090
Unit+0.086
arius+0.078
anse+0.078
dial+0.074
Neg logit contributions
\/\/-0.054
ivas-0.049
NESS-0.043
 pastry-0.043
 BALL-0.042
Token
Token ID564
Feature activation+1.45e-01

Pos logit contributions
\/\/+0.071
ivas+0.067
NESS+0.059
 equally+0.059
 pastry+0.059
Neg logit contributions
ascript-0.092
Unit-0.087
anse-0.079
arius-0.078
dial-0.074
Token
Token ID250
Feature activation+6.09e-02

Pos logit contributions
ascript+0.103
Unit+0.097
anse+0.085
arius+0.085
dial+0.078
Neg logit contributions
\/\/-0.119
ivas-0.113
 equally-0.103
NESS-0.103
 framing-0.102
Tokenmore
Token ID3549
Feature activation-5.00e-02

Pos logit contributions
ascript+0.049
Unit+0.046
anse+0.041
arius+0.041
dial+0.038
Neg logit contributions
\/\/-0.045
ivas-0.042
 equally-0.038
NESS-0.038
 pastry-0.038
Token poll
Token ID3278
Feature activation+1.10e-01

Pos logit contributions
ascript+0.190
Unit+0.178
anse+0.161
arius+0.160
dial+0.150
Neg logit contributions
\/\/-0.151
ivas-0.141
 equally-0.127
 pastry-0.127
NESS-0.127
Token shows
Token ID2523
Feature activation+6.04e-02

Pos logit contributions
ascript+0.095
Unit+0.089
anse+0.081
arius+0.080
dial+0.076
Neg logit contributions
\/\/-0.074
ivas-0.069
NESS-0.061
 equally-0.061
 pastry-0.061
Token that
Token ID326
Feature activation+2.65e-02

Pos logit contributions
ascript+0.057
Unit+0.054
anse+0.049
arius+0.049
dial+0.047
Neg logit contributions
\/\/-0.036
ivas-0.033
NESS-0.029
 equally-0.029
 pastry-0.029
Token Mr
Token ID1770
Feature activation-2.42e-02

Pos logit contributions
ascript+0.024
Unit+0.023
anse+0.021
arius+0.021
dial+0.019
Neg logit contributions
\/\/-0.017
ivas-0.016
NESS-0.014
 equally-0.014
 pastry-0.014
Token.
Token ID13
Feature activation+6.70e-02

Pos logit contributions
\/\/+0.016
ivas+0.015
 equally+0.013
NESS+0.013
 BALL+0.013
Neg logit contributions
ascript-0.021
Unit-0.020
anse-0.018
arius-0.018
dial-0.017
Token Qu
Token ID2264
Feature activation+5.26e-01

Pos logit contributions
ascript+0.038
Unit+0.035
anse+0.029
arius+0.029
dial+0.026
Neg logit contributions
\/\/-0.065
ivas-0.062
 equally-0.057
NESS-0.057
 BALL-0.057
Tokenist
Token ID396
Feature activation+6.60e-02

Pos logit contributions
ascript+0.454
Unit+0.428
anse+0.385
arius+0.384
dial+0.362
Neg logit contributions
\/\/-0.355
ivas-0.331
 equally-0.295
NESS-0.294
 pastry-0.292
Token has
Token ID468
Feature activation-3.78e-02

Pos logit contributions
ascript+0.060
Unit+0.057
anse+0.052
arius+0.052
dial+0.049
Neg logit contributions
\/\/-0.041
ivas-0.038
 equally-0.034
NESS-0.033
 pastry-0.033
Token cut
Token ID2005
Feature activation+2.55e-01

Pos logit contributions
\/\/+0.027
ivas+0.025
 equally+0.023
NESS+0.022
 pastry+0.022
Neg logit contributions
ascript-0.031
Unit-0.030
anse-0.026
arius-0.026
dial-0.025
Token Mr
Token ID1770
Feature activation-4.58e-02

Pos logit contributions
ascript+0.232
Unit+0.220
anse+0.199
arius+0.198
dial+0.188
Neg logit contributions
\/\/-0.160
ivas-0.149
NESS-0.131
 equally-0.131
 pastry-0.130
Token.
Token ID13
Feature activation+4.55e-02

Pos logit contributions
\/\/+0.031
ivas+0.029
 equally+0.026
NESS+0.026
 BALL+0.026
Neg logit contributions
ascript-0.039
Unit-0.037
anse-0.033
arius-0.033
dial-0.031
Token different
Token ID1180
Feature activation-5.22e-02

Pos logit contributions
\/\/+0.009
ivas+0.008
NESS+0.007
 equally+0.007
 BALL+0.007
Neg logit contributions
ascript-0.013
Unit-0.012
anse-0.011
arius-0.011
dial-0.010
Token concepts
Token ID10838
Feature activation+1.10e-01

Pos logit contributions
\/\/+0.033
ivas+0.031
NESS+0.027
 equally+0.027
 BALL+0.027
Neg logit contributions
ascript-0.047
Unit-0.045
anse-0.040
arius-0.040
dial-0.038
Token from
Token ID422
Feature activation-4.71e-02

Pos logit contributions
ascript+0.099
Unit+0.094
anse+0.085
arius+0.085
dial+0.080
Neg logit contributions
\/\/-0.069
ivas-0.064
NESS-0.057
 equally-0.056
 pastry-0.056
Token the
Token ID262
Feature activation+7.07e-03

Pos logit contributions
\/\/+0.031
ivas+0.029
NESS+0.026
 equally+0.026
 pastry+0.025
Neg logit contributions
ascript-0.041
Unit-0.039
anse-0.035
arius-0.035
dial-0.033
Token arg
Token ID1822
Feature activation+4.86e-02

Pos logit contributions
ascript+0.006
Unit+0.006
anse+0.005
arius+0.005
dial+0.005
Neg logit contributions
\/\/-0.005
ivas-0.005
 equally-0.004
NESS-0.004
 pastry-0.004
Tokenuable
Token ID7153
Feature activation+5.24e-01

Pos logit contributions
ascript+0.042
Unit+0.039
arius+0.035
anse+0.035
dial+0.033
Neg logit contributions
\/\/-0.033
ivas-0.031
 equally-0.027
 pastry-0.027
NESS-0.027
Token king
Token ID5822
Feature activation-3.34e-02

Pos logit contributions
ascript+0.449
Unit+0.424
anse+0.382
arius+0.380
dial+0.358
Neg logit contributions
\/\/-0.356
ivas-0.332
 equally-0.296
NESS-0.296
 pastry-0.294
Token of
Token ID286
Feature activation+3.35e-02

Pos logit contributions
\/\/+0.021
ivas+0.019
NESS+0.017
 equally+0.017
 pastry+0.017
Neg logit contributions
ascript-0.031
Unit-0.029
anse-0.026
arius-0.026
dial-0.025
Token curse
Token ID17328
Feature activation-8.85e-02

Pos logit contributions
ascript+0.030
Unit+0.028
anse+0.025
arius+0.025
dial+0.024
Neg logit contributions
\/\/-0.022
ivas-0.020
NESS-0.018
 equally-0.018
 pastry-0.018
Token,
Token ID11
Feature activation+2.33e-03

Pos logit contributions
\/\/+0.054
ivas+0.050
NESS+0.044
 equally+0.044
 pastry+0.043
Neg logit contributions
ascript-0.082
Unit-0.078
anse-0.071
arius-0.071
dial-0.067
Token Scott
Token ID4746
Feature activation+2.41e-02

Pos logit contributions
ascript+0.002
Unit+0.002
anse+0.002
arius+0.002
dial+0.002
Neg logit contributions
\/\/-0.002
ivas-0.001
NESS-0.001
 equally-0.001
 pastry-0.001
Token ho
Token ID8169
Feature activation+1.36e-01

Pos logit contributions
ascript+0.106
Unit+0.101
anse+0.091
arius+0.090
dial+0.085
Neg logit contributions
\/\/-0.082
ivas-0.076
NESS-0.068
 equally-0.068
 pastry-0.067
Tokenarded
Token ID10676
Feature activation+2.38e-01

Pos logit contributions
ascript+0.087
Unit+0.081
anse+0.070
arius+0.069
dial+0.064
Neg logit contributions
\/\/-0.121
ivas-0.115
 equally-0.106
NESS-0.106
 pastry-0.105
Token and
Token ID290
Feature activation+3.41e-02

Pos logit contributions
ascript+0.217
Unit+0.205
anse+0.186
arius+0.186
dial+0.175
Neg logit contributions
\/\/-0.150
ivas-0.139
NESS-0.122
 equally-0.121
 pastry-0.121
Token ready
Token ID3492
Feature activation+2.14e-01

Pos logit contributions
ascript+0.028
Unit+0.026
anse+0.024
arius+0.023
dial+0.022
Neg logit contributions
\/\/-0.024
ivas-0.023
 equally-0.021
NESS-0.021
 pastry-0.020
Token for
Token ID329
Feature activation+4.20e-02

Pos logit contributions
ascript+0.195
Unit+0.185
anse+0.167
arius+0.167
dial+0.158
Neg logit contributions
\/\/-0.135
ivas-0.125
 equally-0.110
NESS-0.110
 pastry-0.109
Token immediate
Token ID7103
Feature activation+5.22e-01

Pos logit contributions
ascript+0.038
Unit+0.036
anse+0.033
arius+0.033
dial+0.031
Neg logit contributions
\/\/-0.026
ivas-0.024
 equally-0.022
NESS-0.021
 pastry-0.021
Token export
Token ID10784
Feature activation+2.36e-02

Pos logit contributions
ascript+0.482
Unit+0.458
anse+0.416
arius+0.414
dial+0.392
Neg logit contributions
\/\/-0.320
ivas-0.296
NESS-0.260
 equally-0.260
 pastry-0.258
Token to
Token ID284
Feature activation+3.07e-02

Pos logit contributions
ascript+0.020
Unit+0.019
anse+0.017
arius+0.017
dial+0.016
Neg logit contributions
\/\/-0.016
ivas-0.015
NESS-0.013
 equally-0.013
 pastry-0.013
Token aid
Token ID6133
Feature activation-8.36e-02

Pos logit contributions
ascript+0.028
Unit+0.026
anse+0.024
arius+0.024
dial+0.023
Neg logit contributions
\/\/-0.019
ivas-0.018
NESS-0.016
 equally-0.016
 pastry-0.016
Token his
Token ID465
Feature activation-8.61e-03

Pos logit contributions
\/\/+0.053
ivas+0.049
NESS+0.043
 equally+0.043
 pastry+0.042
Neg logit contributions
ascript-0.076
Unit-0.072
anse-0.065
arius-0.065
dial-0.062
Token first
Token ID717
Feature activation+5.79e-02

Pos logit contributions
\/\/+0.006
ivas+0.005
 equally+0.005
NESS+0.005
 pastry+0.005
Neg logit contributions
ascript-0.007
Unit-0.007
anse-0.006
arius-0.006
dial-0.006
Token revision
Token ID18440
Feature activation+4.91e-04

Pos logit contributions
\/\/+0.058
ivas+0.054
 equally+0.048
NESS+0.048
 framing+0.048
Neg logit contributions
ascript-0.079
Unit-0.074
anse-0.067
arius-0.067
dial-0.063
Token.
Token ID13
Feature activation+5.43e-02

Pos logit contributions
ascript+0.000
Unit+0.000
anse+0.000
arius+0.000
dial+0.000
Neg logit contributions
\/\/-0.000
ivas-0.000
 equally-0.000
NESS-0.000
 framing-0.000
Token
Token ID198
Feature activation+3.44e-02

Pos logit contributions
ascript+0.048
Unit+0.045
anse+0.041
arius+0.041
dial+0.039
Neg logit contributions
\/\/-0.035
ivas-0.033
 equally-0.029
NESS-0.029
 pastry-0.029
Token
Token ID198
Feature activation+3.50e-02

Pos logit contributions
ascript+0.031
Unit+0.029
anse+0.026
arius+0.026
dial+0.025
Neg logit contributions
\/\/-0.022
ivas-0.021
 equally-0.018
NESS-0.018
 pastry-0.018
TokenCons
Token ID9444
Feature activation+1.59e-01

Pos logit contributions
ascript+0.028
Unit+0.027
anse+0.024
arius+0.024
dial+0.022
Neg logit contributions
\/\/-0.025
ivas-0.024
NESS-0.021
 equally-0.021
 pastry-0.021
Tokenidered
Token ID3089
Feature activation+5.21e-01

Pos logit contributions
ascript+0.128
Unit+0.121
anse+0.108
arius+0.107
dial+0.101
Neg logit contributions
\/\/-0.116
ivas-0.109
NESS-0.098
 equally-0.098
 pastry-0.098
Token the
Token ID262
Feature activation+3.35e-02

Pos logit contributions
ascript+0.457
Unit+0.433
anse+0.391
arius+0.389
dial+0.367
Neg logit contributions
\/\/-0.342
ivas-0.319
NESS-0.282
 equally-0.281
 pastry-0.280
Token economy
Token ID3773
Feature activation+4.57e-02

Pos logit contributions
ascript+0.029
Unit+0.027
anse+0.024
arius+0.024
dial+0.023
Neg logit contributions
\/\/-0.023
ivas-0.021
 equally-0.019
 framing-0.019
 pastry-0.019
Token most
Token ID749
Feature activation+1.39e-01

Pos logit contributions
ascript+0.039
Unit+0.037
anse+0.034
arius+0.033
dial+0.032
Neg logit contributions
\/\/-0.031
ivas-0.029
 equally-0.026
NESS-0.026
 framing-0.025
Token vulnerable
Token ID8826
Feature activation+1.04e-01

Pos logit contributions
ascript+0.122
Unit+0.115
anse+0.105
arius+0.104
dial+0.098
Neg logit contributions
\/\/-0.091
ivas-0.084
 equally-0.076
NESS-0.076
 framing-0.075
Token to
Token ID284
Feature activation+6.00e-02

Pos logit contributions
ascript+0.094
Unit+0.089
anse+0.080
arius+0.080
dial+0.076
Neg logit contributions
\/\/-0.066
ivas-0.061
 equally-0.054
NESS-0.054
 pastry-0.054
Token offered
Token ID4438
Feature activation-9.83e-03

Pos logit contributions
\/\/+0.016
ivas+0.015
 equally+0.014
NESS+0.014
 pastry+0.014
Neg logit contributions
ascript-0.017
Unit-0.016
anse-0.014
arius-0.014
dial-0.013
Token a
Token ID257
Feature activation+6.24e-03

Pos logit contributions
\/\/+0.006
ivas+0.006
NESS+0.005
 equally+0.005
 pastry+0.005
Neg logit contributions
ascript-0.009
Unit-0.008
anse-0.007
arius-0.007
dial-0.007
Token tweet
Token ID6126
Feature activation-8.57e-02

Pos logit contributions
ascript+0.006
Unit+0.005
anse+0.005
arius+0.005
dial+0.004
Neg logit contributions
\/\/-0.004
ivas-0.004
 equally-0.003
NESS-0.003
 pastry-0.003
Token of
Token ID286
Feature activation-3.68e-02

Pos logit contributions
\/\/+0.054
ivas+0.050
NESS+0.044
 equally+0.043
 pastry+0.043
Neg logit contributions
ascript-0.078
Unit-0.074
anse-0.067
arius-0.067
dial-0.063
Token condol
Token ID35059
Feature activation-4.43e-02

Pos logit contributions
\/\/+0.021
ivas+0.020
NESS+0.017
 equally+0.017
 pastry+0.017
Neg logit contributions
ascript-0.035
Unit-0.034
anse-0.031
arius-0.031
dial-0.029
Tokenence
Token ID594
Feature activation-5.95e-01

Pos logit contributions
\/\/+0.014
ivas+0.012
NESS+0.009
 equally+0.009
 pastry+0.009
Neg logit contributions
ascript-0.054
Unit-0.052
arius-0.049
anse-0.049
dial-0.047
Token to
Token ID284
Feature activation-1.15e-02

Pos logit contributions
\/\/+0.388
ivas+0.362
 equally+0.322
NESS+0.321
 pastry+0.319
Neg logit contributions
ascript-0.525
Unit-0.497
anse-0.449
arius-0.446
dial-0.421
Token those
Token ID883
Feature activation-1.57e-02

Pos logit contributions
\/\/+0.007
ivas+0.007
 equally+0.006
NESS+0.006
 pastry+0.006
Neg logit contributions
ascript-0.010
Unit-0.010
anse-0.009
arius-0.009
dial-0.008
Token affected
Token ID5676
Feature activation-2.18e-02

Pos logit contributions
\/\/+0.013
ivas+0.012
NESS+0.011
 equally+0.011
 pastry+0.011
Neg logit contributions
ascript-0.011
Unit-0.011
anse-0.009
arius-0.009
dial-0.009
Token by
Token ID416
Feature activation+3.12e-02

Pos logit contributions
\/\/+0.016
ivas+0.015
 equally+0.013
NESS+0.013
 pastry+0.013
Neg logit contributions
ascript-0.018
Unit-0.017
anse-0.015
arius-0.015
dial-0.014
Token the
Token ID262
Feature activation-3.43e-02

Pos logit contributions
ascript+0.027
Unit+0.026
arius+0.023
anse+0.023
dial+0.022
Neg logit contributions
\/\/-0.021
ivas-0.019
NESS-0.017
 equally-0.017
 pastry-0.017
TokenFO
Token ID6080
Feature activation+6.04e-02

Pos logit contributions
ascript+0.173
Unit+0.163
anse+0.144
arius+0.143
dial+0.134
Neg logit contributions
\/\/-0.181
ivas-0.171
 equally-0.155
 pastry-0.154
NESS-0.154
TokenOT
Token ID2394
Feature activation-1.81e-02

Pos logit contributions
ascript+0.062
Unit+0.060
anse+0.055
arius+0.055
dial+0.052
Neg logit contributions
\/\/-0.030
ivas-0.028
 equally-0.023
NESS-0.023
 pastry-0.023
Token T
Token ID309
Feature activation+3.84e-02

Pos logit contributions
\/\/+0.013
ivas+0.012
NESS+0.010
 equally+0.010
 BALL+0.010
Neg logit contributions
ascript-0.015
Unit-0.014
anse-0.013
arius-0.013
dial-0.012
TokenOC
Token ID4503
Feature activation+7.13e-02

Pos logit contributions
ascript+0.035
Unit+0.033
anse+0.030
arius+0.030
dial+0.028
Neg logit contributions
\/\/-0.024
ivas-0.022
NESS-0.020
 equally-0.020
 BALL-0.019
Token AK
Token ID15837
Feature activation-1.32e-01

Pos logit contributions
ascript+0.058
Unit+0.055
anse+0.049
arius+0.049
dial+0.046
Neg logit contributions
\/\/-0.051
ivas-0.048
NESS-0.043
 equally-0.043
 BALL-0.043
TokenNOW
Token ID45669
Feature activation-5.76e-01

Pos logit contributions
\/\/+0.067
ivas+0.061
 equally+0.052
NESS+0.052
 pastry+0.051
Neg logit contributions
ascript-0.136
Unit-0.130
anse-0.119
arius-0.119
dial-0.113
TokenLED
Token ID30465
Feature activation-8.49e-02

Pos logit contributions
\/\/+0.332
ivas+0.306
NESS+0.274
 equally+0.267
 BALL+0.266
Neg logit contributions
ascript-0.557
Unit-0.523
anse-0.481
arius-0.478
dial-0.449
TokenG
Token ID38
Feature activation+1.12e-01

Pos logit contributions
\/\/+0.064
ivas+0.060
NESS+0.055
 equally+0.054
 BALL+0.054
Neg logit contributions
ascript-0.067
Unit-0.062
anse-0.056
arius-0.055
dial-0.051
TokenED
Token ID1961
Feature activation+4.05e-02

Pos logit contributions
ascript+0.103
Unit+0.097
anse+0.088
arius+0.088
dial+0.083
Neg logit contributions
\/\/-0.069
ivas-0.064
NESS-0.057
 equally-0.057
 BALL-0.056
Token NET
Token ID30502
Feature activation-1.45e-01

Pos logit contributions
ascript+0.033
Unit+0.032
anse+0.028
arius+0.028
dial+0.027
Neg logit contributions
\/\/-0.029
ivas-0.027
 equally-0.024
NESS-0.024
 pastry-0.024
Token CALL
Token ID42815
Feature activation+1.12e-01

Pos logit contributions
\/\/+0.130
ivas+0.123
 equally+0.113
NESS+0.113
 pastry+0.113
Neg logit contributions
ascript-0.093
Unit-0.086
anse-0.075
arius-0.074
dial-0.068
Token library
Token ID5888
Feature activation-2.13e-01

Pos logit contributions
\/\/+0.019
ivas+0.017
 equally+0.016
NESS+0.016
 framing+0.016
Neg logit contributions
ascript-0.023
Unit-0.021
anse-0.019
arius-0.019
dial-0.018
Token together
Token ID1978
Feature activation-1.72e-01

Pos logit contributions
\/\/+0.142
ivas+0.132
 equally+0.119
NESS+0.118
 framing+0.117
Neg logit contributions
ascript-0.186
Unit-0.175
anse-0.158
arius-0.157
dial-0.149
Token:
Token ID25
Feature activation-1.91e-02

Pos logit contributions
\/\/+0.117
ivas+0.109
NESS+0.097
 pastry+0.096
 equally+0.096
Neg logit contributions
ascript-0.147
Unit-0.139
anse-0.125
arius-0.125
dial-0.117
Token
Token ID198
Feature activation-1.13e-01

Pos logit contributions
\/\/+0.013
ivas+0.012
NESS+0.010
 equally+0.010
 pastry+0.010
Neg logit contributions
ascript-0.017
Unit-0.016
anse-0.014
arius-0.014
dial-0.014
Token
Token ID198
Feature activation-9.50e-02

Pos logit contributions
\/\/+0.074
ivas+0.069
NESS+0.061
 equally+0.061
 pastry+0.061
Neg logit contributions
ascript-0.100
Unit-0.095
anse-0.085
arius-0.085
dial-0.080
Tokenpub
Token ID12984
Feature activation-5.71e-01

Pos logit contributions
\/\/+0.049
ivas+0.045
 equally+0.038
NESS+0.038
 pastry+0.038
Neg logit contributions
ascript-0.097
Unit-0.093
anse-0.085
arius-0.085
dial-0.081
Token struct
Token ID2878
Feature activation-2.40e-01

Pos logit contributions
\/\/+0.284
ivas+0.258
 equally+0.219
NESS+0.218
 pastry+0.216
Neg logit contributions
ascript-0.594
Unit-0.566
anse-0.520
arius-0.519
dial-0.495
Token Atomic
Token ID28976
Feature activation-1.54e-01

Pos logit contributions
\/\/+0.154
ivas+0.143
 equally+0.127
NESS+0.126
 pastry+0.125
Neg logit contributions
ascript-0.216
Unit-0.203
anse-0.184
arius-0.184
dial-0.174
Token <
Token ID1279
Feature activation-9.81e-02

Pos logit contributions
\/\/+0.096
ivas+0.089
 equally+0.078
NESS+0.078
 pastry+0.077
Neg logit contributions
ascript-0.141
Unit-0.134
anse-0.121
arius-0.121
dial-0.114
Token T
Token ID309
Feature activation-8.92e-02

Pos logit contributions
\/\/+0.067
ivas+0.062
 equally+0.056
NESS+0.056
 pastry+0.055
Neg logit contributions
ascript-0.084
Unit-0.079
anse-0.071
arius-0.071
dial-0.067
Token >
Token ID1875
Feature activation-9.54e-02

Pos logit contributions
\/\/+0.063
ivas+0.059
 equally+0.053
NESS+0.053
 pastry+0.053
Neg logit contributions
ascript-0.074
Unit-0.070
anse-0.062
arius-0.062
dial-0.058
Token drinks
Token ID11758
Feature activation+1.84e-01

Pos logit contributions
ascript+0.075
Unit+0.071
anse+0.064
arius+0.064
dial+0.060
Neg logit contributions
\/\/-0.056
ivas-0.052
NESS-0.047
 BALL-0.046
 equally-0.046
Token.
Token ID13
Feature activation-2.97e-02

Pos logit contributions
ascript+0.163
Unit+0.154
anse+0.139
arius+0.139
dial+0.131
Neg logit contributions
\/\/-0.120
ivas-0.112
 equally-0.100
 pastry-0.099
NESS-0.099
Token
Token ID198
Feature activation-1.27e-03

Pos logit contributions
\/\/+0.019
ivas+0.017
 equally+0.015
NESS+0.015
 pastry+0.015
Neg logit contributions
ascript-0.027
Unit-0.026
anse-0.023
arius-0.023
dial-0.022
Token
Token ID198
Feature activation+4.96e-03

Pos logit contributions
\/\/+0.001
ivas+0.001
 equally+0.001
NESS+0.001
 pastry+0.001
Neg logit contributions
ascript-0.001
Unit-0.001
anse-0.001
arius-0.001
dial-0.001
TokenPlan
Token ID20854
Feature activation-2.13e-02

Pos logit contributions
ascript+0.004
Unit+0.004
anse+0.003
arius+0.003
dial+0.003
Neg logit contributions
\/\/-0.004
ivas-0.003
NESS-0.003
 equally-0.003
 pastry-0.003
Tokenning
Token ID768
Feature activation-5.45e-01

Pos logit contributions
\/\/+0.011
ivas+0.010
 equally+0.009
NESS+0.008
 pastry+0.008
Neg logit contributions
ascript-0.022
Unit-0.021
anse-0.019
arius-0.019
dial-0.018
Token a
Token ID257
Feature activation-4.79e-02

Pos logit contributions
\/\/+0.359
ivas+0.333
 equally+0.298
NESS+0.297
 pastry+0.295
Neg logit contributions
ascript-0.479
Unit-0.452
anse-0.409
arius-0.407
dial-0.384
Token Christmas
Token ID6786
Feature activation-1.48e-02

Pos logit contributions
\/\/+0.030
ivas+0.027
 equally+0.024
NESS+0.024
 pastry+0.024
Neg logit contributions
ascript-0.044
Unit-0.042
anse-0.038
arius-0.038
dial-0.036
Token or
Token ID393
Feature activation+1.74e-03

Pos logit contributions
\/\/+0.009
ivas+0.009
NESS+0.008
 equally+0.008
 pastry+0.008
Neg logit contributions
ascript-0.013
Unit-0.012
anse-0.011
arius-0.011
dial-0.011
Token new
Token ID649
Feature activation-9.28e-03

Pos logit contributions
ascript+0.001
Unit+0.001
anse+0.001
arius+0.001
dial+0.001
Neg logit contributions
\/\/-0.001
ivas-0.001
NESS-0.001
 equally-0.001
 pastry-0.001
Token year
Token ID614
Feature activation+6.07e-02

Pos logit contributions
\/\/+0.007
ivas+0.007
 equally+0.006
NESS+0.006
 pastry+0.006
Neg logit contributions
ascript-0.007
Unit-0.007
anse-0.006
arius-0.006
dial-0.005
Token toss
Token ID12153
Feature activation-1.04e-01

Pos logit contributions
ascript+0.015
Unit+0.014
anse+0.012
arius+0.012
dial+0.012
Neg logit contributions
\/\/-0.011
ivas-0.010
 equally-0.009
NESS-0.009
 framing-0.009
Tokenin
Token ID259
Feature activation-4.70e-04

Pos logit contributions
\/\/+0.075
ivas+0.071
 equally+0.063
NESS+0.063
 pastry+0.063
Neg logit contributions
ascript-0.085
Unit-0.080
anse-0.071
arius-0.071
dial-0.067
Token
Token ID447
Feature activation-7.02e-02

Pos logit contributions
\/\/+0.000
ivas+0.000
NESS+0.000
 equally+0.000
 pastry+0.000
Neg logit contributions
ascript-0.000
Unit-0.000
anse-0.000
arius-0.000
dial-0.000
Token
Token ID247
Feature activation-2.82e-02

Pos logit contributions
\/\/+0.064
ivas+0.061
 equally+0.056
NESS+0.056
 framing+0.056
Neg logit contributions
ascript-0.044
Unit-0.040
anse-0.035
arius-0.035
dial-0.032
Token hyper
Token ID8718
Feature activation-1.24e-01

Pos logit contributions
\/\/+0.019
ivas+0.018
 equally+0.016
NESS+0.016
 pastry+0.016
Neg logit contributions
ascript-0.024
Unit-0.022
anse-0.020
arius-0.020
dial-0.019
Tokenbole
Token ID45693
Feature activation-5.21e-01

Pos logit contributions
\/\/+0.048
ivas+0.043
 equally+0.034
NESS+0.034
 pastry+0.034
Neg logit contributions
ascript-0.141
Unit-0.135
anse-0.126
arius-0.125
dial-0.120
Token here
Token ID994
Feature activation-1.50e-01

Pos logit contributions
\/\/+0.317
ivas+0.294
 equally+0.258
NESS+0.258
 pastry+0.256
Neg logit contributions
ascript-0.483
Unit-0.458
anse-0.416
arius-0.415
dial-0.393
Token.
Token ID13
Feature activation-3.42e-02

Pos logit contributions
\/\/+0.097
ivas+0.090
 equally+0.080
NESS+0.080
 pastry+0.079
Neg logit contributions
ascript-0.133
Unit-0.126
anse-0.114
arius-0.114
dial-0.107
Token
Token ID198
Feature activation-3.46e-02

Pos logit contributions
\/\/+0.022
ivas+0.020
NESS+0.018
 equally+0.018
 BALL+0.017
Neg logit contributions
ascript-0.031
Unit-0.029
anse-0.027
arius-0.027
dial-0.025
Token
Token ID198
Feature activation-1.99e-02

Pos logit contributions
\/\/+0.022
ivas+0.021
 equally+0.018
NESS+0.018
 pastry+0.018
Neg logit contributions
ascript-0.031
Unit-0.029
anse-0.026
arius-0.026
dial-0.025
TokenPART
Token ID30709
Feature activation-4.15e-02

Pos logit contributions
\/\/+0.014
ivas+0.013
NESS+0.011
 equally+0.011
 BALL+0.011
Neg logit contributions
ascript-0.017
Unit-0.016
anse-0.014
arius-0.014
dial-0.013
Token '
Token ID705
Feature activation-3.21e-02

Pos logit contributions
\/\/+0.062
ivas+0.058
 equally+0.052
NESS+0.052
 pastry+0.052
Neg logit contributions
ascript-0.078
Unit-0.073
anse-0.066
arius-0.066
dial-0.062
Tokena
Token ID64
Feature activation+1.74e-01

Pos logit contributions
\/\/+0.030
ivas+0.029
 equally+0.026
NESS+0.026
 pastry+0.026
Neg logit contributions
ascript-0.019
Unit-0.018
anse-0.015
arius-0.015
dial-0.014
Token.
Token ID764
Feature activation-1.88e-01

Pos logit contributions
ascript+0.158
Unit+0.149
anse+0.135
arius+0.135
dial+0.127
Neg logit contributions
\/\/-0.110
ivas-0.102
 equally-0.090
NESS-0.090
 pastry-0.090
Token
Token ID198
Feature activation-5.80e-02

Pos logit contributions
\/\/+0.120
ivas+0.111
NESS+0.098
 equally+0.098
 pastry+0.097
Neg logit contributions
ascript-0.168
Unit-0.160
anse-0.145
arius-0.144
dial-0.136
Token
Token ID198
Feature activation-4.29e-02

Pos logit contributions
\/\/+0.038
ivas+0.035
 equally+0.031
NESS+0.031
 pastry+0.031
Neg logit contributions
ascript-0.051
Unit-0.049
anse-0.044
arius-0.044
dial-0.041
Tokenpub
Token ID12984
Feature activation-5.19e-01

Pos logit contributions
\/\/+0.031
ivas+0.029
 equally+0.026
NESS+0.026
 pastry+0.026
Neg logit contributions
ascript-0.035
Unit-0.033
anse-0.029
arius-0.029
dial-0.028
Token struct
Token ID2878
Feature activation-1.80e-01

Pos logit contributions
\/\/+0.261
ivas+0.237
 equally+0.202
NESS+0.201
 pastry+0.199
Neg logit contributions
ascript-0.537
Unit-0.512
anse-0.470
arius-0.469
dial-0.447
Token Shared
Token ID39403
Feature activation-2.03e-01

Pos logit contributions
\/\/+0.116
ivas+0.108
 equally+0.095
NESS+0.095
 pastry+0.094
Neg logit contributions
ascript-0.162
Unit-0.153
anse-0.138
arius-0.138
dial-0.130
Token <
Token ID1279
Feature activation-8.76e-02

Pos logit contributions
\/\/+0.128
ivas+0.118
NESS+0.104
 equally+0.104
 pastry+0.104
Neg logit contributions
ascript-0.184
Unit-0.176
anse-0.159
arius-0.158
dial-0.149
Token '
Token ID705
Feature activation-4.04e-02

Pos logit contributions
\/\/+0.060
ivas+0.056
 equally+0.050
NESS+0.050
 pastry+0.049
Neg logit contributions
ascript-0.075
Unit-0.071
anse-0.064
arius-0.063
dial-0.060
Tokena
Token ID64
Feature activation+1.68e-01

Pos logit contributions
\/\/+0.038
ivas+0.037
NESS+0.034
 equally+0.034
 pastry+0.034
Neg logit contributions
ascript-0.024
Unit-0.022
anse-0.018
arius-0.018
dial-0.017
Token are
Token ID389
Feature activation-2.32e-02

Pos logit contributions
\/\/+0.070
ivas+0.066
 equally+0.059
NESS+0.059
 pastry+0.058
Neg logit contributions
ascript-0.086
Unit-0.082
anse-0.073
arius-0.073
dial-0.069
Token basically
Token ID6209
Feature activation-1.05e-01

Pos logit contributions
\/\/+0.015
ivas+0.014
NESS+0.012
 equally+0.012
 pastry+0.012
Neg logit contributions
ascript-0.021
Unit-0.020
anse-0.018
arius-0.018
dial-0.017
Token continuing
Token ID8282
Feature activation+4.65e-02

Pos logit contributions
\/\/+0.079
ivas+0.074
NESS+0.067
 equally+0.066
 BALL+0.066
Neg logit contributions
ascript-0.082
Unit-0.077
anse-0.069
arius-0.069
dial-0.064
Token to
Token ID284
Feature activation-1.66e-02

Pos logit contributions
ascript+0.040
Unit+0.038
anse+0.034
arius+0.034
dial+0.032
Neg logit contributions
\/\/-0.031
ivas-0.029
NESS-0.026
 equally-0.026
 pastry-0.025
Token be
Token ID307
Feature activation-4.89e-02

Pos logit contributions
\/\/+0.010
ivas+0.009
NESS+0.008
 equally+0.008
 pastry+0.008
Neg logit contributions
ascript-0.015
Unit-0.015
anse-0.013
arius-0.013
dial-0.013
Token equally
Token ID8603
Feature activation-5.12e-01

Pos logit contributions
\/\/+0.008
ivas+0.006
NESS+0.003
 BALL+0.003
 pastry+0.003
Neg logit contributions
ascript-0.067
Unit-0.064
anse-0.061
arius-0.060
dial-0.058
Token as
Token ID355
Feature activation-1.65e-01

Pos logit contributions
\/\/+0.258
ivas+0.236
NESS+0.200
 pastry+0.197
 BALL+0.196
Neg logit contributions
ascript-0.528
Unit-0.504
anse-0.463
arius-0.462
dial-0.440
Token tight
Token ID5381
Feature activation+8.57e-02

Pos logit contributions
\/\/+0.129
ivas+0.122
 equally+0.111
NESS+0.110
 pastry+0.110
Neg logit contributions
ascript-0.124
Unit-0.116
anse-0.103
arius-0.102
dial-0.095
Token-
Token ID12
Feature activation-5.69e-02

Pos logit contributions
ascript+0.081
Unit+0.077
anse+0.070
arius+0.070
dial+0.066
Neg logit contributions
\/\/-0.051
ivas-0.047
 equally-0.041
NESS-0.041
 pastry-0.040
Tokenf
Token ID69
Feature activation+7.64e-02

Pos logit contributions
\/\/+0.044
ivas+0.041
 equally+0.038
NESS+0.037
 pastry+0.037
Neg logit contributions
ascript-0.043
Unit-0.041
anse-0.036
arius-0.036
dial-0.033
Tokenisted
Token ID6347
Feature activation+9.66e-02

Pos logit contributions
ascript+0.053
Unit+0.050
anse+0.043
arius+0.043
dial+0.040
Neg logit contributions
\/\/-0.064
ivas-0.061
 equally-0.055
 pastry-0.055
NESS-0.055
Token are
Token ID389
Feature activation-2.05e-03

Pos logit contributions
\/\/+0.017
ivas+0.016
NESS+0.014
 pastry+0.014
 equally+0.014
Neg logit contributions
ascript-0.025
Unit-0.024
anse-0.022
arius-0.022
dial-0.021
Token basically
Token ID6209
Feature activation-1.03e-01

Pos logit contributions
\/\/+0.001
ivas+0.001
NESS+0.001
 equally+0.001
 pastry+0.001
Neg logit contributions
ascript-0.002
Unit-0.002
anse-0.002
arius-0.002
dial-0.002
Token continuing
Token ID8282
Feature activation+6.45e-02

Pos logit contributions
\/\/+0.072
ivas+0.067
NESS+0.060
 equally+0.060
 pastry+0.059
Neg logit contributions
ascript-0.086
Unit-0.082
anse-0.073
arius-0.073
dial-0.069
Token to
Token ID284
Feature activation-3.40e-02

Pos logit contributions
ascript+0.056
Unit+0.053
anse+0.048
arius+0.048
dial+0.045
Neg logit contributions
\/\/-0.043
ivas-0.040
NESS-0.035
 equally-0.035
 pastry-0.035
Token be
Token ID307
Feature activation-5.20e-02

Pos logit contributions
\/\/+0.021
ivas+0.020
NESS+0.017
 equally+0.017
 pastry+0.017
Neg logit contributions
ascript-0.031
Unit-0.030
anse-0.027
arius-0.027
dial-0.025
Token equally
Token ID8603
Feature activation-5.12e-01

Pos logit contributions
\/\/+0.035
ivas+0.032
NESS+0.029
 pastry+0.029
 equally+0.029
Neg logit contributions
ascript-0.045
Unit-0.043
arius-0.038
anse-0.038
dial-0.036
Token as
Token ID355
Feature activation-1.75e-01

Pos logit contributions
\/\/+0.337
ivas+0.313
NESS+0.278
 equally+0.277
 pastry+0.276
Neg logit contributions
ascript-0.451
Unit-0.427
anse-0.385
arius-0.384
dial-0.362
Token tight
Token ID5381
Feature activation+1.10e-01

Pos logit contributions
\/\/+0.116
ivas+0.108
 equally+0.096
NESS+0.096
 pastry+0.095
Neg logit contributions
ascript-0.153
Unit-0.144
anse-0.130
arius-0.130
dial-0.122
Token-
Token ID12
Feature activation-5.20e-02

Pos logit contributions
ascript+0.102
Unit+0.097
anse+0.088
arius+0.088
dial+0.083
Neg logit contributions
\/\/-0.067
ivas-0.062
 equally-0.054
NESS-0.054
 pastry-0.054
Tokenf
Token ID69
Feature activation+7.78e-02

Pos logit contributions
\/\/+0.031
ivas+0.028
 equally+0.025
NESS+0.025
 pastry+0.025
Neg logit contributions
ascript-0.049
Unit-0.047
anse-0.043
arius-0.042
dial-0.040
Tokenisted
Token ID6347
Feature activation+1.07e-01

Pos logit contributions
ascript+0.055
Unit+0.051
anse+0.045
arius+0.044
dial+0.041
Neg logit contributions
\/\/-0.064
ivas-0.062
 pastry-0.056
 equally-0.056
 framing-0.055
TokenE
Token ID36
Feature activation+2.27e-01

Pos logit contributions
ascript+0.187
Unit+0.176
anse+0.157
arius+0.157
dial+0.147
Neg logit contributions
\/\/-0.165
ivas-0.155
NESS-0.139
 equally-0.139
 pastry-0.138
TokenER
Token ID1137
Feature activation+3.80e-02

Pos logit contributions
ascript+0.188
Unit+0.178
anse+0.159
arius+0.159
dial+0.149
Neg logit contributions
\/\/-0.160
ivas-0.149
 equally-0.134
NESS-0.134
 pastry-0.133
Token T
Token ID309
Feature activation+5.75e-02

Pos logit contributions
ascript+0.032
Unit+0.030
anse+0.027
arius+0.027
dial+0.025
Neg logit contributions
\/\/-0.027
ivas-0.025
NESS-0.023
 equally-0.023
 pastry-0.022
TokenOC
Token ID4503
Feature activation+2.35e-02

Pos logit contributions
ascript+0.052
Unit+0.049
anse+0.044
arius+0.044
dial+0.042
Neg logit contributions
\/\/-0.037
ivas-0.034
 equally-0.030
NESS-0.030
 pastry-0.030
Token AK
Token ID15837
Feature activation-1.21e-01

Pos logit contributions
ascript+0.020
Unit+0.018
anse+0.017
arius+0.016
dial+0.015
Neg logit contributions
\/\/-0.017
ivas-0.015
NESS-0.014
 equally-0.014
 BALL-0.014
TokenNOW
Token ID45669
Feature activation-5.10e-01

Pos logit contributions
\/\/+0.035
ivas+0.030
NESS+0.021
 equally+0.021
 BALL+0.020
Neg logit contributions
ascript-0.152
Unit-0.145
anse-0.136
arius-0.136
dial-0.130
TokenLED
Token ID30465
Feature activation-8.79e-02

Pos logit contributions
\/\/+0.295
ivas+0.271
NESS+0.242
 equally+0.237
 BALL+0.236
Neg logit contributions
ascript-0.492
Unit-0.462
anse-0.425
arius-0.422
dial-0.397
TokenG
Token ID38
Feature activation+1.20e-01

Pos logit contributions
\/\/+0.067
ivas+0.063
NESS+0.057
 equally+0.057
 BALL+0.056
Neg logit contributions
ascript-0.069
Unit-0.064
anse-0.057
arius-0.057
dial-0.053
TokenED
Token ID1961
Feature activation+5.63e-02

Pos logit contributions
ascript+0.110
Unit+0.103
anse+0.094
arius+0.094
dial+0.088
Neg logit contributions
\/\/-0.074
ivas-0.069
NESS-0.062
 equally-0.061
 BALL-0.061
Token NET
Token ID30502
Feature activation-1.32e-01

Pos logit contributions
ascript+0.050
Unit+0.048
anse+0.043
arius+0.043
dial+0.041
Neg logit contributions
\/\/-0.036
ivas-0.034
 equally-0.030
 pastry-0.030
NESS-0.030
Token CALL
Token ID42815
Feature activation+1.14e-01

Pos logit contributions
\/\/+0.116
ivas+0.110
NESS+0.102
 equally+0.101
 BALL+0.101
Neg logit contributions
ascript-0.087
Unit-0.080
anse-0.070
arius-0.070
dial-0.064
Token SEM
Token ID48603
Feature activation+7.01e-02

Pos logit contributions
ascript+0.006
Unit+0.006
anse+0.005
arius+0.005
dial+0.005
Neg logit contributions
\/\/-0.006
ivas-0.005
 equally-0.005
NESS-0.005
 pastry-0.005
TokenINAL
Token ID17961
Feature activation+1.39e-01

Pos logit contributions
ascript+0.057
Unit+0.053
anse+0.048
arius+0.047
dial+0.045
Neg logit contributions
\/\/-0.051
ivas-0.048
NESS-0.043
 equally-0.043
 pastry-0.043
Token MA
Token ID8779
Feature activation+1.02e-01

Pos logit contributions
ascript+0.113
Unit+0.107
anse+0.096
arius+0.095
dial+0.089
Neg logit contributions
\/\/-0.101
ivas-0.094
 equally-0.085
NESS-0.085
 pastry-0.084
TokenIN
Token ID1268
Feature activation+1.19e-01

Pos logit contributions
ascript+0.080
Unit+0.075
anse+0.067
arius+0.067
dial+0.062
Neg logit contributions
\/\/-0.077
ivas-0.072
 equally-0.065
NESS-0.065
 pastry-0.065
Token AK
Token ID15837
Feature activation-1.28e-01

Pos logit contributions
ascript+0.098
Unit+0.092
anse+0.082
arius+0.082
dial+0.077
Neg logit contributions
\/\/-0.085
ivas-0.080
NESS-0.071
 equally-0.071
 pastry-0.071
TokenNOW
Token ID45669
Feature activation-5.09e-01

Pos logit contributions
\/\/+0.051
ivas+0.045
 equally+0.036
NESS+0.036
 pastry+0.035
Neg logit contributions
ascript-0.147
Unit-0.141
anse-0.130
arius-0.130
dial-0.125
TokenLED
Token ID30465
Feature activation-9.89e-02

Pos logit contributions
\/\/+0.293
ivas+0.270
NESS+0.241
 equally+0.236
 BALL+0.234
Neg logit contributions
ascript-0.491
Unit-0.462
anse-0.424
arius-0.422
dial-0.396
TokenG
Token ID38
Feature activation+8.40e-02

Pos logit contributions
\/\/+0.075
ivas+0.070
NESS+0.064
 equally+0.063
 BALL+0.063
Neg logit contributions
ascript-0.078
Unit-0.072
anse-0.065
arius-0.064
dial-0.060
TokenED
Token ID1961
Feature activation+7.89e-03

Pos logit contributions
ascript+0.077
Unit+0.073
anse+0.066
arius+0.066
dial+0.062
Neg logit contributions
\/\/-0.052
ivas-0.048
NESS-0.043
 equally-0.043
 BALL-0.042
Token NET
Token ID30502
Feature activation-1.50e-01

Pos logit contributions
ascript+0.007
Unit+0.006
anse+0.006
arius+0.006
dial+0.005
Neg logit contributions
\/\/-0.006
ivas-0.005
 equally-0.005
NESS-0.005
 pastry-0.005
Token CALL
Token ID42815
Feature activation+1.02e-01

Pos logit contributions
\/\/+0.133
ivas+0.126
 equally+0.116
NESS+0.116
 pastry+0.115
Neg logit contributions
ascript-0.097
Unit-0.090
anse-0.078
arius-0.078
dial-0.071
Token rules
Token ID3173
Feature activation+2.08e-02

Pos logit contributions
ascript+0.012
Unit+0.012
anse+0.011
arius+0.010
dial+0.010
Neg logit contributions
\/\/-0.009
ivas-0.008
 equally-0.007
NESS-0.007
 pastry-0.007
Token are
Token ID389
Feature activation+2.36e-03

Pos logit contributions
ascript+0.018
Unit+0.017
anse+0.015
arius+0.015
dial+0.014
Neg logit contributions
\/\/-0.014
ivas-0.013
NESS-0.012
 equally-0.012
 pastry-0.012
Token designed
Token ID3562
Feature activation+8.41e-04

Pos logit contributions
ascript+0.002
Unit+0.002
anse+0.002
arius+0.002
dial+0.002
Neg logit contributions
\/\/-0.002
ivas-0.001
 equally-0.001
NESS-0.001
 pastry-0.001
Token to
Token ID284
Feature activation-1.04e-02

Pos logit contributions
ascript+0.001
Unit+0.001
anse+0.001
arius+0.001
dial+0.001
Neg logit contributions
\/\/-0.001
ivas-0.001
NESS-0.000
 pastry-0.000
 equally-0.000
Token work
Token ID670
Feature activation-1.06e-01

Pos logit contributions
\/\/+0.006
ivas+0.006
NESS+0.005
 equally+0.005
 pastry+0.005
Neg logit contributions
ascript-0.010
Unit-0.009
anse-0.008
arius-0.008
dial-0.008
Token equally
Token ID8603
Feature activation-5.07e-01

Pos logit contributions
\/\/+0.067
ivas+0.062
NESS+0.055
 equally+0.055
 pastry+0.054
Neg logit contributions
ascript-0.096
Unit-0.091
anse-0.083
arius-0.082
dial-0.078
Token well
Token ID880
Feature activation-3.35e-02

Pos logit contributions
\/\/+0.328
ivas+0.306
NESS+0.270
 pastry+0.268
 equally+0.267
Neg logit contributions
ascript-0.451
Unit-0.427
anse-0.386
arius-0.385
dial-0.363
Token whether
Token ID1771
Feature activation+1.05e-01

Pos logit contributions
\/\/+0.022
ivas+0.020
NESS+0.018
 pastry+0.018
 equally+0.018
Neg logit contributions
ascript-0.030
Unit-0.028
anse-0.025
arius-0.025
dial-0.024
Token or
Token ID393
Feature activation-5.87e-03

Pos logit contributions
ascript+0.085
Unit+0.080
anse+0.072
arius+0.071
dial+0.067
Neg logit contributions
\/\/-0.076
ivas-0.071
 equally-0.064
NESS-0.064
 pastry-0.064
Token not
Token ID407
Feature activation+2.99e-02

Pos logit contributions
\/\/+0.004
ivas+0.004
 equally+0.004
NESS+0.004
 pastry+0.004
Neg logit contributions
ascript-0.005
Unit-0.004
anse-0.004
arius-0.004
dial-0.004
Token you
Token ID345
Feature activation-1.77e-02

Pos logit contributions
ascript+0.026
Unit+0.024
anse+0.022
arius+0.022
dial+0.021
Neg logit contributions
\/\/-0.020
ivas-0.019
 equally-0.017
NESS-0.017
 pastry-0.017
Token majority
Token ID3741
Feature activation-6.93e-02

Pos logit contributions
\/\/+0.011
ivas+0.011
 equally+0.009
NESS+0.009
 pastry+0.009
Neg logit contributions
ascript-0.015
Unit-0.014
anse-0.013
arius-0.013
dial-0.012
Token of
Token ID286
Feature activation-3.52e-02

Pos logit contributions
\/\/+0.044
ivas+0.041
NESS+0.036
 equally+0.036
 framing+0.036
Neg logit contributions
ascript-0.062
Unit-0.059
anse-0.054
arius-0.053
dial-0.050
Token China
Token ID2807
Feature activation-4.43e-02

Pos logit contributions
\/\/+0.023
ivas+0.021
NESS+0.019
 equally+0.019
 pastry+0.019
Neg logit contributions
ascript-0.031
Unit-0.030
anse-0.027
arius-0.027
dial-0.025
Token's
Token ID338
Feature activation+2.77e-02

Pos logit contributions
\/\/+0.033
ivas+0.030
NESS+0.027
 equally+0.027
 framing+0.027
Neg logit contributions
ascript-0.036
Unit-0.034
anse-0.030
arius-0.030
dial-0.028
Token air
Token ID1633
Feature activation-3.17e-01

Pos logit contributions
ascript+0.026
Unit+0.025
anse+0.022
arius+0.022
dial+0.021
Neg logit contributions
\/\/-0.017
ivas-0.015
NESS-0.013
 equally-0.013
 pastry-0.013
Token traffic
Token ID4979
Feature activation-5.05e-01

Pos logit contributions
\/\/+0.141
ivas+0.127
NESS+0.104
 pastry+0.102
 framing+0.101
Neg logit contributions
ascript-0.344
Unit-0.333
arius-0.306
anse-0.305
dial-0.293
Token is
Token ID318
Feature activation-1.21e-02

Pos logit contributions
\/\/+0.316
ivas+0.294
NESS+0.258
 equally+0.255
 BALL+0.255
Neg logit contributions
ascript-0.460
Unit-0.437
anse-0.396
arius-0.395
dial-0.374
Token still
Token ID991
Feature activation-7.44e-02

Pos logit contributions
\/\/+0.008
ivas+0.007
NESS+0.006
 pastry+0.006
 BALL+0.006
Neg logit contributions
ascript-0.011
Unit-0.010
arius-0.009
anse-0.009
dial-0.009
Token concentrated
Token ID17298
Feature activation-5.11e-02

Pos logit contributions
\/\/+0.049
ivas+0.046
NESS+0.040
 equally+0.040
 pastry+0.040
Neg logit contributions
ascript-0.065
Unit-0.062
arius-0.056
anse-0.056
dial-0.052
Token among
Token ID1871
Feature activation-8.28e-02

Pos logit contributions
\/\/+0.030
ivas+0.028
NESS+0.024
 BALL+0.024
 pastry+0.024
Neg logit contributions
ascript-0.049
Unit-0.046
anse-0.042
arius-0.042
dial-0.040
Token a
Token ID257
Feature activation-1.18e-02

Pos logit contributions
\/\/+0.055
ivas+0.052
 equally+0.046
NESS+0.046
 pastry+0.046
Neg logit contributions
ascript-0.072
Unit-0.068
anse-0.061
arius-0.061
dial-0.058
Tokensteel
Token ID44822
Feature activation-1.61e-01

Pos logit contributions
\/\/+0.070
ivas+0.064
NESS+0.058
 framing+0.058
 equally+0.057
Neg logit contributions
ascript-0.083
Unit-0.079
arius-0.071
anse-0.070
dial-0.067
Token as
Token ID355
Feature activation-1.66e-01

Pos logit contributions
\/\/+0.104
ivas+0.096
 equally+0.085
NESS+0.085
 pastry+0.085
Neg logit contributions
ascript-0.143
Unit-0.136
anse-0.123
arius-0.122
dial-0.116
Token an
Token ID281
Feature activation-7.70e-03

Pos logit contributions
\/\/+0.109
ivas+0.100
NESS+0.089
 equally+0.088
 pastry+0.088
Neg logit contributions
ascript-0.146
Unit-0.139
anse-0.126
arius-0.125
dial-0.118
Token "
Token ID366
Feature activation+6.06e-02

Pos logit contributions
\/\/+0.005
ivas+0.005
NESS+0.004
 equally+0.004
 pastry+0.004
Neg logit contributions
ascript-0.007
Unit-0.006
anse-0.006
arius-0.006
dial-0.005
Tokenorgan
Token ID9971
Feature activation+2.75e-02

Pos logit contributions
ascript+0.047
Unit+0.045
anse+0.040
arius+0.039
dial+0.037
Neg logit contributions
\/\/-0.046
ivas-0.043
 equally-0.039
NESS-0.039
 pastry-0.039
Tokenization
Token ID1634
Feature activation-5.04e-01

Pos logit contributions
ascript+0.028
Unit+0.026
anse+0.024
arius+0.024
dial+0.023
Neg logit contributions
\/\/-0.015
ivas-0.013
 equally-0.012
NESS-0.012
 pastry-0.011
Token of
Token ID286
Feature activation-7.19e-02

Pos logit contributions
\/\/+0.343
ivas+0.322
 equally+0.288
NESS+0.286
 pastry+0.286
Neg logit contributions
ascript-0.430
Unit-0.405
anse-0.365
arius-0.362
dial-0.342
Token distinction
Token ID12941
Feature activation+1.52e-01

Pos logit contributions
\/\/+0.045
ivas+0.041
 equally+0.037
NESS+0.036
 pastry+0.036
Neg logit contributions
ascript-0.066
Unit-0.062
anse-0.057
arius-0.056
dial-0.053
Token."
Token ID526
Feature activation-1.35e-02

Pos logit contributions
ascript+0.117
Unit+0.109
anse+0.097
arius+0.097
dial+0.090
Neg logit contributions
\/\/-0.117
ivas-0.110
 equally-0.099
NESS-0.099
 pastry-0.099
Token
Token ID198
Feature activation-7.23e-02

Pos logit contributions
\/\/+0.009
ivas+0.008
 equally+0.007
NESS+0.007
 pastry+0.007
Neg logit contributions
ascript-0.012
Unit-0.011
anse-0.010
arius-0.010
dial-0.010
Token
Token ID198
Feature activation-5.38e-02

Pos logit contributions
\/\/+0.047
ivas+0.044
 equally+0.039
NESS+0.039
 pastry+0.038
Neg logit contributions
ascript-0.064
Unit-0.061
anse-0.055
arius-0.055
dial-0.052
Token
Token ID198
Feature activation+9.73e-03

Pos logit contributions
\/\/+0.002
ivas+0.002
 equally+0.002
NESS+0.002
 pastry+0.002
Neg logit contributions
ascript-0.003
Unit-0.003
anse-0.003
arius-0.003
dial-0.003
TokenWilliam
Token ID17121
Feature activation-1.49e-01

Pos logit contributions
ascript+0.008
Unit+0.008
anse+0.007
arius+0.007
dial+0.006
Neg logit contributions
\/\/-0.007
ivas-0.006
NESS-0.006
 equally-0.006
 BALL-0.006
Token Carpenter
Token ID32659
Feature activation-1.10e-01

Pos logit contributions
\/\/+0.101
ivas+0.095
 equally+0.085
NESS+0.084
 pastry+0.084
Neg logit contributions
ascript-0.127
Unit-0.120
anse-0.108
arius-0.108
dial-0.101
Token,
Token ID11
Feature activation+6.73e-03

Pos logit contributions
\/\/+0.072
ivas+0.067
NESS+0.060
 equally+0.060
 pastry+0.059
Neg logit contributions
ascript-0.097
Unit-0.092
anse-0.083
arius-0.083
dial-0.078
Token Mont
Token ID5575
Feature activation+3.32e-02

Pos logit contributions
ascript+0.006
Unit+0.005
anse+0.005
arius+0.005
dial+0.004
Neg logit contributions
\/\/-0.005
ivas-0.004
 equally-0.004
NESS-0.004
 pastry-0.004
Tokenpel
Token ID30242
Feature activation-5.01e-01

Pos logit contributions
ascript+0.043
Unit+0.042
anse+0.039
arius+0.039
dial+0.038
Neg logit contributions
\/\/-0.008
ivas-0.006
 equally-0.004
NESS-0.004
 pastry-0.004
Tokenier
Token ID959
Feature activation-1.23e-01

Pos logit contributions
\/\/+0.272
ivas+0.249
 equally+0.214
NESS+0.212
 pastry+0.211
Neg logit contributions
ascript-0.500
Unit-0.475
anse-0.433
arius-0.431
dial-0.411
Token
Token ID198
Feature activation+1.71e-02

Pos logit contributions
\/\/+0.081
ivas+0.075
 equally+0.067
NESS+0.067
 pastry+0.066
Neg logit contributions
ascript-0.108
Unit-0.102
anse-0.092
arius-0.092
dial-0.086
Token
Token ID198
Feature activation+3.40e-02

Pos logit contributions
ascript+0.015
Unit+0.014
anse+0.013
arius+0.013
dial+0.012
Neg logit contributions
\/\/-0.011
ivas-0.010
 equally-0.009
NESS-0.009
 pastry-0.009
TokenCon
Token ID3103
Feature activation-4.66e-03

Pos logit contributions
ascript+0.029
Unit+0.027
anse+0.024
arius+0.024
dial+0.023
Neg logit contributions
\/\/-0.024
ivas-0.022
NESS-0.020
 equally-0.020
 pastry-0.020
Token Hogan
Token ID25316
Feature activation-2.45e-01

Pos logit contributions
\/\/+0.003
ivas+0.003
 equally+0.003
NESS+0.003
 pastry+0.003
Neg logit contributions
ascript-0.004
Unit-0.004
anse-0.003
arius-0.003
dial-0.003
Token capacity
Token ID5339
Feature activation+9.73e-02

Pos logit contributions
\/\/+0.066
ivas+0.062
 equally+0.056
 framing+0.055
NESS+0.055
Neg logit contributions
ascript-0.085
Unit-0.080
anse-0.072
arius-0.072
dial-0.068
Token and
Token ID290
Feature activation-5.83e-03

Pos logit contributions
ascript+0.084
Unit+0.079
anse+0.071
arius+0.071
dial+0.067
Neg logit contributions
\/\/-0.066
ivas-0.061
 equally-0.055
NESS-0.055
 framing-0.054
Token very
Token ID845
Feature activation-6.40e-02

Pos logit contributions
\/\/+0.004
ivas+0.004
 equally+0.003
NESS+0.003
 framing+0.003
Neg logit contributions
ascript-0.005
Unit-0.005
anse-0.004
arius-0.004
dial-0.004
Token high
Token ID1029
Feature activation-7.08e-02

Pos logit contributions
\/\/+0.043
ivas+0.040
 equally+0.036
NESS+0.036
 framing+0.036
Neg logit contributions
ascript-0.055
Unit-0.052
anse-0.047
arius-0.047
dial-0.044
Token decentral
Token ID20649
Feature activation+1.24e-01

Pos logit contributions
\/\/+0.045
ivas+0.042
 equally+0.037
NESS+0.037
 pastry+0.037
Neg logit contributions
ascript-0.063
Unit-0.060
anse-0.054
arius-0.054
dial-0.051
Tokenization
Token ID1634
Feature activation-5.00e-01

Pos logit contributions
ascript+0.153
Unit+0.147
anse+0.137
arius+0.136
dial+0.131
Neg logit contributions
\/\/-0.038
ivas-0.033
 equally-0.024
NESS-0.024
 pastry-0.024
Token.
Token ID13
Feature activation-4.33e-02

Pos logit contributions
\/\/+0.331
ivas+0.309
NESS+0.274
 equally+0.274
 pastry+0.272
Neg logit contributions
ascript-0.436
Unit-0.413
anse-0.372
arius-0.371
dial-0.350
Token CL
Token ID7852
Feature activation+1.67e-01

Pos logit contributions
\/\/+0.027
ivas+0.025
 equally+0.022
NESS+0.022
 pastry+0.022
Neg logit contributions
ascript-0.039
Unit-0.037
anse-0.034
arius-0.034
dial-0.032
TokenTV
Token ID6849
Feature activation+2.07e-02

Pos logit contributions
ascript+0.137
Unit+0.129
anse+0.116
arius+0.115
dial+0.108
Neg logit contributions
\/\/-0.119
ivas-0.111
 equally-0.100
NESS-0.100
 pastry-0.099
Token (
Token ID357
Feature activation+8.25e-02

Pos logit contributions
ascript+0.018
Unit+0.017
anse+0.015
arius+0.015
dial+0.014
Neg logit contributions
\/\/-0.014
ivas-0.013
 equally-0.011
NESS-0.011
 pastry-0.011
TokenBIP
Token ID47772
Feature activation-8.09e-02

Pos logit contributions
ascript+0.064
Unit+0.060
anse+0.053
arius+0.053
dial+0.049
Neg logit contributions
\/\/-0.063
ivas-0.059
 equally-0.054
NESS-0.054
 framing-0.053
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
ascript+0.074
Unit+0.070
anse+0.063
arius+0.063
dial+0.059
Neg logit contributions
\/\/-0.064
ivas-0.060
NESS-0.054
 equally-0.053
 pastry-0.053
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
ascript+0.018
Unit+0.017
anse+0.015
arius+0.015
dial+0.015
Neg logit contributions
\/\/-0.013
ivas-0.012
 equally-0.011
NESS-0.011
 pastry-0.011
Token
Token ID198
Feature activation+2.78e-02

Pos logit contributions
ascript+0.010
Unit+0.010
anse+0.009
arius+0.009
dial+0.008
Neg logit contributions
\/\/-0.008
ivas-0.007
 equally-0.006
NESS-0.006
 pastry-0.006
TokenWill
Token ID8743
Feature activation-8.20e-02

Pos logit contributions
ascript+0.022
Unit+0.021
anse+0.018
arius+0.018
dial+0.017
Neg logit contributions
\/\/-0.021
ivas-0.019
NESS-0.018
 equally-0.018
 pastry-0.017
Token Dun
Token ID5648
Feature activation+1.10e-02

Pos logit contributions
\/\/+0.052
ivas+0.049
NESS+0.043
 equally+0.043
 pastry+0.043
Neg logit contributions
ascript-0.074
Unit-0.070
anse-0.063
arius-0.063
dial-0.059
Tokenning
Token ID768
Feature activation-4.99e-01

Pos logit contributions
ascript+0.010
Unit+0.010
anse+0.009
arius+0.009
dial+0.009
Neg logit contributions
\/\/-0.006
ivas-0.006
NESS-0.005
 equally-0.005
 pastry-0.005
Token,
Token ID11
Feature activation-2.93e-02

Pos logit contributions
\/\/+0.325
ivas+0.302
NESS+0.267
 equally+0.266
 pastry+0.265
Neg logit contributions
ascript-0.442
Unit-0.419
anse-0.379
arius-0.378
dial-0.356
Token who
Token ID508
Feature activation+5.05e-02

Pos logit contributions
\/\/+0.018
ivas+0.017
NESS+0.015
 equally+0.015
 pastry+0.014
Neg logit contributions
ascript-0.027
Unit-0.026
anse-0.023
arius-0.023
dial-0.022
Token has
Token ID468
Feature activation-1.39e-01

Pos logit contributions
ascript+0.044
Unit+0.042
anse+0.038
arius+0.038
dial+0.036
Neg logit contributions
\/\/-0.033
ivas-0.031
NESS-0.027
 BALL-0.027
 equally-0.027
Token his
Token ID465
Feature activation-6.37e-02

Pos logit contributions
\/\/+0.104
ivas+0.097
 equally+0.088
NESS+0.088
 pastry+0.087
Neg logit contributions
ascript-0.110
Unit-0.103
anse-0.092
arius-0.091
dial-0.086
Token own
Token ID898
Feature activation-1.66e-03

Pos logit contributions
\/\/+0.047
ivas+0.044
 equally+0.040
 pastry+0.040
 framing+0.040
Neg logit contributions
ascript-0.051
Unit-0.048
anse-0.043
arius-0.042
dial-0.040
Token endure
Token ID21178
Feature activation+1.12e-01

Pos logit contributions
ascript+0.014
Unit+0.013
anse+0.012
arius+0.012
dial+0.011
Neg logit contributions
\/\/-0.009
ivas-0.009
NESS-0.007
 equally-0.007
 pastry-0.007
Token great
Token ID1049
Feature activation-7.19e-02

Pos logit contributions
ascript+0.096
Unit+0.090
anse+0.081
arius+0.081
dial+0.076
Neg logit contributions
\/\/-0.077
ivas-0.072
NESS-0.064
 equally-0.064
 pastry-0.064
Token mechanical
Token ID12370
Feature activation-1.16e-01

Pos logit contributions
\/\/+0.043
ivas+0.040
 equally+0.035
NESS+0.035
 pastry+0.034
Neg logit contributions
ascript-0.068
Unit-0.064
anse-0.058
arius-0.058
dial-0.055
Token stress
Token ID5503
Feature activation-1.48e-01

Pos logit contributions
\/\/+0.066
ivas+0.061
 equally+0.053
NESS+0.053
 pastry+0.052
Neg logit contributions
ascript-0.112
Unit-0.107
anse-0.097
arius-0.097
dial-0.092
Token (
Token ID357
Feature activation+8.63e-02

Pos logit contributions
\/\/+0.098
ivas+0.092
NESS+0.081
 pastry+0.081
 equally+0.081
Neg logit contributions
ascript-0.130
Unit-0.123
anse-0.110
arius-0.110
dial-0.104
Tokenbending
Token ID49667
Feature activation-4.97e-01

Pos logit contributions
ascript+0.066
Unit+0.062
anse+0.055
arius+0.055
dial+0.051
Neg logit contributions
\/\/-0.066
ivas-0.062
 equally-0.057
NESS-0.057
 pastry-0.056
Token open
Token ID1280
Feature activation-3.38e-02

Pos logit contributions
\/\/+0.333
ivas+0.311
NESS+0.276
 equally+0.276
 pastry+0.274
Neg logit contributions
ascript-0.431
Unit-0.408
anse-0.367
arius-0.366
dial-0.345
Token and
Token ID290
Feature activation+5.38e-02

Pos logit contributions
\/\/+0.023
ivas+0.021
NESS+0.019
 equally+0.019
 pastry+0.018
Neg logit contributions
ascript-0.029
Unit-0.028
anse-0.025
arius-0.025
dial-0.024
Token slamming
Token ID39603
Feature activation-5.61e-02

Pos logit contributions
ascript+0.046
Unit+0.044
anse+0.040
arius+0.039
dial+0.037
Neg logit contributions
\/\/-0.036
ivas-0.034
 equally-0.030
NESS-0.030
 pastry-0.030
Token
Token ID198
Feature activation+2.12e-02

Pos logit contributions
\/\/+0.037
ivas+0.034
 equally+0.031
NESS+0.031
 pastry+0.030
Neg logit contributions
ascript-0.049
Unit-0.047
anse-0.042
arius-0.042
dial-0.040
Token
Token ID198
Feature activation+3.14e-02

Pos logit contributions
ascript+0.019
Unit+0.018
anse+0.016
arius+0.016
dial+0.015
Neg logit contributions
\/\/-0.014
ivas-0.013
 equally-0.011
NESS-0.011
 pastry-0.011
Token's
Token ID338
Feature activation-5.50e-02

Pos logit contributions
ascript+0.072
Unit+0.068
anse+0.062
arius+0.062
dial+0.058
Neg logit contributions
\/\/-0.051
ivas-0.047
NESS-0.042
 equally-0.041
 pastry-0.041
Token Halo
Token ID18597
Feature activation+1.44e-01

Pos logit contributions
\/\/+0.038
ivas+0.036
NESS+0.032
 equally+0.032
 pastry+0.032
Neg logit contributions
ascript-0.046
Unit-0.044
anse-0.039
arius-0.039
dial-0.037
Token super
Token ID2208
Feature activation-2.29e-02

Pos logit contributions
ascript+0.121
Unit+0.115
anse+0.103
arius+0.102
dial+0.096
Neg logit contributions
\/\/-0.100
ivas-0.094
NESS-0.083
 equally-0.083
 pastry-0.082
Token fans
Token ID3296
Feature activation-2.71e-01

Pos logit contributions
\/\/+0.016
ivas+0.014
 equally+0.013
NESS+0.013
 pastry+0.013
Neg logit contributions
ascript-0.020
Unit-0.019
anse-0.017
arius-0.017
dial-0.016
Token Ryan
Token ID6047
Feature activation-2.25e-01

Pos logit contributions
\/\/+0.177
ivas+0.165
 equally+0.146
NESS+0.146
 pastry+0.145
Neg logit contributions
ascript-0.240
Unit-0.227
anse-0.205
arius-0.205
dial-0.193
Token McC
Token ID5108
Feature activation-4.96e-01

Pos logit contributions
\/\/+0.154
ivas+0.144
 equally+0.129
NESS+0.128
 pastry+0.128
Neg logit contributions
ascript-0.192
Unit-0.181
anse-0.163
arius-0.162
dial-0.153
Tokenaff
Token ID2001
Feature activation+1.46e-02

Pos logit contributions
\/\/+0.358
ivas+0.334
NESS+0.301
 equally+0.301
 pastry+0.299
Neg logit contributions
ascript-0.405
Unit-0.381
anse-0.342
arius-0.341
dial-0.319
Tokenrey
Token ID4364
Feature activation-1.17e-03

Pos logit contributions
ascript+0.013
Unit+0.013
anse+0.012
arius+0.011
dial+0.011
Neg logit contributions
\/\/-0.009
ivas-0.008
 equally-0.007
NESS-0.007
 pastry-0.007
Token and
Token ID290
Feature activation+6.94e-04

Pos logit contributions
\/\/+0.001
ivas+0.001
NESS+0.001
 equally+0.001
 pastry+0.001
Neg logit contributions
ascript-0.001
Unit-0.001
anse-0.001
arius-0.001
dial-0.001
Token Brian
Token ID8403
Feature activation-1.47e-01

Pos logit contributions
ascript+0.001
Unit+0.001
anse+0.001
arius+0.001
dial+0.000
Neg logit contributions
\/\/-0.000
ivas-0.000
NESS-0.000
 equally-0.000
 pastry-0.000
Token Albert
Token ID9966
Feature activation-1.34e-01

Pos logit contributions
\/\/+0.100
ivas+0.093
NESS+0.084
 equally+0.083
 pastry+0.083
Neg logit contributions
ascript-0.126
Unit-0.119
anse-0.107
arius-0.107
dial-0.100
Tokenin
Token ID259
Feature activation-5.22e-04

Pos logit contributions
\/\/+0.067
ivas+0.062
 equally+0.055
NESS+0.055
 pastry+0.055
Neg logit contributions
ascript-0.083
Unit-0.078
anse-0.070
arius-0.070
dial-0.066
Tokenjury
Token ID21871
Feature activation-2.96e-02

Pos logit contributions
\/\/+0.000
ivas+0.000
 equally+0.000
NESS+0.000
 pastry+0.000
Neg logit contributions
ascript-0.000
Unit-0.000
anse-0.000
arius-0.000
dial-0.000
Token/
Token ID14
Feature activation-9.42e-02

Pos logit contributions
\/\/+0.020
ivas+0.018
NESS+0.016
 equally+0.016
 pastry+0.016
Neg logit contributions
ascript-0.026
Unit-0.024
anse-0.022
arius-0.022
dial-0.021
Tokenresearch
Token ID34033
Feature activation-6.78e-02

Pos logit contributions
\/\/+0.065
ivas+0.061
 equally+0.054
NESS+0.054
 pastry+0.054
Neg logit contributions
ascript-0.080
Unit-0.075
anse-0.068
arius-0.068
dial-0.064
Token/
Token ID14
Feature activation-7.67e-02

Pos logit contributions
\/\/+0.045
ivas+0.042
NESS+0.037
 equally+0.037
 pastry+0.037
Neg logit contributions
ascript-0.059
Unit-0.056
anse-0.051
arius-0.050
dial-0.048
Tokenpub
Token ID12984
Feature activation-4.95e-01

Pos logit contributions
\/\/+0.053
ivas+0.049
 equally+0.044
NESS+0.044
 pastry+0.043
Neg logit contributions
ascript-0.065
Unit-0.062
anse-0.055
arius-0.055
dial-0.052
Token/
Token ID14
Feature activation-6.76e-02

Pos logit contributions
\/\/+0.380
ivas+0.358
 equally+0.324
NESS+0.324
 pastry+0.323
Neg logit contributions
ascript-0.381
Unit-0.358
anse-0.318
arius-0.316
dial-0.296
Tokenalcohol
Token ID42142
Feature activation-2.11e-01

Pos logit contributions
\/\/+0.046
ivas+0.043
 equally+0.038
NESS+0.038
 pastry+0.038
Neg logit contributions
ascript-0.058
Unit-0.055
anse-0.049
arius-0.049
dial-0.046
Token-
Token ID12
Feature activation-9.93e-02

Pos logit contributions
\/\/+0.137
ivas+0.128
 equally+0.114
NESS+0.113
 pastry+0.113
Neg logit contributions
ascript-0.187
Unit-0.177
anse-0.160
arius-0.159
dial-0.151
Tokenlaws
Token ID29317
Feature activation-3.57e-02

Pos logit contributions
\/\/+0.070
ivas+0.065
 equally+0.059
NESS+0.059
 pastry+0.058
Neg logit contributions
ascript-0.083
Unit-0.078
anse-0.070
arius-0.070
dial-0.065
Token/
Token ID14
Feature activation-9.09e-02

Pos logit contributions
\/\/+0.024
ivas+0.023
 equally+0.020
 pastry+0.020
NESS+0.020
Neg logit contributions
ascript-0.031
Unit-0.029
anse-0.026
arius-0.026
dial-0.025
Token adults
Token ID6490
Feature activation-4.27e-02

Pos logit contributions
\/\/+0.020
ivas+0.019
 equally+0.017
 pastry+0.017
NESS+0.017
Neg logit contributions
ascript-0.030
Unit-0.028
anse-0.025
arius-0.025
dial-0.024
Token);
Token ID1776
Feature activation+1.30e-01

Pos logit contributions
\/\/+0.028
ivas+0.026
NESS+0.023
 equally+0.023
 pastry+0.023
Neg logit contributions
ascript-0.037
Unit-0.035
anse-0.032
arius-0.032
dial-0.030
Token
Token ID198
Feature activation-3.95e-02

Pos logit contributions
ascript+0.113
Unit+0.107
anse+0.096
arius+0.096
dial+0.090
Neg logit contributions
\/\/-0.087
ivas-0.081
 equally-0.072
NESS-0.072
 pastry-0.072
Token
Token ID198
Feature activation-1.96e-02

Pos logit contributions
\/\/+0.026
ivas+0.024
 equally+0.021
NESS+0.021
 pastry+0.021
Neg logit contributions
ascript-0.035
Unit-0.033
anse-0.030
arius-0.030
dial-0.028
Tokenpres
Token ID18302
Feature activation-1.06e-01

Pos logit contributions
\/\/+0.014
ivas+0.013
NESS+0.011
 equally+0.011
 pastry+0.011
Neg logit contributions
ascript-0.017
Unit-0.016
anse-0.014
arius-0.014
dial-0.013
Tokenence
Token ID594
Feature activation-4.88e-01

Pos logit contributions
\/\/+0.055
ivas+0.050
 equally+0.043
NESS+0.043
 pastry+0.042
Neg logit contributions
ascript-0.108
Unit-0.103
anse-0.094
arius-0.094
dial-0.089
Token of
Token ID286
Feature activation-6.27e-02

Pos logit contributions
\/\/+0.303
ivas+0.281
 equally+0.248
NESS+0.247
 pastry+0.245
Neg logit contributions
ascript-0.447
Unit-0.424
anse-0.384
arius-0.383
dial-0.362
Token critical
Token ID4688
Feature activation+2.62e-02

Pos logit contributions
\/\/+0.042
ivas+0.039
 equally+0.035
NESS+0.035
 pastry+0.035
Neg logit contributions
ascript-0.054
Unit-0.051
anse-0.046
arius-0.046
dial-0.043
Token illness
Token ID8526
Feature activation-3.32e-02

Pos logit contributions
ascript+0.023
Unit+0.021
anse+0.019
arius+0.019
dial+0.018
Neg logit contributions
\/\/-0.017
ivas-0.016
 equally-0.015
NESS-0.015
 framing-0.015
Token-
Token ID12
Feature activation-8.96e-02

Pos logit contributions
\/\/+0.022
ivas+0.020
 equally+0.018
NESS+0.018
 pastry+0.018
Neg logit contributions
ascript-0.029
Unit-0.028
anse-0.025
arius-0.025
dial-0.023
Tokenrelated
Token ID5363
Feature activation-1.76e-01

Pos logit contributions
\/\/+0.057
ivas+0.053
 equally+0.047
NESS+0.047
 framing+0.046
Neg logit contributions
ascript-0.081
Unit-0.077
anse-0.069
arius-0.069
dial-0.065
Tokenminute
Token ID11374
Feature activation-2.42e-02

Pos logit contributions
\/\/+0.008
ivas+0.007
 equally+0.006
NESS+0.006
 framing+0.006
Neg logit contributions
ascript-0.012
Unit-0.011
anse-0.010
arius-0.010
dial-0.010
Token scene
Token ID3715
Feature activation+1.28e-01

Pos logit contributions
\/\/+0.017
ivas+0.016
 equally+0.014
NESS+0.014
 pastry+0.014
Neg logit contributions
ascript-0.020
Unit-0.019
anse-0.017
arius-0.017
dial-0.016
Token that
Token ID326
Feature activation-8.24e-03

Pos logit contributions
ascript+0.109
Unit+0.103
anse+0.093
arius+0.092
dial+0.087
Neg logit contributions
\/\/-0.088
ivas-0.082
 equally-0.074
NESS-0.073
 pastry-0.073
Token ends
Token ID5645
Feature activation-2.10e-02

Pos logit contributions
\/\/+0.005
ivas+0.005
NESS+0.005
 equally+0.005
 pastry+0.004
Neg logit contributions
ascript-0.007
Unit-0.007
anse-0.006
arius-0.006
dial-0.006
Token with
Token ID351
Feature activation+9.95e-02

Pos logit contributions
\/\/+0.015
ivas+0.014
NESS+0.013
 equally+0.012
 pastry+0.012
Neg logit contributions
ascript-0.017
Unit-0.016
anse-0.015
arius-0.015
dial-0.014
Token Sed
Token ID22710
Feature activation+4.25e-01

Pos logit contributions
ascript+0.087
Unit+0.082
anse+0.074
arius+0.074
dial+0.069
Neg logit contributions
\/\/-0.066
ivas-0.062
NESS-0.055
 equally-0.055
 pastry-0.054
Tokenaris
Token ID20066
Feature activation+2.47e-01

Pos logit contributions
ascript+0.323
Unit+0.303
anse+0.269
arius+0.269
dial+0.250
Neg logit contributions
\/\/-0.329
ivas-0.310
NESS-0.282
 equally-0.281
 pastry-0.280
Token covered
Token ID5017
Feature activation+1.12e-01

Pos logit contributions
ascript+0.217
Unit+0.205
anse+0.185
arius+0.185
dial+0.174
Neg logit contributions
\/\/-0.162
ivas-0.151
NESS-0.134
 equally-0.134
 pastry-0.133
Token in
Token ID287
Feature activation-6.74e-02

Pos logit contributions
ascript+0.110
Unit+0.104
anse+0.095
arius+0.095
dial+0.090
Neg logit contributions
\/\/-0.063
ivas-0.058
NESS-0.050
 equally-0.049
 pastry-0.049
Token blood
Token ID2910
Feature activation+1.85e-02

Pos logit contributions
\/\/+0.044
ivas+0.041
NESS+0.037
 equally+0.037
 pastry+0.036
Neg logit contributions
ascript-0.059
Unit-0.056
anse-0.051
arius-0.050
dial-0.048
Token,
Token ID11
Feature activation-2.19e-02

Pos logit contributions
ascript+0.014
Unit+0.013
anse+0.012
arius+0.012
dial+0.011
Neg logit contributions
\/\/-0.015
ivas-0.014
NESS-0.012
 equally-0.012
 pastry-0.012
Token it
Token ID340
Feature activation+1.27e-01

Pos logit contributions
ascript+0.114
Unit+0.108
anse+0.097
arius+0.097
dial+0.091
Neg logit contributions
\/\/-0.093
ivas-0.087
NESS-0.077
 equally-0.077
 pastry-0.076
Token
Token ID447
Feature activation-6.88e-02

Pos logit contributions
ascript+0.109
Unit+0.103
anse+0.093
arius+0.093
dial+0.087
Neg logit contributions
\/\/-0.086
ivas-0.080
NESS-0.072
 equally-0.072
 pastry-0.071
Token
Token ID247
Feature activation+2.90e-02

Pos logit contributions
\/\/+0.062
ivas+0.059
 equally+0.054
NESS+0.054
 framing+0.054
Neg logit contributions
ascript-0.044
Unit-0.040
anse-0.035
arius-0.035
dial-0.032
Tokens
Token ID82
Feature activation+5.41e-02

Pos logit contributions
ascript+0.022
Unit+0.021
anse+0.019
arius+0.019
dial+0.017
Neg logit contributions
\/\/-0.022
ivas-0.021
 equally-0.019
NESS-0.019
 pastry-0.019
Token an
Token ID281
Feature activation+7.13e-02

Pos logit contributions
ascript+0.047
Unit+0.045
anse+0.040
arius+0.040
dial+0.038
Neg logit contributions
\/\/-0.036
ivas-0.034
NESS-0.030
 equally-0.030
 pastry-0.029
Token assemb
Token ID11156
Feature activation+4.17e-01

Pos logit contributions
ascript+0.062
Unit+0.058
anse+0.053
arius+0.052
dial+0.049
Neg logit contributions
\/\/-0.048
ivas-0.045
NESS-0.040
 equally-0.040
 pastry-0.039
Tokenl
Token ID75
Feature activation-7.44e-02

Pos logit contributions
ascript+0.387
Unit+0.368
anse+0.334
arius+0.333
dial+0.316
Neg logit contributions
\/\/-0.254
ivas-0.235
NESS-0.206
 equally-0.205
 pastry-0.204
Tokenage
Token ID496
Feature activation-1.05e-01

Pos logit contributions
\/\/+0.045
ivas+0.041
 equally+0.036
NESS+0.036
 pastry+0.036
Neg logit contributions
ascript-0.070
Unit-0.066
anse-0.060
arius-0.060
dial-0.057
Token of
Token ID286
Feature activation+3.81e-03

Pos logit contributions
\/\/+0.066
ivas+0.061
NESS+0.054
 equally+0.054
 pastry+0.053
Neg logit contributions
ascript-0.096
Unit-0.091
anse-0.083
arius-0.082
dial-0.078
Token disparate
Token ID37433
Feature activation+6.38e-02

Pos logit contributions
ascript+0.003
Unit+0.003
anse+0.003
arius+0.003
dial+0.003
Neg logit contributions
\/\/-0.002
ivas-0.002
NESS-0.002
 equally-0.002
 pastry-0.002
Token things
Token ID1243
Feature activation+1.29e-02

Pos logit contributions
ascript+0.065
Unit+0.062
anse+0.057
arius+0.057
dial+0.054
Neg logit contributions
\/\/-0.033
ivas-0.030
NESS-0.025
 BALL-0.025
 equally-0.025
Token one
Token ID530
Feature activation-2.80e-02

Pos logit contributions
ascript+0.101
Unit+0.095
anse+0.086
arius+0.086
dial+0.081
Neg logit contributions
\/\/-0.075
ivas-0.070
NESS-0.062
 equally-0.062
 pastry-0.061
Token had
Token ID550
Feature activation+1.03e-01

Pos logit contributions
\/\/+0.017
ivas+0.016
 equally+0.014
 pastry+0.014
 framing+0.014
Neg logit contributions
ascript-0.026
Unit-0.024
anse-0.022
arius-0.022
dial-0.021
Token happened
Token ID3022
Feature activation-4.03e-02

Pos logit contributions
ascript+0.091
Unit+0.086
anse+0.077
arius+0.077
dial+0.073
Neg logit contributions
\/\/-0.068
ivas-0.063
 equally-0.057
NESS-0.056
 pastry-0.056
Token.
Token ID13
Feature activation+5.90e-02

Pos logit contributions
\/\/+0.025
ivas+0.023
 equally+0.020
NESS+0.020
 pastry+0.020
Neg logit contributions
ascript-0.037
Unit-0.035
anse-0.032
arius-0.032
dial-0.030
Token A
Token ID317
Feature activation+3.60e-02

Pos logit contributions
ascript+0.053
Unit+0.050
anse+0.046
arius+0.045
dial+0.043
Neg logit contributions
\/\/-0.038
ivas-0.035
NESS-0.031
 equally-0.031
 BALL-0.030
Token Redd
Token ID42734
Feature activation+4.13e-01

Pos logit contributions
ascript+0.031
Unit+0.029
anse+0.026
arius+0.026
dial+0.025
Neg logit contributions
\/\/-0.024
ivas-0.023
 equally-0.020
NESS-0.020
 pastry-0.020
Tokenitor
Token ID2072
Feature activation-1.55e-01

Pos logit contributions
ascript+0.346
Unit+0.324
anse+0.291
arius+0.290
dial+0.273
Neg logit contributions
\/\/-0.290
ivas-0.271
 equally-0.244
NESS-0.242
 pastry-0.242
Token posted
Token ID4481
Feature activation+2.09e-01

Pos logit contributions
\/\/+0.116
ivas+0.109
NESS+0.098
 equally+0.098
 pastry+0.097
Neg logit contributions
ascript-0.123
Unit-0.115
anse-0.103
arius-0.102
dial-0.096
Token what
Token ID644
Feature activation+1.18e-01

Pos logit contributions
ascript+0.183
Unit+0.173
anse+0.156
arius+0.156
dial+0.147
Neg logit contributions
\/\/-0.138
ivas-0.129
NESS-0.114
 equally-0.114
 pastry-0.113
Token they
Token ID484
Feature activation-5.48e-03

Pos logit contributions
ascript+0.104
Unit+0.098
anse+0.089
arius+0.088
dial+0.084
Neg logit contributions
\/\/-0.077
ivas-0.072
NESS-0.063
 equally-0.063
 pastry-0.063
Token claimed
Token ID4752
Feature activation+1.69e-01

Pos logit contributions
\/\/+0.004
ivas+0.004
 equally+0.003
NESS+0.003
 pastry+0.003
Neg logit contributions
ascript-0.004
Unit-0.004
anse-0.004
arius-0.004
dial-0.003
Token,
Token ID11
Feature activation-3.87e-02

Pos logit contributions
ascript+0.009
Unit+0.008
anse+0.007
arius+0.007
dial+0.007
Neg logit contributions
\/\/-0.007
ivas-0.006
NESS-0.006
 equally-0.006
 pastry-0.006
Token Lv
Token ID24388
Feature activation-1.18e-01

Pos logit contributions
\/\/+0.027
ivas+0.025
 equally+0.022
NESS+0.022
 pastry+0.022
Neg logit contributions
ascript-0.033
Unit-0.031
anse-0.028
arius-0.028
dial-0.026
Tokeniv
Token ID452
Feature activation+1.52e-02

Pos logit contributions
\/\/+0.075
ivas+0.072
 equally+0.062
 pastry+0.062
NESS+0.061
Neg logit contributions
ascript-0.107
Unit-0.100
arius-0.090
anse-0.090
dial-0.086
Token and
Token ID290
Feature activation-6.72e-04

Pos logit contributions
ascript+0.014
Unit+0.013
anse+0.012
arius+0.012
dial+0.011
Neg logit contributions
\/\/-0.010
ivas-0.009
 equally-0.008
NESS-0.008
 pastry-0.008
Token Iv
Token ID16975
Feature activation-2.73e-01

Pos logit contributions
\/\/+0.000
ivas+0.000
NESS+0.000
 equally+0.000
 pastry+0.000
Neg logit contributions
ascript-0.001
Unit-0.001
anse-0.001
arius-0.001
dial-0.000
Tokenano
Token ID5733
Feature activation+4.14e-01

Pos logit contributions
\/\/+0.316
ivas+0.306
 equally+0.285
 pastry+0.284
NESS+0.283
Neg logit contributions
ascript-0.103
Unit-0.090
anse-0.065
arius-0.065
dial-0.054
Token-
Token ID12
Feature activation-8.43e-02

Pos logit contributions
ascript+0.392
Unit+0.372
anse+0.338
arius+0.337
dial+0.320
Neg logit contributions
\/\/-0.245
ivas-0.226
 equally-0.198
NESS-0.197
 pastry-0.196
TokenFrank
Token ID17439
Feature activation-9.74e-02

Pos logit contributions
\/\/+0.054
ivas+0.051
NESS+0.045
 BALL+0.045
 pastry+0.045
Neg logit contributions
ascript-0.074
Unit-0.072
arius-0.064
anse-0.064
dial-0.061
Tokeniv
Token ID452
Feature activation+2.58e-02

Pos logit contributions
\/\/+0.062
ivas+0.059
 equally+0.051
NESS+0.051
 pastry+0.051
Neg logit contributions
ascript-0.088
Unit-0.082
anse-0.075
arius-0.074
dial-0.070
Tokensk
Token ID8135
Feature activation+1.26e-01

Pos logit contributions
ascript+0.022
Unit+0.020
anse+0.018
arius+0.018
dial+0.017
Neg logit contributions
\/\/-0.018
ivas-0.017
 equally-0.015
 pastry-0.015
NESS-0.015
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
ascript+0.110
Unit+0.104
anse+0.094
arius+0.093
dial+0.088
Neg logit contributions
\/\/-0.083
ivas-0.078
 equally-0.069
NESS-0.069
 pastry-0.068
Token even
Token ID772
Feature activation+4.83e-02

Pos logit contributions
ascript+0.036
Unit+0.034
anse+0.030
arius+0.030
dial+0.028
Neg logit contributions
\/\/-0.029
ivas-0.027
 equally-0.024
 framing-0.024
NESS-0.024
Token modify
Token ID13096
Feature activation+2.52e-01

Pos logit contributions
ascript+0.042
Unit+0.039
anse+0.035
arius+0.035
dial+0.033
Neg logit contributions
\/\/-0.033
ivas-0.031
 equally-0.028
NESS-0.027
 framing-0.027
Token your
Token ID534
Feature activation+1.28e-01

Pos logit contributions
ascript+0.219
Unit+0.206
anse+0.186
arius+0.185
dial+0.174
Neg logit contributions
\/\/-0.170
ivas-0.158
 equally-0.142
NESS-0.141
 framing-0.140
Token interior
Token ID11087
Feature activation+2.19e-01

Pos logit contributions
ascript+0.103
Unit+0.097
anse+0.087
arius+0.087
dial+0.081
Neg logit contributions
\/\/-0.093
ivas-0.087
 equally-0.078
NESS-0.078
 pastry-0.078
Token.
Token ID13
Feature activation+7.06e-02

Pos logit contributions
ascript+0.189
Unit+0.178
anse+0.161
arius+0.160
dial+0.151
Neg logit contributions
\/\/-0.147
ivas-0.137
 equally-0.122
NESS-0.122
 pastry-0.121
Token Susp
Token ID31922
Feature activation+3.80e-01

Pos logit contributions
ascript+0.064
Unit+0.060
anse+0.054
arius+0.054
dial+0.051
Neg logit contributions
\/\/-0.045
ivas-0.042
 equally-0.037
NESS-0.037
 BALL-0.037
Tokenension
Token ID3004
Feature activation+2.27e-01

Pos logit contributions
ascript+0.292
Unit+0.272
arius+0.241
anse+0.241
dial+0.224
Neg logit contributions
\/\/-0.292
ivas-0.277
 equally-0.250
NESS-0.248
 pastry-0.247
Token and
Token ID290
Feature activation+7.17e-02

Pos logit contributions
ascript+0.199
Unit+0.188
anse+0.170
arius+0.169
dial+0.159
Neg logit contributions
\/\/-0.150
ivas-0.140
 equally-0.125
NESS-0.124
 framing-0.124
Token engine
Token ID3113
Feature activation+9.74e-02

Pos logit contributions
ascript+0.061
Unit+0.058
anse+0.052
arius+0.052
dial+0.049
Neg logit contributions
\/\/-0.049
ivas-0.046
NESS-0.041
 equally-0.041
 pastry-0.040
Token performance
Token ID2854
Feature activation+1.30e-01

Pos logit contributions
ascript+0.080
Unit+0.075
anse+0.067
arius+0.067
dial+0.062
Neg logit contributions
\/\/-0.070
ivas-0.066
 equally-0.060
 framing-0.059
NESS-0.059
Token can
Token ID460
Feature activation+1.15e-01

Pos logit contributions
ascript+0.111
Unit+0.105
anse+0.095
arius+0.094
dial+0.088
Neg logit contributions
\/\/-0.089
ivas-0.082
 equally-0.075
 framing-0.074
NESS-0.073
Token report
Token ID989
Feature activation-5.22e-02

Pos logit contributions
\/\/+0.033
ivas+0.031
 equally+0.028
NESS+0.028
 pastry+0.028
Neg logit contributions
ascript-0.038
Unit-0.036
anse-0.032
arius-0.032
dial-0.030
Token concluded
Token ID8391
Feature activation+1.95e-01

Pos logit contributions
\/\/+0.036
ivas+0.033
NESS+0.030
 equally+0.030
 pastry+0.030
Neg logit contributions
ascript-0.045
Unit-0.042
anse-0.038
arius-0.038
dial-0.035
Token that
Token ID326
Feature activation-1.02e-02

Pos logit contributions
ascript+0.174
Unit+0.165
anse+0.149
arius+0.149
dial+0.141
Neg logit contributions
\/\/-0.125
ivas-0.116
NESS-0.102
 equally-0.102
 pastry-0.101
Token Ike
Token ID34880
Feature activation-1.80e-01

Pos logit contributions
\/\/+0.007
ivas+0.006
 equally+0.006
NESS+0.006
 pastry+0.006
Neg logit contributions
ascript-0.009
Unit-0.008
anse-0.008
arius-0.008
dial-0.007
Tokena
Token ID64
Feature activation+9.69e-02

Pos logit contributions
\/\/+0.174
ivas+0.166
 equally+0.153
NESS+0.153
 pastry+0.153
Neg logit contributions
ascript-0.102
Unit-0.094
anse-0.079
arius-0.079
dial-0.071
Token managers
Token ID11663
Feature activation+1.03e-01

Pos logit contributions
ascript+0.082
Unit+0.077
anse+0.069
arius+0.069
dial+0.065
Neg logit contributions
\/\/-0.067
ivas-0.063
 equally-0.056
NESS-0.056
 pastry-0.056
Token were
Token ID547
Feature activation-5.67e-03

Pos logit contributions
ascript+0.087
Unit+0.082
anse+0.074
arius+0.073
dial+0.069
Neg logit contributions
\/\/-0.071
ivas-0.066
 equally-0.060
 framing-0.060
 pastry-0.059
Token aware
Token ID3910
Feature activation+1.09e-01

Pos logit contributions
\/\/+0.004
ivas+0.003
 equally+0.003
NESS+0.003
 pastry+0.003
Neg logit contributions
ascript-0.005
Unit-0.005
anse-0.004
arius-0.004
dial-0.004
Token of
Token ID286
Feature activation+2.33e-02

Pos logit contributions
ascript+0.098
Unit+0.093
anse+0.085
arius+0.084
dial+0.080
Neg logit contributions
\/\/-0.069
ivas-0.064
NESS-0.056
 equally-0.056
 pastry-0.055
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
ascript+0.021
Unit+0.020
anse+0.018
arius+0.018
dial+0.017
Neg logit contributions
\/\/-0.015
ivas-0.014
 equally-0.012
 pastry-0.012
 framing-0.012
Token possibility
Token ID5885
Feature activation+3.78e-02

Pos logit contributions
ascript+0.014
Unit+0.013
anse+0.012
arius+0.012
dial+0.011
Neg logit contributions
\/\/-0.010
ivas-0.009
 equally-0.008
 framing-0.008
 pastry-0.008
Token Hollande
Token ID33590
Feature activation-6.81e-02

Pos logit contributions
ascript+0.026
Unit+0.025
anse+0.022
arius+0.022
dial+0.021
Neg logit contributions
\/\/-0.017
ivas-0.016
 equally-0.014
NESS-0.014
 BALL-0.014
Token has
Token ID468
Feature activation-6.79e-02

Pos logit contributions
\/\/+0.045
ivas+0.042
 equally+0.038
NESS+0.037
 pastry+0.037
Neg logit contributions
ascript-0.059
Unit-0.056
anse-0.051
arius-0.050
dial-0.048
Token set
Token ID900
Feature activation-8.65e-02

Pos logit contributions
\/\/+0.049
ivas+0.046
NESS+0.041
 equally+0.041
 pastry+0.041
Neg logit contributions
ascript-0.055
Unit-0.052
anse-0.047
arius-0.047
dial-0.044
Token out
Token ID503
Feature activation+1.76e-01

Pos logit contributions
\/\/+0.057
ivas+0.053
 equally+0.048
NESS+0.047
 framing+0.047
Neg logit contributions
ascript-0.076
Unit-0.071
anse-0.065
arius-0.064
dial-0.060
Token an
Token ID281
Feature activation+5.89e-02

Pos logit contributions
ascript+0.158
Unit+0.150
anse+0.136
arius+0.135
dial+0.128
Neg logit contributions
\/\/-0.113
ivas-0.105
 equally-0.093
NESS-0.093
 pastry-0.092
Token ambitious
Token ID14742
Feature activation+8.35e-02

Pos logit contributions
ascript+0.048
Unit+0.045
anse+0.040
arius+0.040
dial+0.038
Neg logit contributions
\/\/-0.042
ivas-0.040
 equally-0.036
NESS-0.036
 framing-0.036
Token goal
Token ID3061
Feature activation-4.34e-02

Pos logit contributions
ascript+0.075
Unit+0.071
anse+0.064
arius+0.064
dial+0.060
Neg logit contributions
\/\/-0.054
ivas-0.050
 equally-0.044
 framing-0.044
NESS-0.044
Token for
Token ID329
Feature activation+2.34e-02

Pos logit contributions
\/\/+0.027
ivas+0.025
 equally+0.022
NESS+0.022
 pastry+0.022
Neg logit contributions
ascript-0.040
Unit-0.038
anse-0.034
arius-0.034
dial-0.032
Token the
Token ID262
Feature activation+3.09e-02

Pos logit contributions
ascript+0.021
Unit+0.020
anse+0.018
arius+0.018
dial+0.017
Neg logit contributions
\/\/-0.015
ivas-0.013
 equally-0.012
 framing-0.012
 pastry-0.012
Token Paris
Token ID6342
Feature activation+1.27e-01

Pos logit contributions
ascript+0.026
Unit+0.025
anse+0.022
arius+0.022
dial+0.020
Neg logit contributions
\/\/-0.021
ivas-0.020
 equally-0.018
 framing-0.018
 pastry-0.018
Token meeting
Token ID3249
Feature activation-6.18e-02

Pos logit contributions
ascript+0.115
Unit+0.109
anse+0.098
arius+0.098
dial+0.092
Neg logit contributions
\/\/-0.080
ivas-0.074
 equally-0.066
 framing-0.066
 pastry-0.065
Token of
Token ID286
Feature activation-9.60e-02

Pos logit contributions
\/\/+0.183
ivas+0.170
 equally+0.149
NESS+0.149
 pastry+0.148
Neg logit contributions
ascript-0.273
Unit-0.259
anse-0.235
arius-0.234
dial-0.222
Token Part
Token ID2142
Feature activation-3.93e-02

Pos logit contributions
\/\/+0.067
ivas+0.062
 equally+0.056
NESS+0.056
 pastry+0.055
Neg logit contributions
ascript-0.081
Unit-0.076
anse-0.068
arius-0.068
dial-0.064
Tokenicle
Token ID1548
Feature activation-7.71e-03

Pos logit contributions
\/\/+0.027
ivas+0.025
 equally+0.023
NESS+0.023
 pastry+0.023
Neg logit contributions
ascript-0.033
Unit-0.031
anse-0.028
arius-0.028
dial-0.026
Token::
Token ID3712
Feature activation+1.33e-01

Pos logit contributions
\/\/+0.007
ivas+0.006
 equally+0.006
NESS+0.006
 pastry+0.006
Neg logit contributions
ascript-0.005
Unit-0.005
anse-0.004
arius-0.004
dial-0.004
TokenGet
Token ID3855
Feature activation-1.97e-01

Pos logit contributions
ascript+0.118
Unit+0.111
anse+0.101
arius+0.100
dial+0.095
Neg logit contributions
\/\/-0.087
ivas-0.080
NESS-0.071
 equally-0.071
 pastry-0.071
TokenMass
Token ID20273
Feature activation+6.07e-02

Pos logit contributions
\/\/+0.150
ivas+0.141
NESS+0.127
 equally+0.127
 pastry+0.126
Neg logit contributions
ascript-0.154
Unit-0.143
anse-0.128
arius-0.128
dial-0.119
Token(
Token ID7
Feature activation-2.13e-02

Pos logit contributions
ascript+0.051
Unit+0.048
anse+0.043
arius+0.043
dial+0.041
Neg logit contributions
\/\/-0.042
ivas-0.039
 equally-0.035
NESS-0.035
 pastry-0.035
Token ambient
Token ID25237
Feature activation-1.66e-01

Pos logit contributions
\/\/+0.014
ivas+0.013
 equally+0.011
NESS+0.011
 pastry+0.011
Neg logit contributions
ascript-0.019
Unit-0.018
anse-0.016
arius-0.016
dial-0.015
TokenD
Token ID35
Feature activation+6.32e-02

Pos logit contributions
\/\/+0.129
ivas+0.121
 equally+0.110
NESS+0.110
 pastry+0.109
Neg logit contributions
ascript-0.126
Unit-0.119
anse-0.105
arius-0.105
dial-0.098
Tokenensity
Token ID6377
Feature activation-8.35e-02

Pos logit contributions
ascript+0.054
Unit+0.051
anse+0.046
arius+0.046
dial+0.043
Neg logit contributions
\/\/-0.043
ivas-0.040
 equally-0.036
NESS-0.036
 pastry-0.035
Token ).
Token ID6739
Feature activation+3.46e-02

Pos logit contributions
\/\/+0.063
ivas+0.059
 equally+0.053
NESS+0.053
 pastry+0.053
Neg logit contributions
ascript-0.066
Unit-0.061
anse-0.055
arius-0.055
dial-0.051
Token ker
Token ID41927
Feature activation+1.28e-02

Pos logit contributions
ascript+0.070
Unit+0.066
anse+0.059
arius+0.059
dial+0.056
Neg logit contributions
\/\/-0.057
ivas-0.053
NESS-0.047
 equally-0.047
 pastry-0.047
Tokenatin
Token ID10680
Feature activation+2.44e-01

Pos logit contributions
ascript+0.010
Unit+0.009
anse+0.008
arius+0.008
dial+0.008
Neg logit contributions
\/\/-0.010
ivas-0.009
 equally-0.008
NESS-0.008
 pastry-0.008
Token 9
Token ID860
Feature activation-2.22e-02

Pos logit contributions
ascript+0.235
Unit+0.223
anse+0.204
arius+0.204
dial+0.193
Neg logit contributions
\/\/-0.140
ivas-0.128
NESS-0.112
 equally-0.111
 pastry-0.110
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
\/\/+0.015
ivas+0.014
 equally+0.012
NESS+0.012
 pastry+0.012
Neg logit contributions
ascript-0.019
Unit-0.018
anse-0.016
arius-0.016
dial-0.015
Token ker
Token ID41927
Feature activation+1.43e-02

Pos logit contributions
ascript+0.011
Unit+0.011
anse+0.009
arius+0.009
dial+0.009
Neg logit contributions
\/\/-0.009
ivas-0.008
 equally-0.007
NESS-0.007
 pastry-0.007
Tokenatin
Token ID10680
Feature activation+2.33e-01

Pos logit contributions
ascript+0.010
Unit+0.010
anse+0.008
arius+0.008
dial+0.008
Neg logit contributions
\/\/-0.012
ivas-0.011
 equally-0.010
NESS-0.010
 pastry-0.010
Token 16
Token ID1467
Feature activation-7.67e-02

Pos logit contributions
ascript+0.210
Unit+0.199
anse+0.181
arius+0.180
dial+0.170
Neg logit contributions
\/\/-0.147
ivas-0.137
NESS-0.121
 equally-0.121
 pastry-0.120
Token.[
Token ID3693
Feature activation+1.53e-01

Pos logit contributions
\/\/+0.050
ivas+0.047
 equally+0.042
NESS+0.042
 pastry+0.041
Neg logit contributions
ascript-0.067
Unit-0.064
anse-0.057
arius-0.057
dial-0.054
Token22
Token ID1828
Feature activation-1.32e-01

Pos logit contributions
ascript+0.143
Unit+0.136
anse+0.124
arius+0.123
dial+0.117
Neg logit contributions
\/\/-0.092
ivas-0.085
 equally-0.074
NESS-0.074
 pastry-0.074
Token]
Token ID60
Feature activation-1.39e-01

Pos logit contributions
\/\/+0.068
ivas+0.062
NESS+0.053
 pastry+0.052
 equally+0.052
Neg logit contributions
ascript-0.135
Unit-0.129
anse-0.118
arius-0.118
dial-0.112
Token
Token ID198
Feature activation+1.09e-03

Pos logit contributions
\/\/+0.090
ivas+0.083
 equally+0.074
NESS+0.074
 pastry+0.073
Neg logit contributions
ascript-0.125
Unit-0.118
anse-0.106
arius-0.106
dial-0.100
Token prime
Token ID6994
Feature activation-2.08e-01

Pos logit contributions
\/\/+0.024
ivas+0.022
NESS+0.020
 equally+0.020
 pastry+0.020
Neg logit contributions
ascript-0.035
Unit-0.033
anse-0.030
arius-0.030
dial-0.028
Token minister
Token ID5342
Feature activation-1.99e-01

Pos logit contributions
\/\/+0.102
ivas+0.093
NESS+0.078
 equally+0.078
 pastry+0.077
Neg logit contributions
ascript-0.218
Unit-0.208
anse-0.191
arius-0.190
dial-0.182
Token should
Token ID815
Feature activation+2.07e-02

Pos logit contributions
\/\/+0.142
ivas+0.133
 equally+0.119
NESS+0.119
 pastry+0.118
Neg logit contributions
ascript-0.163
Unit-0.154
anse-0.138
arius-0.137
dial-0.129
Token take
Token ID1011
Feature activation+2.80e-02

Pos logit contributions
ascript+0.018
Unit+0.017
anse+0.015
arius+0.015
dial+0.014
Neg logit contributions
\/\/-0.014
ivas-0.013
 equally-0.011
NESS-0.011
 pastry-0.011
Token Mr
Token ID1770
Feature activation-1.34e-01

Pos logit contributions
ascript+0.024
Unit+0.023
anse+0.021
arius+0.021
dial+0.020
Neg logit contributions
\/\/-0.019
ivas-0.017
NESS-0.015
 equally-0.015
 pastry-0.015
Token Hes
Token ID44773
Feature activation+8.45e-02

Pos logit contributions
\/\/+0.091
ivas+0.085
NESS+0.076
 equally+0.075
 pastry+0.075
Neg logit contributions
ascript-0.115
Unit-0.108
arius-0.097
anse-0.097
dial-0.092
Tokenelt
Token ID2120
Feature activation-3.39e-02

Pos logit contributions
ascript+0.063
Unit+0.059
anse+0.052
arius+0.052
dial+0.048
Neg logit contributions
\/\/-0.067
ivas-0.063
NESS-0.057
 equally-0.057
 pastry-0.057
Tokenine
Token ID500
Feature activation+1.14e-01

Pos logit contributions
\/\/+0.029
ivas+0.028
NESS+0.025
 equally+0.025
 pastry+0.025
Neg logit contributions
ascript-0.023
Unit-0.021
anse-0.019
arius-0.019
dial-0.017
Token's
Token ID338
Feature activation-1.34e-02

Pos logit contributions
ascript+0.095
Unit+0.089
anse+0.080
arius+0.080
dial+0.075
Neg logit contributions
\/\/-0.080
ivas-0.075
NESS-0.067
 equally-0.067
 pastry-0.066
Token comments
Token ID3651
Feature activation-7.80e-02

Pos logit contributions
\/\/+0.008
ivas+0.007
NESS+0.006
 pastry+0.006
 equally+0.006
Neg logit contributions
ascript-0.013
Unit-0.012
arius-0.011
anse-0.011
dial-0.010
Token "
Token ID366
Feature activation+9.97e-04

Pos logit contributions
\/\/+0.052
ivas+0.048
NESS+0.042
 pastry+0.042
 equally+0.042
Neg logit contributions
ascript-0.068
Unit-0.065
anse-0.058
arius-0.058
dial-0.055
Token
Token ID198
Feature activation-2.04e-03

Pos logit contributions
ascript+0.013
Unit+0.012
anse+0.011
arius+0.011
dial+0.010
Neg logit contributions
\/\/-0.009
ivas-0.009
 equally-0.008
NESS-0.008
 pastry-0.007
Token
Token ID198
Feature activation+2.59e-03

Pos logit contributions
\/\/+0.001
ivas+0.001
 equally+0.001
NESS+0.001
 pastry+0.001
Neg logit contributions
ascript-0.002
Unit-0.002
anse-0.002
arius-0.002
dial-0.001
TokenFinally
Token ID11158
Feature activation-1.15e-01

Pos logit contributions
ascript+0.002
Unit+0.002
anse+0.002
arius+0.002
dial+0.002
Neg logit contributions
\/\/-0.002
ivas-0.002
NESS-0.002
 equally-0.002
 pastry-0.001
Token,
Token ID11
Feature activation+4.63e-02

Pos logit contributions
\/\/+0.072
ivas+0.066
NESS+0.059
 equally+0.058
 pastry+0.058
Neg logit contributions
ascript-0.105
Unit-0.100
arius-0.090
anse-0.090
dial-0.085
Token the
Token ID262
Feature activation-2.10e-03

Pos logit contributions
ascript+0.041
Unit+0.039
anse+0.036
arius+0.035
dial+0.034
Neg logit contributions
\/\/-0.030
ivas-0.028
NESS-0.024
 equally-0.024
 pastry-0.024
Token Obama
Token ID2486
Feature activation-1.34e-02

Pos logit contributions
\/\/+0.001
ivas+0.001
 equally+0.001
NESS+0.001
 pastry+0.001
Neg logit contributions
ascript-0.002
Unit-0.002
anse-0.002
arius-0.002
dial-0.002
Token administration
Token ID3662
Feature activation-3.51e-02

Pos logit contributions
\/\/+0.009
ivas+0.009
NESS+0.008
 equally+0.008
 pastry+0.008
Neg logit contributions
ascript-0.011
Unit-0.011
anse-0.010
arius-0.010
dial-0.009
Token might
Token ID1244
Feature activation+6.48e-02

Pos logit contributions
\/\/+0.023
ivas+0.021
NESS+0.019
 equally+0.019
 pastry+0.019
Neg logit contributions
ascript-0.031
Unit-0.029
anse-0.026
arius-0.026
dial-0.025
Token reach
Token ID3151
Feature activation-9.01e-03

Pos logit contributions
ascript+0.058
Unit+0.055
anse+0.050
arius+0.050
dial+0.047
Neg logit contributions
\/\/-0.041
ivas-0.038
NESS-0.034
 pastry-0.033
 equally-0.033
Token out
Token ID503
Feature activation+1.58e-01

Pos logit contributions
\/\/+0.006
ivas+0.006
 equally+0.005
NESS+0.005
 pastry+0.005
Neg logit contributions
ascript-0.007
Unit-0.007
anse-0.006
arius-0.006
dial-0.006
Token to
Token ID284
Feature activation+3.10e-02

Pos logit contributions
ascript+0.136
Unit+0.128
anse+0.115
arius+0.115
dial+0.108
Neg logit contributions
\/\/-0.107
ivas-0.099
NESS-0.088
 pastry-0.088
 BALL-0.087
Token
Token ID447
Feature activation+2.26e-02

Pos logit contributions
ascript+0.017
Unit+0.016
anse+0.014
arius+0.014
dial+0.013
Neg logit contributions
\/\/-0.014
ivas-0.013
 equally-0.012
NESS-0.012
 pastry-0.012
Token
Token ID250
Feature activation-4.78e-02

Pos logit contributions
ascript+0.015
Unit+0.014
anse+0.012
arius+0.012
dial+0.011
Neg logit contributions
\/\/-0.020
ivas-0.019
NESS-0.017
 equally-0.017
 pastry-0.017
TokenYou
Token ID1639
Feature activation-5.07e-02

Pos logit contributions
\/\/+0.032
ivas+0.030
 equally+0.026
NESS+0.026
 pastry+0.026
Neg logit contributions
ascript-0.042
Unit-0.039
anse-0.036
arius-0.035
dial-0.033
Token see
Token ID766
Feature activation+8.08e-03

Pos logit contributions
\/\/+0.036
ivas+0.034
 equally+0.030
NESS+0.030
 pastry+0.030
Neg logit contributions
ascript-0.042
Unit-0.039
anse-0.035
arius-0.035
dial-0.033
Token fewer
Token ID7380
Feature activation-3.00e-02

Pos logit contributions
ascript+0.007
Unit+0.007
anse+0.006
arius+0.006
dial+0.006
Neg logit contributions
\/\/-0.005
ivas-0.005
NESS-0.004
 equally-0.004
 pastry-0.004
Token people
Token ID661
Feature activation-5.30e-02

Pos logit contributions
\/\/+0.019
ivas+0.018
NESS+0.016
 equally+0.016
 pastry+0.016
Neg logit contributions
ascript-0.027
Unit-0.025
anse-0.023
arius-0.023
dial-0.021
Token on
Token ID319
Feature activation+4.74e-02

Pos logit contributions
\/\/+0.034
ivas+0.031
 equally+0.028
NESS+0.028
 pastry+0.027
Neg logit contributions
ascript-0.048
Unit-0.045
anse-0.041
arius-0.041
dial-0.039
Token the
Token ID262
Feature activation-2.93e-02

Pos logit contributions
ascript+0.042
Unit+0.040
anse+0.036
arius+0.036
dial+0.034
Neg logit contributions
\/\/-0.030
ivas-0.028
 equally-0.025
NESS-0.025
 pastry-0.025
Token In
Token ID554
Feature activation-9.41e-02

Pos logit contributions
\/\/+0.018
ivas+0.016
NESS+0.014
 equally+0.014
 pastry+0.014
Neg logit contributions
ascript-0.027
Unit-0.026
anse-0.024
arius-0.023
dial-0.022
Tokenman
Token ID805
Feature activation+8.06e-02

Pos logit contributions
\/\/+0.078
ivas+0.074
 equally+0.067
NESS+0.067
 pastry+0.067
Neg logit contributions
ascript-0.067
Unit-0.062
anse-0.054
arius-0.054
dial-0.050
Token 300
Token ID5867
Feature activation+9.18e-02

Pos logit contributions
ascript+0.066
Unit+0.062
anse+0.055
arius+0.055
dial+0.052
Neg logit contributions
\/\/-0.058
ivas-0.054
 equally-0.049
NESS-0.049
 pastry-0.049
Token!
Token ID0
Feature activation+7.49e-02

Pos logit contributions
\/\/+0.066
ivas+0.062
NESS+0.055
 equally+0.055
 pastry+0.055
Neg logit contributions
ascript-0.078
Unit-0.074
anse-0.066
arius-0.066
dial-0.062
Token
Token ID447
Feature activation-7.61e-03

Pos logit contributions
ascript+0.061
Unit+0.057
anse+0.051
arius+0.051
dial+0.048
Neg logit contributions
\/\/-0.054
ivas-0.051
 equally-0.046
NESS-0.046
 pastry-0.045
Token
Token ID251
Feature activation-1.73e-02

Pos logit contributions
\/\/+0.006
ivas+0.005
 equally+0.005
 pastry+0.005
NESS+0.005
Neg logit contributions
ascript-0.006
Unit-0.006
anse-0.005
arius-0.005
dial-0.005
Token
Token ID198
Feature activation+4.28e-03

Pos logit contributions
\/\/+0.011
ivas+0.011
 equally+0.009
NESS+0.009
 pastry+0.009
Neg logit contributions
ascript-0.015
Unit-0.014
anse-0.013
arius-0.013
dial-0.012
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
ascript+0.004
Unit+0.004
anse+0.003
arius+0.003
dial+0.003
Neg logit contributions
\/\/-0.003
ivas-0.003
NESS-0.002
 equally-0.002
 pastry-0.002
TokenThe
Token ID464
Feature activation-9.22e-03

Pos logit contributions
ascript+0.010
Unit+0.009
anse+0.008
arius+0.008
dial+0.008
Neg logit contributions
\/\/-0.009
ivas-0.008
NESS-0.007
 equally-0.007
 pastry-0.007
Token segment
Token ID10618
Feature activation+5.70e-02

Pos logit contributions
\/\/+0.007
ivas+0.006
 equally+0.006
 framing+0.006
 pastry+0.006
Neg logit contributions
ascript-0.008
Unit-0.007
anse-0.006
arius-0.006
dial-0.006
Token is
Token ID318
Feature activation+5.67e-03

Pos logit contributions
ascript+0.049
Unit+0.046
anse+0.042
arius+0.042
dial+0.039
Neg logit contributions
\/\/-0.039
ivas-0.036
 equally-0.032
NESS-0.032
 pastry-0.032
Token actually
Token ID1682
Feature activation+4.32e-02

Pos logit contributions
ascript+0.005
Unit+0.005
anse+0.004
arius+0.004
dial+0.004
Neg logit contributions
\/\/-0.004
ivas-0.003
NESS-0.003
 equally-0.003
 pastry-0.003
Token painful
Token ID12132
Feature activation-1.19e-01

Pos logit contributions
ascript+0.039
Unit+0.037
anse+0.033
arius+0.033
dial+0.031
Neg logit contributions
\/\/-0.028
ivas-0.026
NESS-0.023
 equally-0.023
 pastry-0.022
Token to
Token ID284
Feature activation+3.83e-02

Pos logit contributions
\/\/+0.076
ivas+0.071
NESS+0.062
 equally+0.062
 pastry+0.062
Neg logit contributions
ascript-0.106
Unit-0.100
anse-0.091
arius-0.091
dial-0.086
Token judge
Token ID5052
Feature activation-1.88e-02

Pos logit contributions
ascript+0.023
Unit+0.022
anse+0.020
arius+0.020
dial+0.019
Neg logit contributions
\/\/-0.016
ivas-0.015
 equally-0.013
NESS-0.013
 pastry-0.013
Token you
Token ID345
Feature activation-3.59e-02

Pos logit contributions
\/\/+0.013
ivas+0.012
 equally+0.011
NESS+0.011
 pastry+0.011
Neg logit contributions
ascript-0.016
Unit-0.015
anse-0.013
arius-0.013
dial-0.013
Token for
Token ID329
Feature activation-6.40e-03

Pos logit contributions
\/\/+0.023
ivas+0.022
 equally+0.019
NESS+0.019
 pastry+0.019
Neg logit contributions
ascript-0.032
Unit-0.030
anse-0.027
arius-0.027
dial-0.026
Token playing
Token ID2712
Feature activation-1.39e-01

Pos logit contributions
\/\/+0.004
ivas+0.004
 equally+0.004
 pastry+0.004
 framing+0.004
Neg logit contributions
ascript-0.005
Unit-0.005
anse-0.005
arius-0.005
dial-0.004
Token some
Token ID617
Feature activation+1.30e-01

Pos logit contributions
\/\/+0.091
ivas+0.085
 equally+0.075
NESS+0.075
 pastry+0.075
Neg logit contributions
ascript-0.122
Unit-0.115
anse-0.104
arius-0.104
dial-0.098
Token Pokemon
Token ID14878
Feature activation-9.99e-02

Pos logit contributions
ascript+0.118
Unit+0.112
anse+0.101
arius+0.101
dial+0.095
Neg logit contributions
\/\/-0.082
ivas-0.076
NESS-0.067
 equally-0.067
 pastry-0.066
Token while
Token ID981
Feature activation-6.99e-02

Pos logit contributions
\/\/+0.066
ivas+0.061
NESS+0.054
 equally+0.054
 pastry+0.054
Neg logit contributions
ascript-0.087
Unit-0.083
anse-0.075
arius-0.074
dial-0.070
Token in
Token ID287
Feature activation-6.45e-02

Pos logit contributions
\/\/+0.045
ivas+0.042
NESS+0.037
 equally+0.037
 pastry+0.037
Neg logit contributions
ascript-0.062
Unit-0.059
anse-0.053
arius-0.053
dial-0.050
Token line
Token ID1627
Feature activation-1.03e-01

Pos logit contributions
\/\/+0.039
ivas+0.037
NESS+0.032
 equally+0.032
 pastry+0.032
Neg logit contributions
ascript-0.060
Unit-0.057
anse-0.051
arius-0.051
dial-0.049
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
\/\/+0.068
ivas+0.063
 equally+0.056
NESS+0.056
 pastry+0.055
Neg logit contributions
ascript-0.091
Unit-0.086
anse-0.078
arius-0.078
dial-0.073
Token In
Token ID554
Feature activation-6.63e-02

Pos logit contributions
\/\/+0.008
ivas+0.007
NESS+0.006
 equally+0.006
 pastry+0.006
Neg logit contributions
ascript-0.012
Unit-0.011
anse-0.010
arius-0.010
dial-0.009
Token ashamed
Token ID22461
Feature activation+1.03e-01

Pos logit contributions
\/\/+0.011
ivas+0.011
 equally+0.010
NESS+0.010
 pastry+0.010
Neg logit contributions
ascript-0.009
Unit-0.009
anse-0.008
arius-0.008
dial-0.007
Token of
Token ID286
Feature activation-3.01e-02

Pos logit contributions
ascript+0.093
Unit+0.088
anse+0.080
arius+0.080
dial+0.075
Neg logit contributions
\/\/-0.064
ivas-0.060
 equally-0.053
NESS-0.053
 pastry-0.052
Token in
Token ID287
Feature activation-4.08e-02

Pos logit contributions
\/\/+0.020
ivas+0.018
 equally+0.016
NESS+0.016
 pastry+0.016
Neg logit contributions
ascript-0.027
Unit-0.025
anse-0.023
arius-0.023
dial-0.021
Token a
Token ID257
Feature activation+2.13e-02

Pos logit contributions
\/\/+0.026
ivas+0.025
 equally+0.022
NESS+0.022
 framing+0.022
Neg logit contributions
ascript-0.036
Unit-0.034
anse-0.031
arius-0.031
dial-0.029
Token woman
Token ID2415
Feature activation+5.98e-02

Pos logit contributions
ascript+0.018
Unit+0.016
anse+0.015
arius+0.015
dial+0.014
Neg logit contributions
\/\/-0.015
ivas-0.014
 equally-0.013
 pastry-0.013
NESS-0.013
Token
Token ID447
Feature activation+8.29e-03

Pos logit contributions
ascript+0.049
Unit+0.046
anse+0.041
arius+0.041
dial+0.039
Neg logit contributions
\/\/-0.043
ivas-0.040
 equally-0.036
NESS-0.036
 pastry-0.036
Token
Token ID247
Feature activation-1.94e-02

Pos logit contributions
ascript+0.005
Unit+0.005
anse+0.004
arius+0.004
dial+0.004
Neg logit contributions
\/\/-0.007
ivas-0.007
 equally-0.006
NESS-0.006
 framing-0.006
Tokens
Token ID82
Feature activation+1.68e-02

Pos logit contributions
\/\/+0.015
ivas+0.014
NESS+0.013
 equally+0.013
 pastry+0.013
Neg logit contributions
ascript-0.015
Unit-0.014
anse-0.012
arius-0.012
dial-0.011
Token organization
Token ID4009
Feature activation+9.53e-02

Pos logit contributions
ascript+0.014
Unit+0.013
anse+0.012
arius+0.012
dial+0.011
Neg logit contributions
\/\/-0.012
ivas-0.011
 equally-0.010
NESS-0.010
 pastry-0.010
Token,
Token ID11
Feature activation+1.87e-02

Pos logit contributions
ascript+0.082
Unit+0.077
anse+0.070
arius+0.069
dial+0.065
Neg logit contributions
\/\/-0.065
ivas-0.060
 equally-0.054
NESS-0.054
 pastry-0.053
Token and
Token ID290
Feature activation+2.91e-02

Pos logit contributions
ascript+0.016
Unit+0.015
anse+0.013
arius+0.013
dial+0.013
Neg logit contributions
\/\/-0.013
ivas-0.012
 equally-0.011
NESS-0.011
 pastry-0.011
Token and
Token ID290
Feature activation-2.92e-02

Pos logit contributions
\/\/+0.030
ivas+0.028
NESS+0.025
 equally+0.025
 pastry+0.025
Neg logit contributions
ascript-0.037
Unit-0.035
anse-0.032
arius-0.032
dial-0.030
Token seven
Token ID3598
Feature activation+4.25e-02

Pos logit contributions
\/\/+0.020
ivas+0.018
 equally+0.017
NESS+0.016
 pastry+0.016
Neg logit contributions
ascript-0.025
Unit-0.024
anse-0.021
arius-0.021
dial-0.020
Token hundred
Token ID3470
Feature activation+1.83e-01

Pos logit contributions
ascript+0.035
Unit+0.033
anse+0.029
arius+0.029
dial+0.028
Neg logit contributions
\/\/-0.030
ivas-0.028
 equally-0.026
 pastry-0.025
NESS-0.025
Token were
Token ID547
Feature activation+4.26e-02

Pos logit contributions
ascript+0.154
Unit+0.144
anse+0.129
arius+0.129
dial+0.121
Neg logit contributions
\/\/-0.129
ivas-0.120
 equally-0.109
 pastry-0.108
NESS-0.108
Token be
Token ID307
Feature activation-1.09e-02

Pos logit contributions
ascript+0.032
Unit+0.030
anse+0.026
arius+0.026
dial+0.024
Neg logit contributions
\/\/-0.034
ivas-0.032
 equally-0.029
 pastry-0.029
 framing-0.029
Tokenheaded
Token ID15353
Feature activation-3.22e-01

Pos logit contributions
\/\/+0.003
ivas+0.003
 equally+0.002
NESS+0.002
 pastry+0.002
Neg logit contributions
ascript-0.014
Unit-0.013
anse-0.012
arius-0.012
dial-0.012
Token therein
Token ID27258
Feature activation-7.64e-02

Pos logit contributions
\/\/+0.210
ivas+0.195
 equally+0.177
 framing+0.173
NESS+0.173
Neg logit contributions
ascript-0.284
Unit-0.270
anse-0.244
arius-0.242
dial-0.228
Token.[
Token ID3693
Feature activation+1.32e-01

Pos logit contributions
\/\/+0.050
ivas+0.047
 equally+0.042
 framing+0.042
NESS+0.042
Neg logit contributions
ascript-0.067
Unit-0.063
anse-0.057
arius-0.057
dial-0.054
Token1
Token ID16
Feature activation+5.14e-02

Pos logit contributions
ascript+0.118
Unit+0.112
anse+0.102
arius+0.101
dial+0.096
Neg logit contributions
\/\/-0.085
ivas-0.079
NESS-0.070
 equally-0.070
 pastry-0.069
Token][
Token ID7131
Feature activation-1.26e-01

Pos logit contributions
ascript+0.053
Unit+0.050
anse+0.046
arius+0.046
dial+0.044
Neg logit contributions
\/\/-0.026
ivas-0.024
NESS-0.020
 equally-0.020
 pastry-0.020
Token27
Token ID1983
Feature activation-7.63e-03

Pos logit contributions
\/\/+0.077
ivas+0.071
NESS+0.063
 equally+0.063
 pastry+0.062
Neg logit contributions
ascript-0.116
Unit-0.110
anse-0.100
arius-0.100
dial-0.094
Token thought
Token ID1807
Feature activation+7.96e-02

Pos logit contributions
ascript+0.159
Unit+0.151
anse+0.136
arius+0.135
dial+0.128
Neg logit contributions
\/\/-0.122
ivas-0.114
 equally-0.101
NESS-0.101
 pastry-0.100
Token to
Token ID284
Feature activation+5.70e-02

Pos logit contributions
ascript+0.069
Unit+0.065
anse+0.059
arius+0.059
dial+0.055
Neg logit contributions
\/\/-0.053
ivas-0.050
NESS-0.044
 equally-0.044
 pastry-0.044
Token have
Token ID423
Feature activation+1.93e-03

Pos logit contributions
ascript+0.052
Unit+0.050
anse+0.045
arius+0.045
dial+0.042
Neg logit contributions
\/\/-0.035
ivas-0.033
NESS-0.029
 equally-0.028
 pastry-0.028
Token a
Token ID257
Feature activation+7.64e-02

Pos logit contributions
ascript+0.002
Unit+0.002
anse+0.001
arius+0.001
dial+0.001
Neg logit contributions
\/\/-0.001
ivas-0.001
NESS-0.001
 equally-0.001
 pastry-0.001
Token distinct
Token ID7310
Feature activation-8.61e-02

Pos logit contributions
ascript+0.065
Unit+0.061
anse+0.055
arius+0.055
dial+0.052
Neg logit contributions
\/\/-0.052
ivas-0.049
 equally-0.044
NESS-0.044
 pastry-0.043
Token microbiome
Token ID48465
Feature activation-3.50e-01

Pos logit contributions
\/\/+0.059
ivas+0.055
 equally+0.049
NESS+0.049
 pastry+0.048
Neg logit contributions
ascript-0.074
Unit-0.070
anse-0.063
arius-0.063
dial-0.059
Token across
Token ID1973
Feature activation-8.99e-02

Pos logit contributions
\/\/+0.233
ivas+0.217
NESS+0.192
 equally+0.192
 pastry+0.191
Neg logit contributions
ascript-0.306
Unit-0.289
anse-0.261
arius-0.260
dial-0.245
Token all
Token ID477
Feature activation+3.30e-02

Pos logit contributions
\/\/+0.061
ivas+0.057
NESS+0.051
 equally+0.051
 pastry+0.051
Neg logit contributions
ascript-0.076
Unit-0.073
anse-0.065
arius-0.065
dial-0.061
Token women
Token ID1466
Feature activation+9.19e-02

Pos logit contributions
ascript+0.027
Unit+0.026
anse+0.023
arius+0.023
dial+0.022
Neg logit contributions
\/\/-0.023
ivas-0.022
NESS-0.019
 equally-0.019
 pastry-0.019
Token was
Token ID373
Feature activation+6.28e-03

Pos logit contributions
ascript+0.079
Unit+0.075
anse+0.068
arius+0.068
dial+0.064
Neg logit contributions
\/\/-0.062
ivas-0.057
NESS-0.051
 pastry-0.051
 BALL-0.051
Token the
Token ID262
Feature activation+5.21e-02

Pos logit contributions
ascript+0.005
Unit+0.005
anse+0.005
arius+0.005
dial+0.004
Neg logit contributions
\/\/-0.004
ivas-0.004
NESS-0.004
 equally-0.003
 pastry-0.003
Token range
Token ID2837
Feature activation+2.49e-02

Pos logit contributions
\/\/+0.027
ivas+0.025
 equally+0.022
NESS+0.022
 pastry+0.022
Neg logit contributions
ascript-0.036
Unit-0.034
anse-0.031
arius-0.031
dial-0.029
Token at
Token ID379
Feature activation-1.22e-01

Pos logit contributions
ascript+0.021
Unit+0.020
anse+0.018
arius+0.018
dial+0.017
Neg logit contributions
\/\/-0.017
ivas-0.016
NESS-0.014
 equally-0.014
 pastry-0.014
Token will
Token ID481
Feature activation-6.83e-02

Pos logit contributions
\/\/+0.079
ivas+0.073
 equally+0.065
NESS+0.064
 framing+0.064
Neg logit contributions
ascript-0.109
Unit-0.103
anse-0.093
arius-0.093
dial-0.088
Token.
Token ID13
Feature activation-4.80e-02

Pos logit contributions
\/\/+0.047
ivas+0.044
NESS+0.039
 equally+0.039
 pastry+0.039
Neg logit contributions
ascript-0.058
Unit-0.054
anse-0.049
arius-0.049
dial-0.046
Token If
Token ID1002
Feature activation-6.13e-02

Pos logit contributions
\/\/+0.030
ivas+0.027
 equally+0.024
NESS+0.024
 pastry+0.024
Neg logit contributions
ascript-0.044
Unit-0.042
anse-0.038
arius-0.038
dial-0.036
Token no
Token ID645
Feature activation-3.21e-01

Pos logit contributions
\/\/+0.033
ivas+0.030
NESS+0.026
 equally+0.025
 pastry+0.025
Neg logit contributions
ascript-0.061
Unit-0.059
anse-0.054
arius-0.054
dial-0.051
Token direction
Token ID4571
Feature activation-2.79e-01

Pos logit contributions
\/\/+0.201
ivas+0.186
NESS+0.164
 equally+0.164
 pastry+0.163
Neg logit contributions
ascript-0.293
Unit-0.278
anse-0.252
arius-0.251
dial-0.237
Token is
Token ID318
Feature activation-7.31e-02

Pos logit contributions
\/\/+0.181
ivas+0.169
 equally+0.149
NESS+0.149
 pastry+0.148
Neg logit contributions
ascript-0.248
Unit-0.235
anse-0.212
arius-0.211
dial-0.199
Token held
Token ID2714
Feature activation+5.77e-03

Pos logit contributions
\/\/+0.052
ivas+0.049
 equally+0.044
NESS+0.044
 pastry+0.044
Neg logit contributions
ascript-0.060
Unit-0.057
anse-0.051
arius-0.050
dial-0.047
Token during
Token ID1141
Feature activation-6.65e-02

Pos logit contributions
ascript+0.005
Unit+0.005
anse+0.005
arius+0.005
dial+0.004
Neg logit contributions
\/\/-0.004
ivas-0.003
NESS-0.003
 equally-0.003
 pastry-0.003
Token the
Token ID262
Feature activation-4.98e-02

Pos logit contributions
\/\/+0.039
ivas+0.036
NESS+0.031
 equally+0.031
 pastry+0.031
Neg logit contributions
ascript-0.063
Unit-0.060
anse-0.055
arius-0.055
dial-0.052
Token have
Token ID423
Feature activation-7.92e-02

Pos logit contributions
\/\/+0.006
ivas+0.005
NESS+0.005
 equally+0.005
 pastry+0.005
Neg logit contributions
ascript-0.007
Unit-0.007
anse-0.006
arius-0.006
dial-0.006
Token been
Token ID587
Feature activation+6.74e-03

Pos logit contributions
\/\/+0.053
ivas+0.049
NESS+0.044
 equally+0.044
 pastry+0.043
Neg logit contributions
ascript-0.069
Unit-0.065
anse-0.059
arius-0.059
dial-0.055
Token changed
Token ID3421
Feature activation+8.30e-02

Pos logit contributions
ascript+0.005
Unit+0.005
anse+0.004
arius+0.004
dial+0.004
Neg logit contributions
\/\/-0.005
ivas-0.005
NESS-0.004
 equally-0.004
 pastry-0.004
Token for
Token ID329
Feature activation+3.00e-03

Pos logit contributions
ascript+0.073
Unit+0.070
arius+0.062
anse+0.062
dial+0.059
Neg logit contributions
\/\/-0.054
ivas-0.050
NESS-0.045
 pastry-0.044
 BALL-0.044
Token the
Token ID262
Feature activation-1.68e-02

Pos logit contributions
ascript+0.003
Unit+0.003
anse+0.002
arius+0.002
dial+0.002
Neg logit contributions
\/\/-0.002
ivas-0.002
 equally-0.002
NESS-0.002
 pastry-0.002
Token present
Token ID1944
Feature activation-2.78e-01

Pos logit contributions
\/\/+0.012
ivas+0.011
 equally+0.010
NESS+0.010
 pastry+0.010
Neg logit contributions
ascript-0.014
Unit-0.013
anse-0.011
arius-0.011
dial-0.011
Token article
Token ID2708
Feature activation-1.95e-02

Pos logit contributions
\/\/+0.197
ivas+0.185
 equally+0.166
NESS+0.165
 pastry+0.164
Neg logit contributions
ascript-0.230
Unit-0.217
anse-0.194
arius-0.193
dial-0.182
Token to
Token ID284
Feature activation+2.56e-02

Pos logit contributions
\/\/+0.013
ivas+0.012
NESS+0.011
 pastry+0.011
 equally+0.011
Neg logit contributions
ascript-0.017
Unit-0.016
anse-0.014
arius-0.014
dial-0.013
Token use
Token ID779
Feature activation-8.52e-03

Pos logit contributions
ascript+0.022
Unit+0.021
anse+0.019
arius+0.019
dial+0.018
Neg logit contributions
\/\/-0.017
ivas-0.016
NESS-0.014
 equally-0.014
 pastry-0.014
Token the
Token ID262
Feature activation-7.32e-03

Pos logit contributions
\/\/+0.006
ivas+0.005
NESS+0.005
 equally+0.005
 pastry+0.005
Neg logit contributions
ascript-0.007
Unit-0.007
anse-0.006
arius-0.006
dial-0.006
Token new
Token ID649
Feature activation+2.63e-03

Pos logit contributions
\/\/+0.005
ivas+0.005
 equally+0.004
NESS+0.004
 pastry+0.004
Neg logit contributions
ascript-0.006
Unit-0.006
anse-0.005
arius-0.005
dial-0.005
TokenV
Token ID53
Feature activation+2.18e-02

Pos logit contributions
\/\/+0.081
ivas+0.075
 equally+0.067
NESS+0.067
 pastry+0.066
Neg logit contributions
ascript-0.101
Unit-0.096
anse-0.086
arius-0.086
dial-0.081
TokenV
Token ID53
Feature activation+5.96e-02

Pos logit contributions
ascript+0.019
Unit+0.018
anse+0.016
arius+0.016
dial+0.015
Neg logit contributions
\/\/-0.015
ivas-0.014
 equally-0.012
NESS-0.012
 pastry-0.012
Tokenp
Token ID79
Feature activation-2.91e-01

Pos logit contributions
ascript+0.052
Unit+0.049
anse+0.044
arius+0.044
dial+0.042
Neg logit contributions
\/\/-0.040
ivas-0.037
 equally-0.033
NESS-0.033
 pastry-0.033
TokenDM
Token ID23127
Feature activation+5.14e-02

Pos logit contributions
\/\/+0.200
ivas+0.186
NESS+0.166
 equally+0.166
 pastry+0.165
Neg logit contributions
ascript-0.248
Unit-0.234
anse-0.211
arius-0.210
dial-0.197
Token2
Token ID17
Feature activation+4.74e-02

Pos logit contributions
ascript+0.045
Unit+0.042
anse+0.038
arius+0.038
dial+0.036
Neg logit contributions
\/\/-0.034
ivas-0.032
 equally-0.028
NESS-0.028
 pastry-0.028
Token
Token ID851
Feature activation-3.32e-01

Pos logit contributions
ascript+0.050
Unit+0.048
anse+0.044
arius+0.044
dial+0.042
Neg logit contributions
\/\/-0.022
ivas-0.020
NESS-0.017
 equally-0.017
 pastry-0.017
Token V
Token ID569
Feature activation+4.04e-02

Pos logit contributions
\/\/+0.218
ivas+0.203
 equally+0.180
NESS+0.180
 pastry+0.179
Neg logit contributions
ascript-0.292
Unit-0.276
anse-0.249
arius-0.248
dial-0.234
TokenINE
Token ID8881
Feature activation+1.55e-01

Pos logit contributions
ascript+0.034
Unit+0.032
anse+0.029
arius+0.029
dial+0.027
Neg logit contributions
\/\/-0.028
ivas-0.026
 equally-0.023
NESS-0.023
 pastry-0.023
Token:
Token ID25
Feature activation-3.55e-03

Pos logit contributions
ascript+0.127
Unit+0.120
anse+0.108
arius+0.107
dial+0.101
Neg logit contributions
\/\/-0.111
ivas-0.104
 equally-0.093
NESS-0.093
 pastry-0.092
Token Im
Token ID1846
Feature activation+8.29e-02

Pos logit contributions
\/\/+0.002
ivas+0.002
 equally+0.002
NESS+0.002
 pastry+0.002
Neg logit contributions
ascript-0.003
Unit-0.003
anse-0.003
arius-0.003
dial-0.002
TokenJustin
Token ID33229
Feature activation+4.41e-02

Pos logit contributions
ascript+0.071
Unit+0.067
anse+0.060
arius+0.060
dial+0.056
Neg logit contributions
\/\/-0.057
ivas-0.053
 equally-0.047
NESS-0.047
 pastry-0.047