TOP ACTIVATIONS
MAX = 0.561

wrote : Cl arity Gaming will also be releasing
regarding the future of Cl arity Gaming Cl arity Gaming will
of Cl arity Gaming Cl arity Gaming will also be releasing
regarding the future of Cl arity Gaming Good updates
regarding the future of Cl arity Gaming Cl arity Gaming will
wrote : Cl arity Gaming will also be releasing
of Cl arity Gaming Cl arity Gaming will also be releasing
regarding the future of Cl arity Gaming Good updates
knowledge and understanding about LGBT youth that takes account of their
safety of LGBT children , youth , and their families by
years made no distinction between youth and adult punishment or between
something I experienced in my youth , from a dysfunctional family
old to be in a youth facility , he should be
Jay Gr uden praised the tempo and level of play produced
was the right mix of youth and leadership that allowed the
, the strength in the youth , the struggle against repression
murder only makes sense as youth sentence . And because he
[ Table [ graph List [[ # , 1 ]] <
make an application before a youth court judge for the inmate
]] < -> graph List [[ # , 2 , i

MIN ACTIVATIONS
MIN = -0.607

grad s will be better equipped to thrive in many different
pulse j ets . When equipped with a thrust augment er
R 78 70 Hawk , equipped with AMD 's 28 nm
. These Italian volunteers were equipped and trained by the British
, and they showed up equipped to do so with BB
it is , without further ado : one man
serving the fil thy ph allic idol of sod
. 75 million ( 16 th among running backs )
lawyer Mc Car thy , a native of County
. 75 million ( 35 th among guards )
of pharmaceutical s . Home opathy advocated a like
9 th to 12 th Grade , No Di ploma
] < met ros xual th ug > lol m 2
stops Kell Brook in 11 th round to win I BF
of Columbia Pictures and 20 th Century Fox NOT
and then generate a 5 th code tape by X OR
$ 6 million ( 15 th among running backs )
On Sunday , November 16 th , I woke up with
Less than 9 th Grade and
event also marks the 8 th annual Hemp History Week which

INTERVAL 0.269 - 0.561
CONTAINS 0.777%

ically issue informal guidance and policy memor anda intended
al ani ō pu u . E
to its post - H alo project by the cod ename
another character from the TM NT universe will be showing his
a prime suspect . Vill agers also reported that they saw

INTERVAL -0.023 - 0.269
CONTAINS 68.225%

ak shi temple , in this temple there are 7 musical
the protesters have a right to occupy public spaces " as
it would try to mitigate the impact on employees , suppliers
, Mom , this is a sh anty town
ne h aha Creek and Lynd ale Avenue goes before the

INTERVAL -0.315 - -0.023
CONTAINS 30.621%

all in good fun . <|endoftext|> Once
to female veterans and female reporters ). Asked about sexual harassment
that usefulness was good - in - it self . For
( when he discovered I was going to be writing an
, housed in what was then the N 64

INTERVAL -0.607 - -0.315
CONTAINS 0.378%

, wolf - wh ist ling or just ch or uses
Mr . Mor neau was repeatedly on the firing
off side in the 73 rd minute and Ireland also had
I m shocked , I m
from Mitch McConnell and Chuck Grassley would be the end result
Token wrote
Token ID2630
Feature activation+3.74e-02

Pos logit contributions
arity+0.128
 sage+0.121
wikipedia+0.119
idas+0.119
[[+0.118
Neg logit contributions
 equipped-0.134
 ado-0.129
thy-0.127
 BOX-0.126
opathy-0.124
Token:
Token ID25
Feature activation-5.18e-03

Pos logit contributions
arity+0.026
 sage+0.025
wikipedia+0.024
idas+0.024
[[+0.024
Neg logit contributions
 equipped-0.027
 ado-0.026
thy-0.025
 BOX-0.025
opathy-0.025
Token
Token ID198
Feature activation+2.30e-02

Pos logit contributions
 equipped+0.004
 ado+0.004
thy+0.004
 BOX+0.004
opathy+0.004
Neg logit contributions
arity-0.003
 sage-0.003
wikipedia-0.003
idas-0.003
[[-0.003
Token
Token ID198
Feature activation+2.04e-02

Pos logit contributions
arity+0.016
 sage+0.015
wikipedia+0.014
idas+0.014
[[+0.014
Neg logit contributions
 equipped-0.017
 ado-0.016
thy-0.016
 BOX-0.016
opathy-0.016
TokenCl
Token ID2601
Feature activation+2.04e-02

Pos logit contributions
arity+0.016
 sage+0.015
wikipedia+0.015
idas+0.015
[[+0.014
Neg logit contributions
 equipped-0.013
 ado-0.013
thy-0.013
 BOX-0.012
opathy-0.012
Tokenarity
Token ID6806
Feature activation+5.61e-01

Pos logit contributions
arity+0.015
 sage+0.014
wikipedia+0.014
idas+0.014
[[+0.014
Neg logit contributions
 equipped-0.014
 ado-0.013
thy-0.013
 BOX-0.013
opathy-0.013
Token Gaming
Token ID14426
Feature activation+1.86e-02

Pos logit contributions
arity+0.388
 sage+0.365
idas+0.359
wikipedia+0.359
[[+0.358
Neg logit contributions
 equipped-0.405
 ado-0.389
thy-0.383
 BOX-0.381
opathy-0.376
Token will
Token ID481
Feature activation+3.36e-02

Pos logit contributions
arity+0.013
 sage+0.012
wikipedia+0.012
idas+0.012
[[+0.012
Neg logit contributions
 equipped-0.013
 ado-0.013
thy-0.013
 BOX-0.013
opathy-0.012
Token also
Token ID635
Feature activation-9.32e-02

Pos logit contributions
arity+0.023
 sage+0.021
wikipedia+0.021
idas+0.021
[[+0.021
Neg logit contributions
 equipped-0.025
 ado-0.024
thy-0.023
 BOX-0.023
opathy-0.023
Token be
Token ID307
Feature activation+1.10e-01

Pos logit contributions
 equipped+0.070
 ado+0.067
thy+0.066
 BOX+0.065
opathy+0.065
Neg logit contributions
arity-0.062
 sage-0.059
wikipedia-0.058
idas-0.058
[[-0.057
Token releasing
Token ID13011
Feature activation+1.46e-01

Pos logit contributions
arity+0.073
 sage+0.068
idas+0.067
wikipedia+0.067
[[+0.067
Neg logit contributions
 equipped-0.082
 ado-0.079
thy-0.078
 BOX-0.078
opathy-0.077
Token regarding
Token ID5115
Feature activation-9.22e-02

Pos logit contributions
arity+0.029
 sage+0.027
wikipedia+0.027
idas+0.027
[[+0.027
Neg logit contributions
 equipped-0.032
 ado-0.031
thy-0.030
 BOX-0.030
opathy-0.030
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
 equipped+0.067
 ado+0.065
thy+0.064
 BOX+0.063
opathy+0.062
Neg logit contributions
arity-0.063
 sage-0.059
wikipedia-0.059
idas-0.058
[[-0.058
Token future
Token ID2003
Feature activation+1.55e-01

Pos logit contributions
arity+0.007
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.008
Token of
Token ID286
Feature activation+1.90e-02

Pos logit contributions
arity+0.104
 sage+0.098
wikipedia+0.097
idas+0.097
[[+0.096
Neg logit contributions
 equipped-0.115
 ado-0.111
thy-0.108
 BOX-0.108
opathy-0.107
Token Cl
Token ID1012
Feature activation+1.05e-01

Pos logit contributions
arity+0.013
 sage+0.012
wikipedia+0.012
idas+0.012
[[+0.012
Neg logit contributions
 equipped-0.014
 ado-0.014
thy-0.013
 BOX-0.013
opathy-0.013
Tokenarity
Token ID6806
Feature activation+5.55e-01

Pos logit contributions
arity+0.000
 sage-0.001
wikipedia-0.002
 Continued-0.002
idas-0.003
Neg logit contributions
 equipped-0.146
 ado-0.143
thy-0.141
 BOX-0.140
opathy-0.140
Token Gaming
Token ID14426
Feature activation+1.11e-02

Pos logit contributions
arity+0.386
 sage+0.363
idas+0.358
wikipedia+0.357
[[+0.356
Neg logit contributions
 equipped-0.399
 ado-0.384
thy-0.378
 BOX-0.375
opathy-0.371
Token Cl
Token ID1012
Feature activation+1.06e-01

Pos logit contributions
arity+0.008
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.007
Tokenarity
Token ID6806
Feature activation+5.53e-01

Pos logit contributions
arity+0.000
 sage-0.001
wikipedia-0.002
idas-0.003
 Continued-0.003
Neg logit contributions
 equipped-0.147
 ado-0.143
thy-0.142
opathy-0.141
 BOX-0.141
Token Gaming
Token ID14426
Feature activation+1.62e-02

Pos logit contributions
arity+0.387
 sage+0.364
idas+0.359
wikipedia+0.358
[[+0.357
Neg logit contributions
 equipped-0.395
 ado-0.380
thy-0.374
 BOX-0.371
opathy-0.367
Token will
Token ID481
Feature activation+3.03e-02

Pos logit contributions
arity+0.011
 sage+0.010
idas+0.010
wikipedia+0.010
[[+0.010
Neg logit contributions
 equipped-0.012
 ado-0.012
thy-0.011
 BOX-0.011
opathy-0.011
Token of
Token ID286
Feature activation+1.90e-02

Pos logit contributions
arity+0.104
 sage+0.098
wikipedia+0.097
idas+0.097
[[+0.096
Neg logit contributions
 equipped-0.115
 ado-0.111
thy-0.108
 BOX-0.108
opathy-0.107
Token Cl
Token ID1012
Feature activation+1.05e-01

Pos logit contributions
arity+0.013
 sage+0.012
wikipedia+0.012
idas+0.012
[[+0.012
Neg logit contributions
 equipped-0.014
 ado-0.014
thy-0.013
 BOX-0.013
opathy-0.013
Tokenarity
Token ID6806
Feature activation+5.55e-01

Pos logit contributions
arity+0.000
 sage-0.001
wikipedia-0.002
 Continued-0.002
idas-0.003
Neg logit contributions
 equipped-0.146
 ado-0.143
thy-0.141
 BOX-0.140
opathy-0.140
Token Gaming
Token ID14426
Feature activation+1.11e-02

Pos logit contributions
arity+0.386
 sage+0.363
idas+0.358
wikipedia+0.357
[[+0.356
Neg logit contributions
 equipped-0.399
 ado-0.384
thy-0.378
 BOX-0.375
opathy-0.371
Token Cl
Token ID1012
Feature activation+1.06e-01

Pos logit contributions
arity+0.008
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.007
Tokenarity
Token ID6806
Feature activation+5.53e-01

Pos logit contributions
arity+0.000
 sage-0.001
wikipedia-0.002
idas-0.003
 Continued-0.003
Neg logit contributions
 equipped-0.147
 ado-0.143
thy-0.142
opathy-0.141
 BOX-0.141
Token Gaming
Token ID14426
Feature activation+1.62e-02

Pos logit contributions
arity+0.387
 sage+0.364
idas+0.359
wikipedia+0.358
[[+0.357
Neg logit contributions
 equipped-0.395
 ado-0.380
thy-0.374
 BOX-0.371
opathy-0.367
Token will
Token ID481
Feature activation+3.03e-02

Pos logit contributions
arity+0.011
 sage+0.010
idas+0.010
wikipedia+0.010
[[+0.010
Neg logit contributions
 equipped-0.012
 ado-0.012
thy-0.011
 BOX-0.011
opathy-0.011
Token also
Token ID635
Feature activation-1.03e-01

Pos logit contributions
arity+0.025
 sage+0.023
idas+0.023
wikipedia+0.023
[[+0.023
Neg logit contributions
 equipped-0.018
 ado-0.017
thy-0.017
 BOX-0.017
opathy-0.017
Token be
Token ID307
Feature activation+9.54e-02

Pos logit contributions
 equipped+0.087
 ado+0.085
thy+0.083
 BOX+0.083
opathy+0.082
Neg logit contributions
arity-0.058
 sage-0.053
idas-0.052
wikipedia-0.052
[[-0.052
Token releasing
Token ID13011
Feature activation+1.44e-01

Pos logit contributions
arity+0.051
 sage+0.047
idas+0.047
wikipedia+0.046
[[+0.046
Neg logit contributions
 equipped-0.083
 ado-0.081
thy-0.080
 BOX-0.079
opathy-0.078
Token regarding
Token ID5115
Feature activation-9.48e-02

Pos logit contributions
arity+0.028
 sage+0.026
idas+0.026
wikipedia+0.026
[[+0.026
Neg logit contributions
 equipped-0.020
 ado-0.019
thy-0.018
 BOX-0.018
opathy-0.018
Token the
Token ID262
Feature activation+5.78e-04

Pos logit contributions
 equipped+0.068
 ado+0.065
thy+0.064
 BOX+0.063
opathy+0.063
Neg logit contributions
arity-0.066
 sage-0.063
wikipedia-0.062
idas-0.062
[[-0.061
Token future
Token ID2003
Feature activation+1.41e-01

Pos logit contributions
arity+0.000
 sage+0.000
idas+0.000
wikipedia+0.000
[[+0.000
Neg logit contributions
 equipped-0.001
 ado-0.000
thy-0.000
 BOX-0.000
opathy-0.000
Token of
Token ID286
Feature activation+9.59e-03

Pos logit contributions
arity+0.096
 sage+0.090
wikipedia+0.089
idas+0.088
[[+0.088
Neg logit contributions
 equipped-0.103
 ado-0.099
thy-0.098
 BOX-0.097
opathy-0.096
Token Cl
Token ID1012
Feature activation+1.04e-01

Pos logit contributions
arity+0.006
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.008
Tokenarity
Token ID6806
Feature activation+5.47e-01

Pos logit contributions
arity+0.000
 sage-0.001
wikipedia-0.003
 Continued-0.003
[[-0.003
Neg logit contributions
 equipped-0.144
 ado-0.141
thy-0.141
opathy-0.140
 BOX-0.139
Token Gaming
Token ID14426
Feature activation+4.00e-03

Pos logit contributions
arity+0.377
 sage+0.352
idas+0.348
[[+0.347
wikipedia+0.346
Neg logit contributions
 equipped-0.398
 ado-0.382
thy-0.378
 BOX-0.373
opathy-0.368
Token
Token ID198
Feature activation+1.08e-02

Pos logit contributions
arity+0.003
 sage+0.003
idas+0.003
wikipedia+0.003
[[+0.003
Neg logit contributions
 equipped-0.003
 ado-0.003
thy-0.003
 BOX-0.003
opathy-0.003
Token
Token ID198
Feature activation+1.00e-02

Pos logit contributions
arity+0.007
 sage+0.007
idas+0.007
wikipedia+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.007
 BOX-0.007
opathy-0.007
TokenGood
Token ID10248
Feature activation-1.68e-01

Pos logit contributions
arity+0.007
 sage+0.007
idas+0.007
wikipedia+0.007
[[+0.007
Neg logit contributions
 equipped-0.007
 ado-0.007
thy-0.006
 BOX-0.006
opathy-0.006
Token updates
Token ID5992
Feature activation+3.67e-02

Pos logit contributions
 equipped+0.128
 ado+0.123
thy+0.120
 BOX+0.120
opathy+0.118
Neg logit contributions
arity-0.111
 sage-0.104
idas-0.103
wikipedia-0.103
[[-0.102
Token regarding
Token ID5115
Feature activation-1.04e-01

Pos logit contributions
arity+0.025
 sage+0.023
wikipedia+0.023
idas+0.023
[[+0.023
Neg logit contributions
 equipped-0.016
 ado-0.015
thy-0.015
 BOX-0.015
opathy-0.015
Token the
Token ID262
Feature activation-2.51e-03

Pos logit contributions
 equipped+0.074
 ado+0.071
thy+0.070
 BOX+0.069
opathy+0.068
Neg logit contributions
arity-0.073
 sage-0.068
idas-0.067
wikipedia-0.067
[[-0.067
Token future
Token ID2003
Feature activation+1.44e-01

Pos logit contributions
 equipped+0.002
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.001
 sage-0.001
idas-0.001
wikipedia-0.001
[[-0.001
Token of
Token ID286
Feature activation+4.68e-03

Pos logit contributions
arity+0.099
 sage+0.093
wikipedia+0.091
idas+0.091
[[+0.091
Neg logit contributions
 equipped-0.106
 ado-0.102
thy-0.100
 BOX-0.099
opathy-0.098
Token Cl
Token ID1012
Feature activation+9.27e-02

Pos logit contributions
arity+0.003
 sage+0.003
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.004
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Tokenarity
Token ID6806
Feature activation+5.45e-01

Pos logit contributions
arity+0.000
 sage-0.000
 Continued-0.001
wikipedia-0.001
[[-0.002
Neg logit contributions
 equipped-0.127
 ado-0.125
thy-0.124
opathy-0.123
 BOX-0.122
Token Gaming
Token ID14426
Feature activation+5.22e-03

Pos logit contributions
arity+0.379
 sage+0.355
idas+0.351
wikipedia+0.350
[[+0.349
Neg logit contributions
 equipped-0.393
 ado-0.377
thy-0.372
 BOX-0.368
opathy-0.363
Token Cl
Token ID1012
Feature activation+9.25e-02

Pos logit contributions
arity+0.003
 sage+0.003
idas+0.003
wikipedia+0.003
[[+0.003
Neg logit contributions
 equipped-0.004
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Tokenarity
Token ID6806
Feature activation+5.44e-01

Pos logit contributions
arity+0.000
 sage-0.001
wikipedia-0.002
 Continued-0.002
idas-0.002
Neg logit contributions
 equipped-0.128
 ado-0.125
thy-0.124
opathy-0.124
 BOX-0.123
Token Gaming
Token ID14426
Feature activation+3.15e-03

Pos logit contributions
arity+0.379
 sage+0.356
idas+0.351
wikipedia+0.350
[[+0.349
Neg logit contributions
 equipped-0.391
 ado-0.376
thy-0.371
 BOX-0.367
opathy-0.362
Token will
Token ID481
Feature activation+1.56e-02

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token wrote
Token ID2630
Feature activation+8.26e-03

Pos logit contributions
arity+0.108
 sage+0.101
idas+0.100
wikipedia+0.099
[[+0.099
Neg logit contributions
 equipped-0.120
 ado-0.115
thy-0.113
 BOX-0.113
opathy-0.111
Token:
Token ID25
Feature activation-2.85e-02

Pos logit contributions
arity+0.006
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.006
 ado-0.006
thy-0.006
 BOX-0.006
opathy-0.005
Token
Token ID198
Feature activation-5.18e-03

Pos logit contributions
 equipped+0.021
 ado+0.020
thy+0.020
 BOX+0.020
opathy+0.020
Neg logit contributions
arity-0.019
 sage-0.018
idas-0.018
wikipedia-0.018
[[-0.018
Token
Token ID198
Feature activation+1.01e-03

Pos logit contributions
 equipped+0.004
 ado+0.004
thy+0.004
 BOX+0.004
opathy+0.004
Neg logit contributions
arity-0.003
 sage-0.003
idas-0.003
wikipedia-0.003
[[-0.003
TokenCl
Token ID2601
Feature activation+7.61e-04

Pos logit contributions
arity+0.001
 sage+0.001
idas+0.001
wikipedia+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Tokenarity
Token ID6806
Feature activation+5.44e-01

Pos logit contributions
arity+0.000
 sage-0.000
wikipedia-0.000
idas-0.000
 Continued-0.000
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
opathy-0.001
 BOX-0.001
Token Gaming
Token ID14426
Feature activation-3.97e-03

Pos logit contributions
arity+0.380
 sage+0.356
idas+0.352
wikipedia+0.351
[[+0.350
Neg logit contributions
 equipped-0.390
 ado-0.375
thy-0.370
 BOX-0.366
opathy-0.362
Token will
Token ID481
Feature activation+8.17e-03

Pos logit contributions
 equipped+0.003
 ado+0.003
thy+0.003
 BOX+0.003
opathy+0.003
Neg logit contributions
arity-0.003
 sage-0.003
idas-0.002
wikipedia-0.002
[[-0.002
Token also
Token ID635
Feature activation-1.13e-01

Pos logit contributions
arity+0.007
 sage+0.006
idas+0.006
wikipedia+0.006
[[+0.006
Neg logit contributions
 equipped-0.005
 ado-0.005
thy-0.005
 BOX-0.004
opathy-0.004
Token be
Token ID307
Feature activation+8.15e-02

Pos logit contributions
 equipped+0.096
 ado+0.093
thy+0.091
 BOX+0.091
opathy+0.090
Neg logit contributions
arity-0.064
 sage-0.059
idas-0.058
wikipedia-0.058
[[-0.058
Token releasing
Token ID13011
Feature activation+1.37e-01

Pos logit contributions
arity+0.045
 sage+0.041
idas+0.041
wikipedia+0.041
[[+0.041
Neg logit contributions
 equipped-0.070
 ado-0.068
thy-0.067
 BOX-0.067
opathy-0.066
Token of
Token ID286
Feature activation+4.68e-03

Pos logit contributions
arity+0.099
 sage+0.093
wikipedia+0.091
idas+0.091
[[+0.091
Neg logit contributions
 equipped-0.106
 ado-0.102
thy-0.100
 BOX-0.099
opathy-0.098
Token Cl
Token ID1012
Feature activation+9.27e-02

Pos logit contributions
arity+0.003
 sage+0.003
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.004
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Tokenarity
Token ID6806
Feature activation+5.45e-01

Pos logit contributions
arity+0.000
 sage-0.000
 Continued-0.001
wikipedia-0.001
[[-0.002
Neg logit contributions
 equipped-0.127
 ado-0.125
thy-0.124
opathy-0.123
 BOX-0.122
Token Gaming
Token ID14426
Feature activation+5.22e-03

Pos logit contributions
arity+0.379
 sage+0.355
idas+0.351
wikipedia+0.350
[[+0.349
Neg logit contributions
 equipped-0.393
 ado-0.377
thy-0.372
 BOX-0.368
opathy-0.363
Token Cl
Token ID1012
Feature activation+9.25e-02

Pos logit contributions
arity+0.003
 sage+0.003
idas+0.003
wikipedia+0.003
[[+0.003
Neg logit contributions
 equipped-0.004
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Tokenarity
Token ID6806
Feature activation+5.44e-01

Pos logit contributions
arity+0.000
 sage-0.001
wikipedia-0.002
 Continued-0.002
idas-0.002
Neg logit contributions
 equipped-0.128
 ado-0.125
thy-0.124
opathy-0.124
 BOX-0.123
Token Gaming
Token ID14426
Feature activation+3.15e-03

Pos logit contributions
arity+0.379
 sage+0.356
idas+0.351
wikipedia+0.350
[[+0.349
Neg logit contributions
 equipped-0.391
 ado-0.376
thy-0.371
 BOX-0.367
opathy-0.362
Token will
Token ID481
Feature activation+1.56e-02

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token also
Token ID635
Feature activation-1.02e-01

Pos logit contributions
arity+0.013
 sage+0.012
wikipedia+0.012
idas+0.012
[[+0.012
Neg logit contributions
 equipped-0.009
 ado-0.009
thy-0.009
 BOX-0.009
opathy-0.009
Token be
Token ID307
Feature activation+8.61e-02

Pos logit contributions
 equipped+0.087
 ado+0.084
thy+0.083
 BOX+0.083
opathy+0.082
Neg logit contributions
arity-0.058
 sage-0.054
idas-0.053
wikipedia-0.053
[[-0.053
Token releasing
Token ID13011
Feature activation+1.36e-01

Pos logit contributions
arity+0.047
 sage+0.044
idas+0.043
wikipedia+0.043
[[+0.043
Neg logit contributions
 equipped-0.075
 ado-0.072
thy-0.071
 BOX-0.071
opathy-0.070
Token regarding
Token ID5115
Feature activation-1.02e-01

Pos logit contributions
arity+0.023
 sage+0.022
wikipedia+0.021
idas+0.021
[[+0.021
Neg logit contributions
 equipped-0.015
 ado-0.015
thy-0.014
 BOX-0.014
opathy-0.014
Token the
Token ID262
Feature activation-2.45e-03

Pos logit contributions
 equipped+0.073
 ado+0.070
thy+0.069
 BOX+0.069
opathy+0.068
Neg logit contributions
arity-0.071
 sage-0.067
idas-0.066
wikipedia-0.066
[[-0.066
Token future
Token ID2003
Feature activation+1.41e-01

Pos logit contributions
 equipped+0.002
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.001
 sage-0.001
idas-0.001
wikipedia-0.001
[[-0.001
Token of
Token ID286
Feature activation-2.31e-04

Pos logit contributions
arity+0.096
 sage+0.090
wikipedia+0.089
idas+0.089
[[+0.088
Neg logit contributions
 equipped-0.104
 ado-0.100
thy-0.098
 BOX-0.098
opathy-0.097
Token Cl
Token ID1012
Feature activation+9.16e-02

Pos logit contributions
 equipped+0.000
 ado+0.000
thy+0.000
 BOX+0.000
opathy+0.000
Neg logit contributions
arity-0.000
 sage-0.000
idas-0.000
wikipedia-0.000
[[-0.000
Tokenarity
Token ID6806
Feature activation+5.44e-01

Pos logit contributions
arity+0.000
 sage-0.000
 Continued-0.001
wikipedia-0.001
[[-0.002
Neg logit contributions
 equipped-0.126
 ado-0.123
thy-0.123
opathy-0.122
 BOX-0.121
Token Gaming
Token ID14426
Feature activation+6.31e-03

Pos logit contributions
arity+0.379
 sage+0.356
idas+0.351
wikipedia+0.350
[[+0.349
Neg logit contributions
 equipped-0.391
 ado-0.375
thy-0.370
 BOX-0.366
opathy-0.362
Token
Token ID198
Feature activation-2.89e-04

Pos logit contributions
arity+0.004
 sage+0.004
idas+0.004
wikipedia+0.004
[[+0.004
Neg logit contributions
 equipped-0.005
 ado-0.005
thy-0.005
 BOX-0.004
opathy-0.004
Token
Token ID198
Feature activation+1.07e-03

Pos logit contributions
 equipped+0.000
 ado+0.000
thy+0.000
 BOX+0.000
opathy+0.000
Neg logit contributions
arity-0.000
 sage-0.000
wikipedia-0.000
idas-0.000
[[-0.000
TokenGood
Token ID10248
Feature activation-1.81e-01

Pos logit contributions
arity+0.001
 sage+0.001
idas+0.001
wikipedia+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Token updates
Token ID5992
Feature activation+2.19e-02

Pos logit contributions
 equipped+0.132
 ado+0.127
thy+0.125
 BOX+0.124
opathy+0.122
Neg logit contributions
arity-0.123
 sage-0.116
wikipedia-0.115
idas-0.115
[[-0.114
Token knowledge
Token ID3725
Feature activation+1.14e-01

Pos logit contributions
arity+0.043
 sage+0.040
wikipedia+0.040
idas+0.040
[[+0.039
Neg logit contributions
 equipped-0.048
 ado-0.046
thy-0.045
 BOX-0.045
opathy-0.044
Token and
Token ID290
Feature activation-4.14e-03

Pos logit contributions
arity+0.079
 sage+0.075
wikipedia+0.074
idas+0.073
[[+0.073
Neg logit contributions
 equipped-0.083
 ado-0.079
thy-0.078
 BOX-0.077
opathy-0.077
Token understanding
Token ID4547
Feature activation+6.70e-03

Pos logit contributions
 equipped+0.003
 ado+0.003
thy+0.003
 BOX+0.003
opathy+0.003
Neg logit contributions
arity-0.003
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token about
Token ID546
Feature activation-3.21e-02

Pos logit contributions
arity+0.005
 sage+0.004
wikipedia+0.004
idas+0.004
[[+0.004
Neg logit contributions
 equipped-0.005
 ado-0.005
thy-0.004
 BOX-0.004
opathy-0.004
Token LGBT
Token ID9916
Feature activation+7.48e-02

Pos logit contributions
 equipped+0.024
 ado+0.023
thy+0.023
 BOX+0.023
opathy+0.022
Neg logit contributions
arity-0.021
 sage-0.020
wikipedia-0.020
idas-0.020
[[-0.020
Token youth
Token ID6205
Feature activation+5.24e-01

Pos logit contributions
arity+0.025
 sage+0.022
wikipedia+0.022
idas+0.021
[[+0.021
Neg logit contributions
 equipped-0.081
 ado-0.078
thy-0.078
 BOX-0.077
opathy-0.077
Token that
Token ID326
Feature activation+1.65e-02

Pos logit contributions
arity+0.364
 sage+0.342
idas+0.337
wikipedia+0.336
[[+0.336
Neg logit contributions
 equipped-0.376
 ado-0.363
thy-0.356
 BOX-0.355
opathy-0.349
Token takes
Token ID2753
Feature activation+9.50e-02

Pos logit contributions
arity+0.011
 sage+0.011
wikipedia+0.010
idas+0.010
[[+0.010
Neg logit contributions
 equipped-0.012
 ado-0.012
thy-0.012
 BOX-0.011
opathy-0.011
Token account
Token ID1848
Feature activation+2.84e-02

Pos logit contributions
arity+0.062
 sage+0.058
wikipedia+0.058
idas+0.057
[[+0.057
Neg logit contributions
 equipped-0.073
 ado-0.069
thy-0.068
 BOX-0.068
opathy-0.067
Token of
Token ID286
Feature activation+2.16e-02

Pos logit contributions
arity+0.019
 sage+0.018
wikipedia+0.018
idas+0.018
[[+0.018
Neg logit contributions
 equipped-0.021
 ado-0.020
thy-0.020
 BOX-0.020
opathy-0.019
Token their
Token ID511
Feature activation-1.66e-02

Pos logit contributions
arity+0.015
 sage+0.014
wikipedia+0.014
idas+0.014
[[+0.014
Neg logit contributions
 equipped-0.016
 ado-0.015
thy-0.015
 BOX-0.015
opathy-0.014
Token safety
Token ID3747
Feature activation+1.88e-01

Pos logit contributions
arity+0.001
 sage+0.001
wikipedia+0.001
idas+0.001
[[+0.001
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token of
Token ID286
Feature activation+2.03e-02

Pos logit contributions
arity+0.128
 sage+0.120
wikipedia+0.118
idas+0.118
[[+0.117
Neg logit contributions
 equipped-0.138
 ado-0.132
thy-0.130
 BOX-0.130
opathy-0.128
Token LGBT
Token ID9916
Feature activation+8.06e-02

Pos logit contributions
arity+0.012
 sage+0.012
wikipedia+0.011
idas+0.011
[[+0.011
Neg logit contributions
 equipped-0.016
 ado-0.016
thy-0.016
 BOX-0.015
opathy-0.015
Token children
Token ID1751
Feature activation+5.96e-02

Pos logit contributions
arity+0.020
 sage+0.017
wikipedia+0.017
idas+0.016
[[+0.016
Neg logit contributions
 equipped-0.094
 ado-0.091
thy-0.091
 BOX-0.090
opathy-0.089
Token,
Token ID11
Feature activation+2.17e-02

Pos logit contributions
arity+0.042
 sage+0.040
idas+0.039
wikipedia+0.039
[[+0.039
Neg logit contributions
 equipped-0.042
 ado-0.041
thy-0.040
 BOX-0.040
opathy-0.039
Token youth
Token ID6205
Feature activation+5.23e-01

Pos logit contributions
arity+0.015
 sage+0.014
wikipedia+0.014
idas+0.014
[[+0.014
Neg logit contributions
 equipped-0.016
 ado-0.015
thy-0.015
 BOX-0.015
opathy-0.015
Token,
Token ID11
Feature activation+2.05e-02

Pos logit contributions
arity+0.360
 sage+0.339
wikipedia+0.335
idas+0.334
[[+0.332
Neg logit contributions
 equipped-0.380
 ado-0.364
thy-0.358
 BOX-0.356
opathy-0.352
Token and
Token ID290
Feature activation+1.60e-03

Pos logit contributions
arity+0.014
 sage+0.014
wikipedia+0.013
idas+0.013
[[+0.013
Neg logit contributions
 equipped-0.015
 ado-0.014
thy-0.014
 BOX-0.014
opathy-0.014
Token their
Token ID511
Feature activation-1.52e-02

Pos logit contributions
arity+0.001
 sage+0.001
idas+0.001
wikipedia+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Token families
Token ID4172
Feature activation-1.33e-01

Pos logit contributions
 equipped+0.008
 ado+0.008
thy+0.008
 BOX+0.008
opathy+0.008
Neg logit contributions
arity-0.013
 sage-0.012
idas-0.012
wikipedia-0.012
[[-0.012
Token by
Token ID416
Feature activation-3.29e-02

Pos logit contributions
 equipped+0.093
 ado+0.089
thy+0.088
 BOX+0.087
opathy+0.086
Neg logit contributions
arity-0.095
 sage-0.089
wikipedia-0.088
idas-0.088
[[-0.088
Token years
Token ID812
Feature activation-1.89e-03

Pos logit contributions
 equipped+0.022
 ado+0.021
thy+0.021
 BOX+0.021
opathy+0.020
Neg logit contributions
arity-0.021
 sage-0.020
idas-0.020
wikipedia-0.020
[[-0.020
Token made
Token ID925
Feature activation+1.48e-02

Pos logit contributions
 equipped+0.001
 ado+0.001
thy+0.001
 BOX+0.001
opathy+0.001
Neg logit contributions
arity-0.001
 sage-0.001
wikipedia-0.001
idas-0.001
[[-0.001
Token no
Token ID645
Feature activation-1.29e-01

Pos logit contributions
arity+0.010
 sage+0.010
idas+0.009
wikipedia+0.009
[[+0.009
Neg logit contributions
 equipped-0.011
 ado-0.010
thy-0.010
 BOX-0.010
opathy-0.010
Token distinction
Token ID12941
Feature activation-6.98e-02

Pos logit contributions
 equipped+0.101
 ado+0.097
thy+0.096
 BOX+0.095
opathy+0.094
Neg logit contributions
arity-0.082
 sage-0.077
wikipedia-0.075
idas-0.075
[[-0.075
Token between
Token ID1022
Feature activation+1.38e-02

Pos logit contributions
 equipped+0.051
 ado+0.049
thy+0.049
 BOX+0.048
opathy+0.048
Neg logit contributions
arity-0.047
 sage-0.045
wikipedia-0.044
idas-0.044
[[-0.044
Token youth
Token ID6205
Feature activation+5.22e-01

Pos logit contributions
arity+0.010
 sage+0.009
wikipedia+0.009
idas+0.009
[[+0.009
Neg logit contributions
 equipped-0.010
 ado-0.010
thy-0.009
 BOX-0.009
opathy-0.009
Token and
Token ID290
Feature activation-3.06e-03

Pos logit contributions
arity+0.359
 sage+0.338
wikipedia+0.333
idas+0.332
[[+0.331
Neg logit contributions
 equipped-0.380
 ado-0.365
thy-0.359
 BOX-0.357
opathy-0.353
Token adult
Token ID4044
Feature activation+9.27e-02

Pos logit contributions
 equipped+0.002
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.002
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token punishment
Token ID9837
Feature activation-3.50e-02

Pos logit contributions
arity+0.067
 sage+0.063
wikipedia+0.062
idas+0.062
[[+0.062
Neg logit contributions
 equipped-0.065
 ado-0.062
thy-0.060
 BOX-0.060
opathy-0.060
Token or
Token ID393
Feature activation+1.94e-01

Pos logit contributions
 equipped+0.025
 ado+0.025
thy+0.024
 BOX+0.024
opathy+0.024
Neg logit contributions
arity-0.024
 sage-0.023
idas-0.022
wikipedia-0.022
[[-0.022
Token between
Token ID1022
Feature activation+1.06e-02

Pos logit contributions
arity+0.136
 sage+0.128
wikipedia+0.127
idas+0.127
[[+0.126
Neg logit contributions
 equipped-0.139
 ado-0.133
thy-0.131
 BOX-0.130
opathy-0.128
Token something
Token ID1223
Feature activation+1.20e-01

Pos logit contributions
 equipped+0.025
 ado+0.024
thy+0.023
 BOX+0.023
opathy+0.023
Neg logit contributions
arity-0.025
 sage-0.024
wikipedia-0.024
idas-0.024
[[-0.024
Token I
Token ID314
Feature activation-5.91e-02

Pos logit contributions
arity+0.087
 sage+0.082
wikipedia+0.081
idas+0.081
[[+0.080
Neg logit contributions
 equipped-0.083
 ado-0.079
thy-0.078
 BOX-0.077
opathy-0.076
Token experienced
Token ID5924
Feature activation-4.76e-02

Pos logit contributions
 equipped+0.043
 ado+0.042
thy+0.041
 BOX+0.041
opathy+0.040
Neg logit contributions
arity-0.040
 sage-0.038
idas-0.037
wikipedia-0.037
[[-0.037
Token in
Token ID287
Feature activation+2.58e-02

Pos logit contributions
 equipped+0.034
 ado+0.033
thy+0.033
 BOX+0.032
opathy+0.032
Neg logit contributions
arity-0.033
 sage-0.031
idas-0.030
wikipedia-0.030
[[-0.030
Token my
Token ID616
Feature activation-4.23e-02

Pos logit contributions
arity+0.018
 sage+0.017
idas+0.016
wikipedia+0.016
[[+0.016
Neg logit contributions
 equipped-0.019
 ado-0.018
thy-0.018
 BOX-0.018
opathy-0.018
Token youth
Token ID6205
Feature activation+5.19e-01

Pos logit contributions
 equipped+0.035
 ado+0.034
thy+0.034
 BOX+0.033
opathy+0.033
Neg logit contributions
arity-0.025
 sage-0.023
idas-0.022
wikipedia-0.022
[[-0.022
Token,
Token ID11
Feature activation+1.11e-02

Pos logit contributions
arity+0.355
 sage+0.334
idas+0.329
wikipedia+0.328
[[+0.327
Neg logit contributions
 equipped-0.379
 ado-0.365
thy-0.358
 BOX-0.357
opathy-0.352
Token from
Token ID422
Feature activation-2.46e-02

Pos logit contributions
arity+0.008
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.008
Token a
Token ID257
Feature activation+1.07e-02

Pos logit contributions
 equipped+0.018
 ado+0.017
thy+0.017
 BOX+0.017
opathy+0.016
Neg logit contributions
arity-0.017
 sage-0.016
wikipedia-0.016
idas-0.016
[[-0.016
Token dysfunctional
Token ID38127
Feature activation-1.94e-01

Pos logit contributions
arity+0.008
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.007
thy-0.007
 BOX-0.007
opathy-0.007
Token family
Token ID1641
Feature activation-1.23e-01

Pos logit contributions
 equipped+0.138
 ado+0.132
thy+0.129
 BOX+0.129
opathy+0.127
Neg logit contributions
arity-0.137
 sage-0.129
wikipedia-0.128
idas-0.128
[[-0.127
Token old
Token ID1468
Feature activation+2.14e-01

Pos logit contributions
arity+0.013
 sage+0.011
wikipedia+0.011
idas+0.011
[[+0.011
Neg logit contributions
 equipped-0.027
 ado-0.027
thy-0.026
 BOX-0.026
opathy-0.026
Token to
Token ID284
Feature activation+3.40e-03

Pos logit contributions
arity+0.145
 sage+0.137
idas+0.135
wikipedia+0.134
[[+0.134
Neg logit contributions
 equipped-0.157
 ado-0.151
thy-0.149
 BOX-0.148
opathy-0.146
Token be
Token ID307
Feature activation+9.73e-02

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token in
Token ID287
Feature activation+2.79e-02

Pos logit contributions
arity+0.068
 sage+0.063
wikipedia+0.063
idas+0.063
[[+0.063
Neg logit contributions
 equipped-0.071
 ado-0.067
 BOX-0.066
thy-0.066
opathy-0.065
Token a
Token ID257
Feature activation+1.04e-02

Pos logit contributions
arity+0.018
 sage+0.017
wikipedia+0.017
idas+0.017
[[+0.017
Neg logit contributions
 equipped-0.021
 ado-0.020
thy-0.020
 BOX-0.020
opathy-0.020
Token youth
Token ID6205
Feature activation+5.19e-01

Pos logit contributions
arity+0.007
 sage+0.006
wikipedia+0.006
idas+0.006
[[+0.006
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.007
 BOX-0.007
opathy-0.007
Token facility
Token ID6841
Feature activation-1.21e-01

Pos logit contributions
arity+0.348
 sage+0.327
wikipedia+0.322
idas+0.322
[[+0.320
Neg logit contributions
 equipped-0.387
 ado-0.372
thy-0.365
 BOX-0.364
opathy-0.359
Token,
Token ID11
Feature activation+1.39e-02

Pos logit contributions
 equipped+0.089
 ado+0.085
thy+0.084
 BOX+0.084
opathy+0.082
Neg logit contributions
arity-0.082
 sage-0.077
wikipedia-0.076
idas-0.076
[[-0.075
Token he
Token ID339
Feature activation+8.02e-02

Pos logit contributions
arity+0.009
 sage+0.009
idas+0.009
wikipedia+0.009
[[+0.009
Neg logit contributions
 equipped-0.010
 ado-0.010
thy-0.010
 BOX-0.010
opathy-0.010
Token should
Token ID815
Feature activation-2.45e-02

Pos logit contributions
arity+0.054
 sage+0.051
wikipedia+0.050
idas+0.050
[[+0.050
Neg logit contributions
 equipped-0.059
 ado-0.057
thy-0.056
 BOX-0.056
opathy-0.055
Token be
Token ID307
Feature activation+9.65e-02

Pos logit contributions
 equipped+0.019
 ado+0.018
thy+0.018
 BOX+0.018
opathy+0.018
Neg logit contributions
arity-0.016
 sage-0.015
idas-0.014
wikipedia-0.014
[[-0.014
Token Jay
Token ID9180
Feature activation+5.48e-02

Pos logit contributions
arity+0.024
 sage+0.023
idas+0.022
wikipedia+0.022
[[+0.022
Neg logit contributions
 equipped-0.032
 ado-0.031
thy-0.030
 BOX-0.030
opathy-0.030
Token Gr
Token ID1902
Feature activation-4.55e-02

Pos logit contributions
arity+0.042
 sage+0.040
idas+0.040
[[+0.040
wikipedia+0.040
Neg logit contributions
 equipped-0.035
 ado-0.034
thy-0.033
 BOX-0.033
opathy-0.032
Tokenuden
Token ID44452
Feature activation+5.58e-02

Pos logit contributions
 equipped+0.029
thy+0.029
th+0.028
 ado+0.028
 BOX+0.028
Neg logit contributions
 sage-0.033
wikipedia-0.033
arity-0.033
 Continued-0.033
TPS-0.032
Token praised
Token ID15342
Feature activation+7.52e-02

Pos logit contributions
arity+0.037
 sage+0.034
wikipedia+0.034
idas+0.034
[[+0.034
Neg logit contributions
 equipped-0.042
 ado-0.041
thy-0.040
 BOX-0.040
opathy-0.039
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
arity+0.052
 sage+0.048
wikipedia+0.048
idas+0.048
[[+0.047
Neg logit contributions
 equipped-0.055
 ado-0.053
thy-0.052
 BOX-0.052
opathy-0.051
Token tempo
Token ID28691
Feature activation+5.19e-01

Pos logit contributions
arity+0.010
 sage+0.010
wikipedia+0.010
idas+0.010
[[+0.010
Neg logit contributions
 equipped-0.011
 ado-0.010
thy-0.010
 BOX-0.010
opathy-0.010
Token and
Token ID290
Feature activation+8.75e-04

Pos logit contributions
arity+0.358
 sage+0.337
wikipedia+0.333
idas+0.332
[[+0.331
Neg logit contributions
 equipped-0.377
 ado-0.361
thy-0.355
 BOX-0.353
opathy-0.349
Token level
Token ID1241
Feature activation+1.86e-02

Pos logit contributions
arity+0.001
 sage+0.001
wikipedia+0.001
idas+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Token of
Token ID286
Feature activation+1.64e-02

Pos logit contributions
arity+0.013
 sage+0.012
wikipedia+0.012
idas+0.012
[[+0.012
Neg logit contributions
 equipped-0.014
 ado-0.013
thy-0.013
 BOX-0.013
opathy-0.013
Token play
Token ID711
Feature activation-1.76e-02

Pos logit contributions
arity+0.011
 sage+0.010
wikipedia+0.010
idas+0.010
[[+0.010
Neg logit contributions
 equipped-0.012
 ado-0.012
thy-0.012
 BOX-0.012
opathy-0.011
Token produced
Token ID4635
Feature activation+5.02e-02

Pos logit contributions
 equipped+0.012
 ado+0.012
thy+0.012
 BOX+0.011
opathy+0.011
Neg logit contributions
arity-0.013
 sage-0.012
wikipedia-0.012
idas-0.012
[[-0.012
Token was
Token ID373
Feature activation-4.53e-02

Pos logit contributions
arity+0.005
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.005
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.005
Token the
Token ID262
Feature activation+1.22e-02

Pos logit contributions
 equipped+0.033
 ado+0.031
thy+0.031
 BOX+0.031
opathy+0.030
Neg logit contributions
arity-0.032
 sage-0.030
idas-0.029
wikipedia-0.029
[[-0.029
Token right
Token ID826
Feature activation-8.38e-02

Pos logit contributions
arity+0.009
 sage+0.008
wikipedia+0.008
idas+0.008
[[+0.008
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.008
Token mix
Token ID5022
Feature activation+1.57e-01

Pos logit contributions
 equipped+0.064
 ado+0.061
thy+0.060
 BOX+0.060
opathy+0.059
Neg logit contributions
arity-0.055
 sage-0.052
wikipedia-0.051
idas-0.051
[[-0.051
Token of
Token ID286
Feature activation+1.76e-02

Pos logit contributions
arity+0.106
 sage+0.100
wikipedia+0.099
idas+0.099
[[+0.098
Neg logit contributions
 equipped-0.116
 ado-0.111
thy-0.109
 BOX-0.109
opathy-0.107
Token youth
Token ID6205
Feature activation+5.19e-01

Pos logit contributions
arity+0.012
 sage+0.011
wikipedia+0.011
idas+0.011
[[+0.011
Neg logit contributions
 equipped-0.013
 ado-0.012
thy-0.012
 BOX-0.012
opathy-0.012
Token and
Token ID290
Feature activation-5.48e-03

Pos logit contributions
arity+0.356
 sage+0.334
idas+0.330
wikipedia+0.329
[[+0.328
Neg logit contributions
 equipped-0.378
 ado-0.364
thy-0.357
 BOX-0.355
opathy-0.351
Token leadership
Token ID5531
Feature activation+1.44e-01

Pos logit contributions
 equipped+0.004
 ado+0.004
thy+0.004
 BOX+0.004
opathy+0.004
Neg logit contributions
arity-0.004
 sage-0.003
wikipedia-0.003
idas-0.003
[[-0.003
Token that
Token ID326
Feature activation+1.63e-02

Pos logit contributions
arity+0.098
 sage+0.093
idas+0.091
wikipedia+0.091
[[+0.091
Neg logit contributions
 equipped-0.105
 ado-0.101
thy-0.100
 BOX-0.099
opathy-0.098
Token allowed
Token ID3142
Feature activation-1.19e-01

Pos logit contributions
arity+0.011
 sage+0.011
wikipedia+0.010
idas+0.010
[[+0.010
Neg logit contributions
 equipped-0.012
 ado-0.011
thy-0.011
 BOX-0.011
opathy-0.011
Token the
Token ID262
Feature activation+1.39e-02

Pos logit contributions
 equipped+0.087
 ado+0.084
thy+0.083
 BOX+0.082
opathy+0.081
Neg logit contributions
arity-0.081
 sage-0.076
idas-0.075
wikipedia-0.075
[[-0.075
Token,
Token ID11
Feature activation+2.02e-02

Pos logit contributions
 equipped+0.055
 ado+0.053
thy+0.052
 BOX+0.051
opathy+0.051
Neg logit contributions
arity-0.054
 sage-0.051
wikipedia-0.050
idas-0.050
[[-0.049
Token the
Token ID262
Feature activation+8.19e-03

Pos logit contributions
arity+0.014
 sage+0.013
idas+0.013
wikipedia+0.013
[[+0.013
Neg logit contributions
 equipped-0.015
 ado-0.014
 BOX-0.014
thy-0.014
opathy-0.014
Token strength
Token ID4202
Feature activation-2.25e-02

Pos logit contributions
arity+0.005
 sage+0.005
idas+0.005
wikipedia+0.005
[[+0.005
Neg logit contributions
 equipped-0.006
 ado-0.006
 BOX-0.006
thy-0.006
opathy-0.006
Token in
Token ID287
Feature activation+2.07e-02

Pos logit contributions
 equipped+0.016
 ado+0.016
thy+0.015
 BOX+0.015
opathy+0.015
Neg logit contributions
arity-0.015
 sage-0.015
idas-0.014
wikipedia-0.014
[[-0.014
Token the
Token ID262
Feature activation+6.99e-03

Pos logit contributions
arity+0.014
 sage+0.013
idas+0.013
wikipedia+0.013
[[+0.013
Neg logit contributions
 equipped-0.015
 ado-0.014
thy-0.014
 BOX-0.014
opathy-0.014
Token youth
Token ID6205
Feature activation+5.18e-01

Pos logit contributions
arity+0.005
 sage+0.004
idas+0.004
wikipedia+0.004
[[+0.004
Neg logit contributions
 equipped-0.005
 ado-0.005
 BOX-0.005
thy-0.005
opathy-0.005
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
arity+0.319
 sage+0.298
wikipedia+0.293
idas+0.293
[[+0.291
Neg logit contributions
 equipped-0.414
 ado-0.399
thy-0.393
 BOX-0.391
opathy-0.387
Token the
Token ID262
Feature activation+4.58e-03

Pos logit contributions
arity+0.008
 sage+0.007
idas+0.007
wikipedia+0.007
[[+0.007
Neg logit contributions
 equipped-0.009
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.008
Token struggle
Token ID6531
Feature activation+8.33e-02

Pos logit contributions
arity+0.003
 sage+0.003
idas+0.003
wikipedia+0.003
[[+0.003
Neg logit contributions
 equipped-0.003
 ado-0.003
 BOX-0.003
thy-0.003
opathy-0.003
Token against
Token ID1028
Feature activation+2.03e-01

Pos logit contributions
arity+0.051
 sage+0.048
wikipedia+0.047
idas+0.047
[[+0.047
Neg logit contributions
 equipped-0.067
 ado-0.064
thy-0.063
 BOX-0.063
opathy-0.062
Token repression
Token ID28279
Feature activation-9.95e-02

Pos logit contributions
arity+0.154
 sage+0.146
idas+0.144
wikipedia+0.144
[[+0.143
Neg logit contributions
 equipped-0.132
 ado-0.127
thy-0.124
 BOX-0.123
opathy-0.122
Token murder
Token ID5123
Feature activation-8.27e-02

Pos logit contributions
 equipped+0.096
 ado+0.091
thy+0.089
 BOX+0.089
opathy+0.088
Neg logit contributions
arity-0.114
 sage-0.108
wikipedia-0.107
idas-0.107
[[-0.106
Token only
Token ID691
Feature activation-1.02e-01

Pos logit contributions
 equipped+0.060
 ado+0.058
thy+0.057
 BOX+0.056
opathy+0.056
Neg logit contributions
arity-0.057
 sage-0.054
wikipedia-0.053
idas-0.053
[[-0.053
Token makes
Token ID1838
Feature activation+1.09e-02

Pos logit contributions
 equipped+0.074
 ado+0.071
thy+0.070
 BOX+0.069
opathy+0.068
Neg logit contributions
arity-0.070
 sage-0.066
wikipedia-0.065
idas-0.065
[[-0.065
Token sense
Token ID2565
Feature activation-1.20e-01

Pos logit contributions
arity+0.008
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.007
thy-0.007
 BOX-0.007
opathy-0.007
Token as
Token ID355
Feature activation+9.30e-02

Pos logit contributions
 equipped+0.084
 ado+0.081
thy+0.080
 BOX+0.079
opathy+0.078
Neg logit contributions
arity-0.086
 sage-0.081
wikipedia-0.080
idas-0.080
[[-0.080
Token youth
Token ID6205
Feature activation+5.18e-01

Pos logit contributions
arity+0.063
 sage+0.059
wikipedia+0.058
idas+0.058
[[+0.058
Neg logit contributions
 equipped-0.069
 ado-0.066
thy-0.065
 BOX-0.064
opathy-0.064
Token sentence
Token ID6827
Feature activation-3.89e-02

Pos logit contributions
arity+0.359
 sage+0.338
wikipedia+0.334
idas+0.333
[[+0.331
Neg logit contributions
 equipped-0.376
 ado-0.360
thy-0.352
 BOX-0.350
opathy-0.347
Token.
Token ID13
Feature activation+6.70e-03

Pos logit contributions
 equipped+0.028
 ado+0.027
thy+0.027
 BOX+0.027
opathy+0.026
Neg logit contributions
arity-0.027
 sage-0.025
wikipedia-0.025
idas-0.025
[[-0.025
Token And
Token ID843
Feature activation-7.60e-02

Pos logit contributions
arity+0.005
 sage+0.004
idas+0.004
wikipedia+0.004
[[+0.004
Neg logit contributions
 equipped-0.005
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.004
Token because
Token ID780
Feature activation+9.61e-02

Pos logit contributions
 equipped+0.053
 ado+0.051
thy+0.050
 BOX+0.049
opathy+0.049
Neg logit contributions
arity-0.055
 sage-0.051
wikipedia-0.051
idas-0.051
[[-0.050
Token he
Token ID339
Feature activation+7.12e-02

Pos logit contributions
arity+0.066
 sage+0.062
idas+0.061
wikipedia+0.061
[[+0.061
Neg logit contributions
 equipped-0.070
 ado-0.067
thy-0.066
 BOX-0.066
opathy-0.065
Token[
Token ID58
Feature activation+2.09e-01

Pos logit contributions
 equipped+0.025
 ado+0.024
thy+0.024
 BOX+0.024
opathy+0.024
Neg logit contributions
arity-0.012
 sage-0.011
wikipedia-0.010
idas-0.010
[[-0.010
TokenTable
Token ID10962
Feature activation+6.62e-02

Pos logit contributions
arity+0.132
 sage+0.124
wikipedia+0.122
idas+0.122
[[+0.121
Neg logit contributions
 equipped-0.163
 ado-0.157
thy-0.154
 BOX-0.153
opathy-0.151
Token[
Token ID58
Feature activation+2.02e-01

Pos logit contributions
arity+0.030
 sage+0.028
wikipedia+0.027
idas+0.027
[[+0.026
Neg logit contributions
 equipped-0.064
 ado-0.062
thy-0.061
 BOX-0.061
opathy-0.060
Token graph
Token ID4823
Feature activation-5.48e-02

Pos logit contributions
arity+0.123
 sage+0.115
wikipedia+0.113
idas+0.113
[[+0.112
Neg logit contributions
 equipped-0.162
 ado-0.156
thy-0.154
 BOX-0.153
opathy-0.151
TokenList
Token ID8053
Feature activation-6.94e-02

Pos logit contributions
 equipped+0.042
 ado+0.040
thy+0.039
 BOX+0.039
opathy+0.039
Neg logit contributions
arity-0.036
 sage-0.034
wikipedia-0.033
idas-0.033
[[-0.032
Token[[
Token ID30109
Feature activation+5.18e-01

Pos logit contributions
 equipped+0.058
 ado+0.056
thy+0.055
 BOX+0.054
opathy+0.054
Neg logit contributions
arity-0.041
 sage-0.038
wikipedia-0.037
idas-0.037
[[-0.037
Token#
Token ID2
Feature activation+1.85e-01

Pos logit contributions
arity+0.315
 sage+0.294
wikipedia+0.289
idas+0.289
[[+0.287
Neg logit contributions
 equipped-0.418
 ado-0.403
thy-0.396
 BOX-0.395
opathy-0.390
Token,
Token ID11
Feature activation-3.56e-03

Pos logit contributions
arity+0.124
 sage+0.117
idas+0.115
wikipedia+0.115
[[+0.114
Neg logit contributions
 equipped-0.137
 ado-0.132
thy-0.130
 BOX-0.129
opathy-0.127
Token 1
Token ID352
Feature activation+7.36e-02

Pos logit contributions
 equipped+0.003
 ado+0.003
thy+0.003
 BOX+0.003
opathy+0.003
Neg logit contributions
arity-0.002
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token]]
Token ID11907
Feature activation+2.52e-03

Pos logit contributions
arity+0.050
 sage+0.047
idas+0.046
wikipedia+0.046
[[+0.046
Neg logit contributions
 equipped-0.054
 ado-0.052
thy-0.051
 BOX-0.051
opathy-0.051
Token <
Token ID1279
Feature activation-6.15e-02

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token make
Token ID787
Feature activation-1.44e-02

Pos logit contributions
arity+0.037
 sage+0.035
idas+0.034
wikipedia+0.034
[[+0.034
Neg logit contributions
 equipped-0.038
 ado-0.037
thy-0.036
 BOX-0.036
opathy-0.036
Token an
Token ID281
Feature activation-3.88e-02

Pos logit contributions
 equipped+0.010
 ado+0.010
thy+0.010
 BOX+0.010
opathy+0.009
Neg logit contributions
arity-0.010
 sage-0.010
idas-0.009
wikipedia-0.009
[[-0.009
Token application
Token ID3586
Feature activation+4.57e-02

Pos logit contributions
 equipped+0.026
 ado+0.025
thy+0.024
 BOX+0.024
opathy+0.024
Neg logit contributions
arity-0.029
 sage-0.027
idas-0.027
wikipedia-0.027
[[-0.027
Token before
Token ID878
Feature activation-4.51e-02

Pos logit contributions
arity+0.031
 sage+0.029
idas+0.028
wikipedia+0.028
[[+0.028
Neg logit contributions
 equipped-0.034
 ado-0.033
thy-0.032
 BOX-0.032
opathy-0.031
Token a
Token ID257
Feature activation+1.19e-02

Pos logit contributions
 equipped+0.034
 ado+0.033
thy+0.032
 BOX+0.032
opathy+0.032
Neg logit contributions
arity-0.030
 sage-0.028
wikipedia-0.028
idas-0.028
[[-0.027
Token youth
Token ID6205
Feature activation+5.15e-01

Pos logit contributions
arity+0.008
 sage+0.008
wikipedia+0.008
idas+0.008
[[+0.008
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.008
Token court
Token ID2184
Feature activation+2.08e-02

Pos logit contributions
arity+0.351
 sage+0.329
wikipedia+0.325
idas+0.325
[[+0.323
Neg logit contributions
 equipped-0.378
 ado-0.363
thy-0.357
 BOX-0.355
opathy-0.351
Token judge
Token ID5052
Feature activation+3.38e-02

Pos logit contributions
arity+0.014
 sage+0.013
idas+0.013
wikipedia+0.013
[[+0.013
Neg logit contributions
 equipped-0.015
 ado-0.015
thy-0.014
 BOX-0.014
opathy-0.014
Token for
Token ID329
Feature activation-2.91e-03

Pos logit contributions
arity+0.023
 sage+0.022
idas+0.021
wikipedia+0.021
[[+0.021
Neg logit contributions
 equipped-0.025
 ado-0.024
thy-0.023
 BOX-0.023
opathy-0.023
Token the
Token ID262
Feature activation+8.35e-03

Pos logit contributions
 equipped+0.002
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.002
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token inmate
Token ID28414
Feature activation-2.74e-02

Pos logit contributions
arity+0.006
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.006
 ado-0.006
thy-0.006
 BOX-0.006
opathy-0.006
Token]]
Token ID11907
Feature activation+2.52e-03

Pos logit contributions
arity+0.050
 sage+0.047
idas+0.046
wikipedia+0.046
[[+0.046
Neg logit contributions
 equipped-0.054
 ado-0.052
thy-0.051
 BOX-0.051
opathy-0.051
Token <
Token ID1279
Feature activation-6.15e-02

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token->
Token ID3784
Feature activation+9.19e-02

Pos logit contributions
 equipped+0.053
 ado+0.052
thy+0.051
 BOX+0.051
opathy+0.050
Neg logit contributions
arity-0.033
 sage-0.031
wikipedia-0.030
idas-0.030
[[-0.030
Token graph
Token ID4823
Feature activation-5.37e-02

Pos logit contributions
arity+0.058
 sage+0.055
idas+0.054
wikipedia+0.054
[[+0.053
Neg logit contributions
 equipped-0.072
 ado-0.069
thy-0.068
 BOX-0.068
opathy-0.067
TokenList
Token ID8053
Feature activation-7.38e-02

Pos logit contributions
 equipped+0.037
 ado+0.035
thy+0.035
 BOX+0.035
opathy+0.034
Neg logit contributions
arity-0.039
 sage-0.037
wikipedia-0.036
idas-0.036
[[-0.036
Token[[
Token ID30109
Feature activation+5.15e-01

Pos logit contributions
 equipped+0.093
 ado+0.090
thy+0.089
 BOX+0.089
opathy+0.088
Neg logit contributions
arity-0.012
 sage-0.009
idas-0.008
wikipedia-0.008
 Continued-0.007
Token#
Token ID2
Feature activation+1.77e-01

Pos logit contributions
arity+0.235
 sage+0.214
wikipedia+0.210
idas+0.209
[[+0.208
Neg logit contributions
 equipped-0.493
 ado-0.478
thy-0.472
 BOX-0.470
opathy-0.466
Token,
Token ID11
Feature activation-9.88e-04

Pos logit contributions
arity+0.121
 sage+0.114
wikipedia+0.112
idas+0.112
[[+0.111
Neg logit contributions
 equipped-0.130
 ado-0.125
thy-0.123
 BOX-0.122
opathy-0.121
Token 2
Token ID362
Feature activation+1.50e-01

Pos logit contributions
 equipped+0.001
 ado+0.001
thy+0.001
 BOX+0.001
opathy+0.001
Neg logit contributions
arity-0.001
 sage-0.000
idas-0.000
wikipedia-0.000
[[-0.000
Token,
Token ID11
Feature activation-2.92e-03

Pos logit contributions
arity+0.102
 sage+0.096
wikipedia+0.095
idas+0.095
[[+0.094
Neg logit contributions
 equipped-0.110
 ado-0.105
thy-0.104
 BOX-0.103
opathy-0.102
Token i
Token ID1312
Feature activation+8.97e-02

Pos logit contributions
 equipped+0.002
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.002
 sage-0.002
idas-0.002
wikipedia-0.001
[[-0.001
Token grad
Token ID3915
Feature activation+1.66e-01

Pos logit contributions
arity+0.037
 sage+0.035
wikipedia+0.034
idas+0.034
[[+0.034
Neg logit contributions
 equipped-0.035
 ado-0.033
thy-0.033
 BOX-0.032
opathy-0.032
Tokens
Token ID82
Feature activation+2.20e-03

Pos logit contributions
arity+0.117
 sage+0.111
wikipedia+0.110
idas+0.109
[[+0.109
Neg logit contributions
 equipped-0.118
 ado-0.112
thy-0.110
 BOX-0.110
opathy-0.109
Token will
Token ID481
Feature activation+1.96e-02

Pos logit contributions
arity+0.001
 sage+0.001
idas+0.001
wikipedia+0.001
[[+0.001
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token be
Token ID307
Feature activation+8.39e-02

Pos logit contributions
arity+0.014
 sage+0.013
wikipedia+0.013
idas+0.013
[[+0.013
Neg logit contributions
 equipped-0.014
 ado-0.013
thy-0.013
 BOX-0.013
opathy-0.013
Token better
Token ID1365
Feature activation-4.50e-03

Pos logit contributions
arity+0.068
 sage+0.064
wikipedia+0.064
[[+0.064
idas+0.064
Neg logit contributions
 equipped-0.052
 ado-0.048
thy-0.047
 BOX-0.047
opathy-0.046
Token equipped
Token ID10911
Feature activation-6.07e-01

Pos logit contributions
 equipped+0.003
 ado+0.003
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.004
 sage-0.004
wikipedia-0.004
idas-0.004
[[-0.003
Token to
Token ID284
Feature activation-1.43e-02

Pos logit contributions
 equipped+0.438
 ado+0.416
thy+0.410
 BOX+0.407
opathy+0.402
Neg logit contributions
arity-0.423
 sage-0.398
wikipedia-0.395
idas-0.394
[[-0.391
Token thrive
Token ID22191
Feature activation-1.53e-02

Pos logit contributions
 equipped+0.010
 ado+0.009
thy+0.009
 BOX+0.009
opathy+0.009
Neg logit contributions
arity-0.010
wikipedia-0.010
 sage-0.010
idas-0.010
[[-0.010
Token in
Token ID287
Feature activation+1.07e-02

Pos logit contributions
 equipped+0.011
 ado+0.011
thy+0.011
 BOX+0.011
opathy+0.011
Neg logit contributions
arity-0.010
 sage-0.010
idas-0.010
wikipedia-0.010
[[-0.009
Token many
Token ID867
Feature activation-1.61e-02

Pos logit contributions
arity+0.007
 sage+0.007
idas+0.007
wikipedia+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.007
thy-0.007
 BOX-0.007
opathy-0.007
Token different
Token ID1180
Feature activation-1.27e-01

Pos logit contributions
 equipped+0.010
 ado+0.010
thy+0.010
 BOX+0.010
opathy+0.010
Neg logit contributions
arity-0.012
 sage-0.012
idas-0.011
wikipedia-0.011
[[-0.011
Token pulse
Token ID19445
Feature activation-4.80e-02

Pos logit contributions
arity+0.004
 sage+0.004
wikipedia+0.003
idas+0.003
[[+0.003
Neg logit contributions
 equipped-0.004
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Tokenj
Token ID73
Feature activation+9.92e-02

Pos logit contributions
 equipped+0.035
 ado+0.034
thy+0.033
 BOX+0.033
opathy+0.033
Neg logit contributions
arity-0.033
 sage-0.031
idas-0.030
wikipedia-0.030
[[-0.030
Tokenets
Token ID1039
Feature activation+3.64e-01

Pos logit contributions
arity+0.033
 sage+0.029
wikipedia+0.028
idas+0.028
[[+0.027
Neg logit contributions
 equipped-0.108
 ado-0.105
thy-0.104
 BOX-0.103
opathy-0.103
Token.
Token ID13
Feature activation-3.47e-03

Pos logit contributions
arity+0.248
 sage+0.233
idas+0.230
wikipedia+0.230
[[+0.229
Neg logit contributions
 equipped-0.265
 ado-0.257
thy-0.252
 BOX-0.251
opathy-0.247
Token When
Token ID1649
Feature activation-4.23e-02

Pos logit contributions
 equipped+0.002
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.002
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token equipped
Token ID10911
Feature activation-6.00e-01

Pos logit contributions
 equipped+0.029
 ado+0.028
thy+0.028
 BOX+0.027
opathy+0.027
Neg logit contributions
arity-0.030
 sage-0.029
wikipedia-0.028
idas-0.028
[[-0.028
Token with
Token ID351
Feature activation+1.99e-02

Pos logit contributions
 equipped+0.452
 ado+0.431
thy+0.423
 BOX+0.421
opathy+0.416
Neg logit contributions
arity-0.400
 sage-0.375
wikipedia-0.372
idas-0.372
[[-0.368
Token a
Token ID257
Feature activation-2.42e-04

Pos logit contributions
arity+0.014
 sage+0.013
idas+0.013
wikipedia+0.013
[[+0.013
Neg logit contributions
 equipped-0.014
 ado-0.014
thy-0.014
 BOX-0.014
opathy-0.013
Token thrust
Token ID14613
Feature activation+9.92e-02

Pos logit contributions
 equipped+0.000
 ado+0.000
thy+0.000
 BOX+0.000
opathy+0.000
Neg logit contributions
arity-0.000
 sage-0.000
wikipedia-0.000
idas-0.000
[[-0.000
Token augment
Token ID35016
Feature activation-2.09e-01

Pos logit contributions
arity+0.069
 sage+0.066
idas+0.064
wikipedia+0.064
[[+0.064
Neg logit contributions
 equipped-0.070
 ado-0.068
thy-0.067
 BOX-0.066
opathy-0.066
Tokener
Token ID263
Feature activation+3.97e-01

Pos logit contributions
 equipped+0.161
 ado+0.155
thy+0.152
 BOX+0.152
opathy+0.150
Neg logit contributions
arity-0.135
 sage-0.127
wikipedia-0.125
idas-0.125
[[-0.124
Token R
Token ID371
Feature activation-1.92e-02

Pos logit contributions
arity+0.072
 sage+0.068
idas+0.066
wikipedia+0.066
[[+0.066
Neg logit contributions
 equipped-0.075
 ado-0.072
thy-0.071
 BOX-0.070
opathy-0.070
Token78
Token ID3695
Feature activation-7.05e-03

Pos logit contributions
 equipped+0.014
 ado+0.014
thy+0.013
 BOX+0.013
opathy+0.013
Neg logit contributions
arity-0.013
 sage-0.012
wikipedia-0.012
idas-0.012
[[-0.012
Token70
Token ID2154
Feature activation+8.71e-03

Pos logit contributions
 equipped+0.005
 ado+0.005
thy+0.005
 BOX+0.005
opathy+0.005
Neg logit contributions
arity-0.004
 sage-0.004
[[-0.004
wikipedia-0.004
idas-0.004
Token Hawk
Token ID26698
Feature activation-1.60e-01

Pos logit contributions
arity+0.006
 sage+0.006
idas+0.006
[[+0.006
wikipedia+0.006
Neg logit contributions
 equipped-0.006
 ado-0.006
thy-0.006
 BOX-0.006
opathy-0.005
Token,
Token ID11
Feature activation-2.65e-03

Pos logit contributions
 equipped+0.115
 ado+0.112
thy+0.110
 BOX+0.109
opathy+0.108
Neg logit contributions
arity-0.111
 sage-0.104
idas-0.103
[[-0.102
wikipedia-0.102
Token equipped
Token ID10911
Feature activation-6.00e-01

Pos logit contributions
 equipped+0.002
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.002
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token with
Token ID351
Feature activation+2.02e-02

Pos logit contributions
 equipped+0.453
 ado+0.434
thy+0.426
 BOX+0.425
opathy+0.418
Neg logit contributions
arity-0.398
 sage-0.373
wikipedia-0.369
idas-0.368
[[-0.366
Token AMD
Token ID10324
Feature activation+5.98e-02

Pos logit contributions
arity+0.014
 sage+0.013
wikipedia+0.013
idas+0.013
[[+0.013
Neg logit contributions
 equipped-0.015
 ado-0.014
thy-0.014
 BOX-0.014
opathy-0.014
Token's
Token ID338
Feature activation-8.64e-02

Pos logit contributions
arity+0.045
 sage+0.042
wikipedia+0.042
idas+0.042
[[+0.042
Neg logit contributions
 equipped-0.040
 ado-0.038
thy-0.037
 BOX-0.037
opathy-0.037
Token 28
Token ID2579
Feature activation+5.38e-02

Pos logit contributions
 equipped+0.068
 ado+0.065
thy+0.064
 BOX+0.063
opathy+0.062
Neg logit contributions
arity-0.055
 sage-0.051
wikipedia-0.051
idas-0.051
[[-0.051
Tokennm
Token ID21533
Feature activation+9.24e-02

Pos logit contributions
arity+0.032
 sage+0.029
wikipedia+0.029
idas+0.029
[[+0.029
Neg logit contributions
 equipped-0.044
 ado-0.043
thy-0.042
 BOX-0.042
opathy-0.042
Token.
Token ID13
Feature activation+7.60e-03

Pos logit contributions
arity+0.096
 sage+0.090
idas+0.089
wikipedia+0.089
[[+0.088
Neg logit contributions
 equipped-0.099
 ado-0.095
thy-0.094
 BOX-0.093
opathy-0.092
Token These
Token ID2312
Feature activation+1.56e-01

Pos logit contributions
arity+0.005
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.005
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.005
Token Italian
Token ID8200
Feature activation+4.45e-02

Pos logit contributions
arity+0.106
 sage+0.100
wikipedia+0.099
idas+0.098
[[+0.098
Neg logit contributions
 equipped-0.115
 ado-0.110
thy-0.107
 BOX-0.107
opathy-0.105
Token volunteers
Token ID11661
Feature activation-2.46e-02

Pos logit contributions
arity+0.032
 sage+0.030
wikipedia+0.030
idas+0.030
[[+0.030
Neg logit contributions
 equipped-0.031
 ado-0.030
thy-0.029
 BOX-0.029
opathy-0.029
Token were
Token ID547
Feature activation+1.11e-01

Pos logit contributions
 equipped+0.018
 ado+0.018
thy+0.017
 BOX+0.017
opathy+0.017
Neg logit contributions
arity-0.016
 sage-0.015
wikipedia-0.015
idas-0.015
[[-0.015
Token equipped
Token ID10911
Feature activation-5.89e-01

Pos logit contributions
arity+0.081
 sage+0.076
wikipedia+0.076
idas+0.076
[[+0.075
Neg logit contributions
 equipped-0.077
 ado-0.073
thy-0.071
 BOX-0.071
opathy-0.070
Token and
Token ID290
Feature activation-1.82e-03

Pos logit contributions
 equipped+0.443
 ado+0.420
thy+0.414
 BOX+0.411
opathy+0.405
Neg logit contributions
arity-0.395
 sage-0.371
idas-0.367
wikipedia-0.366
[[-0.364
Token trained
Token ID8776
Feature activation-2.19e-01

Pos logit contributions
 equipped+0.001
 ado+0.001
thy+0.001
 BOX+0.001
opathy+0.001
Neg logit contributions
arity-0.002
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token by
Token ID416
Feature activation-3.34e-02

Pos logit contributions
 equipped+0.157
 ado+0.150
thy+0.148
 BOX+0.147
opathy+0.145
Neg logit contributions
arity-0.154
 sage-0.145
wikipedia-0.143
idas-0.143
[[-0.142
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
 equipped+0.024
 ado+0.023
thy+0.023
 BOX+0.023
opathy+0.022
Neg logit contributions
arity-0.023
 sage-0.022
idas-0.021
wikipedia-0.021
[[-0.021
Token British
Token ID3517
Feature activation-4.04e-02

Pos logit contributions
arity+0.010
 sage+0.009
wikipedia+0.009
idas+0.009
[[+0.009
Neg logit contributions
 equipped-0.010
 ado-0.010
thy-0.009
 BOX-0.009
opathy-0.009
Token,
Token ID11
Feature activation+7.73e-03

Pos logit contributions
 equipped+0.138
 ado+0.133
thy+0.131
 BOX+0.130
opathy+0.129
Neg logit contributions
arity-0.128
 sage-0.120
wikipedia-0.118
idas-0.118
[[-0.117
Token and
Token ID290
Feature activation-3.07e-03

Pos logit contributions
arity+0.005
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.006
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.005
Token they
Token ID484
Feature activation+4.30e-02

Pos logit contributions
 equipped+0.002
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.002
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token showed
Token ID3751
Feature activation+5.57e-02

Pos logit contributions
arity+0.028
 sage+0.027
wikipedia+0.026
idas+0.026
[[+0.026
Neg logit contributions
 equipped-0.033
 ado-0.031
thy-0.031
 BOX-0.031
opathy-0.030
Token up
Token ID510
Feature activation-9.35e-02

Pos logit contributions
arity+0.041
 sage+0.039
idas+0.039
wikipedia+0.039
[[+0.038
Neg logit contributions
 equipped-0.037
 ado-0.036
thy-0.035
 BOX-0.035
opathy-0.034
Token equipped
Token ID10911
Feature activation-5.86e-01

Pos logit contributions
 equipped+0.066
 ado+0.064
thy+0.063
 BOX+0.062
opathy+0.062
Neg logit contributions
arity-0.066
 sage-0.062
idas-0.061
wikipedia-0.061
[[-0.061
Token to
Token ID284
Feature activation+1.32e-03

Pos logit contributions
 equipped+0.443
 ado+0.426
thy+0.418
 BOX+0.416
opathy+0.411
Neg logit contributions
arity-0.387
 sage-0.364
wikipedia-0.359
idas-0.359
[[-0.356
Token do
Token ID466
Feature activation-4.59e-02

Pos logit contributions
arity+0.001
 sage+0.001
idas+0.001
wikipedia+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Token so
Token ID523
Feature activation-1.97e-02

Pos logit contributions
 equipped+0.033
 ado+0.032
thy+0.031
 BOX+0.031
opathy+0.031
Neg logit contributions
arity-0.032
 sage-0.030
idas-0.030
wikipedia-0.030
[[-0.029
Token with
Token ID351
Feature activation+2.76e-02

Pos logit contributions
 equipped+0.014
 ado+0.014
thy+0.014
 BOX+0.013
opathy+0.013
Neg logit contributions
arity-0.014
 sage-0.013
idas-0.013
wikipedia-0.012
[[-0.012
Token BB
Token ID12597
Feature activation+1.01e-01

Pos logit contributions
arity+0.020
 sage+0.018
wikipedia+0.018
idas+0.018
[[+0.018
Neg logit contributions
 equipped-0.019
 ado-0.019
thy-0.018
 BOX-0.018
opathy-0.018
Token it
Token ID340
Feature activation-5.51e-02

Pos logit contributions
arity+0.084
 sage+0.079
idas+0.077
wikipedia+0.077
[[+0.077
Neg logit contributions
 equipped-0.089
 ado-0.085
thy-0.084
 BOX-0.084
opathy-0.083
Token is
Token ID318
Feature activation-2.97e-02

Pos logit contributions
 equipped+0.039
 ado+0.037
thy+0.036
 BOX+0.036
opathy+0.036
Neg logit contributions
arity-0.039
 sage-0.037
wikipedia-0.037
idas-0.037
[[-0.036
Token,
Token ID11
Feature activation-6.74e-03

Pos logit contributions
 equipped+0.022
 ado+0.021
thy+0.021
 BOX+0.021
opathy+0.020
Neg logit contributions
arity-0.020
 sage-0.019
wikipedia-0.019
idas-0.019
[[-0.019
Token without
Token ID1231
Feature activation+1.46e-01

Pos logit contributions
 equipped+0.005
 ado+0.005
thy+0.005
 BOX+0.005
opathy+0.005
Neg logit contributions
arity-0.005
 sage-0.004
wikipedia-0.004
idas-0.004
[[-0.004
Token further
Token ID2252
Feature activation-1.19e-01

Pos logit contributions
arity+0.109
 sage+0.103
wikipedia+0.102
idas+0.102
[[+0.101
Neg logit contributions
 equipped-0.097
 ado-0.093
thy-0.092
 BOX-0.091
opathy-0.090
Token ado
Token ID50129
Feature activation-5.78e-01

Pos logit contributions
 equipped+0.006
thy+0.002
 ado+0.001
 BOX+0.001
opathy-0.000
Neg logit contributions
arity-0.163
 sage-0.158
wikipedia-0.157
idas-0.157
[[-0.156
Token:
Token ID25
Feature activation-3.33e-02

Pos logit contributions
 equipped+0.427
 ado+0.411
thy+0.403
 BOX+0.401
opathy+0.396
Neg logit contributions
arity-0.391
 sage-0.368
wikipedia-0.363
idas-0.362
[[-0.360
Token one
Token ID530
Feature activation+9.95e-03

Pos logit contributions
 equipped+0.025
 ado+0.024
thy+0.023
 BOX+0.023
opathy+0.023
Neg logit contributions
arity-0.023
 sage-0.021
wikipedia-0.021
idas-0.021
[[-0.021
Token man
Token ID582
Feature activation+7.51e-02

Pos logit contributions
arity+0.007
 sage+0.006
wikipedia+0.006
idas+0.006
[[+0.006
Neg logit contributions
 equipped-0.007
 ado-0.007
thy-0.007
 BOX-0.007
opathy-0.007
Token
Token ID447
Feature activation-4.82e-03

Pos logit contributions
arity+0.056
 sage+0.053
wikipedia+0.052
idas+0.052
[[+0.052
Neg logit contributions
 equipped-0.050
 ado-0.048
thy-0.047
 BOX-0.047
opathy-0.046
Token
Token ID247
Feature activation-1.12e-02

Pos logit contributions
 equipped+0.004
 ado+0.003
 BOX+0.003
thy+0.003
opathy+0.003
Neg logit contributions
arity-0.003
 sage-0.003
[[-0.003
idas-0.003
wikipedia-0.003
Token serving
Token ID7351
Feature activation-3.35e-02

Pos logit contributions
arity+0.075
 sage+0.071
wikipedia+0.070
idas+0.070
[[+0.069
Neg logit contributions
 equipped-0.079
 ado-0.076
thy-0.074
 BOX-0.074
opathy-0.073
Token the
Token ID262
Feature activation-1.20e-02

Pos logit contributions
 equipped+0.026
 ado+0.025
thy+0.025
 BOX+0.025
opathy+0.024
Neg logit contributions
arity-0.021
 sage-0.020
idas-0.020
wikipedia-0.020
[[-0.019
Token
Token ID564
Feature activation-1.17e-02

Pos logit contributions
 equipped+0.009
 ado+0.009
thy+0.009
 BOX+0.008
opathy+0.008
Neg logit contributions
arity-0.008
 sage-0.007
idas-0.007
wikipedia-0.007
[[-0.007
Token
Token ID250
Feature activation+3.48e-03

Pos logit contributions
 equipped+0.008
 ado+0.008
thy+0.008
 BOX+0.008
opathy+0.007
Neg logit contributions
arity-0.009
 sage-0.008
wikipedia-0.008
idas-0.008
[[-0.008
Tokenfil
Token ID10379
Feature activation+1.52e-01

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.003
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Tokenthy
Token ID20057
Feature activation-5.68e-01

Pos logit contributions
arity+0.156
 sage+0.149
idas+0.148
wikipedia+0.148
[[+0.147
Neg logit contributions
 equipped-0.060
 ado-0.055
 BOX-0.053
opathy-0.051
thy-0.051
Token ph
Token ID872
Feature activation-2.15e-01

Pos logit contributions
 equipped+0.392
 ado+0.375
thy+0.368
 BOX+0.366
opathy+0.361
Neg logit contributions
arity-0.412
 sage-0.389
wikipedia-0.383
idas-0.383
[[-0.381
Tokenallic
Token ID18196
Feature activation-2.46e-01

Pos logit contributions
 equipped+0.133
 ado+0.127
 BOX+0.123
thy+0.122
opathy+0.120
Neg logit contributions
arity-0.174
 sage-0.164
idas-0.162
wikipedia-0.161
[[-0.161
Token idol
Token ID21580
Feature activation+3.91e-02

Pos logit contributions
 equipped+0.166
 ado+0.159
thy+0.156
 BOX+0.155
opathy+0.153
Neg logit contributions
arity-0.182
 sage-0.172
wikipedia-0.169
idas-0.169
[[-0.168
Token of
Token ID286
Feature activation-7.41e-03

Pos logit contributions
arity+0.027
 sage+0.025
wikipedia+0.025
idas+0.025
[[+0.025
Neg logit contributions
 equipped-0.029
 ado-0.027
thy-0.027
 BOX-0.027
opathy-0.027
Token sod
Token ID32581
Feature activation+1.00e-01

Pos logit contributions
 equipped+0.005
 ado+0.005
thy+0.005
 BOX+0.005
opathy+0.005
Neg logit contributions
arity-0.005
 sage-0.005
wikipedia-0.005
idas-0.005
[[-0.005
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
arity+0.040
 sage+0.038
idas+0.037
wikipedia+0.037
[[+0.037
Neg logit contributions
 equipped-0.044
 ado-0.042
thy-0.041
 BOX-0.041
opathy-0.041
Token75
Token ID2425
Feature activation-8.23e-02

Pos logit contributions
 equipped+0.013
 ado+0.013
thy+0.013
 BOX+0.013
opathy+0.012
Neg logit contributions
arity-0.014
 sage-0.013
wikipedia-0.013
idas-0.013
[[-0.013
Token million
Token ID1510
Feature activation+2.93e-02

Pos logit contributions
 equipped+0.066
 ado+0.063
thy+0.062
 BOX+0.062
opathy+0.061
Neg logit contributions
arity-0.051
 sage-0.048
idas-0.047
wikipedia-0.047
[[-0.046
Token (
Token ID357
Feature activation-9.92e-02

Pos logit contributions
arity+0.023
 sage+0.022
idas+0.022
wikipedia+0.022
[[+0.022
Neg logit contributions
 equipped-0.018
 ado-0.017
thy-0.017
 BOX-0.017
opathy-0.017
Token16
Token ID1433
Feature activation-1.05e-01

Pos logit contributions
 equipped+0.059
 ado+0.056
thy+0.055
 BOX+0.055
opathy+0.054
Neg logit contributions
arity-0.082
 sage-0.077
wikipedia-0.077
idas-0.077
[[-0.076
Tokenth
Token ID400
Feature activation-5.65e-01

Pos logit contributions
 equipped+0.011
 ado+0.009
 BOX+0.007
opathy+0.006
thy+0.005
Neg logit contributions
arity-0.137
 sage-0.133
idas-0.132
wikipedia-0.132
[[-0.132
Token among
Token ID1871
Feature activation+4.09e-02

Pos logit contributions
 equipped+0.454
 ado+0.441
thy+0.434
 BOX+0.430
opathy+0.426
Neg logit contributions
arity-0.343
 sage-0.322
idas-0.315
wikipedia-0.313
[[-0.312
Token running
Token ID2491
Feature activation-8.50e-02

Pos logit contributions
arity+0.030
 sage+0.029
idas+0.028
[[+0.028
wikipedia+0.028
Neg logit contributions
 equipped-0.028
 ado-0.027
thy-0.026
 BOX-0.026
opathy-0.026
Token backs
Token ID12983
Feature activation+9.92e-02

Pos logit contributions
 equipped+0.074
 ado+0.072
thy+0.071
 BOX+0.071
opathy+0.070
Neg logit contributions
arity-0.046
 sage-0.042
idas-0.041
[[-0.041
wikipedia-0.041
Token)
Token ID8
Feature activation-1.87e-02

Pos logit contributions
arity+0.066
 sage+0.062
idas+0.061
wikipedia+0.061
[[+0.061
Neg logit contributions
 equipped-0.074
 ado-0.072
thy-0.071
 BOX-0.070
opathy-0.069
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
 equipped+0.014
 ado+0.013
thy+0.013
 BOX+0.013
opathy+0.013
Neg logit contributions
arity-0.012
 sage-0.012
wikipedia-0.011
idas-0.011
[[-0.011
Token lawyer
Token ID6853
Feature activation+7.88e-02

Pos logit contributions
arity+0.009
 sage+0.008
wikipedia+0.008
idas+0.008
[[+0.008
Neg logit contributions
 equipped-0.011
 ado-0.010
thy-0.010
 BOX-0.010
opathy-0.010
Token
Token ID198
Feature activation-9.10e-04

Pos logit contributions
arity+0.055
 sage+0.052
idas+0.051
wikipedia+0.051
[[+0.051
Neg logit contributions
 equipped-0.056
 ado-0.054
thy-0.053
 BOX-0.053
opathy-0.052
Token
Token ID198
Feature activation+1.24e-03

Pos logit contributions
 equipped+0.001
 ado+0.001
thy+0.001
 BOX+0.001
opathy+0.001
Neg logit contributions
arity-0.001
 sage-0.001
idas-0.001
wikipedia-0.001
[[-0.001
TokenMc
Token ID9742
Feature activation+7.66e-02

Pos logit contributions
arity+0.001
 sage+0.001
idas+0.001
wikipedia+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
TokenCar
Token ID9914
Feature activation-4.86e-02

Pos logit contributions
arity+0.053
[[+0.049
 sage+0.049
wikipedia+0.048
mart+0.048
Neg logit contributions
 equipped-0.057
 ado-0.054
 BOX-0.053
opathy-0.053
thy-0.052
Tokenthy
Token ID20057
Feature activation-5.64e-01

Pos logit contributions
 equipped+0.006
 ado+0.004
 BOX+0.004
opathy+0.003
 encl+0.002
Neg logit contributions
arity-0.064
mart-0.061
av-0.061
wikipedia-0.061
 sage-0.061
Token,
Token ID11
Feature activation-2.22e-03

Pos logit contributions
 equipped+0.403
 ado+0.387
thy+0.379
 BOX+0.378
opathy+0.373
Neg logit contributions
arity-0.396
 sage-0.373
idas-0.368
wikipedia-0.368
[[-0.366
Token a
Token ID257
Feature activation-7.15e-04

Pos logit contributions
 equipped+0.002
 ado+0.001
thy+0.001
 BOX+0.001
opathy+0.001
Neg logit contributions
arity-0.002
 sage-0.001
wikipedia-0.001
idas-0.001
[[-0.001
Token native
Token ID6868
Feature activation-1.01e-02

Pos logit contributions
 equipped+0.001
 ado+0.001
thy+0.000
 BOX+0.000
opathy+0.000
Neg logit contributions
arity-0.000
 sage-0.000
idas-0.000
wikipedia-0.000
[[-0.000
Token of
Token ID286
Feature activation+7.12e-03

Pos logit contributions
 equipped+0.007
 ado+0.007
thy+0.007
 BOX+0.007
opathy+0.007
Neg logit contributions
arity-0.007
 sage-0.006
wikipedia-0.006
idas-0.006
[[-0.006
Token County
Token ID3418
Feature activation-2.04e-01

Pos logit contributions
arity+0.005
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.005
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.005
Token.
Token ID13
Feature activation-1.61e-02

Pos logit contributions
arity+0.105
 sage+0.099
idas+0.097
wikipedia+0.097
[[+0.097
Neg logit contributions
 equipped-0.117
 ado-0.112
thy-0.110
 BOX-0.110
opathy-0.109
Token75
Token ID2425
Feature activation-7.96e-02

Pos logit contributions
 equipped+0.012
 ado+0.011
thy+0.011
 BOX+0.011
opathy+0.011
Neg logit contributions
arity-0.011
 sage-0.011
idas-0.010
wikipedia-0.010
[[-0.010
Token million
Token ID1510
Feature activation+2.54e-02

Pos logit contributions
 equipped+0.063
 ado+0.061
thy+0.060
 BOX+0.059
opathy+0.059
Neg logit contributions
arity-0.050
 sage-0.047
idas-0.046
wikipedia-0.046
[[-0.045
Token (
Token ID357
Feature activation-1.05e-01

Pos logit contributions
arity+0.017
 sage+0.016
idas+0.016
wikipedia+0.016
[[+0.016
Neg logit contributions
 equipped-0.018
 ado-0.018
thy-0.017
 BOX-0.017
opathy-0.017
Token35
Token ID2327
Feature activation-2.67e-01

Pos logit contributions
 equipped+0.080
 ado+0.076
thy+0.075
 BOX+0.075
opathy+0.074
Neg logit contributions
arity-0.070
 sage-0.065
idas-0.064
wikipedia-0.064
[[-0.064
Tokenth
Token ID400
Feature activation-5.64e-01

Pos logit contributions
 equipped+0.162
 ado+0.154
thy+0.151
 BOX+0.150
opathy+0.148
Neg logit contributions
arity-0.215
 sage-0.205
idas-0.202
wikipedia-0.202
[[-0.201
Token among
Token ID1871
Feature activation+3.86e-02

Pos logit contributions
 equipped+0.415
 ado+0.398
 BOX+0.390
thy+0.390
opathy+0.384
Neg logit contributions
arity-0.384
 sage-0.359
wikipedia-0.357
[[-0.354
idas-0.353
Token guards
Token ID10942
Feature activation-7.88e-02

Pos logit contributions
arity+0.027
 sage+0.026
idas+0.025
[[+0.025
wikipedia+0.025
Neg logit contributions
 equipped-0.027
 ado-0.026
thy-0.026
 BOX-0.026
opathy-0.026
Token)
Token ID8
Feature activation-2.19e-02

Pos logit contributions
 equipped+0.059
 ado+0.057
thy+0.056
 BOX+0.055
opathy+0.055
Neg logit contributions
arity-0.053
 sage-0.050
idas-0.049
wikipedia-0.049
[[-0.048
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
 equipped+0.016
 ado+0.016
thy+0.015
 BOX+0.015
opathy+0.015
Neg logit contributions
arity-0.015
 sage-0.014
idas-0.014
wikipedia-0.014
[[-0.014
Token
Token ID198
Feature activation-5.32e-03

Pos logit contributions
 equipped+0.008
 ado+0.008
thy+0.008
 BOX+0.008
opathy+0.007
Neg logit contributions
arity-0.007
 sage-0.007
wikipedia-0.007
idas-0.007
[[-0.007
Token of
Token ID286
Feature activation-1.39e-02

Pos logit contributions
 equipped+0.042
 ado+0.041
thy+0.040
 BOX+0.040
opathy+0.039
Neg logit contributions
arity-0.039
 sage-0.037
wikipedia-0.036
idas-0.036
[[-0.036
Token pharmaceutical
Token ID18423
Feature activation-2.45e-02

Pos logit contributions
 equipped+0.010
 ado+0.010
thy+0.010
 BOX+0.010
opathy+0.009
Neg logit contributions
arity-0.009
 sage-0.009
wikipedia-0.009
idas-0.009
[[-0.009
Tokens
Token ID82
Feature activation-6.57e-03

Pos logit contributions
 equipped+0.018
 ado+0.018
thy+0.017
 BOX+0.017
opathy+0.017
Neg logit contributions
arity-0.016
 sage-0.015
wikipedia-0.015
idas-0.015
[[-0.015
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
 equipped+0.005
 ado+0.005
thy+0.005
 BOX+0.005
opathy+0.004
Neg logit contributions
arity-0.004
 sage-0.004
idas-0.004
wikipedia-0.004
[[-0.004
Token Home
Token ID5995
Feature activation+6.10e-02

Pos logit contributions
 equipped+0.013
 ado+0.012
thy+0.012
 BOX+0.012
opathy+0.012
Neg logit contributions
arity-0.012
 sage-0.011
idas-0.011
wikipedia-0.011
[[-0.011
Tokenopathy
Token ID27189
Feature activation-5.62e-01

Pos logit contributions
arity+0.080
 sage+0.077
idas+0.076
wikipedia+0.076
[[+0.076
Neg logit contributions
 equipped-0.006
 ado-0.005
thy-0.004
 BOX-0.004
th-0.003
Token advocated
Token ID25828
Feature activation+1.18e-01

Pos logit contributions
 equipped+0.408
 ado+0.391
thy+0.384
 BOX+0.382
opathy+0.378
Neg logit contributions
arity-0.388
 sage-0.364
wikipedia-0.360
idas-0.360
[[-0.357
Token a
Token ID257
Feature activation-1.02e-02

Pos logit contributions
arity+0.082
 sage+0.077
idas+0.076
wikipedia+0.076
[[+0.075
Neg logit contributions
 equipped-0.084
 ado-0.081
thy-0.080
 BOX-0.079
opathy-0.078
Token
Token ID564
Feature activation-1.77e-02

Pos logit contributions
 equipped+0.007
 ado+0.007
thy+0.007
 BOX+0.007
opathy+0.006
Neg logit contributions
arity-0.007
 sage-0.007
idas-0.007
wikipedia-0.007
[[-0.007
Token
Token ID250
Feature activation+1.11e-03

Pos logit contributions
 equipped+0.012
 ado+0.011
thy+0.011
 BOX+0.011
opathy+0.011
Neg logit contributions
arity-0.013
 sage-0.013
wikipedia-0.012
idas-0.012
[[-0.012
Tokenlike
Token ID2339
Feature activation+1.90e-01

Pos logit contributions
arity+0.001
 sage+0.001
wikipedia+0.001
idas+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Token
Token ID250
Feature activation+1.94e-03

Pos logit contributions
 equipped+0.011
 ado+0.010
thy+0.010
 BOX+0.010
opathy+0.010
Neg logit contributions
arity-0.010
 sage-0.010
idas-0.010
wikipedia-0.010
[[-0.010
Token9
Token ID24
Feature activation-2.25e-02

Pos logit contributions
arity+0.001
 sage+0.001
idas+0.001
wikipedia+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Tokenth
Token ID400
Feature activation-5.59e-01

Pos logit contributions
 equipped+0.002
 ado+0.002
 BOX+0.001
thy+0.001
opathy+0.001
Neg logit contributions
arity-0.030
 sage-0.029
idas-0.028
wikipedia-0.028
[[-0.028
Token to
Token ID284
Feature activation-2.29e-02

Pos logit contributions
 equipped+0.326
 ado+0.310
thy+0.304
 BOX+0.301
opathy+0.297
Neg logit contributions
arity-0.464
 sage-0.442
idas-0.436
wikipedia-0.436
[[-0.435
Token 12
Token ID1105
Feature activation-1.99e-02

Pos logit contributions
 equipped+0.017
 ado+0.016
thy+0.016
 BOX+0.016
opathy+0.016
Neg logit contributions
arity-0.016
 sage-0.015
wikipedia-0.015
[[-0.014
idas-0.014
Tokenth
Token ID400
Feature activation-5.61e-01

Pos logit contributions
 equipped+0.002
 ado+0.002
 BOX+0.001
thy+0.001
opathy+0.001
Neg logit contributions
arity-0.026
 sage-0.025
idas-0.025
[[-0.025
wikipedia-0.025
Token Grade
Token ID22653
Feature activation-1.99e-01

Pos logit contributions
 equipped+0.322
 ado+0.305
thy+0.299
 BOX+0.297
opathy+0.292
Neg logit contributions
arity-0.473
 sage-0.450
wikipedia-0.445
idas-0.444
[[-0.443
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
 equipped+0.145
 ado+0.139
thy+0.137
 BOX+0.136
opathy+0.134
Neg logit contributions
arity-0.137
 sage-0.129
wikipedia-0.127
idas-0.127
[[-0.126
Token No
Token ID1400
Feature activation-2.67e-02

Pos logit contributions
 equipped+0.008
 ado+0.008
thy+0.008
 BOX+0.007
opathy+0.007
Neg logit contributions
arity-0.008
 sage-0.007
wikipedia-0.007
idas-0.007
[[-0.007
Token Di
Token ID6031
Feature activation+1.53e-01

Pos logit contributions
 equipped+0.019
 ado+0.019
thy+0.018
 BOX+0.018
opathy+0.018
Neg logit contributions
arity-0.018
 sage-0.017
idas-0.017
wikipedia-0.017
[[-0.017
Tokenploma
Token ID35728
Feature activation-8.33e-02

Pos logit contributions
arity+0.110
 sage+0.104
idas+0.103
wikipedia+0.103
[[+0.102
Neg logit contributions
 equipped-0.106
 ado-0.102
thy-0.100
 BOX-0.099
opathy-0.098
Token]
Token ID60
Feature activation-4.96e-02

Pos logit contributions
 equipped+0.045
 ado+0.043
thy+0.043
 BOX+0.042
opathy+0.042
Neg logit contributions
arity-0.046
 sage-0.043
wikipedia-0.042
idas-0.042
[[-0.042
Token <
Token ID1279
Feature activation-7.37e-02

Pos logit contributions
 equipped+0.034
 ado+0.032
thy+0.032
 BOX+0.031
opathy+0.031
Neg logit contributions
arity-0.036
 sage-0.034
wikipedia-0.034
idas-0.034
[[-0.034
Tokenmet
Token ID4164
Feature activation+7.59e-02

Pos logit contributions
 equipped+0.055
 ado+0.053
thy+0.053
 BOX+0.052
opathy+0.051
Neg logit contributions
arity-0.049
 sage-0.046
idas-0.045
wikipedia-0.045
[[-0.045
Tokenros
Token ID4951
Feature activation-4.44e-02

Pos logit contributions
arity+0.053
 sage+0.050
idas+0.049
wikipedia+0.049
[[+0.049
Neg logit contributions
 equipped-0.055
 ado-0.053
thy-0.052
 BOX-0.052
opathy-0.051
Tokenxual
Token ID5541
Feature activation-9.21e-02

Pos logit contributions
 equipped+0.033
 ado+0.031
thy+0.031
 BOX+0.031
opathy+0.030
Neg logit contributions
arity-0.030
 sage-0.028
wikipedia-0.028
idas-0.028
[[-0.028
Tokenth
Token ID400
Feature activation-5.61e-01

Pos logit contributions
 equipped+0.070
 ado+0.068
thy+0.066
 BOX+0.066
opathy+0.065
Neg logit contributions
arity-0.060
 sage-0.056
wikipedia-0.055
idas-0.055
[[-0.055
Tokenug
Token ID1018
Feature activation-2.89e-01

Pos logit contributions
 equipped+0.383
 ado+0.367
thy+0.361
 BOX+0.358
opathy+0.354
Neg logit contributions
arity-0.409
 sage-0.388
wikipedia-0.382
idas-0.381
[[-0.380
Token>
Token ID29
Feature activation-2.00e-02

Pos logit contributions
 equipped+0.212
 ado+0.203
thy+0.201
 BOX+0.199
opathy+0.196
Neg logit contributions
arity-0.196
 sage-0.185
wikipedia-0.182
idas-0.182
[[-0.181
Token lol
Token ID19462
Feature activation-3.09e-02

Pos logit contributions
 equipped+0.014
 ado+0.014
thy+0.014
 BOX+0.013
opathy+0.013
Neg logit contributions
arity-0.014
 sage-0.013
idas-0.013
wikipedia-0.013
[[-0.013
Token m
Token ID285
Feature activation+8.42e-02

Pos logit contributions
 equipped+0.026
 ado+0.025
thy+0.025
 BOX+0.024
opathy+0.024
Neg logit contributions
arity-0.018
 sage-0.017
idas-0.016
wikipedia-0.016
[[-0.016
Token2
Token ID17
Feature activation+1.59e-01

Pos logit contributions
arity+0.058
 sage+0.055
wikipedia+0.054
idas+0.054
[[+0.053
Neg logit contributions
 equipped-0.061
 ado-0.059
thy-0.058
 BOX-0.057
opathy-0.057
Token stops
Token ID9911
Feature activation+4.25e-02

Pos logit contributions
 equipped+0.028
 ado+0.027
thy+0.027
 BOX+0.027
opathy+0.026
Neg logit contributions
arity-0.027
 sage-0.026
wikipedia-0.025
idas-0.025
[[-0.025
Token Kell
Token ID25043
Feature activation-7.72e-03

Pos logit contributions
arity+0.029
 sage+0.027
wikipedia+0.027
idas+0.027
[[+0.027
Neg logit contributions
 equipped-0.031
 ado-0.030
 BOX-0.029
thy-0.029
opathy-0.029
Token Brook
Token ID7590
Feature activation-1.83e-01

Pos logit contributions
 equipped+0.004
 ado+0.004
thy+0.004
 BOX+0.004
opathy+0.004
Neg logit contributions
arity-0.007
 sage-0.006
idas-0.006
wikipedia-0.006
[[-0.006
Token in
Token ID287
Feature activation+5.46e-03

Pos logit contributions
 equipped+0.129
 ado+0.124
thy+0.121
 BOX+0.121
opathy+0.119
Neg logit contributions
arity-0.131
 sage-0.123
idas-0.121
wikipedia-0.121
[[-0.121
Token 11
Token ID1367
Feature activation-6.29e-02

Pos logit contributions
arity+0.004
 sage+0.004
wikipedia+0.004
idas+0.003
[[+0.003
Neg logit contributions
 equipped-0.004
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Tokenth
Token ID400
Feature activation-5.61e-01

Pos logit contributions
 equipped+0.027
 ado+0.025
thy+0.024
 BOX+0.024
opathy+0.024
Neg logit contributions
arity-0.062
 sage-0.059
idas-0.059
wikipedia-0.059
[[-0.058
Token round
Token ID2835
Feature activation-1.97e-01

Pos logit contributions
 equipped+0.302
 ado+0.287
 BOX+0.279
thy+0.278
opathy+0.270
Neg logit contributions
arity-0.494
 sage-0.469
wikipedia-0.466
idas-0.464
[[-0.463
Token to
Token ID284
Feature activation-2.30e-02

Pos logit contributions
 equipped+0.142
 ado+0.137
 BOX+0.133
thy+0.133
opathy+0.131
Neg logit contributions
arity-0.138
 sage-0.130
wikipedia-0.128
idas-0.128
[[-0.127
Token win
Token ID1592
Feature activation-5.96e-05

Pos logit contributions
 equipped+0.016
 ado+0.015
thy+0.015
 BOX+0.015
opathy+0.015
Neg logit contributions
arity-0.017
 sage-0.016
idas-0.015
wikipedia-0.015
[[-0.015
Token I
Token ID314
Feature activation-8.02e-02

Pos logit contributions
 equipped+0.000
 ado+0.000
thy+0.000
 BOX+0.000
opathy+0.000
Neg logit contributions
arity-0.000
 sage-0.000
wikipedia-0.000
idas-0.000
[[-0.000
TokenBF
Token ID29499
Feature activation+1.59e-01

Pos logit contributions
 equipped+0.076
 ado+0.073
thy+0.072
opathy+0.071
 BOX+0.071
Neg logit contributions
arity-0.038
 sage-0.035
idas-0.034
wikipedia-0.034
[[-0.034
Token of
Token ID286
Feature activation-1.10e-02

Pos logit contributions
 equipped+0.252
 ado+0.241
 BOX+0.236
thy+0.236
opathy+0.233
Neg logit contributions
arity-0.238
 sage-0.223
idas-0.221
wikipedia-0.219
[[-0.219
Token Columbia
Token ID9309
Feature activation+3.30e-02

Pos logit contributions
 equipped+0.008
 ado+0.008
thy+0.008
 BOX+0.008
opathy+0.008
Neg logit contributions
arity-0.007
 sage-0.007
idas-0.007
wikipedia-0.006
[[-0.006
Token Pictures
Token ID20051
Feature activation-1.53e-01

Pos logit contributions
arity+0.028
 sage+0.027
wikipedia+0.027
idas+0.027
[[+0.026
Neg logit contributions
 equipped-0.018
 ado-0.018
thy-0.017
 BOX-0.017
opathy-0.017
Token and
Token ID290
Feature activation-2.63e-02

Pos logit contributions
 equipped+0.111
 ado+0.107
thy+0.105
 BOX+0.105
opathy+0.103
Neg logit contributions
arity-0.106
 sage-0.099
idas-0.098
wikipedia-0.098
[[-0.097
Token 20
Token ID1160
Feature activation-9.23e-03

Pos logit contributions
 equipped+0.020
 ado+0.019
thy+0.019
 BOX+0.018
opathy+0.018
Neg logit contributions
arity-0.018
 sage-0.017
wikipedia-0.016
idas-0.016
[[-0.016
Tokenth
Token ID400
Feature activation-5.61e-01

Pos logit contributions
 equipped+0.001
 ado+0.001
thy+0.001
 BOX+0.001
opathy+0.001
Neg logit contributions
arity-0.012
 sage-0.012
idas-0.011
[[-0.011
wikipedia-0.011
Token Century
Token ID13641
Feature activation+7.75e-02

Pos logit contributions
 equipped+0.466
 ado+0.448
 BOX+0.441
thy+0.440
opathy+0.433
Neg logit contributions
arity-0.331
 sage-0.306
wikipedia-0.302
idas-0.300
[[-0.298
Token Fox
Token ID5426
Feature activation-1.18e-01

Pos logit contributions
arity+0.064
 sage+0.061
wikipedia+0.060
idas+0.060
[[+0.060
Neg logit contributions
 equipped-0.046
 ado-0.043
 BOX-0.042
thy-0.042
opathy-0.041
Token
Token ID198
Feature activation-1.28e-02

Pos logit contributions
 equipped+0.086
 ado+0.083
thy+0.081
 BOX+0.081
opathy+0.080
Neg logit contributions
arity-0.081
 sage-0.076
idas-0.075
wikipedia-0.075
[[-0.075
Token
Token ID198
Feature activation-8.60e-03

Pos logit contributions
 equipped+0.010
 ado+0.009
thy+0.009
 BOX+0.009
opathy+0.009
Neg logit contributions
arity-0.009
 sage-0.008
wikipedia-0.008
idas-0.008
[[-0.008
TokenNOT
Token ID11929
Feature activation+8.85e-02

Pos logit contributions
 equipped+0.006
 ado+0.006
thy+0.006
 BOX+0.006
opathy+0.006
Neg logit contributions
arity-0.006
 sage-0.006
idas-0.005
wikipedia-0.005
[[-0.005
Token and
Token ID290
Feature activation-2.92e-02

Pos logit contributions
arity+0.091
 sage+0.085
wikipedia+0.084
idas+0.084
[[+0.084
Neg logit contributions
 equipped-0.096
 ado-0.093
thy-0.091
 BOX-0.090
opathy-0.089
Token then
Token ID788
Feature activation-1.45e-01

Pos logit contributions
 equipped+0.020
 ado+0.019
thy+0.019
 BOX+0.018
opathy+0.018
Neg logit contributions
arity-0.021
 sage-0.020
wikipedia-0.020
idas-0.020
[[-0.020
Token generate
Token ID7716
Feature activation+3.80e-03

Pos logit contributions
 equipped+0.098
 ado+0.094
thy+0.092
 BOX+0.092
opathy+0.091
Neg logit contributions
arity-0.107
 sage-0.101
wikipedia-0.100
idas-0.100
[[-0.099
Token a
Token ID257
Feature activation-1.05e-02

Pos logit contributions
arity+0.003
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.003
 ado-0.003
thy-0.003
 BOX-0.003
opathy-0.003
Token 5
Token ID642
Feature activation-3.22e-02

Pos logit contributions
 equipped+0.007
 ado+0.007
thy+0.007
 BOX+0.007
opathy+0.007
Neg logit contributions
arity-0.008
 sage-0.007
idas-0.007
wikipedia-0.007
[[-0.007
Tokenth
Token ID400
Feature activation-5.61e-01

Pos logit contributions
 equipped+0.022
 ado+0.021
thy+0.020
 BOX+0.020
opathy+0.020
Neg logit contributions
arity-0.024
 sage-0.023
idas-0.022
wikipedia-0.022
[[-0.022
Token code
Token ID2438
Feature activation-1.63e-01

Pos logit contributions
 equipped+0.404
 ado+0.387
thy+0.380
 BOX+0.379
opathy+0.373
Neg logit contributions
arity-0.391
 sage-0.368
wikipedia-0.363
idas-0.363
[[-0.360
Token tape
Token ID9154
Feature activation-2.59e-02

Pos logit contributions
 equipped+0.118
 ado+0.113
thy+0.111
 BOX+0.110
opathy+0.109
Neg logit contributions
arity-0.113
 sage-0.107
wikipedia-0.105
idas-0.105
[[-0.105
Token by
Token ID416
Feature activation-6.36e-02

Pos logit contributions
 equipped+0.019
 ado+0.018
thy+0.018
 BOX+0.018
opathy+0.017
Neg logit contributions
arity-0.018
 sage-0.017
wikipedia-0.017
idas-0.017
[[-0.017
Token X
Token ID1395
Feature activation+3.54e-03

Pos logit contributions
 equipped+0.046
 ado+0.044
thy+0.044
 BOX+0.043
opathy+0.043
Neg logit contributions
arity-0.044
 sage-0.041
idas-0.041
wikipedia-0.041
[[-0.040
TokenOR
Token ID1581
Feature activation-4.14e-02

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.003
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token $
Token ID720
Feature activation+1.48e-01

Pos logit contributions
 equipped+0.036
 ado+0.035
thy+0.034
 BOX+0.034
opathy+0.034
Neg logit contributions
arity-0.017
 sage-0.015
wikipedia-0.015
idas-0.015
[[-0.015
Token6
Token ID21
Feature activation-3.28e-02

Pos logit contributions
arity+0.097
 sage+0.091
wikipedia+0.089
idas+0.089
[[+0.089
Neg logit contributions
 equipped-0.113
 ado-0.109
thy-0.107
 BOX-0.106
opathy-0.105
Token million
Token ID1510
Feature activation+2.05e-02

Pos logit contributions
 equipped+0.024
 ado+0.023
thy+0.023
 BOX+0.023
opathy+0.022
Neg logit contributions
arity-0.022
 sage-0.021
idas-0.021
wikipedia-0.021
[[-0.020
Token (
Token ID357
Feature activation-1.08e-01

Pos logit contributions
arity+0.016
 sage+0.015
idas+0.015
wikipedia+0.015
[[+0.015
Neg logit contributions
 equipped-0.013
 ado-0.012
thy-0.012
 BOX-0.012
opathy-0.012
Token15
Token ID1314
Feature activation-1.13e-01

Pos logit contributions
 equipped+0.067
 ado+0.064
thy+0.063
 BOX+0.063
opathy+0.062
Neg logit contributions
arity-0.085
 sage-0.080
wikipedia-0.080
idas-0.079
[[-0.079
Tokenth
Token ID400
Feature activation-5.60e-01

Pos logit contributions
 equipped+0.011
 ado+0.009
 BOX+0.006
thy+0.006
opathy+0.006
Neg logit contributions
arity-0.148
 sage-0.144
idas-0.143
wikipedia-0.143
[[-0.142
Token among
Token ID1871
Feature activation+3.98e-02

Pos logit contributions
 equipped+0.448
 ado+0.434
thy+0.427
 BOX+0.424
opathy+0.419
Neg logit contributions
arity-0.343
 sage-0.323
idas-0.315
wikipedia-0.314
[[-0.313
Token running
Token ID2491
Feature activation-8.55e-02

Pos logit contributions
arity+0.029
 sage+0.028
idas+0.027
[[+0.027
wikipedia+0.027
Neg logit contributions
 equipped-0.027
 ado-0.026
thy-0.026
 BOX-0.025
opathy-0.025
Token backs
Token ID12983
Feature activation+9.59e-02

Pos logit contributions
 equipped+0.075
 ado+0.073
thy+0.072
 BOX+0.071
opathy+0.071
Neg logit contributions
arity-0.046
 sage-0.042
idas-0.042
[[-0.041
wikipedia-0.041
Token)
Token ID8
Feature activation-1.65e-02

Pos logit contributions
arity+0.064
 sage+0.060
idas+0.059
wikipedia+0.059
[[+0.059
Neg logit contributions
 equipped-0.072
 ado-0.069
thy-0.068
 BOX-0.068
opathy-0.067
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
 equipped+0.012
 ado+0.012
thy+0.012
 BOX+0.012
opathy+0.011
Neg logit contributions
arity-0.011
 sage-0.010
wikipedia-0.010
idas-0.010
[[-0.010
Token On
Token ID1550
Feature activation-1.11e-01

Pos logit contributions
 equipped+0.014
 ado+0.014
thy+0.013
 BOX+0.013
opathy+0.013
Neg logit contributions
arity-0.015
 sage-0.014
idas-0.014
wikipedia-0.014
[[-0.014
Token Sunday
Token ID3502
Feature activation+6.30e-04

Pos logit contributions
 equipped+0.080
 ado+0.077
thy+0.076
 BOX+0.075
opathy+0.074
Neg logit contributions
arity-0.077
 sage-0.073
idas-0.072
wikipedia-0.072
[[-0.072
Token,
Token ID11
Feature activation-1.37e-02

Pos logit contributions
arity+0.000
 sage+0.000
idas+0.000
wikipedia+0.000
[[+0.000
Neg logit contributions
 equipped-0.000
 ado-0.000
thy-0.000
 BOX-0.000
opathy-0.000
Token November
Token ID3389
Feature activation-1.18e-03

Pos logit contributions
 equipped+0.009
 ado+0.009
thy+0.009
 BOX+0.009
opathy+0.008
Neg logit contributions
arity-0.010
 sage-0.010
wikipedia-0.010
idas-0.010
[[-0.009
Token 16
Token ID1467
Feature activation+1.08e-02

Pos logit contributions
 equipped+0.001
 ado+0.001
thy+0.001
 BOX+0.001
opathy+0.001
Neg logit contributions
arity-0.001
 sage-0.001
wikipedia-0.001
idas-0.001
[[-0.001
Tokenth
Token ID400
Feature activation-5.60e-01

Pos logit contributions
arity+0.012
 sage+0.011
wikipedia+0.011
idas+0.011
[[+0.011
Neg logit contributions
 equipped-0.003
 ado-0.003
 BOX-0.003
thy-0.003
opathy-0.003
Token,
Token ID11
Feature activation-1.28e-02

Pos logit contributions
 equipped+0.407
 ado+0.390
thy+0.384
 BOX+0.382
opathy+0.377
Neg logit contributions
arity-0.386
 sage-0.363
idas-0.358
wikipedia-0.358
[[-0.356
Token I
Token ID314
Feature activation-8.26e-02

Pos logit contributions
 equipped+0.009
 ado+0.009
thy+0.009
 BOX+0.009
opathy+0.008
Neg logit contributions
arity-0.009
 sage-0.008
idas-0.008
wikipedia-0.008
[[-0.008
Token woke
Token ID19092
Feature activation-2.81e-01

Pos logit contributions
 equipped+0.058
 ado+0.056
thy+0.055
 BOX+0.054
opathy+0.054
Neg logit contributions
arity-0.059
 sage-0.055
wikipedia-0.055
idas-0.055
[[-0.054
Token up
Token ID510
Feature activation-1.13e-01

Pos logit contributions
 equipped+0.174
 ado+0.167
thy+0.163
 BOX+0.163
opathy+0.161
Neg logit contributions
arity-0.222
 sage-0.212
idas-0.208
wikipedia-0.208
[[-0.207
Token with
Token ID351
Feature activation+3.87e-03

Pos logit contributions
 equipped+0.082
 ado+0.078
thy+0.077
 BOX+0.077
opathy+0.076
Neg logit contributions
arity-0.079
 sage-0.074
wikipedia-0.073
idas-0.073
[[-0.073
Token
Token ID564
Feature activation-1.36e-02

Pos logit contributions
 equipped+0.005
 ado+0.005
thy+0.005
 BOX+0.005
opathy+0.005
Neg logit contributions
arity-0.005
 sage-0.005
wikipedia-0.005
idas-0.005
[[-0.005
Token
Token ID250
Feature activation+2.21e-03

Pos logit contributions
 equipped+0.009
 ado+0.009
thy+0.009
 BOX+0.009
opathy+0.009
Neg logit contributions
arity-0.010
 sage-0.009
wikipedia-0.009
idas-0.009
[[-0.009
TokenLess
Token ID22058
Feature activation+9.62e-02

Pos logit contributions
arity+0.001
 sage+0.001
wikipedia+0.001
idas+0.001
[[+0.001
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token than
Token ID621
Feature activation-1.11e-01

Pos logit contributions
arity+0.066
 sage+0.062
wikipedia+0.061
idas+0.061
[[+0.061
Neg logit contributions
 equipped-0.070
 ado-0.067
thy-0.066
 BOX-0.066
opathy-0.065
Token 9
Token ID860
Feature activation-3.80e-02

Pos logit contributions
 equipped+0.089
 ado+0.086
thy+0.084
 BOX+0.084
opathy+0.083
Neg logit contributions
arity-0.068
 sage-0.063
wikipedia-0.062
idas-0.062
[[-0.062
Tokenth
Token ID400
Feature activation-5.59e-01

Pos logit contributions
 equipped+0.029
 ado+0.028
thy+0.027
 BOX+0.027
opathy+0.027
Neg logit contributions
arity-0.025
 sage-0.024
idas-0.023
[[-0.023
wikipedia-0.023
Token Grade
Token ID22653
Feature activation-1.99e-01

Pos logit contributions
 equipped+0.417
 ado+0.401
thy+0.392
 BOX+0.392
opathy+0.387
Neg logit contributions
arity-0.375
 sage-0.351
wikipedia-0.348
idas-0.346
[[-0.344
Token
Token ID447
Feature activation-1.68e-02

Pos logit contributions
 equipped+0.143
 ado+0.137
thy+0.135
 BOX+0.134
opathy+0.132
Neg logit contributions
arity-0.139
 sage-0.131
wikipedia-0.129
idas-0.129
[[-0.128
Token
Token ID251
Feature activation-9.90e-03

Pos logit contributions
 equipped+0.009
 ado+0.009
thy+0.009
 BOX+0.009
opathy+0.008
Neg logit contributions
arity-0.014
 sage-0.014
idas-0.014
wikipedia-0.014
[[-0.013
Token and
Token ID290
Feature activation-2.92e-02

Pos logit contributions
 equipped+0.007
 ado+0.007
thy+0.007
 BOX+0.007
opathy+0.007
Neg logit contributions
arity-0.007
 sage-0.006
wikipedia-0.006
idas-0.006
[[-0.006
Token
Token ID564
Feature activation-1.49e-02

Pos logit contributions
 equipped+0.021
 ado+0.020
thy+0.020
 BOX+0.020
opathy+0.020
Neg logit contributions
arity-0.020
 sage-0.019
wikipedia-0.019
idas-0.019
[[-0.019
Token event
Token ID1785
Feature activation+5.22e-02

Pos logit contributions
 equipped+0.094
 ado+0.090
thy+0.088
 BOX+0.088
opathy+0.087
Neg logit contributions
arity-0.101
 sage-0.096
idas-0.094
wikipedia-0.094
[[-0.094
Token also
Token ID635
Feature activation-1.21e-01

Pos logit contributions
arity+0.036
 sage+0.034
wikipedia+0.034
idas+0.034
[[+0.033
Neg logit contributions
 equipped-0.038
 ado-0.036
thy-0.035
 BOX-0.035
opathy-0.035
Token marks
Token ID8849
Feature activation-3.20e-02

Pos logit contributions
 equipped+0.085
 ado+0.082
thy+0.080
 BOX+0.080
opathy+0.079
Neg logit contributions
arity-0.086
 sage-0.081
idas-0.080
wikipedia-0.080
[[-0.079
Token the
Token ID262
Feature activation-9.79e-03

Pos logit contributions
 equipped+0.023
 ado+0.022
thy+0.022
 BOX+0.022
opathy+0.022
Neg logit contributions
arity-0.022
 sage-0.021
wikipedia-0.020
idas-0.020
[[-0.020
Token 8
Token ID807
Feature activation-2.89e-02

Pos logit contributions
 equipped+0.007
 ado+0.006
thy+0.006
 BOX+0.006
opathy+0.006
Neg logit contributions
arity-0.007
 sage-0.007
wikipedia-0.007
idas-0.007
[[-0.007
Tokenth
Token ID400
Feature activation-5.59e-01

Pos logit contributions
 equipped+0.003
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.001
Neg logit contributions
arity-0.038
 sage-0.037
idas-0.037
[[-0.037
wikipedia-0.036
Token annual
Token ID5079
Feature activation+1.39e-01

Pos logit contributions
 equipped+0.425
 ado+0.409
thy+0.403
 BOX+0.400
opathy+0.396
Neg logit contributions
arity-0.366
 sage-0.344
idas-0.338
wikipedia-0.337
[[-0.336
Token Hemp
Token ID50060
Feature activation-7.19e-02

Pos logit contributions
arity+0.103
 sage+0.098
idas+0.096
wikipedia+0.096
[[+0.096
Neg logit contributions
 equipped-0.093
 ado-0.089
thy-0.088
 BOX-0.087
opathy-0.086
Token History
Token ID7443
Feature activation-1.45e-01

Pos logit contributions
 equipped+0.052
 ado+0.050
thy+0.049
 BOX+0.049
opathy+0.048
Neg logit contributions
arity-0.049
 sage-0.047
wikipedia-0.046
idas-0.046
[[-0.045
Token Week
Token ID6119
Feature activation-7.87e-02

Pos logit contributions
 equipped+0.097
 ado+0.093
thy+0.091
 BOX+0.090
opathy+0.089
Neg logit contributions
arity-0.108
 sage-0.103
wikipedia-0.101
idas-0.101
[[-0.101
Token which
Token ID543
Feature activation-6.91e-02

Pos logit contributions
 equipped+0.057
 ado+0.055
thy+0.054
 BOX+0.053
opathy+0.053
Neg logit contributions
arity-0.054
 sage-0.051
wikipedia-0.050
idas-0.050
[[-0.050
Tokenically
Token ID1146
Feature activation-9.98e-02

Pos logit contributions
 equipped+0.035
 ado+0.033
thy+0.032
 BOX+0.032
opathy+0.032
Neg logit contributions
arity-0.046
 sage-0.044
wikipedia-0.044
idas-0.044
[[-0.043
Token
Token ID447
Feature activation+1.63e-02

Pos logit contributions
 equipped+0.068
 ado+0.066
thy+0.064
 BOX+0.064
opathy+0.063
Neg logit contributions
arity-0.073
 sage-0.069
wikipedia-0.068
idas-0.068
[[-0.067
Token
Token ID251
Feature activation+2.45e-02

Pos logit contributions
arity+0.012
 sage+0.012
wikipedia+0.011
[[+0.011
idas+0.011
Neg logit contributions
 equipped-0.011
 ado-0.010
thy-0.010
 BOX-0.010
opathy-0.010
Token issue
Token ID2071
Feature activation-1.17e-02

Pos logit contributions
arity+0.018
 sage+0.017
wikipedia+0.017
idas+0.017
[[+0.017
Neg logit contributions
 equipped-0.016
 ado-0.016
thy-0.015
 BOX-0.015
opathy-0.015
Token informal
Token ID22176
Feature activation-3.93e-02

Pos logit contributions
 equipped+0.009
 ado+0.009
thy+0.009
 BOX+0.009
opathy+0.009
Neg logit contributions
arity-0.007
wikipedia-0.007
idas-0.007
 sage-0.007
[[-0.006
Token guidance
Token ID11154
Feature activation+3.45e-01

Pos logit contributions
 equipped+0.032
 ado+0.031
thy+0.030
 BOX+0.030
opathy+0.030
Neg logit contributions
arity-0.024
idas-0.022
wikipedia-0.022
 sage-0.022
[[-0.022
Token and
Token ID290
Feature activation+3.53e-03

Pos logit contributions
arity+0.232
 sage+0.218
wikipedia+0.216
idas+0.215
[[+0.213
Neg logit contributions
 equipped-0.257
 ado-0.246
thy-0.241
 BOX-0.241
opathy-0.238
Token policy
Token ID2450
Feature activation-2.61e-02

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.003
 ado-0.003
thy-0.003
 BOX-0.003
opathy-0.003
Token memor
Token ID16181
Feature activation+1.64e-01

Pos logit contributions
 equipped+0.023
 ado+0.022
 BOX+0.022
thy+0.022
opathy+0.021
Neg logit contributions
arity-0.014
wikipedia-0.013
 sage-0.013
idas-0.013
[[-0.013
Tokenanda
Token ID5282
Feature activation+1.59e-01

Pos logit contributions
arity+0.090
 sage+0.084
wikipedia+0.083
[[+0.082
idas+0.082
Neg logit contributions
 equipped-0.141
 ado-0.136
thy-0.135
 BOX-0.134
opathy-0.132
Token intended
Token ID5292
Feature activation+1.78e-02

Pos logit contributions
arity+0.110
 sage+0.104
wikipedia+0.102
idas+0.102
[[+0.102
Neg logit contributions
 equipped-0.115
 ado-0.110
thy-0.108
 BOX-0.108
opathy-0.106
Tokenal
Token ID282
Feature activation+1.08e-02

Pos logit contributions
 equipped+0.141
 ado+0.135
thy+0.132
 BOX+0.132
opathy+0.130
Neg logit contributions
arity-0.147
 sage-0.139
idas-0.137
wikipedia-0.137
[[-0.136
Tokenani
Token ID3216
Feature activation+1.53e-01

Pos logit contributions
arity+0.007
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.008
Token
Token ID134
Feature activation+3.06e-01

Pos logit contributions
arity+0.101
 sage+0.095
wikipedia+0.094
idas+0.093
[[+0.093
Neg logit contributions
 equipped-0.115
 ado-0.111
thy-0.109
 BOX-0.108
opathy-0.107
Token
Token ID119
Feature activation-1.34e-01

Pos logit contributions
arity+0.262
 sage+0.249
idas+0.246
wikipedia+0.246
[[+0.246
Neg logit contributions
 equipped-0.172
 ado-0.163
thy-0.159
 BOX-0.158
opathy-0.155
Tokenō
Token ID13090
Feature activation-2.14e-01

Pos logit contributions
 equipped+0.106
 ado+0.102
thy+0.101
 BOX+0.100
opathy+0.099
Neg logit contributions
arity-0.083
 sage-0.078
wikipedia-0.076
idas-0.076
[[-0.076
Tokenpu
Token ID19944
Feature activation+2.94e-01

Pos logit contributions
 equipped+0.162
 ado+0.155
thy+0.153
 BOX+0.152
opathy+0.150
Neg logit contributions
arity-0.142
 sage-0.133
wikipedia-0.131
idas-0.131
[[-0.130
Token
Token ID134
Feature activation+3.04e-01

Pos logit contributions
arity+0.190
 sage+0.178
wikipedia+0.176
idas+0.175
[[+0.174
Neg logit contributions
 equipped-0.226
 ado-0.217
thy-0.214
 BOX-0.212
opathy-0.210
Token
Token ID119
Feature activation-1.33e-01

Pos logit contributions
arity+0.265
 sage+0.253
idas+0.250
wikipedia+0.250
[[+0.249
Neg logit contributions
 equipped-0.165
 ado-0.157
thy-0.153
 BOX-0.152
opathy-0.149
Tokenu
Token ID84
Feature activation+1.45e-01

Pos logit contributions
 equipped+0.103
 ado+0.099
thy+0.097
 BOX+0.097
opathy+0.096
Neg logit contributions
arity-0.086
 sage-0.080
wikipedia-0.079
idas-0.079
[[-0.079
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
arity+0.096
 sage+0.090
wikipedia+0.089
idas+0.088
[[+0.088
Neg logit contributions
 equipped-0.110
 ado-0.106
thy-0.104
 BOX-0.103
opathy-0.102
Token E
Token ID412
Feature activation-6.58e-02

Pos logit contributions
arity+0.007
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.007
 ado-0.007
thy-0.007
 BOX-0.007
opathy-0.007
Token to
Token ID284
Feature activation-4.53e-03

Pos logit contributions
arity+0.087
 sage+0.082
wikipedia+0.080
idas+0.080
[[+0.080
Neg logit contributions
 equipped-0.094
 ado-0.091
thy-0.090
 BOX-0.089
opathy-0.088
Token its
Token ID663
Feature activation-8.70e-03

Pos logit contributions
 equipped+0.003
 ado+0.003
thy+0.003
 BOX+0.003
opathy+0.003
Neg logit contributions
arity-0.003
 sage-0.003
wikipedia-0.003
idas-0.003
[[-0.003
Token post
Token ID1281
Feature activation-1.14e-02

Pos logit contributions
 equipped+0.006
 ado+0.006
thy+0.006
 BOX+0.006
opathy+0.006
Neg logit contributions
arity-0.006
 sage-0.006
wikipedia-0.005
idas-0.005
[[-0.005
Token-
Token ID12
Feature activation+2.28e-02

Pos logit contributions
 equipped+0.009
 ado+0.008
thy+0.008
 BOX+0.008
opathy+0.008
Neg logit contributions
arity-0.007
 sage-0.007
wikipedia-0.007
idas-0.007
[[-0.007
TokenH
Token ID39
Feature activation-2.16e-01

Pos logit contributions
arity+0.014
 sage+0.013
idas+0.013
wikipedia+0.013
[[+0.013
Neg logit contributions
 equipped-0.018
 ado-0.017
thy-0.017
 BOX-0.017
opathy-0.017
Tokenalo
Token ID7335
Feature activation+3.22e-01

Pos logit contributions
 equipped+0.244
 ado+0.237
 BOX+0.234
thy+0.234
opathy+0.232
Neg logit contributions
arity-0.063
 sage-0.053
idas-0.051
wikipedia-0.051
[[-0.050
Token project
Token ID1628
Feature activation+8.75e-02

Pos logit contributions
arity+0.212
 sage+0.199
wikipedia+0.196
idas+0.195
[[+0.194
Neg logit contributions
 equipped-0.243
 ado-0.234
thy-0.231
 BOX-0.229
opathy-0.226
Token by
Token ID416
Feature activation-4.53e-02

Pos logit contributions
arity+0.052
 sage+0.048
wikipedia+0.047
idas+0.047
[[+0.047
Neg logit contributions
 equipped-0.072
 ado-0.069
thy-0.068
 BOX-0.068
opathy-0.067
Token the
Token ID262
Feature activation+1.76e-04

Pos logit contributions
 equipped+0.032
 ado+0.030
thy+0.030
 BOX+0.030
opathy+0.029
Neg logit contributions
arity-0.032
 sage-0.031
wikipedia-0.030
idas-0.030
[[-0.030
Token cod
Token ID14873
Feature activation+4.75e-02

Pos logit contributions
arity+0.000
 sage+0.000
wikipedia+0.000
idas+0.000
[[+0.000
Neg logit contributions
 equipped-0.000
 ado-0.000
thy-0.000
 BOX-0.000
opathy-0.000
Tokenename
Token ID12453
Feature activation+8.38e-02

Pos logit contributions
arity+0.027
 sage+0.025
idas+0.024
wikipedia+0.024
[[+0.024
Neg logit contributions
 equipped-0.040
 ado-0.039
thy-0.038
 BOX-0.038
opathy-0.038
Token another
Token ID1194
Feature activation-3.93e-02

Pos logit contributions
 equipped+0.003
 ado+0.003
thy+0.003
 BOX+0.003
opathy+0.003
Neg logit contributions
arity-0.003
 sage-0.003
idas-0.003
wikipedia-0.003
[[-0.003
Token character
Token ID2095
Feature activation+7.01e-02

Pos logit contributions
 equipped+0.027
 ado+0.026
thy+0.025
 BOX+0.025
opathy+0.025
Neg logit contributions
arity-0.028
 sage-0.027
wikipedia-0.027
idas-0.027
[[-0.026
Token from
Token ID422
Feature activation-3.97e-02

Pos logit contributions
arity+0.049
 sage+0.046
wikipedia+0.046
idas+0.045
[[+0.045
Neg logit contributions
 equipped-0.050
 ado-0.048
thy-0.047
 BOX-0.047
opathy-0.047
Token the
Token ID262
Feature activation+3.42e-03

Pos logit contributions
 equipped+0.029
 ado+0.028
thy+0.027
 BOX+0.027
opathy+0.027
Neg logit contributions
arity-0.027
 sage-0.026
idas-0.025
wikipedia-0.025
[[-0.025
Token TM
Token ID21232
Feature activation+2.57e-01

Pos logit contributions
arity+0.002
idas+0.002
 sage+0.002
wikipedia+0.002
[[+0.002
Neg logit contributions
 equipped-0.003
 ado-0.002
 BOX-0.002
thy-0.002
opathy-0.002
TokenNT
Token ID11251
Feature activation+3.11e-01

Pos logit contributions
arity+0.098
 sage+0.087
wikipedia+0.086
idas+0.085
[[+0.084
Neg logit contributions
 equipped-0.267
 ado-0.259
thy-0.256
 BOX-0.255
opathy-0.253
Token universe
Token ID6881
Feature activation-9.57e-02

Pos logit contributions
arity+0.208
 sage+0.194
idas+0.193
wikipedia+0.193
[[+0.191
Neg logit contributions
 equipped-0.233
 ado-0.224
thy-0.220
 BOX-0.219
opathy-0.216
Token will
Token ID481
Feature activation+3.42e-02

Pos logit contributions
 equipped+0.068
 ado+0.065
thy+0.064
 BOX+0.063
opathy+0.063
Neg logit contributions
arity-0.068
 sage-0.064
wikipedia-0.063
idas-0.063
[[-0.063
Token be
Token ID307
Feature activation+9.97e-02

Pos logit contributions
arity+0.024
 sage+0.023
wikipedia+0.023
idas+0.023
[[+0.023
Neg logit contributions
 equipped-0.024
 ado-0.023
thy-0.023
 BOX-0.023
opathy-0.022
Token showing
Token ID4478
Feature activation+3.03e-02

Pos logit contributions
arity+0.067
 sage+0.063
wikipedia+0.062
idas+0.062
[[+0.062
Neg logit contributions
 equipped-0.074
 ado-0.071
thy-0.070
 BOX-0.069
opathy-0.069
Token his
Token ID465
Feature activation-5.01e-02

Pos logit contributions
arity+0.024
 sage+0.023
idas+0.022
wikipedia+0.022
[[+0.022
Neg logit contributions
 equipped-0.019
 ado-0.018
thy-0.018
 BOX-0.017
opathy-0.017
Token a
Token ID257
Feature activation-1.95e-02

Pos logit contributions
 equipped+0.021
 ado+0.020
thy+0.020
 BOX+0.020
opathy+0.020
Neg logit contributions
arity-0.023
 sage-0.022
wikipedia-0.021
idas-0.021
[[-0.021
Token prime
Token ID6994
Feature activation+2.06e-01

Pos logit contributions
 equipped+0.014
 ado+0.014
thy+0.013
 BOX+0.013
opathy+0.013
Neg logit contributions
arity-0.013
 sage-0.012
idas-0.012
wikipedia-0.012
[[-0.012
Token suspect
Token ID4099
Feature activation-1.35e-01

Pos logit contributions
arity+0.140
 sage+0.131
wikipedia+0.130
idas+0.130
[[+0.129
Neg logit contributions
 equipped-0.152
 ado-0.146
thy-0.144
 BOX-0.143
opathy-0.141
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
 equipped+0.100
 ado+0.096
thy+0.094
 BOX+0.094
opathy+0.092
Neg logit contributions
arity-0.092
 sage-0.087
idas-0.085
wikipedia-0.085
[[-0.085
Token Vill
Token ID9757
Feature activation-6.32e-02

Pos logit contributions
 equipped+0.010
 ado+0.010
thy+0.010
 BOX+0.010
opathy+0.009
Neg logit contributions
arity-0.010
 sage-0.009
wikipedia-0.009
idas-0.009
[[-0.009
Tokenagers
Token ID10321
Feature activation+3.60e-01

Pos logit contributions
 equipped+0.072
 ado+0.070
thy+0.070
 BOX+0.069
opathy+0.069
Neg logit contributions
arity-0.017
 sage-0.015
wikipedia-0.014
idas-0.014
[[-0.014
Token also
Token ID635
Feature activation-1.32e-01

Pos logit contributions
arity+0.254
 sage+0.240
wikipedia+0.236
idas+0.236
[[+0.235
Neg logit contributions
 equipped-0.255
 ado-0.245
thy-0.241
 BOX-0.239
opathy-0.237
Token reported
Token ID2098
Feature activation-2.34e-01

Pos logit contributions
 equipped+0.088
 ado+0.084
thy+0.083
 BOX+0.082
opathy+0.081
Neg logit contributions
arity-0.098
 sage-0.093
wikipedia-0.092
idas-0.092
[[-0.091
Token that
Token ID326
Feature activation-1.27e-02

Pos logit contributions
 equipped+0.171
 ado+0.164
thy+0.161
 BOX+0.160
opathy+0.158
Neg logit contributions
arity-0.160
 sage-0.150
wikipedia-0.148
idas-0.148
[[-0.147
Token they
Token ID484
Feature activation+1.74e-02

Pos logit contributions
 equipped+0.009
 ado+0.009
thy+0.009
 BOX+0.009
opathy+0.009
Neg logit contributions
arity-0.009
 sage-0.008
wikipedia-0.008
idas-0.008
[[-0.008
Token saw
Token ID2497
Feature activation+2.94e-01

Pos logit contributions
arity+0.011
 sage+0.011
idas+0.011
wikipedia+0.011
[[+0.010
Neg logit contributions
 equipped-0.013
 ado-0.013
thy-0.013
 BOX-0.013
opathy-0.012
Tokenak
Token ID461
Feature activation-1.91e-02

Pos logit contributions
 equipped+0.033
 ado+0.032
thy+0.031
 BOX+0.031
opathy+0.031
Neg logit contributions
arity-0.032
 sage-0.030
wikipedia-0.030
[[-0.030
idas-0.029
Tokenshi
Token ID44019
Feature activation+1.21e-02

Pos logit contributions
 equipped+0.014
 ado+0.013
 BOX+0.013
thy+0.013
opathy+0.013
Neg logit contributions
arity-0.013
 sage-0.012
idas-0.012
wikipedia-0.012
[[-0.012
Token temple
Token ID12505
Feature activation+4.39e-02

Pos logit contributions
arity+0.008
 sage+0.008
idas+0.008
wikipedia+0.008
[[+0.008
Neg logit contributions
 equipped-0.009
 ado-0.009
thy-0.008
 BOX-0.008
opathy-0.008
Token,
Token ID11
Feature activation-4.27e-03

Pos logit contributions
arity+0.030
 sage+0.028
idas+0.027
wikipedia+0.027
[[+0.027
Neg logit contributions
 equipped-0.032
 ado-0.031
thy-0.031
 BOX-0.031
opathy-0.030
Token in
Token ID287
Feature activation+1.40e-02

Pos logit contributions
 equipped+0.003
 ado+0.003
thy+0.003
 BOX+0.003
opathy+0.003
Neg logit contributions
arity-0.003
 sage-0.003
idas-0.003
wikipedia-0.003
[[-0.003
Token this
Token ID428
Feature activation-3.50e-03

Pos logit contributions
arity+0.010
 sage+0.009
idas+0.009
wikipedia+0.009
[[+0.009
Neg logit contributions
 equipped-0.010
 ado-0.010
thy-0.010
 BOX-0.010
opathy-0.009
Token temple
Token ID12505
Feature activation+4.94e-02

Pos logit contributions
 equipped+0.003
 ado+0.003
thy+0.003
 BOX+0.003
opathy+0.003
Neg logit contributions
arity-0.002
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token there
Token ID612
Feature activation+2.57e-02

Pos logit contributions
arity+0.033
 sage+0.032
idas+0.031
wikipedia+0.031
[[+0.031
Neg logit contributions
 equipped-0.036
 ado-0.035
 BOX-0.034
thy-0.034
opathy-0.034
Token are
Token ID389
Feature activation+4.85e-02

Pos logit contributions
arity+0.017
 sage+0.016
idas+0.015
wikipedia+0.015
[[+0.015
Neg logit contributions
 equipped-0.020
 ado-0.019
 BOX-0.019
thy-0.019
opathy-0.018
Token 7
Token ID767
Feature activation+5.80e-02

Pos logit contributions
arity+0.034
 sage+0.032
wikipedia+0.031
idas+0.031
[[+0.031
Neg logit contributions
 equipped-0.035
 ado-0.033
thy-0.033
 BOX-0.033
opathy-0.032
Token musical
Token ID10530
Feature activation-3.73e-02

Pos logit contributions
arity+0.041
 sage+0.039
idas+0.038
wikipedia+0.038
[[+0.038
Neg logit contributions
 equipped-0.041
 ado-0.039
 BOX-0.038
thy-0.038
opathy-0.038
Token the
Token ID262
Feature activation+5.50e-03

Pos logit contributions
arity+0.005
 sage+0.004
wikipedia+0.004
idas+0.004
[[+0.004
Neg logit contributions
 equipped-0.005
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Token protesters
Token ID9584
Feature activation-3.22e-02

Pos logit contributions
arity+0.004
 sage+0.004
wikipedia+0.004
idas+0.004
[[+0.003
Neg logit contributions
 equipped-0.004
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Token have
Token ID423
Feature activation-4.39e-02

Pos logit contributions
 equipped+0.024
 ado+0.023
thy+0.023
 BOX+0.022
opathy+0.022
Neg logit contributions
arity-0.022
 sage-0.020
wikipedia-0.020
idas-0.020
[[-0.020
Token a
Token ID257
Feature activation+1.52e-02

Pos logit contributions
 equipped+0.030
 ado+0.028
thy+0.028
 BOX+0.028
opathy+0.027
Neg logit contributions
arity-0.032
 sage-0.031
wikipedia-0.030
idas-0.030
[[-0.030
Token right
Token ID826
Feature activation-8.97e-02

Pos logit contributions
arity+0.012
 sage+0.011
idas+0.011
wikipedia+0.011
[[+0.011
Neg logit contributions
 equipped-0.010
 ado-0.009
thy-0.009
 BOX-0.009
opathy-0.009
Token to
Token ID284
Feature activation+8.69e-04

Pos logit contributions
 equipped+0.066
 ado+0.064
thy+0.063
 BOX+0.062
opathy+0.061
Neg logit contributions
arity-0.061
 sage-0.057
idas-0.056
wikipedia-0.056
[[-0.056
Token occupy
Token ID22265
Feature activation-1.00e-01

Pos logit contributions
arity+0.001
 sage+0.001
wikipedia+0.001
idas+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Token public
Token ID1171
Feature activation-5.26e-02

Pos logit contributions
 equipped+0.070
 ado+0.067
thy+0.066
 BOX+0.065
opathy+0.064
Neg logit contributions
arity-0.072
 sage-0.067
idas-0.067
wikipedia-0.067
[[-0.066
Token spaces
Token ID9029
Feature activation-1.58e-02

Pos logit contributions
 equipped+0.038
 ado+0.037
thy+0.036
 BOX+0.035
opathy+0.035
Neg logit contributions
arity-0.037
 sage-0.034
idas-0.034
wikipedia-0.034
[[-0.034
Token "
Token ID366
Feature activation-1.27e-01

Pos logit contributions
 equipped+0.012
 ado+0.011
thy+0.011
 BOX+0.011
opathy+0.011
Neg logit contributions
arity-0.011
 sage-0.010
idas-0.010
wikipedia-0.010
[[-0.010
Tokenas
Token ID292
Feature activation+8.12e-02

Pos logit contributions
 equipped+0.101
 ado+0.098
thy+0.096
 BOX+0.096
opathy+0.095
Neg logit contributions
arity-0.078
 sage-0.073
wikipedia-0.072
idas-0.072
[[-0.071
Token it
Token ID340
Feature activation-6.35e-02

Pos logit contributions
arity+0.039
 sage+0.037
wikipedia+0.037
idas+0.036
[[+0.036
Neg logit contributions
 equipped-0.040
 ado-0.038
thy-0.038
 BOX-0.037
opathy-0.037
Token would
Token ID561
Feature activation+1.52e-03

Pos logit contributions
 equipped+0.045
 ado+0.043
thy+0.043
 BOX+0.042
opathy+0.042
Neg logit contributions
arity-0.045
 sage-0.042
wikipedia-0.042
idas-0.042
[[-0.041
Token try
Token ID1949
Feature activation-2.44e-02

Pos logit contributions
arity+0.001
 sage+0.001
wikipedia+0.001
idas+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Token to
Token ID284
Feature activation-1.65e-02

Pos logit contributions
 equipped+0.018
 ado+0.017
thy+0.017
 BOX+0.017
opathy+0.017
Neg logit contributions
arity-0.016
 sage-0.016
wikipedia-0.015
idas-0.015
[[-0.015
Token mitigate
Token ID24237
Feature activation+9.30e-02

Pos logit contributions
 equipped+0.011
 ado+0.011
thy+0.011
 BOX+0.011
opathy+0.011
Neg logit contributions
arity-0.012
 sage-0.011
wikipedia-0.011
idas-0.011
[[-0.011
Token the
Token ID262
Feature activation-1.02e-02

Pos logit contributions
arity+0.065
 sage+0.062
wikipedia+0.061
idas+0.061
[[+0.060
Neg logit contributions
 equipped-0.066
 ado-0.064
thy-0.062
 BOX-0.062
opathy-0.061
Token impact
Token ID2928
Feature activation+2.72e-01

Pos logit contributions
 equipped+0.008
 ado+0.007
thy+0.007
 BOX+0.007
opathy+0.007
Neg logit contributions
arity-0.007
 sage-0.006
wikipedia-0.006
idas-0.006
[[-0.006
Token on
Token ID319
Feature activation+4.62e-02

Pos logit contributions
arity+0.185
 sage+0.174
wikipedia+0.171
idas+0.171
[[+0.170
Neg logit contributions
 equipped-0.201
 ado-0.192
thy-0.189
 BOX-0.188
opathy-0.186
Token employees
Token ID4409
Feature activation-2.22e-02

Pos logit contributions
arity+0.032
 sage+0.030
wikipedia+0.030
idas+0.030
[[+0.030
Neg logit contributions
 equipped-0.033
 ado-0.032
thy-0.031
 BOX-0.031
opathy-0.030
Token,
Token ID11
Feature activation-3.38e-03

Pos logit contributions
 equipped+0.016
 ado+0.015
thy+0.015
 BOX+0.015
opathy+0.015
Neg logit contributions
arity-0.015
 sage-0.015
wikipedia-0.014
idas-0.014
[[-0.014
Token suppliers
Token ID20499
Feature activation-5.24e-02

Pos logit contributions
 equipped+0.002
 ado+0.002
thy+0.002
 BOX+0.002
opathy+0.002
Neg logit contributions
arity-0.002
 sage-0.002
wikipedia-0.002
idas-0.002
[[-0.002
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
arity+0.049
 sage+0.046
wikipedia+0.046
idas+0.045
[[+0.045
Neg logit contributions
 equipped-0.057
 ado-0.055
thy-0.054
 BOX-0.054
opathy-0.053
Token
Token ID564
Feature activation+9.37e-03

Pos logit contributions
arity+0.013
 sage+0.012
wikipedia+0.012
idas+0.012
[[+0.012
Neg logit contributions
 equipped-0.009
 ado-0.009
thy-0.009
 BOX-0.009
opathy-0.009
Token
Token ID250
Feature activation+2.05e-02

Pos logit contributions
arity+0.007
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.006
 ado-0.006
thy-0.006
 BOX-0.006
opathy-0.006
TokenMom
Token ID29252
Feature activation+1.86e-01

Pos logit contributions
arity+0.013
 sage+0.012
idas+0.012
wikipedia+0.012
[[+0.012
Neg logit contributions
 equipped-0.016
 ado-0.015
thy-0.015
 BOX-0.015
opathy-0.015
Token,
Token ID11
Feature activation+7.33e-03

Pos logit contributions
arity+0.126
 sage+0.119
wikipedia+0.117
idas+0.117
[[+0.116
Neg logit contributions
 equipped-0.137
 ado-0.132
thy-0.130
 BOX-0.129
opathy-0.127
Token this
Token ID428
Feature activation+1.48e-02

Pos logit contributions
arity+0.005
 sage+0.005
idas+0.005
wikipedia+0.005
[[+0.005
Neg logit contributions
 equipped-0.005
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.005
Token is
Token ID318
Feature activation-1.06e-02

Pos logit contributions
arity+0.010
 sage+0.010
idas+0.010
wikipedia+0.010
[[+0.010
Neg logit contributions
 equipped-0.011
 ado-0.010
thy-0.010
 BOX-0.010
opathy-0.010
Token a
Token ID257
Feature activation+1.03e-02

Pos logit contributions
 equipped+0.008
 ado+0.007
thy+0.007
 BOX+0.007
opathy+0.007
Neg logit contributions
arity-0.007
 sage-0.007
idas-0.007
wikipedia-0.007
[[-0.007
Token sh
Token ID427
Feature activation-7.42e-02

Pos logit contributions
arity+0.007
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.007
 ado-0.007
thy-0.007
 BOX-0.007
opathy-0.007
Tokenanty
Token ID46098
Feature activation-5.51e-02

Pos logit contributions
 equipped+0.049
 ado+0.047
thy+0.046
 BOX+0.046
opathy+0.045
Neg logit contributions
arity-0.056
 sage-0.053
wikipedia-0.052
idas-0.052
[[-0.052
Tokentown
Token ID12735
Feature activation-2.72e-02

Pos logit contributions
 equipped+0.039
 ado+0.038
thy+0.037
 BOX+0.037
opathy+0.037
Neg logit contributions
arity-0.038
 sage-0.036
wikipedia-0.036
idas-0.036
[[-0.035
Tokenne
Token ID710
Feature activation-1.84e-01

Pos logit contributions
arity+0.024
 sage+0.023
wikipedia+0.022
idas+0.022
[[+0.022
Neg logit contributions
 equipped-0.029
 ado-0.028
thy-0.027
 BOX-0.027
opathy-0.027
Tokenh
Token ID71
Feature activation-2.76e-01

Pos logit contributions
 equipped+0.092
 ado+0.086
thy+0.084
 BOX+0.083
opathy+0.082
Neg logit contributions
arity-0.169
 sage-0.161
wikipedia-0.160
[[-0.160
idas-0.160
Tokenaha
Token ID12236
Feature activation-4.02e-01

Pos logit contributions
 equipped+0.077
 ado+0.069
 BOX+0.066
thy+0.064
opathy+0.062
Neg logit contributions
arity-0.315
 sage-0.303
idas-0.300
wikipedia-0.299
[[-0.299
Token Creek
Token ID13509
Feature activation+2.14e-02

Pos logit contributions
 equipped+0.289
 ado+0.277
thy+0.272
 BOX+0.271
opathy+0.267
Neg logit contributions
arity-0.281
 sage-0.264
wikipedia-0.261
idas-0.260
[[-0.259
Token and
Token ID290
Feature activation-2.60e-02

Pos logit contributions
arity+0.015
 sage+0.014
wikipedia+0.014
idas+0.014
[[+0.014
Neg logit contributions
 equipped-0.016
 ado-0.015
thy-0.015
 BOX-0.015
opathy-0.014
Token Lynd
Token ID33561
Feature activation+1.07e-01

Pos logit contributions
 equipped+0.019
 ado+0.018
thy+0.018
 BOX+0.018
opathy+0.017
Neg logit contributions
arity-0.018
 sage-0.017
idas-0.017
wikipedia-0.017
[[-0.017
Tokenale
Token ID1000
Feature activation-9.49e-02

Pos logit contributions
arity+0.086
 sage+0.082
idas+0.081
wikipedia+0.081
[[+0.080
Neg logit contributions
 equipped-0.065
 ado-0.062
thy-0.061
 BOX-0.060
opathy-0.059
Token Avenue
Token ID8878
Feature activation-5.13e-02

Pos logit contributions
 equipped+0.070
 ado+0.067
thy+0.066
 BOX+0.066
opathy+0.065
Neg logit contributions
arity-0.064
 sage-0.060
idas-0.059
wikipedia-0.059
[[-0.059
Token goes
Token ID2925
Feature activation-2.13e-03

Pos logit contributions
 equipped+0.037
 ado+0.036
thy+0.035
 BOX+0.035
opathy+0.034
Neg logit contributions
arity-0.036
 sage-0.033
wikipedia-0.033
idas-0.033
[[-0.033
Token before
Token ID878
Feature activation-5.91e-02

Pos logit contributions
 equipped+0.002
 ado+0.001
thy+0.001
 BOX+0.001
opathy+0.001
Neg logit contributions
arity-0.001
 sage-0.001
idas-0.001
wikipedia-0.001
[[-0.001
Token the
Token ID262
Feature activation-7.45e-03

Pos logit contributions
 equipped+0.043
 ado+0.042
thy+0.041
 BOX+0.041
opathy+0.040
Neg logit contributions
arity-0.040
 sage-0.038
idas-0.037
wikipedia-0.037
[[-0.037
Token
Token ID564
Feature activation+2.97e-03

Pos logit contributions
arity+0.003
 sage+0.003
wikipedia+0.003
idas+0.003
[[+0.003
Neg logit contributions
 equipped-0.003
 ado-0.003
thy-0.003
 BOX-0.002
opathy-0.002
Token
Token ID250
Feature activation+1.68e-02

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Tokenall
Token ID439
Feature activation-6.09e-02

Pos logit contributions
arity+0.010
 sage+0.010
idas+0.009
wikipedia+0.009
[[+0.009
Neg logit contributions
 equipped-0.013
 ado-0.013
thy-0.013
 BOX-0.013
opathy-0.013
Token in
Token ID287
Feature activation+1.87e-02

Pos logit contributions
 equipped+0.043
 ado+0.041
thy+0.041
 BOX+0.041
opathy+0.040
Neg logit contributions
arity-0.043
 sage-0.040
idas-0.040
wikipedia-0.040
[[-0.039
Token good
Token ID922
Feature activation-1.37e-01

Pos logit contributions
arity+0.014
 sage+0.013
wikipedia+0.013
idas+0.013
[[+0.013
Neg logit contributions
 equipped-0.013
 ado-0.012
thy-0.012
 BOX-0.012
opathy-0.012
Token fun
Token ID1257
Feature activation-1.79e-01

Pos logit contributions
 equipped+0.079
 ado+0.075
thy+0.073
 BOX+0.073
opathy+0.072
Neg logit contributions
arity-0.115
 sage-0.109
wikipedia-0.108
idas-0.108
[[-0.107
Token.
Token ID13
Feature activation+2.94e-03

Pos logit contributions
 equipped+0.129
 ado+0.124
thy+0.122
 BOX+0.121
opathy+0.120
Neg logit contributions
arity-0.123
 sage-0.116
wikipedia-0.114
idas-0.114
[[-0.114
Token
Token ID447
Feature activation+4.25e-03

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token
Token ID251
Feature activation+1.66e-02

Pos logit contributions
arity+0.003
 sage+0.003
wikipedia+0.003
idas+0.003
[[+0.003
Neg logit contributions
 equipped-0.003
 ado-0.003
thy-0.003
 BOX-0.003
opathy-0.003
Token<|endoftext|>
Token ID50256
Feature activation-2.67e-01

Pos logit contributions
arity+0.011
 sage+0.011
wikipedia+0.010
idas+0.010
[[+0.010
Neg logit contributions
 equipped-0.012
 ado-0.012
thy-0.012
 BOX-0.011
opathy-0.011
TokenOnce
Token ID7454
Feature activation-1.02e-01

Pos logit contributions
 equipped+0.194
 ado+0.187
thy+0.184
 BOX+0.183
opathy+0.181
Neg logit contributions
arity-0.183
 sage-0.172
idas-0.170
wikipedia-0.169
[[-0.168
Token to
Token ID284
Feature activation-1.76e-02

Pos logit contributions
 equipped+0.053
 ado+0.051
thy+0.050
 BOX+0.050
opathy+0.049
Neg logit contributions
arity-0.055
 sage-0.052
wikipedia-0.051
idas-0.051
[[-0.050
Token female
Token ID4048
Feature activation-1.02e-01

Pos logit contributions
 equipped+0.012
 ado+0.012
thy+0.012
 BOX+0.012
opathy+0.012
Neg logit contributions
arity-0.012
 sage-0.012
wikipedia-0.012
[[-0.011
idas-0.011
Token veterans
Token ID11255
Feature activation+3.07e-01

Pos logit contributions
 equipped+0.070
 ado+0.068
thy+0.066
 BOX+0.066
opathy+0.065
Neg logit contributions
arity-0.073
 sage-0.069
wikipedia-0.068
idas-0.068
[[-0.068
Token and
Token ID290
Feature activation-3.03e-02

Pos logit contributions
arity+0.216
 sage+0.204
wikipedia+0.200
idas+0.200
[[+0.199
Neg logit contributions
 equipped-0.217
 ado-0.209
thy-0.205
 BOX-0.204
opathy-0.201
Token female
Token ID4048
Feature activation-1.02e-01

Pos logit contributions
 equipped+0.022
 ado+0.022
thy+0.021
 BOX+0.021
opathy+0.021
Neg logit contributions
arity-0.020
 sage-0.019
wikipedia-0.019
idas-0.019
[[-0.019
Token reporters
Token ID7638
Feature activation-2.06e-01

Pos logit contributions
 equipped+0.070
 ado+0.068
thy+0.067
 BOX+0.066
opathy+0.065
Neg logit contributions
arity-0.074
 sage-0.070
wikipedia-0.069
idas-0.069
[[-0.068
Token).
Token ID737
Feature activation+3.33e-02

Pos logit contributions
 equipped+0.140
 ado+0.135
thy+0.132
 BOX+0.131
opathy+0.130
Neg logit contributions
arity-0.150
 sage-0.142
idas-0.140
wikipedia-0.140
[[-0.140
Token Asked
Token ID26220
Feature activation+2.69e-02

Pos logit contributions
arity+0.023
 sage+0.022
wikipedia+0.022
idas+0.022
[[+0.022
Neg logit contributions
 equipped-0.024
 ado-0.023
thy-0.022
 BOX-0.022
opathy-0.022
Token about
Token ID546
Feature activation-5.86e-02

Pos logit contributions
arity+0.020
 sage+0.019
wikipedia+0.018
idas+0.018
[[+0.018
Neg logit contributions
 equipped-0.018
 ado-0.018
thy-0.017
 BOX-0.017
opathy-0.017
Token sexual
Token ID3206
Feature activation-1.25e-01

Pos logit contributions
 equipped+0.042
 ado+0.040
thy+0.039
 BOX+0.039
opathy+0.039
Neg logit contributions
arity-0.041
 sage-0.039
wikipedia-0.038
idas-0.038
[[-0.038
Token harassment
Token ID10556
Feature activation-9.86e-02

Pos logit contributions
 equipped+0.081
 ado+0.077
thy+0.076
 BOX+0.076
opathy+0.074
Neg logit contributions
arity-0.096
 sage-0.091
idas-0.089
wikipedia-0.089
[[-0.089
Token that
Token ID326
Feature activation+1.10e-02

Pos logit contributions
 equipped+0.103
 ado+0.099
thy+0.097
 BOX+0.096
opathy+0.095
Neg logit contributions
arity-0.096
 sage-0.090
wikipedia-0.089
idas-0.089
[[-0.089
Token usefulness
Token ID37496
Feature activation-1.28e-01

Pos logit contributions
arity+0.008
 sage+0.007
wikipedia+0.007
[[+0.007
idas+0.007
Neg logit contributions
 equipped-0.008
 ado-0.007
thy-0.007
 BOX-0.007
opathy-0.007
Token was
Token ID373
Feature activation-3.90e-02

Pos logit contributions
 equipped+0.092
 ado+0.088
thy+0.087
 BOX+0.086
opathy+0.085
Neg logit contributions
arity-0.090
 sage-0.084
wikipedia-0.083
idas-0.083
[[-0.083
Token good
Token ID922
Feature activation-1.27e-01

Pos logit contributions
 equipped+0.028
 ado+0.027
thy+0.027
 BOX+0.026
opathy+0.026
Neg logit contributions
arity-0.027
 sage-0.026
wikipedia-0.025
idas-0.025
[[-0.025
Token-
Token ID12
Feature activation+3.65e-02

Pos logit contributions
 equipped+0.094
 ado+0.091
thy+0.089
 BOX+0.089
opathy+0.088
Neg logit contributions
arity-0.086
 sage-0.081
wikipedia-0.079
idas-0.079
[[-0.079
Tokenin
Token ID259
Feature activation-8.53e-02

Pos logit contributions
arity+0.021
 sage+0.020
wikipedia+0.019
idas+0.019
[[+0.019
Neg logit contributions
 equipped-0.031
 ado-0.030
thy-0.029
 BOX-0.029
opathy-0.029
Token-
Token ID12
Feature activation+4.32e-02

Pos logit contributions
 equipped+0.066
 ado+0.063
thy+0.062
 BOX+0.062
opathy+0.061
Neg logit contributions
arity-0.055
 sage-0.051
idas-0.051
wikipedia-0.051
[[-0.051
Tokenit
Token ID270
Feature activation+2.09e-01

Pos logit contributions
arity+0.025
 sage+0.023
idas+0.023
wikipedia+0.023
[[+0.023
Neg logit contributions
 equipped-0.036
 ado-0.035
thy-0.034
 BOX-0.034
opathy-0.034
Tokenself
Token ID944
Feature activation+4.21e-02

Pos logit contributions
arity+0.135
 sage+0.125
wikipedia+0.123
idas+0.123
[[+0.122
Neg logit contributions
 equipped-0.163
 ado-0.156
 BOX-0.154
thy-0.153
opathy-0.152
Token.
Token ID13
Feature activation+7.81e-03

Pos logit contributions
arity+0.029
 sage+0.027
wikipedia+0.027
idas+0.027
[[+0.027
Neg logit contributions
 equipped-0.031
 ado-0.030
thy-0.029
 BOX-0.029
opathy-0.029
Token For
Token ID1114
Feature activation-2.02e-02

Pos logit contributions
arity+0.005
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.006
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.005
Token (
Token ID357
Feature activation-7.28e-02

Pos logit contributions
 equipped+0.036
 ado+0.035
thy+0.034
 BOX+0.034
opathy+0.034
Neg logit contributions
arity-0.035
 sage-0.033
idas-0.033
wikipedia-0.033
[[-0.032
Tokenwhen
Token ID12518
Feature activation-9.70e-02

Pos logit contributions
 equipped+0.057
 ado+0.055
thy+0.054
 BOX+0.054
opathy+0.053
Neg logit contributions
arity-0.046
 sage-0.043
idas-0.042
wikipedia-0.042
[[-0.042
Token he
Token ID339
Feature activation+8.11e-02

Pos logit contributions
 equipped+0.071
 ado+0.069
thy+0.068
 BOX+0.067
opathy+0.067
Neg logit contributions
arity-0.066
 sage-0.062
wikipedia-0.061
idas-0.061
[[-0.060
Token discovered
Token ID5071
Feature activation+6.16e-02

Pos logit contributions
arity+0.057
 sage+0.054
idas+0.053
wikipedia+0.053
[[+0.053
Neg logit contributions
 equipped-0.058
 ado-0.055
thy-0.054
 BOX-0.054
opathy-0.053
Token I
Token ID314
Feature activation-5.38e-02

Pos logit contributions
arity+0.042
 sage+0.040
idas+0.039
wikipedia+0.039
[[+0.039
Neg logit contributions
 equipped-0.045
 ado-0.043
thy-0.042
 BOX-0.042
opathy-0.042
Token was
Token ID373
Feature activation-3.95e-02

Pos logit contributions
 equipped+0.037
 ado+0.036
 BOX+0.035
thy+0.035
opathy+0.034
Neg logit contributions
arity-0.039
 sage-0.037
idas-0.036
wikipedia-0.036
[[-0.036
Token going
Token ID1016
Feature activation+3.51e-03

Pos logit contributions
 equipped+0.027
 ado+0.026
thy+0.025
 BOX+0.025
opathy+0.025
Neg logit contributions
arity-0.029
 sage-0.028
idas-0.027
[[-0.027
wikipedia-0.027
Token to
Token ID284
Feature activation+1.16e-02

Pos logit contributions
arity+0.002
 sage+0.002
idas+0.002
wikipedia+0.002
[[+0.002
Neg logit contributions
 equipped-0.003
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token be
Token ID307
Feature activation+1.01e-01

Pos logit contributions
arity+0.008
 sage+0.007
idas+0.007
wikipedia+0.007
[[+0.007
Neg logit contributions
 equipped-0.009
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.008
Token writing
Token ID3597
Feature activation-4.50e-02

Pos logit contributions
arity+0.072
 sage+0.068
idas+0.067
wikipedia+0.067
[[+0.066
Neg logit contributions
 equipped-0.071
 ado-0.069
thy-0.067
 BOX-0.067
opathy-0.066
Token an
Token ID281
Feature activation-3.38e-02

Pos logit contributions
 equipped+0.032
 ado+0.031
thy+0.030
 BOX+0.030
opathy+0.030
Neg logit contributions
arity-0.032
 sage-0.030
wikipedia-0.029
idas-0.029
[[-0.029
Token,
Token ID11
Feature activation+1.65e-02

Pos logit contributions
 equipped+0.026
 ado+0.024
thy+0.024
 BOX+0.024
opathy+0.024
Neg logit contributions
arity-0.025
 sage-0.023
wikipedia-0.023
idas-0.023
[[-0.023
Token housed
Token ID23707
Feature activation-3.10e-01

Pos logit contributions
arity+0.012
 sage+0.011
idas+0.011
wikipedia+0.011
[[+0.011
Neg logit contributions
 equipped-0.012
 ado-0.011
thy-0.011
 BOX-0.011
opathy-0.011
Token in
Token ID287
Feature activation+3.42e-02

Pos logit contributions
 equipped+0.224
 ado+0.215
thy+0.211
 BOX+0.210
opathy+0.207
Neg logit contributions
arity-0.215
 sage-0.202
idas-0.200
wikipedia-0.200
[[-0.198
Token what
Token ID644
Feature activation+1.34e-02

Pos logit contributions
arity+0.023
 sage+0.022
idas+0.022
wikipedia+0.022
[[+0.022
Neg logit contributions
 equipped-0.025
 ado-0.024
thy-0.024
 BOX-0.023
opathy-0.023
Token was
Token ID373
Feature activation-4.29e-02

Pos logit contributions
arity+0.009
 sage+0.009
wikipedia+0.008
idas+0.008
[[+0.008
Neg logit contributions
 equipped-0.010
 ado-0.009
thy-0.009
 BOX-0.009
opathy-0.009
Token then
Token ID788
Feature activation-1.17e-01

Pos logit contributions
 equipped+0.030
 ado+0.029
thy+0.028
 BOX+0.028
opathy+0.028
Neg logit contributions
arity-0.031
 sage-0.029
idas-0.029
wikipedia-0.029
[[-0.028
Token the
Token ID262
Feature activation+1.30e-02

Pos logit contributions
 equipped+0.086
 ado+0.083
thy+0.081
 BOX+0.081
opathy+0.080
Neg logit contributions
arity-0.080
 sage-0.075
idas-0.074
wikipedia-0.074
[[-0.074
Token N
Token ID399
Feature activation+7.00e-02

Pos logit contributions
arity+0.009
 sage+0.008
idas+0.008
wikipedia+0.008
[[+0.008
Neg logit contributions
 equipped-0.010
 ado-0.009
thy-0.009
 BOX-0.009
opathy-0.009
Token64
Token ID2414
Feature activation-6.92e-02

Pos logit contributions
arity+0.054
 sage+0.051
idas+0.050
wikipedia+0.050
[[+0.050
Neg logit contributions
 equipped-0.045
 ado-0.043
thy-0.042
 BOX-0.042
opathy-0.041
Token
Token ID447
Feature activation+2.08e-02

Pos logit contributions
 equipped+0.047
 ado+0.045
thy+0.044
 BOX+0.044
opathy+0.043
Neg logit contributions
arity-0.051
 sage-0.048
wikipedia-0.048
idas-0.048
[[-0.048
Token
Token ID247
Feature activation+1.68e-02

Pos logit contributions
arity+0.014
 sage+0.013
idas+0.013
[[+0.013
wikipedia+0.013
Neg logit contributions
 equipped-0.016
 ado-0.015
thy-0.015
 BOX-0.015
opathy-0.014
Token,
Token ID11
Feature activation+3.24e-03

Pos logit contributions
arity+0.097
 sage+0.091
idas+0.090
wikipedia+0.090
[[+0.089
Neg logit contributions
 equipped-0.115
 ado-0.110
thy-0.109
 BOX-0.108
opathy-0.107
Token wolf
Token ID17481
Feature activation-2.33e-02

Pos logit contributions
arity+0.002
 sage+0.002
wikipedia+0.002
idas+0.002
[[+0.002
Neg logit contributions
 equipped-0.002
 ado-0.002
thy-0.002
 BOX-0.002
opathy-0.002
Token-
Token ID12
Feature activation+3.29e-02

Pos logit contributions
 equipped+0.017
 ado+0.017
thy+0.016
 BOX+0.016
opathy+0.016
Neg logit contributions
arity-0.016
 sage-0.015
wikipedia-0.014
idas-0.014
[[-0.014
Tokenwh
Token ID1929
Feature activation-1.66e-01

Pos logit contributions
arity+0.021
wikipedia+0.020
 sage+0.020
idas+0.020
[[+0.020
Neg logit contributions
 equipped-0.025
 ado-0.025
 BOX-0.024
thy-0.024
opathy-0.024
Tokenist
Token ID396
Feature activation-6.83e-02

Pos logit contributions
 equipped+0.104
 ado+0.099
thy+0.097
 BOX+0.096
opathy+0.095
Neg logit contributions
arity-0.131
 sage-0.124
wikipedia-0.123
idas-0.123
[[-0.122
Tokenling
Token ID1359
Feature activation-3.43e-01

Pos logit contributions
 equipped+0.022
 ado+0.020
thy+0.020
 BOX+0.019
opathy+0.019
Neg logit contributions
arity-0.074
 sage-0.071
idas-0.071
wikipedia-0.071
[[-0.071
Token or
Token ID393
Feature activation+1.84e-01

Pos logit contributions
 equipped+0.297
 ado+0.287
thy+0.283
 BOX+0.282
opathy+0.279
Neg logit contributions
arity-0.189
 sage-0.175
wikipedia-0.172
idas-0.172
[[-0.170
Token just
Token ID655
Feature activation+5.96e-02

Pos logit contributions
arity+0.128
 sage+0.121
idas+0.119
wikipedia+0.119
[[+0.119
Neg logit contributions
 equipped-0.132
 ado-0.127
thy-0.125
 BOX-0.124
opathy-0.122
Token ch
Token ID442
Feature activation-6.32e-02

Pos logit contributions
arity+0.032
 sage+0.030
wikipedia+0.029
idas+0.029
[[+0.029
Neg logit contributions
 equipped-0.052
 ado-0.050
thy-0.050
 BOX-0.049
opathy-0.049
Tokenor
Token ID273
Feature activation+1.38e-01

Pos logit contributions
 equipped+0.041
 ado+0.039
thy+0.038
 BOX+0.038
opathy+0.037
Neg logit contributions
arity-0.049
 sage-0.046
wikipedia-0.046
idas-0.045
[[-0.045
Tokenuses
Token ID2664
Feature activation+8.60e-02

Pos logit contributions
arity+0.086
 sage+0.079
idas+0.078
wikipedia+0.078
[[+0.077
Neg logit contributions
 equipped-0.111
 ado-0.107
thy-0.105
 BOX-0.105
opathy-0.103
Token
Token ID198
Feature activation-4.85e-03

Pos logit contributions
 equipped+0.012
 ado+0.012
thy+0.011
 BOX+0.011
opathy+0.011
Neg logit contributions
arity-0.012
 sage-0.012
wikipedia-0.012
idas-0.012
[[-0.012
Token
Token ID198
Feature activation-9.27e-03

Pos logit contributions
 equipped+0.004
 ado+0.003
thy+0.003
 BOX+0.003
opathy+0.003
Neg logit contributions
arity-0.003
 sage-0.003
idas-0.003
wikipedia-0.003
[[-0.003
TokenMr
Token ID5246
Feature activation+9.98e-02

Pos logit contributions
 equipped+0.007
 ado+0.007
thy+0.006
 BOX+0.006
opathy+0.006
Neg logit contributions
arity-0.006
 sage-0.006
idas-0.006
wikipedia-0.006
[[-0.006
Token.
Token ID13
Feature activation-1.75e-02

Pos logit contributions
arity+0.070
 sage+0.066
wikipedia+0.065
idas+0.065
[[+0.064
Neg logit contributions
 equipped-0.071
 ado-0.069
thy-0.067
 BOX-0.067
opathy-0.066
Token Mor
Token ID3461
Feature activation-9.30e-02

Pos logit contributions
 equipped+0.012
 ado+0.012
thy+0.012
 BOX+0.012
opathy+0.011
Neg logit contributions
arity-0.012
 sage-0.012
wikipedia-0.012
idas-0.012
[[-0.011
Tokenneau
Token ID46533
Feature activation-3.55e-01

Pos logit contributions
 equipped+0.029
 ado+0.026
 BOX+0.025
thy+0.024
opathy+0.024
Neg logit contributions
arity-0.103
 sage-0.099
idas-0.098
wikipedia-0.098
[[-0.098
Token was
Token ID373
Feature activation-7.07e-02

Pos logit contributions
 equipped+0.260
 ado+0.249
thy+0.245
 BOX+0.243
opathy+0.240
Neg logit contributions
arity-0.242
 sage-0.228
wikipedia-0.225
idas-0.225
[[-0.224
Token repeatedly
Token ID7830
Feature activation+2.43e-01

Pos logit contributions
 equipped+0.049
 ado+0.047
thy+0.046
 BOX+0.046
opathy+0.045
Neg logit contributions
arity-0.051
 sage-0.048
wikipedia-0.048
idas-0.048
[[-0.048
Token on
Token ID319
Feature activation+4.20e-02

Pos logit contributions
arity+0.168
 sage+0.158
idas+0.155
wikipedia+0.155
[[+0.154
Neg logit contributions
 equipped-0.175
 ado-0.169
thy-0.166
 BOX-0.165
opathy-0.163
Token the
Token ID262
Feature activation-1.19e-02

Pos logit contributions
arity+0.028
 sage+0.026
wikipedia+0.026
idas+0.026
[[+0.026
Neg logit contributions
 equipped-0.031
 ado-0.030
thy-0.030
 BOX-0.029
opathy-0.029
Token firing
Token ID9645
Feature activation+1.70e-02

Pos logit contributions
 equipped+0.009
 ado+0.008
thy+0.008
 BOX+0.008
opathy+0.008
Neg logit contributions
arity-0.008
 sage-0.008
wikipedia-0.008
idas-0.008
[[-0.008
Token off
Token ID572
Feature activation-4.15e-02

Pos logit contributions
arity+0.007
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.006
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.005
Tokenside
Token ID1589
Feature activation+6.36e-02

Pos logit contributions
 equipped+0.033
 ado+0.032
thy+0.031
 BOX+0.031
opathy+0.031
Neg logit contributions
arity-0.026
 sage-0.024
idas-0.024
wikipedia-0.024
[[-0.023
Token in
Token ID287
Feature activation+3.76e-02

Pos logit contributions
arity+0.044
 sage+0.041
wikipedia+0.041
idas+0.041
[[+0.041
Neg logit contributions
 equipped-0.046
 ado-0.044
thy-0.043
 BOX-0.043
opathy-0.043
Token the
Token ID262
Feature activation+1.69e-02

Pos logit contributions
arity+0.027
 sage+0.026
wikipedia+0.025
idas+0.025
[[+0.025
Neg logit contributions
 equipped-0.026
 ado-0.025
thy-0.024
 BOX-0.024
opathy-0.024
Token 73
Token ID8854
Feature activation+1.54e-02

Pos logit contributions
arity+0.011
 sage+0.011
wikipedia+0.011
idas+0.011
[[+0.010
Neg logit contributions
 equipped-0.013
 ado-0.012
thy-0.012
 BOX-0.012
opathy-0.012
Tokenrd
Token ID4372
Feature activation-3.60e-01

Pos logit contributions
arity+0.017
 sage+0.017
wikipedia+0.016
idas+0.016
[[+0.016
Neg logit contributions
 equipped-0.005
 ado-0.004
 BOX-0.004
thy-0.004
opathy-0.004
Token minute
Token ID5664
Feature activation+1.80e-01

Pos logit contributions
 equipped+0.336
 ado+0.326
thy+0.321
 BOX+0.320
opathy+0.317
Neg logit contributions
arity-0.174
 sage-0.159
idas-0.156
wikipedia-0.156
[[-0.155
Token and
Token ID290
Feature activation+1.60e-03

Pos logit contributions
arity+0.124
 sage+0.117
wikipedia+0.116
idas+0.115
[[+0.115
Neg logit contributions
 equipped-0.131
 ado-0.126
thy-0.123
 BOX-0.123
opathy-0.121
Token Ireland
Token ID7517
Feature activation+7.21e-03

Pos logit contributions
arity+0.001
 sage+0.001
wikipedia+0.001
idas+0.001
[[+0.001
Neg logit contributions
 equipped-0.001
 ado-0.001
thy-0.001
 BOX-0.001
opathy-0.001
Token also
Token ID635
Feature activation-9.30e-02

Pos logit contributions
arity+0.005
 sage+0.005
idas+0.005
wikipedia+0.005
[[+0.005
Neg logit contributions
 equipped-0.005
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.005
Token had
Token ID550
Feature activation-1.81e-02

Pos logit contributions
 equipped+0.067
 ado+0.064
thy+0.063
 BOX+0.063
opathy+0.062
Neg logit contributions
arity-0.065
 sage-0.061
wikipedia-0.060
idas-0.060
[[-0.060
Token
Token ID250
Feature activation+2.22e-02

Pos logit contributions
arity+0.007
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.006
 ado-0.006
thy-0.006
 BOX-0.006
opathy-0.006
TokenI
Token ID40
Feature activation+6.55e-02

Pos logit contributions
arity+0.016
 sage+0.015
idas+0.015
wikipedia+0.014
[[+0.014
Neg logit contributions
 equipped-0.016
 ado-0.015
thy-0.015
 BOX-0.015
opathy-0.015
Token
Token ID447
Feature activation+1.63e-02

Pos logit contributions
arity+0.049
 sage+0.046
wikipedia+0.046
idas+0.046
[[+0.045
Neg logit contributions
 equipped-0.044
 ado-0.042
thy-0.041
 BOX-0.041
opathy-0.040
Token
Token ID247
Feature activation+7.48e-03

Pos logit contributions
arity+0.012
 sage+0.012
wikipedia+0.011
[[+0.011
idas+0.011
Neg logit contributions
 equipped-0.011
 ado-0.010
thy-0.010
 BOX-0.010
opathy-0.010
Tokenm
Token ID76
Feature activation+2.20e-01

Pos logit contributions
arity+0.005
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.006
 ado-0.005
thy-0.005
 BOX-0.005
opathy-0.005
Token shocked
Token ID11472
Feature activation-3.43e-01

Pos logit contributions
arity+0.159
 sage+0.150
wikipedia+0.148
idas+0.147
[[+0.147
Neg logit contributions
 equipped-0.152
 ado-0.147
thy-0.144
 BOX-0.143
opathy-0.141
Token,
Token ID11
Feature activation+7.96e-03

Pos logit contributions
 equipped+0.240
 ado+0.230
thy+0.226
 BOX+0.225
opathy+0.222
Neg logit contributions
arity-0.245
 sage-0.231
wikipedia-0.228
idas-0.228
[[-0.227
Token I
Token ID314
Feature activation-6.39e-02

Pos logit contributions
arity+0.007
 sage+0.007
wikipedia+0.007
[[+0.007
idas+0.007
Neg logit contributions
 equipped-0.004
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Token
Token ID447
Feature activation+1.10e-02

Pos logit contributions
 equipped+0.042
 ado+0.040
thy+0.040
 BOX+0.039
opathy+0.039
Neg logit contributions
arity-0.048
 sage-0.046
wikipedia-0.045
idas-0.045
[[-0.045
Token
Token ID247
Feature activation+1.05e-02

Pos logit contributions
arity+0.008
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.008
 ado-0.008
 BOX-0.008
thy-0.008
opathy-0.008
Tokenm
Token ID76
Feature activation+2.12e-01

Pos logit contributions
arity+0.006
 sage+0.005
wikipedia+0.005
idas+0.005
[[+0.005
Neg logit contributions
 equipped-0.009
 ado-0.009
thy-0.009
 BOX-0.009
opathy-0.008
Token from
Token ID422
Feature activation-3.95e-02

Pos logit contributions
arity+0.164
 sage+0.155
wikipedia+0.152
idas+0.152
[[+0.152
Neg logit contributions
 equipped-0.164
 ado-0.157
thy-0.154
 BOX-0.153
opathy-0.151
Token Mitch
Token ID20472
Feature activation-5.27e-02

Pos logit contributions
 equipped+0.028
 ado+0.027
thy+0.027
 BOX+0.026
opathy+0.026
Neg logit contributions
arity-0.028
 sage-0.026
wikipedia-0.026
idas-0.026
[[-0.026
Token McConnell
Token ID18184
Feature activation-1.00e-01

Pos logit contributions
 equipped+0.033
 ado+0.031
thy+0.031
 BOX+0.030
opathy+0.030
Neg logit contributions
arity-0.042
 sage-0.040
idas-0.039
wikipedia-0.039
[[-0.039
Token and
Token ID290
Feature activation-9.20e-03

Pos logit contributions
 equipped+0.075
 ado+0.072
thy+0.070
 BOX+0.070
opathy+0.069
Neg logit contributions
arity-0.067
 sage-0.063
wikipedia-0.062
idas-0.062
[[-0.062
Token Chuck
Token ID14663
Feature activation+2.19e-02

Pos logit contributions
 equipped+0.007
 ado+0.006
thy+0.006
 BOX+0.006
opathy+0.006
Neg logit contributions
arity-0.006
 sage-0.006
wikipedia-0.006
idas-0.006
[[-0.006
Token Grassley
Token ID44050
Feature activation-3.18e-01

Pos logit contributions
arity+0.021
 sage+0.020
idas+0.020
wikipedia+0.020
[[+0.020
Neg logit contributions
 equipped-0.010
 ado-0.010
thy-0.009
 BOX-0.009
opathy-0.009
Token would
Token ID561
Feature activation+1.17e-02

Pos logit contributions
 equipped+0.233
 ado+0.224
thy+0.220
 BOX+0.218
opathy+0.216
Neg logit contributions
arity-0.217
 sage-0.204
wikipedia-0.201
idas-0.201
[[-0.200
Token be
Token ID307
Feature activation+9.30e-02

Pos logit contributions
arity+0.008
 sage+0.007
wikipedia+0.007
idas+0.007
[[+0.007
Neg logit contributions
 equipped-0.009
 ado-0.008
thy-0.008
 BOX-0.008
opathy-0.008
Token the
Token ID262
Feature activation+5.75e-03

Pos logit contributions
arity+0.064
 sage+0.061
wikipedia+0.060
idas+0.060
[[+0.059
Neg logit contributions
 equipped-0.067
 ado-0.064
thy-0.063
 BOX-0.063
opathy-0.062
Token end
Token ID886
Feature activation+4.24e-02

Pos logit contributions
arity+0.004
 sage+0.004
wikipedia+0.003
idas+0.003
[[+0.003
Neg logit contributions
 equipped-0.004
 ado-0.004
thy-0.004
 BOX-0.004
opathy-0.004
Token result
Token ID1255
Feature activation+1.95e-01

Pos logit contributions
arity+0.028
 sage+0.026
wikipedia+0.026
idas+0.026
[[+0.026
Neg logit contributions
 equipped-0.032
 ado-0.031
thy-0.030
 BOX-0.030
opathy-0.030