TOP ACTIVATIONS
MAX = 0.128

from the junk in her parent â s garage
his thumb in both his parent 's hands . As they
was walking outside with his parent . The sky was very
way up with her smart parent by her side . <|endoftext|>
to do well on his parent 's tests . He got
the star fish in his mother â s face
it of all the family members . The little
make a great fer ret parent . The
and felt safe in her parent 's arms . From then
from her kitchen into her mother 's biggest curtain ! She
to play tricks on his friends . He would often make
who loved playing in her family 's kitchen . She would
to play tricks on his friends . One day , he
borrow something from her team members . â
took care of her family members and was always very kind
a small farm with his parent . Every day , the
they c uddled with their parent on the couch and listened
my enjoyed travelling with his family . They had been on
table eating dinner with his family . He noticed the light
to play tricks on her friends . One day , Lily

MIN ACTIVATIONS
MIN = -0.153

can 't . The ze br as have their own food
could see the biggest , ju ic iest cher ries ever
The eggs were blue and spe ck led . They looked
a little bit ." Mom squee zes some shampoo on her
soon nothing but a small spe ck in the sky .
be the rip est , ju ic iest pear in the
The eggs were blue and spe ck led . L ila
to get the biggest , ju ic iest carrots .
scree ched . The ze br as ne ighed . The
She said , " A br ac ad ab ra !"
he found the biggest , ju ic iest squash he had
." L ily squee zes his hand . "
said together , " A br ac ad ab ra !"
She found the biggest and ju ic iest one , but
of the frog . He spe ared the lazy frog and
and saying , " A br ac ad ab ra !"
small for Ben . He squee zes in , but it
was the sweet est and ju ic iest cherry he had
rewarded with the biggest , ju ic iest cher ries !
some of the biggest , ju ic iest berries in the

INTERVAL 0.058 - 0.128
CONTAINS 2.582%

hugged her friend to comfort her . Her friend stopped crying
it , Jane ?" her mother asked . "
He played happily with his mom until it was time for
and round er . He thought it would make a more
want to leave . She wanted to keep playing . The

INTERVAL -0.012 - 0.058
CONTAINS 66.215%

The dog destroys the tubes . The tunnels and bridges and
did not give up . She tried again to unt ie
the bird . She opened her umbrella and put it on
wanted her to disappear again . <|endoftext|> Once there was a
your feelings . If something makes you feel bad , you

INTERVAL -0.083 - -0.012
CONTAINS 30.973%

They surprised dad when he came home . He was happy
new shoes . As the three of them were walking ,
it . The cater pillar said hello back .
so happy that he started to jump up and down in
pretty carpet on the wall ?" asked Alice .

INTERVAL -0.153 - -0.083
CONTAINS 0.230%

gigg led and said We can
wed and che wed and che wed on the shoe .
o never forgot the fast che et ah he had seen
than Uncle ! Un cle was so happy he
the window and saw the re ind eer too . Tim
Token from
Token ID422
Feature activation+9.37e-03

Pos logit contributions
 pose+0.103
 surely+0.101
olog+0.096
 score+0.094
 drib+0.092
Neg logit contributions
lessness-0.094
pt-0.084
aky-0.080
 crystals-0.077
ovable-0.074
Token the
Token ID262
Feature activation+9.43e-03

Pos logit contributions
lessness+0.014
pt+0.013
aky+0.011
 twins+0.011
 crystals+0.011
Neg logit contributions
 pose-0.019
 surely-0.019
 drib-0.017
 score-0.017
zes-0.017
Token junk
Token ID18556
Feature activation-6.08e-03

Pos logit contributions
lessness+0.017
pt+0.017
aky+0.015
 crystals+0.014
 twins+0.014
Neg logit contributions
 pose-0.016
 surely-0.015
 drib-0.014
 score-0.014
 salon-0.014
Token in
Token ID287
Feature activation-2.81e-02

Pos logit contributions
 pose+0.014
 surely+0.014
 drib+0.013
olog+0.013
 som+0.013
Neg logit contributions
lessness-0.007
pt-0.007
aky-0.005
 twins-0.005
 crystals-0.005
Token her
Token ID607
Feature activation+8.02e-02

Pos logit contributions
 surely+0.042
 pose+0.042
 score+0.039
 drib+0.038
 jugg+0.038
Neg logit contributions
lessness-0.055
pt-0.053
aky-0.048
ovable-0.046
 crystals-0.046
Token parent
Token ID2560
Feature activation+1.28e-01

Pos logit contributions
lessness+0.134
pt+0.128
 crystals+0.108
aky+0.107
ruck+0.106
Neg logit contributions
 pose-0.145
 surely-0.139
 drib-0.136
 score-0.134
 salon-0.131
Tokenâ
Token ID22940
Feature activation-2.69e-02

Pos logit contributions
lessness+0.251
pt+0.236
 crystals+0.212
 twins+0.200
ovable+0.200
Neg logit contributions
 pose-0.198
 drib-0.178
 surely-0.174
 score-0.173
 salon-0.172
Token
Token ID26391
Feature activation-3.39e-02

Pos logit contributions
olog+0.034
 dod+0.034
 som+0.033
 surely+0.033
 seventeen+0.033
Neg logit contributions
aky-0.051
pt-0.050
lessness-0.050
ady-0.046
 twins-0.046
Token
Token ID8151
Feature activation-2.61e-02

Pos logit contributions
 surely+0.059
 pose+0.056
 seventeen+0.056
 sixty+0.053
olog+0.053
Neg logit contributions
lessness-0.057
pt-0.055
 twins-0.052
 everywhere-0.051
 who-0.049
Tokens
Token ID82
Feature activation+6.00e-02

Pos logit contributions
 surely+0.055
 pose+0.055
 drib+0.053
olog+0.053
 seventeen+0.052
Neg logit contributions
lessness-0.031
pt-0.031
 crystals-0.027
aky-0.025
 twins-0.024
Token garage
Token ID15591
Feature activation+2.14e-02

Pos logit contributions
lessness+0.104
pt+0.097
aky+0.088
ruck+0.085
ovable+0.084
Neg logit contributions
 pose-0.104
 drib-0.095
 score-0.094
 salon-0.093
 destination-0.091
Token his
Token ID465
Feature activation+2.40e-02

Pos logit contributions
 surely+0.020
 pose+0.020
 drib+0.019
olog+0.018
 som+0.018
Neg logit contributions
lessness-0.013
pt-0.012
 twins-0.011
 crystals-0.010
aky-0.010
Token thumb
Token ID15683
Feature activation+2.46e-02

Pos logit contributions
lessness+0.041
pt+0.040
ruck+0.033
aky+0.033
 twins+0.032
Neg logit contributions
 pose-0.043
 surely-0.043
 score-0.039
 drib-0.039
zes-0.038
Token in
Token ID287
Feature activation-3.30e-02

Pos logit contributions
lessness+0.029
pt+0.025
 twins+0.023
 crystals+0.021
ruck+0.020
Neg logit contributions
 surely-0.055
 pose-0.055
 drib-0.052
 score-0.051
olog-0.051
Token both
Token ID1111
Feature activation+7.26e-02

Pos logit contributions
 surely+0.065
 pose+0.065
 drib+0.062
olog+0.061
 som+0.060
Neg logit contributions
lessness-0.049
pt-0.047
aky-0.040
 twins-0.040
 crystals-0.040
Token his
Token ID465
Feature activation+7.42e-02

Pos logit contributions
pt+0.115
lessness+0.114
aky+0.096
 crystals+0.095
 twins+0.095
Neg logit contributions
 pose-0.135
 surely-0.132
 drib-0.125
 score-0.124
 som-0.119
Token parent
Token ID2560
Feature activation+1.25e-01

Pos logit contributions
lessness+0.129
pt+0.126
ruck+0.107
 twins+0.107
aky+0.106
Neg logit contributions
 pose-0.128
 surely-0.126
 drib-0.117
 score-0.117
-0.115
Token's
Token ID338
Feature activation+6.58e-02

Pos logit contributions
lessness+0.226
pt+0.219
 crystals+0.196
 twins+0.192
aky+0.188
Neg logit contributions
 pose-0.201
 surely-0.194
 drib-0.186
olog-0.184
 score-0.179
Token hands
Token ID2832
Feature activation+4.85e-02

Pos logit contributions
lessness+0.112
pt+0.109
ruck+0.093
aky+0.092
 crystals+0.092
Neg logit contributions
 pose-0.112
 surely-0.111
 drib-0.104
-0.103
 score-0.102
Token.
Token ID13
Feature activation+2.21e-02

Pos logit contributions
lessness+0.060
pt+0.055
 crystals+0.051
 twins+0.050
aky+0.049
Neg logit contributions
 surely-0.102
 pose-0.098
 drib-0.098
olog-0.094
 score-0.092
Token As
Token ID1081
Feature activation+5.40e-03

Pos logit contributions
lessness+0.028
pt+0.025
 twins+0.025
 everywhere+0.024
 of+0.021
Neg logit contributions
 pose-0.048
 surely-0.046
olog-0.045
 som-0.045
 jugg-0.045
Token they
Token ID484
Feature activation+2.25e-02

Pos logit contributions
lessness+0.008
pt+0.007
 everywhere+0.007
 twins+0.007
 crystals+0.006
Neg logit contributions
 pose-0.011
olog-0.010
 seventeen-0.010
 surely-0.010
 drib-0.010
Token was
Token ID373
Feature activation+3.06e-02

Pos logit contributions
lessness+0.044
 crystals+0.037
ovable+0.035
 importantly+0.034
taker+0.034
Neg logit contributions
-0.018
 surely-0.018
 drib-0.018
zes-0.017
oh-0.017
Token walking
Token ID6155
Feature activation-1.25e-02

Pos logit contributions
lessness+0.045
pt+0.043
 crystals+0.036
aky+0.035
 twins+0.033
Neg logit contributions
 surely-0.062
 pose-0.060
 drib-0.059
olog-0.058
 score-0.056
Token outside
Token ID2354
Feature activation-2.61e-02

Pos logit contributions
 drib+0.025
 surely+0.024
 score+0.024
 pose+0.024
zes+0.022
Neg logit contributions
lessness-0.019
pt-0.017
aky-0.015
ovable-0.015
 crystals-0.015
Token with
Token ID351
Feature activation+3.51e-02

Pos logit contributions
 pose+0.051
 surely+0.050
 drib+0.050
zes+0.047
 score+0.047
Neg logit contributions
lessness-0.040
pt-0.037
aky-0.033
 crystals-0.032
 twins-0.031
Token his
Token ID465
Feature activation+6.74e-02

Pos logit contributions
lessness+0.068
pt+0.059
aky+0.056
ovable+0.055
iness+0.051
Neg logit contributions
 drib-0.054
 pose-0.050
 jugg-0.050
 som-0.050
 surely-0.049
Token parent
Token ID2560
Feature activation+1.22e-01

Pos logit contributions
lessness+0.128
pt+0.119
aky+0.106
 crystals+0.096
ovable+0.095
Neg logit contributions
 pose-0.106
 drib-0.106
 surely-0.104
 score-0.100
-0.094
Token.
Token ID13
Feature activation+3.40e-02

Pos logit contributions
lessness+0.204
pt+0.179
aky+0.164
ovable+0.164
 crystals+0.162
Neg logit contributions
 drib-0.202
 pose-0.199
 surely-0.187
-0.186
tons-0.185
Token The
Token ID383
Feature activation-1.85e-02

Pos logit contributions
lessness+0.059
pt+0.056
aky+0.054
ovable+0.049
 crystals+0.048
Neg logit contributions
 surely-0.056
 drib-0.055
olog-0.054
-0.054
 pose-0.052
Token sky
Token ID6766
Feature activation-2.90e-02

Pos logit contributions
 surely+0.030
 drib+0.028
olog+0.027
 pose+0.027
 jugg+0.026
Neg logit contributions
pt-0.033
lessness-0.032
aky-0.029
ovable-0.027
 crystals-0.026
Token was
Token ID373
Feature activation+1.27e-02

Pos logit contributions
 pose+0.059
olog+0.057
 surely+0.057
 seventeen+0.055
 drib+0.055
Neg logit contributions
lessness-0.041
pt-0.040
 everywhere-0.035
 twins-0.034
 crystals-0.033
Token very
Token ID845
Feature activation-6.31e-03

Pos logit contributions
lessness+0.016
pt+0.015
 twins+0.012
aky+0.012
 crystals+0.012
Neg logit contributions
 surely-0.028
 pose-0.027
 drib-0.026
olog-0.026
 seventeen-0.026
Token way
Token ID835
Feature activation+1.16e-03

Pos logit contributions
lessness+0.012
pt+0.011
aky+0.009
 everywhere+0.009
 crystals+0.009
Neg logit contributions
 pose-0.016
 surely-0.016
 drib-0.015
 som-0.015
 score-0.015
Token up
Token ID510
Feature activation-2.93e-02

Pos logit contributions
lessness+0.002
pt+0.002
aky+0.001
 twins+0.001
 crystals+0.001
Neg logit contributions
 surely-0.002
 pose-0.002
 drib-0.002
olog-0.002
 som-0.002
Token with
Token ID351
Feature activation+1.33e-02

Pos logit contributions
 surely+0.064
 pose+0.064
olog+0.062
 drib+0.061
 seventeen+0.060
Neg logit contributions
lessness-0.036
pt-0.034
 twins-0.029
 crystals-0.028
aky-0.027
Token her
Token ID607
Feature activation+5.72e-02

Pos logit contributions
lessness+0.018
pt+0.017
aky+0.015
 twins+0.015
 crystals+0.014
Neg logit contributions
 surely-0.027
 pose-0.027
 drib-0.026
 som-0.025
olog-0.025
Token smart
Token ID4451
Feature activation+7.45e-03

Pos logit contributions
lessness+0.080
pt+0.077
aky+0.065
 twins+0.064
 crystals+0.064
Neg logit contributions
 surely-0.116
 pose-0.116
 drib-0.110
olog-0.109
 score-0.108
Token parent
Token ID2560
Feature activation+1.21e-01

Pos logit contributions
lessness+0.012
pt+0.011
aky+0.010
 twins+0.010
 crystals+0.010
Neg logit contributions
 pose-0.014
 surely-0.014
 drib-0.013
olog-0.013
 som-0.013
Token by
Token ID416
Feature activation-2.01e-02

Pos logit contributions
lessness+0.137
pt+0.133
 crystals+0.113
 twins+0.107
aky+0.107
Neg logit contributions
 surely-0.271
 pose-0.269
 drib-0.254
olog-0.253
 score-0.251
Token her
Token ID607
Feature activation+5.97e-02

Pos logit contributions
 surely+0.036
 pose+0.036
olog+0.035
 seventeen+0.033
ven+0.033
Neg logit contributions
lessness-0.033
pt-0.032
 twins-0.029
 everywhere-0.028
 crystals-0.027
Token side
Token ID1735
Feature activation+2.43e-02

Pos logit contributions
lessness+0.093
pt+0.089
 twins+0.079
 everywhere+0.077
 crystals+0.077
Neg logit contributions
 pose-0.111
 surely-0.111
olog-0.107
 drib-0.104
 seventeen-0.104
Token.
Token ID13
Feature activation+2.19e-02

Pos logit contributions
lessness+0.025
pt+0.024
 crystals+0.019
 twins+0.019
aky+0.018
Neg logit contributions
 pose-0.058
 surely-0.057
 drib-0.054
olog-0.054
 som-0.054
Token<|endoftext|>
Token ID50256
Feature activation+2.27e-02

Pos logit contributions
lessness+0.030
pt+0.028
 twins+0.025
 everywhere+0.024
 crystals+0.023
Neg logit contributions
 pose-0.046
 surely-0.045
olog-0.044
 seventeen-0.043
 drib-0.043
Token to
Token ID284
Feature activation-4.60e-02

Pos logit contributions
 pose+0.092
 surely+0.090
 drib+0.086
olog+0.085
 som+0.085
Neg logit contributions
lessness-0.053
pt-0.053
 twins-0.043
aky-0.042
 crystals-0.042
Token do
Token ID466
Feature activation-7.07e-03

Pos logit contributions
olog+0.078
 seventeen+0.075
+0.072
Too+0.071
 surely+0.071
Neg logit contributions
pt-0.075
lessness-0.075
 who-0.069
 twins-0.068
 everywhere-0.067
Token well
Token ID880
Feature activation-3.33e-02

Pos logit contributions
 surely+0.015
 pose+0.014
olog+0.014
 seventeen+0.014
 drib+0.014
Neg logit contributions
lessness-0.010
pt-0.009
 twins-0.008
 everywhere-0.008
 crystals-0.008
Token on
Token ID319
Feature activation-3.83e-02

Pos logit contributions
 surely+0.076
 pose+0.076
olog+0.074
 seventeen+0.073
 drib+0.073
Neg logit contributions
lessness-0.037
pt-0.035
 twins-0.029
 everywhere-0.028
 crystals-0.028
Token his
Token ID465
Feature activation+6.14e-02

Pos logit contributions
 surely+0.087
olog+0.087
 pose+0.085
 seventeen+0.084
+0.081
Neg logit contributions
lessness-0.041
pt-0.040
 twins-0.037
 everywhere-0.035
 crystals-0.034
Token parent
Token ID2560
Feature activation+1.18e-01

Pos logit contributions
lessness+0.089
 twins+0.088
pt+0.085
 toys+0.084
 crystals+0.083
Neg logit contributions
 seventeen-0.114
olog-0.111
 surely-0.110
ven-0.102
 jugg-0.101
Token's
Token ID338
Feature activation+5.56e-02

Pos logit contributions
lessness+0.183
pt+0.176
 twins+0.155
 everywhere+0.152
 crystals+0.151
Neg logit contributions
 surely-0.219
 pose-0.218
olog-0.211
 drib-0.205
 seventeen-0.205
Token tests
Token ID5254
Feature activation-2.83e-04

Pos logit contributions
lessness+0.090
pt+0.087
 twins+0.082
 everywhere+0.079
 crystals+0.078
Neg logit contributions
 surely-0.097
 seventeen-0.095
olog-0.094
 pose-0.090
ven-0.089
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
 surely+0.001
 pose+0.001
olog+0.001
 drib+0.001
 seventeen+0.001
Neg logit contributions
lessness-0.000
pt-0.000
 twins-0.000
 everywhere-0.000
 crystals-0.000
Token He
Token ID679
Feature activation-2.83e-04

Pos logit contributions
lessness+0.031
pt+0.029
 twins+0.026
 everywhere+0.024
 crystals+0.024
Neg logit contributions
 pose-0.047
 surely-0.046
olog-0.045
 seventeen-0.044
 som-0.044
Token got
Token ID1392
Feature activation+1.42e-02

Pos logit contributions
 seventeen+0.001
olog+0.001
 pose+0.001
 surely+0.001
+0.001
Neg logit contributions
pt-0.000
lessness-0.000
 twins-0.000
aky-0.000
 everywhere-0.000
Token the
Token ID262
Feature activation+3.22e-03

Pos logit contributions
lessness+0.021
pt+0.020
 twins+0.017
aky+0.017
 crystals+0.017
Neg logit contributions
 surely-0.027
 pose-0.027
olog-0.026
 drib-0.025
 som-0.025
Token star
Token ID3491
Feature activation-2.09e-02

Pos logit contributions
lessness+0.005
pt+0.005
aky+0.005
 everywhere+0.004
 twins+0.004
Neg logit contributions
 surely-0.006
 pose-0.006
 score-0.005
 drib-0.005
 seventeen-0.005
Tokenfish
Token ID11084
Feature activation+2.15e-02

Pos logit contributions
 pose+0.041
 surely+0.041
 drib+0.038
 score+0.037
 som+0.037
Neg logit contributions
lessness-0.030
pt-0.029
 twins-0.026
 crystals-0.024
aky-0.024
Token in
Token ID287
Feature activation-1.99e-02

Pos logit contributions
lessness+0.028
pt+0.026
 twins+0.023
aky+0.022
 crystals+0.022
Neg logit contributions
 surely-0.046
 pose-0.046
 drib-0.043
olog-0.043
 som-0.042
Token his
Token ID465
Feature activation+8.27e-02

Pos logit contributions
 surely+0.035
 pose+0.034
 drib+0.033
 score+0.033
 jugg+0.032
Neg logit contributions
lessness-0.034
pt-0.033
aky-0.029
 twins-0.029
 crystals-0.028
Token mother
Token ID2802
Feature activation+1.17e-01

Pos logit contributions
lessness+0.134
pt+0.128
 crystals+0.110
 twins+0.109
aky+0.108
Neg logit contributions
 pose-0.151
 surely-0.151
 drib-0.143
 score-0.141
 som-0.140
Tokenâ
Token ID22940
Feature activation-3.29e-02

Pos logit contributions
lessness+0.189
pt+0.180
 twins+0.158
 crystals+0.157
aky+0.154
Neg logit contributions
 pose-0.214
 surely-0.210
 drib-0.200
olog-0.200
 seventeen-0.195
Token
Token ID26391
Feature activation-4.62e-02

Pos logit contributions
 jugg+0.042
 som+0.042
 pose+0.042
 salon+0.041
 dod+0.040
Neg logit contributions
pt-0.063
lessness-0.061
aky-0.060
ady-0.057
 company-0.053
Token
Token ID8151
Feature activation-3.21e-02

Pos logit contributions
 surely+0.090
 pose+0.088
 seventeen+0.085
olog+0.083
 sixty+0.082
Neg logit contributions
lessness-0.068
pt-0.065
 twins-0.060
 everywhere-0.058
 who-0.057
Tokens
Token ID82
Feature activation+6.04e-02

Pos logit contributions
 surely+0.060
 pose+0.059
olog+0.057
 drib+0.057
 seventeen+0.056
Neg logit contributions
lessness-0.048
pt-0.047
 crystals-0.042
 twins-0.040
aky-0.040
Token face
Token ID1986
Feature activation+5.56e-02

Pos logit contributions
lessness+0.100
pt+0.095
aky+0.088
 crystals+0.087
 twins+0.084
Neg logit contributions
 pose-0.103
 surely-0.101
 drib-0.100
 som-0.097
 score-0.097
Token it
Token ID340
Feature activation+2.39e-02

Pos logit contributions
 surely+0.057
 pose+0.054
 drib+0.053
 som+0.052
 jugg+0.052
Neg logit contributions
lessness-0.053
pt-0.050
aky-0.045
 twins-0.043
ovable-0.043
Token of
Token ID286
Feature activation+4.19e-03

Pos logit contributions
lessness+0.031
pt+0.029
 twins+0.025
aky+0.024
 crystals+0.024
Neg logit contributions
 surely-0.052
 pose-0.051
 drib-0.049
olog-0.049
 score-0.047
Token all
Token ID477
Feature activation+4.31e-02

Pos logit contributions
lessness+0.006
pt+0.006
aky+0.005
 twins+0.005
ovable+0.005
Neg logit contributions
 surely-0.008
 pose-0.008
 drib-0.008
 jugg-0.007
 som-0.007
Token the
Token ID262
Feature activation+3.00e-02

Pos logit contributions
pt+0.073
lessness+0.071
 twins+0.061
 crystals+0.061
aky+0.060
Neg logit contributions
 surely-0.075
 pose-0.071
 drib-0.070
 score-0.068
 jugg-0.065
Token family
Token ID1641
Feature activation+9.45e-02

Pos logit contributions
lessness+0.041
pt+0.040
aky+0.030
ovable+0.029
 crystals+0.029
Neg logit contributions
 pose-0.062
 surely-0.061
 drib-0.060
 score-0.059
olog-0.058
Token members
Token ID1866
Feature activation+1.17e-01

Pos logit contributions
pt+0.137
lessness+0.132
aky+0.111
 crystals+0.106
ovable+0.105
Neg logit contributions
 surely-0.176
 pose-0.174
 drib-0.172
 score-0.168
 jugg-0.165
Token.
Token ID13
Feature activation+3.60e-02

Pos logit contributions
pt+0.152
lessness+0.150
aky+0.130
 crystals+0.128
ovable+0.126
Neg logit contributions
 surely-0.215
 drib-0.213
 pose-0.207
 score-0.205
 jugg-0.202
Token
Token ID198
Feature activation+1.37e-02

Pos logit contributions
lessness+0.051
pt+0.049
 twins+0.044
 everywhere+0.042
 crystals+0.041
Neg logit contributions
 pose-0.073
 surely-0.072
olog-0.070
 seventeen-0.068
 drib-0.068
Token
Token ID198
Feature activation+1.66e-03

Pos logit contributions
lessness+0.022
pt+0.022
 crystals+0.019
 twins+0.019
aky+0.018
Neg logit contributions
 pose-0.024
 surely-0.024
 drib-0.023
olog-0.023
 jugg-0.023
TokenThe
Token ID464
Feature activation-3.87e-02

Pos logit contributions
lessness+0.003
pt+0.003
 twins+0.003
aky+0.003
 everywhere+0.003
Neg logit contributions
 surely-0.003
 pose-0.003
olog-0.003
 seventeen-0.003
zes-0.002
Token little
Token ID1310
Feature activation-4.47e-02

Pos logit contributions
 surely+0.072
+0.065
olog+0.064
 drib+0.064
 jugg+0.064
Neg logit contributions
pt-0.064
lessness-0.061
aky-0.053
ovable-0.049
sm-0.047
Token make
Token ID787
Feature activation+5.66e-03

Pos logit contributions
 pose+0.041
 surely+0.041
olog+0.039
 seventeen+0.038
 drib+0.038
Neg logit contributions
lessness-0.047
pt-0.046
 twins-0.041
 everywhere-0.040
 crystals-0.040
Token a
Token ID257
Feature activation+2.17e-02

Pos logit contributions
lessness+0.008
pt+0.007
aky+0.006
 crystals+0.006
 twins+0.006
Neg logit contributions
 pose-0.012
 surely-0.012
olog-0.011
 drib-0.011
 som-0.011
Token great
Token ID1049
Feature activation-2.19e-02

Pos logit contributions
lessness+0.025
pt+0.024
 twins+0.022
 everywhere+0.021
 crystals+0.020
Neg logit contributions
 surely-0.047
 pose-0.046
 seventeen-0.045
olog-0.045
-0.045
Token fer
Token ID11354
Feature activation-9.76e-02

Pos logit contributions
 pose+0.038
 surely+0.037
olog+0.036
 drib+0.035
 som+0.034
Neg logit contributions
lessness-0.038
pt-0.037
 twins-0.032
 crystals-0.032
aky-0.032
Tokenret
Token ID1186
Feature activation+3.29e-02

Pos logit contributions
 pose+0.170
 surely+0.168
+0.161
tons+0.157
olog+0.157
Neg logit contributions
lessness-0.158
pt-0.152
aky-0.147
 of-0.132
 everywhere-0.131
Token parent
Token ID2560
Feature activation+1.17e-01

Pos logit contributions
pt+0.045
lessness+0.045
 crystals+0.037
 twins+0.037
 everywhere+0.035
Neg logit contributions
 pose-0.068
 surely-0.067
olog-0.064
 drib-0.063
 score-0.062
Token.
Token ID13
Feature activation+2.14e-02

Pos logit contributions
pt+0.152
lessness+0.151
 crystals+0.127
 twins+0.121
aky+0.120
Neg logit contributions
 pose-0.248
 surely-0.248
 score-0.232
 drib-0.228
olog-0.226
Token
Token ID220
Feature activation-8.06e-03

Pos logit contributions
lessness+0.028
pt+0.025
 twins+0.024
 everywhere+0.022
 of+0.021
Neg logit contributions
 pose-0.046
 surely-0.045
olog-0.044
 seventeen-0.044
 som-0.043
Token
Token ID198
Feature activation+1.14e-02

Pos logit contributions
 surely+0.016
 pose+0.016
olog+0.015
 drib+0.015
 seventeen+0.015
Neg logit contributions
lessness-0.011
pt-0.011
 twins-0.009
 crystals-0.009
 everywhere-0.009
Token
Token ID198
Feature activation-7.27e-04

Pos logit contributions
lessness+0.016
pt+0.015
 twins+0.013
 everywhere+0.013
 crystals+0.013
Neg logit contributions
 pose-0.023
 surely-0.023
olog-0.022
 drib-0.022
 seventeen-0.022
TokenThe
Token ID464
Feature activation-4.71e-02

Pos logit contributions
 pose+0.001
 surely+0.001
olog+0.001
zes+0.001
 seventeen+0.001
Neg logit contributions
lessness-0.001
pt-0.001
aky-0.001
 twins-0.001
 of-0.001
Token and
Token ID290
Feature activation-2.18e-02

Pos logit contributions
 pose+0.016
 surely+0.016
 drib+0.015
 som+0.015
olog+0.015
Neg logit contributions
pt-0.012
lessness-0.012
aky-0.012
 crystals-0.011
 twins-0.010
Token felt
Token ID2936
Feature activation+3.65e-02

Pos logit contributions
 pose+0.043
 surely+0.043
olog+0.042
 seventeen+0.041
 drib+0.041
Neg logit contributions
lessness-0.032
pt-0.030
 twins-0.026
 everywhere-0.026
 crystals-0.025
Token safe
Token ID3338
Feature activation-8.88e-03

Pos logit contributions
lessness+0.048
pt+0.047
 crystals+0.044
 twins+0.042
 bricks+0.041
Neg logit contributions
 surely-0.077
 pose-0.071
 drib-0.067
olog-0.067
 som-0.067
Token in
Token ID287
Feature activation-3.21e-02

Pos logit contributions
 surely+0.018
 pose+0.017
 drib+0.016
olog+0.016
 som+0.016
Neg logit contributions
lessness-0.012
pt-0.012
aky-0.011
 crystals-0.010
ovable-0.010
Token her
Token ID607
Feature activation+6.11e-02

Pos logit contributions
 surely+0.068
 pose+0.068
 drib+0.064
 som+0.063
 score+0.062
Neg logit contributions
lessness-0.042
pt-0.041
aky-0.035
 crystals-0.033
 twins-0.033
Token parent
Token ID2560
Feature activation+1.16e-01

Pos logit contributions
lessness+0.094
pt+0.091
 twins+0.079
aky+0.078
 crystals+0.078
Neg logit contributions
 pose-0.115
 surely-0.115
olog-0.109
 drib-0.107
 som-0.105
Token's
Token ID338
Feature activation+6.24e-02

Pos logit contributions
lessness+0.206
pt+0.203
 crystals+0.182
 twins+0.177
aky+0.175
Neg logit contributions
 pose-0.190
 surely-0.189
olog-0.177
 drib-0.174
 score-0.169
Token arms
Token ID5101
Feature activation+4.54e-02

Pos logit contributions
lessness+0.101
pt+0.101
 crystals+0.090
aky+0.087
ruck+0.085
Neg logit contributions
 pose-0.108
 surely-0.105
zes-0.098
 drib-0.097
 score-0.096
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
lessness+0.055
pt+0.052
 crystals+0.051
 twins+0.049
aky+0.047
Neg logit contributions
 surely-0.096
 pose-0.091
 drib-0.089
olog-0.086
 score-0.086
Token From
Token ID3574
Feature activation-6.82e-03

Pos logit contributions
lessness+0.033
pt+0.032
 twins+0.028
 everywhere+0.027
 crystals+0.027
Neg logit contributions
 pose-0.039
 surely-0.039
olog-0.038
 drib-0.037
 seventeen-0.037
Token then
Token ID788
Feature activation-3.15e-02

Pos logit contributions
 surely+0.011
 pose+0.011
olog+0.010
 drib+0.010
zes+0.010
Neg logit contributions
lessness-0.012
pt-0.012
 twins-0.011
aky-0.010
 crystals-0.010
Token from
Token ID422
Feature activation+5.40e-03

Pos logit contributions
 surely+0.011
 pose+0.011
olog+0.010
 drib+0.010
ven+0.010
Neg logit contributions
lessness-0.008
pt-0.008
 twins-0.007
aky-0.007
 crystals-0.006
Token her
Token ID607
Feature activation+6.12e-02

Pos logit contributions
lessness+0.008
pt+0.008
 twins+0.007
 everywhere+0.007
 crystals+0.007
Neg logit contributions
 pose-0.010
 surely-0.010
olog-0.010
 drib-0.010
 seventeen-0.010
Token kitchen
Token ID9592
Feature activation+5.96e-03

Pos logit contributions
lessness+0.094
pt+0.089
aky+0.075
 twins+0.073
 crystals+0.073
Neg logit contributions
 pose-0.119
 surely-0.117
 drib-0.113
olog-0.112
zes-0.109
Token into
Token ID656
Feature activation-4.28e-03

Pos logit contributions
lessness+0.007
pt+0.006
 twins+0.005
 crystals+0.005
 everywhere+0.005
Neg logit contributions
 pose-0.014
 surely-0.014
olog-0.013
 drib-0.013
 seventeen-0.013
Token her
Token ID607
Feature activation+8.30e-02

Pos logit contributions
 pose+0.007
 surely+0.007
 drib+0.006
 score+0.006
 jugg+0.006
Neg logit contributions
lessness-0.008
pt-0.008
ovable-0.007
aky-0.007
 twins-0.006
Token mother
Token ID2802
Feature activation+1.15e-01

Pos logit contributions
lessness+0.129
pt+0.125
aky+0.101
 crystals+0.101
 twins+0.100
Neg logit contributions
 pose-0.161
 surely-0.156
 drib-0.151
olog-0.149
zes-0.146
Token's
Token ID338
Feature activation+5.98e-02

Pos logit contributions
lessness+0.218
pt+0.209
 crystals+0.175
 twins+0.174
aky+0.173
Neg logit contributions
 pose-0.189
 surely-0.176
 drib-0.171
olog-0.166
 score-0.165
Token biggest
Token ID4094
Feature activation+1.83e-02

Pos logit contributions
lessness+0.096
pt+0.092
aky+0.076
ovable+0.072
 crystals+0.072
Neg logit contributions
 pose-0.113
 drib-0.107
 surely-0.107
olog-0.104
zes-0.103
Token curtain
Token ID29461
Feature activation+1.84e-03

Pos logit contributions
lessness+0.030
pt+0.029
aky+0.025
 twins+0.024
 everywhere+0.024
Neg logit contributions
 pose-0.033
 surely-0.033
 drib-0.031
olog-0.031
 som-0.030
Token!
Token ID0
Feature activation+2.33e-02

Pos logit contributions
lessness+0.002
pt+0.002
 twins+0.002
aky+0.002
 crystals+0.001
Neg logit contributions
 pose-0.004
 surely-0.004
olog-0.004
 drib-0.004
 som-0.004
Token She
Token ID1375
Feature activation-6.47e-03

Pos logit contributions
lessness+0.029
pt+0.028
 twins+0.024
 everywhere+0.023
 crystals+0.023
Neg logit contributions
 pose-0.051
 surely-0.051
olog-0.049
 drib-0.048
 seventeen-0.048
Token to
Token ID284
Feature activation-2.48e-02

Pos logit contributions
lessness+0.090
pt+0.085
aky+0.073
 twins+0.069
ovable+0.067
Neg logit contributions
 pose-0.114
 drib-0.110
 som-0.108
 surely-0.108
 jugg-0.105
Token play
Token ID711
Feature activation+2.29e-02

Pos logit contributions
 surely+0.043
olog+0.042
 pose+0.039
 seventeen+0.039
+0.039
Neg logit contributions
pt-0.041
lessness-0.039
 crystals-0.037
 twins-0.036
 everywhere-0.035
Token tricks
Token ID15910
Feature activation+2.21e-02

Pos logit contributions
lessness+0.035
pt+0.033
 twins+0.028
 crystals+0.027
aky+0.027
Neg logit contributions
 pose-0.045
 surely-0.044
 drib-0.043
olog-0.042
 som-0.042
Token on
Token ID319
Feature activation-9.35e-03

Pos logit contributions
lessness+0.028
pt+0.027
aky+0.023
 crystals+0.023
 twins+0.022
Neg logit contributions
 pose-0.047
 surely-0.045
 drib-0.044
 som-0.043
 jugg-0.043
Token his
Token ID465
Feature activation+8.40e-02

Pos logit contributions
 surely+0.022
olog+0.022
 pose+0.021
 seventeen+0.021
ven+0.020
Neg logit contributions
pt-0.009
lessness-0.009
 everywhere-0.009
 twins-0.008
 crystals-0.008
Token friends
Token ID2460
Feature activation+1.15e-01

Pos logit contributions
lessness+0.107
pt+0.103
 twins+0.087
 everywhere+0.085
 crystals+0.085
Neg logit contributions
 pose-0.180
 surely-0.180
olog-0.174
 drib-0.171
 seventeen-0.170
Token.
Token ID13
Feature activation+3.83e-02

Pos logit contributions
lessness+0.159
pt+0.133
 crystals+0.111
 twins+0.111
aky+0.107
Neg logit contributions
 pose-0.255
 drib-0.246
 score-0.235
 jugg-0.235
 surely-0.231
Token He
Token ID679
Feature activation+9.09e-04

Pos logit contributions
lessness+0.040
 twins+0.037
pt+0.037
 everywhere+0.034
 who+0.033
Neg logit contributions
 surely-0.087
olog-0.085
 pose-0.085
zes-0.084
 seventeen-0.080
Token would
Token ID561
Feature activation-3.09e-02

Pos logit contributions
pt+0.001
lessness+0.001
 everywhere+0.001
 twins+0.001
aky+0.001
Neg logit contributions
 surely-0.002
olog-0.002
 pose-0.002
 seventeen-0.002
-0.002
Token often
Token ID1690
Feature activation-7.94e-04

Pos logit contributions
olog+0.049
 surely+0.048
 pose+0.046
 seventeen+0.046
+0.046
Neg logit contributions
pt-0.053
lessness-0.053
 everywhere-0.051
 twins-0.050
 who-0.048
Token make
Token ID787
Feature activation+1.76e-02

Pos logit contributions
 surely+0.001
olog+0.001
 pose+0.001
 seventeen+0.001
+0.001
Neg logit contributions
pt-0.001
lessness-0.001
 everywhere-0.001
 twins-0.001
 crystals-0.001
Token who
Token ID508
Feature activation+1.12e-02

Pos logit contributions
lessness+0.032
pt+0.029
 crystals+0.026
 everywhere+0.024
 twins+0.024
Neg logit contributions
 pose-0.034
 surely-0.032
 drib-0.031
zes-0.031
 jugg-0.030
Token loved
Token ID6151
Feature activation+5.71e-02

Pos logit contributions
lessness+0.015
pt+0.014
 crystals+0.013
 bricks+0.012
 twins+0.012
Neg logit contributions
 pose-0.023
 surely-0.023
 drib-0.022
olog-0.022
-0.021
Token playing
Token ID2712
Feature activation+2.90e-02

Pos logit contributions
lessness+0.088
pt+0.082
aky+0.073
 twins+0.067
ovable+0.066
Neg logit contributions
 pose-0.111
 drib-0.107
 surely-0.105
 som-0.104
 jugg-0.103
Token in
Token ID287
Feature activation-7.64e-03

Pos logit contributions
lessness+0.040
pt+0.036
aky+0.030
 twins+0.030
 crystals+0.029
Neg logit contributions
 pose-0.063
 drib-0.060
 surely-0.059
 score-0.058
olog-0.058
Token her
Token ID607
Feature activation+9.13e-02

Pos logit contributions
 pose+0.014
 surely+0.014
 drib+0.013
olog+0.013
 som+0.013
Neg logit contributions
lessness-0.013
pt-0.012
aky-0.010
 twins-0.010
 everywhere-0.010
Token family
Token ID1641
Feature activation+1.15e-01

Pos logit contributions
lessness+0.140
pt+0.132
aky+0.107
 everywhere+0.106
 crystals+0.106
Neg logit contributions
 pose-0.177
 surely-0.173
 drib-0.168
zes-0.165
-0.165
Token's
Token ID338
Feature activation+7.61e-02

Pos logit contributions
lessness+0.167
pt+0.154
ovable+0.116
 crystals+0.116
aky+0.116
Neg logit contributions
 pose-0.239
 drib-0.234
-0.223
 surely-0.222
 score-0.221
Token kitchen
Token ID9592
Feature activation+2.53e-02

Pos logit contributions
lessness+0.142
pt+0.129
ovable+0.108
iness+0.107
aky+0.106
Neg logit contributions
 drib-0.123
 pose-0.122
zes-0.118
 score-0.118
 salon-0.116
Token.
Token ID13
Feature activation+3.31e-02

Pos logit contributions
lessness+0.029
pt+0.027
aky+0.021
 crystals+0.020
 twins+0.020
Neg logit contributions
 pose-0.059
 drib-0.056
 surely-0.056
olog-0.056
 som-0.055
Token She
Token ID1375
Feature activation-2.97e-03

Pos logit contributions
lessness+0.042
pt+0.040
 twins+0.037
 everywhere+0.035
 crystals+0.034
Neg logit contributions
 surely-0.070
 pose-0.070
olog-0.068
 seventeen-0.066
 drib-0.066
Token would
Token ID561
Feature activation-3.61e-02

Pos logit contributions
 pose+0.006
 surely+0.006
 drib+0.006
olog+0.006
 som+0.006
Neg logit contributions
lessness-0.004
pt-0.004
 twins-0.003
 crystals-0.003
 everywhere-0.003
Token to
Token ID284
Feature activation-2.62e-02

Pos logit contributions
lessness+0.084
pt+0.079
aky+0.068
 twins+0.065
 crystals+0.063
Neg logit contributions
 pose-0.113
 surely-0.109
 drib-0.109
 som-0.108
 jugg-0.106
Token play
Token ID711
Feature activation+2.22e-02

Pos logit contributions
 surely+0.042
olog+0.041
 pose+0.040
 seventeen+0.039
+0.039
Neg logit contributions
pt-0.047
lessness-0.045
 twins-0.041
 crystals-0.041
 everywhere-0.040
Token tricks
Token ID15910
Feature activation+2.10e-02

Pos logit contributions
lessness+0.030
pt+0.027
 twins+0.023
 crystals+0.023
aky+0.023
Neg logit contributions
 pose-0.048
 surely-0.047
 drib-0.046
olog-0.045
 som-0.045
Token on
Token ID319
Feature activation-9.88e-03

Pos logit contributions
lessness+0.026
pt+0.025
 crystals+0.021
aky+0.021
 twins+0.020
Neg logit contributions
 pose-0.045
 surely-0.044
 drib-0.043
 som-0.042
 jugg-0.042
Token his
Token ID465
Feature activation+8.56e-02

Pos logit contributions
olog+0.024
 surely+0.024
 seventeen+0.022
 pose+0.022
ven+0.022
Neg logit contributions
pt-0.010
lessness-0.010
 everywhere-0.009
 twins-0.009
 crystals-0.009
Token friends
Token ID2460
Feature activation+1.15e-01

Pos logit contributions
lessness+0.133
pt+0.128
 twins+0.110
 crystals+0.108
 everywhere+0.108
Neg logit contributions
 pose-0.162
 surely-0.161
olog-0.154
 drib-0.153
 som-0.150
Token.
Token ID13
Feature activation+4.02e-02

Pos logit contributions
lessness+0.165
pt+0.137
 crystals+0.119
 twins+0.118
ruck+0.116
Neg logit contributions
 pose-0.246
 drib-0.237
 score-0.226
 jugg-0.225
 surely-0.222
Token One
Token ID1881
Feature activation+1.07e-02

Pos logit contributions
lessness+0.050
pt+0.046
 twins+0.045
 everywhere+0.041
 crystals+0.040
Neg logit contributions
 surely-0.086
 pose-0.085
olog-0.084
zes-0.082
 seventeen-0.081
Token day
Token ID1110
Feature activation-1.68e-02

Pos logit contributions
lessness+0.017
pt+0.016
 twins+0.014
 crystals+0.014
 everywhere+0.014
Neg logit contributions
 pose-0.020
 surely-0.020
 drib-0.019
olog-0.019
 som-0.018
Token,
Token ID11
Feature activation-7.07e-03

Pos logit contributions
 pose+0.035
 surely+0.035
olog+0.034
 seventeen+0.033
 drib+0.033
Neg logit contributions
lessness-0.023
pt-0.022
 everywhere-0.019
 twins-0.019
 crystals-0.018
Token he
Token ID339
Feature activation-1.43e-03

Pos logit contributions
 pose+0.015
 surely+0.014
olog+0.014
 seventeen+0.014
 drib+0.014
Neg logit contributions
lessness-0.010
pt-0.009
 twins-0.008
 everywhere-0.008
 crystals-0.008
Token borrow
Token ID8804
Feature activation+7.12e-03

Pos logit contributions
 seventeen+0.094
olog+0.093
 pose+0.092
 surely+0.089
zes+0.087
Neg logit contributions
pt-0.095
lessness-0.091
 of-0.085
 twins-0.083
 everywhere-0.081
Token something
Token ID1223
Feature activation+6.43e-02

Pos logit contributions
lessness+0.009
pt+0.008
 twins+0.007
aky+0.007
 everywhere+0.007
Neg logit contributions
 pose-0.016
 surely-0.016
olog-0.015
 drib-0.015
 seventeen-0.015
Token from
Token ID422
Feature activation-4.66e-04

Pos logit contributions
lessness+0.091
pt+0.083
aky+0.073
 twins+0.073
 crystals+0.068
Neg logit contributions
 pose-0.133
 surely-0.128
 drib-0.125
olog-0.124
 som-0.122
Token her
Token ID607
Feature activation+6.32e-02

Pos logit contributions
 pose+0.001
 surely+0.001
olog+0.001
 seventeen+0.001
 drib+0.001
Neg logit contributions
lessness-0.001
pt-0.001
 twins-0.001
 everywhere-0.001
 crystals-0.001
Token team
Token ID1074
Feature activation+4.13e-02

Pos logit contributions
lessness+0.088
pt+0.085
 twins+0.075
 everywhere+0.073
 crystals+0.072
Neg logit contributions
 surely-0.127
 pose-0.126
olog-0.124
 seventeen-0.121
 drib-0.119
Token members
Token ID1866
Feature activation+1.15e-01

Pos logit contributions
lessness+0.046
pt+0.042
 crystals+0.034
 twins+0.033
aky+0.032
Neg logit contributions
 pose-0.098
 surely-0.095
 drib-0.092
olog-0.092
 score-0.091
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
lessness+0.134
pt+0.125
 crystals+0.104
aky+0.101
 twins+0.100
Neg logit contributions
 pose-0.259
 surely-0.254
 drib-0.247
 score-0.242
olog-0.241
Token
Token ID198
Feature activation+7.97e-03

Pos logit contributions
lessness+0.018
pt+0.017
 twins+0.014
 everywhere+0.014
 crystals+0.013
Neg logit contributions
 pose-0.032
 surely-0.031
 seventeen-0.030
olog-0.030
 jugg-0.029
Token
Token ID198
Feature activation-2.32e-03

Pos logit contributions
lessness+0.011
pt+0.011
 twins+0.009
 everywhere+0.009
aky+0.009
Neg logit contributions
 pose-0.016
 surely-0.016
olog-0.015
 drib-0.015
 seventeen-0.015
Tokenâ
Token ID22940
Feature activation-7.23e-02

Pos logit contributions
 pose+0.004
 surely+0.004
olog+0.004
 seventeen+0.003
 drib+0.003
Neg logit contributions
lessness-0.004
pt-0.004
 twins-0.004
aky-0.004
 everywhere-0.004
Token
Token ID26391
Feature activation-7.97e-02

Pos logit contributions
 pose+0.088
 som+0.087
 salon+0.085
 jugg+0.084
 surely+0.084
Neg logit contributions
lessness-0.152
pt-0.142
aky-0.137
 who-0.128
ady-0.127
Token took
Token ID1718
Feature activation+1.63e-02

Pos logit contributions
 surely+0.042
 pose+0.042
 seventeen+0.042
olog+0.041
 drib+0.040
Neg logit contributions
pt-0.027
lessness-0.027
 twins-0.023
aky-0.022
 everywhere-0.022
Token care
Token ID1337
Feature activation+6.58e-02

Pos logit contributions
lessness+0.022
pt+0.020
 twins+0.017
aky+0.017
 crystals+0.017
Neg logit contributions
 pose-0.035
 surely-0.034
olog-0.033
 drib-0.032
 som-0.032
Token of
Token ID286
Feature activation+9.47e-04

Pos logit contributions
pt+0.069
 crystals+0.050
lessness+0.049
 twins+0.048
aky+0.044
Neg logit contributions
 surely-0.153
 pose-0.151
 drib-0.137
olog-0.135
ro-0.133
Token her
Token ID607
Feature activation+5.48e-02

Pos logit contributions
lessness+0.001
pt+0.001
 twins+0.001
aky+0.001
 crystals+0.001
Neg logit contributions
 pose-0.002
 surely-0.002
olog-0.002
 drib-0.002
 som-0.002
Token family
Token ID1641
Feature activation+9.29e-02

Pos logit contributions
lessness+0.073
pt+0.069
aky+0.062
ovable+0.053
ruck+0.051
Neg logit contributions
 pose-0.114
 score-0.108
 surely-0.106
 drib-0.105
zes-0.102
Token members
Token ID1866
Feature activation+1.14e-01

Pos logit contributions
lessness+0.129
pt+0.127
aky+0.103
 crystals+0.102
 twins+0.099
Neg logit contributions
 pose-0.196
 surely-0.188
 drib-0.181
 score-0.177
olog-0.176
Token and
Token ID290
Feature activation+5.32e-03

Pos logit contributions
pt+0.153
lessness+0.150
 crystals+0.124
aky+0.122
 twins+0.115
Neg logit contributions
 pose-0.239
 surely-0.231
 drib-0.222
 score-0.217
 jugg-0.212
Token was
Token ID373
Feature activation+1.57e-02

Pos logit contributions
lessness+0.007
pt+0.007
aky+0.006
 twins+0.006
 crystals+0.006
Neg logit contributions
 pose-0.011
 surely-0.011
olog-0.010
 drib-0.010
 som-0.010
Token always
Token ID1464
Feature activation-8.50e-03

Pos logit contributions
lessness+0.017
pt+0.016
 twins+0.014
 everywhere+0.013
aky+0.013
Neg logit contributions
 pose-0.037
olog-0.036
 surely-0.036
 seventeen-0.035
 drib-0.035
Token very
Token ID845
Feature activation-1.73e-02

Pos logit contributions
 pose+0.019
 surely+0.019
olog+0.018
 drib+0.018
 seventeen+0.018
Neg logit contributions
lessness-0.010
pt-0.010
 twins-0.008
 crystals-0.008
 everywhere-0.008
Token kind
Token ID1611
Feature activation+1.88e-02

Pos logit contributions
 pose+0.038
 surely+0.037
olog+0.037
 seventeen+0.036
 drib+0.036
Neg logit contributions
lessness-0.022
pt-0.020
 twins-0.018
aky-0.017
 everywhere-0.017
Token a
Token ID257
Feature activation+4.98e-02

Pos logit contributions
 pose+0.078
 surely+0.077
 drib+0.075
olog+0.074
 som+0.073
Neg logit contributions
lessness-0.034
pt-0.031
 twins-0.024
aky-0.024
 crystals-0.023
Token small
Token ID1402
Feature activation-8.86e-04

Pos logit contributions
lessness+0.063
pt+0.062
 everywhere+0.057
 twins+0.056
aky+0.054
Neg logit contributions
 surely-0.104
olog-0.102
 pose-0.101
 seventeen-0.100
ven-0.097
Token farm
Token ID5318
Feature activation-1.36e-02

Pos logit contributions
 pose+0.002
 surely+0.002
olog+0.001
 drib+0.001
 seventeen+0.001
Neg logit contributions
lessness-0.001
pt-0.001
 twins-0.001
 everywhere-0.001
 crystals-0.001
Token with
Token ID351
Feature activation+2.94e-02

Pos logit contributions
 pose+0.023
 surely+0.022
 drib+0.022
 jugg+0.021
 som+0.021
Neg logit contributions
lessness-0.024
pt-0.023
 crystals-0.020
aky-0.020
 twins-0.019
Token his
Token ID465
Feature activation+5.44e-02

Pos logit contributions
lessness+0.054
pt+0.048
aky+0.042
iness+0.042
itive+0.040
Neg logit contributions
 drib-0.045
 seventeen-0.044
 som-0.043
 jugg-0.043
 sixty-0.043
Token parent
Token ID2560
Feature activation+1.14e-01

Pos logit contributions
lessness+0.068
pt+0.061
aky+0.048
ruck+0.048
 crystals+0.045
Neg logit contributions
 pose-0.121
 surely-0.118
 drib-0.116
 score-0.113
-0.112
Token.
Token ID13
Feature activation+3.54e-02

Pos logit contributions
lessness+0.194
pt+0.170
ovable+0.153
 crystals+0.144
 treasures+0.142
Neg logit contributions
 pose-0.190
 drib-0.184
 score-0.176
 jugg-0.173
 surely-0.171
Token Every
Token ID3887
Feature activation+1.18e-02

Pos logit contributions
lessness+0.058
pt+0.056
 twins+0.049
 everywhere+0.048
 crystals+0.048
Neg logit contributions
 pose-0.063
 surely-0.063
olog-0.060
 drib-0.059
 seventeen-0.059
Token day
Token ID1110
Feature activation-2.45e-02

Pos logit contributions
lessness+0.018
pt+0.018
 twins+0.015
 crystals+0.015
 everywhere+0.015
Neg logit contributions
 pose-0.022
 surely-0.022
olog-0.021
 drib-0.021
 seventeen-0.021
Token,
Token ID11
Feature activation-1.77e-03

Pos logit contributions
 pose+0.051
 surely+0.051
olog+0.049
 drib+0.048
 seventeen+0.048
Neg logit contributions
lessness-0.033
pt-0.032
 twins-0.028
 everywhere-0.028
 crystals-0.027
Token the
Token ID262
Feature activation-3.45e-02

Pos logit contributions
 pose+0.003
 surely+0.003
olog+0.003
 drib+0.003
 seventeen+0.003
Neg logit contributions
lessness-0.003
pt-0.003
 twins-0.002
 everywhere-0.002
 crystals-0.002
Token they
Token ID484
Feature activation-7.62e-04

Pos logit contributions
 pose+0.080
 surely+0.080
olog+0.079
 seventeen+0.077
 drib+0.077
Neg logit contributions
lessness-0.055
pt-0.054
 twins-0.048
 everywhere-0.046
 crystals-0.045
Token c
Token ID269
Feature activation-1.86e-02

Pos logit contributions
 pose+0.001
 surely+0.001
olog+0.001
 drib+0.001
 seventeen+0.001
Neg logit contributions
lessness-0.001
pt-0.001
 twins-0.001
 everywhere-0.001
 crystals-0.001
Tokenuddled
Token ID32745
Feature activation+1.44e-02

Pos logit contributions
 pose+0.029
 surely+0.029
olog+0.028
 drib+0.027
 seventeen+0.027
Neg logit contributions
lessness-0.034
pt-0.033
 twins-0.030
 everywhere-0.029
 crystals-0.029
Token with
Token ID351
Feature activation+2.67e-02

Pos logit contributions
lessness+0.016
pt+0.016
 everywhere+0.014
 twins+0.013
 crystals+0.013
Neg logit contributions
 pose-0.033
 surely-0.033
olog-0.032
 seventeen-0.032
 drib-0.031
Token their
Token ID511
Feature activation+3.92e-02

Pos logit contributions
lessness+0.049
aky+0.044
pt+0.044
 crystals+0.039
ruck+0.039
Neg logit contributions
 surely-0.042
 pose-0.041
 drib-0.040
 som-0.040
 score-0.039
Token parent
Token ID2560
Feature activation+1.13e-01

Pos logit contributions
lessness+0.048
pt+0.045
aky+0.038
 twins+0.035
 crystals+0.034
Neg logit contributions
 pose-0.087
 surely-0.086
 drib-0.084
olog-0.082
 score-0.081
Token on
Token ID319
Feature activation-2.99e-02

Pos logit contributions
lessness+0.156
pt+0.147
 crystals+0.127
aky+0.123
 twins+0.115
Neg logit contributions
 pose-0.235
 surely-0.228
 drib-0.220
 score-0.217
olog-0.212
Token the
Token ID262
Feature activation+1.61e-02

Pos logit contributions
 surely+0.057
 pose+0.057
olog+0.056
 seventeen+0.055
 drib+0.053
Neg logit contributions
lessness-0.044
pt-0.044
 everywhere-0.039
 twins-0.038
 crystals-0.038
Token couch
Token ID18507
Feature activation-1.04e-03

Pos logit contributions
lessness+0.033
pt+0.033
aky+0.029
 twins+0.028
 crystals+0.028
Neg logit contributions
 pose-0.023
 surely-0.022
 drib-0.021
olog-0.020
 seventeen-0.020
Token and
Token ID290
Feature activation-8.05e-03

Pos logit contributions
 pose+0.002
 surely+0.002
olog+0.002
 drib+0.002
 som+0.002
Neg logit contributions
lessness-0.001
pt-0.001
 crystals-0.001
 twins-0.001
aky-0.001
Token listened
Token ID16399
Feature activation+1.27e-02

Pos logit contributions
 surely+0.015
 pose+0.015
olog+0.015
 seventeen+0.015
+0.014
Neg logit contributions
pt-0.012
lessness-0.012
 twins-0.011
 everywhere-0.011
 crystals-0.010
Tokenmy
Token ID1820
Feature activation+4.70e-02

Pos logit contributions
lessness+0.027
pt+0.022
 crystals+0.020
 twins+0.019
 everywhere+0.018
Neg logit contributions
 surely-0.044
 drib-0.044
 pose-0.044
olog-0.042
 score-0.042
Token enjoyed
Token ID8359
Feature activation+3.80e-02

Pos logit contributions
lessness+0.070
pt+0.065
 twins+0.057
 crystals+0.056
aky+0.056
Neg logit contributions
 pose-0.091
 surely-0.091
olog-0.087
 drib-0.087
 som-0.085
Token travelling
Token ID16574
Feature activation+2.65e-02

Pos logit contributions
lessness+0.049
pt+0.047
 twins+0.041
 everywhere+0.041
 crystals+0.040
Neg logit contributions
 surely-0.081
 pose-0.080
olog-0.079
 seventeen-0.077
 drib-0.075
Token with
Token ID351
Feature activation+4.42e-02

Pos logit contributions
lessness+0.035
pt+0.034
aky+0.028
 twins+0.028
 crystals+0.028
Neg logit contributions
 pose-0.055
 surely-0.055
 drib-0.053
olog-0.052
 som-0.052
Token his
Token ID465
Feature activation+6.79e-02

Pos logit contributions
lessness+0.087
pt+0.077
aky+0.069
ovable+0.066
iness+0.062
Neg logit contributions
 drib-0.069
 som-0.068
 jugg-0.067
 pose-0.067
 score-0.065
Token family
Token ID1641
Feature activation+1.12e-01

Pos logit contributions
lessness+0.115
pt+0.107
aky+0.095
 crystals+0.085
ruck+0.084
Neg logit contributions
 surely-0.123
 pose-0.122
 drib-0.120
 score-0.117
tons-0.113
Token.
Token ID13
Feature activation+4.43e-02

Pos logit contributions
lessness+0.158
pt+0.150
aky+0.125
 crystals+0.123
 twins+0.117
Neg logit contributions
 pose-0.228
 surely-0.228
 drib-0.226
 score-0.216
olog-0.213
Token They
Token ID1119
Feature activation+1.41e-02

Pos logit contributions
lessness+0.060
pt+0.057
 twins+0.051
 everywhere+0.049
 crystals+0.048
Neg logit contributions
 pose-0.092
 surely-0.091
olog-0.088
 drib-0.085
 seventeen-0.085
Token had
Token ID550
Feature activation+2.72e-02

Pos logit contributions
lessness+0.019
pt+0.017
 twins+0.015
 crystals+0.015
aky+0.014
Neg logit contributions
 surely-0.030
 drib-0.029
 pose-0.028
olog-0.028
 jugg-0.028
Token been
Token ID587
Feature activation+2.46e-02

Pos logit contributions
lessness+0.041
pt+0.038
 crystals+0.032
iness+0.030
aky+0.030
Neg logit contributions
 som-0.054
 surely-0.054
 pose-0.053
 drib-0.051
 jugg-0.051
Token on
Token ID319
Feature activation-4.08e-02

Pos logit contributions
lessness+0.030
pt+0.029
 twins+0.023
 crystals+0.023
 everywhere+0.023
Neg logit contributions
 surely-0.054
 pose-0.054
olog-0.052
 drib-0.052
 seventeen-0.051
Token table
Token ID3084
Feature activation+4.53e-04

Pos logit contributions
 pose+0.000
 drib+0.000
 surely+0.000
olog+0.000
 som+0.000
Neg logit contributions
lessness-0.000
pt-0.000
aky-0.000
 crystals-0.000
 twins-0.000
Token eating
Token ID6600
Feature activation+3.13e-02

Pos logit contributions
lessness+0.001
pt+0.001
aky+0.001
 crystals+0.001
 twins+0.001
Neg logit contributions
 pose-0.001
 surely-0.001
 drib-0.001
 som-0.001
zes-0.001
Token dinner
Token ID8073
Feature activation+3.19e-02

Pos logit contributions
lessness+0.045
pt+0.042
aky+0.036
ovable+0.035
 twins+0.034
Neg logit contributions
 pose-0.062
 drib-0.061
 surely-0.061
 som-0.059
 jugg-0.059
Token with
Token ID351
Feature activation+3.73e-02

Pos logit contributions
lessness+0.032
pt+0.031
 twins+0.024
aky+0.024
 crystals+0.023
Neg logit contributions
 pose-0.077
 surely-0.076
olog-0.074
 drib-0.074
 som-0.072
Token his
Token ID465
Feature activation+6.36e-02

Pos logit contributions
lessness+0.069
pt+0.062
ovable+0.055
aky+0.054
iness+0.050
Neg logit contributions
 drib-0.064
 som-0.059
 jugg-0.059
 pose-0.058
 surely-0.057
Token family
Token ID1641
Feature activation+1.12e-01

Pos logit contributions
lessness+0.119
pt+0.115
aky+0.098
ruck+0.095
 crystals+0.091
Neg logit contributions
 pose-0.102
 drib-0.102
 surely-0.097
 score-0.096
zes-0.092
Token.
Token ID13
Feature activation+4.50e-02

Pos logit contributions
pt+0.166
lessness+0.165
aky+0.133
ovable+0.132
 crystals+0.130
Neg logit contributions
 drib-0.214
 pose-0.209
 surely-0.201
 score-0.198
 jugg-0.196
Token He
Token ID679
Feature activation+4.49e-03

Pos logit contributions
lessness+0.067
pt+0.064
 twins+0.059
 everywhere+0.057
 crystals+0.054
Neg logit contributions
 pose-0.087
 surely-0.086
olog-0.083
 seventeen-0.080
zes-0.079
Token noticed
Token ID6810
Feature activation+4.46e-02

Pos logit contributions
lessness+0.006
pt+0.006
 twins+0.005
 everywhere+0.005
 crystals+0.005
Neg logit contributions
 pose-0.009
 surely-0.009
olog-0.009
 seventeen-0.009
 drib-0.009
Token the
Token ID262
Feature activation-1.39e-02

Pos logit contributions
lessness+0.073
pt+0.065
aky+0.060
ovable+0.056
 twins+0.053
Neg logit contributions
 jugg-0.079
 drib-0.079
 pose-0.078
 som-0.077
 surely-0.077
Token light
Token ID1657
Feature activation-2.18e-02

Pos logit contributions
 pose+0.024
 drib+0.023
 surely+0.023
 jugg+0.022
 score+0.022
Neg logit contributions
pt-0.025
lessness-0.024
aky-0.020
ovable-0.020
 who-0.017
Token to
Token ID284
Feature activation-2.91e-02

Pos logit contributions
lessness+0.076
pt+0.072
aky+0.062
 twins+0.059
 crystals+0.058
Neg logit contributions
 pose-0.108
 surely-0.105
 drib-0.103
olog-0.101
 som-0.101
Token play
Token ID711
Feature activation+1.60e-02

Pos logit contributions
 surely+0.046
 pose+0.046
olog+0.045
 seventeen+0.043
 drib+0.043
Neg logit contributions
lessness-0.053
pt-0.052
 twins-0.046
 crystals-0.045
 everywhere-0.045
Token tricks
Token ID15910
Feature activation+1.72e-02

Pos logit contributions
lessness+0.021
pt+0.020
 twins+0.016
 crystals+0.016
aky+0.016
Neg logit contributions
 pose-0.035
 surely-0.034
olog-0.033
 drib-0.033
 som-0.032
Token on
Token ID319
Feature activation-1.42e-02

Pos logit contributions
lessness+0.021
pt+0.021
 crystals+0.018
aky+0.018
 twins+0.017
Neg logit contributions
 pose-0.037
 surely-0.036
 drib-0.034
olog-0.034
 som-0.033
Token her
Token ID607
Feature activation+8.81e-02

Pos logit contributions
 surely+0.036
olog+0.035
 pose+0.035
 seventeen+0.035
 drib+0.034
Neg logit contributions
lessness-0.013
pt-0.012
 twins-0.011
 everywhere-0.010
 crystals-0.010
Token friends
Token ID2460
Feature activation+1.12e-01

Pos logit contributions
lessness+0.120
pt+0.117
aky+0.093
 crystals+0.091
ruck+0.089
Neg logit contributions
 pose-0.190
 drib-0.178
zes-0.173
 score-0.173
 surely-0.170
Token.
Token ID13
Feature activation+3.78e-02

Pos logit contributions
lessness+0.153
pt+0.132
 crystals+0.119
ruck+0.112
 twins+0.109
Neg logit contributions
 pose-0.246
 drib-0.235
 surely-0.225
 score-0.223
-0.221
Token One
Token ID1881
Feature activation+7.34e-03

Pos logit contributions
lessness+0.063
pt+0.061
 twins+0.055
 everywhere+0.054
 crystals+0.053
Neg logit contributions
 surely-0.067
 pose-0.067
olog-0.064
 seventeen-0.062
 drib-0.062
Token day
Token ID1110
Feature activation-2.04e-02

Pos logit contributions
pt+0.013
lessness+0.013
 crystals+0.011
 twins+0.011
 everywhere+0.011
Neg logit contributions
 pose-0.013
 surely-0.012
 drib-0.012
 score-0.011
 jugg-0.011
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
 surely+0.043
 pose+0.042
olog+0.041
 seventeen+0.040
 drib+0.040
Neg logit contributions
lessness-0.028
pt-0.027
 twins-0.023
 everywhere-0.023
 crystals-0.022
Token Lily
Token ID20037
Feature activation-1.18e-02

Pos logit contributions
 pose+0.021
 surely+0.021
olog+0.020
 drib+0.020
 som+0.019
Neg logit contributions
lessness-0.017
pt-0.017
aky-0.014
 twins-0.014
 crystals-0.014
Token can
Token ID460
Feature activation-5.94e-02

Pos logit contributions
 surely+0.052
 pose+0.051
olog+0.049
 drib+0.049
 jugg+0.048
Neg logit contributions
lessness-0.047
pt-0.041
aky-0.038
 crystals-0.038
 twins-0.037
Token't
Token ID470
Feature activation-1.90e-02

Pos logit contributions
olog+0.100
 pose+0.099
 surely+0.098
 seventeen+0.098
Too+0.095
Neg logit contributions
pt-0.098
lessness-0.090
 twins-0.089
 everywhere-0.083
 crystals-0.083
Token.
Token ID13
Feature activation-2.16e-02

Pos logit contributions
 pose+0.034
 surely+0.034
olog+0.032
 drib+0.032
 seventeen+0.031
Neg logit contributions
lessness-0.031
pt-0.030
 twins-0.027
 everywhere-0.026
 crystals-0.026
Token The
Token ID383
Feature activation-4.76e-02

Pos logit contributions
 pose+0.034
 surely+0.034
 drib+0.034
 som+0.032
olog+0.032
Neg logit contributions
lessness-0.039
pt-0.038
 twins-0.035
 crystals-0.033
 who-0.032
Token ze
Token ID41271
Feature activation-5.24e-02

Pos logit contributions
 pose+0.094
 surely+0.092
 seventeen+0.087
olog+0.087
 drib+0.087
Neg logit contributions
lessness-0.070
pt-0.068
aky-0.056
 everywhere-0.056
 twins-0.056
Tokenbr
Token ID1671
Feature activation-1.53e-01

Pos logit contributions
 surely+0.070
 pose+0.067
olog+0.066
 drib+0.064
 seventeen+0.061
Neg logit contributions
lessness-0.110
pt-0.107
 twins-0.098
 everywhere-0.097
 crystals-0.095
Tokenas
Token ID292
Feature activation-1.29e-02

Pos logit contributions
 pose+0.341
 surely+0.340
olog+0.329
 drib+0.323
 seventeen+0.323
Neg logit contributions
lessness-0.182
pt-0.173
 twins-0.145
 everywhere-0.140
 crystals-0.140
Token have
Token ID423
Feature activation-2.61e-03

Pos logit contributions
 surely+0.027
 pose+0.027
olog+0.026
 drib+0.026
 score+0.025
Neg logit contributions
lessness-0.017
pt-0.016
 twins-0.014
 crystals-0.014
aky-0.013
Token their
Token ID511
Feature activation+1.58e-02

Pos logit contributions
 pose+0.005
 surely+0.005
 seventeen+0.005
 som+0.005
 drib+0.005
Neg logit contributions
lessness-0.004
pt-0.004
 twins-0.003
aky-0.003
 crystals-0.003
Token own
Token ID898
Feature activation+4.92e-02

Pos logit contributions
lessness+0.026
pt+0.025
 twins+0.022
aky+0.021
 crystals+0.021
Neg logit contributions
 pose-0.028
 surely-0.028
olog-0.026
 drib-0.026
 seventeen-0.026
Token food
Token ID2057
Feature activation+1.51e-02

Pos logit contributions
lessness+0.078
pt+0.073
 twins+0.064
aky+0.063
 crystals+0.062
Neg logit contributions
 pose-0.093
 surely-0.089
olog-0.086
 drib-0.086
 som-0.085
Token could
Token ID714
Feature activation-3.06e-02

Pos logit contributions
pt+0.014
 everywhere+0.012
lessness+0.012
 were+0.011
 who+0.011
Neg logit contributions
 seventeen-0.036
 pose-0.036
olog-0.035
 surely-0.034
-0.034
Token see
Token ID766
Feature activation+4.02e-02

Pos logit contributions
olog+0.057
 pose+0.056
+0.055
 seventeen+0.055
 surely+0.054
Neg logit contributions
lessness-0.045
pt-0.045
 everywhere-0.041
 of-0.040
 twins-0.038
Token the
Token ID262
Feature activation-5.87e-03

Pos logit contributions
lessness+0.069
pt+0.066
aky+0.059
 twins+0.054
 crystals+0.052
Neg logit contributions
 surely-0.067
 pose-0.066
 seventeen-0.065
 som-0.065
 drib-0.063
Token biggest
Token ID4094
Feature activation+1.03e-03

Pos logit contributions
 surely+0.011
 pose+0.011
olog+0.011
 seventeen+0.010
 drib+0.010
Neg logit contributions
lessness-0.009
pt-0.009
 twins-0.008
aky-0.007
 crystals-0.007
Token,
Token ID11
Feature activation-9.75e-03

Pos logit contributions
lessness+0.002
pt+0.002
 twins+0.001
aky+0.001
 crystals+0.001
Neg logit contributions
 pose-0.002
 surely-0.002
olog-0.002
 seventeen-0.002
 drib-0.002
Token ju
Token ID7544
Feature activation-1.49e-01

Pos logit contributions
 surely+0.018
 pose+0.018
 seventeen+0.018
olog+0.017
 som+0.017
Neg logit contributions
lessness-0.015
pt-0.015
 twins-0.012
aky-0.012
 everywhere-0.012
Tokenic
Token ID291
Feature activation-2.60e-02

Pos logit contributions
 pose+0.256
 surely+0.254
olog+0.245
 drib+0.235
 seventeen+0.231
Neg logit contributions
lessness-0.247
pt-0.245
aky-0.221
 twins-0.220
 crystals-0.216
Tokeniest
Token ID6386
Feature activation+6.73e-03

Pos logit contributions
 surely+0.067
 pose+0.067
olog+0.064
 seventeen+0.064
 drib+0.064
Neg logit contributions
lessness-0.021
pt-0.020
 twins-0.015
 everywhere-0.015
aky-0.014
Token cher
Token ID21382
Feature activation-9.91e-02

Pos logit contributions
lessness+0.012
pt+0.012
aky+0.010
 twins+0.010
 everywhere+0.010
Neg logit contributions
 pose-0.011
 surely-0.011
olog-0.010
 seventeen-0.010
 drib-0.010
Tokenries
Token ID1678
Feature activation-2.03e-02

Pos logit contributions
 pose+0.186
 surely+0.184
olog+0.178
 drib+0.174
 seventeen+0.172
Neg logit contributions
lessness-0.154
pt-0.149
 twins-0.131
 crystals-0.130
 everywhere-0.129
Token ever
Token ID1683
Feature activation-1.60e-02

Pos logit contributions
 surely+0.042
 pose+0.041
 drib+0.040
olog+0.040
 seventeen+0.039
Neg logit contributions
lessness-0.027
pt-0.026
 twins-0.022
aky-0.022
 crystals-0.021
Token The
Token ID383
Feature activation-3.85e-02

Pos logit contributions
lessness+0.011
pt+0.010
 twins+0.009
 everywhere+0.009
 crystals+0.009
Neg logit contributions
 pose-0.014
 surely-0.014
olog-0.013
 drib-0.013
 seventeen-0.013
Token eggs
Token ID9653
Feature activation-3.65e-02

Pos logit contributions
 pose+0.076
 surely+0.076
 seventeen+0.071
olog+0.071
 drib+0.070
Neg logit contributions
lessness-0.057
pt-0.055
 everywhere-0.045
aky-0.045
 crystals-0.044
Token were
Token ID547
Feature activation+7.90e-03

Pos logit contributions
 surely+0.077
 pose+0.074
 drib+0.071
zes+0.070
olog+0.069
Neg logit contributions
lessness-0.047
pt-0.043
 twins-0.040
aky-0.039
 crystals-0.037
Token blue
Token ID4171
Feature activation-3.46e-02

Pos logit contributions
lessness+0.011
pt+0.011
 twins+0.009
 crystals+0.009
aky+0.009
Neg logit contributions
 surely-0.016
 pose-0.015
 seventeen-0.015
 drib-0.015
 som-0.014
Token and
Token ID290
Feature activation-2.68e-02

Pos logit contributions
 pose+0.055
 seventeen+0.055
ro+0.053
 reaches+0.052
 surely+0.052
Neg logit contributions
lessness-0.059
pt-0.051
ovable-0.050
aky-0.049
taker-0.045
Token spe
Token ID693
Feature activation-1.49e-01

Pos logit contributions
 pose+0.057
 surely+0.056
olog+0.056
 drib+0.054
zes+0.052
Neg logit contributions
lessness-0.036
pt-0.034
 twins-0.030
 crystals-0.029
 everywhere-0.029
Tokenck
Token ID694
Feature activation+5.55e-03

Pos logit contributions
 surely+0.256
olog+0.247
 pose+0.247
 jugg+0.236
 instructor+0.225
Neg logit contributions
pt-0.256
aky-0.240
lessness-0.236
 crystals-0.217
 twins-0.214
Tokenled
Token ID992
Feature activation+5.08e-03

Pos logit contributions
lessness+0.007
pt+0.007
 twins+0.006
aky+0.006
 crystals+0.006
Neg logit contributions
 pose-0.010
 surely-0.010
 seventeen-0.010
 dod-0.010
 som-0.010
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
lessness+0.006
pt+0.006
aky+0.005
 twins+0.005
 crystals+0.004
Neg logit contributions
 surely-0.011
 pose-0.011
 drib-0.010
 seventeen-0.010
 som-0.010
Token They
Token ID1119
Feature activation-1.08e-02

Pos logit contributions
lessness+0.017
pt+0.016
 twins+0.014
 everywhere+0.014
 crystals+0.014
Neg logit contributions
 pose-0.021
 surely-0.021
olog-0.020
 drib-0.020
 seventeen-0.019
Token looked
Token ID3114
Feature activation+8.02e-03

Pos logit contributions
 surely+0.021
 pose+0.021
 drib+0.020
olog+0.020
 jugg+0.020
Neg logit contributions
lessness-0.016
pt-0.015
 twins-0.013
aky-0.013
 crystals-0.013
Token a
Token ID257
Feature activation-1.53e-02

Pos logit contributions
 surely+0.170
olog+0.168
 pose+0.166
 seventeen+0.165
 drib+0.163
Neg logit contributions
lessness-0.076
pt-0.074
 twins-0.067
 everywhere-0.064
 crystals-0.062
Token little
Token ID1310
Feature activation-2.11e-02

Pos logit contributions
 surely+0.034
olog+0.034
ven+0.033
 seventeen+0.032
 pose+0.032
Neg logit contributions
lessness-0.017
pt-0.017
 twins-0.016
 everywhere-0.014
aky-0.014
Token bit
Token ID1643
Feature activation-3.25e-02

Pos logit contributions
 surely+0.031
olog+0.029
 pose+0.029
 seventeen+0.028
 drib+0.028
Neg logit contributions
lessness-0.041
pt-0.040
 twins-0.037
 everywhere-0.036
 crystals-0.036
Token."
Token ID526
Feature activation-3.54e-02

Pos logit contributions
 pose+0.077
 surely+0.074
olog+0.072
 drib+0.071
 som+0.070
Neg logit contributions
lessness-0.035
pt-0.033
 crystals-0.027
 twins-0.026
 everywhere-0.026
Token Mom
Token ID11254
Feature activation+9.72e-03

Pos logit contributions
 pose+0.069
 surely+0.069
olog+0.067
 drib+0.065
 seventeen+0.065
Neg logit contributions
lessness-0.052
pt-0.050
 twins-0.043
 crystals-0.043
aky-0.043
Token squee
Token ID13170
Feature activation-1.49e-01

Pos logit contributions
 twins+0.014
pt+0.014
lessness+0.013
 everywhere+0.013
 who+0.012
Neg logit contributions
 surely-0.019
 som-0.018
olog-0.018
-0.018
 seventeen-0.017
Tokenzes
Token ID12271
Feature activation-1.36e-02

Pos logit contributions
 surely+0.142
Ready+0.115
 pose+0.114
olog+0.108
 seventeen+0.107
Neg logit contributions
pt-0.372
 twins-0.320
 who-0.311
 everywhere-0.306
lessness-0.302
Token some
Token ID617
Feature activation+3.42e-02

Pos logit contributions
 pose+0.028
 surely+0.028
olog+0.027
 drib+0.027
 seventeen+0.026
Neg logit contributions
lessness-0.019
pt-0.018
 twins-0.016
 everywhere-0.015
 crystals-0.015
Token shampoo
Token ID47259
Feature activation-2.19e-02

Pos logit contributions
lessness+0.042
pt+0.040
 twins+0.034
aky+0.033
 crystals+0.033
Neg logit contributions
 pose-0.075
 surely-0.074
 drib-0.071
 seventeen-0.070
olog-0.070
Token on
Token ID319
Feature activation-4.46e-02

Pos logit contributions
 pose+0.052
 surely+0.052
olog+0.050
 drib+0.049
 seventeen+0.049
Neg logit contributions
lessness-0.023
pt-0.021
 twins-0.017
aky-0.017
 crystals-0.017
Token her
Token ID607
Feature activation+5.04e-02

Pos logit contributions
olog+0.101
 pose+0.101
 surely+0.101
ven+0.095
 seventeen+0.095
Neg logit contributions
lessness-0.045
pt-0.045
 twins-0.045
 everywhere-0.042
 who-0.037
Token soon
Token ID2582
Feature activation+1.48e-02

Pos logit contributions
lessness+0.027
pt+0.026
 twins+0.022
 crystals+0.021
aky+0.021
Neg logit contributions
 surely-0.048
 pose-0.047
 drib-0.044
olog-0.044
 seventeen-0.044
Token nothing
Token ID2147
Feature activation+3.56e-02

Pos logit contributions
lessness+0.022
pt+0.021
 twins+0.018
aky+0.018
 crystals+0.018
Neg logit contributions
 surely-0.029
 pose-0.029
olog-0.027
 drib-0.027
 som-0.027
Token but
Token ID475
Feature activation-5.03e-04

Pos logit contributions
lessness+0.051
pt+0.049
 twins+0.046
aky+0.044
 crystals+0.044
Neg logit contributions
 pose-0.070
 surely-0.069
 som-0.065
olog-0.064
 drib-0.064
Token a
Token ID257
Feature activation-2.16e-03

Pos logit contributions
 pose+0.001
 surely+0.001
olog+0.001
 seventeen+0.001
 drib+0.001
Neg logit contributions
lessness-0.001
pt-0.001
 twins-0.001
 everywhere-0.001
 crystals-0.001
Token small
Token ID1402
Feature activation-3.36e-02

Pos logit contributions
 surely+0.005
 pose+0.005
olog+0.005
 drib+0.005
 som+0.005
Neg logit contributions
lessness-0.003
pt-0.002
 twins-0.002
 crystals-0.002
aky-0.002
Token spe
Token ID693
Feature activation-1.48e-01

Pos logit contributions
 pose+0.067
 surely+0.066
olog+0.062
 drib+0.061
 som+0.061
Neg logit contributions
lessness-0.049
pt-0.047
aky-0.040
 twins-0.040
 crystals-0.040
Tokenck
Token ID694
Feature activation-3.64e-03

Pos logit contributions
 pose+0.215
olog+0.212
 surely+0.210
+0.197
ven+0.196
Neg logit contributions
pt-0.286
lessness-0.285
aky-0.262
 everywhere-0.245
 of-0.236
Token in
Token ID287
Feature activation-4.35e-02

Pos logit contributions
 surely+0.007
 pose+0.007
 som+0.006
 score+0.006
+0.006
Neg logit contributions
lessness-0.005
pt-0.005
 twins-0.004
 crystals-0.004
aky-0.004
Token the
Token ID262
Feature activation+1.99e-02

Pos logit contributions
 surely+0.080
 pose+0.079
 drib+0.074
 som+0.074
olog+0.073
Neg logit contributions
lessness-0.071
pt-0.067
 twins-0.059
aky-0.059
 crystals-0.058
Token sky
Token ID6766
Feature activation-1.20e-02

Pos logit contributions
lessness+0.035
pt+0.034
aky+0.030
 twins+0.030
 crystals+0.029
Neg logit contributions
 surely-0.034
 pose-0.033
 seventeen-0.031
 score-0.030
 som-0.030
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
 pose+0.029
 surely+0.029
olog+0.028
 drib+0.027
 seventeen+0.027
Neg logit contributions
lessness-0.012
pt-0.011
 twins-0.009
 crystals-0.009
 everywhere-0.009
Token be
Token ID307
Feature activation-1.14e-02

Pos logit contributions
olog+0.125
 surely+0.113
+0.112
ven+0.109
 seventeen+0.108
Neg logit contributions
pt-0.117
 crystals-0.110
lessness-0.108
 of-0.103
 who-0.102
Token the
Token ID262
Feature activation-1.30e-02

Pos logit contributions
 pose+0.024
 surely+0.024
 som+0.023
 drib+0.023
 score+0.022
Neg logit contributions
lessness-0.015
pt-0.015
 twins-0.011
 everywhere-0.011
aky-0.011
Token rip
Token ID18344
Feature activation-2.21e-02

Pos logit contributions
 surely+0.026
 pose+0.025
 drib+0.024
 score+0.024
 som+0.024
Neg logit contributions
lessness-0.019
pt-0.019
aky-0.015
 twins-0.015
 everywhere-0.014
Tokenest
Token ID395
Feature activation+1.71e-02

Pos logit contributions
 pose+0.045
 surely+0.043
 som+0.041
 score+0.040
 drib+0.040
Neg logit contributions
lessness-0.031
pt-0.029
 twins-0.026
aky-0.025
ovable-0.023
Token,
Token ID11
Feature activation-1.30e-02

Pos logit contributions
lessness+0.029
pt+0.027
aky+0.025
ovable+0.023
 twins+0.023
Neg logit contributions
 pose-0.030
 som-0.028
 surely-0.028
 score-0.028
 drib-0.028
Token ju
Token ID7544
Feature activation-1.48e-01

Pos logit contributions
 surely+0.027
 pose+0.027
olog+0.026
 drib+0.026
 seventeen+0.026
Neg logit contributions
lessness-0.017
pt-0.017
 twins-0.014
aky-0.014
 everywhere-0.014
Tokenic
Token ID291
Feature activation-2.70e-02

Pos logit contributions
 surely+0.237
 pose+0.237
olog+0.228
 drib+0.220
 seventeen+0.218
Neg logit contributions
lessness-0.267
pt-0.262
 twins-0.233
 crystals-0.230
aky-0.230
Tokeniest
Token ID6386
Feature activation+6.17e-03

Pos logit contributions
 pose+0.065
 surely+0.064
 som+0.061
olog+0.061
 drib+0.061
Neg logit contributions
lessness-0.026
pt-0.025
 everywhere-0.020
 twins-0.020
 crystals-0.019
Token pear
Token ID25286
Feature activation-5.44e-02

Pos logit contributions
lessness+0.011
pt+0.010
aky+0.009
ovable+0.009
 twins+0.009
Neg logit contributions
 pose-0.010
 som-0.010
 surely-0.010
 drib-0.010
 score-0.010
Token in
Token ID287
Feature activation-3.85e-02

Pos logit contributions
 pose+0.113
 surely+0.109
 som+0.103
 drib+0.103
olog+0.102
Neg logit contributions
lessness-0.074
pt-0.071
 twins-0.062
aky-0.060
ruck-0.055
Token the
Token ID262
Feature activation+2.50e-02

Pos logit contributions
 surely+0.072
 pose+0.072
 drib+0.067
olog+0.067
 som+0.066
Neg logit contributions
lessness-0.061
pt-0.058
 twins-0.051
aky-0.050
 everywhere-0.049
Token The
Token ID383
Feature activation-3.40e-02

Pos logit contributions
lessness+0.022
pt+0.021
 twins+0.021
 everywhere+0.019
 of+0.019
Neg logit contributions
 surely-0.029
 pose-0.028
olog-0.026
zes-0.026
 seventeen-0.026
Token eggs
Token ID9653
Feature activation-3.23e-02

Pos logit contributions
 pose+0.065
 surely+0.064
 seventeen+0.060
olog+0.060
 som+0.059
Neg logit contributions
lessness-0.054
pt-0.051
aky-0.042
 everywhere-0.042
ovable-0.040
Token were
Token ID547
Feature activation+8.95e-03

Pos logit contributions
 surely+0.067
 pose+0.063
 drib+0.061
 jugg+0.061
zes+0.061
Neg logit contributions
lessness-0.043
pt-0.038
 twins-0.035
aky-0.035
ovable-0.033
Token blue
Token ID4171
Feature activation-3.13e-02

Pos logit contributions
lessness+0.012
pt+0.012
 twins+0.010
 crystals+0.009
aky+0.009
Neg logit contributions
 surely-0.019
 pose-0.018
 seventeen-0.018
 drib-0.017
olog-0.017
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
 seventeen+0.057
 pose+0.056
 surely+0.053
ro+0.052
 sixty+0.052
Neg logit contributions
lessness-0.050
pt-0.044
aky-0.041
ovable-0.040
 twins-0.037
Token spe
Token ID693
Feature activation-1.46e-01

Pos logit contributions
 pose+0.048
 surely+0.048
olog+0.048
 drib+0.046
zes+0.045
Neg logit contributions
lessness-0.030
pt-0.028
 twins-0.025
 crystals-0.025
 everywhere-0.024
Tokenck
Token ID694
Feature activation+1.83e-03

Pos logit contributions
 surely+0.241
 pose+0.234
olog+0.233
 jugg+0.217
 score+0.208
Neg logit contributions
pt-0.267
lessness-0.246
aky-0.239
 crystals-0.229
 everywhere-0.226
Tokenled
Token ID992
Feature activation+4.49e-03

Pos logit contributions
lessness+0.003
pt+0.002
 twins+0.002
aky+0.002
 company+0.002
Neg logit contributions
 pose-0.003
 surely-0.003
 seventeen-0.003
 som-0.003
 dod-0.003
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
lessness+0.006
pt+0.005
aky+0.005
 twins+0.005
ovable+0.004
Neg logit contributions
 surely-0.009
 pose-0.009
 som-0.009
 seventeen-0.009
 drib-0.009
Token L
Token ID406
Feature activation-1.54e-02

Pos logit contributions
lessness+0.029
pt+0.028
 twins+0.028
 everywhere+0.025
 of+0.025
Neg logit contributions
 surely-0.035
 pose-0.035
olog-0.033
zes-0.032
 drib-0.032
Tokenila
Token ID10102
Feature activation-1.11e-02

Pos logit contributions
 pose+0.025
 surely+0.025
olog+0.024
 drib+0.024
 seventeen+0.023
Neg logit contributions
lessness-0.027
pt-0.027
 twins-0.024
 everywhere-0.023
 crystals-0.023
Token to
Token ID284
Feature activation-1.47e-02

Pos logit contributions
 surely+0.037
 pose+0.036
olog+0.035
 drib+0.035
 seventeen+0.034
Neg logit contributions
lessness-0.019
pt-0.019
 twins-0.016
 crystals-0.015
aky-0.015
Token get
Token ID651
Feature activation+1.39e-02

Pos logit contributions
olog+0.027
 seventeen+0.026
Too+0.025
+0.025
 surely+0.025
Neg logit contributions
pt-0.021
lessness-0.021
 who-0.020
 everywhere-0.020
 twins-0.019
Token the
Token ID262
Feature activation+8.92e-03

Pos logit contributions
lessness+0.021
pt+0.019
aky+0.018
 twins+0.018
ovable+0.016
Neg logit contributions
 surely-0.027
 pose-0.025
 drib-0.025
 som-0.025
olog-0.024
Token biggest
Token ID4094
Feature activation+4.01e-03

Pos logit contributions
lessness+0.014
pt+0.013
 twins+0.011
aky+0.011
 crystals+0.010
Neg logit contributions
 surely-0.018
 pose-0.018
 drib-0.017
 score-0.016
olog-0.016
Token,
Token ID11
Feature activation-5.57e-03

Pos logit contributions
lessness+0.006
pt+0.006
aky+0.005
 twins+0.005
 everywhere+0.005
Neg logit contributions
 pose-0.008
 surely-0.008
 som-0.007
 drib-0.007
 score-0.007
Token ju
Token ID7544
Feature activation-1.46e-01

Pos logit contributions
 surely+0.011
 pose+0.011
 seventeen+0.010
 som+0.010
 drib+0.010
Neg logit contributions
pt-0.009
lessness-0.008
 twins-0.007
aky-0.007
 everywhere-0.007
Tokenic
Token ID291
Feature activation-2.48e-02

Pos logit contributions
 pose+0.246
 surely+0.244
olog+0.237
 drib+0.225
 seventeen+0.225
Neg logit contributions
lessness-0.246
pt-0.244
aky-0.220
 twins-0.217
 crystals-0.214
Tokeniest
Token ID6386
Feature activation+1.14e-02

Pos logit contributions
 pose+0.062
 surely+0.062
 som+0.059
 drib+0.059
olog+0.058
Neg logit contributions
pt-0.022
lessness-0.021
 everywhere-0.017
 twins-0.017
 crystals-0.015
Token carrots
Token ID34397
Feature activation-1.49e-02

Pos logit contributions
lessness+0.017
pt+0.016
 twins+0.013
aky+0.013
ovable+0.013
Neg logit contributions
 pose-0.022
 surely-0.022
 som-0.021
 drib-0.021
 score-0.021
Token.
Token ID13
Feature activation+2.36e-02

Pos logit contributions
 surely+0.031
 pose+0.031
 drib+0.029
olog+0.029
 score+0.029
Neg logit contributions
lessness-0.019
pt-0.018
 twins-0.017
aky-0.016
 crystals-0.015
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
lessness+0.034
pt+0.032
 twins+0.029
 everywhere+0.028
aky+0.027
Neg logit contributions
 pose-0.047
 surely-0.046
olog-0.045
 seventeen-0.044
zes-0.044
Token scree
Token ID43465
Feature activation-3.35e-02

Pos logit contributions
 pose+0.073
 surely+0.073
 drib+0.069
olog+0.069
 seventeen+0.068
Neg logit contributions
lessness-0.037
pt-0.034
 twins-0.029
 crystals-0.029
aky-0.028
Tokenched
Token ID1740
Feature activation+1.20e-02

Pos logit contributions
 pose+0.053
 surely+0.053
olog+0.051
 drib+0.049
 seventeen+0.049
Neg logit contributions
lessness-0.061
pt-0.059
 twins-0.054
 crystals-0.053
 everywhere-0.052
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
lessness+0.016
pt+0.015
 twins+0.013
 everywhere+0.013
 crystals+0.013
Neg logit contributions
 pose-0.025
 surely-0.025
olog-0.024
 drib-0.024
 seventeen-0.023
Token The
Token ID383
Feature activation-3.58e-02

Pos logit contributions
lessness+0.024
pt+0.023
 twins+0.021
 everywhere+0.019
 crystals+0.019
Neg logit contributions
 pose-0.034
 surely-0.034
olog-0.033
 drib-0.032
zes-0.032
Token ze
Token ID41271
Feature activation-4.38e-02

Pos logit contributions
 surely+0.066
 pose+0.066
olog+0.064
 drib+0.062
 seventeen+0.062
Neg logit contributions
lessness-0.056
pt-0.054
 twins-0.048
 everywhere-0.046
 crystals-0.046
Tokenbr
Token ID1671
Feature activation-1.45e-01

Pos logit contributions
 surely+0.065
 pose+0.063
olog+0.062
 drib+0.060
zes+0.058
Neg logit contributions
lessness-0.086
pt-0.082
 twins-0.075
 everywhere-0.074
aky-0.073
Tokenas
Token ID292
Feature activation-3.11e-03

Pos logit contributions
 pose+0.350
 surely+0.345
olog+0.335
 seventeen+0.334
 drib+0.331
Neg logit contributions
lessness-0.147
pt-0.139
 twins-0.113
 crystals-0.112
 everywhere-0.108
Token ne
Token ID497
Feature activation-1.03e-01

Pos logit contributions
 surely+0.006
 pose+0.006
olog+0.005
 drib+0.005
 seventeen+0.005
Neg logit contributions
lessness-0.005
pt-0.005
 twins-0.004
 crystals-0.004
 everywhere-0.004
Tokenighed
Token ID12570
Feature activation+3.10e-03

Pos logit contributions
 surely+0.184
 salon+0.174
 subt+0.174
olog+0.173
 pose+0.170
Neg logit contributions
aky-0.175
pt-0.165
lessness-0.155
ady-0.139
ak-0.137
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
lessness+0.003
pt+0.003
 everywhere+0.002
 twins+0.002
aky+0.002
Neg logit contributions
 surely-0.008
 pose-0.007
olog-0.007
 drib-0.007
 seventeen-0.007
Token The
Token ID383
Feature activation-3.57e-02

Pos logit contributions
lessness+0.023
pt+0.021
 twins+0.020
 everywhere+0.018
 who+0.018
Neg logit contributions
 pose-0.033
 surely-0.033
olog-0.032
 drib-0.031
zes-0.031
Token She
Token ID1375
Feature activation-1.27e-02

Pos logit contributions
lessness+0.002
pt+0.002
 twins+0.002
 everywhere+0.002
 crystals+0.002
Neg logit contributions
 pose-0.004
 surely-0.004
olog-0.004
 seventeen-0.004
 drib-0.004
Token said
Token ID531
Feature activation+3.13e-02

Pos logit contributions
 surely+0.025
olog+0.025
 pose+0.025
 seventeen+0.025
+0.024
Neg logit contributions
pt-0.019
lessness-0.017
 who-0.016
aky-0.016
 twins-0.015
Token,
Token ID11
Feature activation-3.96e-02

Pos logit contributions
lessness+0.044
pt+0.043
 twins+0.037
 everywhere+0.036
 crystals+0.036
Neg logit contributions
 surely-0.063
 pose-0.063
olog-0.061
 drib-0.060
 seventeen-0.059
Token "
Token ID366
Feature activation-5.44e-02

Pos logit contributions
 pose+0.073
 surely+0.071
olog+0.071
 drib+0.068
 jugg+0.067
Neg logit contributions
lessness-0.065
pt-0.061
 everywhere-0.056
 twins-0.055
 who-0.054
TokenA
Token ID32
Feature activation-8.85e-03

Pos logit contributions
olog+0.093
 surely+0.093
 pose+0.092
 som+0.090
 drib+0.088
Neg logit contributions
lessness-0.082
pt-0.081
aky-0.077
 twins-0.072
 who-0.071
Tokenbr
Token ID1671
Feature activation-1.44e-01

Pos logit contributions
 surely+0.016
olog+0.015
 drib+0.015
 seventeen+0.015
 salon+0.014
Neg logit contributions
lessness-0.014
pt-0.014
 crystals-0.012
 of-0.012
 twins-0.012
Tokenac
Token ID330
Feature activation-5.60e-02

Pos logit contributions
 pose+0.279
 surely+0.278
olog+0.268
 drib+0.261
 seventeen+0.261
Neg logit contributions
lessness-0.216
pt-0.207
 twins-0.183
 crystals-0.179
 everywhere-0.178
Tokenad
Token ID324
Feature activation-2.15e-03

Pos logit contributions
 surely+0.107
 pose+0.105
olog+0.103
 drib+0.101
+0.098
Neg logit contributions
lessness-0.086
pt-0.085
 twins-0.074
aky-0.074
 crystals-0.071
Tokenab
Token ID397
Feature activation-1.08e-01

Pos logit contributions
 surely+0.003
 pose+0.003
olog+0.003
 drib+0.003
 seventeen+0.003
Neg logit contributions
lessness-0.004
pt-0.004
 twins-0.003
aky-0.003
 everywhere-0.003
Tokenra
Token ID430
Feature activation-3.34e-02

Pos logit contributions
 squats+0.133
 subt+0.132
 drib+0.131
 pose+0.129
 sidelines+0.128
Neg logit contributions
pt-0.224
lessness-0.203
 of-0.194
 twins-0.187
 crystals-0.186
Token!"
Token ID2474
Feature activation-3.89e-02

Pos logit contributions
 surely+0.072
 pose+0.071
olog+0.069
 drib+0.069
 seventeen+0.068
Neg logit contributions
pt-0.042
lessness-0.039
 everywhere-0.033
 of-0.032
 twins-0.032
Token he
Token ID339
Feature activation+1.43e-02

Pos logit contributions
 surely+0.072
 pose+0.072
 drib+0.069
olog+0.068
 som+0.067
Neg logit contributions
lessness-0.062
pt-0.060
 twins-0.053
aky-0.052
 crystals-0.052
Token found
Token ID1043
Feature activation+4.62e-02

Pos logit contributions
lessness+0.019
pt+0.019
 twins+0.016
 everywhere+0.016
 crystals+0.016
Neg logit contributions
 pose-0.029
 surely-0.029
olog-0.028
 seventeen-0.028
 drib-0.027
Token the
Token ID262
Feature activation-3.10e-03

Pos logit contributions
lessness+0.073
pt+0.069
aky+0.063
 twins+0.057
ovable+0.054
Neg logit contributions
 surely-0.086
 som-0.081
 drib-0.080
 pose-0.079
 seventeen-0.077
Token biggest
Token ID4094
Feature activation+9.07e-03

Pos logit contributions
 surely+0.006
 pose+0.006
 score+0.006
 drib+0.006
+0.005
Neg logit contributions
pt-0.005
lessness-0.005
aky-0.004
ovable-0.004
 twins-0.004
Token,
Token ID11
Feature activation-7.07e-03

Pos logit contributions
lessness+0.015
pt+0.014
aky+0.012
 twins+0.012
ovable+0.012
Neg logit contributions
 surely-0.017
 pose-0.016
 score-0.016
 drib-0.016
 seventeen-0.015
Token ju
Token ID7544
Feature activation-1.44e-01

Pos logit contributions
 surely+0.014
 pose+0.013
 seventeen+0.012
 som+0.012
 drib+0.012
Neg logit contributions
pt-0.011
lessness-0.011
 twins-0.009
aky-0.009
 everywhere-0.009
Tokenic
Token ID291
Feature activation-2.35e-02

Pos logit contributions
 pose+0.234
 surely+0.233
olog+0.224
 drib+0.215
 seventeen+0.215
Neg logit contributions
lessness-0.256
pt-0.251
 twins-0.225
aky-0.223
 crystals-0.220
Tokeniest
Token ID6386
Feature activation+8.39e-03

Pos logit contributions
 surely+0.057
 pose+0.056
 jugg+0.054
 drib+0.054
 som+0.053
Neg logit contributions
pt-0.022
lessness-0.021
 everywhere-0.018
 twins-0.017
aky-0.016
Token squash
Token ID34613
Feature activation-2.75e-02

Pos logit contributions
lessness+0.014
pt+0.014
aky+0.012
ovable+0.012
 twins+0.011
Neg logit contributions
 surely-0.015
 pose-0.014
 score-0.014
 drib-0.014
-0.014
Token he
Token ID339
Feature activation+2.07e-02

Pos logit contributions
 pose+0.056
 surely+0.056
 drib+0.054
 jugg+0.053
 som+0.053
Neg logit contributions
lessness-0.037
pt-0.036
 twins-0.031
aky-0.031
 crystals-0.028
Token had
Token ID550
Feature activation+4.55e-02

Pos logit contributions
lessness+0.027
pt+0.026
 twins+0.022
 crystals+0.022
 everywhere+0.021
Neg logit contributions
 surely-0.044
 pose-0.044
olog-0.042
 drib-0.042
 seventeen-0.041
Token."
Token ID526
Feature activation-3.39e-02

Pos logit contributions
 surely+0.138
 pose+0.138
olog+0.134
 drib+0.131
 seventeen+0.131
Neg logit contributions
lessness-0.073
pt-0.070
 twins-0.059
 everywhere-0.057
 crystals-0.056
Token
Token ID198
Feature activation-3.62e-02

Pos logit contributions
 pose+0.061
 surely+0.061
olog+0.058
 drib+0.058
 som+0.056
Neg logit contributions
lessness-0.054
pt-0.053
 crystals-0.046
aky-0.046
 everywhere-0.044
Token
Token ID198
Feature activation-4.32e-02

Pos logit contributions
 pose+0.066
 surely+0.065
olog+0.063
 drib+0.062
 som+0.061
Neg logit contributions
lessness-0.057
pt-0.055
 crystals-0.048
aky-0.047
 twins-0.046
TokenL
Token ID43
Feature activation-3.89e-02

Pos logit contributions
 surely+0.081
 pose+0.078
olog+0.076
 seventeen+0.076
 drib+0.075
Neg logit contributions
lessness-0.070
pt-0.065
 twins-0.062
aky-0.062
 who-0.056
Tokenily
Token ID813
Feature activation-2.46e-02

Pos logit contributions
 pose+0.094
 surely+0.090
olog+0.087
 drib+0.087
 score+0.086
Neg logit contributions
lessness-0.039
pt-0.037
 everywhere-0.033
 crystals-0.030
 twins-0.028
Token squee
Token ID13170
Feature activation-1.44e-01

Pos logit contributions
 surely+0.047
olog+0.043
 seventeen+0.043
+0.042
 avail+0.041
Neg logit contributions
pt-0.036
 twins-0.036
 who-0.033
 everywhere-0.033
 children-0.030
Tokenzes
Token ID12271
Feature activation-3.17e-03

Pos logit contributions
 surely+0.136
Ready+0.114
olog+0.113
 seventeen+0.105
 salon+0.100
Neg logit contributions
pt-0.366
 twins-0.314
 everywhere-0.297
aky-0.295
 toys-0.293
Token his
Token ID465
Feature activation+2.74e-02

Pos logit contributions
 surely+0.006
 pose+0.006
olog+0.006
 seventeen+0.006
 som+0.006
Neg logit contributions
lessness-0.004
pt-0.004
 everywhere-0.004
 twins-0.004
 crystals-0.004
Token hand
Token ID1021
Feature activation+1.06e-02

Pos logit contributions
lessness+0.042
pt+0.041
 twins+0.037
 crystals+0.035
 everywhere+0.035
Neg logit contributions
 surely-0.051
olog-0.051
 pose-0.050
 seventeen-0.048
-0.047
Token.
Token ID13
Feature activation+4.14e-03

Pos logit contributions
lessness+0.011
pt+0.010
 twins+0.008
 everywhere+0.008
 crystals+0.008
Neg logit contributions
 pose-0.025
 surely-0.025
olog-0.025
 drib-0.024
 seventeen-0.024
Token "
Token ID366
Feature activation-4.27e-02

Pos logit contributions
lessness+0.005
pt+0.005
 twins+0.004
 everywhere+0.004
 crystals+0.004
Neg logit contributions
 surely-0.009
 pose-0.009
olog-0.009
 seventeen-0.009
 drib-0.009
Token said
Token ID531
Feature activation+3.17e-02

Pos logit contributions
 pose+0.014
 surely+0.014
olog+0.014
 drib+0.014
 seventeen+0.014
Neg logit contributions
lessness-0.009
pt-0.009
 twins-0.008
 everywhere-0.008
 crystals-0.008
Token together
Token ID1978
Feature activation-1.88e-02

Pos logit contributions
lessness+0.047
pt+0.046
 twins+0.040
 everywhere+0.039
 crystals+0.038
Neg logit contributions
 surely-0.062
 pose-0.062
olog-0.059
 drib-0.058
 seventeen-0.058
Token,
Token ID11
Feature activation-2.84e-02

Pos logit contributions
 pose+0.038
 surely+0.038
olog+0.036
 drib+0.036
 jugg+0.035
Neg logit contributions
lessness-0.026
pt-0.024
aky-0.023
 crystals-0.022
 twins-0.021
Token "
Token ID366
Feature activation-4.70e-02

Pos logit contributions
 pose+0.059
 surely+0.057
olog+0.055
 som+0.054
 score+0.054
Neg logit contributions
lessness-0.038
 who-0.036
 twins-0.036
 everywhere-0.034
pt-0.032
TokenA
Token ID32
Feature activation-5.65e-03

Pos logit contributions
 som+0.092
 pose+0.091
olog+0.090
 surely+0.090
 drib+0.089
Neg logit contributions
lessness-0.062
pt-0.057
aky-0.055
 who-0.055
 twins-0.054
Tokenbr
Token ID1671
Feature activation-1.44e-01

Pos logit contributions
 surely+0.011
 drib+0.010
olog+0.010
 seventeen+0.010
zes+0.010
Neg logit contributions
lessness-0.008
pt-0.007
 twins-0.007
 of-0.007
 who-0.007
Tokenac
Token ID330
Feature activation-4.53e-02

Pos logit contributions
 surely+0.295
 pose+0.294
olog+0.286
 drib+0.280
 seventeen+0.277
Neg logit contributions
lessness-0.197
pt-0.189
 twins-0.163
aky-0.159
 everywhere-0.158
Tokenad
Token ID324
Feature activation+4.04e-03

Pos logit contributions
 surely+0.082
 pose+0.082
olog+0.079
 drib+0.078
 som+0.076
Neg logit contributions
lessness-0.073
pt-0.072
 twins-0.064
aky-0.062
 crystals-0.061
Tokenab
Token ID397
Feature activation-9.79e-02

Pos logit contributions
lessness+0.007
pt+0.007
 twins+0.006
aky+0.006
 everywhere+0.006
Neg logit contributions
 surely-0.006
olog-0.006
 pose-0.006
 drib-0.006
 seventeen-0.006
Tokenra
Token ID430
Feature activation-2.71e-02

Pos logit contributions
 drib+0.127
 squats+0.125
 sidelines+0.121
 subt+0.120
 jugg+0.118
Neg logit contributions
pt-0.192
lessness-0.173
 of-0.170
 twins-0.166
 who-0.165
Token!"
Token ID2474
Feature activation-3.92e-02

Pos logit contributions
 surely+0.057
 pose+0.056
olog+0.055
 drib+0.054
 seventeen+0.053
Neg logit contributions
pt-0.036
lessness-0.034
 everywhere-0.029
 twins-0.029
aky-0.028
Token She
Token ID1375
Feature activation-6.54e-03

Pos logit contributions
lessness+0.038
pt+0.036
 twins+0.033
 everywhere+0.033
 crystals+0.029
Neg logit contributions
 surely-0.075
 pose-0.075
 seventeen-0.072
olog-0.072
 jugg-0.071
Token found
Token ID1043
Feature activation+3.73e-02

Pos logit contributions
 seventeen+0.016
 pose+0.016
 surely+0.016
olog+0.015
ident+0.014
Neg logit contributions
pt-0.006
 everywhere-0.006
 who-0.006
lessness-0.006
 twins-0.005
Token the
Token ID262
Feature activation-5.26e-03

Pos logit contributions
lessness+0.063
pt+0.056
aky+0.050
iness+0.046
ruck+0.046
Neg logit contributions
 surely-0.066
 seventeen-0.064
 sixty-0.063
 pose-0.063
 thirty-0.061
Token biggest
Token ID4094
Feature activation-8.25e-04

Pos logit contributions
 surely+0.010
 pose+0.010
 drib+0.009
 seventeen+0.009
 som+0.009
Neg logit contributions
lessness-0.008
pt-0.008
 twins-0.007
aky-0.007
 crystals-0.006
Token and
Token ID290
Feature activation-1.59e-02

Pos logit contributions
 pose+0.001
 surely+0.001
olog+0.001
 drib+0.001
 seventeen+0.001
Neg logit contributions
lessness-0.001
pt-0.001
 twins-0.001
 everywhere-0.001
 crystals-0.001
Token ju
Token ID7544
Feature activation-1.43e-01

Pos logit contributions
 pose+0.033
 surely+0.033
olog+0.032
zes+0.031
 drib+0.031
Neg logit contributions
lessness-0.021
pt-0.020
 everywhere-0.018
 crystals-0.018
 twins-0.018
Tokenic
Token ID291
Feature activation-2.47e-02

Pos logit contributions
 surely+0.241
 pose+0.240
olog+0.231
 seventeen+0.223
 drib+0.221
Neg logit contributions
lessness-0.243
pt-0.242
aky-0.215
 twins-0.215
 crystals-0.212
Tokeniest
Token ID6386
Feature activation+7.41e-03

Pos logit contributions
 surely+0.060
 pose+0.060
 drib+0.057
 som+0.057
olog+0.057
Neg logit contributions
lessness-0.022
pt-0.021
 twins-0.017
 everywhere-0.017
aky-0.016
Token one
Token ID530
Feature activation+3.55e-02

Pos logit contributions
lessness+0.014
pt+0.013
 twins+0.011
aky+0.011
 everywhere+0.011
Neg logit contributions
 pose-0.012
 surely-0.012
 drib-0.011
 som-0.011
olog-0.011
Token,
Token ID11
Feature activation-3.39e-03

Pos logit contributions
lessness+0.050
pt+0.049
 twins+0.042
aky+0.040
 crystals+0.039
Neg logit contributions
 pose-0.072
 surely-0.071
 drib-0.067
olog-0.066
 som-0.066
Token but
Token ID475
Feature activation-6.66e-03

Pos logit contributions
 surely+0.007
 pose+0.007
 seventeen+0.007
 drib+0.007
 som+0.007
Neg logit contributions
lessness-0.005
pt-0.005
 twins-0.004
aky-0.004
 crystals-0.003
Token of
Token ID286
Feature activation+3.46e-03

Pos logit contributions
lessness+0.049
pt+0.047
 twins+0.039
aky+0.039
 crystals+0.037
Neg logit contributions
 surely-0.106
 pose-0.106
 drib-0.101
olog-0.100
 som-0.099
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
lessness+0.005
pt+0.005
 twins+0.004
aky+0.004
 crystals+0.004
Neg logit contributions
 pose-0.007
 surely-0.007
olog-0.006
 drib-0.006
 seventeen-0.006
Token frog
Token ID21264
Feature activation-8.82e-04

Pos logit contributions
lessness+0.018
pt+0.017
 twins+0.015
aky+0.015
 crystals+0.015
Neg logit contributions
 surely-0.019
 pose-0.019
olog-0.018
 drib-0.018
 seventeen-0.018
Token.
Token ID13
Feature activation+2.99e-02

Pos logit contributions
 pose+0.002
 surely+0.002
olog+0.002
 drib+0.002
 seventeen+0.002
Neg logit contributions
lessness-0.001
pt-0.001
 twins-0.001
 crystals-0.001
aky-0.001
Token He
Token ID679
Feature activation+2.25e-03

Pos logit contributions
lessness+0.039
pt+0.036
 twins+0.034
 everywhere+0.031
 of+0.031
Neg logit contributions
 pose-0.064
 surely-0.061
olog-0.060
zes-0.059
 seventeen-0.059
Token spe
Token ID693
Feature activation-1.43e-01

Pos logit contributions
pt+0.003
lessness+0.003
 everywhere+0.003
 who+0.003
 of+0.003
Neg logit contributions
 pose-0.004
-0.004
olog-0.004
 surely-0.004
 seventeen-0.004
Tokenared
Token ID1144
Feature activation-1.20e-02

Pos logit contributions
 surely+0.245
 pose+0.239
olog+0.237
+0.221
 jugg+0.216
Neg logit contributions
pt-0.242
aky-0.236
lessness-0.229
ruck-0.206
 crystals-0.199
Token the
Token ID262
Feature activation-1.23e-02

Pos logit contributions
 surely+0.024
 pose+0.024
olog+0.023
 drib+0.023
 seventeen+0.023
Neg logit contributions
lessness-0.017
pt-0.016
 twins-0.014
 crystals-0.013
aky-0.013
Token lazy
Token ID16931
Feature activation-1.06e-02

Pos logit contributions
 pose+0.022
 surely+0.022
olog+0.021
 drib+0.020
 seventeen+0.020
Neg logit contributions
lessness-0.020
pt-0.019
 twins-0.017
 everywhere-0.017
 crystals-0.017
Token frog
Token ID21264
Feature activation-4.60e-03

Pos logit contributions
 surely+0.021
olog+0.021
 pose+0.021
ven+0.019
 drib+0.019
Neg logit contributions
lessness-0.015
pt-0.015
 twins-0.013
 crystals-0.013
 everywhere-0.013
Token and
Token ID290
Feature activation-1.31e-03

Pos logit contributions
 surely+0.009
 pose+0.008
olog+0.008
 seventeen+0.008
 drib+0.008
Neg logit contributions
lessness-0.007
pt-0.007
 everywhere-0.006
 twins-0.006
 crystals-0.006
Token and
Token ID290
Feature activation-2.88e-03

Pos logit contributions
lessness+0.026
pt+0.024
 twins+0.020
 crystals+0.019
aky+0.019
Neg logit contributions
 pose-0.063
 surely-0.063
olog-0.061
 drib-0.060
 seventeen-0.059
Token saying
Token ID2282
Feature activation+5.21e-02

Pos logit contributions
olog+0.006
+0.005
 seventeen+0.005
Too+0.005
 surely+0.005
Neg logit contributions
pt-0.004
lessness-0.004
 everywhere-0.004
 who-0.004
 twins-0.004
Token,
Token ID11
Feature activation-2.26e-02

Pos logit contributions
lessness+0.065
pt+0.063
 twins+0.053
 everywhere+0.052
 crystals+0.051
Neg logit contributions
 surely-0.114
 pose-0.112
olog-0.109
 drib-0.109
 seventeen-0.106
Token "
Token ID366
Feature activation-5.11e-02

Pos logit contributions
 pose+0.045
olog+0.045
 surely+0.045
Too+0.045
ven+0.043
Neg logit contributions
lessness-0.029
 who-0.029
 everywhere-0.028
 twins-0.028
pt-0.026
TokenA
Token ID32
Feature activation-1.01e-02

Pos logit contributions
 pose+0.091
olog+0.089
 som+0.087
 surely+0.087
 seventeen+0.086
Neg logit contributions
lessness-0.083
pt-0.071
aky-0.071
 twins-0.066
 who-0.066
Tokenbr
Token ID1671
Feature activation-1.43e-01

Pos logit contributions
 surely+0.020
olog+0.020
zes+0.020
+0.020
 drib+0.019
Neg logit contributions
pt-0.012
 of-0.012
lessness-0.012
 twins-0.011
 who-0.011
Tokenac
Token ID330
Feature activation-5.75e-02

Pos logit contributions
 pose+0.280
 surely+0.280
olog+0.270
 drib+0.264
 seventeen+0.262
Neg logit contributions
lessness-0.211
pt-0.202
 twins-0.176
 everywhere-0.172
 crystals-0.172
Tokenad
Token ID324
Feature activation-2.68e-03

Pos logit contributions
 surely+0.109
 pose+0.108
olog+0.106
 drib+0.103
+0.101
Neg logit contributions
lessness-0.088
pt-0.087
 twins-0.076
aky-0.076
 crystals-0.074
Tokenab
Token ID397
Feature activation-1.06e-01

Pos logit contributions
 surely+0.004
 pose+0.004
olog+0.004
 drib+0.004
 seventeen+0.004
Neg logit contributions
lessness-0.005
pt-0.005
 twins-0.004
aky-0.004
 everywhere-0.004
Tokenra
Token ID430
Feature activation-3.44e-02

Pos logit contributions
 squats+0.143
 subt+0.140
 sidelines+0.136
 pose+0.135
 drib+0.134
Neg logit contributions
pt-0.204
 of-0.182
lessness-0.182
 twins-0.173
 who-0.172
Token!"
Token ID2474
Feature activation-3.78e-02

Pos logit contributions
 surely+0.073
 pose+0.071
olog+0.070
 drib+0.070
 seventeen+0.069
Neg logit contributions
pt-0.044
lessness-0.042
 twins-0.035
 everywhere-0.035
 of-0.034
Token small
Token ID1402
Feature activation-2.25e-02

Pos logit contributions
 pose+0.059
 surely+0.058
olog+0.057
 som+0.056
+0.056
Neg logit contributions
lessness-0.032
pt-0.030
aky-0.027
 everywhere-0.026
 twins-0.025
Token for
Token ID329
Feature activation+8.00e-04

Pos logit contributions
 pose+0.046
 drib+0.042
 surely+0.042
olog+0.041
 score+0.041
Neg logit contributions
lessness-0.031
pt-0.030
aky-0.027
 crystals-0.026
ovable-0.025
Token Ben
Token ID3932
Feature activation+1.68e-02

Pos logit contributions
lessness+0.001
pt+0.001
aky+0.001
 twins+0.001
 crystals+0.001
Neg logit contributions
 pose-0.002
 surely-0.002
 drib-0.001
 score-0.001
 som-0.001
Token.
Token ID13
Feature activation+3.89e-03

Pos logit contributions
lessness+0.021
pt+0.019
 crystals+0.019
 twins+0.017
 bricks+0.016
Neg logit contributions
 pose-0.037
 drib-0.035
 surely-0.035
 score-0.034
olog-0.033
Token He
Token ID679
Feature activation-1.08e-02

Pos logit contributions
lessness+0.005
pt+0.004
 twins+0.004
 everywhere+0.004
 of+0.004
Neg logit contributions
 pose-0.008
 surely-0.008
olog-0.008
 salon-0.008
zes-0.008
Token squee
Token ID13170
Feature activation-1.42e-01

Pos logit contributions
 surely+0.018
olog+0.017
+0.017
 pose+0.016
 seventeen+0.016
Neg logit contributions
pt-0.020
 everywhere-0.018
 twins-0.017
lessness-0.017
 who-0.016
Tokenzes
Token ID12271
Feature activation-9.86e-03

Pos logit contributions
 surely+0.126
 pose+0.106
olog+0.101
 salon+0.101
 seventeen+0.096
Neg logit contributions
pt-0.370
 everywhere-0.327
 twins-0.325
 toys-0.316
lessness-0.312
Token in
Token ID287
Feature activation-3.25e-02

Pos logit contributions
 pose+0.018
 surely+0.018
olog+0.018
 seventeen+0.017
 drib+0.017
Neg logit contributions
lessness-0.016
pt-0.015
 everywhere-0.014
 twins-0.013
 crystals-0.013
Token,
Token ID11
Feature activation-6.69e-03

Pos logit contributions
 pose+0.062
 surely+0.061
 drib+0.059
 seventeen+0.058
olog+0.058
Neg logit contributions
lessness-0.050
pt-0.049
 twins-0.041
aky-0.041
 crystals-0.041
Token but
Token ID475
Feature activation-1.03e-02

Pos logit contributions
olog+0.016
 surely+0.015
 pose+0.015
ven+0.015
+0.015
Neg logit contributions
lessness-0.007
pt-0.007
 everywhere-0.006
 twins-0.006
 who-0.006
Token it
Token ID340
Feature activation-2.36e-02

Pos logit contributions
olog+0.023
+0.022
 pose+0.022
 seventeen+0.022
 coaster+0.021
Neg logit contributions
lessness-0.013
 everywhere-0.012
pt-0.011
 of-0.011
 twins-0.010
Token was
Token ID373
Feature activation+6.67e-03

Pos logit contributions
 surely+0.011
 pose+0.011
olog+0.011
 drib+0.010
 som+0.010
Neg logit contributions
lessness-0.010
pt-0.010
 twins-0.009
 crystals-0.008
aky-0.008
Token the
Token ID262
Feature activation-1.59e-02

Pos logit contributions
lessness+0.009
pt+0.009
 twins+0.008
 everywhere+0.007
 crystals+0.007
Neg logit contributions
 pose-0.014
 surely-0.014
olog-0.013
 drib-0.013
 seventeen-0.013
Token sweet
Token ID6029
Feature activation-1.91e-02

Pos logit contributions
 surely+0.033
 pose+0.033
 drib+0.031
olog+0.031
 seventeen+0.031
Neg logit contributions
lessness-0.022
pt-0.021
aky-0.017
 twins-0.017
 crystals-0.017
Tokenest
Token ID395
Feature activation+5.42e-03

Pos logit contributions
 pose+0.035
 surely+0.035
 seventeen+0.033
olog+0.033
 drib+0.033
Neg logit contributions
lessness-0.030
pt-0.030
aky-0.026
 everywhere-0.025
 twins-0.024
Token and
Token ID290
Feature activation-2.00e-02

Pos logit contributions
lessness+0.010
pt+0.010
aky+0.008
 twins+0.008
 everywhere+0.008
Neg logit contributions
 pose-0.009
 surely-0.009
olog-0.008
 drib-0.008
 seventeen-0.008
Token ju
Token ID7544
Feature activation-1.42e-01

Pos logit contributions
 pose+0.043
olog+0.043
 surely+0.042
zes+0.041
 drib+0.041
Neg logit contributions
lessness-0.025
pt-0.023
 crystals-0.021
 twins-0.021
 everywhere-0.020
Tokenic
Token ID291
Feature activation-1.99e-02

Pos logit contributions
 pose+0.220
 surely+0.220
olog+0.210
 drib+0.204
 seventeen+0.203
Neg logit contributions
lessness-0.267
pt-0.259
 twins-0.233
 everywhere-0.228
 crystals-0.228
Tokeniest
Token ID6386
Feature activation+9.56e-03

Pos logit contributions
 pose+0.047
 surely+0.047
 drib+0.045
 som+0.045
 jugg+0.044
Neg logit contributions
lessness-0.018
pt-0.017
 everywhere-0.015
 crystals-0.014
 twins-0.014
Token cherry
Token ID23612
Feature activation-7.06e-02

Pos logit contributions
lessness+0.017
pt+0.016
aky+0.015
 twins+0.014
 everywhere+0.014
Neg logit contributions
 pose-0.016
 surely-0.016
 seventeen-0.015
 som-0.015
 drib-0.015
Token he
Token ID339
Feature activation+1.99e-02

Pos logit contributions
 pose+0.144
 surely+0.144
olog+0.137
 drib+0.136
 seventeen+0.135
Neg logit contributions
lessness-0.096
pt-0.094
 twins-0.079
aky-0.077
 crystals-0.075
Token had
Token ID550
Feature activation+4.11e-02

Pos logit contributions
lessness+0.025
pt+0.024
 twins+0.021
 everywhere+0.020
 crystals+0.020
Neg logit contributions
 pose-0.043
 surely-0.043
olog-0.041
 seventeen-0.040
 drib-0.040
Token rewarded
Token ID20945
Feature activation-4.97e-03

Pos logit contributions
lessness+0.006
pt+0.006
 everywhere+0.005
 twins+0.005
 of+0.005
Neg logit contributions
-0.018
olog-0.018
 pose-0.018
Too-0.018
 surely-0.017
Token with
Token ID351
Feature activation+1.25e-02

Pos logit contributions
 surely+0.010
 pose+0.010
 seventeen+0.010
olog+0.010
 drib+0.010
Neg logit contributions
lessness-0.007
pt-0.006
 everywhere-0.006
 twins-0.006
 crystals-0.005
Token the
Token ID262
Feature activation+3.82e-04

Pos logit contributions
lessness+0.016
pt+0.015
aky+0.013
 twins+0.013
 crystals+0.012
Neg logit contributions
 surely-0.027
 pose-0.027
 drib-0.026
olog-0.026
 som-0.025
Token biggest
Token ID4094
Feature activation+1.46e-02

Pos logit contributions
lessness+0.001
pt+0.001
 twins+0.001
aky+0.001
 everywhere+0.001
Neg logit contributions
 surely-0.001
 pose-0.001
olog-0.001
 seventeen-0.001
 drib-0.001
Token,
Token ID11
Feature activation-8.24e-03

Pos logit contributions
lessness+0.026
pt+0.025
 twins+0.022
 everywhere+0.022
aky+0.022
Neg logit contributions
 pose-0.024
 surely-0.024
olog-0.023
 drib-0.022
 seventeen-0.022
Token ju
Token ID7544
Feature activation-1.42e-01

Pos logit contributions
 surely+0.017
 pose+0.016
 seventeen+0.016
olog+0.016
 som+0.015
Neg logit contributions
lessness-0.012
pt-0.011
 twins-0.010
aky-0.009
 everywhere-0.009
Tokenic
Token ID291
Feature activation-2.30e-02

Pos logit contributions
 pose+0.231
 surely+0.230
olog+0.220
 drib+0.214
 seventeen+0.211
Neg logit contributions
lessness-0.253
pt-0.247
 twins-0.221
aky-0.218
 crystals-0.217
Tokeniest
Token ID6386
Feature activation+1.24e-02

Pos logit contributions
 pose+0.055
 surely+0.055
olog+0.052
 som+0.052
 jugg+0.052
Neg logit contributions
pt-0.021
lessness-0.020
 everywhere-0.017
 twins-0.016
 crystals-0.016
Token cher
Token ID21382
Feature activation-9.54e-02

Pos logit contributions
lessness+0.022
pt+0.022
aky+0.019
 twins+0.019
 everywhere+0.019
Neg logit contributions
 pose-0.020
 surely-0.020
 seventeen-0.019
olog-0.019
 som-0.019
Tokenries
Token ID1678
Feature activation-1.93e-02

Pos logit contributions
 surely+0.170
 pose+0.169
olog+0.163
 drib+0.159
 seventeen+0.158
Neg logit contributions
lessness-0.156
pt-0.151
 twins-0.133
 everywhere-0.130
aky-0.129
Token!
Token ID0
Feature activation+1.59e-02

Pos logit contributions
 surely+0.039
 pose+0.038
olog+0.037
 drib+0.036
 seventeen+0.036
Neg logit contributions
lessness-0.026
pt-0.026
 twins-0.023
aky-0.022
 crystals-0.021
Token some
Token ID617
Feature activation+3.32e-02

Pos logit contributions
lessness+0.113
pt+0.092
aky+0.088
 twins+0.086
iness+0.083
Neg logit contributions
 pose-0.109
 som-0.102
 surely-0.101
 reaches-0.094
 sixty-0.094
Token of
Token ID286
Feature activation-1.98e-02

Pos logit contributions
lessness+0.060
pt+0.050
aky+0.045
 twins+0.045
 alien+0.045
Neg logit contributions
 pose-0.058
 som-0.053
 drib-0.052
 squats-0.050
 score-0.048
Token the
Token ID262
Feature activation+1.23e-02

Pos logit contributions
 pose+0.040
 surely+0.039
 drib+0.038
olog+0.038
 som+0.038
Neg logit contributions
lessness-0.029
pt-0.027
 twins-0.023
aky-0.023
 crystals-0.022
Token biggest
Token ID4094
Feature activation+9.61e-03

Pos logit contributions
lessness+0.021
pt+0.020
aky+0.016
ovable+0.016
ruck+0.015
Neg logit contributions
 pose-0.022
 drib-0.020
 surely-0.020
 som-0.020
-0.019
Token,
Token ID11
Feature activation-4.03e-03

Pos logit contributions
lessness+0.015
pt+0.014
aky+0.012
 twins+0.012
 crystals+0.012
Neg logit contributions
 pose-0.019
 surely-0.017
 som-0.017
 drib-0.017
olog-0.017
Token ju
Token ID7544
Feature activation-1.42e-01

Pos logit contributions
 pose+0.008
 surely+0.008
 som+0.007
 seventeen+0.007
olog+0.007
Neg logit contributions
lessness-0.006
pt-0.006
 twins-0.005
aky-0.005
 everywhere-0.005
Tokenic
Token ID291
Feature activation-2.36e-02

Pos logit contributions
 surely+0.241
 pose+0.238
olog+0.231
 seventeen+0.222
 drib+0.220
Neg logit contributions
lessness-0.239
pt-0.238
aky-0.211
 twins-0.210
 crystals-0.209
Tokeniest
Token ID6386
Feature activation+6.42e-03

Pos logit contributions
 pose+0.057
 surely+0.055
 som+0.052
ede+0.052
ven+0.052
Neg logit contributions
lessness-0.023
pt-0.022
 everywhere-0.018
 twins-0.018
aky-0.016
Token berries
Token ID36322
Feature activation-1.35e-02

Pos logit contributions
lessness+0.010
pt+0.009
aky+0.008
 twins+0.008
ovable+0.007
Neg logit contributions
 pose-0.013
 som-0.012
 drib-0.012
 surely-0.012
-0.011
Token in
Token ID287
Feature activation-4.42e-02

Pos logit contributions
 surely+0.025
 pose+0.025
 drib+0.023
olog+0.022
 som+0.022
Neg logit contributions
lessness-0.019
pt-0.018
 twins-0.018
aky-0.017
ruck-0.016
Token the
Token ID262
Feature activation+2.46e-02

Pos logit contributions
 surely+0.078
 pose+0.078
olog+0.074
 drib+0.073
 seventeen+0.072
Neg logit contributions
lessness-0.074
pt-0.071
 twins-0.063
 crystals-0.062
aky-0.061
Token hugged
Token ID41130
Feature activation+2.99e-02

Pos logit contributions
 surely+0.015
 pose+0.015
olog+0.015
 drib+0.014
 seventeen+0.014
Neg logit contributions
lessness-0.013
pt-0.012
 twins-0.011
 crystals-0.011
 everywhere-0.011
Token her
Token ID607
Feature activation+4.69e-02

Pos logit contributions
lessness+0.029
pt+0.028
 twins+0.023
 everywhere+0.022
 crystals+0.021
Neg logit contributions
 pose-0.074
 surely-0.073
olog-0.072
 seventeen-0.070
 drib-0.070
Token friend
Token ID1545
Feature activation+7.10e-02

Pos logit contributions
lessness+0.081
pt+0.074
 crystals+0.062
aky+0.062
ruck+0.058
Neg logit contributions
 pose-0.085
 surely-0.084
 score-0.079
olog-0.079
 drib-0.078
Token to
Token ID284
Feature activation-1.72e-02

Pos logit contributions
lessness+0.099
pt+0.092
 crystals+0.082
aky+0.076
 twins+0.072
Neg logit contributions
 pose-0.146
 surely-0.143
 drib-0.140
olog-0.140
 score-0.135
Token comfort
Token ID4467
Feature activation+3.50e-02

Pos logit contributions
 surely+0.037
 pose+0.036
 drib+0.035
olog+0.035
 jugg+0.034
Neg logit contributions
lessness-0.023
pt-0.021
aky-0.019
 crystals-0.018
 twins-0.018
Token her
Token ID607
Feature activation+6.33e-02

Pos logit contributions
lessness+0.047
pt+0.041
 crystals+0.039
aky+0.039
 twins+0.038
Neg logit contributions
 surely-0.072
 pose-0.070
olog-0.068
zes-0.067
-0.067
Token.
Token ID13
Feature activation+2.92e-02

Pos logit contributions
lessness+0.090
pt+0.083
 crystals+0.069
aky+0.068
ruck+0.063
Neg logit contributions
 pose-0.131
 surely-0.128
 score-0.121
olog-0.121
 drib-0.120
Token Her
Token ID2332
Feature activation+6.33e-03

Pos logit contributions
lessness+0.030
pt+0.029
 twins+0.027
 everywhere+0.025
 crystals+0.022
Neg logit contributions
 pose-0.070
 surely-0.069
 seventeen-0.066
 som-0.066
 jugg-0.065
Token friend
Token ID1545
Feature activation+3.67e-02

Pos logit contributions
lessness+0.010
pt+0.009
 crystals+0.008
aky+0.007
 everywhere+0.007
Neg logit contributions
 pose-0.013
 surely-0.012
olog-0.012
 drib-0.012
 score-0.012
Token stopped
Token ID5025
Feature activation+3.81e-03

Pos logit contributions
lessness+0.068
pt+0.060
 crystals+0.060
aky+0.055
 everywhere+0.054
Neg logit contributions
 surely-0.058
 pose-0.057
olog-0.056
 drib-0.055
 jugg-0.053
Token crying
Token ID13774
Feature activation+4.32e-02

Pos logit contributions
lessness+0.005
pt+0.004
 twins+0.003
 crystals+0.003
aky+0.003
Neg logit contributions
 pose-0.009
 surely-0.009
 drib-0.008
olog-0.008
 som-0.008
Token it
Token ID340
Feature activation-2.78e-03

Pos logit contributions
 pose+0.040
 surely+0.040
olog+0.039
 drib+0.038
 seventeen+0.038
Neg logit contributions
lessness-0.032
pt-0.031
 twins-0.027
 everywhere-0.027
 crystals-0.026
Token,
Token ID11
Feature activation-1.47e-02

Pos logit contributions
 pose+0.005
 surely+0.005
olog+0.005
 drib+0.005
 seventeen+0.005
Neg logit contributions
lessness-0.004
pt-0.004
 twins-0.003
aky-0.003
 crystals-0.003
Token Jane
Token ID12091
Feature activation-1.21e-02

Pos logit contributions
 pose+0.026
 surely+0.026
olog+0.025
 drib+0.025
 seventeen+0.024
Neg logit contributions
lessness-0.024
pt-0.023
 twins-0.021
 everywhere-0.020
 crystals-0.020
Token?"
Token ID1701
Feature activation-1.70e-02

Pos logit contributions
 pose+0.021
 surely+0.021
olog+0.020
 drib+0.020
 jugg+0.019
Neg logit contributions
lessness-0.020
pt-0.019
 everywhere-0.016
 crystals-0.016
 twins-0.016
Token her
Token ID607
Feature activation-5.20e-03

Pos logit contributions
 pose+0.035
 surely+0.035
olog+0.034
 seventeen+0.033
 drib+0.032
Neg logit contributions
lessness-0.023
pt-0.022
 twins-0.020
 crystals-0.019
 everywhere-0.019
Token mother
Token ID2802
Feature activation+6.58e-02

Pos logit contributions
 pose+0.010
 surely+0.010
olog+0.010
 seventeen+0.009
 drib+0.009
Neg logit contributions
lessness-0.008
pt-0.007
 twins-0.007
 everywhere-0.006
 crystals-0.006
Token asked
Token ID1965
Feature activation+2.99e-02

Pos logit contributions
pt+0.088
lessness+0.084
 who+0.075
 twins+0.074
 of+0.072
Neg logit contributions
 surely-0.137
 pose-0.134
olog-0.133
 seventeen-0.133
 som-0.129
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
lessness+0.035
pt+0.032
aky+0.029
 crystals+0.028
 twins+0.027
Neg logit contributions
 pose-0.065
 surely-0.065
olog-0.064
 drib-0.062
zes-0.062
Token
Token ID198
Feature activation+8.69e-03

Pos logit contributions
lessness+0.025
pt+0.023
 twins+0.022
 everywhere+0.020
 who+0.020
Neg logit contributions
 pose-0.042
 surely-0.040
olog-0.038
 som-0.038
 seventeen-0.038
Token
Token ID198
Feature activation-1.78e-03

Pos logit contributions
lessness+0.011
pt+0.011
 twins+0.009
 everywhere+0.009
aky+0.009
Neg logit contributions
 pose-0.018
 surely-0.018
olog-0.018
 drib-0.017
 seventeen-0.017
Token"
Token ID1
Feature activation+3.74e-03

Pos logit contributions
 pose+0.003
 surely+0.003
 som+0.003
olog+0.003
 drib+0.003
Neg logit contributions
lessness-0.003
pt-0.003
aky-0.003
 twins-0.003
 of-0.003
Token He
Token ID679
Feature activation+3.54e-03

Pos logit contributions
olog+0.060
 surely+0.059
 pose+0.058
 seventeen+0.058
ven+0.056
Neg logit contributions
lessness-0.042
pt-0.042
 everywhere-0.038
 twins-0.037
 crystals-0.035
Token played
Token ID2826
Feature activation+2.37e-02

Pos logit contributions
pt+0.005
lessness+0.005
 twins+0.004
 everywhere+0.004
 crystals+0.004
Neg logit contributions
 surely-0.007
 pose-0.007
olog-0.007
 seventeen-0.006
-0.006
Token happily
Token ID18177
Feature activation+1.71e-03

Pos logit contributions
lessness+0.026
 everywhere+0.025
pt+0.024
 twins+0.023
 of+0.021
Neg logit contributions
 surely-0.054
olog-0.053
-0.052
 seventeen-0.052
 pose-0.052
Token with
Token ID351
Feature activation+4.04e-02

Pos logit contributions
lessness+0.002
pt+0.002
 twins+0.001
 everywhere+0.001
 crystals+0.001
Neg logit contributions
 surely-0.004
 pose-0.004
olog-0.004
 seventeen-0.004
 drib-0.004
Token his
Token ID465
Feature activation+5.85e-02

Pos logit contributions
lessness+0.067
pt+0.062
aky+0.058
 crystals+0.052
 twins+0.051
Neg logit contributions
 pose-0.071
 surely-0.070
 drib-0.069
 som-0.068
 score-0.068
Token mom
Token ID1995
Feature activation+7.50e-02

Pos logit contributions
lessness+0.071
pt+0.067
aky+0.054
 twins+0.053
 crystals+0.052
Neg logit contributions
 pose-0.130
 surely-0.129
 drib-0.125
olog-0.123
 score-0.122
Token until
Token ID1566
Feature activation+1.33e-02

Pos logit contributions
lessness+0.096
pt+0.089
 crystals+0.077
aky+0.071
 twins+0.069
Neg logit contributions
 pose-0.160
 surely-0.159
 drib-0.153
olog-0.151
 score-0.147
Token it
Token ID340
Feature activation-2.42e-02

Pos logit contributions
lessness+0.020
pt+0.019
 twins+0.017
 everywhere+0.017
 crystals+0.016
Neg logit contributions
 pose-0.026
olog-0.025
 surely-0.025
 seventeen-0.024
 drib-0.024
Token was
Token ID373
Feature activation-1.82e-03

Pos logit contributions
 pose+0.049
 surely+0.048
olog+0.047
 seventeen+0.047
 drib+0.046
Neg logit contributions
lessness-0.034
pt-0.034
 everywhere-0.029
 twins-0.028
aky-0.027
Token time
Token ID640
Feature activation-1.72e-03

Pos logit contributions
 surely+0.003
 pose+0.003
 drib+0.003
olog+0.003
 seventeen+0.003
Neg logit contributions
lessness-0.003
pt-0.003
 twins-0.003
 crystals-0.003
aky-0.003
Token for
Token ID329
Feature activation+8.88e-03

Pos logit contributions
 surely+0.003
 pose+0.003
olog+0.003
 drib+0.003
 som+0.002
Neg logit contributions
lessness-0.003
pt-0.003
aky-0.003
 crystals-0.003
ovable-0.003
Token and
Token ID290
Feature activation-2.52e-02

Pos logit contributions
 pose+0.151
 surely+0.147
 drib+0.141
olog+0.141
 som+0.140
Neg logit contributions
lessness-0.088
pt-0.084
 twins-0.069
aky-0.068
 everywhere-0.068
Token round
Token ID2835
Feature activation-4.71e-02

Pos logit contributions
+0.053
olog+0.052
zes+0.052
 pose+0.052
uren+0.050
Neg logit contributions
lessness-0.030
 crystals-0.030
pt-0.028
 everywhere-0.028
 twins-0.027
Tokener
Token ID263
Feature activation-1.87e-02

Pos logit contributions
 pose+0.111
 surely+0.108
 drib+0.105
 seventeen+0.104
 som+0.104
Neg logit contributions
lessness-0.051
pt-0.047
aky-0.038
 twins-0.038
 everywhere-0.036
Token.
Token ID13
Feature activation+1.01e-03

Pos logit contributions
 pose+0.045
 surely+0.045
olog+0.044
 drib+0.043
 seventeen+0.043
Neg logit contributions
lessness-0.018
pt-0.017
 twins-0.014
 crystals-0.013
 everywhere-0.013
Token He
Token ID679
Feature activation-7.31e-03

Pos logit contributions
lessness+0.001
pt+0.001
aky+0.001
 twins+0.001
 everywhere+0.001
Neg logit contributions
 surely-0.002
 pose-0.002
olog-0.002
 drib-0.002
 seventeen-0.002
Token thought
Token ID1807
Feature activation+7.91e-02

Pos logit contributions
 pose+0.015
 surely+0.015
olog+0.015
+0.015
 seventeen+0.015
Neg logit contributions
pt-0.009
lessness-0.009
 who-0.008
 everywhere-0.008
aky-0.008
Token it
Token ID340
Feature activation-2.77e-02

Pos logit contributions
lessness+0.132
pt+0.128
 twins+0.114
aky+0.113
 crystals+0.110
Neg logit contributions
 surely-0.139
 pose-0.138
olog-0.132
 drib-0.130
 som-0.129
Token would
Token ID561
Feature activation-4.55e-02

Pos logit contributions
 pose+0.049
 surely+0.049
olog+0.047
 drib+0.046
 som+0.045
Neg logit contributions
lessness-0.046
pt-0.044
 twins-0.039
aky-0.038
 crystals-0.038
Token make
Token ID787
Feature activation-5.79e-03

Pos logit contributions
 pose+0.054
olog+0.054
 surely+0.054
+0.050
 drib+0.050
Neg logit contributions
lessness-0.099
pt-0.098
 twins-0.089
 everywhere-0.088
 crystals-0.088
Token a
Token ID257
Feature activation+1.06e-02

Pos logit contributions
 pose+0.013
 surely+0.013
olog+0.013
 drib+0.012
 seventeen+0.012
Neg logit contributions
lessness-0.007
pt-0.006
 twins-0.005
 everywhere-0.005
 crystals-0.005
Token more
Token ID517
Feature activation-3.09e-02

Pos logit contributions
lessness+0.013
pt+0.012
 twins+0.012
 everywhere+0.011
 crystals+0.011
Neg logit contributions
 surely-0.022
-0.022
olog-0.022
 pose-0.021
ven-0.021
Token want
Token ID765
Feature activation+5.64e-02

Pos logit contributions
 pose+0.032
 surely+0.031
 score+0.029
olog+0.029
 drib+0.028
Neg logit contributions
lessness-0.032
pt-0.031
 crystals-0.029
 everywhere-0.028
 twins-0.028
Token to
Token ID284
Feature activation-6.43e-02

Pos logit contributions
lessness+0.088
pt+0.080
aky+0.074
 twins+0.070
 crystals+0.067
Neg logit contributions
 pose-0.110
 surely-0.109
 drib-0.104
 som-0.103
olog-0.102
Token leave
Token ID2666
Feature activation+1.05e-02

Pos logit contributions
 surely+0.113
 seventeen+0.113
 pose+0.112
olog+0.110
 drib+0.106
Neg logit contributions
pt-0.105
lessness-0.102
 twins-0.092
 everywhere-0.087
 of-0.086
Token.
Token ID13
Feature activation+1.50e-02

Pos logit contributions
lessness+0.015
pt+0.014
aky+0.012
 twins+0.012
 crystals+0.012
Neg logit contributions
 surely-0.021
 pose-0.021
olog-0.020
 drib-0.020
 som-0.019
Token She
Token ID1375
Feature activation-9.97e-03

Pos logit contributions
lessness+0.017
pt+0.016
 twins+0.015
 everywhere+0.014
 crystals+0.013
Neg logit contributions
 pose-0.034
 surely-0.034
 jugg-0.033
 drib-0.032
 som-0.032
Token wanted
Token ID2227
Feature activation+6.62e-02

Pos logit contributions
 surely+0.022
 pose+0.022
 seventeen+0.022
olog+0.021
+0.021
Neg logit contributions
pt-0.013
lessness-0.012
 twins-0.010
aky-0.010
 everywhere-0.010
Token to
Token ID284
Feature activation-5.41e-02

Pos logit contributions
lessness+0.102
pt+0.094
aky+0.085
 twins+0.081
 crystals+0.078
Neg logit contributions
 pose-0.129
 surely-0.128
 drib-0.122
olog-0.120
 som-0.120
Token keep
Token ID1394
Feature activation-1.15e-02

Pos logit contributions
 seventeen+0.096
olog+0.093
 surely+0.093
+0.092
 pose+0.091
Neg logit contributions
pt-0.089
lessness-0.082
 twins-0.077
 of-0.074
 everywhere-0.074
Token playing
Token ID2712
Feature activation+1.70e-02

Pos logit contributions
 surely+0.023
 pose+0.023
olog+0.022
 drib+0.022
 seventeen+0.021
Neg logit contributions
lessness-0.017
pt-0.016
 twins-0.014
aky-0.014
 crystals-0.014
Token.
Token ID13
Feature activation+2.32e-02

Pos logit contributions
lessness+0.020
pt+0.019
 twins+0.016
 crystals+0.016
aky+0.016
Neg logit contributions
 pose-0.038
 surely-0.038
olog-0.036
 drib-0.036
 som-0.036
Token The
Token ID383
Feature activation-3.12e-02

Pos logit contributions
lessness+0.030
pt+0.029
 twins+0.027
 everywhere+0.025
 of+0.024
Neg logit contributions
 surely-0.050
 pose-0.049
 jugg-0.047
 som-0.046
olog-0.046
Token The
Token ID383
Feature activation-3.99e-02

Pos logit contributions
lessness+0.025
pt+0.024
 twins+0.022
 everywhere+0.021
 crystals+0.020
Neg logit contributions
 surely-0.036
 pose-0.036
olog-0.035
 drib-0.034
 seventeen-0.034
Token dog
Token ID3290
Feature activation-2.52e-02

Pos logit contributions
 pose+0.078
 surely+0.076
 drib+0.074
olog+0.074
 score+0.073
Neg logit contributions
pt-0.059
lessness-0.058
 crystals-0.048
aky-0.048
 everywhere-0.046
Token destroys
Token ID27838
Feature activation+1.71e-03

Pos logit contributions
 surely+0.046
olog+0.043
+0.042
 avail+0.041
ven+0.041
Neg logit contributions
 everywhere-0.040
pt-0.040
 twins-0.038
 who-0.037
lessness-0.036
Token the
Token ID262
Feature activation-2.64e-03

Pos logit contributions
lessness+0.003
pt+0.003
aky+0.002
 twins+0.002
 crystals+0.002
Neg logit contributions
 pose-0.003
 surely-0.003
 drib-0.003
olog-0.003
 seventeen-0.003
Token tubes
Token ID21103
Feature activation+3.60e-02

Pos logit contributions
 pose+0.005
 drib+0.005
 score+0.005
 surely+0.005
olog+0.005
Neg logit contributions
pt-0.004
lessness-0.004
aky-0.003
 crystals-0.003
ovable-0.003
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
lessness+0.042
pt+0.042
 twins+0.035
aky+0.033
 crystals+0.033
Neg logit contributions
 surely-0.081
 pose-0.080
 drib-0.076
olog-0.076
 seventeen-0.075
Token The
Token ID383
Feature activation-3.79e-02

Pos logit contributions
lessness+0.031
pt+0.029
 twins+0.026
 everywhere+0.025
 crystals+0.025
Neg logit contributions
 pose-0.039
 surely-0.039
olog-0.038
 drib-0.037
 seventeen-0.037
Token tunnels
Token ID22642
Feature activation-1.15e-02

Pos logit contributions
 pose+0.074
 surely+0.072
 drib+0.070
olog+0.069
 score+0.069
Neg logit contributions
pt-0.056
lessness-0.055
 crystals-0.045
aky-0.045
 everywhere-0.044
Token and
Token ID290
Feature activation-4.80e-02

Pos logit contributions
 surely+0.022
 pose+0.022
olog+0.021
 drib+0.021
 seventeen+0.021
Neg logit contributions
lessness-0.017
pt-0.016
 twins-0.014
aky-0.014
 crystals-0.014
Token bridges
Token ID19432
Feature activation-3.47e-02

Pos logit contributions
 surely+0.097
olog+0.097
 pose+0.094
ven+0.092
+0.090
Neg logit contributions
lessness-0.068
pt-0.064
 everywhere-0.060
 twins-0.060
 crystals-0.058
Token and
Token ID290
Feature activation-4.40e-02

Pos logit contributions
 surely+0.073
 pose+0.073
 drib+0.069
olog+0.069
 seventeen+0.069
Neg logit contributions
lessness-0.046
pt-0.043
aky-0.037
 twins-0.036
 crystals-0.036
Token did
Token ID750
Feature activation-2.16e-02

Pos logit contributions
 surely+0.011
 pose+0.011
 seventeen+0.010
olog+0.010
 som+0.010
Neg logit contributions
pt-0.007
lessness-0.007
 twins-0.006
 everywhere-0.006
aky-0.006
Token not
Token ID407
Feature activation-1.11e-02

Pos logit contributions
 seventeen+0.043
 surely+0.039
 pose+0.039
olog+0.038
 jugg+0.038
Neg logit contributions
pt-0.031
lessness-0.031
 everywhere-0.029
 who-0.028
 twins-0.027
Token give
Token ID1577
Feature activation+8.80e-04

Pos logit contributions
 pose+0.020
 surely+0.019
 score+0.018
olog+0.018
 drib+0.018
Neg logit contributions
lessness-0.019
pt-0.018
 twins-0.016
 crystals-0.016
 everywhere-0.016
Token up
Token ID510
Feature activation-4.19e-02

Pos logit contributions
lessness+0.001
pt+0.001
 twins+0.001
 everywhere+0.001
 crystals+0.001
Neg logit contributions
 surely-0.002
 pose-0.002
olog-0.001
 drib-0.001
 seventeen-0.001
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
 pose+0.096
 surely+0.095
 drib+0.090
olog+0.090
 som+0.090
Neg logit contributions
lessness-0.046
pt-0.043
aky-0.036
 crystals-0.035
 twins-0.035
Token She
Token ID1375
Feature activation-9.42e-03

Pos logit contributions
lessness+0.013
pt+0.012
 twins+0.010
 everywhere+0.010
 crystals+0.009
Neg logit contributions
 surely-0.031
 pose-0.031
 seventeen-0.030
 drib-0.029
olog-0.029
Token tried
Token ID3088
Feature activation+8.86e-03

Pos logit contributions
 surely+0.019
 pose+0.019
olog+0.018
 seventeen+0.018
 som+0.018
Neg logit contributions
pt-0.013
lessness-0.013
 twins-0.011
 everywhere-0.011
aky-0.011
Token again
Token ID757
Feature activation-4.07e-02

Pos logit contributions
lessness+0.010
pt+0.010
 everywhere+0.008
 twins+0.008
 crystals+0.008
Neg logit contributions
 surely-0.020
 pose-0.020
olog-0.020
 seventeen-0.020
-0.019
Token to
Token ID284
Feature activation-5.12e-02

Pos logit contributions
 pose+0.091
 surely+0.089
olog+0.085
 drib+0.085
 score+0.083
Neg logit contributions
lessness-0.051
pt-0.048
 twins-0.041
 crystals-0.039
 everywhere-0.038
Token unt
Token ID1418
Feature activation-3.41e-02

Pos logit contributions
 seventeen+0.087
olog+0.085
 surely+0.082
 pose+0.082
ven+0.080
Neg logit contributions
pt-0.086
lessness-0.081
 everywhere-0.075
 twins-0.075
 who-0.074
Tokenie
Token ID494
Feature activation+5.08e-02

Pos logit contributions
 surely+0.074
 pose+0.073
 drib+0.071
zes+0.069
olog+0.069
Neg logit contributions
lessness-0.043
pt-0.039
 twins-0.034
 crystals-0.032
aky-0.031
Token the
Token ID262
Feature activation-3.79e-03

Pos logit contributions
lessness+0.012
pt+0.012
 everywhere+0.011
 twins+0.011
 crystals+0.010
Neg logit contributions
 surely-0.019
 seventeen-0.019
olog-0.018
 pose-0.018
 drib-0.018
Token bird
Token ID6512
Feature activation-6.28e-03

Pos logit contributions
 surely+0.007
 pose+0.007
 drib+0.006
olog+0.006
 score+0.006
Neg logit contributions
lessness-0.006
pt-0.006
aky-0.005
 everywhere-0.005
 crystals-0.005
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
 pose+0.012
 surely+0.011
 score+0.010
 drib+0.010
 som+0.010
Neg logit contributions
lessness-0.011
pt-0.009
 twins-0.009
aky-0.009
ruck-0.008
Token She
Token ID1375
Feature activation-5.23e-03

Pos logit contributions
lessness+0.021
pt+0.018
 twins+0.016
 everywhere+0.016
 of+0.015
Neg logit contributions
 surely-0.050
 pose-0.049
 seventeen-0.049
 jugg-0.048
 som-0.047
Token opened
Token ID4721
Feature activation-1.47e-02

Pos logit contributions
 surely+0.010
 pose+0.010
 seventeen+0.010
olog+0.010
+0.010
Neg logit contributions
pt-0.008
lessness-0.007
aky-0.006
 everywhere-0.006
 twins-0.006
Token her
Token ID607
Feature activation+5.32e-02

Pos logit contributions
 surely+0.030
 pose+0.030
olog+0.028
 drib+0.028
 seventeen+0.028
Neg logit contributions
lessness-0.021
pt-0.020
 twins-0.017
aky-0.017
 crystals-0.016
Token umbrella
Token ID25510
Feature activation+1.93e-02

Pos logit contributions
lessness+0.067
pt+0.064
 twins+0.054
 everywhere+0.052
 crystals+0.052
Neg logit contributions
 pose-0.116
 surely-0.116
olog-0.112
 drib-0.110
 seventeen-0.109
Token and
Token ID290
Feature activation-6.47e-03

Pos logit contributions
lessness+0.018
pt+0.017
 everywhere+0.014
 twins+0.013
 crystals+0.013
Neg logit contributions
 pose-0.048
 surely-0.047
olog-0.046
 seventeen-0.046
 drib-0.045
Token put
Token ID1234
Feature activation-6.46e-03

Pos logit contributions
 pose+0.012
 surely+0.012
 seventeen+0.012
olog+0.012
 drib+0.012
Neg logit contributions
pt-0.010
lessness-0.009
 everywhere-0.008
 crystals-0.008
 twins-0.008
Token it
Token ID340
Feature activation+1.32e-02

Pos logit contributions
 surely+0.012
 pose+0.011
 drib+0.011
olog+0.011
 som+0.011
Neg logit contributions
lessness-0.010
pt-0.010
 twins-0.009
aky-0.009
 crystals-0.008
Token on
Token ID319
Feature activation-4.29e-02

Pos logit contributions
lessness+0.017
 twins+0.015
pt+0.014
ruck+0.013
aky+0.013
Neg logit contributions
 surely-0.029
 pose-0.027
 drib-0.026
olog-0.025
zes-0.025
Token wanted
Token ID2227
Feature activation+6.67e-02

Pos logit contributions
 pose+0.061
 surely+0.060
olog+0.058
 drib+0.057
 seventeen+0.056
Neg logit contributions
lessness-0.051
pt-0.048
 twins-0.043
 crystals-0.042
 everywhere-0.042
Token her
Token ID607
Feature activation+2.15e-02

Pos logit contributions
lessness+0.103
pt+0.095
aky+0.089
 twins+0.085
 crystals+0.079
Neg logit contributions
 pose-0.129
 surely-0.129
 drib-0.119
 som-0.119
olog-0.119
Token to
Token ID284
Feature activation-5.30e-02

Pos logit contributions
lessness+0.041
aky+0.041
pt+0.039
ruck+0.035
ovable+0.035
Neg logit contributions
 surely-0.033
 pose-0.032
 score-0.029
 drib-0.028
 destination-0.028
Token disappear
Token ID10921
Feature activation+1.01e-02

Pos logit contributions
 pose+0.083
 surely+0.083
olog+0.080
 drib+0.077
 seventeen+0.077
Neg logit contributions
lessness-0.098
pt-0.096
 twins-0.086
 everywhere-0.084
 crystals-0.084
Token again
Token ID757
Feature activation-2.25e-02

Pos logit contributions
lessness+0.020
pt+0.019
 twins+0.017
aky+0.017
 crystals+0.016
Neg logit contributions
 pose-0.015
 surely-0.015
olog-0.014
 drib-0.014
 som-0.014
Token.
Token ID13
Feature activation+2.43e-02

Pos logit contributions
 pose+0.056
 surely+0.055
olog+0.053
 drib+0.053
 som+0.052
Neg logit contributions
lessness-0.022
pt-0.020
 twins-0.016
 crystals-0.016
aky-0.015
Token<|endoftext|>
Token ID50256
Feature activation+2.79e-02

Pos logit contributions
lessness+0.032
pt+0.031
 twins+0.027
 everywhere+0.026
 crystals+0.025
Neg logit contributions
 pose-0.052
 surely-0.051
olog-0.049
 seventeen-0.049
 drib-0.049
TokenOnce
Token ID7454
Feature activation+3.41e-02

Pos logit contributions
lessness+0.049
pt+0.046
aky+0.042
 crystals+0.042
 twins+0.040
Neg logit contributions
 surely-0.045
 pose-0.045
 drib-0.042
olog-0.042
-0.042
Token there
Token ID612
Feature activation-9.07e-03

Pos logit contributions
pt+0.049
lessness+0.049
aky+0.046
 crystals+0.044
ruck+0.042
Neg logit contributions
 surely-0.061
 pose-0.057
ue-0.056
 drib-0.056
 salon-0.054
Token was
Token ID373
Feature activation+1.76e-02

Pos logit contributions
 surely+0.016
 pose+0.016
olog+0.015
 drib+0.015
 som+0.015
Neg logit contributions
lessness-0.015
pt-0.014
 twins-0.013
aky-0.013
 crystals-0.013
Token a
Token ID257
Feature activation+1.08e-02

Pos logit contributions
lessness+0.033
pt+0.029
iness+0.025
aky+0.023
 crystals+0.023
Neg logit contributions
 drib-0.028
 surely-0.028
 pose-0.026
tons-0.026
V-0.025
Token your
Token ID534
Feature activation+2.74e-03

Pos logit contributions
 surely+0.077
 pose+0.075
olog+0.075
 seventeen+0.072
 drib+0.071
Neg logit contributions
lessness-0.060
pt-0.059
 twins-0.055
 everywhere-0.052
 crystals-0.051
Token feelings
Token ID7666
Feature activation+7.45e-02

Pos logit contributions
lessness+0.004
pt+0.004
 crystals+0.003
 everywhere+0.003
aky+0.003
Neg logit contributions
 pose-0.005
 surely-0.005
olog-0.005
 drib-0.005
 som-0.005
Token.
Token ID13
Feature activation+4.98e-03

Pos logit contributions
lessness+0.088
pt+0.084
 crystals+0.071
 twins+0.069
aky+0.068
Neg logit contributions
 pose-0.165
 surely-0.159
olog-0.152
 drib-0.150
 score-0.149
Token If
Token ID1002
Feature activation-5.98e-03

Pos logit contributions
lessness+0.008
pt+0.007
 twins+0.007
aky+0.006
 crystals+0.006
Neg logit contributions
 pose-0.009
 surely-0.009
 drib-0.009
 som-0.009
olog-0.009
Token something
Token ID1223
Feature activation+3.58e-02

Pos logit contributions
 seventeen+0.012
 surely+0.012
olog+0.012
 pose+0.012
 drib+0.012
Neg logit contributions
lessness-0.008
pt-0.008
 twins-0.008
 crystals-0.007
 everywhere-0.007
Token makes
Token ID1838
Feature activation-3.29e-03

Pos logit contributions
lessness+0.067
pt+0.064
aky+0.057
 twins+0.057
 crystals+0.056
Neg logit contributions
 pose-0.057
 surely-0.056
olog-0.053
 drib-0.052
 som-0.051
Token you
Token ID345
Feature activation+3.75e-02

Pos logit contributions
 pose+0.006
 surely+0.006
olog+0.006
 drib+0.005
 seventeen+0.005
Neg logit contributions
lessness-0.005
pt-0.005
 twins-0.005
 crystals-0.004
aky-0.004
Token feel
Token ID1254
Feature activation+3.12e-02

Pos logit contributions
lessness+0.060
pt+0.053
 crystals+0.049
aky+0.048
 twins+0.047
Neg logit contributions
 pose-0.071
 surely-0.070
 score-0.064
 som-0.062
olog-0.062
Token bad
Token ID2089
Feature activation+1.41e-02

Pos logit contributions
lessness+0.050
pt+0.047
 crystals+0.045
 twins+0.042
aky+0.042
Neg logit contributions
 surely-0.054
 pose-0.052
 som-0.048
olog-0.048
 drib-0.047
Token,
Token ID11
Feature activation-2.74e-02

Pos logit contributions
lessness+0.024
pt+0.022
 crystals+0.021
aky+0.021
ovable+0.020
Neg logit contributions
 pose-0.024
 surely-0.022
olog-0.022
zes-0.021
 score-0.021
Token you
Token ID345
Feature activation+1.21e-03

Pos logit contributions
olog+0.052
 pose+0.052
 surely+0.051
 seventeen+0.051
 drib+0.050
Neg logit contributions
lessness-0.040
pt-0.038
 twins-0.036
 who-0.034
 of-0.034
Token They
Token ID1119
Feature activation-1.13e-02

Pos logit contributions
lessness+0.006
pt+0.006
aky+0.005
 crystals+0.005
 everywhere+0.005
Neg logit contributions
 surely-0.006
 pose-0.006
olog-0.006
 seventeen-0.006
 drib-0.006
Token surprised
Token ID6655
Feature activation-4.45e-03

Pos logit contributions
 surely+0.022
 pose+0.021
 jugg+0.021
 drib+0.021
 seventeen+0.020
Neg logit contributions
lessness-0.017
pt-0.016
aky-0.014
 twins-0.014
 crystals-0.013
Token dad
Token ID9955
Feature activation+4.87e-02

Pos logit contributions
 pose+0.008
 surely+0.008
 drib+0.008
 score+0.008
zes+0.008
Neg logit contributions
lessness-0.007
pt-0.007
aky-0.006
ovable-0.006
 crystals-0.005
Token when
Token ID618
Feature activation+6.69e-03

Pos logit contributions
lessness+0.078
pt+0.074
 crystals+0.064
aky+0.063
 twins+0.060
Neg logit contributions
 pose-0.092
 surely-0.089
 drib-0.086
olog-0.086
 score-0.084
Token he
Token ID339
Feature activation-1.36e-02

Pos logit contributions
lessness+0.012
pt+0.011
aky+0.010
 everywhere+0.010
 crystals+0.010
Neg logit contributions
 surely-0.011
 pose-0.011
olog-0.010
 drib-0.010
 som-0.010
Token came
Token ID1625
Feature activation-2.39e-02

Pos logit contributions
 pose+0.026
 surely+0.026
olog+0.026
 seventeen+0.024
 drib+0.024
Neg logit contributions
pt-0.020
lessness-0.020
 twins-0.017
 everywhere-0.017
 crystals-0.017
Token home
Token ID1363
Feature activation+1.08e-02

Pos logit contributions
 surely+0.043
 pose+0.041
 score+0.039
 drib+0.038
zes+0.036
Neg logit contributions
lessness-0.041
pt-0.036
aky-0.032
 crystals-0.032
ovable-0.031
Token.
Token ID13
Feature activation+7.79e-03

Pos logit contributions
lessness+0.011
pt+0.011
 twins+0.008
 crystals+0.008
aky+0.008
Neg logit contributions
 pose-0.026
 surely-0.026
olog-0.025
 drib-0.024
 seventeen-0.024
Token He
Token ID679
Feature activation-1.57e-02

Pos logit contributions
pt+0.014
lessness+0.014
aky+0.012
 crystals+0.012
 everywhere+0.012
Neg logit contributions
 surely-0.012
olog-0.012
 pose-0.011
 seventeen-0.011
 drib-0.011
Token was
Token ID373
Feature activation-5.34e-04

Pos logit contributions
 surely+0.032
 pose+0.032
olog+0.031
+0.030
 seventeen+0.030
Neg logit contributions
pt-0.021
lessness-0.021
 twins-0.019
 everywhere-0.018
 crystals-0.018
Token happy
Token ID3772
Feature activation+1.45e-02

Pos logit contributions
 surely+0.001
 pose+0.001
olog+0.001
 drib+0.001
 seventeen+0.001
Neg logit contributions
lessness-0.001
pt-0.001
 crystals-0.001
 twins-0.001
 everywhere-0.001
Token new
Token ID649
Feature activation+1.81e-02

Pos logit contributions
lessness+0.080
pt+0.075
 twins+0.064
 crystals+0.064
aky+0.063
Neg logit contributions
 pose-0.094
 surely-0.092
 drib-0.089
 score-0.087
 som-0.087
Token shoes
Token ID10012
Feature activation+1.20e-02

Pos logit contributions
lessness+0.027
pt+0.026
 twins+0.022
 everywhere+0.022
aky+0.022
Neg logit contributions
 pose-0.035
 surely-0.034
olog-0.033
 drib-0.033
 som-0.032
Token.
Token ID13
Feature activation+2.94e-02

Pos logit contributions
lessness+0.015
pt+0.014
 twins+0.012
 crystals+0.012
aky+0.011
Neg logit contributions
 surely-0.026
 pose-0.026
 drib-0.025
olog-0.025
 score-0.024
Token As
Token ID1081
Feature activation+2.87e-03

Pos logit contributions
lessness+0.046
pt+0.045
 twins+0.039
 crystals+0.039
 everywhere+0.039
Neg logit contributions
 surely-0.054
 pose-0.053
olog-0.052
 drib-0.051
 seventeen-0.050
Token the
Token ID262
Feature activation-3.04e-02

Pos logit contributions
lessness+0.005
pt+0.004
 twins+0.004
 everywhere+0.004
 crystals+0.004
Neg logit contributions
 pose-0.005
 surely-0.005
olog-0.005
 seventeen-0.005
 drib-0.005
Token three
Token ID1115
Feature activation-3.51e-02

Pos logit contributions
 pose+0.059
 surely+0.059
olog+0.057
 drib+0.055
 seventeen+0.055
Neg logit contributions
lessness-0.046
pt-0.044
 twins-0.038
 everywhere-0.037
 crystals-0.037
Token of
Token ID286
Feature activation-3.09e-02

Pos logit contributions
 pose+0.073
 surely+0.073
olog+0.071
 drib+0.069
 seventeen+0.069
Neg logit contributions
lessness-0.047
pt-0.045
 twins-0.038
 everywhere-0.037
 crystals-0.037
Token them
Token ID606
Feature activation+3.57e-02

Pos logit contributions
 surely+0.060
 pose+0.059
olog+0.057
 drib+0.057
 seventeen+0.056
Neg logit contributions
lessness-0.046
pt-0.044
 twins-0.039
 everywhere-0.038
 crystals-0.038
Token were
Token ID547
Feature activation+2.65e-02

Pos logit contributions
pt+0.053
lessness+0.052
 everywhere+0.045
 twins+0.043
aky+0.042
Neg logit contributions
 pose-0.069
 surely-0.068
olog-0.068
 seventeen-0.068
 som-0.065
Token walking
Token ID6155
Feature activation-1.36e-02

Pos logit contributions
lessness+0.038
pt+0.038
 everywhere+0.034
 twins+0.033
aky+0.032
Neg logit contributions
 surely-0.052
 pose-0.052
olog-0.051
-0.049
 seventeen-0.049
Token,
Token ID11
Feature activation+2.52e-03

Pos logit contributions
 surely+0.025
 pose+0.024
olog+0.023
 drib+0.023
+0.022
Neg logit contributions
lessness-0.022
pt-0.021
 twins-0.019
 crystals-0.019
aky-0.019
Token it
Token ID340
Feature activation+2.94e-03

Pos logit contributions
olog+0.077
 pose+0.074
 surely+0.073
 seventeen+0.073
ven+0.070
Neg logit contributions
pt-0.043
 twins-0.040
 who-0.040
lessness-0.040
 everywhere-0.039
Token.
Token ID13
Feature activation+5.99e-03

Pos logit contributions
lessness+0.003
pt+0.002
 twins+0.002
 everywhere+0.002
 crystals+0.002
Neg logit contributions
 surely-0.008
 pose-0.008
olog-0.007
 drib-0.007
 seventeen-0.007
Token
Token ID198
Feature activation-2.13e-02

Pos logit contributions
lessness+0.005
pt+0.005
 twins+0.005
 of+0.004
 who+0.004
Neg logit contributions
 pose-0.015
 surely-0.015
olog-0.015
 coaster-0.014
 som-0.014
Token
Token ID198
Feature activation-3.41e-02

Pos logit contributions
 surely+0.047
 pose+0.046
olog+0.045
 seventeen+0.044
 drib+0.044
Neg logit contributions
lessness-0.026
pt-0.025
 twins-0.022
 everywhere-0.021
aky-0.020
TokenThe
Token ID464
Feature activation-5.17e-02

Pos logit contributions
 surely+0.078
 coaster+0.075
olog+0.074
 pose+0.074
 destination+0.074
Neg logit contributions
 of-0.035
 twins-0.033
aky-0.033
 who-0.032
lessness-0.032
Token cater
Token ID20825
Feature activation-6.47e-02

Pos logit contributions
 surely+0.094
 pose+0.093
olog+0.091
 seventeen+0.088
 drib+0.088
Neg logit contributions
lessness-0.083
pt-0.080
 twins-0.072
 crystals-0.070
 everywhere-0.070
Tokenpillar
Token ID41643
Feature activation-2.53e-02

Pos logit contributions
 surely+0.133
 pose+0.131
olog+0.130
 seventeen+0.126
 drib+0.125
Neg logit contributions
lessness-0.091
pt-0.086
 twins-0.074
 crystals-0.073
aky-0.073
Token said
Token ID531
Feature activation+2.61e-02

Pos logit contributions
 surely+0.052
olog+0.052
 seventeen+0.050
+0.049
 pose+0.049
Neg logit contributions
pt-0.034
lessness-0.032
 who-0.030
 everywhere-0.030
 twins-0.030
Token hello
Token ID23748
Feature activation-1.05e-02

Pos logit contributions
lessness+0.034
pt+0.033
 twins+0.028
 everywhere+0.027
 crystals+0.027
Neg logit contributions
 surely-0.056
 pose-0.056
olog-0.054
 drib-0.054
 seventeen-0.052
Token back
Token ID736
Feature activation-2.76e-02

Pos logit contributions
 pose+0.022
 surely+0.022
olog+0.021
 drib+0.020
 seventeen+0.020
Neg logit contributions
lessness-0.014
pt-0.013
 crystals-0.011
 twins-0.011
aky-0.011
Token.
Token ID13
Feature activation-9.26e-03

Pos logit contributions
 pose+0.069
 surely+0.069
olog+0.067
 drib+0.066
 seventeen+0.065
Neg logit contributions
lessness-0.026
pt-0.024
 twins-0.019
 crystals-0.018
aky-0.018
Token so
Token ID523
Feature activation-2.13e-03

Pos logit contributions
lessness+0.004
pt+0.004
 twins+0.003
 everywhere+0.003
aky+0.003
Neg logit contributions
 pose-0.011
 surely-0.011
olog-0.011
 drib-0.011
 seventeen-0.011
Token happy
Token ID3772
Feature activation+7.11e-03

Pos logit contributions
 surely+0.005
 pose+0.005
olog+0.004
 drib+0.004
 seventeen+0.004
Neg logit contributions
lessness-0.003
pt-0.003
 twins-0.002
 crystals-0.002
 everywhere-0.002
Token that
Token ID326
Feature activation+6.25e-03

Pos logit contributions
lessness+0.010
pt+0.010
aky+0.009
 twins+0.008
 crystals+0.008
Neg logit contributions
 pose-0.014
 surely-0.014
 drib-0.014
 som-0.014
olog-0.013
Token he
Token ID339
Feature activation-2.24e-03

Pos logit contributions
lessness+0.008
pt+0.007
 twins+0.007
 everywhere+0.007
 crystals+0.006
Neg logit contributions
 pose-0.013
olog-0.013
 surely-0.013
 seventeen-0.013
 drib-0.013
Token started
Token ID2067
Feature activation+8.19e-03

Pos logit contributions
 pose+0.005
 surely+0.004
 seventeen+0.004
olog+0.004
+0.004
Neg logit contributions
lessness-0.003
pt-0.003
 everywhere-0.003
 twins-0.002
aky-0.002
Token to
Token ID284
Feature activation-3.78e-02

Pos logit contributions
lessness+0.011
pt+0.010
 twins+0.009
 everywhere+0.009
 crystals+0.009
Neg logit contributions
 surely-0.017
 pose-0.017
olog-0.017
 drib-0.016
 seventeen-0.016
Token jump
Token ID4391
Feature activation-1.53e-02

Pos logit contributions
 surely+0.062
olog+0.061
 pose+0.061
 seventeen+0.058
ven+0.057
Neg logit contributions
lessness-0.065
pt-0.065
 twins-0.058
 everywhere-0.056
 crystals-0.056
Token up
Token ID510
Feature activation-2.09e-02

Pos logit contributions
 surely+0.032
 pose+0.032
olog+0.030
 drib+0.030
 seventeen+0.030
Neg logit contributions
lessness-0.021
pt-0.020
 twins-0.017
aky-0.016
 crystals-0.016
Token and
Token ID290
Feature activation+1.20e-02

Pos logit contributions
 pose+0.048
 surely+0.047
olog+0.046
 seventeen+0.045
 drib+0.045
Neg logit contributions
lessness-0.024
pt-0.022
 twins-0.019
 everywhere-0.019
 crystals-0.018
Token down
Token ID866
Feature activation-4.12e-02

Pos logit contributions
pt+0.017
lessness+0.017
 everywhere+0.017
 crystals+0.015
 twins+0.015
Neg logit contributions
olog-0.022
ven-0.021
 seventeen-0.021
Too-0.021
 surely-0.021
Token in
Token ID287
Feature activation-3.15e-02

Pos logit contributions
 surely+0.089
 pose+0.089
olog+0.089
 seventeen+0.086
 drib+0.085
Neg logit contributions
lessness-0.051
pt-0.048
 everywhere-0.041
 twins-0.041
 crystals-0.040
Token pretty
Token ID2495
Feature activation-2.83e-02

Pos logit contributions
 pose+0.003
 surely+0.003
 drib+0.003
olog+0.003
 score+0.003
Neg logit contributions
lessness-0.003
pt-0.003
aky-0.002
 twins-0.002
ovable-0.002
Token carpet
Token ID20710
Feature activation+2.18e-03

Pos logit contributions
 pose+0.057
 surely+0.056
olog+0.054
 drib+0.053
 seventeen+0.052
Neg logit contributions
lessness-0.041
pt-0.039
 twins-0.033
aky-0.033
 everywhere-0.032
Token on
Token ID319
Feature activation-3.20e-02

Pos logit contributions
lessness+0.003
pt+0.003
 twins+0.002
 everywhere+0.002
 crystals+0.002
Neg logit contributions
 surely-0.005
 pose-0.005
olog-0.005
 drib-0.004
 seventeen-0.004
Token the
Token ID262
Feature activation+1.21e-02

Pos logit contributions
 pose+0.060
 surely+0.059
olog+0.059
 seventeen+0.057
 drib+0.057
Neg logit contributions
lessness-0.048
pt-0.047
 everywhere-0.042
 twins-0.042
 crystals-0.041
Token wall
Token ID3355
Feature activation-1.76e-04

Pos logit contributions
lessness+0.021
pt+0.020
 twins+0.018
 everywhere+0.018
 crystals+0.018
Neg logit contributions
 pose-0.020
 surely-0.020
olog-0.019
 drib-0.019
 seventeen-0.019
Token?"
Token ID1701
Feature activation-2.80e-02

Pos logit contributions
 surely+0.000
 pose+0.000
olog+0.000
 drib+0.000
 seventeen+0.000
Neg logit contributions
lessness-0.000
pt-0.000
 twins-0.000
 everywhere-0.000
 crystals-0.000
Token asked
Token ID1965
Feature activation+1.90e-02

Pos logit contributions
 surely+0.058
 pose+0.057
 seventeen+0.057
olog+0.057
 drib+0.055
Neg logit contributions
pt-0.036
lessness-0.035
 twins-0.033
 who-0.032
 everywhere-0.031
Token Alice
Token ID14862
Feature activation+4.17e-03

Pos logit contributions
lessness+0.033
pt+0.032
 twins+0.029
 everywhere+0.028
 crystals+0.028
Neg logit contributions
 pose-0.032
 surely-0.032
olog-0.031
 drib-0.030
 seventeen-0.030
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
lessness+0.004
pt+0.003
 twins+0.003
 everywhere+0.003
 crystals+0.003
Neg logit contributions
 pose-0.011
 surely-0.011
olog-0.010
 drib-0.010
 seventeen-0.010
Token
Token ID220
Feature activation-1.49e-02

Pos logit contributions
lessness+0.012
 twins+0.012
pt+0.011
 who+0.011
 everywhere+0.011
Neg logit contributions
 pose-0.034
 surely-0.033
 som-0.033
 seventeen-0.032
olog-0.032
Token
Token ID198
Feature activation+7.66e-04

Pos logit contributions
 surely+0.027
 pose+0.027
olog+0.026
 drib+0.026
 seventeen+0.025
Neg logit contributions
lessness-0.023
pt-0.022
 crystals-0.020
aky-0.019
 twins-0.019
Token gigg
Token ID30442
Feature activation-1.10e-02

Pos logit contributions
lessness+0.073
pt+0.071
 twins+0.062
 everywhere+0.060
 crystals+0.059
Neg logit contributions
 pose-0.100
 surely-0.100
olog-0.097
 seventeen-0.095
 drib-0.094
Tokenled
Token ID992
Feature activation+1.97e-02

Pos logit contributions
 surely+0.025
 pose+0.024
 drib+0.024
olog+0.024
zes+0.023
Neg logit contributions
lessness-0.013
pt-0.011
 twins-0.010
 crystals-0.010
 everywhere-0.010
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
lessness+0.028
pt+0.026
 crystals+0.023
 twins+0.022
aky+0.022
Neg logit contributions
 surely-0.039
 pose-0.038
 drib-0.038
olog-0.037
-0.036
Token said
Token ID531
Feature activation+4.28e-02

Pos logit contributions
lessness+0.013
pt+0.013
 who+0.011
 everywhere+0.011
 twins+0.010
Neg logit contributions
 surely-0.031
 pose-0.031
olog-0.031
 seventeen-0.031
elve-0.030
Token
Token ID6184
Feature activation-7.71e-02

Pos logit contributions
lessness+0.061
pt+0.061
 twins+0.050
 everywhere+0.050
 of+0.049
Neg logit contributions
 pose-0.086
 surely-0.085
 drib-0.081
olog-0.080
 seventeen-0.079
Token
Token ID95
Feature activation-8.38e-02

Pos logit contributions
 surely+0.093
 seventeen+0.086
 pose+0.085
 halftime+0.080
 leans+0.077
Neg logit contributions
pt-0.155
lessness-0.146
 who-0.137
ovable-0.127
sm-0.126
Token
Token ID26391
Feature activation-8.22e-02

Pos logit contributions
 surely+0.121
 halftime+0.114
 pose+0.111
 jugg+0.110
 dod+0.108
Neg logit contributions
lessness-0.143
aky-0.130
 who-0.128
ady-0.127
pt-0.124
Token
Token ID129
Feature activation-9.36e-02

Pos logit contributions
 surely+0.141
 seventeen+0.129
 pose+0.128
 drib+0.126
 sixty+0.123
Neg logit contributions
lessness-0.138
 twins-0.128
pt-0.125
 who-0.123
 everywhere-0.119
Token
Token ID241
Feature activation+6.87e-03

Pos logit contributions
 surely+0.145
 pose+0.136
 salon+0.133
 halftime+0.133
 jugg+0.129
Neg logit contributions
ady-0.147
pt-0.140
aky-0.136
 adorable-0.134
 twins-0.133
TokenWe
Token ID1135
Feature activation-9.67e-03

Pos logit contributions
lessness+0.012
pt+0.011
 twins+0.010
 everywhere+0.010
aky+0.010
Neg logit contributions
 surely-0.012
 pose-0.012
olog-0.011
 seventeen-0.011
 drib-0.011
Token can
Token ID460
Feature activation-4.47e-02

Pos logit contributions
 surely+0.019
 pose+0.018
olog+0.018
 drib+0.018
 jugg+0.017
Neg logit contributions
lessness-0.015
pt-0.013
 twins-0.013
 crystals-0.012
 everywhere-0.012
Tokenwed
Token ID19103
Feature activation+1.95e-02

Pos logit contributions
 surely+0.168
 pose+0.168
olog+0.162
 drib+0.157
 seventeen+0.155
Neg logit contributions
lessness-0.172
pt-0.167
 twins-0.148
 crystals-0.145
 everywhere-0.145
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
lessness+0.023
pt+0.022
 twins+0.019
 everywhere+0.019
 crystals+0.018
Neg logit contributions
 pose-0.043
 surely-0.043
olog-0.042
 drib-0.041
 seventeen-0.041
Token che
Token ID1125
Feature activation-8.94e-02

Pos logit contributions
olog+0.026
 surely+0.025
 pose+0.025
+0.024
 seventeen+0.023
Neg logit contributions
pt-0.024
lessness-0.024
 crystals-0.022
 everywhere-0.022
 twins-0.021
Tokenwed
Token ID19103
Feature activation+1.87e-02

Pos logit contributions
 surely+0.167
 pose+0.165
olog+0.162
 drib+0.153
 seventeen+0.152
Neg logit contributions
lessness-0.140
pt-0.139
aky-0.120
 crystals-0.117
 twins-0.116
Token and
Token ID290
Feature activation-1.37e-02

Pos logit contributions
lessness+0.025
pt+0.024
 twins+0.020
 everywhere+0.020
 crystals+0.020
Neg logit contributions
 pose-0.039
 surely-0.039
olog-0.038
 drib-0.037
 seventeen-0.037
Token che
Token ID1125
Feature activation-8.79e-02

Pos logit contributions
olog+0.022
 surely+0.021
+0.021
 pose+0.021
Too+0.020
Neg logit contributions
lessness-0.024
pt-0.024
 everywhere-0.023
 crystals-0.023
 twins-0.021
Tokenwed
Token ID19103
Feature activation+1.68e-02

Pos logit contributions
 surely+0.168
 pose+0.165
olog+0.163
 drib+0.152
+0.152
Neg logit contributions
pt-0.135
lessness-0.134
aky-0.117
 crystals-0.112
 everywhere-0.112
Token on
Token ID319
Feature activation-4.91e-02

Pos logit contributions
lessness+0.024
pt+0.023
 twins+0.019
 everywhere+0.019
 crystals+0.019
Neg logit contributions
 pose-0.034
 surely-0.034
olog-0.033
 drib-0.032
 seventeen-0.032
Token the
Token ID262
Feature activation+7.09e-03

Pos logit contributions
 surely+0.101
olog+0.100
 pose+0.099
 seventeen+0.094
 drib+0.093
Neg logit contributions
lessness-0.064
pt-0.062
 everywhere-0.058
 twins-0.057
 crystals-0.055
Token shoe
Token ID17292
Feature activation-6.88e-03

Pos logit contributions
lessness+0.012
pt+0.011
 twins+0.010
 everywhere+0.010
 crystals+0.010
Neg logit contributions
 surely-0.012
 pose-0.012
olog-0.012
 drib-0.012
 seventeen-0.012
Token.
Token ID13
Feature activation+7.51e-03

Pos logit contributions
 surely+0.017
 pose+0.017
olog+0.016
 seventeen+0.016
 drib+0.016
Neg logit contributions
lessness-0.007
pt-0.006
 everywhere-0.005
 twins-0.005
 crystals-0.005
Tokeno
Token ID78
Feature activation-3.60e-02

Pos logit contributions
 pose+0.097
 surely+0.097
olog+0.093
 drib+0.092
 seventeen+0.091
Neg logit contributions
lessness-0.084
pt-0.082
 twins-0.072
 crystals-0.070
 everywhere-0.070
Token never
Token ID1239
Feature activation-3.30e-02

Pos logit contributions
 pose+0.070
 surely+0.070
olog+0.068
 seventeen+0.067
 som+0.066
Neg logit contributions
lessness-0.051
pt-0.051
 twins-0.044
 everywhere-0.043
aky-0.043
Token forgot
Token ID16453
Feature activation+5.29e-02

Pos logit contributions
 surely+0.063
 pose+0.063
olog+0.061
 seventeen+0.060
 drib+0.059
Neg logit contributions
lessness-0.050
pt-0.048
 twins-0.042
 everywhere-0.041
 crystals-0.041
Token the
Token ID262
Feature activation-1.05e-02

Pos logit contributions
lessness+0.082
pt+0.077
aky+0.071
 twins+0.068
 crystals+0.066
Neg logit contributions
 surely-0.097
 pose-0.097
 drib-0.093
 som-0.093
olog-0.091
Token fast
Token ID3049
Feature activation-4.85e-02

Pos logit contributions
 surely+0.020
 pose+0.019
olog+0.019
 drib+0.018
 seventeen+0.018
Neg logit contributions
lessness-0.016
pt-0.016
 twins-0.014
 crystals-0.014
 everywhere-0.014
Token che
Token ID1125
Feature activation-9.71e-02

Pos logit contributions
 surely+0.090
olog+0.088
 pose+0.085
+0.082
ven+0.082
Neg logit contributions
lessness-0.074
pt-0.073
 twins-0.068
 crystals-0.066
 of-0.065
Tokenet
Token ID316
Feature activation-5.65e-02

Pos logit contributions
olog+0.155
 surely+0.147
+0.140
 pose+0.140
 jugg+0.130
Neg logit contributions
pt-0.180
lessness-0.178
aky-0.170
 crystals-0.158
ruck-0.152
Tokenah
Token ID993
Feature activation-2.62e-02

Pos logit contributions
olog+0.109
 pose+0.108
 surely+0.108
+0.102
 seventeen+0.102
Neg logit contributions
lessness-0.082
pt-0.078
 of-0.070
 twins-0.068
 everywhere-0.067
Token he
Token ID339
Feature activation+1.33e-02

Pos logit contributions
 surely+0.044
olog+0.043
 pose+0.043
 drib+0.041
+0.041
Neg logit contributions
lessness-0.046
pt-0.043
aky-0.039
 twins-0.039
 everywhere-0.038
Token had
Token ID550
Feature activation+4.06e-02

Pos logit contributions
pt+0.016
lessness+0.016
aky+0.013
 everywhere+0.013
 twins+0.013
Neg logit contributions
 pose-0.029
 surely-0.029
olog-0.028
 seventeen-0.028
 salon-0.027
Token seen
Token ID1775
Feature activation+7.16e-02

Pos logit contributions
lessness+0.054
pt+0.051
 twins+0.044
 crystals+0.043
 everywhere+0.043
Neg logit contributions
 pose-0.085
 surely-0.085
olog-0.082
 drib-0.081
 som-0.080
Token than
Token ID621
Feature activation-3.95e-03

Pos logit contributions
 surely+0.236
 pose+0.234
olog+0.230
 drib+0.224
 seventeen+0.222
Neg logit contributions
lessness-0.127
pt-0.121
 twins-0.103
 crystals-0.101
 everywhere-0.100
Token Uncle
Token ID23169
Feature activation+2.45e-02

Pos logit contributions
 surely+0.007
 pose+0.007
olog+0.006
 drib+0.006
 som+0.006
Neg logit contributions
lessness-0.007
pt-0.007
 twins-0.006
aky-0.006
 crystals-0.006
Token!
Token ID0
Feature activation+2.04e-02

Pos logit contributions
lessness+0.042
pt+0.040
 twins+0.036
 crystals+0.035
 everywhere+0.035
Neg logit contributions
 pose-0.042
 surely-0.042
olog-0.040
 drib-0.039
 seventeen-0.039
Token
Token ID198
Feature activation+4.14e-03

Pos logit contributions
lessness+0.029
pt+0.027
 twins+0.025
 everywhere+0.023
 crystals+0.022
Neg logit contributions
 pose-0.042
 surely-0.041
olog-0.039
 drib-0.039
zes-0.038
Token
Token ID198
Feature activation-5.81e-03

Pos logit contributions
lessness+0.006
pt+0.005
 twins+0.005
 everywhere+0.004
aky+0.004
Neg logit contributions
 pose-0.009
 surely-0.009
olog-0.008
 drib-0.008
 seventeen-0.008
TokenUn
Token ID3118
Feature activation-1.08e-01

Pos logit contributions
 pose+0.009
 surely+0.009
olog+0.009
 drib+0.009
zes+0.009
Neg logit contributions
lessness-0.010
pt-0.010
aky-0.009
 twins-0.009
 who-0.009
Tokencle
Token ID2375
Feature activation+1.11e-02

Pos logit contributions
 pose+0.165
 drib+0.156
 som+0.153
 surely+0.150
 score+0.149
Neg logit contributions
lessness-0.209
pt-0.193
ruck-0.181
aky-0.181
 twins-0.176
Token was
Token ID373
Feature activation+4.57e-03

Pos logit contributions
lessness+0.018
pt+0.017
 twins+0.016
 everywhere+0.015
 crystals+0.015
Neg logit contributions
 pose-0.020
 surely-0.020
olog-0.019
 drib-0.018
 seventeen-0.018
Token so
Token ID523
Feature activation-2.59e-03

Pos logit contributions
lessness+0.004
pt+0.003
 twins+0.003
 everywhere+0.003
aky+0.002
Neg logit contributions
 pose-0.012
olog-0.012
 surely-0.012
-0.012
zes-0.011
Token happy
Token ID3772
Feature activation+8.62e-03

Pos logit contributions
 pose+0.006
 surely+0.006
olog+0.005
 drib+0.005
 seventeen+0.005
Neg logit contributions
lessness-0.003
pt-0.003
 twins-0.003
 everywhere-0.003
aky-0.003
Token he
Token ID339
Feature activation+2.46e-03

Pos logit contributions
lessness+0.012
pt+0.012
aky+0.011
 crystals+0.010
 twins+0.010
Neg logit contributions
 pose-0.017
 surely-0.017
 drib-0.016
olog-0.016
 som-0.016
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
lessness+0.013
pt+0.012
aky+0.012
ovable+0.011
 twins+0.011
Neg logit contributions
 surely-0.014
 drib-0.013
 pose-0.013
 score-0.013
 som-0.013
Token window
Token ID4324
Feature activation+8.09e-03

Pos logit contributions
lessness+0.025
pt+0.024
ovable+0.020
 crystals+0.020
aky+0.020
Neg logit contributions
 pose-0.020
 surely-0.020
 drib-0.020
 score-0.019
 salon-0.019
Token and
Token ID290
Feature activation-4.50e-03

Pos logit contributions
lessness+0.010
pt+0.010
aky+0.008
 crystals+0.008
 twins+0.008
Neg logit contributions
 surely-0.018
 pose-0.018
olog-0.017
 drib-0.017
 seventeen-0.016
Token saw
Token ID2497
Feature activation+4.27e-02

Pos logit contributions
 pose+0.009
 surely+0.009
olog+0.009
 seventeen+0.009
 drib+0.009
Neg logit contributions
lessness-0.006
pt-0.006
 twins-0.005
 everywhere-0.005
 crystals-0.005
Token the
Token ID262
Feature activation-1.83e-02

Pos logit contributions
lessness+0.102
aky+0.098
ady+0.088
ovable+0.088
pt+0.086
Neg logit contributions
 coaster-0.041
 seventeen-0.040
 surely-0.039
 som-0.039
 drib-0.039
Token re
Token ID302
Feature activation-8.32e-02

Pos logit contributions
 jugg+0.026
 surely+0.025
 pose+0.025
 coaster+0.025
 bar+0.025
Neg logit contributions
lessness-0.036
pt-0.034
aky-0.029
ovable-0.029
 crystals-0.028
Tokenind
Token ID521
Feature activation-1.96e-02

Pos logit contributions
 pose+0.161
 surely+0.161
olog+0.156
 drib+0.152
 seventeen+0.151
Neg logit contributions
lessness-0.124
pt-0.119
 twins-0.104
 everywhere-0.101
 crystals-0.101
Tokeneer
Token ID28153
Feature activation+1.22e-02

Pos logit contributions
 pose+0.040
 surely+0.039
zes+0.038
ro+0.037
 bar+0.037
Neg logit contributions
pt-0.023
lessness-0.023
 twins-0.017
 everywhere-0.017
 crystals-0.017
Token too
Token ID1165
Feature activation-2.12e-02

Pos logit contributions
lessness+0.019
pt+0.017
aky+0.016
 crystals+0.015
ovable+0.014
Neg logit contributions
 pose-0.023
 surely-0.022
 drib-0.022
 score-0.021
 jugg-0.021
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
 pose+0.049
 surely+0.048
 drib+0.047
olog+0.047
 jugg+0.046
Neg logit contributions
lessness-0.023
pt-0.022
 crystals-0.017
aky-0.017
 everywhere-0.017
Token Tim
Token ID5045
Feature activation-9.88e-03

Pos logit contributions
lessness+0.022
pt+0.021
 twins+0.019
 everywhere+0.019
 crystals+0.018
Neg logit contributions
 pose-0.041
 surely-0.040
olog-0.037
 score-0.037
 drib-0.037