TOP ACTIVATIONS
MAX = 12.301

do not like mosquitoes . Mos quit oes are small and
're welcome , Mr . Mos qu ito ! I 'm
" Hello , Mr . Mos qu ito ! What are
that ?" " Mos quit oes !" Tom said
!" Lily said . " Mos quit oes are bad .
. She said , " Mos quit oes are important too
hurt you . But you shouldn 't go into the garage
t you know you shouldn â t speed
said . " But you shouldn 't go to the hill
said . " But you shouldn 't go to the woods
fun . But maybe we shouldn 't eat it . Maybe
, B ongo . We shouldn 't go near it ."
a mosquito flying around . Mos quit oes were Tim my
You 're right . We shouldn 't eat it . But
're safe . But you shouldn 't have crossed that bridge
the arm chair . We shouldn 't jump on it ."
very mean boy . He shouldn 't have taken your doll
's dangerous . And you shouldn 't take lumber that doesn
's not nice . We shouldn 't do that ."
she said . " We shouldn 't do that . It

MIN ACTIVATIONS
MIN = -13.183

am , until finally he arrived at the end of the
than when the woman had arrived . <|endoftext|> It was a
continued on their journey and arrived at the hot land .
the lake was bigger and deeper than the stream , but
Lily 's friend Tim my arrived at her house . He
and finally , the license arrived in the mail . It
finish line , Bob had arrived first ! Peter was very
It was coming from down underground . She pe ered down
felt like forever , they arrived at the zoo . Little
's house . When they arrived , Lily 's mom knocked
bolts , but when he arrived the lid was wide open
when suddenly , something strange arrived . They stopped dancing and
were glad that he had arrived . He was a very
until he was b owing deeper than anyone else ! The
stops they made until they arrived at the store .
Finally , they arrived at the man â
After an hour , they arrived at Steve 's home .
was so excited when Daddy arrived . She ran to him
When they arrived , she saw so many
held the man until backup arrived . Fortunately for the man

INTERVAL 5.930 - 12.301
CONTAINS 3.778%

one . " Yes , it is . But
<|endoftext|> Tom and Mia are friends . They like to play
scared , and he could not sleep at night . The
says . " No , it 's mine !"
igator . It said " Don 't be afraid , I

INTERVAL -0.441 - 5.930
CONTAINS 58.396%

Milo chased a butterfly into a me adow . He disappeared
inside to find something to wipe her hands with . As
. We made him together . We should share him ."
special ." Fl uffy and Lucy were very happy and went
pony asked L izzy , " Do you want to take

INTERVAL -6.812 - -0.441
CONTAINS 32.290%

a nap under the tree when he felt hot and dry
to be together . <|endoftext|> Once there was a town with
land . Suddenly , he saw a storm coming .
. The boy eagerly walked through the broken door
, he went to a toy store and filled his car

INTERVAL -13.183 - -6.812
CONTAINS 5.536%

use it again . They buried it with Lily and hoped
care of it , so he could play with it .
the tracks . Every morning , the train conductor would make
of them laughed and laughed , feeling silly and relieved .
M um said , " Let 's sit down
Token do
Token ID466
Feature activation+2.15e+00

Pos logit contributions
pit+0.042
 nicely+0.042
,+0.042
ider+0.038
 gym+0.038
Neg logit contributions
leaf-0.078
keeping-0.072
 sorts-0.070
vous-0.064
 kinds-0.062
Token not
Token ID407
Feature activation+6.33e+00

Pos logit contributions
leaf+0.069
 sorts+0.066
keeping+0.060
vous+0.059
 kinds+0.058
Neg logit contributions
pit-0.044
ider-0.041
uddle-0.037
bles-0.035
angers-0.035
Token like
Token ID588
Feature activation+3.39e+00

Pos logit contributions
leaf+0.209
keeping+0.181
 sorts+0.176
Isn+0.162
vous+0.161
Neg logit contributions
 nicely-0.133
,-0.132
pit-0.122
ider-0.115
 gym-0.113
Token mosquitoes
Token ID35573
Feature activation+7.53e+00

Pos logit contributions
leaf+0.142
vous+0.125
 sorts+0.125
Dear+0.122
keeping+0.122
Neg logit contributions
,-0.049
 gym-0.043
 nicely-0.042
pit-0.039
 hockey-0.037
Token.
Token ID13
Feature activation+6.06e-01

Pos logit contributions
leaf+0.290
 sorts+0.262
keeping+0.253
vous+0.241
 kinds+0.234
Neg logit contributions
pit-0.122
ider-0.111
 nicely-0.103
uddle-0.099
,-0.097
Token Mos
Token ID5826
Feature activation+1.23e+01

Pos logit contributions
leaf+0.020
 sorts+0.019
Isn+0.018
keeping+0.017
 kinds+0.017
Neg logit contributions
pit-0.011
ider-0.011
bles-0.009
uddle-0.009
 patiently-0.009
Tokenquit
Token ID47391
Feature activation+3.31e+00

Pos logit contributions
leaf+0.330
keeping+0.276
 sorts+0.268
Dear+0.244
 faith+0.243
Neg logit contributions
pit-0.324
uddle-0.312
ider-0.311
,-0.310
 gym-0.309
Tokenoes
Token ID3028
Feature activation-1.76e-01

Pos logit contributions
leaf+0.108
keeping+0.093
 sorts+0.089
 faith+0.087
vous+0.085
Neg logit contributions
pit-0.073
 nicely-0.070
,-0.069
ider-0.065
uddle-0.064
Token are
Token ID389
Feature activation+8.54e-01

Pos logit contributions
,+0.004
 nicely+0.004
pit+0.003
uddle+0.003
ider+0.003
Neg logit contributions
leaf-0.006
keeping-0.005
 sorts-0.005
vous-0.005
Dear-0.005
Token small
Token ID1402
Feature activation-1.00e+00

Pos logit contributions
leaf+0.032
keeping+0.029
 sorts+0.028
Dear+0.026
vous+0.025
Neg logit contributions
,-0.015
 nicely-0.014
balls-0.014
ider-0.013
pit-0.013
Token and
Token ID290
Feature activation+4.56e+00

Pos logit contributions
pit+0.010
,+0.009
ider+0.008
 nicely+0.008
uddle+0.007
Neg logit contributions
leaf-0.046
 sorts-0.041
keeping-0.040
vous-0.039
Dear-0.038
Token're
Token ID821
Feature activation+7.96e-01

Pos logit contributions
keeping+0.202
leaf+0.196
 sorts+0.170
 harm+0.169
 harmony+0.162
Neg logit contributions
,-0.135
 nicely-0.113
bles-0.110
pit-0.107
ingu-0.107
Token welcome
Token ID7062
Feature activation+6.17e-01

Pos logit contributions
leaf+0.023
keeping+0.020
 sorts+0.020
vous+0.018
Dear+0.017
Neg logit contributions
pit-0.020
 nicely-0.020
,-0.019
ider-0.019
 Later-0.018
Token,
Token ID11
Feature activation-8.39e+00

Pos logit contributions
leaf+0.026
 sorts+0.023
keeping+0.023
vous+0.022
 fright+0.021
Neg logit contributions
pit-0.008
ider-0.007
uddle-0.006
 nicely-0.006
bles-0.005
Token Mr
Token ID1770
Feature activation+3.69e+00

Pos logit contributions
pit+0.127
,+0.120
 nicely+0.117
ider+0.110
 hockey+0.107
Neg logit contributions
leaf-0.333
keeping-0.310
 sorts-0.298
vous-0.275
Dear-0.270
Token.
Token ID13
Feature activation-2.78e-01

Pos logit contributions
leaf+0.138
 sorts+0.124
keeping+0.120
vous+0.114
 kinds+0.111
Neg logit contributions
pit-0.064
ider-0.060
 nicely-0.055
uddle-0.053
,-0.050
Token Mos
Token ID5826
Feature activation+1.20e+01

Pos logit contributions
pit+0.007
ider+0.006
,+0.006
 gym+0.006
uddle+0.006
Neg logit contributions
leaf-0.009
 sorts-0.008
keeping-0.007
 kinds-0.007
vous-0.007
Tokenqu
Token ID421
Feature activation-4.94e+00

Pos logit contributions
leaf+0.342
 sorts+0.293
keeping+0.291
 faith+0.269
 rid+0.261
Neg logit contributions
ider-0.297
pit-0.295
uddle-0.283
,-0.278
 nicely-0.274
Tokenito
Token ID10094
Feature activation+5.55e+00

Pos logit contributions
pit+0.090
ider+0.084
uddle+0.080
 nicely+0.079
,+0.072
Neg logit contributions
leaf-0.180
keeping-0.162
 sorts-0.161
vous-0.146
 fright-0.145
Token!
Token ID0
Feature activation+1.25e+00

Pos logit contributions
leaf+0.188
 sorts+0.165
keeping+0.163
vous+0.150
Dear+0.147
Neg logit contributions
pit-0.116
ider-0.109
,-0.103
 nicely-0.102
uddle-0.099
Token I
Token ID314
Feature activation+6.16e+00

Pos logit contributions
leaf+0.040
keeping+0.037
 sorts+0.036
 harm+0.034
 faith+0.032
Neg logit contributions
ider-0.026
pit-0.025
 His-0.024
 nicely-0.024
 Later-0.024
Token'm
Token ID1101
Feature activation+2.83e+00

Pos logit contributions
leaf+0.182
keeping+0.177
ymes+0.152
weet+0.150
icated+0.146
Neg logit contributions
,-0.154
 nicely-0.142
ider-0.141
pit-0.137
 patiently-0.135
Token "
Token ID366
Feature activation+4.51e+00

Pos logit contributions
pit+0.098
ider+0.086
uddle+0.073
 nicely+0.070
 camps+0.060
Neg logit contributions
leaf-0.369
 sorts-0.337
keeping-0.325
vous-0.313
 kinds-0.306
TokenHello
Token ID15496
Feature activation-1.87e+00

Pos logit contributions
leaf+0.147
 sorts+0.128
keeping+0.117
 outage+0.115
 kinds+0.113
Neg logit contributions
pit-0.102
 Later-0.093
ider-0.090
 nicely-0.089
 One-0.089
Token,
Token ID11
Feature activation-8.38e+00

Pos logit contributions
pit+0.017
ider+0.016
 nicely+0.011
angers+0.010
bles+0.010
Neg logit contributions
leaf-0.082
 sorts-0.077
keeping-0.073
vous-0.073
 kinds-0.071
Token Mr
Token ID1770
Feature activation+3.54e+00

Pos logit contributions
,+0.175
 nicely+0.169
uddle+0.163
pit+0.152
 gym+0.147
Neg logit contributions
keeping-0.295
leaf-0.282
 sorts-0.266
 kinds-0.239
vous-0.235
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
leaf+0.129
 sorts+0.116
keeping+0.113
vous+0.107
 kinds+0.104
Neg logit contributions
pit-0.064
ider-0.059
 nicely-0.054
uddle-0.053
bles-0.049
Token Mos
Token ID5826
Feature activation+1.20e+01

Pos logit contributions
leaf+0.004
 sorts+0.003
keeping+0.003
 kinds+0.003
vous+0.003
Neg logit contributions
pit-0.004
ider-0.004
,-0.003
uddle-0.003
 nicely-0.003
Tokenqu
Token ID421
Feature activation-4.83e+00

Pos logit contributions
leaf+0.314
 sorts+0.268
keeping+0.257
 faith+0.235
 kinds+0.228
Neg logit contributions
pit-0.327
ider-0.322
uddle-0.306
,-0.300
 nicely-0.299
Tokenito
Token ID10094
Feature activation+5.66e+00

Pos logit contributions
pit+0.101
ider+0.095
uddle+0.094
 nicely+0.090
,+0.084
Neg logit contributions
leaf-0.163
keeping-0.146
 sorts-0.145
vous-0.129
Dear-0.128
Token!
Token ID0
Feature activation+1.53e+00

Pos logit contributions
leaf+0.198
 sorts+0.173
keeping+0.172
vous+0.158
Dear+0.155
Neg logit contributions
pit-0.113
ider-0.106
,-0.101
 nicely-0.100
uddle-0.099
Token What
Token ID1867
Feature activation+3.32e+00

Pos logit contributions
leaf+0.044
 sorts+0.039
keeping+0.038
vous+0.034
 kinds+0.034
Neg logit contributions
pit-0.039
ider-0.038
 nicely-0.035
uddle-0.034
,-0.034
Token are
Token ID389
Feature activation+5.64e-01

Pos logit contributions
leaf+0.119
keeping+0.110
 sorts+0.102
Dear+0.095
vous+0.094
Neg logit contributions
,-0.062
pit-0.058
 nicely-0.057
 gym-0.052
 hockey-0.051
Token that
Token ID326
Feature activation-4.34e-01

Pos logit contributions
leaf+0.054
 sorts+0.047
keeping+0.046
vous+0.043
 fright+0.041
Neg logit contributions
pit-0.041
ider-0.038
uddle-0.035
 nicely-0.035
 Later-0.033
Token?"
Token ID1701
Feature activation+3.74e+00

Pos logit contributions
pit+0.010
ider+0.009
 nicely+0.009
uddle+0.009
,+0.009
Neg logit contributions
leaf-0.014
 sorts-0.012
keeping-0.012
vous-0.011
 kinds-0.011
Token
Token ID198
Feature activation-3.44e-01

Pos logit contributions
leaf+0.127
 sorts+0.114
keeping+0.110
vous+0.103
Dear+0.100
Neg logit contributions
pit-0.077
ider-0.072
uddle-0.065
 nicely-0.065
bles-0.061
Token
Token ID198
Feature activation-2.92e-01

Pos logit contributions
pit+0.008
ider+0.008
uddle+0.007
 nicely+0.007
 camps+0.006
Neg logit contributions
leaf-0.011
 sorts-0.009
keeping-0.009
Dear-0.008
vous-0.008
Token"
Token ID1
Feature activation+4.51e+00

Pos logit contributions
pit+0.009
ider+0.008
 camps+0.007
uddle+0.007
angers+0.007
Neg logit contributions
leaf-0.007
keeping-0.006
Dear-0.006
 sorts-0.006
vous-0.006
TokenMos
Token ID32668
Feature activation+1.17e+01

Pos logit contributions
leaf+0.137
 sorts+0.133
keeping+0.124
 kinds+0.116
 faith+0.110
Neg logit contributions
ider-0.111
pit-0.102
balls-0.101
 nicely-0.097
bles-0.096
Tokenquit
Token ID47391
Feature activation+3.33e+00

Pos logit contributions
leaf+0.298
 sorts+0.259
keeping+0.250
vous+0.222
 kinds+0.220
Neg logit contributions
pit-0.329
ider-0.320
uddle-0.303
 nicely-0.298
,-0.289
Tokenoes
Token ID3028
Feature activation-2.60e-01

Pos logit contributions
leaf+0.107
keeping+0.090
 sorts+0.089
 faith+0.084
vous+0.081
Neg logit contributions
pit-0.078
,-0.077
bles-0.071
 nicely-0.071
ider-0.070
Token!"
Token ID2474
Feature activation+4.20e+00

Pos logit contributions
,+0.006
pit+0.005
 nicely+0.005
uddle+0.005
ider+0.005
Neg logit contributions
leaf-0.009
keeping-0.008
 sorts-0.008
vous-0.007
Dear-0.007
Token Tom
Token ID4186
Feature activation-3.00e-01

Pos logit contributions
leaf+0.158
 sorts+0.142
keeping+0.138
vous+0.129
 fright+0.128
Neg logit contributions
pit-0.072
ider-0.066
uddle-0.058
 nicely-0.057
bles-0.054
Token said
Token ID531
Feature activation-2.26e+00

Pos logit contributions
pit+0.007
ider+0.007
 nicely+0.007
uddle+0.006
,+0.006
Neg logit contributions
leaf-0.009
 sorts-0.008
keeping-0.008
vous-0.007
 kinds-0.007
Token!"
Token ID2474
Feature activation+4.57e+00

Pos logit contributions
leaf+0.273
keeping+0.236
 sorts+0.230
vous+0.209
Isn+0.209
Neg logit contributions
,-0.143
 nicely-0.119
uddle-0.115
pit-0.112
ider-0.110
Token Lily
Token ID20037
Feature activation+1.50e+00

Pos logit contributions
leaf+0.172
 sorts+0.158
keeping+0.154
vous+0.146
 fright+0.143
Neg logit contributions
pit-0.076
ider-0.068
uddle-0.061
bles-0.057
 camps-0.057
Token said
Token ID531
Feature activation-2.12e+00

Pos logit contributions
leaf+0.044
 sorts+0.039
keeping+0.038
vous+0.034
Dear+0.033
Neg logit contributions
pit-0.038
ider-0.036
 nicely-0.034
,-0.034
uddle-0.034
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
pit+0.054
 camps+0.048
uddle+0.047
ider+0.046
 Later+0.045
Neg logit contributions
leaf-0.060
keeping-0.053
 sorts-0.051
 fright-0.051
 harm-0.048
Token "
Token ID366
Feature activation+4.79e+00

Pos logit contributions
leaf+0.007
keeping+0.006
 sorts+0.006
 fright+0.006
don+0.005
Neg logit contributions
pit-0.005
 camps-0.005
ig-0.004
ider-0.004
uddle-0.004
TokenMos
Token ID32668
Feature activation+1.16e+01

Pos logit contributions
leaf+0.146
 sorts+0.143
keeping+0.132
 kinds+0.125
 faith+0.118
Neg logit contributions
 nicely-0.102
ider-0.101
His-0.096
bles-0.094
balls-0.093
Tokenquit
Token ID47391
Feature activation+3.17e+00

Pos logit contributions
leaf+0.284
 sorts+0.244
keeping+0.242
 kinds+0.207
 faith+0.207
Neg logit contributions
pit-0.329
ider-0.324
uddle-0.308
 nicely-0.301
 gym-0.296
Tokenoes
Token ID3028
Feature activation-3.82e-01

Pos logit contributions
leaf+0.111
keeping+0.095
 faith+0.092
 sorts+0.092
 fright+0.086
Neg logit contributions
,-0.064
pit-0.063
 nicely-0.059
bles-0.057
ider-0.055
Token are
Token ID389
Feature activation+6.89e-01

Pos logit contributions
,+0.008
pit+0.008
 nicely+0.008
ider+0.008
uddle+0.008
Neg logit contributions
leaf-0.013
keeping-0.011
 sorts-0.011
vous-0.010
Dear-0.010
Token bad
Token ID2089
Feature activation+4.34e+00

Pos logit contributions
leaf+0.026
keeping+0.023
 sorts+0.023
Dear+0.021
vous+0.020
Neg logit contributions
,-0.012
 nicely-0.012
balls-0.011
ider-0.011
pit-0.011
Token.
Token ID13
Feature activation+7.45e-02

Pos logit contributions
leaf+0.168
 sorts+0.148
keeping+0.141
vous+0.136
Dear+0.132
Neg logit contributions
,-0.073
pit-0.073
ider-0.068
 nicely-0.063
uddle-0.061
Token.
Token ID13
Feature activation+5.87e-01

Pos logit contributions
pit+0.072
ider+0.067
uddle+0.061
 nicely+0.058
 camps+0.057
Neg logit contributions
leaf-0.098
 sorts-0.089
keeping-0.087
 fright-0.079
vous-0.078
Token She
Token ID1375
Feature activation-4.98e+00

Pos logit contributions
leaf+0.015
keeping+0.014
 sorts+0.014
 shouldn+0.014
don+0.013
Neg logit contributions
ider-0.015
pit-0.014
bles-0.013
uing-0.013
 camps-0.013
Token said
Token ID531
Feature activation-2.44e+00

Pos logit contributions
pit+0.126
ider+0.120
 nicely+0.111
uddle+0.109
bles+0.103
Neg logit contributions
leaf-0.146
 sorts-0.128
keeping-0.121
vous-0.115
 shouldn-0.111
Token,
Token ID11
Feature activation-8.43e+00

Pos logit contributions
pit+0.048
ider+0.037
 rink+0.036
 camps+0.035
uddle+0.033
Neg logit contributions
leaf-0.090
 shouldn-0.080
 fright-0.078
 sorts-0.075
sm-0.075
Token "
Token ID366
Feature activation+4.59e+00

Pos logit contributions
pit+0.174
uddle+0.153
ider+0.142
 camps+0.137
erve+0.131
Neg logit contributions
leaf-0.294
 shouldn-0.244
 sorts-0.241
Isn-0.241
keeping-0.239
TokenMos
Token ID32668
Feature activation+1.15e+01

Pos logit contributions
leaf+0.132
 sorts+0.119
keeping+0.116
 kinds+0.104
 faith+0.102
Neg logit contributions
His-0.107
pit-0.102
erve-0.098
 Later-0.098
 nicely-0.097
Tokenquit
Token ID47391
Feature activation+3.22e+00

Pos logit contributions
leaf+0.273
 sorts+0.234
keeping+0.227
vous+0.197
 kinds+0.195
Neg logit contributions
pit-0.343
ider-0.335
uddle-0.318
 nicely-0.314
,-0.302
Tokenoes
Token ID3028
Feature activation-4.36e-01

Pos logit contributions
leaf+0.105
keeping+0.091
 sorts+0.090
 faith+0.084
vous+0.083
Neg logit contributions
pit-0.071
,-0.066
 nicely-0.066
ider-0.064
uddle-0.062
Token are
Token ID389
Feature activation+8.68e-01

Pos logit contributions
pit+0.010
,+0.009
 nicely+0.009
ider+0.009
uddle+0.009
Neg logit contributions
leaf-0.014
keeping-0.012
 sorts-0.012
vous-0.011
Dear-0.011
Token important
Token ID1593
Feature activation+5.71e+00

Pos logit contributions
leaf+0.031
keeping+0.028
 sorts+0.028
Dear+0.026
vous+0.025
Neg logit contributions
,-0.016
 nicely-0.015
balls-0.015
ider-0.014
pit-0.014
Token too
Token ID1165
Feature activation+6.81e+00

Pos logit contributions
leaf+0.215
 sorts+0.197
keeping+0.186
vous+0.185
Dear+0.179
Neg logit contributions
,-0.092
pit-0.089
ider-0.089
 nicely-0.084
balls-0.080
Token hurt
Token ID5938
Feature activation+2.39e+00

Pos logit contributions
leaf+0.209
 sorts+0.182
keeping+0.174
vous+0.161
 kinds+0.156
Neg logit contributions
pit-0.191
ider-0.181
 nicely-0.172
uddle-0.168
,-0.163
Token you
Token ID345
Feature activation+5.77e+00

Pos logit contributions
leaf+0.076
 sorts+0.068
vous+0.058
 kinds+0.057
keeping+0.056
Neg logit contributions
,-0.058
 nicely-0.057
pit-0.055
ider-0.052
bles-0.050
Token.
Token ID13
Feature activation-1.79e-01

Pos logit contributions
leaf+0.205
 sorts+0.183
keeping+0.171
vous+0.165
 kinds+0.161
Neg logit contributions
pit-0.112
ider-0.106
 nicely-0.103
,-0.099
uddle-0.096
Token But
Token ID887
Feature activation+1.05e+00

Pos logit contributions
pit+0.003
ider+0.003
 nicely+0.003
uddle+0.003
,+0.003
Neg logit contributions
leaf-0.006
 sorts-0.006
keeping-0.006
vous-0.005
 kinds-0.005
Token you
Token ID345
Feature activation+5.76e+00

Pos logit contributions
leaf+0.031
 sorts+0.027
keeping+0.026
vous+0.024
 kinds+0.023
Neg logit contributions
pit-0.027
ider-0.026
 nicely-0.024
uddle-0.024
,-0.023
Token shouldn
Token ID6584
Feature activation+1.15e+01

Pos logit contributions
leaf+0.201
 sorts+0.176
keeping+0.175
ymes+0.158
 harmony+0.158
Neg logit contributions
,-0.130
 nicely-0.108
 One-0.107
ingu-0.094
bles-0.093
Token't
Token ID470
Feature activation+7.29e+00

Pos logit contributions
leaf+0.419
 sorts+0.377
keeping+0.349
 kinds+0.340
vous+0.338
Neg logit contributions
pit-0.204
,-0.201
ider-0.192
 nicely-0.189
uddle-0.179
Token go
Token ID467
Feature activation-3.80e+00

Pos logit contributions
leaf+0.230
 sorts+0.204
keeping+0.196
vous+0.184
 kinds+0.177
Neg logit contributions
pit-0.170
ider-0.159
 nicely-0.147
uddle-0.146
bles-0.138
Token into
Token ID656
Feature activation-4.19e+00

Pos logit contributions
pit+0.096
ider+0.086
uddle+0.082
oner+0.081
uing+0.079
Neg logit contributions
leaf-0.102
keeping-0.099
 harm-0.092
 sorts-0.091
 fright-0.087
Token the
Token ID262
Feature activation+3.26e-02

Pos logit contributions
,+0.110
 nicely+0.107
pit+0.106
 hockey+0.103
ider+0.101
Neg logit contributions
leaf-0.123
 sorts-0.098
vous-0.092
keeping-0.086
Dear-0.084
Token garage
Token ID15591
Feature activation-4.64e+00

Pos logit contributions
leaf+0.001
 sorts+0.001
keeping+0.001
vous+0.001
 kinds+0.001
Neg logit contributions
pit-0.001
ider-0.001
 nicely-0.000
uddle-0.000
bles-0.000
Token
Token ID8151
Feature activation+5.56e+00

Pos logit contributions
pit+0.043
ider+0.041
 nicely+0.038
uddle+0.037
bles+0.035
Neg logit contributions
leaf-0.060
 sorts-0.053
keeping-0.051
vous-0.048
 kinds-0.046
Tokent
Token ID83
Feature activation+8.88e+00

Pos logit contributions
leaf+0.156
 sorts+0.137
keeping+0.131
vous+0.122
 kinds+0.117
Neg logit contributions
pit-0.144
ider-0.137
 nicely-0.129
uddle-0.127
bles-0.121
Token you
Token ID345
Feature activation+5.91e+00

Pos logit contributions
leaf+0.312
 sorts+0.269
keeping+0.268
vous+0.243
Dear+0.240
Neg logit contributions
pit-0.171
,-0.167
 nicely-0.162
ider-0.162
uddle-0.148
Token know
Token ID760
Feature activation+2.54e+00

Pos logit contributions
leaf+0.198
keeping+0.178
 sorts+0.176
vous+0.158
 kinds+0.155
Neg logit contributions
,-0.124
pit-0.118
 nicely-0.117
ider-0.116
uddle-0.108
Token you
Token ID345
Feature activation+5.87e+00

Pos logit contributions
leaf+0.095
 sorts+0.085
keeping+0.082
vous+0.078
 kinds+0.077
Neg logit contributions
pit-0.043
ider-0.040
uddle-0.035
 nicely-0.034
 Later-0.032
Token shouldn
Token ID6584
Feature activation+1.15e+01

Pos logit contributions
leaf+0.194
keeping+0.173
 sorts+0.167
vous+0.156
Oh+0.148
Neg logit contributions
,-0.137
 nicely-0.123
pit-0.116
ider-0.111
 One-0.108
Tokenâ
Token ID22940
Feature activation+4.26e+00

Pos logit contributions
leaf+0.395
 sorts+0.354
keeping+0.338
vous+0.317
 kinds+0.315
Neg logit contributions
pit-0.225
,-0.216
ider-0.214
 nicely-0.207
uddle-0.200
Token
Token ID26391
Feature activation-1.82e+00

Pos logit contributions
leaf+0.115
 sorts+0.099
keeping+0.095
vous+0.088
 kinds+0.084
Neg logit contributions
pit-0.118
ider-0.112
 nicely-0.106
uddle-0.105
,-0.100
Token
Token ID8151
Feature activation+5.62e+00

Pos logit contributions
pit+0.040
ider+0.038
 nicely+0.035
uddle+0.035
,+0.034
Neg logit contributions
leaf-0.060
 sorts-0.053
keeping-0.050
vous-0.048
 kinds-0.047
Tokent
Token ID83
Feature activation+8.89e+00

Pos logit contributions
leaf+0.169
 sorts+0.148
keeping+0.141
vous+0.132
 faith+0.129
Neg logit contributions
pit-0.138
ider-0.131
 nicely-0.125
uddle-0.122
,-0.121
Token speed
Token ID2866
Feature activation-1.34e+00

Pos logit contributions
leaf+0.311
 sorts+0.273
keeping+0.267
vous+0.251
Dear+0.242
Neg logit contributions
pit-0.176
ider-0.164
 nicely-0.156
,-0.151
uddle-0.149
Token said
Token ID531
Feature activation-2.04e+00

Pos logit contributions
leaf+0.127
 sorts+0.100
vous+0.083
keeping+0.082
Dear+0.081
Neg logit contributions
pit-0.215
ider-0.204
 camps-0.193
 gym-0.192
uddle-0.192
Token.
Token ID13
Feature activation+2.75e-01

Pos logit contributions
pit+0.056
 camps+0.049
uddle+0.048
 gym+0.048
 rink+0.046
Neg logit contributions
leaf-0.056
keeping-0.051
 sorts-0.047
 fright-0.044
 harm-0.043
Token "
Token ID366
Feature activation+5.18e+00

Pos logit contributions
leaf+0.007
Dear+0.006
Isn+0.006
keeping+0.006
Don+0.006
Neg logit contributions
pit-0.007
uddle-0.007
ider-0.007
ig-0.007
erve-0.006
TokenBut
Token ID1537
Feature activation+1.27e+00

Pos logit contributions
leaf+0.130
 sorts+0.129
 kinds+0.113
keeping+0.106
 fright+0.105
Neg logit contributions
ider-0.144
 nicely-0.141
pit-0.134
His-0.128
balls-0.126
Token you
Token ID345
Feature activation+5.81e+00

Pos logit contributions
leaf+0.033
 sorts+0.028
keeping+0.027
vous+0.024
 faith+0.024
Neg logit contributions
pit-0.036
ider-0.034
uddle-0.033
 nicely-0.031
 Later-0.031
Token shouldn
Token ID6584
Feature activation+1.15e+01

Pos logit contributions
leaf+0.201
keeping+0.175
 sorts+0.171
ymes+0.163
 harmony+0.153
Neg logit contributions
,-0.133
 One-0.110
 nicely-0.103
ingu-0.096
 p-0.094
Token't
Token ID470
Feature activation+7.27e+00

Pos logit contributions
leaf+0.448
 sorts+0.411
 kinds+0.376
keeping+0.376
vous+0.368
Neg logit contributions
,-0.180
pit-0.164
 gym-0.155
 nicely-0.155
ider-0.155
Token go
Token ID467
Feature activation-3.59e+00

Pos logit contributions
leaf+0.245
 sorts+0.218
keeping+0.210
vous+0.197
 kinds+0.192
Neg logit contributions
pit-0.151
ider-0.143
 nicely-0.136
uddle-0.132
,-0.129
Token to
Token ID284
Feature activation+3.40e+00

Pos logit contributions
pit+0.088
ider+0.077
uddle+0.075
ingu+0.072
 Days+0.072
Neg logit contributions
leaf-0.104
keeping-0.094
 sorts-0.090
 harm-0.090
 fright-0.087
Token the
Token ID262
Feature activation+2.11e-01

Pos logit contributions
leaf+0.098
 sorts+0.085
keeping+0.081
vous+0.076
 kinds+0.073
Neg logit contributions
pit-0.087
ider-0.082
 nicely-0.079
uddle-0.077
,-0.076
Token hill
Token ID12788
Feature activation-2.26e+00

Pos logit contributions
leaf+0.007
 sorts+0.006
keeping+0.006
vous+0.006
 fright+0.006
Neg logit contributions
pit-0.004
ider-0.004
bles-0.004
uddle-0.004
inn-0.004
Token said
Token ID531
Feature activation-1.99e+00

Pos logit contributions
pit+0.163
ider+0.156
 nicely+0.148
,+0.146
uddle+0.146
Neg logit contributions
leaf-0.194
 sorts-0.172
keeping-0.169
vous-0.155
 kinds-0.148
Token.
Token ID13
Feature activation+1.00e-01

Pos logit contributions
pit+0.049
ider+0.042
uddle+0.041
 camps+0.041
 gym+0.039
Neg logit contributions
leaf-0.062
keeping-0.054
 sorts-0.052
 fright-0.049
 rid-0.047
Token "
Token ID366
Feature activation+5.12e+00

Pos logit contributions
leaf+0.003
 sorts+0.003
keeping+0.003
Dear+0.003
Isn+0.003
Neg logit contributions
pit-0.002
uddle-0.002
ider-0.002
uing-0.002
ig-0.002
TokenBut
Token ID1537
Feature activation+1.18e+00

Pos logit contributions
leaf+0.123
 sorts+0.122
 kinds+0.105
keeping+0.099
 shouldn+0.098
Neg logit contributions
ider-0.152
 nicely-0.144
pit-0.141
balls-0.137
bles-0.131
Token you
Token ID345
Feature activation+5.80e+00

Pos logit contributions
leaf+0.036
 sorts+0.032
keeping+0.030
vous+0.029
 kinds+0.027
Neg logit contributions
pit-0.029
ider-0.027
 nicely-0.026
uddle-0.025
,-0.025
Token shouldn
Token ID6584
Feature activation+1.15e+01

Pos logit contributions
leaf+0.209
keeping+0.182
 sorts+0.178
ymes+0.166
 harmony+0.159
Neg logit contributions
,-0.125
 One-0.102
 nicely-0.099
ingu-0.095
bles-0.091
Token't
Token ID470
Feature activation+7.30e+00

Pos logit contributions
leaf+0.427
 sorts+0.389
keeping+0.358
 kinds+0.351
vous+0.348
Neg logit contributions
pit-0.191
,-0.189
ider-0.180
 nicely-0.175
balls-0.173
Token go
Token ID467
Feature activation-3.66e+00

Pos logit contributions
leaf+0.252
 sorts+0.225
keeping+0.216
vous+0.203
 kinds+0.200
Neg logit contributions
pit-0.143
ider-0.138
 nicely-0.133
,-0.129
uddle-0.127
Token to
Token ID284
Feature activation+3.38e+00

Pos logit contributions
pit+0.086
ider+0.075
uddle+0.072
oner+0.068
 Days+0.068
Neg logit contributions
leaf-0.111
keeping-0.100
 sorts-0.097
 harm-0.094
 fright-0.093
Token the
Token ID262
Feature activation+2.06e-01

Pos logit contributions
leaf+0.099
 sorts+0.087
keeping+0.082
vous+0.077
 kinds+0.074
Neg logit contributions
pit-0.085
ider-0.080
 nicely-0.077
uddle-0.075
,-0.074
Token woods
Token ID16479
Feature activation-6.36e+00

Pos logit contributions
leaf+0.007
 sorts+0.007
keeping+0.006
vous+0.006
Dear+0.006
Neg logit contributions
pit-0.004
ider-0.004
 nicely-0.003
uddle-0.003
bles-0.003
Token fun
Token ID1257
Feature activation+1.32e+00

Pos logit contributions
pit+0.013
ider+0.013
 nicely+0.013
,+0.012
uddle+0.012
Neg logit contributions
leaf-0.021
keeping-0.018
 sorts-0.018
vous-0.017
Dear-0.016
Token.
Token ID13
Feature activation+5.69e-02

Pos logit contributions
leaf+0.057
 sorts+0.050
keeping+0.049
vous+0.048
Dear+0.048
Neg logit contributions
,-0.016
pit-0.016
ider-0.014
 nicely-0.013
uddle-0.012
Token But
Token ID887
Feature activation+1.16e+00

Pos logit contributions
leaf+0.002
 sorts+0.002
keeping+0.002
vous+0.001
 kinds+0.001
Neg logit contributions
pit-0.001
ider-0.001
 nicely-0.001
,-0.001
 Later-0.001
Token maybe
Token ID3863
Feature activation+5.54e+00

Pos logit contributions
leaf+0.034
 sorts+0.030
keeping+0.029
vous+0.027
 kinds+0.026
Neg logit contributions
pit-0.029
ider-0.027
 nicely-0.026
uddle-0.025
,-0.024
Token we
Token ID356
Feature activation+6.13e+00

Pos logit contributions
leaf+0.164
 sorts+0.144
keeping+0.138
vous+0.128
 kinds+0.125
Neg logit contributions
pit-0.138
ider-0.130
 nicely-0.122
uddle-0.121
bles-0.114
Token shouldn
Token ID6584
Feature activation+1.15e+01

Pos logit contributions
leaf+0.205
keeping+0.181
 sorts+0.180
vous+0.165
 kinds+0.160
Neg logit contributions
,-0.130
 nicely-0.127
pit-0.122
uddle-0.112
 One-0.112
Token't
Token ID470
Feature activation+7.37e+00

Pos logit contributions
leaf+0.371
 sorts+0.330
keeping+0.315
vous+0.297
 kinds+0.289
Neg logit contributions
pit-0.256
ider-0.243
 nicely-0.229
uddle-0.223
,-0.222
Token eat
Token ID4483
Feature activation-2.33e+00

Pos logit contributions
leaf+0.212
 sorts+0.191
keeping+0.181
 fright+0.172
 shouldn+0.171
Neg logit contributions
pit-0.189
ider-0.176
uddle-0.160
angers-0.158
 camps-0.156
Token it
Token ID340
Feature activation-1.36e+00

Pos logit contributions
pit+0.061
ider+0.058
 nicely+0.054
uddle+0.054
 Later+0.051
Neg logit contributions
leaf-0.066
 sorts-0.058
keeping-0.055
vous-0.051
 kinds-0.050
Token.
Token ID13
Feature activation-3.42e-02

Pos logit contributions
pit+0.032
ider+0.030
uddle+0.029
 Days+0.027
 Later+0.027
Neg logit contributions
leaf-0.039
 sorts-0.035
keeping-0.034
vous-0.032
 kinds-0.032
Token Maybe
Token ID6674
Feature activation+4.72e+00

Pos logit contributions
ider+0.001
pit+0.001
 nicely+0.001
 Later+0.001
,+0.001
Neg logit contributions
leaf-0.001
 sorts-0.001
keeping-0.001
vous-0.001
 kinds-0.001
Token,
Token ID11
Feature activation-8.29e+00

Pos logit contributions
leaf+0.003
 sorts+0.003
keeping+0.003
 faith+0.003
Dear+0.003
Neg logit contributions
pit-0.001
,-0.001
ider-0.001
 nicely-0.001
uddle-0.001
Token B
Token ID347
Feature activation+8.40e-01

Pos logit contributions
pit+0.172
,+0.167
 nicely+0.160
ider+0.158
uddle+0.156
Neg logit contributions
leaf-0.279
 sorts-0.242
keeping-0.241
vous-0.229
 kinds-0.210
Tokenongo
Token ID25162
Feature activation+5.21e-01

Pos logit contributions
leaf+0.029
 sorts+0.025
keeping+0.025
vous+0.024
 shouldn+0.023
Neg logit contributions
pit-0.017
ider-0.016
 nicely-0.015
uddle-0.014
 camps-0.013
Token.
Token ID13
Feature activation+6.36e-01

Pos logit contributions
leaf+0.020
 sorts+0.018
keeping+0.017
vous+0.016
 faith+0.016
Neg logit contributions
pit-0.009
ider-0.008
 nicely-0.007
,-0.007
uddle-0.007
Token We
Token ID775
Feature activation+6.96e+00

Pos logit contributions
leaf+0.020
 sorts+0.018
keeping+0.017
vous+0.016
 kinds+0.015
Neg logit contributions
pit-0.014
ider-0.014
 nicely-0.013
 Later-0.013
,-0.013
Token shouldn
Token ID6584
Feature activation+1.15e+01

Pos logit contributions
leaf+0.197
keeping+0.167
 sorts+0.155
ymes+0.145
vous+0.143
Neg logit contributions
,-0.191
 nicely-0.177
 One-0.174
pit-0.168
 at-0.160
Token't
Token ID470
Feature activation+7.30e+00

Pos logit contributions
leaf+0.406
 sorts+0.361
keeping+0.343
 kinds+0.322
vous+0.321
Neg logit contributions
pit-0.221
,-0.208
ider-0.207
 nicely-0.198
uddle-0.191
Token go
Token ID467
Feature activation-3.69e+00

Pos logit contributions
leaf+0.250
 sorts+0.226
keeping+0.216
vous+0.206
 kinds+0.200
Neg logit contributions
pit-0.150
ider-0.141
 nicely-0.127
uddle-0.127
bles-0.119
Token near
Token ID1474
Feature activation-2.48e+00

Pos logit contributions
pit+0.079
ider+0.073
uddle+0.068
 nicely+0.064
 Later+0.062
Neg logit contributions
leaf-0.122
 sorts-0.109
keeping-0.107
vous-0.100
 fright-0.099
Token it
Token ID340
Feature activation-1.30e+00

Pos logit contributions
,+0.057
 nicely+0.051
pit+0.050
 car+0.046
 towers+0.045
Neg logit contributions
leaf-0.088
 sorts-0.074
vous-0.070
keeping-0.070
Dear-0.066
Token."
Token ID526
Feature activation+3.94e+00

Pos logit contributions
pit+0.035
ider+0.033
uddle+0.031
balls+0.030
bles+0.030
Neg logit contributions
leaf-0.031
 sorts-0.031
keeping-0.029
vous-0.027
 kinds-0.027
Token a
Token ID257
Feature activation+1.43e+00

Pos logit contributions
pit+0.067
,+0.066
 nicely+0.064
 hockey+0.059
 gym+0.057
Neg logit contributions
leaf-0.147
keeping-0.131
 sorts-0.129
Dear-0.122
vous-0.122
Token mosquito
Token ID35446
Feature activation+5.03e+00

Pos logit contributions
leaf+0.053
keeping+0.048
 sorts+0.047
vous+0.044
 kinds+0.043
Neg logit contributions
pit-0.026
uddle-0.023
,-0.022
 nicely-0.022
ider-0.022
Token flying
Token ID7348
Feature activation-7.93e+00

Pos logit contributions
leaf+0.199
 sorts+0.174
keeping+0.172
vous+0.161
 faith+0.160
Neg logit contributions
pit-0.085
,-0.080
ider-0.077
 nicely-0.069
uddle-0.068
Token around
Token ID1088
Feature activation-1.35e+00

Pos logit contributions
,+0.122
pit+0.117
 nicely+0.115
ider+0.102
 at+0.089
Neg logit contributions
leaf-0.327
keeping-0.283
 sorts-0.273
 faith-0.266
Dear-0.260
Token.
Token ID13
Feature activation+2.07e-01

Pos logit contributions
,+0.025
pit+0.022
 nicely+0.022
 until+0.019
 hockey+0.018
Neg logit contributions
leaf-0.054
 sorts-0.046
vous-0.044
keeping-0.044
Dear-0.044
Token Mos
Token ID5826
Feature activation+1.15e+01

Pos logit contributions
leaf+0.008
 sorts+0.008
keeping+0.007
vous+0.007
 kinds+0.007
Neg logit contributions
pit-0.003
ider-0.003
 nicely-0.002
uddle-0.002
,-0.002
Tokenquit
Token ID47391
Feature activation+2.42e+00

Pos logit contributions
leaf+0.304
 sorts+0.257
keeping+0.249
Dear+0.230
 faith+0.227
Neg logit contributions
pit-0.304
ider-0.291
uddle-0.287
,-0.283
 gym-0.282
Tokenoes
Token ID3028
Feature activation-1.07e+00

Pos logit contributions
leaf+0.080
keeping+0.067
 sorts+0.067
 faith+0.064
vous+0.062
Neg logit contributions
pit-0.054
,-0.049
 nicely-0.048
ider-0.047
uddle-0.046
Token were
Token ID547
Feature activation-9.42e-01

Pos logit contributions
pit+0.023
,+0.023
 nicely+0.022
ider+0.021
uddle+0.021
Neg logit contributions
leaf-0.035
keeping-0.030
 sorts-0.029
vous-0.027
Dear-0.027
Token Tim
Token ID5045
Feature activation+4.65e-01

Pos logit contributions
,+0.018
pit+0.017
 nicely+0.016
balls+0.016
ider+0.016
Neg logit contributions
leaf-0.033
keeping-0.029
 sorts-0.029
Dear-0.027
vous-0.027
Tokenmy
Token ID1820
Feature activation+1.90e+00

Pos logit contributions
leaf+0.017
 sorts+0.014
keeping+0.014
Dear+0.013
 faith+0.013
Neg logit contributions
pit-0.010
,-0.010
 nicely-0.009
bles-0.008
ider-0.008
TokenYou
Token ID1639
Feature activation+6.25e+00

Pos logit contributions
leaf+0.151
 sorts+0.134
keeping+0.131
 kinds+0.115
weet+0.113
Neg logit contributions
ider-0.121
pit-0.117
 nicely-0.114
balls-0.102
uddle-0.102
Token're
Token ID821
Feature activation+1.16e+00

Pos logit contributions
leaf+0.220
keeping+0.198
 sorts+0.194
vous+0.175
 harmony+0.169
Neg logit contributions
,-0.142
 One-0.120
 nicely-0.118
bles-0.112
pit-0.109
Token right
Token ID826
Feature activation-1.55e+00

Pos logit contributions
leaf+0.043
 sorts+0.039
keeping+0.038
vous+0.035
 kinds+0.035
Neg logit contributions
pit-0.020
ider-0.019
 nicely-0.018
,-0.017
uddle-0.017
Token.
Token ID13
Feature activation-1.39e-01

Pos logit contributions
pit+0.016
ider+0.015
uddle+0.013
 nicely+0.011
angers+0.011
Neg logit contributions
leaf-0.063
 sorts-0.059
keeping-0.058
 shouldn-0.055
 fright-0.055
Token We
Token ID775
Feature activation+7.08e+00

Pos logit contributions
pit+0.003
ider+0.003
 nicely+0.003
,+0.003
 Later+0.003
Neg logit contributions
leaf-0.004
 sorts-0.004
keeping-0.004
vous-0.004
 kinds-0.003
Token shouldn
Token ID6584
Feature activation+1.15e+01

Pos logit contributions
leaf+0.217
keeping+0.207
 sorts+0.183
vous+0.172
ymes+0.170
Neg logit contributions
,-0.180
 nicely-0.161
 One-0.156
uddle-0.145
pit-0.144
Token't
Token ID470
Feature activation+7.35e+00

Pos logit contributions
leaf+0.426
 sorts+0.385
keeping+0.360
vous+0.344
 kinds+0.343
Neg logit contributions
pit-0.200
,-0.197
ider-0.189
 nicely-0.181
balls-0.173
Token eat
Token ID4483
Feature activation-2.39e+00

Pos logit contributions
leaf+0.254
 sorts+0.224
keeping+0.219
vous+0.204
 faith+0.198
Neg logit contributions
pit-0.145
ider-0.138
 nicely-0.134
,-0.131
uddle-0.126
Token it
Token ID340
Feature activation-1.44e+00

Pos logit contributions
pit+0.059
ider+0.056
uddle+0.052
 nicely+0.051
 Later+0.049
Neg logit contributions
leaf-0.070
 sorts-0.062
keeping-0.060
vous-0.055
 kinds-0.054
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
pit+0.034
ider+0.030
uddle+0.029
 Days+0.027
 Later+0.027
Neg logit contributions
leaf-0.042
 sorts-0.038
keeping-0.037
vous-0.035
 kinds-0.034
Token But
Token ID887
Feature activation+9.67e-01

Pos logit contributions
ider+0.005
 nicely+0.005
pit+0.004
,+0.004
 Later+0.004
Neg logit contributions
leaf-0.008
 sorts-0.007
keeping-0.007
vous-0.006
 kinds-0.006
Token're
Token ID821
Feature activation+1.14e+00

Pos logit contributions
leaf+0.191
keeping+0.184
 sorts+0.168
 mosquit+0.161
ymes+0.161
Neg logit contributions
,-0.119
 nicely-0.099
pit-0.084
ider-0.084
 trade-0.083
Token safe
Token ID3338
Feature activation-2.23e-01

Pos logit contributions
leaf+0.036
 sorts+0.032
keeping+0.030
vous+0.028
 kinds+0.028
Neg logit contributions
pit-0.026
ider-0.025
 nicely-0.024
uddle-0.023
,-0.022
Token.
Token ID13
Feature activation-2.19e-01

Pos logit contributions
,+0.003
ider+0.003
pit+0.003
 nicely+0.003
 patiently+0.002
Neg logit contributions
leaf-0.009
 sorts-0.008
vous-0.008
keeping-0.007
 faith-0.007
Token But
Token ID887
Feature activation+1.10e+00

Pos logit contributions
ider+0.005
pit+0.005
 nicely+0.005
 Later+0.004
,+0.004
Neg logit contributions
leaf-0.007
 sorts-0.006
keeping-0.006
vous-0.005
 kinds-0.005
Token you
Token ID345
Feature activation+5.73e+00

Pos logit contributions
leaf+0.032
 sorts+0.028
keeping+0.027
vous+0.025
 kinds+0.024
Neg logit contributions
pit-0.028
ider-0.027
 nicely-0.026
uddle-0.025
,-0.024
Token shouldn
Token ID6584
Feature activation+1.14e+01

Pos logit contributions
leaf+0.209
keeping+0.192
 sorts+0.181
ymes+0.170
 kinds+0.162
Neg logit contributions
,-0.119
 One-0.096
 nicely-0.091
uddle-0.080
 trade-0.080
Token't
Token ID470
Feature activation+7.26e+00

Pos logit contributions
leaf+0.448
 sorts+0.409
keeping+0.378
 kinds+0.373
vous+0.368
Neg logit contributions
,-0.178
pit-0.163
 nicely-0.160
ider-0.160
 gym-0.156
Token have
Token ID423
Feature activation+3.81e+00

Pos logit contributions
leaf+0.258
 sorts+0.229
keeping+0.219
vous+0.209
 kinds+0.206
Neg logit contributions
 nicely-0.131
pit-0.131
,-0.131
ider-0.130
uddle-0.120
Token crossed
Token ID12606
Feature activation-4.93e+00

Pos logit contributions
leaf+0.108
 sorts+0.096
keeping+0.089
vous+0.084
 kinds+0.081
Neg logit contributions
pit-0.096
ider-0.093
 nicely-0.092
,-0.087
uddle-0.086
Token that
Token ID326
Feature activation-2.74e-01

Pos logit contributions
,+0.094
 nicely+0.093
pit+0.086
 hockey+0.079
 camps+0.075
Neg logit contributions
leaf-0.181
 sorts-0.159
vous-0.150
Dear-0.142
keeping-0.139
Token bridge
Token ID7696
Feature activation-3.41e+00

Pos logit contributions
,+0.005
pit+0.005
uddle+0.005
 gym+0.004
 car+0.004
Neg logit contributions
leaf-0.012
keeping-0.010
 sorts-0.009
vous-0.009
Dear-0.009
Token the
Token ID262
Feature activation+3.01e-01

Pos logit contributions
leaf+0.002
 sorts+0.002
keeping+0.002
vous+0.002
Dear+0.002
Neg logit contributions
,-0.001
 nicely-0.001
pit-0.001
 gym-0.001
ider-0.001
Token arm
Token ID3211
Feature activation+4.79e+00

Pos logit contributions
leaf+0.011
 sorts+0.010
keeping+0.010
 kinds+0.009
 faith+0.009
Neg logit contributions
pit-0.005
ider-0.005
 gym-0.005
 nicely-0.005
,-0.004
Tokenchair
Token ID16337
Feature activation+1.77e+00

Pos logit contributions
leaf+0.168
 sorts+0.142
keeping+0.139
 faith+0.131
vous+0.129
Neg logit contributions
pit-0.101
ider-0.093
,-0.091
 nicely-0.089
uddle-0.087
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
leaf+0.071
 sorts+0.061
keeping+0.059
vous+0.057
 faith+0.056
Neg logit contributions
pit-0.029
ider-0.027
 nicely-0.026
,-0.026
uddle-0.023
Token We
Token ID775
Feature activation+7.20e+00

Pos logit contributions
leaf+0.007
 sorts+0.006
keeping+0.006
vous+0.005
 kinds+0.005
Neg logit contributions
ider-0.004
pit-0.004
 nicely-0.004
 Later-0.004
 One-0.004
Token shouldn
Token ID6584
Feature activation+1.14e+01

Pos logit contributions
leaf+0.238
keeping+0.218
 sorts+0.201
vous+0.195
ymes+0.192
Neg logit contributions
,-0.167
 nicely-0.150
 One-0.141
 at-0.133
uddle-0.131
Token't
Token ID470
Feature activation+7.34e+00

Pos logit contributions
leaf+0.439
 sorts+0.403
keeping+0.374
 kinds+0.368
vous+0.361
Neg logit contributions
,-0.184
pit-0.176
 nicely-0.173
ider-0.167
 gym-0.163
Token jump
Token ID4391
Feature activation-3.92e+00

Pos logit contributions
leaf+0.245
 sorts+0.217
keeping+0.209
vous+0.197
 kinds+0.191
Neg logit contributions
pit-0.154
ider-0.145
 nicely-0.141
,-0.136
uddle-0.134
Token on
Token ID319
Feature activation+1.16e-01

Pos logit contributions
pit+0.064
 nicely+0.061
ider+0.060
,+0.060
uddle+0.056
Neg logit contributions
leaf-0.151
 sorts-0.136
keeping-0.130
vous-0.125
 kinds-0.121
Token it
Token ID340
Feature activation-1.27e+00

Pos logit contributions
leaf+0.004
 sorts+0.004
vous+0.004
keeping+0.004
Dear+0.003
Neg logit contributions
,-0.002
 nicely-0.002
 hockey-0.002
 car-0.002
 until-0.002
Token."
Token ID526
Feature activation+3.55e+00

Pos logit contributions
pit+0.027
ider+0.025
uddle+0.023
 nicely+0.023
bles+0.021
Neg logit contributions
leaf-0.042
 sorts-0.038
keeping-0.036
vous-0.034
 kinds-0.033
Token very
Token ID845
Feature activation+3.96e+00

Pos logit contributions
leaf+0.077
 sorts+0.070
keeping+0.069
 kinds+0.064
vous+0.062
Neg logit contributions
pit-0.036
 nicely-0.033
ider-0.032
uddle-0.032
,-0.031
Token mean
Token ID1612
Feature activation+3.88e+00

Pos logit contributions
leaf+0.112
 sorts+0.095
vous+0.092
 fright+0.084
 shouldn+0.083
Neg logit contributions
pit-0.110
ider-0.092
 camps-0.089
angers-0.089
 Later-0.086
Token boy
Token ID2933
Feature activation+5.37e+00

Pos logit contributions
leaf+0.135
keeping+0.118
 sorts+0.115
vous+0.107
Isn+0.105
Neg logit contributions
pit-0.080
,-0.076
uddle-0.070
ider-0.068
 nicely-0.068
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
leaf+0.225
 sorts+0.205
keeping+0.194
vous+0.187
 faith+0.182
Neg logit contributions
,-0.069
ider-0.068
pit-0.066
 nicely-0.064
uddle-0.051
Token He
Token ID679
Feature activation-5.75e+00

Pos logit contributions
pit+0.005
ider+0.005
 nicely+0.005
,+0.004
uddle+0.004
Neg logit contributions
leaf-0.007
 sorts-0.006
keeping-0.006
vous-0.005
 kinds-0.005
Token shouldn
Token ID6584
Feature activation+1.14e+01

Pos logit contributions
,+0.097
ider+0.078
uddle+0.078
 nicely+0.077
 car+0.073
Neg logit contributions
keeping-0.222
leaf-0.206
 sorts-0.194
 harmony-0.189
weet-0.188
Token't
Token ID470
Feature activation+7.31e+00

Pos logit contributions
leaf+0.439
 sorts+0.406
 kinds+0.371
keeping+0.368
vous+0.359
Neg logit contributions
,-0.171
pit-0.169
ider-0.165
 nicely-0.161
 gym-0.156
Token have
Token ID423
Feature activation+3.83e+00

Pos logit contributions
leaf+0.254
 sorts+0.226
keeping+0.215
vous+0.203
 kinds+0.202
Neg logit contributions
,-0.140
 nicely-0.137
ider-0.127
 One-0.125
pit-0.122
Token taken
Token ID2077
Feature activation-6.99e-01

Pos logit contributions
leaf+0.107
 sorts+0.095
keeping+0.088
vous+0.083
 kinds+0.080
Neg logit contributions
pit-0.099
ider-0.095
 nicely-0.094
,-0.090
uddle-0.089
Token your
Token ID534
Feature activation-2.19e+00

Pos logit contributions
 nicely+0.013
,+0.013
pit+0.011
 hockey+0.010
balls+0.010
Neg logit contributions
leaf-0.025
 sorts-0.023
vous-0.022
keeping-0.021
Dear-0.021
Token doll
Token ID3654
Feature activation+2.92e+00

Pos logit contributions
pit+0.033
,+0.032
ider+0.031
uddle+0.030
 shoes+0.029
Neg logit contributions
leaf-0.087
 sorts-0.080
vous-0.074
keeping-0.073
 shouldn-0.071
Token's
Token ID338
Feature activation-2.88e-01

Pos logit contributions
leaf+0.034
keeping+0.030
 sorts+0.030
 faith+0.027
weet+0.026
Neg logit contributions
,-0.014
pit-0.014
ider-0.013
uddle-0.012
 nicely-0.012
Token dangerous
Token ID4923
Feature activation+3.15e+00

Pos logit contributions
 nicely+0.004
,+0.004
pit+0.004
ider+0.004
 Later+0.004
Neg logit contributions
leaf-0.012
 sorts-0.010
keeping-0.010
 kinds-0.009
vous-0.009
Token.
Token ID13
Feature activation-6.32e-02

Pos logit contributions
leaf+0.122
 sorts+0.109
keeping+0.098
vous+0.097
 faith+0.094
Neg logit contributions
,-0.055
pit-0.052
ider-0.050
 nicely-0.049
balls-0.043
Token And
Token ID843
Feature activation+2.31e+00

Pos logit contributions
ider+0.001
 One+0.001
 Later+0.001
 nicely+0.001
 His+0.001
Neg logit contributions
leaf-0.002
 sorts-0.002
keeping-0.002
 kinds-0.002
vous-0.002
Token you
Token ID345
Feature activation+5.76e+00

Pos logit contributions
leaf+0.081
 sorts+0.072
keeping+0.069
vous+0.069
Dear+0.062
Neg logit contributions
,-0.051
 nicely-0.048
ider-0.041
 at-0.040
pit-0.038
Token shouldn
Token ID6584
Feature activation+1.14e+01

Pos logit contributions
leaf+0.182
 sorts+0.163
keeping+0.156
 kinds+0.140
ymes+0.138
Neg logit contributions
,-0.143
 nicely-0.120
 One-0.118
uddle-0.115
bles-0.112
Token't
Token ID470
Feature activation+7.28e+00

Pos logit contributions
leaf+0.397
 sorts+0.356
keeping+0.333
 kinds+0.318
vous+0.318
Neg logit contributions
pit-0.223
ider-0.208
,-0.206
 nicely-0.201
uddle-0.194
Token take
Token ID1011
Feature activation+5.90e-01

Pos logit contributions
leaf+0.256
 sorts+0.229
keeping+0.218
vous+0.206
 kinds+0.204
Neg logit contributions
pit-0.137
ider-0.131
 nicely-0.127
,-0.124
uddle-0.120
Token lumber
Token ID34840
Feature activation-2.93e+00

Pos logit contributions
leaf+0.019
 sorts+0.017
keeping+0.016
vous+0.015
 kinds+0.015
Neg logit contributions
 nicely-0.013
,-0.013
pit-0.012
ider-0.012
 hockey-0.011
Token that
Token ID326
Feature activation-2.29e-01

Pos logit contributions
pit+0.060
ider+0.056
 nicely+0.051
uddle+0.051
,+0.047
Neg logit contributions
leaf-0.100
 sorts-0.089
keeping-0.087
vous-0.081
 kinds-0.079
Token doesn
Token ID1595
Feature activation+9.20e+00

Pos logit contributions
,+0.005
pit+0.004
 nicely+0.004
ider+0.004
 gym+0.004
Neg logit contributions
leaf-0.008
 sorts-0.007
keeping-0.007
vous-0.006
 faith-0.006
Token's
Token ID338
Feature activation-3.45e-01

Pos logit contributions
,+0.007
pit+0.006
 nicely+0.006
 hockey+0.006
uddle+0.005
Neg logit contributions
leaf-0.014
keeping-0.013
 sorts-0.011
vous-0.011
Dear-0.010
Token not
Token ID407
Feature activation+6.21e+00

Pos logit contributions
pit+0.006
 nicely+0.006
ider+0.006
,+0.006
bles+0.005
Neg logit contributions
leaf-0.013
keeping-0.011
 sorts-0.011
vous-0.010
Dear-0.010
Token nice
Token ID3621
Feature activation+9.96e-01

Pos logit contributions
leaf+0.229
keeping+0.200
 sorts+0.199
vous+0.185
Isn+0.181
Neg logit contributions
 nicely-0.104
pit-0.104
,-0.100
ider-0.092
 hockey-0.092
Token.
Token ID13
Feature activation+6.62e-01

Pos logit contributions
leaf+0.042
 sorts+0.036
keeping+0.035
vous+0.035
Dear+0.034
Neg logit contributions
,-0.015
pit-0.014
 nicely-0.012
uddle-0.012
ider-0.011
Token We
Token ID775
Feature activation+7.01e+00

Pos logit contributions
leaf+0.021
 sorts+0.019
keeping+0.018
vous+0.017
 kinds+0.017
Neg logit contributions
pit-0.015
ider-0.014
 nicely-0.014
 Later-0.013
,-0.013
Token shouldn
Token ID6584
Feature activation+1.14e+01

Pos logit contributions
leaf+0.209
keeping+0.185
 sorts+0.172
 harmony+0.165
ymes+0.163
Neg logit contributions
,-0.186
 nicely-0.166
 at-0.158
 One-0.157
uddle-0.152
Token't
Token ID470
Feature activation+7.32e+00

Pos logit contributions
leaf+0.427
 sorts+0.397
keeping+0.367
 kinds+0.357
vous+0.352
Neg logit contributions
,-0.189
pit-0.182
 nicely-0.173
ider-0.173
 gym-0.163
Token do
Token ID466
Feature activation+1.92e+00

Pos logit contributions
leaf+0.239
 sorts+0.212
keeping+0.205
vous+0.191
 kinds+0.185
Neg logit contributions
pit-0.153
 nicely-0.147
ider-0.146
,-0.143
uddle-0.133
Token that
Token ID326
Feature activation-3.77e-01

Pos logit contributions
leaf+0.063
 sorts+0.057
keeping+0.054
vous+0.053
 kinds+0.049
Neg logit contributions
 nicely-0.042
,-0.041
pit-0.038
ider-0.038
balls-0.037
Token."
Token ID526
Feature activation+4.20e+00

Pos logit contributions
pit+0.007
,+0.007
ider+0.007
 nicely+0.007
uddle+0.006
Neg logit contributions
leaf-0.014
 sorts-0.012
keeping-0.012
vous-0.011
 faith-0.010
Token
Token ID198
Feature activation+7.04e-02

Pos logit contributions
leaf+0.158
 sorts+0.142
keeping+0.138
vous+0.131
 kinds+0.127
Neg logit contributions
pit-0.072
ider-0.066
 nicely-0.061
uddle-0.059
,-0.055
Token she
Token ID673
Feature activation-5.88e+00

Pos logit contributions
leaf+0.105
keeping+0.096
 sorts+0.092
vous+0.090
 harm+0.086
Neg logit contributions
,-0.036
pit-0.035
 nicely-0.033
ider-0.030
 patiently-0.029
Token said
Token ID531
Feature activation-2.03e+00

Pos logit contributions
pit+0.168
ider+0.159
 nicely+0.150
uddle+0.150
 camps+0.145
Neg logit contributions
leaf-0.153
 sorts-0.133
keeping-0.123
vous-0.117
 kinds-0.112
Token.
Token ID13
Feature activation+5.65e-01

Pos logit contributions
pit+0.040
ider+0.036
uddle+0.034
 camps+0.033
 Later+0.031
Neg logit contributions
leaf-0.070
 sorts-0.063
keeping-0.062
vous-0.057
 fright-0.057
Token "
Token ID366
Feature activation+4.86e+00

Pos logit contributions
leaf+0.019
 sorts+0.017
keeping+0.017
vous+0.015
 shouldn+0.015
Neg logit contributions
pit-0.011
ider-0.011
uddle-0.010
 camps-0.009
 nicely-0.009
TokenWe
Token ID1135
Feature activation+7.00e+00

Pos logit contributions
leaf+0.144
 sorts+0.139
keeping+0.130
 kinds+0.124
 harmony+0.117
Neg logit contributions
 nicely-0.109
ider-0.109
pit-0.106
His-0.095
bles-0.094
Token shouldn
Token ID6584
Feature activation+1.14e+01

Pos logit contributions
leaf+0.200
keeping+0.173
 sorts+0.165
vous+0.150
 harmony+0.150
Neg logit contributions
,-0.179
pit-0.175
 nicely-0.173
uddle-0.161
ider-0.161
Token't
Token ID470
Feature activation+7.19e+00

Pos logit contributions
leaf+0.408
 sorts+0.370
keeping+0.344
 kinds+0.331
vous+0.329
Neg logit contributions
pit-0.210
,-0.198
ider-0.198
 nicely-0.189
 gym-0.179
Token do
Token ID466
Feature activation+1.84e+00

Pos logit contributions
leaf+0.230
 sorts+0.203
keeping+0.196
vous+0.183
 kinds+0.177
Neg logit contributions
pit-0.162
ider-0.153
 nicely-0.146
uddle-0.140
,-0.139
Token that
Token ID326
Feature activation-3.91e-01

Pos logit contributions
leaf+0.056
 sorts+0.050
keeping+0.047
vous+0.046
Dear+0.043
Neg logit contributions
pit-0.043
 nicely-0.042
ider-0.041
,-0.041
balls-0.038
Token.
Token ID13
Feature activation+3.11e-01

Pos logit contributions
pit+0.008
ider+0.007
 nicely+0.007
,+0.007
uddle+0.007
Neg logit contributions
leaf-0.014
 sorts-0.012
keeping-0.012
vous-0.011
Dear-0.011
Token It
Token ID632
Feature activation+7.82e-01

Pos logit contributions
leaf+0.010
 sorts+0.009
keeping+0.008
vous+0.008
 kinds+0.008
Neg logit contributions
pit-0.007
ider-0.007
 nicely-0.007
 Later-0.006
,-0.006
Tokenam
Token ID321
Feature activation-8.94e+00

Pos logit contributions
pit+0.117
ider+0.107
 nicely+0.096
uddle+0.093
,+0.086
Neg logit contributions
leaf-0.288
 sorts-0.261
keeping-0.254
vous-0.242
 kinds-0.235
Token,
Token ID11
Feature activation-9.26e+00

Pos logit contributions
pit+0.185
ider+0.172
uddle+0.157
 nicely+0.157
 camps+0.145
Neg logit contributions
leaf-0.302
 sorts-0.270
keeping-0.259
vous-0.245
 kinds-0.238
Token until
Token ID1566
Feature activation-4.15e+00

Pos logit contributions
pit+0.158
,+0.145
 nicely+0.144
ider+0.138
balls+0.128
Neg logit contributions
leaf-0.349
 sorts-0.314
keeping-0.310
vous-0.295
 kinds-0.286
Token finally
Token ID3443
Feature activation-3.41e+00

Pos logit contributions
pit+0.091
ider+0.086
uddle+0.079
 nicely+0.079
bles+0.073
Neg logit contributions
leaf-0.134
 sorts-0.119
keeping-0.116
vous-0.108
 fright-0.105
Token he
Token ID339
Feature activation-7.65e+00

Pos logit contributions
pit+0.063
ider+0.060
uddle+0.053
 nicely+0.052
 camps+0.049
Neg logit contributions
leaf-0.122
 sorts-0.111
keeping-0.107
vous-0.101
 shouldn-0.099
Token arrived
Token ID5284
Feature activation-1.24e+01

Pos logit contributions
,+0.145
pit+0.143
 nicely+0.132
uddle+0.124
ider+0.118
Neg logit contributions
leaf-0.278
keeping-0.241
 sorts-0.240
vous-0.219
 kinds-0.218
Token at
Token ID379
Feature activation-3.00e+00

Pos logit contributions
pit+0.188
ider+0.171
 nicely+0.154
uddle+0.150
,+0.138
Neg logit contributions
leaf-0.487
 sorts-0.442
keeping-0.429
vous-0.409
 kinds-0.398
Token the
Token ID262
Feature activation-3.11e-01

Pos logit contributions
 nicely+0.046
,+0.045
pit+0.044
 hockey+0.041
 car+0.038
Neg logit contributions
leaf-0.117
vous-0.100
 sorts-0.097
 shouldn-0.096
ymes-0.096
Token end
Token ID886
Feature activation-3.19e+00

Pos logit contributions
pit+0.005
 nicely+0.004
uddle+0.004
 camps+0.004
 rink+0.004
Neg logit contributions
leaf-0.013
 sorts-0.011
keeping-0.011
vous-0.011
 kinds-0.010
Token of
Token ID286
Feature activation+1.76e+00

Pos logit contributions
pit+0.081
ider+0.076
,+0.074
 nicely+0.073
uddle+0.072
Neg logit contributions
leaf-0.095
 sorts-0.081
keeping-0.079
vous-0.075
Dear-0.072
Token the
Token ID262
Feature activation-2.33e-01

Pos logit contributions
leaf+0.069
vous+0.060
 sorts+0.058
 shouldn+0.058
Dear+0.056
Neg logit contributions
,-0.027
pit-0.026
 nicely-0.025
 hockey-0.025
 patiently-0.021
Token than
Token ID621
Feature activation+1.14e+00

Pos logit contributions
pit+0.014
,+0.014
 nicely+0.012
ider+0.012
uddle+0.010
Neg logit contributions
leaf-0.043
 sorts-0.038
vous-0.035
keeping-0.035
 faith-0.034
Token when
Token ID618
Feature activation-3.32e+00

Pos logit contributions
leaf+0.040
 sorts+0.036
keeping+0.034
vous+0.034
Dear+0.033
Neg logit contributions
 nicely-0.021
pit-0.021
,-0.020
ider-0.018
 hockey-0.017
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
,+0.052
pit+0.051
 nicely+0.047
ider+0.045
 gym+0.042
Neg logit contributions
leaf-0.132
 sorts-0.115
keeping-0.111
vous-0.110
 kinds-0.102
Token woman
Token ID2415
Feature activation+6.38e+00

Pos logit contributions
pit+0.022
 nicely+0.021
,+0.021
 gym+0.020
ider+0.020
Neg logit contributions
leaf-0.037
 sorts-0.034
keeping-0.032
vous-0.030
 kinds-0.029
Token had
Token ID550
Feature activation+4.75e-01

Pos logit contributions
leaf+0.215
 sorts+0.190
keeping+0.187
vous+0.172
 kinds+0.166
Neg logit contributions
pit-0.136
,-0.126
ider-0.125
 nicely-0.121
uddle-0.115
Token arrived
Token ID5284
Feature activation-1.24e+01

Pos logit contributions
leaf+0.016
 sorts+0.014
keeping+0.013
vous+0.013
icated+0.013
Neg logit contributions
,-0.010
pit-0.009
 nicely-0.009
ider-0.008
 patiently-0.008
Token.
Token ID13
Feature activation-5.96e-01

Pos logit contributions
pit+0.190
ider+0.168
 nicely+0.160
,+0.150
uddle+0.149
Neg logit contributions
leaf-0.491
 sorts-0.442
keeping-0.427
vous-0.415
 kinds-0.398
Token<|endoftext|>
Token ID50256
Feature activation+3.89e+00

Pos logit contributions
pit+0.010
ider+0.009
 nicely+0.008
uddle+0.008
,+0.007
Neg logit contributions
leaf-0.023
 sorts-0.021
keeping-0.020
vous-0.019
 kinds-0.019
TokenIt
Token ID1026
Feature activation-1.48e+00

Pos logit contributions
leaf+0.152
 sorts+0.134
keeping+0.127
vous+0.125
 kinds+0.121
Neg logit contributions
pit-0.063
,-0.057
 nicely-0.054
ider-0.053
uddle-0.051
Token was
Token ID373
Feature activation+7.17e-01

Pos logit contributions
pit+0.032
ider+0.029
 nicely+0.028
,+0.028
uddle+0.027
Neg logit contributions
leaf-0.050
 sorts-0.044
keeping-0.043
vous-0.040
 kinds-0.038
Token a
Token ID257
Feature activation+1.70e+00

Pos logit contributions
leaf+0.022
keeping+0.019
 sorts+0.019
vous+0.018
 fright+0.017
Neg logit contributions
pit-0.017
ider-0.015
 nicely-0.015
,-0.015
 Later-0.014
Token continued
Token ID3767
Feature activation-1.02e+01

Pos logit contributions
leaf+0.214
 sorts+0.191
keeping+0.185
vous+0.180
 fright+0.169
Neg logit contributions
pit-0.123
 nicely-0.117
,-0.117
ider-0.112
 patiently-0.104
Token on
Token ID319
Feature activation-2.06e+00

Pos logit contributions
pit+0.196
ider+0.177
 nicely+0.173
,+0.167
uddle+0.162
Neg logit contributions
leaf-0.359
 sorts-0.317
keeping-0.310
vous-0.298
Dear-0.284
Token their
Token ID511
Feature activation-5.43e+00

Pos logit contributions
pit+0.042
ider+0.039
 nicely+0.037
,+0.035
uddle+0.035
Neg logit contributions
leaf-0.070
 sorts-0.062
keeping-0.060
vous-0.058
Dear-0.055
Token journey
Token ID7002
Feature activation-4.74e+00

Pos logit contributions
pit+0.107
 nicely+0.098
ider+0.098
,+0.094
uddle+0.093
Neg logit contributions
leaf-0.189
 sorts-0.168
keeping-0.162
vous-0.154
Dear-0.151
Token and
Token ID290
Feature activation+2.81e+00

Pos logit contributions
pit+0.074
ider+0.068
 nicely+0.061
uddle+0.060
,+0.055
Neg logit contributions
leaf-0.183
 sorts-0.166
keeping-0.161
vous-0.153
 kinds-0.149
Token arrived
Token ID5284
Feature activation-1.22e+01

Pos logit contributions
leaf+0.092
 sorts+0.079
keeping+0.077
vous+0.074
 fright+0.070
Neg logit contributions
pit-0.063
 nicely-0.056
ider-0.056
,-0.054
uddle-0.052
Token at
Token ID379
Feature activation-2.95e+00

Pos logit contributions
pit+0.167
ider+0.157
uddle+0.131
 nicely+0.122
balls+0.120
Neg logit contributions
leaf-0.489
 sorts-0.449
keeping-0.434
 kinds-0.405
vous-0.402
Token the
Token ID262
Feature activation-6.47e-01

Pos logit contributions
 nicely+0.049
pit+0.048
,+0.047
 gym+0.044
 hockey+0.043
Neg logit contributions
leaf-0.110
vous-0.096
 shouldn-0.093
ymes-0.090
 sorts-0.089
Token hot
Token ID3024
Feature activation-1.63e-01

Pos logit contributions
pit+0.011
 nicely+0.009
 camps+0.009
ider+0.009
 gym+0.009
Neg logit contributions
leaf-0.025
 sorts-0.023
keeping-0.023
vous-0.021
 kinds-0.021
Token land
Token ID1956
Feature activation-5.45e+00

Pos logit contributions
pit+0.004
,+0.003
uddle+0.003
ider+0.003
 nicely+0.003
Neg logit contributions
leaf-0.005
 sorts-0.004
keeping-0.004
vous-0.004
Dear-0.004
Token.
Token ID13
Feature activation-2.61e-01

Pos logit contributions
pit+0.091
ider+0.082
 nicely+0.077
,+0.075
uddle+0.071
Neg logit contributions
leaf-0.213
 sorts-0.189
keeping-0.184
vous-0.175
 faith-0.172
Token the
Token ID262
Feature activation+2.42e-01

Pos logit contributions
pit+0.143
,+0.142
 nicely+0.138
 gym+0.125
ider+0.117
Neg logit contributions
leaf-0.357
 sorts-0.315
keeping-0.309
vous-0.306
Dear-0.286
Token lake
Token ID13546
Feature activation-4.13e+00

Pos logit contributions
leaf+0.009
 sorts+0.008
keeping+0.008
vous+0.007
 kinds+0.007
Neg logit contributions
pit-0.005
 nicely-0.004
,-0.004
 gym-0.004
ider-0.004
Token was
Token ID373
Feature activation-3.05e-01

Pos logit contributions
pit+0.063
,+0.059
 nicely+0.053
ider+0.052
uddle+0.048
Neg logit contributions
leaf-0.171
 sorts-0.145
keeping-0.145
vous-0.141
Dear-0.137
Token bigger
Token ID5749
Feature activation-3.98e+00

Pos logit contributions
,+0.006
pit+0.006
 nicely+0.005
ider+0.005
 patiently+0.005
Neg logit contributions
leaf-0.011
keeping-0.010
 sorts-0.009
ymes-0.009
icated-0.009
Token and
Token ID290
Feature activation+2.86e+00

Pos logit contributions
pit+0.071
,+0.061
ider+0.061
 nicely+0.056
uddle+0.054
Neg logit contributions
leaf-0.154
 sorts-0.133
keeping-0.129
vous-0.127
Dear-0.125
Token deeper
Token ID9211
Feature activation-1.21e+01

Pos logit contributions
leaf+0.110
keeping+0.098
 sorts+0.096
vous+0.089
Dear+0.087
Neg logit contributions
pit-0.049
 nicely-0.044
ider-0.044
,-0.041
 patiently-0.039
Token than
Token ID621
Feature activation+1.34e+00

Pos logit contributions
pit+0.226
ider+0.211
 nicely+0.193
uddle+0.191
bles+0.175
Neg logit contributions
leaf-0.432
 sorts-0.390
keeping-0.379
vous-0.355
 kinds-0.348
Token the
Token ID262
Feature activation-6.48e-01

Pos logit contributions
leaf+0.046
 sorts+0.040
keeping+0.038
vous+0.037
Dear+0.037
Neg logit contributions
pit-0.028
 nicely-0.027
,-0.025
ider-0.024
 patiently-0.023
Token stream
Token ID4269
Feature activation-5.64e+00

Pos logit contributions
pit+0.012
 camps+0.011
,+0.011
 nicely+0.011
 car+0.011
Neg logit contributions
leaf-0.024
 sorts-0.021
keeping-0.020
vous-0.019
Dear-0.019
Token,
Token ID11
Feature activation-9.18e+00

Pos logit contributions
pit+0.091
ider+0.081
,+0.078
 nicely+0.077
uddle+0.070
Neg logit contributions
leaf-0.227
 sorts-0.199
keeping-0.189
Dear-0.185
 faith-0.183
Token but
Token ID475
Feature activation+3.11e+00

Pos logit contributions
pit+0.167
 nicely+0.146
,+0.144
ider+0.144
balls+0.130
Neg logit contributions
leaf-0.346
 sorts-0.302
keeping-0.297
vous-0.289
Dear-0.281
Token Lily
Token ID20037
Feature activation+1.45e+00

Pos logit contributions
pit+0.148
,+0.140
 nicely+0.130
ider+0.119
 gym+0.116
Neg logit contributions
leaf-0.373
 sorts-0.334
keeping-0.325
vous-0.316
 kinds-0.300
Token's
Token ID338
Feature activation-1.89e+00

Pos logit contributions
leaf+0.059
 sorts+0.052
keeping+0.051
vous+0.048
 fright+0.047
Neg logit contributions
,-0.022
pit-0.022
 nicely-0.019
ider-0.018
uddle-0.017
Token friend
Token ID1545
Feature activation+7.40e+00

Pos logit contributions
pit+0.027
 nicely+0.023
ider+0.023
,+0.022
uddle+0.022
Neg logit contributions
leaf-0.079
 sorts-0.071
keeping-0.069
vous-0.067
 kinds-0.064
Token Tim
Token ID5045
Feature activation+1.87e-01

Pos logit contributions
leaf+0.306
 sorts+0.274
keeping+0.269
vous+0.253
 kinds+0.247
Neg logit contributions
pit-0.102
ider-0.087
,-0.086
 nicely-0.086
uddle-0.079
Tokenmy
Token ID1820
Feature activation+1.27e+00

Pos logit contributions
leaf+0.006
 sorts+0.005
keeping+0.005
vous+0.004
 faith+0.004
Neg logit contributions
pit-0.004
,-0.004
 nicely-0.004
ider-0.004
uddle-0.004
Token arrived
Token ID5284
Feature activation-1.21e+01

Pos logit contributions
leaf+0.046
 sorts+0.041
keeping+0.040
vous+0.038
 kinds+0.036
Neg logit contributions
pit-0.023
,-0.022
ider-0.020
 nicely-0.020
uddle-0.019
Token at
Token ID379
Feature activation-2.91e+00

Pos logit contributions
pit+0.117
ider+0.097
 nicely+0.092
,+0.084
uddle+0.079
Neg logit contributions
leaf-0.548
 sorts-0.502
keeping-0.491
vous-0.481
 kinds-0.462
Token her
Token ID607
Feature activation-4.68e+00

Pos logit contributions
 nicely+0.038
pit+0.035
,+0.035
 hockey+0.034
 gym+0.030
Neg logit contributions
leaf-0.117
 sorts-0.103
vous-0.103
 shouldn-0.102
ymes-0.098
Token house
Token ID2156
Feature activation-1.40e+00

Pos logit contributions
pit+0.078
ider+0.068
 nicely+0.067
uddle+0.065
,+0.064
Neg logit contributions
leaf-0.183
 sorts-0.162
keeping-0.157
vous-0.153
 kinds-0.146
Token.
Token ID13
Feature activation+3.72e-02

Pos logit contributions
pit+0.024
ider+0.022
 nicely+0.020
uddle+0.019
,+0.019
Neg logit contributions
leaf-0.053
 sorts-0.048
keeping-0.046
vous-0.044
 kinds-0.043
Token He
Token ID679
Feature activation-6.23e+00

Pos logit contributions
leaf+0.001
 sorts+0.001
keeping+0.001
Isn+0.001
vous+0.001
Neg logit contributions
pit-0.001
ider-0.001
uddle-0.001
bles-0.001
 nicely-0.001
Token and
Token ID290
Feature activation+3.02e+00

Pos logit contributions
leaf+0.131
 sorts+0.117
keeping+0.112
vous+0.106
 kinds+0.102
Neg logit contributions
pit-0.089
ider-0.083
 nicely-0.078
uddle-0.076
,-0.072
Token finally
Token ID3443
Feature activation-2.93e+00

Pos logit contributions
leaf+0.107
 sorts+0.092
keeping+0.090
vous+0.090
Dear+0.084
Neg logit contributions
pit-0.059
,-0.055
 nicely-0.055
ider-0.048
 gym-0.048
Token,
Token ID11
Feature activation-9.49e+00

Pos logit contributions
pit+0.064
ider+0.059
 nicely+0.056
uddle+0.055
,+0.053
Neg logit contributions
leaf-0.096
 sorts-0.085
keeping-0.082
vous-0.077
 kinds-0.075
Token the
Token ID262
Feature activation-3.45e-01

Pos logit contributions
pit+0.181
,+0.173
 nicely+0.166
 gym+0.144
ider+0.141
Neg logit contributions
leaf-0.348
 sorts-0.302
vous-0.295
keeping-0.290
 kinds-0.274
Token license
Token ID5964
Feature activation+3.40e-01

Pos logit contributions
pit+0.007
 nicely+0.006
ider+0.006
uddle+0.006
,+0.006
Neg logit contributions
leaf-0.012
 sorts-0.012
keeping-0.011
vous-0.010
 kinds-0.010
Token arrived
Token ID5284
Feature activation-1.21e+01

Pos logit contributions
leaf+0.012
 sorts+0.010
keeping+0.010
vous+0.009
 fright+0.009
Neg logit contributions
pit-0.007
ider-0.007
,-0.007
 nicely-0.006
uddle-0.006
Token in
Token ID287
Feature activation-3.18e+00

Pos logit contributions
pit+0.201
ider+0.181
 nicely+0.169
uddle+0.162
,+0.158
Neg logit contributions
leaf-0.466
 sorts-0.417
keeping-0.404
vous-0.389
 kinds-0.374
Token the
Token ID262
Feature activation-4.58e-01

Pos logit contributions
,+0.061
pit+0.058
 nicely+0.053
 hockey+0.048
 patiently+0.046
Neg logit contributions
leaf-0.116
vous-0.098
 shouldn-0.094
icated-0.094
ymes-0.091
Token mail
Token ID6920
Feature activation-2.53e+00

Pos logit contributions
pit+0.009
 nicely+0.008
uddle+0.008
 car+0.007
 rink+0.007
Neg logit contributions
leaf-0.017
keeping-0.015
 sorts-0.014
vous-0.013
 kinds-0.013
Token.
Token ID13
Feature activation-2.43e-01

Pos logit contributions
pit+0.039
,+0.035
ider+0.033
 nicely+0.032
uddle+0.028
Neg logit contributions
leaf-0.102
 sorts-0.090
keeping-0.086
vous-0.086
 fright-0.083
Token It
Token ID632
Feature activation+7.85e-02

Pos logit contributions
pit+0.004
ider+0.004
 nicely+0.004
 Later+0.003
,+0.003
Neg logit contributions
leaf-0.009
 sorts-0.008
keeping-0.008
vous-0.008
 kinds-0.008
Token finish
Token ID5461
Feature activation-5.74e+00

Pos logit contributions
pit+0.015
 nicely+0.013
 gym+0.012
,+0.012
ider+0.012
Neg logit contributions
leaf-0.035
 sorts-0.032
keeping-0.032
vous-0.030
 kinds-0.029
Token line
Token ID1627
Feature activation-8.44e-01

Pos logit contributions
pit+0.085
ider+0.073
 nicely+0.066
,+0.065
uddle+0.064
Neg logit contributions
leaf-0.236
 sorts-0.209
keeping-0.204
vous-0.198
Dear-0.192
Token,
Token ID11
Feature activation-9.60e+00

Pos logit contributions
pit+0.014
ider+0.013
 nicely+0.011
uddle+0.011
 camps+0.010
Neg logit contributions
leaf-0.032
 sorts-0.029
keeping-0.028
vous-0.027
 kinds-0.026
Token Bob
Token ID5811
Feature activation-2.58e+00

Pos logit contributions
pit+0.158
 nicely+0.147
,+0.145
ider+0.128
 gym+0.122
Neg logit contributions
leaf-0.370
 sorts-0.330
vous-0.324
keeping-0.322
Dear-0.308
Token had
Token ID550
Feature activation+1.09e+00

Pos logit contributions
pit+0.053
,+0.047
ider+0.047
 nicely+0.045
uddle+0.044
Neg logit contributions
leaf-0.089
 sorts-0.079
keeping-0.077
vous-0.072
 fright-0.070
Token arrived
Token ID5284
Feature activation-1.20e+01

Pos logit contributions
leaf+0.038
 sorts+0.034
keeping+0.033
vous+0.031
icated+0.031
Neg logit contributions
pit-0.020
,-0.020
 nicely-0.019
 hockey-0.018
ider-0.017
Token first
Token ID717
Feature activation-8.91e-01

Pos logit contributions
pit+0.169
ider+0.152
 nicely+0.137
uddle+0.132
,+0.122
Neg logit contributions
leaf-0.485
 sorts-0.441
keeping-0.429
vous-0.410
 kinds-0.398
Token!
Token ID0
Feature activation+4.99e-01

Pos logit contributions
pit+0.017
,+0.015
 nicely+0.015
ider+0.015
uddle+0.013
Neg logit contributions
leaf-0.032
 sorts-0.028
keeping-0.028
vous-0.027
Dear-0.026
Token Peter
Token ID5613
Feature activation-1.94e+00

Pos logit contributions
leaf+0.020
 sorts+0.018
keeping+0.017
vous+0.016
 kinds+0.016
Neg logit contributions
pit-0.008
ider-0.007
 nicely-0.006
uddle-0.006
,-0.006
Token was
Token ID373
Feature activation+4.77e-01

Pos logit contributions
pit+0.040
,+0.039
 nicely+0.036
ider+0.036
uddle+0.034
Neg logit contributions
leaf-0.066
keeping-0.058
 sorts-0.058
vous-0.053
Dear-0.052
Token very
Token ID845
Feature activation+2.60e+00

Pos logit contributions
leaf+0.015
 sorts+0.013
keeping+0.013
vous+0.012
Dear+0.012
Neg logit contributions
pit-0.011
ider-0.010
 nicely-0.010
,-0.010
 patiently-0.009
Token It
Token ID632
Feature activation+1.59e+00

Pos logit contributions
leaf+0.085
keeping+0.076
vous+0.074
Dear+0.074
 shouldn+0.072
Neg logit contributions
,-0.035
 nicely-0.031
pit-0.030
 His-0.028
 hockey-0.026
Token was
Token ID373
Feature activation-3.67e-01

Pos logit contributions
leaf+0.062
keeping+0.055
 sorts+0.051
vous+0.049
weet+0.047
Neg logit contributions
,-0.028
pit-0.028
 nicely-0.026
uddle-0.024
ider-0.024
Token coming
Token ID2406
Feature activation-5.77e+00

Pos logit contributions
,+0.007
 nicely+0.007
pit+0.007
 patiently+0.006
ider+0.006
Neg logit contributions
leaf-0.013
keeping-0.012
 sorts-0.011
vous-0.011
ymes-0.010
Token from
Token ID422
Feature activation-3.84e+00

Pos logit contributions
,+0.098
 nicely+0.093
ider+0.085
pit+0.085
 patiently+0.074
Neg logit contributions
leaf-0.226
 sorts-0.204
keeping-0.192
vous-0.191
 kinds-0.186
Token down
Token ID866
Feature activation-8.73e+00

Pos logit contributions
,+0.071
pit+0.064
 nicely+0.063
 camps+0.063
 hockey+0.059
Neg logit contributions
leaf-0.144
 sorts-0.121
vous-0.118
keeping-0.110
 shouldn-0.110
Token underground
Token ID11447
Feature activation-1.19e+01

Pos logit contributions
,+0.156
pit+0.140
 nicely+0.132
ider+0.126
 gym+0.116
Neg logit contributions
leaf-0.347
 sorts-0.294
keeping-0.282
vous-0.278
icated-0.276
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
pit+0.179
,+0.165
ider+0.151
 nicely+0.150
uddle+0.132
Neg logit contributions
leaf-0.502
 sorts-0.424
vous-0.413
keeping-0.411
Dear-0.397
Token She
Token ID1375
Feature activation-4.96e+00

Pos logit contributions
pit+0.004
ider+0.004
 nicely+0.003
uddle+0.003
,+0.003
Neg logit contributions
leaf-0.009
 sorts-0.008
keeping-0.008
vous-0.007
 kinds-0.007
Token pe
Token ID613
Feature activation-6.08e+00

Pos logit contributions
pit+0.109
ider+0.100
 nicely+0.097
,+0.095
uddle+0.093
Neg logit contributions
leaf-0.163
 sorts-0.144
keeping-0.141
vous-0.131
 kinds-0.127
Tokenered
Token ID1068
Feature activation-6.47e+00

Pos logit contributions
pit+0.141
ider+0.135
 nicely+0.129
uddle+0.126
 gym+0.125
Neg logit contributions
leaf-0.186
 sorts-0.163
keeping-0.156
vous-0.146
 kinds-0.144
Token down
Token ID866
Feature activation-7.77e+00

Pos logit contributions
pit+0.127
ider+0.119
 nicely+0.109
uddle+0.108
bles+0.100
Neg logit contributions
leaf-0.225
 sorts-0.202
keeping-0.195
vous-0.184
 kinds-0.179
Token felt
Token ID2936
Feature activation-8.61e-01

Pos logit contributions
leaf+0.014
keeping+0.013
 sorts+0.012
vous+0.012
Dear+0.011
Neg logit contributions
,-0.008
 nicely-0.006
pit-0.006
 gym-0.006
 at-0.006
Token like
Token ID588
Feature activation+1.01e+00

Pos logit contributions
,+0.016
 nicely+0.015
 until+0.012
 at+0.012
pit+0.012
Neg logit contributions
leaf-0.033
keeping-0.029
 sorts-0.028
vous-0.027
 faith-0.026
Token forever
Token ID8097
Feature activation-1.48e+00

Pos logit contributions
leaf+0.036
vous+0.030
 sorts+0.030
keeping+0.030
Dear+0.028
Neg logit contributions
,-0.021
 nicely-0.018
pit-0.018
 gym-0.016
 hockey-0.016
Token,
Token ID11
Feature activation-9.45e+00

Pos logit contributions
pit+0.024
,+0.023
ider+0.021
 nicely+0.021
uddle+0.019
Neg logit contributions
leaf-0.059
 sorts-0.050
vous-0.048
keeping-0.048
Dear-0.046
Token they
Token ID484
Feature activation-2.72e+00

Pos logit contributions
,+0.155
pit+0.136
 until+0.130
 nicely+0.129
 patiently+0.113
Neg logit contributions
leaf-0.379
keeping-0.324
vous-0.322
 sorts-0.312
ymes-0.307
Token arrived
Token ID5284
Feature activation-1.19e+01

Pos logit contributions
,+0.053
pit+0.049
 nicely+0.049
 patiently+0.042
ider+0.042
Neg logit contributions
leaf-0.097
keeping-0.087
 sorts-0.083
vous-0.082
 faith-0.076
Token at
Token ID379
Feature activation-2.98e+00

Pos logit contributions
pit+0.158
ider+0.138
 nicely+0.127
uddle+0.119
,+0.114
Neg logit contributions
leaf-0.495
 sorts-0.450
keeping-0.439
vous-0.424
 kinds-0.408
Token the
Token ID262
Feature activation-4.32e-01

Pos logit contributions
 nicely+0.049
,+0.046
pit+0.041
 gym+0.039
 hockey+0.038
Neg logit contributions
leaf-0.116
vous-0.103
 shouldn-0.099
 sorts-0.093
Isn-0.093
Token zoo
Token ID26626
Feature activation-4.50e+00

Pos logit contributions
pit+0.007
 nicely+0.006
 gym+0.006
 camps+0.005
ider+0.005
Neg logit contributions
leaf-0.018
keeping-0.016
 sorts-0.016
vous-0.015
 kinds-0.014
Token.
Token ID13
Feature activation-4.69e-01

Pos logit contributions
pit+0.068
,+0.057
 nicely+0.056
ider+0.056
uddle+0.051
Neg logit contributions
leaf-0.186
 sorts-0.163
keeping-0.160
vous-0.156
Dear-0.152
Token Little
Token ID7703
Feature activation+3.45e+00

Pos logit contributions
pit+0.007
 nicely+0.006
ider+0.006
 Later+0.006
 One+0.006
Neg logit contributions
leaf-0.018
 sorts-0.016
keeping-0.016
vous-0.015
 kinds-0.015
Token's
Token ID338
Feature activation-1.73e+00

Pos logit contributions
leaf+0.186
 sorts+0.165
keeping+0.159
vous+0.149
 harmony+0.147
Neg logit contributions
pit-0.106
,-0.096
 nicely-0.090
ider-0.088
uddle-0.081
Token house
Token ID2156
Feature activation-2.32e+00

Pos logit contributions
pit+0.031
,+0.029
 nicely+0.029
ider+0.026
 gym+0.025
Neg logit contributions
leaf-0.065
 sorts-0.056
keeping-0.054
vous-0.054
Isn-0.051
Token.
Token ID13
Feature activation-4.52e-01

Pos logit contributions
pit+0.038
,+0.034
ider+0.033
 nicely+0.033
uddle+0.028
Neg logit contributions
leaf-0.093
 sorts-0.080
keeping-0.078
vous-0.075
 faith-0.074
Token When
Token ID1649
Feature activation-2.77e+00

Pos logit contributions
pit+0.006
ider+0.006
 nicely+0.005
uddle+0.005
,+0.005
Neg logit contributions
leaf-0.018
 sorts-0.016
keeping-0.016
vous-0.015
 kinds-0.015
Token they
Token ID484
Feature activation-2.54e+00

Pos logit contributions
pit+0.048
ider+0.043
 nicely+0.042
,+0.041
uddle+0.038
Neg logit contributions
leaf-0.104
 sorts-0.093
keeping-0.089
vous-0.087
 kinds-0.083
Token arrived
Token ID5284
Feature activation-1.19e+01

Pos logit contributions
pit+0.056
ider+0.054
uddle+0.049
 nicely+0.048
bles+0.046
Neg logit contributions
leaf-0.082
 sorts-0.074
keeping-0.070
vous-0.066
 kinds-0.064
Token,
Token ID11
Feature activation-9.48e+00

Pos logit contributions
ider+0.142
pit+0.135
 camps+0.104
uddle+0.104
bles+0.098
Neg logit contributions
leaf-0.477
 sorts-0.446
keeping-0.445
 kinds-0.405
 shouldn-0.404
Token Lily
Token ID20037
Feature activation+1.11e+00

Pos logit contributions
pit+0.140
,+0.139
 nicely+0.134
 gym+0.113
balls+0.100
Neg logit contributions
leaf-0.384
vous-0.336
 sorts-0.333
keeping-0.329
Dear-0.311
Token's
Token ID338
Feature activation-1.31e+00

Pos logit contributions
leaf+0.037
 sorts+0.033
keeping+0.031
vous+0.030
 kinds+0.029
Neg logit contributions
pit-0.024
ider-0.022
 nicely-0.021
uddle-0.020
,-0.020
Token mom
Token ID1995
Feature activation+6.78e+00

Pos logit contributions
pit+0.024
ider+0.021
 nicely+0.020
uddle+0.020
,+0.019
Neg logit contributions
leaf-0.049
 sorts-0.044
keeping-0.043
vous-0.040
 kinds-0.039
Token knocked
Token ID13642
Feature activation-5.48e+00

Pos logit contributions
leaf+0.214
 sorts+0.186
keeping+0.183
vous+0.169
 kinds+0.162
Neg logit contributions
pit-0.159
ider-0.145
 nicely-0.140
,-0.139
uddle-0.136
Token bolts
Token ID24667
Feature activation+4.84e-01

Pos logit contributions
leaf+0.103
 sorts+0.090
keeping+0.086
vous+0.084
Dear+0.080
Neg logit contributions
pit-0.057
 nicely-0.051
ider-0.051
,-0.048
uddle-0.046
Token,
Token ID11
Feature activation-9.13e+00

Pos logit contributions
leaf+0.018
 sorts+0.016
keeping+0.015
vous+0.015
 kinds+0.014
Neg logit contributions
pit-0.009
ider-0.008
 nicely-0.008
,-0.007
uddle-0.007
Token but
Token ID475
Feature activation+3.23e+00

Pos logit contributions
pit+0.155
ider+0.137
 nicely+0.132
,+0.127
balls+0.116
Neg logit contributions
leaf-0.352
 sorts-0.312
keeping-0.303
vous-0.292
Dear-0.279
Token when
Token ID618
Feature activation-2.53e+00

Pos logit contributions
leaf+0.118
 sorts+0.104
keeping+0.101
vous+0.097
Dear+0.093
Neg logit contributions
pit-0.061
 nicely-0.055
ider-0.054
,-0.053
balls-0.048
Token he
Token ID339
Feature activation-7.37e+00

Pos logit contributions
pit+0.044
ider+0.040
 nicely+0.038
,+0.037
uddle+0.035
Neg logit contributions
leaf-0.096
 sorts-0.085
keeping-0.082
vous-0.079
 kinds-0.076
Token arrived
Token ID5284
Feature activation-1.19e+01

Pos logit contributions
pit+0.159
ider+0.148
 nicely+0.140
,+0.138
uddle+0.137
Neg logit contributions
leaf-0.247
 sorts-0.218
keeping-0.216
vous-0.194
 kinds-0.192
Token the
Token ID262
Feature activation-1.89e-01

Pos logit contributions
pit+0.159
ider+0.157
uddle+0.135
 camps+0.132
angers+0.124
Neg logit contributions
leaf-0.437
keeping-0.400
 sorts-0.393
 shouldn-0.371
 faith-0.370
Token lid
Token ID19789
Feature activation+6.52e-01

Pos logit contributions
pit+0.003
ider+0.003
 nicely+0.003
uddle+0.003
,+0.003
Neg logit contributions
leaf-0.007
 sorts-0.006
keeping-0.006
vous-0.006
 kinds-0.006
Token was
Token ID373
Feature activation+3.22e-01

Pos logit contributions
leaf+0.026
 sorts+0.024
keeping+0.023
vous+0.022
 shouldn+0.022
Neg logit contributions
pit-0.008
ider-0.008
 nicely-0.007
uddle-0.006
 Later-0.006
Token wide
Token ID3094
Feature activation-3.94e+00

Pos logit contributions
leaf+0.010
 sorts+0.009
keeping+0.008
vous+0.008
Dear+0.008
Neg logit contributions
pit-0.008
ider-0.007
uddle-0.007
 nicely-0.006
balls-0.006
Token open
Token ID1280
Feature activation-6.66e+00

Pos logit contributions
pit+0.057
ider+0.054
uddle+0.049
 nicely+0.047
bles+0.044
Neg logit contributions
leaf-0.149
 sorts-0.140
keeping-0.136
 kinds-0.128
vous-0.127
Token when
Token ID618
Feature activation-2.66e+00

Pos logit contributions
pit+0.178
ider+0.171
uddle+0.156
 nicely+0.155
 Later+0.143
Neg logit contributions
leaf-0.305
 sorts-0.274
keeping-0.262
vous-0.245
 kinds-0.241
Token suddenly
Token ID6451
Feature activation-4.26e+00

Pos logit contributions
pit+0.045
,+0.040
 nicely+0.039
ider+0.039
 gym+0.036
Neg logit contributions
leaf-0.104
 sorts-0.092
keeping-0.089
vous-0.088
 kinds-0.082
Token,
Token ID11
Feature activation-9.26e+00

Pos logit contributions
pit+0.076
ider+0.068
,+0.067
 nicely+0.065
uddle+0.062
Neg logit contributions
leaf-0.158
 sorts-0.141
keeping-0.137
vous-0.131
 kinds-0.125
Token something
Token ID1223
Feature activation-5.20e-01

Pos logit contributions
pit+0.134
,+0.132
 gym+0.115
 nicely+0.113
balls+0.108
Neg logit contributions
leaf-0.380
 sorts-0.341
vous-0.338
keeping-0.319
Dear-0.310
Token strange
Token ID6283
Feature activation-1.50e+00

Pos logit contributions
pit+0.011
ider+0.011
 nicely+0.011
uddle+0.010
,+0.010
Neg logit contributions
leaf-0.016
keeping-0.015
 sorts-0.015
vous-0.013
 kinds-0.013
Token arrived
Token ID5284
Feature activation-1.18e+01

Pos logit contributions
pit+0.042
ider+0.040
,+0.040
uddle+0.037
 nicely+0.037
Neg logit contributions
leaf-0.040
 sorts-0.034
keeping-0.033
vous-0.030
 faith-0.028
Token.
Token ID13
Feature activation-3.15e-01

Pos logit contributions
pit+0.268
ider+0.243
uddle+0.230
,+0.228
 nicely+0.225
Neg logit contributions
leaf-0.399
 sorts-0.349
keeping-0.325
vous-0.320
 faith-0.306
Token They
Token ID1119
Feature activation-2.54e+00

Pos logit contributions
pit+0.005
ider+0.005
 nicely+0.005
 Later+0.005
,+0.005
Neg logit contributions
leaf-0.011
 sorts-0.010
keeping-0.010
vous-0.010
 kinds-0.010
Token stopped
Token ID5025
Feature activation-5.77e+00

Pos logit contributions
pit+0.053
,+0.050
 nicely+0.046
ider+0.044
bles+0.042
Neg logit contributions
leaf-0.088
keeping-0.077
 sorts-0.076
vous-0.073
 faith-0.068
Token dancing
Token ID15360
Feature activation-4.23e+00

Pos logit contributions
pit+0.088
,+0.087
 nicely+0.078
 patiently+0.074
ider+0.073
Neg logit contributions
leaf-0.227
keeping-0.206
 sorts-0.206
vous-0.202
 kinds-0.188
Token and
Token ID290
Feature activation+3.78e+00

Pos logit contributions
pit+0.067
ider+0.058
,+0.056
 nicely+0.056
uddle+0.052
Neg logit contributions
leaf-0.167
 sorts-0.150
keeping-0.145
vous-0.142
 kinds-0.135
Token were
Token ID547
Feature activation-1.69e+00

Pos logit contributions
,+0.058
pit+0.055
 nicely+0.055
 patiently+0.049
ider+0.046
Neg logit contributions
leaf-0.104
keeping-0.094
 sorts-0.091
vous-0.089
 faith-0.084
Token glad
Token ID9675
Feature activation+1.95e+00

Pos logit contributions
pit+0.034
 nicely+0.031
ider+0.031
,+0.029
uddle+0.028
Neg logit contributions
leaf-0.058
 sorts-0.052
keeping-0.051
vous-0.048
Dear-0.046
Token that
Token ID326
Feature activation-1.49e+00

Pos logit contributions
leaf+0.085
 sorts+0.077
vous+0.077
keeping+0.074
Dear+0.070
Neg logit contributions
,-0.026
 nicely-0.021
pit-0.021
 patiently-0.017
ider-0.017
Token he
Token ID339
Feature activation-7.57e+00

Pos logit contributions
,+0.020
 nicely+0.019
pit+0.018
 gym+0.016
 hockey+0.015
Neg logit contributions
leaf-0.064
keeping-0.056
vous-0.054
 sorts-0.054
Dear-0.051
Token had
Token ID550
Feature activation+1.11e+00

Pos logit contributions
,+0.126
 nicely+0.107
ider+0.099
pit+0.098
 at+0.092
Neg logit contributions
leaf-0.276
 sorts-0.254
keeping-0.245
weet-0.243
icated-0.238
Token arrived
Token ID5284
Feature activation-1.18e+01

Pos logit contributions
leaf+0.032
 sorts+0.028
keeping+0.026
vous+0.024
Dear+0.023
Neg logit contributions
pit-0.028
 nicely-0.027
,-0.027
ider-0.026
 hockey-0.024
Token.
Token ID13
Feature activation-5.56e-01

Pos logit contributions
pit+0.146
 nicely+0.143
,+0.139
ider+0.112
 patiently+0.110
Neg logit contributions
leaf-0.510
 sorts-0.447
vous-0.443
keeping-0.437
icated-0.421
Token He
Token ID679
Feature activation-6.52e+00

Pos logit contributions
pit+0.008
 nicely+0.007
 Later+0.007
ider+0.007
,+0.006
Neg logit contributions
leaf-0.022
 sorts-0.020
keeping-0.019
vous-0.019
 fright-0.018
Token was
Token ID373
Feature activation+5.47e-01

Pos logit contributions
,+0.129
pit+0.119
 nicely+0.117
 patiently+0.114
 proudly+0.103
Neg logit contributions
leaf-0.223
keeping-0.209
 sorts-0.193
vous-0.186
ymes-0.183
Token a
Token ID257
Feature activation+1.42e+00

Pos logit contributions
leaf+0.018
keeping+0.016
 sorts+0.016
vous+0.014
 kinds+0.014
Neg logit contributions
pit-0.012
 nicely-0.011
ider-0.011
,-0.011
 patiently-0.010
Token very
Token ID845
Feature activation+2.83e+00

Pos logit contributions
leaf+0.050
keeping+0.047
 sorts+0.044
 kinds+0.041
vous+0.040
Neg logit contributions
 nicely-0.027
pit-0.027
,-0.026
ider-0.025
 patiently-0.023
Token until
Token ID1566
Feature activation-3.83e+00

Pos logit contributions
pit+0.018
,+0.017
 nicely+0.017
ider+0.017
uddle+0.016
Neg logit contributions
leaf-0.028
keeping-0.024
 sorts-0.024
Dear-0.022
vous-0.022
Token he
Token ID339
Feature activation-7.35e+00

Pos logit contributions
pit+0.084
ider+0.078
 nicely+0.077
,+0.076
uddle+0.071
Neg logit contributions
leaf-0.125
 sorts-0.111
keeping-0.106
vous-0.103
 kinds-0.097
Token was
Token ID373
Feature activation+3.95e-01

Pos logit contributions
pit+0.147
,+0.136
 nicely+0.133
ider+0.131
uddle+0.126
Neg logit contributions
leaf-0.264
 sorts-0.230
keeping-0.228
 kinds-0.205
icated-0.204
Token b
Token ID275
Feature activation-9.45e-01

Pos logit contributions
leaf+0.013
 sorts+0.011
keeping+0.011
vous+0.010
 kinds+0.010
Neg logit contributions
pit-0.009
ider-0.008
 nicely-0.008
uddle-0.007
,-0.007
Tokenowing
Token ID7855
Feature activation-3.60e+00

Pos logit contributions
pit+0.028
ider+0.026
 nicely+0.025
uddle+0.024
 camps+0.024
Neg logit contributions
leaf-0.024
 sorts-0.020
keeping-0.019
vous-0.018
 kinds-0.017
Token deeper
Token ID9211
Feature activation-1.18e+01

Pos logit contributions
pit+0.064
ider+0.058
 nicely+0.055
,+0.053
uddle+0.052
Neg logit contributions
leaf-0.134
 sorts-0.120
keeping-0.116
vous-0.110
 kinds-0.107
Token than
Token ID621
Feature activation+1.67e+00

Pos logit contributions
pit+0.225
ider+0.213
uddle+0.193
 nicely+0.192
bles+0.177
Neg logit contributions
leaf-0.410
 sorts-0.374
keeping-0.363
vous-0.338
 kinds-0.334
Token anyone
Token ID2687
Feature activation+4.03e+00

Pos logit contributions
leaf+0.059
 sorts+0.053
keeping+0.050
Dear+0.049
vous+0.049
Neg logit contributions
pit-0.031
 nicely-0.031
,-0.029
ider-0.028
 patiently-0.026
Token else
Token ID2073
Feature activation-4.59e-01

Pos logit contributions
leaf+0.146
 sorts+0.128
keeping+0.126
Dear+0.117
vous+0.115
Neg logit contributions
pit-0.074
,-0.073
 nicely-0.067
ider-0.066
uddle-0.060
Token!
Token ID0
Feature activation+7.77e-01

Pos logit contributions
pit+0.008
,+0.007
 nicely+0.007
ider+0.007
uddle+0.006
Neg logit contributions
leaf-0.018
 sorts-0.015
keeping-0.015
vous-0.014
Dear-0.014
Token The
Token ID383
Feature activation+8.15e-01

Pos logit contributions
leaf+0.031
 sorts+0.028
keeping+0.027
vous+0.026
 kinds+0.025
Neg logit contributions
pit-0.012
ider-0.011
 nicely-0.010
 Later-0.010
,-0.009
Token stops
Token ID9911
Feature activation-5.40e+00

Pos logit contributions
leaf+0.057
keeping+0.050
vous+0.049
 sorts+0.048
Dear+0.048
Neg logit contributions
pit-0.032
 nicely-0.030
,-0.029
 hockey-0.027
 gym-0.027
Token they
Token ID484
Feature activation-2.88e+00

Pos logit contributions
pit+0.088
ider+0.078
 nicely+0.076
,+0.075
uddle+0.066
Neg logit contributions
leaf-0.211
 sorts-0.189
keeping-0.183
vous-0.179
 kinds-0.169
Token made
Token ID925
Feature activation-9.95e-01

Pos logit contributions
pit+0.057
ider+0.051
 nicely+0.051
,+0.049
uddle+0.046
Neg logit contributions
leaf-0.101
 sorts-0.089
keeping-0.088
vous-0.081
 faith-0.080
Token until
Token ID1566
Feature activation-3.66e+00

Pos logit contributions
,+0.014
pit+0.014
 nicely+0.013
ider+0.011
 patiently+0.011
Neg logit contributions
leaf-0.040
 sorts-0.035
keeping-0.035
vous-0.034
Dear-0.034
Token they
Token ID484
Feature activation-2.94e+00

Pos logit contributions
pit+0.065
 nicely+0.059
,+0.058
ider+0.057
 patiently+0.051
Neg logit contributions
leaf-0.135
 sorts-0.121
keeping-0.116
vous-0.115
 kinds-0.108
Token arrived
Token ID5284
Feature activation-1.18e+01

Pos logit contributions
pit+0.066
ider+0.060
 nicely+0.059
,+0.057
uddle+0.055
Neg logit contributions
leaf-0.096
 sorts-0.084
keeping-0.082
vous-0.076
 kinds-0.074
Token at
Token ID379
Feature activation-2.88e+00

Pos logit contributions
pit+0.146
ider+0.132
 nicely+0.112
uddle+0.110
bles+0.097
Neg logit contributions
leaf-0.494
 sorts-0.453
keeping-0.440
vous-0.417
 kinds-0.411
Token the
Token ID262
Feature activation-5.00e-01

Pos logit contributions
 nicely+0.051
,+0.048
pit+0.047
 patiently+0.039
 gym+0.039
Neg logit contributions
leaf-0.109
vous-0.095
 sorts-0.090
 shouldn-0.090
weet-0.087
Token store
Token ID3650
Feature activation-2.98e+00

Pos logit contributions
pit+0.008
ider+0.007
 nicely+0.007
uddle+0.006
 gym+0.006
Neg logit contributions
leaf-0.020
 sorts-0.018
keeping-0.017
vous-0.016
 kinds-0.016
Token.
Token ID13
Feature activation-4.94e-01

Pos logit contributions
pit+0.051
ider+0.046
 nicely+0.043
uddle+0.041
,+0.039
Neg logit contributions
leaf-0.112
 sorts-0.101
keeping-0.098
vous-0.093
 kinds-0.090
Token
Token ID198
Feature activation-5.45e-01

Pos logit contributions
pit+0.009
ider+0.008
 nicely+0.008
 Later+0.008
,+0.008
Neg logit contributions
leaf-0.018
 sorts-0.016
keeping-0.015
vous-0.015
 kinds-0.014
Token
Token ID198
Feature activation+5.89e-01

Pos logit contributions
pit+0.038
ider+0.036
uddle+0.032
 Later+0.030
angers+0.030
Neg logit contributions
leaf-0.075
 sorts-0.071
keeping-0.064
 kinds-0.063
 fright-0.062
Token
Token ID198
Feature activation+3.23e-01

Pos logit contributions
leaf+0.022
 sorts+0.019
keeping+0.019
 kinds+0.018
vous+0.018
Neg logit contributions
pit-0.010
ider-0.010
 nicely-0.010
 Later-0.009
,-0.008
TokenFinally
Token ID11158
Feature activation-1.71e+00

Pos logit contributions
leaf+0.012
 sorts+0.010
keeping+0.010
vous+0.009
 kinds+0.009
Neg logit contributions
pit-0.006
ider-0.006
 nicely-0.005
 Later-0.005
uddle-0.005
Token,
Token ID11
Feature activation-9.06e+00

Pos logit contributions
pit+0.015
ider+0.014
uddle+0.011
 nicely+0.010
bles+0.009
Neg logit contributions
leaf-0.075
 sorts-0.071
keeping-0.070
 faith-0.065
 fright-0.065
Token they
Token ID484
Feature activation-2.23e+00

Pos logit contributions
pit+0.172
,+0.155
 nicely+0.155
ider+0.149
balls+0.136
Neg logit contributions
leaf-0.327
 sorts-0.293
vous-0.278
keeping-0.275
 kinds-0.265
Token arrived
Token ID5284
Feature activation-1.18e+01

Pos logit contributions
pit+0.044
 nicely+0.039
,+0.039
ider+0.037
uddle+0.035
Neg logit contributions
leaf-0.079
 sorts-0.070
keeping-0.068
vous-0.065
 kinds-0.063
Token at
Token ID379
Feature activation-2.90e+00

Pos logit contributions
pit+0.136
ider+0.119
 nicely+0.103
uddle+0.099
,+0.088
Neg logit contributions
leaf-0.508
 sorts-0.466
keeping-0.453
vous-0.434
 kinds-0.423
Token the
Token ID262
Feature activation-4.50e-01

Pos logit contributions
 nicely+0.044
,+0.040
pit+0.036
 hockey+0.035
 gym+0.035
Neg logit contributions
leaf-0.114
vous-0.104
 shouldn-0.101
odic-0.094
don-0.094
Token man
Token ID582
Feature activation+3.40e+00

Pos logit contributions
pit+0.007
 nicely+0.006
 gym+0.006
 camps+0.005
ider+0.005
Neg logit contributions
leaf-0.018
 sorts-0.016
keeping-0.016
vous-0.015
 kinds-0.015
Tokenâ
Token ID22940
Feature activation+3.56e+00

Pos logit contributions
leaf+0.122
 sorts+0.109
keeping+0.106
vous+0.100
 kinds+0.097
Neg logit contributions
pit-0.064
ider-0.059
 nicely-0.054
uddle-0.053
,-0.050
Token
Token ID26391
Feature activation-2.53e+00

Pos logit contributions
leaf+0.098
 sorts+0.086
keeping+0.081
vous+0.075
 kinds+0.073
Neg logit contributions
pit-0.096
ider-0.091
 nicely-0.086
uddle-0.084
,-0.081
Token After
Token ID2293
Feature activation+1.57e-01

Pos logit contributions
pit+0.002
ider+0.001
 nicely+0.001
uddle+0.001
,+0.001
Neg logit contributions
leaf-0.005
 sorts-0.004
keeping-0.004
vous-0.004
 kinds-0.004
Token an
Token ID281
Feature activation+2.32e+00

Pos logit contributions
leaf+0.006
 sorts+0.005
keeping+0.005
vous+0.005
Dear+0.004
Neg logit contributions
pit-0.003
,-0.003
ider-0.003
 nicely-0.003
balls-0.003
Token hour
Token ID1711
Feature activation+2.19e+00

Pos logit contributions
leaf+0.066
 sorts+0.058
keeping+0.054
vous+0.052
 kinds+0.050
Neg logit contributions
pit-0.061
ider-0.057
uddle-0.054
 nicely-0.054
bles-0.051
Token,
Token ID11
Feature activation-9.22e+00

Pos logit contributions
leaf+0.088
 sorts+0.084
keeping+0.079
 kinds+0.075
vous+0.075
Neg logit contributions
pit-0.028
ider-0.027
uddle-0.023
 nicely-0.021
bles-0.021
Token they
Token ID484
Feature activation-2.56e+00

Pos logit contributions
,+0.166
pit+0.158
 nicely+0.157
 gym+0.139
ider+0.134
Neg logit contributions
leaf-0.345
 sorts-0.299
keeping-0.292
vous-0.289
Dear-0.272
Token arrived
Token ID5284
Feature activation-1.18e+01

Pos logit contributions
pit+0.048
 nicely+0.043
,+0.043
ider+0.040
uddle+0.038
Neg logit contributions
leaf-0.092
 sorts-0.081
keeping-0.081
vous-0.076
 kinds-0.074
Token at
Token ID379
Feature activation-2.70e+00

Pos logit contributions
pit+0.153
ider+0.146
uddle+0.117
 nicely+0.113
 camps+0.106
Neg logit contributions
leaf-0.483
 sorts-0.444
keeping-0.430
 kinds-0.400
vous-0.400
Token Steve
Token ID6542
Feature activation-7.94e-01

Pos logit contributions
 nicely+0.040
,+0.038
 hockey+0.033
 car+0.032
pit+0.032
Neg logit contributions
leaf-0.105
vous-0.095
 shouldn-0.091
ymes-0.088
Isn-0.088
Token's
Token ID338
Feature activation-1.26e+00

Pos logit contributions
pit+0.015
ider+0.014
 nicely+0.013
uddle+0.013
,+0.013
Neg logit contributions
leaf-0.028
 sorts-0.025
keeping-0.024
vous-0.023
 kinds-0.022
Token home
Token ID1363
Feature activation-4.56e+00

Pos logit contributions
pit+0.023
ider+0.020
 nicely+0.019
uddle+0.019
,+0.018
Neg logit contributions
leaf-0.047
 sorts-0.042
keeping-0.041
vous-0.039
Dear-0.037
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
pit+0.076
ider+0.069
 nicely+0.064
,+0.060
uddle+0.060
Neg logit contributions
leaf-0.176
 sorts-0.158
keeping-0.152
vous-0.147
Dear-0.141
Token was
Token ID373
Feature activation+6.70e-01

Pos logit contributions
pit+0.133
ider+0.124
 nicely+0.118
uddle+0.115
,+0.111
Neg logit contributions
leaf-0.181
 sorts-0.160
keeping-0.155
vous-0.144
 kinds-0.140
Token so
Token ID523
Feature activation+3.60e+00

Pos logit contributions
leaf+0.020
 sorts+0.018
keeping+0.017
vous+0.016
 kinds+0.016
Neg logit contributions
pit-0.016
ider-0.015
 nicely-0.014
uddle-0.014
bles-0.013
Token excited
Token ID6568
Feature activation+1.20e+00

Pos logit contributions
leaf+0.113
 sorts+0.099
keeping+0.096
vous+0.089
 kinds+0.086
Neg logit contributions
pit-0.084
ider-0.080
 nicely-0.075
uddle-0.073
,-0.071
Token when
Token ID618
Feature activation-2.76e+00

Pos logit contributions
leaf+0.044
 sorts+0.039
keeping+0.038
vous+0.036
Dear+0.035
Neg logit contributions
pit-0.022
ider-0.020
 nicely-0.019
,-0.019
uddle-0.018
Token Daddy
Token ID26926
Feature activation+2.84e+00

Pos logit contributions
pit+0.035
,+0.035
 nicely+0.031
ider+0.029
 patiently+0.027
Neg logit contributions
leaf-0.120
 sorts-0.105
vous-0.103
keeping-0.101
 kinds-0.096
Token arrived
Token ID5284
Feature activation-1.18e+01

Pos logit contributions
leaf+0.092
 sorts+0.082
keeping+0.079
vous+0.074
 kinds+0.071
Neg logit contributions
pit-0.063
ider-0.058
 nicely-0.055
uddle-0.054
,-0.053
Token.
Token ID13
Feature activation-1.29e-01

Pos logit contributions
pit+0.195
ider+0.187
uddle+0.164
 nicely+0.160
 camps+0.153
Neg logit contributions
leaf-0.430
 sorts-0.394
keeping-0.387
 kinds-0.353
vous-0.347
Token She
Token ID1375
Feature activation-5.18e+00

Pos logit contributions
pit+0.002
ider+0.002
uddle+0.002
bles+0.002
 nicely+0.002
Neg logit contributions
leaf-0.005
 sorts-0.004
keeping-0.004
vous-0.004
Dear-0.004
Token ran
Token ID4966
Feature activation-4.69e+00

Pos logit contributions
pit+0.108
,+0.098
 nicely+0.097
ider+0.096
 patiently+0.093
Neg logit contributions
leaf-0.172
keeping-0.156
 sorts-0.151
vous-0.142
icated-0.135
Token to
Token ID284
Feature activation+2.78e+00

Pos logit contributions
pit+0.104
ider+0.100
uddle+0.092
 nicely+0.088
bles+0.086
Neg logit contributions
leaf-0.154
 sorts-0.135
keeping-0.131
vous-0.119
 fright-0.118
Token him
Token ID683
Feature activation-3.74e+00

Pos logit contributions
leaf+0.096
vous+0.090
keeping+0.086
 sorts+0.086
 shouldn+0.085
Neg logit contributions
,-0.050
 nicely-0.049
 patiently-0.045
pit-0.044
 gym-0.043
Token
Token ID220
Feature activation-1.90e+00

Pos logit contributions
leaf+0.004
 sorts+0.004
keeping+0.004
 kinds+0.003
vous+0.003
Neg logit contributions
pit-0.002
ider-0.002
uddle-0.002
 nicely-0.002
 gym-0.002
Token
Token ID198
Feature activation-7.20e-02

Pos logit contributions
pit+0.035
ider+0.033
 nicely+0.031
uddle+0.030
,+0.028
Neg logit contributions
leaf-0.068
 sorts-0.061
keeping-0.059
vous-0.056
 kinds-0.055
Token
Token ID198
Feature activation-8.24e-03

Pos logit contributions
pit+0.001
 nicely+0.001
 Later+0.001
ider+0.001
,+0.001
Neg logit contributions
leaf-0.003
 sorts-0.002
keeping-0.002
 kinds-0.002
vous-0.002
TokenWhen
Token ID2215
Feature activation-2.15e+00

Pos logit contributions
pit+0.000
 nicely+0.000
ider+0.000
 Later+0.000
 gym+0.000
Neg logit contributions
leaf-0.000
 sorts-0.000
keeping-0.000
 kinds-0.000
vous-0.000
Token they
Token ID484
Feature activation-2.31e+00

Pos logit contributions
pit+0.039
ider+0.034
,+0.034
 nicely+0.034
uddle+0.031
Neg logit contributions
leaf-0.079
 sorts-0.070
keeping-0.068
vous-0.066
 kinds-0.063
Token arrived
Token ID5284
Feature activation-1.18e+01

Pos logit contributions
pit+0.046
ider+0.042
 nicely+0.040
uddle+0.038
,+0.038
Neg logit contributions
leaf-0.080
 sorts-0.071
keeping-0.070
vous-0.065
 kinds-0.063
Token,
Token ID11
Feature activation-9.51e+00

Pos logit contributions
pit+0.167
ider+0.167
uddle+0.131
 camps+0.128
balls+0.124
Neg logit contributions
leaf-0.454
 sorts-0.417
keeping-0.411
 kinds-0.374
 faith-0.373
Token she
Token ID673
Feature activation-6.25e+00

Pos logit contributions
pit+0.174
,+0.167
 nicely+0.165
 gym+0.139
ider+0.136
Neg logit contributions
leaf-0.356
 sorts-0.308
vous-0.307
keeping-0.298
 harm-0.285
Token saw
Token ID2497
Feature activation-3.82e+00

Pos logit contributions
pit+0.130
,+0.119
 nicely+0.117
ider+0.113
uddle+0.108
Neg logit contributions
leaf-0.215
keeping-0.188
 sorts-0.186
vous-0.176
 kinds-0.167
Token so
Token ID523
Feature activation+3.66e+00

Pos logit contributions
pit+0.072
,+0.071
 nicely+0.068
 gym+0.058
 hockey+0.056
Neg logit contributions
leaf-0.135
 sorts-0.120
keeping-0.119
vous-0.115
Isn-0.109
Token many
Token ID867
Feature activation+1.37e+00

Pos logit contributions
leaf+0.130
 sorts+0.118
keeping+0.112
vous+0.107
 kinds+0.105
Neg logit contributions
pit-0.070
ider-0.065
uddle-0.059
 nicely-0.058
bles-0.054
Token held
Token ID2714
Feature activation-5.74e+00

Pos logit contributions
leaf+0.023
 sorts+0.021
keeping+0.020
vous+0.019
 kinds+0.018
Neg logit contributions
pit-0.016
ider-0.015
 nicely-0.014
,-0.014
uddle-0.014
Token the
Token ID262
Feature activation-2.43e-01

Pos logit contributions
 nicely+0.098
,+0.096
pit+0.094
balls+0.082
 hockey+0.082
Neg logit contributions
leaf-0.204
 sorts-0.187
vous-0.182
keeping-0.177
Dear-0.172
Token man
Token ID582
Feature activation+3.65e+00

Pos logit contributions
pit+0.004
 car+0.004
ider+0.004
 nicely+0.004
uddle+0.003
Neg logit contributions
leaf-0.009
 sorts-0.009
vous-0.008
keeping-0.008
 kinds-0.008
Token until
Token ID1566
Feature activation-3.40e+00

Pos logit contributions
leaf+0.133
 sorts+0.118
keeping+0.115
vous+0.110
Dear+0.107
Neg logit contributions
pit-0.065
ider-0.060
 nicely-0.059
,-0.057
uddle-0.054
Token backup
Token ID11559
Feature activation-4.68e+00

Pos logit contributions
pit+0.063
ider+0.058
 nicely+0.054
uddle+0.052
,+0.051
Neg logit contributions
leaf-0.122
 sorts-0.110
keeping-0.106
vous-0.101
 kinds-0.098
Token arrived
Token ID5284
Feature activation-1.18e+01

Pos logit contributions
pit+0.085
,+0.081
ider+0.079
 nicely+0.077
balls+0.070
Neg logit contributions
leaf-0.170
 sorts-0.152
keeping-0.139
vous-0.137
 faith-0.135
Token.
Token ID13
Feature activation-1.89e-01

Pos logit contributions
pit+0.181
ider+0.161
 nicely+0.153
,+0.144
uddle+0.141
Neg logit contributions
leaf-0.469
 sorts-0.423
keeping-0.404
vous-0.397
 kinds-0.380
Token Fortunately
Token ID20525
Feature activation-1.21e+00

Pos logit contributions
pit+0.003
ider+0.003
 nicely+0.002
uddle+0.002
,+0.002
Neg logit contributions
leaf-0.007
 sorts-0.007
keeping-0.006
vous-0.006
 kinds-0.006
Token for
Token ID329
Feature activation+4.89e+00

Pos logit contributions
pit+0.013
ider+0.012
uddle+0.010
 camps+0.008
angers+0.008
Neg logit contributions
leaf-0.051
 sorts-0.049
keeping-0.047
vous-0.045
 kinds-0.045
Token the
Token ID262
Feature activation+1.73e-01

Pos logit contributions
leaf+0.159
 sorts+0.140
keeping+0.131
vous+0.129
Dear+0.123
Neg logit contributions
pit-0.105
 nicely-0.100
ider-0.100
,-0.096
 patiently-0.090
Token man
Token ID582
Feature activation+3.57e+00

Pos logit contributions
leaf+0.006
 sorts+0.006
keeping+0.005
 kinds+0.005
vous+0.005
Neg logit contributions
pit-0.003
ider-0.003
 nicely-0.003
,-0.003
 gym-0.003
Token one
Token ID530
Feature activation-2.54e+00

Pos logit contributions
pit+0.001
ider+0.001
 nicely+0.000
uddle+0.000
,+0.000
Neg logit contributions
leaf-0.001
 sorts-0.001
keeping-0.001
vous-0.001
 kinds-0.001
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
pit+0.046
ider+0.042
 nicely+0.038
uddle+0.038
bles+0.035
Neg logit contributions
leaf-0.092
 sorts-0.083
keeping-0.080
vous-0.076
 kinds-0.074
Token
Token ID198
Feature activation+3.42e-01

Pos logit contributions
leaf+0.001
 sorts+0.000
Isn+0.000
 shouldn+0.000
Dear+0.000
Neg logit contributions
pit-0.000
ider-0.000
uddle-0.000
 camps-0.000
uing-0.000
Token
Token ID198
Feature activation+3.28e-01

Pos logit contributions
leaf+0.010
 sorts+0.008
keeping+0.008
Dear+0.008
 fright+0.007
Neg logit contributions
pit-0.009
ider-0.009
uddle-0.008
 throne-0.007
 camps-0.007
Token"
Token ID1
Feature activation+5.02e+00

Pos logit contributions
leaf+0.009
Dear+0.008
keeping+0.007
 sorts+0.007
Don+0.007
Neg logit contributions
pit-0.009
 throne-0.008
ider-0.008
ingu-0.007
angers-0.007
TokenYes
Token ID5297
Feature activation+9.02e+00

Pos logit contributions
leaf+0.127
keeping+0.117
 sorts+0.115
vous+0.099
 kinds+0.097
Neg logit contributions
ider-0.141
pit-0.139
 nicely-0.134
 gym-0.121
uddle-0.121
Token,
Token ID11
Feature activation-7.97e+00

Pos logit contributions
 fright+0.272
leaf+0.271
 shouldn+0.261
 sorts+0.253
weet+0.239
Neg logit contributions
pit-0.159
ider-0.155
angers-0.145
 rink-0.136
 Later-0.128
Token it
Token ID340
Feature activation-1.28e+00

Pos logit contributions
pit+0.154
ider+0.143
 nicely+0.130
uddle+0.129
bles+0.119
Neg logit contributions
leaf-0.283
 sorts-0.254
keeping-0.245
vous-0.232
 kinds-0.226
Token is
Token ID318
Feature activation+1.86e+00

Pos logit contributions
pit+0.029
ider+0.027
 nicely+0.026
uddle+0.025
,+0.024
Neg logit contributions
leaf-0.041
 sorts-0.036
keeping-0.035
vous-0.033
 kinds-0.031
Token.
Token ID13
Feature activation-2.24e-01

Pos logit contributions
leaf+0.069
 sorts+0.061
keeping+0.060
vous+0.057
Dear+0.054
Neg logit contributions
pit-0.033
ider-0.031
 nicely-0.030
,-0.029
uddle-0.028
Token But
Token ID887
Feature activation+1.17e+00

Pos logit contributions
pit+0.005
ider+0.005
 nicely+0.005
uddle+0.004
,+0.004
Neg logit contributions
leaf-0.007
 sorts-0.006
keeping-0.006
vous-0.006
 kinds-0.005
Token<|endoftext|>
Token ID50256
Feature activation+4.82e+00

Pos logit contributions
pit+0.008
ider+0.007
uddle+0.007
bles+0.007
 nicely+0.007
Neg logit contributions
leaf-0.012
 sorts-0.011
keeping-0.010
Dear-0.010
vous-0.009
TokenTom
Token ID13787
Feature activation+3.04e-01

Pos logit contributions
leaf+0.169
 sorts+0.150
keeping+0.145
vous+0.138
 kinds+0.133
Neg logit contributions
pit-0.095
ider-0.087
 nicely-0.082
,-0.081
uddle-0.080
Token and
Token ID290
Feature activation+4.21e+00

Pos logit contributions
leaf+0.010
 sorts+0.009
keeping+0.009
vous+0.008
 kinds+0.008
Neg logit contributions
pit-0.006
ider-0.006
 nicely-0.005
uddle-0.005
,-0.005
Token Mia
Token ID32189
Feature activation+1.68e-01

Pos logit contributions
leaf+0.184
 sorts+0.168
keeping+0.165
ymes+0.165
vous+0.164
Neg logit contributions
,-0.046
 His-0.044
 One-0.031
 nicely-0.030
 He-0.029
Token are
Token ID389
Feature activation+7.12e-01

Pos logit contributions
leaf+0.006
 sorts+0.005
keeping+0.005
vous+0.004
 fright+0.004
Neg logit contributions
pit-0.004
ider-0.003
 nicely-0.003
,-0.003
uddle-0.003
Token friends
Token ID2460
Feature activation+8.11e+00

Pos logit contributions
leaf+0.026
 sorts+0.023
keeping+0.022
 kinds+0.021
vous+0.021
Neg logit contributions
pit-0.013
ider-0.012
uddle-0.011
 nicely-0.011
balls-0.010
Token.
Token ID13
Feature activation+5.43e-01

Pos logit contributions
leaf+0.293
 sorts+0.265
keeping+0.254
vous+0.241
 kinds+0.236
Neg logit contributions
pit-0.149
ider-0.138
uddle-0.125
 nicely-0.124
bles-0.115
Token They
Token ID1119
Feature activation-2.53e+00

Pos logit contributions
leaf+0.018
 sorts+0.017
 kinds+0.015
keeping+0.015
vous+0.015
Neg logit contributions
ider-0.011
pit-0.011
uddle-0.010
bles-0.010
balls-0.009
Token like
Token ID588
Feature activation+3.32e+00

Pos logit contributions
,+0.052
pit+0.052
 nicely+0.052
ider+0.048
 gym+0.046
Neg logit contributions
leaf-0.080
keeping-0.076
 sorts-0.070
vous-0.067
 faith-0.065
Token to
Token ID284
Feature activation+3.33e+00

Pos logit contributions
leaf+0.132
vous+0.125
keeping+0.125
 sorts+0.124
 shouldn+0.118
Neg logit contributions
,-0.037
 gym-0.034
 hockey-0.034
 nicely-0.034
pit-0.028
Token play
Token ID711
Feature activation-1.45e+00

Pos logit contributions
leaf+0.121
 sorts+0.109
keeping+0.106
vous+0.100
 kinds+0.097
Neg logit contributions
pit-0.061
ider-0.055
 nicely-0.052
uddle-0.050
,-0.049
Token scared
Token ID12008
Feature activation+4.88e+00

Pos logit contributions
leaf+0.133
 sorts+0.117
keeping+0.112
vous+0.105
 kinds+0.102
Neg logit contributions
pit-0.092
ider-0.088
 nicely-0.086
,-0.083
uddle-0.081
Token,
Token ID11
Feature activation-8.39e+00

Pos logit contributions
leaf+0.164
 sorts+0.148
keeping+0.143
vous+0.136
 fright+0.136
Neg logit contributions
pit-0.101
ider-0.093
uddle-0.085
 camps-0.080
bles-0.079
Token and
Token ID290
Feature activation+4.10e+00

Pos logit contributions
pit+0.115
ider+0.104
 nicely+0.099
,+0.098
balls+0.085
Neg logit contributions
leaf-0.346
 sorts-0.313
keeping-0.301
vous-0.292
 kinds-0.284
Token he
Token ID339
Feature activation-6.94e+00

Pos logit contributions
leaf+0.165
 sorts+0.149
keeping+0.143
vous+0.138
 kinds+0.134
Neg logit contributions
pit-0.060
ider-0.054
 nicely-0.054
,-0.051
uddle-0.047
Token could
Token ID714
Feature activation+3.11e+00

Pos logit contributions
pit+0.165
ider+0.156
 nicely+0.148
uddle+0.145
,+0.140
Neg logit contributions
leaf-0.214
 sorts-0.189
keeping-0.182
vous-0.169
 kinds-0.164
Token not
Token ID407
Feature activation+6.00e+00

Pos logit contributions
leaf+0.106
 sorts+0.094
keeping+0.093
 fright+0.089
 shouldn+0.086
Neg logit contributions
pit-0.064
ider-0.059
uddle-0.053
angers-0.050
 nicely-0.050
Token sleep
Token ID3993
Feature activation-2.51e+00

Pos logit contributions
leaf+0.203
 sorts+0.182
keeping+0.174
vous+0.164
 kinds+0.161
Neg logit contributions
pit-0.124
ider-0.117
 nicely-0.110
uddle-0.106
,-0.103
Token at
Token ID379
Feature activation-1.94e+00

Pos logit contributions
pit+0.049
ider+0.043
uddle+0.042
 nicely+0.038
 Later+0.038
Neg logit contributions
leaf-0.085
 sorts-0.078
keeping-0.078
vous-0.072
 fright-0.070
Token night
Token ID1755
Feature activation-3.45e+00

Pos logit contributions
pit+0.039
 nicely+0.038
,+0.036
 gym+0.034
ider+0.033
Neg logit contributions
leaf-0.069
 sorts-0.057
vous-0.055
keeping-0.055
Dear-0.053
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
pit+0.064
ider+0.059
uddle+0.054
 nicely+0.053
bles+0.050
Neg logit contributions
leaf-0.123
 sorts-0.112
keeping-0.109
vous-0.102
 kinds-0.100
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
pit+0.003
ider+0.003
 nicely+0.003
uddle+0.003
,+0.002
Neg logit contributions
leaf-0.007
 sorts-0.007
keeping-0.007
vous-0.006
 kinds-0.006
Token says
Token ID1139
Feature activation-2.62e+00

Pos logit contributions
pit+0.000
ider+0.000
 nicely+0.000
uddle+0.000
bles+0.000
Neg logit contributions
leaf-0.000
 sorts-0.000
keeping-0.000
vous-0.000
 kinds-0.000
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
pit+0.068
 camps+0.059
uddle+0.059
 rink+0.054
ider+0.054
Neg logit contributions
leaf-0.078
keeping-0.067
 sorts-0.065
 fright-0.063
 harm-0.059
Token
Token ID198
Feature activation-2.84e-02

Pos logit contributions
leaf+0.007
 sorts+0.006
keeping+0.006
Dear+0.005
 fright+0.005
Neg logit contributions
pit-0.004
ider-0.004
uddle-0.004
 camps-0.004
 throne-0.004
Token
Token ID198
Feature activation-5.71e-02

Pos logit contributions
pit+0.001
ider+0.001
uddle+0.001
 throne+0.001
 camps+0.001
Neg logit contributions
leaf-0.001
 sorts-0.001
keeping-0.001
Dear-0.001
 faith-0.001
Token"
Token ID1
Feature activation+4.32e+00

Pos logit contributions
pit+0.001
ider+0.001
 camps+0.001
 throne+0.001
uddle+0.001
Neg logit contributions
leaf-0.002
 sorts-0.001
Dear-0.001
keeping-0.001
 faith-0.001
TokenNo
Token ID2949
Feature activation+9.45e+00

Pos logit contributions
leaf+0.116
keeping+0.107
 sorts+0.107
vous+0.092
 kinds+0.091
Neg logit contributions
ider-0.114
pit-0.111
 nicely-0.108
uddle-0.101
balls-0.100
Token,
Token ID11
Feature activation-8.39e+00

Pos logit contributions
leaf+0.397
 sorts+0.382
 kinds+0.358
 fright+0.355
keeping+0.351
Neg logit contributions
ider-0.099
pit-0.096
 Later-0.071
angers-0.070
 nicely-0.070
Token it
Token ID340
Feature activation-1.80e+00

Pos logit contributions
pit+0.206
ider+0.195
,+0.195
 nicely+0.195
uddle+0.181
Neg logit contributions
leaf-0.250
 sorts-0.223
keeping-0.217
vous-0.205
Dear-0.192
Token's
Token ID338
Feature activation-7.82e-01

Pos logit contributions
pit+0.042
ider+0.040
 nicely+0.037
uddle+0.035
angers+0.035
Neg logit contributions
leaf-0.055
 sorts-0.050
keeping-0.045
vous-0.045
 kinds-0.044
Token mine
Token ID6164
Feature activation-1.08e+00

Pos logit contributions
pit+0.015
ider+0.014
 nicely+0.013
uddle+0.013
,+0.012
Neg logit contributions
leaf-0.028
 sorts-0.025
keeping-0.024
vous-0.023
 kinds-0.022
Token!"
Token ID2474
Feature activation+4.78e+00

Pos logit contributions
pit+0.034
ider+0.032
uddle+0.031
 Later+0.031
 Days+0.030
Neg logit contributions
leaf-0.023
 sorts-0.019
keeping-0.019
 kinds-0.016
vous-0.016
Tokenigator
Token ID23823
Feature activation+1.75e+00

Pos logit contributions
leaf+0.075
keeping+0.064
 faith+0.063
 shouldn+0.057
Dear+0.057
Neg logit contributions
,-0.069
pit-0.064
 at-0.057
 nicely-0.056
 gym-0.054
Token.
Token ID13
Feature activation+5.70e-01

Pos logit contributions
leaf+0.069
 sorts+0.062
keeping+0.060
vous+0.058
 kinds+0.056
Neg logit contributions
pit-0.027
ider-0.025
 nicely-0.023
,-0.023
uddle-0.022
Token It
Token ID632
Feature activation+6.63e-01

Pos logit contributions
leaf+0.021
 sorts+0.019
keeping+0.018
vous+0.017
Dear+0.017
Neg logit contributions
pit-0.011
ider-0.010
uddle-0.009
 nicely-0.009
bles-0.009
Token said
Token ID531
Feature activation-2.21e+00

Pos logit contributions
leaf+0.022
keeping+0.019
 sorts+0.019
vous+0.017
 kinds+0.017
Neg logit contributions
pit-0.014
ider-0.013
,-0.013
 nicely-0.013
uddle-0.012
Token "
Token ID366
Feature activation+4.83e+00

Pos logit contributions
pit+0.026
ider+0.020
uddle+0.018
 camps+0.016
 rink+0.016
Neg logit contributions
leaf-0.096
 sorts-0.086
keeping-0.084
 fright-0.083
 shouldn-0.082
TokenDon
Token ID3987
Feature activation+6.44e+00

Pos logit contributions
leaf+0.141
 sorts+0.135
 kinds+0.118
keeping+0.115
 outage+0.109
Neg logit contributions
His-0.095
ider-0.095
pit-0.095
 nicely-0.092
 Later-0.087
Token't
Token ID470
Feature activation+7.29e+00

Pos logit contributions
leaf+0.236
 sorts+0.221
keeping+0.212
 kinds+0.201
vous+0.193
Neg logit contributions
ider-0.106
,-0.104
 nicely-0.099
pit-0.098
 Later-0.095
Token be
Token ID307
Feature activation-3.65e-01

Pos logit contributions
leaf+0.246
 sorts+0.214
keeping+0.205
 kinds+0.187
Isn+0.183
Neg logit contributions
,-0.157
 nicely-0.147
pit-0.140
ider-0.134
 One-0.131
Token afraid
Token ID7787
Feature activation+4.95e+00

Pos logit contributions
pit+0.010
,+0.010
 nicely+0.010
ider+0.010
uddle+0.009
Neg logit contributions
leaf-0.009
 sorts-0.008
keeping-0.008
 kinds-0.007
Dear-0.007
Token,
Token ID11
Feature activation-8.46e+00

Pos logit contributions
leaf+0.235
 sorts+0.211
keeping+0.207
vous+0.198
Dear+0.196
Neg logit contributions
,-0.050
 nicely-0.037
pit-0.034
ider-0.028
 gym-0.025
Token I
Token ID314
Feature activation+6.20e+00

Pos logit contributions
,+0.159
 nicely+0.138
pit+0.120
ider+0.115
 One+0.115
Neg logit contributions
leaf-0.326
 sorts-0.293
keeping-0.285
vous-0.275
 kinds-0.259
Token Milo
Token ID32662
Feature activation+3.91e-01

Pos logit contributions
pit+0.137
,+0.127
ider+0.127
 nicely+0.126
uddle+0.112
Neg logit contributions
leaf-0.317
 sorts-0.282
keeping-0.282
vous-0.265
 faith-0.252
Token chased
Token ID26172
Feature activation-5.69e-01

Pos logit contributions
leaf+0.014
 sorts+0.013
keeping+0.012
 kinds+0.011
vous+0.011
Neg logit contributions
pit-0.007
,-0.007
 nicely-0.006
ider-0.006
uddle-0.006
Token a
Token ID257
Feature activation+2.23e+00

Pos logit contributions
pit+0.012
ider+0.011
 nicely+0.011
,+0.010
uddle+0.010
Neg logit contributions
leaf-0.019
 sorts-0.017
keeping-0.016
vous-0.016
Dear-0.015
Token butterfly
Token ID35113
Feature activation+4.42e+00

Pos logit contributions
leaf+0.073
 sorts+0.066
keeping+0.062
vous+0.060
 fright+0.058
Neg logit contributions
pit-0.048
ider-0.046
 nicely-0.041
 Later-0.039
uddle-0.039
Token into
Token ID656
Feature activation-4.13e+00

Pos logit contributions
leaf+0.150
 sorts+0.137
keeping+0.129
vous+0.124
 kinds+0.120
Neg logit contributions
pit-0.089
ider-0.084
uddle-0.076
 nicely-0.076
bles-0.071
Token a
Token ID257
Feature activation+2.18e+00

Pos logit contributions
pit+0.077
 nicely+0.075
,+0.074
 hockey+0.072
 gym+0.069
Neg logit contributions
leaf-0.145
keeping-0.122
 sorts-0.118
 shouldn-0.117
vous-0.117
Token me
Token ID502
Feature activation+7.01e-01

Pos logit contributions
leaf+0.075
 sorts+0.066
keeping+0.065
vous+0.060
 kinds+0.059
Neg logit contributions
pit-0.045
ider-0.041
 nicely-0.039
uddle-0.039
,-0.037
Tokenadow
Token ID4584
Feature activation-4.04e+00

Pos logit contributions
leaf+0.021
 sorts+0.018
keeping+0.017
vous+0.015
Dear+0.015
Neg logit contributions
pit-0.018
ider-0.017
 nicely-0.016
uddle-0.016
,-0.016
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
pit+0.062
ider+0.057
,+0.051
 nicely+0.051
uddle+0.046
Neg logit contributions
leaf-0.165
 sorts-0.145
keeping-0.141
 faith-0.134
vous-0.134
Token He
Token ID679
Feature activation-6.13e+00

Pos logit contributions
leaf+0.010
 sorts+0.009
keeping+0.008
vous+0.008
 kinds+0.008
Neg logit contributions
pit-0.008
ider-0.007
bles-0.007
uddle-0.007
ooming-0.007
Token disappeared
Token ID12120
Feature activation-7.85e+00

Pos logit contributions
pit+0.123
,+0.111
ider+0.110
 nicely+0.109
uddle+0.107
Neg logit contributions
leaf-0.214
keeping-0.188
 sorts-0.186
vous-0.171
Dear-0.167
Token inside
Token ID2641
Feature activation-6.32e+00

Pos logit contributions
pit+0.090
ider+0.081
 nicely+0.078
uddle+0.075
,+0.075
Neg logit contributions
leaf-0.162
 sorts-0.144
keeping-0.140
vous-0.134
 kinds-0.129
Token to
Token ID284
Feature activation+3.17e+00

Pos logit contributions
pit+0.120
,+0.111
 nicely+0.110
ider+0.107
 patiently+0.100
Neg logit contributions
leaf-0.225
keeping-0.197
 sorts-0.194
vous-0.187
Dear-0.177
Token find
Token ID1064
Feature activation-2.28e+00

Pos logit contributions
leaf+0.108
keeping+0.095
 sorts+0.095
vous+0.090
ymes+0.088
Neg logit contributions
pit-0.058
,-0.056
 nicely-0.056
 gym-0.051
 Later-0.050
Token something
Token ID1223
Feature activation+1.13e-01

Pos logit contributions
,+0.040
 nicely+0.037
pit+0.036
 His+0.034
 camps+0.032
Neg logit contributions
leaf-0.079
keeping-0.075
 sorts-0.070
vous-0.068
 shouldn-0.066
Token to
Token ID284
Feature activation+3.12e+00

Pos logit contributions
leaf+0.004
 sorts+0.004
keeping+0.003
vous+0.003
 kinds+0.003
Neg logit contributions
pit-0.002
ider-0.002
 nicely-0.002
uddle-0.002
,-0.002
Token wipe
Token ID19916
Feature activation-4.27e-01

Pos logit contributions
leaf+0.100
keeping+0.089
 sorts+0.089
vous+0.082
Isn+0.079
Neg logit contributions
pit-0.063
 nicely-0.062
,-0.062
ider-0.059
uddle-0.056
Token her
Token ID607
Feature activation-4.10e+00

Pos logit contributions
,+0.007
 nicely+0.006
pit+0.006
 until+0.005
erve+0.005
Neg logit contributions
leaf-0.017
vous-0.014
 sorts-0.014
keeping-0.014
Dear-0.014
Token hands
Token ID2832
Feature activation+6.56e+00

Pos logit contributions
uddle+0.053
,+0.049
pit+0.049
 nicely+0.042
balls+0.042
Neg logit contributions
leaf-0.174
 sorts-0.163
vous-0.150
icated-0.148
weet-0.143
Token with
Token ID351
Feature activation-6.95e-01

Pos logit contributions
leaf+0.260
 sorts+0.230
keeping+0.223
vous+0.218
 faith+0.210
Neg logit contributions
pit-0.100
 nicely-0.093
ider-0.090
,-0.090
uddle-0.079
Token.
Token ID13
Feature activation+2.47e-01

Pos logit contributions
,+0.013
 nicely+0.011
 until+0.010
 hockey+0.010
 One+0.009
Neg logit contributions
leaf-0.026
vous-0.023
 sorts-0.023
keeping-0.022
Dear-0.021
Token As
Token ID1081
Feature activation-1.42e+00

Pos logit contributions
leaf+0.009
 sorts+0.008
keeping+0.008
 kinds+0.007
vous+0.007
Neg logit contributions
pit-0.005
ider-0.005
uddle-0.004
bles-0.004
 nicely-0.004
Token.
Token ID13
Feature activation-2.37e-01

Pos logit contributions
leaf+0.013
 sorts+0.012
keeping+0.011
vous+0.011
 faith+0.011
Neg logit contributions
pit-0.005
ider-0.004
,-0.004
 nicely-0.004
balls-0.003
Token We
Token ID775
Feature activation+7.31e+00

Pos logit contributions
pit+0.003
ider+0.003
 nicely+0.003
uddle+0.003
,+0.003
Neg logit contributions
leaf-0.009
 sorts-0.009
keeping-0.008
vous-0.008
 kinds-0.008
Token made
Token ID925
Feature activation-3.32e-01

Pos logit contributions
leaf+0.233
keeping+0.213
 sorts+0.200
vous+0.183
Isn+0.182
Neg logit contributions
,-0.164
 nicely-0.158
pit-0.148
 One-0.143
ider-0.140
Token him
Token ID683
Feature activation-3.26e+00

Pos logit contributions
,+0.006
 nicely+0.006
 hockey+0.006
balls+0.006
uddle+0.005
Neg logit contributions
leaf-0.012
keeping-0.011
 sorts-0.011
vous-0.010
Isn-0.010
Token together
Token ID1978
Feature activation-4.37e+00

Pos logit contributions
pit+0.063
 nicely+0.059
,+0.058
ider+0.057
uddle+0.056
Neg logit contributions
leaf-0.116
 sorts-0.100
keeping-0.100
vous-0.094
Isn-0.093
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
pit+0.070
 nicely+0.068
,+0.067
ider+0.062
 patiently+0.054
Neg logit contributions
leaf-0.173
keeping-0.150
 sorts-0.150
vous-0.145
 shouldn-0.136
Token We
Token ID775
Feature activation+7.23e+00

Pos logit contributions
pit+0.010
ider+0.010
 nicely+0.009
uddle+0.009
,+0.009
Neg logit contributions
leaf-0.013
 sorts-0.011
keeping-0.011
vous-0.010
 kinds-0.010
Token should
Token ID815
Feature activation+7.16e+00

Pos logit contributions
leaf+0.230
keeping+0.211
 sorts+0.195
vous+0.179
weet+0.178
Neg logit contributions
,-0.164
 nicely-0.159
pit-0.149
ider-0.141
 One-0.141
Token share
Token ID2648
Feature activation-2.21e+00

Pos logit contributions
leaf+0.239
 sorts+0.214
keeping+0.207
vous+0.194
 kinds+0.189
Neg logit contributions
pit-0.151
ider-0.141
uddle-0.129
 nicely-0.127
bles-0.120
Token him
Token ID683
Feature activation-3.21e+00

Pos logit contributions
,+0.036
 nicely+0.035
pit+0.029
 hockey+0.028
 camps+0.026
Neg logit contributions
leaf-0.089
vous-0.079
 sorts-0.079
keeping-0.075
 shouldn-0.071
Token."
Token ID526
Feature activation+4.38e+00

Pos logit contributions
pit+0.049
 nicely+0.048
ider+0.045
,+0.043
uddle+0.041
Neg logit contributions
leaf-0.128
 sorts-0.115
keeping-0.111
vous-0.107
 shouldn-0.104
Token special
Token ID2041
Feature activation+1.05e+00

Pos logit contributions
leaf+0.088
 sorts+0.077
keeping+0.076
vous+0.069
Dear+0.068
Neg logit contributions
pit-0.060
ider-0.057
 nicely-0.055
uddle-0.053
,-0.051
Token."
Token ID526
Feature activation+3.84e+00

Pos logit contributions
leaf+0.039
 sorts+0.034
vous+0.034
keeping+0.033
Dear+0.031
Neg logit contributions
,-0.019
pit-0.019
ider-0.017
 nicely-0.016
balls-0.015
Token Fl
Token ID1610
Feature activation-3.08e+00

Pos logit contributions
leaf+0.130
 sorts+0.117
keeping+0.114
vous+0.104
 kinds+0.103
Neg logit contributions
pit-0.079
ider-0.074
uddle-0.066
 nicely-0.065
bles-0.062
Tokenuffy
Token ID15352
Feature activation+1.69e+00

Pos logit contributions
pit+0.046
ider+0.035
 gym+0.035
 nicely+0.035
 camps+0.034
Neg logit contributions
leaf-0.109
 sorts-0.101
keeping-0.098
vous-0.097
 shouldn-0.094
Token and
Token ID290
Feature activation+3.94e+00

Pos logit contributions
leaf+0.053
 sorts+0.046
keeping+0.045
vous+0.042
 kinds+0.040
Neg logit contributions
pit-0.040
ider-0.037
 nicely-0.035
uddle-0.034
,-0.033
Token Lucy
Token ID22162
Feature activation+4.68e-01

Pos logit contributions
leaf+0.153
 sorts+0.138
keeping+0.134
vous+0.128
 kinds+0.124
Neg logit contributions
pit-0.063
ider-0.056
 nicely-0.052
uddle-0.050
,-0.048
Token were
Token ID547
Feature activation-7.55e-01

Pos logit contributions
leaf+0.015
 sorts+0.013
keeping+0.013
vous+0.012
 kinds+0.011
Neg logit contributions
pit-0.011
ider-0.010
 nicely-0.010
,-0.009
uddle-0.009
Token very
Token ID845
Feature activation+3.05e+00

Pos logit contributions
pit+0.016
ider+0.015
 nicely+0.015
,+0.014
uddle+0.014
Neg logit contributions
leaf-0.025
 sorts-0.022
keeping-0.021
vous-0.020
 kinds-0.019
Token happy
Token ID3772
Feature activation+2.16e+00

Pos logit contributions
leaf+0.098
 sorts+0.087
keeping+0.084
vous+0.078
 kinds+0.076
Neg logit contributions
pit-0.069
ider-0.065
 nicely-0.061
uddle-0.060
,-0.057
Token and
Token ID290
Feature activation+3.81e+00

Pos logit contributions
leaf+0.086
 sorts+0.077
keeping+0.074
vous+0.074
Dear+0.070
Neg logit contributions
pit-0.033
,-0.030
ider-0.030
 nicely-0.028
uddle-0.026
Token went
Token ID1816
Feature activation-5.37e+00

Pos logit contributions
leaf+0.135
 sorts+0.119
keeping+0.117
vous+0.113
Dear+0.108
Neg logit contributions
,-0.067
 nicely-0.065
pit-0.064
ider-0.062
balls-0.060
Token pony
Token ID26902
Feature activation+2.51e+00

Pos logit contributions
leaf+0.055
 sorts+0.050
keeping+0.049
vous+0.045
 kinds+0.044
Neg logit contributions
pit-0.028
ider-0.026
 nicely-0.024
,-0.023
uddle-0.023
Token asked
Token ID1965
Feature activation-1.81e+00

Pos logit contributions
leaf+0.077
 sorts+0.068
keeping+0.065
vous+0.061
 kinds+0.059
Neg logit contributions
pit-0.059
ider-0.056
 nicely-0.052
uddle-0.052
bles-0.049
Token L
Token ID406
Feature activation+2.96e+00

Pos logit contributions
pit+0.046
ider+0.044
uddle+0.041
 rink+0.037
 camps+0.037
Neg logit contributions
leaf-0.051
 sorts-0.049
 kinds-0.043
keeping-0.042
 fright-0.041
Tokenizzy
Token ID40593
Feature activation+2.56e+00

Pos logit contributions
leaf+0.081
 sorts+0.071
keeping+0.067
vous+0.063
 kinds+0.060
Neg logit contributions
pit-0.080
ider-0.076
 nicely-0.072
uddle-0.071
,-0.068
Token,
Token ID11
Feature activation-8.35e+00

Pos logit contributions
leaf+0.085
 sorts+0.077
keeping+0.072
 kinds+0.068
vous+0.068
Neg logit contributions
pit-0.053
ider-0.052
uddle-0.045
 nicely-0.041
 Later-0.041
Token "
Token ID366
Feature activation+4.80e+00

Pos logit contributions
pit+0.077
ider+0.067
 nicely+0.059
,+0.050
uddle+0.049
Neg logit contributions
leaf-0.376
 sorts-0.347
keeping-0.341
vous-0.325
 kinds-0.318
TokenDo
Token ID5211
Feature activation+3.58e+00

Pos logit contributions
leaf+0.142
 sorts+0.134
keeping+0.128
 kinds+0.117
vous+0.117
Neg logit contributions
pit-0.119
ider-0.109
 nicely-0.105
 camps-0.100
 patiently-0.096
Token you
Token ID345
Feature activation+5.52e+00

Pos logit contributions
leaf+0.088
 sorts+0.076
keeping+0.074
vous+0.065
 kinds+0.062
Neg logit contributions
pit-0.105
ider-0.102
 nicely-0.099
,-0.096
uddle-0.096
Token want
Token ID765
Feature activation+6.50e+00

Pos logit contributions
keeping+0.196
leaf+0.191
ymes+0.163
 sorts+0.163
Congratulations+0.155
Neg logit contributions
,-0.116
 One-0.099
 nicely-0.094
ider-0.082
 at-0.081
Token to
Token ID284
Feature activation+3.34e+00

Pos logit contributions
keeping+0.257
leaf+0.257
 sorts+0.250
vous+0.237
lling+0.224
Neg logit contributions
,-0.092
 nicely-0.076
 hockey-0.069
 One-0.066
 His-0.058
Token take
Token ID1011
Feature activation+4.14e-01

Pos logit contributions
leaf+0.117
 sorts+0.105
keeping+0.103
vous+0.097
 kinds+0.093
Neg logit contributions
pit-0.063
ider-0.059
 nicely-0.059
,-0.058
uddle-0.054
Token a
Token ID257
Feature activation+2.00e+00

Pos logit contributions
pit+0.033
ider+0.031
 nicely+0.030
uddle+0.029
,+0.028
Neg logit contributions
leaf-0.040
 sorts-0.035
keeping-0.034
vous-0.032
 kinds-0.030
Token nap
Token ID25422
Feature activation-4.44e+00

Pos logit contributions
leaf+0.076
keeping+0.071
 sorts+0.069
vous+0.065
 kinds+0.064
Neg logit contributions
uddle-0.029
pit-0.029
,-0.029
 nicely-0.028
ider-0.026
Token under
Token ID739
Feature activation-1.43e+00

Pos logit contributions
pit+0.076
ider+0.070
 nicely+0.065
uddle+0.062
,+0.060
Neg logit contributions
leaf-0.168
 sorts-0.150
keeping-0.145
vous-0.139
 kinds-0.134
Token the
Token ID262
Feature activation+4.59e-01

Pos logit contributions
,+0.024
pit+0.022
 nicely+0.021
 hockey+0.020
 camps+0.020
Neg logit contributions
leaf-0.057
 sorts-0.049
keeping-0.047
vous-0.046
Dear-0.046
Token tree
Token ID5509
Feature activation-9.91e-01

Pos logit contributions
leaf+0.017
 sorts+0.016
keeping+0.015
vous+0.014
 kinds+0.014
Neg logit contributions
pit-0.008
,-0.007
uddle-0.007
 nicely-0.007
 camps-0.007
Token when
Token ID618
Feature activation-2.23e+00

Pos logit contributions
pit+0.017
ider+0.016
uddle+0.015
 nicely+0.015
bles+0.014
Neg logit contributions
leaf-0.036
 sorts-0.033
keeping-0.032
vous-0.030
 kinds-0.029
Token he
Token ID339
Feature activation-6.87e+00

Pos logit contributions
pit+0.025
ider+0.021
 nicely+0.020
,+0.019
 gym+0.017
Neg logit contributions
leaf-0.099
 sorts-0.089
keeping-0.086
vous-0.083
Dear-0.081
Token felt
Token ID2936
Feature activation+8.84e-01

Pos logit contributions
pit+0.147
ider+0.136
 nicely+0.131
uddle+0.129
,+0.128
Neg logit contributions
leaf-0.232
 sorts-0.201
keeping-0.198
vous-0.180
 kinds-0.177
Token hot
Token ID3024
Feature activation+5.88e-01

Pos logit contributions
leaf+0.032
keeping+0.028
 sorts+0.028
vous+0.025
 kinds+0.025
Neg logit contributions
 nicely-0.016
,-0.016
pit-0.015
ider-0.015
uddle-0.014
Token and
Token ID290
Feature activation+4.12e+00

Pos logit contributions
leaf+0.022
 sorts+0.020
keeping+0.019
vous+0.019
 kinds+0.018
Neg logit contributions
pit-0.010
ider-0.009
,-0.008
 nicely-0.008
uddle-0.008
Token dry
Token ID5894
Feature activation-1.32e+00

Pos logit contributions
leaf+0.160
keeping+0.135
 sorts+0.131
vous+0.122
ymes+0.121
Neg logit contributions
,-0.069
 nicely-0.068
pit-0.062
ider-0.060
 patiently-0.059
Token to
Token ID284
Feature activation+2.92e+00

Pos logit contributions
leaf+0.083
 sorts+0.074
vous+0.073
keeping+0.070
Dear+0.067
Neg logit contributions
,-0.032
pit-0.032
 nicely-0.030
ider-0.029
balls-0.027
Token be
Token ID307
Feature activation-6.01e-01

Pos logit contributions
leaf+0.088
 sorts+0.078
keeping+0.075
vous+0.070
 kinds+0.068
Neg logit contributions
pit-0.072
ider-0.068
uddle-0.063
 nicely-0.062
bles-0.059
Token together
Token ID1978
Feature activation-4.62e+00

Pos logit contributions
pit+0.012
 nicely+0.011
ider+0.011
,+0.011
 patiently+0.010
Neg logit contributions
leaf-0.021
 sorts-0.018
keeping-0.018
vous-0.017
icated-0.017
Token.
Token ID13
Feature activation+5.94e-02

Pos logit contributions
pit+0.066
 nicely+0.061
,+0.061
ider+0.060
 patiently+0.051
Neg logit contributions
leaf-0.190
 sorts-0.168
keeping-0.166
vous-0.161
Dear-0.151
Token<|endoftext|>
Token ID50256
Feature activation+4.56e+00

Pos logit contributions
leaf+0.002
keeping+0.002
 sorts+0.002
vous+0.002
ymes+0.002
Neg logit contributions
pit-0.001
 Later-0.001
 nicely-0.001
,-0.001
 One-0.001
TokenOnce
Token ID7454
Feature activation-6.42e+00

Pos logit contributions
leaf+0.182
 sorts+0.162
keeping+0.154
vous+0.152
 kinds+0.147
Neg logit contributions
pit-0.068
,-0.062
 nicely-0.060
ider-0.058
uddle-0.055
Token there
Token ID612
Feature activation-4.02e+00

Pos logit contributions
pit+0.164
ider+0.164
uddle+0.139
balls+0.137
erve+0.137
Neg logit contributions
leaf-0.172
 sorts-0.163
keeping-0.140
 kinds-0.140
 shouldn-0.138
Token was
Token ID373
Feature activation+1.06e+00

Pos logit contributions
pit+0.102
ider+0.100
uddle+0.091
 nicely+0.090
bles+0.087
Neg logit contributions
leaf-0.114
 sorts-0.102
keeping-0.096
vous-0.090
 kinds-0.088
Token a
Token ID257
Feature activation+2.08e+00

Pos logit contributions
leaf+0.033
 sorts+0.029
keeping+0.029
vous+0.026
 fright+0.025
Neg logit contributions
pit-0.024
ider-0.022
 nicely-0.022
,-0.022
 camps-0.021
Token town
Token ID3240
Feature activation-1.54e+00

Pos logit contributions
leaf+0.061
 sorts+0.053
vous+0.050
Dear+0.048
 fright+0.047
Neg logit contributions
pit-0.051
ider-0.048
angers-0.044
 Later-0.043
iked-0.043
Token with
Token ID351
Feature activation-3.98e-01

Pos logit contributions
pit+0.026
ider+0.024
uddle+0.021
 nicely+0.021
bles+0.020
Neg logit contributions
leaf-0.058
 sorts-0.053
keeping-0.051
vous-0.048
 kinds-0.048
Token land
Token ID1956
Feature activation-4.62e+00

Pos logit contributions
pit+0.046
,+0.044
 nicely+0.043
 camps+0.041
 hockey+0.038
Neg logit contributions
leaf-0.091
keeping-0.077
 sorts-0.077
vous-0.074
Dear-0.071
Token.
Token ID13
Feature activation+5.16e-01

Pos logit contributions
pit+0.085
ider+0.077
 nicely+0.073
,+0.072
uddle+0.069
Neg logit contributions
leaf-0.171
 sorts-0.153
keeping-0.146
vous-0.140
Dear-0.136
Token Suddenly
Token ID24975
Feature activation-5.72e+00

Pos logit contributions
leaf+0.015
 sorts+0.014
keeping+0.014
vous+0.013
Isn+0.012
Neg logit contributions
pit-0.012
ider-0.012
bles-0.011
uddle-0.011
uing-0.010
Token,
Token ID11
Feature activation-8.57e+00

Pos logit contributions
pit+0.089
ider+0.081
uddle+0.069
uing+0.060
angers+0.059
Neg logit contributions
leaf-0.211
 sorts-0.200
 faith-0.191
 fright-0.189
 shouldn-0.188
Token he
Token ID339
Feature activation-7.12e+00

Pos logit contributions
pit+0.091
ider+0.080
,+0.075
 nicely+0.075
 camps+0.063
Neg logit contributions
leaf-0.375
 sorts-0.345
keeping-0.335
vous-0.322
 kinds-0.315
Token saw
Token ID2497
Feature activation-2.98e+00

Pos logit contributions
pit+0.180
ider+0.170
 nicely+0.160
uddle+0.159
,+0.153
Neg logit contributions
leaf-0.209
 sorts-0.184
keeping-0.176
vous-0.163
 kinds-0.158
Token a
Token ID257
Feature activation+2.14e+00

Pos logit contributions
pit+0.056
,+0.056
 nicely+0.053
 hockey+0.050
 camps+0.049
Neg logit contributions
leaf-0.104
 sorts-0.092
keeping-0.092
vous-0.083
Dear-0.081
Token storm
Token ID6388
Feature activation-3.62e-01

Pos logit contributions
leaf+0.082
keeping+0.076
 sorts+0.073
vous+0.069
 kinds+0.067
Neg logit contributions
pit-0.036
uddle-0.033
,-0.033
 nicely-0.031
 camps-0.030
Token coming
Token ID2406
Feature activation-3.21e+00

Pos logit contributions
pit+0.009
ider+0.008
,+0.008
 nicely+0.008
uddle+0.007
Neg logit contributions
leaf-0.011
 sorts-0.010
keeping-0.009
vous-0.009
 faith-0.008
Token.
Token ID13
Feature activation+7.27e-01

Pos logit contributions
,+0.058
pit+0.056
ider+0.055
 nicely+0.053
 camps+0.049
Neg logit contributions
leaf-0.116
 sorts-0.103
keeping-0.100
vous-0.095
 faith-0.094
Token
Token ID198
Feature activation-6.23e-02

Pos logit contributions
leaf+0.026
 sorts+0.024
keeping+0.023
vous+0.022
 fright+0.021
Neg logit contributions
pit-0.014
ider-0.013
 nicely-0.011
uddle-0.011
bles-0.011
Token.
Token ID13
Feature activation+4.31e-01

Pos logit contributions
pit+0.030
ider+0.029
uddle+0.026
angers+0.024
balls+0.024
Neg logit contributions
leaf-0.050
 sorts-0.050
keeping-0.045
 kinds-0.045
vous-0.043
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
leaf+0.015
 sorts+0.014
keeping+0.013
vous+0.013
 kinds+0.013
Neg logit contributions
pit-0.008
ider-0.008
bles-0.008
uddle-0.007
ooming-0.007
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
pit+0.002
ider+0.002
 nicely+0.002
uddle+0.002
,+0.002
Neg logit contributions
leaf-0.004
 sorts-0.004
keeping-0.003
vous-0.003
 kinds-0.003
TokenThe
Token ID464
Feature activation+1.35e+00

Pos logit contributions
pit+0.002
ider+0.002
uddle+0.002
 nicely+0.002
bles+0.002
Neg logit contributions
leaf-0.004
 sorts-0.003
keeping-0.003
vous-0.003
 kinds-0.003
Token boy
Token ID2933
Feature activation+5.11e+00

Pos logit contributions
leaf+0.037
 sorts+0.032
vous+0.031
keeping+0.029
 fright+0.029
Neg logit contributions
pit-0.035
ider-0.032
angers-0.030
uing-0.030
inn-0.030
Token eagerly
Token ID30130
Feature activation-1.46e+00

Pos logit contributions
leaf+0.142
 sorts+0.133
vous+0.119
keeping+0.119
 kinds+0.116
Neg logit contributions
pit-0.130
ider-0.127
uddle-0.117
 nicely-0.115
bles-0.112
Token walked
Token ID6807
Feature activation-7.68e+00

Pos logit contributions
ider+0.038
pit+0.038
uddle+0.036
 nicely+0.034
angers+0.034
Neg logit contributions
leaf-0.040
 sorts-0.037
keeping-0.033
vous-0.032
 kinds-0.032
Token through
Token ID832
Feature activation-3.75e+00

Pos logit contributions
pit+0.189
ider+0.183
uddle+0.174
 nicely+0.156
bles+0.155
Neg logit contributions
leaf-0.232
 sorts-0.212
keeping-0.198
 kinds-0.183
vous-0.180
Token the
Token ID262
Feature activation+8.12e-02

Pos logit contributions
pit+0.095
ider+0.088
 nicely+0.086
,+0.084
uddle+0.082
Neg logit contributions
leaf-0.111
 sorts-0.094
keeping-0.091
vous-0.087
Dear-0.083
Token broken
Token ID5445
Feature activation-5.07e-01

Pos logit contributions
 sorts+0.002
leaf+0.002
 kinds+0.002
vous+0.002
keeping+0.002
Neg logit contributions
ider-0.002
pit-0.002
uing-0.002
bles-0.002
balls-0.002
Token door
Token ID3420
Feature activation-2.19e+00

Pos logit contributions
ider+0.013
pit+0.012
 nicely+0.011
balls+0.011
uing+0.011
Neg logit contributions
 sorts-0.013
leaf-0.013
keeping-0.012
 kinds-0.012
 fright-0.011
Token,
Token ID11
Feature activation-8.55e+00

Pos logit contributions
pit+0.034
ider+0.029
uddle+0.027
uing+0.023
aj+0.023
Neg logit contributions
leaf-0.056
 sorts-0.051
 shouldn-0.048
 kinds-0.047
 fright-0.047
Token he
Token ID339
Feature activation-7.40e+00

Pos logit contributions
pit+0.119
ider+0.108
 nicely+0.097
uddle+0.092
,+0.087
Neg logit contributions
leaf-0.349
 sorts-0.318
keeping-0.310
vous-0.295
 kinds-0.287
Token went
Token ID1816
Feature activation-5.08e+00

Pos logit contributions
pit+0.167
ider+0.156
 nicely+0.147
uddle+0.144
,+0.138
Neg logit contributions
leaf-0.238
 sorts-0.211
keeping-0.203
vous-0.190
 kinds-0.184
Token to
Token ID284
Feature activation+3.14e+00

Pos logit contributions
pit+0.153
ider+0.142
uddle+0.138
 Days+0.133
uing+0.130
Neg logit contributions
leaf-0.133
 sorts-0.107
keeping-0.096
Dear-0.086
 kinds-0.084
Token a
Token ID257
Feature activation+2.12e+00

Pos logit contributions
leaf+0.100
 sorts+0.085
keeping+0.084
vous+0.081
 shouldn+0.076
Neg logit contributions
pit-0.069
 nicely-0.065
,-0.063
 gym-0.062
ider-0.060
Token toy
Token ID13373
Feature activation-6.84e-01

Pos logit contributions
leaf+0.065
 sorts+0.060
vous+0.054
 fright+0.052
Dear+0.051
Neg logit contributions
ider-0.050
pit-0.049
bles-0.044
balls-0.041
angers-0.041
Token store
Token ID3650
Feature activation-2.51e+00

Pos logit contributions
pit+0.012
ider+0.012
bles+0.011
aj+0.011
uing+0.011
Neg logit contributions
leaf-0.024
 sorts-0.022
keeping-0.020
Dear-0.020
vous-0.019
Token and
Token ID290
Feature activation+3.93e+00

Pos logit contributions
pit+0.053
ider+0.050
uddle+0.049
 Days+0.044
 camps+0.043
Neg logit contributions
leaf-0.081
 sorts-0.075
keeping-0.073
 kinds-0.068
vous-0.065
Token filled
Token ID5901
Feature activation-4.34e+00

Pos logit contributions
leaf+0.123
 sorts+0.110
keeping+0.106
vous+0.099
 kinds+0.095
Neg logit contributions
pit-0.091
ider-0.088
uddle-0.081
 nicely-0.077
bles-0.076
Token his
Token ID465
Feature activation-4.44e+00

Pos logit contributions
pit+0.089
ider+0.084
 nicely+0.076
uddle+0.076
,+0.071
Neg logit contributions
leaf-0.148
 sorts-0.133
keeping-0.128
vous-0.120
 kinds-0.117
Token car
Token ID1097
Feature activation-1.84e+00

Pos logit contributions
pit+0.072
ider+0.068
 nicely+0.058
uddle+0.054
bles+0.054
Neg logit contributions
leaf-0.167
 sorts-0.153
keeping-0.149
vous-0.140
 kinds-0.137
Token use
Token ID779
Feature activation-1.11e+00

Pos logit contributions
leaf+0.082
 sorts+0.074
keeping+0.071
vous+0.068
 fright+0.066
Neg logit contributions
pit-0.071
ider-0.067
uddle-0.064
balls-0.061
angers-0.061
Token it
Token ID340
Feature activation-1.62e+00

Pos logit contributions
,+0.024
 nicely+0.022
pit+0.021
 until+0.019
 patiently+0.019
Neg logit contributions
leaf-0.039
vous-0.034
 sorts-0.032
keeping-0.031
Dear-0.031
Token again
Token ID757
Feature activation-1.98e+00

Pos logit contributions
pit+0.033
ider+0.030
uddle+0.028
 nicely+0.027
 Later+0.026
Neg logit contributions
leaf-0.054
 sorts-0.049
keeping-0.048
vous-0.045
 kinds-0.043
Token.
Token ID13
Feature activation+1.88e-01

Pos logit contributions
pit+0.028
,+0.026
 nicely+0.025
ider+0.025
 patiently+0.020
Neg logit contributions
leaf-0.083
 sorts-0.072
vous-0.069
keeping-0.069
Dear-0.065
Token They
Token ID1119
Feature activation-2.18e+00

Pos logit contributions
leaf+0.007
 sorts+0.007
keeping+0.006
vous+0.006
 kinds+0.006
Neg logit contributions
pit-0.003
ider-0.003
 nicely-0.003
uddle-0.002
 Later-0.002
Token buried
Token ID11694
Feature activation-7.13e+00

Pos logit contributions
pit+0.044
 nicely+0.043
,+0.042
 patiently+0.040
ider+0.038
Neg logit contributions
leaf-0.073
keeping-0.065
 sorts-0.062
vous-0.061
icated-0.058
Token it
Token ID340
Feature activation-1.53e+00

Pos logit contributions
,+0.139
 nicely+0.135
pit+0.134
 until+0.113
 patiently+0.110
Neg logit contributions
leaf-0.265
vous-0.242
Dear-0.223
 sorts-0.219
weet-0.211
Token with
Token ID351
Feature activation-6.13e-01

Pos logit contributions
pit+0.028
ider+0.026
uddle+0.024
 nicely+0.024
 Later+0.022
Neg logit contributions
leaf-0.055
 sorts-0.050
keeping-0.048
vous-0.045
 kinds-0.044
Token Lily
Token ID20037
Feature activation+1.44e+00

Pos logit contributions
,+0.011
 nicely+0.010
 until+0.008
pit+0.008
 towers+0.008
Neg logit contributions
leaf-0.024
vous-0.021
keeping-0.020
Dear-0.020
 outage-0.019
Token and
Token ID290
Feature activation+4.25e+00

Pos logit contributions
leaf+0.055
 sorts+0.049
keeping+0.047
vous+0.045
Dear+0.044
Neg logit contributions
pit-0.025
ider-0.022
,-0.022
 nicely-0.022
uddle-0.020
Token hoped
Token ID10719
Feature activation+1.07e+00

Pos logit contributions
leaf+0.160
vous+0.135
 sorts+0.134
keeping+0.128
Isn+0.122
Neg logit contributions
pit-0.076
,-0.074
 nicely-0.071
 patiently-0.065
oner-0.063
Token care
Token ID1337
Feature activation+1.12e+00

Pos logit contributions
leaf+0.011
 sorts+0.009
keeping+0.009
vous+0.009
Dear+0.008
Neg logit contributions
 nicely-0.008
,-0.008
pit-0.008
ider-0.008
 hockey-0.007
Token of
Token ID286
Feature activation+2.52e+00

Pos logit contributions
leaf+0.033
 sorts+0.028
keeping+0.028
vous+0.026
Dear+0.026
Neg logit contributions
pit-0.027
 nicely-0.027
ider-0.026
,-0.026
bles-0.024
Token it
Token ID340
Feature activation-1.46e+00

Pos logit contributions
leaf+0.089
vous+0.076
 sorts+0.074
Dear+0.074
keeping+0.071
Neg logit contributions
 nicely-0.051
,-0.050
pit-0.048
 hockey-0.043
 patiently-0.043
Token,
Token ID11
Feature activation-8.21e+00

Pos logit contributions
pit+0.027
ider+0.025
uddle+0.023
 nicely+0.022
bles+0.021
Neg logit contributions
leaf-0.052
 sorts-0.047
keeping-0.045
vous-0.043
 kinds-0.042
Token so
Token ID523
Feature activation+4.56e+00

Pos logit contributions
,+0.133
pit+0.130
 nicely+0.127
 until+0.118
 patiently+0.117
Neg logit contributions
leaf-0.304
vous-0.269
keeping-0.269
ymes-0.266
 sorts-0.265
Token he
Token ID339
Feature activation-6.91e+00

Pos logit contributions
leaf+0.198
 sorts+0.180
keeping+0.177
vous+0.169
Dear+0.164
Neg logit contributions
pit-0.051
ider-0.045
 nicely-0.043
,-0.039
uddle-0.035
Token could
Token ID714
Feature activation+3.32e+00

Pos logit contributions
pit+0.140
ider+0.133
 nicely+0.129
,+0.125
uddle+0.121
Neg logit contributions
leaf-0.233
 sorts-0.210
keeping-0.203
vous-0.193
 kinds-0.186
Token play
Token ID711
Feature activation-1.68e+00

Pos logit contributions
leaf+0.112
 sorts+0.100
keeping+0.097
vous+0.092
 kinds+0.089
Neg logit contributions
pit-0.067
ider-0.064
 nicely-0.061
,-0.059
uddle-0.058
Token with
Token ID351
Feature activation-5.57e-01

Pos logit contributions
pit+0.023
 nicely+0.021
ider+0.021
,+0.018
uddle+0.018
Neg logit contributions
leaf-0.069
 sorts-0.062
keeping-0.060
vous-0.058
Dear-0.056
Token it
Token ID340
Feature activation-1.51e+00

Pos logit contributions
,+0.011
 nicely+0.011
pit+0.010
 hockey+0.010
 towers+0.010
Neg logit contributions
leaf-0.020
Dear-0.017
vous-0.017
 sorts-0.016
keeping-0.016
Token.
Token ID13
Feature activation+9.75e-02

Pos logit contributions
pit+0.025
ider+0.023
 nicely+0.021
uddle+0.021
,+0.020
Neg logit contributions
leaf-0.057
 sorts-0.051
keeping-0.050
vous-0.047
 kinds-0.046
Token the
Token ID262
Feature activation-1.08e-01

Pos logit contributions
pit+0.113
 nicely+0.111
,+0.111
ider+0.105
 gym+0.099
Neg logit contributions
leaf-0.165
 sorts-0.139
keeping-0.138
vous-0.133
Dear-0.126
Token tracks
Token ID8339
Feature activation-5.92e-01

Pos logit contributions
pit+0.002
ider+0.002
 nicely+0.002
uddle+0.002
,+0.002
Neg logit contributions
leaf-0.004
 sorts-0.003
keeping-0.003
vous-0.003
 kinds-0.003
Token.
Token ID13
Feature activation+4.72e-01

Pos logit contributions
pit+0.009
ider+0.008
 nicely+0.007
uddle+0.007
,+0.006
Neg logit contributions
leaf-0.024
 sorts-0.021
keeping-0.021
vous-0.020
 kinds-0.019
Token Every
Token ID3887
Feature activation+6.92e-01

Pos logit contributions
leaf+0.019
keeping+0.017
 sorts+0.017
vous+0.016
 kinds+0.015
Neg logit contributions
pit-0.007
 nicely-0.006
ider-0.006
,-0.006
 One-0.006
Token morning
Token ID3329
Feature activation-3.68e+00

Pos logit contributions
leaf+0.025
keeping+0.023
 sorts+0.019
Dear+0.018
 shouldn+0.018
Neg logit contributions
,-0.014
pit-0.013
 gym-0.013
 camps-0.012
 nicely-0.012
Token,
Token ID11
Feature activation-8.47e+00

Pos logit contributions
pit+0.025
ider+0.019
uddle+0.012
 nicely+0.012
bles+0.008
Neg logit contributions
leaf-0.177
 sorts-0.165
keeping-0.159
vous-0.154
 kinds-0.152
Token the
Token ID262
Feature activation-9.45e-02

Pos logit contributions
,+0.169
 nicely+0.144
 hockey+0.132
 gym+0.132
 car+0.132
Neg logit contributions
leaf-0.321
keeping-0.294
vous-0.282
 sorts-0.277
Dear-0.251
Token train
Token ID4512
Feature activation-3.90e-01

Pos logit contributions
pit+0.001
ider+0.001
uddle+0.001
,+0.001
 car+0.001
Neg logit contributions
leaf-0.004
keeping-0.003
 sorts-0.003
vous-0.003
 kinds-0.003
Token conductor
Token ID39206
Feature activation+3.42e+00

Pos logit contributions
pit+0.009
ider+0.008
 nicely+0.008
,+0.008
uddle+0.007
Neg logit contributions
leaf-0.013
 sorts-0.011
keeping-0.011
vous-0.010
 kinds-0.010
Token would
Token ID561
Feature activation+2.08e+00

Pos logit contributions
leaf+0.120
keeping+0.111
 sorts+0.108
vous+0.097
 fright+0.097
Neg logit contributions
,-0.064
pit-0.062
 nicely-0.057
ider-0.057
 patiently-0.051
Token make
Token ID787
Feature activation+2.37e+00

Pos logit contributions
leaf+0.073
 sorts+0.065
keeping+0.065
vous+0.061
 kinds+0.058
Neg logit contributions
pit-0.039
ider-0.036
 nicely-0.036
,-0.035
uddle-0.032
Token of
Token ID286
Feature activation+2.62e+00

Pos logit contributions
leaf+0.024
 sorts+0.021
keeping+0.020
vous+0.019
 kinds+0.019
Neg logit contributions
pit-0.019
ider-0.018
 nicely-0.017
uddle-0.017
 Later-0.016
Token them
Token ID606
Feature activation-1.71e+00

Pos logit contributions
leaf+0.105
keeping+0.098
 sorts+0.095
vous+0.093
 shouldn+0.091
Neg logit contributions
pit-0.033
 nicely-0.031
,-0.030
 camps-0.027
inn-0.025
Token laughed
Token ID13818
Feature activation-2.04e+00

Pos logit contributions
pit+0.034
 nicely+0.032
,+0.031
ider+0.031
uddle+0.030
Neg logit contributions
leaf-0.058
keeping-0.052
 sorts-0.052
vous-0.047
 kinds-0.046
Token and
Token ID290
Feature activation+4.05e+00

Pos logit contributions
pit+0.041
ider+0.038
uddle+0.035
 nicely+0.033
bles+0.032
Neg logit contributions
leaf-0.069
 sorts-0.063
keeping-0.059
vous-0.056
 kinds-0.056
Token laughed
Token ID13818
Feature activation-2.11e+00

Pos logit contributions
leaf+0.133
 sorts+0.117
keeping+0.114
vous+0.111
Dear+0.105
Neg logit contributions
pit-0.087
,-0.080
 nicely-0.079
ider-0.077
uddle-0.073
Token,
Token ID11
Feature activation-8.44e+00

Pos logit contributions
pit+0.041
ider+0.038
uddle+0.034
angers+0.032
 gym+0.031
Neg logit contributions
leaf-0.073
 sorts-0.066
keeping-0.062
 kinds-0.059
 fright-0.059
Token feeling
Token ID4203
Feature activation+2.22e+00

Pos logit contributions
pit+0.148
ider+0.134
 nicely+0.130
,+0.126
 patiently+0.118
Neg logit contributions
leaf-0.312
 sorts-0.282
keeping-0.276
vous-0.264
 kinds-0.254
Token silly
Token ID14397
Feature activation+4.76e+00

Pos logit contributions
leaf+0.078
keeping+0.072
 sorts+0.066
icated+0.064
Dear+0.063
Neg logit contributions
,-0.044
 nicely-0.042
pit-0.036
 hockey-0.035
 until-0.033
Token and
Token ID290
Feature activation+4.03e+00

Pos logit contributions
leaf+0.182
 sorts+0.164
keeping+0.158
vous+0.152
Dear+0.147
Neg logit contributions
pit-0.079
ider-0.073
 nicely-0.066
,-0.064
uddle-0.064
Token relieved
Token ID22598
Feature activation+8.97e-01

Pos logit contributions
leaf+0.142
keeping+0.125
 sorts+0.119
Dear+0.116
vous+0.115
Neg logit contributions
,-0.076
 nicely-0.076
pit-0.067
balls-0.063
ider-0.063
Token.
Token ID13
Feature activation-2.24e-01

Pos logit contributions
leaf+0.036
 sorts+0.033
keeping+0.032
vous+0.031
Dear+0.029
Neg logit contributions
pit-0.013
ider-0.012
,-0.012
 nicely-0.012
uddle-0.010
Token
Token ID198
Feature activation-6.32e-02

Pos logit contributions
pit+0.038
ider+0.036
uddle+0.033
 nicely+0.032
 camps+0.030
Neg logit contributions
leaf-0.061
 sorts-0.055
keeping-0.053
vous-0.049
 kinds-0.048
Token
Token ID198
Feature activation-4.37e-02

Pos logit contributions
pit+0.002
ider+0.001
uddle+0.001
 nicely+0.001
angers+0.001
Neg logit contributions
leaf-0.002
 sorts-0.002
keeping-0.002
vous-0.001
Dear-0.001
TokenM
Token ID44
Feature activation+2.22e-01

Pos logit contributions
pit+0.001
ider+0.001
uddle+0.001
 nicely+0.001
bles+0.001
Neg logit contributions
leaf-0.001
 sorts-0.001
keeping-0.001
vous-0.001
Dear-0.001
Tokenum
Token ID388
Feature activation+1.32e+00

Pos logit contributions
leaf+0.009
 sorts+0.008
vous+0.007
 kinds+0.007
keeping+0.007
Neg logit contributions
pit-0.003
 nicely-0.003
ider-0.003
angers-0.002
 gym-0.002
Token said
Token ID531
Feature activation-2.19e+00

Pos logit contributions
leaf+0.036
 sorts+0.031
keeping+0.029
vous+0.028
 kinds+0.027
Neg logit contributions
pit-0.035
ider-0.035
uddle-0.032
 nicely-0.032
angers-0.030
Token,
Token ID11
Feature activation-8.25e+00

Pos logit contributions
pit+0.025
ider+0.022
uddle+0.019
 Later+0.017
angers+0.016
Neg logit contributions
leaf-0.094
 sorts-0.085
keeping-0.081
 fright-0.079
 kinds-0.078
Token "
Token ID366
Feature activation+4.82e+00

Pos logit contributions
pit+0.128
ider+0.112
uddle+0.110
 camps+0.092
 nicely+0.091
Neg logit contributions
leaf-0.322
 sorts-0.283
keeping-0.271
Isn-0.266
Dear-0.264
TokenLet
Token ID5756
Feature activation+7.93e+00

Pos logit contributions
keeping+0.133
leaf+0.129
 harm+0.119
 sorts+0.117
 outage+0.116
Neg logit contributions
pit-0.118
 One-0.093
 sixty-0.093
ider-0.093
 nicely-0.092
Token's
Token ID338
Feature activation-3.50e-01

Pos logit contributions
leaf+0.341
keeping+0.309
 sorts+0.306
vous+0.277
 kinds+0.274
Neg logit contributions
,-0.105
pit-0.088
 nicely-0.085
uddle-0.076
erve-0.072
Token sit
Token ID1650
Feature activation-3.08e+00

Pos logit contributions
pit+0.006
ider+0.006
 nicely+0.005
,+0.005
uddle+0.005
Neg logit contributions
leaf-0.013
 sorts-0.012
keeping-0.011
vous-0.011
 kinds-0.010
Token down
Token ID866
Feature activation-6.95e+00

Pos logit contributions
pit+0.056
ider+0.052
 nicely+0.048
uddle+0.046
,+0.046
Neg logit contributions
leaf-0.114
 sorts-0.103
keeping-0.099
vous-0.095
 kinds-0.092