TOP ACTIVATIONS
MAX = 14.854

examine it with the magn ifying glass ?" "
and handed him the magn ifying glass . He
, handing him the magn ifying glass . She looked around
When Daisy put the magn ifying glass down , the cater
. Lily had a magn ifying glass that she found in
the necklace and the magn ifying glass and told him that
borrowed her mom 's magn ifying glass to take a closer
the watch with the magn ifying glass . He saw the
cried , dropping the magn ifying glass . "
the necklace and the magn ifying glass and started to examine
. They had a magn ifying glass and a small box
She takes out a magn ifying glass . She looks at
She had a big magn ifying glass that made things look
hat and grabbed her magn ifying glass . She was ready
said , holding the magn ifying glass over a black ant
flashlight , and a magn ifying glass . As
a flashlight and a magn ifying glass to examine their finds
She took out a magn ifying glass and looked very carefully
, next to her magn ifying glass . She looked at
He took out a magn ifying glass and a notebook and

MIN ACTIVATIONS
MIN = -15.433

bouncing around , an unpleasant od our wa ft ed into
six , forty - seven , forty - eight , forty
felt the familiar feeling of impat ience . Suddenly , she
" Me ow ," m ew ed the cat ,
things â col ouring books , stickers and
three , thirty - four , thirty - five , thirty
thirty , thirty - one , thirty - two , thirty
- to - foot in impat ience . Finally
smile . Mary anx iously opened up the pass
six , thirty - seven , thirty - eight , thirty
some broccoli !" Tan ner was confused , but
underwater kingdom ?" Gab ri ella was so excited
three , forty - four , forty - five , forty
about some carrots ?" her m ama replied .
two , thirty - three , thirty - four , thirty
lesson Mary had learnt : impat ience can make things worse
Sam could barely contain his impat ience as he opened the
M m h mm ," m umbled a sleepy Bill from
one , thirty - two , thirty - three , thirty
the rabbit 's family was h uddled together a short distance

INTERVAL 7.282 - 14.854
CONTAINS 0.531%

scared and didn 't know what to do . He sw
asked her mom , " What 's inside ?" Her mom
him that being different is what makes him adorable . Benny
asked the egg , " What are you doing there ?"
all played together happily . <|endoftext|> Once upon a time ,

INTERVAL -0.290 - 7.282
CONTAINS 57.647%

, and the deer was never bored again . <|endoftext|> One
a time , there was a little girl named Lily .
mine . I need it for my games ."
, but he couldn 't find it . He felt sad
remembers how she felt when she lost her favorite doll .

INTERVAL -7.861 - -0.290
CONTAINS 38.176%

something strange it was a big
to the hole . He peek ed inside and saw something
It 's upstairs , Lily . Just go up the stairs
thing . S ara is curious . She takes
it . He felt so happy and content !

INTERVAL -15.433 - -7.861
CONTAINS 3.645%

hugged them . She said , " I 'm glad you
tree . He was so excited that he had to show
. Just then , a little boy named George
to find a new home , but he never found one
. From that day on , he was extra careful when
Token examine
Token ID10716
Feature activation+3.74e-01

Pos logit contributions
tted+0.129
fire+0.122
 will+0.117
 deserve+0.116
Ready+0.116
Neg logit contributions
lee-0.268
ittens-0.218
ogging-0.207
ummy-0.206
atted-0.204
Token it
Token ID340
Feature activation+5.86e+00

Pos logit contributions
 will+0.010
 am+0.010
 belong+0.009
 deserve+0.009
'll+0.009
Neg logit contributions
lee-0.012
ittens-0.009
atted-0.009
omo-0.009
aj-0.009
Token with
Token ID351
Feature activation+3.30e+00

Pos logit contributions
 am+0.189
tted+0.189
 will+0.188
 deserve+0.183
 belong+0.180
Neg logit contributions
lee-0.144
ries-0.102
ittens-0.102
atted-0.099
ogging-0.094
Token the
Token ID262
Feature activation+5.86e+00

Pos logit contributions
 will+0.095
 am+0.093
 deserve+0.089
 belong+0.088
tted+0.087
Neg logit contributions
lee-0.098
ittens-0.074
atted-0.074
ries-0.072
ogging-0.071
Token magn
Token ID7842
Feature activation+1.81e-02

Pos logit contributions
tted+0.161
 deserve+0.149
 am+0.149
 will+0.145
fire+0.140
Neg logit contributions
lee-0.172
ittens-0.134
 oranges-0.132
ries-0.130
atted-0.126
Tokenifying
Token ID4035
Feature activation+1.43e+01

Pos logit contributions
 am+0.000
 will+0.000
tted+0.000
 deserve+0.000
 belong+0.000
Neg logit contributions
lee-0.001
ries-0.001
atted-0.001
ittens-0.001
ummy-0.001
Token glass
Token ID5405
Feature activation+2.51e+00

Pos logit contributions
tted+0.425
 affect+0.406
 will+0.405
 conquer+0.403
 belong+0.402
Neg logit contributions
lee-0.356
ummy-0.277
ries-0.271
 yogurt-0.258
 oranges-0.258
Token?"
Token ID1701
Feature activation+4.27e-01

Pos logit contributions
 am+0.077
 will+0.077
tted+0.075
 deserve+0.073
 belong+0.072
Neg logit contributions
lee-0.067
ries-0.049
ittens-0.048
atted-0.048
ogging-0.045
Token
Token ID198
Feature activation+4.14e+00

Pos logit contributions
 am+0.010
 will+0.009
tted+0.009
'll+0.009
 deserve+0.009
Neg logit contributions
lee-0.015
ries-0.012
atted-0.012
ittens-0.012
ummy-0.012
Token
Token ID198
Feature activation+3.78e+00

Pos logit contributions
 am+0.100
 will+0.100
tted+0.098
 deserve+0.094
 belong+0.091
Neg logit contributions
lee-0.139
ries-0.109
atted-0.108
ittens-0.108
ogging-0.102
Token"
Token ID1
Feature activation+3.15e+00

Pos logit contributions
 will+0.107
 am+0.107
tted+0.105
 deserve+0.102
 belong+0.100
Neg logit contributions
lee-0.112
ries-0.083
ittens-0.083
atted-0.083
ogging-0.078
Token and
Token ID290
Feature activation-6.25e+00

Pos logit contributions
lee+0.129
ries+0.090
atted+0.088
ittens+0.086
aj+0.080
Neg logit contributions
 will-0.209
 am-0.201
 belong-0.192
 deserve-0.192
tted-0.188
Token handed
Token ID10158
Feature activation+2.51e-02

Pos logit contributions
lee+0.174
ittens+0.145
ries+0.132
aj+0.131
ummy+0.127
Neg logit contributions
 will-0.190
 am-0.183
tted-0.182
 deserve-0.179
 make-0.173
Token him
Token ID683
Feature activation+1.90e+00

Pos logit contributions
 will+0.001
 am+0.001
tted+0.001
'll+0.001
 belong+0.001
Neg logit contributions
lee-0.001
ittens-0.001
atted-0.001
aj-0.001
ries-0.001
Token the
Token ID262
Feature activation+5.29e+00

Pos logit contributions
 am+0.053
 will+0.052
tted+0.052
 deserve+0.050
 belong+0.049
Neg logit contributions
lee-0.057
ries-0.043
atted-0.042
ittens-0.042
ogging-0.040
Token magn
Token ID7842
Feature activation-5.26e-01

Pos logit contributions
tted+0.177
fire+0.175
 am+0.167
 deserve+0.163
 conquer+0.162
Neg logit contributions
lee-0.117
 sunglasses-0.083
 oranges-0.082
 strawberries-0.080
ries-0.080
Tokenifying
Token ID4035
Feature activation+1.38e+01

Pos logit contributions
lee+0.021
ries+0.017
atted+0.017
ittens+0.016
ummy+0.016
Neg logit contributions
 am-0.010
 will-0.009
 deserve-0.009
tted-0.009
 belong-0.009
Token glass
Token ID5405
Feature activation+1.95e+00

Pos logit contributions
tted+0.399
 will+0.388
 am+0.384
 belong+0.382
 affect+0.373
Neg logit contributions
lee-0.372
ummy-0.280
ries-0.279
atted-0.264
 oranges-0.260
Token.
Token ID13
Feature activation-5.80e-02

Pos logit contributions
 will+0.060
 am+0.059
tted+0.058
 deserve+0.056
 belong+0.056
Neg logit contributions
lee-0.053
ittens-0.039
ries-0.038
atted-0.038
aj-0.036
Token
Token ID198
Feature activation+4.22e+00

Pos logit contributions
lee+0.002
ries+0.002
atted+0.001
ittens+0.001
ogging+0.001
Neg logit contributions
 am-0.002
 will-0.002
tted-0.001
 deserve-0.001
'll-0.001
Token
Token ID198
Feature activation+3.55e+00

Pos logit contributions
 am+0.092
 will+0.091
tted+0.087
 deserve+0.082
'll+0.079
Neg logit contributions
lee-0.152
ries-0.126
atted-0.124
ittens-0.120
aj-0.115
TokenHe
Token ID1544
Feature activation+1.92e+00

Pos logit contributions
 will+0.084
 am+0.084
tted+0.082
 deserve+0.079
 belong+0.074
Neg logit contributions
lee-0.124
ries-0.098
atted-0.098
ittens-0.095
aj-0.090
Token,
Token ID11
Feature activation-9.74e+00

Pos logit contributions
lee+0.099
ries+0.073
ittens+0.070
atted+0.066
aj+0.066
Neg logit contributions
 will-0.130
 am-0.126
tted-0.121
 belong-0.119
 deserve-0.117
Token handing
Token ID22786
Feature activation-1.80e+00

Pos logit contributions
lee+0.292
ries+0.229
ittens+0.222
atted+0.218
aj+0.215
Neg logit contributions
 will-0.276
 am-0.270
tted-0.256
 make-0.254
 deserve-0.250
Token him
Token ID683
Feature activation+8.59e-01

Pos logit contributions
lee+0.049
ittens+0.036
atted+0.036
aj+0.035
ogging+0.034
Neg logit contributions
 will-0.057
 am-0.055
tted-0.053
 belong-0.052
 deserve-0.052
Token the
Token ID262
Feature activation+4.37e+00

Pos logit contributions
 will+0.024
 am+0.023
tted+0.023
 deserve+0.022
 belong+0.022
Neg logit contributions
lee-0.026
ittens-0.020
atted-0.020
ries-0.020
ogging-0.019
Token magn
Token ID7842
Feature activation-8.03e-01

Pos logit contributions
tted+0.117
 am+0.114
 deserve+0.113
 will+0.113
 belong+0.108
Neg logit contributions
lee-0.128
ries-0.099
ittens-0.099
atted-0.097
 oranges-0.095
Tokenifying
Token ID4035
Feature activation+1.37e+01

Pos logit contributions
lee+0.034
atted+0.028
ries+0.028
ittens+0.028
ummy+0.027
Neg logit contributions
 am-0.013
 will-0.012
 deserve-0.011
tted-0.011
 belong-0.011
Token glass
Token ID5405
Feature activation+1.43e+00

Pos logit contributions
tted+0.385
 will+0.377
 am+0.375
 belong+0.370
 deserve+0.364
Neg logit contributions
lee-0.381
ries-0.294
ummy-0.289
atted-0.282
aj-0.275
Token.
Token ID13
Feature activation-7.42e-03

Pos logit contributions
 will+0.044
 am+0.043
tted+0.042
 deserve+0.041
 belong+0.040
Neg logit contributions
lee-0.040
ittens-0.029
ries-0.029
atted-0.029
ogging-0.027
Token She
Token ID1375
Feature activation+2.89e+00

Pos logit contributions
lee+0.000
ries+0.000
atted+0.000
ya+0.000
illy+0.000
Neg logit contributions
 am-0.000
 will-0.000
tted-0.000
'll-0.000
Ready-0.000
Token looked
Token ID3114
Feature activation-3.99e+00

Pos logit contributions
 will+0.084
 am+0.077
 deserve+0.073
 make+0.072
'll+0.072
Neg logit contributions
lee-0.089
ummy-0.076
ogging-0.075
ittens-0.075
ries-0.073
Token around
Token ID1088
Feature activation+2.04e+00

Pos logit contributions
lee+0.114
ittens+0.106
omo+0.092
ries+0.089
ogging+0.089
Neg logit contributions
 will-0.127
'll-0.108
 am-0.106
 make-0.102
 away-0.099
Token When
Token ID1649
Feature activation+1.46e+00

Pos logit contributions
 am+0.009
 will+0.009
tted+0.009
 deserve+0.008
'll+0.008
Neg logit contributions
lee-0.013
ries-0.011
atted-0.011
ittens-0.011
ogging-0.010
Token Daisy
Token ID40355
Feature activation-2.54e+00

Pos logit contributions
 will+0.051
 am+0.050
tted+0.049
 deserve+0.048
 belong+0.048
Neg logit contributions
lee-0.034
ittens-0.024
atted-0.023
ries-0.023
ogging-0.021
Token put
Token ID1234
Feature activation+3.30e-02

Pos logit contributions
lee+0.083
ittens+0.067
ries+0.065
atted+0.064
ummy+0.061
Neg logit contributions
 will-0.067
 am-0.066
tted-0.062
 deserve-0.060
'll-0.060
Token the
Token ID262
Feature activation+4.37e+00

Pos logit contributions
 will+0.001
 am+0.001
tted+0.001
 belong+0.001
 deserve+0.001
Neg logit contributions
lee-0.001
ittens-0.001
atted-0.001
ogging-0.001
ries-0.001
Token magn
Token ID7842
Feature activation-3.84e-01

Pos logit contributions
 am+0.125
 will+0.124
tted+0.124
 deserve+0.120
 belong+0.117
Neg logit contributions
lee-0.124
ries-0.093
ittens-0.093
atted-0.092
ogging-0.086
Tokenifying
Token ID4035
Feature activation+1.37e+01

Pos logit contributions
lee+0.016
atted+0.013
ries+0.013
ittens+0.013
ummy+0.013
Neg logit contributions
 am-0.006
 will-0.006
 deserve-0.006
tted-0.006
 belong-0.006
Token glass
Token ID5405
Feature activation+1.17e+00

Pos logit contributions
tted+0.388
 will+0.385
 am+0.380
 belong+0.375
 deserve+0.372
Neg logit contributions
lee-0.374
ries-0.287
ummy-0.284
atted-0.272
 oranges-0.271
Token down
Token ID866
Feature activation+3.42e+00

Pos logit contributions
 will+0.041
 am+0.041
tted+0.040
 deserve+0.039
 belong+0.038
Neg logit contributions
lee-0.027
ittens-0.018
atted-0.018
ries-0.018
ogging-0.016
Token,
Token ID11
Feature activation-8.59e+00

Pos logit contributions
 will+0.127
 am+0.127
tted+0.124
 deserve+0.121
 belong+0.120
Neg logit contributions
lee-0.071
ittens-0.046
atted-0.045
ries-0.045
ogging-0.041
Token the
Token ID262
Feature activation+3.43e+00

Pos logit contributions
lee+0.165
ittens+0.102
atted+0.101
ries+0.100
ogging+0.089
Neg logit contributions
 will-0.331
 am-0.331
tted-0.323
 deserve-0.318
 belong-0.315
Token cater
Token ID20825
Feature activation-6.15e+00

Pos logit contributions
 will+0.096
 am+0.096
tted+0.092
 deserve+0.090
 belong+0.089
Neg logit contributions
lee-0.103
atted-0.077
ittens-0.077
ries-0.076
ogging-0.073
Token.
Token ID13
Feature activation+8.33e-01

Pos logit contributions
 will+0.176
 am+0.170
tted+0.166
 deserve+0.162
 belong+0.161
Neg logit contributions
lee-0.128
atted-0.091
ittens-0.089
ries-0.089
aj-0.082
Token Lily
Token ID20037
Feature activation-5.90e-01

Pos logit contributions
 will+0.024
 am+0.024
tted+0.023
 deserve+0.023
 belong+0.022
Neg logit contributions
lee-0.024
ittens-0.018
atted-0.018
ries-0.018
ogging-0.017
Token had
Token ID550
Feature activation+4.93e+00

Pos logit contributions
lee+0.021
ittens+0.018
ries+0.017
ogging+0.017
ummy+0.017
Neg logit contributions
 will-0.015
 am-0.013
'll-0.013
tted-0.013
 deserve-0.012
Token a
Token ID257
Feature activation+3.23e+00

Pos logit contributions
tted+0.133
 am+0.133
 will+0.130
 deserve+0.127
 belong+0.126
Neg logit contributions
lee-0.152
ries-0.109
atted-0.109
ittens-0.109
 oranges-0.106
Token magn
Token ID7842
Feature activation-9.73e-01

Pos logit contributions
 will+0.083
 am+0.082
tted+0.077
 deserve+0.075
 belong+0.074
Neg logit contributions
lee-0.107
ries-0.085
atted-0.084
ittens-0.082
ogging-0.079
Tokenifying
Token ID4035
Feature activation+1.34e+01

Pos logit contributions
lee+0.041
atted+0.034
ries+0.034
ittens+0.034
ummy+0.033
Neg logit contributions
 am-0.015
 will-0.015
 deserve-0.014
tted-0.014
 belong-0.014
Token glass
Token ID5405
Feature activation+1.56e+00

Pos logit contributions
tted+0.399
 will+0.392
 deserve+0.378
'll+0.377
 affect+0.374
Neg logit contributions
lee-0.329
ummy-0.258
omo-0.244
 yogurt-0.236
 shrimp-0.234
Token that
Token ID326
Feature activation+4.55e+00

Pos logit contributions
 will+0.051
 am+0.050
tted+0.049
 deserve+0.048
 belong+0.047
Neg logit contributions
lee-0.040
ittens-0.029
ries-0.028
atted-0.028
ogging-0.026
Token she
Token ID673
Feature activation+3.78e+00

Pos logit contributions
 will+0.150
 am+0.150
tted+0.144
 belong+0.141
 deserve+0.140
Neg logit contributions
lee-0.116
ittens-0.084
atted-0.083
ries-0.083
ogging-0.079
Token found
Token ID1043
Feature activation+2.20e+00

Pos logit contributions
 will+0.098
 am+0.098
tted+0.096
 deserve+0.093
 belong+0.091
Neg logit contributions
lee-0.120
ittens-0.091
ries-0.091
atted-0.091
ogging-0.086
Token in
Token ID287
Feature activation+6.91e+00

Pos logit contributions
 will+0.064
 am+0.064
tted+0.062
 deserve+0.060
 belong+0.060
Neg logit contributions
lee-0.063
ittens-0.047
ries-0.046
atted-0.046
ogging-0.043
Token the
Token ID262
Feature activation+4.32e+00

Pos logit contributions
 will+0.020
 am+0.020
tted+0.019
 belong+0.019
 deserve+0.019
Neg logit contributions
lee-0.019
ittens-0.015
atted-0.014
aj-0.014
ries-0.014
Token necklace
Token ID38987
Feature activation+2.23e+00

Pos logit contributions
tted+0.148
fire+0.146
 am+0.140
 deserve+0.136
'll+0.134
Neg logit contributions
lee-0.096
 sunglasses-0.065
 oranges-0.064
 strawberries-0.061
ries-0.061
Token and
Token ID290
Feature activation-7.17e+00

Pos logit contributions
 will+0.083
 am+0.082
tted+0.080
 belong+0.078
 deserve+0.078
Neg logit contributions
lee-0.048
ittens-0.032
atted-0.031
ries-0.030
aj-0.028
Token the
Token ID262
Feature activation+4.05e+00

Pos logit contributions
lee+0.191
ittens+0.173
ummy+0.158
ogging+0.154
aj+0.146
Neg logit contributions
 will-0.222
 make-0.212
 am-0.211
 deserve-0.201
 belong-0.191
Token magn
Token ID7842
Feature activation-7.93e-01

Pos logit contributions
tted+0.127
 am+0.122
 will+0.119
 deserve+0.118
fire+0.117
Neg logit contributions
lee-0.105
ries-0.073
 oranges-0.072
ittens-0.072
atted-0.071
Tokenifying
Token ID4035
Feature activation+1.32e+01

Pos logit contributions
lee+0.032
atted+0.026
ries+0.025
ittens+0.025
ummy+0.024
Neg logit contributions
 am-0.015
 will-0.014
 deserve-0.014
 belong-0.014
tted-0.013
Token glass
Token ID5405
Feature activation+1.46e+00

Pos logit contributions
tted+0.398
 will+0.390
 am+0.380
 affect+0.377
 belong+0.377
Neg logit contributions
lee-0.337
ummy-0.252
ries-0.251
 shrimp-0.234
 oranges-0.234
Token and
Token ID290
Feature activation-6.41e+00

Pos logit contributions
 will+0.044
 am+0.043
tted+0.042
 deserve+0.041
 belong+0.041
Neg logit contributions
lee-0.041
ittens-0.030
ries-0.029
atted-0.029
ogging-0.028
Token told
Token ID1297
Feature activation-5.88e-01

Pos logit contributions
lee+0.185
ittens+0.159
ogging+0.144
ummy+0.142
aj+0.139
Neg logit contributions
 will-0.185
 am-0.180
 deserve-0.172
 make-0.171
tted-0.167
Token him
Token ID683
Feature activation+9.28e-01

Pos logit contributions
lee+0.019
ittens+0.016
aj+0.016
atted+0.016
ries+0.015
Neg logit contributions
 will-0.015
 am-0.014
 make-0.014
 belong-0.013
 protect-0.013
Token that
Token ID326
Feature activation+3.07e+00

Pos logit contributions
 will+0.029
 am+0.028
tted+0.028
 deserve+0.027
 belong+0.027
Neg logit contributions
lee-0.025
ittens-0.018
ries-0.018
atted-0.018
ogging-0.017
Token borrowed
Token ID22546
Feature activation-4.37e-01

Pos logit contributions
 will+0.066
 am+0.062
'll+0.057
 make+0.056
 deserve+0.055
Neg logit contributions
lee-0.111
ries-0.109
ogging-0.109
ittens-0.105
ummy-0.104
Token her
Token ID607
Feature activation+1.63e+00

Pos logit contributions
lee+0.015
ittens+0.012
aj+0.012
omo+0.012
atted+0.012
Neg logit contributions
 will-0.010
 am-0.010
 deserve-0.010
 belong-0.010
tted-0.009
Token mom
Token ID1995
Feature activation+1.71e-01

Pos logit contributions
 will+0.045
 am+0.045
 belong+0.041
tted+0.041
 deserve+0.040
Neg logit contributions
lee-0.050
ries-0.039
atted-0.039
ittens-0.039
ogging-0.037
Token's
Token ID338
Feature activation+1.52e+00

Pos logit contributions
 will+0.005
 am+0.005
tted+0.005
 deserve+0.005
 belong+0.005
Neg logit contributions
lee-0.005
ittens-0.004
atted-0.003
ries-0.003
ogging-0.003
Token magn
Token ID7842
Feature activation-7.20e-01

Pos logit contributions
 will+0.040
 am+0.040
tted+0.039
 deserve+0.038
 belong+0.037
Neg logit contributions
lee-0.047
ittens-0.036
ries-0.036
atted-0.036
ogging-0.034
Tokenifying
Token ID4035
Feature activation+1.32e+01

Pos logit contributions
lee+0.033
ittens+0.028
ries+0.028
atted+0.027
ogging+0.027
Neg logit contributions
 will-0.008
tted-0.008
 am-0.008
 deserve-0.007
'll-0.007
Token glass
Token ID5405
Feature activation+1.58e+00

Pos logit contributions
 will+0.354
tted+0.354
 am+0.350
 deserve+0.340
 belong+0.338
Neg logit contributions
lee-0.392
ries-0.294
atted-0.291
ummy-0.288
ittens-0.287
Token to
Token ID284
Feature activation+6.03e+00

Pos logit contributions
 will+0.050
 am+0.049
tted+0.048
 deserve+0.046
 belong+0.046
Neg logit contributions
lee-0.042
ittens-0.031
atted-0.030
ries-0.029
aj-0.029
Token take
Token ID1011
Feature activation+3.71e+00

Pos logit contributions
tted+0.137
 will+0.132
 am+0.131
 deserve+0.124
'll+0.121
Neg logit contributions
lee-0.210
ries-0.165
ittens-0.161
 oranges-0.160
atted-0.156
Token a
Token ID257
Feature activation+3.60e+00

Pos logit contributions
 am+0.115
 will+0.115
tted+0.113
 deserve+0.110
 belong+0.108
Neg logit contributions
lee-0.099
ries-0.071
atted-0.070
ittens-0.070
ogging-0.065
Token closer
Token ID5699
Feature activation+7.17e-02

Pos logit contributions
 am+0.081
tted+0.081
 will+0.080
 deserve+0.077
 belong+0.074
Neg logit contributions
lee-0.122
ittens-0.097
atted-0.094
ogging-0.094
ries-0.093
Token the
Token ID262
Feature activation+3.17e+00

Pos logit contributions
 am+0.044
 will+0.044
tted+0.042
 deserve+0.041
 belong+0.041
Neg logit contributions
lee-0.041
ittens-0.030
atted-0.030
ries-0.029
aj-0.029
Token watch
Token ID2342
Feature activation-6.48e-01

Pos logit contributions
tted+0.082
 am+0.079
 will+0.078
 deserve+0.078
 belong+0.076
Neg logit contributions
lee-0.098
ries-0.076
ittens-0.075
atted-0.073
 oranges-0.073
Token with
Token ID351
Feature activation+1.19e+00

Pos logit contributions
lee+0.016
ries+0.012
ittens+0.012
atted+0.011
ogging+0.011
Neg logit contributions
 am-0.021
 will-0.021
tted-0.020
 deserve-0.020
 belong-0.020
Token the
Token ID262
Feature activation+3.96e+00

Pos logit contributions
 am+0.036
 will+0.036
tted+0.036
 deserve+0.035
'll+0.034
Neg logit contributions
lee-0.032
ries-0.024
atted-0.023
ittens-0.022
 oranges-0.022
Token magn
Token ID7842
Feature activation-1.27e+00

Pos logit contributions
tted+0.097
 am+0.093
 will+0.093
 deserve+0.092
 belong+0.089
Neg logit contributions
lee-0.127
ittens-0.100
ries-0.099
atted-0.097
 oranges-0.096
Tokenifying
Token ID4035
Feature activation+1.31e+01

Pos logit contributions
lee+0.053
atted+0.044
ries+0.044
ittens+0.043
ummy+0.042
Neg logit contributions
 am-0.021
 will-0.020
 deserve-0.020
 belong-0.019
tted-0.018
Token glass
Token ID5405
Feature activation+1.10e+00

Pos logit contributions
tted+0.367
 will+0.364
 am+0.358
 belong+0.355
 deserve+0.350
Neg logit contributions
lee-0.363
ries-0.279
ummy-0.275
atted-0.265
 oranges-0.261
Token.
Token ID13
Feature activation-4.81e-01

Pos logit contributions
 will+0.033
 am+0.033
tted+0.032
 deserve+0.031
 belong+0.031
Neg logit contributions
lee-0.031
ittens-0.023
atted-0.022
ries-0.022
aj-0.021
Token He
Token ID679
Feature activation+2.25e+00

Pos logit contributions
lee+0.015
ries+0.012
ittens+0.012
atted+0.012
ogging+0.011
Neg logit contributions
 am-0.012
 will-0.012
tted-0.012
 deserve-0.012
 belong-0.011
Token saw
Token ID2497
Feature activation-1.10e+00

Pos logit contributions
 will+0.051
 am+0.048
'll+0.044
tted+0.042
 make+0.042
Neg logit contributions
lee-0.081
ogging-0.070
ittens-0.070
ries-0.070
ummy-0.069
Token the
Token ID262
Feature activation+3.56e+00

Pos logit contributions
lee+0.038
ittens+0.031
aj+0.030
ogging+0.030
atted+0.030
Neg logit contributions
 will-0.025
 am-0.025
tted-0.023
 belong-0.023
 deserve-0.023
Token cried
Token ID16896
Feature activation-5.38e+00

Pos logit contributions
lee+0.086
atted+0.080
ittens+0.076
ries+0.075
ummy+0.074
Neg logit contributions
 make-0.064
 will-0.064
 am-0.060
'll-0.057
 overcome-0.055
Token,
Token ID11
Feature activation-9.94e+00

Pos logit contributions
lee+0.180
aj+0.145
atted+0.143
ries+0.142
ittens+0.138
Neg logit contributions
 am-0.126
 will-0.123
tted-0.122
 make-0.119
 protect-0.117
Token dropping
Token ID12047
Feature activation-1.81e+00

Pos logit contributions
lee+0.288
ummy+0.230
ittens+0.226
aj+0.224
ogging+0.223
Neg logit contributions
 make-0.271
 protect-0.264
 will-0.263
 am-0.261
fire-0.246
Token the
Token ID262
Feature activation+3.93e+00

Pos logit contributions
lee+0.052
ittens+0.039
atted+0.038
ries+0.038
aj+0.037
Neg logit contributions
 will-0.053
 am-0.053
tted-0.051
 belong-0.050
 deserve-0.050
Token magn
Token ID7842
Feature activation-1.38e+00

Pos logit contributions
 am+0.111
 will+0.111
tted+0.110
 deserve+0.107
 belong+0.105
Neg logit contributions
lee-0.116
ries-0.087
ittens-0.087
atted-0.086
ogging-0.081
Tokenifying
Token ID4035
Feature activation+1.31e+01

Pos logit contributions
lee+0.058
ries+0.048
atted+0.048
ittens+0.047
ummy+0.047
Neg logit contributions
 am-0.023
 will-0.021
 deserve-0.021
 belong-0.020
tted-0.020
Token glass
Token ID5405
Feature activation+1.30e+00

Pos logit contributions
tted+0.369
 will+0.367
 belong+0.365
 am+0.359
 deserve+0.353
Neg logit contributions
lee-0.357
ummy-0.270
ries-0.270
atted-0.256
 oranges-0.251
Token.
Token ID13
Feature activation-8.45e-02

Pos logit contributions
 will+0.039
 am+0.039
tted+0.038
 deserve+0.037
 belong+0.036
Neg logit contributions
lee-0.036
ittens-0.027
atted-0.026
ries-0.026
ogging-0.025
Token
Token ID198
Feature activation+4.02e+00

Pos logit contributions
lee+0.003
ries+0.002
atted+0.002
ittens+0.002
ogging+0.002
Neg logit contributions
 am-0.002
 will-0.002
tted-0.002
 deserve-0.002
'll-0.002
Token
Token ID198
Feature activation+3.55e+00

Pos logit contributions
 am+0.083
 will+0.080
tted+0.079
 deserve+0.071
'll+0.069
Neg logit contributions
lee-0.152
ries-0.127
atted-0.124
ittens-0.118
 oranges-0.118
Token"
Token ID1
Feature activation+3.24e+00

Pos logit contributions
tted+0.089
 am+0.088
 will+0.087
 deserve+0.083
fire+0.078
Neg logit contributions
lee-0.120
ries-0.094
atted-0.094
ittens-0.089
ogging-0.086
Token the
Token ID262
Feature activation+4.06e+00

Pos logit contributions
 will+0.061
 am+0.058
 make+0.055
tted+0.054
 belong+0.054
Neg logit contributions
lee-0.062
ittens-0.050
ogging-0.047
aj-0.046
ummy-0.046
Token necklace
Token ID38987
Feature activation+1.90e+00

Pos logit contributions
tted+0.115
 am+0.115
 will+0.112
 deserve+0.111
'll+0.106
Neg logit contributions
lee-0.115
ittens-0.086
ries-0.086
atted-0.084
aj-0.081
Token and
Token ID290
Feature activation-6.93e+00

Pos logit contributions
 will+0.070
 am+0.068
tted+0.067
 belong+0.066
 deserve+0.065
Neg logit contributions
lee-0.042
atted-0.027
ries-0.026
ittens-0.026
aj-0.024
Token the
Token ID262
Feature activation+4.01e+00

Pos logit contributions
lee+0.223
ittens+0.186
ummy+0.181
ogging+0.177
atted+0.171
Neg logit contributions
 will-0.185
 am-0.174
 make-0.167
 deserve-0.164
 belong-0.163
Token magn
Token ID7842
Feature activation-1.10e+00

Pos logit contributions
tted+0.117
 am+0.116
 will+0.114
 deserve+0.112
 belong+0.109
Neg logit contributions
lee-0.112
ries-0.082
ittens-0.082
atted-0.080
 oranges-0.077
Tokenifying
Token ID4035
Feature activation+1.31e+01

Pos logit contributions
lee+0.043
atted+0.035
ries+0.034
ittens+0.033
ummy+0.033
Neg logit contributions
 am-0.022
 will-0.021
 deserve-0.021
 belong-0.020
tted-0.019
Token glass
Token ID5405
Feature activation+1.03e+00

Pos logit contributions
tted+0.375
 will+0.370
 am+0.365
 belong+0.359
 deserve+0.354
Neg logit contributions
lee-0.357
ries-0.269
ummy-0.267
atted-0.255
 oranges-0.255
Token and
Token ID290
Feature activation-6.46e+00

Pos logit contributions
 will+0.030
 am+0.030
tted+0.029
 deserve+0.028
 belong+0.028
Neg logit contributions
lee-0.029
ittens-0.022
atted-0.021
ries-0.021
ogging-0.020
Token started
Token ID2067
Feature activation+4.95e-01

Pos logit contributions
lee+0.204
ittens+0.164
atted+0.155
ogging+0.155
ries+0.155
Neg logit contributions
 will-0.173
 am-0.169
tted-0.161
 deserve-0.159
 belong-0.157
Token to
Token ID284
Feature activation+4.80e+00

Pos logit contributions
 will+0.012
 am+0.012
tted+0.012
 belong+0.011
 deserve+0.011
Neg logit contributions
lee-0.016
ittens-0.013
ries-0.012
atted-0.012
ummy-0.012
Token examine
Token ID10716
Feature activation-6.70e-01

Pos logit contributions
tted+0.110
 will+0.102
 am+0.101
 deserve+0.098
'll+0.094
Neg logit contributions
lee-0.167
 gigg-0.130
ries-0.128
 oranges-0.126
ittens-0.126
Token.
Token ID13
Feature activation+8.57e-01

Pos logit contributions
 will+0.097
 am+0.097
tted+0.094
 deserve+0.092
 belong+0.091
Neg logit contributions
lee-0.089
ittens-0.065
atted-0.065
ries-0.065
ogging-0.060
Token They
Token ID1119
Feature activation+4.19e+00

Pos logit contributions
 am+0.021
 will+0.021
tted+0.021
 deserve+0.020
 belong+0.019
Neg logit contributions
lee-0.028
ries-0.022
ittens-0.022
atted-0.022
ogging-0.021
Token had
Token ID550
Feature activation+4.79e+00

Pos logit contributions
 will+0.102
 am+0.100
tted+0.095
 belong+0.092
 deserve+0.092
Neg logit contributions
lee-0.142
ittens-0.115
ries-0.113
atted-0.112
ogging-0.110
Token a
Token ID257
Feature activation+2.95e+00

Pos logit contributions
tted+0.133
 am+0.133
 will+0.129
 deserve+0.128
 belong+0.126
Neg logit contributions
lee-0.142
ries-0.103
atted-0.102
 oranges-0.101
ittens-0.100
Token magn
Token ID7842
Feature activation-1.30e+00

Pos logit contributions
 will+0.070
 am+0.069
tted+0.064
 belong+0.061
 deserve+0.061
Neg logit contributions
lee-0.105
ries-0.085
atted-0.084
ittens-0.081
ogging-0.079
Tokenifying
Token ID4035
Feature activation+1.30e+01

Pos logit contributions
lee+0.055
atted+0.045
ries+0.044
ittens+0.044
ummy+0.043
Neg logit contributions
 am-0.021
 deserve-0.020
 will-0.020
 belong-0.019
tted-0.019
Token glass
Token ID5405
Feature activation+1.10e+00

Pos logit contributions
tted+0.376
 will+0.370
 deserve+0.359
'll+0.354
 belong+0.352
Neg logit contributions
lee-0.331
ummy-0.259
omo-0.247
 yogurt-0.242
 oranges-0.241
Token and
Token ID290
Feature activation-6.32e+00

Pos logit contributions
 will+0.033
 am+0.033
tted+0.032
 deserve+0.031
 belong+0.031
Neg logit contributions
lee-0.031
ittens-0.022
ries-0.022
atted-0.022
ogging-0.021
Token a
Token ID257
Feature activation+2.47e+00

Pos logit contributions
lee+0.211
ittens+0.165
atted+0.164
ries+0.163
ogging+0.157
Neg logit contributions
 will-0.155
 am-0.154
tted-0.149
 deserve-0.145
 belong-0.143
Token small
Token ID1402
Feature activation-9.74e-01

Pos logit contributions
 will+0.066
 am+0.065
tted+0.061
 belong+0.059
 deserve+0.059
Neg logit contributions
lee-0.080
ries-0.063
atted-0.062
ittens-0.060
ogging-0.058
Token box
Token ID3091
Feature activation+3.32e+00

Pos logit contributions
lee+0.032
ittens+0.025
atted+0.025
ries+0.025
ogging+0.023
Neg logit contributions
 will-0.025
 am-0.025
tted-0.023
 deserve-0.023
 belong-0.023
Token She
Token ID1375
Feature activation+2.62e+00

Pos logit contributions
lee+0.032
ries+0.026
ittens+0.026
atted+0.026
aj+0.025
Neg logit contributions
 will-0.026
 am-0.026
tted-0.025
'll-0.024
 deserve-0.023
Token takes
Token ID2753
Feature activation+1.13e+00

Pos logit contributions
 will+0.065
 am+0.064
tted+0.061
 deserve+0.060
'll+0.058
Neg logit contributions
lee-0.088
ittens-0.070
ries-0.069
atted-0.068
ogging-0.066
Token out
Token ID503
Feature activation+1.94e+00

Pos logit contributions
 am+0.029
 will+0.028
tted+0.028
 deserve+0.028
 belong+0.027
Neg logit contributions
lee-0.037
ries-0.028
atted-0.028
ittens-0.027
ogging-0.026
Token a
Token ID257
Feature activation+2.19e+00

Pos logit contributions
 will+0.046
 am+0.045
tted+0.043
 deserve+0.041
 belong+0.041
Neg logit contributions
lee-0.065
ittens-0.052
atted-0.050
ries-0.050
ogging-0.049
Token magn
Token ID7842
Feature activation-1.90e+00

Pos logit contributions
tted+0.056
 am+0.055
 deserve+0.055
 will+0.054
 belong+0.053
Neg logit contributions
lee-0.066
ittens-0.050
ries-0.050
atted-0.050
 oranges-0.049
Tokenifying
Token ID4035
Feature activation+1.28e+01

Pos logit contributions
lee+0.077
atted+0.062
ries+0.062
ummy+0.060
ittens+0.060
Neg logit contributions
 am-0.035
 deserve-0.033
 belong-0.032
 will-0.032
tted-0.030
Token glass
Token ID5405
Feature activation+7.99e-01

Pos logit contributions
tted+0.368
 will+0.360
 am+0.357
 belong+0.356
 affect+0.348
Neg logit contributions
lee-0.347
ummy-0.264
ries-0.263
 oranges-0.250
atted-0.248
Token.
Token ID13
Feature activation-3.53e-01

Pos logit contributions
 will+0.025
 am+0.025
tted+0.025
 deserve+0.024
 belong+0.024
Neg logit contributions
lee-0.021
ittens-0.015
ries-0.015
atted-0.015
ogging-0.014
Token She
Token ID1375
Feature activation+3.41e+00

Pos logit contributions
lee+0.011
ries+0.009
atted+0.009
ittens+0.009
ogging+0.008
Neg logit contributions
 am-0.009
 will-0.009
tted-0.009
 deserve-0.008
'll-0.008
Token looks
Token ID3073
Feature activation-3.41e+00

Pos logit contributions
 will+0.085
 am+0.083
tted+0.078
 deserve+0.077
'll+0.074
Neg logit contributions
lee-0.115
ittens-0.093
ries-0.090
atted-0.089
ogging-0.087
Token at
Token ID379
Feature activation+2.04e+00

Pos logit contributions
lee+0.094
ittens+0.081
ries+0.073
ogging+0.071
omo+0.071
Neg logit contributions
 will-0.107
 am-0.100
tted-0.095
'll-0.095
 deserve-0.095
Token She
Token ID1375
Feature activation+3.06e+00

Pos logit contributions
 am+0.009
 will+0.009
tted+0.008
 deserve+0.008
'll+0.008
Neg logit contributions
lee-0.012
ries-0.009
atted-0.009
ittens-0.009
aj-0.009
Token had
Token ID550
Feature activation+4.22e+00

Pos logit contributions
 will+0.077
 am+0.073
'll+0.067
tted+0.066
 deserve+0.065
Neg logit contributions
lee-0.104
ittens-0.087
ries-0.086
ogging-0.085
ummy-0.084
Token a
Token ID257
Feature activation+2.42e+00

Pos logit contributions
 am+0.112
tted+0.111
 will+0.110
 deserve+0.106
 belong+0.105
Neg logit contributions
lee-0.132
ries-0.099
atted-0.098
ittens-0.098
 oranges-0.093
Token big
Token ID1263
Feature activation+2.23e-01

Pos logit contributions
 will+0.064
 am+0.064
tted+0.062
 deserve+0.060
 belong+0.060
Neg logit contributions
lee-0.076
ries-0.058
ittens-0.057
atted-0.057
ogging-0.054
Token magn
Token ID7842
Feature activation-1.63e+00

Pos logit contributions
 will+0.006
 am+0.006
 belong+0.005
 deserve+0.005
'll+0.005
Neg logit contributions
lee-0.007
ries-0.006
ittens-0.006
atted-0.006
ogging-0.005
Tokenifying
Token ID4035
Feature activation+1.27e+01

Pos logit contributions
lee+0.070
atted+0.058
ries+0.057
ittens+0.057
ummy+0.055
Neg logit contributions
 am-0.025
 will-0.024
tted-0.023
 deserve-0.023
 belong-0.022
Token glass
Token ID5405
Feature activation+7.90e-01

Pos logit contributions
tted+0.374
 will+0.363
 deserve+0.351
 am+0.350
 belong+0.349
Neg logit contributions
lee-0.332
ummy-0.261
ries-0.241
omo-0.241
 oranges-0.237
Token that
Token ID326
Feature activation+4.05e+00

Pos logit contributions
 will+0.025
 am+0.025
tted+0.024
 deserve+0.024
 belong+0.023
Neg logit contributions
lee-0.021
ittens-0.015
ries-0.015
atted-0.015
ogging-0.014
Token made
Token ID925
Feature activation+2.71e+00

Pos logit contributions
 will+0.121
 am+0.121
tted+0.119
 deserve+0.116
 belong+0.114
Neg logit contributions
lee-0.112
ries-0.081
ittens-0.081
atted-0.081
ogging-0.075
Token things
Token ID1243
Feature activation+3.81e+00

Pos logit contributions
 am+0.091
 will+0.091
tted+0.089
 deserve+0.087
 belong+0.086
Neg logit contributions
lee-0.065
ries-0.045
ittens-0.044
atted-0.044
ogging-0.040
Token look
Token ID804
Feature activation-9.11e-01

Pos logit contributions
tted+0.089
 am+0.087
 will+0.085
 deserve+0.084
'll+0.079
Neg logit contributions
lee-0.127
ries-0.100
atted-0.096
ittens-0.095
 oranges-0.093
Token hat
Token ID6877
Feature activation+7.94e-01

Pos logit contributions
 will+0.035
 am+0.035
tted+0.032
 belong+0.032
 make+0.031
Neg logit contributions
lee-0.035
atted-0.026
ogging-0.025
ries-0.025
illy-0.024
Token and
Token ID290
Feature activation-6.99e+00

Pos logit contributions
 will+0.030
 am+0.029
tted+0.029
 deserve+0.027
 belong+0.027
Neg logit contributions
lee-0.017
atted-0.011
ries-0.011
ittens-0.011
ogging-0.010
Token grabbed
Token ID13176
Feature activation-1.33e+00

Pos logit contributions
lee+0.210
ummy+0.168
ogging+0.168
ittens+0.163
atted+0.159
Neg logit contributions
 will-0.201
 am-0.190
 make-0.184
 belong-0.177
 protect-0.175
Token her
Token ID607
Feature activation+1.52e+00

Pos logit contributions
lee+0.038
ittens+0.029
atted+0.028
ogging+0.028
aj+0.027
Neg logit contributions
 will-0.039
 am-0.038
tted-0.037
 belong-0.036
 deserve-0.036
Token magn
Token ID7842
Feature activation-1.49e+00

Pos logit contributions
 will+0.045
 am+0.044
tted+0.041
 belong+0.040
 deserve+0.040
Neg logit contributions
lee-0.045
atted-0.034
ogging-0.032
ries-0.032
ittens-0.031
Tokenifying
Token ID4035
Feature activation+1.27e+01

Pos logit contributions
lee+0.067
ittens+0.055
ries+0.055
atted+0.055
ogging+0.053
Neg logit contributions
 am-0.019
 will-0.019
tted-0.018
 deserve-0.017
 belong-0.017
Token glass
Token ID5405
Feature activation+9.22e-01

Pos logit contributions
 will+0.343
 am+0.343
tted+0.340
 deserve+0.329
 belong+0.328
Neg logit contributions
lee-0.380
ries-0.283
atted-0.282
ittens-0.278
ummy-0.275
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
 will+0.027
 am+0.026
tted+0.026
 deserve+0.025
 belong+0.024
Neg logit contributions
lee-0.027
ittens-0.020
atted-0.019
ries-0.019
ogging-0.018
Token She
Token ID1375
Feature activation+3.29e+00

Pos logit contributions
lee+0.005
ries+0.004
atted+0.004
ittens+0.003
ogging+0.003
Neg logit contributions
 am-0.004
 will-0.004
tted-0.004
 deserve-0.004
'll-0.003
Token was
Token ID373
Feature activation-1.96e+00

Pos logit contributions
 will+0.091
 am+0.087
tted+0.083
 deserve+0.081
'll+0.080
Neg logit contributions
lee-0.104
ogging-0.082
ittens-0.082
ries-0.081
ummy-0.081
Token ready
Token ID3492
Feature activation-3.65e+00

Pos logit contributions
lee+0.057
ittens+0.045
ries+0.044
ogging+0.043
atted+0.043
Neg logit contributions
 will-0.056
 am-0.054
tted-0.053
 deserve-0.052
 belong-0.052
Token said
Token ID531
Feature activation-5.44e+00

Pos logit contributions
lee+0.093
ittens+0.081
ries+0.079
ummy+0.075
ogging+0.075
Neg logit contributions
 will-0.101
 am-0.099
 make-0.099
'll-0.092
 deserve-0.091
Token,
Token ID11
Feature activation-1.08e+01

Pos logit contributions
lee+0.152
ittens+0.119
ries+0.118
omo+0.112
aj+0.112
Neg logit contributions
 am-0.158
 will-0.157
tted-0.154
 make-0.148
 belong-0.146
Token holding
Token ID4769
Feature activation-1.23e+00

Pos logit contributions
lee+0.308
ittens+0.256
aj+0.249
ummy+0.245
illy+0.240
Neg logit contributions
 make-0.298
 protect-0.286
tted-0.282
fire-0.281
 am-0.278
Token the
Token ID262
Feature activation+3.20e+00

Pos logit contributions
lee+0.040
ittens+0.032
atted+0.030
ogging+0.030
ries+0.030
Neg logit contributions
 will-0.032
 am-0.031
tted-0.030
 belong-0.029
 deserve-0.028
Token magn
Token ID7842
Feature activation-2.05e+00

Pos logit contributions
tted+0.097
 am+0.096
 will+0.095
 deserve+0.094
 belong+0.091
Neg logit contributions
lee-0.086
ries-0.063
ittens-0.063
atted-0.061
 oranges-0.060
Tokenifying
Token ID4035
Feature activation+1.27e+01

Pos logit contributions
lee+0.088
ries+0.072
atted+0.072
ittens+0.071
ummy+0.070
Neg logit contributions
 am-0.032
 will-0.031
 deserve-0.029
 belong-0.029
tted-0.028
Token glass
Token ID5405
Feature activation+6.16e-01

Pos logit contributions
tted+0.358
 will+0.355
 am+0.348
 belong+0.347
 deserve+0.341
Neg logit contributions
lee-0.349
ummy-0.263
ries-0.261
 oranges-0.250
atted-0.248
Token over
Token ID625
Feature activation+2.15e+00

Pos logit contributions
 will+0.018
 am+0.018
tted+0.018
 deserve+0.017
 belong+0.017
Neg logit contributions
lee-0.017
ittens-0.013
ries-0.013
atted-0.013
aj-0.012
Token a
Token ID257
Feature activation+2.58e+00

Pos logit contributions
 will+0.059
 am+0.057
tted+0.054
 make+0.052
 protect+0.051
Neg logit contributions
lee-0.064
ittens-0.053
aj-0.052
atted-0.049
ogging-0.049
Token black
Token ID2042
Feature activation-8.15e-01

Pos logit contributions
 am+0.067
 will+0.067
tted+0.066
 deserve+0.064
 belong+0.063
Neg logit contributions
lee-0.080
ittens-0.061
ries-0.060
atted-0.059
ogging-0.057
Token ant
Token ID1885
Feature activation+3.69e-01

Pos logit contributions
lee+0.026
ries+0.020
ittens+0.020
atted+0.019
ogging+0.019
Neg logit contributions
 will-0.021
 am-0.021
tted-0.021
 deserve-0.020
 belong-0.020
Token flashlight
Token ID38371
Feature activation-2.79e-01

Pos logit contributions
 will+0.029
 am+0.029
tted+0.028
 deserve+0.027
 belong+0.027
Neg logit contributions
lee-0.033
ries-0.025
atted-0.025
ittens-0.025
ogging-0.023
Token,
Token ID11
Feature activation-9.61e+00

Pos logit contributions
lee+0.007
ries+0.004
atted+0.004
ittens+0.004
ogging+0.004
Neg logit contributions
 am-0.009
tted-0.009
 will-0.009
 deserve-0.009
 belong-0.009
Token and
Token ID290
Feature activation-7.21e+00

Pos logit contributions
lee+0.232
ittens+0.160
atted+0.159
ries+0.159
ogging+0.147
Neg logit contributions
 will-0.324
 am-0.322
tted-0.315
 deserve-0.309
 belong-0.305
Token a
Token ID257
Feature activation+1.02e+00

Pos logit contributions
lee+0.214
ittens+0.160
ries+0.160
atted+0.159
ogging+0.149
Neg logit contributions
 am-0.201
 will-0.201
tted-0.197
 deserve-0.191
 belong-0.188
Token magn
Token ID7842
Feature activation-1.20e+00

Pos logit contributions
 will+0.028
 am+0.028
tted+0.027
 deserve+0.026
 belong+0.026
Neg logit contributions
lee-0.031
ries-0.024
atted-0.024
ittens-0.024
ogging-0.022
Tokenifying
Token ID4035
Feature activation+1.26e+01

Pos logit contributions
lee+0.048
atted+0.039
ries+0.039
ittens+0.038
ummy+0.037
Neg logit contributions
 am-0.022
 deserve-0.020
 will-0.020
tted-0.020
 belong-0.019
Token glass
Token ID5405
Feature activation+6.64e-01

Pos logit contributions
tted+0.357
 will+0.351
 am+0.346
 affect+0.342
 deserve+0.341
Neg logit contributions
lee-0.339
ummy-0.262
ries-0.257
 oranges-0.247
omo-0.247
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
 will+0.020
 am+0.020
tted+0.019
 deserve+0.019
 belong+0.018
Neg logit contributions
lee-0.019
ittens-0.014
atted-0.014
ries-0.014
ogging-0.013
Token
Token ID198
Feature activation+3.90e+00

Pos logit contributions
 am+0.004
 will+0.004
tted+0.004
 deserve+0.004
'll+0.004
Neg logit contributions
lee-0.005
ries-0.004
atted-0.003
ittens-0.003
ogging-0.003
Token
Token ID198
Feature activation+3.16e+00

Pos logit contributions
 am+0.094
 will+0.093
tted+0.091
 deserve+0.087
 belong+0.084
Neg logit contributions
lee-0.132
ries-0.104
atted-0.103
ittens-0.102
ogging-0.097
TokenAs
Token ID1722
Feature activation-3.42e+00

Pos logit contributions
 am+0.083
 will+0.083
tted+0.081
 deserve+0.079
 belong+0.078
Neg logit contributions
lee-0.099
ries-0.076
ittens-0.075
atted-0.075
ogging-0.071
Token a
Token ID257
Feature activation+2.65e+00

Pos logit contributions
tted+0.133
 am+0.130
 affect+0.130
 deserve+0.129
 belong+0.126
Neg logit contributions
lee-0.126
 oranges-0.089
ries-0.087
 gigg-0.086
 strawberries-0.085
Token flashlight
Token ID38371
Feature activation-1.56e-01

Pos logit contributions
 will+0.065
 am+0.064
tted+0.059
 belong+0.058
 deserve+0.057
Neg logit contributions
lee-0.092
ries-0.074
atted-0.074
ittens-0.071
ogging-0.069
Token and
Token ID290
Feature activation-6.79e+00

Pos logit contributions
lee+0.003
ittens+0.002
ries+0.002
atted+0.002
ogging+0.002
Neg logit contributions
 am-0.006
 will-0.006
tted-0.006
 deserve-0.005
 belong-0.005
Token a
Token ID257
Feature activation+1.74e+00

Pos logit contributions
lee+0.215
ries+0.163
ittens+0.163
atted+0.162
 oranges+0.151
Neg logit contributions
 will-0.174
 am-0.174
tted-0.172
 deserve-0.166
 belong-0.161
Token magn
Token ID7842
Feature activation-1.57e+00

Pos logit contributions
 will+0.045
 am+0.045
tted+0.043
 deserve+0.041
 belong+0.041
Neg logit contributions
lee-0.056
ries-0.044
atted-0.043
ittens-0.043
ogging-0.041
Tokenifying
Token ID4035
Feature activation+1.25e+01

Pos logit contributions
lee+0.063
atted+0.051
ittens+0.050
ries+0.050
ummy+0.050
Neg logit contributions
 am-0.029
 deserve-0.028
 belong-0.027
 will-0.027
tted-0.025
Token glass
Token ID5405
Feature activation+6.87e-01

Pos logit contributions
tted+0.377
 will+0.363
 deserve+0.358
 affect+0.357
 belong+0.355
Neg logit contributions
lee-0.307
ummy-0.243
omo-0.228
 yogurt-0.225
 oranges-0.225
Token to
Token ID284
Feature activation+5.20e+00

Pos logit contributions
 am+0.020
 will+0.020
tted+0.020
 deserve+0.020
 belong+0.019
Neg logit contributions
lee-0.019
ittens-0.014
ries-0.014
atted-0.014
ogging-0.013
Token examine
Token ID10716
Feature activation-1.70e-01

Pos logit contributions
tted+0.095
 will+0.095
 deserve+0.093
'll+0.090
 am+0.090
Neg logit contributions
lee-0.193
ittens-0.154
ries-0.151
 oranges-0.151
 gigg-0.146
Token their
Token ID511
Feature activation+2.17e+00

Pos logit contributions
lee+0.005
ittens+0.004
atted+0.004
ries+0.004
ogging+0.003
Neg logit contributions
 will-0.005
 am-0.005
tted-0.005
 deserve-0.005
 belong-0.005
Token finds
Token ID7228
Feature activation+1.88e+00

Pos logit contributions
 am+0.059
 will+0.059
tted+0.058
 deserve+0.056
 belong+0.055
Neg logit contributions
lee-0.066
ittens-0.050
ries-0.050
atted-0.049
ogging-0.046
Token She
Token ID1375
Feature activation+2.59e+00

Pos logit contributions
 will+0.010
 am+0.010
tted+0.010
 deserve+0.010
'll+0.009
Neg logit contributions
lee-0.010
ries-0.007
ittens-0.007
atted-0.007
aj-0.007
Token took
Token ID1718
Feature activation+1.83e-01

Pos logit contributions
 will+0.065
 am+0.062
tted+0.061
 deserve+0.058
'll+0.058
Neg logit contributions
lee-0.086
ittens-0.071
ries-0.070
atted-0.067
ogging-0.067
Token out
Token ID503
Feature activation+2.39e+00

Pos logit contributions
 am+0.005
 will+0.005
tted+0.005
 deserve+0.005
 belong+0.005
Neg logit contributions
lee-0.006
atted-0.004
ries-0.004
ittens-0.004
ogging-0.004
Token a
Token ID257
Feature activation+2.21e+00

Pos logit contributions
 will+0.059
 am+0.058
tted+0.056
 belong+0.054
 deserve+0.054
Neg logit contributions
lee-0.076
ittens-0.062
aj-0.058
ries-0.058
atted-0.058
Token magn
Token ID7842
Feature activation-1.77e+00

Pos logit contributions
tted+0.055
 am+0.055
 will+0.055
 deserve+0.054
 belong+0.052
Neg logit contributions
lee-0.068
ittens-0.052
atted-0.051
ries-0.051
ogging-0.051
Tokenifying
Token ID4035
Feature activation+1.25e+01

Pos logit contributions
lee+0.074
atted+0.061
ries+0.060
ittens+0.059
ummy+0.058
Neg logit contributions
 am-0.030
 will-0.028
 deserve-0.027
 belong-0.027
tted-0.025
Token glass
Token ID5405
Feature activation+7.60e-01

Pos logit contributions
tted+0.353
 will+0.350
 belong+0.347
 am+0.342
 deserve+0.335
Neg logit contributions
lee-0.336
ummy-0.255
ries-0.243
atted-0.237
 oranges-0.235
Token and
Token ID290
Feature activation-6.92e+00

Pos logit contributions
 am+0.026
 will+0.026
tted+0.025
 deserve+0.025
 belong+0.024
Neg logit contributions
lee-0.018
ittens-0.012
ries-0.012
atted-0.012
ogging-0.011
Token looked
Token ID3114
Feature activation-3.29e+00

Pos logit contributions
lee+0.213
ittens+0.164
ries+0.162
atted+0.161
ogging+0.154
Neg logit contributions
 will-0.187
 am-0.185
tted-0.181
 deserve-0.176
 belong-0.173
Token very
Token ID845
Feature activation-5.65e+00

Pos logit contributions
lee+0.104
ittens+0.088
ries+0.083
omo+0.081
atted+0.080
Neg logit contributions
 will-0.086
 am-0.082
tted-0.079
 belong-0.077
 deserve-0.077
Token carefully
Token ID7773
Feature activation-2.03e+00

Pos logit contributions
lee+0.150
ittens+0.125
ries+0.119
atted+0.111
omo+0.109
Neg logit contributions
 will-0.174
 am-0.170
tted-0.164
 deserve-0.161
 belong-0.161
Token,
Token ID11
Feature activation-9.22e+00

Pos logit contributions
 will+0.049
 am+0.049
tted+0.048
 deserve+0.047
 belong+0.046
Neg logit contributions
lee-0.028
ries-0.018
ittens-0.018
atted-0.018
ogging-0.016
Token next
Token ID1306
Feature activation+7.24e-02

Pos logit contributions
lee+0.250
ittens+0.186
ries+0.181
ummy+0.180
ogging+0.178
Neg logit contributions
 will-0.288
 am-0.278
tted-0.267
 deserve-0.264
 make-0.263
Token to
Token ID284
Feature activation+4.27e+00

Pos logit contributions
tted+0.002
 am+0.002
 deserve+0.002
 belong+0.002
 will+0.002
Neg logit contributions
lee-0.002
atted-0.001
ittens-0.001
ummy-0.001
ries-0.001
Token her
Token ID607
Feature activation+5.99e-02

Pos logit contributions
 will+0.147
 am+0.146
tted+0.142
 deserve+0.140
 belong+0.139
Neg logit contributions
lee-0.101
ittens-0.070
atted-0.069
ries-0.069
ogging-0.063
Token magn
Token ID7842
Feature activation-1.18e+00

Pos logit contributions
 will+0.002
 am+0.002
tted+0.002
 belong+0.002
 deserve+0.002
Neg logit contributions
lee-0.001
atted-0.001
ittens-0.001
ries-0.001
ogging-0.001
Tokenifying
Token ID4035
Feature activation+1.25e+01

Pos logit contributions
lee+0.051
atted+0.042
ries+0.042
ittens+0.042
ummy+0.040
Neg logit contributions
 am-0.018
 will-0.017
 deserve-0.016
tted-0.016
 belong-0.016
Token glass
Token ID5405
Feature activation+5.76e-01

Pos logit contributions
tted+0.339
 will+0.337
 am+0.335
 belong+0.328
 deserve+0.324
Neg logit contributions
lee-0.364
ries-0.274
ummy-0.270
atted-0.268
aj-0.264
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
 will+0.017
 am+0.017
tted+0.016
 deserve+0.015
 belong+0.015
Neg logit contributions
lee-0.017
ittens-0.012
atted-0.012
ries-0.012
ogging-0.011
Token She
Token ID1375
Feature activation+2.89e+00

Pos logit contributions
lee+0.006
ries+0.005
ittens+0.005
atted+0.005
ogging+0.004
Neg logit contributions
 will-0.006
 am-0.006
tted-0.005
 deserve-0.005
'll-0.005
Token looked
Token ID3114
Feature activation-4.19e+00

Pos logit contributions
 will+0.080
 am+0.076
tted+0.071
'll+0.070
 deserve+0.069
Neg logit contributions
lee-0.091
ittens-0.075
ries-0.073
ogging-0.071
ummy-0.071
Token at
Token ID379
Feature activation+1.83e+00

Pos logit contributions
lee+0.123
ittens+0.115
omo+0.104
ries+0.100
ogging+0.098
Neg logit contributions
 will-0.123
'll-0.106
 am-0.106
 make-0.101
tted-0.099
Token He
Token ID679
Feature activation+7.66e-01

Pos logit contributions
lee+0.025
ries+0.018
atted+0.018
ittens+0.018
ogging+0.017
Neg logit contributions
 will-0.024
 am-0.024
tted-0.024
 belong-0.023
 deserve-0.023
Token took
Token ID1718
Feature activation-6.78e-01

Pos logit contributions
 will+0.020
 am+0.019
tted+0.019
 deserve+0.017
'll+0.017
Neg logit contributions
lee-0.025
ittens-0.021
ries-0.021
ogging-0.020
ummy-0.020
Token out
Token ID503
Feature activation+1.80e+00

Pos logit contributions
lee+0.017
atted+0.013
ries+0.013
 oranges+0.012
 gigg+0.011
Neg logit contributions
 am-0.021
 affect-0.021
tted-0.020
 will-0.020
 deserve-0.020
Token a
Token ID257
Feature activation+1.64e+00

Pos logit contributions
 will+0.048
 am+0.047
tted+0.046
 deserve+0.044
 belong+0.043
Neg logit contributions
lee-0.055
ittens-0.044
ries-0.042
atted-0.041
ogging-0.040
Token magn
Token ID7842
Feature activation-2.14e+00

Pos logit contributions
 am+0.041
tted+0.041
 will+0.040
 deserve+0.040
 belong+0.038
Neg logit contributions
lee-0.052
ittens-0.040
atted-0.039
ries-0.039
ogging-0.038
Tokenifying
Token ID4035
Feature activation+1.24e+01

Pos logit contributions
lee+0.090
atted+0.074
ries+0.073
ittens+0.072
ummy+0.071
Neg logit contributions
 am-0.036
 deserve-0.033
 will-0.033
 belong-0.033
tted-0.031
Token glass
Token ID5405
Feature activation+3.29e-01

Pos logit contributions
tted+0.353
 belong+0.344
 will+0.343
 am+0.339
 affect+0.336
Neg logit contributions
lee-0.337
ummy-0.257
ries-0.247
aj-0.241
omo-0.240
Token and
Token ID290
Feature activation-7.37e+00

Pos logit contributions
 will+0.011
 am+0.011
tted+0.011
 deserve+0.011
 belong+0.011
Neg logit contributions
lee-0.008
ittens-0.005
ries-0.005
atted-0.005
ogging-0.005
Token a
Token ID257
Feature activation+1.59e+00

Pos logit contributions
lee+0.225
ittens+0.175
ries+0.170
atted+0.170
ogging+0.163
Neg logit contributions
 will-0.203
 am-0.200
tted-0.194
 deserve-0.190
 belong-0.187
Token notebook
Token ID20922
Feature activation+1.82e+00

Pos logit contributions
 am+0.042
 will+0.042
tted+0.042
 deserve+0.041
 belong+0.040
Neg logit contributions
lee-0.048
ittens-0.036
atted-0.035
ries-0.035
aj-0.034
Token and
Token ID290
Feature activation-7.51e+00

Pos logit contributions
 am+0.058
 will+0.057
tted+0.056
 deserve+0.056
 belong+0.055
Neg logit contributions
lee-0.046
atted-0.032
ries-0.032
ittens-0.032
ogging-0.030
Token bouncing
Token ID30713
Feature activation-2.25e+00

Pos logit contributions
lee+0.127
ittens+0.086
ries+0.086
atted+0.085
ogging+0.077
Neg logit contributions
 am-0.188
 will-0.188
tted-0.184
 deserve-0.180
 belong-0.178
Token around
Token ID1088
Feature activation+2.35e+00

Pos logit contributions
lee+0.076
ries+0.059
atted+0.058
ittens+0.058
aj+0.057
Neg logit contributions
 am-0.057
 will-0.056
tted-0.053
 belong-0.052
 protect-0.050
Token,
Token ID11
Feature activation-9.26e+00

Pos logit contributions
 am+0.072
 will+0.071
tted+0.066
 make+0.065
fire+0.065
Neg logit contributions
lee-0.065
atted-0.048
ittens-0.048
aj-0.047
ries-0.046
Token an
Token ID281
Feature activation-1.07e+00

Pos logit contributions
lee+0.250
ries+0.200
atted+0.198
aj+0.195
ittens+0.183
Neg logit contributions
 am-0.283
 will-0.271
 make-0.257
 conquer-0.257
tted-0.256
Token unpleasant
Token ID22029
Feature activation-3.73e+00

Pos logit contributions
aj+0.029
lee+0.028
atted+0.027
ries+0.026
ittens+0.025
Neg logit contributions
 belong-0.027
age-0.026
tted-0.025
 protect-0.025
 deserve-0.024
Token od
Token ID16298
Feature activation-1.54e+01

Pos logit contributions
lee+0.106
ittens+0.078
ries+0.077
atted+0.077
ogging+0.073
Neg logit contributions
 will-0.109
tted-0.108
 am-0.107
 deserve-0.104
 belong-0.102
Tokenour
Token ID454
Feature activation+1.65e+00

Pos logit contributions
lee+0.356
 oranges+0.289
omo+0.284
ries+0.279
ittens+0.270
Neg logit contributions
 will-0.433
tted-0.429
 am-0.419
'll-0.411
 affect-0.405
Token wa
Token ID2082
Feature activation-8.47e+00

Pos logit contributions
 will+0.045
 deserve+0.044
 am+0.044
tted+0.044
 belong+0.042
Neg logit contributions
lee-0.051
ittens-0.039
atted-0.038
ries-0.037
illy-0.034
Tokenft
Token ID701
Feature activation+7.87e-01

Pos logit contributions
lee+0.247
ittens+0.186
atted+0.186
aj+0.179
ries+0.175
Neg logit contributions
 am-0.239
 will-0.233
 deserve-0.231
tted-0.228
 belong-0.225
Tokened
Token ID276
Feature activation-4.64e+00

Pos logit contributions
 am+0.027
 will+0.026
 deserve+0.026
tted+0.025
 belong+0.025
Neg logit contributions
lee-0.019
atted-0.013
ries-0.013
ittens-0.012
ogging-0.012
Token into
Token ID656
Feature activation+1.71e+00

Pos logit contributions
lee+0.147
ittens+0.116
ries+0.111
atted+0.110
ogging+0.107
Neg logit contributions
 will-0.123
 am-0.120
tted-0.117
 belong-0.112
 deserve-0.112
Tokensix
Token ID19412
Feature activation-5.89e+00

Pos logit contributions
lee+0.225
ittens+0.171
ries+0.171
atted+0.171
ogging+0.161
Neg logit contributions
 will-0.188
 am-0.188
tted-0.183
 deserve-0.178
 belong-0.175
Token,
Token ID11
Feature activation-1.05e+01

Pos logit contributions
lee+0.124
ittens+0.080
atted+0.079
ries+0.078
ogging+0.073
Neg logit contributions
 will-0.219
 am-0.216
tted-0.205
 deserve-0.205
 belong-0.203
Token forty
Token ID16571
Feature activation-6.69e+00

Pos logit contributions
lee+0.340
ries+0.263
ittens+0.262
atted+0.261
ogging+0.247
Neg logit contributions
 am-0.262
 will-0.261
tted-0.257
 deserve-0.247
 belong-0.244
Token-
Token ID12
Feature activation-7.02e+00

Pos logit contributions
lee+0.135
atted+0.080
ittens+0.079
ries+0.078
ogging+0.074
Neg logit contributions
 will-0.255
 am-0.252
tted-0.238
 deserve-0.238
 belong-0.233
Tokenseven
Token ID26548
Feature activation-5.97e-01

Pos logit contributions
lee+0.219
ittens+0.166
ries+0.166
atted+0.166
ogging+0.156
Neg logit contributions
 will-0.185
 am-0.185
tted-0.181
 deserve-0.175
 belong-0.173
Token,
Token ID11
Feature activation-1.21e+01

Pos logit contributions
lee+0.012
ittens+0.008
ries+0.008
atted+0.008
ogging+0.007
Neg logit contributions
 am-0.022
 will-0.022
tted-0.021
 deserve-0.021
 belong-0.020
Token forty
Token ID16571
Feature activation-7.94e+00

Pos logit contributions
lee+0.426
atted+0.340
ittens+0.337
ogging+0.329
aj+0.327
Neg logit contributions
 will-0.283
 am-0.278
 deserve-0.256
 make-0.242
'll-0.241
Token-
Token ID12
Feature activation-8.43e+00

Pos logit contributions
lee+0.167
ries+0.101
atted+0.099
ittens+0.098
ogging+0.096
Neg logit contributions
 will-0.295
 am-0.294
tted-0.275
 deserve-0.275
 belong-0.267
Tokeneight
Token ID26022
Feature activation-3.41e+00

Pos logit contributions
lee+0.249
ittens+0.186
ries+0.185
atted+0.185
ogging+0.175
Neg logit contributions
 will-0.237
 am-0.237
tted-0.232
 deserve-0.226
 belong-0.221
Token,
Token ID11
Feature activation-1.14e+01

Pos logit contributions
lee+0.076
ittens+0.048
atted+0.048
ogging+0.046
ries+0.045
Neg logit contributions
 am-0.123
 will-0.123
 deserve-0.113
tted-0.113
 belong-0.111
Token forty
Token ID16571
Feature activation-7.26e+00

Pos logit contributions
lee+0.400
ittens+0.316
atted+0.315
ries+0.304
ogging+0.303
Neg logit contributions
 will-0.267
 am-0.261
 deserve-0.242
tted-0.232
'll-0.230
Token felt
Token ID2936
Feature activation-2.32e+00

Pos logit contributions
lee+0.181
atted+0.129
ries+0.125
ittens+0.123
ogging+0.117
Neg logit contributions
 am-0.219
 will-0.218
tted-0.217
 deserve-0.211
 belong-0.206
Token the
Token ID262
Feature activation+2.88e+00

Pos logit contributions
lee+0.054
atted+0.037
 gigg+0.033
ries+0.033
ittens+0.033
Neg logit contributions
tted-0.077
 will-0.077
 am-0.076
 deserve-0.075
'll-0.073
Token familiar
Token ID5385
Feature activation-3.63e+00

Pos logit contributions
tted+0.082
 am+0.078
 deserve+0.076
 will+0.073
'll+0.071
Neg logit contributions
lee-0.082
 gigg-0.059
 oranges-0.059
 strawberries-0.059
atted-0.058
Token feeling
Token ID4203
Feature activation-1.42e+00

Pos logit contributions
lee+0.119
atted+0.090
ries+0.087
ittens+0.087
ogging+0.085
Neg logit contributions
tted-0.092
 will-0.091
 am-0.090
 deserve-0.088
'll-0.085
Token of
Token ID286
Feature activation+3.67e+00

Pos logit contributions
lee+0.039
atted+0.028
ries+0.028
ittens+0.028
ogging+0.026
Neg logit contributions
 will-0.043
 am-0.042
tted-0.041
 deserve-0.041
 belong-0.040
Token impat
Token ID28189
Feature activation-1.21e+01

Pos logit contributions
 will+0.108
 am+0.108
tted+0.106
 deserve+0.103
 belong+0.100
Neg logit contributions
lee-0.105
atted-0.077
ries-0.077
ittens-0.076
ogging-0.072
Tokenience
Token ID1240
Feature activation-9.57e-01

Pos logit contributions
lee+0.259
ittens+0.173
ries+0.171
atted+0.166
aj+0.154
Neg logit contributions
 will-0.440
 am-0.437
tted-0.431
 deserve-0.423
 belong-0.419
Token.
Token ID13
Feature activation+8.05e-01

Pos logit contributions
lee+0.022
atted+0.015
ries+0.014
ittens+0.014
ogging+0.014
Neg logit contributions
 am-0.032
tted-0.032
 will-0.031
 deserve-0.031
 belong-0.030
Token Suddenly
Token ID24975
Feature activation-3.41e+00

Pos logit contributions
 will+0.024
 am+0.024
tted+0.023
 belong+0.023
 deserve+0.022
Neg logit contributions
lee-0.023
atted-0.017
ries-0.016
ittens-0.016
ummy-0.016
Token,
Token ID11
Feature activation-8.61e+00

Pos logit contributions
lee+0.070
ittens+0.048
atted+0.045
ries+0.045
aj+0.042
Neg logit contributions
 will-0.128
 am-0.127
tted-0.121
 belong-0.120
 deserve-0.119
Token she
Token ID673
Feature activation+7.23e-01

Pos logit contributions
lee+0.205
ries+0.144
atted+0.140
ittens+0.137
ummy+0.127
Neg logit contributions
tted-0.286
 will-0.286
 am-0.286
 deserve-0.278
 belong-0.271
Token
Token ID198
Feature activation+3.49e+00

Pos logit contributions
 am+0.100
 will+0.098
tted+0.097
 deserve+0.092
'll+0.089
Neg logit contributions
lee-0.145
ries-0.115
atted-0.114
ittens-0.114
aj-0.107
Token"
Token ID1
Feature activation+2.53e+00

Pos logit contributions
 am+0.106
tted+0.105
 will+0.103
 deserve+0.101
'll+0.096
Neg logit contributions
lee-0.096
ries-0.074
atted-0.072
ittens-0.072
aj-0.066
TokenMe
Token ID5308
Feature activation-3.35e+00

Pos logit contributions
 will+0.054
 am+0.054
tted+0.052
 deserve+0.050
 belong+0.050
Neg logit contributions
lee-0.091
ries-0.073
ittens-0.073
atted-0.072
ogging-0.069
Tokenow
Token ID322
Feature activation-3.30e+00

Pos logit contributions
lee+0.118
ittens+0.090
atted+0.090
ries+0.089
ogging+0.086
Neg logit contributions
 am-0.079
 will-0.079
tted-0.074
 deserve-0.073
 belong-0.072
Token,"
Token ID553
Feature activation-1.52e+00

Pos logit contributions
lee+0.088
ittens+0.064
ries+0.064
atted+0.063
ogging+0.059
Neg logit contributions
 will-0.102
 am-0.101
tted-0.099
 deserve-0.096
 belong-0.095
Token m
Token ID285
Feature activation-1.21e+01

Pos logit contributions
lee+0.035
ittens+0.028
aj+0.026
ogging+0.026
atted+0.025
Neg logit contributions
 am-0.052
 will-0.050
 belong-0.049
 make-0.049
 deserve-0.047
Tokenew
Token ID413
Feature activation-5.14e+00

Pos logit contributions
lee+0.345
 shrimp+0.275
 oranges+0.274
 strawberries+0.273
 potatoes+0.269
Neg logit contributions
 am-0.339
tted-0.329
age-0.312
 will-0.304
'm-0.290
Tokened
Token ID276
Feature activation-6.49e+00

Pos logit contributions
lee+0.124
ittens+0.102
aj+0.096
omo+0.094
ummy+0.094
Neg logit contributions
 am-0.152
 will-0.148
tted-0.146
 deserve-0.140
 make-0.137
Token the
Token ID262
Feature activation+8.01e-01

Pos logit contributions
lee+0.157
ittens+0.149
aj+0.132
ogging+0.127
omo+0.125
Neg logit contributions
 am-0.186
 make-0.186
 will-0.183
 belong-0.182
 protect-0.175
Token cat
Token ID3797
Feature activation-6.05e-01

Pos logit contributions
 will+0.022
 am+0.022
tted+0.021
 deserve+0.020
 belong+0.020
Neg logit contributions
lee-0.025
atted-0.019
ries-0.019
ittens-0.018
ogging-0.018
Token,
Token ID11
Feature activation-1.04e+01

Pos logit contributions
lee+0.016
ittens+0.012
ries+0.012
ummy+0.012
atted+0.011
Neg logit contributions
 will-0.019
 am-0.019
tted-0.018
 belong-0.017
'll-0.017
Token things
Token ID1243
Feature activation+3.70e+00

Pos logit contributions
lee+0.076
ittens+0.062
atted+0.059
ries+0.057
aj+0.057
Neg logit contributions
 will-0.063
 am-0.061
 deserve-0.057
tted-0.056
 belong-0.056
Tokenâ
Token ID22940
Feature activation-4.43e+00

Pos logit contributions
 will+0.107
 am+0.105
tted+0.101
 deserve+0.099
 belong+0.099
Neg logit contributions
lee-0.110
ittens-0.085
atted-0.083
ries-0.081
aj-0.077
Token
Token ID26391
Feature activation-2.33e+00

Pos logit contributions
lee+0.141
illy+0.112
ittens+0.109
aj+0.109
ummy+0.106
Neg logit contributions
 will-0.104
 am-0.102
 affect-0.096
tted-0.096
'll-0.095
Token
Token ID447
Feature activation+8.18e-01

Pos logit contributions
lee+0.072
ittens+0.055
ries+0.054
atted+0.053
aj+0.052
Neg logit contributions
 will-0.062
 am-0.061
tted-0.061
 belong-0.059
 deserve-0.059
Token
Token ID251
Feature activation+2.68e-01

Pos logit contributions
 am+0.020
 will+0.020
tted+0.020
 deserve+0.019
 belong+0.019
Neg logit contributions
lee-0.027
ittens-0.021
ries-0.021
atted-0.021
ogging-0.019
Tokencol
Token ID4033
Feature activation-1.20e+01

Pos logit contributions
 will+0.008
 am+0.008
tted+0.007
 deserve+0.007
 belong+0.007
Neg logit contributions
lee-0.008
ittens-0.006
ries-0.006
atted-0.006
aj-0.005
Tokenouring
Token ID21823
Feature activation-1.41e+00

Pos logit contributions
lee+0.313
 oranges+0.237
 shrimp+0.237
ittens+0.236
 Uncle+0.231
Neg logit contributions
tted-0.331
 will-0.328
'll-0.322
 am-0.311
 deserve-0.302
Token books
Token ID3835
Feature activation+2.66e+00

Pos logit contributions
lee+0.041
ittens+0.033
ries+0.031
ogging+0.031
atted+0.031
Neg logit contributions
 will-0.039
 am-0.039
tted-0.036
 belong-0.035
 deserve-0.035
Token,
Token ID11
Feature activation-1.01e+01

Pos logit contributions
 am+0.096
 will+0.096
tted+0.094
 deserve+0.092
 belong+0.091
Neg logit contributions
lee-0.057
ittens-0.037
ries-0.037
atted-0.037
ogging-0.034
Token stickers
Token ID28568
Feature activation+3.50e-01

Pos logit contributions
lee+0.253
ittens+0.196
ries+0.192
atted+0.182
ogging+0.181
Neg logit contributions
 will-0.317
 am-0.306
 make-0.299
 deserve-0.297
'll-0.290
Token and
Token ID290
Feature activation-7.55e+00

Pos logit contributions
 am+0.014
 will+0.014
tted+0.013
 deserve+0.013
 belong+0.013
Neg logit contributions
lee-0.007
ittens-0.004
ries-0.004
atted-0.004
ogging-0.003
Tokenthree
Token ID15542
Feature activation-1.48e+00

Pos logit contributions
lee+0.226
ittens+0.171
atted+0.168
ries+0.168
aj+0.161
Neg logit contributions
 will-0.213
 am-0.212
tted-0.207
 deserve-0.203
 belong-0.196
Token,
Token ID11
Feature activation-1.17e+01

Pos logit contributions
lee+0.030
ittens+0.019
ries+0.019
atted+0.019
ogging+0.017
Neg logit contributions
 will-0.055
 am-0.055
tted-0.054
 deserve-0.053
 belong-0.053
Token thirty
Token ID12277
Feature activation-5.90e+00

Pos logit contributions
lee+0.427
ittens+0.354
atted+0.347
ogging+0.340
aj+0.338
Neg logit contributions
 am-0.249
 will-0.246
 deserve-0.224
 make-0.213
tted-0.210
Token-
Token ID12
Feature activation-7.43e+00

Pos logit contributions
lee+0.119
ries+0.073
ittens+0.073
atted+0.071
aj+0.068
Neg logit contributions
 will-0.220
 am-0.220
tted-0.207
 deserve-0.206
 belong-0.202
Tokenfour
Token ID14337
Feature activation-8.57e-01

Pos logit contributions
lee+0.224
ittens+0.170
atted+0.167
ries+0.167
aj+0.159
Neg logit contributions
 will-0.205
 am-0.204
tted-0.199
 deserve-0.194
 belong-0.188
Token,
Token ID11
Feature activation-1.19e+01

Pos logit contributions
lee+0.020
ittens+0.014
ries+0.014
atted+0.014
aj+0.013
Neg logit contributions
 am-0.030
 will-0.029
tted-0.028
 deserve-0.028
 belong-0.027
Token thirty
Token ID12277
Feature activation-6.26e+00

Pos logit contributions
lee+0.412
ittens+0.330
atted+0.323
ries+0.320
ogging+0.310
Neg logit contributions
 will-0.280
 am-0.280
 deserve-0.261
tted-0.256
 belong-0.247
Token-
Token ID12
Feature activation-7.67e+00

Pos logit contributions
lee+0.126
ries+0.076
ittens+0.076
atted+0.075
aj+0.072
Neg logit contributions
 am-0.236
 will-0.235
tted-0.223
 deserve-0.221
 belong-0.217
Tokenfive
Token ID13261
Feature activation-2.36e+00

Pos logit contributions
lee+0.237
ittens+0.180
ries+0.179
atted+0.179
ogging+0.169
Neg logit contributions
 will-0.205
 am-0.205
tted-0.200
 deserve-0.195
 belong-0.191
Token,
Token ID11
Feature activation-1.11e+01

Pos logit contributions
lee+0.051
ittens+0.033
ries+0.033
atted+0.032
ogging+0.030
Neg logit contributions
 am-0.086
 will-0.086
tted-0.083
 deserve-0.082
 belong-0.081
Token thirty
Token ID12277
Feature activation-5.55e+00

Pos logit contributions
lee+0.376
ittens+0.298
atted+0.293
ries+0.290
ogging+0.280
Neg logit contributions
 will-0.269
 am-0.267
 deserve-0.250
tted-0.248
 belong-0.240
Token thirty
Token ID12277
Feature activation-6.02e+00

Pos logit contributions
lee+0.406
aj+0.337
ogging+0.337
atted+0.334
ittens+0.332
Neg logit contributions
 am-0.234
 will-0.230
 make-0.210
 deserve-0.207
 protect-0.191
Token,
Token ID11
Feature activation-1.15e+01

Pos logit contributions
lee+0.128
ries+0.082
ittens+0.080
aj+0.079
atted+0.078
Neg logit contributions
 am-0.220
 will-0.217
tted-0.207
 deserve-0.204
 belong-0.198
Token thirty
Token ID12277
Feature activation-6.12e+00

Pos logit contributions
lee+0.421
aj+0.351
ogging+0.351
atted+0.348
ittens+0.346
Neg logit contributions
 am-0.235
 will-0.229
 make-0.210
 deserve-0.205
 protect-0.190
Token-
Token ID12
Feature activation-7.86e+00

Pos logit contributions
lee+0.129
ries+0.082
ittens+0.080
aj+0.079
atted+0.078
Neg logit contributions
 am-0.225
 will-0.222
tted-0.211
 deserve-0.209
 belong-0.203
Tokenone
Token ID505
Feature activation-3.61e-02

Pos logit contributions
lee+0.237
ittens+0.181
atted+0.178
ries+0.177
aj+0.173
Neg logit contributions
 will-0.214
 am-0.213
tted-0.208
 deserve-0.204
 belong-0.194
Token,
Token ID11
Feature activation-1.19e+01

Pos logit contributions
lee+0.001
atted+0.001
ittens+0.001
ries+0.001
ogging+0.001
Neg logit contributions
 will-0.001
 am-0.001
tted-0.001
 deserve-0.001
 belong-0.001
Token thirty
Token ID12277
Feature activation-6.33e+00

Pos logit contributions
lee+0.438
ogging+0.365
aj+0.364
atted+0.362
ittens+0.362
Neg logit contributions
 am-0.239
 will-0.235
 make-0.212
 deserve-0.209
'll-0.194
Token-
Token ID12
Feature activation-8.04e+00

Pos logit contributions
lee+0.132
ries+0.083
ittens+0.082
aj+0.080
atted+0.079
Neg logit contributions
 am-0.233
 will-0.231
tted-0.219
 deserve-0.217
 belong-0.211
Tokentwo
Token ID11545
Feature activation-2.11e+00

Pos logit contributions
lee+0.241
ittens+0.184
atted+0.181
ries+0.180
aj+0.175
Neg logit contributions
 will-0.221
 am-0.220
tted-0.214
 deserve-0.210
 belong-0.201
Token,
Token ID11
Feature activation-1.17e+01

Pos logit contributions
lee+0.046
ittens+0.031
ries+0.031
atted+0.030
ogging+0.029
Neg logit contributions
 am-0.075
 will-0.075
tted-0.072
 deserve-0.071
 belong-0.069
Token thirty
Token ID12277
Feature activation-6.04e+00

Pos logit contributions
lee+0.428
ittens+0.356
atted+0.351
ogging+0.349
aj+0.348
Neg logit contributions
 am-0.243
 will-0.238
 deserve-0.214
 make-0.211
'll-0.198
Token-
Token ID12
Feature activation-6.25e+00

Pos logit contributions
lee+0.049
ittens+0.036
atted+0.035
ries+0.035
aj+0.034
Neg logit contributions
 will-0.059
 am-0.058
tted-0.057
 deserve-0.054
 belong-0.054
Tokento
Token ID1462
Feature activation+4.64e-01

Pos logit contributions
lee+0.193
ries+0.153
atted+0.148
 oranges+0.142
ittens+0.139
Neg logit contributions
 am-0.166
 belong-0.160
 will-0.159
 deserve-0.155
tted-0.153
Token-
Token ID12
Feature activation-6.16e+00

Pos logit contributions
 am+0.014
 will+0.014
 deserve+0.014
tted+0.014
 belong+0.014
Neg logit contributions
lee-0.012
atted-0.009
ries-0.009
ittens-0.009
ogging-0.008
Tokenfoot
Token ID5898
Feature activation-2.40e+00

Pos logit contributions
lee+0.189
ries+0.151
atted+0.145
 oranges+0.140
ittens+0.137
Neg logit contributions
 am-0.164
 belong-0.159
 will-0.158
 deserve-0.154
tted-0.152
Token in
Token ID287
Feature activation+4.07e+00

Pos logit contributions
lee+0.061
ittens+0.043
atted+0.043
ries+0.043
ogging+0.040
Neg logit contributions
 will-0.077
 am-0.077
tted-0.075
 deserve-0.073
 belong-0.072
Token impat
Token ID28189
Feature activation-1.19e+01

Pos logit contributions
tted+0.127
 deserve+0.122
 am+0.121
 belong+0.121
 will+0.121
Neg logit contributions
lee-0.108
ries-0.077
 oranges-0.073
atted-0.073
 gigg-0.071
Tokenience
Token ID1240
Feature activation-8.79e-01

Pos logit contributions
lee+0.279
ittens+0.207
omo+0.205
ries+0.201
atted+0.190
Neg logit contributions
 will-0.396
 am-0.378
tted-0.365
 deserve-0.362
 belong-0.362
Token.
Token ID13
Feature activation+6.11e-01

Pos logit contributions
lee+0.021
ries+0.013
ittens+0.013
atted+0.013
ogging+0.013
Neg logit contributions
 am-0.030
tted-0.029
 will-0.029
 deserve-0.029
 belong-0.028
Token
Token ID198
Feature activation+3.88e+00

Pos logit contributions
 am+0.016
 will+0.016
tted+0.015
 deserve+0.015
'll+0.014
Neg logit contributions
lee-0.019
ries-0.015
ittens-0.015
atted-0.015
aj-0.014
Token
Token ID198
Feature activation+3.23e+00

Pos logit contributions
 am+0.095
 will+0.095
tted+0.093
 deserve+0.089
 belong+0.087
Neg logit contributions
lee-0.129
ries-0.100
atted-0.100
ittens-0.099
ogging-0.094
TokenFinally
Token ID11158
Feature activation-1.96e+00

Pos logit contributions
 will+0.082
 am+0.081
 belong+0.078
tted+0.077
 deserve+0.076
Neg logit contributions
lee-0.104
ittens-0.080
atted-0.077
ries-0.077
ogging-0.076
Token smile
Token ID8212
Feature activation-2.76e+00

Pos logit contributions
lee+0.039
ittens+0.031
ries+0.030
atted+0.029
aj+0.029
Neg logit contributions
 am-0.033
 will-0.033
 belong-0.030
tted-0.030
 deserve-0.029
Token.
Token ID13
Feature activation-6.37e-01

Pos logit contributions
lee+0.086
ittens+0.068
aj+0.066
atted+0.066
ries+0.066
Neg logit contributions
 will-0.068
 am-0.068
 belong-0.065
age-0.064
tted-0.063
Token
Token ID198
Feature activation+3.36e+00

Pos logit contributions
lee+0.025
ries+0.022
atted+0.021
ittens+0.021
illy+0.020
Neg logit contributions
 am-0.011
 will-0.011
tted-0.010
'll-0.010
 deserve-0.010
Token
Token ID198
Feature activation+2.86e+00

Pos logit contributions
 am+0.063
 will+0.061
tted+0.059
 deserve+0.053
'll+0.052
Neg logit contributions
lee-0.128
ries-0.111
atted-0.107
 oranges-0.103
ittens-0.103
TokenMary
Token ID24119
Feature activation-4.22e+00

Pos logit contributions
tted+0.072
 will+0.072
 am+0.072
 deserve+0.068
fire+0.065
Neg logit contributions
lee-0.095
ries-0.074
atted-0.074
ittens-0.071
ummy-0.068
Token anx
Token ID7296
Feature activation-1.19e+01

Pos logit contributions
lee+0.117
ittens+0.105
ries+0.096
ogging+0.095
ummy+0.092
Neg logit contributions
 will-0.126
 am-0.117
tted-0.114
'll-0.113
 deserve-0.110
Tokeniously
Token ID6819
Feature activation-8.88e+00

Pos logit contributions
lee+0.262
ls+0.214
 shrimp+0.210
ittens+0.209
 strawberries+0.208
Neg logit contributions
 limit-0.345
age-0.334
 protect-0.334
 belong-0.333
 remain-0.327
Token opened
Token ID4721
Feature activation-1.77e+00

Pos logit contributions
lee+0.214
ittens+0.183
aj+0.172
ries+0.164
omo+0.162
Neg logit contributions
 will-0.286
 belong-0.273
 make-0.266
 am-0.262
 deserve-0.262
Token up
Token ID510
Feature activation-4.46e-01

Pos logit contributions
lee+0.056
ittens+0.049
aj+0.046
ogging+0.045
omo+0.044
Neg logit contributions
 will-0.044
 am-0.041
 make-0.040
'll-0.038
tted-0.038
Token the
Token ID262
Feature activation+3.84e+00

Pos logit contributions
lee+0.015
ittens+0.012
aj+0.012
omo+0.011
atted+0.011
Neg logit contributions
 am-0.011
 will-0.011
tted-0.010
 make-0.010
 belong-0.010
Token pass
Token ID1208
Feature activation+1.23e-01

Pos logit contributions
tted+0.097
 am+0.094
 deserve+0.094
 will+0.091
 belong+0.090
Neg logit contributions
lee-0.116
ries-0.093
atted-0.090
 oranges-0.088
 shrimp-0.088
Tokensix
Token ID19412
Feature activation-5.28e+00

Pos logit contributions
lee+0.215
ittens+0.164
ries+0.162
atted+0.162
ogging+0.153
Neg logit contributions
 will-0.183
 am-0.182
tted-0.178
 deserve-0.173
 belong-0.169
Token,
Token ID11
Feature activation-1.02e+01

Pos logit contributions
lee+0.112
ittens+0.073
atted+0.072
ries+0.071
ogging+0.067
Neg logit contributions
 will-0.195
 am-0.193
 deserve-0.180
tted-0.180
 belong-0.179
Token thirty
Token ID12277
Feature activation-5.34e+00

Pos logit contributions
lee+0.345
ittens+0.272
atted+0.268
ries+0.262
ogging+0.255
Neg logit contributions
 will-0.250
 am-0.249
 deserve-0.233
tted-0.232
 belong-0.225
Token-
Token ID12
Feature activation-6.81e+00

Pos logit contributions
lee+0.104
ittens+0.062
atted+0.062
ries+0.061
ogging+0.057
Neg logit contributions
 will-0.205
 am-0.204
tted-0.194
 deserve-0.193
 belong-0.190
Tokenseven
Token ID26548
Feature activation-1.93e-01

Pos logit contributions
lee+0.214
ittens+0.163
ries+0.162
atted+0.162
ogging+0.153
Neg logit contributions
 will-0.179
 am-0.178
tted-0.174
 deserve-0.169
 belong-0.165
Token,
Token ID11
Feature activation-1.19e+01

Pos logit contributions
lee+0.004
ittens+0.003
ries+0.003
atted+0.003
ogging+0.002
Neg logit contributions
 am-0.007
 will-0.007
tted-0.007
 deserve-0.007
 belong-0.006
Token thirty
Token ID12277
Feature activation-6.66e+00

Pos logit contributions
lee+0.412
atted+0.333
ittens+0.332
ogging+0.327
aj+0.325
Neg logit contributions
 will-0.274
 am-0.273
 deserve-0.245
 make-0.239
'll-0.228
Token-
Token ID12
Feature activation-8.21e+00

Pos logit contributions
lee+0.135
ries+0.082
ittens+0.080
atted+0.080
ogging+0.077
Neg logit contributions
 am-0.251
 will-0.250
tted-0.236
 deserve-0.234
 belong-0.230
Tokeneight
Token ID26022
Feature activation-3.28e+00

Pos logit contributions
lee+0.242
ittens+0.182
atted+0.180
ries+0.180
aj+0.171
Neg logit contributions
 will-0.231
 am-0.230
tted-0.225
 deserve-0.220
 belong-0.213
Token,
Token ID11
Feature activation-1.14e+01

Pos logit contributions
lee+0.073
ittens+0.047
atted+0.047
ogging+0.046
aj+0.044
Neg logit contributions
 am-0.118
 will-0.117
tted-0.107
 deserve-0.107
 belong-0.105
Token thirty
Token ID12277
Feature activation-6.07e+00

Pos logit contributions
lee+0.405
ittens+0.325
atted+0.324
ogging+0.315
aj+0.313
Neg logit contributions
 will-0.259
 am-0.255
 deserve-0.232
 make-0.220
'll-0.218
Token some
Token ID617
Feature activation+5.87e+00

Pos logit contributions
 am+0.014
 will+0.014
tted+0.013
 deserve+0.013
 belong+0.013
Neg logit contributions
lee-0.012
ries-0.009
ittens-0.009
atted-0.009
ogging-0.008
Token broccoli
Token ID44653
Feature activation+3.20e+00

Pos logit contributions
 will+0.194
 am+0.190
 deserve+0.184
tted+0.183
 belong+0.181
Neg logit contributions
lee-0.150
ittens-0.106
atted-0.105
ries-0.102
aj-0.098
Token!"
Token ID2474
Feature activation-9.62e-01

Pos logit contributions
 will+0.095
 am+0.094
tted+0.092
 deserve+0.090
 belong+0.089
Neg logit contributions
lee-0.090
ittens-0.067
atted-0.066
ries-0.065
ogging-0.062
Token
Token ID198
Feature activation+2.57e+00

Pos logit contributions
lee+0.034
ries+0.027
atted+0.027
ittens+0.027
aj+0.026
Neg logit contributions
 am-0.022
 will-0.022
tted-0.021
 deserve-0.020
'll-0.019
Token
Token ID198
Feature activation+2.26e+00

Pos logit contributions
 am+0.061
 will+0.060
tted+0.059
 deserve+0.055
'll+0.053
Neg logit contributions
lee-0.088
ries-0.071
atted-0.069
ittens-0.068
 oranges-0.065
TokenTan
Token ID45557
Feature activation-1.19e+01

Pos logit contributions
 will+0.071
 am+0.070
tted+0.069
 deserve+0.067
 belong+0.067
Neg logit contributions
lee-0.060
ittens-0.043
ries-0.043
atted-0.043
ogging-0.040
Tokenner
Token ID1008
Feature activation-7.96e-01

Pos logit contributions
lee+0.361
ittens+0.272
ries+0.271
atted+0.270
ogging+0.253
Neg logit contributions
 am-0.326
 will-0.325
tted-0.317
 deserve-0.308
 belong-0.301
Token was
Token ID373
Feature activation-2.38e+00

Pos logit contributions
lee+0.019
ittens+0.014
ries+0.014
atted+0.013
aj+0.013
Neg logit contributions
 will-0.027
tted-0.026
 am-0.026
 deserve-0.025
 belong-0.025
Token confused
Token ID10416
Feature activation-7.30e+00

Pos logit contributions
lee+0.064
ittens+0.052
ries+0.050
ogging+0.048
ummy+0.048
Neg logit contributions
 am-0.071
tted-0.071
 will-0.070
 deserve-0.066
 belong-0.065
Token,
Token ID11
Feature activation-1.01e+01

Pos logit contributions
lee+0.229
ittens+0.192
ries+0.189
aj+0.180
atted+0.177
Neg logit contributions
 will-0.179
 am-0.177
 make-0.163
 belong-0.159
 deserve-0.157
Token but
Token ID475
Feature activation-3.34e+00

Pos logit contributions
lee+0.283
ries+0.225
ittens+0.218
atted+0.215
aj+0.208
Neg logit contributions
 will-0.297
 am-0.291
 deserve-0.278
tted-0.277
 make-0.270
Token underwater
Token ID21258
Feature activation-2.40e+00

Pos logit contributions
 am+0.044
 will+0.044
tted+0.043
 deserve+0.042
 belong+0.041
Neg logit contributions
lee-0.062
ittens-0.048
ries-0.048
atted-0.048
ogging-0.046
Token kingdom
Token ID13239
Feature activation+2.35e+00

Pos logit contributions
lee+0.084
ittens+0.066
atted+0.065
ries+0.065
ogging+0.064
Neg logit contributions
 am-0.053
tted-0.053
 will-0.052
 deserve-0.050
 belong-0.050
Token?"
Token ID1701
Feature activation-4.36e-01

Pos logit contributions
tted+0.070
 am+0.068
 will+0.066
 deserve+0.065
 belong+0.065
Neg logit contributions
lee-0.063
ries-0.046
ittens-0.046
atted-0.045
aj-0.045
Token
Token ID198
Feature activation+2.79e+00

Pos logit contributions
lee+0.015
atted+0.012
ries+0.012
aj+0.012
ittens+0.012
Neg logit contributions
 am-0.010
 will-0.010
 deserve-0.009
'll-0.009
tted-0.009
Token
Token ID198
Feature activation+2.53e+00

Pos logit contributions
 am+0.068
 will+0.067
tted+0.066
 deserve+0.063
 belong+0.061
Neg logit contributions
lee-0.094
ries-0.075
atted-0.074
ittens-0.073
aj-0.069
TokenGab
Token ID46079
Feature activation-1.19e+01

Pos logit contributions
 am+0.071
 will+0.071
tted+0.070
 deserve+0.068
 belong+0.066
Neg logit contributions
lee-0.075
ries-0.058
atted-0.057
ittens-0.057
aj-0.053
Tokenri
Token ID380
Feature activation-5.65e+00

Pos logit contributions
lee+0.332
ries+0.257
ittens+0.249
atted+0.248
 oranges+0.234
Neg logit contributions
 will-0.345
 am-0.342
tted-0.339
 deserve-0.324
'll-0.318
Tokenella
Token ID12627
Feature activation-3.02e+00

Pos logit contributions
lee+0.201
ries+0.162
ittens+0.161
atted+0.158
ogging+0.154
Neg logit contributions
 am-0.123
 will-0.123
tted-0.121
 deserve-0.114
 belong-0.111
Token was
Token ID373
Feature activation-2.61e+00

Pos logit contributions
lee+0.083
ittens+0.074
ries+0.069
ogging+0.064
ummy+0.064
Neg logit contributions
 will-0.090
 am-0.085
'll-0.082
tted-0.081
 deserve-0.079
Token so
Token ID523
Feature activation-5.76e+00

Pos logit contributions
lee+0.069
ittens+0.056
ries+0.054
atted+0.050
ogging+0.050
Neg logit contributions
 will-0.080
 am-0.080
tted-0.079
 deserve-0.076
'll-0.075
Token excited
Token ID6568
Feature activation-8.71e+00

Pos logit contributions
lee+0.133
ries+0.106
ittens+0.106
atted+0.092
aj+0.088
Neg logit contributions
 am-0.204
 will-0.204
tted-0.196
 deserve-0.193
 belong-0.189
Tokenthree
Token ID15542
Feature activation-9.74e-01

Pos logit contributions
lee+0.212
ittens+0.160
ries+0.160
atted+0.160
ogging+0.151
Neg logit contributions
 will-0.184
 am-0.184
tted-0.180
 deserve-0.174
 belong-0.172
Token,
Token ID11
Feature activation-1.05e+01

Pos logit contributions
lee+0.018
ittens+0.011
ries+0.011
atted+0.011
ogging+0.010
Neg logit contributions
 will-0.038
 am-0.037
tted-0.037
 deserve-0.036
 belong-0.036
Token forty
Token ID16571
Feature activation-6.54e+00

Pos logit contributions
lee+0.344
ries+0.267
ittens+0.265
atted+0.265
ogging+0.251
Neg logit contributions
 am-0.261
 will-0.259
tted-0.257
 deserve-0.246
 belong-0.244
Token-
Token ID12
Feature activation-6.86e+00

Pos logit contributions
lee+0.134
atted+0.081
ittens+0.078
ries+0.078
ogging+0.073
Neg logit contributions
 will-0.247
 am-0.243
tted-0.230
 deserve-0.230
 belong-0.224
Tokenfour
Token ID14337
Feature activation-9.67e-01

Pos logit contributions
lee+0.213
ries+0.162
ittens+0.162
atted+0.162
ogging+0.153
Neg logit contributions
 am-0.181
 will-0.181
tted-0.176
 deserve-0.171
 belong-0.169
Token,
Token ID11
Feature activation-1.18e+01

Pos logit contributions
lee+0.020
ittens+0.013
ries+0.013
atted+0.013
aj+0.012
Neg logit contributions
 am-0.036
 will-0.036
tted-0.034
 deserve-0.034
 belong-0.033
Token forty
Token ID16571
Feature activation-7.57e+00

Pos logit contributions
lee+0.395
ries+0.307
ittens+0.307
atted+0.306
ogging+0.290
Neg logit contributions
 am-0.282
 will-0.281
tted-0.276
 deserve-0.266
 belong-0.262
Token-
Token ID12
Feature activation-8.00e+00

Pos logit contributions
lee+0.156
ries+0.093
atted+0.093
ittens+0.090
aj+0.087
Neg logit contributions
 will-0.284
 am-0.283
tted-0.269
 deserve-0.267
 belong-0.261
Tokenfive
Token ID13261
Feature activation-2.57e+00

Pos logit contributions
lee+0.265
ries+0.205
ittens+0.205
atted+0.205
ogging+0.194
Neg logit contributions
 will-0.195
 am-0.194
tted-0.189
 deserve-0.183
 belong-0.180
Token,
Token ID11
Feature activation-1.11e+01

Pos logit contributions
lee+0.051
ittens+0.032
ries+0.032
atted+0.032
ogging+0.029
Neg logit contributions
 will-0.097
 am-0.097
tted-0.095
 deserve-0.093
 belong-0.092
Token forty
Token ID16571
Feature activation-6.92e+00

Pos logit contributions
lee+0.366
ries+0.283
ittens+0.283
atted+0.282
ogging+0.267
Neg logit contributions
 am-0.273
 will-0.272
tted-0.268
 deserve-0.257
 belong-0.254
Token about
Token ID546
Feature activation+2.59e+00

Pos logit contributions
 am+0.157
 will+0.156
tted+0.155
 deserve+0.150
 belong+0.146
Neg logit contributions
lee-0.196
ittens-0.150
ries-0.149
atted-0.148
ogging-0.141
Token some
Token ID617
Feature activation+5.53e+00

Pos logit contributions
 will+0.071
 am+0.068
 deserve+0.064
 belong+0.063
 make+0.063
Neg logit contributions
lee-0.082
atted-0.065
ittens-0.063
ogging-0.061
aj-0.059
Token carrots
Token ID34397
Feature activation+2.06e+00

Pos logit contributions
 will+0.181
 am+0.178
 belong+0.169
 deserve+0.166
 make+0.160
Neg logit contributions
lee-0.153
atted-0.111
ittens-0.101
aj-0.099
ries-0.097
Token?"
Token ID1701
Feature activation-6.01e-01

Pos logit contributions
tted+0.070
 am+0.068
 will+0.068
 deserve+0.067
 belong+0.065
Neg logit contributions
lee-0.046
ries-0.033
ittens-0.032
atted-0.031
ummy-0.030
Token her
Token ID607
Feature activation-2.66e+00

Pos logit contributions
lee+0.015
ries+0.012
aj+0.012
atted+0.012
ogging+0.012
Neg logit contributions
 am-0.019
 will-0.018
 make-0.018
 protect-0.017
'll-0.017
Token m
Token ID285
Feature activation-1.18e+01

Pos logit contributions
ogging+0.069
lee+0.068
ries+0.063
atted+0.061
affles+0.058
Neg logit contributions
 am-0.061
 will-0.061
 make-0.058
 source-0.055
'll-0.054
Tokenama
Token ID1689
Feature activation-6.61e+00

Pos logit contributions
lee+0.303
 oranges+0.255
 shrimp+0.254
 strawberries+0.244
 beans+0.228
Neg logit contributions
age-0.330
 am-0.321
 will-0.297
tted-0.297
'll-0.289
Token replied
Token ID8712
Feature activation-5.01e+00

Pos logit contributions
lee+0.154
ittens+0.147
aj+0.135
ogging+0.135
ries+0.132
Neg logit contributions
 make-0.180
 will-0.177
age-0.163
'll-0.162
 am-0.160
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
lee+0.173
ittens+0.156
omo+0.152
ries+0.149
aj+0.146
Neg logit contributions
tted-0.095
 am-0.093
 make-0.093
fire-0.093
 will-0.091
Token
Token ID198
Feature activation+2.57e+00

Pos logit contributions
lee+0.050
ries+0.044
atted+0.043
ittens+0.041
ogging+0.041
Neg logit contributions
 am-0.022
tted-0.022
 will-0.021
'll-0.020
 deserve-0.019
Token
Token ID198
Feature activation+2.25e+00

Pos logit contributions
 am+0.054
 will+0.053
tted+0.052
 deserve+0.048
'll+0.047
Neg logit contributions
lee-0.096
ries-0.080
atted-0.078
ittens-0.076
 oranges-0.073
Tokentwo
Token ID11545
Feature activation-2.11e+00

Pos logit contributions
lee+0.241
ittens+0.184
atted+0.181
ries+0.180
aj+0.175
Neg logit contributions
 will-0.221
 am-0.220
tted-0.214
 deserve-0.210
 belong-0.201
Token,
Token ID11
Feature activation-1.17e+01

Pos logit contributions
lee+0.046
ittens+0.031
ries+0.031
atted+0.030
ogging+0.029
Neg logit contributions
 am-0.075
 will-0.075
tted-0.072
 deserve-0.071
 belong-0.069
Token thirty
Token ID12277
Feature activation-6.04e+00

Pos logit contributions
lee+0.428
ittens+0.356
atted+0.351
ogging+0.349
aj+0.348
Neg logit contributions
 am-0.243
 will-0.238
 deserve-0.214
 make-0.211
'll-0.198
Token-
Token ID12
Feature activation-7.63e+00

Pos logit contributions
lee+0.123
ries+0.076
ittens+0.076
atted+0.073
aj+0.072
Neg logit contributions
 am-0.224
 will-0.224
tted-0.211
 deserve-0.209
 belong-0.205
Tokenthree
Token ID15542
Feature activation-1.48e+00

Pos logit contributions
lee+0.226
ittens+0.171
atted+0.168
ries+0.168
aj+0.161
Neg logit contributions
 will-0.213
 am-0.212
tted-0.207
 deserve-0.203
 belong-0.196
Token,
Token ID11
Feature activation-1.17e+01

Pos logit contributions
lee+0.030
ittens+0.019
ries+0.019
atted+0.019
ogging+0.017
Neg logit contributions
 will-0.055
 am-0.055
tted-0.054
 deserve-0.053
 belong-0.053
Token thirty
Token ID12277
Feature activation-5.90e+00

Pos logit contributions
lee+0.427
ittens+0.354
atted+0.347
ogging+0.340
aj+0.338
Neg logit contributions
 am-0.249
 will-0.246
 deserve-0.224
 make-0.213
tted-0.210
Token-
Token ID12
Feature activation-7.43e+00

Pos logit contributions
lee+0.119
ries+0.073
ittens+0.073
atted+0.071
aj+0.068
Neg logit contributions
 will-0.220
 am-0.220
tted-0.207
 deserve-0.206
 belong-0.202
Tokenfour
Token ID14337
Feature activation-8.57e-01

Pos logit contributions
lee+0.224
ittens+0.170
atted+0.167
ries+0.167
aj+0.159
Neg logit contributions
 will-0.205
 am-0.204
tted-0.199
 deserve-0.194
 belong-0.188
Token,
Token ID11
Feature activation-1.19e+01

Pos logit contributions
lee+0.020
ittens+0.014
ries+0.014
atted+0.014
aj+0.013
Neg logit contributions
 am-0.030
 will-0.029
tted-0.028
 deserve-0.028
 belong-0.027
Token thirty
Token ID12277
Feature activation-6.26e+00

Pos logit contributions
lee+0.412
ittens+0.330
atted+0.323
ries+0.320
ogging+0.310
Neg logit contributions
 will-0.280
 am-0.280
 deserve-0.261
tted-0.256
 belong-0.247
Token lesson
Token ID11483
Feature activation-1.85e+00

Pos logit contributions
tted+0.062
 am+0.061
age+0.059
 will+0.058
fire+0.058
Neg logit contributions
lee-0.085
 oranges-0.071
ogging-0.066
 strawberries-0.066
 shrimp-0.065
Token Mary
Token ID5335
Feature activation-5.98e-01

Pos logit contributions
lee+0.045
ittens+0.032
atted+0.031
ries+0.031
aj+0.029
Neg logit contributions
 will-0.061
 am-0.061
tted-0.060
 deserve-0.058
 belong-0.058
Token had
Token ID550
Feature activation+5.74e+00

Pos logit contributions
lee+0.016
ittens+0.012
ries+0.012
atted+0.012
ogging+0.011
Neg logit contributions
 will-0.018
 am-0.018
tted-0.018
 deserve-0.017
 belong-0.017
Token learnt
Token ID26338
Feature activation+1.07e+00

Pos logit contributions
 am+0.146
tted+0.144
 will+0.143
fire+0.137
 deserve+0.136
Neg logit contributions
lee-0.184
atted-0.138
ries-0.133
ittens-0.130
ogging-0.128
Token:
Token ID25
Feature activation-4.46e+00

Pos logit contributions
tted+0.034
 am+0.034
 will+0.033
 deserve+0.032
 belong+0.032
Neg logit contributions
lee-0.028
ries-0.020
atted-0.019
ittens-0.019
ogging-0.018
Token impat
Token ID28189
Feature activation-1.17e+01

Pos logit contributions
lee+0.127
atted+0.093
ries+0.093
ittens+0.093
ogging+0.087
Neg logit contributions
 am-0.129
 will-0.128
tted-0.127
 deserve-0.123
 belong-0.121
Tokenience
Token ID1240
Feature activation-1.49e+00

Pos logit contributions
lee+0.265
ittens+0.192
ries+0.189
omo+0.180
atted+0.176
Neg logit contributions
 will-0.404
 am-0.391
tted-0.387
 deserve-0.382
 belong-0.376
Token can
Token ID460
Feature activation+4.66e+00

Pos logit contributions
lee+0.041
atted+0.030
ries+0.029
ittens+0.028
ogging+0.028
Neg logit contributions
tted-0.042
 am-0.042
 deserve-0.040
 will-0.040
 belong-0.040
Token make
Token ID787
Feature activation+3.25e+00

Pos logit contributions
Ready+0.082
tted+0.081
fire+0.079
Left+0.075
 deserve+0.071
Neg logit contributions
lee-0.160
 sc-0.130
 gigg-0.124
ries-0.123
ittens-0.122
Token things
Token ID1243
Feature activation+3.33e+00

Pos logit contributions
tted+0.112
 am+0.109
 will+0.106
 deserve+0.104
fire+0.102
Neg logit contributions
lee-0.076
ries-0.055
atted-0.050
ittens-0.049
ummy-0.048
Token worse
Token ID4785
Feature activation-1.66e+00

Pos logit contributions
tted+0.081
 am+0.074
fire+0.073
Ready+0.072
 will+0.071
Neg logit contributions
lee-0.103
ries-0.078
ogging-0.078
 gigg-0.077
atted-0.077
Token Sam
Token ID3409
Feature activation-4.70e-01

Pos logit contributions
lee+0.132
ittens+0.104
ries+0.098
atted+0.095
ogging+0.094
Neg logit contributions
 will-0.159
 am-0.155
tted-0.149
 deserve-0.148
 belong-0.147
Token could
Token ID714
Feature activation+6.62e+00

Pos logit contributions
lee+0.013
ittens+0.010
ries+0.010
atted+0.009
ogging+0.009
Neg logit contributions
 will-0.014
 am-0.014
tted-0.014
 deserve-0.013
'll-0.013
Token barely
Token ID8523
Feature activation-1.81e+00

Pos logit contributions
 will+0.148
tted+0.148
 am+0.148
 deserve+0.143
 belong+0.137
Neg logit contributions
lee-0.227
atted-0.176
ittens-0.175
ries-0.172
aj-0.166
Token contain
Token ID3994
Feature activation+1.80e+00

Pos logit contributions
lee+0.064
 gigg+0.052
 shrugged+0.051
atted+0.049
ittens+0.049
Neg logit contributions
 will-0.033
tted-0.032
 am-0.031
'll-0.030
Ready-0.030
Token his
Token ID465
Feature activation-7.41e-01

Pos logit contributions
 am+0.046
 will+0.046
 belong+0.044
 protect+0.042
 deserve+0.042
Neg logit contributions
lee-0.055
ittens-0.046
atted-0.044
ries-0.044
aj-0.043
Token impat
Token ID28189
Feature activation-1.17e+01

Pos logit contributions
lee+0.020
ittens+0.014
atted+0.014
ries+0.014
ogging+0.013
Neg logit contributions
 will-0.023
 am-0.023
tted-0.023
 deserve-0.022
 belong-0.022
Tokenience
Token ID1240
Feature activation-1.23e+00

Pos logit contributions
lee+0.276
ries+0.196
ittens+0.191
atted+0.182
omo+0.178
Neg logit contributions
 will-0.397
 am-0.390
tted-0.382
 belong-0.380
 deserve-0.377
Token as
Token ID355
Feature activation+2.10e+00

Pos logit contributions
lee+0.030
ittens+0.020
atted+0.020
ries+0.020
ogging+0.019
Neg logit contributions
 am-0.040
tted-0.040
 will-0.039
 deserve-0.039
 belong-0.038
Token he
Token ID339
Feature activation+2.36e+00

Pos logit contributions
 will+0.071
 am+0.070
tted+0.069
 deserve+0.067
 belong+0.067
Neg logit contributions
lee-0.051
ittens-0.035
ries-0.035
atted-0.035
ogging-0.032
Token opened
Token ID4721
Feature activation+7.39e-03

Pos logit contributions
 am+0.061
tted+0.060
 will+0.060
 deserve+0.058
 belong+0.058
Neg logit contributions
lee-0.074
atted-0.056
ries-0.055
ittens-0.055
ogging-0.052
Token the
Token ID262
Feature activation+4.82e+00

Pos logit contributions
 will+0.000
 am+0.000
tted+0.000
 deserve+0.000
 belong+0.000
Neg logit contributions
lee-0.000
ittens-0.000
atted-0.000
ries-0.000
ogging-0.000
TokenM
Token ID44
Feature activation-5.86e+00

Pos logit contributions
 will+0.053
 am+0.053
tted+0.051
 deserve+0.049
 belong+0.049
Neg logit contributions
lee-0.097
ittens-0.078
ries-0.078
atted-0.078
ogging-0.074
Tokenm
Token ID76
Feature activation-3.58e+00

Pos logit contributions
lee+0.133
atted+0.092
ries+0.091
 gigg+0.091
 shrugged+0.085
Neg logit contributions
 am-0.202
 will-0.194
tted-0.188
 belong-0.186
 deserve-0.185
Tokenh
Token ID71
Feature activation-5.53e+00

Pos logit contributions
lee+0.099
ries+0.076
atted+0.072
ittens+0.072
ogging+0.071
Neg logit contributions
 am-0.107
 will-0.104
tted-0.100
 belong-0.100
 deserve-0.099
Tokenmm
Token ID3020
Feature activation-4.59e+00

Pos logit contributions
lee+0.110
ries+0.072
ittens+0.068
atted+0.067
 oranges+0.060
Neg logit contributions
 am-0.209
 will-0.208
tted-0.203
 deserve-0.199
 belong-0.197
Token,"
Token ID553
Feature activation-2.07e+00

Pos logit contributions
lee+0.119
atted+0.084
ittens+0.077
ogging+0.076
 shrugged+0.074
Neg logit contributions
 will-0.151
 am-0.137
 belong-0.128
 make-0.126
'll-0.125
Token m
Token ID285
Feature activation-1.17e+01

Pos logit contributions
lee+0.051
aj+0.042
ittens+0.038
atted+0.037
ogging+0.036
Neg logit contributions
 am-0.067
 will-0.065
 make-0.064
 belong-0.062
 protect-0.059
Tokenumbled
Token ID11137
Feature activation-6.13e+00

Pos logit contributions
lee+0.332
 spaghetti+0.265
 potatoes+0.259
 oranges+0.257
 strawberries+0.252
Neg logit contributions
 am-0.327
tted-0.322
age-0.296
 will-0.293
'm-0.284
Token a
Token ID257
Feature activation+2.34e-01

Pos logit contributions
lee+0.142
ittens+0.116
aj+0.110
ummy+0.103
affles+0.100
Neg logit contributions
 will-0.192
 am-0.189
 make-0.188
 belong-0.188
 protect-0.182
Token sleepy
Token ID42023
Feature activation-5.57e+00

Pos logit contributions
 am+0.007
 will+0.007
tted+0.007
 deserve+0.007
 belong+0.006
Neg logit contributions
lee-0.007
ries-0.005
ittens-0.005
atted-0.005
ogging-0.005
Token Bill
Token ID3941
Feature activation-4.12e+00

Pos logit contributions
lee+0.164
ries+0.129
atted+0.125
ittens+0.122
aj+0.119
Neg logit contributions
 will-0.155
 am-0.154
tted-0.149
 deserve-0.147
fire-0.143
Token from
Token ID422
Feature activation-8.43e-01

Pos logit contributions
lee+0.112
ittens+0.082
ries+0.082
atted+0.080
aj+0.075
Neg logit contributions
 will-0.126
 am-0.125
tted-0.122
 deserve-0.119
 belong-0.118
Tokenone
Token ID505
Feature activation-3.61e-02

Pos logit contributions
lee+0.237
ittens+0.181
atted+0.178
ries+0.177
aj+0.173
Neg logit contributions
 will-0.214
 am-0.213
tted-0.208
 deserve-0.204
 belong-0.194
Token,
Token ID11
Feature activation-1.19e+01

Pos logit contributions
lee+0.001
atted+0.001
ittens+0.001
ries+0.001
ogging+0.001
Neg logit contributions
 will-0.001
 am-0.001
tted-0.001
 deserve-0.001
 belong-0.001
Token thirty
Token ID12277
Feature activation-6.33e+00

Pos logit contributions
lee+0.438
ogging+0.365
aj+0.364
atted+0.362
ittens+0.362
Neg logit contributions
 am-0.239
 will-0.235
 make-0.212
 deserve-0.209
'll-0.194
Token-
Token ID12
Feature activation-8.04e+00

Pos logit contributions
lee+0.132
ries+0.083
ittens+0.082
aj+0.080
atted+0.079
Neg logit contributions
 am-0.233
 will-0.231
tted-0.219
 deserve-0.217
 belong-0.211
Tokentwo
Token ID11545
Feature activation-2.11e+00

Pos logit contributions
lee+0.241
ittens+0.184
atted+0.181
ries+0.180
aj+0.175
Neg logit contributions
 will-0.221
 am-0.220
tted-0.214
 deserve-0.210
 belong-0.201
Token,
Token ID11
Feature activation-1.17e+01

Pos logit contributions
lee+0.046
ittens+0.031
ries+0.031
atted+0.030
ogging+0.029
Neg logit contributions
 am-0.075
 will-0.075
tted-0.072
 deserve-0.071
 belong-0.069
Token thirty
Token ID12277
Feature activation-6.04e+00

Pos logit contributions
lee+0.428
ittens+0.356
atted+0.351
ogging+0.349
aj+0.348
Neg logit contributions
 am-0.243
 will-0.238
 deserve-0.214
 make-0.211
'll-0.198
Token-
Token ID12
Feature activation-7.63e+00

Pos logit contributions
lee+0.123
ries+0.076
ittens+0.076
atted+0.073
aj+0.072
Neg logit contributions
 am-0.224
 will-0.224
tted-0.211
 deserve-0.209
 belong-0.205
Tokenthree
Token ID15542
Feature activation-1.48e+00

Pos logit contributions
lee+0.226
ittens+0.171
atted+0.168
ries+0.168
aj+0.161
Neg logit contributions
 will-0.213
 am-0.212
tted-0.207
 deserve-0.203
 belong-0.196
Token,
Token ID11
Feature activation-1.17e+01

Pos logit contributions
lee+0.030
ittens+0.019
ries+0.019
atted+0.019
ogging+0.017
Neg logit contributions
 will-0.055
 am-0.055
tted-0.054
 deserve-0.053
 belong-0.053
Token thirty
Token ID12277
Feature activation-5.90e+00

Pos logit contributions
lee+0.427
ittens+0.354
atted+0.347
ogging+0.340
aj+0.338
Neg logit contributions
 am-0.249
 will-0.246
 deserve-0.224
 make-0.213
tted-0.210
Token the
Token ID262
Feature activation+1.58e+00

Pos logit contributions
lee+0.241
ittens+0.170
atted+0.170
ries+0.170
ogging+0.157
Neg logit contributions
 will-0.295
 am-0.292
tted-0.284
 deserve-0.278
 belong-0.273
Token rabbit
Token ID22746
Feature activation-1.05e+00

Pos logit contributions
 am+0.049
 will+0.048
tted+0.048
 deserve+0.046
 belong+0.046
Neg logit contributions
lee-0.043
ittens-0.031
atted-0.031
ries-0.031
ogging-0.029
Token's
Token ID338
Feature activation-5.31e-01

Pos logit contributions
lee+0.033
ittens+0.026
ries+0.026
atted+0.025
ummy+0.024
Neg logit contributions
 will-0.028
 am-0.027
tted-0.027
 deserve-0.026
'll-0.026
Token family
Token ID1641
Feature activation+3.20e-01

Pos logit contributions
lee+0.014
ries+0.011
ittens+0.011
atted+0.010
ogging+0.010
Neg logit contributions
 will-0.016
 am-0.016
tted-0.016
 deserve-0.015
 belong-0.015
Token was
Token ID373
Feature activation-1.83e+00

Pos logit contributions
 am+0.008
 will+0.008
tted+0.008
 deserve+0.008
 belong+0.008
Neg logit contributions
lee-0.010
atted-0.008
ries-0.008
ittens-0.008
ogging-0.007
Token h
Token ID289
Feature activation-1.17e+01

Pos logit contributions
lee+0.049
atted+0.035
ries+0.035
ittens+0.034
ogging+0.032
Neg logit contributions
 am-0.057
 will-0.056
tted-0.055
 deserve-0.054
 belong-0.053
Tokenuddled
Token ID32745
Feature activation-5.24e+00

Pos logit contributions
lee+0.344
ries+0.259
ittens+0.254
atted+0.254
aj+0.237
Neg logit contributions
tted-0.328
 am-0.325
 will-0.324
 deserve-0.310
 belong-0.306
Token together
Token ID1978
Feature activation-1.44e-01

Pos logit contributions
lee+0.148
ries+0.109
atted+0.109
ittens+0.109
ogging+0.102
Neg logit contributions
 am-0.153
 will-0.152
tted-0.149
 deserve-0.146
 belong-0.144
Token a
Token ID257
Feature activation+1.41e+00

Pos logit contributions
lee+0.003
atted+0.002
ries+0.002
ittens+0.002
ogging+0.002
Neg logit contributions
 am-0.005
tted-0.005
 will-0.005
 deserve-0.005
 belong-0.005
Token short
Token ID1790
Feature activation-1.08e+00

Pos logit contributions
 am+0.039
tted+0.038
 will+0.038
 deserve+0.036
 belong+0.036
Neg logit contributions
lee-0.041
ittens-0.031
atted-0.030
ries-0.030
ogging-0.030
Token distance
Token ID5253
Feature activation-1.67e+00

Pos logit contributions
lee+0.037
ittens+0.029
atted+0.028
ries+0.028
ogging+0.028
Neg logit contributions
 am-0.025
 will-0.025
tted-0.024
 deserve-0.023
 belong-0.023
Token scared
Token ID12008
Feature activation-5.62e+00

Pos logit contributions
lee+0.047
ries+0.036
ittens+0.036
atted+0.035
ogging+0.034
Neg logit contributions
 will-0.047
 am-0.047
tted-0.046
 belong-0.045
 deserve-0.044
Token and
Token ID290
Feature activation-7.08e+00

Pos logit contributions
lee+0.158
aj+0.137
ries+0.125
ittens+0.122
omo+0.121
Neg logit contributions
 will-0.166
 am-0.156
 make-0.153
 belong-0.145
 protect-0.144
Token didn
Token ID1422
Feature activation-8.40e-01

Pos logit contributions
lee+0.203
ittens+0.157
ries+0.156
ogging+0.149
atted+0.148
Neg logit contributions
 will-0.209
 am-0.205
 belong-0.196
 deserve-0.193
tted-0.193
Token't
Token ID470
Feature activation+9.80e-01

Pos logit contributions
lee+0.029
ittens+0.024
ogging+0.023
ummy+0.023
ries+0.023
Neg logit contributions
 will-0.018
 deserve-0.016
 am-0.016
tted-0.016
'll-0.016
Token know
Token ID760
Feature activation+3.74e-01

Pos logit contributions
tted+0.022
 will+0.021
fire+0.020
 surely+0.020
 am+0.019
Neg logit contributions
lee-0.032
ogging-0.025
ittens-0.025
 gigg-0.025
 oranges-0.024
Token what
Token ID644
Feature activation+7.52e+00

Pos logit contributions
 will+0.011
 am+0.011
 belong+0.011
 deserve+0.010
tted+0.010
Neg logit contributions
lee-0.011
ittens-0.008
atted-0.008
ries-0.007
aj-0.007
Token to
Token ID284
Feature activation+5.76e+00

Pos logit contributions
tted+0.221
 deserve+0.214
 will+0.212
 am+0.211
 belong+0.206
Neg logit contributions
lee-0.207
ittens-0.150
atted-0.150
ries-0.149
ogging-0.141
Token do
Token ID466
Feature activation+3.57e+00

Pos logit contributions
tted+0.115
 deserve+0.102
Ready+0.099
fire+0.096
 will+0.096
Neg logit contributions
lee-0.202
ittens-0.178
ries-0.174
ogging-0.169
ummy-0.169
Token.
Token ID13
Feature activation+6.77e-01

Pos logit contributions
 will+0.109
 am+0.108
tted+0.106
 deserve+0.104
 belong+0.102
Neg logit contributions
lee-0.097
ittens-0.070
ries-0.070
atted-0.070
ogging-0.065
Token He
Token ID679
Feature activation+2.16e+00

Pos logit contributions
 will+0.017
 am+0.017
tted+0.017
 deserve+0.016
 belong+0.016
Neg logit contributions
lee-0.022
ries-0.017
ittens-0.017
atted-0.017
ogging-0.016
Token sw
Token ID1509
Feature activation-7.57e+00

Pos logit contributions
 will+0.062
 am+0.059
tted+0.056
'll+0.056
 deserve+0.055
Neg logit contributions
lee-0.065
ries-0.053
ittens-0.051
ummy-0.050
ogging-0.050
Token asked
Token ID1965
Feature activation+6.20e-02

Pos logit contributions
 will+0.062
'll+0.057
 am+0.056
 make+0.056
 deserve+0.053
Neg logit contributions
lee-0.077
ogging-0.076
ries-0.075
ummy-0.071
ittens-0.071
Token her
Token ID607
Feature activation-4.17e-01

Pos logit contributions
 will+0.002
 make+0.002
 am+0.002
 belong+0.002
 protect+0.002
Neg logit contributions
lee-0.002
aj-0.002
ittens-0.002
omo-0.001
atted-0.001
Token mom
Token ID1995
Feature activation-9.32e-01

Pos logit contributions
lee+0.013
ogging+0.011
ittens+0.010
ries+0.010
atted+0.010
Neg logit contributions
 will-0.012
 am-0.011
 make-0.011
'll-0.010
 belong-0.010
Token,
Token ID11
Feature activation-9.15e+00

Pos logit contributions
lee+0.026
ittens+0.020
atted+0.019
ries+0.018
aj+0.018
Neg logit contributions
 will-0.030
 am-0.028
'll-0.027
 make-0.026
 belong-0.026
Token "
Token ID366
Feature activation+3.24e+00

Pos logit contributions
lee+0.308
ries+0.250
atted+0.246
ls+0.239
ittens+0.236
Neg logit contributions
 will-0.229
 am-0.225
 make-0.209
'll-0.208
 deserve-0.197
TokenWhat
Token ID2061
Feature activation+8.50e+00

Pos logit contributions
 am+0.078
 will+0.076
tted+0.076
 belong+0.073
 deserve+0.072
Neg logit contributions
lee-0.108
ries-0.086
ittens-0.085
atted-0.083
ummy-0.080
Token's
Token ID338
Feature activation+1.74e+00

Pos logit contributions
tted+0.170
 am+0.161
 will+0.159
 deserve+0.154
fire+0.150
Neg logit contributions
lee-0.313
ries-0.255
ittens-0.250
atted-0.248
ogging-0.241
Token inside
Token ID2641
Feature activation+4.12e+00

Pos logit contributions
 will+0.049
 am+0.048
 belong+0.046
tted+0.045
 deserve+0.045
Neg logit contributions
lee-0.053
ittens-0.041
atted-0.040
ries-0.039
ogging-0.038
Token?"
Token ID1701
Feature activation-3.87e-01

Pos logit contributions
 am+0.131
 will+0.131
tted+0.129
 deserve+0.126
 belong+0.124
Neg logit contributions
lee-0.106
ries-0.075
ittens-0.075
atted-0.075
ogging-0.069
Token Her
Token ID2332
Feature activation-3.16e+00

Pos logit contributions
lee+0.012
ries+0.010
ls+0.009
atted+0.009
ogging+0.009
Neg logit contributions
 am-0.010
 make-0.010
 will-0.010
age-0.009
'll-0.009
Token mom
Token ID1995
Feature activation-2.16e+00

Pos logit contributions
lee+0.090
ogging+0.077
atted+0.073
ries+0.072
illy+0.071
Neg logit contributions
 will-0.091
 am-0.088
 make-0.080
'll-0.078
 belong-0.076
Token him
Token ID683
Feature activation+3.54e-01

Pos logit contributions
lee+0.031
ittens+0.024
atted+0.024
ries+0.024
aj+0.024
Neg logit contributions
 will-0.028
 am-0.028
 belong-0.027
 deserve-0.027
tted-0.027
Token that
Token ID326
Feature activation+2.49e+00

Pos logit contributions
 will+0.011
 am+0.011
 belong+0.010
tted+0.010
 deserve+0.010
Neg logit contributions
lee-0.009
ittens-0.007
ries-0.007
atted-0.007
aj-0.007
Token being
Token ID852
Feature activation+2.74e+00

Pos logit contributions
 will+0.078
 am+0.076
 belong+0.072
 deserve+0.070
 protect+0.069
Neg logit contributions
lee-0.072
ittens-0.053
atted-0.053
aj-0.050
ries-0.048
Token different
Token ID1180
Feature activation-7.93e-02

Pos logit contributions
 am+0.082
 will+0.082
tted+0.080
 deserve+0.078
 belong+0.077
Neg logit contributions
lee-0.076
atted-0.055
ittens-0.055
ries-0.055
ogging-0.051
Token is
Token ID318
Feature activation+8.74e-01

Pos logit contributions
lee+0.002
ries+0.002
atted+0.002
ittens+0.002
ogging+0.001
Neg logit contributions
 am-0.002
 will-0.002
tted-0.002
 deserve-0.002
 belong-0.002
Token what
Token ID644
Feature activation+7.78e+00

Pos logit contributions
 am+0.024
tted+0.024
 will+0.024
 deserve+0.023
 belong+0.022
Neg logit contributions
lee-0.026
atted-0.019
ittens-0.019
ries-0.019
aj-0.018
Token makes
Token ID1838
Feature activation+4.27e+00

Pos logit contributions
 will+0.252
 am+0.252
tted+0.250
 deserve+0.242
 belong+0.238
Neg logit contributions
lee-0.194
ries-0.136
ittens-0.136
atted-0.135
ogging-0.125
Token him
Token ID683
Feature activation+1.30e+00

Pos logit contributions
 will+0.127
 am+0.127
tted+0.127
 deserve+0.121
'll+0.117
Neg logit contributions
lee-0.118
ries-0.086
ittens-0.085
atted-0.085
ogging-0.080
Token adorable
Token ID29012
Feature activation-2.20e+00

Pos logit contributions
tted+0.039
 am+0.038
 will+0.038
Ready+0.036
 deserve+0.036
Neg logit contributions
lee-0.035
ries-0.025
atted-0.025
ittens-0.024
ogging-0.024
Token.
Token ID13
Feature activation+1.96e-02

Pos logit contributions
lee+0.058
ries+0.041
atted+0.041
ittens+0.041
ogging+0.038
Neg logit contributions
tted-0.069
 am-0.068
 will-0.068
 deserve-0.065
 belong-0.064
Token Benny
Token ID44275
Feature activation-3.49e+00

Pos logit contributions
 will+0.001
tted+0.001
 deserve+0.001
 am+0.001
'll+0.001
Neg logit contributions
lee-0.001
ries-0.000
ittens-0.000
atted-0.000
aj-0.000
Token asked
Token ID1965
Feature activation+1.12e+00

Pos logit contributions
lee+0.212
ittens+0.172
ogging+0.163
ummy+0.162
ries+0.160
Neg logit contributions
 will-0.181
 am-0.173
 deserve-0.162
'll-0.162
 make-0.158
Token the
Token ID262
Feature activation+3.15e+00

Pos logit contributions
 will+0.030
 am+0.029
 belong+0.027
 deserve+0.027
'll+0.026
Neg logit contributions
lee-0.036
ittens-0.029
atted-0.028
aj-0.028
ogging-0.027
Token egg
Token ID5935
Feature activation+1.65e-01

Pos logit contributions
tted+0.109
 am+0.107
 will+0.104
 deserve+0.104
 belong+0.102
Neg logit contributions
lee-0.071
ittens-0.050
ries-0.050
atted-0.048
ummy-0.047
Token,
Token ID11
Feature activation-7.96e+00

Pos logit contributions
 will+0.006
 am+0.006
tted+0.006
 deserve+0.005
 belong+0.005
Neg logit contributions
lee-0.004
ittens-0.003
ries-0.003
atted-0.003
ogging-0.002
Token "
Token ID366
Feature activation+4.06e+00

Pos logit contributions
lee+0.222
atted+0.165
ittens+0.162
ries+0.161
aj+0.155
Neg logit contributions
 will-0.248
 am-0.246
 deserve-0.229
tted-0.228
'll-0.228
TokenWhat
Token ID2061
Feature activation+9.15e+00

Pos logit contributions
tted+0.103
 belong+0.100
 am+0.099
 will+0.098
hips+0.097
Neg logit contributions
lee-0.114
ries-0.099
 oranges-0.094
ittens-0.089
 strawberries-0.088
Token are
Token ID389
Feature activation+2.25e+00

Pos logit contributions
tted+0.181
fire+0.160
age+0.150
 surely+0.150
 deserve+0.149
Neg logit contributions
lee-0.332
ries-0.277
atted-0.268
ittens-0.264
ogging-0.264
Token you
Token ID345
Feature activation+3.88e+00

Pos logit contributions
 will+0.056
 am+0.056
 belong+0.053
tted+0.052
 deserve+0.052
Neg logit contributions
lee-0.076
atted-0.058
ittens-0.057
ries-0.056
ogging-0.056
Token doing
Token ID1804
Feature activation+2.72e+00

Pos logit contributions
 am+0.113
 will+0.111
 belong+0.106
 deserve+0.105
tted+0.104
Neg logit contributions
lee-0.114
atted-0.086
ittens-0.085
ries-0.085
ogging-0.079
Token there
Token ID612
Feature activation+9.82e+00

Pos logit contributions
 am+0.081
 will+0.080
 belong+0.075
tted+0.075
 deserve+0.074
Neg logit contributions
lee-0.080
atted-0.061
ittens-0.057
ries-0.056
ogging-0.053
Token?"
Token ID1701
Feature activation+4.84e-01

Pos logit contributions
 will+0.320
 am+0.320
tted+0.319
 deserve+0.308
 belong+0.303
Neg logit contributions
lee-0.241
ries-0.169
ittens-0.168
atted-0.166
ogging-0.156
Token all
Token ID477
Feature activation+2.50e+00

Pos logit contributions
tted+0.074
 am+0.071
 will+0.069
 deserve+0.069
 belong+0.067
Neg logit contributions
lee-0.093
atted-0.070
ittens-0.068
ries-0.068
 gigg-0.066
Token played
Token ID2826
Feature activation-1.05e+00

Pos logit contributions
tted+0.079
 will+0.074
 am+0.074
fire+0.072
Ready+0.072
Neg logit contributions
lee-0.067
atted-0.047
 gigg-0.044
ries-0.041
 shrugged-0.041
Token together
Token ID1978
Feature activation+1.17e+00

Pos logit contributions
lee+0.022
atted+0.015
ittens+0.015
ries+0.014
 gigg+0.014
Neg logit contributions
 am-0.036
 deserve-0.036
 will-0.036
tted-0.036
 belong-0.035
Token happily
Token ID18177
Feature activation-2.51e+00

Pos logit contributions
tted+0.038
 am+0.038
 will+0.037
 deserve+0.037
 belong+0.036
Neg logit contributions
lee-0.028
ries-0.020
atted-0.020
ittens-0.020
ogging-0.019
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
lee+0.061
ries+0.043
atted+0.043
ittens+0.043
ogging+0.043
Neg logit contributions
tted-0.078
 will-0.075
 deserve-0.075
'll-0.074
 am-0.074
Token<|endoftext|>
Token ID50256
Feature activation+7.45e+00

Pos logit contributions
 will+0.004
tted+0.004
 am+0.004
 belong+0.004
 deserve+0.004
Neg logit contributions
lee-0.005
ittens-0.003
ries-0.003
atted-0.003
ummy-0.003
TokenOnce
Token ID7454
Feature activation-1.70e+00

Pos logit contributions
 belong+0.243
 will+0.231
 deserve+0.231
 affect+0.231
 limit+0.228
Neg logit contributions
lee-0.171
ries-0.120
ittens-0.118
ummy-0.116
ogging-0.114
Token upon
Token ID2402
Feature activation+6.54e+00

Pos logit contributions
lee+0.057
ries+0.045
ittens+0.044
atted+0.044
 oranges+0.042
Neg logit contributions
 will-0.042
 am-0.042
tted-0.038
 belong-0.037
 deserve-0.037
Token a
Token ID257
Feature activation+1.52e+00

Pos logit contributions
tted+0.206
 will+0.200
 limit+0.199
hips+0.196
 affect+0.194
Neg logit contributions
lee-0.146
ittens-0.109
 gigg-0.106
 spaghetti-0.104
ries-0.103
Token time
Token ID640
Feature activation-6.03e-01

Pos logit contributions
 will+0.030
 am+0.030
tted+0.029
 deserve+0.028
 belong+0.027
Neg logit contributions
lee-0.058
ittens-0.046
ries-0.046
atted-0.046
ogging-0.044
Token,
Token ID11
Feature activation-7.45e+00

Pos logit contributions
lee+0.016
ries+0.011
atted+0.010
ittens+0.010
 oranges+0.010
Neg logit contributions
 am-0.021
 will-0.020
 belong-0.018
tted-0.018
 are-0.018
Token,
Token ID11
Feature activation-7.83e+00

Pos logit contributions
 will+0.008
 am+0.008
tted+0.007
 deserve+0.007
 belong+0.007
Neg logit contributions
lee-0.007
atted-0.005
ries-0.005
ittens-0.005
aj-0.005
Token and
Token ID290
Feature activation-6.22e+00

Pos logit contributions
lee+0.164
atted+0.104
ries+0.104
ittens+0.104
ogging+0.094
Neg logit contributions
 am-0.287
 will-0.287
tted-0.283
 deserve-0.277
 belong-0.273
Token the
Token ID262
Feature activation+3.39e+00

Pos logit contributions
lee+0.150
atted+0.100
ries+0.098
ittens+0.094
 oranges+0.091
Neg logit contributions
tted-0.211
 am-0.205
 will-0.203
 deserve-0.200
 belong-0.193
Token deer
Token ID20096
Feature activation+4.69e-01

Pos logit contributions
tted+0.117
 am+0.116
 will+0.113
 deserve+0.110
 belong+0.109
Neg logit contributions
lee-0.075
ittens-0.050
ries-0.049
 oranges-0.049
atted-0.048
Token was
Token ID373
Feature activation-7.47e-01

Pos logit contributions
tted+0.012
Ready+0.012
 deserve+0.012
fire+0.012
 belong+0.011
Neg logit contributions
lee-0.014
 gigg-0.010
atted-0.010
ittens-0.009
 shrugged-0.009
Token never
Token ID1239
Feature activation-1.47e-02

Pos logit contributions
lee+0.021
atted+0.015
 gigg+0.014
ries+0.014
ittens+0.014
Neg logit contributions
tted-0.022
 am-0.021
 will-0.021
 deserve-0.021
fire-0.020
Token bored
Token ID17533
Feature activation-2.06e+00

Pos logit contributions
lee+0.000
atted+0.000
ries+0.000
ittens+0.000
ogging+0.000
Neg logit contributions
 will-0.000
 am-0.000
tted-0.000
 deserve-0.000
 belong-0.000
Token again
Token ID757
Feature activation+1.64e+00

Pos logit contributions
lee+0.062
ittens+0.048
ries+0.047
atted+0.047
aj+0.045
Neg logit contributions
 will-0.057
 am-0.057
tted-0.055
 deserve-0.054
 belong-0.053
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
tted+0.053
 will+0.052
 am+0.052
 deserve+0.050
 belong+0.050
Neg logit contributions
lee-0.042
ries-0.029
ittens-0.029
atted-0.029
ogging-0.028
Token<|endoftext|>
Token ID50256
Feature activation+7.86e+00

Pos logit contributions
 will+0.034
tted+0.034
 belong+0.033
 am+0.033
 deserve+0.032
Neg logit contributions
lee-0.032
ittens-0.022
ummy-0.022
atted-0.022
ries-0.021
TokenOne
Token ID3198
Feature activation+3.78e+00

Pos logit contributions
 belong+0.259
 affect+0.250
 deserve+0.248
 limit+0.246
 will+0.246
Neg logit contributions
lee-0.177
ries-0.118
ittens-0.117
ummy-0.117
ogging-0.115
Token a
Token ID257
Feature activation+1.85e+00

Pos logit contributions
tted+0.203
 will+0.199
 am+0.192
 limit+0.192
 deserve+0.190
Neg logit contributions
lee-0.173
ittens-0.129
ries-0.125
 gigg-0.122
atted-0.120
Token time
Token ID640
Feature activation-1.52e-01

Pos logit contributions
 will+0.033
 am+0.033
tted+0.031
 deserve+0.030
 belong+0.029
Neg logit contributions
lee-0.075
ries-0.062
atted-0.060
ittens-0.059
ogging-0.057
Token,
Token ID11
Feature activation-7.08e+00

Pos logit contributions
lee+0.004
ries+0.003
atted+0.003
ittens+0.003
 oranges+0.003
Neg logit contributions
 am-0.005
 will-0.004
 are-0.004
 belong-0.004
tted-0.004
Token there
Token ID612
Feature activation+6.77e+00

Pos logit contributions
lee+0.175
ries+0.125
atted+0.125
ittens+0.123
ummy+0.114
Neg logit contributions
tted-0.225
 will-0.225
 am-0.222
 deserve-0.221
 belong-0.214
Token was
Token ID373
Feature activation-6.05e-01

Pos logit contributions
 will+0.176
tted+0.175
 am+0.173
 deserve+0.171
 belong+0.166
Neg logit contributions
lee-0.210
ries-0.159
atted-0.159
ittens-0.158
ummy-0.149
Token a
Token ID257
Feature activation+1.40e+00

Pos logit contributions
lee+0.024
atted+0.020
ittens+0.019
ries+0.019
ogging+0.019
Neg logit contributions
 am-0.011
 will-0.011
tted-0.010
 deserve-0.010
 belong-0.010
Token little
Token ID1310
Feature activation-4.78e+00

Pos logit contributions
 will+0.038
 am+0.037
 make+0.034
 deserve+0.033
fire+0.033
Neg logit contributions
lee-0.045
ries-0.038
atted-0.038
ogging-0.033
ittens-0.032
Token girl
Token ID2576
Feature activation-9.14e-02

Pos logit contributions
lee+0.187
atted+0.154
aj+0.149
ries+0.148
ittens+0.139
Neg logit contributions
 will-0.101
 am-0.095
 belong-0.081
 make-0.080
fire-0.080
Token named
Token ID3706
Feature activation-5.05e+00

Pos logit contributions
lee+0.003
ries+0.002
ittens+0.002
aj+0.002
atted+0.002
Neg logit contributions
 am-0.003
 will-0.003
 make-0.002
 protect-0.002
 belong-0.002
Token Lily
Token ID20037
Feature activation-4.73e-02

Pos logit contributions
lee+0.111
atted+0.080
ittens+0.077
ries+0.077
ogging+0.073
Neg logit contributions
 am-0.181
 will-0.181
tted-0.175
 deserve-0.175
 belong-0.173
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
lee+0.001
ries+0.001
ittens+0.001
atted+0.001
ogging+0.001
Neg logit contributions
 will-0.001
 am-0.001
tted-0.001
 deserve-0.001
 belong-0.001
Token mine
Token ID6164
Feature activation+2.82e+00

Pos logit contributions
lee+0.004
atted+0.003
ries+0.003
ittens+0.003
ogging+0.003
Neg logit contributions
 will-0.004
 am-0.004
 deserve-0.003
 belong-0.003
tted-0.003
Token.
Token ID13
Feature activation+2.68e-01

Pos logit contributions
 will+0.083
 am+0.081
tted+0.077
 deserve+0.075
 belong+0.074
Neg logit contributions
lee-0.085
atted-0.063
ittens-0.063
ries-0.061
ogging-0.059
Token I
Token ID314
Feature activation+6.06e+00

Pos logit contributions
 am+0.006
 will+0.006
tted+0.006
 deserve+0.006
 belong+0.006
Neg logit contributions
lee-0.009
ries-0.007
atted-0.007
ittens-0.007
ogging-0.007
Token need
Token ID761
Feature activation+5.01e+00

Pos logit contributions
fire+0.127
hips+0.121
 summit+0.119
tted+0.118
 conquer+0.112
Neg logit contributions
lee-0.155
 said-0.152
 gigg-0.148
 wrink-0.147
 at-0.147
Token it
Token ID340
Feature activation+5.02e+00

Pos logit contributions
 will+0.140
 am+0.139
tted+0.134
 deserve+0.132
 belong+0.130
Neg logit contributions
lee-0.151
ittens-0.114
atted-0.113
ries-0.113
ogging-0.107
Token for
Token ID329
Feature activation+6.81e+00

Pos logit contributions
 will+0.141
 am+0.141
tted+0.138
 deserve+0.134
 belong+0.132
Neg logit contributions
lee-0.149
ries-0.111
ittens-0.111
atted-0.111
ogging-0.104
Token my
Token ID616
Feature activation+2.49e+00

Pos logit contributions
 am+0.170
 will+0.168
 deserve+0.167
 belong+0.163
 protect+0.163
Neg logit contributions
lee-0.219
atted-0.178
ogging-0.165
ries-0.164
ittens-0.161
Token games
Token ID1830
Feature activation+3.03e+00

Pos logit contributions
 will+0.065
 am+0.065
tted+0.061
 belong+0.059
 deserve+0.059
Neg logit contributions
lee-0.081
ries-0.062
atted-0.062
ittens-0.060
ogging-0.059
Token."
Token ID526
Feature activation-2.04e-01

Pos logit contributions
tted+0.095
 am+0.094
 will+0.091
 deserve+0.090
 conquer+0.090
Neg logit contributions
lee-0.072
ittens-0.053
aj-0.050
ries-0.050
atted-0.050
Token
Token ID198
Feature activation+3.30e+00

Pos logit contributions
lee+0.009
ries+0.008
atted+0.007
ya+0.007
ittens+0.007
Neg logit contributions
 am-0.003
 will-0.003
'll-0.002
 make-0.002
tted-0.002
Token
Token ID198
Feature activation+2.89e+00

Pos logit contributions
 am+0.075
 will+0.074
tted+0.070
 deserve+0.067
'll+0.064
Neg logit contributions
lee-0.117
ries-0.094
atted-0.093
ittens-0.091
ogging-0.087
Token,
Token ID11
Feature activation-9.45e+00

Pos logit contributions
lee+0.010
atted+0.007
ries+0.007
ittens+0.007
ogging+0.006
Neg logit contributions
 will-0.018
 am-0.018
tted-0.018
 deserve-0.018
 belong-0.017
Token but
Token ID475
Feature activation-3.01e+00

Pos logit contributions
lee+0.299
atted+0.229
ries+0.222
ittens+0.221
ogging+0.219
Neg logit contributions
 will-0.260
 am-0.250
 make-0.246
 belong-0.239
 protect-0.238
Token he
Token ID339
Feature activation+2.60e-01

Pos logit contributions
lee+0.091
atted+0.068
ogging+0.067
ittens+0.066
ries+0.066
Neg logit contributions
 will-0.090
 am-0.088
 belong-0.085
 surely-0.080
 make-0.079
Token couldn
Token ID3521
Feature activation+4.76e-02

Pos logit contributions
 will+0.007
 am+0.007
'll+0.007
tted+0.006
 belong+0.006
Neg logit contributions
lee-0.008
ries-0.007
ogging-0.006
atted-0.006
ittens-0.006
Token't
Token ID470
Feature activation+1.21e+00

Pos logit contributions
 will+0.001
 am+0.001
tted+0.001
 deserve+0.001
 belong+0.001
Neg logit contributions
lee-0.002
ittens-0.001
ries-0.001
atted-0.001
ummy-0.001
Token find
Token ID1064
Feature activation+2.60e+00

Pos logit contributions
 will+0.017
tted+0.016
 am+0.015
 deserve+0.015
Ready+0.013
Neg logit contributions
lee-0.052
ittens-0.042
ogging-0.042
ries-0.041
ummy-0.041
Token it
Token ID340
Feature activation+4.62e+00

Pos logit contributions
 am+0.082
 will+0.082
tted+0.081
 deserve+0.078
 belong+0.077
Neg logit contributions
lee-0.068
ries-0.049
ittens-0.048
atted-0.048
ogging-0.045
Token.
Token ID13
Feature activation+1.93e-01

Pos logit contributions
 will+0.126
 am+0.124
tted+0.120
 belong+0.118
 deserve+0.116
Neg logit contributions
lee-0.146
atted-0.111
ittens-0.110
ries-0.109
aj-0.103
Token He
Token ID679
Feature activation+1.81e+00

Pos logit contributions
tted+0.004
 will+0.004
 am+0.003
'll+0.003
 deserve+0.003
Neg logit contributions
lee-0.007
ries-0.006
ogging-0.006
ica-0.006
atted-0.006
Token felt
Token ID2936
Feature activation-3.04e+00

Pos logit contributions
 will+0.044
 am+0.040
'll+0.039
 make+0.037
tted+0.037
Neg logit contributions
lee-0.061
ries-0.054
ogging-0.053
ittens-0.053
ummy-0.052
Token sad
Token ID6507
Feature activation-6.83e+00

Pos logit contributions
lee+0.079
ries+0.058
ittens+0.058
atted+0.057
aj+0.054
Neg logit contributions
 am-0.097
 will-0.096
tted-0.094
 deserve-0.092
 belong-0.092
Token remembers
Token ID18140
Feature activation-1.73e+00

Pos logit contributions
 will+0.058
 am+0.054
 deserve+0.049
'll+0.049
tted+0.047
Neg logit contributions
lee-0.086
ittens-0.074
ries-0.073
atted-0.069
ogging-0.069
Token how
Token ID703
Feature activation+2.69e+00

Pos logit contributions
lee+0.053
ittens+0.044
atted+0.041
omo+0.040
aj+0.040
Neg logit contributions
 will-0.047
 am-0.046
 make-0.041
 protect-0.040
 belong-0.040
Token she
Token ID673
Feature activation+2.44e+00

Pos logit contributions
 will+0.084
 am+0.082
 belong+0.078
 deserve+0.075
tted+0.074
Neg logit contributions
lee-0.069
ries-0.056
ittens-0.054
atted-0.053
aj-0.050
Token felt
Token ID2936
Feature activation-1.76e+00

Pos logit contributions
 will+0.059
 am+0.058
tted+0.056
 deserve+0.055
 belong+0.054
Neg logit contributions
lee-0.083
ittens-0.065
ries-0.065
atted-0.064
ogging-0.061
Token when
Token ID618
Feature activation-1.38e+00

Pos logit contributions
lee+0.056
ittens+0.044
ries+0.043
atted+0.042
omo+0.042
Neg logit contributions
 am-0.047
 will-0.047
 belong-0.045
 deserve-0.044
tted-0.044
Token she
Token ID673
Feature activation+1.41e+00

Pos logit contributions
lee+0.032
atted+0.023
ittens+0.022
ries+0.021
aj+0.021
Neg logit contributions
 am-0.048
 will-0.048
 belong-0.045
 deserve-0.044
tted-0.044
Token lost
Token ID2626
Feature activation-2.84e+00

Pos logit contributions
 am+0.038
 will+0.038
tted+0.037
 deserve+0.036
 belong+0.035
Neg logit contributions
lee-0.044
ittens-0.033
ries-0.033
atted-0.033
ogging-0.031
Token her
Token ID607
Feature activation+1.36e+00

Pos logit contributions
lee+0.084
atted+0.067
ittens+0.067
aj+0.066
ogging+0.066
Neg logit contributions
 will-0.077
 am-0.076
 belong-0.071
 make-0.071
 protect-0.070
Token favorite
Token ID4004
Feature activation-5.57e-01

Pos logit contributions
 will+0.036
 am+0.036
tted+0.034
 belong+0.034
 deserve+0.033
Neg logit contributions
lee-0.043
ries-0.033
atted-0.033
ittens-0.032
ogging-0.032
Token doll
Token ID3654
Feature activation+1.11e+00

Pos logit contributions
lee+0.017
atted+0.013
ittens+0.013
ries+0.013
ogging+0.012
Neg logit contributions
 am-0.015
 will-0.015
tted-0.015
 deserve-0.014
 belong-0.014
Token.
Token ID13
Feature activation-4.05e-01

Pos logit contributions
 will+0.035
 am+0.035
tted+0.034
 deserve+0.033
 belong+0.033
Neg logit contributions
lee-0.029
ittens-0.021
ries-0.021
atted-0.021
ogging-0.019
Token something
Token ID1223
Feature activation+2.89e+00

Pos logit contributions
lee+0.034
ittens+0.028
atted+0.027
aj+0.027
ries+0.027
Neg logit contributions
 will-0.020
 am-0.020
tted-0.018
 belong-0.018
 deserve-0.018
Token strange
Token ID6283
Feature activation-3.12e+00

Pos logit contributions
 am+0.072
 will+0.070
 belong+0.068
 make+0.063
tted+0.061
Neg logit contributions
lee-0.101
aj-0.083
atted-0.082
ittens-0.082
omo-0.078
Token
Token ID6184
Feature activation-6.30e+00

Pos logit contributions
lee+0.097
ittens+0.076
atted+0.073
aj+0.071
ries+0.070
Neg logit contributions
 am-0.086
 will-0.086
tted-0.080
 belong-0.079
 deserve-0.077
Token
Token ID95
Feature activation-5.32e+00

Pos logit contributions
lee+0.213
ittens+0.163
ries+0.161
atted+0.158
ogging+0.157
Neg logit contributions
 will-0.153
 am-0.146
tted-0.143
 deserve-0.141
 belong-0.138
Token
Token ID26391
Feature activation-3.95e+00

Pos logit contributions
lee+0.180
ica+0.132
illy+0.130
ittens+0.128
ummy+0.125
Neg logit contributions
 will-0.129
'll-0.114
 am-0.113
 affect-0.111
 belong-0.111
Token
Token ID447
Feature activation-7.20e-01

Pos logit contributions
lee+0.120
ittens+0.090
atted+0.086
ries+0.085
aj+0.085
Neg logit contributions
 will-0.106
tted-0.106
 am-0.105
 belong-0.105
 deserve-0.104
Token
Token ID250
Feature activation+2.84e+00

Pos logit contributions
lee+0.024
ittens+0.019
ries+0.019
atted+0.019
ogging+0.018
Neg logit contributions
 will-0.017
 am-0.017
tted-0.017
 deserve-0.016
 belong-0.016
Token it
Token ID340
Feature activation+2.96e+00

Pos logit contributions
 will+0.068
 am+0.065
 belong+0.063
tted+0.062
 deserve+0.060
Neg logit contributions
lee-0.094
ries-0.075
ittens-0.073
atted-0.070
 oranges-0.070
Token was
Token ID373
Feature activation-1.15e+00

Pos logit contributions
tted+0.073
 am+0.071
 will+0.069
 deserve+0.068
 belong+0.066
Neg logit contributions
lee-0.095
atted-0.072
ries-0.072
ittens-0.068
ummy-0.068
Token a
Token ID257
Feature activation+8.46e-01

Pos logit contributions
lee+0.039
atted+0.032
ittens+0.032
ries+0.031
ogging+0.031
Neg logit contributions
 will-0.028
 am-0.027
 deserve-0.025
 belong-0.025
tted-0.025
Token big
Token ID1263
Feature activation-1.44e+00

Pos logit contributions
 will+0.023
 am+0.023
tted+0.021
 deserve+0.021
 belong+0.021
Neg logit contributions
lee-0.027
ries-0.021
atted-0.021
ittens-0.020
ogging-0.019
Token to
Token ID284
Feature activation+4.46e+00

Pos logit contributions
lee+0.083
atted+0.063
ries+0.063
ittens+0.063
ogging+0.060
Neg logit contributions
 am-0.069
 will-0.068
tted-0.066
 deserve-0.066
 belong-0.064
Token the
Token ID262
Feature activation+3.92e+00

Pos logit contributions
tted+0.160
age+0.154
 will+0.153
 am+0.149
 deserve+0.149
Neg logit contributions
lee-0.094
 Sally-0.063
 Sophie-0.062
 Uncle-0.060
 Dad-0.060
Token hole
Token ID7604
Feature activation+2.84e+00

Pos logit contributions
tted+0.114
 deserve+0.110
 am+0.109
 belong+0.109
 surely+0.104
Neg logit contributions
lee-0.099
 oranges-0.077
 strawberries-0.072
 shrimp-0.071
ls-0.069
Token.
Token ID13
Feature activation-4.09e-01

Pos logit contributions
 will+0.081
 am+0.080
tted+0.078
 deserve+0.074
 belong+0.074
Neg logit contributions
lee-0.083
ittens-0.063
ries-0.061
atted-0.060
aj-0.058
Token He
Token ID679
Feature activation+2.64e+00

Pos logit contributions
lee+0.012
ittens+0.009
atted+0.009
ries+0.009
aj+0.008
Neg logit contributions
 will-0.012
 am-0.011
tted-0.011
 deserve-0.011
 belong-0.011
Token peek
Token ID27185
Feature activation-1.46e+00

Pos logit contributions
 will+0.068
 am+0.066
tted+0.064
 deserve+0.060
'll+0.060
Neg logit contributions
lee-0.087
ittens-0.070
ries-0.070
ummy-0.069
ogging-0.068
Tokened
Token ID276
Feature activation-4.59e+00

Pos logit contributions
lee+0.043
ittens+0.033
ries+0.032
atted+0.031
omo+0.030
Neg logit contributions
 will-0.042
tted-0.042
 am-0.042
 deserve-0.039
 belong-0.039
Token inside
Token ID2641
Feature activation+2.34e+00

Pos logit contributions
lee+0.152
ittens+0.124
ries+0.117
atted+0.116
aj+0.113
Neg logit contributions
 will-0.117
 am-0.113
tted-0.108
 belong-0.104
 deserve-0.104
Token and
Token ID290
Feature activation-6.60e+00

Pos logit contributions
 will+0.069
 am+0.068
 make+0.063
tted+0.063
 deserve+0.061
Neg logit contributions
lee-0.067
ittens-0.053
atted-0.050
ries-0.049
aj-0.049
Token saw
Token ID2497
Feature activation-3.51e-01

Pos logit contributions
lee+0.208
atted+0.158
ries+0.158
ittens+0.156
ogging+0.148
Neg logit contributions
 am-0.172
 will-0.171
tted-0.169
 deserve-0.164
 belong-0.161
Token something
Token ID1223
Feature activation+3.51e+00

Pos logit contributions
lee+0.011
ittens+0.009
atted+0.009
ries+0.009
ogging+0.009
Neg logit contributions
 am-0.009
 will-0.009
tted-0.008
 belong-0.008
 deserve-0.008
TokenIt
Token ID1026
Feature activation+5.74e+00

Pos logit contributions
 am+0.055
 will+0.055
tted+0.055
 belong+0.054
 deserve+0.051
Neg logit contributions
lee-0.087
ittens-0.071
ries-0.069
atted-0.067
omo-0.067
Token's
Token ID338
Feature activation+7.41e-01

Pos logit contributions
tted+0.148
 am+0.147
 will+0.142
 deserve+0.139
 belong+0.137
Neg logit contributions
lee-0.183
atted-0.139
ries-0.139
ittens-0.135
aj-0.129
Token upstairs
Token ID26148
Feature activation+8.51e-01

Pos logit contributions
 will+0.020
 am+0.019
 belong+0.018
 deserve+0.018
tted+0.018
Neg logit contributions
lee-0.024
atted-0.019
ittens-0.018
ogging-0.018
ries-0.018
Token,
Token ID11
Feature activation-8.93e+00

Pos logit contributions
 am+0.028
 will+0.028
tted+0.028
 deserve+0.027
 belong+0.027
Neg logit contributions
lee-0.020
ries-0.014
atted-0.014
ittens-0.014
ogging-0.013
Token Lily
Token ID20037
Feature activation-1.41e+00

Pos logit contributions
lee+0.208
atted+0.148
ries+0.144
ogging+0.144
ittens+0.143
Neg logit contributions
 will-0.324
 am-0.303
 deserve-0.300
 make-0.295
'll-0.292
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
lee+0.039
ittens+0.029
ries+0.029
atted+0.029
ogging+0.027
Neg logit contributions
 will-0.042
 am-0.042
tted-0.041
 deserve-0.040
 belong-0.040
Token Just
Token ID2329
Feature activation-8.05e-01

Pos logit contributions
lee+0.011
atted+0.008
ittens+0.008
ries+0.008
ummy+0.008
Neg logit contributions
 am-0.008
tted-0.008
 will-0.008
 belong-0.008
 deserve-0.008
Token go
Token ID467
Feature activation-2.28e+00

Pos logit contributions
lee+0.029
ries+0.023
atted+0.023
ittens+0.023
 oranges+0.022
Neg logit contributions
tted-0.018
 am-0.016
 will-0.016
 deserve-0.015
fire-0.015
Token up
Token ID510
Feature activation+3.50e-02

Pos logit contributions
lee+0.063
ogging+0.047
atted+0.046
ittens+0.046
ries+0.046
Neg logit contributions
tted-0.065
 am-0.064
 deserve-0.064
 belong-0.063
 will-0.062
Token the
Token ID262
Feature activation+3.80e+00

Pos logit contributions
 will+0.001
 am+0.001
tted+0.001
 deserve+0.001
'll+0.001
Neg logit contributions
lee-0.001
atted-0.001
ittens-0.001
ries-0.001
omo-0.001
Token stairs
Token ID16046
Feature activation+1.16e+00

Pos logit contributions
tted+0.101
 deserve+0.099
 belong+0.097
 will+0.097
 am+0.095
Neg logit contributions
lee-0.107
ittens-0.084
ries-0.083
ogging-0.083
aj-0.082
Token thing
Token ID1517
Feature activation+2.63e+00

Pos logit contributions
lee+0.021
ries+0.017
ittens+0.017
atted+0.017
ogging+0.016
Neg logit contributions
 am-0.014
 will-0.014
tted-0.014
 deserve-0.013
 belong-0.013
Token.
Token ID13
Feature activation-4.61e-01

Pos logit contributions
 will+0.072
 am+0.069
tted+0.066
 belong+0.065
 deserve+0.063
Neg logit contributions
lee-0.085
ittens-0.065
atted-0.062
aj-0.062
ries-0.062
Token
Token ID198
Feature activation+3.70e+00

Pos logit contributions
lee+0.016
ries+0.014
atted+0.013
ittens+0.013
ogging+0.012
Neg logit contributions
 am-0.011
 will-0.010
tted-0.010
 deserve-0.009
'll-0.009
Token
Token ID198
Feature activation+3.06e+00

Pos logit contributions
 am+0.084
 will+0.082
tted+0.079
 deserve+0.075
'll+0.072
Neg logit contributions
lee-0.132
ries-0.107
atted-0.105
ittens-0.104
ogging-0.099
TokenS
Token ID50
Feature activation-6.84e+00

Pos logit contributions
 am+0.096
 will+0.096
tted+0.095
 deserve+0.092
 belong+0.090
Neg logit contributions
lee-0.083
ries-0.059
atted-0.059
ittens-0.058
ogging-0.054
Tokenara
Token ID3301
Feature activation-3.75e+00

Pos logit contributions
lee+0.160
ittens+0.108
atted+0.107
ries+0.106
ogging+0.097
Neg logit contributions
 will-0.238
 am-0.237
tted-0.232
 deserve-0.227
 belong-0.224
Token is
Token ID318
Feature activation-1.25e+00

Pos logit contributions
lee+0.121
ittens+0.103
ries+0.095
illy+0.091
ummy+0.091
Neg logit contributions
 will-0.101
 am-0.097
tted-0.089
'll-0.087
 deserve-0.087
Token curious
Token ID11040
Feature activation-5.34e+00

Pos logit contributions
lee+0.034
ittens+0.028
ries+0.026
ogging+0.025
illy+0.025
Neg logit contributions
 am-0.039
 will-0.038
tted-0.037
 deserve-0.036
 belong-0.036
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
lee+0.177
ittens+0.150
ries+0.141
aj+0.138
atted+0.138
Neg logit contributions
 will-0.134
 am-0.125
 make-0.116
 deserve-0.112
 belong-0.112
Token She
Token ID1375
Feature activation+2.86e+00

Pos logit contributions
lee+0.042
ries+0.036
ittens+0.035
aj+0.035
atted+0.035
Neg logit contributions
 am-0.026
 will-0.024
tted-0.021
'll-0.021
 make-0.021
Token takes
Token ID2753
Feature activation+1.06e+00

Pos logit contributions
 will+0.072
 am+0.068
 deserve+0.063
tted+0.062
'll+0.062
Neg logit contributions
lee-0.098
ittens-0.084
ries-0.081
ummy-0.077
atted-0.077
Token it
Token ID340
Feature activation+5.29e+00

Pos logit contributions
lee+0.040
ries+0.029
atted+0.029
ittens+0.028
ogging+0.027
Neg logit contributions
 am-0.045
tted-0.045
 will-0.045
 deserve-0.043
 belong-0.043
Token.
Token ID13
Feature activation+3.79e-01

Pos logit contributions
 am+0.168
 will+0.166
tted+0.165
 deserve+0.162
 belong+0.158
Neg logit contributions
lee-0.135
ries-0.095
atted-0.095
ittens-0.094
ogging-0.089
Token He
Token ID679
Feature activation+1.99e+00

Pos logit contributions
 am+0.008
 will+0.008
tted+0.008
 deserve+0.008
'll+0.008
Neg logit contributions
lee-0.013
ries-0.011
ittens-0.011
atted-0.011
ogging-0.010
Token felt
Token ID2936
Feature activation-2.62e+00

Pos logit contributions
 will+0.057
 am+0.055
tted+0.052
 deserve+0.052
'll+0.051
Neg logit contributions
lee-0.059
ittens-0.047
ries-0.046
atted-0.044
ummy-0.044
Token so
Token ID523
Feature activation-5.70e+00

Pos logit contributions
lee+0.072
atted+0.052
ries+0.050
ittens+0.050
ogging+0.048
Neg logit contributions
 will-0.079
 am-0.079
tted-0.077
 deserve-0.075
 belong-0.074
Token happy
Token ID3772
Feature activation-6.56e+00

Pos logit contributions
lee+0.138
atted+0.094
ittens+0.090
ries+0.087
ogging+0.085
Neg logit contributions
 will-0.191
 am-0.190
tted-0.187
 deserve-0.182
 belong-0.179
Token and
Token ID290
Feature activation-6.82e+00

Pos logit contributions
lee+0.171
ittens+0.131
ries+0.127
atted+0.121
aj+0.120
Neg logit contributions
 will-0.207
 am-0.203
 deserve-0.190
 belong-0.187
fire-0.186
Token content
Token ID2695
Feature activation-3.88e+00

Pos logit contributions
lee+0.168
ries+0.123
ittens+0.123
atted+0.117
ogging+0.110
Neg logit contributions
 will-0.225
 am-0.223
tted-0.218
 deserve-0.215
 belong-0.213
Token!
Token ID0
Feature activation-1.97e-01

Pos logit contributions
lee+0.097
ittens+0.069
ries+0.068
atted+0.067
ogging+0.063
Neg logit contributions
 am-0.127
 will-0.127
tted-0.124
 deserve-0.121
 belong-0.120
Token
Token ID220
Feature activation+3.28e-01

Pos logit contributions
lee+0.007
ries+0.006
ittens+0.005
atted+0.005
ogging+0.005
Neg logit contributions
 am-0.005
 will-0.004
tted-0.004
 deserve-0.004
'll-0.004
Token
Token ID198
Feature activation+3.66e+00

Pos logit contributions
 will+0.009
 am+0.009
 belong+0.009
tted+0.009
 deserve+0.009
Neg logit contributions
lee-0.010
ittens-0.007
atted-0.007
ries-0.007
aj-0.007
Token hugged
Token ID41130
Feature activation-4.45e+00

Pos logit contributions
lee+0.200
ittens+0.184
ummy+0.167
ries+0.162
aj+0.158
Neg logit contributions
 will-0.240
 am-0.222
 make-0.220
 deserve-0.215
tted-0.211
Token them
Token ID606
Feature activation+4.27e+00

Pos logit contributions
lee+0.138
ittens+0.107
atted+0.105
aj+0.104
ries+0.104
Neg logit contributions
 will-0.120
 am-0.118
tted-0.111
 deserve-0.107
 belong-0.105
Token.
Token ID13
Feature activation-3.91e-01

Pos logit contributions
 will+0.109
tted+0.101
 am+0.100
 deserve+0.096
 make+0.095
Neg logit contributions
lee-0.142
ittens-0.113
ries-0.110
atted-0.108
aj-0.105
Token She
Token ID1375
Feature activation+3.47e+00

Pos logit contributions
lee+0.013
ries+0.012
ittens+0.012
atted+0.011
ogging+0.011
Neg logit contributions
 am-0.009
 will-0.009
tted-0.007
'll-0.007
 deserve-0.007
Token said
Token ID531
Feature activation-3.32e+00

Pos logit contributions
 will+0.093
 am+0.086
tted+0.081
 deserve+0.080
'll+0.079
Neg logit contributions
lee-0.111
ittens-0.098
ries-0.095
ogging-0.095
ummy-0.093
Token,
Token ID11
Feature activation-8.94e+00

Pos logit contributions
lee+0.091
aj+0.082
ittens+0.082
omo+0.082
ries+0.074
Neg logit contributions
 will-0.092
 am-0.086
 are-0.079
 make-0.077
'll-0.071
Token "
Token ID366
Feature activation+3.72e+00

Pos logit contributions
lee+0.281
ries+0.234
ogging+0.222
atted+0.217
ya+0.213
Neg logit contributions
 will-0.232
 am-0.225
 make-0.206
tted-0.201
'll-0.199
TokenI
Token ID40
Feature activation+5.45e+00

Pos logit contributions
 belong+0.078
tted+0.078
 will+0.077
 am+0.076
 deserve+0.071
Neg logit contributions
lee-0.131
ittens-0.106
ries-0.103
atted-0.102
ogging-0.100
Token'm
Token ID1101
Feature activation-2.22e+00

Pos logit contributions
tted+0.087
fire+0.079
 belong+0.076
hips+0.073
 deserve+0.073
Neg logit contributions
lee-0.219
ittens-0.177
atted-0.177
omo-0.175
 gigg-0.174
Token glad
Token ID9675
Feature activation-4.92e+00

Pos logit contributions
lee+0.057
ries+0.042
atted+0.041
ittens+0.041
aj+0.039
Neg logit contributions
 am-0.073
 will-0.072
 deserve-0.069
tted-0.068
 belong-0.068
Token you
Token ID345
Feature activation+3.73e+00

Pos logit contributions
lee+0.192
aj+0.161
ittens+0.159
atted+0.157
omo+0.153
Neg logit contributions
 will-0.095
 am-0.091
 make-0.080
 protect-0.077
 deserve-0.074
Token tree
Token ID5509
Feature activation+2.44e+00

Pos logit contributions
 will+0.120
 am+0.118
tted+0.115
 deserve+0.110
 belong+0.109
Neg logit contributions
lee-0.178
atted-0.145
ittens-0.143
ries-0.140
ogging-0.135
Token.
Token ID13
Feature activation+7.77e-02

Pos logit contributions
 will+0.078
 am+0.078
tted+0.076
 deserve+0.074
 belong+0.073
Neg logit contributions
lee-0.063
ittens-0.045
ries-0.044
atted-0.044
ogging-0.041
Token He
Token ID679
Feature activation+2.06e+00

Pos logit contributions
tted+0.002
 am+0.002
 will+0.002
'll+0.002
 deserve+0.002
Neg logit contributions
lee-0.002
ries-0.002
atted-0.002
ittens-0.002
aj-0.002
Token was
Token ID373
Feature activation-1.83e+00

Pos logit contributions
 will+0.044
 am+0.040
'll+0.040
 make+0.039
'd+0.035
Neg logit contributions
lee-0.071
ries-0.068
ogging-0.067
ittens-0.067
ummy-0.066
Token so
Token ID523
Feature activation-5.50e+00

Pos logit contributions
lee+0.057
ittens+0.049
ries+0.049
aj+0.047
ogging+0.047
Neg logit contributions
 will-0.045
 am-0.044
tted-0.044
 make-0.043
 belong-0.042
Token excited
Token ID6568
Feature activation-8.58e+00

Pos logit contributions
lee+0.155
ries+0.135
ittens+0.124
aj+0.118
atted+0.115
Neg logit contributions
 am-0.169
tted-0.163
 will-0.158
 belong-0.153
 deserve-0.152
Token that
Token ID326
Feature activation+1.71e+00

Pos logit contributions
lee+0.272
ittens+0.223
aj+0.219
atted+0.214
omo+0.213
Neg logit contributions
 will-0.214
 am-0.202
 make-0.191
 belong-0.187
 deserve-0.187
Token he
Token ID339
Feature activation+9.49e-01

Pos logit contributions
 am+0.052
 will+0.052
 belong+0.050
 make+0.049
 deserve+0.047
Neg logit contributions
lee-0.049
ries-0.039
atted-0.038
ittens-0.038
aj-0.037
Token had
Token ID550
Feature activation+4.90e+00

Pos logit contributions
 will+0.025
'll+0.023
 deserve+0.022
tted+0.022
 make+0.022
Neg logit contributions
lee-0.029
ries-0.029
ummy-0.027
ogging-0.026
ittens-0.026
Token to
Token ID284
Feature activation+5.61e+00

Pos logit contributions
 am+0.097
tted+0.095
 will+0.093
 deserve+0.087
 belong+0.086
Neg logit contributions
lee-0.189
atted-0.148
ries-0.146
ittens-0.143
 oranges-0.141
Token show
Token ID905
Feature activation+8.67e-01

Pos logit contributions
 will+0.102
tted+0.100
 am+0.100
 deserve+0.091
'll+0.090
Neg logit contributions
lee-0.217
ittens-0.174
ries-0.170
 oranges-0.168
atted-0.167
Token.
Token ID13
Feature activation+7.50e-02

Pos logit contributions
 will+0.028
 am+0.028
tted+0.026
 belong+0.026
'll+0.026
Neg logit contributions
lee-0.033
ittens-0.025
atted-0.025
ries-0.025
ummy-0.023
Token
Token ID198
Feature activation+3.43e+00

Pos logit contributions
 am+0.002
 will+0.002
tted+0.002
 deserve+0.002
'll+0.001
Neg logit contributions
lee-0.003
ries-0.002
atted-0.002
ittens-0.002
ogging-0.002
Token
Token ID198
Feature activation+2.97e+00

Pos logit contributions
 will+0.092
 am+0.091
tted+0.089
 belong+0.089
 deserve+0.088
Neg logit contributions
lee-0.105
ittens-0.080
atted-0.078
ries-0.077
ogging-0.075
TokenJust
Token ID5703
Feature activation-2.28e+00

Pos logit contributions
 am+0.088
 will+0.087
 belong+0.086
 affect+0.082
tted+0.082
Neg logit contributions
lee-0.080
ittens-0.059
atted-0.055
ogging-0.054
ries-0.054
Token then
Token ID788
Feature activation-2.43e+00

Pos logit contributions
lee+0.045
ittens+0.028
atted+0.028
ries+0.028
ogging+0.024
Neg logit contributions
 am-0.087
 will-0.086
tted-0.086
 deserve-0.084
 belong-0.082
Token,
Token ID11
Feature activation-9.31e+00

Pos logit contributions
lee+0.067
ittens+0.050
ries+0.049
aj+0.049
omo+0.047
Neg logit contributions
 am-0.080
 will-0.077
 protect-0.072
 belong-0.071
 conquer-0.070
Token a
Token ID257
Feature activation-4.15e-02

Pos logit contributions
lee+0.309
ittens+0.239
ries+0.238
atted+0.238
ogging+0.226
Neg logit contributions
 will-0.230
 am-0.230
tted-0.223
 deserve-0.216
 belong-0.213
Token little
Token ID1310
Feature activation-6.23e+00

Pos logit contributions
lee+0.001
ittens+0.001
atted+0.001
ries+0.001
ogging+0.001
Neg logit contributions
 am-0.001
 will-0.001
tted-0.001
 deserve-0.001
 belong-0.001
Token boy
Token ID2933
Feature activation-1.97e+00

Pos logit contributions
lee+0.192
ittens+0.146
ries+0.145
atted+0.144
aj+0.136
Neg logit contributions
 will-0.170
 am-0.169
tted-0.164
 deserve-0.160
 belong-0.158
Token named
Token ID3706
Feature activation-6.23e+00

Pos logit contributions
lee+0.061
ittens+0.051
aj+0.051
ummy+0.049
omo+0.048
Neg logit contributions
 am-0.053
 will-0.052
tted-0.047
 make-0.047
'll-0.046
Token George
Token ID4502
Feature activation-2.44e+00

Pos logit contributions
lee+0.117
ittens+0.074
atted+0.072
ries+0.070
aj+0.063
Neg logit contributions
 am-0.242
 will-0.242
tted-0.236
 deserve-0.233
 belong-0.230
Token to
Token ID284
Feature activation+4.47e+00

Pos logit contributions
lee+0.037
ittens+0.028
ries+0.028
atted+0.026
aj+0.025
Neg logit contributions
 will-0.047
 am-0.046
tted-0.045
 deserve-0.044
 belong-0.043
Token find
Token ID1064
Feature activation+2.61e+00

Pos logit contributions
tted+0.138
 will+0.124
 am+0.124
 deserve+0.120
'll+0.119
Neg logit contributions
lee-0.122
 gigg-0.085
 Nem-0.085
 Sophie-0.085
 Sally-0.085
Token a
Token ID257
Feature activation+2.56e+00

Pos logit contributions
tted+0.084
 am+0.084
 will+0.082
 deserve+0.081
 affect+0.078
Neg logit contributions
lee-0.063
atted-0.045
ries-0.045
ittens-0.042
 oranges-0.041
Token new
Token ID649
Feature activation+1.16e+00

Pos logit contributions
tted+0.076
 am+0.069
 deserve+0.068
hips+0.068
 belong+0.066
Neg logit contributions
lee-0.066
 strawberries-0.049
 shrimp-0.049
 oranges-0.048
ittens-0.048
Token home
Token ID1363
Feature activation-8.09e-01

Pos logit contributions
tted+0.030
 am+0.029
 will+0.029
 deserve+0.029
 belong+0.028
Neg logit contributions
lee-0.036
ittens-0.028
atted-0.028
ries-0.028
ummy-0.027
Token,
Token ID11
Feature activation-9.27e+00

Pos logit contributions
lee+0.018
atted+0.012
ries+0.012
ittens+0.012
ogging+0.011
Neg logit contributions
 am-0.028
 will-0.028
tted-0.028
 deserve-0.027
 belong-0.027
Token but
Token ID475
Feature activation-2.35e+00

Pos logit contributions
lee+0.236
ries+0.169
ittens+0.168
atted+0.166
ogging+0.154
Neg logit contributions
 will-0.301
 am-0.298
tted-0.292
 deserve-0.287
 belong-0.282
Token he
Token ID339
Feature activation+9.63e-01

Pos logit contributions
lee+0.056
atted+0.039
ries+0.038
ittens+0.037
ummy+0.035
Neg logit contributions
tted-0.078
 am-0.077
 will-0.077
 deserve-0.075
 belong-0.073
Token never
Token ID1239
Feature activation-6.83e-01

Pos logit contributions
 am+0.023
 will+0.023
tted+0.023
 deserve+0.022
 belong+0.022
Neg logit contributions
lee-0.032
atted-0.025
ittens-0.025
ries-0.025
ogging-0.023
Token found
Token ID1043
Feature activation+1.92e+00

Pos logit contributions
lee+0.020
atted+0.014
ogging+0.013
 shrugged+0.013
 gigg+0.013
Neg logit contributions
tted-0.020
 am-0.020
 will-0.019
hips-0.019
fire-0.019
Token one
Token ID530
Feature activation+4.34e+00

Pos logit contributions
tted+0.062
 am+0.061
 will+0.058
 deserve+0.058
 affect+0.056
Neg logit contributions
lee-0.048
ries-0.036
atted-0.035
 oranges-0.032
ittens-0.032
Token.
Token ID13
Feature activation+3.68e-02

Pos logit contributions
 am+0.021
 will+0.021
tted+0.021
 deserve+0.021
 belong+0.020
Neg logit contributions
lee-0.018
atted-0.012
ries-0.012
ittens-0.012
ogging-0.011
Token From
Token ID3574
Feature activation-4.00e+00

Pos logit contributions
 am+0.001
tted+0.001
 will+0.001
 deserve+0.001
'll+0.001
Neg logit contributions
lee-0.001
ries-0.001
ittens-0.001
atted-0.001
aj-0.001
Token that
Token ID326
Feature activation+2.19e+00

Pos logit contributions
lee+0.116
ittens+0.097
atted+0.087
ries+0.087
ogging+0.086
Neg logit contributions
 will-0.113
 am-0.105
fire-0.101
 surely-0.100
 belong-0.098
Token day
Token ID1110
Feature activation-1.83e+00

Pos logit contributions
tted+0.051
 am+0.049
 will+0.049
 deserve+0.048
 belong+0.046
Neg logit contributions
lee-0.074
ittens-0.058
atted-0.058
ries-0.057
ogging-0.057
Token on
Token ID319
Feature activation+4.12e+00

Pos logit contributions
lee+0.059
ries+0.045
ittens+0.045
atted+0.045
aj+0.043
Neg logit contributions
 will-0.047
 am-0.047
tted-0.045
 deserve-0.044
 belong-0.043
Token,
Token ID11
Feature activation-8.78e+00

Pos logit contributions
 am+0.150
 will+0.149
tted+0.140
 belong+0.138
 deserve+0.135
Neg logit contributions
lee-0.092
ittens-0.061
atted-0.061
ries-0.059
aj-0.057
Token he
Token ID339
Feature activation+1.30e-01

Pos logit contributions
lee+0.184
ittens+0.120
atted+0.116
ries+0.116
aj+0.107
Neg logit contributions
 will-0.327
 am-0.326
tted-0.318
 deserve-0.314
 belong-0.311
Token was
Token ID373
Feature activation-2.08e+00

Pos logit contributions
tted+0.003
 am+0.003
 will+0.003
 deserve+0.003
 belong+0.003
Neg logit contributions
lee-0.004
atted-0.003
ries-0.003
ittens-0.003
ogging-0.003
Token extra
Token ID3131
Feature activation-4.86e+00

Pos logit contributions
lee+0.070
atted+0.055
ries+0.053
ittens+0.052
ogging+0.050
Neg logit contributions
 am-0.050
tted-0.049
 will-0.049
 deserve-0.047
 belong-0.046
Token careful
Token ID8161
Feature activation-6.13e+00

Pos logit contributions
lee+0.130
ries+0.095
ittens+0.094
atted+0.093
ogging+0.087
Neg logit contributions
 will-0.151
 am-0.151
tted-0.148
 deserve-0.144
 belong-0.142
Token when
Token ID618
Feature activation-3.54e-01

Pos logit contributions
lee+0.156
atted+0.112
ittens+0.110
ries+0.108
ogging+0.103
Neg logit contributions
 am-0.197
tted-0.194
 will-0.194
 deserve-0.189
 belong-0.185