TOP ACTIVATIONS
MAX = 6.009

honest , even in difficult situations . <|endoftext|> Once upon a
see animals in their natural habitat . She ran around the
decorated it with shells and seaw eed . They had a
saw five fish and seven seaw eed . They were happy
like fish and shells and seaw eed . One
buckets and shovel s and seaw eed . Then we can
like fish and shells and seaw eed . But they did
ate it with shells and seaw eed . They find a
W OO F ! W OO F ! W OO F
him about the frog 's habitat , diet and how it
their hair and on their stomach s . They are happy
see shells , rocks , seaw eed , crabs and fish
fruits and vegetables and his stomach never hurt again . <|endoftext|>
W OO F ! W OO F !" Ben
Like shells and rocks and seaw eed ." M
also help you in tough situations . From that day on
butterfly back in its natural habitat . He waved goodbye to
added shells and stones and seaw eed to make it look
was , and used his net to catch it . He
" Yes , your majesty . What do you order

MIN ACTIVATIONS
MIN = -5.568

t help it â he reached out and
something else too â it was a sour
noticed something strange â a big , bright
something was wrong â all the water had
heard a noise â it sounded like someone
was too late â P ete slipped and
back to watching the fire work show in peace . What
was too late â the ice melted and
was too late â his foot had caught
was too late â they had already drifted
was too late â the o at had
They watched the fire work spark le above them in
Can I put the cup cakes in the oven ?" June
a normal penny â it was special .
. They watched the fire work until it was all finished
Her name was à mil ie .
Lily 's mom put lot ion on her arms because they
so they did â sw imming , playing
fun watching the pretty fire work 's . It was the
. They put the cup cakes on a plate and brought

INTERVAL 3.115 - 6.009
CONTAINS 0.168%

. " Oh no !" Sam and Max say
and cou g hed a few times . He felt a
it sailing away in the distance . They could hear the
started to rock back and forth , and Nut ty became
cat looked sad and had no friends to play with .

INTERVAL 0.221 - 3.115
CONTAINS 38.294%

happy . He says , " Wow , thank you ,
to rock back and forth on his four legs while relaxing
way home . From that day on , the bunny and
. They start to climb the hill . It
curtain got even bigger . Tom and Max felt

INTERVAL -2.674 - 0.221
CONTAINS 61.353%

The flower was very happy because it was not lonely
They are happy . <|endoftext|> Tom and Mia are friends .
the bear . The bear was sad too . The red
whistle . Everywhere she went she made people smile and laugh
't believe how much everyone was enjoying his things ! He

INTERVAL -5.568 - -2.674
CONTAINS 0.184%

Mom , can I get this car ?" His mom looked
Alice poured the popcorn into a big bowl .
Ben . They come out of the tube . They are
Lily . They got the boxes and tape and paint from
bending down to snap the box open . As soon as
Token honest
Token ID5508
Feature activation-3.46e-01

Pos logit contributions
ulp+0.102
aw+0.088
 tune+0.087
 endings+0.086
OO+0.085
Neg logit contributions
-0.070
 first-0.066
my-0.062
 Hatt-0.060
 Bear-0.059
Token,
Token ID11
Feature activation+5.84e-01

Pos logit contributions
+0.011
 first+0.010
my+0.010
 Hatt+0.009
 Bear+0.009
Neg logit contributions
ulp-0.017
aw-0.015
 tune-0.014
 endings-0.014
OO-0.014
Token even
Token ID772
Feature activation+1.44e+00

Pos logit contributions
ulp+0.028
aw+0.025
 tune+0.024
 endings+0.024
OO+0.023
Neg logit contributions
-0.018
 first-0.017
my-0.016
 Hatt-0.016
 Bear-0.015
Token in
Token ID287
Feature activation+8.67e-01

Pos logit contributions
ulp+0.077
aw+0.066
 endings+0.064
 tune+0.064
OO+0.064
Neg logit contributions
-0.041
 first-0.041
my-0.036
 Hatt-0.034
 Bear-0.033
Token difficult
Token ID2408
Feature activation+1.63e+00

Pos logit contributions
ulp+0.040
aw+0.036
 tune+0.035
OO+0.033
 endings+0.033
Neg logit contributions
-0.029
 first-0.028
my-0.025
 Hatt-0.025
 Bear-0.024
Token situations
Token ID7445
Feature activation+6.01e+00

Pos logit contributions
ulp+0.059
aw+0.045
ater+0.042
OO+0.042
 tune+0.041
Neg logit contributions
-0.075
 first-0.074
my-0.068
 Bear-0.064
 Hatt-0.064
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
ulp+0.248
aw+0.228
 tune+0.221
 endings+0.217
 tunes+0.213
Neg logit contributions
-0.205
 first-0.197
 Hatt-0.190
my-0.185
 Bear-0.179
Token<|endoftext|>
Token ID50256
Feature activation+1.08e+00

Pos logit contributions
 first+0.005
+0.004
 Hatt+0.004
ishly+0.004
my+0.004
Neg logit contributions
ulp-0.008
aw-0.007
 endings-0.007
 tune-0.007
OO-0.007
TokenOnce
Token ID7454
Feature activation+3.47e-02

Pos logit contributions
ulp+0.051
aw+0.048
 endings+0.044
 tune+0.043
OO+0.043
Neg logit contributions
-0.028
 Hatt-0.028
 first-0.028
ishly-0.027
my-0.024
Token upon
Token ID2402
Feature activation+3.33e-01

Pos logit contributions
ulp+0.002
aw+0.001
OO+0.001
 than+0.001
 pounds+0.001
Neg logit contributions
-0.001
ishly-0.001
 Hatt-0.001
 Elephant-0.001
 attendant-0.001
Token a
Token ID257
Feature activation-6.56e-01

Pos logit contributions
ulp+0.020
izing+0.019
aw+0.018
 tune+0.018
 heights+0.018
Neg logit contributions
-0.007
 first-0.006
my-0.005
 Bear-0.004
 -0.004
Token see
Token ID766
Feature activation+1.62e+00

Pos logit contributions
ulp+0.008
aw+0.007
 tune+0.007
 tunes+0.007
 endings+0.007
Neg logit contributions
-0.007
 Hatt-0.007
 first-0.007
my-0.006
 Bear-0.006
Token animals
Token ID4695
Feature activation-5.62e-01

Pos logit contributions
ulp+0.089
 tune+0.079
aw+0.078
 endings+0.077
OO+0.076
Neg logit contributions
-0.040
 first-0.037
 Hatt-0.034
my-0.034
 Bear-0.032
Token in
Token ID287
Feature activation+3.96e-01

Pos logit contributions
+0.020
 Hatt+0.020
 first+0.019
my+0.018
 Bear+0.018
Neg logit contributions
ulp-0.022
aw-0.020
 tune-0.019
 than-0.019
 tunes-0.019
Token their
Token ID511
Feature activation+1.25e+00

Pos logit contributions
ulp+0.020
aw+0.018
 tune+0.018
 endings+0.017
OO+0.017
Neg logit contributions
-0.011
 first-0.011
 Hatt-0.010
my-0.010
 Bear-0.009
Token natural
Token ID3288
Feature activation+1.83e+00

Pos logit contributions
ulp+0.044
aw+0.040
 tune+0.038
 endings+0.036
 tunes+0.036
Neg logit contributions
-0.052
 Hatt-0.048
 first-0.048
 Bear-0.045
my-0.045
Token habitat
Token ID20018
Feature activation+5.94e+00

Pos logit contributions
ulp+0.045
 tune+0.033
aw+0.032
 pounds+0.030
izing+0.030
Neg logit contributions
 first-0.099
-0.098
my-0.094
 Bear-0.090
 Hatt-0.090
Token.
Token ID13
Feature activation+1.07e-01

Pos logit contributions
ulp+0.309
 tune+0.285
 tunes+0.271
OO+0.266
 som+0.265
Neg logit contributions
 first-0.164
-0.155
my-0.147
 Bear-0.118
 Hatt-0.116
Token She
Token ID1375
Feature activation-2.61e-01

Pos logit contributions
ulp+0.006
aw+0.005
 tune+0.005
 endings+0.005
OO+0.005
Neg logit contributions
-0.002
 first-0.002
 Hatt-0.002
my-0.002
 Bear-0.002
Token ran
Token ID4966
Feature activation+7.57e-01

Pos logit contributions
+0.009
 Hatt+0.009
 first+0.009
ishly+0.009
my+0.009
Neg logit contributions
ulp-0.010
aw-0.009
 tune-0.009
 som-0.008
 tunes-0.008
Token around
Token ID1088
Feature activation-4.45e-01

Pos logit contributions
 than+0.027
 tune+0.025
aw+0.025
ulp+0.025
 som+0.024
Neg logit contributions
-0.028
 Hatt-0.027
 Bear-0.025
my-0.025
ishly-0.025
Token the
Token ID262
Feature activation-1.62e-01

Pos logit contributions
+0.016
 Hatt+0.014
my+0.014
 first+0.014
ishly+0.013
Neg logit contributions
ulp-0.018
 tune-0.017
aw-0.016
 tunes-0.016
 heights-0.016
Token decorated
Token ID24789
Feature activation-8.82e-01

Pos logit contributions
ulp+0.033
aw+0.029
 tune+0.029
 tunes+0.028
 endings+0.027
Neg logit contributions
-0.028
 Hatt-0.026
 first-0.026
my-0.025
 Bear-0.024
Token it
Token ID340
Feature activation+5.42e-01

Pos logit contributions
 first+0.029
+0.028
 Bear+0.025
my+0.024
 Hatt+0.024
Neg logit contributions
ulp-0.040
 tune-0.036
OO-0.036
 endings-0.036
aw-0.034
Token with
Token ID351
Feature activation-5.78e-01

Pos logit contributions
ulp+0.023
aw+0.020
 tune+0.019
 than+0.019
 tunes+0.019
Neg logit contributions
-0.019
 Hatt-0.019
my-0.018
 first-0.017
 Bear-0.017
Token shells
Token ID19679
Feature activation+2.07e+00

Pos logit contributions
+0.022
 first+0.021
 Hatt+0.020
my+0.020
 Bear+0.019
Neg logit contributions
ulp-0.023
aw-0.021
 tune-0.020
 endings-0.020
 tunes-0.019
Token and
Token ID290
Feature activation+1.14e+00

Pos logit contributions
ulp+0.085
aw+0.076
 tune+0.072
ater+0.070
 endings+0.069
Neg logit contributions
-0.075
 Hatt-0.071
 first-0.071
my-0.068
 Bear-0.065
Token seaw
Token ID32085
Feature activation+5.72e+00

Pos logit contributions
ulp+0.039
aw+0.034
 tune+0.033
 endings+0.032
 tunes+0.031
Neg logit contributions
-0.049
 first-0.048
 Hatt-0.046
my-0.045
 Bear-0.044
Tokeneed
Token ID2308
Feature activation+3.04e+00

Pos logit contributions
ulp+0.120
 tune+0.105
 tunes+0.093
 endings+0.091
 conclusions+0.090
Neg logit contributions
-0.301
 first-0.296
 Hatt-0.281
my-0.280
 Bear-0.278
Token.
Token ID13
Feature activation+8.28e-01

Pos logit contributions
ulp+0.128
aw+0.114
 tune+0.110
 endings+0.107
 tunes+0.105
Neg logit contributions
-0.108
 first-0.102
 Hatt-0.100
my-0.097
 Bear-0.093
Token They
Token ID1119
Feature activation+3.61e-01

Pos logit contributions
ulp+0.043
aw+0.039
 tune+0.037
 endings+0.037
 tunes+0.036
Neg logit contributions
-0.022
 first-0.021
my-0.019
 Hatt-0.019
 Bear-0.018
Token had
Token ID550
Feature activation+1.47e+00

Pos logit contributions
ulp+0.013
aw+0.013
 som+0.012
ater+0.012
cest+0.012
Neg logit contributions
 Hatt-0.012
-0.012
ishly-0.012
 Ellie-0.012
 Elephant-0.011
Token a
Token ID257
Feature activation+4.31e-01

Pos logit contributions
ulp+0.076
 tune+0.068
izing+0.067
OO+0.066
ater+0.065
Neg logit contributions
-0.043
 first-0.042
my-0.035
 both-0.034
 Bear-0.033
Token saw
Token ID2497
Feature activation+1.36e+00

Pos logit contributions
ulp+0.018
aw+0.017
 tune+0.016
 endings+0.016
 tunes+0.015
Neg logit contributions
-0.012
 first-0.011
 Hatt-0.011
my-0.010
 Bear-0.010
Token five
Token ID1936
Feature activation+6.27e-01

Pos logit contributions
ulp+0.070
 tune+0.065
OO+0.062
ater+0.062
izing+0.060
Neg logit contributions
 first-0.037
-0.037
my-0.030
 Bear-0.029
 both-0.029
Token fish
Token ID5916
Feature activation+3.80e-01

Pos logit contributions
ulp+0.027
 tune+0.024
OO+0.023
 endings+0.022
 tunes+0.022
Neg logit contributions
 first-0.024
-0.023
my-0.019
 both-0.019
 Hatt-0.018
Token and
Token ID290
Feature activation+9.61e-01

Pos logit contributions
ulp+0.017
aw+0.015
 tune+0.015
 endings+0.015
 tunes+0.014
Neg logit contributions
-0.012
 first-0.012
 Hatt-0.011
my-0.011
 Bear-0.010
Token seven
Token ID3598
Feature activation+8.83e-01

Pos logit contributions
ulp+0.042
aw+0.037
 tune+0.036
 endings+0.035
 tunes+0.035
Neg logit contributions
-0.033
 first-0.032
 Hatt-0.029
my-0.029
 Bear-0.028
Token seaw
Token ID32085
Feature activation+5.57e+00

Pos logit contributions
ulp+0.039
 tune+0.037
OO+0.034
 tunes+0.033
ilt+0.033
Neg logit contributions
 first-0.032
-0.030
 both-0.025
my-0.025
 two-0.023
Tokeneed
Token ID2308
Feature activation+2.66e+00

Pos logit contributions
ulp+0.127
 tune+0.110
 endings+0.104
 tunes+0.103
 situations+0.099
Neg logit contributions
 first-0.277
-0.276
 Bear-0.261
 Hatt-0.260
ll-0.259
Token.
Token ID13
Feature activation+6.42e-01

Pos logit contributions
ulp+0.124
 tune+0.109
aw+0.109
 endings+0.108
 tunes+0.104
Neg logit contributions
-0.083
 first-0.082
my-0.073
 Hatt-0.072
 Bear-0.072
Token They
Token ID1119
Feature activation+1.50e-01

Pos logit contributions
ulp+0.031
aw+0.029
 tune+0.027
 endings+0.027
 tunes+0.026
Neg logit contributions
-0.018
 first-0.018
my-0.017
 Hatt-0.016
 Bear-0.015
Token were
Token ID547
Feature activation-2.16e-02

Pos logit contributions
ulp+0.007
aw+0.006
 tune+0.006
 som+0.006
 tunes+0.006
Neg logit contributions
-0.005
 Hatt-0.005
 first-0.004
my-0.004
 Bear-0.004
Token happy
Token ID3772
Feature activation-4.26e-01

Pos logit contributions
+0.001
 Hatt+0.001
my+0.001
ishly+0.001
 first+0.001
Neg logit contributions
ulp-0.001
aw-0.001
 than-0.001
 tune-0.001
 som-0.001
Token like
Token ID588
Feature activation+8.18e-01

Pos logit contributions
ulp+0.036
OO+0.032
 endings+0.032
aw+0.032
 tune+0.031
Neg logit contributions
 first-0.026
-0.025
my-0.022
 Hatt-0.021
 Bear-0.020
Token fish
Token ID5916
Feature activation+1.67e+00

Pos logit contributions
ulp+0.033
 tune+0.029
OO+0.028
aw+0.027
ater+0.027
Neg logit contributions
-0.032
 first-0.032
 Bear-0.027
my-0.027
 Hatt-0.027
Token and
Token ID290
Feature activation+1.24e+00

Pos logit contributions
ulp+0.091
 tune+0.083
aw+0.083
 endings+0.081
 tunes+0.080
Neg logit contributions
-0.040
 first-0.038
my-0.033
 Hatt-0.032
 Bear-0.032
Token shells
Token ID19679
Feature activation+2.25e+00

Pos logit contributions
ulp+0.043
aw+0.037
 tune+0.036
 endings+0.034
OO+0.034
Neg logit contributions
-0.054
 first-0.052
my-0.049
 Hatt-0.049
 Bear-0.048
Token and
Token ID290
Feature activation+1.11e+00

Pos logit contributions
ulp+0.092
aw+0.083
 tune+0.077
 endings+0.075
 tunes+0.075
Neg logit contributions
-0.081
 Hatt-0.077
 first-0.077
my-0.073
 Bear-0.071
Token seaw
Token ID32085
Feature activation+5.56e+00

Pos logit contributions
ulp+0.039
aw+0.033
 tune+0.032
 endings+0.031
OO+0.031
Neg logit contributions
-0.049
 first-0.047
my-0.044
 Hatt-0.044
 Bear-0.043
Tokeneed
Token ID2308
Feature activation+3.02e+00

Pos logit contributions
ulp+0.119
 tune+0.108
 tunes+0.096
 conclusions+0.095
 endings+0.095
Neg logit contributions
 first-0.284
-0.282
ll-0.267
my-0.266
 Hatt-0.264
Token.
Token ID13
Feature activation+7.65e-01

Pos logit contributions
ulp+0.127
aw+0.113
 tune+0.109
 endings+0.106
 tunes+0.105
Neg logit contributions
-0.108
 first-0.102
 Hatt-0.099
my-0.096
 Bear-0.093
Token
Token ID198
Feature activation+6.83e-01

Pos logit contributions
ulp+0.035
aw+0.032
 endings+0.030
 conclusions+0.030
 heights+0.029
Neg logit contributions
-0.021
 first-0.021
 Hatt-0.020
ishly-0.019
my-0.019
Token
Token ID198
Feature activation+3.33e-01

Pos logit contributions
ulp+0.033
aw+0.030
 tune+0.028
OO+0.027
 conclusions+0.027
Neg logit contributions
ishly-0.018
-0.018
 Hatt-0.018
 first-0.017
 Elephant-0.016
TokenOne
Token ID3198
Feature activation-6.68e-01

Pos logit contributions
ulp+0.013
aw+0.012
 tune+0.012
ater+0.011
OO+0.011
Neg logit contributions
 Hatt-0.011
 first-0.011
-0.011
 Bear-0.011
ishly-0.011
Token buckets
Token ID38674
Feature activation+8.97e-01

Pos logit contributions
+0.036
 first+0.035
my+0.032
 Hatt+0.031
 Bear+0.031
Neg logit contributions
ulp-0.045
aw-0.040
 tune-0.039
 endings-0.038
OO-0.038
Token and
Token ID290
Feature activation+2.30e-01

Pos logit contributions
ulp+0.038
aw+0.035
 tune+0.032
 than+0.032
ater+0.032
Neg logit contributions
-0.031
 Hatt-0.030
 first-0.029
 Bear-0.027
my-0.027
Token shovel
Token ID33677
Feature activation+2.66e+00

Pos logit contributions
ulp+0.009
aw+0.007
 tune+0.007
 endings+0.007
 tunes+0.007
Neg logit contributions
-0.009
 first-0.009
 Hatt-0.009
my-0.008
 Bear-0.008
Tokens
Token ID82
Feature activation-1.91e+00

Pos logit contributions
ulp+0.115
aw+0.112
 than+0.105
ater+0.103
 pounds+0.103
Neg logit contributions
-0.084
 Hatt-0.083
 first-0.073
 Elephant-0.072
 Bear-0.071
Token and
Token ID290
Feature activation+4.49e-01

Pos logit contributions
+0.057
 first+0.055
my+0.049
 Hatt+0.047
 Bear+0.047
Neg logit contributions
ulp-0.092
aw-0.083
 tune-0.082
 endings-0.080
 tunes-0.079
Token seaw
Token ID32085
Feature activation+5.55e+00

Pos logit contributions
ulp+0.018
aw+0.016
 tune+0.016
 endings+0.015
 tunes+0.015
Neg logit contributions
-0.016
 first-0.016
 Hatt-0.015
my-0.015
 Bear-0.014
Tokeneed
Token ID2308
Feature activation+2.82e+00

Pos logit contributions
ulp+0.123
 tune+0.114
 tunes+0.104
 endings+0.103
 conclusions+0.103
Neg logit contributions
 first-0.275
-0.275
ll-0.258
my-0.255
 Bear-0.253
Token.
Token ID13
Feature activation+4.06e-01

Pos logit contributions
ulp+0.118
aw+0.106
 tune+0.101
 endings+0.099
ater+0.099
Neg logit contributions
-0.099
 Hatt-0.093
 first-0.092
my-0.089
 Bear-0.085
Token Then
Token ID3244
Feature activation-1.30e+00

Pos logit contributions
ulp+0.015
aw+0.015
 endings+0.014
 tune+0.014
 tunes+0.013
Neg logit contributions
 first-0.014
-0.014
my-0.014
 Hatt-0.014
ll-0.013
Token we
Token ID356
Feature activation-4.26e-01

Pos logit contributions
+0.042
 first+0.039
 Hatt+0.038
my+0.037
 Bear+0.035
Neg logit contributions
ulp-0.059
aw-0.053
 tune-0.051
 endings-0.051
 tunes-0.049
Token can
Token ID460
Feature activation-1.59e+00

Pos logit contributions
 Hatt+0.014
 Ellie+0.014
ishly+0.014
 attendant+0.014
 Kenny+0.013
Neg logit contributions
ulp-0.016
ater-0.015
aw-0.015
 conclusions-0.014
 som-0.014
Token like
Token ID588
Feature activation+8.46e-01

Pos logit contributions
ulp+0.040
OO+0.036
 endings+0.035
aw+0.035
 tune+0.034
Neg logit contributions
 first-0.028
-0.027
my-0.024
 Hatt-0.023
 Bear-0.022
Token fish
Token ID5916
Feature activation+1.65e+00

Pos logit contributions
ulp+0.034
 tune+0.029
OO+0.029
aw+0.028
 endings+0.028
Neg logit contributions
-0.033
 first-0.033
 Bear-0.029
my-0.028
 Hatt-0.028
Token and
Token ID290
Feature activation+1.34e+00

Pos logit contributions
ulp+0.090
aw+0.081
 tune+0.081
 endings+0.079
 tunes+0.078
Neg logit contributions
-0.040
 first-0.037
my-0.033
 Hatt-0.033
 Bear-0.032
Token shells
Token ID19679
Feature activation+2.21e+00

Pos logit contributions
ulp+0.045
aw+0.039
 tune+0.038
OO+0.036
 endings+0.035
Neg logit contributions
-0.060
 first-0.058
my-0.055
 Hatt-0.054
 Bear-0.054
Token and
Token ID290
Feature activation+1.23e+00

Pos logit contributions
ulp+0.088
aw+0.080
 tune+0.073
 than+0.072
 endings+0.072
Neg logit contributions
-0.082
 Hatt-0.078
 first-0.077
my-0.074
 Bear-0.071
Token seaw
Token ID32085
Feature activation+5.55e+00

Pos logit contributions
ulp+0.041
aw+0.035
 tune+0.034
OO+0.032
 endings+0.032
Neg logit contributions
-0.055
 first-0.053
my-0.050
 Hatt-0.050
 Bear-0.049
Tokeneed
Token ID2308
Feature activation+3.07e+00

Pos logit contributions
ulp+0.118
 tune+0.103
 endings+0.092
 conclusions+0.091
 tunes+0.091
Neg logit contributions
 first-0.288
-0.287
my-0.272
 Hatt-0.271
ll-0.269
Token.
Token ID13
Feature activation+9.06e-01

Pos logit contributions
ulp+0.128
aw+0.114
 tune+0.109
 endings+0.106
 tunes+0.105
Neg logit contributions
-0.110
 first-0.104
 Hatt-0.102
my-0.099
 Bear-0.095
Token But
Token ID887
Feature activation-1.23e-01

Pos logit contributions
ulp+0.048
aw+0.043
 tune+0.042
 endings+0.041
 conclusions+0.040
Neg logit contributions
-0.022
 first-0.021
 Hatt-0.020
my-0.019
 Bear-0.018
Token they
Token ID484
Feature activation+5.95e-01

Pos logit contributions
+0.004
 Hatt+0.004
my+0.003
 first+0.003
 Elephant+0.003
Neg logit contributions
ulp-0.006
aw-0.005
 tune-0.005
 endings-0.005
 tunes-0.005
Token did
Token ID750
Feature activation+2.60e-01

Pos logit contributions
ulp+0.028
aw+0.025
 tune+0.024
 endings+0.023
OO+0.023
Neg logit contributions
-0.018
 Hatt-0.017
 first-0.016
my-0.016
 Bear-0.015
Tokenate
Token ID378
Feature activation-1.78e+00

Pos logit contributions
 first+0.066
+0.065
my+0.060
 Hatt+0.058
 Bear+0.056
Neg logit contributions
ulp-0.071
 tune-0.062
OO-0.060
 endings-0.060
aw-0.058
Token it
Token ID340
Feature activation-2.05e-02

Pos logit contributions
+0.055
 first+0.053
 Hatt+0.049
my+0.049
 Bear+0.047
Neg logit contributions
ulp-0.084
aw-0.075
 tune-0.073
 endings-0.072
 tunes-0.071
Token with
Token ID351
Feature activation-4.84e-01

Pos logit contributions
 Hatt+0.001
+0.001
my+0.001
ishly+0.001
 Bear+0.001
Neg logit contributions
ulp-0.001
 than-0.001
aw-0.001
 tune-0.001
 som-0.001
Token shells
Token ID19679
Feature activation+1.96e+00

Pos logit contributions
+0.018
 first+0.017
 Hatt+0.016
my+0.016
 Bear+0.016
Neg logit contributions
ulp-0.020
aw-0.018
 tune-0.017
 endings-0.017
 tunes-0.016
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
ulp+0.079
aw+0.071
 tune+0.066
ater+0.064
 than+0.064
Neg logit contributions
-0.072
 Hatt-0.069
 first-0.068
my-0.065
 Bear-0.063
Token seaw
Token ID32085
Feature activation+5.45e+00

Pos logit contributions
ulp+0.039
aw+0.034
 tune+0.033
 endings+0.032
 tunes+0.031
Neg logit contributions
-0.051
 first-0.049
 Hatt-0.047
my-0.046
 Bear-0.045
Tokeneed
Token ID2308
Feature activation+3.07e+00

Pos logit contributions
ulp+0.113
 tune+0.096
 endings+0.087
 conclusions+0.085
 tunes+0.084
Neg logit contributions
-0.290
 first-0.285
 Hatt-0.273
my-0.271
 Bear-0.268
Token.
Token ID13
Feature activation+7.23e-01

Pos logit contributions
ulp+0.131
aw+0.116
 tune+0.112
 endings+0.109
 tunes+0.107
Neg logit contributions
-0.109
 first-0.103
 Hatt-0.100
my-0.097
 Bear-0.093
Token They
Token ID1119
Feature activation+1.53e-01

Pos logit contributions
ulp+0.037
aw+0.033
 tune+0.032
 endings+0.032
 tunes+0.031
Neg logit contributions
-0.020
 first-0.019
my-0.017
 Hatt-0.017
 Bear-0.016
Token find
Token ID1064
Feature activation+5.78e-01

Pos logit contributions
ulp+0.006
 som+0.006
ater+0.005
aw+0.005
 than+0.005
Neg logit contributions
 Hatt-0.005
 Ellie-0.005
-0.005
ishly-0.005
 Elephant-0.005
Token a
Token ID257
Feature activation-1.13e-01

Pos logit contributions
ulp+0.026
aw+0.024
 tune+0.022
 endings+0.022
 tunes+0.021
Neg logit contributions
-0.020
 first-0.018
 Hatt-0.018
my-0.017
 Bear-0.016
TokenW
Token ID54
Feature activation-4.83e-01

Pos logit contributions
+0.011
 first+0.010
 Hatt+0.009
my+0.009
 Bear+0.009
Neg logit contributions
ulp-0.013
aw-0.011
 tune-0.011
 endings-0.011
OO-0.011
TokenOO
Token ID6684
Feature activation+5.04e+00

Pos logit contributions
my+0.015
+0.015
 first+0.014
 Bear+0.012
ll+0.012
Neg logit contributions
ulp-0.021
 tune-0.020
 endings-0.019
 tunes-0.019
izing-0.019
TokenF
Token ID37
Feature activation+1.80e-01

Pos logit contributions
ulp+0.210
 tune+0.181
aw+0.178
 endings+0.175
 tunes+0.174
Neg logit contributions
-0.191
 first-0.182
my-0.167
 Hatt-0.167
 Bear-0.163
Token!
Token ID0
Feature activation+1.49e+00

Pos logit contributions
ulp+0.007
aw+0.006
 tune+0.006
 endings+0.006
 tunes+0.006
Neg logit contributions
-0.007
 first-0.007
my-0.006
 Hatt-0.006
 Bear-0.006
Token W
Token ID370
Feature activation+5.77e-01

Pos logit contributions
ulp+0.065
aw+0.058
 tune+0.057
 endings+0.055
 tunes+0.054
Neg logit contributions
-0.051
 first-0.048
 Hatt-0.046
my-0.045
 Bear-0.044
TokenOO
Token ID6684
Feature activation+5.35e+00

Pos logit contributions
ulp+0.020
 tune+0.020
 tunes+0.019
 endings+0.019
 conclusions+0.018
Neg logit contributions
 first-0.022
-0.022
ll-0.020
my-0.019
 Hatt-0.019
TokenF
Token ID37
Feature activation+2.43e-01

Pos logit contributions
ulp+0.223
 tune+0.194
aw+0.189
 tunes+0.187
 endings+0.186
Neg logit contributions
-0.202
 first-0.191
my-0.176
 Hatt-0.176
 Bear-0.172
Token!
Token ID0
Feature activation+1.14e+00

Pos logit contributions
ulp+0.010
aw+0.009
 tune+0.009
 endings+0.008
 tunes+0.008
Neg logit contributions
-0.009
 first-0.009
 Hatt-0.008
my-0.008
 Bear-0.008
Token W
Token ID370
Feature activation+2.46e-01

Pos logit contributions
ulp+0.048
aw+0.042
 tune+0.041
 endings+0.040
OO+0.039
Neg logit contributions
-0.040
 first-0.039
 Hatt-0.037
my-0.036
 Bear-0.034
TokenOO
Token ID6684
Feature activation+5.24e+00

Pos logit contributions
ulp+0.009
 tune+0.008
 tunes+0.008
 endings+0.008
 conclusions+0.007
Neg logit contributions
 first-0.010
-0.009
ll-0.008
my-0.008
 Hatt-0.008
TokenF
Token ID37
Feature activation-1.79e-02

Pos logit contributions
ulp+0.218
 tune+0.189
aw+0.186
 endings+0.182
 tunes+0.181
Neg logit contributions
-0.197
 first-0.187
 Hatt-0.174
my-0.173
 Bear-0.169
Token him
Token ID683
Feature activation+2.19e+00

Pos logit contributions
ulp+0.017
aw+0.016
 tune+0.015
 endings+0.015
 tunes+0.015
Neg logit contributions
-0.010
 Hatt-0.009
 first-0.009
my-0.009
 Bear-0.009
Token about
Token ID546
Feature activation+1.07e+00

Pos logit contributions
ulp+0.088
aw+0.077
 tune+0.076
 endings+0.073
 tunes+0.072
Neg logit contributions
-0.082
 first-0.079
 Hatt-0.074
my-0.074
 Bear-0.071
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
ulp+0.049
aw+0.043
 tune+0.042
OO+0.041
ater+0.041
Neg logit contributions
-0.036
 first-0.036
 Bear-0.032
my-0.032
 Hatt-0.031
Token frog
Token ID21264
Feature activation+8.20e-01

Pos logit contributions
ulp+0.046
aw+0.040
 tune+0.040
 endings+0.038
 tunes+0.037
Neg logit contributions
-0.052
 first-0.049
 Hatt-0.048
my-0.047
 Bear-0.045
Token's
Token ID338
Feature activation+5.04e-02

Pos logit contributions
ulp+0.036
aw+0.032
 tune+0.032
 endings+0.031
 tunes+0.030
Neg logit contributions
-0.027
 first-0.026
 Hatt-0.025
my-0.024
 Bear-0.023
Token habitat
Token ID20018
Feature activation+5.33e+00

Pos logit contributions
ulp+0.002
aw+0.002
 tune+0.002
 endings+0.002
 tunes+0.002
Neg logit contributions
-0.002
 first-0.002
my-0.002
 Hatt-0.002
 Bear-0.002
Token,
Token ID11
Feature activation+9.94e-01

Pos logit contributions
ulp+0.268
 tune+0.237
OO+0.231
 endings+0.226
 tunes+0.225
Neg logit contributions
-0.161
 first-0.156
my-0.139
 Hatt-0.121
 Bear-0.118
Token diet
Token ID5496
Feature activation+4.06e+00

Pos logit contributions
ulp+0.044
aw+0.039
 tune+0.038
 endings+0.037
OO+0.036
Neg logit contributions
-0.034
 first-0.032
my-0.030
 Hatt-0.030
 Bear-0.029
Token and
Token ID290
Feature activation+7.93e-01

Pos logit contributions
ulp+0.188
 tune+0.164
aw+0.164
 endings+0.159
OO+0.156
Neg logit contributions
-0.132
 first-0.125
my-0.115
 Hatt-0.113
 Bear-0.110
Token how
Token ID703
Feature activation-1.42e-01

Pos logit contributions
ulp+0.034
aw+0.030
 tune+0.030
 endings+0.029
 tunes+0.028
Neg logit contributions
-0.027
 first-0.026
 Hatt-0.025
my-0.024
 Bear-0.023
Token it
Token ID340
Feature activation-9.71e-02

Pos logit contributions
+0.005
 first+0.004
 Hatt+0.004
my+0.004
 Bear+0.004
Neg logit contributions
ulp-0.007
aw-0.006
 tune-0.006
 endings-0.006
 tunes-0.005
Token their
Token ID511
Feature activation+1.06e+00

Pos logit contributions
ulp+0.011
aw+0.010
 tune+0.010
 endings+0.009
 tunes+0.009
Neg logit contributions
-0.007
 first-0.006
 Hatt-0.006
my-0.006
 Bear-0.005
Token hair
Token ID4190
Feature activation+2.26e+00

Pos logit contributions
ulp+0.045
aw+0.040
 tune+0.040
 pounds+0.039
izing+0.039
Neg logit contributions
-0.037
 first-0.037
my-0.033
 Hatt-0.032
 Bear-0.031
Token and
Token ID290
Feature activation+9.52e-01

Pos logit contributions
ulp+0.112
 tune+0.098
 endings+0.097
aw+0.097
OO+0.094
Neg logit contributions
 first-0.067
-0.066
my-0.060
 Bear-0.055
 Hatt-0.054
Token on
Token ID319
Feature activation+1.96e+00

Pos logit contributions
ulp+0.041
aw+0.037
 tune+0.036
 endings+0.035
OO+0.034
Neg logit contributions
-0.033
 first-0.032
my-0.030
 Hatt-0.029
 Bear-0.029
Token their
Token ID511
Feature activation+1.96e+00

Pos logit contributions
ulp+0.112
OO+0.102
 endings+0.101
aw+0.101
 pounds+0.100
Neg logit contributions
 first-0.044
-0.041
 both-0.034
 Bear-0.033
my-0.033
Token stomach
Token ID11384
Feature activation+5.33e+00

Pos logit contributions
ulp+0.073
aw+0.067
OO+0.063
 endings+0.061
 pounds+0.061
Neg logit contributions
 first-0.078
-0.078
 Hatt-0.070
my-0.069
 Bear-0.068
Tokens
Token ID82
Feature activation-1.51e+00

Pos logit contributions
ulp+0.266
 tune+0.235
aw+0.235
 endings+0.235
OO+0.230
Neg logit contributions
-0.157
 first-0.149
my-0.137
 Bear-0.125
 Hatt-0.122
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
+0.043
 first+0.043
my+0.037
 Bear+0.035
 Hatt+0.034
Neg logit contributions
ulp-0.074
aw-0.068
OO-0.066
 endings-0.066
 tune-0.065
Token They
Token ID1119
Feature activation+5.72e-01

Pos logit contributions
ulp+0.069
 tune+0.063
aw+0.063
 tunes+0.061
 endings+0.061
Neg logit contributions
-0.025
 first-0.022
 Hatt-0.021
 Bear-0.019
my-0.019
Token are
Token ID389
Feature activation-9.60e-02

Pos logit contributions
ulp+0.022
aw+0.021
 som+0.020
 conclusions+0.020
 tune+0.019
Neg logit contributions
 Hatt-0.019
 first-0.018
 Elephant-0.018
 Ellie-0.017
 Barry-0.017
Token happy
Token ID3772
Feature activation-9.12e-02

Pos logit contributions
+0.003
 first+0.003
 Hatt+0.003
my+0.003
 Bear+0.003
Neg logit contributions
ulp-0.004
aw-0.004
 tune-0.004
 tunes-0.004
 endings-0.004
Token see
Token ID766
Feature activation+1.10e+00

Pos logit contributions
ulp+0.003
 conclusions+0.003
 heights+0.003
 som+0.003
aw+0.003
Neg logit contributions
 Hatt-0.004
-0.003
 Ellie-0.003
ishly-0.003
 Elephant-0.003
Token shells
Token ID19679
Feature activation-1.37e-01

Pos logit contributions
ulp+0.051
 tune+0.045
aw+0.044
OO+0.043
 endings+0.043
Neg logit contributions
-0.036
 first-0.035
my-0.032
 Hatt-0.031
 Bear-0.030
Token,
Token ID11
Feature activation+2.54e-01

Pos logit contributions
+0.004
 Hatt+0.004
 first+0.004
my+0.004
 Bear+0.003
Neg logit contributions
ulp-0.006
aw-0.006
 tune-0.005
 than-0.005
 endings-0.005
Token rocks
Token ID12586
Feature activation+9.73e-01

Pos logit contributions
ulp+0.008
aw+0.007
 tune+0.006
 endings+0.006
 tunes+0.006
Neg logit contributions
-0.012
 first-0.012
 Hatt-0.011
my-0.011
 Bear-0.011
Token,
Token ID11
Feature activation+8.88e-01

Pos logit contributions
ulp+0.048
aw+0.044
 tune+0.041
 than+0.041
ater+0.040
Neg logit contributions
-0.027
 Hatt-0.025
 first-0.024
my-0.023
 Bear-0.022
Token seaw
Token ID32085
Feature activation+5.30e+00

Pos logit contributions
ulp+0.034
aw+0.029
 tune+0.029
 endings+0.028
OO+0.028
Neg logit contributions
-0.035
 first-0.035
my-0.032
 Hatt-0.032
 Bear-0.031
Tokeneed
Token ID2308
Feature activation+2.81e+00

Pos logit contributions
ulp+0.124
 tune+0.110
 endings+0.099
 tunes+0.099
 conclusions+0.098
Neg logit contributions
 first-0.267
-0.266
my-0.248
 Hatt-0.245
ll-0.244
Token,
Token ID11
Feature activation+1.18e+00

Pos logit contributions
ulp+0.149
aw+0.135
 tune+0.133
 endings+0.130
 tunes+0.129
Neg logit contributions
-0.070
 first-0.066
 Hatt-0.060
my-0.059
 Bear-0.056
Token crabs
Token ID49172
Feature activation+3.72e+00

Pos logit contributions
ulp+0.054
 tune+0.048
aw+0.047
 endings+0.047
OO+0.046
Neg logit contributions
-0.039
 first-0.038
my-0.034
 Hatt-0.033
 Bear-0.033
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
ulp+0.204
aw+0.186
 tune+0.183
 endings+0.180
 tunes+0.177
Neg logit contributions
-0.087
 first-0.081
 Hatt-0.073
my-0.072
 Bear-0.068
Token fish
Token ID5916
Feature activation+2.41e+00

Pos logit contributions
ulp+0.038
aw+0.032
 tune+0.032
OO+0.030
 endings+0.030
Neg logit contributions
-0.047
 first-0.045
my-0.042
 Hatt-0.042
 Bear-0.041
Token fruits
Token ID15921
Feature activation-7.15e-02

Pos logit contributions
ulp+0.004
 tune+0.003
OO+0.003
aw+0.003
 endings+0.003
Neg logit contributions
-0.004
 first-0.004
my-0.004
 Bear-0.003
 Hatt-0.003
Token and
Token ID290
Feature activation+6.21e-01

Pos logit contributions
+0.002
 Hatt+0.002
 first+0.002
 Bear+0.002
my+0.002
Neg logit contributions
ulp-0.003
aw-0.003
 than-0.003
 tune-0.003
ater-0.002
Token vegetables
Token ID13701
Feature activation+1.53e+00

Pos logit contributions
ulp+0.026
aw+0.023
 tune+0.022
 endings+0.021
OO+0.021
Neg logit contributions
-0.023
 first-0.022
my-0.021
 Hatt-0.020
 Bear-0.020
Token and
Token ID290
Feature activation+9.29e-01

Pos logit contributions
ulp+0.068
aw+0.061
 tune+0.059
 endings+0.058
 tunes+0.057
Neg logit contributions
-0.051
 first-0.049
 Hatt-0.046
my-0.045
 Bear-0.044
Token his
Token ID465
Feature activation+2.57e+00

Pos logit contributions
ulp+0.034
aw+0.030
 tune+0.029
 endings+0.028
 tunes+0.027
Neg logit contributions
-0.038
 first-0.037
 Hatt-0.035
my-0.035
 Bear-0.034
Token stomach
Token ID11384
Feature activation+5.27e+00

Pos logit contributions
ulp+0.110
 tune+0.096
aw+0.096
 endings+0.094
 tunes+0.092
Neg logit contributions
-0.091
 Hatt-0.085
 first-0.081
my-0.079
 Bear-0.075
Token never
Token ID1239
Feature activation+1.24e+00

Pos logit contributions
ulp+0.257
aw+0.229
 endings+0.223
 tune+0.223
 conclusions+0.219
Neg logit contributions
-0.158
 first-0.153
my-0.146
 Bear-0.134
 Hatt-0.131
Token hurt
Token ID5938
Feature activation-7.73e-01

Pos logit contributions
ulp+0.058
aw+0.054
 tunes+0.051
 tune+0.051
izing+0.051
Neg logit contributions
 first-0.040
-0.040
my-0.038
 Bear-0.033
 -0.030
Token again
Token ID757
Feature activation+6.75e-01

Pos logit contributions
+0.027
 first+0.027
my+0.025
 Hatt+0.024
 Bear+0.024
Neg logit contributions
ulp-0.033
aw-0.029
 tune-0.029
 endings-0.028
 tunes-0.028
Token.
Token ID13
Feature activation-4.26e-02

Pos logit contributions
ulp+0.037
aw+0.035
OO+0.034
 endings+0.033
 tune+0.033
Neg logit contributions
-0.017
 first-0.017
my-0.014
 Bear-0.013
 both-0.011
Token<|endoftext|>
Token ID50256
Feature activation+1.11e+00

Pos logit contributions
 first+0.001
 Hatt+0.001
+0.001
ishly+0.001
my+0.001
Neg logit contributions
ulp-0.002
aw-0.002
 tune-0.002
 endings-0.002
OO-0.002
Token W
Token ID370
Feature activation+5.77e-01

Pos logit contributions
ulp+0.065
aw+0.058
 tune+0.057
 endings+0.055
 tunes+0.054
Neg logit contributions
-0.051
 first-0.048
 Hatt-0.046
my-0.045
 Bear-0.044
TokenOO
Token ID6684
Feature activation+5.35e+00

Pos logit contributions
ulp+0.020
 tune+0.020
 tunes+0.019
 endings+0.019
 conclusions+0.018
Neg logit contributions
 first-0.022
-0.022
ll-0.020
my-0.019
 Hatt-0.019
TokenF
Token ID37
Feature activation+2.43e-01

Pos logit contributions
ulp+0.223
 tune+0.194
aw+0.189
 tunes+0.187
 endings+0.186
Neg logit contributions
-0.202
 first-0.191
my-0.176
 Hatt-0.176
 Bear-0.172
Token!
Token ID0
Feature activation+1.14e+00

Pos logit contributions
ulp+0.010
aw+0.009
 tune+0.009
 endings+0.008
 tunes+0.008
Neg logit contributions
-0.009
 first-0.009
 Hatt-0.008
my-0.008
 Bear-0.008
Token W
Token ID370
Feature activation+2.46e-01

Pos logit contributions
ulp+0.048
aw+0.042
 tune+0.041
 endings+0.040
OO+0.039
Neg logit contributions
-0.040
 first-0.039
 Hatt-0.037
my-0.036
 Bear-0.034
TokenOO
Token ID6684
Feature activation+5.24e+00

Pos logit contributions
ulp+0.009
 tune+0.008
 tunes+0.008
 endings+0.008
 conclusions+0.007
Neg logit contributions
 first-0.010
-0.009
ll-0.008
my-0.008
 Hatt-0.008
TokenF
Token ID37
Feature activation-1.79e-02

Pos logit contributions
ulp+0.218
 tune+0.189
aw+0.186
 endings+0.182
 tunes+0.181
Neg logit contributions
-0.197
 first-0.187
 Hatt-0.174
my-0.173
 Bear-0.169
Token!"
Token ID2474
Feature activation+3.40e-01

Pos logit contributions
+0.001
 first+0.001
 Hatt+0.001
my+0.001
 Bear+0.001
Neg logit contributions
ulp-0.001
aw-0.001
 tune-0.001
 endings-0.001
 tunes-0.001
Token
Token ID198
Feature activation-6.00e-01

Pos logit contributions
aw+0.012
ulp+0.012
 som+0.011
 endings+0.011
 tune+0.011
Neg logit contributions
my-0.012
-0.012
ll-0.011
 first-0.011
ishly-0.011
Token
Token ID198
Feature activation-6.12e-01

Pos logit contributions
+0.014
 first+0.014
 Hatt+0.013
my+0.013
ishly+0.012
Neg logit contributions
ulp-0.031
aw-0.029
 tune-0.027
 endings-0.027
 tunes-0.027
TokenBen
Token ID11696
Feature activation+1.12e-01

Pos logit contributions
+0.022
 first+0.020
 Hatt+0.019
my+0.019
 Bear+0.018
Neg logit contributions
ulp-0.027
aw-0.023
 tune-0.023
 endings-0.023
 tunes-0.022
Token Like
Token ID4525
Feature activation-6.52e-01

Pos logit contributions
 first+0.009
+0.008
 Hatt+0.008
my+0.008
ishly+0.008
Neg logit contributions
ulp-0.009
aw-0.008
 endings-0.008
 conclusions-0.008
 tune-0.008
Token shells
Token ID19679
Feature activation+6.75e-01

Pos logit contributions
 first+0.020
+0.020
 Bear+0.017
 Hatt+0.016
my+0.016
Neg logit contributions
ulp-0.032
aw-0.028
 tune-0.028
OO-0.027
 endings-0.027
Token and
Token ID290
Feature activation-8.24e-02

Pos logit contributions
ulp+0.032
aw+0.029
 tune+0.028
 endings+0.027
 tunes+0.027
Neg logit contributions
-0.021
 first-0.019
 Hatt-0.018
my-0.018
 Bear-0.017
Token rocks
Token ID12586
Feature activation+7.42e-01

Pos logit contributions
+0.004
 first+0.003
my+0.003
 Hatt+0.003
 Bear+0.003
Neg logit contributions
ulp-0.003
aw-0.003
 tune-0.002
OO-0.002
 endings-0.002
Token and
Token ID290
Feature activation+4.86e-01

Pos logit contributions
ulp+0.032
aw+0.029
 tune+0.028
 endings+0.027
ater+0.027
Neg logit contributions
-0.025
 first-0.023
 Hatt-0.023
my-0.022
 Bear-0.022
Token seaw
Token ID32085
Feature activation+5.23e+00

Pos logit contributions
ulp+0.018
aw+0.015
 tune+0.015
OO+0.015
 endings+0.014
Neg logit contributions
-0.020
 first-0.020
my-0.018
 Hatt-0.018
 Bear-0.018
Tokeneed
Token ID2308
Feature activation+2.40e+00

Pos logit contributions
ulp+0.113
 tune+0.094
 tunes+0.083
 endings+0.080
aw+0.080
Neg logit contributions
-0.276
 first-0.274
my-0.259
 Hatt-0.256
 Bear-0.254
Token."
Token ID526
Feature activation+4.86e-02

Pos logit contributions
ulp+0.103
aw+0.092
 tune+0.089
 endings+0.087
 tunes+0.086
Neg logit contributions
-0.083
 first-0.078
 Hatt-0.075
my-0.074
 Bear-0.071
Token
Token ID198
Feature activation+2.94e-02

Pos logit contributions
ulp+0.002
aw+0.002
 conclusions+0.002
 tune+0.002
 endings+0.002
Neg logit contributions
-0.001
 first-0.001
 Hatt-0.001
ishly-0.001
my-0.001
Token
Token ID198
Feature activation-4.57e-02

Pos logit contributions
ulp+0.001
aw+0.001
 tune+0.001
 conclusions+0.001
 endings+0.001
Neg logit contributions
-0.001
 first-0.001
ishly-0.001
 Hatt-0.001
my-0.001
TokenM
Token ID44
Feature activation+6.42e-01

Pos logit contributions
 first+0.002
ishly+0.002
 Hatt+0.002
+0.002
 Bear+0.002
Neg logit contributions
aw-0.001
ulp-0.001
 tune-0.001
 conclusions-0.001
ater-0.001
Token also
Token ID635
Feature activation-1.65e+00

Pos logit contributions
 Hatt+0.045
 Grac+0.040
+0.040
my+0.038
ishly+0.038
Neg logit contributions
 som-0.048
ulp-0.048
 tune-0.047
 heights-0.047
 conclusions-0.047
Token help
Token ID1037
Feature activation+1.65e-03

Pos logit contributions
+0.049
 Hatt+0.048
 first+0.047
my+0.045
 Bear+0.043
Neg logit contributions
ulp-0.076
 tune-0.068
aw-0.067
 endings-0.066
 tunes-0.065
Token you
Token ID345
Feature activation-5.08e-01

Pos logit contributions
ulp+0.000
aw+0.000
 tune+0.000
 endings+0.000
OO+0.000
Neg logit contributions
-0.000
 first-0.000
my-0.000
 Hatt-0.000
 Bear-0.000
Token in
Token ID287
Feature activation-1.41e-01

Pos logit contributions
+0.015
 first+0.015
 Hatt+0.014
my+0.014
 Bear+0.013
Neg logit contributions
ulp-0.024
aw-0.021
 tune-0.021
 endings-0.021
 tunes-0.020
Token tough
Token ID5802
Feature activation+1.26e+00

Pos logit contributions
+0.004
 first+0.004
my+0.003
 Bear+0.003
 Hatt+0.003
Neg logit contributions
ulp-0.007
aw-0.007
 tune-0.006
OO-0.006
 tunes-0.006
Token situations
Token ID7445
Feature activation+5.22e+00

Pos logit contributions
ulp+0.035
aw+0.027
OO+0.026
ater+0.025
 tune+0.025
Neg logit contributions
-0.065
 first-0.064
my-0.060
 Bear-0.059
 Hatt-0.055
Token.
Token ID13
Feature activation-5.27e-01

Pos logit contributions
ulp+0.204
aw+0.188
 endings+0.181
 tune+0.181
 than+0.177
Neg logit contributions
-0.182
 Hatt-0.175
 first-0.174
my-0.168
 Bear-0.162
Token From
Token ID3574
Feature activation-1.45e+00

Pos logit contributions
 first+0.016
 Hatt+0.014
+0.014
my+0.013
ishly+0.013
Neg logit contributions
ulp-0.023
 endings-0.021
aw-0.021
 tune-0.020
 conclusions-0.019
Token that
Token ID326
Feature activation+9.98e-02

Pos logit contributions
+0.053
 Hatt+0.050
 first+0.049
my+0.048
 Bear+0.046
Neg logit contributions
ulp-0.058
aw-0.055
 tune-0.051
 endings-0.049
 tunes-0.049
Token day
Token ID1110
Feature activation+1.61e+00

Pos logit contributions
ulp+0.004
 tune+0.003
 tunes+0.003
 endings+0.003
aw+0.003
Neg logit contributions
 Hatt-0.004
-0.003
 first-0.003
ishly-0.003
my-0.003
Token on
Token ID319
Feature activation+9.14e-01

Pos logit contributions
ulp+0.070
aw+0.063
 tune+0.059
OO+0.059
 endings+0.059
Neg logit contributions
-0.048
 first-0.048
 Hatt-0.046
my-0.046
 Bear-0.043
Token butterfly
Token ID35113
Feature activation-8.23e-01

Pos logit contributions
+0.068
 Hatt+0.067
my+0.061
 first+0.061
ll+0.061
Neg logit contributions
ulp-0.063
 tune-0.057
aw-0.054
 tunes-0.052
 conclusions-0.051
Token back
Token ID736
Feature activation-9.58e-01

Pos logit contributions
 Hatt+0.023
+0.021
 first+0.021
my+0.020
 Bear+0.020
Neg logit contributions
ulp-0.039
 tune-0.036
aw-0.035
 tunes-0.034
 som-0.033
Token in
Token ID287
Feature activation-4.99e-01

Pos logit contributions
+0.029
 Hatt+0.028
 first+0.026
 Bear+0.025
my+0.025
Neg logit contributions
ulp-0.044
aw-0.040
 tune-0.039
 than-0.038
 tunes-0.037
Token its
Token ID663
Feature activation+1.34e+00

Pos logit contributions
+0.016
 first+0.015
 Hatt+0.014
my+0.014
 Bear+0.013
Neg logit contributions
ulp-0.023
aw-0.020
 tune-0.020
 endings-0.019
 tunes-0.019
Token natural
Token ID3288
Feature activation+1.17e+00

Pos logit contributions
ulp+0.054
 tune+0.047
aw+0.047
 tunes+0.044
 endings+0.043
Neg logit contributions
-0.050
 Hatt-0.048
 first-0.048
my-0.046
 Bear-0.044
Token habitat
Token ID20018
Feature activation+5.20e+00

Pos logit contributions
ulp+0.045
aw+0.040
 pounds+0.038
OO+0.038
 tune+0.037
Neg logit contributions
-0.046
 first-0.045
my-0.041
 Bear-0.040
 Hatt-0.039
Token.
Token ID13
Feature activation-4.70e-01

Pos logit contributions
ulp+0.252
 tune+0.223
aw+0.218
 endings+0.216
 tunes+0.216
Neg logit contributions
-0.156
 first-0.154
my-0.139
 Hatt-0.129
 Bear-0.125
Token He
Token ID679
Feature activation-1.76e-02

Pos logit contributions
 first+0.013
+0.013
 Hatt+0.012
my+0.012
ll+0.011
Neg logit contributions
ulp-0.022
aw-0.020
 tune-0.019
 endings-0.019
 tunes-0.018
Token waved
Token ID26834
Feature activation+3.29e-01

Pos logit contributions
+0.001
 first+0.001
 Hatt+0.001
my+0.000
 Ellie+0.000
Neg logit contributions
ulp-0.001
aw-0.001
 tune-0.001
 tunes-0.001
 som-0.001
Token goodbye
Token ID24829
Feature activation+1.07e+00

Pos logit contributions
ulp+0.015
aw+0.013
 tune+0.013
 endings+0.013
 tunes+0.012
Neg logit contributions
-0.011
 first-0.010
 Hatt-0.010
my-0.010
 Bear-0.009
Token to
Token ID284
Feature activation+2.77e-01

Pos logit contributions
ulp+0.044
 tune+0.041
aw+0.040
 than+0.038
 tunes+0.038
Neg logit contributions
-0.038
 Hatt-0.035
 first-0.035
my-0.034
 Bear-0.032
Token added
Token ID2087
Feature activation-3.23e-01

Pos logit contributions
ulp+0.008
aw+0.008
 som+0.007
ater+0.007
 tune+0.007
Neg logit contributions
-0.009
 Hatt-0.009
ishly-0.009
 Barry-0.009
 Elephant-0.009
Token shells
Token ID19679
Feature activation+9.15e-01

Pos logit contributions
+0.012
 first+0.011
 Hatt+0.011
my+0.010
 Bear+0.010
Neg logit contributions
ulp-0.014
aw-0.012
 tune-0.012
 endings-0.011
 tunes-0.011
Token and
Token ID290
Feature activation+7.77e-01

Pos logit contributions
ulp+0.033
aw+0.031
 than+0.029
 tune+0.026
ater+0.026
Neg logit contributions
 Hatt-0.036
-0.035
 first-0.033
 Bear-0.032
my-0.032
Token stones
Token ID14966
Feature activation+2.07e+00

Pos logit contributions
ulp+0.025
aw+0.022
 tune+0.021
 endings+0.020
 tunes+0.020
Neg logit contributions
-0.035
 first-0.033
 Hatt-0.033
my-0.032
 Bear-0.031
Token and
Token ID290
Feature activation+9.42e-01

Pos logit contributions
ulp+0.077
aw+0.072
 than+0.069
 tune+0.063
 som+0.062
Neg logit contributions
 Hatt-0.079
-0.078
 first-0.072
 Bear-0.071
my-0.071
Token seaw
Token ID32085
Feature activation+5.20e+00

Pos logit contributions
ulp+0.034
aw+0.030
 tune+0.029
 endings+0.028
 tunes+0.028
Neg logit contributions
-0.039
 first-0.037
 Hatt-0.036
my-0.035
 Bear-0.034
Tokeneed
Token ID2308
Feature activation+2.60e+00

Pos logit contributions
ulp+0.112
 tune+0.090
aw+0.082
 endings+0.081
 tunes+0.079
Neg logit contributions
-0.282
 first-0.277
 Hatt-0.267
my-0.265
 Bear-0.259
Token to
Token ID284
Feature activation+1.36e+00

Pos logit contributions
ulp+0.113
aw+0.102
 tune+0.097
 endings+0.095
 tunes+0.093
Neg logit contributions
-0.089
 first-0.083
 Hatt-0.082
my-0.079
 Bear-0.076
Token make
Token ID787
Feature activation+8.99e-01

Pos logit contributions
ulp+0.064
aw+0.058
 tune+0.056
 endings+0.056
OO+0.054
Neg logit contributions
-0.042
 first-0.041
my-0.037
 Hatt-0.037
 Bear-0.035
Token it
Token ID340
Feature activation-4.46e-01

Pos logit contributions
ulp+0.038
 tune+0.033
OO+0.032
aw+0.032
 endings+0.032
Neg logit contributions
-0.032
 first-0.031
my-0.029
 Hatt-0.028
 Bear-0.027
Token look
Token ID804
Feature activation+9.83e-01

Pos logit contributions
+0.015
 Hatt+0.015
 first+0.014
my+0.013
 Bear+0.013
Neg logit contributions
ulp-0.019
aw-0.017
 tune-0.017
 som-0.016
 endings-0.016
Token was
Token ID373
Feature activation-9.08e-01

Pos logit contributions
+0.022
 Hatt+0.021
 first+0.021
my+0.020
 Bear+0.019
Neg logit contributions
ulp-0.035
aw-0.033
 tune-0.031
 endings-0.030
 tunes-0.030
Token,
Token ID11
Feature activation-1.00e-01

Pos logit contributions
+0.028
 first+0.027
my+0.024
 Bear+0.022
 Hatt+0.021
Neg logit contributions
ulp-0.044
aw-0.040
 tune-0.040
 endings-0.040
 tunes-0.039
Token and
Token ID290
Feature activation+4.34e-02

Pos logit contributions
+0.003
 first+0.003
my+0.003
 Hatt+0.003
 Bear+0.003
Neg logit contributions
ulp-0.004
aw-0.004
 tune-0.004
 endings-0.004
 tunes-0.004
Token used
Token ID973
Feature activation+7.84e-01

Pos logit contributions
ulp+0.002
aw+0.002
 tune+0.002
 som+0.002
 tunes+0.002
Neg logit contributions
-0.001
 first-0.001
 Hatt-0.001
my-0.001
ll-0.001
Token his
Token ID465
Feature activation+2.54e+00

Pos logit contributions
ulp+0.033
aw+0.029
 tune+0.028
 endings+0.027
 tunes+0.027
Neg logit contributions
-0.029
 first-0.027
 Hatt-0.026
my-0.026
 Bear-0.025
Token net
Token ID2010
Feature activation+5.18e+00

Pos logit contributions
ulp+0.103
aw+0.093
 tune+0.088
 endings+0.087
 tunes+0.085
Neg logit contributions
-0.096
 first-0.093
 Hatt-0.084
my-0.084
 Bear-0.083
Token to
Token ID284
Feature activation+1.83e+00

Pos logit contributions
ulp+0.222
aw+0.198
 tune+0.190
 than+0.189
 endings+0.185
Neg logit contributions
-0.174
 Hatt-0.174
 first-0.158
my-0.151
 Bear-0.149
Token catch
Token ID4929
Feature activation+1.62e+00

Pos logit contributions
ulp+0.083
aw+0.075
 endings+0.073
 tune+0.072
OO+0.070
Neg logit contributions
-0.060
 first-0.059
my-0.052
 Hatt-0.052
 Bear-0.051
Token it
Token ID340
Feature activation-2.17e-01

Pos logit contributions
ulp+0.065
aw+0.059
 tune+0.056
 endings+0.054
 tunes+0.054
Neg logit contributions
-0.060
 first-0.058
 Hatt-0.056
my-0.055
 Bear-0.052
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
+0.008
 Hatt+0.008
 first+0.007
my+0.007
 Bear+0.007
Neg logit contributions
ulp-0.009
aw-0.008
 tune-0.007
 endings-0.007
 tunes-0.007
Token He
Token ID679
Feature activation+2.54e-01

Pos logit contributions
ulp+0.014
aw+0.012
 tune+0.012
 endings+0.011
 som+0.011
Neg logit contributions
-0.008
 first-0.008
my-0.007
 Hatt-0.007
ishly-0.006
Token
Token ID198
Feature activation+1.69e-01

Pos logit contributions
ulp+0.019
aw+0.017
 tune+0.016
 conclusions+0.016
 endings+0.016
Neg logit contributions
ishly-0.011
-0.011
 Hatt-0.011
 first-0.010
 Elephant-0.010
Token"
Token ID1
Feature activation-3.97e-01

Pos logit contributions
ulp+0.007
aw+0.006
 tune+0.006
ater+0.006
 tunes+0.006
Neg logit contributions
-0.006
 Hatt-0.006
 first-0.006
my-0.005
 Bear-0.005
TokenYes
Token ID5297
Feature activation-6.24e-01

Pos logit contributions
+0.015
 first+0.015
 Hatt+0.014
my+0.014
 Bear+0.013
Neg logit contributions
ulp-0.016
aw-0.014
 tune-0.013
 endings-0.013
OO-0.013
Token,
Token ID11
Feature activation-2.26e-01

Pos logit contributions
+0.015
 first+0.014
 Hatt+0.014
my+0.013
 Bear+0.012
Neg logit contributions
ulp-0.033
aw-0.030
 tune-0.029
 endings-0.029
OO-0.028
Token your
Token ID534
Feature activation+1.12e+00

Pos logit contributions
+0.007
 first+0.007
 Hatt+0.006
my+0.006
 Bear+0.006
Neg logit contributions
ulp-0.010
aw-0.009
 tune-0.009
 endings-0.009
 tunes-0.009
Token majesty
Token ID48565
Feature activation+5.18e+00

Pos logit contributions
ulp+0.053
aw+0.047
 tune+0.046
 endings+0.045
 tunes+0.045
Neg logit contributions
-0.035
 first-0.033
 Hatt-0.031
my-0.030
 Bear-0.029
Token.
Token ID13
Feature activation+3.47e-01

Pos logit contributions
ulp+0.216
aw+0.205
 endings+0.191
 than+0.187
 pounds+0.184
Neg logit contributions
-0.179
 Hatt-0.168
 first-0.163
 Bear-0.153
 Grac-0.150
Token What
Token ID1867
Feature activation+7.65e-01

Pos logit contributions
ulp+0.011
aw+0.010
 tune+0.009
 endings+0.009
 tunes+0.009
Neg logit contributions
 first-0.014
-0.014
ll-0.014
my-0.013
 Hatt-0.013
Token do
Token ID466
Feature activation+4.26e-01

Pos logit contributions
ulp+0.040
 tune+0.036
aw+0.035
ater+0.034
 conclusions+0.034
Neg logit contributions
-0.024
 first-0.022
my-0.020
 Bear-0.017
 Hatt-0.017
Token you
Token ID345
Feature activation-1.07e+00

Pos logit contributions
ulp+0.020
aw+0.018
 tune+0.017
 endings+0.017
 tunes+0.017
Neg logit contributions
-0.014
 first-0.013
 Hatt-0.012
my-0.012
 Bear-0.011
Token order
Token ID1502
Feature activation-8.13e-01

Pos logit contributions
 Hatt+0.040
+0.039
ishly+0.038
 first+0.038
 Bear+0.037
Neg logit contributions
ulp-0.040
ater-0.035
aw-0.035
 tune-0.034
 som-0.033
Tokent
Token ID83
Feature activation-6.98e-01

Pos logit contributions
ulp+0.051
aw+0.050
OO+0.046
 pounds+0.043
ater+0.043
Neg logit contributions
 Hatt-0.035
-0.033
 Bear-0.033
 Elephant-0.032
 first-0.032
Token help
Token ID1037
Feature activation-5.32e-01

Pos logit contributions
+0.022
 first+0.021
 Hatt+0.019
my+0.019
 Bear+0.018
Neg logit contributions
ulp-0.033
aw-0.030
 tune-0.029
 endings-0.029
 tunes-0.028
Token it
Token ID340
Feature activation-1.35e+00

Pos logit contributions
+0.017
 first+0.017
 Hatt+0.015
my+0.015
 Bear+0.015
Neg logit contributions
ulp-0.025
aw-0.022
 endings-0.022
 tune-0.022
 tunes-0.021
Tokenâ
Token ID22940
Feature activation-7.30e-01

Pos logit contributions
+0.041
 Hatt+0.040
 first+0.040
my+0.037
 Bear+0.036
Neg logit contributions
ulp-0.060
aw-0.054
 tune-0.053
 endings-0.051
 tunes-0.051
Token
Token ID26391
Feature activation-6.44e-01

Pos logit contributions
+0.024
 first+0.023
 Hatt+0.022
my+0.021
 Bear+0.021
Neg logit contributions
ulp-0.032
aw-0.029
 tune-0.029
 endings-0.028
 tunes-0.027
Token
Token ID447
Feature activation-5.33e+00

Pos logit contributions
+0.025
 first+0.023
my+0.022
 Hatt+0.021
 +0.021
Neg logit contributions
ulp-0.027
 tune-0.023
 tunes-0.022
aw-0.022
 endings-0.022
Token
Token ID251
Feature activation+1.09e-01

Pos logit contributions
 first+0.200
+0.197
 Hatt+0.196
my+0.190
 Ellie+0.181
Neg logit contributions
ulp-0.184
aw-0.179
OO-0.157
 tune-0.156
ater-0.155
Tokenhe
Token ID258
Feature activation-2.52e+00

Pos logit contributions
ulp+0.006
aw+0.006
 tune+0.006
 endings+0.005
 tunes+0.005
Neg logit contributions
-0.002
 first-0.002
 Hatt-0.002
my-0.002
 Bear-0.002
Token reached
Token ID4251
Feature activation+8.58e-01

Pos logit contributions
+0.076
 first+0.073
 Hatt+0.067
my+0.066
 Bear+0.064
Neg logit contributions
ulp-0.121
aw-0.108
 tune-0.105
 endings-0.104
 tunes-0.102
Token out
Token ID503
Feature activation-1.11e+00

Pos logit contributions
ulp+0.039
aw+0.032
 som+0.032
 tune+0.031
 heights+0.031
Neg logit contributions
-0.025
 Hatt-0.025
 Elephant-0.024
ll-0.023
my-0.022
Token and
Token ID290
Feature activation-7.06e-01

Pos logit contributions
+0.038
 Hatt+0.037
my+0.036
 first+0.036
ll+0.034
Neg logit contributions
ulp-0.046
aw-0.040
 tune-0.039
 som-0.038
 tunes-0.037
Token something
Token ID1223
Feature activation-3.09e-02

Pos logit contributions
ulp+0.044
 tune+0.039
aw+0.038
 endings+0.037
ater+0.037
Neg logit contributions
-0.028
 first-0.027
my-0.024
 Hatt-0.024
 Bear-0.024
Token else
Token ID2073
Feature activation-2.32e+00

Pos logit contributions
+0.001
 first+0.001
 Hatt+0.001
my+0.001
 Bear+0.001
Neg logit contributions
ulp-0.001
aw-0.001
 tune-0.001
 endings-0.001
 tunes-0.001
Token too
Token ID1165
Feature activation-2.18e+00

Pos logit contributions
+0.077
 first+0.073
 Hatt+0.068
my+0.067
 Bear+0.065
Neg logit contributions
ulp-0.105
aw-0.094
 tune-0.092
 endings-0.090
 tunes-0.088
Tokenâ
Token ID22940
Feature activation-1.20e+00

Pos logit contributions
+0.069
 first+0.065
my+0.056
 Bear+0.054
 Hatt+0.053
Neg logit contributions
ulp-0.107
 tune-0.097
aw-0.095
 endings-0.095
 tunes-0.093
Token
Token ID26391
Feature activation-6.21e-01

Pos logit contributions
+0.040
 first+0.038
 Hatt+0.037
my+0.035
 Bear+0.035
Neg logit contributions
ulp-0.052
aw-0.047
 tune-0.046
 endings-0.045
 tunes-0.044
Token
Token ID447
Feature activation-5.20e+00

Pos logit contributions
+0.028
 first+0.025
my+0.024
 Hatt+0.024
 +0.023
Neg logit contributions
ulp-0.022
 tune-0.019
 tunes-0.018
 conclusions-0.018
aw-0.018
Token
Token ID251
Feature activation+8.77e-02

Pos logit contributions
 first+0.189
 Hatt+0.186
+0.182
my+0.181
 Bear+0.169
Neg logit contributions
ulp-0.195
aw-0.186
OO-0.167
 endings-0.163
izing-0.159
Tokenit
Token ID270
Feature activation-1.76e+00

Pos logit contributions
ulp+0.005
 tune+0.005
 endings+0.005
aw+0.005
ater+0.005
Neg logit contributions
-0.002
 first-0.001
 -0.001
 Bear-0.001
my-0.001
Token was
Token ID373
Feature activation-7.59e-01

Pos logit contributions
+0.052
 first+0.049
 Hatt+0.046
my+0.046
 Bear+0.043
Neg logit contributions
ulp-0.085
aw-0.076
 tune-0.075
 endings-0.073
 tunes-0.072
Token a
Token ID257
Feature activation-1.16e+00

Pos logit contributions
+0.023
 first+0.022
 Hatt+0.021
my+0.021
 Bear+0.019
Neg logit contributions
ulp-0.036
aw-0.032
 tune-0.031
 endings-0.030
 tunes-0.030
Token sour
Token ID11348
Feature activation-3.50e-01

Pos logit contributions
+0.044
 first+0.042
 Hatt+0.041
my+0.040
 Bear+0.038
Neg logit contributions
ulp-0.046
aw-0.040
 tune-0.040
 endings-0.037
 tunes-0.037
Token noticed
Token ID6810
Feature activation+1.81e+00

Pos logit contributions
ulp+0.038
 tune+0.032
aw+0.032
 pounds+0.031
izing+0.031
Neg logit contributions
-0.030
 first-0.028
my-0.027
 Bear-0.026
 Hatt-0.025
Token something
Token ID1223
Feature activation+4.19e-01

Pos logit contributions
ulp+0.095
 tune+0.084
ater+0.084
OO+0.083
 pounds+0.082
Neg logit contributions
-0.050
 first-0.049
my-0.044
 Bear-0.041
 Hatt-0.041
Token strange
Token ID6283
Feature activation-2.87e-01

Pos logit contributions
ulp+0.018
aw+0.016
 tune+0.015
 endings+0.015
OO+0.015
Neg logit contributions
-0.016
 first-0.015
 Hatt-0.014
my-0.013
 Bear-0.013
Tokenâ
Token ID22940
Feature activation-1.33e+00

Pos logit contributions
+0.007
 first+0.006
my+0.005
 Bear+0.005
 over+0.005
Neg logit contributions
ulp-0.017
 conclusions-0.015
ater-0.015
OO-0.015
 tune-0.015
Token
Token ID26391
Feature activation-6.41e-01

Pos logit contributions
+0.044
 first+0.041
my+0.039
 Hatt+0.036
 Bear+0.034
Neg logit contributions
ulp-0.064
 pounds-0.055
aw-0.055
 endings-0.055
 tune-0.052
Token
Token ID447
Feature activation-5.19e+00

Pos logit contributions
+0.029
 +0.024
 first+0.023
my+0.023
ll+0.022
Neg logit contributions
ulp-0.024
 tune-0.021
 conclusions-0.021
 tunes-0.020
 pounds-0.020
Token
Token ID250
Feature activation-4.20e-01

Pos logit contributions
 first+0.176
+0.173
 Hatt+0.167
my+0.161
 Bear+0.157
Neg logit contributions
ulp-0.212
aw-0.197
OO-0.179
 tune-0.179
 endings-0.177
Tokena
Token ID64
Feature activation+6.86e-01

Pos logit contributions
+0.017
 first+0.016
my+0.015
 Hatt+0.014
 Bear+0.014
Neg logit contributions
ulp-0.018
aw-0.015
 endings-0.014
 tune-0.014
 som-0.014
Token big
Token ID1263
Feature activation+6.52e-03

Pos logit contributions
ulp+0.030
 tune+0.026
aw+0.026
 endings+0.025
 tunes+0.025
Neg logit contributions
-0.026
 first-0.023
my-0.021
 Bear-0.021
 Hatt-0.021
Token,
Token ID11
Feature activation-2.99e-02

Pos logit contributions
ulp+0.000
aw+0.000
 tune+0.000
 endings+0.000
 tunes+0.000
Neg logit contributions
-0.000
 first-0.000
 Bear-0.000
my-0.000
 Hatt-0.000
Token bright
Token ID6016
Feature activation-1.17e-01

Pos logit contributions
+0.001
 first+0.001
 Hatt+0.001
my+0.001
 Bear+0.001
Neg logit contributions
ulp-0.001
aw-0.001
 endings-0.001
 tune-0.001
 tunes-0.001
Token something
Token ID1223
Feature activation+5.90e-01

Pos logit contributions
+0.041
 first+0.039
 Hatt+0.037
my+0.036
 Bear+0.035
Neg logit contributions
ulp-0.059
aw-0.053
 tune-0.052
 endings-0.050
 tunes-0.050
Token was
Token ID373
Feature activation-2.30e-01

Pos logit contributions
ulp+0.025
aw+0.022
 tune+0.021
 endings+0.021
OO+0.021
Neg logit contributions
-0.022
 first-0.020
 Hatt-0.019
my-0.019
 Bear-0.018
Token wrong
Token ID2642
Feature activation+3.53e-01

Pos logit contributions
+0.009
 first+0.008
 Hatt+0.008
my+0.008
 Bear+0.008
Neg logit contributions
ulp-0.009
aw-0.008
 tune-0.008
 endings-0.008
 tunes-0.007
Tokenâ
Token ID22940
Feature activation-5.09e-01

Pos logit contributions
ulp+0.017
aw+0.015
 tune+0.015
 endings+0.014
 conclusions+0.014
Neg logit contributions
-0.011
 first-0.010
my-0.009
 Bear-0.009
 Hatt-0.009
Token
Token ID26391
Feature activation-1.14e-01

Pos logit contributions
+0.017
 first+0.016
 Hatt+0.016
my+0.015
 Bear+0.015
Neg logit contributions
ulp-0.022
aw-0.020
 tune-0.020
 endings-0.019
 tunes-0.019
Token
Token ID447
Feature activation-4.96e+00

Pos logit contributions
+0.005
 first+0.004
my+0.004
 Hatt+0.004
 Bear+0.004
Neg logit contributions
ulp-0.005
 tune-0.004
aw-0.004
 tunes-0.004
 endings-0.004
Token
Token ID251
Feature activation+2.83e-01

Pos logit contributions
 first+0.188
 Hatt+0.184
my+0.178
+0.173
 Ellie+0.171
Neg logit contributions
aw-0.170
ulp-0.168
izing-0.147
OO-0.144
ater-0.143
Tokenall
Token ID439
Feature activation-3.03e+00

Pos logit contributions
ulp+0.015
aw+0.014
 tune+0.014
 endings+0.013
 tunes+0.013
Neg logit contributions
-0.007
 first-0.006
 Hatt-0.006
my-0.006
 Bear-0.005
Token the
Token ID262
Feature activation+4.83e-01

Pos logit contributions
+0.084
 Hatt+0.082
my+0.081
 first+0.080
 Ros+0.076
Neg logit contributions
ulp-0.133
aw-0.132
 tune-0.124
izing-0.118
 tunes-0.116
Token water
Token ID1660
Feature activation-3.63e-01

Pos logit contributions
ulp+0.018
aw+0.017
 tune+0.017
 tunes+0.016
 endings+0.015
Neg logit contributions
-0.018
 Hatt-0.017
my-0.017
 first-0.017
ll-0.016
Token had
Token ID550
Feature activation+9.62e-01

Pos logit contributions
+0.011
 first+0.011
 Hatt+0.010
my+0.010
 Bear+0.009
Neg logit contributions
ulp-0.017
aw-0.015
 tune-0.015
 endings-0.014
 tunes-0.014
Token heard
Token ID2982
Feature activation+2.53e+00

Pos logit contributions
+0.002
 first+0.002
my+0.002
 Bear+0.002
 Hatt+0.002
Neg logit contributions
ulp-0.003
 tune-0.002
 endings-0.002
 tunes-0.002
aw-0.002
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
ulp+0.140
ater+0.125
 tune+0.124
aw+0.122
 endings+0.122
Neg logit contributions
-0.060
 first-0.055
 Bear-0.048
my-0.048
 Hatt-0.046
Token noise
Token ID7838
Feature activation-7.22e-01

Pos logit contributions
+0.043
 first+0.041
 Hatt+0.038
my+0.038
 Bear+0.037
Neg logit contributions
ulp-0.043
aw-0.038
 endings-0.036
 tune-0.036
 tunes-0.035
Tokenâ
Token ID22940
Feature activation-1.24e+00

Pos logit contributions
+0.023
 first+0.021
my+0.019
 Bear+0.019
 Ellie+0.017
Neg logit contributions
ulp-0.036
 endings-0.032
 pounds-0.031
aw-0.031
OO-0.030
Token
Token ID26391
Feature activation-6.86e-01

Pos logit contributions
+0.045
 first+0.042
my+0.040
 Hatt+0.038
 Bear+0.037
Neg logit contributions
ulp-0.054
aw-0.047
 endings-0.046
 pounds-0.045
 tune-0.045
Token
Token ID447
Feature activation-4.92e+00

Pos logit contributions
+0.030
 first+0.026
my+0.026
 +0.025
 Bear+0.025
Neg logit contributions
ulp-0.025
 tune-0.022
 tunes-0.021
 conclusions-0.021
aw-0.020
Token
Token ID251
Feature activation+1.85e-01

Pos logit contributions
+0.161
 first+0.160
 Hatt+0.154
my+0.151
 Bear+0.143
Neg logit contributions
ulp-0.208
aw-0.193
 tune-0.179
OO-0.175
 endings-0.174
Tokenit
Token ID270
Feature activation-1.83e+00

Pos logit contributions
ulp+0.010
 endings+0.009
 pounds+0.009
 appearance+0.009
OO+0.009
Neg logit contributions
-0.005
 Bear-0.004
 first-0.004
 -0.004
 Hatt-0.003
Token sounded
Token ID14846
Feature activation-8.61e-01

Pos logit contributions
+0.054
 first+0.049
 Bear+0.045
my+0.044
 Hatt+0.042
Neg logit contributions
ulp-0.093
aw-0.082
 endings-0.082
 conclusions-0.080
 tune-0.080
Token like
Token ID588
Feature activation-2.59e-01

Pos logit contributions
+0.028
 first+0.026
my+0.024
 Bear+0.023
 Hatt+0.023
Neg logit contributions
ulp-0.040
aw-0.036
 endings-0.035
 pounds-0.034
 tune-0.034
Token someone
Token ID2130
Feature activation+8.82e-01

Pos logit contributions
+0.007
 first+0.007
 Hatt+0.006
 Bear+0.006
my+0.006
Neg logit contributions
ulp-0.013
aw-0.012
 tune-0.012
ater-0.011
 endings-0.011
Token was
Token ID373
Feature activation+4.09e-01

Pos logit contributions
ulp+0.048
aw+0.045
 tune+0.043
 som+0.041
 tunes+0.041
Neg logit contributions
 Hatt-0.042
 first-0.038
-0.038
my-0.037
 Grac-0.035
Token too
Token ID1165
Feature activation-1.27e+00

Pos logit contributions
ulp+0.018
aw+0.017
 tune+0.016
 endings+0.016
 tunes+0.016
Neg logit contributions
-0.014
 first-0.013
my-0.012
 Bear-0.012
 Hatt-0.012
Token late
Token ID2739
Feature activation-2.04e+00

Pos logit contributions
+0.043
 first+0.041
 Hatt+0.039
my+0.038
 Bear+0.037
Neg logit contributions
ulp-0.056
aw-0.050
 tune-0.049
 endings-0.048
 tunes-0.047
Tokenâ
Token ID22940
Feature activation-1.43e+00

Pos logit contributions
+0.070
 first+0.066
 Hatt+0.063
my+0.062
 Bear+0.059
Neg logit contributions
ulp-0.089
aw-0.079
 tune-0.077
 endings-0.075
 tunes-0.074
Token
Token ID26391
Feature activation-7.20e-01

Pos logit contributions
+0.047
 first+0.045
my+0.040
 Hatt+0.040
 Bear+0.039
Neg logit contributions
ulp-0.066
aw-0.059
 pounds-0.057
 endings-0.057
 tune-0.056
Token
Token ID447
Feature activation-4.89e+00

Pos logit contributions
+0.032
 first+0.028
 +0.027
my+0.026
ll+0.026
Neg logit contributions
ulp-0.027
 conclusions-0.023
 tune-0.023
 tunes-0.022
 sensation-0.022
Token
Token ID251
Feature activation+3.17e-02

Pos logit contributions
+0.137
 first+0.130
 Hatt+0.119
my+0.116
 Bear+0.113
Neg logit contributions
ulp-0.247
aw-0.220
 tune-0.219
 endings-0.214
 tunes-0.213
TokenP
Token ID47
Feature activation-5.31e-02

Pos logit contributions
ulp+0.002
 pounds+0.002
 heights+0.002
 endings+0.002
ater+0.002
Neg logit contributions
-0.001
 first-0.001
 Bear-0.001
 -0.001
 Hatt-0.001
Tokenete
Token ID14471
Feature activation+2.11e-01

Pos logit contributions
+0.002
 first+0.001
 Hatt+0.001
my+0.001
 Bear+0.001
Neg logit contributions
ulp-0.003
aw-0.002
 tune-0.002
 endings-0.002
 tunes-0.002
Token slipped
Token ID18859
Feature activation+1.07e+00

Pos logit contributions
ulp+0.009
 tune+0.009
aw+0.008
OO+0.008
 tunes+0.008
Neg logit contributions
 Hatt-0.006
-0.006
 first-0.006
my-0.006
 Ellie-0.006
Token and
Token ID290
Feature activation-1.20e+00

Pos logit contributions
ulp+0.054
aw+0.049
 endings+0.048
 tune+0.047
 tunes+0.046
Neg logit contributions
-0.030
 first-0.029
my-0.026
 Hatt-0.025
 Bear-0.025
Token back
Token ID736
Feature activation-8.46e-01

Pos logit contributions
+0.022
 Hatt+0.021
 Bear+0.020
 first+0.018
 Elephant+0.018
Neg logit contributions
ulp-0.041
aw-0.039
 tune-0.037
 heights-0.037
 than-0.037
Token to
Token ID284
Feature activation-3.83e-01

Pos logit contributions
+0.026
 Hatt+0.025
 first+0.023
ishly+0.022
my+0.022
Neg logit contributions
ulp-0.037
aw-0.034
 heights-0.033
 than-0.032
 tune-0.032
Token watching
Token ID4964
Feature activation+5.27e-01

Pos logit contributions
 first+0.010
+0.009
my+0.008
 Bear+0.008
 Ellie+0.007
Neg logit contributions
ulp-0.021
aw-0.020
 pounds-0.019
ater-0.019
 endings-0.018
Token the
Token ID262
Feature activation-1.24e+00

Pos logit contributions
ulp+0.023
 tune+0.021
aw+0.021
 tunes+0.020
 endings+0.020
Neg logit contributions
-0.017
 first-0.016
 Hatt-0.016
my-0.015
ll-0.014
Token fire
Token ID2046
Feature activation-2.55e+00

Pos logit contributions
+0.047
 first+0.044
 Hatt+0.043
my+0.042
 Bear+0.041
Neg logit contributions
ulp-0.049
aw-0.044
 tune-0.043
 endings-0.041
 tunes-0.041
Tokenwork
Token ID1818
Feature activation-4.86e+00

Pos logit contributions
+0.083
 Hatt+0.081
 first+0.075
my+0.070
 Grac+0.068
Neg logit contributions
ulp-0.109
aw-0.101
 tune-0.100
 tunes-0.096
 endings-0.095
Token show
Token ID905
Feature activation-5.75e-01

Pos logit contributions
+0.151
 first+0.144
my+0.131
 Hatt+0.130
 Bear+0.125
Neg logit contributions
ulp-0.230
aw-0.206
 tune-0.200
 endings-0.198
 tunes-0.193
Token in
Token ID287
Feature activation+1.48e+00

Pos logit contributions
+0.018
 first+0.017
 Hatt+0.017
my+0.016
 Bear+0.016
Neg logit contributions
ulp-0.026
aw-0.023
 tune-0.023
 endings-0.022
 tunes-0.022
Token peace
Token ID4167
Feature activation+9.08e-01

Pos logit contributions
ulp+0.073
aw+0.066
 tune+0.064
 endings+0.063
OO+0.062
Neg logit contributions
-0.045
 first-0.043
my-0.038
 Hatt-0.038
 Bear-0.036
Token.
Token ID13
Feature activation-8.39e-02

Pos logit contributions
ulp+0.042
aw+0.037
 tune+0.036
 endings+0.036
OO+0.035
Neg logit contributions
-0.030
 first-0.028
my-0.025
 Hatt-0.025
 Bear-0.025
Token What
Token ID1867
Feature activation+9.00e-01

Pos logit contributions
 first+0.002
+0.002
 Hatt+0.002
my+0.002
ishly+0.002
Neg logit contributions
ulp-0.004
aw-0.004
 endings-0.004
 tune-0.004
OO-0.003
Token was
Token ID373
Feature activation+6.32e-01

Pos logit contributions
aw+0.050
ulp+0.048
 tune+0.045
 som+0.045
 endings+0.043
Neg logit contributions
 Hatt-0.049
 Grac-0.044
 Ellie-0.043
 first-0.043
-0.043
Token too
Token ID1165
Feature activation-1.38e+00

Pos logit contributions
ulp+0.031
aw+0.028
 tune+0.028
 som+0.027
 endings+0.027
Neg logit contributions
-0.017
 Hatt-0.016
 first-0.015
my-0.014
 Grac-0.014
Token late
Token ID2739
Feature activation-1.66e+00

Pos logit contributions
+0.048
 first+0.046
 Hatt+0.044
my+0.043
 Bear+0.041
Neg logit contributions
ulp-0.059
aw-0.053
 tune-0.051
 endings-0.050
 tunes-0.049
Tokenâ
Token ID22940
Feature activation-8.84e-01

Pos logit contributions
+0.058
 first+0.055
 Hatt+0.053
my+0.052
 Bear+0.050
Neg logit contributions
ulp-0.071
aw-0.063
 tune-0.061
 endings-0.060
 tunes-0.059
Token
Token ID26391
Feature activation-1.99e-01

Pos logit contributions
+0.030
 first+0.028
 Hatt+0.027
my+0.026
 Bear+0.026
Neg logit contributions
ulp-0.039
aw-0.035
 tune-0.034
 endings-0.033
 tunes-0.033
Token
Token ID447
Feature activation-4.86e+00

Pos logit contributions
+0.008
 first+0.007
my+0.007
 +0.007
 Hatt+0.007
Neg logit contributions
ulp-0.008
 tune-0.007
 tunes-0.007
 conclusions-0.007
aw-0.007
Token
Token ID251
Feature activation+1.71e-01

Pos logit contributions
+0.175
 first+0.172
my+0.167
 Hatt+0.167
 Bear+0.157
Neg logit contributions
ulp-0.182
aw-0.175
OO-0.156
 tune-0.155
 endings-0.154
Token the
Token ID262
Feature activation-2.54e-01

Pos logit contributions
ulp+0.010
aw+0.009
 tune+0.009
 endings+0.009
 tunes+0.009
Neg logit contributions
-0.003
 first-0.003
 Hatt-0.003
my-0.003
 Bear-0.003
Token ice
Token ID4771
Feature activation-8.73e-01

Pos logit contributions
+0.010
 Hatt+0.009
my+0.009
 first+0.009
ll+0.008
Neg logit contributions
ulp-0.010
aw-0.009
 tune-0.009
 endings-0.008
 tunes-0.008
Token melted
Token ID24178
Feature activation-1.10e+00

Pos logit contributions
+0.034
 first+0.032
 Hatt+0.031
my+0.030
 Bear+0.029
Neg logit contributions
ulp-0.034
aw-0.031
 tune-0.030
 endings-0.029
 tunes-0.028
Token and
Token ID290
Feature activation-4.15e-01

Pos logit contributions
+0.029
 first+0.026
 Hatt+0.025
my+0.024
 Bear+0.023
Neg logit contributions
ulp-0.057
aw-0.052
 tune-0.051
 endings-0.050
 tunes-0.049
Token was
Token ID373
Feature activation-1.13e-01

Pos logit contributions
 Hatt+0.014
 first+0.014
+0.014
my+0.013
ishly+0.012
Neg logit contributions
ulp-0.018
aw-0.017
 tune-0.017
 tunes-0.016
 som-0.015
Token too
Token ID1165
Feature activation-1.74e+00

Pos logit contributions
+0.004
 first+0.003
 Hatt+0.003
my+0.003
 Bear+0.003
Neg logit contributions
ulp-0.005
aw-0.005
 tune-0.005
 endings-0.004
 tunes-0.004
Token late
Token ID2739
Feature activation-2.18e+00

Pos logit contributions
+0.064
 Hatt+0.062
 first+0.061
ishly+0.058
my+0.058
Neg logit contributions
ulp-0.068
aw-0.062
 tune-0.058
ater-0.056
OO-0.056
Tokenâ
Token ID22940
Feature activation-1.39e+00

Pos logit contributions
+0.066
 first+0.063
 Hatt+0.059
my+0.058
 Bear+0.055
Neg logit contributions
ulp-0.104
aw-0.093
 tune-0.091
 endings-0.089
 tunes-0.088
Token
Token ID26391
Feature activation-5.90e-01

Pos logit contributions
+0.046
 first+0.044
 Hatt+0.042
my+0.041
 Bear+0.039
Neg logit contributions
ulp-0.062
aw-0.055
 tune-0.054
 endings-0.053
 tunes-0.052
Token
Token ID447
Feature activation-4.77e+00

Pos logit contributions
+0.025
 first+0.021
 +0.021
my+0.021
 Hatt+0.020
Neg logit contributions
ulp-0.024
 tune-0.020
 conclusions-0.020
 tunes-0.020
ilt-0.019
Token
Token ID250
Feature activation-4.63e-01

Pos logit contributions
+0.148
 first+0.141
 Hatt+0.132
my+0.131
 Bear+0.125
Neg logit contributions
ulp-0.222
aw-0.200
 tune-0.194
 endings-0.190
 tunes-0.186
Tokenhis
Token ID14363
Feature activation-2.93e-01

Pos logit contributions
+0.019
 first+0.017
my+0.017
 Hatt+0.016
 Bear+0.016
Neg logit contributions
ulp-0.019
aw-0.015
 endings-0.015
 tune-0.015
 som-0.015
Token foot
Token ID2366
Feature activation+2.72e+00

Pos logit contributions
 first+0.011
+0.010
 Bear+0.009
my+0.009
 Hatt+0.009
Neg logit contributions
ulp-0.012
aw-0.011
 tune-0.011
 tunes-0.011
 endings-0.011
Token had
Token ID550
Feature activation+7.09e-01

Pos logit contributions
ulp+0.121
 tune+0.103
aw+0.103
 endings+0.102
 tunes+0.100
Neg logit contributions
-0.101
 first-0.092
my-0.086
 Bear-0.080
 Hatt-0.080
Token caught
Token ID4978
Feature activation+1.32e-01

Pos logit contributions
ulp+0.036
aw+0.031
 tune+0.031
 tunes+0.031
 conclusions+0.031
Neg logit contributions
-0.023
 first-0.020
my-0.017
 Bear-0.017
 Brown-0.016
Token was
Token ID373
Feature activation+2.70e-01

Pos logit contributions
ulp+0.042
aw+0.042
 tune+0.041
 som+0.039
 tunes+0.039
Neg logit contributions
 Hatt-0.042
my-0.037
 first-0.037
 First-0.036
 Grac-0.036
Token too
Token ID1165
Feature activation-1.41e+00

Pos logit contributions
ulp+0.013
 tune+0.012
aw+0.012
 than+0.011
 som+0.011
Neg logit contributions
-0.007
 Hatt-0.007
 first-0.007
my-0.006
 Bear-0.006
Token late
Token ID2739
Feature activation-2.22e+00

Pos logit contributions
+0.051
 first+0.048
 Hatt+0.048
my+0.046
 Bear+0.044
Neg logit contributions
ulp-0.057
aw-0.051
 tune-0.049
OO-0.047
ater-0.047
Tokenâ
Token ID22940
Feature activation-1.43e+00

Pos logit contributions
+0.073
 first+0.069
 Hatt+0.067
my+0.066
 Bear+0.063
Neg logit contributions
ulp-0.098
aw-0.088
 tune-0.086
 endings-0.084
 tunes-0.082
Token
Token ID26391
Feature activation-6.69e-01

Pos logit contributions
+0.048
 first+0.046
 Hatt+0.044
my+0.043
 Bear+0.042
Neg logit contributions
ulp-0.062
aw-0.056
 tune-0.055
 endings-0.053
 tunes-0.052
Token
Token ID447
Feature activation-4.75e+00

Pos logit contributions
+0.033
 first+0.029
 Hatt+0.029
my+0.028
 Bear+0.027
Neg logit contributions
ulp-0.023
 tune-0.018
 conclusions-0.017
aw-0.017
 tunes-0.017
Token
Token ID251
Feature activation-3.70e-02

Pos logit contributions
+0.158
 first+0.151
my+0.146
 Hatt+0.144
 Bear+0.136
Neg logit contributions
ulp-0.204
aw-0.188
 tune-0.176
 endings-0.174
OO-0.173
Tokenthey
Token ID9930
Feature activation-1.02e+00

Pos logit contributions
+0.001
 first+0.001
 Hatt+0.001
 Bear+0.001
my+0.001
Neg logit contributions
ulp-0.002
aw-0.002
 tune-0.002
 pounds-0.002
 endings-0.002
Token had
Token ID550
Feature activation+1.30e-01

Pos logit contributions
 Hatt+0.031
 first+0.029
my+0.029
+0.028
 Grac+0.027
Neg logit contributions
ulp-0.044
aw-0.042
 tune-0.041
 som-0.039
OO-0.038
Token already
Token ID1541
Feature activation-1.33e+00

Pos logit contributions
ulp+0.006
aw+0.005
 tune+0.005
 endings+0.005
 tunes+0.005
Neg logit contributions
-0.004
 first-0.004
 Hatt-0.004
my-0.004
 Bear-0.004
Token drifted
Token ID38648
Feature activation-5.97e-02

Pos logit contributions
+0.042
 first+0.040
my+0.036
 Hatt+0.033
 Bear+0.033
Neg logit contributions
ulp-0.063
 tune-0.056
aw-0.056
 endings-0.054
izing-0.054
Token was
Token ID373
Feature activation+2.15e-01

Pos logit contributions
ulp+0.029
aw+0.027
 tune+0.026
 som+0.026
OO+0.025
Neg logit contributions
 Hatt-0.036
 Grac-0.031
 Alfred-0.030
 first-0.030
 First-0.030
Token too
Token ID1165
Feature activation-1.57e+00

Pos logit contributions
ulp+0.011
 tune+0.009
aw+0.009
 pounds+0.009
OO+0.009
Neg logit contributions
-0.006
 Hatt-0.006
 first-0.005
my-0.005
ishly-0.005
Token late
Token ID2739
Feature activation-2.04e+00

Pos logit contributions
+0.058
 Hatt+0.056
 first+0.055
my+0.052
ishly+0.052
Neg logit contributions
ulp-0.063
aw-0.054
 tune-0.053
OO-0.052
ater-0.051
Tokenâ
Token ID22940
Feature activation-1.11e+00

Pos logit contributions
+0.073
 Hatt+0.070
 first+0.069
my+0.068
 Bear+0.064
Neg logit contributions
ulp-0.081
aw-0.073
 tune-0.070
 endings-0.070
OO-0.067
Token
Token ID26391
Feature activation-4.23e-01

Pos logit contributions
+0.038
 first+0.037
 Hatt+0.035
my+0.034
 Bear+0.033
Neg logit contributions
ulp-0.048
aw-0.043
 tune-0.042
 endings-0.041
 tunes-0.040
Token
Token ID447
Feature activation-4.71e+00

Pos logit contributions
+0.020
 first+0.018
my+0.017
 Hatt+0.017
 Bear+0.017
Neg logit contributions
ulp-0.015
 tune-0.012
 tunes-0.012
 conclusions-0.012
aw-0.011
Token
Token ID251
Feature activation+8.34e-02

Pos logit contributions
+0.169
 first+0.167
my+0.163
 Hatt+0.160
 Bear+0.149
Neg logit contributions
ulp-0.182
aw-0.167
OO-0.154
 endings-0.151
 tune-0.150
Tokenthe
Token ID1169
Feature activation-9.70e-01

Pos logit contributions
ulp+0.005
aw+0.004
 tune+0.004
 endings+0.004
 tunes+0.004
Neg logit contributions
-0.002
 first-0.002
 Hatt-0.002
my-0.002
 Bear-0.002
Token o
Token ID267
Feature activation-1.97e+00

Pos logit contributions
+0.037
 first+0.035
 Hatt+0.034
my+0.034
 Bear+0.032
Neg logit contributions
ulp-0.038
aw-0.034
 tune-0.033
 endings-0.032
 tunes-0.032
Tokenat
Token ID265
Feature activation-1.38e+00

Pos logit contributions
+0.080
 first+0.077
 Hatt+0.075
my+0.072
 Bear+0.071
Neg logit contributions
ulp-0.073
aw-0.065
 tune-0.062
 endings-0.059
OO-0.058
Token had
Token ID550
Feature activation+6.29e-01

Pos logit contributions
 Hatt+0.045
+0.044
 first+0.041
 Grac+0.039
my+0.038
Neg logit contributions
ulp-0.058
aw-0.055
 tune-0.053
 tunes-0.049
OO-0.049
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
ulp+0.004
aw+0.004
 tune+0.004
 tunes+0.004
 som+0.004
Neg logit contributions
 Hatt-0.002
ishly-0.002
-0.002
 first-0.002
my-0.002
TokenThey
Token ID2990
Feature activation-9.41e-01

Pos logit contributions
+0.007
 first+0.007
 Hatt+0.006
my+0.006
 Bear+0.006
Neg logit contributions
ulp-0.006
aw-0.005
 tune-0.005
 tunes-0.005
 endings-0.005
Token watched
Token ID7342
Feature activation+1.04e-01

Pos logit contributions
+0.030
 Hatt+0.030
 first+0.030
my+0.027
 attendant+0.027
Neg logit contributions
ulp-0.038
aw-0.037
 tune-0.035
 tunes-0.034
 som-0.033
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
ulp+0.005
aw+0.005
 tune+0.004
 endings+0.004
 tunes+0.004
Neg logit contributions
-0.003
 first-0.003
 Hatt-0.003
my-0.003
 Bear-0.003
Token fire
Token ID2046
Feature activation-2.55e+00

Pos logit contributions
+0.058
 Hatt+0.055
 first+0.054
my+0.052
 Bear+0.050
Neg logit contributions
ulp-0.044
 tune-0.039
aw-0.038
 tunes-0.036
 endings-0.035
Tokenwork
Token ID1818
Feature activation-4.70e+00

Pos logit contributions
+0.079
 Hatt+0.077
 first+0.068
 Grac+0.065
 Ellie+0.064
Neg logit contributions
ulp-0.113
 tune-0.107
aw-0.103
 tunes-0.103
izing-0.099
Token spark
Token ID9009
Feature activation-2.11e+00

Pos logit contributions
+0.142
 first+0.134
 Hatt+0.127
my+0.125
 Bear+0.119
Neg logit contributions
ulp-0.223
aw-0.200
 tune-0.197
 endings-0.191
 tunes-0.189
Tokenle
Token ID293
Feature activation-1.22e+00

Pos logit contributions
+0.091
 first+0.090
my+0.082
 Hatt+0.081
 Bear+0.079
Neg logit contributions
ulp-0.077
 endings-0.061
aw-0.060
OO-0.060
 tune-0.059
Token above
Token ID2029
Feature activation-2.97e-01

Pos logit contributions
+0.037
 first+0.036
my+0.031
 Bear+0.030
 Hatt+0.029
Neg logit contributions
ulp-0.060
aw-0.052
 endings-0.052
OO-0.051
 tune-0.050
Token them
Token ID606
Feature activation-1.23e+00

Pos logit contributions
+0.008
 first+0.008
my+0.007
 Bear+0.007
 Hatt+0.007
Neg logit contributions
ulp-0.015
aw-0.013
 endings-0.013
 tune-0.013
OO-0.013
Token in
Token ID287
Feature activation-6.74e-03

Pos logit contributions
+0.041
 Hatt+0.040
 first+0.039
my+0.037
 Bear+0.036
Neg logit contributions
ulp-0.052
 tune-0.047
aw-0.047
 tunes-0.045
 endings-0.044
TokenCan
Token ID6090
Feature activation+2.22e-01

Pos logit contributions
+0.017
 first+0.016
my+0.015
 Hatt+0.015
 Bear+0.014
Neg logit contributions
ulp-0.023
aw-0.020
 endings-0.020
 tune-0.020
OO-0.019
Token I
Token ID314
Feature activation-9.86e-01

Pos logit contributions
ulp+0.007
 tune+0.006
aw+0.006
 endings+0.006
 heights+0.006
Neg logit contributions
 Hatt-0.009
my-0.008
-0.008
sy-0.008
 first-0.008
Token put
Token ID1234
Feature activation-1.65e+00

Pos logit contributions
 Hatt+0.030
+0.030
 first+0.029
my+0.027
ishly+0.027
Neg logit contributions
ulp-0.044
aw-0.038
 tune-0.038
ater-0.037
 som-0.036
Token the
Token ID262
Feature activation-1.51e+00

Pos logit contributions
+0.055
 first+0.051
 Hatt+0.050
my+0.049
 Bear+0.047
Neg logit contributions
ulp-0.072
aw-0.065
 tune-0.062
ater-0.060
 pounds-0.060
Token cup
Token ID6508
Feature activation-1.96e+00

Pos logit contributions
+0.063
 first+0.060
 Hatt+0.058
my+0.057
 Bear+0.056
Neg logit contributions
ulp-0.054
aw-0.047
 tune-0.046
 endings-0.044
 tunes-0.043
Tokencakes
Token ID37263
Feature activation-4.69e+00

Pos logit contributions
 Hatt+0.062
+0.058
 Elephant+0.054
 Barry+0.054
 Alfred+0.053
Neg logit contributions
ulp-0.084
aw-0.077
ater-0.071
 than-0.069
 tune-0.068
Token in
Token ID287
Feature activation-7.03e-01

Pos logit contributions
 Hatt+0.142
+0.138
 Elephant+0.132
 Bear+0.130
ishly+0.128
Neg logit contributions
ulp-0.205
aw-0.190
 than-0.179
ater-0.177
 tune-0.177
Token the
Token ID262
Feature activation+6.23e-01

Pos logit contributions
+0.022
 first+0.021
 Hatt+0.019
my+0.019
 Bear+0.018
Neg logit contributions
ulp-0.033
aw-0.030
 tune-0.029
 endings-0.028
 tunes-0.028
Token oven
Token ID14361
Feature activation-9.48e-01

Pos logit contributions
ulp+0.024
aw+0.021
 tune+0.020
 endings+0.019
ater+0.019
Neg logit contributions
-0.024
 Hatt-0.023
 first-0.023
my-0.022
 Bear-0.021
Token?"
Token ID1701
Feature activation+7.47e-01

Pos logit contributions
+0.035
 Hatt+0.034
 first+0.031
 Bear+0.031
my+0.031
Neg logit contributions
ulp-0.037
aw-0.033
 tune-0.031
ater-0.030
 than-0.030
Token June
Token ID2795
Feature activation+6.50e-01

Pos logit contributions
 tune+0.029
 tunes+0.029
ulp+0.028
aw+0.027
 insect+0.027
Neg logit contributions
ll-0.023
 first-0.020
 Hatt-0.020
ishly-0.020
my-0.018
Token a
Token ID257
Feature activation-3.57e-01

Pos logit contributions
 first+0.011
+0.010
 Bear+0.008
 Brown+0.008
 Ros+0.008
Neg logit contributions
ulp-0.018
aw-0.017
ater-0.016
OO-0.016
 conclusions-0.016
Token normal
Token ID3487
Feature activation+5.63e-02

Pos logit contributions
+0.012
 first+0.012
 Hatt+0.010
 Bear+0.010
my+0.010
Neg logit contributions
ulp-0.017
aw-0.015
 endings-0.014
 heights-0.014
iling-0.014
Token penny
Token ID26369
Feature activation+1.10e-01

Pos logit contributions
ulp+0.003
aw+0.002
cks+0.002
 endings+0.002
iling+0.002
Neg logit contributions
 first-0.002
-0.002
my-0.002
 Hatt-0.002
 Bear-0.001
Tokenâ
Token ID22940
Feature activation-3.23e-01

Pos logit contributions
ulp+0.006
aw+0.005
 tune+0.005
 endings+0.005
OO+0.005
Neg logit contributions
-0.003
 first-0.003
my-0.003
 Hatt-0.003
 Bear-0.003
Token
Token ID26391
Feature activation-2.19e-01

Pos logit contributions
+0.010
 first+0.009
my+0.009
 Hatt+0.008
 Bear+0.008
Neg logit contributions
ulp-0.016
aw-0.014
 pounds-0.014
 tune-0.014
 endings-0.013
Token
Token ID447
Feature activation-4.69e+00

Pos logit contributions
+0.010
 +0.008
my+0.008
 first+0.008
 Hatt+0.008
Neg logit contributions
ulp-0.008
 tune-0.008
 conclusions-0.007
 tunes-0.007
 sensation-0.007
Token
Token ID251
Feature activation+3.48e-01

Pos logit contributions
+0.132
 first+0.124
 Hatt+0.114
my+0.114
 Bear+0.109
Neg logit contributions
ulp-0.237
aw-0.212
 tune-0.211
 endings-0.207
 tunes-0.204
Tokenit
Token ID270
Feature activation-1.38e+00

Pos logit contributions
ulp+0.019
 pounds+0.017
OO+0.017
ater+0.017
 endings+0.017
Neg logit contributions
-0.009
 first-0.008
 -0.008
my-0.007
 Bear-0.007
Token was
Token ID373
Feature activation-4.33e-01

Pos logit contributions
+0.034
 first+0.030
my+0.026
 Bear+0.026
 +0.023
Neg logit contributions
ulp-0.080
 conclusions-0.071
 endings-0.071
 heights-0.070
aw-0.069
Token special
Token ID2041
Feature activation-1.39e+00

Pos logit contributions
+0.012
 first+0.012
 Hatt+0.011
my+0.011
 Bear+0.010
Neg logit contributions
ulp-0.022
aw-0.019
 tune-0.019
 endings-0.019
 tunes-0.018
Token.
Token ID13
Feature activation-8.06e-01

Pos logit contributions
 first+0.037
+0.035
 Bear+0.031
my+0.029
 Brown+0.027
Neg logit contributions
ulp-0.076
aw-0.069
OO-0.067
 conclusions-0.066
 endings-0.065
Token.
Token ID13
Feature activation-4.66e-01

Pos logit contributions
ulp+0.048
aw+0.043
 tune+0.042
 endings+0.041
 tunes+0.040
Neg logit contributions
-0.037
 first-0.035
 Hatt-0.033
my-0.033
 Bear-0.031
Token They
Token ID1119
Feature activation-4.55e-01

Pos logit contributions
 first+0.013
+0.013
my+0.012
 Hatt+0.012
ll+0.011
Neg logit contributions
ulp-0.022
aw-0.020
 tune-0.019
 endings-0.019
OO-0.018
Token watched
Token ID7342
Feature activation-2.18e-02

Pos logit contributions
 Hatt+0.015
 first+0.014
+0.013
 attendant+0.013
 Grac+0.012
Neg logit contributions
aw-0.018
ulp-0.017
 tune-0.016
 tunes-0.016
 som-0.015
Token the
Token ID262
Feature activation-8.96e-01

Pos logit contributions
+0.001
 first+0.001
 Hatt+0.001
my+0.001
 Bear+0.001
Neg logit contributions
ulp-0.001
aw-0.001
 tune-0.001
 endings-0.001
 tunes-0.001
Token fire
Token ID2046
Feature activation-2.33e+00

Pos logit contributions
+0.040
 Hatt+0.039
my+0.036
 first+0.036
 Grac+0.035
Neg logit contributions
ulp-0.030
 tune-0.027
aw-0.025
 tunes-0.025
 endings-0.024
Tokenwork
Token ID1818
Feature activation-4.65e+00

Pos logit contributions
+0.074
 Hatt+0.074
 first+0.064
 Grac+0.062
 Ellie+0.061
Neg logit contributions
ulp-0.100
 tune-0.096
aw-0.093
 tunes-0.091
 than-0.087
Token until
Token ID1566
Feature activation+5.95e-01

Pos logit contributions
+0.144
 first+0.137
 Hatt+0.134
my+0.129
 Bear+0.123
Neg logit contributions
ulp-0.214
aw-0.191
 tune-0.191
 tunes-0.183
 endings-0.182
Token it
Token ID340
Feature activation-3.68e-01

Pos logit contributions
ulp+0.026
aw+0.023
 tune+0.022
 endings+0.022
 tunes+0.021
Neg logit contributions
-0.021
 first-0.020
 Hatt-0.019
my-0.019
 Bear-0.018
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
+0.011
 Hatt+0.010
 first+0.010
my+0.010
 Bear+0.009
Neg logit contributions
ulp-0.017
aw-0.016
 tune-0.015
 endings-0.015
 tunes-0.015
Token all
Token ID477
Feature activation-1.53e+00

Pos logit contributions
+0.043
 first+0.042
 Hatt+0.040
my+0.039
 Bear+0.038
Neg logit contributions
ulp-0.039
aw-0.034
 tune-0.033
 endings-0.032
 tunes-0.031
Token finished
Token ID5201
Feature activation-1.39e+00

Pos logit contributions
+0.056
 Hatt+0.056
 first+0.053
 Bear+0.049
my+0.048
Neg logit contributions
ulp-0.061
aw-0.053
 tune-0.053
 tunes-0.050
 som-0.049
Token Her
Token ID2332
Feature activation-1.10e+00

Pos logit contributions
+0.010
my+0.008
 first+0.008
 Hatt+0.007
 Bear+0.006
Neg logit contributions
ulp-0.054
aw-0.050
 pounds-0.050
 tune-0.049
ater-0.049
Token name
Token ID1438
Feature activation-3.59e-01

Pos logit contributions
+0.033
 first+0.031
 Hatt+0.030
my+0.029
 Bear+0.028
Neg logit contributions
ulp-0.052
aw-0.046
 tune-0.046
 endings-0.044
OO-0.044
Token was
Token ID373
Feature activation-5.23e-01

Pos logit contributions
+0.008
 first+0.007
my+0.007
 Hatt+0.006
 Bear+0.006
Neg logit contributions
ulp-0.021
 tune-0.019
 conclusions-0.019
 endings-0.019
 pounds-0.018
Token
Token ID220
Feature activation-9.98e-01

Pos logit contributions
+0.015
 first+0.014
my+0.013
 Hatt+0.013
ll+0.013
Neg logit contributions
ulp-0.024
aw-0.022
 tune-0.021
 ow-0.021
 endings-0.020
TokenÃ
Token ID5746
Feature activation-1.75e+00

Pos logit contributions
+0.035
 first+0.034
 Hatt+0.033
ishly+0.031
 Bear+0.031
Neg logit contributions
ulp-0.043
aw-0.037
OO-0.035
 tune-0.035
ater-0.034
Token
Token ID447
Feature activation-4.61e+00

Pos logit contributions
+0.063
 first+0.059
 Hatt+0.057
my+0.055
 Bear+0.053
Neg logit contributions
ulp-0.073
aw-0.066
 tune-0.064
 endings-0.061
 tunes-0.061
Token
Token ID108
Feature activation-2.51e+00

Pos logit contributions
+0.129
 first+0.122
 Hatt+0.113
my+0.113
 Bear+0.106
Neg logit contributions
ulp-0.231
aw-0.208
 tune-0.205
 endings-0.201
 tunes-0.198
Tokenmil
Token ID25433
Feature activation-1.10e+00

Pos logit contributions
+0.086
 first+0.083
 Hatt+0.078
my+0.077
 Bear+0.075
Neg logit contributions
ulp-0.107
aw-0.097
 tune-0.094
 endings-0.090
 tunes-0.090
Tokenie
Token ID494
Feature activation+1.48e+00

Pos logit contributions
+0.050
 first+0.048
 Hatt+0.046
my+0.046
 Bear+0.045
Neg logit contributions
ulp-0.036
aw-0.031
 tune-0.030
 endings-0.029
 tunes-0.028
Token.
Token ID13
Feature activation-4.90e-01

Pos logit contributions
ulp+0.056
aw+0.055
 tune+0.053
 tunes+0.050
ater+0.048
Neg logit contributions
-0.052
 Hatt-0.050
 first-0.050
my-0.046
 Grac-0.045
Token
Token ID220
Feature activation-6.61e-01

Pos logit contributions
 first+0.011
+0.011
ll+0.009
 Hatt+0.009
my+0.009
Neg logit contributions
ulp-0.025
aw-0.024
 tune-0.023
 tunes-0.023
 endings-0.022
Token Lily
Token ID20037
Feature activation+1.76e-01

Pos logit contributions
ulp+0.026
aw+0.024
 endings+0.023
 tune+0.023
 pounds+0.023
Neg logit contributions
 first-0.009
-0.009
 Hatt-0.008
my-0.008
 Bear-0.007
Token's
Token ID338
Feature activation-2.91e-01

Pos logit contributions
ulp+0.009
aw+0.008
 tune+0.008
 endings+0.008
 tunes+0.008
Neg logit contributions
-0.005
 first-0.005
my-0.004
 Hatt-0.004
 Bear-0.004
Token mom
Token ID1995
Feature activation+1.09e-01

Pos logit contributions
+0.008
 Hatt+0.007
my+0.006
 Bear+0.006
 first+0.006
Neg logit contributions
ulp-0.015
aw-0.013
 tune-0.013
 endings-0.013
OO-0.013
Token put
Token ID1234
Feature activation-1.42e+00

Pos logit contributions
ulp+0.005
aw+0.005
 tune+0.005
ater+0.005
 than+0.005
Neg logit contributions
 Hatt-0.002
-0.002
ishly-0.002
 first-0.002
 Elephant-0.002
Token lot
Token ID1256
Feature activation-1.48e+00

Pos logit contributions
+0.048
 first+0.043
 Hatt+0.043
ll+0.042
ishly+0.039
Neg logit contributions
ulp-0.061
aw-0.058
 tune-0.051
 tunes-0.049
 than-0.049
Tokenion
Token ID295
Feature activation-4.58e+00

Pos logit contributions
+0.049
 first+0.046
 Hatt+0.044
my+0.043
 Bear+0.041
Neg logit contributions
ulp-0.067
aw-0.060
 tune-0.057
 endings-0.056
 tunes-0.055
Token on
Token ID319
Feature activation+3.87e-01

Pos logit contributions
 Hatt+0.160
+0.156
my+0.143
 Grac+0.142
 Elephant+0.141
Neg logit contributions
aw-0.167
ulp-0.161
 than-0.153
izing-0.147
 tune-0.141
Token her
Token ID607
Feature activation+6.19e-01

Pos logit contributions
ulp+0.021
aw+0.018
 tune+0.018
 endings+0.018
 tunes+0.018
Neg logit contributions
-0.010
 first-0.010
 Hatt-0.009
my-0.009
 Bear-0.008
Token arms
Token ID5101
Feature activation+1.90e+00

Pos logit contributions
ulp+0.026
aw+0.023
 endings+0.022
 tune+0.022
 conclusions+0.021
Neg logit contributions
-0.023
 first-0.022
my-0.020
 Hatt-0.020
 Bear-0.019
Token because
Token ID780
Feature activation+3.68e-02

Pos logit contributions
ulp+0.077
aw+0.070
 tune+0.068
ater+0.064
 tunes+0.064
Neg logit contributions
-0.066
 Hatt-0.064
 first-0.062
 Bear-0.059
 Elephant-0.058
Token they
Token ID484
Feature activation+7.39e-01

Pos logit contributions
ulp+0.001
aw+0.001
 tune+0.001
 endings+0.001
 tunes+0.001
Neg logit contributions
-0.001
 first-0.001
 Hatt-0.001
my-0.001
 Bear-0.001
Token so
Token ID523
Feature activation+9.43e-01

Pos logit contributions
my+0.066
+0.066
 Hatt+0.066
 first+0.063
 Bear+0.061
Neg logit contributions
ulp-0.095
aw-0.089
 tune-0.087
 endings-0.084
 tunes-0.083
Token they
Token ID484
Feature activation+1.07e-01

Pos logit contributions
ulp+0.039
aw+0.038
 endings+0.036
izing+0.035
 than+0.035
Neg logit contributions
 Hatt-0.028
-0.025
 first-0.025
my-0.025
 attendant-0.023
Token did
Token ID750
Feature activation-4.10e-02

Pos logit contributions
aw+0.004
ulp+0.004
 som+0.004
 tune+0.004
 tunes+0.003
Neg logit contributions
 Hatt-0.003
 first-0.003
 attendant-0.003
my-0.003
 manager-0.003
Tokenâ
Token ID22940
Feature activation-7.37e-01

Pos logit contributions
+0.001
 first+0.001
 Hatt+0.001
 Bear+0.001
my+0.001
Neg logit contributions
ulp-0.002
 tune-0.002
aw-0.002
 endings-0.002
ater-0.002
Token
Token ID26391
Feature activation-2.47e-01

Pos logit contributions
+0.024
 first+0.023
 Hatt+0.022
my+0.021
 Bear+0.021
Neg logit contributions
ulp-0.033
aw-0.030
 tune-0.029
 endings-0.028
 tunes-0.028
Token
Token ID447
Feature activation-4.57e+00

Pos logit contributions
+0.012
 first+0.010
 Hatt+0.010
my+0.010
 +0.010
Neg logit contributions
ulp-0.009
 tune-0.007
 conclusions-0.007
 tunes-0.007
 endings-0.007
Token
Token ID251
Feature activation+4.69e-01

Pos logit contributions
+0.155
 first+0.149
my+0.143
 Hatt+0.142
 Bear+0.133
Neg logit contributions
ulp-0.192
aw-0.179
 tune-0.166
OO-0.162
 endings-0.161
Tokensw
Token ID2032
Feature activation-1.42e+00

Pos logit contributions
ulp+0.023
aw+0.022
 tune+0.021
 tunes+0.020
 endings+0.020
Neg logit contributions
 first-0.012
-0.011
my-0.011
 Hatt-0.011
 Bear-0.010
Tokenimming
Token ID27428
Feature activation-5.80e-01

Pos logit contributions
+0.053
 first+0.051
 Hatt+0.049
my+0.047
 Bear+0.047
Neg logit contributions
ulp-0.058
aw-0.052
 tune-0.049
OO-0.047
ater-0.047
Token,
Token ID11
Feature activation-3.30e-01

Pos logit contributions
+0.016
 Hatt+0.015
 first+0.015
my+0.014
 Bear+0.013
Neg logit contributions
ulp-0.028
aw-0.026
 tune-0.026
 tunes-0.025
 som-0.024
Token playing
Token ID2712
Feature activation-3.76e-01

Pos logit contributions
+0.010
 first+0.010
 Hatt+0.009
my+0.009
 Bear+0.009
Neg logit contributions
ulp-0.015
aw-0.014
 tune-0.013
 endings-0.013
 tunes-0.013
Token fun
Token ID1257
Feature activation-1.71e+00

Pos logit contributions
+0.014
 first+0.014
 Hatt+0.013
my+0.013
 Bear+0.012
Neg logit contributions
ulp-0.016
aw-0.014
 tune-0.014
 endings-0.014
 tunes-0.013
Token watching
Token ID4964
Feature activation+5.59e-01

Pos logit contributions
+0.055
 first+0.052
my+0.044
 Hatt+0.043
 Ellie+0.043
Neg logit contributions
ulp-0.080
ater-0.071
aw-0.071
OO-0.071
 conclusions-0.070
Token the
Token ID262
Feature activation-1.09e+00

Pos logit contributions
ulp+0.026
aw+0.023
 tune+0.023
 tunes+0.022
 endings+0.022
Neg logit contributions
-0.018
 first-0.017
 Hatt-0.016
my-0.016
 Bear-0.015
Token pretty
Token ID2495
Feature activation-1.52e+00

Pos logit contributions
+0.047
 Hatt+0.045
 first+0.044
my+0.043
 Bear+0.042
Neg logit contributions
ulp-0.037
aw-0.033
 tune-0.033
 tunes-0.031
 endings-0.031
Token fire
Token ID2046
Feature activation-2.33e+00

Pos logit contributions
+0.062
 first+0.060
 Hatt+0.057
my+0.056
 Bear+0.055
Neg logit contributions
ulp-0.056
aw-0.049
 tune-0.047
 endings-0.046
 tunes-0.045
Tokenwork
Token ID1818
Feature activation-4.56e+00

Pos logit contributions
 Hatt+0.072
+0.071
 first+0.065
my+0.062
 Grac+0.060
Neg logit contributions
ulp-0.100
 tune-0.094
aw-0.093
 tunes-0.091
 than-0.089
Token's
Token ID338
Feature activation-1.43e+00

Pos logit contributions
+0.145
 first+0.137
 Hatt+0.128
my+0.127
 Bear+0.122
Neg logit contributions
ulp-0.211
aw-0.189
 tune-0.184
 endings-0.180
 tunes-0.177
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
+0.058
 first+0.055
 Hatt+0.054
my+0.053
 Bear+0.051
Neg logit contributions
ulp-0.054
 tune-0.046
aw-0.046
 endings-0.044
 tunes-0.044
Token It
Token ID632
Feature activation-3.09e-01

Pos logit contributions
+0.005
 first+0.005
 Hatt+0.004
my+0.004
 Bear+0.004
Neg logit contributions
ulp-0.010
aw-0.009
 tune-0.009
 endings-0.009
OO-0.008
Token was
Token ID373
Feature activation+2.50e-01

Pos logit contributions
 Hatt+0.010
 first+0.010
+0.009
my+0.009
ishly+0.009
Neg logit contributions
ulp-0.012
aw-0.012
 tune-0.012
 tunes-0.011
 som-0.011
Token the
Token ID262
Feature activation-3.32e-02

Pos logit contributions
ulp+0.012
aw+0.011
 tune+0.011
 endings+0.011
 tunes+0.011
Neg logit contributions
-0.007
 first-0.007
my-0.006
 Hatt-0.006
 Bear-0.006
Token.
Token ID13
Feature activation+2.50e-01

Pos logit contributions
+0.032
 first+0.030
my+0.026
 Hatt+0.026
 Bear+0.025
Neg logit contributions
ulp-0.048
 tune-0.043
 endings-0.043
aw-0.042
 tunes-0.041
Token They
Token ID1119
Feature activation-6.04e-02

Pos logit contributions
ulp+0.012
aw+0.011
 tune+0.010
 endings+0.010
OO+0.010
Neg logit contributions
 first-0.007
-0.007
 Hatt-0.006
my-0.006
 Bear-0.005
Token put
Token ID1234
Feature activation-1.35e+00

Pos logit contributions
 Hatt+0.002
+0.002
 first+0.002
 Elephant+0.002
 Barry+0.002
Neg logit contributions
ulp-0.002
aw-0.002
 som-0.002
 tune-0.002
ater-0.002
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
+0.051
 Hatt+0.046
 first+0.045
ll+0.044
my+0.043
Neg logit contributions
ulp-0.052
aw-0.051
 tune-0.043
 pounds-0.043
 than-0.042
Token cup
Token ID6508
Feature activation-1.98e+00

Pos logit contributions
+0.085
 first+0.080
 Hatt+0.079
my+0.077
 Bear+0.076
Neg logit contributions
ulp-0.065
aw-0.056
 tune-0.055
 tunes-0.051
 endings-0.051
Tokencakes
Token ID37263
Feature activation-4.54e+00

Pos logit contributions
 Hatt+0.061
+0.061
 Elephant+0.057
 Barry+0.056
 Bear+0.055
Neg logit contributions
ulp-0.087
aw-0.084
 tune-0.073
ater-0.072
 than-0.071
Token on
Token ID319
Feature activation+1.35e-01

Pos logit contributions
+0.144
 Hatt+0.144
 Elephant+0.138
 Bear+0.136
 Barry+0.131
Neg logit contributions
ulp-0.193
aw-0.185
 than-0.166
 tune-0.165
ater-0.158
Token a
Token ID257
Feature activation+4.96e-02

Pos logit contributions
ulp+0.006
aw+0.006
 tune+0.006
 endings+0.006
 tunes+0.006
Neg logit contributions
-0.004
 first-0.004
 Hatt-0.004
my-0.003
 Bear-0.003
Token plate
Token ID7480
Feature activation+9.91e-01

Pos logit contributions
ulp+0.002
aw+0.002
 endings+0.002
 tune+0.002
 heights+0.002
Neg logit contributions
-0.002
 first-0.002
my-0.002
 Hatt-0.002
 Bear-0.002
Token and
Token ID290
Feature activation+6.35e-01

Pos logit contributions
ulp+0.041
aw+0.037
 tune+0.035
ater+0.034
 than+0.033
Neg logit contributions
-0.035
 Hatt-0.034
 first-0.033
my-0.031
 Bear-0.031
Token brought
Token ID3181
Feature activation+1.32e+00

Pos logit contributions
ulp+0.028
aw+0.025
 tune+0.024
 tunes+0.023
 endings+0.023
Neg logit contributions
-0.021
 first-0.020
 Hatt-0.019
my-0.019
 Elephant-0.018
Token.
Token ID13
Feature activation+8.08e-01

Pos logit contributions
+0.003
 first+0.003
my+0.003
 both+0.002
 Bear+0.002
Neg logit contributions
ulp-0.006
 endings-0.005
 tune-0.005
 tunes-0.005
 conclusions-0.005
Token
Token ID198
Feature activation+7.58e-01

Pos logit contributions
ulp+0.044
aw+0.040
 tune+0.039
 endings+0.038
 tunes+0.038
Neg logit contributions
-0.020
 first-0.018
 Hatt-0.017
my-0.016
 Bear-0.016
Token
Token ID198
Feature activation+4.26e-01

Pos logit contributions
ulp+0.038
aw+0.034
 endings+0.032
 conclusions+0.031
 tune+0.031
Neg logit contributions
-0.020
 first-0.019
 Hatt-0.019
ishly-0.019
my-0.017
Token"
Token ID1
Feature activation-1.61e-01

Pos logit contributions
ulp+0.017
aw+0.015
 tune+0.014
ater+0.014
 endings+0.014
Neg logit contributions
 first-0.016
-0.016
 Hatt-0.015
my-0.014
ishly-0.014
TokenOh
Token ID5812
Feature activation+1.42e+00

Pos logit contributions
+0.006
 first+0.006
my+0.006
 Hatt+0.006
 Bear+0.005
Neg logit contributions
ulp-0.007
aw-0.006
 endings-0.005
 tune-0.005
 pounds-0.005
Token no
Token ID645
Feature activation+3.17e+00

Pos logit contributions
ulp+0.067
 tune+0.060
aw+0.059
 tunes+0.058
 endings+0.057
Neg logit contributions
-0.045
 first-0.043
my-0.039
 Hatt-0.039
 Bear-0.037
Token!"
Token ID2474
Feature activation+1.15e+00

Pos logit contributions
ulp+0.166
aw+0.149
 tune+0.149
 endings+0.146
 tunes+0.146
Neg logit contributions
 first-0.083
-0.082
my-0.074
 Bear-0.067
 Hatt-0.066
Token Sam
Token ID3409
Feature activation+3.59e-01

Pos logit contributions
ulp+0.049
aw+0.046
 endings+0.044
 tune+0.042
 conclusions+0.042
Neg logit contributions
-0.036
 first-0.035
ll-0.035
my-0.034
 Hatt-0.033
Token and
Token ID290
Feature activation-1.18e+00

Pos logit contributions
aw+0.015
ulp+0.014
 heights+0.012
 tune+0.012
 tunes+0.012
Neg logit contributions
 Hatt-0.012
 first-0.011
-0.011
 Grac-0.011
 Barry-0.011
Token Max
Token ID5436
Feature activation-4.11e-01

Pos logit contributions
+0.031
 first+0.030
 Hatt+0.027
my+0.027
 Bear+0.026
Neg logit contributions
ulp-0.061
aw-0.055
 tune-0.054
 endings-0.053
 tunes-0.052
Token say
Token ID910
Feature activation-1.27e+00

Pos logit contributions
 Hatt+0.014
 Grac+0.013
 first+0.013
 Bear+0.012
 Elephant+0.012
Neg logit contributions
aw-0.016
ulp-0.014
 ow-0.014
 tune-0.014
 endings-0.013
Token and
Token ID290
Feature activation-2.00e-01

Pos logit contributions
+0.015
 first+0.014
my+0.013
 Hatt+0.013
 Bear+0.013
Neg logit contributions
ulp-0.024
aw-0.021
 endings-0.021
 tune-0.021
 tunes-0.020
Token cou
Token ID2284
Feature activation-4.53e-02

Pos logit contributions
+0.005
 first+0.005
my+0.005
 Hatt+0.005
 Bear+0.005
Neg logit contributions
ulp-0.010
aw-0.009
 tune-0.009
 endings-0.009
ater-0.009
Tokeng
Token ID70
Feature activation-3.11e+00

Pos logit contributions
+0.001
 first+0.001
 Hatt+0.001
my+0.001
 Bear+0.001
Neg logit contributions
ulp-0.002
aw-0.002
 tune-0.002
 endings-0.002
 tunes-0.002
Tokenhed
Token ID704
Feature activation-2.11e+00

Pos logit contributions
+0.114
 first+0.108
 Hatt+0.102
my+0.102
 Bear+0.097
Neg logit contributions
ulp-0.130
aw-0.113
 tune-0.112
 endings-0.109
 tunes-0.107
Token a
Token ID257
Feature activation-5.68e-01

Pos logit contributions
+0.068
 first+0.065
my+0.060
 Hatt+0.060
 Bear+0.057
Neg logit contributions
ulp-0.098
aw-0.087
 tune-0.085
 endings-0.084
 tunes-0.082
Token few
Token ID1178
Feature activation+3.20e+00

Pos logit contributions
+0.019
 first+0.017
my+0.015
 Bear+0.015
 +0.015
Neg logit contributions
ulp-0.026
 endings-0.026
aw-0.025
 heights-0.024
iling-0.023
Token times
Token ID1661
Feature activation+6.48e-01

Pos logit contributions
ulp+0.113
OO+0.100
 endings+0.093
aw+0.092
 tune+0.092
Neg logit contributions
-0.135
 first-0.130
 Bear-0.117
 Hatt-0.115
my-0.112
Token.
Token ID13
Feature activation+7.87e-02

Pos logit contributions
ulp+0.027
aw+0.024
 tune+0.023
 endings+0.022
 tunes+0.022
Neg logit contributions
-0.023
 first-0.022
 Hatt-0.021
my-0.021
 Bear-0.020
Token He
Token ID679
Feature activation+7.84e-02

Pos logit contributions
ulp+0.004
aw+0.004
 tune+0.004
 endings+0.004
 tunes+0.003
Neg logit contributions
-0.002
 first-0.002
 Hatt-0.002
my-0.002
 Bear-0.002
Token felt
Token ID2936
Feature activation+1.38e+00

Pos logit contributions
ulp+0.003
aw+0.003
 tune+0.003
 tunes+0.003
 endings+0.003
Neg logit contributions
-0.002
 first-0.002
 Hatt-0.002
my-0.002
 Bear-0.002
Token a
Token ID257
Feature activation-7.43e-01

Pos logit contributions
ulp+0.071
aw+0.063
 tune+0.062
 endings+0.062
OO+0.061
Neg logit contributions
-0.040
 first-0.036
 Hatt-0.033
my-0.032
 Bear-0.031
Token it
Token ID340
Feature activation-8.45e-01

Pos logit contributions
ulp+0.067
 tune+0.061
OO+0.059
ater+0.058
 tunes+0.058
Neg logit contributions
 first-0.034
-0.033
my-0.029
 Bear-0.028
 Hatt-0.027
Token sailing
Token ID28499
Feature activation-1.15e+00

Pos logit contributions
+0.026
 first+0.025
my+0.023
 Hatt+0.023
 Bear+0.022
Neg logit contributions
ulp-0.040
aw-0.036
 tune-0.035
 endings-0.035
 tunes-0.034
Token away
Token ID1497
Feature activation+4.84e-01

Pos logit contributions
+0.031
 first+0.030
my+0.027
 Hatt+0.025
 Bear+0.025
Neg logit contributions
ulp-0.060
aw-0.053
 tune-0.053
 endings-0.052
 tunes-0.051
Token in
Token ID287
Feature activation+7.62e-02

Pos logit contributions
ulp+0.025
aw+0.022
 tune+0.022
 endings+0.021
 tunes+0.021
Neg logit contributions
 first-0.013
-0.013
my-0.011
 Bear-0.011
 Hatt-0.011
Token the
Token ID262
Feature activation+1.32e+00

Pos logit contributions
ulp+0.004
aw+0.004
 tune+0.004
OO+0.004
 endings+0.004
Neg logit contributions
 first-0.002
-0.002
my-0.002
 Bear-0.002
 Hatt-0.002
Token distance
Token ID5253
Feature activation+4.07e+00

Pos logit contributions
ulp+0.049
aw+0.042
 tune+0.042
 endings+0.040
 tunes+0.040
Neg logit contributions
-0.054
 first-0.053
my-0.049
 Hatt-0.048
 Bear-0.048
Token.
Token ID13
Feature activation+5.25e-01

Pos logit contributions
ulp+0.195
aw+0.170
 tune+0.168
 endings+0.167
OO+0.165
Neg logit contributions
-0.127
 first-0.123
my-0.112
 Bear-0.107
 Hatt-0.106
Token They
Token ID1119
Feature activation+2.32e-01

Pos logit contributions
ulp+0.028
aw+0.025
 tune+0.025
 endings+0.024
 tunes+0.024
Neg logit contributions
-0.013
 first-0.012
 Hatt-0.011
my-0.011
 Bear-0.011
Token could
Token ID714
Feature activation-6.48e-01

Pos logit contributions
ulp+0.009
aw+0.008
 tune+0.008
 som+0.007
ater+0.007
Neg logit contributions
-0.008
 Hatt-0.008
 first-0.008
my-0.008
 Ellie-0.008
Token hear
Token ID3285
Feature activation+2.08e+00

Pos logit contributions
+0.024
 Hatt+0.023
my+0.022
 first+0.021
 Bear+0.021
Neg logit contributions
ulp-0.026
 tune-0.022
 som-0.021
 heights-0.021
aw-0.021
Token the
Token ID262
Feature activation-3.73e-01

Pos logit contributions
ulp+0.108
ater+0.095
OO+0.095
 tune+0.095
 pounds+0.094
Neg logit contributions
-0.055
 first-0.054
 Bear-0.046
 Hatt-0.046
my-0.045
Token started
Token ID2067
Feature activation+2.62e-01

Pos logit contributions
ulp+0.093
aw+0.085
 tune+0.083
 endings+0.080
 tunes+0.079
Neg logit contributions
-0.059
 first-0.056
 Hatt-0.055
my-0.052
 Bear-0.050
Token to
Token ID284
Feature activation+6.34e-01

Pos logit contributions
ulp+0.014
aw+0.013
 endings+0.013
 tune+0.012
 pounds+0.012
Neg logit contributions
-0.007
 first-0.006
my-0.006
 Bear-0.006
 Hatt-0.005
Token rock
Token ID3881
Feature activation+5.17e-01

Pos logit contributions
ulp+0.027
aw+0.024
 tune+0.023
 endings+0.023
 tunes+0.022
Neg logit contributions
-0.022
 first-0.021
 Hatt-0.020
my-0.020
 Bear-0.019
Token back
Token ID736
Feature activation+3.39e-01

Pos logit contributions
ulp+0.024
aw+0.022
 tune+0.021
 tunes+0.020
 than+0.020
Neg logit contributions
-0.015
 Hatt-0.015
 first-0.014
my-0.013
 Bear-0.013
Token and
Token ID290
Feature activation+4.61e-01

Pos logit contributions
ulp+0.016
aw+0.014
 tune+0.014
 endings+0.014
 tunes+0.013
Neg logit contributions
-0.010
 first-0.010
 Hatt-0.009
my-0.009
 Bear-0.009
Token forth
Token ID6071
Feature activation+3.27e+00

Pos logit contributions
ulp+0.019
 tune+0.018
aw+0.017
 endings+0.017
 tunes+0.017
Neg logit contributions
-0.015
 first-0.015
 Hatt-0.014
my-0.013
 Bear-0.013
Token,
Token ID11
Feature activation+1.27e+00

Pos logit contributions
ulp+0.181
aw+0.162
 endings+0.161
 conclusions+0.158
 tune+0.156
Neg logit contributions
-0.082
 first-0.080
my-0.067
 Bear-0.064
 Hatt-0.061
Token and
Token ID290
Feature activation+5.94e-01

Pos logit contributions
ulp+0.058
aw+0.051
 endings+0.049
 tune+0.049
ater+0.048
Neg logit contributions
-0.043
 first-0.042
my-0.038
 Hatt-0.038
 Bear-0.037
Token Nut
Token ID11959
Feature activation+2.17e+00

Pos logit contributions
ulp+0.030
aw+0.028
 tune+0.027
 endings+0.026
 tunes+0.026
Neg logit contributions
-0.015
 first-0.014
 Hatt-0.014
my-0.013
 Bear-0.012
Tokenty
Token ID774
Feature activation+7.98e-01

Pos logit contributions
ulp+0.110
aw+0.100
 tune+0.098
 endings+0.096
 tunes+0.095
Neg logit contributions
-0.058
 first-0.055
 Hatt-0.051
my-0.050
 Bear-0.047
Token became
Token ID2627
Feature activation+1.26e+00

Pos logit contributions
ulp+0.037
aw+0.034
 tune+0.033
 endings+0.032
 tunes+0.031
Neg logit contributions
-0.025
 first-0.024
 Hatt-0.022
my-0.022
 Bear-0.021
Token cat
Token ID3797
Feature activation+6.23e-01

Pos logit contributions
+0.022
 first+0.021
my+0.019
 Hatt+0.019
 Bear+0.019
Neg logit contributions
ulp-0.026
aw-0.023
 endings-0.022
 tune-0.022
 pounds-0.021
Token looked
Token ID3114
Feature activation+5.23e-01

Pos logit contributions
ulp+0.024
aw+0.024
 tune+0.022
 tunes+0.021
 endings+0.021
Neg logit contributions
 Hatt-0.021
-0.021
 first-0.021
 Grac-0.019
my-0.019
Token sad
Token ID6507
Feature activation+3.06e-01

Pos logit contributions
ulp+0.022
aw+0.020
 tune+0.019
 than+0.018
OO+0.018
Neg logit contributions
-0.018
 Hatt-0.017
 first-0.017
my-0.016
 Bear-0.016
Token and
Token ID290
Feature activation-6.92e-01

Pos logit contributions
ulp+0.014
aw+0.013
 tune+0.013
 endings+0.012
 tunes+0.012
Neg logit contributions
-0.010
 first-0.009
my-0.008
 Hatt-0.008
 Bear-0.008
Token had
Token ID550
Feature activation+1.96e+00

Pos logit contributions
+0.024
 first+0.023
 Hatt+0.022
my+0.021
 Bear+0.020
Neg logit contributions
ulp-0.030
aw-0.026
 tune-0.026
 endings-0.025
 tunes-0.025
Token no
Token ID645
Feature activation+3.62e+00

Pos logit contributions
ulp+0.104
aw+0.092
ater+0.092
izing+0.092
 tune+0.091
Neg logit contributions
-0.052
 first-0.052
my-0.045
 Bear-0.041
 Hatt-0.040
Token friends
Token ID2460
Feature activation-4.63e-01

Pos logit contributions
ulp+0.160
aw+0.142
 tune+0.140
 endings+0.136
 tunes+0.134
Neg logit contributions
-0.123
 first-0.115
 Hatt-0.111
my-0.108
 Bear-0.105
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
+0.011
 first+0.010
my+0.010
 Bear+0.008
ll+0.007
Neg logit contributions
ulp-0.027
OO-0.023
aw-0.023
 tune-0.023
izing-0.023
Token play
Token ID711
Feature activation-7.12e-01

Pos logit contributions
ulp+0.055
 tune+0.050
aw+0.048
 conclusions+0.048
 tunes+0.047
Neg logit contributions
-0.038
 Hatt-0.036
 first-0.034
 Bear-0.033
my-0.032
Token with
Token ID351
Feature activation-1.18e+00

Pos logit contributions
+0.021
 first+0.020
my+0.018
 Hatt+0.018
 Bear+0.017
Neg logit contributions
ulp-0.036
aw-0.032
 tune-0.031
 endings-0.030
 pounds-0.030
Token.
Token ID13
Feature activation+3.15e-01

Pos logit contributions
+0.038
 first+0.036
my+0.033
 Hatt+0.033
 Bear+0.032
Neg logit contributions
ulp-0.056
aw-0.049
 tune-0.048
 endings-0.047
OO-0.046
Token happy
Token ID3772
Feature activation-1.85e-01

Pos logit contributions
+0.029
 Hatt+0.029
my+0.027
 Bear+0.027
 first+0.027
Neg logit contributions
ulp-0.031
aw-0.028
 tune-0.026
ater-0.025
 than-0.025
Token.
Token ID13
Feature activation+1.08e-01

Pos logit contributions
+0.006
 first+0.005
my+0.005
 Bear+0.005
 Hatt+0.004
Neg logit contributions
ulp-0.009
aw-0.008
 tune-0.008
OO-0.008
 conclusions-0.008
Token He
Token ID679
Feature activation+5.40e-02

Pos logit contributions
ulp+0.005
aw+0.005
 tune+0.004
 endings+0.004
 than+0.004
Neg logit contributions
 first-0.003
 Hatt-0.003
-0.003
my-0.003
ll-0.003
Token says
Token ID1139
Feature activation-1.54e+00

Pos logit contributions
ulp+0.002
aw+0.002
 som+0.002
 endings+0.002
 tune+0.002
Neg logit contributions
 Hatt-0.002
 first-0.002
-0.002
my-0.002
 Ellie-0.002
Token,
Token ID11
Feature activation+2.84e-01

Pos logit contributions
+0.044
 first+0.041
 Hatt+0.041
my+0.041
 Bear+0.037
Neg logit contributions
ulp-0.074
aw-0.069
 tune-0.065
 endings-0.065
 som-0.063
Token "
Token ID366
Feature activation+7.67e-01

Pos logit contributions
ulp+0.012
aw+0.010
 tune+0.010
 endings+0.010
 tunes+0.010
Neg logit contributions
-0.010
 first-0.010
 Hatt-0.009
my-0.009
 Bear-0.009
TokenWow
Token ID22017
Feature activation+2.42e-01

Pos logit contributions
ulp+0.030
aw+0.027
 tune+0.026
 endings+0.025
 tunes+0.025
Neg logit contributions
-0.030
 first-0.029
 Hatt-0.028
my-0.027
 Bear-0.026
Token,
Token ID11
Feature activation+7.82e-02

Pos logit contributions
ulp+0.011
aw+0.011
OO+0.010
 endings+0.010
 pounds+0.010
Neg logit contributions
-0.007
 Hatt-0.007
 first-0.006
my-0.006
 Bear-0.006
Token thank
Token ID5875
Feature activation+7.79e-01

Pos logit contributions
ulp+0.003
aw+0.003
 endings+0.003
 tune+0.003
 som+0.002
Neg logit contributions
-0.003
 Hatt-0.003
my-0.003
 first-0.003
ll-0.003
Token you
Token ID345
Feature activation-3.41e-01

Pos logit contributions
ulp+0.027
aw+0.026
 endings+0.024
 tune+0.024
 heights+0.023
Neg logit contributions
-0.030
 Hatt-0.030
my-0.029
 first-0.027
 Bear-0.027
Token,
Token ID11
Feature activation-3.18e-01

Pos logit contributions
 Hatt+0.012
+0.011
my+0.010
ishly+0.010
 Grac+0.010
Neg logit contributions
ulp-0.014
aw-0.013
OO-0.012
 tune-0.012
 endings-0.012
Token to
Token ID284
Feature activation+1.18e+00

Pos logit contributions
ulp+0.036
ater+0.033
aw+0.032
OO+0.032
ilt+0.031
Neg logit contributions
 first-0.011
 Bear-0.010
my-0.009
 Brown-0.009
-0.009
Token rock
Token ID3881
Feature activation+9.94e-01

Pos logit contributions
ulp+0.059
aw+0.055
 endings+0.053
 pounds+0.053
izing+0.052
Neg logit contributions
-0.033
 first-0.032
my-0.028
 Bear-0.028
 Hatt-0.027
Token back
Token ID736
Feature activation+3.57e-01

Pos logit contributions
ulp+0.051
 tune+0.045
aw+0.045
 endings+0.044
OO+0.043
Neg logit contributions
-0.027
 first-0.027
my-0.024
 Hatt-0.023
 Bear-0.023
Token and
Token ID290
Feature activation+3.08e-01

Pos logit contributions
ulp+0.018
aw+0.016
 tune+0.016
 endings+0.015
 tunes+0.015
Neg logit contributions
-0.010
 first-0.010
 Hatt-0.009
my-0.009
 Bear-0.009
Token forth
Token ID6071
Feature activation+3.01e+00

Pos logit contributions
ulp+0.015
aw+0.014
 tune+0.013
 endings+0.013
OO+0.013
Neg logit contributions
-0.009
 first-0.009
my-0.008
 Hatt-0.008
 Bear-0.008
Token on
Token ID319
Feature activation+1.13e+00

Pos logit contributions
ulp+0.151
aw+0.136
 tune+0.131
 endings+0.131
 tunes+0.127
Neg logit contributions
-0.087
 first-0.085
my-0.075
 Hatt-0.075
 Bear-0.072
Token his
Token ID465
Feature activation+1.80e+00

Pos logit contributions
ulp+0.067
aw+0.061
OO+0.060
 tune+0.059
 endings+0.059
Neg logit contributions
 first-0.027
-0.025
 Bear-0.022
my-0.019
 -0.019
Token four
Token ID1440
Feature activation+5.36e-01

Pos logit contributions
ulp+0.077
aw+0.069
 endings+0.065
 tune+0.064
OO+0.064
Neg logit contributions
 first-0.064
-0.064
my-0.058
 Hatt-0.056
 Bear-0.055
Token legs
Token ID7405
Feature activation+4.26e+00

Pos logit contributions
ulp+0.023
OO+0.020
 endings+0.018
 tune+0.018
 disease+0.018
Neg logit contributions
-0.021
 first-0.019
my-0.018
 Bear-0.017
 both-0.017
Token while
Token ID981
Feature activation+3.06e+00

Pos logit contributions
ulp+0.185
aw+0.164
 tune+0.160
 endings+0.156
 tunes+0.154
Neg logit contributions
-0.147
 first-0.139
 Hatt-0.132
my-0.130
 Bear-0.125
Token relaxing
Token ID28175
Feature activation+8.56e-02

Pos logit contributions
ulp+0.153
aw+0.140
 endings+0.134
ater+0.134
 tune+0.133
Neg logit contributions
 first-0.088
-0.087
 Bear-0.078
my-0.076
 Hatt-0.073
Token way
Token ID835
Feature activation+1.29e+00

Pos logit contributions
+0.005
 first+0.005
my+0.005
 Bear+0.005
 Hatt+0.005
Neg logit contributions
ulp-0.006
aw-0.005
 tune-0.005
 endings-0.005
 conclusions-0.004
Token home
Token ID1363
Feature activation-1.68e+00

Pos logit contributions
ulp+0.056
 than+0.049
 tune+0.049
aw+0.048
ater+0.044
Neg logit contributions
 Hatt-0.042
-0.036
 Elephant-0.034
 Grac-0.033
ishly-0.033
Token.
Token ID13
Feature activation-2.47e-01

Pos logit contributions
+0.063
 Hatt+0.059
 first+0.058
my+0.057
 Bear+0.054
Neg logit contributions
ulp-0.067
aw-0.058
 tune-0.057
 endings-0.055
 tunes-0.054
Token From
Token ID3574
Feature activation-1.45e+00

Pos logit contributions
 first+0.007
+0.007
 Hatt+0.006
my+0.006
 Bear+0.005
Neg logit contributions
ulp-0.012
aw-0.011
 tune-0.011
 endings-0.011
 tunes-0.010
Token that
Token ID326
Feature activation+2.54e-01

Pos logit contributions
+0.062
 Hatt+0.060
my+0.056
 first+0.055
 Bear+0.055
Neg logit contributions
ulp-0.048
aw-0.047
 tune-0.042
izing-0.041
 than-0.040
Token day
Token ID1110
Feature activation+1.73e+00

Pos logit contributions
ulp+0.011
 tune+0.010
aw+0.009
 endings+0.009
 tunes+0.009
Neg logit contributions
-0.009
 first-0.008
 Hatt-0.008
my-0.008
 Bear-0.007
Token on
Token ID319
Feature activation+8.67e-01

Pos logit contributions
ulp+0.083
aw+0.076
 tune+0.072
 endings+0.071
OO+0.070
Neg logit contributions
-0.050
 first-0.047
 Hatt-0.045
my-0.044
 Bear-0.042
Token,
Token ID11
Feature activation+8.14e-01

Pos logit contributions
ulp+0.042
aw+0.037
 endings+0.036
 tune+0.035
 than+0.035
Neg logit contributions
-0.024
 Hatt-0.022
 first-0.022
my-0.022
 Grac-0.019
Token the
Token ID262
Feature activation+1.80e+00

Pos logit contributions
ulp+0.035
 tune+0.032
aw+0.032
 endings+0.031
 tunes+0.031
Neg logit contributions
-0.026
 first-0.024
 Hatt-0.024
my-0.023
ishly-0.021
Token bunny
Token ID44915
Feature activation+1.65e+00

Pos logit contributions
ulp+0.075
 tune+0.069
aw+0.066
 tunes+0.065
 endings+0.064
Neg logit contributions
-0.063
 first-0.058
 Hatt-0.058
my-0.055
 Bear-0.052
Token and
Token ID290
Feature activation+5.62e-01

Pos logit contributions
ulp+0.077
aw+0.069
 tune+0.068
 endings+0.066
 tunes+0.065
Neg logit contributions
-0.051
 first-0.049
 Hatt-0.046
my-0.045
 Bear-0.043
Token.
Token ID13
Feature activation+4.33e-01

Pos logit contributions
ulp+0.027
aw+0.025
 tune+0.024
 than+0.023
 endings+0.022
Neg logit contributions
-0.028
 Hatt-0.027
 Bear-0.025
 Grac-0.025
 first-0.025
Token
Token ID198
Feature activation+5.96e-01

Pos logit contributions
ulp+0.023
aw+0.022
 endings+0.021
 tune+0.020
 heights+0.020
Neg logit contributions
-0.010
 first-0.010
 Hatt-0.009
my-0.008
 Bear-0.008
Token
Token ID198
Feature activation+3.74e-01

Pos logit contributions
ulp+0.030
aw+0.028
 endings+0.026
 tune+0.025
 conclusions+0.025
Neg logit contributions
-0.015
 first-0.015
 Hatt-0.015
ishly-0.013
 Bear-0.013
TokenThey
Token ID2990
Feature activation-5.17e-01

Pos logit contributions
ulp+0.012
aw+0.012
 tune+0.010
 endings+0.010
OO+0.010
Neg logit contributions
 Hatt-0.016
 Bear-0.015
 first-0.015
 Grac-0.015
-0.015
Token start
Token ID923
Feature activation+5.58e-02

Pos logit contributions
 Grac+0.021
 Ros+0.021
 Hatt+0.020
 Ellie+0.020
 Elephant+0.020
Neg logit contributions
 som-0.015
aw-0.015
 conclusions-0.014
 heights-0.014
ulp-0.013
Token to
Token ID284
Feature activation+5.58e-01

Pos logit contributions
ulp+0.003
aw+0.003
 tune+0.003
 endings+0.003
 pounds+0.002
Neg logit contributions
-0.002
 first-0.001
my-0.001
 Hatt-0.001
 Bear-0.001
Token climb
Token ID12080
Feature activation+2.45e-01

Pos logit contributions
ulp+0.024
 tune+0.022
 som+0.022
 conclusions+0.021
 heights+0.021
Neg logit contributions
-0.017
 Hatt-0.017
 first-0.016
my-0.015
 Bear-0.015
Token the
Token ID262
Feature activation-1.47e+00

Pos logit contributions
aw+0.009
ulp+0.009
 heights+0.009
 than+0.008
 conclusions+0.008
Neg logit contributions
 Hatt-0.009
-0.009
my-0.008
ll-0.008
 Grac-0.008
Token hill
Token ID12788
Feature activation-5.16e-01

Pos logit contributions
+0.056
 Hatt+0.055
my+0.053
 first+0.052
 Bear+0.051
Neg logit contributions
ulp-0.053
aw-0.048
 heights-0.046
 tune-0.046
 conclusions-0.045
Token.
Token ID13
Feature activation+4.32e-01

Pos logit contributions
+0.020
 Hatt+0.020
 Grac+0.019
 Bear+0.018
my+0.018
Neg logit contributions
ulp-0.018
aw-0.017
 tune-0.016
 than-0.016
 heights-0.015
Token It
Token ID632
Feature activation-6.40e-01

Pos logit contributions
ulp+0.023
aw+0.021
 tune+0.021
 endings+0.020
 tunes+0.020
Neg logit contributions
-0.011
 first-0.010
 Hatt-0.009
my-0.009
 Bear-0.009
Token curtain
Token ID29461
Feature activation+8.45e-01

Pos logit contributions
ulp+0.023
aw+0.020
 tune+0.020
 endings+0.019
 tunes+0.019
Neg logit contributions
-0.019
 first-0.019
 Hatt-0.018
my-0.017
 Bear-0.017
Token got
Token ID1392
Feature activation+1.13e-01

Pos logit contributions
ulp+0.042
aw+0.038
 tune+0.037
 endings+0.037
 tunes+0.036
Neg logit contributions
-0.023
 first-0.022
 Hatt-0.021
my-0.020
 Bear-0.019
Token even
Token ID772
Feature activation+1.56e-01

Pos logit contributions
ulp+0.004
aw+0.004
izing+0.004
 som+0.004
 pounds+0.004
Neg logit contributions
-0.004
 Hatt-0.004
ishly-0.004
ll-0.003
 first-0.003
Token bigger
Token ID5749
Feature activation-1.29e+00

Pos logit contributions
ulp+0.007
aw+0.006
 tune+0.006
 endings+0.006
 tunes+0.006
Neg logit contributions
-0.005
 first-0.005
 Hatt-0.004
my-0.004
 Bear-0.004
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
 first+0.039
+0.037
my+0.033
 Bear+0.032
 Hatt+0.030
Neg logit contributions
ulp-0.066
 tune-0.055
OO-0.055
aw-0.054
 conclusions-0.054
Token
Token ID198
Feature activation+5.31e-01

Pos logit contributions
ulp+0.008
aw+0.007
 tune+0.007
 endings+0.007
OO+0.007
Neg logit contributions
-0.003
 first-0.002
 Hatt-0.002
my-0.002
 Bear-0.002
Token
Token ID198
Feature activation+2.46e-01

Pos logit contributions
ulp+0.029
aw+0.027
 tune+0.026
 endings+0.025
 tunes+0.025
Neg logit contributions
-0.012
 first-0.011
 Hatt-0.010
my-0.010
 Bear-0.009
TokenTom
Token ID13787
Feature activation-1.62e-01

Pos logit contributions
ulp+0.011
 endings+0.009
 tune+0.009
 conclusions+0.009
 heights+0.009
Neg logit contributions
-0.009
 first-0.009
my-0.008
 Hatt-0.008
 Ellie-0.007
Token and
Token ID290
Feature activation-7.41e-01

Pos logit contributions
 Hatt+0.005
+0.005
 first+0.005
 Grac+0.005
ishly+0.005
Neg logit contributions
ulp-0.006
aw-0.006
 tune-0.006
ater-0.006
 tunes-0.006
Token Max
Token ID5436
Feature activation+1.25e-01

Pos logit contributions
+0.017
 first+0.015
 Hatt+0.014
my+0.013
 Bear+0.012
Neg logit contributions
ulp-0.040
 tune-0.038
aw-0.037
 tunes-0.036
 endings-0.036
Token felt
Token ID2936
Feature activation+1.46e+00

Pos logit contributions
aw+0.005
ulp+0.005
 tune+0.005
 tunes+0.005
 endings+0.005
Neg logit contributions
 first-0.004
-0.004
 Hatt-0.004
 Grac-0.003
 Bear-0.003
Token
Token ID198
Feature activation+1.30e-01

Pos logit contributions
ulp+0.022
aw+0.021
 tune+0.020
 endings+0.019
 tunes+0.019
Neg logit contributions
-0.010
 first-0.009
 Hatt-0.009
my-0.009
ishly-0.008
TokenThe
Token ID464
Feature activation-5.24e-01

Pos logit contributions
ulp+0.006
 pounds+0.005
 endings+0.005
aw+0.005
 appearance+0.005
Neg logit contributions
-0.005
 first-0.004
my-0.004
 Hatt-0.004
 -0.003
Token flower
Token ID15061
Feature activation+1.35e+00

Pos logit contributions
+0.021
 first+0.020
 Hatt+0.020
my+0.019
 Bear+0.019
Neg logit contributions
ulp-0.020
aw-0.017
 tune-0.017
 endings-0.016
 tunes-0.016
Token was
Token ID373
Feature activation+1.79e-01

Pos logit contributions
ulp+0.063
aw+0.057
 tune+0.055
 endings+0.053
 tunes+0.053
Neg logit contributions
-0.042
 first-0.040
 Hatt-0.038
my-0.037
 Bear-0.036
Token very
Token ID845
Feature activation-4.66e-01

Pos logit contributions
ulp+0.010
aw+0.009
 endings+0.009
 tune+0.009
 tunes+0.008
Neg logit contributions
-0.004
 first-0.004
my-0.004
 Hatt-0.004
 Bear-0.004
Token happy
Token ID3772
Feature activation-3.46e-01

Pos logit contributions
+0.015
 first+0.014
 Hatt+0.013
my+0.013
 Bear+0.012
Neg logit contributions
ulp-0.022
aw-0.020
 tune-0.019
 endings-0.019
 tunes-0.018
Token because
Token ID780
Feature activation-4.51e-01

Pos logit contributions
 first+0.007
+0.007
 Bear+0.006
my+0.006
 both+0.005
Neg logit contributions
ulp-0.020
 conclusions-0.019
OO-0.019
 pounds-0.019
aw-0.018
Token it
Token ID340
Feature activation+1.14e+00

Pos logit contributions
+0.019
 first+0.018
 Hatt+0.017
my+0.017
 Bear+0.016
Neg logit contributions
ulp-0.017
aw-0.014
 tune-0.014
 endings-0.014
 tunes-0.013
Token was
Token ID373
Feature activation+5.15e-01

Pos logit contributions
ulp+0.057
aw+0.051
 tune+0.050
 endings+0.049
 pounds+0.048
Neg logit contributions
-0.033
 first-0.031
 Hatt-0.029
my-0.028
 Bear-0.027
Token not
Token ID407
Feature activation+7.63e-01

Pos logit contributions
ulp+0.025
aw+0.023
 endings+0.023
 tune+0.023
OO+0.022
Neg logit contributions
-0.015
 first-0.014
my-0.013
 Hatt-0.013
 Bear-0.012
Token lonely
Token ID21757
Feature activation+5.39e-01

Pos logit contributions
ulp+0.039
aw+0.037
 endings+0.036
 conclusions+0.035
 tunes+0.034
Neg logit contributions
 first-0.021
-0.020
my-0.019
 Bear-0.018
 Hatt-0.017
Token They
Token ID1119
Feature activation+1.53e-01

Pos logit contributions
ulp+0.029
aw+0.027
 tune+0.025
 endings+0.024
 conclusions+0.024
Neg logit contributions
 first-0.020
-0.019
 Hatt-0.018
my-0.018
ll-0.017
Token are
Token ID389
Feature activation-1.56e-01

Pos logit contributions
ulp+0.006
aw+0.006
 tune+0.006
 som+0.005
 conclusions+0.005
Neg logit contributions
 Hatt-0.005
 first-0.005
-0.005
 Elephant-0.005
my-0.004
Token happy
Token ID3772
Feature activation-1.36e-02

Pos logit contributions
+0.005
 Hatt+0.005
 first+0.005
my+0.005
 Elephant+0.004
Neg logit contributions
ulp-0.007
aw-0.006
 tune-0.006
 tunes-0.006
 than-0.006
Token.
Token ID13
Feature activation+6.59e-01

Pos logit contributions
+0.000
 first+0.000
my+0.000
 Bear+0.000
 Hatt+0.000
Neg logit contributions
ulp-0.001
aw-0.001
 tune-0.001
OO-0.001
ater-0.001
Token<|endoftext|>
Token ID50256
Feature activation+1.15e+00

Pos logit contributions
ulp+0.033
aw+0.030
 tune+0.029
 endings+0.029
 tunes+0.028
Neg logit contributions
 first-0.017
-0.016
 Hatt-0.015
my-0.015
ishly-0.014
TokenTom
Token ID13787
Feature activation-1.23e+00

Pos logit contributions
ulp+0.054
aw+0.050
 endings+0.046
 tune+0.046
OO+0.045
Neg logit contributions
-0.033
 first-0.032
 Hatt-0.032
ishly-0.029
my-0.029
Token and
Token ID290
Feature activation-2.08e+00

Pos logit contributions
+0.015
 first+0.013
my+0.011
 Hatt+0.010
 Bear+0.009
Neg logit contributions
ulp-0.082
aw-0.076
 tune-0.075
 endings-0.074
 tunes-0.073
Token Mia
Token ID32189
Feature activation-2.06e-01

Pos logit contributions
 first+0.066
 Hatt+0.065
+0.065
ll+0.059
ishly+0.057
Neg logit contributions
ulp-0.089
aw-0.080
 tune-0.078
 endings-0.075
 tunes-0.072
Token are
Token ID389
Feature activation-2.44e+00

Pos logit contributions
+0.005
 first+0.005
 Hatt+0.005
my+0.005
 Bear+0.005
Neg logit contributions
ulp-0.010
aw-0.010
 tune-0.009
 endings-0.009
 tunes-0.009
Token friends
Token ID2460
Feature activation-1.17e+00

Pos logit contributions
 Hatt+0.100
+0.100
 Elephant+0.095
ishly+0.092
 attendant+0.091
Neg logit contributions
ulp-0.094
aw-0.080
OO-0.068
 endings-0.065
 than-0.064
Token.
Token ID13
Feature activation-3.92e-01

Pos logit contributions
+0.037
 first+0.036
my+0.034
 Hatt+0.032
 Bear+0.031
Neg logit contributions
ulp-0.055
 endings-0.048
aw-0.048
 tune-0.048
OO-0.047
Token the
Token ID262
Feature activation-3.05e-01

Pos logit contributions
ulp+0.022
 tune+0.019
 conclusions+0.018
 som+0.018
 heights+0.018
Neg logit contributions
-0.020
 Hatt-0.019
ll-0.017
my-0.017
ishly-0.017
Token bear
Token ID6842
Feature activation-9.49e-01

Pos logit contributions
+0.012
 Hatt+0.011
ll+0.010
my+0.010
 first+0.010
Neg logit contributions
ulp-0.011
 tune-0.010
aw-0.009
 tunes-0.009
 endings-0.009
Token.
Token ID13
Feature activation+9.03e-02

Pos logit contributions
 Hatt+0.038
+0.036
 first+0.036
 Grac+0.034
 Elephant+0.033
Neg logit contributions
ulp-0.033
aw-0.028
 tune-0.028
ater-0.027
 tunes-0.026
Token The
Token ID383
Feature activation-4.82e-01

Pos logit contributions
ulp+0.005
aw+0.004
 tune+0.004
 endings+0.004
 tunes+0.004
Neg logit contributions
-0.003
 first-0.002
 Hatt-0.002
my-0.002
 Bear-0.002
Token bear
Token ID6842
Feature activation+2.40e-01

Pos logit contributions
+0.022
 Hatt+0.021
ll+0.019
ishly+0.019
 first+0.019
Neg logit contributions
ulp-0.015
 tune-0.013
 tunes-0.012
aw-0.011
 heights-0.011
Token was
Token ID373
Feature activation+6.90e-02

Pos logit contributions
ulp+0.011
aw+0.010
 tune+0.010
 tunes+0.009
 endings+0.009
Neg logit contributions
-0.007
 first-0.007
 Hatt-0.007
my-0.006
 Bear-0.006
Token sad
Token ID6507
Feature activation+5.59e-01

Pos logit contributions
ulp+0.003
aw+0.003
 tune+0.003
 tunes+0.003
 than+0.003
Neg logit contributions
-0.002
 Hatt-0.002
 first-0.002
my-0.002
 Bear-0.002
Token too
Token ID1165
Feature activation-1.52e+00

Pos logit contributions
ulp+0.029
aw+0.028
 tune+0.027
 tunes+0.026
 som+0.026
Neg logit contributions
 first-0.016
-0.014
my-0.013
 Bear-0.013
 both-0.011
Token.
Token ID13
Feature activation+7.30e-02

Pos logit contributions
+0.044
 first+0.043
my+0.037
 Bear+0.036
 Hatt+0.035
Neg logit contributions
ulp-0.077
aw-0.070
 tune-0.068
 endings-0.068
 tunes-0.067
Token The
Token ID383
Feature activation-4.48e-01

Pos logit contributions
ulp+0.004
aw+0.004
 tune+0.004
 endings+0.003
ater+0.003
Neg logit contributions
-0.002
 first-0.002
 Hatt-0.001
my-0.001
 Bear-0.001
Token red
Token ID2266
Feature activation+4.15e-01

Pos logit contributions
+0.020
 Hatt+0.019
ll+0.017
ishly+0.017
 Grac+0.017
Neg logit contributions
ulp-0.014
 tune-0.013
 tunes-0.012
 ow-0.011
 heights-0.011
Token whistle
Token ID16121
Feature activation+1.30e+00

Pos logit contributions
ulp+0.007
aw+0.006
 endings+0.006
 conclusions+0.006
OO+0.006
Neg logit contributions
 first-0.006
-0.006
 Bear-0.005
 -0.005
my-0.005
Token.
Token ID13
Feature activation-1.55e-01

Pos logit contributions
ulp+0.062
aw+0.056
 tune+0.056
 endings+0.054
 tunes+0.054
Neg logit contributions
-0.038
 first-0.036
 Hatt-0.034
my-0.033
 Bear-0.032
Token Everywhere
Token ID48201
Feature activation-3.59e-01

Pos logit contributions
+0.004
 first+0.004
 Hatt+0.003
my+0.003
 Bear+0.003
Neg logit contributions
ulp-0.008
aw-0.007
 tune-0.007
 endings-0.007
 tunes-0.007
Token she
Token ID673
Feature activation-8.51e-01

Pos logit contributions
+0.009
 first+0.009
 Hatt+0.008
my+0.008
 Bear+0.007
Neg logit contributions
ulp-0.019
aw-0.017
 tune-0.017
 endings-0.016
 tunes-0.016
Token went
Token ID1816
Feature activation-3.50e-02

Pos logit contributions
+0.021
 first+0.019
my+0.017
 Hatt+0.017
 Bear+0.016
Neg logit contributions
ulp-0.048
aw-0.042
 endings-0.041
 tune-0.041
OO-0.041
Token she
Token ID673
Feature activation-3.33e-01

Pos logit contributions
+0.001
 first+0.001
 Hatt+0.001
my+0.001
 Bear+0.001
Neg logit contributions
ulp-0.002
aw-0.002
 tune-0.002
 endings-0.002
 tunes-0.002
Token made
Token ID925
Feature activation-6.18e-01

Pos logit contributions
+0.010
 first+0.009
my+0.008
 Hatt+0.008
 Bear+0.008
Neg logit contributions
ulp-0.017
aw-0.015
 tune-0.015
 endings-0.014
OO-0.014
Token people
Token ID661
Feature activation-2.66e-01

Pos logit contributions
+0.019
 first+0.018
my+0.017
 Hatt+0.017
 Bear+0.016
Neg logit contributions
ulp-0.030
aw-0.026
OO-0.025
 endings-0.025
 tune-0.025
Token smile
Token ID8212
Feature activation+2.43e-01

Pos logit contributions
+0.008
 first+0.007
 Hatt+0.007
 Bear+0.007
my+0.007
Neg logit contributions
ulp-0.012
 tune-0.012
aw-0.011
 tunes-0.011
 endings-0.011
Token and
Token ID290
Feature activation+1.01e+00

Pos logit contributions
ulp+0.012
aw+0.010
 endings+0.010
 tune+0.010
 conclusions+0.010
Neg logit contributions
-0.008
 first-0.007
my-0.007
 Hatt-0.007
 Bear-0.006
Token laugh
Token ID6487
Feature activation+2.02e+00

Pos logit contributions
ulp+0.045
 tune+0.042
aw+0.041
 tunes+0.041
 endings+0.040
Neg logit contributions
-0.030
 first-0.029
 Hatt-0.028
my-0.027
 Bear-0.026
Token't
Token ID470
Feature activation-1.03e+00

Pos logit contributions
 Hatt+0.029
my+0.028
 Grac+0.026
 Bear+0.025
 Ros+0.024
Neg logit contributions
aw-0.049
ulp-0.049
 som-0.047
OO-0.044
 conclusions-0.044
Token believe
Token ID1975
Feature activation+3.87e-01

Pos logit contributions
+0.034
my+0.032
 Hatt+0.032
 first+0.032
 Bear+0.030
Neg logit contributions
ulp-0.044
 tune-0.041
 tunes-0.038
 conclusions-0.038
 endings-0.038
Token how
Token ID703
Feature activation-1.29e-01

Pos logit contributions
ulp+0.019
aw+0.017
OO+0.016
 tune+0.016
 endings+0.016
Neg logit contributions
 first-0.012
-0.011
 Bear-0.010
 Ellie-0.010
 both-0.010
Token much
Token ID881
Feature activation-5.37e-01

Pos logit contributions
+0.004
 first+0.004
 Hatt+0.004
my+0.004
 Bear+0.004
Neg logit contributions
ulp-0.006
aw-0.005
 tune-0.005
 endings-0.005
 tunes-0.005
Token everyone
Token ID2506
Feature activation+1.44e-01

Pos logit contributions
+0.019
 first+0.018
 Hatt+0.017
my+0.017
 Bear+0.016
Neg logit contributions
ulp-0.023
aw-0.020
 tune-0.020
 endings-0.019
 tunes-0.019
Token was
Token ID373
Feature activation-1.78e-01

Pos logit contributions
ulp+0.008
aw+0.007
 tune+0.007
 endings+0.007
OO+0.007
Neg logit contributions
-0.004
 first-0.003
my-0.003
 Hatt-0.003
 Bear-0.003
Token enjoying
Token ID13226
Feature activation+1.84e+00

Pos logit contributions
+0.006
 first+0.006
 Hatt+0.006
my+0.006
 Bear+0.005
Neg logit contributions
ulp-0.008
aw-0.007
 tune-0.007
 endings-0.006
 tunes-0.006
Token his
Token ID465
Feature activation+1.40e+00

Pos logit contributions
ulp+0.097
aw+0.086
OO+0.084
 tune+0.083
 endings+0.082
Neg logit contributions
 first-0.050
-0.049
 Hatt-0.042
 Bear-0.041
my-0.040
Token things
Token ID1243
Feature activation+5.76e-02

Pos logit contributions
ulp+0.058
aw+0.052
 tune+0.050
 endings+0.049
 tunes+0.048
Neg logit contributions
-0.051
 first-0.050
my-0.046
 Hatt-0.045
 Bear-0.044
Token!
Token ID0
Feature activation+7.19e-01

Pos logit contributions
ulp+0.002
aw+0.002
 tune+0.002
 tunes+0.002
 endings+0.002
Neg logit contributions
-0.002
 Hatt-0.002
 first-0.002
my-0.002
 Bear-0.002
Token He
Token ID679
Feature activation+1.82e-01

Pos logit contributions
ulp+0.034
aw+0.033
 endings+0.030
OO+0.028
 conclusions+0.028
Neg logit contributions
 first-0.021
 Hatt-0.019
-0.019
my-0.017
ishly-0.017
TokenMom
Token ID29252
Feature activation+6.79e-01

Pos logit contributions
ulp+0.048
aw+0.041
 tune+0.039
 endings+0.039
OO+0.038
Neg logit contributions
-0.043
 first-0.040
my-0.039
 Hatt-0.038
 Bear-0.037
Token,
Token ID11
Feature activation+4.18e-01

Pos logit contributions
ulp+0.031
aw+0.029
OO+0.028
 tune+0.027
 pounds+0.027
Neg logit contributions
 Hatt-0.019
-0.018
 first-0.016
ishly-0.015
 Grac-0.015
Token can
Token ID460
Feature activation-9.92e-01

Pos logit contributions
ulp+0.019
aw+0.017
 tune+0.017
 endings+0.017
 tunes+0.016
Neg logit contributions
-0.013
 first-0.012
 Hatt-0.012
my-0.011
 Bear-0.011
Token I
Token ID314
Feature activation-1.38e-01

Pos logit contributions
 Hatt+0.039
+0.039
my+0.039
 first+0.038
sy+0.038
Neg logit contributions
ulp-0.028
 tune-0.026
 tunes-0.025
 som-0.024
izing-0.024
Token get
Token ID651
Feature activation-6.39e-01

Pos logit contributions
 Hatt+0.005
ishly+0.004
 first+0.004
 Grac+0.004
 Bear+0.004
Neg logit contributions
ulp-0.005
ater-0.005
 tune-0.005
 than-0.004
 som-0.004
Token this
Token ID428
Feature activation-2.79e+00

Pos logit contributions
+0.026
ishly+0.026
ll+0.025
 Hatt+0.025
my+0.024
Neg logit contributions
ulp-0.019
aw-0.018
 tune-0.016
 pounds-0.016
 som-0.016
Token car
Token ID1097
Feature activation-2.60e+00

Pos logit contributions
+0.107
 Hatt+0.103
 first+0.099
my+0.097
 Bear+0.095
Neg logit contributions
ulp-0.105
 tune-0.091
aw-0.089
 tunes-0.085
 endings-0.083
Token?"
Token ID1701
Feature activation-1.50e-01

Pos logit contributions
+0.091
 Hatt+0.088
 Bear+0.079
 first+0.079
 Grac+0.078
Neg logit contributions
ulp-0.102
aw-0.094
 tune-0.093
 than-0.088
ater-0.088
Token His
Token ID2399
Feature activation-7.33e-01

Pos logit contributions
+0.004
 first+0.004
 Hatt+0.004
my+0.004
 Bear+0.003
Neg logit contributions
ulp-0.007
aw-0.007
 tune-0.007
 endings-0.006
 tunes-0.006
Token mom
Token ID1995
Feature activation-7.52e-01

Pos logit contributions
+0.023
 Hatt+0.021
my+0.018
ishly+0.017
 Grac+0.017
Neg logit contributions
ulp-0.036
 tune-0.031
 tunes-0.029
 ow-0.029
 heights-0.029
Token looked
Token ID3114
Feature activation-1.03e-01

Pos logit contributions
 Hatt+0.025
+0.022
 first+0.022
 Barry+0.021
 First+0.021
Neg logit contributions
aw-0.029
ulp-0.028
 tune-0.027
 tunes-0.025
ater-0.024
Token
Token ID198
Feature activation+4.08e-01

Pos logit contributions
+0.002
 first+0.002
 Hatt+0.002
my+0.002
 Bear+0.002
Neg logit contributions
ulp-0.004
aw-0.004
 tune-0.003
 endings-0.003
 tunes-0.003
Token
Token ID198
Feature activation+1.45e-01

Pos logit contributions
ulp+0.021
aw+0.020
 tune+0.019
 endings+0.018
 tunes+0.018
Neg logit contributions
 first-0.009
-0.009
 Hatt-0.009
my-0.008
 Bear-0.008
TokenAlice
Token ID44484
Feature activation+2.14e-01

Pos logit contributions
ulp+0.007
aw+0.006
 tune+0.006
 endings+0.006
 tunes+0.006
Neg logit contributions
-0.005
 first-0.004
 Hatt-0.004
my-0.004
 Bear-0.004
Token poured
Token ID19036
Feature activation-1.11e+00

Pos logit contributions
ulp+0.009
aw+0.009
 tune+0.009
 tunes+0.008
ater+0.008
Neg logit contributions
 first-0.007
 Hatt-0.006
-0.006
my-0.006
 Ellie-0.006
Token the
Token ID262
Feature activation-2.04e+00

Pos logit contributions
+0.037
 Hatt+0.036
 first+0.034
 Elephant+0.033
my+0.032
Neg logit contributions
ulp-0.049
aw-0.044
 tune-0.041
 tunes-0.039
 pounds-0.039
Token popcorn
Token ID38914
Feature activation-2.98e+00

Pos logit contributions
+0.085
 first+0.085
 Bear+0.076
my+0.075
 Hatt+0.072
Neg logit contributions
ulp-0.078
aw-0.069
 endings-0.068
 tune-0.064
 conclusions-0.064
Token into
Token ID656
Feature activation-9.45e-01

Pos logit contributions
+0.080
 Hatt+0.078
my+0.073
 first+0.072
 Elephant+0.067
Neg logit contributions
ulp-0.149
aw-0.134
 tune-0.132
ater-0.126
 tunes-0.126
Token a
Token ID257
Feature activation-9.54e-01

Pos logit contributions
+0.026
 first+0.025
 Hatt+0.023
my+0.022
 Bear+0.022
Neg logit contributions
ulp-0.048
aw-0.044
 tune-0.042
 endings-0.042
 tunes-0.041
Token big
Token ID1263
Feature activation-2.32e-01

Pos logit contributions
+0.034
 first+0.032
 Hatt+0.030
my+0.030
 Bear+0.029
Neg logit contributions
ulp-0.041
aw-0.036
 endings-0.035
 tune-0.035
 tunes-0.034
Token bowl
Token ID9396
Feature activation-3.13e-01

Pos logit contributions
+0.008
 first+0.007
my+0.006
 Bear+0.006
 Hatt+0.006
Neg logit contributions
ulp-0.011
aw-0.010
 endings-0.010
 tune-0.009
 tunes-0.009
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
+0.010
 first+0.010
 Hatt+0.009
my+0.009
 Bear+0.008
Neg logit contributions
ulp-0.014
aw-0.013
 tune-0.013
 endings-0.012
 tunes-0.012
Token Ben
Token ID3932
Feature activation+7.27e-01

Pos logit contributions
ulp+0.046
aw+0.043
 tune+0.040
 endings+0.039
 tunes+0.038
Neg logit contributions
-0.038
 first-0.036
 Hatt-0.035
my-0.034
 Bear-0.033
Token.
Token ID13
Feature activation+3.89e-01

Pos logit contributions
ulp+0.031
aw+0.029
 tune+0.027
 endings+0.026
ater+0.026
Neg logit contributions
-0.024
 first-0.023
 Hatt-0.023
my-0.022
 Bear-0.021
Token They
Token ID1119
Feature activation-1.54e-01

Pos logit contributions
ulp+0.022
aw+0.020
 tune+0.020
 endings+0.019
 tunes+0.019
Neg logit contributions
-0.009
 first-0.008
 Hatt-0.007
my-0.007
 Bear-0.007
Token come
Token ID1282
Feature activation-7.55e-01

Pos logit contributions
 Hatt+0.005
 first+0.005
+0.005
my+0.005
 Bear+0.005
Neg logit contributions
 som-0.005
aw-0.005
ulp-0.005
ater-0.005
 conclusions-0.005
Token out
Token ID503
Feature activation-2.32e+00

Pos logit contributions
 Hatt+0.024
 Story+0.022
+0.022
 Grac+0.021
 Bear+0.021
Neg logit contributions
ulp-0.030
 tune-0.028
 than-0.028
aw-0.027
OO-0.027
Token of
Token ID286
Feature activation-2.77e+00

Pos logit contributions
+0.078
 Hatt+0.073
 first+0.073
my+0.070
 Bear+0.067
Neg logit contributions
ulp-0.100
 tune-0.088
aw-0.087
 tunes-0.083
 endings-0.082
Token the
Token ID262
Feature activation-1.21e+00

Pos logit contributions
+0.080
 first+0.077
 Hatt+0.071
my+0.070
 Bear+0.067
Neg logit contributions
ulp-0.136
aw-0.123
 tune-0.120
 endings-0.118
 tunes-0.116
Token tube
Token ID12403
Feature activation-1.96e+00

Pos logit contributions
+0.039
 first+0.037
 Hatt+0.037
my+0.034
ishly+0.034
Neg logit contributions
ulp-0.053
 tune-0.047
aw-0.047
 endings-0.045
 tunes-0.045
Token.
Token ID13
Feature activation+4.22e-01

Pos logit contributions
 Hatt+0.074
+0.072
 first+0.068
 Grac+0.067
 Bear+0.067
Neg logit contributions
ulp-0.073
aw-0.069
 tune-0.064
 than-0.062
ater-0.061
Token They
Token ID1119
Feature activation+2.07e-03

Pos logit contributions
ulp+0.022
aw+0.020
 tune+0.019
 endings+0.019
 tunes+0.019
Neg logit contributions
-0.011
 first-0.011
my-0.010
 Hatt-0.010
 Bear-0.009
Token are
Token ID389
Feature activation-4.81e-01

Pos logit contributions
 som+0.000
aw+0.000
ater+0.000
 conclusions+0.000
ulp+0.000
Neg logit contributions
 Hatt-0.000
 Bear-0.000
ll-0.000
my-0.000
 Ellie-0.000
Token Lily
Token ID20037
Feature activation+1.07e-01

Pos logit contributions
+0.027
 first+0.024
 Hatt+0.024
my+0.023
 Bear+0.022
Neg logit contributions
ulp-0.035
aw-0.031
 tune-0.030
 tunes-0.029
 endings-0.029
Token.
Token ID13
Feature activation+3.05e-02

Pos logit contributions
ulp+0.004
 tune+0.004
aw+0.004
 than+0.004
 tunes+0.004
Neg logit contributions
 Hatt-0.004
 first-0.003
-0.003
 Grac-0.003
 Bear-0.003
Token They
Token ID1119
Feature activation-1.42e-01

Pos logit contributions
ulp+0.001
aw+0.001
 tune+0.001
 endings+0.001
 som+0.001
Neg logit contributions
-0.001
 Hatt-0.001
 first-0.001
my-0.001
 Elephant-0.001
Token got
Token ID1392
Feature activation+1.63e-02

Pos logit contributions
+0.005
ishly+0.005
 attendant+0.005
 Hatt+0.005
 Barry+0.005
Neg logit contributions
aw-0.004
ulp-0.004
ater-0.004
 som-0.004
 tunes-0.004
Token the
Token ID262
Feature activation-1.56e+00

Pos logit contributions
aw+0.001
ulp+0.001
 som+0.000
 tunes+0.000
 heights+0.000
Neg logit contributions
ishly-0.001
ll-0.001
-0.001
 Elephant-0.001
 Hatt-0.001
Token boxes
Token ID10559
Feature activation-3.32e+00

Pos logit contributions
+0.067
ll+0.057
ishly+0.057
 Barry+0.054
 Bear+0.054
Neg logit contributions
ulp-0.047
 pounds-0.043
 sensation-0.043
 than-0.043
ater-0.043
Token and
Token ID290
Feature activation-4.21e-01

Pos logit contributions
 Hatt+0.145
+0.137
 Bear+0.135
 Elephant+0.134
 Grac+0.133
Neg logit contributions
ulp-0.099
 than-0.093
aw-0.083
 tune-0.081
izing-0.076
Token tape
Token ID9154
Feature activation-8.38e-01

Pos logit contributions
ll+0.015
+0.015
ishly+0.015
 first+0.014
 Elephant+0.014
Neg logit contributions
ulp-0.014
aw-0.013
 tunes-0.013
 som-0.013
 tune-0.013
Token and
Token ID290
Feature activation+1.99e-02

Pos logit contributions
+0.032
 Hatt+0.030
 first+0.028
 Bear+0.028
my+0.027
Neg logit contributions
ulp-0.032
aw-0.029
 tune-0.027
 than-0.027
 tunes-0.026
Token paint
Token ID7521
Feature activation-2.54e-01

Pos logit contributions
ulp+0.001
aw+0.001
 tunes+0.001
 tune+0.001
 som+0.001
Neg logit contributions
ll-0.001
-0.001
ishly-0.001
 Elephant-0.001
 Bear-0.001
Token from
Token ID422
Feature activation-1.28e+00

Pos logit contributions
+0.010
ll+0.009
 Hatt+0.009
 first+0.009
ishly+0.009
Neg logit contributions
aw-0.009
ulp-0.009
 than-0.008
 tunes-0.008
 tune-0.008
Token bending
Token ID30808
Feature activation-1.78e-01

Pos logit contributions
+0.043
 first+0.040
 Hatt+0.039
my+0.038
 Bear+0.036
Neg logit contributions
ulp-0.065
aw-0.058
 tune-0.057
 endings-0.055
 tunes-0.055
Token down
Token ID866
Feature activation+3.79e-01

Pos logit contributions
+0.005
 first+0.005
my+0.005
 Hatt+0.004
 Bear+0.004
Neg logit contributions
ulp-0.009
aw-0.008
 tune-0.008
 endings-0.008
 tunes-0.008
Token to
Token ID284
Feature activation+4.29e-02

Pos logit contributions
ulp+0.017
aw+0.015
 tune+0.015
 endings+0.014
 tunes+0.014
Neg logit contributions
-0.012
 first-0.011
 Hatt-0.011
my-0.011
 Bear-0.010
Token snap
Token ID11495
Feature activation+4.63e-01

Pos logit contributions
ulp+0.002
aw+0.002
 endings+0.002
 tune+0.002
 tunes+0.002
Neg logit contributions
-0.001
 first-0.001
my-0.001
 Bear-0.001
 Hatt-0.001
Token the
Token ID262
Feature activation-1.43e+00

Pos logit contributions
ulp+0.021
aw+0.019
 tune+0.018
 endings+0.018
 tunes+0.018
Neg logit contributions
-0.015
 first-0.014
 Hatt-0.013
my-0.013
 Bear-0.013
Token box
Token ID3091
Feature activation-2.74e+00

Pos logit contributions
+0.046
 Hatt+0.045
 first+0.042
my+0.042
ll+0.041
Neg logit contributions
ulp-0.061
 tune-0.057
aw-0.055
 pounds-0.054
 tunes-0.054
Token open
Token ID1280
Feature activation-1.48e+00

Pos logit contributions
 Hatt+0.095
+0.092
 first+0.088
 Bear+0.087
my+0.087
Neg logit contributions
ulp-0.110
aw-0.099
 tune-0.098
 than-0.092
 pounds-0.091
Token.
Token ID13
Feature activation-4.62e-01

Pos logit contributions
+0.051
 first+0.049
 Hatt+0.048
my+0.046
 Bear+0.045
Neg logit contributions
ulp-0.062
aw-0.056
 tune-0.054
 endings-0.052
 tunes-0.051
Token As
Token ID1081
Feature activation+8.28e-01

Pos logit contributions
 Hatt+0.014
my+0.013
+0.013
 first+0.012
ll+0.012
Neg logit contributions
ulp-0.020
aw-0.019
 endings-0.018
 heights-0.018
 tune-0.017
Token soon
Token ID2582
Feature activation-9.93e-01

Pos logit contributions
ulp+0.034
 tune+0.032
 than+0.032
 heights+0.031
aw+0.031
Neg logit contributions
my-0.025
-0.025
 Hatt-0.025
ishly-0.023
ll-0.021
Token as
Token ID355
Feature activation+1.93e-02

Pos logit contributions
+0.034
 first+0.032
 Hatt+0.030
 Bear+0.028
my+0.028
Neg logit contributions
ulp-0.045
aw-0.039
 tune-0.039
 endings-0.038
ater-0.038