TOP ACTIVATIONS
MAX = 9.057

had learned to appreciate the therm ometer . <|endoftext|> John lay
. She had the strongest therm ometer in the town and
him how to use the therm ometer and soon John knew
he know that the mysterious therm ometer that he had seen
that he had used the m op in an organized way
said it was the best ash tr ay they had ever
checked . She put the therm ometer next to his forehead
. She put on the m ittens and gave Emily a
't too sure what the therm ometer was for . She
and they bought the new therm ometer ! <|endoftext|> Once upon
Red thought about what the m op had said , and
. She loved using the m op and making the floors
They took turns putting the therm ometer in their mouths and
was surprised how harsh the m op was ! He scrub
hug , even though the therm ometer felt cold . The
" Look how generous the fire is , burning and giving
too sure what the therm ometer was for . She wasn
surprised to see that the therm ometer was very high .
worked so hard on the ash tr ay , and now
. Can we bring the m op , too ?"

MIN ACTIVATIONS
MIN = -11.718

there was a girl named Ph oe be who liked to
there were two friends , Ab i and Ben . They
her brown fur . <|endoftext|> Ark ie wanted his own equipment
there were two friends , Ab ig ail and Mat ty
was a little girl named Ab ig ail . She was
moon . <|endoftext|> Once upon a time , there was a
slides . <|endoftext|> Once upon a time , there was a
two little girls , named Ab bie and Daisy . Ab
garden . <|endoftext|> Once upon a time , there was a
castle . <|endoftext|> Once upon a time , there was a
bit of magic . <|endoftext|> Isa ac was preparing breakfast in
book . <|endoftext|> Once upon a time , there was a
soup . <|endoftext|> Once upon a time , there was a
Daddy !" <|endoftext|> Once upon a time , there was a
two friends , Peter and Ab ig ail . They were
there was a girl called Ab bie . She wanted to
Once upon a time , Jack and his sister Mia were
aser . <|endoftext|> Once upon a time , a friendly helicopter
safe . <|endoftext|> Once upon a time , there was a
world . <|endoftext|> Once upon a time , there was a

INTERVAL 3.863 - 9.057
CONTAINS 8.049%

finally got the carrots out . " Thank you , Squirrel
But nothing happens . They do not fly or
fun with Molly 's knot ! <|endoftext|> Once upon a time
Lily was a fork expert . One night
says . " The fire truck is so big and strong

INTERVAL -1.330 - 3.863
CONTAINS 60.000%

it wonderful how nature teaches us all kinds of things ?"
do and she said , she would figure it out soon
The fish let go and he ran away as fast as
were new and playful things to find . The tunnel seemed
a big , juicy banana . He was so excited to

INTERVAL -6.524 - -1.330
CONTAINS 30.152%

home from school when he heard a noise coming from the
Once upon a time , there was a boy named Tim
the secret room . She smiled and went to get breakfast
his toys . Tim my got mad and said , "
Soon the pony got stronger . She would play

INTERVAL -11.718 - -6.524
CONTAINS 1.799%

having fun , Lily ." <|endoftext|> Once upon a time ,
upon a time , there was a boy who had an
adventure . <|endoftext|> Once upon a time there was a lady
. <|endoftext|> Once upon a time , there was a little
rice . <|endoftext|> Once upon a time , there was a
Token had
Token ID550
Feature activation-1.35e+00

Pos logit contributions
 sle+0.084
 roof+0.084
 mast+0.081
tr+0.080
 Dragons+0.080
Neg logit contributions
ited-0.073
ised-0.064
ere-0.061
illed-0.059
ced-0.059
Token learned
Token ID4499
Feature activation-8.85e-01

Pos logit contributions
ited+0.023
ised+0.020
illed+0.018
ere+0.018
ced+0.018
Neg logit contributions
 roof-0.028
 sle-0.027
tr-0.026
 mast-0.026
bul-0.025
Token to
Token ID284
Feature activation-1.19e+00

Pos logit contributions
ited+0.014
ised+0.012
ere+0.011
illed+0.011
ced+0.011
Neg logit contributions
 sle-0.019
 roof-0.019
tr-0.019
 globe-0.018
bul-0.018
Token appreciate
Token ID9144
Feature activation-1.33e+00

Pos logit contributions
ited+0.025
ised+0.024
illed+0.022
ced+0.022
ere+0.021
Neg logit contributions
 roof-0.019
 sle-0.019
tr-0.018
 mast-0.017
 globe-0.017
Token the
Token ID262
Feature activation+3.12e+00

Pos logit contributions
ited+0.020
ised+0.017
 disagreed+0.016
ere+0.015
illed+0.015
Neg logit contributions
 sle-0.030
 roof-0.029
 globe-0.028
tr-0.028
 mast-0.028
Token therm
Token ID21969
Feature activation+9.06e+00

Pos logit contributions
 mast+0.054
 sle+0.054
 Dragons+0.053
 roof+0.053
tr+0.053
Neg logit contributions
ited-0.060
ised-0.054
ere-0.051
ced-0.051
 disagreed-0.051
Tokenometer
Token ID15635
Feature activation+6.06e+00

Pos logit contributions
 mast+0.080
 roof+0.075
 sle+0.071
 Dragons+0.070
 globe+0.067
Neg logit contributions
ised-0.240
ited-0.228
ere-0.222
illed-0.220
ced-0.220
Token.
Token ID13
Feature activation+3.94e+00

Pos logit contributions
 sle+0.133
 roof+0.128
 Dragons+0.128
 mast+0.128
tr+0.127
Neg logit contributions
ited-0.090
ised-0.079
 disagreed-0.074
ced-0.073
ere-0.070
Token<|endoftext|>
Token ID50256
Feature activation-6.09e+00

Pos logit contributions
 sle+0.089
 sy+0.087
 mast+0.086
 roof+0.085
tr+0.083
Neg logit contributions
ised-0.054
ited-0.054
ere-0.049
illed-0.049
 disagreed-0.049
TokenJohn
Token ID7554
Feature activation-5.97e+00

Pos logit contributions
ited+0.130
ised+0.115
 disagreed+0.110
ere+0.106
ced+0.106
Neg logit contributions
 sle-0.098
 globe-0.096
 roof-0.094
tr-0.094
 Dragons-0.092
Token lay
Token ID3830
Feature activation-3.78e+00

Pos logit contributions
ited+0.096
ised+0.093
ced+0.082
illed+0.081
eful+0.079
Neg logit contributions
 sle-0.120
 roof-0.115
tr-0.114
 globe-0.112
 mast-0.112
Token.
Token ID13
Feature activation+4.64e+00

Pos logit contributions
 sle+0.010
 roof+0.010
tr+0.010
 mast+0.010
 Dragons+0.010
Neg logit contributions
ited-0.008
ised-0.007
 disagreed-0.006
ced-0.006
ere-0.006
Token She
Token ID1375
Feature activation+2.13e+00

Pos logit contributions
 mast+0.104
 sle+0.100
 roof+0.099
bul+0.095
tr+0.095
Neg logit contributions
ited-0.065
ere-0.061
ised-0.057
illed-0.057
 disagreed-0.057
Token had
Token ID550
Feature activation-2.20e+00

Pos logit contributions
 sle+0.048
 roof+0.047
tr+0.046
 mast+0.046
 globe+0.045
Neg logit contributions
ited-0.031
ised-0.027
ere-0.024
 disagreed-0.024
ced-0.024
Token the
Token ID262
Feature activation+1.92e+00

Pos logit contributions
ited+0.037
ised+0.037
illed+0.031
eful+0.029
ere+0.029
Neg logit contributions
tr-0.039
 sle-0.038
 roof-0.038
 sy-0.037
bul-0.036
Token strongest
Token ID12841
Feature activation+2.81e+00

Pos logit contributions
 sle+0.034
 roof+0.033
 mast+0.033
 Dragons+0.033
tr+0.032
Neg logit contributions
ited-0.037
ised-0.033
 disagreed-0.031
ced-0.031
ere-0.031
Token therm
Token ID21969
Feature activation+8.93e+00

Pos logit contributions
 sle+0.058
 roof+0.057
tr+0.055
bul+0.054
 mast+0.054
Neg logit contributions
ited-0.046
ised-0.041
ere-0.037
illed-0.037
 disagreed-0.037
Tokenometer
Token ID15635
Feature activation+6.03e+00

Pos logit contributions
 roof+0.074
 Dragons+0.070
 mast+0.069
 sle+0.067
 mant+0.056
Neg logit contributions
ised-0.242
ited-0.235
ced-0.228
illed-0.227
ere-0.227
Token in
Token ID287
Feature activation-5.01e-01

Pos logit contributions
 sle+0.138
 roof+0.134
 mast+0.133
 Dragons+0.133
tr+0.132
Neg logit contributions
ited-0.084
ised-0.073
 disagreed-0.068
ced-0.067
ere-0.066
Token the
Token ID262
Feature activation+2.50e+00

Pos logit contributions
ited+0.007
ised+0.006
 disagreed+0.005
illed+0.005
ced+0.005
Neg logit contributions
 sle-0.012
 roof-0.011
 globe-0.011
bul-0.011
 mast-0.011
Token town
Token ID3240
Feature activation+3.19e+00

Pos logit contributions
 sle+0.044
 roof+0.043
tr+0.042
 mast+0.042
 Dragons+0.042
Neg logit contributions
ited-0.049
ised-0.044
 disagreed-0.041
ere-0.041
ced-0.041
Token and
Token ID290
Feature activation+4.49e+00

Pos logit contributions
 sle+0.069
 roof+0.069
tr+0.066
 globe+0.066
 mast+0.066
Neg logit contributions
ited-0.049
ised-0.043
ere-0.039
illed-0.038
ced-0.037
Token him
Token ID683
Feature activation-5.02e-01

Pos logit contributions
ited+0.048
ised+0.040
 disagreed+0.039
illed+0.038
ced+0.036
Neg logit contributions
 sle-0.052
 roof-0.050
 globe-0.050
bul-0.050
tr-0.049
Token how
Token ID703
Feature activation+1.34e+00

Pos logit contributions
ited+0.008
ised+0.007
illed+0.006
 disagreed+0.006
ere+0.006
Neg logit contributions
 sle-0.011
 roof-0.010
tr-0.010
 Dragons-0.010
 globe-0.010
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
 sle+0.025
 roof+0.024
 Dragons+0.023
 mast+0.023
tr+0.023
Neg logit contributions
ited-0.025
ised-0.022
ere-0.020
ced-0.020
 disagreed-0.020
Token use
Token ID779
Feature activation+3.71e-01

Pos logit contributions
 sle+0.016
 roof+0.015
tr+0.015
 mast+0.015
 Dragons+0.015
Neg logit contributions
ited-0.024
ised-0.022
 disagreed-0.020
ere-0.020
ced-0.020
Token the
Token ID262
Feature activation+4.31e+00

Pos logit contributions
 sle+0.009
tr+0.009
 roof+0.008
 Dragons+0.008
 mast+0.008
Neg logit contributions
ited-0.005
ised-0.004
ced-0.004
ere-0.004
 disagreed-0.004
Token therm
Token ID21969
Feature activation+8.79e+00

Pos logit contributions
 Dragons+0.061
 mast+0.060
 globe+0.059
 sle+0.056
 roof+0.056
Neg logit contributions
ited-0.095
ised-0.089
ced-0.085
ere-0.083
 disagreed-0.083
Tokenometer
Token ID15635
Feature activation+5.45e+00

Pos logit contributions
 roof+0.069
 globe+0.065
 mast+0.065
 sle+0.064
 Dragons+0.064
Neg logit contributions
ised-0.243
ited-0.228
ere-0.225
illed-0.223
ced-0.221
Token and
Token ID290
Feature activation+5.26e+00

Pos logit contributions
 sle+0.124
 Dragons+0.118
 roof+0.118
 mast+0.118
tr+0.117
Neg logit contributions
ited-0.076
ised-0.069
 disagreed-0.063
ced-0.062
illed-0.060
Token soon
Token ID2582
Feature activation+1.72e+00

Pos logit contributions
 sle+0.112
 roof+0.109
tr+0.108
 mast+0.107
 Dragons+0.106
Neg logit contributions
ited-0.083
ised-0.073
 disagreed-0.068
ced-0.067
ere-0.067
Token John
Token ID1757
Feature activation-4.18e-02

Pos logit contributions
 sle+0.033
 roof+0.032
 mast+0.032
 globe+0.030
 Dragons+0.030
Neg logit contributions
ited-0.031
ised-0.028
illed-0.026
ere-0.026
 disagreed-0.025
Token knew
Token ID2993
Feature activation+2.07e-01

Pos logit contributions
ited+0.001
ised+0.001
ere+0.001
illed+0.001
eful+0.001
Neg logit contributions
 sle-0.001
tr-0.001
 roof-0.001
 mast-0.001
 globe-0.001
Token he
Token ID339
Feature activation+1.58e+00

Pos logit contributions
ited+0.043
illed+0.037
ised+0.034
ced+0.034
eful+0.033
Neg logit contributions
 roof-0.032
 sle-0.031
 mast-0.031
 Dragons-0.030
 sail-0.029
Token know
Token ID760
Feature activation-2.38e+00

Pos logit contributions
 sle+0.030
 roof+0.029
tr+0.029
 mast+0.028
 Dragons+0.028
Neg logit contributions
ited-0.029
ised-0.026
 disagreed-0.024
ere-0.024
ced-0.024
Token that
Token ID326
Feature activation+1.17e+00

Pos logit contributions
ited+0.048
illed+0.037
ised+0.037
ere+0.036
 disagreed+0.035
Neg logit contributions
 sle-0.045
 roof-0.045
 mast-0.043
 Dragons-0.041
bul-0.041
Token the
Token ID262
Feature activation+3.61e+00

Pos logit contributions
 sle+0.023
tr+0.023
 roof+0.023
 mast+0.022
 Dragons+0.022
Neg logit contributions
ited-0.020
ised-0.018
 disagreed-0.017
ced-0.016
ere-0.016
Token mysterious
Token ID11428
Feature activation+5.55e+00

Pos logit contributions
 mast+0.058
 Dragons+0.058
tr+0.057
 sle+0.057
 globe+0.057
Neg logit contributions
ited-0.072
ised-0.066
 disagreed-0.063
ere-0.062
ced-0.062
Token therm
Token ID21969
Feature activation+8.75e+00

Pos logit contributions
 sle+0.078
tr+0.076
 Dragons+0.076
 roof+0.076
 mast+0.075
Neg logit contributions
ited-0.123
ised-0.114
 disagreed-0.109
ere-0.108
ced-0.107
Tokenometer
Token ID15635
Feature activation+6.18e+00

Pos logit contributions
 roof+0.070
 mast+0.066
 Dragons+0.065
 sle+0.065
 mant+0.059
Neg logit contributions
ised-0.230
ere-0.218
ited-0.215
illed-0.212
ced-0.212
Token that
Token ID326
Feature activation+4.14e+00

Pos logit contributions
 sle+0.141
 roof+0.137
 mast+0.135
tr+0.135
 Dragons+0.135
Neg logit contributions
ited-0.087
ised-0.075
 disagreed-0.070
ced-0.068
ere-0.068
Token he
Token ID339
Feature activation+3.24e+00

Pos logit contributions
 sle+0.084
 roof+0.080
 mast+0.080
tr+0.080
 Dragons+0.078
Neg logit contributions
ited-0.066
ised-0.061
 disagreed-0.061
ced-0.058
 done-0.057
Token had
Token ID550
Feature activation-1.33e+00

Pos logit contributions
 sle+0.062
tr+0.060
 roof+0.060
 Dragons+0.059
 mast+0.059
Neg logit contributions
ited-0.057
ised-0.052
illed-0.047
ere-0.047
ced-0.046
Token seen
Token ID1775
Feature activation-1.08e+00

Pos logit contributions
ited+0.021
ised+0.019
ere+0.017
illed+0.016
eful+0.016
Neg logit contributions
 roof-0.028
 sle-0.028
 mast-0.027
tr-0.027
bul-0.026
Token that
Token ID326
Feature activation+1.74e+00

Pos logit contributions
ited+0.007
ised+0.006
 disagreed+0.005
ere+0.005
ced+0.005
Neg logit contributions
 sle-0.009
 roof-0.009
tr-0.008
 mast-0.008
 Dragons-0.008
Token he
Token ID339
Feature activation+2.84e+00

Pos logit contributions
 sle+0.035
tr+0.034
 mast+0.034
 roof+0.033
 globe+0.033
Neg logit contributions
ited-0.028
ised-0.025
 disagreed-0.025
ced-0.024
ere-0.024
Token had
Token ID550
Feature activation-1.51e+00

Pos logit contributions
tr+0.051
bul+0.049
 roof+0.049
 globe+0.049
 sle+0.048
Neg logit contributions
ited-0.050
ised-0.047
illed-0.044
ere-0.043
eful-0.041
Token used
Token ID973
Feature activation-6.37e-01

Pos logit contributions
ited+0.022
ised+0.020
ere+0.017
eful+0.017
illed+0.017
Neg logit contributions
 roof-0.033
tr-0.032
bul-0.032
 mast-0.032
 sle-0.031
Token the
Token ID262
Feature activation+3.71e+00

Pos logit contributions
ited+0.011
ised+0.009
 disagreed+0.008
ere+0.008
illed+0.008
Neg logit contributions
 sle-0.013
 roof-0.013
tr-0.013
 mast-0.013
bul-0.012
Token m
Token ID285
Feature activation+8.72e+00

Pos logit contributions
 sle+0.067
 Dragons+0.067
 mast+0.066
 globe+0.065
tr+0.062
Neg logit contributions
ited-0.066
ised-0.061
ced-0.060
 disagreed-0.058
 done-0.057
Tokenop
Token ID404
Feature activation+5.63e+00

Pos logit contributions
 mast+0.217
 globe+0.214
 Dragons+0.206
 sle+0.205
 roof+0.205
Neg logit contributions
ited-0.098
ised-0.095
ere-0.085
illed-0.085
ced-0.076
Token in
Token ID287
Feature activation+3.66e-01

Pos logit contributions
 sle+0.122
 roof+0.118
 mast+0.116
tr+0.116
 Dragons+0.115
Neg logit contributions
ited-0.087
ised-0.077
 disagreed-0.072
ere-0.070
ced-0.070
Token an
Token ID281
Feature activation-6.88e-01

Pos logit contributions
 sle+0.007
 roof+0.007
 globe+0.007
 mast+0.007
bul+0.007
Neg logit contributions
ited-0.006
ised-0.005
 disagreed-0.005
ere-0.005
illed-0.005
Token organized
Token ID8389
Feature activation+2.33e+00

Pos logit contributions
ited+0.013
ised+0.012
ced+0.012
 disagreed+0.011
ere+0.011
Neg logit contributions
 roof-0.012
 sle-0.012
 mast-0.011
bul-0.011
 mant-0.010
Token way
Token ID835
Feature activation+6.90e-01

Pos logit contributions
 sle+0.032
 roof+0.031
 mast+0.030
tr+0.030
 globe+0.029
Neg logit contributions
ited-0.055
ised-0.050
 disagreed-0.047
ere-0.047
ced-0.047
Token said
Token ID531
Feature activation-2.37e+00

Pos logit contributions
ited+0.002
ised+0.002
ere+0.002
 disagreed+0.002
ced+0.002
Neg logit contributions
 sle-0.004
 roof-0.004
tr-0.004
 globe-0.004
 mast-0.004
Token it
Token ID340
Feature activation+2.53e+00

Pos logit contributions
ited+0.048
ised+0.041
ere+0.039
illed+0.038
ced+0.037
Neg logit contributions
 roof-0.042
 sle-0.040
 mast-0.040
 globe-0.039
bul-0.039
Token was
Token ID373
Feature activation-8.90e-01

Pos logit contributions
 sle+0.057
 roof+0.056
tr+0.056
 Dragons+0.054
 mast+0.054
Neg logit contributions
ited-0.037
ised-0.032
ere-0.029
 disagreed-0.029
ced-0.028
Token the
Token ID262
Feature activation+1.45e+00

Pos logit contributions
ited+0.014
ised+0.012
illed+0.010
ced+0.010
ere+0.010
Neg logit contributions
 roof-0.019
 sle-0.019
tr-0.019
bul-0.019
 globe-0.018
Token best
Token ID1266
Feature activation+2.68e+00

Pos logit contributions
 sle+0.031
 Dragons+0.030
 mast+0.029
 sy+0.029
ometer+0.028
Neg logit contributions
ited-0.022
 done-0.019
 disagreed-0.019
ised-0.018
 seen-0.018
Token ash
Token ID12530
Feature activation+8.70e+00

Pos logit contributions
 mast+0.051
 sle+0.051
 Dragons+0.051
 sy+0.047
ometer+0.045
Neg logit contributions
,-0.041
 done-0.040
 seen-0.040
ited-0.039
 time-0.039
Tokentr
Token ID2213
Feature activation+6.91e+00

Pos logit contributions
 mast+0.091
 sle+0.090
 Dragons+0.081
 mant+0.072
 nar+0.065
Neg logit contributions
 did-0.214
 done-0.213
 seen-0.203
,-0.201
 heard-0.200
Tokenay
Token ID323
Feature activation+2.61e+00

Pos logit contributions
 sle+0.160
 mast+0.158
 Dragons+0.156
 roof+0.151
 mant+0.148
Neg logit contributions
ited-0.088
ised-0.086
ced-0.072
pped-0.072
 done-0.072
Token they
Token ID484
Feature activation+3.78e+00

Pos logit contributions
 sle+0.059
 mast+0.057
 roof+0.056
 Dragons+0.056
tr+0.054
Neg logit contributions
ited-0.037
ised-0.033
 disagreed-0.032
ced-0.032
 seen-0.031
Token had
Token ID550
Feature activation-1.40e+00

Pos logit contributions
 sle+0.079
 roof+0.076
tr+0.075
 mast+0.075
 Dragons+0.074
Neg logit contributions
ited-0.062
ised-0.054
 disagreed-0.050
ere-0.050
ced-0.050
Token ever
Token ID1683
Feature activation-9.88e-01

Pos logit contributions
ited+0.032
ised+0.030
ere+0.027
illed+0.027
 disagreed+0.027
Neg logit contributions
 roof-0.019
 sle-0.019
tr-0.019
 globe-0.018
 mast-0.018
Token checked
Token ID10667
Feature activation-1.18e+00

Pos logit contributions
ited+0.018
ised+0.016
illed+0.015
ere+0.015
 disagreed+0.015
Neg logit contributions
 sle-0.017
 sy-0.016
bul-0.016
 roof-0.016
 mast-0.016
Token.
Token ID13
Feature activation+5.50e+00

Pos logit contributions
ited+0.018
ised+0.016
 disagreed+0.015
ere+0.015
ced+0.014
Neg logit contributions
 sle-0.026
 roof-0.025
tr-0.024
 globe-0.024
 mast-0.024
Token She
Token ID1375
Feature activation+2.40e+00

Pos logit contributions
 sle+0.103
 roof+0.101
tr+0.099
 mast+0.099
 globe+0.099
Neg logit contributions
ited-0.098
ised-0.089
ere-0.087
illed-0.086
 disagreed-0.086
Token put
Token ID1234
Feature activation-7.48e-01

Pos logit contributions
 sle+0.053
 roof+0.052
tr+0.052
 mast+0.051
 globe+0.050
Neg logit contributions
ited-0.036
ised-0.031
ere-0.028
illed-0.028
 disagreed-0.028
Token the
Token ID262
Feature activation+2.54e+00

Pos logit contributions
ited+0.011
ised+0.010
 disagreed+0.009
illed+0.009
ere+0.009
Neg logit contributions
 sle-0.016
 globe-0.016
tr-0.015
 roof-0.015
bul-0.015
Token therm
Token ID21969
Feature activation+8.70e+00

Pos logit contributions
 sle+0.036
 roof+0.034
 mast+0.034
 Dragons+0.034
tr+0.034
Neg logit contributions
ited-0.059
ised-0.054
ced-0.051
 disagreed-0.051
ere-0.051
Tokenometer
Token ID15635
Feature activation+4.86e+00

Pos logit contributions
 roof+0.076
 Dragons+0.069
 sle+0.067
 mast+0.067
 flaming+0.059
Neg logit contributions
ised-0.237
ited-0.222
ced-0.217
ere-0.216
illed-0.213
Token next
Token ID1306
Feature activation+2.94e-01

Pos logit contributions
 sle+0.102
 roof+0.099
tr+0.098
 mast+0.097
 globe+0.096
Neg logit contributions
ited-0.078
ised-0.068
 disagreed-0.063
ere-0.063
ced-0.062
Token to
Token ID284
Feature activation+2.83e-01

Pos logit contributions
 sle+0.005
 roof+0.005
tr+0.005
bul+0.005
 mast+0.005
Neg logit contributions
ited-0.006
ised-0.005
illed-0.005
ced-0.005
 disagreed-0.005
Token his
Token ID465
Feature activation+1.73e+00

Pos logit contributions
 sle+0.006
 roof+0.006
tr+0.006
 mast+0.006
 Dragons+0.006
Neg logit contributions
ited-0.004
ised-0.003
 disagreed-0.003
ere-0.003
ced-0.003
Token forehead
Token ID22645
Feature activation+1.46e+00

Pos logit contributions
 Dragons+0.028
 sle+0.028
tr+0.027
 mast+0.027
 globe+0.027
Neg logit contributions
ited-0.033
ised-0.031
 disagreed-0.029
ced-0.029
ere-0.029
Token.
Token ID13
Feature activation+4.51e+00

Pos logit contributions
 roof+0.009
 sle+0.009
tr+0.009
 globe+0.009
 mast+0.009
Neg logit contributions
ited-0.009
ised-0.008
ere-0.007
 disagreed-0.007
ced-0.007
Token She
Token ID1375
Feature activation+2.50e+00

Pos logit contributions
 sle+0.101
 mast+0.094
 roof+0.092
 sy+0.090
 globe+0.089
Neg logit contributions
ited-0.067
ised-0.066
ere-0.061
 disagreed-0.060
illed-0.060
Token put
Token ID1234
Feature activation-1.05e+00

Pos logit contributions
 sle+0.055
 roof+0.053
tr+0.053
 mast+0.052
 globe+0.052
Neg logit contributions
ited-0.038
ised-0.033
ere-0.030
illed-0.030
ced-0.030
Token on
Token ID319
Feature activation-1.71e+00

Pos logit contributions
ited+0.017
ised+0.014
 disagreed+0.014
illed+0.013
ced+0.013
Neg logit contributions
 sle-0.022
 globe-0.022
 roof-0.022
tr-0.021
bul-0.021
Token the
Token ID262
Feature activation+3.78e+00

Pos logit contributions
ited+0.025
ised+0.021
 disagreed+0.020
illed+0.019
ere+0.019
Neg logit contributions
 sle-0.039
 roof-0.038
tr-0.038
 globe-0.037
 mast-0.037
Token m
Token ID285
Feature activation+8.69e+00

Pos logit contributions
 mast+0.062
 Dragons+0.062
 sle+0.061
 globe+0.061
 roof+0.059
Neg logit contributions
ited-0.075
ised-0.067
 disagreed-0.064
ere-0.063
ced-0.063
Tokenittens
Token ID34978
Feature activation+5.81e+00

Pos logit contributions
 mast+0.203
 globe+0.198
 Dragons+0.191
 roof+0.188
 mant+0.188
Neg logit contributions
ited-0.115
ere-0.098
ised-0.097
illed-0.089
pped-0.083
Token and
Token ID290
Feature activation+4.08e+00

Pos logit contributions
 sle+0.141
 roof+0.139
tr+0.137
 mast+0.136
 globe+0.135
Neg logit contributions
ited-0.074
ised-0.062
ere-0.056
 disagreed-0.056
ced-0.055
Token gave
Token ID2921
Feature activation-1.29e+00

Pos logit contributions
 sle+0.087
 roof+0.085
tr+0.084
 mast+0.083
 globe+0.082
Neg logit contributions
ited-0.064
ised-0.056
ere-0.052
 disagreed-0.051
illed-0.051
Token Emily
Token ID17608
Feature activation-1.79e+00

Pos logit contributions
ited+0.025
ised+0.021
illed+0.020
 disagreed+0.020
ere+0.019
Neg logit contributions
 roof-0.023
 sle-0.023
 globe-0.022
tr-0.022
 Dragons-0.022
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
ited+0.026
ised+0.022
 disagreed+0.020
ere+0.020
ced+0.020
Neg logit contributions
 sle-0.041
 roof-0.040
tr-0.040
 globe-0.039
 Dragons-0.038
Token't
Token ID470
Feature activation-3.79e+00

Pos logit contributions
ited+0.087
ised+0.080
illed+0.079
ere+0.078
ced+0.076
Neg logit contributions
 roof-0.033
tr-0.032
 sle-0.032
 remote-0.029
bul-0.028
Token too
Token ID1165
Feature activation-2.66e+00

Pos logit contributions
ited+0.070
ised+0.064
ere+0.057
eful+0.056
pped+0.056
Neg logit contributions
 sle-0.068
 sy-0.067
 roof-0.066
bul-0.065
 remote-0.064
Token sure
Token ID1654
Feature activation-1.87e+00

Pos logit contributions
ited+0.048
ised+0.044
pped+0.043
ere+0.041
ieved+0.040
Neg logit contributions
 sle-0.045
 roof-0.044
ometer-0.043
 sy-0.042
bul-0.040
Token what
Token ID644
Feature activation+5.71e-01

Pos logit contributions
ited+0.035
ised+0.030
ere+0.027
eful+0.026
ivered+0.026
Neg logit contributions
 roof-0.036
 chim-0.032
 globe-0.031
bul-0.031
 lantern-0.031
Token the
Token ID262
Feature activation+3.76e+00

Pos logit contributions
 sle+0.012
 roof+0.012
 mast+0.012
tr+0.012
 Dragons+0.012
Neg logit contributions
ited-0.009
ised-0.008
 disagreed-0.007
ere-0.007
ced-0.007
Token therm
Token ID21969
Feature activation+8.65e+00

Pos logit contributions
 mast+0.065
 Dragons+0.060
 globe+0.056
tr+0.056
 roof+0.055
Neg logit contributions
ited-0.074
ised-0.069
 disagreed-0.067
 done-0.065
ere-0.065
Tokenometer
Token ID15635
Feature activation+8.30e+00

Pos logit contributions
 mast+0.079
 roof+0.077
 sle+0.074
 Dragons+0.069
 flaming+0.064
Neg logit contributions
ised-0.225
ited-0.215
ere-0.214
illed-0.208
ced-0.207
Token was
Token ID373
Feature activation-3.37e-01

Pos logit contributions
 sle+0.200
 roof+0.194
 mast+0.192
 Dragons+0.191
tr+0.191
Neg logit contributions
ited-0.108
ised-0.092
 disagreed-0.086
ere-0.082
ced-0.082
Token for
Token ID329
Feature activation-1.36e+00

Pos logit contributions
ited+0.006
ised+0.005
ere+0.005
eful+0.005
ced+0.005
Neg logit contributions
 roof-0.007
 sle-0.007
tr-0.006
 globe-0.006
 mast-0.006
Token.
Token ID13
Feature activation+4.25e+00

Pos logit contributions
ited+0.023
ised+0.020
ced+0.017
 disagreed+0.017
ere+0.017
Neg logit contributions
 roof-0.028
 sle-0.027
 globe-0.026
 Dragons-0.026
 sy-0.026
Token She
Token ID1375
Feature activation+1.34e+00

Pos logit contributions
 sle+0.075
 mast+0.070
 globe+0.069
tr+0.067
 roof+0.067
Neg logit contributions
ited-0.083
ised-0.078
 disagreed-0.074
ere-0.073
illed-0.073
Token and
Token ID290
Feature activation+2.27e+00

Pos logit contributions
ited+0.043
ised+0.038
ere+0.037
eful+0.034
ced+0.034
Neg logit contributions
 roof-0.037
 sle-0.037
 globe-0.036
 mast-0.035
tr-0.035
Token they
Token ID484
Feature activation-1.75e-01

Pos logit contributions
 sle+0.038
 roof+0.037
tr+0.036
 mast+0.036
 globe+0.035
Neg logit contributions
ited-0.046
ised-0.041
ere-0.039
ced-0.039
illed-0.038
Token bought
Token ID5839
Feature activation-2.10e+00

Pos logit contributions
ited+0.003
ised+0.003
illed+0.003
ced+0.003
ere+0.003
Neg logit contributions
 roof-0.003
 sle-0.003
tr-0.003
 mast-0.003
bul-0.003
Token the
Token ID262
Feature activation+3.31e+00

Pos logit contributions
ited+0.031
ised+0.026
 disagreed+0.023
illed+0.023
ere+0.023
Neg logit contributions
 sle-0.048
 roof-0.046
 globe-0.045
 mast-0.045
tr-0.045
Token new
Token ID649
Feature activation+4.46e+00

Pos logit contributions
 Dragons+0.045
 mast+0.045
 sle+0.043
 roof+0.043
 globe+0.042
Neg logit contributions
ited-0.075
ised-0.070
ced-0.067
 disagreed-0.066
ere-0.066
Token therm
Token ID21969
Feature activation+8.53e+00

Pos logit contributions
 Dragons+0.067
 mast+0.066
tr+0.064
 globe+0.063
 roof+0.063
Neg logit contributions
ited-0.094
ised-0.088
ced-0.084
ere-0.083
 disagreed-0.082
Tokenometer
Token ID15635
Feature activation+4.98e+00

Pos logit contributions
 roof+0.082
 Dragons+0.079
 mast+0.078
 sle+0.070
 flaming+0.068
Neg logit contributions
ised-0.218
ited-0.203
ere-0.201
illed-0.201
ced-0.198
Token!
Token ID0
Feature activation+4.49e+00

Pos logit contributions
 Dragons+0.112
 sle+0.111
 mast+0.108
 roof+0.106
tr+0.106
Neg logit contributions
ited-0.065
ised-0.060
 disagreed-0.058
 seen-0.056
illed-0.055
Token<|endoftext|>
Token ID50256
Feature activation-6.03e+00

Pos logit contributions
 sle+0.094
 mast+0.093
 sy+0.091
 roof+0.091
bul+0.090
Neg logit contributions
ited-0.069
ised-0.066
 disagreed-0.062
ere-0.061
illed-0.060
TokenOnce
Token ID7454
Feature activation-7.03e+00

Pos logit contributions
ited+0.135
ised+0.120
 disagreed+0.114
ced+0.110
illed+0.109
Neg logit contributions
 sle-0.091
 globe-0.090
tr-0.088
 roof-0.087
 Dragons-0.086
Token upon
Token ID2402
Feature activation-6.38e+00

Pos logit contributions
ited+0.114
ised+0.094
ced+0.090
pped+0.090
eful+0.084
Neg logit contributions
 roof-0.148
 sle-0.143
 globe-0.142
ometer-0.138
 Dragons-0.132
TokenRed
Token ID7738
Feature activation-1.01e+00

Pos logit contributions
ited+0.019
ised+0.017
 disagreed+0.017
 seen+0.016
illed+0.016
Neg logit contributions
 sle-0.019
 roof-0.018
 sy-0.018
 Dragons-0.018
tr-0.018
Token thought
Token ID1807
Feature activation-1.67e+00

Pos logit contributions
ited+0.016
ised+0.014
illed+0.014
 seen+0.013
eful+0.013
Neg logit contributions
 sle-0.019
tr-0.019
 mast-0.019
 roof-0.019
 sy-0.019
Token about
Token ID546
Feature activation-4.72e-01

Pos logit contributions
ited+0.029
ised+0.025
 disagreed+0.023
ere+0.022
ced+0.022
Neg logit contributions
 sle-0.034
 roof-0.033
 sy-0.031
bul-0.031
 mast-0.031
Token what
Token ID644
Feature activation+7.82e-01

Pos logit contributions
ited+0.007
ised+0.006
 disagreed+0.005
ere+0.005
ced+0.005
Neg logit contributions
 sle-0.011
 roof-0.011
tr-0.010
 mast-0.010
 Dragons-0.010
Token the
Token ID262
Feature activation+3.93e+00

Pos logit contributions
 sle+0.015
 roof+0.014
tr+0.014
 mast+0.014
 globe+0.014
Neg logit contributions
ited-0.014
ised-0.013
 disagreed-0.012
ere-0.012
ced-0.012
Token m
Token ID285
Feature activation+8.49e+00

Pos logit contributions
 mast+0.079
 Dragons+0.077
 globe+0.076
 sle+0.076
tr+0.075
Neg logit contributions
ited-0.062
ised-0.056
 disagreed-0.055
ced-0.055
 done-0.054
Tokenop
Token ID404
Feature activation+6.98e+00

Pos logit contributions
 mast+0.204
 sle+0.195
 globe+0.195
 roof+0.189
 Dragons+0.187
Neg logit contributions
ited-0.108
ised-0.101
ere-0.094
illed-0.093
 disagreed-0.090
Token had
Token ID550
Feature activation-8.52e-01

Pos logit contributions
 sle+0.149
 roof+0.144
 mast+0.142
tr+0.142
 Dragons+0.140
Neg logit contributions
ited-0.110
ised-0.097
 disagreed-0.092
ere-0.089
ced-0.088
Token said
Token ID531
Feature activation+2.23e-01

Pos logit contributions
ited+0.012
ised+0.010
ere+0.009
eful+0.008
ced+0.008
Neg logit contributions
 roof-0.019
tr-0.019
 sle-0.019
bul-0.019
 mast-0.018
Token,
Token ID11
Feature activation+2.93e+00

Pos logit contributions
 sle+0.004
 roof+0.004
tr+0.004
 mast+0.004
 globe+0.004
Neg logit contributions
ited-0.004
ised-0.004
ere-0.004
 disagreed-0.003
ced-0.003
Token and
Token ID290
Feature activation+3.60e+00

Pos logit contributions
 sle+0.047
tr+0.046
 roof+0.045
 globe+0.045
 mast+0.043
Neg logit contributions
ited-0.059
ised-0.055
 disagreed-0.054
ced-0.052
ere-0.051
Token.
Token ID13
Feature activation+3.88e+00

Pos logit contributions
 sle+0.044
 mast+0.042
 Dragons+0.042
 roof+0.042
tr+0.042
Neg logit contributions
ited-0.033
ised-0.029
 disagreed-0.028
ced-0.027
ere-0.027
Token She
Token ID1375
Feature activation+1.92e+00

Pos logit contributions
 mast+0.112
 sle+0.106
rier+0.102
 remote+0.097
 hull+0.096
Neg logit contributions
ere-0.028
ited-0.025
 did-0.025
,-0.024
ised-0.024
Token loved
Token ID6151
Feature activation-2.44e-01

Pos logit contributions
 sle+0.041
 roof+0.041
tr+0.040
 globe+0.039
 mast+0.039
Neg logit contributions
ited-0.030
ised-0.026
illed-0.024
ere-0.023
ced-0.023
Token using
Token ID1262
Feature activation-2.18e-01

Pos logit contributions
ited+0.004
ised+0.004
 disagreed+0.003
ere+0.003
ced+0.003
Neg logit contributions
 sle-0.005
 roof-0.005
tr-0.005
 Dragons-0.005
 mast-0.005
Token the
Token ID262
Feature activation+3.52e+00

Pos logit contributions
ited+0.004
ised+0.003
 disagreed+0.003
illed+0.003
ere+0.003
Neg logit contributions
 roof-0.005
 sle-0.005
 sy-0.004
bul-0.004
 mast-0.004
Token m
Token ID285
Feature activation+8.48e+00

Pos logit contributions
 sle+0.053
 roof+0.052
tr+0.050
 mast+0.050
 globe+0.049
Neg logit contributions
ited-0.078
ised-0.071
 disagreed-0.067
ere-0.067
ced-0.067
Tokenop
Token ID404
Feature activation+5.22e+00

Pos logit contributions
 mast+0.191
 globe+0.186
 sle+0.186
 Dragons+0.181
 roof+0.180
Neg logit contributions
ited-0.121
ised-0.110
ere-0.103
illed-0.101
 disagreed-0.098
Token and
Token ID290
Feature activation+4.65e+00

Pos logit contributions
 sle+0.114
 roof+0.112
tr+0.110
 mast+0.109
 Dragons+0.109
Neg logit contributions
ited-0.079
ised-0.069
 disagreed-0.064
ere-0.063
ced-0.063
Token making
Token ID1642
Feature activation+1.32e-01

Pos logit contributions
 roof+0.095
 sle+0.094
tr+0.092
 mast+0.090
bul+0.090
Neg logit contributions
ited-0.079
ised-0.069
ere-0.063
illed-0.062
 disagreed-0.062
Token the
Token ID262
Feature activation+3.38e+00

Pos logit contributions
 roof+0.003
 sle+0.003
 mast+0.002
 Dragons+0.002
tr+0.002
Neg logit contributions
ited-0.002
ised-0.002
 disagreed-0.002
illed-0.002
ere-0.002
Token floors
Token ID18570
Feature activation+1.91e+00

Pos logit contributions
 sle+0.055
 mast+0.054
 Dragons+0.054
 globe+0.052
bul+0.052
Neg logit contributions
ited-0.068
ised-0.062
ced-0.059
 disagreed-0.059
ere-0.058
Token They
Token ID1119
Feature activation+2.25e-01

Pos logit contributions
 sle+0.082
bul+0.073
 roof+0.071
 globe+0.070
 mant+0.070
Neg logit contributions
ised-0.045
,-0.043
 did-0.042
 there-0.042
illed-0.038
Token took
Token ID1718
Feature activation-2.51e+00

Pos logit contributions
 roof+0.004
 mast+0.004
tr+0.004
 sle+0.004
 globe+0.004
Neg logit contributions
ited-0.004
ised-0.004
illed-0.003
ere-0.003
eful-0.003
Token turns
Token ID4962
Feature activation-9.02e-01

Pos logit contributions
ited+0.032
ised+0.027
 disagreed+0.024
ere+0.024
ced+0.024
Neg logit contributions
 sle-0.062
 roof-0.061
tr-0.060
 mast-0.060
 Dragons-0.059
Token putting
Token ID5137
Feature activation-6.47e-01

Pos logit contributions
ited+0.017
ised+0.016
ere+0.014
 disagreed+0.014
illed+0.014
Neg logit contributions
 sle-0.016
 roof-0.016
tr-0.016
 mast-0.015
 globe-0.015
Token the
Token ID262
Feature activation+3.72e+00

Pos logit contributions
ited+0.008
ised+0.007
 disagreed+0.006
ere+0.006
ced+0.006
Neg logit contributions
 sle-0.016
 roof-0.016
tr-0.016
 mast-0.016
 Dragons-0.015
Token therm
Token ID21969
Feature activation+8.41e+00

Pos logit contributions
 Dragons+0.052
 mast+0.052
 globe+0.050
 sle+0.048
 roof+0.048
Neg logit contributions
ited-0.083
ised-0.076
ced-0.074
ere-0.073
 disagreed-0.071
Tokenometer
Token ID15635
Feature activation+5.17e+00

Pos logit contributions
 roof+0.079
 Dragons+0.072
 mast+0.072
 sle+0.069
 flow+0.066
Neg logit contributions
ised-0.214
ere-0.195
ited-0.195
ced-0.193
illed-0.192
Token in
Token ID287
Feature activation+8.18e-02

Pos logit contributions
 sle+0.123
 Dragons+0.120
 mast+0.117
 roof+0.117
 globe+0.115
Neg logit contributions
ited-0.063
ised-0.058
 disagreed-0.052
ced-0.051
illed-0.050
Token their
Token ID511
Feature activation+2.57e+00

Pos logit contributions
 sle+0.002
 roof+0.002
 globe+0.002
 mast+0.002
tr+0.002
Neg logit contributions
ited-0.001
ised-0.001
 disagreed-0.001
ere-0.001
illed-0.001
Token mouths
Token ID28552
Feature activation+3.24e+00

Pos logit contributions
bul+0.044
 Dragons+0.043
 sle+0.043
 globe+0.043
tr+0.042
Neg logit contributions
ited-0.048
ised-0.043
ere-0.042
 disagreed-0.041
ced-0.040
Token and
Token ID290
Feature activation+4.45e+00

Pos logit contributions
 sle+0.066
 roof+0.065
tr+0.064
 mast+0.063
 Dragons+0.063
Neg logit contributions
ited-0.054
ised-0.048
ere-0.044
 disagreed-0.044
ced-0.043
Token was
Token ID373
Feature activation-1.56e+00

Pos logit contributions
tr+0.040
 roof+0.040
 sle+0.040
 mast+0.039
bul+0.039
Neg logit contributions
ited-0.031
ised-0.028
illed-0.026
ere-0.026
eful-0.024
Token surprised
Token ID6655
Feature activation-1.22e+00

Pos logit contributions
ited+0.032
ised+0.028
ere+0.027
illed+0.024
eful+0.024
Neg logit contributions
 roof-0.026
 sle-0.026
 remote-0.025
tr-0.025
 mast-0.024
Token how
Token ID703
Feature activation+8.24e-01

Pos logit contributions
ited+0.022
ised+0.018
ced+0.018
ere+0.018
 disagreed+0.017
Neg logit contributions
 sle-0.023
 roof-0.022
 Dragons-0.022
bul-0.022
 mast-0.021
Token harsh
Token ID11859
Feature activation+3.16e+00

Pos logit contributions
 sle+0.016
 roof+0.016
 mast+0.015
 Dragons+0.015
tr+0.015
Neg logit contributions
ited-0.015
ised-0.013
 disagreed-0.012
ere-0.012
ced-0.012
Token the
Token ID262
Feature activation+3.28e+00

Pos logit contributions
 sle+0.070
 roof+0.068
tr+0.067
 mast+0.067
 Dragons+0.067
Neg logit contributions
ited-0.047
ised-0.041
 disagreed-0.038
ere-0.038
ced-0.037
Token m
Token ID285
Feature activation+8.40e+00

Pos logit contributions
 Dragons+0.056
 sle+0.055
 mast+0.055
tr+0.054
 globe+0.054
Neg logit contributions
ited-0.062
ised-0.058
ced-0.056
 disagreed-0.054
ere-0.054
Tokenop
Token ID404
Feature activation+6.71e+00

Pos logit contributions
 mast+0.204
 sle+0.196
 globe+0.194
 roof+0.190
 Dragons+0.189
Neg logit contributions
ited-0.099
ised-0.099
ere-0.089
illed-0.087
ced-0.082
Token was
Token ID373
Feature activation+1.68e+00

Pos logit contributions
 sle+0.151
 roof+0.147
 mast+0.145
tr+0.144
 Dragons+0.143
Neg logit contributions
ited-0.098
ised-0.086
 disagreed-0.079
ere-0.077
ced-0.077
Token!
Token ID0
Feature activation+5.27e+00

Pos logit contributions
 sle+0.032
 roof+0.032
tr+0.031
 mast+0.031
 globe+0.031
Neg logit contributions
ited-0.030
ised-0.027
ere-0.025
 disagreed-0.024
ced-0.024
Token He
Token ID679
Feature activation+2.36e+00

Pos logit contributions
 sle+0.113
 globe+0.105
 mast+0.103
 sy+0.102
 roof+0.099
Neg logit contributions
ised-0.082
ere-0.075
ited-0.075
 disagreed-0.074
illed-0.072
Token scrub
Token ID27268
Feature activation-2.44e+00

Pos logit contributions
 sle+0.050
 roof+0.049
 mast+0.049
tr+0.049
bul+0.047
Neg logit contributions
ited-0.036
ised-0.031
illed-0.030
ere-0.029
ced-0.028
Token hug
Token ID16225
Feature activation+1.01e+00

Pos logit contributions
 Dragons+0.069
 sle+0.069
 mast+0.069
tr+0.069
 roof+0.067
Neg logit contributions
ited-0.093
ised-0.085
ere-0.081
 disagreed-0.080
ced-0.079
Token,
Token ID11
Feature activation+4.12e+00

Pos logit contributions
 sle+0.021
tr+0.021
 roof+0.021
 globe+0.020
 mast+0.020
Neg logit contributions
ited-0.016
ised-0.014
ere-0.013
 disagreed-0.013
ced-0.013
Token even
Token ID772
Feature activation+2.25e+00

Pos logit contributions
 sle+0.096
 roof+0.093
tr+0.092
 mast+0.092
 globe+0.091
Neg logit contributions
ited-0.057
ised-0.049
 disagreed-0.046
ced-0.045
 seen-0.044
Token though
Token ID996
Feature activation+1.84e+00

Pos logit contributions
bul+0.046
 sle+0.046
tr+0.046
 roof+0.044
 sy+0.044
Neg logit contributions
ited-0.036
ised-0.032
ere-0.030
illed-0.030
 disagreed-0.029
Token the
Token ID262
Feature activation+4.15e+00

Pos logit contributions
 roof+0.031
 sle+0.029
 globe+0.029
tr+0.028
 Dragons+0.028
Neg logit contributions
ited-0.039
ised-0.034
ere-0.032
illed-0.031
 disagreed-0.030
Token therm
Token ID21969
Feature activation+8.31e+00

Pos logit contributions
 sle+0.056
 mast+0.056
 Dragons+0.055
 globe+0.055
tr+0.053
Neg logit contributions
ited-0.094
ised-0.086
ced-0.083
 disagreed-0.082
ere-0.081
Tokenometer
Token ID15635
Feature activation+7.10e+00

Pos logit contributions
 roof+0.064
 mast+0.063
 sle+0.059
 Dragons+0.059
 mant+0.058
Neg logit contributions
ised-0.222
ited-0.216
ced-0.208
ere-0.208
illed-0.205
Token felt
Token ID2936
Feature activation-2.23e-01

Pos logit contributions
 sle+0.167
 roof+0.162
 mast+0.160
tr+0.160
 Dragons+0.159
Neg logit contributions
ited-0.096
ised-0.082
 disagreed-0.076
ced-0.074
ere-0.074
Token cold
Token ID4692
Feature activation+1.12e+00

Pos logit contributions
ited+0.005
ised+0.004
ere+0.004
illed+0.004
 disagreed+0.004
Neg logit contributions
 sle-0.004
 roof-0.004
 sy-0.003
tr-0.003
 mast-0.003
Token.
Token ID13
Feature activation+5.61e+00

Pos logit contributions
 sle+0.024
 roof+0.024
tr+0.023
 mast+0.023
 globe+0.023
Neg logit contributions
ited-0.017
ised-0.015
ere-0.014
 disagreed-0.014
ced-0.014
Token The
Token ID383
Feature activation+2.22e+00

Pos logit contributions
 sle+0.104
 mast+0.098
 roof+0.097
 globe+0.095
tr+0.094
Neg logit contributions
ited-0.106
ised-0.096
 disagreed-0.092
ere-0.090
illed-0.089
Token "
Token ID366
Feature activation+1.22e+00

Pos logit contributions
 sle+0.017
 roof+0.016
tr+0.016
 mast+0.016
 Dragons+0.016
Neg logit contributions
ited-0.021
ised-0.020
 disagreed-0.019
ced-0.019
ere-0.018
TokenLook
Token ID8567
Feature activation-3.30e+00

Pos logit contributions
 sle+0.023
 roof+0.022
 mast+0.022
tr+0.022
 Dragons+0.021
Neg logit contributions
ited-0.023
ised-0.020
 disagreed-0.019
ere-0.019
ced-0.019
Token how
Token ID703
Feature activation+6.74e-01

Pos logit contributions
ited+0.050
ised+0.040
 disagreed+0.036
ere+0.036
illed+0.036
Neg logit contributions
 sle-0.074
 roof-0.074
tr-0.072
 globe-0.071
 mast-0.070
Token generous
Token ID14431
Feature activation+1.68e+00

Pos logit contributions
 roof+0.013
 sle+0.012
 mast+0.012
 Dragons+0.012
tr+0.012
Neg logit contributions
ited-0.013
ised-0.011
illed-0.010
ere-0.010
 disagreed-0.010
Token the
Token ID262
Feature activation+3.72e+00

Pos logit contributions
 sle+0.032
 roof+0.031
tr+0.030
 mast+0.030
 Dragons+0.030
Neg logit contributions
ited-0.030
ised-0.027
 disagreed-0.025
ere-0.025
ced-0.025
Token fire
Token ID2046
Feature activation+8.30e+00

Pos logit contributions
 Dragons+0.050
tr+0.049
 sle+0.047
 sy+0.047
 mast+0.046
Neg logit contributions
ited-0.077
ised-0.076
ced-0.071
 done-0.070
 disagreed-0.070
Token is
Token ID318
Feature activation+1.39e+00

Pos logit contributions
 sle+0.159
 mast+0.152
 Dragons+0.149
 roof+0.149
 sy+0.146
Neg logit contributions
ited-0.132
ised-0.125
 disagreed-0.123
 done-0.119
 did-0.118
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
 sle+0.029
 roof+0.028
tr+0.028
 Dragons+0.027
 mast+0.027
Neg logit contributions
ited-0.023
ised-0.020
 disagreed-0.019
ced-0.019
ere-0.018
Token burning
Token ID9482
Feature activation+1.05e+00

Pos logit contributions
 sle+0.073
 roof+0.072
tr+0.070
 mast+0.070
 globe+0.069
Neg logit contributions
ited-0.063
ised-0.056
 disagreed-0.051
ere-0.051
ced-0.051
Token and
Token ID290
Feature activation+5.13e+00

Pos logit contributions
 sle+0.021
 roof+0.020
tr+0.020
 mast+0.020
 globe+0.020
Neg logit contributions
ited-0.018
ised-0.016
 disagreed-0.015
ere-0.015
illed-0.015
Token giving
Token ID3501
Feature activation-1.42e-01

Pos logit contributions
 sle+0.104
tr+0.101
 globe+0.100
 roof+0.100
 Dragons+0.100
Neg logit contributions
ited-0.082
ised-0.075
 disagreed-0.070
ced-0.069
 seen-0.067
Token too
Token ID1165
Feature activation-2.66e+00

Pos logit contributions
ited+0.070
ised+0.064
ere+0.057
eful+0.056
pped+0.056
Neg logit contributions
 sle-0.068
 sy-0.067
 roof-0.066
bul-0.065
 remote-0.064
Token sure
Token ID1654
Feature activation-1.87e+00

Pos logit contributions
ited+0.048
ised+0.044
pped+0.043
ere+0.041
ieved+0.040
Neg logit contributions
 sle-0.045
 roof-0.044
ometer-0.043
 sy-0.042
bul-0.040
Token what
Token ID644
Feature activation+5.71e-01

Pos logit contributions
ited+0.035
ised+0.030
ere+0.027
eful+0.026
ivered+0.026
Neg logit contributions
 roof-0.036
 chim-0.032
 globe-0.031
bul-0.031
 lantern-0.031
Token the
Token ID262
Feature activation+3.76e+00

Pos logit contributions
 sle+0.012
 roof+0.012
 mast+0.012
tr+0.012
 Dragons+0.012
Neg logit contributions
ited-0.009
ised-0.008
 disagreed-0.007
ere-0.007
ced-0.007
Token therm
Token ID21969
Feature activation+8.65e+00

Pos logit contributions
 mast+0.065
 Dragons+0.060
 globe+0.056
tr+0.056
 roof+0.055
Neg logit contributions
ited-0.074
ised-0.069
 disagreed-0.067
 done-0.065
ere-0.065
Tokenometer
Token ID15635
Feature activation+8.30e+00

Pos logit contributions
 mast+0.079
 roof+0.077
 sle+0.074
 Dragons+0.069
 flaming+0.064
Neg logit contributions
ised-0.225
ited-0.215
ere-0.214
illed-0.208
ced-0.207
Token was
Token ID373
Feature activation-3.37e-01

Pos logit contributions
 sle+0.200
 roof+0.194
 mast+0.192
 Dragons+0.191
tr+0.191
Neg logit contributions
ited-0.108
ised-0.092
 disagreed-0.086
ere-0.082
ced-0.082
Token for
Token ID329
Feature activation-1.36e+00

Pos logit contributions
ited+0.006
ised+0.005
ere+0.005
eful+0.005
ced+0.005
Neg logit contributions
 roof-0.007
 sle-0.007
tr-0.006
 globe-0.006
 mast-0.006
Token.
Token ID13
Feature activation+4.25e+00

Pos logit contributions
ited+0.023
ised+0.020
ced+0.017
 disagreed+0.017
ere+0.017
Neg logit contributions
 roof-0.028
 sle-0.027
 globe-0.026
 Dragons-0.026
 sy-0.026
Token She
Token ID1375
Feature activation+1.34e+00

Pos logit contributions
 sle+0.075
 mast+0.070
 globe+0.069
tr+0.067
 roof+0.067
Neg logit contributions
ited-0.083
ised-0.078
 disagreed-0.074
ere-0.073
illed-0.073
Token wasn
Token ID2492
Feature activation-3.06e+00

Pos logit contributions
tr+0.028
 roof+0.027
 sle+0.027
 mast+0.026
bul+0.026
Neg logit contributions
ited-0.021
ised-0.019
illed-0.017
ced-0.017
ere-0.017
Token surprised
Token ID6655
Feature activation-1.67e+00

Pos logit contributions
ited+0.041
ised+0.038
illed+0.033
ere+0.033
 disagreed+0.032
Neg logit contributions
 sle-0.039
 roof-0.036
 sy-0.036
tr-0.036
 remote-0.035
Token to
Token ID284
Feature activation-2.11e+00

Pos logit contributions
ited+0.032
ised+0.028
ced+0.028
illed+0.026
ere+0.026
Neg logit contributions
 sle-0.028
bul-0.027
 roof-0.026
 Dragons-0.026
 sy-0.025
Token see
Token ID766
Feature activation-1.50e+00

Pos logit contributions
ited+0.039
ised+0.035
ere+0.032
ced+0.031
illed+0.031
Neg logit contributions
 sle-0.040
 roof-0.039
 mast-0.038
 Dragons-0.037
tr-0.037
Token that
Token ID326
Feature activation+2.98e+00

Pos logit contributions
ited+0.022
ised+0.019
 disagreed+0.017
illed+0.017
ere+0.017
Neg logit contributions
 sle-0.034
 roof-0.032
tr-0.032
 globe-0.032
 sy-0.032
Token the
Token ID262
Feature activation+2.97e+00

Pos logit contributions
 sle+0.054
 roof+0.053
tr+0.052
 globe+0.051
 mast+0.050
Neg logit contributions
ited-0.057
ised-0.051
ere-0.047
illed-0.047
ced-0.047
Token therm
Token ID21969
Feature activation+8.25e+00

Pos logit contributions
 mast+0.042
 globe+0.042
 Dragons+0.042
 sle+0.041
 roof+0.039
Neg logit contributions
ited-0.065
ised-0.058
ced-0.057
ere-0.057
 disagreed-0.056
Tokenometer
Token ID15635
Feature activation+6.33e+00

Pos logit contributions
 mast+0.068
 roof+0.066
 Dragons+0.060
 sle+0.060
 globe+0.058
Neg logit contributions
ised-0.216
ited-0.209
ere-0.204
ced-0.201
illed-0.200
Token was
Token ID373
Feature activation-3.38e-01

Pos logit contributions
 sle+0.146
 roof+0.141
 mast+0.140
 Dragons+0.140
tr+0.139
Neg logit contributions
ited-0.088
ised-0.075
 disagreed-0.071
ced-0.069
ere-0.068
Token very
Token ID845
Feature activation+7.78e-01

Pos logit contributions
ited+0.006
ised+0.006
ere+0.005
illed+0.005
 disagreed+0.005
Neg logit contributions
 sle-0.006
 roof-0.006
tr-0.006
bul-0.006
 sy-0.006
Token high
Token ID1029
Feature activation+2.15e-01

Pos logit contributions
 sle+0.014
 roof+0.013
tr+0.013
 mast+0.013
 Dragons+0.013
Neg logit contributions
ited-0.015
ised-0.014
ere-0.013
ced-0.013
 disagreed-0.013
Token.
Token ID13
Feature activation+5.35e+00

Pos logit contributions
 sle+0.004
tr+0.004
 roof+0.004
 globe+0.004
bul+0.004
Neg logit contributions
ited-0.004
ised-0.004
ere-0.004
 disagreed-0.004
illed-0.004
Token worked
Token ID3111
Feature activation-1.47e+00

Pos logit contributions
ited+0.032
ised+0.028
illed+0.026
ere+0.026
eful+0.025
Neg logit contributions
 roof-0.036
 mast-0.034
tr-0.034
bul-0.033
 sle-0.033
Token so
Token ID523
Feature activation+1.27e+00

Pos logit contributions
ited+0.032
ere+0.027
illed+0.027
ised+0.026
 disagreed+0.026
Neg logit contributions
bul-0.022
tr-0.020
 globe-0.020
 roof-0.019
 sle-0.019
Token hard
Token ID1327
Feature activation+1.52e+00

Pos logit contributions
 roof+0.020
 Dragons+0.019
bul+0.019
 sle+0.019
 mast+0.019
Neg logit contributions
ited-0.026
ere-0.022
 disagreed-0.022
illed-0.022
 seen-0.021
Token on
Token ID319
Feature activation-8.70e-01

Pos logit contributions
 roof+0.030
 sle+0.030
bul+0.029
tr+0.029
 mast+0.028
Neg logit contributions
ited-0.027
ised-0.022
ere-0.021
illed-0.021
 disagreed-0.021
Token the
Token ID262
Feature activation+3.67e+00

Pos logit contributions
ited+0.012
ised+0.011
 disagreed+0.010
ced+0.010
ere+0.010
Neg logit contributions
 sle-0.020
tr-0.019
 roof-0.019
 Dragons-0.019
 mast-0.019
Token ash
Token ID12530
Feature activation+8.11e+00

Pos logit contributions
 Dragons+0.053
 sle+0.051
 mast+0.048
 sy+0.047
ometer+0.046
Neg logit contributions
ited-0.076
ised-0.071
ced-0.070
 disagreed-0.069
 done-0.069
Tokentr
Token ID2213
Feature activation+6.88e+00

Pos logit contributions
 sle+0.066
 mast+0.056
 Dragons+0.050
 sy+0.043
 remote+0.043
Neg logit contributions
ised-0.210
ited-0.207
,-0.206
 done-0.204
 disagreed-0.201
Tokenay
Token ID323
Feature activation+3.12e+00

Pos logit contributions
 sle+0.156
 mast+0.150
 Dragons+0.145
 globe+0.143
 roof+0.143
Neg logit contributions
ited-0.094
ised-0.092
ere-0.081
illed-0.077
 done-0.077
Token,
Token ID11
Feature activation+4.22e+00

Pos logit contributions
 sle+0.065
 roof+0.064
tr+0.064
 globe+0.063
 Dragons+0.062
Neg logit contributions
ited-0.050
ised-0.044
 disagreed-0.041
ere-0.040
ced-0.040
Token and
Token ID290
Feature activation+4.46e+00

Pos logit contributions
 sle+0.093
tr+0.091
 roof+0.089
 Dragons+0.089
 globe+0.088
Neg logit contributions
ited-0.061
ised-0.055
 disagreed-0.051
ced-0.051
 seen-0.049
Token now
Token ID783
Feature activation+1.73e+00

Pos logit contributions
 sle+0.085
 roof+0.084
 mast+0.082
tr+0.081
 Dragons+0.081
Neg logit contributions
ited-0.081
ised-0.071
ere-0.067
illed-0.066
 disagreed-0.065
Token.
Token ID13
Feature activation+5.12e+00

Pos logit contributions
 sle+0.076
 mast+0.071
 roof+0.071
 Dragons+0.070
tr+0.069
Neg logit contributions
ited-0.062
ised-0.057
 disagreed-0.054
ced-0.053
ere-0.052
Token Can
Token ID1680
Feature activation+1.26e+00

Pos logit contributions
 sle+0.090
bul+0.084
 roof+0.080
 sy+0.079
 globe+0.078
Neg logit contributions
 did-0.081
ised-0.081
illed-0.076
 there-0.075
eful-0.074
Token we
Token ID356
Feature activation-1.02e-01

Pos logit contributions
 sle+0.017
tr+0.015
bul+0.015
 mast+0.015
 sy+0.015
Neg logit contributions
ised-0.028
ited-0.027
 disagreed-0.026
 did-0.026
 there-0.025
Token bring
Token ID2222
Feature activation-8.13e-01

Pos logit contributions
ited+0.002
ised+0.002
 disagreed+0.001
ere+0.001
ced+0.001
Neg logit contributions
 sle-0.002
tr-0.002
 Dragons-0.002
 roof-0.002
 globe-0.002
Token the
Token ID262
Feature activation+2.86e+00

Pos logit contributions
ited+0.013
ised+0.012
 disagreed+0.011
ere+0.011
illed+0.011
Neg logit contributions
 sle-0.017
 roof-0.016
 mast-0.016
tr-0.016
 globe-0.016
Token m
Token ID285
Feature activation+8.11e+00

Pos logit contributions
 sle+0.051
tr+0.049
 Dragons+0.048
 roof+0.047
 mast+0.046
Neg logit contributions
ited-0.054
ised-0.051
ced-0.048
 disagreed-0.047
ere-0.047
Tokenop
Token ID404
Feature activation+4.35e+00

Pos logit contributions
 roof+0.196
 sle+0.194
 globe+0.191
 mast+0.190
 Dragons+0.189
Neg logit contributions
ised-0.091
ited-0.090
eful-0.078
illed-0.077
ere-0.071
Token,
Token ID11
Feature activation+2.82e+00

Pos logit contributions
 sle+0.096
 roof+0.094
tr+0.092
 mast+0.092
 Dragons+0.091
Neg logit contributions
ited-0.065
ised-0.057
 disagreed-0.052
ere-0.052
ced-0.051
Token too
Token ID1165
Feature activation+8.41e-01

Pos logit contributions
 sle+0.048
tr+0.046
 roof+0.046
 globe+0.045
 Dragons+0.045
Neg logit contributions
ited-0.055
ised-0.051
 disagreed-0.048
ere-0.047
ced-0.047
Token?"
Token ID1701
Feature activation+1.93e+00

Pos logit contributions
 sle+0.017
 roof+0.017
tr+0.017
 mast+0.016
 Dragons+0.016
Neg logit contributions
ited-0.015
ised-0.012
ced-0.012
ere-0.012
 disagreed-0.011
Token
Token ID198
Feature activation-2.97e+00

Pos logit contributions
 sle+0.041
tr+0.040
 mast+0.039
bul+0.039
 roof+0.039
Neg logit contributions
ited-0.029
ised-0.027
 disagreed-0.026
ere-0.025
illed-0.025
Token there
Token ID612
Feature activation-5.44e+00

Pos logit contributions
ited+0.127
ere+0.092
 disagreed+0.089
eling+0.088
ised+0.086
Neg logit contributions
 globe-0.139
 roof-0.133
tr-0.127
 We-0.122
 bulb-0.122
Token was
Token ID373
Feature activation-7.44e+00

Pos logit contributions
ited+0.142
ised+0.120
ere+0.110
eful+0.110
ieved+0.108
Neg logit contributions
 globe-0.072
 roof-0.068
tr-0.063
 fireplace-0.062
 chim-0.059
Token a
Token ID257
Feature activation-4.96e+00

Pos logit contributions
ited+0.144
ised+0.121
 disagreed+0.116
ced+0.110
ere+0.110
Neg logit contributions
 roof-0.142
tr-0.137
 moon-0.133
 globe-0.132
 fireplace-0.131
Token girl
Token ID2576
Feature activation-6.26e+00

Pos logit contributions
ited+0.057
ised+0.046
ced+0.044
ere+0.043
 disagreed+0.043
Neg logit contributions
 Dragons-0.119
tr-0.118
 sle-0.118
bul-0.116
 mast-0.115
Token named
Token ID3706
Feature activation-5.00e+00

Pos logit contributions
ited+0.087
ised+0.075
 disagreed+0.068
ere+0.067
ced+0.067
Neg logit contributions
 sle-0.146
 roof-0.143
tr-0.141
 mast-0.139
 Dragons-0.139
Token Ph
Token ID1380
Feature activation-1.17e+01

Pos logit contributions
ited+0.109
ised+0.100
ced+0.093
 disagreed+0.092
ere+0.092
Neg logit contributions
 roof-0.078
tr-0.076
 sle-0.076
 Dragons-0.072
 globe-0.072
Tokenoe
Token ID2577
Feature activation-8.06e+00

Pos logit contributions
ited+0.221
ised+0.206
 disagreed+0.188
ced+0.185
ere+0.183
Neg logit contributions
 sle-0.204
tr-0.204
 roof-0.203
 globe-0.195
 mast-0.190
Tokenbe
Token ID1350
Feature activation-7.08e+00

Pos logit contributions
ited+0.147
ised+0.132
 disagreed+0.123
ere+0.119
ced+0.117
Neg logit contributions
 sle-0.154
 roof-0.149
tr-0.149
bul-0.143
 globe-0.141
Token who
Token ID508
Feature activation-5.88e-01

Pos logit contributions
ited+0.133
ised+0.116
ced+0.109
 disagreed+0.107
ere+0.105
Neg logit contributions
 roof-0.132
 sle-0.128
tr-0.127
 globe-0.122
 Dragons-0.121
Token liked
Token ID8288
Feature activation-2.68e+00

Pos logit contributions
ited+0.006
ised+0.005
 disagreed+0.005
ere+0.004
ced+0.004
Neg logit contributions
 sle-0.016
 mast-0.015
 roof-0.015
tr-0.015
bul-0.015
Token to
Token ID284
Feature activation-1.63e+00

Pos logit contributions
ited+0.025
 disagreed+0.024
 seen+0.023
ere+0.022
ised+0.021
Neg logit contributions
 sle-0.068
 mast-0.067
 globe-0.067
tr-0.066
 Dragons-0.065
Token there
Token ID612
Feature activation-5.79e+00

Pos logit contributions
ited+0.133
ere+0.098
 disagreed+0.096
eling+0.095
ised+0.093
Neg logit contributions
 globe-0.153
 roof-0.148
tr-0.145
 We-0.134
 bulb-0.131
Token were
Token ID547
Feature activation-8.59e+00

Pos logit contributions
ited+0.148
ised+0.124
ere+0.113
eful+0.113
ieved+0.111
Neg logit contributions
 globe-0.083
 roof-0.081
tr-0.074
 fireplace-0.072
 chim-0.071
Token two
Token ID734
Feature activation-6.24e+00

Pos logit contributions
ited+0.161
ised+0.135
ced+0.132
 disagreed+0.126
illed+0.125
Neg logit contributions
 roof-0.164
 sle-0.160
tr-0.151
 mast-0.149
 Dragons-0.149
Token friends
Token ID2460
Feature activation-7.96e+00

Pos logit contributions
ited+0.120
ised+0.108
illed+0.100
ced+0.099
pped+0.099
Neg logit contributions
 sle-0.117
 roof-0.114
 mast-0.105
 Dragons-0.104
tr-0.103
Token,
Token ID11
Feature activation-2.45e+00

Pos logit contributions
ited+0.154
ised+0.125
ced+0.120
 disagreed+0.119
ere+0.115
Neg logit contributions
 roof-0.152
 sle-0.152
 Dragons-0.144
tr-0.142
 mast-0.139
Token Ab
Token ID2275
Feature activation-1.16e+01

Pos logit contributions
ited+0.034
ised+0.030
 disagreed+0.029
ere+0.028
ced+0.028
Neg logit contributions
 sle-0.055
 roof-0.053
tr-0.053
 mast-0.053
 Dragons-0.052
Tokeni
Token ID72
Feature activation-7.15e+00

Pos logit contributions
ised+0.202
ited+0.199
 disagreed+0.175
ced+0.171
eful+0.163
Neg logit contributions
 sle-0.211
 roof-0.207
tr-0.184
 flaming-0.181
 globe-0.180
Token and
Token ID290
Feature activation-1.99e+00

Pos logit contributions
ited+0.082
ised+0.071
ere+0.065
 disagreed+0.064
ced+0.064
Neg logit contributions
 sle-0.178
 roof-0.174
 mast-0.173
tr-0.173
 Dragons-0.172
Token Ben
Token ID3932
Feature activation-7.31e+00

Pos logit contributions
ited+0.036
ised+0.032
ced+0.029
ere+0.029
 disagreed+0.029
Neg logit contributions
 sle-0.039
 roof-0.038
tr-0.037
 mast-0.037
 Dragons-0.037
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
ited+0.142
ised+0.127
ere+0.118
 disagreed+0.117
ced+0.117
Neg logit contributions
 sle-0.131
 roof-0.128
tr-0.126
 globe-0.124
 mast-0.123
Token They
Token ID1119
Feature activation-2.56e-01

Pos logit contributions
 sle+0.017
 roof+0.016
tr+0.016
 mast+0.016
 Dragons+0.016
Neg logit contributions
ited-0.040
ised-0.037
 disagreed-0.036
ere-0.036
illed-0.036
Token her
Token ID607
Feature activation+7.90e-01

Pos logit contributions
ited+0.006
ised+0.005
 disagreed+0.005
illed+0.005
ere+0.005
Neg logit contributions
 sle-0.008
 roof-0.008
 mast-0.008
 Dragons-0.008
tr-0.008
Token brown
Token ID7586
Feature activation+3.23e+00

Pos logit contributions
 sle+0.015
 roof+0.014
tr+0.014
 mast+0.014
 Dragons+0.014
Neg logit contributions
ited-0.014
ised-0.013
 disagreed-0.012
ced-0.012
ere-0.012
Token fur
Token ID9230
Feature activation+3.61e+00

Pos logit contributions
 sle+0.050
tr+0.048
 roof+0.048
 Dragons+0.047
 mast+0.047
Neg logit contributions
ited-0.067
ised-0.062
 disagreed-0.060
ere-0.059
ced-0.058
Token.
Token ID13
Feature activation+5.24e+00

Pos logit contributions
 sle+0.072
 roof+0.069
 mast+0.068
tr+0.068
 Dragons+0.068
Neg logit contributions
ited-0.062
ised-0.055
 disagreed-0.051
ced-0.051
ere-0.050
Token<|endoftext|>
Token ID50256
Feature activation-5.18e+00

Pos logit contributions
 sle+0.100
 roof+0.097
 globe+0.096
 mast+0.095
bul+0.093
Neg logit contributions
ited-0.093
ised-0.085
ere-0.083
 disagreed-0.081
illed-0.080
TokenArk
Token ID42007
Feature activation-1.16e+01

Pos logit contributions
ited+0.116
ised+0.103
 disagreed+0.099
illed+0.094
ced+0.094
Neg logit contributions
 globe-0.077
 sle-0.077
tr-0.073
 Dragons-0.073
 roof-0.073
Tokenie
Token ID494
Feature activation-3.83e+00

Pos logit contributions
ised+0.216
ited+0.198
ced+0.186
 disagreed+0.173
eful+0.172
Neg logit contributions
 sle-0.199
man-0.187
 globe-0.184
bul-0.179
 lantern-0.174
Token wanted
Token ID2227
Feature activation-2.96e+00

Pos logit contributions
ited+0.064
ised+0.057
illed+0.054
ced+0.051
eful+0.050
Neg logit contributions
 sle-0.077
 roof-0.076
 mast-0.073
tr-0.072
 globe-0.071
Token his
Token ID465
Feature activation-4.28e-01

Pos logit contributions
ited+0.048
ised+0.042
 disagreed+0.038
ere+0.038
ced+0.038
Neg logit contributions
 sle-0.063
 roof-0.061
tr-0.060
 mast-0.060
 Dragons-0.060
Token own
Token ID898
Feature activation+9.02e-01

Pos logit contributions
ited+0.008
ised+0.007
 disagreed+0.007
ere+0.007
 seen+0.007
Neg logit contributions
tr-0.007
 sle-0.007
 Dragons-0.007
 roof-0.007
 mast-0.007
Token equipment
Token ID5112
Feature activation+3.27e+00

Pos logit contributions
 sle+0.014
tr+0.013
 Dragons+0.013
 roof+0.013
 mast+0.013
Neg logit contributions
ited-0.019
ised-0.017
 disagreed-0.017
ere-0.017
ced-0.016
Token there
Token ID612
Feature activation-5.51e+00

Pos logit contributions
ited+0.131
 disagreed+0.097
ere+0.095
ised+0.091
eling+0.091
Neg logit contributions
 globe-0.153
 roof-0.147
tr-0.146
 bulb-0.132
 lamp-0.132
Token were
Token ID547
Feature activation-8.25e+00

Pos logit contributions
ited+0.147
ised+0.124
ere+0.114
eful+0.113
ieved+0.113
Neg logit contributions
 globe-0.072
 roof-0.067
tr-0.062
 fireplace-0.061
 chim-0.057
Token two
Token ID734
Feature activation-5.99e+00

Pos logit contributions
ited+0.158
ised+0.131
ced+0.129
illed+0.124
 disagreed+0.123
Neg logit contributions
 roof-0.156
 sle-0.152
 mast-0.141
tr-0.141
 Dragons-0.140
Token friends
Token ID2460
Feature activation-7.64e+00

Pos logit contributions
ited+0.113
ised+0.102
illed+0.094
ced+0.094
 disagreed+0.092
Neg logit contributions
 sle-0.114
 roof-0.111
 mast-0.104
 Dragons-0.102
tr-0.101
Token,
Token ID11
Feature activation-2.37e+00

Pos logit contributions
ited+0.145
ised+0.116
ced+0.112
 disagreed+0.111
ere+0.108
Neg logit contributions
 roof-0.149
 sle-0.149
 Dragons-0.141
tr-0.137
 mast-0.137
Token Ab
Token ID2275
Feature activation-1.15e+01

Pos logit contributions
ited+0.032
ised+0.028
 disagreed+0.027
 seen+0.026
ere+0.026
Neg logit contributions
 sle-0.054
 roof-0.052
tr-0.052
 mast-0.052
 Dragons-0.051
Tokenig
Token ID328
Feature activation-5.76e+00

Pos logit contributions
ised+0.209
ited+0.198
 disagreed+0.180
ced+0.174
eful+0.166
Neg logit contributions
 sle-0.202
 roof-0.187
tr-0.176
 flaming-0.176
 sy-0.169
Tokenail
Token ID603
Feature activation-7.06e+00

Pos logit contributions
ited+0.115
ised+0.100
 disagreed+0.095
ced+0.092
ere+0.090
Neg logit contributions
 sle-0.102
 roof-0.101
tr-0.099
 globe-0.096
 mast-0.095
Token and
Token ID290
Feature activation-2.73e+00

Pos logit contributions
ited+0.106
ised+0.090
ced+0.081
 disagreed+0.080
ere+0.079
Neg logit contributions
 sle-0.159
 roof-0.157
tr-0.153
 mast-0.152
 globe-0.150
Token Mat
Token ID6550
Feature activation-8.33e+00

Pos logit contributions
ited+0.048
ised+0.042
ced+0.038
ere+0.038
 disagreed+0.037
Neg logit contributions
 sle-0.055
 roof-0.054
 mast-0.053
 Dragons-0.052
tr-0.052
Tokenty
Token ID774
Feature activation-5.35e+00

Pos logit contributions
ited+0.124
ised+0.109
 disagreed+0.099
ced+0.097
ere+0.096
Neg logit contributions
 sle-0.184
 roof-0.181
tr-0.177
 mast-0.176
 globe-0.175
Token was
Token ID373
Feature activation-7.97e+00

Pos logit contributions
ited+0.147
ised+0.125
ere+0.115
eful+0.115
ieved+0.112
Neg logit contributions
 globe-0.080
 roof-0.076
tr-0.072
 fireplace-0.069
 chim-0.066
Token a
Token ID257
Feature activation-5.10e+00

Pos logit contributions
ited+0.148
ised+0.124
 disagreed+0.121
ced+0.114
ere+0.112
Neg logit contributions
 roof-0.155
tr-0.151
 sle-0.146
 globe-0.145
 moon-0.144
Token little
Token ID1310
Feature activation-3.82e+00

Pos logit contributions
ited+0.060
ised+0.050
ced+0.047
ere+0.047
 disagreed+0.046
Neg logit contributions
tr-0.120
 Dragons-0.120
 sle-0.119
bul-0.118
 mast-0.116
Token girl
Token ID2576
Feature activation-6.75e+00

Pos logit contributions
ited+0.067
ised+0.061
ere+0.058
 disagreed+0.056
eful+0.054
Neg logit contributions
tr-0.070
 mant-0.068
 Dragons-0.067
 sle-0.066
 mast-0.066
Token named
Token ID3706
Feature activation-5.29e+00

Pos logit contributions
ited+0.094
ised+0.082
 disagreed+0.074
ced+0.073
ere+0.073
Neg logit contributions
 sle-0.157
 roof-0.153
tr-0.152
 mast-0.150
 Dragons-0.149
Token Ab
Token ID2275
Feature activation-1.11e+01

Pos logit contributions
ited+0.112
ised+0.102
ced+0.095
 disagreed+0.095
ere+0.094
Neg logit contributions
 roof-0.085
 sle-0.084
tr-0.083
 Dragons-0.080
 globe-0.080
Tokenig
Token ID328
Feature activation-4.58e+00

Pos logit contributions
ised+0.217
ited+0.206
 disagreed+0.190
ced+0.182
eful+0.177
Neg logit contributions
 sle-0.187
 roof-0.172
 globe-0.164
tr-0.162
 mast-0.157
Tokenail
Token ID603
Feature activation-6.63e+00

Pos logit contributions
ited+0.086
ised+0.076
 disagreed+0.071
ced+0.070
ere+0.069
Neg logit contributions
 sle-0.086
 roof-0.085
tr-0.084
 globe-0.081
 mast-0.081
Token.
Token ID13
Feature activation+8.38e-01

Pos logit contributions
ited+0.105
ised+0.094
 disagreed+0.088
ere+0.088
illed+0.087
Neg logit contributions
 sle-0.138
 mast-0.134
 Dragons-0.132
 roof-0.131
tr-0.131
Token She
Token ID1375
Feature activation-1.26e+00

Pos logit contributions
 sle+0.012
 roof+0.012
tr+0.012
 mast+0.012
 Dragons+0.012
Neg logit contributions
ited-0.019
ised-0.017
ced-0.016
 disagreed-0.016
ere-0.016
Token was
Token ID373
Feature activation-4.13e+00

Pos logit contributions
ited+0.018
ised+0.015
 disagreed+0.014
ere+0.014
ced+0.013
Neg logit contributions
 sle-0.029
 roof-0.029
tr-0.029
 globe-0.028
 Dragons-0.028
Token moon
Token ID8824
Feature activation+2.15e+00

Pos logit contributions
 sle+0.023
tr+0.023
 Dragons+0.023
 mast+0.021
 roof+0.021
Neg logit contributions
ited-0.038
ised-0.036
ere-0.034
ced-0.033
 disagreed-0.033
Token.
Token ID13
Feature activation+3.16e+00

Pos logit contributions
 sle+0.046
 Dragons+0.044
tr+0.044
 roof+0.043
 mast+0.043
Neg logit contributions
ited-0.032
ised-0.029
 disagreed-0.027
ced-0.026
 seen-0.026
Token<|endoftext|>
Token ID50256
Feature activation-7.60e+00

Pos logit contributions
 sle+0.066
 mast+0.062
 sy+0.060
bul+0.060
 roof+0.059
Neg logit contributions
ited-0.052
ised-0.050
 disagreed-0.047
ere-0.046
illed-0.046
TokenOnce
Token ID7454
Feature activation-8.11e+00

Pos logit contributions
ited+0.187
ised+0.166
 disagreed+0.159
ere+0.153
illed+0.153
Neg logit contributions
 globe-0.100
 sle-0.099
tr-0.096
 roof-0.095
 Dragons-0.093
Token upon
Token ID2402
Feature activation-7.10e+00

Pos logit contributions
ited+0.131
ised+0.107
ced+0.103
pped+0.100
ere+0.097
Neg logit contributions
 roof-0.171
 globe-0.165
 sle-0.163
ometer-0.156
tr-0.155
Token a
Token ID257
Feature activation-1.11e+01

Pos logit contributions
ited+0.142
ised+0.126
ere+0.120
 disagreed+0.114
eful+0.112
Neg logit contributions
 roof-0.127
tr-0.123
 globe-0.121
 Dragons-0.120
 moon-0.114
Token time
Token ID640
Feature activation-7.17e+00

Pos logit contributions
ised+0.174
pped+0.166
ited+0.166
ere+0.154
ced+0.151
Neg logit contributions
tr-0.206
 roof-0.189
 sle-0.178
 globe-0.177
 fireplace-0.169
Token,
Token ID11
Feature activation-7.69e+00

Pos logit contributions
ited+0.131
ised+0.099
ere+0.092
eling+0.091
pped+0.089
Neg logit contributions
 globe-0.148
 roof-0.145
tr-0.139
 Dragons-0.138
 sle-0.137
Token there
Token ID612
Feature activation-6.32e+00

Pos logit contributions
ited+0.154
ere+0.117
eling+0.110
 disagreed+0.110
iley+0.107
Neg logit contributions
 globe-0.151
tr-0.144
 roof-0.144
 bulb-0.133
 We-0.131
Token was
Token ID373
Feature activation-8.70e+00

Pos logit contributions
ited+0.172
ised+0.143
eful+0.133
ere+0.133
ieved+0.131
Neg logit contributions
 globe-0.079
 roof-0.071
tr-0.069
 fireplace-0.062
 Bang-0.061
Token a
Token ID257
Feature activation-5.75e+00

Pos logit contributions
ited+0.165
ised+0.138
 disagreed+0.132
ced+0.127
ere+0.124
Neg logit contributions
 roof-0.167
tr-0.162
 globe-0.156
 moon-0.155
 fireplace-0.154
Token slides
Token ID19392
Feature activation+8.06e-01

Pos logit contributions
 sle+0.084
 roof+0.083
tr+0.081
 globe+0.080
 mast+0.079
Neg logit contributions
ited-0.077
ised-0.069
ere-0.063
illed-0.062
ced-0.062
Token.
Token ID13
Feature activation+4.26e+00

Pos logit contributions
 roof+0.014
 sle+0.014
tr+0.014
 globe+0.014
 mast+0.014
Neg logit contributions
ited-0.016
ised-0.014
ere-0.013
 disagreed-0.013
ced-0.013
Token<|endoftext|>
Token ID50256
Feature activation-7.81e+00

Pos logit contributions
 sle+0.094
 mast+0.092
 globe+0.086
 Dragons+0.085
bul+0.085
Neg logit contributions
ited-0.062
ised-0.059
 disagreed-0.054
 done-0.053
 did-0.053
TokenOnce
Token ID7454
Feature activation-8.03e+00

Pos logit contributions
ited+0.189
ised+0.169
 disagreed+0.161
ere+0.156
illed+0.155
Neg logit contributions
 sle-0.106
 globe-0.104
tr-0.102
 roof-0.102
 Dragons-0.098
Token upon
Token ID2402
Feature activation-7.19e+00

Pos logit contributions
ited+0.122
ised+0.102
ced+0.096
pped+0.091
 disagreed+0.091
Neg logit contributions
 roof-0.181
 sle-0.176
 globe-0.171
tr-0.166
ometer-0.164
Token a
Token ID257
Feature activation-1.10e+01

Pos logit contributions
ited+0.136
ised+0.122
ere+0.114
 disagreed+0.108
eful+0.107
Neg logit contributions
 roof-0.137
tr-0.133
 globe-0.129
 Dragons-0.128
 sle-0.125
Token time
Token ID640
Feature activation-7.21e+00

Pos logit contributions
ised+0.163
ited+0.159
pped+0.148
ced+0.143
eful+0.140
Neg logit contributions
tr-0.222
 roof-0.211
 sle-0.204
 globe-0.197
 fireplace-0.191
Token,
Token ID11
Feature activation-7.78e+00

Pos logit contributions
ited+0.125
ised+0.097
ere+0.088
ced+0.086
eling+0.085
Neg logit contributions
 globe-0.155
 roof-0.155
 sle-0.149
tr-0.146
 Dragons-0.144
Token there
Token ID612
Feature activation-6.33e+00

Pos logit contributions
ited+0.147
ere+0.110
ised+0.107
eling+0.106
 disagreed+0.105
Neg logit contributions
 globe-0.161
 roof-0.159
tr-0.153
 We-0.144
 chim-0.140
Token was
Token ID373
Feature activation-8.84e+00

Pos logit contributions
ited+0.168
ised+0.142
eful+0.134
ere+0.130
ieved+0.129
Neg logit contributions
 globe-0.079
 roof-0.077
tr-0.069
 fireplace-0.066
 chim-0.064
Token a
Token ID257
Feature activation-5.51e+00

Pos logit contributions
ited+0.159
ised+0.133
 disagreed+0.126
ced+0.124
ere+0.120
Neg logit contributions
 roof-0.178
tr-0.173
 sle-0.167
 globe-0.167
 fireplace-0.164
Token two
Token ID734
Feature activation-5.11e+00

Pos logit contributions
ited+0.156
ced+0.134
ised+0.133
illed+0.126
 disagreed+0.125
Neg logit contributions
 sle-0.154
 roof-0.153
 Dragons-0.141
 mast-0.140
tr-0.137
Token little
Token ID1310
Feature activation-2.98e+00

Pos logit contributions
ited+0.096
ised+0.087
ced+0.081
illed+0.080
pped+0.079
Neg logit contributions
 sle-0.098
 roof-0.095
 mast-0.088
 Dragons-0.086
tr-0.086
Token girls
Token ID4813
Feature activation-5.77e+00

Pos logit contributions
ited+0.056
ised+0.051
 disagreed+0.048
ere+0.048
ced+0.047
Neg logit contributions
 sle-0.053
tr-0.052
 roof-0.051
 mast-0.050
 Dragons-0.050
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
ited+0.087
ised+0.071
ced+0.065
 disagreed+0.065
ere+0.064
Neg logit contributions
 sle-0.132
 roof-0.131
tr-0.126
 mast-0.124
 Dragons-0.123
Token named
Token ID3706
Feature activation-5.21e+00

Pos logit contributions
ited+0.033
ised+0.029
 disagreed+0.027
ere+0.027
ced+0.027
Neg logit contributions
 sle-0.044
 roof-0.043
tr-0.042
 mast-0.042
 Dragons-0.042
Token Ab
Token ID2275
Feature activation-1.10e+01

Pos logit contributions
ited+0.124
ised+0.110
ced+0.104
ere+0.101
 disagreed+0.101
Neg logit contributions
 roof-0.077
tr-0.072
 sle-0.069
 globe-0.068
 fireplace-0.065
Tokenbie
Token ID12590
Feature activation-6.81e+00

Pos logit contributions
ised+0.196
ited+0.184
 disagreed+0.165
ced+0.161
eful+0.153
Neg logit contributions
 sle-0.200
 roof-0.187
 globe-0.179
tr-0.175
 flaming-0.173
Token and
Token ID290
Feature activation-2.09e+00

Pos logit contributions
ited+0.091
ised+0.078
 disagreed+0.071
ere+0.070
ced+0.070
Neg logit contributions
 sle-0.161
 roof-0.157
tr-0.155
 mast-0.155
 Dragons-0.154
Token Daisy
Token ID40355
Feature activation-6.97e+00

Pos logit contributions
ited+0.033
ised+0.029
ced+0.026
illed+0.026
ere+0.025
Neg logit contributions
 sle-0.046
 roof-0.045
 mast-0.043
 Dragons-0.043
tr-0.043
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
ited+0.127
ised+0.113
 disagreed+0.105
ere+0.105
ced+0.104
Neg logit contributions
 sle-0.132
 roof-0.129
tr-0.127
 mast-0.125
 Dragons-0.124
Token Ab
Token ID2275
Feature activation-7.75e+00

Pos logit contributions
 sle+0.020
 roof+0.019
tr+0.018
 mast+0.018
 Dragons+0.018
Neg logit contributions
ited-0.049
ised-0.046
 disagreed-0.044
ere-0.044
illed-0.043
Token garden
Token ID11376
Feature activation+1.59e+00

Pos logit contributions
 sle+0.054
tr+0.051
 roof+0.051
 mast+0.050
 Dragons+0.050
Neg logit contributions
ited-0.080
ised-0.074
ere-0.070
 disagreed-0.070
ced-0.069
Token.
Token ID13
Feature activation+2.75e+00

Pos logit contributions
 sle+0.038
 mast+0.036
 roof+0.036
bul+0.035
tr+0.035
Neg logit contributions
ited-0.019
ised-0.017
 disagreed-0.016
ere-0.015
ced-0.015
Token<|endoftext|>
Token ID50256
Feature activation-8.92e+00

Pos logit contributions
 mast+0.072
 sle+0.070
bul+0.065
rier+0.065
 roof+0.064
Neg logit contributions
 done-0.026
 did-0.025
ited-0.024
ere-0.023
 seen-0.023
TokenOnce
Token ID7454
Feature activation-8.17e+00

Pos logit contributions
ited+0.229
ised+0.203
 disagreed+0.196
ere+0.188
illed+0.186
Neg logit contributions
 globe-0.111
 sle-0.108
tr-0.107
 roof-0.104
 Dragons-0.102
Token upon
Token ID2402
Feature activation-6.91e+00

Pos logit contributions
ited+0.133
ised+0.111
ced+0.102
 disagreed+0.098
ere+0.097
Neg logit contributions
 roof-0.176
 sle-0.172
 globe-0.169
tr-0.165
ometer-0.162
Token a
Token ID257
Feature activation-1.09e+01

Pos logit contributions
ited+0.130
ised+0.116
ere+0.108
 disagreed+0.104
eful+0.101
Neg logit contributions
 roof-0.132
tr-0.130
 globe-0.128
 Dragons-0.126
 sle-0.122
Token time
Token ID640
Feature activation-7.48e+00

Pos logit contributions
ised+0.167
ited+0.165
pped+0.154
ced+0.147
 disagreed+0.144
Neg logit contributions
tr-0.216
 roof-0.202
 sle-0.196
 globe-0.194
 fireplace-0.181
Token,
Token ID11
Feature activation-7.99e+00

Pos logit contributions
ited+0.136
ised+0.102
eling+0.097
ere+0.094
pped+0.093
Neg logit contributions
 globe-0.155
 roof-0.150
tr-0.147
 sle-0.145
 Dragons-0.143
Token there
Token ID612
Feature activation-6.20e+00

Pos logit contributions
ited+0.152
ere+0.113
 disagreed+0.111
ised+0.109
eling+0.107
Neg logit contributions
 globe-0.163
tr-0.160
 roof-0.155
 bulb-0.142
ometer-0.142
Token was
Token ID373
Feature activation-8.87e+00

Pos logit contributions
ited+0.156
ised+0.132
ere+0.122
eful+0.121
pped+0.116
Neg logit contributions
 globe-0.092
tr-0.082
 roof-0.081
 Bang-0.073
 fireplace-0.073
Token a
Token ID257
Feature activation-5.72e+00

Pos logit contributions
ited+0.160
ised+0.134
 disagreed+0.127
ced+0.124
ere+0.120
Neg logit contributions
 roof-0.178
tr-0.174
 globe-0.168
 sle-0.167
 Dragons-0.164
Token castle
Token ID16669
Feature activation+1.58e+00

Pos logit contributions
 sle+0.022
 roof+0.022
tr+0.021
 mast+0.021
 Dragons+0.021
Neg logit contributions
ited-0.030
ised-0.027
 disagreed-0.026
ere-0.025
ced-0.025
Token.
Token ID13
Feature activation+4.45e+00

Pos logit contributions
 sle+0.029
 roof+0.028
tr+0.028
 mast+0.027
 globe+0.027
Neg logit contributions
ited-0.030
ised-0.027
 disagreed-0.025
ere-0.025
ced-0.025
Token<|endoftext|>
Token ID50256
Feature activation-7.34e+00

Pos logit contributions
 sle+0.104
 mast+0.103
 sy+0.097
bul+0.095
 globe+0.093
Neg logit contributions
ited-0.056
ised-0.053
 disagreed-0.050
 did-0.050
ere-0.049
TokenOnce
Token ID7454
Feature activation-8.03e+00

Pos logit contributions
ited+0.177
ised+0.157
 disagreed+0.151
illed+0.145
ere+0.144
Neg logit contributions
 sle-0.101
 globe-0.099
tr-0.097
 roof-0.096
 Dragons-0.093
Token upon
Token ID2402
Feature activation-7.26e+00

Pos logit contributions
ited+0.124
ised+0.101
ced+0.096
pped+0.095
ere+0.091
Neg logit contributions
 roof-0.179
 sle-0.173
 globe-0.170
ometer-0.164
tr-0.163
Token a
Token ID257
Feature activation-1.09e+01

Pos logit contributions
ited+0.136
ised+0.120
ere+0.113
 disagreed+0.109
eful+0.106
Neg logit contributions
 roof-0.139
tr-0.136
 globe-0.133
 Dragons-0.132
 sle-0.130
Token time
Token ID640
Feature activation-7.08e+00

Pos logit contributions
ised+0.167
ited+0.162
pped+0.155
 disagreed+0.145
ced+0.144
Neg logit contributions
tr-0.215
 roof-0.200
 sle-0.194
 globe-0.187
 fireplace-0.183
Token,
Token ID11
Feature activation-7.81e+00

Pos logit contributions
ited+0.116
ised+0.088
ere+0.079
pped+0.077
ced+0.077
Neg logit contributions
 globe-0.160
 roof-0.159
 sle-0.153
tr-0.152
 Dragons-0.150
Token there
Token ID612
Feature activation-6.28e+00

Pos logit contributions
ited+0.142
ere+0.105
 disagreed+0.104
ised+0.100
eling+0.097
Neg logit contributions
 globe-0.169
 roof-0.163
tr-0.161
bul-0.147
 bulb-0.147
Token was
Token ID373
Feature activation-8.46e+00

Pos logit contributions
ited+0.162
ised+0.136
eful+0.126
ere+0.124
ieved+0.124
Neg logit contributions
 globe-0.089
 roof-0.086
tr-0.080
 fireplace-0.076
 chim-0.074
Token a
Token ID257
Feature activation-5.33e+00

Pos logit contributions
ited+0.151
ised+0.126
 disagreed+0.121
ced+0.116
ere+0.113
Neg logit contributions
 roof-0.171
tr-0.167
 sle-0.164
 globe-0.161
 fireplace-0.159
Token bit
Token ID1643
Feature activation+2.07e+00

Pos logit contributions
 sle+0.075
 Dragons+0.073
tr+0.073
 roof+0.072
 mast+0.071
Neg logit contributions
ited-0.070
ised-0.062
 disagreed-0.059
 seen-0.057
ced-0.057
Token of
Token ID286
Feature activation+2.00e+00

Pos logit contributions
 sle+0.040
 roof+0.040
 mast+0.039
bul+0.038
 Dragons+0.038
Neg logit contributions
ited-0.035
ised-0.031
ere-0.030
ced-0.029
 disagreed-0.028
Token magic
Token ID5536
Feature activation+3.80e+00

Pos logit contributions
 sle+0.037
 roof+0.036
tr+0.036
 mast+0.035
 Dragons+0.035
Neg logit contributions
ited-0.037
ised-0.034
 disagreed-0.032
ced-0.031
ere-0.031
Token.
Token ID13
Feature activation+4.44e+00

Pos logit contributions
 sle+0.076
 roof+0.073
 Dragons+0.072
tr+0.072
 mast+0.072
Neg logit contributions
ited-0.064
ised-0.058
ced-0.054
 disagreed-0.054
ere-0.054
Token<|endoftext|>
Token ID50256
Feature activation-6.20e+00

Pos logit contributions
 roof+0.106
 sle+0.106
 mast+0.105
tr+0.098
 sy+0.098
Neg logit contributions
ited-0.054
ised-0.053
ere-0.050
illed-0.047
 disagreed-0.046
TokenIsa
Token ID39443
Feature activation-1.09e+01

Pos logit contributions
ited+0.138
ised+0.124
 disagreed+0.117
illed+0.113
ced+0.113
Neg logit contributions
 sle-0.094
 globe-0.092
tr-0.090
 roof-0.090
 Dragons-0.088
Tokenac
Token ID330
Feature activation-6.56e+00

Pos logit contributions
ised+0.191
ited+0.188
 disagreed+0.162
ced+0.151
pped+0.147
Neg logit contributions
tr-0.205
 sle-0.201
 globe-0.194
 roof-0.183
bul-0.183
Token was
Token ID373
Feature activation-6.02e+00

Pos logit contributions
ited+0.113
illed+0.099
ced+0.096
 disagreed+0.093
ised+0.090
Neg logit contributions
tr-0.113
 sle-0.112
 moon-0.109
 unicorn-0.109
bul-0.107
Token preparing
Token ID10629
Feature activation-1.16e+00

Pos logit contributions
ited+0.117
ised+0.106
 disagreed+0.097
ced+0.093
ere+0.093
Neg logit contributions
 sle-0.106
 roof-0.101
 remote-0.099
tr-0.097
bul-0.096
Token breakfast
Token ID12607
Feature activation+8.47e-01

Pos logit contributions
ited+0.018
ised+0.015
 disagreed+0.014
ere+0.014
ced+0.014
Neg logit contributions
 sle-0.026
 roof-0.025
tr-0.024
 mast-0.024
 globe-0.024
Token in
Token ID287
Feature activation-1.27e+00

Pos logit contributions
 sle+0.017
 roof+0.017
tr+0.016
 mast+0.016
 Dragons+0.016
Neg logit contributions
ited-0.014
ised-0.013
 disagreed-0.012
ere-0.012
ced-0.012
Token book
Token ID1492
Feature activation+4.28e+00

Pos logit contributions
 sle+0.050
 mast+0.049
bul+0.047
 Dragons+0.047
 roof+0.047
Neg logit contributions
ited-0.060
ised-0.052
ced-0.051
ere-0.051
 disagreed-0.051
Token.
Token ID13
Feature activation+3.67e+00

Pos logit contributions
 roof+0.076
 sle+0.076
tr+0.075
 mast+0.073
 globe+0.073
Neg logit contributions
ited-0.082
ised-0.073
ere-0.068
 disagreed-0.068
ced-0.067
Token<|endoftext|>
Token ID50256
Feature activation-8.69e+00

Pos logit contributions
 sle+0.081
 mast+0.077
 sy+0.076
bul+0.074
 remote+0.072
Neg logit contributions
ised-0.053
ere-0.051
 disagreed-0.050
illed-0.049
ited-0.047
TokenOnce
Token ID7454
Feature activation-8.30e+00

Pos logit contributions
ited+0.230
ised+0.203
 disagreed+0.196
ere+0.187
illed+0.186
Neg logit contributions
 globe-0.105
tr-0.099
 sle-0.099
 roof-0.096
 Dragons-0.094
Token upon
Token ID2402
Feature activation-7.22e+00

Pos logit contributions
ited+0.139
ised+0.114
ced+0.108
pped+0.103
 disagreed+0.101
Neg logit contributions
 roof-0.174
 sle-0.169
 globe-0.169
tr-0.162
ometer-0.160
Token a
Token ID257
Feature activation-1.09e+01

Pos logit contributions
ited+0.140
ised+0.124
ere+0.116
 disagreed+0.112
eful+0.109
Neg logit contributions
 roof-0.134
tr-0.132
 globe-0.130
 Dragons-0.127
 sle-0.123
Token time
Token ID640
Feature activation-7.55e+00

Pos logit contributions
ited+0.166
ised+0.160
pped+0.152
 disagreed+0.143
ced+0.143
Neg logit contributions
tr-0.216
 roof-0.203
 globe-0.197
 sle-0.195
 fireplace-0.184
Token,
Token ID11
Feature activation-8.47e+00

Pos logit contributions
ited+0.132
ised+0.098
eling+0.091
pped+0.090
ere+0.090
Neg logit contributions
 globe-0.161
 roof-0.157
tr-0.156
 Dragons-0.151
 sle-0.151
Token there
Token ID612
Feature activation-6.50e+00

Pos logit contributions
ited+0.172
 disagreed+0.129
ere+0.128
ised+0.125
eling+0.121
Neg logit contributions
tr-0.163
 globe-0.159
 roof-0.154
 bulb-0.140
ometer-0.140
Token was
Token ID373
Feature activation-9.01e+00

Pos logit contributions
ited+0.164
ised+0.138
ere+0.127
eful+0.126
pped+0.124
Neg logit contributions
 globe-0.096
tr-0.088
 roof-0.087
 Bang-0.077
 fireplace-0.077
Token a
Token ID257
Feature activation-5.75e+00

Pos logit contributions
ited+0.168
ised+0.139
 disagreed+0.133
ced+0.129
ere+0.125
Neg logit contributions
 roof-0.177
tr-0.175
 globe-0.168
 sle-0.165
 moon-0.165
Token soup
Token ID17141
Feature activation+5.48e+00

Pos logit contributions
 sle+0.038
tr+0.038
 Dragons+0.036
bul+0.036
 mast+0.036
Neg logit contributions
ited-0.053
 done-0.048
ised-0.048
ced-0.047
 disagreed-0.046
Token.
Token ID13
Feature activation+3.26e+00

Pos logit contributions
 sle+0.116
 mast+0.110
 Dragons+0.110
 roof+0.110
tr+0.108
Neg logit contributions
ited-0.088
ised-0.077
 disagreed-0.075
ced-0.072
 seen-0.070
Token<|endoftext|>
Token ID50256
Feature activation-9.22e+00

Pos logit contributions
 sle+0.073
 mast+0.072
bul+0.070
 sy+0.069
 globe+0.068
Neg logit contributions
ited-0.044
ised-0.042
 disagreed-0.041
 done-0.041
 seen-0.041
TokenOnce
Token ID7454
Feature activation-8.29e+00

Pos logit contributions
ited+0.235
ised+0.211
 disagreed+0.201
ere+0.194
illed+0.192
Neg logit contributions
 globe-0.115
 sle-0.112
tr-0.111
 Dragons-0.107
 roof-0.107
Token upon
Token ID2402
Feature activation-7.38e+00

Pos logit contributions
ited+0.131
ised+0.110
ced+0.101
pped+0.096
ere+0.096
Neg logit contributions
 roof-0.179
 sle-0.177
 globe-0.175
tr-0.169
ometer-0.167
Token a
Token ID257
Feature activation-1.09e+01

Pos logit contributions
ited+0.131
ised+0.116
ere+0.107
 disagreed+0.104
eful+0.099
Neg logit contributions
 roof-0.148
tr-0.145
 globe-0.144
 Dragons-0.144
 sle-0.141
Token time
Token ID640
Feature activation-7.53e+00

Pos logit contributions
ited+0.160
ised+0.159
pped+0.146
ced+0.141
 disagreed+0.137
Neg logit contributions
tr-0.224
 roof-0.211
 sle-0.206
 globe-0.203
 fireplace-0.192
Token,
Token ID11
Feature activation-8.33e+00

Pos logit contributions
ited+0.124
ised+0.094
ere+0.084
ced+0.082
 disagreed+0.081
Neg logit contributions
 globe-0.169
 roof-0.165
tr-0.164
 sle-0.162
 Dragons-0.160
Token there
Token ID612
Feature activation-6.55e+00

Pos logit contributions
ited+0.160
ere+0.119
ised+0.116
 disagreed+0.115
iley+0.110
Neg logit contributions
 globe-0.169
tr-0.166
 roof-0.161
 Dragons-0.151
 Bang-0.148
Token was
Token ID373
Feature activation-9.32e+00

Pos logit contributions
ited+0.160
ised+0.137
ere+0.124
eful+0.123
ieved+0.121
Neg logit contributions
 globe-0.100
tr-0.092
 roof-0.090
 Bang-0.082
 fireplace-0.081
Token a
Token ID257
Feature activation-6.03e+00

Pos logit contributions
ited+0.163
ised+0.134
 disagreed+0.127
ced+0.126
ere+0.120
Neg logit contributions
 roof-0.193
tr-0.188
 globe-0.184
 sle-0.184
 Dragons-0.181
Token Daddy
Token ID26926
Feature activation-2.73e+00

Pos logit contributions
tr+0.051
 sle+0.048
 Dragons+0.048
 globe+0.047
 sy+0.047
Neg logit contributions
ited-0.032
 disagreed-0.031
ised-0.030
 did-0.030
 enough-0.028
Token!"
Token ID2474
Feature activation+1.41e+00

Pos logit contributions
ited+0.046
ised+0.041
 disagreed+0.038
ere+0.037
ced+0.037
Neg logit contributions
 sle-0.055
 roof-0.054
tr-0.053
 mast-0.053
 globe-0.052
Token<|endoftext|>
Token ID50256
Feature activation-7.24e+00

Pos logit contributions
 sle+0.036
 mast+0.034
tr+0.033
 roof+0.033
 remote+0.033
Neg logit contributions
ised-0.015
,-0.013
 disagreed-0.013
illed-0.013
ited-0.013
TokenOnce
Token ID7454
Feature activation-7.38e+00

Pos logit contributions
ited+0.174
ised+0.155
 disagreed+0.149
illed+0.143
ere+0.142
Neg logit contributions
 sle-0.098
 globe-0.097
tr-0.093
 roof-0.092
 Dragons-0.092
Token upon
Token ID2402
Feature activation-7.01e+00

Pos logit contributions
ited+0.114
ised+0.094
ced+0.090
pped+0.086
ere+0.084
Neg logit contributions
 roof-0.162
 sle-0.158
 globe-0.155
ometer-0.149
 Dragons-0.148
Token a
Token ID257
Feature activation-1.09e+01

Pos logit contributions
ited+0.126
ised+0.112
ere+0.104
 disagreed+0.101
eful+0.097
Neg logit contributions
 roof-0.138
tr-0.135
 globe-0.133
 Dragons-0.133
 sle-0.132
Token time
Token ID640
Feature activation-7.22e+00

Pos logit contributions
ised+0.169
ited+0.156
pped+0.153
ced+0.145
eling+0.142
Neg logit contributions
tr-0.213
 roof-0.195
 sle-0.192
 globe-0.187
 fireplace-0.177
Token,
Token ID11
Feature activation-7.09e+00

Pos logit contributions
ited+0.120
ised+0.093
ere+0.083
ced+0.081
pped+0.080
Neg logit contributions
 globe-0.160
 roof-0.157
 sle-0.153
 Dragons-0.150
tr-0.149
Token there
Token ID612
Feature activation-6.38e+00

Pos logit contributions
ited+0.135
ere+0.102
 disagreed+0.098
ised+0.095
ced+0.095
Neg logit contributions
 globe-0.146
 roof-0.137
tr-0.132
 We-0.129
 bulb-0.127
Token was
Token ID373
Feature activation-8.58e+00

Pos logit contributions
ited+0.168
ised+0.141
eful+0.131
ere+0.129
ieved+0.129
Neg logit contributions
 globe-0.084
 roof-0.078
tr-0.072
 fireplace-0.070
 chim-0.067
Token a
Token ID257
Feature activation-5.47e+00

Pos logit contributions
ited+0.146
ised+0.122
 disagreed+0.116
ced+0.114
ere+0.108
Neg logit contributions
 roof-0.178
 sle-0.173
tr-0.172
 globe-0.170
 Dragons-0.166
Token two
Token ID734
Feature activation-6.11e+00

Pos logit contributions
ited+0.158
ised+0.135
ced+0.133
illed+0.127
 disagreed+0.127
Neg logit contributions
 sle-0.156
 roof-0.156
 Dragons-0.144
 mast-0.143
tr-0.143
Token friends
Token ID2460
Feature activation-7.78e+00

Pos logit contributions
ited+0.116
ised+0.106
illed+0.098
ced+0.097
pped+0.096
Neg logit contributions
 sle-0.116
 roof-0.112
 mast-0.105
 Dragons-0.103
 sy-0.100
Token,
Token ID11
Feature activation-2.73e+00

Pos logit contributions
ited+0.147
ised+0.119
ced+0.115
 disagreed+0.115
ere+0.111
Neg logit contributions
 sle-0.152
 roof-0.151
 Dragons-0.143
 mast-0.140
bul-0.138
Token Peter
Token ID5613
Feature activation-8.45e+00

Pos logit contributions
ited+0.034
 seen+0.030
 disagreed+0.030
ised+0.029
ere+0.028
Neg logit contributions
 sle-0.063
tr-0.061
 roof-0.061
 mast-0.061
 Dragons-0.060
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
ited+0.142
ised+0.126
ced+0.112
 disagreed+0.112
ere+0.110
Neg logit contributions
 sle-0.179
 roof-0.169
 globe-0.167
 mast-0.165
tr-0.164
Token Ab
Token ID2275
Feature activation-1.09e+01

Pos logit contributions
ited+0.022
ised+0.019
 disagreed+0.019
ere+0.018
ced+0.018
Neg logit contributions
 sle-0.026
 roof-0.025
tr-0.025
 mast-0.025
 globe-0.024
Tokenig
Token ID328
Feature activation-4.81e+00

Pos logit contributions
ised+0.196
ited+0.192
 disagreed+0.168
ced+0.165
eful+0.158
Neg logit contributions
 sle-0.202
 roof-0.191
tr-0.181
 mast-0.178
bul-0.175
Tokenail
Token ID603
Feature activation-5.93e+00

Pos logit contributions
ited+0.096
ised+0.083
 disagreed+0.078
ced+0.077
ere+0.075
Neg logit contributions
 roof-0.086
 sle-0.085
tr-0.083
 mast-0.081
 globe-0.080
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
ited+0.111
ised+0.097
ced+0.089
ere+0.089
 disagreed+0.087
Neg logit contributions
 roof-0.112
 sle-0.110
tr-0.106
 mast-0.105
 globe-0.103
Token They
Token ID1119
Feature activation-2.88e-01

Pos logit contributions
 sle+0.023
 sy+0.023
 mast+0.022
tr+0.022
 cheek+0.021
Neg logit contributions
ised-0.029
illed-0.028
ere-0.028
ited-0.027
 disagreed-0.027
Token were
Token ID547
Feature activation-3.46e+00

Pos logit contributions
ited+0.005
ised+0.004
ced+0.004
illed+0.004
ere+0.004
Neg logit contributions
 roof-0.006
 globe-0.006
 mast-0.006
tr-0.006
 Dragons-0.005
Token there
Token ID612
Feature activation-5.72e+00

Pos logit contributions
ited+0.130
ere+0.094
 disagreed+0.091
ised+0.090
eling+0.090
Neg logit contributions
 globe-0.149
 roof-0.143
tr-0.143
 bulb-0.131
 We-0.131
Token was
Token ID373
Feature activation-8.04e+00

Pos logit contributions
ited+0.147
ised+0.126
eful+0.115
ere+0.114
ieved+0.114
Neg logit contributions
 globe-0.078
 roof-0.075
tr-0.069
 fireplace-0.067
 chim-0.066
Token a
Token ID257
Feature activation-5.10e+00

Pos logit contributions
ited+0.153
ised+0.130
 disagreed+0.124
ced+0.118
ere+0.116
Neg logit contributions
 roof-0.151
tr-0.149
 sle-0.143
 fireplace-0.142
 moon-0.142
Token girl
Token ID2576
Feature activation-6.58e+00

Pos logit contributions
ited+0.061
ised+0.050
ced+0.048
ere+0.047
 disagreed+0.047
Neg logit contributions
 Dragons-0.120
tr-0.119
 sle-0.119
bul-0.117
 mast-0.116
Token called
Token ID1444
Feature activation-5.18e+00

Pos logit contributions
ited+0.092
ised+0.080
 disagreed+0.073
ced+0.072
ere+0.071
Neg logit contributions
 sle-0.153
 roof-0.149
tr-0.147
 mast-0.145
 Dragons-0.145
Token Ab
Token ID2275
Feature activation-1.08e+01

Pos logit contributions
ited+0.096
ised+0.086
 disagreed+0.079
ced+0.079
ere+0.079
Neg logit contributions
 sle-0.097
 roof-0.095
tr-0.094
 mast-0.092
 Dragons-0.092
Tokenbie
Token ID12590
Feature activation-6.30e+00

Pos logit contributions
ised+0.201
ited+0.190
 disagreed+0.176
ced+0.168
eful+0.162
Neg logit contributions
 sle-0.189
 roof-0.176
 globe-0.170
tr-0.169
bul-0.162
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
ited+0.072
illed+0.061
 disagreed+0.061
ere+0.060
ised+0.060
Neg logit contributions
 mast-0.149
 sle-0.145
 Dragons-0.144
 roof-0.139
 sy-0.138
Token She
Token ID1375
Feature activation-1.14e+00

Pos logit contributions
 sle+0.017
 roof+0.017
tr+0.016
 mast+0.016
 Dragons+0.016
Neg logit contributions
ited-0.025
ised-0.022
 disagreed-0.021
ced-0.021
ere-0.021
Token wanted
Token ID2227
Feature activation-3.24e+00

Pos logit contributions
ited+0.018
ised+0.016
illed+0.014
ere+0.014
ced+0.014
Neg logit contributions
tr-0.024
 roof-0.024
 sle-0.024
 globe-0.024
 Dragons-0.023
Token to
Token ID284
Feature activation-1.48e+00

Pos logit contributions
ited+0.054
ised+0.048
 disagreed+0.044
ere+0.043
ced+0.043
Neg logit contributions
 sle-0.067
 roof-0.066
tr-0.065
 mast-0.064
 Dragons-0.064
TokenOnce
Token ID7454
Feature activation-8.40e+00

Pos logit contributions
ited+0.140
ised+0.125
 disagreed+0.119
ced+0.115
illed+0.115
Neg logit contributions
 sle-0.092
 globe-0.089
tr-0.089
 Dragons-0.086
 roof-0.086
Token upon
Token ID2402
Feature activation-6.93e+00

Pos logit contributions
ited+0.138
ised+0.113
eful+0.104
ced+0.104
 disagreed+0.103
Neg logit contributions
 roof-0.178
 sle-0.171
 globe-0.170
ometer-0.163
tr-0.162
Token a
Token ID257
Feature activation-9.83e+00

Pos logit contributions
ited+0.125
ised+0.109
 disagreed+0.101
ere+0.100
ced+0.095
Neg logit contributions
 roof-0.138
tr-0.135
 Dragons-0.133
 sle-0.132
 globe-0.130
Token time
Token ID640
Feature activation-6.76e+00

Pos logit contributions
ited+0.129
ised+0.128
 disagreed+0.114
pped+0.110
ced+0.110
Neg logit contributions
 sle-0.219
 roof-0.218
tr-0.212
 globe-0.199
 mast-0.199
Token,
Token ID11
Feature activation-7.66e+00

Pos logit contributions
ited+0.102
ised+0.075
 disagreed+0.068
ere+0.067
ced+0.065
Neg logit contributions
 roof-0.162
 globe-0.160
 sle-0.157
tr-0.156
 Dragons-0.151
Token Jack
Token ID3619
Feature activation-1.08e+01

Pos logit contributions
ited+0.131
 disagreed+0.097
ere+0.087
ised+0.087
eling+0.083
Neg logit contributions
 globe-0.176
tr-0.173
 roof-0.170
 sle-0.159
bul-0.159
Token and
Token ID290
Feature activation-4.03e+00

Pos logit contributions
ised+0.209
ited+0.202
ieved+0.177
eful+0.173
 disagreed+0.170
Neg logit contributions
 sle-0.182
 globe-0.180
 roof-0.179
tr-0.176
 moon-0.175
Token his
Token ID465
Feature activation-5.64e+00

Pos logit contributions
ited+0.063
ised+0.056
 disagreed+0.053
ere+0.052
ced+0.051
Neg logit contributions
 sle-0.085
 roof-0.083
tr-0.083
 mast-0.082
 Dragons-0.081
Token sister
Token ID6621
Feature activation-5.62e+00

Pos logit contributions
ited+0.080
ised+0.075
ere+0.074
ced+0.070
 disagreed+0.069
Neg logit contributions
 sle-0.116
bul-0.114
 mast-0.113
 Dragons-0.113
 roof-0.112
Token Mia
Token ID32189
Feature activation-7.13e+00

Pos logit contributions
ited+0.089
ised+0.081
illed+0.069
ced+0.067
eful+0.065
Neg logit contributions
 roof-0.125
 sle-0.123
tr-0.121
 fireplace-0.116
 sy-0.115
Token were
Token ID547
Feature activation-8.50e+00

Pos logit contributions
ited+0.123
ised+0.118
ced+0.099
illed+0.099
eful+0.096
Neg logit contributions
 roof-0.140
 sle-0.136
 globe-0.132
tr-0.132
 mast-0.127
Tokenaser
Token ID6005
Feature activation+3.67e+00

Pos logit contributions
 sle+0.110
 mast+0.106
 roof+0.105
 fireplace+0.100
 mant+0.098
Neg logit contributions
ited-0.072
ere-0.068
ised-0.067
pped-0.066
ced-0.065
Token.
Token ID13
Feature activation+3.82e+00

Pos logit contributions
 sle+0.086
 mast+0.079
 Dragons+0.079
tr+0.079
 roof+0.078
Neg logit contributions
ited-0.049
ised-0.043
 disagreed-0.042
ced-0.042
 seen-0.041
Token<|endoftext|>
Token ID50256
Feature activation-7.69e+00

Pos logit contributions
 sle+0.091
 mast+0.088
 sy+0.087
bul+0.084
rier+0.081
Neg logit contributions
ised-0.047
 did-0.045
 done-0.044
ere-0.044
 seen-0.042
TokenOnce
Token ID7454
Feature activation-8.06e+00

Pos logit contributions
ited+0.188
ised+0.166
 disagreed+0.160
ere+0.153
illed+0.153
Neg logit contributions
 sle-0.103
 globe-0.102
tr-0.100
 roof-0.099
 Dragons-0.097
Token upon
Token ID2402
Feature activation-6.72e+00

Pos logit contributions
ited+0.136
ised+0.112
ced+0.105
pped+0.100
 disagreed+0.099
Neg logit contributions
 roof-0.169
 sle-0.164
 globe-0.164
tr-0.157
ometer-0.156
Token a
Token ID257
Feature activation-1.08e+01

Pos logit contributions
ited+0.131
ised+0.117
ere+0.109
 disagreed+0.104
eful+0.101
Neg logit contributions
 roof-0.124
tr-0.123
 globe-0.121
 Dragons-0.119
 sle-0.114
Token time
Token ID640
Feature activation-7.41e+00

Pos logit contributions
ised+0.163
ited+0.162
pped+0.150
ced+0.141
 disagreed+0.140
Neg logit contributions
tr-0.220
 roof-0.204
 globe-0.197
 sle-0.195
 fireplace-0.185
Token,
Token ID11
Feature activation-8.09e+00

Pos logit contributions
ited+0.127
ised+0.092
ere+0.086
 disagreed+0.084
pped+0.084
Neg logit contributions
 globe-0.161
 roof-0.159
tr-0.155
 Dragons-0.153
 sle-0.152
Token a
Token ID257
Feature activation-8.10e+00

Pos logit contributions
ited+0.160
ere+0.118
 disagreed+0.118
ised+0.116
ced+0.111
Neg logit contributions
 globe-0.159
tr-0.158
 roof-0.153
 Dragons-0.141
 bulb-0.141
Token friendly
Token ID8030
Feature activation-5.29e+00

Pos logit contributions
ited+0.124
ised+0.110
ere+0.102
ced+0.101
 disagreed+0.101
Neg logit contributions
 sle-0.174
 roof-0.168
 mast-0.167
tr-0.166
 Dragons-0.166
Token helicopter
Token ID15392
Feature activation-1.85e+00

Pos logit contributions
ited+0.100
ere+0.094
ised+0.093
 disagreed+0.089
ced+0.086
Neg logit contributions
 sle-0.083
tr-0.083
 mast-0.080
 Dragons-0.080
 roof-0.079
Token safe
Token ID3338
Feature activation+2.31e+00

Pos logit contributions
 sle+0.090
 roof+0.086
tr+0.085
 Dragons+0.085
 mast+0.085
Neg logit contributions
ited-0.090
ised-0.082
 disagreed-0.077
ced-0.076
ere-0.075
Token.
Token ID13
Feature activation+3.87e+00

Pos logit contributions
 sle+0.047
 roof+0.046
 Dragons+0.045
tr+0.045
 mast+0.045
Neg logit contributions
ited-0.038
ised-0.034
 disagreed-0.031
ced-0.031
ere-0.031
Token<|endoftext|>
Token ID50256
Feature activation-7.24e+00

Pos logit contributions
 mast+0.097
 sle+0.096
 remote+0.088
 globe+0.088
 roof+0.087
Neg logit contributions
ised-0.041
ited-0.039
ere-0.038
 did-0.036
illed-0.036
TokenOnce
Token ID7454
Feature activation-7.85e+00

Pos logit contributions
ited+0.172
ised+0.154
 disagreed+0.147
illed+0.141
ere+0.141
Neg logit contributions
 sle-0.101
 globe-0.099
tr-0.097
 roof-0.096
 Dragons-0.094
Token upon
Token ID2402
Feature activation-7.01e+00

Pos logit contributions
ited+0.122
ised+0.101
ced+0.095
pped+0.092
 disagreed+0.089
Neg logit contributions
 roof-0.173
 sle-0.167
 globe-0.165
ometer-0.159
tr-0.158
Token a
Token ID257
Feature activation-1.08e+01

Pos logit contributions
ited+0.131
ised+0.117
ere+0.110
 disagreed+0.106
eful+0.103
Neg logit contributions
 roof-0.133
tr-0.130
 globe-0.127
 Dragons-0.127
 sle-0.124
Token time
Token ID640
Feature activation-7.40e+00

Pos logit contributions
ised+0.167
ited+0.157
pped+0.153
ced+0.144
ere+0.143
Neg logit contributions
tr-0.212
 roof-0.197
 sle-0.191
 globe-0.187
 fireplace-0.177
Token,
Token ID11
Feature activation-7.67e+00

Pos logit contributions
ited+0.123
ised+0.094
ere+0.083
 disagreed+0.083
ced+0.083
Neg logit contributions
 globe-0.165
 roof-0.162
 sle-0.159
tr-0.156
 Dragons-0.155
Token there
Token ID612
Feature activation-6.20e+00

Pos logit contributions
ited+0.137
ere+0.102
 disagreed+0.102
ised+0.100
eling+0.094
Neg logit contributions
 globe-0.166
 roof-0.160
tr-0.160
bul-0.146
 We-0.145
Token was
Token ID373
Feature activation-8.22e+00

Pos logit contributions
ited+0.157
ised+0.134
ere+0.122
eful+0.122
ieved+0.119
Neg logit contributions
 globe-0.089
 roof-0.084
tr-0.080
 fireplace-0.075
 chim-0.072
Token a
Token ID257
Feature activation-5.23e+00

Pos logit contributions
ited+0.147
ised+0.123
 disagreed+0.117
ced+0.114
ere+0.109
Neg logit contributions
 roof-0.166
tr-0.162
 sle-0.158
 globe-0.157
 fireplace-0.154
Token world
Token ID995
Feature activation+1.65e+00

Pos logit contributions
 sle+0.023
 roof+0.022
 globe+0.022
tr+0.021
 mast+0.021
Neg logit contributions
ited-0.027
ised-0.024
 disagreed-0.023
illed-0.023
ced-0.022
Token.
Token ID13
Feature activation+4.31e+00

Pos logit contributions
 sle+0.033
 roof+0.032
 mast+0.032
tr+0.032
 Dragons+0.031
Neg logit contributions
ited-0.028
ised-0.025
 disagreed-0.023
ere-0.023
ced-0.023
Token<|endoftext|>
Token ID50256
Feature activation-8.12e+00

Pos logit contributions
 mast+0.092
 sle+0.090
 mant+0.083
 lantern+0.081
rier+0.081
Neg logit contributions
ere-0.053
 did-0.053
ised-0.052
,-0.051
 enough-0.051
TokenOnce
Token ID7454
Feature activation-7.73e+00

Pos logit contributions
ited+0.203
ised+0.182
 disagreed+0.175
ere+0.168
illed+0.167
Neg logit contributions
 sle-0.104
 globe-0.103
tr-0.099
 roof-0.099
 Dragons-0.096
Token upon
Token ID2402
Feature activation-6.85e+00

Pos logit contributions
ited+0.113
ised+0.093
ced+0.086
 disagreed+0.084
eful+0.081
Neg logit contributions
 roof-0.178
 sle-0.175
 globe-0.170
tr-0.168
ometer-0.163
Token a
Token ID257
Feature activation-1.08e+01

Pos logit contributions
ited+0.128
ised+0.114
ere+0.107
 disagreed+0.103
eful+0.100
Neg logit contributions
 roof-0.133
tr-0.131
 globe-0.126
 Dragons-0.124
 sle-0.123
Token time
Token ID640
Feature activation-7.14e+00

Pos logit contributions
ised+0.159
ited+0.149
pped+0.141
ced+0.138
 disagreed+0.135
Neg logit contributions
tr-0.222
 roof-0.211
 sle-0.205
 globe-0.199
 fireplace-0.187
Token,
Token ID11
Feature activation-7.21e+00

Pos logit contributions
ited+0.125
ised+0.093
ere+0.085
ced+0.084
eling+0.084
Neg logit contributions
 globe-0.154
 roof-0.153
 sle-0.146
tr-0.145
 Dragons-0.143
Token there
Token ID612
Feature activation-5.96e+00

Pos logit contributions
ited+0.122
ere+0.091
 disagreed+0.090
ised+0.088
eful+0.081
Neg logit contributions
 globe-0.162
 roof-0.161
tr-0.157
 Dragons-0.144
 fireplace-0.143
Token was
Token ID373
Feature activation-7.79e+00

Pos logit contributions
ited+0.150
ised+0.125
eful+0.116
ere+0.116
 disagreed+0.113
Neg logit contributions
 globe-0.087
 roof-0.083
tr-0.079
 fireplace-0.074
 chim-0.071
Token a
Token ID257
Feature activation-5.19e+00

Pos logit contributions
ited+0.131
ised+0.111
 disagreed+0.105
ced+0.102
ere+0.099
Neg logit contributions
 roof-0.164
tr-0.160
 sle-0.158
 globe-0.155
 fireplace-0.152
Token finally
Token ID3443
Feature activation-6.05e-01

Pos logit contributions
 roof+0.013
 mast+0.013
tr+0.013
 globe+0.012
 sle+0.012
Neg logit contributions
ited-0.009
ised-0.009
ere-0.008
illed-0.008
eful-0.007
Token got
Token ID1392
Feature activation-2.02e+00

Pos logit contributions
ited+0.009
ised+0.008
ere+0.007
illed+0.007
eful+0.007
Neg logit contributions
 roof-0.013
 mast-0.013
 globe-0.012
 flow-0.012
 sle-0.012
Token the
Token ID262
Feature activation+2.27e+00

Pos logit contributions
ited+0.032
ised+0.028
ere+0.025
ced+0.025
illed+0.025
Neg logit contributions
 sle-0.043
 roof-0.043
 mast-0.042
tr-0.041
 Dragons-0.041
Token carrots
Token ID34397
Feature activation+3.76e+00

Pos logit contributions
 sle+0.033
tr+0.033
 roof+0.032
 Dragons+0.032
 mast+0.031
Neg logit contributions
ited-0.051
ised-0.046
 disagreed-0.044
ere-0.043
ced-0.043
Token out
Token ID503
Feature activation+7.14e-01

Pos logit contributions
 sle+0.081
 Dragons+0.079
tr+0.078
 roof+0.076
 mast+0.076
Neg logit contributions
ited-0.058
ised-0.051
 disagreed-0.049
ced-0.047
 seen-0.047
Token.
Token ID13
Feature activation+4.63e+00

Pos logit contributions
 sle+0.016
 roof+0.015
tr+0.015
 mast+0.015
 Dragons+0.015
Neg logit contributions
ited-0.011
ised-0.010
 disagreed-0.009
ere-0.009
ced-0.009
Token "
Token ID366
Feature activation+1.83e+00

Pos logit contributions
 sle+0.106
 mast+0.100
 globe+0.100
 roof+0.099
 lantern+0.095
Neg logit contributions
ised-0.065
ere-0.048
 there-0.045
,-0.045
pped-0.043
TokenThank
Token ID10449
Feature activation-1.51e+00

Pos logit contributions
 sle+0.037
 roof+0.036
 mast+0.036
 Dragons+0.035
ometer+0.034
Neg logit contributions
ited-0.034
ised-0.030
pped-0.028
ced-0.028
ere-0.027
Token you
Token ID345
Feature activation-2.12e+00

Pos logit contributions
ited+0.026
ised+0.026
ere+0.024
ced+0.023
pped+0.022
Neg logit contributions
 roof-0.029
 sle-0.028
tr-0.028
 mast-0.028
 globe-0.027
Token,
Token ID11
Feature activation+2.61e+00

Pos logit contributions
ited+0.033
ised+0.028
 disagreed+0.025
ced+0.025
ere+0.025
Neg logit contributions
 sle-0.048
 roof-0.046
tr-0.045
 mast-0.045
 Dragons-0.045
Token Squirrel
Token ID44259
Feature activation-3.35e+00

Pos logit contributions
tr+0.047
 mast+0.044
 Dragons+0.044
 roof+0.044
 globe+0.044
Neg logit contributions
ised-0.043
ited-0.039
 did-0.039
ere-0.039
 there-0.038
Token
Token ID198
Feature activation-2.79e+00

Pos logit contributions
 sle+0.096
bul+0.092
 roof+0.090
 mast+0.089
 globe+0.088
Neg logit contributions
ised-0.059
illed-0.056
ited-0.055
 seen-0.053
 disagreed-0.053
Token
Token ID198
Feature activation-2.47e+00

Pos logit contributions
ited+0.053
ised+0.048
 disagreed+0.045
ere+0.045
ced+0.045
Neg logit contributions
 sle-0.050
 roof-0.049
tr-0.048
 mast-0.048
 Dragons-0.047
TokenBut
Token ID1537
Feature activation+1.08e+00

Pos logit contributions
ited+0.048
ised+0.042
 disagreed+0.041
ere+0.040
illed+0.038
Neg logit contributions
 roof-0.043
tr-0.043
 sle-0.042
 mast-0.042
 globe-0.042
Token nothing
Token ID2147
Feature activation-2.93e-01

Pos logit contributions
 sle+0.022
 roof+0.021
tr+0.021
 mast+0.021
 globe+0.021
Neg logit contributions
ited-0.019
ised-0.017
ere-0.016
illed-0.015
 disagreed-0.015
Token happens
Token ID4325
Feature activation+2.40e-01

Pos logit contributions
ited+0.006
ised+0.006
ere+0.005
illed+0.005
 disagreed+0.005
Neg logit contributions
 sle-0.005
 roof-0.005
tr-0.004
 globe-0.004
 mast-0.004
Token.
Token ID13
Feature activation+5.11e+00

Pos logit contributions
 roof+0.004
 sle+0.004
tr+0.004
 globe+0.004
 Dragons+0.003
Neg logit contributions
ited-0.005
ised-0.005
ere-0.004
 disagreed-0.004
ced-0.004
Token They
Token ID1119
Feature activation+6.63e-01

Pos logit contributions
 sle+0.097
 globe+0.089
 roof+0.089
 mast+0.088
bul+0.088
Neg logit contributions
ited-0.085
ised-0.085
illed-0.080
 disagreed-0.078
ere-0.077
Token do
Token ID466
Feature activation-2.38e+00

Pos logit contributions
 roof+0.011
 sle+0.011
 mast+0.011
tr+0.010
 Dragons+0.010
Neg logit contributions
ited-0.013
ised-0.013
ere-0.012
illed-0.012
ced-0.011
Token not
Token ID407
Feature activation-2.44e+00

Pos logit contributions
ited+0.055
ised+0.048
ere+0.047
illed+0.045
eful+0.044
Neg logit contributions
 roof-0.036
tr-0.035
 globe-0.034
 Dragons-0.034
 mast-0.033
Token fly
Token ID6129
Feature activation-4.98e-01

Pos logit contributions
ited+0.036
 seen+0.031
 disagreed+0.030
 done+0.029
ised+0.028
Neg logit contributions
 sle-0.056
tr-0.054
 globe-0.053
bul-0.052
 Dragons-0.052
Token or
Token ID393
Feature activation+3.01e+00

Pos logit contributions
ited+0.010
ised+0.009
ere+0.009
illed+0.008
 disagreed+0.008
Neg logit contributions
 roof-0.008
 globe-0.008
tr-0.008
 sle-0.008
 mast-0.008
Token fun
Token ID1257
Feature activation-1.10e+00

Pos logit contributions
 roof+0.014
 sle+0.013
 mast+0.013
 globe+0.013
 Dragons+0.013
Neg logit contributions
ited-0.015
ised-0.013
pped-0.012
ere-0.012
illed-0.012
Token with
Token ID351
Feature activation-7.64e-01

Pos logit contributions
ited+0.021
 disagreed+0.017
ised+0.017
ere+0.016
illed+0.016
Neg logit contributions
 roof-0.020
 globe-0.020
 mast-0.019
 sle-0.019
 Dragons-0.019
Token Molly
Token ID30236
Feature activation-3.91e-01

Pos logit contributions
ited+0.014
ised+0.011
illed+0.011
ere+0.011
 disagreed+0.010
Neg logit contributions
 roof-0.015
 globe-0.014
 remote-0.014
bul-0.014
 mast-0.014
Token's
Token ID338
Feature activation+1.11e+00

Pos logit contributions
ited+0.006
ised+0.006
ere+0.005
illed+0.005
 disagreed+0.005
Neg logit contributions
 sle-0.008
 roof-0.008
tr-0.008
 globe-0.008
 mast-0.008
Token knot
Token ID29654
Feature activation+2.04e+00

Pos logit contributions
 sle+0.018
 roof+0.018
 mast+0.017
 globe+0.017
tr+0.017
Neg logit contributions
ited-0.024
ised-0.021
 disagreed-0.020
ere-0.020
illed-0.019
Token!
Token ID0
Feature activation+3.92e+00

Pos logit contributions
 roof+0.040
 sle+0.040
tr+0.039
 globe+0.038
 mast+0.038
Neg logit contributions
ited-0.036
ised-0.031
ere-0.029
 disagreed-0.029
ced-0.028
Token<|endoftext|>
Token ID50256
Feature activation-5.83e+00

Pos logit contributions
 sle+0.087
 mast+0.085
 roof+0.083
tr+0.081
bul+0.080
Neg logit contributions
ited-0.058
ised-0.055
ere-0.051
illed-0.050
 disagreed-0.050
TokenOnce
Token ID7454
Feature activation-6.87e+00

Pos logit contributions
ited+0.123
ised+0.110
 disagreed+0.104
ced+0.100
ere+0.100
Neg logit contributions
 sle-0.094
 globe-0.092
 roof-0.091
tr-0.091
 Dragons-0.089
Token upon
Token ID2402
Feature activation-6.10e+00

Pos logit contributions
ited+0.108
ised+0.089
pped+0.087
ced+0.086
ere+0.079
Neg logit contributions
 roof-0.148
 globe-0.143
 sle-0.141
ometer-0.137
bul-0.134
Token a
Token ID257
Feature activation-9.52e+00

Pos logit contributions
ited+0.125
ised+0.111
ere+0.106
 disagreed+0.102
eful+0.098
Neg logit contributions
 roof-0.107
tr-0.101
 globe-0.101
 Dragons-0.101
 moon-0.097
Token time
Token ID640
Feature activation-6.03e+00

Pos logit contributions
ised+0.144
ited+0.137
pped+0.136
 disagreed+0.125
ere+0.124
Neg logit contributions
tr-0.187
 roof-0.173
 sle-0.172
 globe-0.168
 fireplace-0.159
Token Lily
Token ID20037
Feature activation+2.55e-01

Pos logit contributions
 roof+0.006
 mast+0.006
 sle+0.006
 globe+0.006
bul+0.005
Neg logit contributions
ited-0.007
ised-0.006
illed-0.006
ere-0.006
eful-0.006
Token was
Token ID373
Feature activation-2.22e+00

Pos logit contributions
 roof+0.005
tr+0.005
 sle+0.005
 mast+0.005
 Dragons+0.005
Neg logit contributions
ited-0.004
ised-0.004
illed-0.004
ere-0.004
eful-0.003
Token a
Token ID257
Feature activation-1.35e+00

Pos logit contributions
ited+0.041
ised+0.037
ere+0.035
illed+0.033
eful+0.032
Neg logit contributions
 roof-0.040
bul-0.039
 remote-0.038
tr-0.037
 mast-0.037
Token fork
Token ID15563
Feature activation+1.73e+00

Pos logit contributions
ited+0.023
ced+0.019
ised+0.019
 disagreed+0.018
 noticed+0.017
Neg logit contributions
 sle-0.027
 Dragons-0.027
tr-0.027
 globe-0.026
 roof-0.025
Token expert
Token ID5887
Feature activation-9.37e-01

Pos logit contributions
 sle+0.037
 roof+0.036
 Dragons+0.036
 mast+0.036
tr+0.035
Neg logit contributions
ited-0.026
ised-0.023
 disagreed-0.021
ced-0.021
ere-0.021
Token.
Token ID13
Feature activation+4.99e+00

Pos logit contributions
ited+0.015
ised+0.013
 disagreed+0.012
ere+0.012
ced+0.012
Neg logit contributions
 sle-0.020
 roof-0.019
tr-0.019
 mast-0.019
 globe-0.019
Token
Token ID220
Feature activation+1.47e+00

Pos logit contributions
 sle+0.117
 mast+0.112
 globe+0.110
bul+0.105
 lantern+0.104
Neg logit contributions
ised-0.062
ited-0.059
ere-0.059
 disagreed-0.055
illed-0.055
Token
Token ID198
Feature activation-1.18e+00

Pos logit contributions
 sle+0.029
 mast+0.027
 roof+0.027
 globe+0.026
 Dragons+0.026
Neg logit contributions
ited-0.025
ised-0.024
ere-0.023
illed-0.022
ced-0.022
Token
Token ID198
Feature activation-1.32e+00

Pos logit contributions
ited+0.022
ised+0.020
ere+0.019
ced+0.019
 disagreed+0.019
Neg logit contributions
 sle-0.022
 roof-0.021
 mast-0.021
tr-0.020
 Dragons-0.020
TokenOne
Token ID3198
Feature activation-4.05e-01

Pos logit contributions
ited+0.020
ised+0.018
 disagreed+0.016
ere+0.016
ced+0.016
Neg logit contributions
 sle-0.028
 roof-0.028
tr-0.028
 Dragons-0.027
 mast-0.027
Token night
Token ID1755
Feature activation-2.68e-01

Pos logit contributions
ited+0.009
ised+0.009
pped+0.008
ere+0.008
ced+0.008
Neg logit contributions
 sle-0.005
 roof-0.005
tr-0.005
bul-0.005
 globe-0.005
Token says
Token ID1139
Feature activation-4.69e+00

Pos logit contributions
ited+0.071
 disagreed+0.063
ised+0.062
ere+0.060
illed+0.058
Neg logit contributions
 sle-0.092
 roof-0.090
 globe-0.089
 mast-0.089
tr-0.088
Token.
Token ID13
Feature activation+2.12e+00

Pos logit contributions
ited+0.069
ised+0.063
 disagreed+0.061
illed+0.060
ced+0.058
Neg logit contributions
 sle-0.103
tr-0.098
 roof-0.098
 globe-0.097
 mast-0.097
Token "
Token ID366
Feature activation+8.71e-01

Pos logit contributions
 sle+0.038
 globe+0.035
 mant+0.034
 roof+0.034
tr+0.033
Neg logit contributions
ised-0.038
ited-0.038
 disagreed-0.038
 did-0.035
ere-0.035
TokenThe
Token ID464
Feature activation+2.06e+00

Pos logit contributions
 sle+0.020
 sy+0.018
 roof+0.018
 mast+0.018
bul+0.017
Neg logit contributions
ited-0.013
ised-0.011
ced-0.011
 seen-0.011
 heard-0.010
Token fire
Token ID2046
Feature activation+4.82e+00

Pos logit contributions
bul+0.030
 Dragons+0.030
tr+0.029
 sle+0.029
 sy+0.029
Neg logit contributions
ised-0.037
 done-0.037
 did-0.036
ited-0.036
 saw-0.036
Token truck
Token ID7779
Feature activation+3.95e+00

Pos logit contributions
 sle+0.086
 mast+0.084
 roof+0.083
 Dragons+0.083
 sy+0.082
Neg logit contributions
ited-0.079
ised-0.076
ere-0.072
 did-0.072
 disagreed-0.071
Token is
Token ID318
Feature activation-6.46e-01

Pos logit contributions
 sle+0.082
 roof+0.079
 mast+0.078
 Dragons+0.078
tr+0.077
Neg logit contributions
ited-0.061
ised-0.054
 disagreed-0.052
ere-0.050
ced-0.050
Token so
Token ID523
Feature activation+3.84e-02

Pos logit contributions
ited+0.010
ised+0.009
 disagreed+0.008
 done+0.008
 seen+0.008
Neg logit contributions
 sle-0.013
 Dragons-0.013
tr-0.013
 roof-0.013
 globe-0.013
Token big
Token ID1263
Feature activation+4.36e-02

Pos logit contributions
tr+0.001
 sy+0.001
bul+0.001
 Dragons+0.001
 sle+0.001
Neg logit contributions
ited-0.000
ised-0.000
 done-0.000
 did-0.000
 disagreed-0.000
Token and
Token ID290
Feature activation+4.99e+00

Pos logit contributions
 sle+0.001
 roof+0.001
tr+0.001
 mast+0.001
 Dragons+0.001
Neg logit contributions
ited-0.001
ised-0.000
 disagreed-0.000
ere-0.000
ced-0.000
Token strong
Token ID1913
Feature activation+6.08e-01

Pos logit contributions
tr+0.098
 sle+0.098
 Dragons+0.096
 roof+0.095
 sy+0.093
Neg logit contributions
ited-0.076
ised-0.072
 did-0.068
 disagreed-0.068
 done-0.067
Token it
Token ID340
Feature activation+2.87e+00

Pos logit contributions
ited+0.037
ised+0.033
 disagreed+0.031
 seen+0.031
ere+0.030
Neg logit contributions
 sle-0.062
tr-0.062
 mast-0.060
 roof-0.059
 globe-0.059
Token wonderful
Token ID7932
Feature activation+7.61e-01

Pos logit contributions
 sle+0.062
 mast+0.059
tr+0.059
 roof+0.059
 globe+0.058
Neg logit contributions
ited-0.043
ised-0.039
 disagreed-0.037
ere-0.036
 done-0.035
Token how
Token ID703
Feature activation+1.92e+00

Pos logit contributions
 sle+0.019
 roof+0.018
tr+0.018
 mast+0.018
 globe+0.018
Neg logit contributions
ited-0.009
ised-0.008
 disagreed-0.007
ere-0.007
ced-0.007
Token nature
Token ID3450
Feature activation+4.10e+00

Pos logit contributions
 sle+0.034
tr+0.034
 roof+0.033
 mast+0.032
 globe+0.032
Neg logit contributions
ited-0.036
ised-0.034
 disagreed-0.032
ere-0.031
ced-0.031
Token teaches
Token ID17324
Feature activation-3.06e-01

Pos logit contributions
 sle+0.075
 roof+0.073
tr+0.072
 mast+0.071
 Dragons+0.069
Neg logit contributions
ited-0.078
ised-0.070
 disagreed-0.066
ere-0.065
ced-0.065
Token us
Token ID514
Feature activation+5.21e-01

Pos logit contributions
ited+0.006
ised+0.005
illed+0.004
ced+0.004
pped+0.004
Neg logit contributions
 sle-0.006
 Dragons-0.006
tr-0.006
 globe-0.005
bul-0.005
Token all
Token ID477
Feature activation+3.67e-01

Pos logit contributions
 sle+0.011
 roof+0.011
 Dragons+0.011
tr+0.011
 globe+0.011
Neg logit contributions
ited-0.008
ised-0.007
 disagreed-0.006
ced-0.006
ere-0.006
Token kinds
Token ID6982
Feature activation+1.38e-01

Pos logit contributions
 sle+0.008
 roof+0.008
tr+0.008
 globe+0.008
 Dragons+0.008
Neg logit contributions
ited-0.005
ised-0.005
 disagreed-0.004
ere-0.004
ced-0.004
Token of
Token ID286
Feature activation+1.34e+00

Pos logit contributions
 sle+0.003
 roof+0.003
tr+0.003
 mast+0.003
 Dragons+0.003
Neg logit contributions
ited-0.002
ised-0.002
 disagreed-0.002
ere-0.002
ced-0.002
Token things
Token ID1243
Feature activation+2.46e+00

Pos logit contributions
 sle+0.022
 roof+0.021
 Dragons+0.020
 globe+0.020
 mast+0.020
Neg logit contributions
ited-0.028
ised-0.025
 disagreed-0.024
ced-0.023
illed-0.023
Token?"
Token ID1701
Feature activation+1.66e+00

Pos logit contributions
 sle+0.061
 roof+0.059
tr+0.059
 mast+0.058
 Dragons+0.057
Neg logit contributions
ited-0.029
ised-0.027
 disagreed-0.024
ere-0.023
ced-0.023
Token do
Token ID466
Feature activation-1.36e-01

Pos logit contributions
ited+0.002
ised+0.002
 disagreed+0.002
ere+0.002
ced+0.002
Neg logit contributions
 sle-0.003
 roof-0.003
tr-0.003
 mast-0.003
 Dragons-0.003
Token and
Token ID290
Feature activation+2.16e+00

Pos logit contributions
ited+0.003
ised+0.003
ere+0.003
 disagreed+0.002
ced+0.002
Neg logit contributions
 globe-0.002
 Dragons-0.002
 roof-0.002
bul-0.002
tr-0.002
Token she
Token ID673
Feature activation+5.95e-02

Pos logit contributions
 roof+0.042
 sle+0.041
 Dragons+0.040
 mast+0.040
tr+0.040
Neg logit contributions
ited-0.038
ised-0.034
ced-0.031
ere-0.031
illed-0.031
Token said
Token ID531
Feature activation-3.05e+00

Pos logit contributions
tr+0.001
 roof+0.001
 sle+0.001
bul+0.001
 mast+0.001
Neg logit contributions
ited-0.001
ised-0.001
illed-0.001
ced-0.001
ere-0.001
Token,
Token ID11
Feature activation+1.73e+00

Pos logit contributions
ited+0.049
ised+0.040
ere+0.038
 seen+0.036
illed+0.036
Neg logit contributions
 roof-0.066
 sle-0.065
 mast-0.064
 globe-0.064
 Dragons-0.063
Token she
Token ID673
Feature activation+5.14e-01

Pos logit contributions
 sle+0.028
 roof+0.027
tr+0.027
 mast+0.026
 globe+0.026
Neg logit contributions
ited-0.034
ised-0.033
 disagreed-0.032
ced-0.031
illed-0.030
Token would
Token ID561
Feature activation-8.35e-01

Pos logit contributions
tr+0.010
 roof+0.010
bul+0.010
ometer+0.010
 Dragons+0.010
Neg logit contributions
ited-0.009
ised-0.008
illed-0.007
ere-0.007
ced-0.007
Token figure
Token ID3785
Feature activation-3.23e+00

Pos logit contributions
ited+0.016
ised+0.014
 disagreed+0.013
ere+0.013
ced+0.013
Neg logit contributions
 sle-0.015
 roof-0.015
tr-0.014
 mast-0.014
 Dragons-0.014
Token it
Token ID340
Feature activation+3.15e+00

Pos logit contributions
ited+0.053
ised+0.050
ced+0.048
ere+0.047
pped+0.047
Neg logit contributions
 sle-0.063
tr-0.062
 roof-0.062
 Dragons-0.061
 globe-0.061
Token out
Token ID503
Feature activation-2.94e-01

Pos logit contributions
 sle+0.055
 roof+0.054
tr+0.054
 mast+0.052
 globe+0.052
Neg logit contributions
ited-0.062
ised-0.055
 disagreed-0.052
ere-0.052
ced-0.051
Token soon
Token ID2582
Feature activation+1.86e+00

Pos logit contributions
ited+0.006
ere+0.005
 disagreed+0.005
ised+0.005
illed+0.005
Neg logit contributions
bul-0.005
 roof-0.005
 sle-0.005
 Dragons-0.005
tr-0.005
Token The
Token ID383
Feature activation+6.40e-01

Pos logit contributions
 sle+0.052
 roof+0.051
bul+0.050
 remote+0.050
tr+0.049
Neg logit contributions
ited-0.029
ised-0.028
 disagreed-0.026
 did-0.025
ere-0.025
Token fish
Token ID5916
Feature activation+2.11e-02

Pos logit contributions
 sle+0.010
tr+0.009
 roof+0.009
bul+0.009
 Dragons+0.009
Neg logit contributions
ited-0.014
ised-0.013
ced-0.012
 disagreed-0.012
ere-0.012
Token let
Token ID1309
Feature activation-9.07e-01

Pos logit contributions
 sle+0.000
tr+0.000
 roof+0.000
 globe+0.000
 mast+0.000
Neg logit contributions
ited-0.000
ised-0.000
ere-0.000
illed-0.000
ced-0.000
Token go
Token ID467
Feature activation-1.61e+00

Pos logit contributions
ited+0.013
ised+0.011
ere+0.011
 disagreed+0.011
ced+0.010
Neg logit contributions
 sle-0.020
 roof-0.020
tr-0.020
 mast-0.020
 Dragons-0.020
Token and
Token ID290
Feature activation+2.43e+00

Pos logit contributions
ited+0.019
ised+0.016
 seen+0.015
ere+0.015
illed+0.014
Neg logit contributions
tr-0.040
 Dragons-0.040
 sle-0.039
 roof-0.039
 mast-0.039
Token he
Token ID339
Feature activation+6.96e-01

Pos logit contributions
 sle+0.051
 roof+0.049
tr+0.049
 Dragons+0.048
 mast+0.048
Neg logit contributions
ited-0.040
ised-0.035
 disagreed-0.033
ced-0.032
ere-0.032
Token ran
Token ID4966
Feature activation-3.08e+00

Pos logit contributions
 sle+0.015
 roof+0.015
tr+0.015
 mast+0.014
 globe+0.014
Neg logit contributions
ited-0.011
ised-0.010
ere-0.009
illed-0.009
ced-0.009
Token away
Token ID1497
Feature activation-2.04e+00

Pos logit contributions
ited+0.051
ised+0.045
 disagreed+0.042
ere+0.042
ced+0.042
Neg logit contributions
 sle-0.063
 roof-0.062
tr-0.061
 mast-0.060
 globe-0.060
Token as
Token ID355
Feature activation-2.82e-01

Pos logit contributions
ited+0.032
ised+0.028
 disagreed+0.026
ced+0.026
illed+0.025
Neg logit contributions
 sle-0.044
tr-0.043
 roof-0.042
 Dragons-0.042
 mast-0.042
Token fast
Token ID3049
Feature activation+1.03e+00

Pos logit contributions
ited+0.005
ised+0.005
ced+0.005
 disagreed+0.005
ere+0.004
Neg logit contributions
 sle-0.005
tr-0.005
 roof-0.005
 Dragons-0.005
 mast-0.005
Token as
Token ID355
Feature activation+8.43e-01

Pos logit contributions
 sle+0.024
 roof+0.023
tr+0.023
 mast+0.023
 Dragons+0.022
Neg logit contributions
ited-0.015
ised-0.013
 disagreed-0.012
ere-0.011
ced-0.011
Token were
Token ID547
Feature activation-5.65e-03

Pos logit contributions
 roof+0.022
 sle+0.022
 mast+0.021
 globe+0.021
tr+0.021
Neg logit contributions
ited-0.018
ised-0.016
ere-0.015
eful-0.014
 disagreed-0.014
Token new
Token ID649
Feature activation+5.20e-01

Pos logit contributions
ited+0.000
ised+0.000
ere+0.000
illed+0.000
 disagreed+0.000
Neg logit contributions
 sle-0.000
 roof-0.000
 mast-0.000
 globe-0.000
bul-0.000
Token and
Token ID290
Feature activation+5.09e+00

Pos logit contributions
 sle+0.010
 roof+0.010
 mast+0.010
 globe+0.010
tr+0.010
Neg logit contributions
ited-0.009
ised-0.008
ere-0.007
 disagreed-0.007
illed-0.007
Token playful
Token ID34264
Feature activation+1.49e+00

Pos logit contributions
 sle+0.086
tr+0.084
 roof+0.083
 Dragons+0.082
 globe+0.081
Neg logit contributions
ited-0.099
ised-0.092
 disagreed-0.086
ced-0.086
 seen-0.084
Token things
Token ID1243
Feature activation+2.46e+00

Pos logit contributions
 sle+0.029
 roof+0.028
tr+0.028
 mast+0.028
 Dragons+0.027
Neg logit contributions
ited-0.026
ised-0.023
 disagreed-0.022
ere-0.021
ced-0.021
Token to
Token ID284
Feature activation+2.14e+00

Pos logit contributions
 sle+0.053
 Dragons+0.051
 roof+0.051
tr+0.051
 mast+0.050
Neg logit contributions
ited-0.037
ised-0.033
 disagreed-0.031
ced-0.031
 seen-0.030
Token find
Token ID1064
Feature activation-6.54e-01

Pos logit contributions
 roof+0.035
 sle+0.035
 mast+0.034
 globe+0.033
 Dragons+0.033
Neg logit contributions
ited-0.045
ised-0.041
 disagreed-0.038
ere-0.038
illed-0.038
Token.
Token ID13
Feature activation+5.09e+00

Pos logit contributions
ited+0.012
ised+0.010
 disagreed+0.010
ere+0.010
illed+0.010
Neg logit contributions
 sle-0.013
 roof-0.012
 globe-0.012
 mast-0.012
tr-0.012
Token The
Token ID383
Feature activation+1.97e+00

Pos logit contributions
 sle+0.099
 mast+0.091
 roof+0.091
 globe+0.090
tr+0.090
Neg logit contributions
ised-0.085
ited-0.084
ere-0.080
illed-0.078
 disagreed-0.077
Token tunnel
Token ID13275
Feature activation+4.16e+00

Pos logit contributions
 sle+0.030
 roof+0.029
tr+0.029
 mast+0.028
 Dragons+0.028
Neg logit contributions
ited-0.043
ised-0.039
 disagreed-0.037
ced-0.037
ere-0.037
Token seemed
Token ID3947
Feature activation+1.03e+00

Pos logit contributions
 roof+0.089
 sle+0.089
tr+0.087
 mast+0.087
 globe+0.086
Neg logit contributions
ited-0.064
ised-0.055
ere-0.052
illed-0.051
ced-0.050
Token a
Token ID257
Feature activation-6.22e-02

Pos logit contributions
ited+0.041
ised+0.035
 disagreed+0.031
ere+0.031
ced+0.030
Neg logit contributions
 sle-0.107
 roof-0.104
 mast-0.103
tr-0.103
 globe-0.102
Token big
Token ID1263
Feature activation+1.85e-01

Pos logit contributions
ited+0.001
ced+0.001
 disagreed+0.001
ised+0.001
 noticed+0.001
Neg logit contributions
 Dragons-0.001
 sle-0.001
 mast-0.001
bul-0.001
tr-0.001
Token,
Token ID11
Feature activation+3.60e+00

Pos logit contributions
 sle+0.003
 roof+0.003
tr+0.003
 Dragons+0.003
 mast+0.003
Neg logit contributions
ited-0.004
ised-0.004
 disagreed-0.003
ere-0.003
ced-0.003
Token juicy
Token ID35075
Feature activation+2.51e-01

Pos logit contributions
 sle+0.066
 Dragons+0.065
 mast+0.064
 roof+0.063
tr+0.063
Neg logit contributions
ited-0.065
ised-0.057
 disagreed-0.055
ced-0.054
ere-0.053
Token banana
Token ID25996
Feature activation+2.66e+00

Pos logit contributions
 sle+0.004
 roof+0.004
tr+0.003
 globe+0.003
 mast+0.003
Neg logit contributions
ited-0.006
ised-0.005
ced-0.005
 disagreed-0.005
ere-0.005
Token.
Token ID13
Feature activation+3.63e+00

Pos logit contributions
 sle+0.059
 mast+0.058
 Dragons+0.057
bul+0.056
 roof+0.055
Neg logit contributions
ited-0.037
 disagreed-0.032
ised-0.032
 seen-0.030
ced-0.030
Token He
Token ID679
Feature activation+8.88e-01

Pos logit contributions
 sle+0.068
bul+0.065
 roof+0.064
 mast+0.064
 globe+0.062
Neg logit contributions
ited-0.063
ised-0.060
 disagreed-0.058
ere-0.056
 did-0.054
Token was
Token ID373
Feature activation-2.72e+00

Pos logit contributions
tr+0.018
 roof+0.018
 sle+0.018
 Dragons+0.017
 globe+0.017
Neg logit contributions
ited-0.014
ised-0.013
illed-0.012
ere-0.012
eful-0.012
Token so
Token ID523
Feature activation-1.48e+00

Pos logit contributions
ited+0.053
ised+0.048
ere+0.044
ced+0.043
illed+0.042
Neg logit contributions
 sle-0.050
 roof-0.049
tr-0.048
 Dragons-0.046
 mast-0.045
Token excited
Token ID6568
Feature activation-1.55e+00

Pos logit contributions
ited+0.027
ised+0.025
ced+0.023
ere+0.023
pped+0.022
Neg logit contributions
 sle-0.027
 roof-0.027
 Dragons-0.026
 globe-0.026
tr-0.026
Token to
Token ID284
Feature activation-1.10e+00

Pos logit contributions
ited+0.029
ised+0.025
ced+0.024
ere+0.023
illed+0.023
Neg logit contributions
 sle-0.028
 roof-0.027
 Dragons-0.027
tr-0.027
 globe-0.026
Token home
Token ID1363
Feature activation-2.19e+00

Pos logit contributions
ited+0.074
ised+0.064
 disagreed+0.060
ced+0.060
ere+0.059
Neg logit contributions
 sle-0.086
 roof-0.084
 globe-0.082
tr-0.082
 Dragons-0.081
Token from
Token ID422
Feature activation-2.11e+00

Pos logit contributions
ited+0.041
ised+0.033
ere+0.033
 disagreed+0.032
pped+0.031
Neg logit contributions
 roof-0.042
 sle-0.041
tr-0.040
 globe-0.039
 Dragons-0.038
Token school
Token ID1524
Feature activation-2.38e-01

Pos logit contributions
ited+0.039
ised+0.034
 disagreed+0.032
ced+0.032
ere+0.032
Neg logit contributions
 sle-0.040
 roof-0.039
tr-0.038
 mast-0.038
 globe-0.038
Token when
Token ID618
Feature activation+1.48e+00

Pos logit contributions
ited+0.003
ised+0.003
ere+0.003
 disagreed+0.003
ced+0.003
Neg logit contributions
 roof-0.005
 sle-0.005
tr-0.005
 mast-0.005
 Dragons-0.005
Token he
Token ID339
Feature activation-1.57e+00

Pos logit contributions
 sle+0.018
 roof+0.018
 mast+0.017
 globe+0.017
 Dragons+0.017
Neg logit contributions
ited-0.037
ised-0.033
ere-0.033
illed-0.032
 disagreed-0.032
Token heard
Token ID2982
Feature activation-2.75e+00

Pos logit contributions
ised+0.024
ited+0.023
illed+0.022
ere+0.020
eful+0.019
Neg logit contributions
tr-0.030
 roof-0.030
 sle-0.030
 Dragons-0.028
 mast-0.028
Token a
Token ID257
Feature activation-1.34e+00

Pos logit contributions
ited+0.039
ised+0.032
 disagreed+0.030
ere+0.029
ced+0.029
Neg logit contributions
 sle-0.065
 roof-0.064
tr-0.063
 Dragons-0.063
 mast-0.062
Token noise
Token ID7838
Feature activation-8.64e-01

Pos logit contributions
ited+0.018
ised+0.017
ced+0.016
 disagreed+0.016
 heard+0.015
Neg logit contributions
 mast-0.028
 sle-0.028
tr-0.028
 globe-0.028
 Dragons-0.027
Token coming
Token ID2406
Feature activation-1.18e+00

Pos logit contributions
ited+0.013
ised+0.011
ced+0.011
 disagreed+0.011
 seen+0.010
Neg logit contributions
 sle-0.019
 mast-0.018
 roof-0.018
tr-0.018
 globe-0.018
Token from
Token ID422
Feature activation-1.60e+00

Pos logit contributions
ited+0.013
ised+0.013
 heard+0.013
 seen+0.013
illed+0.012
Neg logit contributions
 mast-0.026
tr-0.025
 globe-0.025
 Dragons-0.025
 roof-0.024
Token the
Token ID262
Feature activation+1.87e+00

Pos logit contributions
ited+0.026
ised+0.023
 disagreed+0.021
ere+0.021
ced+0.021
Neg logit contributions
 sle-0.034
 roof-0.033
 globe-0.033
 mast-0.032
tr-0.032
TokenOnce
Token ID7454
Feature activation-7.94e+00

Pos logit contributions
ited+0.140
ised+0.125
 disagreed+0.119
illed+0.115
ced+0.115
Neg logit contributions
 sle-0.091
 globe-0.089
tr-0.087
 roof-0.086
 Dragons-0.086
Token upon
Token ID2402
Feature activation-6.03e+00

Pos logit contributions
ited+0.133
ised+0.109
ced+0.104
pped+0.103
eful+0.099
Neg logit contributions
 roof-0.164
 sle-0.160
 globe-0.160
ometer-0.153
bul-0.149
Token a
Token ID257
Feature activation-9.56e+00

Pos logit contributions
ited+0.112
ised+0.099
 disagreed+0.092
ere+0.090
illed+0.086
Neg logit contributions
 roof-0.116
 Dragons-0.113
tr-0.113
 sle-0.111
 globe-0.110
Token time
Token ID640
Feature activation-6.58e+00

Pos logit contributions
ised+0.138
ited+0.131
pped+0.123
 disagreed+0.118
ced+0.117
Neg logit contributions
 sle-0.196
 roof-0.196
tr-0.194
 globe-0.179
 mast-0.176
Token,
Token ID11
Feature activation-7.09e+00

Pos logit contributions
ited+0.096
ised+0.072
 disagreed+0.064
ere+0.064
ced+0.064
Neg logit contributions
 roof-0.156
 globe-0.156
 sle-0.155
tr-0.152
 Dragons-0.150
Token there
Token ID612
Feature activation-5.75e+00

Pos logit contributions
ited+0.120
 disagreed+0.089
ised+0.083
ere+0.083
eling+0.077
Neg logit contributions
 globe-0.164
tr-0.163
 roof-0.157
 sle-0.147
bul-0.147
Token was
Token ID373
Feature activation-8.07e+00

Pos logit contributions
ited+0.142
ised+0.120
 disagreed+0.108
ere+0.108
eful+0.107
Neg logit contributions
 globe-0.088
 roof-0.084
tr-0.080
 fireplace-0.075
 sle-0.073
Token a
Token ID257
Feature activation-5.02e+00

Pos logit contributions
ited+0.139
ised+0.116
 disagreed+0.111
ced+0.107
ere+0.102
Neg logit contributions
 roof-0.168
tr-0.163
 sle-0.162
 globe-0.158
 Dragons-0.156
Token boy
Token ID2933
Feature activation-6.34e+00

Pos logit contributions
ited+0.061
ised+0.052
ere+0.048
ced+0.048
 disagreed+0.047
Neg logit contributions
tr-0.117
 Dragons-0.117
 sle-0.116
bul-0.115
 mast-0.113
Token named
Token ID3706
Feature activation-5.84e+00

Pos logit contributions
ited+0.078
ised+0.066
 disagreed+0.060
ere+0.060
illed+0.058
Neg logit contributions
 sle-0.156
 roof-0.153
 mast-0.151
tr-0.150
 Dragons-0.149
Token Tim
Token ID5045
Feature activation-8.35e+00

Pos logit contributions
ited+0.078
ised+0.068
 disagreed+0.067
ere+0.067
 seen+0.063
Neg logit contributions
 sle-0.132
bul-0.130
tr-0.127
 sy-0.127
 roof-0.127
Token the
Token ID262
Feature activation+3.42e+00

Pos logit contributions
ited+0.011
ised+0.009
 disagreed+0.009
illed+0.008
ere+0.008
Neg logit contributions
 sle-0.013
 roof-0.013
 globe-0.013
bul-0.013
 mast-0.012
Token secret
Token ID3200
Feature activation+4.29e+00

Pos logit contributions
 sle+0.059
 mast+0.057
 Dragons+0.056
tr+0.056
 roof+0.055
Neg logit contributions
ited-0.067
ised-0.061
ced-0.058
ere-0.057
 disagreed-0.056
Token room
Token ID2119
Feature activation+3.10e+00

Pos logit contributions
 sle+0.073
 roof+0.069
tr+0.069
 mast+0.068
 Dragons+0.068
Neg logit contributions
ited-0.086
ised-0.079
 disagreed-0.074
ced-0.073
ere-0.073
Token.
Token ID13
Feature activation+5.21e+00

Pos logit contributions
 roof+0.052
 sle+0.051
tr+0.050
 globe+0.049
 mast+0.048
Neg logit contributions
ited-0.062
ised-0.056
ere-0.053
 disagreed-0.051
ced-0.051
Token She
Token ID1375
Feature activation+2.20e+00

Pos logit contributions
 mast+0.116
 sle+0.116
 remote+0.109
 globe+0.108
 sy+0.107
Neg logit contributions
ised-0.078
illed-0.068
ere-0.068
ited-0.066
 seen-0.062
Token smiled
Token ID13541
Feature activation-2.52e+00

Pos logit contributions
 sle+0.047
 roof+0.046
tr+0.045
 mast+0.045
bul+0.044
Neg logit contributions
ited-0.034
ised-0.031
ere-0.028
illed-0.028
ced-0.027
Token and
Token ID290
Feature activation+2.69e+00

Pos logit contributions
ited+0.039
ised+0.032
 disagreed+0.030
ere+0.030
ced+0.029
Neg logit contributions
 sle-0.056
 roof-0.055
tr-0.054
 mast-0.054
 Dragons-0.053
Token went
Token ID1816
Feature activation-2.36e+00

Pos logit contributions
 sle+0.058
 roof+0.056
tr+0.055
 mast+0.055
 Dragons+0.054
Neg logit contributions
ited-0.042
ised-0.036
ere-0.034
illed-0.033
ced-0.033
Token to
Token ID284
Feature activation-2.95e-01

Pos logit contributions
ited+0.042
ised+0.038
ere+0.036
ced+0.035
 disagreed+0.035
Neg logit contributions
 sle-0.044
 roof-0.043
tr-0.043
 mast-0.043
 Dragons-0.042
Token get
Token ID651
Feature activation-7.92e-01

Pos logit contributions
ited+0.006
ised+0.005
 disagreed+0.005
ere+0.005
ced+0.005
Neg logit contributions
 sle-0.005
tr-0.005
 roof-0.005
 mast-0.005
 Dragons-0.005
Token breakfast
Token ID12607
Feature activation+1.38e+00

Pos logit contributions
ited+0.012
ised+0.010
 disagreed+0.009
ere+0.009
ced+0.009
Neg logit contributions
 sle-0.018
 roof-0.018
tr-0.017
 mast-0.017
 Dragons-0.017
Token his
Token ID465
Feature activation+2.42e+00

Pos logit contributions
ited+0.027
ised+0.023
 disagreed+0.021
illed+0.021
ced+0.021
Neg logit contributions
 sle-0.040
 roof-0.038
tr-0.038
 mast-0.038
 Dragons-0.037
Token toys
Token ID14958
Feature activation+2.26e+00

Pos logit contributions
 sle+0.042
tr+0.040
 mast+0.040
 roof+0.040
 Dragons+0.040
Neg logit contributions
ited-0.046
ised-0.042
 disagreed-0.040
ere-0.039
 seen-0.039
Token.
Token ID13
Feature activation+4.16e+00

Pos logit contributions
 sle+0.042
 roof+0.042
tr+0.041
 mast+0.041
 globe+0.040
Neg logit contributions
ited-0.041
ised-0.036
ere-0.034
ced-0.034
 disagreed-0.033
Token Tim
Token ID5045
Feature activation-3.81e+00

Pos logit contributions
 sle+0.076
 roof+0.073
 mast+0.072
tr+0.072
 globe+0.071
Neg logit contributions
ited-0.079
ised-0.072
 disagreed-0.068
ere-0.067
ced-0.066
Tokenmy
Token ID1820
Feature activation-1.42e+00

Pos logit contributions
ised+0.094
ited+0.084
ced+0.083
eful+0.082
pped+0.081
Neg logit contributions
tr-0.041
man-0.037
 lamp-0.032
bul-0.031
 bear-0.029
Token got
Token ID1392
Feature activation-3.10e+00

Pos logit contributions
ited+0.025
ised+0.023
ere+0.022
illed+0.022
eful+0.022
Neg logit contributions
tr-0.023
ometer-0.022
 roof-0.022
 sy-0.022
 mast-0.021
Token mad
Token ID8805
Feature activation-3.43e-01

Pos logit contributions
ited+0.053
ised+0.047
ced+0.044
illed+0.043
ere+0.043
Neg logit contributions
 sle-0.061
 sy-0.059
 roof-0.058
tr-0.058
 remote-0.058
Token and
Token ID290
Feature activation+3.05e+00

Pos logit contributions
ited+0.005
ised+0.004
ere+0.004
ced+0.004
illed+0.004
Neg logit contributions
 sle-0.007
 roof-0.007
tr-0.007
 Dragons-0.007
 mast-0.007
Token said
Token ID531
Feature activation-4.30e+00

Pos logit contributions
 sle+0.065
 roof+0.065
 mast+0.064
tr+0.063
 Dragons+0.063
Neg logit contributions
ited-0.046
ised-0.040
ere-0.037
illed-0.037
ced-0.037
Token,
Token ID11
Feature activation+2.70e+00

Pos logit contributions
ited+0.054
ised+0.041
ere+0.038
illed+0.035
 seen+0.034
Neg logit contributions
 roof-0.109
 sle-0.107
tr-0.107
 mast-0.106
 Dragons-0.106
Token "
Token ID366
Feature activation+1.88e+00

Pos logit contributions
 sle+0.055
 globe+0.050
 mast+0.049
 roof+0.049
tr+0.049
Neg logit contributions
 disagreed-0.043
 did-0.041
ited-0.040
ised-0.040
ced-0.039
Token
Token ID198
Feature activation-1.38e+00

Pos logit contributions
 sle+0.023
 mast+0.022
 roof+0.022
tr+0.021
 globe+0.021
Neg logit contributions
ited-0.017
ised-0.017
ere-0.016
illed-0.016
ced-0.015
Token
Token ID198
Feature activation-9.93e-01

Pos logit contributions
ited+0.028
ised+0.025
 disagreed+0.024
ere+0.023
ced+0.023
Neg logit contributions
 sle-0.024
 roof-0.023
tr-0.022
 mast-0.022
 Dragons-0.022
TokenSoon
Token ID28093
Feature activation-9.32e-02

Pos logit contributions
ited+0.013
ised+0.011
ere+0.010
ced+0.010
 disagreed+0.010
Neg logit contributions
 sle-0.025
 roof-0.024
 mast-0.023
tr-0.023
 globe-0.023
Token the
Token ID262
Feature activation+7.28e-01

Pos logit contributions
ited+0.001
ised+0.001
illed+0.001
ere+0.001
ced+0.001
Neg logit contributions
 sle-0.002
 roof-0.002
 mast-0.002
 Dragons-0.002
 globe-0.002
Token pony
Token ID26902
Feature activation+2.02e+00

Pos logit contributions
 sle+0.011
 roof+0.011
tr+0.011
 globe+0.010
 mast+0.010
Neg logit contributions
ited-0.016
ised-0.015
illed-0.014
 disagreed-0.014
ced-0.014
Token got
Token ID1392
Feature activation-1.73e+00

Pos logit contributions
 sle+0.042
tr+0.042
 roof+0.042
 mast+0.040
 globe+0.040
Neg logit contributions
ited-0.032
ised-0.029
illed-0.028
ere-0.026
ced-0.025
Token stronger
Token ID7387
Feature activation+1.29e+00

Pos logit contributions
ited+0.031
ised+0.028
ere+0.026
illed+0.026
ced+0.026
Neg logit contributions
 sle-0.033
 roof-0.032
tr-0.031
 mast-0.031
bul-0.031
Token.
Token ID13
Feature activation+5.50e+00

Pos logit contributions
 sle+0.028
 roof+0.027
tr+0.027
 mast+0.027
 globe+0.027
Neg logit contributions
ited-0.020
ised-0.017
ere-0.016
 disagreed-0.016
ced-0.016
Token She
Token ID1375
Feature activation+2.42e+00

Pos logit contributions
 sle+0.103
 globe+0.102
 mast+0.097
 lantern+0.093
tr+0.093
Neg logit contributions
ited-0.086
ised-0.084
ere-0.084
 disagreed-0.081
illed-0.078
Token would
Token ID561
Feature activation+1.81e-01

Pos logit contributions
 sle+0.053
 roof+0.052
tr+0.051
 mast+0.051
 globe+0.050
Neg logit contributions
ited-0.037
ised-0.033
ere-0.030
illed-0.030
ced-0.029
Token play
Token ID711
Feature activation-1.41e+00

Pos logit contributions
 sle+0.003
tr+0.003
 Dragons+0.003
 roof+0.003
 globe+0.003
Neg logit contributions
ited-0.003
ised-0.003
 disagreed-0.003
ere-0.003
ced-0.003
Token having
Token ID1719
Feature activation-2.16e+00

Pos logit contributions
ited+0.029
ised+0.026
 disagreed+0.024
ere+0.024
ced+0.024
Neg logit contributions
 sle-0.031
 roof-0.031
 mast-0.030
tr-0.030
 globe-0.029
Token fun
Token ID1257
Feature activation-1.54e+00

Pos logit contributions
ited+0.043
illed+0.036
ised+0.036
ced+0.033
ared+0.033
Neg logit contributions
 sle-0.038
 roof-0.037
 mast-0.036
 Dragons-0.035
 fireplace-0.035
Token,
Token ID11
Feature activation+3.82e+00

Pos logit contributions
ited+0.029
ised+0.024
illed+0.023
ere+0.023
 disagreed+0.023
Neg logit contributions
 sle-0.029
 roof-0.028
 globe-0.027
 Dragons-0.027
 mast-0.027
Token Lily
Token ID20037
Feature activation-2.29e+00

Pos logit contributions
 sle+0.073
tr+0.070
 roof+0.069
 mast+0.069
 globe+0.068
Neg logit contributions
ited-0.068
ised-0.062
 disagreed-0.059
ere-0.057
ced-0.056
Token."
Token ID526
Feature activation+2.97e+00

Pos logit contributions
ited+0.046
ised+0.041
 disagreed+0.039
ere+0.039
ced+0.038
Neg logit contributions
 sle-0.039
 roof-0.038
tr-0.037
 mast-0.037
 Dragons-0.037
Token<|endoftext|>
Token ID50256
Feature activation-6.82e+00

Pos logit contributions
 sle+0.069
 mast+0.067
 globe+0.063
bul+0.063
 remote+0.063
Neg logit contributions
ited-0.041
ised-0.039
 disagreed-0.036
illed-0.034
ere-0.034
TokenOnce
Token ID7454
Feature activation-6.83e+00

Pos logit contributions
ited+0.161
ised+0.145
 disagreed+0.138
ere+0.134
ced+0.133
Neg logit contributions
 sle-0.095
 globe-0.092
tr-0.091
 roof-0.090
 Dragons-0.089
Token upon
Token ID2402
Feature activation-6.28e+00

Pos logit contributions
ited+0.103
ised+0.086
ced+0.080
ere+0.077
pped+0.076
Neg logit contributions
 roof-0.152
 sle-0.149
 globe-0.144
tr-0.141
ometer-0.140
Token a
Token ID257
Feature activation-9.99e+00

Pos logit contributions
ited+0.119
ised+0.107
ere+0.100
 disagreed+0.096
eful+0.093
Neg logit contributions
 roof-0.118
tr-0.115
 globe-0.113
 Dragons-0.112
 sle-0.110
Token time
Token ID640
Feature activation-6.19e+00

Pos logit contributions
ised+0.156
pped+0.142
ited+0.140
ced+0.136
ere+0.130
Neg logit contributions
tr-0.195
 sle-0.180
 roof-0.179
 globe-0.166
 fireplace-0.165
Token,
Token ID11
Feature activation-6.19e+00

Pos logit contributions
ited+0.103
ised+0.081
ere+0.073
ced+0.070
 disagreed+0.069
Neg logit contributions
 roof-0.134
 globe-0.133
 sle-0.130
 Dragons-0.126
tr-0.126
Token upon
Token ID2402
Feature activation-6.40e+00

Pos logit contributions
ited+0.117
ised+0.095
ced+0.090
pped+0.089
ere+0.086
Neg logit contributions
 roof-0.168
 sle-0.164
 globe-0.162
ometer-0.155
 Dragons-0.154
Token a
Token ID257
Feature activation-9.86e+00

Pos logit contributions
ited+0.120
ised+0.106
ere+0.100
 disagreed+0.095
eful+0.094
Neg logit contributions
 roof-0.124
tr-0.121
 globe-0.119
 Dragons-0.117
 sle-0.116
Token time
Token ID640
Feature activation-5.87e+00

Pos logit contributions
ised+0.141
ited+0.141
pped+0.131
 disagreed+0.123
ere+0.122
Neg logit contributions
tr-0.204
 roof-0.196
 sle-0.195
 globe-0.184
 fireplace-0.179
Token,
Token ID11
Feature activation-6.77e+00

Pos logit contributions
ited+0.100
ised+0.076
ere+0.068
pped+0.067
eling+0.066
Neg logit contributions
 globe-0.131
 roof-0.128
 sle-0.125
 Dragons-0.122
tr-0.121
Token there
Token ID612
Feature activation-5.49e+00

Pos logit contributions
ited+0.117
ere+0.087
 disagreed+0.084
ised+0.084
pped+0.079
Neg logit contributions
 globe-0.153
 roof-0.147
tr-0.142
 fireplace-0.134
 We-0.133
Token was
Token ID373
Feature activation-8.13e+00

Pos logit contributions
ited+0.142
ised+0.120
eful+0.111
ere+0.111
illed+0.108
Neg logit contributions
 globe-0.076
 roof-0.070
tr-0.068
 fireplace-0.064
 chim-0.062
Token a
Token ID257
Feature activation-5.21e+00

Pos logit contributions
ited+0.150
ised+0.125
 disagreed+0.120
ced+0.115
ere+0.113
Neg logit contributions
 roof-0.161
tr-0.157
 globe-0.153
 sle-0.153
 fireplace-0.150
Token boy
Token ID2933
Feature activation-7.45e+00

Pos logit contributions
ited+0.061
ised+0.050
ced+0.048
 disagreed+0.047
ere+0.047
Neg logit contributions
tr-0.123
 sle-0.123
 Dragons-0.122
bul-0.121
 mast-0.120
Token who
Token ID508
Feature activation-2.13e+00

Pos logit contributions
ited+0.098
ised+0.084
 disagreed+0.076
ere+0.076
ced+0.075
Neg logit contributions
 sle-0.178
 roof-0.174
tr-0.172
 mast-0.172
 Dragons-0.170
Token had
Token ID550
Feature activation-4.72e+00

Pos logit contributions
ited+0.031
ised+0.029
ere+0.025
ced+0.025
illed+0.025
Neg logit contributions
 sle-0.047
tr-0.047
 roof-0.046
 globe-0.046
 Dragons-0.045
Token an
Token ID281
Feature activation-3.77e+00

Pos logit contributions
ited+0.076
ised+0.063
ere+0.057
 disagreed+0.056
ced+0.056
Neg logit contributions
 roof-0.107
 sle-0.105
tr-0.104
 mast-0.100
 Dragons-0.099
Token adventure
Token ID8855
Feature activation+1.41e+00

Pos logit contributions
 sle+0.041
 roof+0.039
tr+0.038
 mast+0.038
 Dragons+0.038
Neg logit contributions
ited-0.070
ised-0.064
 disagreed-0.061
ere-0.060
ced-0.060
Token.
Token ID13
Feature activation+4.44e+00

Pos logit contributions
 sle+0.027
 roof+0.026
 mast+0.026
tr+0.026
 globe+0.026
Neg logit contributions
ited-0.025
ised-0.022
 disagreed-0.021
ere-0.021
ced-0.020
Token<|endoftext|>
Token ID50256
Feature activation-6.07e+00

Pos logit contributions
 roof+0.102
 sle+0.101
tr+0.101
 mast+0.100
 sy+0.098
Neg logit contributions
ere-0.050
ised-0.048
ited-0.046
illed-0.044
 disagreed-0.043
TokenOnce
Token ID7454
Feature activation-7.62e+00

Pos logit contributions
ited+0.126
ised+0.113
 disagreed+0.107
ced+0.103
illed+0.103
Neg logit contributions
 sle-0.101
 globe-0.098
tr-0.096
 roof-0.096
 Dragons-0.096
Token upon
Token ID2402
Feature activation-6.04e+00

Pos logit contributions
ited+0.122
ised+0.099
pped+0.097
ced+0.095
eful+0.088
Neg logit contributions
 roof-0.163
 globe-0.161
 sle-0.158
ometer-0.153
 Dragons-0.151
Token a
Token ID257
Feature activation-9.74e+00

Pos logit contributions
ited+0.115
ised+0.102
ere+0.094
 disagreed+0.094
eful+0.088
Neg logit contributions
 roof-0.113
 Dragons-0.111
tr-0.109
 globe-0.109
 sle-0.108
Token time
Token ID640
Feature activation-6.35e+00

Pos logit contributions
ised+0.144
pped+0.135
ited+0.134
 disagreed+0.124
ced+0.120
Neg logit contributions
tr-0.195
 sle-0.187
 roof-0.184
 globe-0.179
 fireplace-0.170
Token there
Token ID612
Feature activation-5.67e+00

Pos logit contributions
ited+0.102
ised+0.077
ere+0.070
pped+0.070
 disagreed+0.068
Neg logit contributions
 globe-0.145
 roof-0.142
 sle-0.139
tr-0.137
 Dragons-0.136
Token was
Token ID373
Feature activation-8.54e+00

Pos logit contributions
ited+0.133
ised+0.115
eful+0.104
ere+0.104
pped+0.104
Neg logit contributions
 globe-0.088
 roof-0.082
tr-0.079
 fireplace-0.074
 sle-0.073
Token a
Token ID257
Feature activation-5.21e+00

Pos logit contributions
ited+0.142
ised+0.116
 disagreed+0.115
ced+0.109
ere+0.104
Neg logit contributions
 roof-0.181
 sle-0.175
tr-0.173
 globe-0.170
 Dragons-0.167
Token lady
Token ID10846
Feature activation-4.43e+00

Pos logit contributions
ited+0.063
ised+0.054
ced+0.049
ere+0.049
 disagreed+0.048
Neg logit contributions
 sle-0.122
tr-0.122
 Dragons-0.122
bul-0.121
 mast-0.119
Token.
Token ID13
Feature activation+3.92e+00

Pos logit contributions
 sle+0.037
 roof+0.036
tr+0.035
 mast+0.035
 Dragons+0.035
Neg logit contributions
ited-0.033
ised-0.030
 disagreed-0.028
ere-0.027
ced-0.027
Token<|endoftext|>
Token ID50256
Feature activation-7.68e+00

Pos logit contributions
 sle+0.090
 mast+0.089
 roof+0.081
 globe+0.081
 remote+0.081
Neg logit contributions
ited-0.055
ised-0.050
ere-0.047
illed-0.046
 disagreed-0.046
TokenOnce
Token ID7454
Feature activation-7.60e+00

Pos logit contributions
ited+0.186
ised+0.166
 disagreed+0.160
ere+0.153
illed+0.152
Neg logit contributions
 sle-0.103
 globe-0.102
tr-0.099
 roof-0.098
 Dragons-0.095
Token upon
Token ID2402
Feature activation-6.83e+00

Pos logit contributions
ited+0.115
ised+0.094
ced+0.090
pped+0.087
ere+0.085
Neg logit contributions
 roof-0.169
 sle-0.165
 globe-0.162
tr-0.156
ometer-0.156
Token a
Token ID257
Feature activation-1.02e+01

Pos logit contributions
ited+0.129
ised+0.115
ere+0.108
 disagreed+0.104
eful+0.101
Neg logit contributions
 roof-0.129
tr-0.126
 globe-0.124
 Dragons-0.123
 sle-0.120
Token time
Token ID640
Feature activation-6.68e+00

Pos logit contributions
ised+0.153
ited+0.149
pped+0.141
ced+0.134
 disagreed+0.132
Neg logit contributions
tr-0.202
 roof-0.193
 sle-0.189
 globe-0.182
 fireplace-0.174
Token,
Token ID11
Feature activation-7.15e+00

Pos logit contributions
ited+0.113
ised+0.087
ere+0.078
pped+0.077
ced+0.076
Neg logit contributions
 globe-0.148
 roof-0.146
 sle-0.141
tr-0.138
 Dragons-0.137
Token there
Token ID612
Feature activation-5.82e+00

Pos logit contributions
ited+0.127
ere+0.095
 disagreed+0.094
ised+0.092
eling+0.089
Neg logit contributions
 globe-0.156
 roof-0.150
tr-0.149
 We-0.136
 bulb-0.135
Token was
Token ID373
Feature activation-8.40e+00

Pos logit contributions
ited+0.149
ised+0.125
eful+0.115
ere+0.114
ieved+0.112
Neg logit contributions
 globe-0.085
 roof-0.079
tr-0.074
 fireplace-0.070
 chim-0.068
Token a
Token ID257
Feature activation-5.24e+00

Pos logit contributions
ited+0.145
ised+0.121
 disagreed+0.115
ced+0.112
ere+0.108
Neg logit contributions
 roof-0.174
tr-0.168
 sle-0.166
 globe-0.164
 Dragons-0.160
Token little
Token ID1310
Feature activation-3.75e+00

Pos logit contributions
ited+0.061
ised+0.050
ere+0.047
ced+0.047
 disagreed+0.046
Neg logit contributions
tr-0.124
 Dragons-0.124
bul-0.123
 sle-0.123
 mast-0.120
Token rice
Token ID11464
Feature activation+4.84e+00

Pos logit contributions
 sle+0.053
 roof+0.051
tr+0.051
 mast+0.050
 Dragons+0.049
Neg logit contributions
ited-0.068
ised-0.062
 disagreed-0.059
ere-0.058
ced-0.058
Token.
Token ID13
Feature activation+3.02e+00

Pos logit contributions
 sle+0.115
 mast+0.111
 Dragons+0.111
 roof+0.110
tr+0.108
Neg logit contributions
ited-0.064
ised-0.055
 disagreed-0.054
 seen-0.051
 done-0.051
Token<|endoftext|>
Token ID50256
Feature activation-7.80e+00

Pos logit contributions
 mast+0.076
 sle+0.073
 roof+0.067
 remote+0.067
rier+0.066
Neg logit contributions
ited-0.036
ere-0.032
ised-0.032
 seen-0.030
 did-0.030
TokenOnce
Token ID7454
Feature activation-7.85e+00

Pos logit contributions
ited+0.203
ised+0.180
 disagreed+0.175
illed+0.166
eful+0.165
Neg logit contributions
 globe-0.095
 sle-0.092
tr-0.091
 roof-0.087
 Dragons-0.086
Token upon
Token ID2402
Feature activation-6.42e+00

Pos logit contributions
ited+0.121
ised+0.100
ced+0.094
pped+0.092
 disagreed+0.091
Neg logit contributions
 roof-0.171
 sle-0.168
 globe-0.162
tr-0.158
ometer-0.158
Token a
Token ID257
Feature activation-9.80e+00

Pos logit contributions
ited+0.129
ised+0.114
ere+0.108
 disagreed+0.105
eful+0.101
Neg logit contributions
 roof-0.114
tr-0.111
 Dragons-0.110
 globe-0.109
 sle-0.105
Token time
Token ID640
Feature activation-6.48e+00

Pos logit contributions
ised+0.154
ited+0.143
pped+0.141
ced+0.132
 disagreed+0.132
Neg logit contributions
tr-0.184
 sle-0.173
 roof-0.173
 globe-0.159
 fireplace-0.154
Token,
Token ID11
Feature activation-7.04e+00

Pos logit contributions
ited+0.113
ised+0.087
ere+0.079
pped+0.077
eling+0.077
Neg logit contributions
 globe-0.139
 roof-0.135
 sle-0.134
tr-0.130
 Dragons-0.129
Token there
Token ID612
Feature activation-5.50e+00

Pos logit contributions
ited+0.119
 disagreed+0.091
ere+0.090
ised+0.089
eling+0.082
Neg logit contributions
 globe-0.154
tr-0.150
 roof-0.147
bul-0.134
 remote-0.134
Token was
Token ID373
Feature activation-8.12e+00

Pos logit contributions
ited+0.142
ised+0.122
eful+0.109
ere+0.109
ieved+0.108
Neg logit contributions
 globe-0.076
 roof-0.071
tr-0.067
 fireplace-0.064
 chim-0.062
Token a
Token ID257
Feature activation-4.95e+00

Pos logit contributions
ited+0.141
ised+0.119
 disagreed+0.112
ced+0.110
ere+0.105
Neg logit contributions
 roof-0.167
tr-0.161
 sle-0.160
 globe-0.157
 Dragons-0.155