TOP ACTIVATIONS
MAX = 4.448

mum , " What is a jewel ?" M
scissors . " Thank you so much !" Santa
was a bird named Blue . Blue was very happy because
Queen . She was a grand ship , with a tall
was a bird named Blue . Blue was a graceful bird
little fish y named Blue . Blue loved to swim and
was a bird named Blue . Blue was very beautiful with
was a bird named Blue . Blue liked to fly high
was a bird named Blue . Blue had a strong and
was a bird named Blue . Blue loved to soar high
was a bird named Blue . Blue was an ordinary bird
ostr ich named Blue y . Blue y liked to run
a little bird named Blue . Blue was very sad because
was a bird named Blue . Blue loved to fly high
blue cat named Blue y . Blue y was sad because
a little bird named Blue . Blue was very happy because
wrote letters to Santa and put them in a big red
calm little bird named Blue . Blue lived in a tiny
a little bird named Blue . Blue had a peaceful life
gifted little bird named Blue . Blue loved to sing and

MIN ACTIVATIONS
MIN = -4.024

high ! Can you teach me ?" The frog nodded and
asked , " What is that ?" The mom
Mom my took out a spray bottle . She showed Sarah
higher than you !" The small bubble said , " No
it tastes ." L ula could barely contain her
let 's play it . One , two , three ,
said , " Look at that blue circle !" His mom
fun ! Can you teach me how to make music with
my , can you tell me a story ?" His mom
They swim back to the shallow end of the pool ,
the bald player and the girl skipped up and down the
ball ! Can you teach me how to fly ?" The
dancer ! Can you teach me ?" Lucy smiled and said
he found a can of spray paint in the woods .
can have the steak , L ula ". She turned around
like to stay in the shallow end ." Ben
. She showed Sarah how to spray some nice smelling liquid
and the baby walked together to the garden . When they
cat said , " Mar riage is when two animals love
good ! Can you teach me how to play too ?"

INTERVAL 2.330 - 4.448
CONTAINS 0.072%

can have this . It will make you feel better ,"
saw monkeys , elephants , gir aff es , and lions
a je ep . The je ep was weird . So
," Lily said . " Those are not our gifts .
duck . " Our family treasure is hidden !"

INTERVAL 0.212 - 2.330
CONTAINS 37.126%

funny pants . Tim and his mom went back home ,
play . Billy said , " Let 's go
They pretended that the table was a rocket ship and that
bush ! <|endoftext|> Once upon a time , there was a
. He walked away . He did not say thank you

INTERVAL -1.906 - 0.212
CONTAINS 62.412%

had been able to help someone else . She learned that
called Sammy . Sammy was always looking for exciting adventures ,
to look brighter and happier . The friends were
friend in need . <|endoftext|> Once upon a time , a
read books with pictures of animals and cars . One day

INTERVAL -4.024 - -1.906
CONTAINS 0.389%

Tom said to the mosquito , " Hello ,
like fun . Do you mind if I sit here and
, " Let me teach you something young dinosaur !" Terry
. Max tried to catch the bug , but it was
excited that he wanted to do it , too . Mom
Token mum
Token ID25682
Feature activation-9.19e-01

Pos logit contributions
upid+0.008
ours+0.008
ry+0.008
about+0.007
asing+0.007
Neg logit contributions
Brown-0.011
Ell-0.010
 Brown-0.010
 Stretch-0.009
Twe-0.009
Token,
Token ID11
Feature activation-7.70e-01

Pos logit contributions
upid+0.029
ours+0.027
 ankle+0.027
olt+0.027
 bubble+0.026
Neg logit contributions
Brown-0.038
 Brown-0.037
Ell-0.036
 Stretch-0.035
 Barney-0.035
Token "
Token ID366
Feature activation+7.91e-01

Pos logit contributions
upid+0.029
ours+0.028
ry+0.028
olt+0.027
about+0.027
Neg logit contributions
Brown-0.029
Ell-0.028
 Brown-0.027
 Barney-0.026
 Stretch-0.026
TokenWhat
Token ID2061
Feature activation-7.50e-01

Pos logit contributions
Brown+0.035
Ell+0.033
hy+0.033
 Stretch+0.032
Twe+0.031
Neg logit contributions
ry-0.024
ours-0.023
about-0.022
 bubble-0.022
 Sandy-0.021
Token is
Token ID318
Feature activation+3.06e+00

Pos logit contributions
upid+0.024
ry+0.024
ours+0.024
 Sandy+0.023
 shallow+0.023
Neg logit contributions
Brown-0.031
Ell-0.030
 Stretch-0.029
 Brown-0.028
hy-0.028
Token a
Token ID257
Feature activation+4.45e+00

Pos logit contributions
Brown+0.171
Ell+0.167
Ar+0.152
thank+0.152
hy+0.150
Neg logit contributions
 how-0.060
 more-0.058
 Sandy-0.056
ours-0.056
 if-0.054
Token jewel
Token ID16927
Feature activation+2.55e-01

Pos logit contributions
Ell+0.226
Brown+0.219
hy+0.193
ding+0.191
rob+0.189
Neg logit contributions
 shell-0.121
 foot-0.115
 promise-0.115
 eye-0.108
 end-0.106
Token?"
Token ID1701
Feature activation+5.46e-02

Pos logit contributions
Brown+0.013
Ell+0.013
Twe+0.012
Ar+0.012
 Stretch+0.012
Neg logit contributions
ry-0.006
ours-0.006
upid-0.006
 bubble-0.005
 Sandy-0.005
Token
Token ID198
Feature activation+3.55e-01

Pos logit contributions
Brown+0.002
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token
Token ID198
Feature activation+1.98e-01

Pos logit contributions
Brown+0.015
Ell+0.014
Twe+0.013
hy+0.013
 Stretch+0.013
Neg logit contributions
ry-0.012
ours-0.012
upid-0.012
 Sandy-0.011
 bubble-0.011
TokenM
Token ID44
Feature activation+3.81e-01

Pos logit contributions
Brown+0.008
Ell+0.008
Twe+0.007
 Stretch+0.007
hy+0.007
Neg logit contributions
ry-0.007
ours-0.007
upid-0.007
 Sandy-0.006
 bubble-0.006
Token scissors
Token ID41786
Feature activation+9.55e-02

Pos logit contributions
Brown+0.054
Ell+0.052
Twe+0.048
 Stretch+0.047
Ar+0.046
Neg logit contributions
 bubble-0.035
ours-0.035
upid-0.034
 foot-0.034
 Sandy-0.033
Token.
Token ID13
Feature activation+2.22e+00

Pos logit contributions
Brown+0.004
Ell+0.004
Twe+0.004
 Stretch+0.004
Ar+0.004
Neg logit contributions
ours-0.003
ry-0.003
upid-0.003
 Sandy-0.002
 bubble-0.002
Token
Token ID198
Feature activation+1.77e+00

Pos logit contributions
Ell+0.087
Brown+0.080
Twe+0.073
ve+0.072
wl+0.071
Neg logit contributions
ours-0.078
ry-0.074
 hand-0.074
 Sandy-0.071
upid-0.071
Token
Token ID198
Feature activation+1.88e+00

Pos logit contributions
hy+0.082
Ell+0.082
Brown+0.082
Twe+0.075
 loot+0.075
Neg logit contributions
upid-0.039
ry-0.038
ours-0.036
 Sandy-0.036
ress-0.035
Token"
Token ID1
Feature activation+2.08e+00

Pos logit contributions
hy+0.064
Ell+0.062
 Brown+0.058
Brown+0.058
 Ellie+0.057
Neg logit contributions
ry-0.061
about-0.061
upid-0.060
ress-0.060
'-0.060
TokenThank
Token ID10449
Feature activation+3.91e+00

Pos logit contributions
hy+0.077
gy+0.076
Ell+0.076
 Brown+0.076
Brown+0.075
Neg logit contributions
about-0.061
 bubble-0.056
ired-0.054
ful-0.054
Make-0.053
Token you
Token ID345
Feature activation+1.55e+00

Pos logit contributions
Ell+0.206
Brown+0.188
gy+0.187
storm+0.181
hy+0.180
Neg logit contributions
ful-0.084
 of-0.080
'-0.079
ry-0.072
 if-0.066
Token so
Token ID523
Feature activation+1.06e+00

Pos logit contributions
Brown+0.072
Ell+0.069
Twe+0.066
Ar+0.066
 Stretch+0.062
Neg logit contributions
ry-0.042
ress-0.040
 Sandy-0.040
upid-0.039
ful-0.039
Token much
Token ID881
Feature activation+2.22e+00

Pos logit contributions
Brown+0.046
Ell+0.045
Ar+0.042
Twe+0.041
hy+0.039
Neg logit contributions
ry-0.033
upid-0.032
ours-0.032
 Sandy-0.031
 foot-0.029
Token!"
Token ID2474
Feature activation+2.58e+00

Pos logit contributions
Ell+0.114
Brown+0.109
Twe+0.101
Mic+0.099
hy+0.097
Neg logit contributions
ry-0.054
 of-0.050
ours-0.048
 if-0.047
ired-0.047
Token Santa
Token ID8909
Feature activation+8.96e-01

Pos logit contributions
Ell+0.108
Brown+0.107
Twe+0.100
Ar+0.097
hy+0.094
Neg logit contributions
ry-0.073
 hand-0.072
 bubble-0.072
 Sandy-0.071
 Mama-0.067
Token was
Token ID373
Feature activation+8.93e-01

Pos logit contributions
ours+0.020
upid+0.020
ry+0.020
asing+0.019
 ankle+0.018
Neg logit contributions
Brown-0.022
Ell-0.021
 Stretch-0.019
 Brown-0.019
Twe-0.019
Token a
Token ID257
Feature activation+2.26e-01

Pos logit contributions
Brown+0.047
Ell+0.044
hy+0.042
Twe+0.042
 Stretch+0.041
Neg logit contributions
 bubble-0.019
 Sandy-0.019
ours-0.019
upid-0.019
 foot-0.018
Token bird
Token ID6512
Feature activation-5.38e-01

Pos logit contributions
Brown+0.011
Ell+0.010
Twe+0.010
 Stretch+0.009
hy+0.009
Neg logit contributions
ours-0.006
ry-0.006
upid-0.006
 bubble-0.006
 Sandy-0.006
Token named
Token ID3706
Feature activation+1.08e+00

Pos logit contributions
ours+0.018
upid+0.018
ry+0.017
 Sandy+0.016
about+0.016
Neg logit contributions
Brown-0.024
Ell-0.022
Twe-0.021
hy-0.020
 Stretch-0.020
Token Blue
Token ID4518
Feature activation+6.26e-02

Pos logit contributions
Brown+0.033
Ell+0.031
Ar+0.031
ve+0.029
hy+0.028
Neg logit contributions
 bubble-0.046
 Sandy-0.046
upid-0.045
ours-0.043
ry-0.043
Token.
Token ID13
Feature activation+3.90e+00

Pos logit contributions
Brown+0.003
Ell+0.003
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token Blue
Token ID4518
Feature activation+1.36e+00

Pos logit contributions
ve+0.178
bel+0.163
ind+0.143
cle+0.141
grand+0.141
Neg logit contributions
 again-0.096
 of-0.093
 if-0.092
 Cater-0.088
 the-0.088
Token was
Token ID373
Feature activation+1.51e+00

Pos logit contributions
Brown+0.058
Ell+0.057
hy+0.054
 Barney+0.053
Twe+0.053
Neg logit contributions
ry-0.042
 bubble-0.037
 foot-0.036
upid-0.036
ours-0.036
Token very
Token ID845
Feature activation+1.95e+00

Pos logit contributions
Brown+0.068
Ell+0.063
Ar+0.062
Twe+0.060
ve+0.059
Neg logit contributions
 Sandy-0.043
 foot-0.042
 bubble-0.042
upid-0.040
 hand-0.039
Token happy
Token ID3772
Feature activation+1.37e-01

Pos logit contributions
Brown+0.092
Ell+0.089
Ar+0.078
 Barney+0.077
 Stretch+0.077
Neg logit contributions
 foot-0.055
 Sandy-0.054
upid-0.054
ours-0.052
 bubble-0.052
Token because
Token ID780
Feature activation+1.28e+00

Pos logit contributions
Brown+0.006
Ell+0.006
Twe+0.005
 Stretch+0.005
Ar+0.005
Neg logit contributions
ry-0.004
ours-0.004
upid-0.004
 Sandy-0.004
 bubble-0.004
Token Queen
Token ID7542
Feature activation+4.19e-02

Pos logit contributions
ours+0.023
upid+0.023
ry+0.021
 ankle+0.020
asing+0.020
Neg logit contributions
Brown-0.025
Ell-0.023
Twe-0.022
 Stretch-0.022
Ar-0.022
Token.
Token ID13
Feature activation+1.84e+00

Pos logit contributions
Brown+0.002
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.001
ry-0.001
upid-0.001
 Sandy-0.001
 bubble-0.001
Token She
Token ID1375
Feature activation+1.11e+00

Pos logit contributions
Brown+0.088
ve+0.083
Ell+0.081
Twe+0.080
Ar+0.079
Neg logit contributions
 Sandy-0.047
ry-0.046
ours-0.045
upid-0.045
 Jelly-0.044
Token was
Token ID373
Feature activation+1.14e+00

Pos logit contributions
Brown+0.048
Ell+0.048
Mic+0.042
Twe+0.042
 Kirst+0.042
Neg logit contributions
ry-0.034
 bubble-0.033
ours-0.032
 foot-0.031
upid-0.031
Token a
Token ID257
Feature activation+1.50e+00

Pos logit contributions
Brown+0.050
Ell+0.047
Twe+0.043
Mic+0.042
Ar+0.041
Neg logit contributions
 Sandy-0.036
ours-0.035
 foot-0.035
 bubble-0.035
upid-0.034
Token grand
Token ID4490
Feature activation+3.85e+00

Pos logit contributions
Ell+0.063
Brown+0.062
Twe+0.055
 Kirst+0.054
hy+0.053
Neg logit contributions
 bubble-0.050
 foot-0.049
ry-0.047
 dolphin-0.046
 twin-0.045
Token ship
Token ID4074
Feature activation+1.19e+00

Pos logit contributions
Ell+0.192
Brown+0.188
Red+0.179
Twe+0.179
Mic+0.171
Neg logit contributions
ry-0.114
 eye-0.108
 dolphin-0.107
 insect-0.106
ful-0.104
Token,
Token ID11
Feature activation+8.94e-01

Pos logit contributions
Brown+0.057
Ell+0.055
Twe+0.052
Ar+0.051
ve+0.049
Neg logit contributions
ry-0.033
 bubble-0.031
 foot-0.030
ours-0.029
apping-0.029
Token with
Token ID351
Feature activation+1.51e+00

Pos logit contributions
Brown+0.044
Ell+0.042
Twe+0.040
Ar+0.039
ve+0.039
Neg logit contributions
 Sandy-0.022
ry-0.021
ours-0.021
 bubble-0.021
 foot-0.021
Token a
Token ID257
Feature activation-4.89e-02

Pos logit contributions
Brown+0.072
Ell+0.070
Twe+0.068
ve+0.064
 Claus+0.063
Neg logit contributions
upid-0.039
 bubble-0.039
 Sandy-0.038
 foot-0.038
ful-0.037
Token tall
Token ID7331
Feature activation-1.77e-01

Pos logit contributions
ours+0.002
ry+0.002
upid+0.002
 Sandy+0.002
 bubble+0.002
Neg logit contributions
Brown-0.002
Ell-0.002
Twe-0.001
 Stretch-0.001
Ar-0.001
Token was
Token ID373
Feature activation+6.99e-01

Pos logit contributions
ours+0.020
upid+0.020
ry+0.019
asing+0.019
 bubble+0.018
Neg logit contributions
Brown-0.022
Ell-0.020
 Brown-0.019
 Stretch-0.019
Twe-0.019
Token a
Token ID257
Feature activation+2.36e-01

Pos logit contributions
Brown+0.036
Ell+0.034
Twe+0.032
hy+0.032
 Stretch+0.031
Neg logit contributions
ours-0.017
 bubble-0.016
upid-0.016
 Sandy-0.016
 foot-0.015
Token bird
Token ID6512
Feature activation-6.08e-01

Pos logit contributions
Brown+0.011
Ell+0.011
Twe+0.010
 Stretch+0.010
hy+0.010
Neg logit contributions
ours-0.006
ry-0.006
upid-0.006
 bubble-0.006
 Sandy-0.006
Token named
Token ID3706
Feature activation+7.76e-01

Pos logit contributions
upid+0.021
ours+0.021
ry+0.019
 Sandy+0.019
about+0.018
Neg logit contributions
Brown-0.026
Ell-0.025
Twe-0.023
hy-0.022
 Stretch-0.022
Token Blue
Token ID4518
Feature activation+8.18e-02

Pos logit contributions
Brown+0.025
Ell+0.024
Ar+0.022
ve+0.021
hy+0.021
Neg logit contributions
upid-0.032
 Sandy-0.032
 bubble-0.032
ours-0.032
ry-0.031
Token.
Token ID13
Feature activation+3.82e+00

Pos logit contributions
Brown+0.004
Ell+0.003
Twe+0.003
 Stretch+0.003
Ar+0.003
Neg logit contributions
ours-0.003
ry-0.003
upid-0.003
 Sandy-0.002
 bubble-0.002
Token Blue
Token ID4518
Feature activation+1.43e+00

Pos logit contributions
ve+0.172
bel+0.158
Ar+0.139
grand+0.137
cle+0.135
Neg logit contributions
 again-0.092
 Cater-0.092
 if-0.091
 Duck-0.091
 cricket-0.089
Token was
Token ID373
Feature activation+1.54e+00

Pos logit contributions
Brown+0.062
Ell+0.061
hy+0.058
 Barney+0.056
Twe+0.056
Neg logit contributions
ry-0.043
 bubble-0.038
ours-0.037
 foot-0.037
upid-0.037
Token a
Token ID257
Feature activation+1.36e+00

Pos logit contributions
Brown+0.069
Ell+0.065
Ar+0.063
Twe+0.061
ve+0.060
Neg logit contributions
 Sandy-0.044
 foot-0.043
 bubble-0.043
 hand-0.040
upid-0.040
Token graceful
Token ID44363
Feature activation-2.22e-01

Pos logit contributions
Brown+0.056
Ell+0.055
hy+0.049
Twe+0.049
Ar+0.048
Neg logit contributions
 foot-0.044
 bubble-0.044
ry-0.040
 hand-0.039
 twin-0.039
Token bird
Token ID6512
Feature activation-3.77e-01

Pos logit contributions
ours+0.007
upid+0.007
ry+0.007
 Sandy+0.006
asing+0.006
Neg logit contributions
Brown-0.010
Ell-0.010
Twe-0.009
 Stretch-0.009
hy-0.009
Token little
Token ID1310
Feature activation+3.31e-01

Pos logit contributions
Brown+0.011
Ell+0.010
Twe+0.010
 Stretch+0.009
hy+0.009
Neg logit contributions
ours-0.006
ry-0.006
upid-0.006
 bubble-0.006
 Sandy-0.006
Token fish
Token ID5916
Feature activation-9.66e-01

Pos logit contributions
Brown+0.016
Ell+0.015
Twe+0.014
Ar+0.014
 Stretch+0.014
Neg logit contributions
ry-0.009
ours-0.009
upid-0.008
 bubble-0.008
 Sandy-0.008
Tokeny
Token ID88
Feature activation-1.07e+00

Pos logit contributions
upid+0.035
ours+0.035
 Sandy+0.032
ry+0.032
about+0.032
Neg logit contributions
Brown-0.038
Ell-0.035
Twe-0.034
 Stretch-0.033
hy-0.032
Token named
Token ID3706
Feature activation+3.02e-01

Pos logit contributions
upid+0.040
ours+0.039
asing+0.036
ress+0.035
ry+0.035
Neg logit contributions
Brown-0.042
Twe-0.038
 Stretch-0.038
Ell-0.037
 Brown-0.037
Token Blue
Token ID4518
Feature activation-1.37e-02

Pos logit contributions
Brown+0.011
Ell+0.010
Twe+0.009
hy+0.009
Ar+0.009
Neg logit contributions
ours-0.012
upid-0.012
ry-0.012
 Sandy-0.011
 bubble-0.011
Token.
Token ID13
Feature activation+3.78e+00

Pos logit contributions
ours+0.000
ry+0.000
upid+0.000
 Sandy+0.000
 bubble+0.000
Neg logit contributions
Brown-0.001
Ell-0.001
Twe-0.001
 Stretch-0.001
Ar-0.001
Token Blue
Token ID4518
Feature activation+8.60e-01

Pos logit contributions
ve+0.170
bel+0.151
cle+0.142
grand+0.142
ind+0.136
Neg logit contributions
 again-0.102
 if-0.097
 goalie-0.095
 frogs-0.094
 how-0.093
Token loved
Token ID6151
Feature activation+5.55e-01

Pos logit contributions
Brown+0.045
Ell+0.044
hy+0.041
Twe+0.041
 Stretch+0.041
Neg logit contributions
ry-0.020
upid-0.018
ours-0.018
 bubble-0.017
 foot-0.016
Token to
Token ID284
Feature activation-1.55e-01

Pos logit contributions
Brown+0.025
Ell+0.024
Twe+0.022
hy+0.022
 Stretch+0.021
Neg logit contributions
ours-0.017
upid-0.016
 bubble-0.016
ry-0.016
 Sandy-0.016
Token swim
Token ID9422
Feature activation+3.96e-01

Pos logit contributions
ours+0.005
ry+0.005
upid+0.005
 Sandy+0.004
asing+0.004
Neg logit contributions
Brown-0.007
Ell-0.006
Twe-0.006
 Stretch-0.006
Ar-0.006
Token and
Token ID290
Feature activation-5.29e-01

Pos logit contributions
Brown+0.017
Ell+0.016
Twe+0.015
hy+0.014
 Stretch+0.014
Neg logit contributions
ry-0.014
ours-0.013
upid-0.013
 Sandy-0.012
 bubble-0.012
Token was
Token ID373
Feature activation+8.03e-01

Pos logit contributions
ours+0.031
upid+0.030
asing+0.029
ry+0.029
 ankle+0.028
Neg logit contributions
Brown-0.029
Ell-0.027
 Brown-0.026
 Stretch-0.026
Twe-0.025
Token a
Token ID257
Feature activation+7.49e-02

Pos logit contributions
Brown+0.042
Ell+0.039
Twe+0.037
hy+0.037
 Stretch+0.036
Neg logit contributions
 Sandy-0.018
 bubble-0.018
ours-0.018
upid-0.017
 foot-0.017
Token bird
Token ID6512
Feature activation-6.51e-01

Pos logit contributions
Brown+0.003
Ell+0.003
Twe+0.003
 Stretch+0.003
Ar+0.003
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token named
Token ID3706
Feature activation+7.54e-01

Pos logit contributions
upid+0.023
ours+0.022
ry+0.021
 Sandy+0.020
about+0.020
Neg logit contributions
Brown-0.028
Ell-0.026
Twe-0.025
hy-0.024
 Brown-0.023
Token Blue
Token ID4518
Feature activation+2.80e-02

Pos logit contributions
Brown+0.025
Ell+0.023
Ar+0.022
ve+0.021
hy+0.021
Neg logit contributions
 bubble-0.031
upid-0.031
 Sandy-0.031
ours-0.030
ry-0.030
Token.
Token ID13
Feature activation+3.77e+00

Pos logit contributions
Brown+0.001
Ell+0.001
Twe+0.001
 Stretch+0.001
Ar+0.001
Neg logit contributions
ours-0.001
ry-0.001
upid-0.001
 Sandy-0.001
 bubble-0.001
Token Blue
Token ID4518
Feature activation+1.37e+00

Pos logit contributions
ve+0.172
bel+0.153
ind+0.140
cle+0.138
grand+0.137
Neg logit contributions
 again-0.097
 if-0.093
 goalie-0.090
 of-0.089
 Cater-0.088
Token was
Token ID373
Feature activation+1.42e+00

Pos logit contributions
Brown+0.060
Ell+0.058
hy+0.055
Twe+0.054
 Barney+0.054
Neg logit contributions
ry-0.041
 bubble-0.037
ours-0.035
 foot-0.035
upid-0.035
Token very
Token ID845
Feature activation+1.95e+00

Pos logit contributions
Brown+0.063
Ell+0.059
Ar+0.057
Twe+0.055
ve+0.055
Neg logit contributions
 Sandy-0.041
 foot-0.041
 bubble-0.040
upid-0.038
ours-0.037
Token beautiful
Token ID4950
Feature activation+4.65e-02

Pos logit contributions
Brown+0.091
Ell+0.089
Ar+0.078
 Barney+0.077
 Elliot+0.077
Neg logit contributions
 foot-0.056
upid-0.054
 Sandy-0.054
 bubble-0.052
ours-0.052
Token with
Token ID351
Feature activation+2.03e+00

Pos logit contributions
Brown+0.002
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.001
ry-0.001
upid-0.001
 Sandy-0.001
 bubble-0.001
Token was
Token ID373
Feature activation+9.04e-01

Pos logit contributions
ours+0.016
upid+0.016
ry+0.016
asing+0.015
 bubble+0.015
Neg logit contributions
Brown-0.019
Ell-0.018
 Stretch-0.017
 Brown-0.017
Twe-0.016
Token a
Token ID257
Feature activation+4.41e-01

Pos logit contributions
Brown+0.047
Ell+0.044
Twe+0.042
hy+0.042
 Stretch+0.041
Neg logit contributions
 bubble-0.020
 Sandy-0.020
ours-0.020
upid-0.019
 foot-0.019
Token bird
Token ID6512
Feature activation-4.07e-01

Pos logit contributions
Brown+0.021
Ell+0.020
Twe+0.019
hy+0.019
 Stretch+0.019
Neg logit contributions
ry-0.012
ours-0.012
upid-0.011
 bubble-0.011
 foot-0.011
Token named
Token ID3706
Feature activation+1.02e+00

Pos logit contributions
ours+0.013
upid+0.013
ry+0.012
 Sandy+0.012
about+0.011
Neg logit contributions
Brown-0.018
Ell-0.017
Twe-0.016
hy-0.016
 Stretch-0.016
Token Blue
Token ID4518
Feature activation+2.50e-01

Pos logit contributions
Brown+0.034
Ell+0.032
Ar+0.031
ve+0.030
hy+0.028
Neg logit contributions
 Sandy-0.041
 bubble-0.041
upid-0.041
ours-0.040
ry-0.039
Token.
Token ID13
Feature activation+3.74e+00

Pos logit contributions
Brown+0.011
Ell+0.011
Twe+0.010
 Stretch+0.010
Ar+0.010
Neg logit contributions
ry-0.008
ours-0.007
upid-0.007
 bubble-0.007
 Sandy-0.007
Token Blue
Token ID4518
Feature activation+1.37e+00

Pos logit contributions
ve+0.159
bel+0.144
ind+0.127
grand+0.127
cle+0.126
Neg logit contributions
 of-0.101
 Duck-0.098
 if-0.097
 again-0.096
 Cater-0.094
Token liked
Token ID8288
Feature activation+1.06e+00

Pos logit contributions
Brown+0.058
Ell+0.057
hy+0.054
Twe+0.053
 Barney+0.053
Neg logit contributions
ry-0.043
 bubble-0.038
ours-0.037
upid-0.037
 foot-0.037
Token to
Token ID284
Feature activation+7.72e-01

Pos logit contributions
Brown+0.052
Ell+0.049
Twe+0.047
ve+0.046
Ar+0.046
Neg logit contributions
 bubble-0.027
ours-0.027
 Sandy-0.026
upid-0.026
 foot-0.025
Token fly
Token ID6129
Feature activation+1.39e+00

Pos logit contributions
Brown+0.036
Ell+0.035
ve+0.032
 Stretch+0.032
Twe+0.032
Neg logit contributions
 bubble-0.022
upid-0.021
 foot-0.020
ours-0.020
ry-0.020
Token high
Token ID1029
Feature activation+2.14e+00

Pos logit contributions
Brown+0.066
Twe+0.059
Ell+0.059
hy+0.058
 Barney+0.057
Neg logit contributions
ry-0.037
 bubble-0.037
 foot-0.036
upid-0.035
 Sandy-0.035
Token was
Token ID373
Feature activation+9.41e-01

Pos logit contributions
ours+0.022
upid+0.022
ry+0.021
asing+0.021
 bubble+0.020
Neg logit contributions
Brown-0.024
Ell-0.022
 Brown-0.021
 Stretch-0.021
Twe-0.021
Token a
Token ID257
Feature activation+2.09e-01

Pos logit contributions
Brown+0.049
Ell+0.046
hy+0.044
Twe+0.044
ve+0.043
Neg logit contributions
 bubble-0.021
 Sandy-0.020
ours-0.020
upid-0.020
 foot-0.019
Token bird
Token ID6512
Feature activation-5.79e-01

Pos logit contributions
Brown+0.010
Ell+0.009
Twe+0.009
 Stretch+0.009
Ar+0.009
Neg logit contributions
ours-0.006
ry-0.006
upid-0.006
 bubble-0.005
 Sandy-0.005
Token named
Token ID3706
Feature activation+9.70e-01

Pos logit contributions
ours+0.019
upid+0.019
ry+0.018
 Sandy+0.018
about+0.017
Neg logit contributions
Brown-0.025
Ell-0.024
Twe-0.022
hy-0.021
 Stretch-0.021
Token Blue
Token ID4518
Feature activation+1.40e-01

Pos logit contributions
Brown+0.032
Ell+0.030
Ar+0.029
ve+0.028
hy+0.027
Neg logit contributions
 Sandy-0.040
 bubble-0.039
upid-0.039
ours-0.038
ry-0.037
Token.
Token ID13
Feature activation+3.71e+00

Pos logit contributions
Brown+0.006
Ell+0.006
Twe+0.006
 Stretch+0.005
Ar+0.005
Neg logit contributions
ry-0.004
ours-0.004
upid-0.004
 Sandy-0.004
 bubble-0.004
Token Blue
Token ID4518
Feature activation+1.21e+00

Pos logit contributions
ve+0.160
bel+0.142
ind+0.131
grand+0.126
cle+0.126
Neg logit contributions
 of-0.098
 again-0.096
 if-0.092
 Duck-0.091
 let-0.090
Token had
Token ID550
Feature activation+1.15e+00

Pos logit contributions
Brown+0.051
Ell+0.049
hy+0.046
Twe+0.045
 Barney+0.045
Neg logit contributions
ry-0.039
 bubble-0.035
ours-0.035
upid-0.035
 foot-0.034
Token a
Token ID257
Feature activation+1.45e+00

Pos logit contributions
Brown+0.058
Ell+0.055
Twe+0.054
ve+0.053
 Stretch+0.053
Neg logit contributions
 foot-0.025
 bubble-0.024
upid-0.023
 Sandy-0.023
 hand-0.023
Token strong
Token ID1913
Feature activation+3.92e-01

Pos logit contributions
Brown+0.061
Ell+0.057
Twe+0.053
hy+0.053
 Stretch+0.052
Neg logit contributions
 bubble-0.048
 foot-0.048
 hand-0.044
upid-0.043
 hoop-0.042
Token and
Token ID290
Feature activation+2.48e-01

Pos logit contributions
Brown+0.020
Ell+0.019
Twe+0.018
 Stretch+0.018
Ar+0.018
Neg logit contributions
upid-0.010
ours-0.009
ry-0.009
 foot-0.009
 bubble-0.009
Token was
Token ID373
Feature activation+6.75e-01

Pos logit contributions
ours+0.031
upid+0.031
asing+0.030
ry+0.030
 ankle+0.028
Neg logit contributions
Brown-0.029
Ell-0.027
 Brown-0.026
 Stretch-0.026
Twe-0.025
Token a
Token ID257
Feature activation+2.05e-01

Pos logit contributions
Brown+0.034
Ell+0.032
Twe+0.030
hy+0.030
 Stretch+0.030
Neg logit contributions
 bubble-0.016
ours-0.016
upid-0.016
 Sandy-0.016
 foot-0.015
Token bird
Token ID6512
Feature activation-5.59e-01

Pos logit contributions
Brown+0.010
Ell+0.009
Twe+0.009
 Stretch+0.008
Ar+0.008
Neg logit contributions
ours-0.006
ry-0.006
upid-0.006
 bubble-0.005
 Sandy-0.005
Token named
Token ID3706
Feature activation+8.44e-01

Pos logit contributions
upid+0.019
ours+0.019
ry+0.017
 Sandy+0.017
about+0.016
Neg logit contributions
Brown-0.024
Ell-0.023
Twe-0.022
hy-0.021
 Stretch-0.021
Token Blue
Token ID4518
Feature activation+5.16e-02

Pos logit contributions
Brown+0.027
Ell+0.025
Ar+0.024
ve+0.023
hy+0.023
Neg logit contributions
upid-0.035
 Sandy-0.035
 bubble-0.035
ours-0.035
ry-0.034
Token.
Token ID13
Feature activation+3.71e+00

Pos logit contributions
Brown+0.002
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.001
 bubble-0.001
Token Blue
Token ID4518
Feature activation+1.12e+00

Pos logit contributions
ve+0.162
bel+0.144
ind+0.130
cle+0.130
Ar+0.129
Neg logit contributions
 of-0.096
 again-0.095
 if-0.093
 Cater-0.093
 cricket-0.091
Token loved
Token ID6151
Feature activation+1.12e+00

Pos logit contributions
Brown+0.047
Ell+0.045
hy+0.043
Twe+0.042
 Barney+0.042
Neg logit contributions
ry-0.036
ours-0.033
 bubble-0.033
upid-0.032
 foot-0.032
Token to
Token ID284
Feature activation+4.73e-01

Pos logit contributions
Brown+0.054
Ell+0.052
Twe+0.050
hy+0.049
ve+0.049
Neg logit contributions
 bubble-0.029
ours-0.028
 Sandy-0.028
upid-0.027
 foot-0.026
Token soar
Token ID48701
Feature activation+1.85e+00

Pos logit contributions
Brown+0.021
Ell+0.020
Twe+0.019
 Stretch+0.019
Ar+0.019
Neg logit contributions
upid-0.014
 bubble-0.014
ours-0.014
ry-0.014
 foot-0.013
Token high
Token ID1029
Feature activation+2.46e+00

Pos logit contributions
Brown+0.089
Ell+0.081
Twe+0.080
hy+0.079
 Barney+0.076
Neg logit contributions
ry-0.050
 foot-0.048
 bubble-0.048
 shallow-0.046
 Sandy-0.046
Token was
Token ID373
Feature activation+7.10e-01

Pos logit contributions
ours+0.015
upid+0.015
ry+0.015
asing+0.014
 bubble+0.014
Neg logit contributions
Brown-0.018
Ell-0.017
 Stretch-0.016
 Brown-0.016
Twe-0.016
Token a
Token ID257
Feature activation+2.19e-01

Pos logit contributions
Brown+0.036
Ell+0.034
Twe+0.032
hy+0.032
ve+0.031
Neg logit contributions
ours-0.017
 bubble-0.017
upid-0.016
 Sandy-0.016
ry-0.016
Token bird
Token ID6512
Feature activation-4.56e-01

Pos logit contributions
Brown+0.010
Ell+0.010
Twe+0.009
 Stretch+0.009
Ar+0.009
Neg logit contributions
ours-0.006
ry-0.006
upid-0.006
 bubble-0.006
 Sandy-0.006
Token named
Token ID3706
Feature activation+8.76e-01

Pos logit contributions
ours+0.015
upid+0.015
ry+0.014
 Sandy+0.013
about+0.013
Neg logit contributions
Brown-0.020
Ell-0.019
Twe-0.018
 Stretch-0.017
hy-0.017
Token Blue
Token ID4518
Feature activation+8.50e-02

Pos logit contributions
Brown+0.028
Ell+0.026
Ar+0.025
ve+0.024
Twe+0.023
Neg logit contributions
 bubble-0.036
upid-0.036
 Sandy-0.036
ours-0.035
 foot-0.034
Token.
Token ID13
Feature activation+3.70e+00

Pos logit contributions
Brown+0.004
Ell+0.004
Twe+0.003
 Stretch+0.003
Ar+0.003
Neg logit contributions
ours-0.003
ry-0.003
upid-0.003
 Sandy-0.002
 bubble-0.002
Token Blue
Token ID4518
Feature activation+1.38e+00

Pos logit contributions
ve+0.163
bel+0.140
ind+0.139
cle+0.130
grand+0.127
Neg logit contributions
 goalie-0.097
 of-0.096
 again-0.095
 the-0.093
 All-0.093
Token was
Token ID373
Feature activation+1.46e+00

Pos logit contributions
Brown+0.058
Ell+0.056
hy+0.053
 Barney+0.052
Twe+0.052
Neg logit contributions
ry-0.045
 bubble-0.040
ours-0.039
upid-0.039
 foot-0.039
Token an
Token ID281
Feature activation+8.38e-01

Pos logit contributions
Brown+0.066
Ell+0.061
Ar+0.059
Twe+0.057
ve+0.056
Neg logit contributions
 foot-0.042
 bubble-0.042
 Sandy-0.041
upid-0.040
 hand-0.039
Token ordinary
Token ID8850
Feature activation+1.62e-01

Pos logit contributions
Brown+0.032
Ell+0.031
Ar+0.028
 Stretch+0.027
Twe+0.027
Neg logit contributions
upid-0.031
ours-0.030
 foot-0.030
ry-0.030
 bubble-0.029
Token bird
Token ID6512
Feature activation-4.61e-01

Pos logit contributions
Brown+0.008
Ell+0.008
Twe+0.007
 Stretch+0.007
Ar+0.007
Neg logit contributions
ry-0.004
ours-0.004
upid-0.004
 bubble-0.004
 Sandy-0.004
Token ostr
Token ID42387
Feature activation+1.80e+00

Pos logit contributions
upid+0.031
ours+0.031
ry+0.030
asing+0.030
 Sandy+0.028
Neg logit contributions
Brown-0.037
Ell-0.034
Twe-0.033
 Brown-0.033
hy-0.033
Tokenich
Token ID488
Feature activation-9.26e-01

Pos logit contributions
Ar+0.085
Ell+0.083
Brown+0.082
 Barney+0.078
 Stretch+0.077
Neg logit contributions
ry-0.052
apping-0.045
icking-0.045
ours-0.045
ress-0.044
Token named
Token ID3706
Feature activation+1.51e+00

Pos logit contributions
upid+0.036
ours+0.035
 Sandy+0.032
ry+0.032
about+0.031
Neg logit contributions
Brown-0.037
Ell-0.035
Twe-0.033
hy-0.032
 Brown-0.031
Token Blue
Token ID4518
Feature activation+1.34e-01

Pos logit contributions
Brown+0.059
Ell+0.057
Ar+0.054
ve+0.053
thank+0.052
Neg logit contributions
 Sandy-0.054
upid-0.048
 bubble-0.047
ry-0.047
ours-0.046
Tokeny
Token ID88
Feature activation+7.16e-02

Pos logit contributions
Brown+0.006
Ell+0.006
Twe+0.005
 Stretch+0.005
Ar+0.005
Neg logit contributions
ry-0.004
ours-0.004
upid-0.004
 Sandy-0.004
 bubble-0.004
Token.
Token ID13
Feature activation+3.70e+00

Pos logit contributions
Brown+0.003
Ell+0.003
Twe+0.003
 Stretch+0.003
Ar+0.003
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token Blue
Token ID4518
Feature activation+1.16e+00

Pos logit contributions
ve+0.176
bel+0.151
ind+0.141
cle+0.137
Ar+0.130
Neg logit contributions
 again-0.089
 of-0.089
 All-0.089
 Believe-0.089
 Let-0.088
Tokeny
Token ID88
Feature activation+1.92e+00

Pos logit contributions
Brown+0.065
Ell+0.063
hy+0.061
Twe+0.060
 Stretch+0.060
Neg logit contributions
ry-0.022
upid-0.018
ours-0.018
 bubble-0.017
 foot-0.016
Token liked
Token ID8288
Feature activation+9.28e-01

Pos logit contributions
Brown+0.086
Ell+0.085
Ar+0.080
Mic+0.079
Twe+0.078
Neg logit contributions
ry-0.052
 foot-0.051
 bubble-0.048
 Sandy-0.048
 twin-0.046
Token to
Token ID284
Feature activation+6.60e-01

Pos logit contributions
Brown+0.044
Ell+0.042
Twe+0.040
hy+0.039
ve+0.039
Neg logit contributions
ours-0.024
 bubble-0.024
 Sandy-0.024
upid-0.023
 foot-0.023
Token run
Token ID1057
Feature activation+1.16e+00

Pos logit contributions
Brown+0.031
Ell+0.030
Twe+0.027
ve+0.027
 Stretch+0.027
Neg logit contributions
 bubble-0.019
upid-0.018
ours-0.018
ry-0.017
 foot-0.017
Token a
Token ID257
Feature activation+3.29e-01

Pos logit contributions
Brown+0.031
Ell+0.030
Twe+0.028
hy+0.028
 Stretch+0.028
Neg logit contributions
ours-0.015
upid-0.015
 bubble-0.015
 Sandy-0.015
 foot-0.014
Token little
Token ID1310
Feature activation+1.38e-01

Pos logit contributions
Brown+0.016
Ell+0.015
Twe+0.014
 Stretch+0.014
hy+0.014
Neg logit contributions
ours-0.009
ry-0.009
upid-0.009
 bubble-0.008
 Sandy-0.008
Token bird
Token ID6512
Feature activation-5.94e-01

Pos logit contributions
Brown+0.007
Ell+0.006
Twe+0.006
 Stretch+0.006
Ar+0.006
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 bubble-0.003
 Sandy-0.003
Token named
Token ID3706
Feature activation+7.94e-01

Pos logit contributions
upid+0.020
ours+0.020
ry+0.019
 Sandy+0.018
about+0.018
Neg logit contributions
Brown-0.026
Ell-0.024
Twe-0.023
hy-0.022
 Stretch-0.022
Token Blue
Token ID4518
Feature activation+1.26e-01

Pos logit contributions
Brown+0.028
Ell+0.026
Ar+0.025
hy+0.024
ve+0.023
Neg logit contributions
upid-0.031
 Sandy-0.031
 bubble-0.031
ours-0.031
ry-0.030
Token.
Token ID13
Feature activation+3.66e+00

Pos logit contributions
Brown+0.006
Ell+0.005
Twe+0.005
 Stretch+0.005
Ar+0.005
Neg logit contributions
ry-0.004
ours-0.004
upid-0.004
 Sandy-0.004
 bubble-0.004
Token Blue
Token ID4518
Feature activation+1.20e+00

Pos logit contributions
ve+0.160
bel+0.143
cle+0.128
Ar+0.128
grand+0.125
Neg logit contributions
 again-0.094
 Cater-0.093
 Duck-0.092
 of-0.092
 if-0.091
Token was
Token ID373
Feature activation+1.44e+00

Pos logit contributions
Brown+0.050
Ell+0.049
hy+0.046
Twe+0.045
 Barney+0.045
Neg logit contributions
ry-0.039
 bubble-0.035
ours-0.035
upid-0.034
 foot-0.034
Token very
Token ID845
Feature activation+1.83e+00

Pos logit contributions
Brown+0.063
Ell+0.059
Ar+0.057
Twe+0.055
ve+0.054
Neg logit contributions
 foot-0.043
 Sandy-0.042
 bubble-0.042
upid-0.040
ours-0.039
Token sad
Token ID6507
Feature activation-2.50e-01

Pos logit contributions
Brown+0.085
Ell+0.083
Ar+0.073
 Barney+0.071
 Stretch+0.071
Neg logit contributions
 foot-0.053
upid-0.052
 Sandy-0.051
ours-0.051
 bubble-0.050
Token because
Token ID780
Feature activation+1.21e+00

Pos logit contributions
ours+0.009
upid+0.009
ry+0.008
 Sandy+0.008
about+0.008
Neg logit contributions
Brown-0.010
Ell-0.009
 Stretch-0.009
 Brown-0.009
Twe-0.009
Token was
Token ID373
Feature activation+7.89e-01

Pos logit contributions
ours+0.003
upid+0.003
ry+0.003
 bubble+0.002
 Sandy+0.002
Neg logit contributions
Brown-0.004
Ell-0.004
Twe-0.003
 Stretch-0.003
Ar-0.003
Token a
Token ID257
Feature activation+2.60e-01

Pos logit contributions
Brown+0.041
Ell+0.039
Twe+0.036
hy+0.036
ve+0.035
Neg logit contributions
ours-0.018
 bubble-0.018
 Sandy-0.018
upid-0.018
 foot-0.017
Token bird
Token ID6512
Feature activation-3.33e-01

Pos logit contributions
Brown+0.012
Ell+0.011
Twe+0.011
 Stretch+0.010
hy+0.010
Neg logit contributions
ours-0.008
ry-0.008
upid-0.007
 bubble-0.007
 Sandy-0.007
Token named
Token ID3706
Feature activation+8.67e-01

Pos logit contributions
ours+0.010
upid+0.010
ry+0.010
 Sandy+0.009
about+0.009
Neg logit contributions
Brown-0.015
Ell-0.014
Twe-0.013
 Stretch-0.013
hy-0.013
Token Blue
Token ID4518
Feature activation+2.69e-01

Pos logit contributions
Brown+0.030
Ell+0.028
Ar+0.026
ve+0.026
Twe+0.025
Neg logit contributions
 Sandy-0.034
upid-0.034
 bubble-0.034
ours-0.033
 foot-0.032
Token.
Token ID13
Feature activation+3.65e+00

Pos logit contributions
Brown+0.012
Ell+0.011
Twe+0.011
 Stretch+0.011
Ar+0.010
Neg logit contributions
ry-0.008
ours-0.008
upid-0.008
 bubble-0.007
 Sandy-0.007
Token Blue
Token ID4518
Feature activation+1.50e+00

Pos logit contributions
ve+0.158
bel+0.137
ind+0.129
cle+0.128
grand+0.127
Neg logit contributions
 goalie-0.101
 again-0.099
 harder-0.095
 cricket-0.094
 of-0.094
Token loved
Token ID6151
Feature activation+1.20e+00

Pos logit contributions
Brown+0.064
Ell+0.062
hy+0.059
 Barney+0.059
 Stretch+0.058
Neg logit contributions
ry-0.046
 bubble-0.041
 foot-0.040
ours-0.039
upid-0.039
Token to
Token ID284
Feature activation+3.65e-01

Pos logit contributions
Brown+0.060
Ell+0.056
Twe+0.055
ve+0.054
hy+0.054
Neg logit contributions
 bubble-0.030
ours-0.029
 Sandy-0.029
 foot-0.027
upid-0.027
Token fly
Token ID6129
Feature activation+1.48e+00

Pos logit contributions
Brown+0.016
Ell+0.015
Twe+0.014
 Stretch+0.014
Ar+0.014
Neg logit contributions
upid-0.011
ours-0.011
ry-0.011
 bubble-0.011
 Sandy-0.011
Token high
Token ID1029
Feature activation+2.08e+00

Pos logit contributions
Brown+0.070
Ell+0.061
Twe+0.061
 Barney+0.060
hy+0.060
Neg logit contributions
ry-0.041
 bubble-0.040
 foot-0.039
 Sandy-0.038
 shallow-0.038
Token blue
Token ID4171
Feature activation-8.83e-01

Pos logit contributions
Brown+0.008
Ell+0.008
Twe+0.008
 Stretch+0.007
Ar+0.007
Neg logit contributions
ours-0.005
ry-0.005
upid-0.005
 bubble-0.005
 Sandy-0.005
Token cat
Token ID3797
Feature activation-1.36e-01

Pos logit contributions
upid+0.031
ours+0.030
ry+0.029
asing+0.029
 Sandy+0.027
Neg logit contributions
Brown-0.036
Ell-0.033
Twe-0.032
 Brown-0.032
hy-0.032
Token named
Token ID3706
Feature activation-2.51e-01

Pos logit contributions
ours+0.004
upid+0.004
ry+0.004
 Sandy+0.004
 bubble+0.004
Neg logit contributions
Brown-0.006
Ell-0.006
Twe-0.006
 Stretch-0.005
hy-0.005
Token Blue
Token ID4518
Feature activation+3.35e-01

Pos logit contributions
ry+0.010
ours+0.009
upid+0.009
about+0.009
asing+0.009
Neg logit contributions
Brown-0.010
Ell-0.009
Twe-0.008
 Stretch-0.008
 Brown-0.008
Tokeny
Token ID88
Feature activation-1.94e-02

Pos logit contributions
Brown+0.015
Ell+0.014
Twe+0.013
Ar+0.013
 Stretch+0.013
Neg logit contributions
ry-0.010
ours-0.010
upid-0.010
 bubble-0.009
 Sandy-0.009
Token.
Token ID13
Feature activation+3.61e+00

Pos logit contributions
ours+0.001
ry+0.001
upid+0.001
 Sandy+0.001
 bubble+0.001
Neg logit contributions
Brown-0.001
Ell-0.001
Twe-0.001
 Stretch-0.001
hy-0.001
Token Blue
Token ID4518
Feature activation+1.27e+00

Pos logit contributions
ve+0.170
bel+0.146
ind+0.136
cle+0.131
Ar+0.129
Neg logit contributions
 All-0.094
 of-0.092
 cricket-0.092
 goalie-0.090
 bubble-0.089
Tokeny
Token ID88
Feature activation+1.46e+00

Pos logit contributions
Brown+0.060
Ell+0.058
hy+0.055
Twe+0.055
 Stretch+0.055
Neg logit contributions
ry-0.034
upid-0.030
 bubble-0.030
ours-0.029
 foot-0.029
Token was
Token ID373
Feature activation+1.09e+00

Pos logit contributions
Brown+0.062
Ell+0.059
Ar+0.057
Twe+0.056
ve+0.055
Neg logit contributions
ry-0.044
 foot-0.043
 bubble-0.043
 Sandy-0.043
ours-0.041
Token sad
Token ID6507
Feature activation-3.81e-01

Pos logit contributions
Brown+0.048
Ell+0.045
Ar+0.043
Twe+0.042
ve+0.042
Neg logit contributions
 Sandy-0.033
 foot-0.033
 bubble-0.032
ours-0.031
upid-0.031
Token because
Token ID780
Feature activation+1.05e+00

Pos logit contributions
ours+0.014
upid+0.014
ry+0.013
about+0.013
 Sandy+0.013
Neg logit contributions
Brown-0.015
Ell-0.014
 Brown-0.013
 Stretch-0.013
Twe-0.013
Token a
Token ID257
Feature activation+1.82e-01

Pos logit contributions
Brown+0.019
Ell+0.018
Twe+0.017
hy+0.016
 Stretch+0.016
Neg logit contributions
ours-0.011
upid-0.010
ry-0.010
 bubble-0.010
 Sandy-0.010
Token little
Token ID1310
Feature activation+1.35e-01

Pos logit contributions
Brown+0.008
Ell+0.008
Twe+0.007
 Stretch+0.007
Ar+0.007
Neg logit contributions
ours-0.005
ry-0.005
upid-0.005
 bubble-0.005
 Sandy-0.005
Token bird
Token ID6512
Feature activation-6.77e-01

Pos logit contributions
Brown+0.006
Ell+0.006
Twe+0.006
 Stretch+0.006
Ar+0.006
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 bubble-0.003
 Sandy-0.003
Token named
Token ID3706
Feature activation+6.63e-01

Pos logit contributions
upid+0.023
ours+0.023
ry+0.021
 Sandy+0.021
about+0.020
Neg logit contributions
Brown-0.029
Ell-0.027
Twe-0.026
hy-0.025
 Stretch-0.024
Token Blue
Token ID4518
Feature activation-1.58e-01

Pos logit contributions
Brown+0.022
Ell+0.020
Ar+0.019
hy+0.018
ve+0.018
Neg logit contributions
upid-0.028
 Sandy-0.028
 bubble-0.027
ours-0.027
ry-0.027
Token.
Token ID13
Feature activation+3.60e+00

Pos logit contributions
ours+0.005
upid+0.005
ry+0.005
 Sandy+0.004
 bubble+0.004
Neg logit contributions
Brown-0.007
Ell-0.007
Twe-0.006
 Stretch-0.006
Ar-0.006
Token Blue
Token ID4518
Feature activation+1.17e+00

Pos logit contributions
ve+0.160
bel+0.141
cle+0.129
Ar+0.126
ind+0.125
Neg logit contributions
 goalie-0.102
 again-0.100
 harder-0.096
 Believe-0.093
 cricket-0.092
Token was
Token ID373
Feature activation+1.24e+00

Pos logit contributions
Brown+0.047
Ell+0.045
hy+0.043
 Stretch+0.043
 Barney+0.042
Neg logit contributions
ry-0.040
upid-0.036
 bubble-0.036
ours-0.036
 foot-0.035
Token very
Token ID845
Feature activation+1.72e+00

Pos logit contributions
Brown+0.054
Ell+0.050
Ar+0.048
Twe+0.046
ve+0.045
Neg logit contributions
 foot-0.039
 Sandy-0.039
 bubble-0.038
upid-0.037
ours-0.037
Token happy
Token ID3772
Feature activation-9.45e-02

Pos logit contributions
Brown+0.080
Ell+0.078
Ar+0.068
 Stretch+0.067
 Barney+0.067
Neg logit contributions
upid-0.051
 foot-0.050
 Sandy-0.050
ours-0.050
 bubble-0.049
Token because
Token ID780
Feature activation+1.04e+00

Pos logit contributions
upid+0.003
ours+0.003
ry+0.003
 Sandy+0.003
 bubble+0.003
Neg logit contributions
Brown-0.004
Ell-0.004
 Stretch-0.004
Twe-0.004
 Brown-0.003
Token wrote
Token ID2630
Feature activation+2.11e+00

Pos logit contributions
Brown+0.059
Ell+0.057
Twe+0.054
 Stretch+0.051
Ar+0.051
Neg logit contributions
ry-0.037
ours-0.036
 bubble-0.033
 Sandy-0.033
 foot-0.033
Token letters
Token ID7475
Feature activation+2.31e+00

Pos logit contributions
Twe+0.108
Ell+0.108
Brown+0.107
Mic+0.098
Ar+0.096
Neg logit contributions
ours-0.050
ry-0.048
ful-0.044
 of-0.043
 how-0.042
Token to
Token ID284
Feature activation+2.32e+00

Pos logit contributions
Ell+0.130
Twe+0.129
Brown+0.128
Mic+0.123
Ar+0.120
Neg logit contributions
ry-0.049
 of-0.042
ours-0.041
ful-0.041
upid-0.038
Token Santa
Token ID8909
Feature activation+2.28e+00

Pos logit contributions
Ell+0.105
Brown+0.099
Twe+0.091
ve+0.089
Ar+0.087
Neg logit contributions
 hand-0.070
 bubble-0.068
 foot-0.068
ours-0.067
 why-0.067
Token and
Token ID290
Feature activation+1.53e+00

Pos logit contributions
Ell+0.058
Brown+0.051
Twe+0.051
Ar+0.049
Mic+0.044
Neg logit contributions
ry-0.124
icking-0.110
ours-0.110
ful-0.107
 twin-0.106
Token put
Token ID1234
Feature activation+3.60e+00

Pos logit contributions
Brown+0.065
Ell+0.065
Twe+0.062
ve+0.059
hy+0.056
Neg logit contributions
ours-0.047
ry-0.046
 bubble-0.046
 hand-0.044
 Sandy-0.044
Token them
Token ID606
Feature activation+2.86e+00

Pos logit contributions
Ell+0.208
Brown+0.197
Twe+0.191
Mic+0.183
hy+0.178
Neg logit contributions
 hand-0.071
ours-0.067
 more-0.064
 how-0.060
 something-0.059
Token in
Token ID287
Feature activation+1.80e+00

Pos logit contributions
Brown+0.161
Twe+0.161
Ell+0.160
Mic+0.150
Ar+0.148
Neg logit contributions
ry-0.052
 hand-0.045
 foot-0.042
ac-0.040
icking-0.040
Token a
Token ID257
Feature activation+1.67e+00

Pos logit contributions
Ell+0.095
Brown+0.094
Twe+0.089
ve+0.087
Mic+0.085
Neg logit contributions
 hand-0.037
 bubble-0.036
 twin-0.033
upid-0.033
ry-0.033
Token big
Token ID1263
Feature activation+8.98e-01

Pos logit contributions
Brown+0.074
Ell+0.071
Twe+0.067
 Stretch+0.064
 Kirst+0.063
Neg logit contributions
 bubble-0.051
ry-0.050
 hand-0.050
 foot-0.049
ours-0.048
Token red
Token ID2266
Feature activation+7.09e-01

Pos logit contributions
Brown+0.044
Ell+0.043
Twe+0.041
Ar+0.039
 Stretch+0.039
Neg logit contributions
ry-0.024
ours-0.024
 bubble-0.023
upid-0.023
 foot-0.022
Token calm
Token ID9480
Feature activation-4.13e-01

Pos logit contributions
Brown+0.010
Ell+0.009
Twe+0.009
 Stretch+0.009
Ar+0.009
Neg logit contributions
ours-0.008
ry-0.008
upid-0.008
 bubble-0.007
 Sandy-0.007
Token little
Token ID1310
Feature activation-3.40e-01

Pos logit contributions
ours+0.014
upid+0.014
ry+0.013
asing+0.012
 Sandy+0.012
Neg logit contributions
Brown-0.018
Ell-0.016
 Stretch-0.015
Twe-0.015
hy-0.015
Token bird
Token ID6512
Feature activation-5.59e-01

Pos logit contributions
upid+0.011
ours+0.011
ry+0.011
asing+0.010
 Sandy+0.010
Neg logit contributions
Brown-0.015
Ell-0.014
 Stretch-0.013
Twe-0.013
 Brown-0.013
Token named
Token ID3706
Feature activation+6.86e-01

Pos logit contributions
upid+0.019
ours+0.019
ry+0.018
 Sandy+0.018
about+0.017
Neg logit contributions
Brown-0.024
Ell-0.022
Twe-0.021
 Stretch-0.020
hy-0.020
Token Blue
Token ID4518
Feature activation+1.80e-02

Pos logit contributions
Brown+0.026
Ell+0.025
Ar+0.023
ve+0.023
Twe+0.023
Neg logit contributions
 Sandy-0.024
upid-0.024
ours-0.024
 bubble-0.024
ry-0.023
Token.
Token ID13
Feature activation+3.60e+00

Pos logit contributions
Brown+0.001
Ell+0.001
Twe+0.001
 Stretch+0.001
Ar+0.001
Neg logit contributions
ours-0.001
ry-0.001
upid-0.001
 Sandy-0.001
 bubble-0.001
Token Blue
Token ID4518
Feature activation+9.95e-01

Pos logit contributions
ve+0.158
bel+0.134
ind+0.130
cle+0.126
grand+0.122
Neg logit contributions
 of-0.104
 goalie-0.100
 All-0.099
 Believe-0.097
 cricket-0.094
Token lived
Token ID5615
Feature activation+2.85e-01

Pos logit contributions
Brown+0.039
Ell+0.038
Twe+0.035
hy+0.035
 Barney+0.035
Neg logit contributions
ry-0.035
ours-0.032
 bubble-0.032
upid-0.032
 foot-0.031
Token in
Token ID287
Feature activation+1.55e+00

Pos logit contributions
Brown+0.011
Ell+0.011
Twe+0.010
hy+0.010
 Stretch+0.010
Neg logit contributions
ours-0.010
ry-0.010
upid-0.010
 Sandy-0.009
 bubble-0.009
Token a
Token ID257
Feature activation+8.93e-02

Pos logit contributions
ve+0.087
Brown+0.085
Ell+0.085
Twe+0.082
hy+0.077
Neg logit contributions
 bubble-0.028
 foot-0.024
upid-0.022
 hand-0.022
 Sandy-0.021
Token tiny
Token ID7009
Feature activation-6.08e-01

Pos logit contributions
Brown+0.003
Ell+0.003
Twe+0.003
 Stretch+0.003
Ar+0.003
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 Sandy-0.003
 bubble-0.003
Token a
Token ID257
Feature activation+1.85e-01

Pos logit contributions
Brown+0.041
Ell+0.038
Twe+0.036
hy+0.036
Ar+0.035
Neg logit contributions
 bubble-0.018
ours-0.018
 Sandy-0.018
upid-0.018
ry-0.017
Token little
Token ID1310
Feature activation+1.33e-01

Pos logit contributions
Brown+0.009
Ell+0.008
Twe+0.008
 Stretch+0.008
Ar+0.007
Neg logit contributions
ours-0.005
ry-0.005
upid-0.005
 bubble-0.005
 Sandy-0.005
Token bird
Token ID6512
Feature activation-5.90e-01

Pos logit contributions
Brown+0.006
Ell+0.006
Twe+0.006
 Stretch+0.006
Ar+0.006
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 bubble-0.003
 Sandy-0.003
Token named
Token ID3706
Feature activation+7.25e-01

Pos logit contributions
upid+0.020
ours+0.020
ry+0.018
 Sandy+0.018
about+0.018
Neg logit contributions
Brown-0.026
Ell-0.024
Twe-0.023
hy-0.022
 Stretch-0.021
Token Blue
Token ID4518
Feature activation+5.49e-02

Pos logit contributions
Brown+0.025
Ell+0.024
Ar+0.022
hy+0.022
ve+0.022
Neg logit contributions
 Sandy-0.028
upid-0.028
 bubble-0.028
ours-0.028
ry-0.027
Token.
Token ID13
Feature activation+3.59e+00

Pos logit contributions
Brown+0.002
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token Blue
Token ID4518
Feature activation+1.27e+00

Pos logit contributions
ve+0.164
bel+0.145
ind+0.134
cle+0.132
grand+0.130
Neg logit contributions
 goalie-0.093
 again-0.092
 harder-0.089
 the-0.087
 cricket-0.087
Token had
Token ID550
Feature activation+1.25e+00

Pos logit contributions
Brown+0.052
Ell+0.050
hy+0.048
Twe+0.047
 Barney+0.047
Neg logit contributions
ry-0.042
 bubble-0.038
ours-0.037
upid-0.037
 foot-0.037
Token a
Token ID257
Feature activation+1.44e+00

Pos logit contributions
Brown+0.065
Ell+0.061
Twe+0.060
ve+0.059
Ar+0.058
Neg logit contributions
 foot-0.026
 bubble-0.025
 Sandy-0.024
upid-0.024
 hand-0.024
Token peaceful
Token ID12309
Feature activation+1.01e+00

Pos logit contributions
Brown+0.060
Ell+0.056
Twe+0.053
hy+0.052
Ar+0.052
Neg logit contributions
 bubble-0.049
 foot-0.047
upid-0.044
 hand-0.043
 twin-0.042
Token life
Token ID1204
Feature activation+1.47e-01

Pos logit contributions
Brown+0.048
Ell+0.046
Twe+0.044
ve+0.042
Ar+0.042
Neg logit contributions
 bubble-0.028
 foot-0.027
ry-0.027
upid-0.026
 hand-0.025
Token gifted
Token ID22527
Feature activation-4.82e-01

Pos logit contributions
Brown+0.000
Ell+0.000
Twe+0.000
 Stretch+0.000
Ar+0.000
Neg logit contributions
ours-0.000
ry-0.000
upid-0.000
 Sandy-0.000
 bubble-0.000
Token little
Token ID1310
Feature activation-1.65e-01

Pos logit contributions
ours+0.016
upid+0.016
ry+0.016
asing+0.014
 Sandy+0.014
Neg logit contributions
Brown-0.021
Ell-0.019
 Stretch-0.018
Twe-0.018
Ar-0.018
Token bird
Token ID6512
Feature activation-7.79e-01

Pos logit contributions
upid+0.005
ours+0.005
ry+0.005
 Sandy+0.004
asing+0.004
Neg logit contributions
Brown-0.007
Ell-0.007
 Stretch-0.007
Twe-0.007
Ar-0.006
Token named
Token ID3706
Feature activation-3.18e-02

Pos logit contributions
upid+0.029
ours+0.029
ry+0.026
 Sandy+0.026
about+0.026
Neg logit contributions
Brown-0.031
Ell-0.029
Twe-0.027
 Brown-0.026
hy-0.026
Token Blue
Token ID4518
Feature activation-7.75e-02

Pos logit contributions
ours+0.001
ry+0.001
upid+0.001
 bubble+0.001
 Sandy+0.001
Neg logit contributions
Brown-0.001
Ell-0.001
Twe-0.001
 Stretch-0.001
Ar-0.001
Token.
Token ID13
Feature activation+3.56e+00

Pos logit contributions
ours+0.002
upid+0.002
ry+0.002
 Sandy+0.002
 bubble+0.002
Neg logit contributions
Brown-0.004
Ell-0.004
Twe-0.003
 Stretch-0.003
Ar-0.003
Token Blue
Token ID4518
Feature activation+7.23e-01

Pos logit contributions
ve+0.152
cle+0.130
ind+0.129
bel+0.125
wl+0.117
Neg logit contributions
 of-0.112
 Believe-0.107
 Keep-0.105
 All-0.102
 Let-0.101
Token loved
Token ID6151
Feature activation+1.07e+00

Pos logit contributions
Brown+0.037
Ell+0.036
Twe+0.034
hy+0.034
 Stretch+0.034
Neg logit contributions
ry-0.017
ours-0.015
upid-0.015
 bubble-0.015
 foot-0.014
Token to
Token ID284
Feature activation+2.90e-01

Pos logit contributions
Brown+0.052
Ell+0.049
Twe+0.048
hy+0.047
ve+0.046
Neg logit contributions
 bubble-0.028
ours-0.027
 Sandy-0.026
upid-0.026
 foot-0.025
Token sing
Token ID1702
Feature activation+6.29e-01

Pos logit contributions
Brown+0.013
Ell+0.012
Twe+0.011
 Stretch+0.011
Ar+0.011
Neg logit contributions
upid-0.009
ours-0.009
ry-0.009
 bubble-0.009
 Sandy-0.008
Token and
Token ID290
Feature activation-9.53e-02

Pos logit contributions
Brown+0.030
Ell+0.029
Twe+0.028
hy+0.027
Ar+0.026
Neg logit contributions
ry-0.016
upid-0.016
 bubble-0.016
ours-0.015
 foot-0.015
Token high
Token ID1029
Feature activation+1.21e-01

Pos logit contributions
ours+0.058
ress+0.057
ry+0.057
upid+0.056
 ankle+0.055
Neg logit contributions
Brown-0.026
Ell-0.023
Twe-0.022
 Stretch-0.022
hy-0.021
Token!
Token ID0
Feature activation-1.99e+00

Pos logit contributions
Brown+0.005
Ell+0.005
Twe+0.005
 Stretch+0.005
Ar+0.005
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 Sandy-0.003
 bubble-0.003
Token Can
Token ID1680
Feature activation+6.72e-01

Pos logit contributions
ry+0.092
upid+0.088
ours+0.088
 shallow+0.086
icking+0.086
Neg logit contributions
Brown-0.060
 Stretch-0.054
 Brown-0.054
hy-0.051
Twe-0.051
Token you
Token ID345
Feature activation-2.33e+00

Pos logit contributions
Brown+0.026
Ell+0.025
Ar+0.023
hy+0.022
Twe+0.022
Neg logit contributions
ours-0.024
ry-0.024
upid-0.023
 Sandy-0.023
 bubble-0.023
Token teach
Token ID4545
Feature activation-3.80e-01

Pos logit contributions
ours+0.118
ays+0.108
ress+0.106
ry+0.104
about+0.102
Neg logit contributions
Brown-0.060
Ell-0.059
Twe-0.058
ve-0.052
 Ellie-0.051
Token me
Token ID502
Feature activation-4.02e+00

Pos logit contributions
ry+0.017
ours+0.017
upid+0.017
about+0.016
olt+0.016
Neg logit contributions
Brown-0.012
Ell-0.011
 Stretch-0.010
Twe-0.010
 Brown-0.009
Token?"
Token ID1701
Feature activation+1.41e-01

Pos logit contributions
olt+0.176
 horror+0.173
 pitcher+0.154
idget+0.150
about+0.147
Neg logit contributions
 Christmas-0.109
Brown-0.098
 novels-0.093
 Ellie-0.091
 saf-0.090
Token The
Token ID383
Feature activation-8.87e-01

Pos logit contributions
Brown+0.006
Ell+0.006
Twe+0.005
 Stretch+0.005
Ar+0.005
Neg logit contributions
ours-0.005
ry-0.005
upid-0.005
 Sandy-0.004
 bubble-0.004
Token frog
Token ID21264
Feature activation-6.66e-01

Pos logit contributions
ry+0.029
ours+0.028
upid+0.028
asing+0.026
icking+0.026
Neg logit contributions
Brown-0.039
Ell-0.037
 Stretch-0.036
Ar-0.036
Twe-0.035
Token nodded
Token ID14464
Feature activation+9.55e-02

Pos logit contributions
upid+0.025
ours+0.025
ry+0.024
ress+0.022
 Sandy+0.022
Neg logit contributions
Brown-0.025
Ell-0.024
 Stretch-0.022
Twe-0.022
Ar-0.021
Token and
Token ID290
Feature activation-1.43e+00

Pos logit contributions
Brown+0.004
Ell+0.004
Twe+0.003
 Stretch+0.003
Ar+0.003
Neg logit contributions
ours-0.003
ry-0.003
upid-0.003
 Sandy-0.003
 bubble-0.003
Token asked
Token ID1965
Feature activation+3.24e-02

Pos logit contributions
Brown+0.002
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token,
Token ID11
Feature activation-5.56e-01

Pos logit contributions
Brown+0.001
Ell+0.001
Twe+0.001
 Stretch+0.001
Ar+0.001
Neg logit contributions
ours-0.001
ry-0.001
upid-0.001
 bubble-0.001
 Sandy-0.001
Token "
Token ID366
Feature activation-3.04e-01

Pos logit contributions
upid+0.020
ry+0.019
ours+0.019
asing+0.018
olt+0.017
Neg logit contributions
Brown-0.023
Ell-0.021
 Brown-0.020
 Stretch-0.020
Twe-0.020
TokenWhat
Token ID2061
Feature activation-2.02e+00

Pos logit contributions
upid+0.011
ours+0.011
ry+0.011
 Sandy+0.010
 bubble+0.010
Neg logit contributions
Brown-0.012
Ell-0.011
Twe-0.011
Ar-0.010
 Stretch-0.010
Token is
Token ID318
Feature activation-1.25e+00

Pos logit contributions
 shallow+0.083
olt+0.082
 Sandy+0.081
upid+0.081
ry+0.080
Neg logit contributions
Brown-0.061
 Stretch-0.060
 Brown-0.060
 Christmas-0.058
hy-0.057
Token that
Token ID326
Feature activation-3.95e+00

Pos logit contributions
ill+0.050
ress+0.050
ry+0.050
upid+0.049
about+0.048
Neg logit contributions
Brown-0.041
 Brown-0.040
 Stretch-0.039
 Barney-0.038
 Twe-0.037
Token?"
Token ID1701
Feature activation-1.63e-02

Pos logit contributions
ress+0.170
asing+0.170
ill+0.170
omp+0.164
run+0.161
Neg logit contributions
 Brown-0.099
 Twe-0.094
 long-0.091
 Blue-0.089
 Ellie-0.087
Token
Token ID198
Feature activation+6.52e-02

Pos logit contributions
ours+0.001
upid+0.001
ry+0.001
 Sandy+0.001
 bubble+0.001
Neg logit contributions
Brown-0.001
Ell-0.001
Twe-0.001
 Stretch-0.001
hy-0.001
Token
Token ID198
Feature activation+1.12e-01

Pos logit contributions
Brown+0.003
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
TokenThe
Token ID464
Feature activation+6.92e-02

Pos logit contributions
Brown+0.005
Ell+0.005
Twe+0.005
 Stretch+0.005
Ar+0.005
Neg logit contributions
ours-0.003
ry-0.003
upid-0.003
 Sandy-0.003
 bubble-0.003
Token mom
Token ID1995
Feature activation+1.48e-02

Pos logit contributions
Brown+0.003
Ell+0.003
Twe+0.003
 Stretch+0.003
Ar+0.003
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token Mom
Token ID11254
Feature activation-3.46e-01

Pos logit contributions
upid+0.020
ours+0.020
ry+0.020
 bubble+0.019
asing+0.018
Neg logit contributions
Brown-0.017
Ell-0.017
 Brown-0.015
 Stretch-0.014
Twe-0.014
Tokenmy
Token ID1820
Feature activation-2.89e-01

Pos logit contributions
upid+0.013
ours+0.012
ry+0.012
 Sandy+0.012
 bubble+0.011
Neg logit contributions
Brown-0.014
Ell-0.013
 Brown-0.012
Twe-0.012
 Stretch-0.011
Token took
Token ID1718
Feature activation+4.18e-01

Pos logit contributions
upid+0.010
ours+0.010
ry+0.010
 Sandy+0.009
 foot+0.009
Neg logit contributions
Brown-0.012
Ell-0.011
Twe-0.010
 Stretch-0.010
 Brown-0.010
Token out
Token ID503
Feature activation-8.90e-01

Pos logit contributions
Brown+0.017
Ell+0.016
Twe+0.015
 Stretch+0.015
Ar+0.015
Neg logit contributions
ours-0.014
upid-0.014
ry-0.014
 Sandy-0.014
 bubble-0.013
Token a
Token ID257
Feature activation-7.82e-01

Pos logit contributions
ry+0.035
upid+0.035
ours+0.034
 ankle+0.032
asing+0.032
Neg logit contributions
Ell-0.031
Brown-0.031
 Brown-0.028
 Kirst-0.026
hy-0.026
Token spray
Token ID11662
Feature activation-3.92e+00

Pos logit contributions
ours+0.028
upid+0.027
ry+0.027
asing+0.027
 Sandy+0.025
Neg logit contributions
Brown-0.031
Ell-0.030
Twe-0.027
Ar-0.027
 Stretch-0.027
Token bottle
Token ID9294
Feature activation-1.69e+00

Pos logit contributions
ress+0.142
upid+0.132
asing+0.132
ours+0.128
omp+0.128
Neg logit contributions
 past-0.119
Ell-0.112
 Brown-0.111
 Ellie-0.110
hy-0.110
Token.
Token ID13
Feature activation-3.25e-01

Pos logit contributions
upid+0.057
ours+0.056
 ankle+0.051
ry+0.051
ress+0.050
Neg logit contributions
Ell-0.064
Brown-0.063
hy-0.060
 Brown-0.059
ve-0.057
Token She
Token ID1375
Feature activation-2.52e+00

Pos logit contributions
ry+0.013
upid+0.013
ours+0.013
asing+0.012
 bubble+0.012
Neg logit contributions
Brown-0.012
Ell-0.011
Twe-0.010
 Brown-0.010
 Stretch-0.010
Token showed
Token ID3751
Feature activation-3.19e-01

Pos logit contributions
upid+0.089
 conclusions+0.082
olt+0.075
 goalie+0.073
iece+0.072
Neg logit contributions
Ell-0.106
Brown-0.104
ve-0.103
 Brown-0.096
Ar-0.096
Token Sarah
Token ID10490
Feature activation-1.68e+00

Pos logit contributions
ry+0.012
ours+0.012
upid+0.011
 shallow+0.011
 bubble+0.011
Neg logit contributions
Brown-0.012
Ell-0.012
 Brown-0.011
Twe-0.011
 Stretch-0.010
Token higher
Token ID2440
Feature activation-1.03e+00

Pos logit contributions
ours+0.040
upid+0.038
ress+0.038
ry+0.037
olt+0.037
Neg logit contributions
Brown-0.030
Ell-0.027
 Stretch-0.026
Ar-0.026
Twe-0.025
Token than
Token ID621
Feature activation-3.07e-01

Pos logit contributions
ours+0.041
upid+0.040
ry+0.038
ress+0.038
olt+0.037
Neg logit contributions
Brown-0.035
Ar-0.032
 Stretch-0.032
Ell-0.031
 Brown-0.030
Token you
Token ID345
Feature activation-9.35e-01

Pos logit contributions
ours+0.014
ry+0.014
upid+0.013
about+0.013
 Sandy+0.013
Neg logit contributions
Brown-0.009
Ell-0.008
Twe-0.008
 Stretch-0.008
Ar-0.008
Token!"
Token ID2474
Feature activation-5.35e-01

Pos logit contributions
ours+0.035
ry+0.033
ress+0.033
 Sandy+0.032
upid+0.032
Neg logit contributions
Brown-0.034
 Stretch-0.031
Ell-0.031
Ar-0.030
ve-0.030
Token The
Token ID383
Feature activation-2.65e+00

Pos logit contributions
upid+0.022
ry+0.022
ours+0.022
asing+0.020
 shallow+0.020
Neg logit contributions
Brown-0.018
Ell-0.017
 Stretch-0.016
 Brown-0.016
Twe-0.016
Token small
Token ID1402
Feature activation-3.91e+00

Pos logit contributions
ress+0.169
ry+0.168
ours+0.163
asing+0.159
ill+0.158
Neg logit contributions
 long-0.043
Ar-0.039
 shiny-0.030
Brown-0.028
 whole-0.026
Token bubble
Token ID14310
Feature activation-1.20e+00

Pos logit contributions
idget+0.143
run+0.142
asing+0.141
ress+0.136
ayed+0.134
Neg logit contributions
 long-0.140
Ar-0.129
 rh-0.120
 shiny-0.116
 whole-0.111
Token said
Token ID531
Feature activation-6.73e-01

Pos logit contributions
ours+0.044
upid+0.044
ry+0.041
ress+0.039
 Sandy+0.039
Neg logit contributions
Brown-0.046
Ell-0.043
Ar-0.042
pe-0.041
 Stretch-0.041
Token,
Token ID11
Feature activation-2.13e-01

Pos logit contributions
ours+0.026
upid+0.026
ry+0.025
 shallow+0.024
 bubble+0.024
Neg logit contributions
Brown-0.024
Ell-0.023
 Stretch-0.022
 Brown-0.021
hy-0.021
Token "
Token ID366
Feature activation+5.18e-01

Pos logit contributions
upid+0.006
ry+0.006
ours+0.006
 Sandy+0.006
 bubble+0.006
Neg logit contributions
Brown-0.010
Ell-0.009
Twe-0.009
 Stretch-0.008
 Brown-0.008
TokenNo
Token ID2949
Feature activation-1.35e+00

Pos logit contributions
Brown+0.021
Ell+0.020
 Brown+0.019
 Barney+0.019
 Stretch+0.019
Neg logit contributions
ry-0.017
upid-0.017
ours-0.017
 bubble-0.016
about-0.016
Token it
Token ID340
Feature activation+6.44e-01

Pos logit contributions
upid+0.027
ry+0.026
ours+0.026
about+0.025
 Sandy+0.025
Neg logit contributions
Brown-0.026
Ell-0.024
Twe-0.023
 Stretch-0.023
 Brown-0.022
Token tastes
Token ID18221
Feature activation-1.12e-01

Pos logit contributions
Brown+0.033
Ell+0.032
Twe+0.030
hy+0.029
Ar+0.029
Neg logit contributions
ours-0.017
ry-0.017
upid-0.016
 bubble-0.015
 Sandy-0.014
Token."
Token ID526
Feature activation-7.87e-01

Pos logit contributions
ours+0.004
ry+0.004
upid+0.004
 Sandy+0.003
 bubble+0.003
Neg logit contributions
Brown-0.005
Ell-0.004
Twe-0.004
 Stretch-0.004
Ar-0.004
Token
Token ID198
Feature activation-9.17e-01

Pos logit contributions
upid+0.034
ours+0.033
ry+0.033
 shallow+0.031
asing+0.031
Neg logit contributions
Brown-0.025
Ell-0.024
 Brown-0.022
Twe-0.022
 Stretch-0.021
Token
Token ID198
Feature activation-1.09e+00

Pos logit contributions
ours+0.042
ry+0.040
upid+0.040
asing+0.039
about+0.039
Neg logit contributions
Brown-0.028
Ell-0.026
Twe-0.024
Ar-0.023
 Brown-0.023
TokenL
Token ID43
Feature activation-3.80e+00

Pos logit contributions
ours+0.049
upid+0.048
ry+0.047
icking+0.045
 Sandy+0.044
Neg logit contributions
Brown-0.038
Ell-0.035
Twe-0.033
Ar-0.030
Red-0.030
Tokenula
Token ID4712
Feature activation+9.18e-01

Pos logit contributions
 shallow+0.124
ry+0.122
 reel+0.119
 fin+0.117
 net+0.114
Neg logit contributions
ve-0.153
Ell-0.131
Twe-0.126
Red-0.124
Brown-0.117
Token could
Token ID714
Feature activation+1.25e-01

Pos logit contributions
Brown+0.045
Ell+0.043
Ar+0.041
Twe+0.041
 Stretch+0.040
Neg logit contributions
ry-0.025
ours-0.024
upid-0.023
 bubble-0.023
 Sandy-0.022
Token barely
Token ID8523
Feature activation-1.76e-01

Pos logit contributions
Brown+0.005
Ell+0.004
Twe+0.004
 Stretch+0.004
Ar+0.004
Neg logit contributions
ours-0.005
ry-0.005
upid-0.005
 Sandy-0.004
 bubble-0.004
Token contain
Token ID3994
Feature activation-1.49e-01

Pos logit contributions
ours+0.006
ry+0.005
upid+0.005
 Sandy+0.005
 bubble+0.005
Neg logit contributions
Brown-0.008
Ell-0.007
Twe-0.007
 Stretch-0.007
hy-0.007
Token her
Token ID607
Feature activation-2.39e-01

Pos logit contributions
ours+0.007
ry+0.007
upid+0.006
 Sandy+0.006
 bubble+0.006
Neg logit contributions
Brown-0.005
Ell-0.004
Twe-0.004
 Stretch-0.004
 Barney-0.004
Token let
Token ID1309
Feature activation-1.56e+00

Pos logit contributions
Brown+0.002
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token's
Token ID338
Feature activation-2.25e+00

Pos logit contributions
ours+0.054
 shallow+0.053
apping+0.053
asing+0.051
upid+0.051
Neg logit contributions
 Brown-0.064
Brown-0.063
 Stretch-0.057
 Barney-0.055
Ell-0.055
Token play
Token ID711
Feature activation-2.46e+00

Pos logit contributions
ours+0.098
ry+0.094
ress+0.093
asing+0.088
upid+0.087
Neg logit contributions
Brown-0.072
 Brown-0.065
 Stretch-0.062
 Christmas-0.058
Ell-0.058
Token it
Token ID340
Feature activation-1.89e-01

Pos logit contributions
upid+0.107
ours+0.105
olt+0.101
icking+0.100
 Sandy+0.098
Neg logit contributions
Brown-0.076
 Stretch-0.070
Yo-0.069
Ell-0.068
Ar-0.068
Token.
Token ID13
Feature activation-9.97e-01

Pos logit contributions
ours+0.006
upid+0.006
ry+0.006
 Sandy+0.006
 bubble+0.006
Neg logit contributions
Brown-0.008
Ell-0.007
Twe-0.007
 Stretch-0.007
Ar-0.007
Token One
Token ID1881
Feature activation-3.75e+00

Pos logit contributions
ry+0.041
ours+0.040
upid+0.040
about+0.038
 shallow+0.038
Neg logit contributions
Brown-0.036
 Stretch-0.032
 Brown-0.031
Ell-0.031
Twe-0.030
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
omp+0.183
 Sandy+0.182
idget+0.179
ill+0.178
ours+0.176
Neg logit contributions
 Christmas-0.078
 long-0.078
 de-0.071
bows-0.071
Brown-0.068
Token two
Token ID734
Feature activation-8.05e-01

Pos logit contributions
upid+0.067
ry+0.064
ours+0.063
olt+0.063
ired+0.060
Neg logit contributions
Brown-0.073
 Brown-0.066
 Stretch-0.064
Ar-0.063
Ell-0.062
Token,
Token ID11
Feature activation-1.96e+00

Pos logit contributions
ours+0.028
upid+0.027
ry+0.027
 Sandy+0.026
ress+0.025
Neg logit contributions
Brown-0.032
Ell-0.029
 Stretch-0.029
hy-0.028
 Brown-0.028
Token three
Token ID1115
Feature activation-7.01e-01

Pos logit contributions
upid+0.072
ry+0.071
ours+0.070
olt+0.070
icking+0.066
Neg logit contributions
Brown-0.078
 Brown-0.071
Ell-0.070
 Stretch-0.069
Ar-0.068
Token,
Token ID11
Feature activation-1.59e+00

Pos logit contributions
upid+0.025
ours+0.025
ry+0.024
 Sandy+0.024
olt+0.023
Neg logit contributions
Brown-0.027
Ell-0.025
 Stretch-0.024
Ar-0.023
hy-0.023
Token said
Token ID531
Feature activation+1.18e-01

Pos logit contributions
upid+0.007
ours+0.007
ry+0.006
 bubble+0.006
 Sandy+0.006
Neg logit contributions
Brown-0.007
Ell-0.006
 Stretch-0.006
Twe-0.006
Ar-0.006
Token,
Token ID11
Feature activation+6.32e-01

Pos logit contributions
Brown+0.005
Ell+0.005
Twe+0.005
 Stretch+0.005
Ar+0.005
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 Sandy-0.003
 bubble-0.003
Token "
Token ID366
Feature activation+8.58e-01

Pos logit contributions
Brown+0.033
Ell+0.031
Twe+0.030
Ar+0.029
hy+0.029
Neg logit contributions
ours-0.015
ry-0.015
upid-0.014
 Sandy-0.014
 bubble-0.013
TokenLook
Token ID8567
Feature activation-4.05e-01

Pos logit contributions
Brown+0.028
Ell+0.027
 Stretch+0.026
hy+0.026
ve+0.026
Neg logit contributions
about-0.032
ry-0.032
 bubble-0.032
upid-0.031
ours-0.030
Token at
Token ID379
Feature activation-7.97e-01

Pos logit contributions
ours+0.013
upid+0.013
ry+0.013
 bubble+0.012
 shallow+0.012
Neg logit contributions
Brown-0.017
Ell-0.016
 Stretch-0.015
 Brown-0.015
Twe-0.015
Token that
Token ID326
Feature activation-3.75e+00

Pos logit contributions
ry+0.030
ours+0.030
upid+0.029
olt+0.028
about+0.028
Neg logit contributions
Brown-0.030
 Brown-0.027
 Stretch-0.027
 Barney-0.026
Ell-0.026
Token blue
Token ID4171
Feature activation-2.94e+00

Pos logit contributions
asing+0.151
icking+0.150
idget+0.150
ours+0.149
ion+0.143
Neg logit contributions
 long-0.106
 saf-0.102
 Brown-0.094
 shiny-0.089
 whole-0.085
Token circle
Token ID9197
Feature activation-5.63e-01

Pos logit contributions
 ankle+0.104
icking+0.104
idget+0.103
arre+0.101
asing+0.099
Neg logit contributions
 saf-0.085
 Christmas-0.083
 Brown-0.081
hy-0.079
Ar-0.079
Token!"
Token ID2474
Feature activation-2.00e-02

Pos logit contributions
ours+0.019
upid+0.019
ry+0.019
 Sandy+0.018
 bubble+0.017
Neg logit contributions
Brown-0.022
Ell-0.021
hy-0.020
 Stretch-0.019
Twe-0.019
Token His
Token ID2399
Feature activation-1.52e+00

Pos logit contributions
ours+0.001
ry+0.001
upid+0.001
 Sandy+0.001
 bubble+0.001
Neg logit contributions
Brown-0.001
Ell-0.001
Twe-0.001
 Stretch-0.001
Ar-0.001
Token mom
Token ID1995
Feature activation+1.63e-02

Pos logit contributions
ours+0.058
ry+0.057
ion+0.056
upid+0.054
olt+0.053
Neg logit contributions
Brown-0.051
 Brown-0.049
Ar-0.048
 Stretch-0.047
 saf-0.046
Token fun
Token ID1257
Feature activation+4.51e-01

Pos logit contributions
ours+0.035
ry+0.034
upid+0.033
ress+0.033
 ankle+0.032
Neg logit contributions
Brown-0.020
Ell-0.018
 Stretch-0.017
Ar-0.017
Twe-0.017
Token!
Token ID0
Feature activation-1.51e+00

Pos logit contributions
Brown+0.021
Ell+0.020
Twe+0.019
Ar+0.018
 Stretch+0.018
Neg logit contributions
ry-0.014
ours-0.013
upid-0.013
 Sandy-0.013
 bubble-0.013
Token Can
Token ID1680
Feature activation+7.90e-01

Pos logit contributions
ry+0.061
upid+0.061
ours+0.060
 shallow+0.059
 bubble+0.058
Neg logit contributions
Brown-0.055
 Stretch-0.049
 Brown-0.049
Ell-0.048
 Barney-0.046
Token you
Token ID345
Feature activation-1.57e+00

Pos logit contributions
Brown+0.035
Ell+0.034
Ar+0.032
hy+0.031
Twe+0.031
Neg logit contributions
ry-0.023
ours-0.023
 Sandy-0.022
upid-0.022
 bubble-0.022
Token teach
Token ID4545
Feature activation+3.23e-01

Pos logit contributions
ours+0.069
ry+0.063
ress+0.059
ays+0.058
upid+0.058
Neg logit contributions
Brown-0.055
Ell-0.051
Twe-0.047
 Stretch-0.047
 Brown-0.046
Token me
Token ID502
Feature activation-3.75e+00

Pos logit contributions
Brown+0.012
Ell+0.011
Twe+0.010
hy+0.010
Ar+0.010
Neg logit contributions
ours-0.013
ry-0.013
upid-0.012
 Sandy-0.012
 bubble-0.012
Token how
Token ID703
Feature activation-1.82e+00

Pos logit contributions
olt+0.151
 horror+0.144
ours+0.138
 anth+0.137
 pitcher+0.136
Neg logit contributions
Brown-0.116
 Christmas-0.109
 Brown-0.100
 Stretch-0.098
 novels-0.097
Token to
Token ID284
Feature activation-1.43e+00

Pos logit contributions
icking+0.088
about+0.087
ours+0.087
olt+0.087
ry+0.086
Neg logit contributions
Brown-0.044
 Stretch-0.036
Ell-0.036
 Brown-0.036
Ar-0.035
Token make
Token ID787
Feature activation-1.73e+00

Pos logit contributions
ours+0.058
ry+0.053
 Sandy+0.052
ress+0.051
upid+0.049
Neg logit contributions
Brown-0.055
Ell-0.049
Ar-0.047
 Stretch-0.047
pe-0.046
Token music
Token ID2647
Feature activation-1.90e+00

Pos logit contributions
ry+0.071
ours+0.069
upid+0.067
olt+0.067
ill+0.064
Neg logit contributions
Brown-0.058
 Brown-0.055
 Stretch-0.055
 Barney-0.052
Ell-0.050
Token with
Token ID351
Feature activation-1.07e+00

Pos logit contributions
upid+0.072
ours+0.071
ry+0.068
 Sandy+0.067
olt+0.066
Neg logit contributions
Brown-0.069
Ell-0.062
 Brown-0.060
 Stretch-0.060
Ar-0.058
Tokenmy
Token ID1820
Feature activation+1.26e-01

Pos logit contributions
upid+0.018
ours+0.018
ry+0.018
 bubble+0.017
about+0.017
Neg logit contributions
Brown-0.020
Ell-0.019
 Brown-0.018
Twe-0.017
 Stretch-0.017
Token,
Token ID11
Feature activation-1.41e+00

Pos logit contributions
Brown+0.006
Ell+0.005
Twe+0.005
 Stretch+0.005
Ar+0.005
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 Sandy-0.004
 bubble-0.003
Token can
Token ID460
Feature activation+7.05e-01

Pos logit contributions
upid+0.049
ours+0.046
ry+0.046
about+0.046
apping+0.046
Neg logit contributions
Brown-0.056
 Brown-0.054
 Stretch-0.049
 Christmas-0.049
 Barney-0.049
Token you
Token ID345
Feature activation-1.54e+00

Pos logit contributions
Brown+0.029
Ell+0.028
Ar+0.027
hy+0.025
Twe+0.025
Neg logit contributions
ry-0.024
ours-0.024
 bubble-0.023
upid-0.023
 foot-0.022
Token tell
Token ID1560
Feature activation-2.29e-01

Pos logit contributions
ours+0.073
ry+0.070
upid+0.068
 Sandy+0.068
about+0.068
Neg logit contributions
Brown-0.046
Ell-0.040
Twe-0.037
 Brown-0.036
ve-0.034
Token me
Token ID502
Feature activation-3.73e+00

Pos logit contributions
ry+0.009
ours+0.009
upid+0.009
about+0.009
 Sandy+0.009
Neg logit contributions
Brown-0.008
Ell-0.007
Twe-0.007
 Stretch-0.007
 Brown-0.006
Token a
Token ID257
Feature activation-9.17e-01

Pos logit contributions
udge+0.153
olt+0.146
omp+0.146
about+0.145
 goalie+0.142
Neg logit contributions
 Christmas-0.112
Brown-0.103
 Brown-0.101
 Ellie-0.095
 novels-0.086
Token story
Token ID1621
Feature activation-5.84e-01

Pos logit contributions
ours+0.035
ry+0.035
upid+0.034
asing+0.034
ays+0.032
Neg logit contributions
Brown-0.035
Ell-0.033
Twe-0.030
Ar-0.030
 Brown-0.029
Token?"
Token ID1701
Feature activation+5.72e-01

Pos logit contributions
upid+0.016
ours+0.016
ry+0.016
 Sandy+0.015
 bubble+0.015
Neg logit contributions
Brown-0.027
Ell-0.026
 Stretch-0.024
 Brown-0.024
hy-0.024
Token His
Token ID2399
Feature activation-6.57e-02

Pos logit contributions
Brown+0.019
Ell+0.017
Ar+0.017
 Stretch+0.016
Twe+0.016
Neg logit contributions
ours-0.024
ry-0.024
 Sandy-0.023
upid-0.023
 foot-0.023
Token mom
Token ID1995
Feature activation+2.52e-01

Pos logit contributions
ry+0.002
ours+0.002
upid+0.002
 Sandy+0.002
 bubble+0.002
Neg logit contributions
Brown-0.003
Ell-0.003
Twe-0.003
 Stretch-0.003
Ar-0.003
Token They
Token ID1119
Feature activation+3.57e-01

Pos logit contributions
ry+0.021
upid+0.021
ours+0.021
ress+0.020
asing+0.020
Neg logit contributions
Brown-0.013
Ell-0.012
Ar-0.011
 Stretch-0.011
Twe-0.011
Token swim
Token ID9422
Feature activation-6.76e-01

Pos logit contributions
Brown+0.018
Ell+0.017
Twe+0.016
 Stretch+0.016
Ar+0.016
Neg logit contributions
upid-0.009
ours-0.009
ry-0.009
 bubble-0.008
 Sandy-0.008
Token back
Token ID736
Feature activation-1.81e-01

Pos logit contributions
upid+0.026
ours+0.025
ry+0.024
 Sandy+0.024
 bubble+0.024
Neg logit contributions
Brown-0.024
Ar-0.022
Ell-0.022
 Stretch-0.021
 Brown-0.021
Token to
Token ID284
Feature activation-1.04e+00

Pos logit contributions
upid+0.007
ours+0.007
ry+0.006
 Sandy+0.006
 bubble+0.006
Neg logit contributions
Brown-0.007
Ell-0.007
 Stretch-0.006
 Brown-0.006
Twe-0.006
Token the
Token ID262
Feature activation-7.45e-01

Pos logit contributions
ry+0.043
ours+0.042
asing+0.041
upid+0.041
about+0.040
Neg logit contributions
 Brown-0.036
Brown-0.035
 Stretch-0.033
Ell-0.032
 Christmas-0.032
Token shallow
Token ID19337
Feature activation-3.73e+00

Pos logit contributions
ours+0.025
ry+0.025
upid+0.025
asing+0.024
 Sandy+0.023
Neg logit contributions
Brown-0.031
Ell-0.028
Ar-0.028
Twe-0.027
 Stretch-0.027
Token end
Token ID886
Feature activation-1.67e+00

Pos logit contributions
asing+0.172
ours+0.152
run+0.146
 ankle+0.145
okes+0.143
Neg logit contributions
Ar-0.121
 saf-0.108
Yo-0.104
thank-0.102
pe-0.099
Token of
Token ID286
Feature activation-1.04e+00

Pos logit contributions
upid+0.069
 ankle+0.065
ours+0.061
about+0.061
asing+0.060
Neg logit contributions
Brown-0.049
Ell-0.046
Ar-0.044
 Brown-0.044
pe-0.044
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
ry+0.041
asing+0.039
upid+0.039
ours+0.039
about+0.037
Neg logit contributions
Brown-0.038
 Brown-0.037
Ell-0.036
 Stretch-0.035
Ar-0.035
Token pool
Token ID5933
Feature activation-1.55e+00

Pos logit contributions
ours+0.041
ry+0.041
asing+0.041
upid+0.040
ress+0.040
Neg logit contributions
Brown-0.041
Ar-0.041
Ell-0.037
 saf-0.036
 Stretch-0.036
Token,
Token ID11
Feature activation-9.88e-01

Pos logit contributions
upid+0.070
ours+0.067
 ankle+0.065
asing+0.064
 Sandy+0.063
Neg logit contributions
Brown-0.038
pe-0.038
hy-0.037
Ar-0.036
 Brown-0.036
Token the
Token ID262
Feature activation-1.32e+00

Pos logit contributions
ours+0.007
upid+0.007
ry+0.007
 bubble+0.006
about+0.006
Neg logit contributions
Brown-0.009
Ell-0.008
 Stretch-0.008
 Brown-0.008
Twe-0.007
Token bald
Token ID28273
Feature activation-4.78e-01

Pos logit contributions
ry+0.062
ours+0.061
upid+0.061
ress+0.055
 Sandy+0.055
Neg logit contributions
Brown-0.039
 Stretch-0.036
 Brown-0.035
Ar-0.035
Ell-0.034
Token player
Token ID2137
Feature activation-2.42e+00

Pos logit contributions
ours+0.019
upid+0.019
ry+0.019
 Sandy+0.018
 shallow+0.018
Neg logit contributions
Brown-0.017
Ell-0.015
hy-0.014
 Stretch-0.014
Ar-0.014
Token and
Token ID290
Feature activation-1.27e+00

Pos logit contributions
upid+0.103
ours+0.102
 conclusions+0.095
 shallow+0.093
ry+0.092
Neg logit contributions
Brown-0.075
Ell-0.074
ve-0.068
 Stretch-0.067
Ar-0.066
Token the
Token ID262
Feature activation-1.67e+00

Pos logit contributions
ours+0.051
upid+0.049
ry+0.048
icking+0.044
ays+0.043
Neg logit contributions
Ell-0.050
Brown-0.049
 Stretch-0.045
 Brown-0.044
 Ellie-0.043
Token girl
Token ID2576
Feature activation-3.73e+00

Pos logit contributions
ry+0.082
ours+0.080
upid+0.078
ays+0.070
ress+0.069
Neg logit contributions
Brown-0.050
Ar-0.046
 Stretch-0.046
 Brown-0.046
Ell-0.044
Token skipped
Token ID26684
Feature activation-1.76e-01

Pos logit contributions
ours+0.195
upid+0.192
 conclusions+0.192
ainer+0.185
udge+0.182
Neg logit contributions
Ell-0.075
ve-0.068
 Kirst-0.063
hy-0.061
Brown-0.057
Token up
Token ID510
Feature activation-6.57e-01

Pos logit contributions
ours+0.006
ry+0.006
upid+0.006
 bubble+0.005
asing+0.005
Neg logit contributions
Brown-0.007
Ell-0.007
 Stretch-0.006
Ar-0.006
Twe-0.006
Token and
Token ID290
Feature activation-1.67e+00

Pos logit contributions
upid+0.023
ours+0.023
ry+0.022
asing+0.022
ress+0.021
Neg logit contributions
Ell-0.025
Brown-0.025
Ar-0.023
 Stretch-0.023
 Brown-0.023
Token down
Token ID866
Feature activation-1.18e+00

Pos logit contributions
upid+0.073
ours+0.073
ry+0.068
ava+0.067
ired+0.067
Neg logit contributions
Ell-0.054
Brown-0.050
 Brown-0.048
 Kirst-0.046
 Stretch-0.046
Token the
Token ID262
Feature activation-7.54e-01

Pos logit contributions
ours+0.044
upid+0.044
asing+0.042
ry+0.041
 ankle+0.041
Neg logit contributions
Ar-0.039
Ell-0.039
 Stretch-0.038
hy-0.038
Brown-0.037
Token ball
Token ID2613
Feature activation-1.19e+00

Pos logit contributions
ry+0.095
ices+0.088
asing+0.087
ours+0.086
ress+0.084
Neg logit contributions
 Ellie-0.061
 Blue-0.060
 Brown-0.059
 Twe-0.059
 Stretch-0.056
Token!
Token ID0
Feature activation-1.14e+00

Pos logit contributions
ours+0.050
upid+0.049
 ankle+0.046
asing+0.046
ry+0.045
Neg logit contributions
Brown-0.038
Ell-0.037
Twe-0.036
hy-0.034
 Stretch-0.034
Token Can
Token ID1680
Feature activation+1.00e+00

Pos logit contributions
upid+0.045
ry+0.045
ours+0.044
about+0.043
 ankle+0.043
Neg logit contributions
Brown-0.040
Ell-0.037
 Stretch-0.036
hy-0.036
Twe-0.036
Token you
Token ID345
Feature activation-5.72e-01

Pos logit contributions
Brown+0.041
Ell+0.039
Ar+0.038
hy+0.035
ve+0.035
Neg logit contributions
ours-0.033
ry-0.033
upid-0.031
 bubble-0.031
 Sandy-0.031
Token teach
Token ID4545
Feature activation-6.23e-01

Pos logit contributions
ours+0.024
ry+0.023
upid+0.022
about+0.021
 Sandy+0.021
Neg logit contributions
Brown-0.020
Ell-0.019
Twe-0.018
hy-0.017
 Stretch-0.017
Token me
Token ID502
Feature activation-3.69e+00

Pos logit contributions
ry+0.030
ours+0.030
upid+0.029
olt+0.028
about+0.028
Neg logit contributions
Brown-0.017
Ell-0.016
 Stretch-0.015
 Brown-0.014
Twe-0.014
Token how
Token ID703
Feature activation-1.61e+00

Pos logit contributions
olt+0.153
 horror+0.146
idget+0.139
about+0.138
 Sandy+0.137
Neg logit contributions
 Christmas-0.102
Brown-0.097
 Ellie-0.095
 novels-0.093
 Blue-0.092
Token to
Token ID284
Feature activation-5.66e-02

Pos logit contributions
ry+0.077
about+0.076
ours+0.076
ill+0.075
icking+0.075
Neg logit contributions
Brown-0.041
Ell-0.037
Twe-0.034
hy-0.034
 Stretch-0.032
Token fly
Token ID6129
Feature activation-1.41e+00

Pos logit contributions
ours+0.002
ry+0.002
upid+0.002
 Sandy+0.002
 bubble+0.002
Neg logit contributions
Brown-0.002
Ell-0.002
Twe-0.002
 Stretch-0.002
Ar-0.002
Token?"
Token ID1701
Feature activation+1.11e-01

Pos logit contributions
ours+0.056
ry+0.053
ress+0.053
olt+0.052
upid+0.051
Neg logit contributions
Brown-0.048
Ell-0.045
Twe-0.043
 Brown-0.043
 Stretch-0.042
Token The
Token ID383
Feature activation-3.93e-01

Pos logit contributions
Brown+0.005
Ell+0.004
Twe+0.004
 Stretch+0.004
Ar+0.004
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 Sandy-0.003
 bubble-0.003
Token dancer
Token ID38619
Feature activation-6.02e-01

Pos logit contributions
Brown+0.000
Ell+0.000
Twe+0.000
 Stretch+0.000
Ar+0.000
Neg logit contributions
ours-0.000
ry-0.000
upid-0.000
 Sandy-0.000
 bubble-0.000
Token!
Token ID0
Feature activation-1.47e+00

Pos logit contributions
ours+0.021
upid+0.021
ry+0.021
 Sandy+0.019
 bubble+0.019
Neg logit contributions
Brown-0.024
Ell-0.022
 Stretch-0.021
hy-0.021
 Brown-0.021
Token Can
Token ID1680
Feature activation+6.81e-01

Pos logit contributions
ry+0.065
ours+0.063
upid+0.063
 shallow+0.062
 bubble+0.061
Neg logit contributions
Brown-0.047
Ell-0.043
 Stretch-0.042
 Brown-0.041
hy-0.041
Token you
Token ID345
Feature activation-1.79e+00

Pos logit contributions
Brown+0.029
Ell+0.027
Ar+0.026
hy+0.025
 Stretch+0.025
Neg logit contributions
ours-0.022
ry-0.022
upid-0.021
 Sandy-0.021
 bubble-0.020
Token teach
Token ID4545
Feature activation+1.31e-01

Pos logit contributions
ours+0.080
ry+0.070
ays+0.070
ress+0.067
about+0.066
Neg logit contributions
Brown-0.061
Ell-0.059
 Stretch-0.054
Twe-0.053
Ar-0.053
Token me
Token ID502
Feature activation-3.66e+00

Pos logit contributions
Brown+0.005
Ell+0.004
Twe+0.004
 Stretch+0.004
Ar+0.004
Neg logit contributions
ours-0.005
ry-0.005
upid-0.005
 Sandy-0.005
 bubble-0.005
Token?"
Token ID1701
Feature activation+1.34e+00

Pos logit contributions
olt+0.139
 horror+0.136
ours+0.125
inter+0.125
 anth+0.122
Neg logit contributions
Brown-0.113
 Christmas-0.111
 novels-0.102
Ell-0.101
 Stretch-0.101
Token Lucy
Token ID22162
Feature activation-5.93e-01

Pos logit contributions
Brown+0.065
Ell+0.060
Ar+0.060
ve+0.058
Twe+0.057
Neg logit contributions
 Sandy-0.037
ours-0.034
about-0.032
ry-0.032
ress-0.031
Token smiled
Token ID13541
Feature activation-2.13e-01

Pos logit contributions
upid+0.021
ours+0.021
ry+0.020
 shallow+0.018
asing+0.018
Neg logit contributions
Brown-0.023
Ell-0.023
 Stretch-0.021
Twe-0.021
 Brown-0.020
Token and
Token ID290
Feature activation-9.18e-01

Pos logit contributions
ours+0.007
upid+0.007
ry+0.007
 bubble+0.006
 Sandy+0.006
Neg logit contributions
Brown-0.009
Ell-0.009
 Stretch-0.008
Twe-0.008
 Brown-0.008
Token said
Token ID531
Feature activation-4.65e-01

Pos logit contributions
upid+0.041
ry+0.039
ours+0.039
 conclusions+0.036
 foot+0.036
Neg logit contributions
Brown-0.028
Ell-0.028
 Brown-0.026
 Stretch-0.026
 Barney-0.026
Token he
Token ID339
Feature activation-4.11e-01

Pos logit contributions
Brown+0.024
Ell+0.022
Ar+0.020
Twe+0.020
hy+0.020
Neg logit contributions
ry-0.026
ours-0.026
 Sandy-0.026
 foot-0.026
 bubble-0.025
Token found
Token ID1043
Feature activation-5.20e-01

Pos logit contributions
upid+0.017
ours+0.017
ry+0.016
asing+0.015
 bubble+0.015
Neg logit contributions
Brown-0.014
Ell-0.014
Twe-0.012
Ar-0.012
 Brown-0.012
Token a
Token ID257
Feature activation-2.71e-01

Pos logit contributions
ry+0.020
ours+0.019
upid+0.019
about+0.018
apping+0.017
Neg logit contributions
Brown-0.020
Ell-0.019
 Brown-0.018
 Stretch-0.018
 Barney-0.018
Token can
Token ID460
Feature activation-1.05e+00

Pos logit contributions
ours+0.011
ry+0.011
upid+0.011
 Sandy+0.011
asing+0.010
Neg logit contributions
Brown-0.009
Ell-0.008
Twe-0.008
 Stretch-0.008
Ar-0.008
Token of
Token ID286
Feature activation-5.98e-01

Pos logit contributions
ours+0.025
upid+0.024
ry+0.021
about+0.020
 shallow+0.020
Neg logit contributions
Brown-0.054
Ell-0.052
hy-0.051
Twe-0.050
 Brown-0.048
Token spray
Token ID11662
Feature activation-3.63e+00

Pos logit contributions
ry+0.025
ours+0.024
upid+0.023
ress+0.022
asing+0.022
Neg logit contributions
Brown-0.021
Ell-0.020
 Stretch-0.019
 Brown-0.018
Twe-0.018
Token paint
Token ID7521
Feature activation-1.50e+00

Pos logit contributions
ress+0.145
ry+0.132
asing+0.131
upid+0.130
ours+0.129
Neg logit contributions
 Twe-0.102
 past-0.099
 Ellie-0.098
hy-0.097
 Brown-0.096
Token in
Token ID287
Feature activation-9.38e-01

Pos logit contributions
upid+0.062
ours+0.060
ry+0.058
ress+0.055
 ankle+0.055
Neg logit contributions
Brown-0.050
Ell-0.048
hy-0.046
 Brown-0.044
Twe-0.043
Token the
Token ID262
Feature activation-1.50e+00

Pos logit contributions
ours+0.035
ry+0.034
upid+0.033
asing+0.032
apping+0.032
Neg logit contributions
Brown-0.038
 Brown-0.035
Ell-0.034
Ar-0.033
Twe-0.033
Token woods
Token ID16479
Feature activation-1.05e+00

Pos logit contributions
ry+0.059
ours+0.059
upid+0.059
ress+0.058
asing+0.056
Neg logit contributions
Brown-0.054
Ell-0.054
Twe-0.051
Red-0.046
Yo-0.045
Token.
Token ID13
Feature activation-1.55e-01

Pos logit contributions
ours+0.045
upid+0.044
ry+0.042
 Sandy+0.041
 bubble+0.040
Neg logit contributions
Brown-0.033
hy-0.031
Ell-0.031
pe-0.029
 Brown-0.029
Token can
Token ID460
Feature activation+8.82e-01

Pos logit contributions
ours+0.003
ry+0.003
upid+0.003
 Sandy+0.002
 bubble+0.002
Neg logit contributions
Brown-0.004
Ell-0.004
Twe-0.004
 Stretch-0.004
hy-0.004
Token have
Token ID423
Feature activation+9.43e-01

Pos logit contributions
Brown+0.038
Ell+0.037
Ar+0.034
Twe+0.033
 Stretch+0.033
Neg logit contributions
upid-0.028
ry-0.027
 bubble-0.026
 foot-0.026
 Sandy-0.026
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
Brown+0.046
Ell+0.043
Twe+0.041
hy+0.041
Ar+0.040
Neg logit contributions
upid-0.024
ours-0.024
 Sandy-0.024
 bubble-0.024
ry-0.023
Token steak
Token ID26320
Feature activation+8.28e-01

Pos logit contributions
Ell+0.049
Brown+0.048
Twe+0.045
 Kirst+0.042
hy+0.040
Neg logit contributions
 foot-0.040
 bubble-0.040
upid-0.039
ours-0.039
 hand-0.039
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
Brown+0.038
Ell+0.036
Twe+0.034
Ar+0.033
 Stretch+0.033
Neg logit contributions
ry-0.025
ours-0.024
upid-0.024
 Sandy-0.023
 foot-0.023
Token L
Token ID406
Feature activation-3.62e+00

Pos logit contributions
ours+0.060
upid+0.058
ry+0.057
about+0.054
ays+0.053
Neg logit contributions
Brown-0.064
 Brown-0.058
Ell-0.057
Twe-0.057
hy-0.055
Tokenula
Token ID4712
Feature activation+7.91e-01

Pos logit contributions
 shallow+0.118
 reel+0.110
 amaz+0.110
 goalie+0.109
 conclusions+0.109
Neg logit contributions
ve-0.162
Twe-0.131
Ell-0.125
Brown-0.116
Red-0.113
Token".
Token ID1911
Feature activation-5.61e-01

Pos logit contributions
Brown+0.037
Ell+0.036
Twe+0.033
Ar+0.032
 Stretch+0.032
Neg logit contributions
ry-0.024
ours-0.023
upid-0.023
 Sandy-0.021
 bubble-0.021
Token She
Token ID1375
Feature activation-1.90e-02

Pos logit contributions
upid+0.019
ours+0.019
ry+0.018
 bubble+0.017
asing+0.017
Neg logit contributions
Brown-0.023
Ell-0.022
Twe-0.020
hy-0.020
 Brown-0.020
Token turned
Token ID2900
Feature activation-2.12e-01

Pos logit contributions
ours+0.001
upid+0.001
ry+0.001
 Sandy+0.001
 bubble+0.001
Neg logit contributions
Brown-0.001
Ell-0.001
Twe-0.001
 Stretch-0.001
Ar-0.001
Token around
Token ID1088
Feature activation+3.54e-01

Pos logit contributions
ours+0.007
ry+0.007
upid+0.007
 Sandy+0.006
 bubble+0.006
Neg logit contributions
Brown-0.009
Ell-0.008
 Stretch-0.008
hy-0.008
Twe-0.008
Token like
Token ID588
Feature activation-1.11e+00

Pos logit contributions
ours+0.020
ry+0.019
upid+0.019
ress+0.018
 Sandy+0.018
Neg logit contributions
Brown-0.023
Ell-0.021
Ar-0.021
Twe-0.020
ve-0.020
Token to
Token ID284
Feature activation-6.24e-01

Pos logit contributions
ry+0.045
upid+0.041
olt+0.040
ours+0.039
asing+0.039
Neg logit contributions
Brown-0.040
 Brown-0.037
 Stretch-0.036
 Christmas-0.035
Ell-0.035
Token stay
Token ID2652
Feature activation-4.76e-01

Pos logit contributions
ours+0.021
ry+0.021
upid+0.019
asing+0.019
ress+0.018
Neg logit contributions
Brown-0.026
Ell-0.024
Twe-0.024
hy-0.024
Ar-0.023
Token in
Token ID287
Feature activation-1.69e+00

Pos logit contributions
ours+0.019
upid+0.018
ry+0.018
about+0.017
ress+0.017
Neg logit contributions
Brown-0.017
Ell-0.016
Ar-0.015
Twe-0.015
 Stretch-0.015
Token the
Token ID262
Feature activation-1.32e+00

Pos logit contributions
ry+0.073
ours+0.071
asing+0.069
icking+0.067
apping+0.066
Neg logit contributions
Brown-0.054
 Brown-0.052
 Christmas-0.051
 Stretch-0.050
Ar-0.049
Token shallow
Token ID19337
Feature activation-3.61e+00

Pos logit contributions
asing+0.054
ry+0.054
ress+0.051
ful+0.050
inda+0.049
Neg logit contributions
Ar-0.043
Brown-0.043
Yo-0.043
 saf-0.042
hy-0.042
Token end
Token ID886
Feature activation-1.69e+00

Pos logit contributions
asing+0.163
ours+0.154
 ankle+0.143
isted+0.141
icking+0.137
Neg logit contributions
Ar-0.110
 saf-0.097
Yo-0.094
pe-0.087
 road-0.085
Token."
Token ID526
Feature activation+3.57e-01

Pos logit contributions
upid+0.071
 ankle+0.070
ours+0.068
asing+0.067
icking+0.065
Neg logit contributions
Brown-0.049
Ell-0.045
Ar-0.043
Twe-0.042
 Brown-0.041
Token
Token ID198
Feature activation-5.63e-02

Pos logit contributions
Brown+0.015
Ell+0.015
Twe+0.014
hy+0.013
Ar+0.013
Neg logit contributions
ours-0.012
ry-0.012
upid-0.011
 Sandy-0.011
 bubble-0.011
Token
Token ID198
Feature activation+1.29e-02

Pos logit contributions
ours+0.002
ry+0.002
upid+0.002
 Sandy+0.002
 bubble+0.002
Neg logit contributions
Brown-0.002
Ell-0.002
Twe-0.002
 Stretch-0.002
Ar-0.002
TokenBen
Token ID11696
Feature activation-1.53e+00

Pos logit contributions
Brown+0.001
Ell+0.001
Twe+0.001
 Stretch+0.000
Ar+0.000
Neg logit contributions
ours-0.000
ry-0.000
upid-0.000
 Sandy-0.000
 bubble-0.000
Token.
Token ID13
Feature activation-3.25e-01

Pos logit contributions
upid+0.057
ours+0.056
 ankle+0.051
ry+0.051
ress+0.050
Neg logit contributions
Ell-0.064
Brown-0.063
hy-0.060
 Brown-0.059
ve-0.057
Token She
Token ID1375
Feature activation-2.52e+00

Pos logit contributions
ry+0.013
upid+0.013
ours+0.013
asing+0.012
 bubble+0.012
Neg logit contributions
Brown-0.012
Ell-0.011
Twe-0.010
 Brown-0.010
 Stretch-0.010
Token showed
Token ID3751
Feature activation-3.19e-01

Pos logit contributions
upid+0.089
 conclusions+0.082
olt+0.075
 goalie+0.073
iece+0.072
Neg logit contributions
Ell-0.106
Brown-0.104
ve-0.103
 Brown-0.096
Ar-0.096
Token Sarah
Token ID10490
Feature activation-1.68e+00

Pos logit contributions
ry+0.012
ours+0.012
upid+0.011
 shallow+0.011
 bubble+0.011
Neg logit contributions
Brown-0.012
Ell-0.012
 Brown-0.011
Twe-0.011
 Stretch-0.010
Token how
Token ID703
Feature activation-1.97e+00

Pos logit contributions
upid+0.054
olt+0.052
ry+0.048
ours+0.048
 goalie+0.048
Neg logit contributions
Brown-0.072
 Brown-0.068
Ell-0.067
 Kirst-0.063
 Stretch-0.062
Token to
Token ID284
Feature activation-3.61e+00

Pos logit contributions
about+0.101
ress+0.099
 goalie+0.097
upid+0.096
ry+0.095
Neg logit contributions
Ell-0.037
ve-0.037
Brown-0.035
Twe-0.033
 long-0.033
Token spray
Token ID11662
Feature activation-3.10e+00

Pos logit contributions
ress+0.146
ired+0.136
apping+0.135
asing+0.134
iece+0.134
Neg logit contributions
Twe-0.115
Brown-0.109
 Twe-0.102
Ell-0.099
 Brown-0.098
Token some
Token ID617
Feature activation-3.41e+00

Pos logit contributions
ress+0.136
apping+0.124
olt+0.123
 goalie+0.122
ired+0.122
Neg logit contributions
 Brown-0.089
 Twe-0.084
 Ellie-0.082
 Blue-0.080
Brown-0.079
Token nice
Token ID3621
Feature activation-3.43e+00

Pos logit contributions
ress+0.142
olt+0.124
ill+0.123
upid+0.121
ours+0.118
Neg logit contributions
 long-0.105
Brown-0.104
Twe-0.098
ve-0.097
 de-0.097
Token smelling
Token ID48090
Feature activation-3.52e+00

Pos logit contributions
ress+0.140
olt+0.140
icking+0.134
 ankle+0.134
 goalie+0.133
Neg logit contributions
 past-0.091
 Brown-0.088
bows-0.087
 de-0.086
Brown-0.083
Token liquid
Token ID8122
Feature activation-1.05e+00

Pos logit contributions
ress+0.139
olt+0.131
 goalie+0.130
ired+0.128
apping+0.127
Neg logit contributions
 de-0.104
 Brown-0.102
 Twe-0.101
Brown-0.097
 Kirst-0.096
Token and
Token ID290
Feature activation+2.62e-02

Pos logit contributions
 conclusions+0.121
upid+0.120
ours+0.110
 ankle+0.103
 shallow+0.102
Neg logit contributions
Brown-0.080
Ell-0.079
ve-0.070
 Brown-0.066
Ar-0.064
Token the
Token ID262
Feature activation-5.18e-02

Pos logit contributions
Brown+0.001
Ell+0.001
Twe+0.001
 Stretch+0.001
Ar+0.001
Neg logit contributions
ours-0.001
ry-0.001
upid-0.001
 Sandy-0.001
 bubble-0.001
Token baby
Token ID5156
Feature activation-2.82e+00

Pos logit contributions
ry+0.002
ours+0.002
upid+0.002
 Sandy+0.001
 bubble+0.001
Neg logit contributions
Brown-0.002
Ell-0.002
Twe-0.002
 Stretch-0.002
Ar-0.002
Token walked
Token ID6807
Feature activation+1.17e-01

Pos logit contributions
upid+0.122
 conclusions+0.107
olt+0.104
ours+0.104
 ankle+0.103
Neg logit contributions
Ell-0.089
Brown-0.086
ve-0.084
 Kirst-0.081
 Brown-0.080
Token together
Token ID1978
Feature activation-1.18e+00

Pos logit contributions
Brown+0.005
Ell+0.005
Twe+0.004
 Stretch+0.004
Ar+0.004
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 Sandy-0.004
 bubble-0.003
Token to
Token ID284
Feature activation-3.60e+00

Pos logit contributions
ours+0.054
upid+0.052
asing+0.049
ress+0.048
ry+0.048
Neg logit contributions
Ell-0.031
ve-0.030
Brown-0.030
 Brown-0.029
 Stretch-0.029
Token the
Token ID262
Feature activation-2.38e+00

Pos logit contributions
asing+0.200
ry+0.194
ays+0.192
iness+0.192
ares+0.190
Neg logit contributions
 visit-0.081
 Brown-0.076
 Ellie-0.074
 long-0.071
 Twe-0.070
Token garden
Token ID11376
Feature activation-1.44e+00

Pos logit contributions
ry+0.108
run+0.103
asing+0.103
omp+0.100
ours+0.099
Neg logit contributions
Ell-0.072
Brown-0.068
Twe-0.065
Yo-0.064
 ranch-0.064
Token.
Token ID13
Feature activation-2.59e-01

Pos logit contributions
upid+0.063
ours+0.061
asing+0.056
 ankle+0.055
ry+0.055
Neg logit contributions
Brown-0.041
Ell-0.041
 Brown-0.039
pe-0.038
hy-0.037
Token When
Token ID1649
Feature activation+6.04e-02

Pos logit contributions
ry+0.011
upid+0.011
ours+0.011
 bubble+0.010
asing+0.010
Neg logit contributions
Brown-0.008
Ell-0.008
 Stretch-0.007
 Brown-0.007
Twe-0.007
Token they
Token ID484
Feature activation-2.06e+00

Pos logit contributions
Brown+0.002
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token cat
Token ID3797
Feature activation+3.99e-01

Pos logit contributions
ry+0.040
upid+0.038
ours+0.038
ill+0.035
asing+0.035
Neg logit contributions
Brown-0.033
Ell-0.029
hy-0.028
 Brown-0.027
 Stretch-0.027
Token said
Token ID531
Feature activation-3.57e-01

Pos logit contributions
Brown+0.017
Ell+0.016
Twe+0.015
Ar+0.015
 Stretch+0.015
Neg logit contributions
ry-0.013
ours-0.013
upid-0.012
 bubble-0.012
 Sandy-0.012
Token,
Token ID11
Feature activation-1.51e-01

Pos logit contributions
ours+0.013
upid+0.012
ry+0.012
 bubble+0.011
 shallow+0.011
Neg logit contributions
Brown-0.014
Ell-0.013
 Stretch-0.013
 Brown-0.013
hy-0.012
Token "
Token ID366
Feature activation+4.87e-01

Pos logit contributions
upid+0.004
ours+0.004
ry+0.004
 Sandy+0.004
 bubble+0.004
Neg logit contributions
Brown-0.007
Ell-0.007
Twe-0.006
 Stretch-0.006
 Brown-0.006
TokenMar
Token ID7676
Feature activation-7.99e-01

Pos logit contributions
Brown+0.019
Ell+0.018
 Stretch+0.017
 Barney+0.017
 Brown+0.017
Neg logit contributions
ry-0.017
ours-0.017
upid-0.017
 bubble-0.016
 Sandy-0.016
Tokenriage
Token ID4087
Feature activation-3.59e+00

Pos logit contributions
upid+0.029
ours+0.028
 Sandy+0.028
 bubble+0.028
 shallow+0.027
Neg logit contributions
Brown-0.030
Ell-0.027
hy-0.027
Twe-0.027
 Stretch-0.026
Token is
Token ID318
Feature activation-7.08e-01

Pos logit contributions
 shallow+0.163
upid+0.162
olt+0.160
 ankle+0.159
ours+0.158
Neg logit contributions
Brown-0.090
 Brown-0.083
pe-0.079
 Stretch-0.077
Ar-0.077
Token when
Token ID618
Feature activation-6.92e-02

Pos logit contributions
ours+0.022
upid+0.022
ry+0.021
about+0.020
olt+0.019
Neg logit contributions
Brown-0.031
Ell-0.029
Twe-0.028
 Stretch-0.028
 Brown-0.027
Token two
Token ID734
Feature activation+2.54e-01

Pos logit contributions
ours+0.002
ry+0.002
upid+0.002
 Sandy+0.002
 bubble+0.002
Neg logit contributions
Brown-0.003
Ell-0.003
Twe-0.002
 Stretch-0.002
Ar-0.002
Token animals
Token ID4695
Feature activation-2.30e-01

Pos logit contributions
Brown+0.011
Ell+0.011
Twe+0.010
 Stretch+0.010
Ar+0.010
Neg logit contributions
ours-0.008
ry-0.008
upid-0.008
 bubble-0.007
 Sandy-0.007
Token love
Token ID1842
Feature activation+4.17e-01

Pos logit contributions
ours+0.007
ry+0.007
upid+0.007
 Sandy+0.007
 bubble+0.006
Neg logit contributions
Brown-0.010
Ell-0.010
Twe-0.009
 Stretch-0.009
hy-0.009
Token good
Token ID922
Feature activation-4.42e-01

Pos logit contributions
Brown+0.007
Ell+0.007
Twe+0.006
 Stretch+0.006
Ar+0.006
Neg logit contributions
ours-0.008
upid-0.008
ry-0.008
 bubble-0.007
 Sandy-0.007
Token!
Token ID0
Feature activation-1.58e+00

Pos logit contributions
ours+0.015
ry+0.015
upid+0.015
 shallow+0.014
 bubble+0.014
Neg logit contributions
Brown-0.018
Ell-0.017
 Brown-0.016
hy-0.016
 Stretch-0.016
Token Can
Token ID1680
Feature activation+9.86e-01

Pos logit contributions
ry+0.070
upid+0.068
 shallow+0.068
ours+0.066
 bubble+0.066
Neg logit contributions
Brown-0.051
 Brown-0.045
Ell-0.045
Twe-0.044
hy-0.044
Token you
Token ID345
Feature activation-1.47e+00

Pos logit contributions
Brown+0.040
Ell+0.038
Ar+0.037
hy+0.035
ve+0.035
Neg logit contributions
ry-0.032
ours-0.032
upid-0.031
 Sandy-0.030
 foot-0.030
Token teach
Token ID4545
Feature activation+5.40e-01

Pos logit contributions
ours+0.066
ry+0.061
ays+0.059
upid+0.055
ees+0.054
Neg logit contributions
Brown-0.052
Ell-0.049
Twe-0.045
hy-0.044
 Brown-0.043
Token me
Token ID502
Feature activation-3.59e+00

Pos logit contributions
Brown+0.021
Ell+0.020
Ar+0.018
Twe+0.018
hy+0.018
Neg logit contributions
upid-0.019
ours-0.019
ry-0.019
 bubble-0.019
 Sandy-0.019
Token how
Token ID703
Feature activation-1.39e+00

Pos logit contributions
olt+0.137
 horror+0.136
 anth+0.131
 Sandy+0.123
inter+0.123
Neg logit contributions
Brown-0.113
 Christmas-0.111
 novels-0.107
 him-0.099
 Brown-0.098
Token to
Token ID284
Feature activation-3.84e-01

Pos logit contributions
ry+0.064
ours+0.064
 Sandy+0.063
ill+0.063
icking+0.063
Neg logit contributions
Brown-0.039
Ell-0.034
hy-0.031
 Stretch-0.031
Twe-0.030
Token play
Token ID711
Feature activation-1.91e+00

Pos logit contributions
ours+0.014
ry+0.014
upid+0.013
 Sandy+0.013
ress+0.013
Neg logit contributions
Brown-0.015
Ell-0.014
Twe-0.013
 Stretch-0.013
Ar-0.013
Token too
Token ID1165
Feature activation-2.49e+00

Pos logit contributions
upid+0.082
ours+0.080
 Sandy+0.079
ry+0.078
olt+0.077
Neg logit contributions
Brown-0.066
Ell-0.060
 Stretch-0.058
Red-0.057
 Brown-0.056
Token?"
Token ID1701
Feature activation+5.83e-01

Pos logit contributions
 ankle+0.111
asing+0.104
ours+0.104
 Sandy+0.103
upid+0.103
Neg logit contributions
Brown-0.072
 Brown-0.066
 Christmas-0.063
 Stretch-0.063
Ell-0.063
Token can
Token ID460
Feature activation+6.16e-01

Pos logit contributions
about+0.044
ours+0.043
apping+0.043
 shallow+0.042
udge+0.042
Neg logit contributions
Brown-0.048
 Brown-0.047
Ell-0.045
Twe-0.045
ve-0.043
Token have
Token ID423
Feature activation+9.32e-01

Pos logit contributions
Brown+0.030
Ell+0.029
Ar+0.028
 Stretch+0.027
 Barney+0.027
Neg logit contributions
ry-0.017
ours-0.016
upid-0.016
 bubble-0.016
 foot-0.015
Token this
Token ID428
Feature activation+2.07e-01

Pos logit contributions
Brown+0.046
Ell+0.044
hy+0.042
Ar+0.041
ve+0.041
Neg logit contributions
ours-0.023
 bubble-0.023
upid-0.022
 Sandy-0.022
ry-0.022
Token.
Token ID13
Feature activation-7.89e-02

Pos logit contributions
Brown+0.009
Ell+0.009
Twe+0.008
 Stretch+0.008
Ar+0.008
Neg logit contributions
ours-0.007
ry-0.007
upid-0.007
 bubble-0.006
 Sandy-0.006
Token It
Token ID632
Feature activation+1.21e+00

Pos logit contributions
ry+0.003
ours+0.003
upid+0.003
 Sandy+0.003
 bubble+0.003
Neg logit contributions
Brown-0.003
Ell-0.003
Twe-0.003
 Stretch-0.003
hy-0.003
Token will
Token ID481
Feature activation+2.44e+00

Pos logit contributions
Brown+0.069
Ell+0.067
Ar+0.064
Twe+0.064
hy+0.063
Neg logit contributions
ours-0.021
ry-0.021
 bubble-0.020
upid-0.018
 foot-0.017
Token make
Token ID787
Feature activation+8.97e-01

Pos logit contributions
Ell+0.139
Brown+0.138
Charles+0.133
 novels+0.132
 Barney+0.131
Neg logit contributions
 bubble-0.045
 promise-0.043
upid-0.040
 foot-0.037
 pour-0.037
Token you
Token ID345
Feature activation+3.51e-01

Pos logit contributions
Brown+0.033
Ell+0.033
hy+0.029
Ar+0.029
Twe+0.029
Neg logit contributions
upid-0.034
 bubble-0.034
ours-0.034
 Sandy-0.033
ry-0.033
Token feel
Token ID1254
Feature activation-3.41e-01

Pos logit contributions
Brown+0.017
Ell+0.016
Twe+0.015
Ar+0.015
 Stretch+0.015
Neg logit contributions
ry-0.009
upid-0.009
ours-0.009
 bubble-0.009
 foot-0.008
Token better
Token ID1365
Feature activation+8.35e-01

Pos logit contributions
ours+0.010
ry+0.010
upid+0.009
about+0.009
 Sandy+0.009
Neg logit contributions
Brown-0.016
Ell-0.015
Twe-0.015
 Stretch-0.014
Ar-0.014
Token,"
Token ID553
Feature activation+4.73e-01

Pos logit contributions
Brown+0.037
Ell+0.036
Twe+0.033
hy+0.032
 Stretch+0.032
Neg logit contributions
ry-0.028
ours-0.026
 foot-0.025
upid-0.025
ful-0.024
Token saw
Token ID2497
Feature activation-1.21e-02

Pos logit contributions
ours+0.058
upid+0.057
ress+0.055
ry+0.053
 ankle+0.052
Neg logit contributions
Brown-0.046
Ell-0.043
hy-0.043
pe-0.043
 Stretch-0.042
Token monkeys
Token ID26697
Feature activation-1.72e-01

Pos logit contributions
ours+0.000
ry+0.000
upid+0.000
 Sandy+0.000
 bubble+0.000
Neg logit contributions
Brown-0.001
Ell-0.001
Twe-0.000
 Stretch-0.000
hy-0.000
Token,
Token ID11
Feature activation+6.42e-01

Pos logit contributions
ours+0.006
upid+0.006
ry+0.006
 Sandy+0.005
 bubble+0.005
Neg logit contributions
Brown-0.007
Ell-0.007
Twe-0.006
 Stretch-0.006
hy-0.006
Token elephants
Token ID25368
Feature activation+5.96e-01

Pos logit contributions
Brown+0.029
Ell+0.027
Twe+0.025
Ar+0.025
 Stretch+0.025
Neg logit contributions
ours-0.019
ry-0.019
upid-0.019
 Sandy-0.019
 bubble-0.018
Token,
Token ID11
Feature activation+1.28e+00

Pos logit contributions
Brown+0.029
Ell+0.027
Twe+0.026
Ar+0.026
 Stretch+0.025
Neg logit contributions
ry-0.017
ours-0.016
upid-0.016
 Sandy-0.015
 bubble-0.015
Token gir
Token ID37370
Feature activation+2.63e+00

Pos logit contributions
Brown+0.062
Ell+0.059
Twe+0.056
Ar+0.056
 Stretch+0.054
Neg logit contributions
ours-0.034
 Sandy-0.033
ry-0.033
 bubble-0.032
upid-0.031
Tokenaff
Token ID2001
Feature activation+1.20e+00

Pos logit contributions
Brown+0.113
Ell+0.107
Ar+0.099
 Barney+0.095
 Stretch+0.093
Neg logit contributions
ry-0.086
upid-0.085
 Sandy-0.084
ours-0.083
about-0.082
Tokenes
Token ID274
Feature activation+5.14e-01

Pos logit contributions
Brown+0.067
Ell+0.063
Twe+0.060
Ar+0.059
hy+0.059
Neg logit contributions
ry-0.027
ours-0.025
upid-0.024
 Sandy-0.022
asing-0.021
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
Brown+0.024
Ell+0.023
Twe+0.021
Ar+0.021
 Stretch+0.021
Neg logit contributions
ry-0.015
ours-0.015
upid-0.014
 Sandy-0.014
 bubble-0.013
Token and
Token ID290
Feature activation+8.63e-01

Pos logit contributions
Brown+0.068
Ell+0.065
Twe+0.061
Ar+0.061
 Stretch+0.059
Neg logit contributions
ours-0.036
ry-0.035
 bubble-0.035
 Sandy-0.034
upid-0.033
Token lions
Token ID31420
Feature activation+7.66e-01

Pos logit contributions
Brown+0.038
Ell+0.036
Twe+0.034
Ar+0.033
ve+0.032
Neg logit contributions
ours-0.026
ry-0.026
upid-0.026
 Sandy-0.025
 bubble-0.025
Token a
Token ID257
Feature activation-4.49e-01

Pos logit contributions
Brown+0.008
Ell+0.008
Twe+0.007
 Stretch+0.007
Ar+0.007
Neg logit contributions
ours-0.005
ry-0.005
upid-0.005
 Sandy-0.005
 bubble-0.005
Token je
Token ID11223
Feature activation+1.59e+00

Pos logit contributions
upid+0.015
ours+0.014
ry+0.014
asing+0.013
 Sandy+0.013
Neg logit contributions
Brown-0.020
Ell-0.019
Twe-0.018
 Stretch-0.017
Ar-0.017
Tokenep
Token ID538
Feature activation-2.84e-01

Pos logit contributions
Brown+0.062
 Stretch+0.060
Ell+0.060
 Barney+0.059
 Brown+0.058
Neg logit contributions
ry-0.052
 foot-0.048
ress-0.047
upid-0.046
ours-0.046
Token.
Token ID13
Feature activation+5.68e-01

Pos logit contributions
ours+0.010
upid+0.010
ry+0.009
 Sandy+0.009
 bubble+0.009
Neg logit contributions
Brown-0.012
Ell-0.011
Twe-0.010
hy-0.010
 Stretch-0.010
Token The
Token ID383
Feature activation+1.63e+00

Pos logit contributions
Brown+0.024
Ell+0.023
ve+0.021
Twe+0.021
Ar+0.021
Neg logit contributions
ry-0.018
ours-0.018
upid-0.018
 Sandy-0.017
 bubble-0.016
Token je
Token ID11223
Feature activation+2.48e+00

Pos logit contributions
Ell+0.053
Brown+0.051
 Stretch+0.046
Hope+0.045
 Kirst+0.045
Neg logit contributions
 foot-0.066
upid-0.065
 eye-0.064
 hand-0.063
 bubble-0.062
Tokenep
Token ID538
Feature activation-1.34e-01

Pos logit contributions
 Stretch+0.105
 Barney+0.103
 Kirst+0.101
Ell+0.100
Brown+0.100
Neg logit contributions
ry-0.068
ress-0.062
ig-0.062
 foot-0.059
ours-0.057
Token was
Token ID373
Feature activation+1.17e+00

Pos logit contributions
ours+0.005
upid+0.004
ry+0.004
 Sandy+0.004
 bubble+0.004
Neg logit contributions
Brown-0.006
Ell-0.005
Twe-0.005
 Stretch-0.005
hy-0.005
Token weird
Token ID7650
Feature activation-4.63e-01

Pos logit contributions
Brown+0.048
Ell+0.047
Twe+0.042
 Stretch+0.041
Hope+0.041
Neg logit contributions
 foot-0.040
upid-0.038
ours-0.037
 hand-0.037
 bubble-0.037
Token.
Token ID13
Feature activation+9.43e-03

Pos logit contributions
ours+0.017
upid+0.017
ry+0.016
 Sandy+0.016
asing+0.016
Neg logit contributions
Brown-0.018
Ell-0.016
 Brown-0.016
hy-0.016
 Stretch-0.016
Token So
Token ID1406
Feature activation+6.80e-01

Pos logit contributions
Brown+0.000
Ell+0.000
Twe+0.000
 Stretch+0.000
Ar+0.000
Neg logit contributions
ours-0.000
ry-0.000
upid-0.000
 bubble-0.000
 Sandy-0.000
Token,"
Token ID553
Feature activation+1.01e-01

Pos logit contributions
Brown+0.064
Ell+0.061
Twe+0.058
Ar+0.057
 Stretch+0.055
Neg logit contributions
ry-0.029
upid-0.025
ours-0.025
 foot-0.024
 Sandy-0.024
Token Lily
Token ID20037
Feature activation+1.13e-02

Pos logit contributions
Brown+0.005
Ell+0.005
Twe+0.004
 Stretch+0.004
Ar+0.004
Neg logit contributions
ours-0.003
ry-0.003
upid-0.003
 Sandy-0.002
 bubble-0.002
Token said
Token ID531
Feature activation-3.14e-01

Pos logit contributions
Brown+0.000
Ell+0.000
Twe+0.000
 Stretch+0.000
Ar+0.000
Neg logit contributions
ours-0.000
ry-0.000
upid-0.000
 Sandy-0.000
 bubble-0.000
Token.
Token ID13
Feature activation-6.56e-01

Pos logit contributions
upid+0.011
ours+0.011
ry+0.011
 bubble+0.010
about+0.010
Neg logit contributions
Brown-0.012
Ell-0.011
 Brown-0.010
 Stretch-0.010
hy-0.010
Token "
Token ID366
Feature activation+8.17e-01

Pos logit contributions
upid+0.024
ry+0.023
ours+0.023
 bubble+0.022
asing+0.022
Neg logit contributions
Brown-0.025
Ell-0.024
 Brown-0.023
 Stretch-0.023
 Barney-0.022
TokenThose
Token ID9627
Feature activation+2.35e+00

Pos logit contributions
Brown+0.033
Ell+0.033
hy+0.031
 Stretch+0.031
 Brown+0.030
Neg logit contributions
ry-0.026
ours-0.026
upid-0.026
about-0.025
 foot-0.025
Token are
Token ID389
Feature activation+2.01e+00

Pos logit contributions
Brown+0.156
Ell+0.152
Ar+0.150
Twe+0.145
Mic+0.141
Neg logit contributions
ours-0.018
upid-0.018
 hand-0.015
 twin-0.011
 of-0.010
Token not
Token ID407
Feature activation+1.32e+00

Pos logit contributions
Brown+0.094
Ell+0.092
Ar+0.086
Twe+0.085
thank+0.084
Neg logit contributions
ours-0.056
 twin-0.052
upid-0.050
ry-0.049
 Sandy-0.049
Token our
Token ID674
Feature activation+2.85e+00

Pos logit contributions
Brown+0.060
Ell+0.058
Twe+0.054
Ar+0.054
hy+0.053
Neg logit contributions
ours-0.039
ry-0.039
upid-0.039
ful-0.036
 twin-0.036
Token gifts
Token ID13201
Feature activation+7.84e-01

Pos logit contributions
Brown+0.142
Ell+0.137
Ar+0.130
Twe+0.128
hy+0.125
Neg logit contributions
 hand-0.077
 sister-0.074
 twin-0.074
 promise-0.072
 ways-0.070
Token.
Token ID13
Feature activation+1.06e-01

Pos logit contributions
Brown+0.038
Ell+0.037
Twe+0.035
Ar+0.034
 Stretch+0.033
Neg logit contributions
ry-0.022
ours-0.021
upid-0.021
 bubble-0.019
 foot-0.019
Token duck
Token ID22045
Feature activation-7.01e-01

Pos logit contributions
ours+0.046
upid+0.044
 Sandy+0.040
ays+0.039
ry+0.038
Neg logit contributions
Brown-0.065
hy-0.061
 Stretch-0.058
Ell-0.058
 Brown-0.058
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
ours+0.027
upid+0.026
about+0.025
ry+0.024
 bubble+0.024
Neg logit contributions
Brown-0.026
Ell-0.025
Twe-0.023
 Brown-0.022
 Stretch-0.022
Token "
Token ID366
Feature activation+8.59e-01

Pos logit contributions
upid+0.007
ry+0.007
ours+0.007
 bubble+0.006
 Sandy+0.006
Neg logit contributions
Brown-0.010
Ell-0.009
Twe-0.009
 Stretch-0.009
 Brown-0.009
TokenOur
Token ID5122
Feature activation+1.04e+00

Pos logit contributions
Brown+0.036
Ell+0.034
hy+0.034
 Brown+0.034
 Stretch+0.033
Neg logit contributions
ry-0.025
ours-0.025
 bubble-0.025
upid-0.024
about-0.024
Token family
Token ID1641
Feature activation+9.99e-01

Pos logit contributions
Ell+0.046
Brown+0.045
Twe+0.039
 Kirst+0.038
Mic+0.038
Neg logit contributions
 foot-0.035
ours-0.035
 bubble-0.034
ry-0.034
upid-0.033
Token treasure
Token ID14068
Feature activation+2.50e+00

Pos logit contributions
Brown+0.049
Ell+0.047
Twe+0.044
Ar+0.042
hy+0.042
Neg logit contributions
ry-0.028
upid-0.027
ours-0.027
 bubble-0.025
 foot-0.024
Token is
Token ID318
Feature activation+6.05e-01

Pos logit contributions
Ell+0.135
Brown+0.134
Twe+0.124
Mic+0.123
Red+0.116
Neg logit contributions
ry-0.068
ours-0.057
 foot-0.055
upid-0.055
 bubble-0.053
Token hidden
Token ID7104
Feature activation+8.96e-01

Pos logit contributions
Brown+0.025
Ell+0.024
Twe+0.022
hy+0.021
 Stretch+0.021
Neg logit contributions
ry-0.021
upid-0.021
ours-0.021
 bubble-0.020
 foot-0.019
Token!"
Token ID2474
Feature activation+7.15e-01

Pos logit contributions
Brown+0.041
Ell+0.039
hy+0.036
Twe+0.036
Ar+0.035
Neg logit contributions
ry-0.029
ours-0.027
upid-0.026
 foot-0.025
 bubble-0.025
Token
Token ID220
Feature activation+2.35e-01

Pos logit contributions
Brown+0.032
Ell+0.031
Twe+0.029
hy+0.028
Ar+0.028
Neg logit contributions
ry-0.021
ours-0.021
 bubble-0.020
 Sandy-0.020
upid-0.019
Token
Token ID198
Feature activation+5.27e-01

Pos logit contributions
Brown+0.009
Ell+0.009
Twe+0.008
 Stretch+0.008
hy+0.008
Neg logit contributions
ours-0.008
ry-0.008
upid-0.008
 Sandy-0.008
 bubble-0.008
Token funny
Token ID8258
Feature activation-4.37e-01

Pos logit contributions
ry+0.018
ours+0.018
upid+0.017
 Sandy+0.016
ress+0.016
Neg logit contributions
Brown-0.018
Ell-0.017
hy-0.017
Twe-0.016
Ar-0.016
Token pants
Token ID12581
Feature activation+3.12e-01

Pos logit contributions
ours+0.015
ry+0.015
upid+0.015
 Sandy+0.014
 bubble+0.014
Neg logit contributions
Brown-0.017
Ell-0.016
hy-0.015
Twe-0.015
Ar-0.015
Token.
Token ID13
Feature activation+7.57e-02

Pos logit contributions
Brown+0.014
Ell+0.013
Twe+0.012
Ar+0.012
 Stretch+0.012
Neg logit contributions
ry-0.010
ours-0.010
upid-0.010
 Sandy-0.009
 bubble-0.009
Token Tim
Token ID5045
Feature activation-3.72e-01

Pos logit contributions
Brown+0.003
Ell+0.003
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.003
ry-0.003
upid-0.003
 Sandy-0.003
 bubble-0.003
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
ours+0.012
upid+0.012
ry+0.012
 Sandy+0.011
 bubble+0.011
Neg logit contributions
Brown-0.016
Ell-0.015
Twe-0.014
 Stretch-0.014
 Brown-0.014
Token his
Token ID465
Feature activation+2.21e-01

Pos logit contributions
ry+0.005
ours+0.005
upid+0.005
 bubble+0.005
about+0.005
Neg logit contributions
Brown-0.008
Ell-0.008
 Stretch-0.007
Twe-0.007
 Brown-0.007
Token mom
Token ID1995
Feature activation-7.51e-01

Pos logit contributions
Brown+0.011
Ell+0.011
Twe+0.010
 Stretch+0.010
Ar+0.010
Neg logit contributions
ours-0.006
ry-0.006
upid-0.005
 bubble-0.005
 Sandy-0.005
Token went
Token ID1816
Feature activation-7.94e-02

Pos logit contributions
ours+0.031
upid+0.031
ry+0.030
 Sandy+0.028
 shallow+0.028
Neg logit contributions
Brown-0.025
Ell-0.024
hy-0.023
 Brown-0.022
Twe-0.021
Token back
Token ID736
Feature activation-2.19e-01

Pos logit contributions
ours+0.003
upid+0.003
ry+0.003
 bubble+0.003
 Sandy+0.003
Neg logit contributions
Brown-0.003
Ell-0.003
Twe-0.003
 Stretch-0.003
Ar-0.003
Token home
Token ID1363
Feature activation+2.31e-01

Pos logit contributions
ours+0.008
upid+0.008
ry+0.008
 bubble+0.008
about+0.008
Neg logit contributions
Brown-0.008
Ell-0.008
 Brown-0.007
 Stretch-0.007
Twe-0.007
Token,
Token ID11
Feature activation+8.03e-01

Pos logit contributions
Brown+0.010
Ell+0.010
Twe+0.009
 Stretch+0.009
Ar+0.009
Neg logit contributions
ry-0.007
ours-0.007
upid-0.007
 Sandy-0.007
 bubble-0.007
Token play
Token ID711
Feature activation-9.65e-01

Pos logit contributions
Brown+0.032
Ell+0.030
hy+0.028
 Stretch+0.028
Twe+0.028
Neg logit contributions
 bubble-0.021
ours-0.021
ry-0.021
upid-0.020
 Sandy-0.020
Token.
Token ID13
Feature activation+5.33e-01

Pos logit contributions
upid+0.043
ours+0.043
ry+0.041
 Sandy+0.039
ress+0.038
Neg logit contributions
Brown-0.031
Ell-0.029
Twe-0.027
 Brown-0.026
 Stretch-0.026
Token
Token ID198
Feature activation+3.80e-02

Pos logit contributions
Brown+0.021
Ell+0.019
Twe+0.018
Ar+0.017
 Stretch+0.017
Neg logit contributions
ours-0.019
upid-0.019
ry-0.019
 Sandy-0.019
 bubble-0.018
Token
Token ID198
Feature activation+5.11e-01

Pos logit contributions
Brown+0.001
Ell+0.001
Twe+0.001
 Stretch+0.001
Ar+0.001
Neg logit contributions
ours-0.001
ry-0.001
upid-0.001
 Sandy-0.001
 bubble-0.001
TokenBilly
Token ID49640
Feature activation+4.61e-01

Pos logit contributions
Brown+0.021
Ell+0.020
 Stretch+0.019
hy+0.019
 Brown+0.018
Neg logit contributions
ry-0.016
upid-0.016
ours-0.016
about-0.015
 Sandy-0.015
Token said
Token ID531
Feature activation+4.01e-01

Pos logit contributions
Brown+0.021
Ell+0.020
Twe+0.019
Ar+0.019
hy+0.018
Neg logit contributions
ry-0.014
ours-0.013
upid-0.013
 bubble-0.013
 Sandy-0.013
Token,
Token ID11
Feature activation+9.56e-01

Pos logit contributions
Brown+0.019
Ell+0.018
Twe+0.017
Ar+0.017
 Stretch+0.017
Neg logit contributions
ours-0.011
ry-0.011
upid-0.011
 Sandy-0.010
 bubble-0.010
Token "
Token ID366
Feature activation+5.07e-01

Pos logit contributions
Brown+0.053
Ell+0.050
Ar+0.049
Twe+0.049
hy+0.047
Neg logit contributions
ours-0.018
ry-0.018
 Sandy-0.017
 bubble-0.016
upid-0.016
TokenLet
Token ID5756
Feature activation+1.54e-01

Pos logit contributions
Brown+0.020
Ell+0.018
 Stretch+0.018
hy+0.018
 Kirst+0.017
Neg logit contributions
ry-0.017
ours-0.017
upid-0.017
 bubble-0.016
 Sandy-0.016
Token's
Token ID338
Feature activation-9.67e-01

Pos logit contributions
Brown+0.009
Ell+0.008
Twe+0.008
 Stretch+0.008
Ar+0.008
Neg logit contributions
ours-0.003
ry-0.003
upid-0.003
 Sandy-0.002
 bubble-0.002
Token go
Token ID467
Feature activation-2.65e-01

Pos logit contributions
ours+0.041
ry+0.040
upid+0.038
ress+0.036
asing+0.036
Neg logit contributions
Brown-0.034
Ell-0.031
 Brown-0.030
Twe-0.030
 Stretch-0.029
TokenThey
Token ID2990
Feature activation-1.92e-01

Pos logit contributions
ours+0.010
ry+0.010
upid+0.010
 bubble+0.009
 Sandy+0.009
Neg logit contributions
Brown-0.011
Ell-0.010
Twe-0.009
Ar-0.009
 Stretch-0.009
Token pretended
Token ID36766
Feature activation+1.84e-01

Pos logit contributions
ours+0.007
upid+0.007
ry+0.007
 Sandy+0.006
 bubble+0.006
Neg logit contributions
Brown-0.008
Ell-0.007
 Stretch-0.007
Ar-0.006
Twe-0.006
Token that
Token ID326
Feature activation-1.14e+00

Pos logit contributions
Brown+0.008
Ell+0.007
Twe+0.007
 Stretch+0.007
Ar+0.007
Neg logit contributions
ours-0.006
ry-0.006
upid-0.006
 Sandy-0.006
 bubble-0.006
Token the
Token ID262
Feature activation+3.05e-01

Pos logit contributions
upid+0.044
ours+0.042
ry+0.041
asing+0.038
olt+0.038
Neg logit contributions
Brown-0.043
Ell-0.041
 Brown-0.040
 Stretch-0.040
 Barney-0.038
Token table
Token ID3084
Feature activation+3.23e-01

Pos logit contributions
Brown+0.012
Ell+0.012
Twe+0.011
 Stretch+0.010
hy+0.010
Neg logit contributions
ours-0.011
ry-0.011
upid-0.011
 bubble-0.010
 foot-0.010
Token was
Token ID373
Feature activation+6.07e-01

Pos logit contributions
Brown+0.015
Ell+0.014
Twe+0.013
 Stretch+0.013
Ar+0.013
Neg logit contributions
ry-0.010
ours-0.010
upid-0.009
 bubble-0.009
 Sandy-0.009
Token a
Token ID257
Feature activation+6.29e-01

Pos logit contributions
Brown+0.027
Ell+0.026
Twe+0.024
 Stretch+0.023
Ar+0.023
Neg logit contributions
ours-0.019
upid-0.018
ry-0.018
 bubble-0.018
 Sandy-0.018
Token rocket
Token ID10701
Feature activation+1.29e+00

Pos logit contributions
Brown+0.024
Ell+0.023
Twe+0.022
 Stretch+0.021
hy+0.020
Neg logit contributions
ry-0.023
upid-0.022
 bubble-0.022
ours-0.022
 foot-0.021
Token ship
Token ID4074
Feature activation+1.57e+00

Pos logit contributions
Brown+0.063
Ell+0.061
Twe+0.060
Ar+0.056
Mic+0.056
Neg logit contributions
ry-0.037
 bubble-0.033
upid-0.032
ours-0.032
 foot-0.031
Token and
Token ID290
Feature activation-7.75e-01

Pos logit contributions
Brown+0.079
Ell+0.077
Twe+0.075
Ar+0.070
Mic+0.070
Neg logit contributions
ry-0.041
 bubble-0.037
upid-0.037
ours-0.035
 foot-0.035
Token that
Token ID326
Feature activation-5.99e-01

Pos logit contributions
upid+0.027
ours+0.027
ry+0.027
olt+0.024
apping+0.024
Neg logit contributions
Brown-0.031
Ell-0.029
 Stretch-0.028
 Brown-0.028
 Barney-0.027
Token bush
Token ID24484
Feature activation+7.55e-01

Pos logit contributions
ours+0.015
ry+0.014
upid+0.014
asing+0.013
 Sandy+0.013
Neg logit contributions
Brown-0.013
Ell-0.013
Ar-0.012
Twe-0.012
 Stretch-0.011
Token!
Token ID0
Feature activation-4.06e-01

Pos logit contributions
Brown+0.035
Ell+0.033
Twe+0.032
Ar+0.031
 Stretch+0.031
Neg logit contributions
ry-0.022
ours-0.021
upid-0.021
 Sandy-0.020
 bubble-0.020
Token<|endoftext|>
Token ID50256
Feature activation-6.38e-01

Pos logit contributions
ours+0.016
ry+0.016
upid+0.016
 bubble+0.015
about+0.015
Neg logit contributions
Brown-0.014
Ell-0.014
 Stretch-0.013
Ar-0.012
 Barney-0.012
TokenOnce
Token ID7454
Feature activation-2.99e-01

Pos logit contributions
ours+0.027
upid+0.027
ry+0.027
asing+0.025
 bubble+0.025
Neg logit contributions
Brown-0.021
Ell-0.020
 Stretch-0.018
Ar-0.018
 Brown-0.018
Token upon
Token ID2402
Feature activation-1.15e-02

Pos logit contributions
ours+0.009
upid+0.009
ry+0.009
 bubble+0.008
about+0.008
Neg logit contributions
Brown-0.013
Ell-0.013
Twe-0.012
 Stretch-0.012
 Brown-0.011
Token a
Token ID257
Feature activation+1.39e+00

Pos logit contributions
ours+0.000
ry+0.000
upid+0.000
 Sandy+0.000
 bubble+0.000
Neg logit contributions
Brown-0.001
Ell-0.000
Twe-0.000
 Stretch-0.000
Ar-0.000
Token time
Token ID640
Feature activation+8.07e-01

Pos logit contributions
Brown+0.069
Ell+0.064
 Stretch+0.061
 Brown+0.061
 Barney+0.061
Neg logit contributions
upid-0.034
ry-0.034
 bubble-0.033
ours-0.033
 foot-0.033
Token,
Token ID11
Feature activation-3.52e-01

Pos logit contributions
Brown+0.041
Ell+0.038
Twe+0.037
Ar+0.037
 Stretch+0.035
Neg logit contributions
ry-0.021
ours-0.020
 Sandy-0.020
upid-0.020
 foot-0.018
Token there
Token ID612
Feature activation-7.08e-01

Pos logit contributions
ours+0.011
upid+0.010
ry+0.010
ress+0.010
 bubble+0.009
Neg logit contributions
Brown-0.016
Ell-0.015
 Stretch-0.014
Twe-0.014
 Brown-0.014
Token was
Token ID373
Feature activation+4.87e-01

Pos logit contributions
ours+0.026
upid+0.026
ry+0.025
asing+0.025
 ankle+0.023
Neg logit contributions
Brown-0.027
Ell-0.025
 Brown-0.024
 Stretch-0.024
Twe-0.023
Token a
Token ID257
Feature activation-5.70e-02

Pos logit contributions
Brown+0.024
Ell+0.022
Twe+0.021
hy+0.021
 Stretch+0.021
Neg logit contributions
ours-0.012
upid-0.012
 Sandy-0.012
 bubble-0.012
ry-0.012
Token.
Token ID13
Feature activation-9.29e-01

Pos logit contributions
Brown+0.024
Ell+0.023
Twe+0.022
Ar+0.021
 Stretch+0.021
Neg logit contributions
ry-0.014
upid-0.014
ours-0.014
 foot-0.013
 Sandy-0.013
Token He
Token ID679
Feature activation+6.32e-01

Pos logit contributions
upid+0.054
asing+0.054
ry+0.054
icking+0.053
ours+0.052
Neg logit contributions
Brown-0.012
 Stretch-0.011
 Brown-0.010
 Barney-0.010
Ar-0.010
Token walked
Token ID6807
Feature activation+8.51e-01

Pos logit contributions
Brown+0.029
Ell+0.027
Twe+0.026
 Stretch+0.025
hy+0.024
Neg logit contributions
ry-0.020
ours-0.019
 bubble-0.018
 Sandy-0.018
upid-0.017
Token away
Token ID1497
Feature activation+1.01e+00

Pos logit contributions
Brown+0.044
Ell+0.041
Twe+0.040
 Stretch+0.038
Mic+0.037
Neg logit contributions
ry-0.023
 Sandy-0.021
ours-0.020
 foot-0.020
upid-0.020
Token.
Token ID13
Feature activation-4.01e-01

Pos logit contributions
Brown+0.054
Ell+0.051
Twe+0.050
Mic+0.048
Ar+0.046
Neg logit contributions
ry-0.025
 Sandy-0.022
 foot-0.022
 bubble-0.021
upid-0.020
Token He
Token ID679
Feature activation+6.17e-01

Pos logit contributions
ry+0.019
upid+0.019
ours+0.019
asing+0.018
 bubble+0.018
Neg logit contributions
Brown-0.011
Ell-0.010
 Stretch-0.009
Twe-0.009
Ar-0.009
Token did
Token ID750
Feature activation+5.13e-01

Pos logit contributions
Brown+0.028
Ell+0.027
Twe+0.025
 Stretch+0.024
 Barney+0.024
Neg logit contributions
ry-0.019
ours-0.019
 bubble-0.017
 Sandy-0.017
upid-0.017
Token not
Token ID407
Feature activation+1.16e+00

Pos logit contributions
Brown+0.020
Ell+0.019
Twe+0.017
Ar+0.017
 Stretch+0.017
Neg logit contributions
upid-0.019
ours-0.019
ry-0.019
 Sandy-0.018
 foot-0.018
Token say
Token ID910
Feature activation-1.55e-01

Pos logit contributions
Brown+0.062
Ell+0.060
 Stretch+0.056
Twe+0.056
 Barney+0.056
Neg logit contributions
upid-0.024
 foot-0.023
 bubble-0.023
ry-0.023
 Sandy-0.022
Token thank
Token ID5875
Feature activation+1.25e+00

Pos logit contributions
ry+0.005
ours+0.005
upid+0.005
 bubble+0.004
about+0.004
Neg logit contributions
Brown-0.007
Ell-0.006
 Stretch-0.006
hy-0.006
Twe-0.006
Token you
Token ID345
Feature activation-4.84e-01

Pos logit contributions
Brown+0.051
Ell+0.050
Ar+0.044
hy+0.043
Twe+0.042
Neg logit contributions
 Sandy-0.047
ry-0.044
ours-0.044
 bubble-0.043
 foot-0.043
Token had
Token ID550
Feature activation-4.82e-01

Pos logit contributions
Brown+0.009
Ell+0.008
Twe+0.008
 Stretch+0.008
Ar+0.008
Neg logit contributions
ry-0.008
ours-0.008
upid-0.008
 Sandy-0.008
 bubble-0.008
Token been
Token ID587
Feature activation-6.13e-01

Pos logit contributions
ours+0.020
ry+0.019
upid+0.019
about+0.018
 bubble+0.017
Neg logit contributions
Brown-0.017
Ell-0.017
Ar-0.015
 Stretch-0.015
Twe-0.015
Token able
Token ID1498
Feature activation+1.14e-01

Pos logit contributions
ours+0.021
upid+0.021
ry+0.021
about+0.020
 bubble+0.019
Neg logit contributions
Brown-0.025
Ell-0.025
Twe-0.022
hy-0.022
 Stretch-0.022
Token to
Token ID284
Feature activation-3.54e-03

Pos logit contributions
Brown+0.005
Ell+0.004
Twe+0.004
 Stretch+0.004
Ar+0.004
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 Sandy-0.004
 bubble-0.004
Token help
Token ID1037
Feature activation-5.17e-01

Pos logit contributions
ours+0.000
ry+0.000
upid+0.000
 Sandy+0.000
 bubble+0.000
Neg logit contributions
Brown-0.000
Ell-0.000
Twe-0.000
 Stretch-0.000
Ar-0.000
Token someone
Token ID2130
Feature activation+1.29e-01

Pos logit contributions
ours+0.019
upid+0.018
ry+0.017
about+0.016
 shallow+0.016
Neg logit contributions
Brown-0.021
Ell-0.020
Twe-0.019
 Brown-0.019
 Stretch-0.019
Token else
Token ID2073
Feature activation+1.95e-01

Pos logit contributions
Brown+0.006
Ell+0.005
Twe+0.005
 Stretch+0.005
Ar+0.005
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 Sandy-0.004
 bubble-0.004
Token.
Token ID13
Feature activation-2.14e-01

Pos logit contributions
Brown+0.009
Ell+0.008
Twe+0.008
Ar+0.008
 Stretch+0.008
Neg logit contributions
ry-0.006
ours-0.006
upid-0.006
 Sandy-0.006
 bubble-0.006
Token She
Token ID1375
Feature activation-1.48e-01

Pos logit contributions
ours+0.009
ry+0.009
upid+0.009
 bubble+0.009
about+0.009
Neg logit contributions
Brown-0.007
Ell-0.007
Twe-0.006
 Stretch-0.006
 Brown-0.006
Token learned
Token ID4499
Feature activation-2.79e-01

Pos logit contributions
ours+0.005
upid+0.005
ry+0.005
 Sandy+0.005
 bubble+0.005
Neg logit contributions
Brown-0.006
Ell-0.006
Twe-0.005
 Stretch-0.005
Ar-0.005
Token that
Token ID326
Feature activation-6.25e-01

Pos logit contributions
ry+0.010
ours+0.010
upid+0.010
about+0.009
 bubble+0.009
Neg logit contributions
Brown-0.011
Ell-0.011
Twe-0.010
 Stretch-0.010
 Brown-0.010
Token called
Token ID1444
Feature activation-2.84e-01

Pos logit contributions
ours+0.043
upid+0.043
about+0.038
 Sandy+0.038
 ankle+0.038
Neg logit contributions
Brown-0.044
Ell-0.041
Twe-0.040
 Brown-0.036
 Stretch-0.036
Token Sammy
Token ID44067
Feature activation-4.69e-01

Pos logit contributions
ours+0.009
ry+0.009
upid+0.008
about+0.008
asing+0.008
Neg logit contributions
Brown-0.013
Ell-0.012
 Stretch-0.012
Twe-0.012
 Brown-0.012
Token.
Token ID13
Feature activation-6.30e-01

Pos logit contributions
ours+0.018
upid+0.018
ry+0.017
 Sandy+0.016
about+0.016
Neg logit contributions
Brown-0.018
Ell-0.017
Twe-0.016
 Stretch-0.016
 Brown-0.015
Token Sammy
Token ID44067
Feature activation+1.60e+00

Pos logit contributions
ours+0.018
ry+0.017
upid+0.017
ress+0.016
asing+0.016
Neg logit contributions
Brown-0.029
Ell-0.029
 Brown-0.027
 Barney-0.027
Twe-0.027
Token was
Token ID373
Feature activation+8.28e-01

Pos logit contributions
Brown+0.075
Ell+0.073
hy+0.068
Ar+0.067
Twe+0.066
Neg logit contributions
ry-0.043
 bubble-0.042
 Sandy-0.039
 foot-0.039
ours-0.038
Token always
Token ID1464
Feature activation-2.54e-01

Pos logit contributions
Brown+0.035
Ell+0.033
Twe+0.031
Ar+0.030
ve+0.029
Neg logit contributions
 Sandy-0.027
ours-0.026
 foot-0.026
 bubble-0.026
upid-0.026
Token looking
Token ID2045
Feature activation+3.85e-01

Pos logit contributions
ours+0.011
ry+0.011
upid+0.011
about+0.010
 bubble+0.010
Neg logit contributions
Brown-0.009
Ell-0.008
 Stretch-0.007
Twe-0.007
 Brown-0.007
Token for
Token ID329
Feature activation-7.60e-01

Pos logit contributions
Brown+0.018
Ell+0.017
Twe+0.016
Ar+0.015
hy+0.015
Neg logit contributions
ours-0.012
ry-0.012
upid-0.012
 Sandy-0.011
 foot-0.011
Token exciting
Token ID7895
Feature activation+1.07e-01

Pos logit contributions
ours+0.030
upid+0.030
ry+0.030
asing+0.028
olt+0.028
Neg logit contributions
Brown-0.027
Ell-0.025
Twe-0.024
 Brown-0.024
 Stretch-0.024
Token adventures
Token ID17545
Feature activation+1.13e+00

Pos logit contributions
Brown+0.004
Ell+0.004
Twe+0.004
 Stretch+0.003
Ar+0.003
Neg logit contributions
ours-0.004
ry-0.004
upid-0.004
 Sandy-0.004
 bubble-0.004
Token,
Token ID11
Feature activation+5.53e-01

Pos logit contributions
Brown+0.057
Ell+0.053
Twe+0.052
Ar+0.051
Mic+0.047
Neg logit contributions
ry-0.029
 Sandy-0.028
 foot-0.027
ful-0.026
ours-0.026
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
Brown+0.015
Ell+0.014
Twe+0.013
 Stretch+0.013
Ar+0.013
Neg logit contributions
ours-0.011
ry-0.011
upid-0.011
 bubble-0.011
 Sandy-0.011
Token look
Token ID804
Feature activation-3.03e-01

Pos logit contributions
Brown+0.048
Ell+0.046
ve+0.043
Twe+0.043
hy+0.042
Neg logit contributions
 bubble-0.031
upid-0.030
ry-0.029
 shallow-0.028
 foot-0.027
Token brighter
Token ID26083
Feature activation+5.67e-01

Pos logit contributions
ours+0.011
upid+0.011
ry+0.011
 Sandy+0.010
 bubble+0.010
Neg logit contributions
Brown-0.011
Ell-0.011
Twe-0.010
 Stretch-0.010
Ar-0.010
Token and
Token ID290
Feature activation-1.34e-01

Pos logit contributions
Brown+0.027
Ell+0.026
Twe+0.024
hy+0.023
 Stretch+0.023
Neg logit contributions
ry-0.017
ours-0.016
upid-0.016
 bubble-0.015
 Sandy-0.015
Token happier
Token ID23030
Feature activation-5.26e-01

Pos logit contributions
ours+0.004
upid+0.004
ry+0.003
 Sandy+0.003
 bubble+0.003
Neg logit contributions
Brown-0.007
Ell-0.006
Twe-0.006
 Stretch-0.006
Ar-0.006
Token.
Token ID13
Feature activation-1.21e-01

Pos logit contributions
ours+0.021
upid+0.020
ry+0.020
about+0.019
ress+0.019
Neg logit contributions
Brown-0.018
Ell-0.017
 Stretch-0.016
 Brown-0.016
Ar-0.016
Token
Token ID220
Feature activation-5.07e-01

Pos logit contributions
ry+0.004
ours+0.004
upid+0.004
 bubble+0.004
 foot+0.004
Neg logit contributions
Brown-0.005
Ell-0.004
Twe-0.004
 Stretch-0.004
 Brown-0.004
Token
Token ID198
Feature activation+3.37e-01

Pos logit contributions
ry+0.020
ours+0.020
upid+0.020
 bubble+0.019
about+0.019
Neg logit contributions
Brown-0.018
Ell-0.017
Twe-0.015
 Stretch-0.015
Ar-0.015
TokenThe
Token ID464
Feature activation-2.43e-01

Pos logit contributions
Brown+0.015
Ell+0.014
hy+0.013
Twe+0.013
 Stretch+0.013
Neg logit contributions
upid-0.010
ry-0.010
ours-0.010
 bubble-0.009
 Sandy-0.009
Token friends
Token ID2460
Feature activation-1.10e+00

Pos logit contributions
ry+0.006
ours+0.006
upid+0.006
 Sandy+0.005
ress+0.005
Neg logit contributions
Brown-0.012
Ell-0.012
Ar-0.011
 Stretch-0.011
Twe-0.011
Token were
Token ID547
Feature activation-3.91e-01

Pos logit contributions
ours+0.046
upid+0.045
ry+0.043
 Sandy+0.042
 shallow+0.040
Neg logit contributions
Brown-0.037
Ell-0.035
 Stretch-0.032
 Brown-0.032
hy-0.031
Token friend
Token ID1545
Feature activation-7.78e-01

Pos logit contributions
Brown+0.008
Ell+0.008
Twe+0.007
 Stretch+0.007
hy+0.007
Neg logit contributions
ry-0.007
upid-0.007
ours-0.007
 bubble-0.007
 Sandy-0.007
Token in
Token ID287
Feature activation-3.15e-01

Pos logit contributions
ours+0.030
upid+0.029
ry+0.027
about+0.027
 Sandy+0.026
Neg logit contributions
Brown-0.029
Ell-0.028
 Stretch-0.025
hy-0.025
Twe-0.025
Token need
Token ID761
Feature activation+9.68e-02

Pos logit contributions
ours+0.009
ry+0.009
upid+0.009
about+0.008
 Sandy+0.008
Neg logit contributions
Brown-0.015
Ell-0.014
 Stretch-0.013
 Brown-0.013
Twe-0.013
Token.
Token ID13
Feature activation+1.71e-01

Pos logit contributions
Brown+0.004
Ell+0.004
Twe+0.004
 Stretch+0.004
Ar+0.004
Neg logit contributions
ours-0.003
ry-0.003
upid-0.003
 Sandy-0.003
 bubble-0.003
Token<|endoftext|>
Token ID50256
Feature activation-3.75e-01

Pos logit contributions
Brown+0.007
Ell+0.006
Twe+0.006
 Stretch+0.006
Ar+0.006
Neg logit contributions
ours-0.006
ry-0.006
upid-0.006
 Sandy-0.006
 bubble-0.006
TokenOnce
Token ID7454
Feature activation-4.64e-01

Pos logit contributions
ours+0.015
ry+0.015
upid+0.015
asing+0.014
 bubble+0.014
Neg logit contributions
Brown-0.013
Ell-0.012
 Stretch-0.011
Ar-0.011
Twe-0.011
Token upon
Token ID2402
Feature activation-6.40e-02

Pos logit contributions
ours+0.014
upid+0.014
ry+0.014
 bubble+0.013
about+0.013
Neg logit contributions
Brown-0.020
Ell-0.019
Twe-0.018
 Stretch-0.018
 Brown-0.017
Token a
Token ID257
Feature activation+8.90e-01

Pos logit contributions
ours+0.002
ry+0.002
upid+0.002
 Sandy+0.002
 bubble+0.002
Neg logit contributions
Brown-0.003
Ell-0.003
Twe-0.002
 Stretch-0.002
Ar-0.002
Token time
Token ID640
Feature activation+8.31e-01

Pos logit contributions
Brown+0.041
Ell+0.038
 Stretch+0.036
 Brown+0.036
Twe+0.036
Neg logit contributions
upid-0.026
ry-0.025
ours-0.025
 bubble-0.024
 foot-0.024
Token,
Token ID11
Feature activation-1.04e-01

Pos logit contributions
Brown+0.044
Ell+0.041
Twe+0.039
Ar+0.039
hy+0.038
Neg logit contributions
ry-0.021
upid-0.020
ours-0.019
 Sandy-0.019
 foot-0.018
Token a
Token ID257
Feature activation+2.15e-01

Pos logit contributions
ours+0.003
ry+0.003
upid+0.003
 Sandy+0.003
 bubble+0.003
Neg logit contributions
Brown-0.005
Ell-0.005
Twe-0.004
 Stretch-0.004
Ar-0.004
Token read
Token ID1100
Feature activation+1.56e+00

Pos logit contributions
Brown+0.015
Ell+0.015
Twe+0.014
 Stretch+0.013
Ar+0.013
Neg logit contributions
ours-0.012
ry-0.012
upid-0.012
 bubble-0.012
 Sandy-0.011
Token books
Token ID3835
Feature activation+9.68e-01

Pos logit contributions
Brown+0.062
Ell+0.061
Twe+0.058
ve+0.057
Ar+0.053
Neg logit contributions
 bubble-0.052
ry-0.051
ours-0.051
about-0.050
 of-0.048
Token with
Token ID351
Feature activation+7.94e-01

Pos logit contributions
Brown+0.049
Ell+0.047
Twe+0.046
Ar+0.044
 Barney+0.042
Neg logit contributions
ry-0.025
ours-0.023
upid-0.021
 foot-0.021
about-0.021
Token pictures
Token ID5986
Feature activation+1.08e+00

Pos logit contributions
Brown+0.030
Ell+0.029
Twe+0.026
hy+0.026
ve+0.025
Neg logit contributions
ours-0.030
ry-0.029
upid-0.029
 bubble-0.029
 foot-0.028
Token of
Token ID286
Feature activation+1.50e+00

Pos logit contributions
Brown+0.057
Ell+0.055
Twe+0.054
Ar+0.052
ve+0.051
Neg logit contributions
ry-0.023
ours-0.023
 bubble-0.021
upid-0.020
about-0.020
Token animals
Token ID4695
Feature activation+9.32e-03

Pos logit contributions
Brown+0.069
Ell+0.068
Twe+0.062
hy+0.062
ve+0.061
Neg logit contributions
 bubble-0.042
 Sandy-0.041
ours-0.040
 foot-0.039
ry-0.038
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
Brown+0.000
Ell+0.000
Twe+0.000
 Stretch+0.000
Ar+0.000
Neg logit contributions
ours-0.000
ry-0.000
upid-0.000
 Sandy-0.000
 bubble-0.000
Token cars
Token ID5006
Feature activation+1.21e+00

Pos logit contributions
Brown+0.047
Ell+0.046
Twe+0.044
ve+0.043
hy+0.042
Neg logit contributions
 bubble-0.032
ours-0.031
 foot-0.030
ry-0.030
 Sandy-0.030
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
Brown+0.064
Ell+0.061
Twe+0.060
Ar+0.059
 Barney+0.058
Neg logit contributions
ry-0.026
ours-0.025
upid-0.023
about-0.023
 of-0.022
Token One
Token ID1881
Feature activation-4.48e-01

Pos logit contributions
Brown+0.008
Ell+0.007
Twe+0.007
 Stretch+0.006
Ar+0.006
Neg logit contributions
ours-0.010
ry-0.010
upid-0.010
 Sandy-0.010
 bubble-0.010
Token day
Token ID1110
Feature activation+1.63e-01

Pos logit contributions
ours+0.020
upid+0.020
ry+0.019
 Sandy+0.018
asing+0.018
Neg logit contributions
Brown-0.014
Ell-0.013
hy-0.012
Twe-0.012
 Stretch-0.012
Token
Token ID198
Feature activation-7.31e-02

Pos logit contributions
ry+0.001
ours+0.001
upid+0.001
 bubble+0.001
 Sandy+0.001
Neg logit contributions
Brown-0.001
Ell-0.001
Twe-0.001
 Stretch-0.001
Ar-0.001
Token
Token ID198
Feature activation-7.13e-02

Pos logit contributions
ours+0.003
ry+0.003
upid+0.003
 bubble+0.003
 Sandy+0.003
Neg logit contributions
Brown-0.003
Ell-0.003
Twe-0.002
 Stretch-0.002
Ar-0.002
TokenTom
Token ID13787
Feature activation+2.45e-01

Pos logit contributions
ours+0.003
ry+0.003
upid+0.003
 bubble+0.003
 Sandy+0.003
Neg logit contributions
Brown-0.002
Ell-0.002
Twe-0.002
Ar-0.002
 Stretch-0.002
Token said
Token ID531
Feature activation+3.69e-01

Pos logit contributions
Brown+0.011
Ell+0.010
Twe+0.010
 Stretch+0.010
hy+0.010
Neg logit contributions
ry-0.007
ours-0.007
upid-0.007
 bubble-0.007
 Sandy-0.007
Token to
Token ID284
Feature activation-4.34e-01

Pos logit contributions
Brown+0.018
Ell+0.016
Twe+0.016
Ar+0.015
 Stretch+0.015
Neg logit contributions
ry-0.011
ours-0.011
upid-0.010
 Sandy-0.010
 foot-0.010
Token the
Token ID262
Feature activation-2.29e+00

Pos logit contributions
ry+0.016
ours+0.016
upid+0.015
asing+0.014
apping+0.014
Neg logit contributions
Brown-0.017
Ell-0.016
Twe-0.016
 Stretch-0.015
 Brown-0.015
Token mosquito
Token ID35446
Feature activation-6.77e-01

Pos logit contributions
ry+0.091
ress+0.085
upid+0.085
ours+0.080
asing+0.079
Neg logit contributions
Brown-0.079
 long-0.075
 Raven-0.075
Ell-0.075
 Stretch-0.074
Token,
Token ID11
Feature activation-5.22e-01

Pos logit contributions
ours+0.026
upid+0.026
ry+0.024
about+0.024
olt+0.023
Neg logit contributions
Brown-0.025
Ell-0.024
 Brown-0.023
 Stretch-0.022
 Barney-0.021
Token "
Token ID366
Feature activation+7.26e-01

Pos logit contributions
upid+0.018
ry+0.018
ours+0.018
asing+0.016
olt+0.016
Neg logit contributions
Brown-0.022
Ell-0.020
 Brown-0.020
 Stretch-0.020
 Barney-0.019
TokenHello
Token ID15496
Feature activation-2.23e+00

Pos logit contributions
Brown+0.028
Ell+0.026
hy+0.026
 Stretch+0.026
ve+0.025
Neg logit contributions
ry-0.024
ours-0.023
upid-0.023
 bubble-0.023
about-0.023
Token,
Token ID11
Feature activation-3.46e+00

Pos logit contributions
upid+0.109
asing+0.108
inda+0.103
olt+0.101
ours+0.099
Neg logit contributions
 Brown-0.056
Brown-0.049
 Stretch-0.049
 Christmas-0.048
 Blue-0.045
Token like
Token ID588
Feature activation+3.88e-01

Pos logit contributions
Brown+0.019
Ell+0.017
Twe+0.016
Ar+0.016
 Stretch+0.016
Neg logit contributions
upid-0.016
ours-0.016
ry-0.016
 Sandy-0.015
 bubble-0.015
Token fun
Token ID1257
Feature activation+1.81e-01

Pos logit contributions
Brown+0.015
Ell+0.014
Twe+0.013
Ar+0.012
 Stretch+0.012
Neg logit contributions
ours-0.014
upid-0.014
ry-0.014
 Sandy-0.014
 bubble-0.014
Token.
Token ID13
Feature activation-1.71e+00

Pos logit contributions
Brown+0.008
Ell+0.007
Twe+0.007
 Stretch+0.007
Ar+0.007
Neg logit contributions
ours-0.006
ry-0.006
upid-0.006
 Sandy-0.005
 bubble-0.005
Token Do
Token ID2141
Feature activation+7.28e-01

Pos logit contributions
ry+0.070
ours+0.067
upid+0.067
 shallow+0.064
 bubble+0.064
Neg logit contributions
Brown-0.062
 Stretch-0.057
 Brown-0.055
Ell-0.054
 Barney-0.053
Token you
Token ID345
Feature activation-1.27e+00

Pos logit contributions
Brown+0.025
Ell+0.023
Ar+0.022
Twe+0.021
hy+0.021
Neg logit contributions
upid-0.029
 Sandy-0.028
ours-0.028
ry-0.028
 bubble-0.027
Token mind
Token ID2000
Feature activation-2.70e+00

Pos logit contributions
ours+0.055
ress+0.051
ry+0.051
ays+0.048
about+0.046
Neg logit contributions
Brown-0.039
Ell-0.037
Ar-0.037
hy-0.037
Twe-0.036
Token if
Token ID611
Feature activation-5.35e-01

Pos logit contributions
olt+0.095
ours+0.088
 pitcher+0.087
 horror+0.081
ainer+0.080
Neg logit contributions
Brown-0.104
 Brown-0.097
 Stretch-0.089
Ell-0.087
 Barney-0.086
Token I
Token ID314
Feature activation-1.57e+00

Pos logit contributions
ours+0.025
ry+0.025
upid+0.025
about+0.024
 bubble+0.023
Neg logit contributions
Brown-0.015
Ell-0.013
 Stretch-0.012
Twe-0.012
 Brown-0.012
Token sit
Token ID1650
Feature activation-4.82e-01

Pos logit contributions
ours+0.056
ress+0.051
ry+0.050
about+0.048
upid+0.048
Neg logit contributions
Brown-0.060
Ar-0.057
ve-0.056
 Stretch-0.055
Ell-0.055
Token here
Token ID994
Feature activation-1.16e+00

Pos logit contributions
upid+0.017
ours+0.017
ry+0.016
 bubble+0.016
 Sandy+0.015
Neg logit contributions
Brown-0.019
Ell-0.018
Twe-0.017
 Stretch-0.017
 Brown-0.016
Token and
Token ID290
Feature activation+1.43e-01

Pos logit contributions
ours+0.037
upid+0.036
ry+0.035
 Sandy+0.033
asing+0.032
Neg logit contributions
Brown-0.049
Ell-0.046
 Stretch-0.044
 Brown-0.044
Twe-0.043
Token,
Token ID11
Feature activation+4.13e-02

Pos logit contributions
Brown+0.014
Ell+0.013
Twe+0.013
Ar+0.012
 Stretch+0.012
Neg logit contributions
ry-0.009
ours-0.009
upid-0.009
 Sandy-0.008
 bubble-0.008
Token "
Token ID366
Feature activation+8.00e-01

Pos logit contributions
Brown+0.002
Ell+0.002
Twe+0.002
 Stretch+0.002
Ar+0.002
Neg logit contributions
ours-0.001
ry-0.001
upid-0.001
 Sandy-0.001
 bubble-0.001
TokenLet
Token ID5756
Feature activation+6.80e-01

Pos logit contributions
Brown+0.031
Ell+0.029
hy+0.028
 Brown+0.028
 Stretch+0.028
Neg logit contributions
upid-0.027
ry-0.026
ours-0.026
 Sandy-0.026
 bubble-0.026
Token me
Token ID502
Feature activation-2.13e-01

Pos logit contributions
Brown+0.038
Ell+0.036
Ar+0.034
hy+0.034
Twe+0.034
Neg logit contributions
upid-0.014
ry-0.012
ours-0.012
 bubble-0.012
 Sandy-0.012
Token teach
Token ID4545
Feature activation+4.00e-01

Pos logit contributions
ours+0.008
ry+0.008
upid+0.008
 Sandy+0.007
about+0.007
Neg logit contributions
Brown-0.008
Ell-0.008
Twe-0.007
 Stretch-0.007
hy-0.007
Token you
Token ID345
Feature activation-2.60e+00

Pos logit contributions
Brown+0.013
Ell+0.012
Twe+0.011
Ar+0.011
hy+0.011
Neg logit contributions
upid-0.017
ours-0.017
ry-0.017
 Sandy-0.016
 bubble-0.016
Token something
Token ID1223
Feature activation-5.85e-01

Pos logit contributions
olt+0.083
ours+0.083
ry+0.081
 pitcher+0.079
idget+0.079
Neg logit contributions
Brown-0.096
 Stretch-0.092
 novels-0.091
Ell-0.090
 Ellie-0.088
Token young
Token ID1862
Feature activation-3.34e-01

Pos logit contributions
ours+0.023
upid+0.023
ry+0.022
 Sandy+0.021
about+0.021
Neg logit contributions
Brown-0.021
Ell-0.019
 Stretch-0.018
Twe-0.018
 Brown-0.017
Token dinosaur
Token ID28257
Feature activation+4.12e-01

Pos logit contributions
ours+0.011
ry+0.011
upid+0.011
 Sandy+0.010
 bubble+0.010
Neg logit contributions
Brown-0.014
Ell-0.013
 Stretch-0.012
Twe-0.012
hy-0.012
Token!"
Token ID2474
Feature activation+4.59e-01

Pos logit contributions
Brown+0.018
Ell+0.018
Twe+0.016
Ar+0.016
 Stretch+0.016
Neg logit contributions
ry-0.013
ours-0.013
upid-0.013
 bubble-0.012
 Sandy-0.012
Token Terry
Token ID14286
Feature activation+2.60e-01

Pos logit contributions
Brown+0.016
Ell+0.015
Twe+0.014
Ar+0.013
 Stretch+0.013
Neg logit contributions
upid-0.018
ours-0.018
ry-0.018
 Sandy-0.018
 bubble-0.018
Token.
Token ID13
Feature activation+4.82e-01

Pos logit contributions
ours+0.019
upid+0.018
ry+0.018
about+0.017
ress+0.017
Neg logit contributions
Brown-0.017
Ell-0.016
 Stretch-0.016
 Brown-0.015
Twe-0.015
Token Max
Token ID5436
Feature activation+6.46e-01

Pos logit contributions
Brown+0.017
Ell+0.016
Twe+0.015
Ar+0.014
hy+0.014
Neg logit contributions
ry-0.019
ours-0.018
upid-0.018
 Sandy-0.018
 bubble-0.018
Token tried
Token ID3088
Feature activation-9.29e-01

Pos logit contributions
Brown+0.029
Ell+0.027
hy+0.026
Twe+0.026
 Stretch+0.025
Neg logit contributions
ry-0.019
ours-0.019
 bubble-0.018
 Sandy-0.018
upid-0.018
Token to
Token ID284
Feature activation-4.78e-01

Pos logit contributions
ours+0.043
ry+0.043
upid+0.042
olt+0.039
asing+0.039
Neg logit contributions
Brown-0.027
Ell-0.026
 Brown-0.025
 Stretch-0.023
 Barney-0.022
Token catch
Token ID4929
Feature activation-7.11e-01

Pos logit contributions
ours+0.017
ry+0.016
upid+0.015
asing+0.014
apping+0.014
Neg logit contributions
Brown-0.020
Ell-0.020
Twe-0.019
Ar-0.018
 Stretch-0.018
Token the
Token ID262
Feature activation-2.29e+00

Pos logit contributions
ours+0.023
ry+0.023
upid+0.021
apping+0.021
about+0.020
Neg logit contributions
Brown-0.031
Ell-0.029
 Brown-0.029
 Stretch-0.029
 Barney-0.028
Token bug
Token ID5434
Feature activation-9.05e-01

Pos logit contributions
ours+0.093
ry+0.092
asing+0.080
ather+0.078
ress+0.077
Neg logit contributions
Ell-0.077
ve-0.075
 Ellie-0.071
Brown-0.071
 long-0.070
Token,
Token ID11
Feature activation-9.17e-01

Pos logit contributions
ours+0.036
upid+0.034
ry+0.034
olt+0.032
about+0.032
Neg logit contributions
Brown-0.030
Ell-0.030
 Brown-0.028
 Stretch-0.027
Twe-0.027
Token but
Token ID475
Feature activation-7.43e-01

Pos logit contributions
ry+0.041
ours+0.040
upid+0.039
ress+0.036
asing+0.036
Neg logit contributions
Brown-0.029
Ell-0.028
 Brown-0.026
 Stretch-0.026
 Barney-0.026
Token it
Token ID340
Feature activation-5.11e-01

Pos logit contributions
ours+0.030
ry+0.029
upid+0.028
apping+0.027
asing+0.027
Neg logit contributions
Brown-0.027
 Stretch-0.025
Ell-0.025
 Brown-0.025
 Barney-0.024
Token was
Token ID373
Feature activation-8.03e-01

Pos logit contributions
upid+0.018
ours+0.018
ry+0.017
apping+0.016
 Sandy+0.016
Neg logit contributions
Brown-0.021
Ell-0.020
Twe-0.018
 Stretch-0.018
 Brown-0.018
Token excited
Token ID6568
Feature activation+1.54e+00

Pos logit contributions
Brown+0.019
Ell+0.018
Twe+0.017
 Stretch+0.016
hy+0.016
Neg logit contributions
ours-0.015
ry-0.015
upid-0.015
 Sandy-0.014
 foot-0.014
Token that
Token ID326
Feature activation+4.71e-02

Pos logit contributions
Brown+0.087
Ell+0.082
Ar+0.082
Twe+0.080
Mic+0.078
Neg logit contributions
ry-0.036
 Sandy-0.034
ours-0.030
ired-0.029
 of-0.028
Token he
Token ID339
Feature activation+2.34e-01

Pos logit contributions
Brown+0.001
Ell+0.001
Twe+0.001
 Stretch+0.001
Ar+0.001
Neg logit contributions
ours-0.002
ry-0.002
upid-0.002
 Sandy-0.002
 bubble-0.002
Token wanted
Token ID2227
Feature activation-1.02e+00

Pos logit contributions
Brown+0.010
Ell+0.010
Twe+0.009
 Stretch+0.009
Ar+0.009
Neg logit contributions
ry-0.007
ours-0.007
upid-0.007
 Sandy-0.007
 bubble-0.007
Token to
Token ID284
Feature activation-1.90e+00

Pos logit contributions
upid+0.048
ry+0.048
ours+0.047
olt+0.045
asing+0.044
Neg logit contributions
Brown-0.027
 Brown-0.025
Ell-0.024
 Stretch-0.023
 Barney-0.023
Token do
Token ID466
Feature activation-2.20e+00

Pos logit contributions
ours+0.087
ry+0.077
ays+0.077
asing+0.074
icking+0.070
Neg logit contributions
Brown-0.064
 Stretch-0.059
Ar-0.058
 Brown-0.057
 barely-0.056
Token it
Token ID340
Feature activation-7.85e-01

Pos logit contributions
upid+0.080
ours+0.079
ry+0.078
asing+0.076
icking+0.076
Neg logit contributions
Brown-0.072
 Stretch-0.070
 saf-0.069
 Brown-0.068
ve-0.068
Token,
Token ID11
Feature activation-3.91e-01

Pos logit contributions
ours+0.031
upid+0.029
ry+0.029
ress+0.027
asing+0.026
Neg logit contributions
Brown-0.028
Ell-0.026
 Brown-0.025
 Stretch-0.025
Twe-0.024
Token too
Token ID1165
Feature activation-6.34e-01

Pos logit contributions
ours+0.013
ry+0.013
upid+0.013
 bubble+0.012
asing+0.012
Neg logit contributions
Brown-0.016
Ell-0.015
Twe-0.014
 Stretch-0.014
 Brown-0.014
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
ours+0.025
upid+0.024
ry+0.023
asing+0.023
ress+0.022
Neg logit contributions
Brown-0.023
Ell-0.020
 Stretch-0.020
 Brown-0.019
Twe-0.019
Token Mom
Token ID11254
Feature activation-6.05e-01

Pos logit contributions
Brown+0.041
Ell+0.039
Twe+0.036
Ar+0.035
ve+0.034
Neg logit contributions
ours-0.045
ry-0.045
 foot-0.044
 Sandy-0.044
about-0.044