TOP ACTIVATIONS
MAX = 5.452

. He saw the water boiling and the caul iflower getting
a pot of soup was boiling . Lily bumped into the
. When the soup was boiling , Mama went to get
them in the pot of boiling water and waited for them
The bottle lands in the boiling soup and spl ashes everywhere
added something special - bread cr umbs . The duck sn
in a deep pot of boiling water . "
and grabbed a pot of boiling water . She poured it
with some water and started boiling it down . The smell
olive in a pot of boiling water and after a few
sauce and keep it from boiling over . They are happy
did not hear the cocoa boiling over . They did not
noise . It was a sne e ze . Joe looked
and soon the pot was boiling . He took the pot
It 's just a little sne e ze . It will
She had big pot of boiling water , and a big
a special gift from the sne e ze . They went
closer . It opens its be ak and sl aps Lily
the bag was full of salty peanuts . The little girl
, that was a big sne e ze !"

MIN ACTIVATIONS
MIN = -6.818

sur fer . Whenever the sur fer balanced on the board
wanted to be a good gol fer like his dad .
arm chair . THE <|endoftext|> Once upon a time
before dinner . <|endoftext|> S aman tha was a lazy girl
came back . THE <|endoftext|> Once upon a time
strawberries !" S aman tha and Dan thought it
B ella watched as the sur fer rode on the waves
Peter wasn 't the greatest sur fer , but he loved
again tomorrow . THE <|endoftext|> Once upon a time
hands . The sur fer saw Bella and waved
He was a very successful clim ber . Every day he
he could be a better sur fer . He hurried to
alone , without a cool hairst yle and without a solution
pond ?" The gol fer replied , " I
but she liked watching the sur fer . Whenever the sur
was no . <|endoftext|> S aman tha opened the door and
The town spe ople had to find a
! I 'm the best gol fer in the world and
. You are a great sur fer . We are proud
dolls and pretend they were mer maid s or pirates .

INTERVAL 2.384 - 5.452
CONTAINS 0.897%

counts them every day . One , two , three ,
her strength . One day , Jill was playing
can use this . It 's like paper , but bigger
looked inside . It was also empty . He put it
belt . It was a bit messy , but he liked

INTERVAL -0.683 - 2.384
CONTAINS 78.054%

not good . The house is old and broken . The
to be very careful not to hurt anyone . Bob promised
, she wasn 't hurt but she was a bit scared
she couldn 't open it . She tried and tried but
to get to their homes . They were so busy that

INTERVAL -3.750 - -0.683
CONTAINS 21.029%

, " That 's a great idea , Lily ! Let
way home , but he didn 't know where Pink lived
pepper . Sam said , " Hi Tim ! Do you
head on her pillow and soon fell asleep . <|endoftext|> Once
he can still make a new picture . Tim my felt

INTERVAL -6.818 - -3.750
CONTAINS 0.021%

so he promised to be Beat rice 's friend . From
agreed to take the aer opl ane on a journey .
Lily loved the big Fer ris wheel the most . She
, gems and even golden jew ellery . Molly was so
trunk of the tree and sc ur ried along the ground
Token.
Token ID13
Feature activation+4.67e-01

Pos logit contributions
 chew+0.041
 snowball+0.036
 puck+0.035
 hop+0.035
 skate+0.035
Neg logit contributions
keeper-0.025
iring-0.024
rier-0.023
spe-0.023
bel-0.023
Token He
Token ID679
Feature activation+8.35e-01

Pos logit contributions
 chew+0.012
 pile+0.010
 hop+0.010
 nose+0.010
 spaghetti+0.010
Neg logit contributions
Lay-0.011
ires-0.010
overed-0.010
iring-0.010
Ap-0.010
Token saw
Token ID2497
Feature activation+8.52e-01

Pos logit contributions
 chew+0.021
 hop+0.019
 pile+0.018
 spaghetti+0.018
 snowball+0.018
Neg logit contributions
Lay-0.021
ager-0.020
ve-0.019
la-0.019
leaf-0.019
Token the
Token ID262
Feature activation-4.90e-02

Pos logit contributions
 chew+0.023
 hop+0.021
 pile+0.020
 bisc+0.019
 nose+0.019
Neg logit contributions
Lay-0.019
ires-0.018
iring-0.018
la-0.017
overed-0.017
Token water
Token ID1660
Feature activation+1.30e+00

Pos logit contributions
ve+0.001
 singer+0.001
Lay+0.001
iring+0.001
leaf+0.001
Neg logit contributions
 chew-0.001
 hop-0.001
 jump-0.001
 skate-0.001
 snowball-0.001
Token boiling
Token ID24372
Feature activation+5.45e+00

Pos logit contributions
 chew+0.036
 snowball+0.031
 hop+0.031
 spaghetti+0.031
 bisc+0.031
Neg logit contributions
overed-0.026
iring-0.026
spe-0.024
ires-0.024
bel-0.024
Token and
Token ID290
Feature activation+2.03e+00

Pos logit contributions
 chew+0.177
 skate+0.161
 hop+0.157
 puck+0.157
 stack+0.150
Neg logit contributions
led-0.091
iring-0.087
ed-0.085
ires-0.084
col-0.083
Token the
Token ID262
Feature activation-2.30e-01

Pos logit contributions
 chew+0.049
 hop+0.045
 snowball+0.044
 bone+0.043
 bisc+0.042
Neg logit contributions
bel-0.046
Lay-0.045
overed-0.044
spe-0.044
Ap-0.043
Token caul
Token ID46087
Feature activation-8.53e-03

Pos logit contributions
 singer+0.006
ve+0.006
Lay+0.006
leaf+0.006
iring+0.006
Neg logit contributions
 chew-0.005
 hop-0.004
 skate-0.004
 jump-0.004
 snowball-0.004
Tokeniflower
Token ID42642
Feature activation+3.59e-01

Pos logit contributions
Lay+0.000
leaf+0.000
Mad+0.000
la+0.000
spe+0.000
Neg logit contributions
 chew-0.000
 pile-0.000
 hop-0.000
 spaghetti-0.000
 carrot-0.000
Token getting
Token ID1972
Feature activation+1.54e+00

Pos logit contributions
 chew+0.010
 snowball+0.009
 hop+0.009
 bisc+0.009
 skate+0.009
Neg logit contributions
ve-0.006
overed-0.006
iring-0.006
ager-0.006
ires-0.006
Token a
Token ID257
Feature activation-3.50e-01

Pos logit contributions
 chew+0.019
 hop+0.017
 snowball+0.016
 bisc+0.016
 skate+0.016
Neg logit contributions
Lay-0.014
la-0.014
bel-0.013
spe-0.013
overed-0.013
Token pot
Token ID1787
Feature activation+7.07e-01

Pos logit contributions
 singer+0.008
bel+0.008
spe+0.008
ve+0.008
Lay+0.008
Neg logit contributions
 chew-0.009
 hop-0.008
 spaghetti-0.007
 skate-0.007
 Sammy-0.007
Token of
Token ID286
Feature activation+4.14e-01

Pos logit contributions
 chew+0.019
 snowball+0.016
 spaghetti+0.016
 hop+0.016
 carrot+0.016
Neg logit contributions
Lay-0.016
overed-0.016
ve-0.016
bel-0.016
ager-0.015
Token soup
Token ID17141
Feature activation+1.46e+00

Pos logit contributions
 chew+0.009
 hop+0.008
 skate+0.007
 pile+0.007
 puck+0.007
Neg logit contributions
leaf-0.011
Lay-0.011
 singer-0.011
bel-0.010
iring-0.010
Token was
Token ID373
Feature activation+1.78e+00

Pos logit contributions
 chew+0.038
 hop+0.034
 snowball+0.033
 spaghetti+0.032
 carrot+0.032
Neg logit contributions
Lay-0.033
ve-0.033
overed-0.033
ager-0.033
la-0.033
Token boiling
Token ID24372
Feature activation+5.43e+00

Pos logit contributions
 chew+0.047
 hop+0.043
 skate+0.039
 puck+0.039
 bisc+0.039
Neg logit contributions
iring-0.038
ve-0.038
Lay-0.037
overed-0.037
la-0.037
Token.
Token ID13
Feature activation+6.25e-01

Pos logit contributions
 chew+0.169
 hop+0.156
 skate+0.151
 puck+0.150
 jump+0.141
Neg logit contributions
iring-0.101
led-0.099
col-0.096
ve-0.094
ager-0.094
Token Lily
Token ID20037
Feature activation-5.37e-01

Pos logit contributions
 chew+0.016
 hop+0.014
 pile+0.013
 spaghetti+0.013
 nose+0.013
Neg logit contributions
Lay-0.016
ve-0.015
ager-0.015
la-0.015
leaf-0.015
Token bumped
Token ID38034
Feature activation-1.74e+00

Pos logit contributions
ager+0.012
Lay+0.012
ve+0.012
la+0.011
elia+0.011
Neg logit contributions
 chew-0.015
 hop-0.013
 spaghetti-0.013
 bisc-0.013
 pile-0.012
Token into
Token ID656
Feature activation+4.57e-01

Pos logit contributions
Lay+0.036
ager+0.034
leaf+0.033
spe+0.033
la+0.032
Neg logit contributions
 chew-0.053
 hop-0.047
 spaghetti-0.047
 pile-0.047
 nose-0.047
Token the
Token ID262
Feature activation+1.18e-01

Pos logit contributions
 chew+0.014
 puck+0.013
 hop+0.013
 skate+0.012
 bisc+0.012
Neg logit contributions
Lay-0.008
ve-0.008
iring-0.008
May-0.008
 yourselves-0.008
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
 chew+0.046
 hop+0.040
 snowball+0.040
 bone+0.038
 spaghetti+0.038
Neg logit contributions
Lay-0.037
overed-0.037
la-0.037
ve-0.036
bel-0.036
Token When
Token ID1649
Feature activation-6.63e-01

Pos logit contributions
 chew+0.031
 hop+0.026
 pile+0.026
 snowball+0.026
 spaghetti+0.025
Neg logit contributions
Lay-0.029
la-0.027
overed-0.027
iring-0.026
ires-0.026
Token the
Token ID262
Feature activation-3.19e-01

Pos logit contributions
Lay+0.012
la+0.011
elia+0.011
ve+0.011
 singer+0.011
Neg logit contributions
 chew-0.021
 hop-0.019
 pile-0.019
 snowball-0.019
 nose-0.018
Token soup
Token ID17141
Feature activation+1.80e+00

Pos logit contributions
 singer+0.009
leaf+0.009
ager+0.009
ve+0.009
iring+0.009
Neg logit contributions
 chew-0.006
 hop-0.005
 snowball-0.005
 skate-0.005
 puck-0.005
Token was
Token ID373
Feature activation+8.58e-01

Pos logit contributions
 chew+0.051
 snowball+0.044
 hop+0.044
 bone+0.044
 spaghetti+0.043
Neg logit contributions
Lay-0.036
ve-0.036
overed-0.035
ager-0.035
bel-0.035
Token boiling
Token ID24372
Feature activation+5.29e+00

Pos logit contributions
 chew+0.022
 hop+0.020
 bone+0.019
 snowball+0.019
 bisc+0.018
Neg logit contributions
Lay-0.020
overed-0.020
ve-0.020
iring-0.019
la-0.019
Token,
Token ID11
Feature activation+9.47e-01

Pos logit contributions
 chew+0.142
 skate+0.131
 puck+0.127
 hop+0.122
 bone+0.120
Neg logit contributions
led-0.117
iring-0.113
overed-0.111
ve-0.109
Lay-0.108
Token Mama
Token ID40084
Feature activation+4.89e-01

Pos logit contributions
 chew+0.024
 hop+0.021
 pile+0.021
 snowball+0.020
 spaghetti+0.020
Neg logit contributions
Lay-0.024
la-0.022
iring-0.022
ve-0.022
leaf-0.022
Token went
Token ID1816
Feature activation+7.28e-01

Pos logit contributions
 chew+0.013
 hop+0.011
 pile+0.011
 spaghetti+0.010
 snowball+0.010
Neg logit contributions
Lay-0.012
la-0.012
ve-0.011
ager-0.011
elia-0.011
Token to
Token ID284
Feature activation+2.35e-01

Pos logit contributions
 chew+0.019
 pile+0.017
 hop+0.017
 spaghetti+0.017
 nose+0.016
Neg logit contributions
Lay-0.019
spe-0.017
ager-0.017
ires-0.017
Mad-0.017
Token get
Token ID651
Feature activation+1.10e+00

Pos logit contributions
 chew+0.004
 snowball+0.004
 bone+0.004
 puck+0.004
 hop+0.004
Neg logit contributions
Lay-0.007
leaf-0.007
May-0.006
ager-0.006
ve-0.006
Token them
Token ID606
Feature activation+5.96e-01

Pos logit contributions
 chew+0.054
 hop+0.048
 skate+0.047
 bisc+0.045
 pile+0.045
Neg logit contributions
Lay-0.037
la-0.036
ires-0.036
ve-0.035
yl-0.035
Token in
Token ID287
Feature activation+4.32e-01

Pos logit contributions
 chew+0.020
 hop+0.017
 pile+0.017
 snowball+0.017
 spaghetti+0.017
Neg logit contributions
Lay-0.010
ve-0.010
bel-0.009
la-0.009
overed-0.009
Token the
Token ID262
Feature activation-1.53e-01

Pos logit contributions
 chew+0.014
 hop+0.013
 puck+0.012
 skate+0.012
 bisc+0.012
Neg logit contributions
Lay-0.007
May-0.006
yl-0.006
An-0.006
Ap-0.006
Token pot
Token ID1787
Feature activation+1.71e+00

Pos logit contributions
Lay+0.004
ager+0.004
Ap+0.004
ires+0.004
leaf+0.004
Neg logit contributions
 chew-0.003
 hop-0.002
 snowball-0.002
 jump-0.002
 skate-0.002
Token of
Token ID286
Feature activation+5.92e-01

Pos logit contributions
 chew+0.052
 snowball+0.045
 hop+0.045
 skate+0.044
 puck+0.044
Neg logit contributions
bel-0.032
iring-0.031
led-0.030
but-0.030
overed-0.030
Token boiling
Token ID24372
Feature activation+5.25e+00

Pos logit contributions
 chew+0.014
 hop+0.013
 skate+0.013
 puck+0.013
 snowball+0.013
Neg logit contributions
ires-0.014
leaf-0.013
 yourselves-0.013
 talented-0.012
iring-0.012
Token water
Token ID1660
Feature activation+2.29e+00

Pos logit contributions
 chew+0.129
 hop+0.113
 skate+0.112
 puck+0.107
 jump+0.102
Neg logit contributions
led-0.125
col-0.124
ires-0.123
leaf-0.122
bel-0.122
Token and
Token ID290
Feature activation+1.59e+00

Pos logit contributions
 chew+0.067
 hop+0.059
 snowball+0.057
 skate+0.055
 puck+0.055
Neg logit contributions
iring-0.044
overed-0.043
la-0.043
bel-0.043
ve-0.042
Token waited
Token ID13488
Feature activation+1.41e+00

Pos logit contributions
 chew+0.038
 snowball+0.035
 hop+0.035
 Today+0.035
 bone+0.034
Neg logit contributions
bel-0.036
led-0.034
overed-0.034
Lay-0.033
we-0.033
Token for
Token ID329
Feature activation+1.13e-02

Pos logit contributions
 chew+0.037
 hop+0.032
 snowball+0.031
 bone+0.031
 pile+0.031
Neg logit contributions
ve-0.033
Lay-0.033
overed-0.033
ires-0.032
iring-0.032
Token them
Token ID606
Feature activation-1.87e-01

Pos logit contributions
 chew+0.000
 snowball+0.000
 puck+0.000
bowl+0.000
 hop+0.000
Neg logit contributions
ires-0.000
 yourselves-0.000
essa-0.000
yl-0.000
Lay-0.000
Token The
Token ID383
Feature activation+8.10e-01

Pos logit contributions
 chew+0.004
 hop+0.003
 pile+0.003
 spaghetti+0.003
 nose+0.003
Neg logit contributions
Lay-0.004
la-0.004
ager-0.004
ve-0.004
leaf-0.004
Token bottle
Token ID9294
Feature activation+6.28e-02

Pos logit contributions
 chew+0.018
 hop+0.014
 snowball+0.014
 skate+0.013
 jump+0.013
Neg logit contributions
Lay-0.022
 singer-0.022
ve-0.021
ager-0.021
leaf-0.021
Token lands
Token ID8604
Feature activation+4.97e-01

Pos logit contributions
 chew+0.001
 pile+0.001
 hop+0.001
 jump+0.001
 nose+0.001
Neg logit contributions
Lay-0.002
Mad-0.002
May-0.002
leaf-0.002
otte-0.002
Token in
Token ID287
Feature activation+1.12e+00

Pos logit contributions
 chew+0.014
 hop+0.012
 pile+0.012
 spaghetti+0.012
 carrot+0.012
Neg logit contributions
Lay-0.011
ve-0.011
overed-0.011
la-0.010
bel-0.010
Token the
Token ID262
Feature activation+7.04e-01

Pos logit contributions
 chew+0.035
 hop+0.031
 skate+0.030
 bisc+0.030
 puck+0.030
Neg logit contributions
Lay-0.021
Kat-0.020
 yourselves-0.019
yl-0.019
May-0.019
Token boiling
Token ID24372
Feature activation+5.14e+00

Pos logit contributions
 chew+0.015
 hop+0.012
 snowball+0.011
 jump+0.011
 puck+0.011
Neg logit contributions
 singer-0.019
Lay-0.019
May-0.019
leaf-0.019
ager-0.018
Token soup
Token ID17141
Feature activation+2.20e+00

Pos logit contributions
 chew+0.127
 hop+0.109
 skate+0.106
 puck+0.102
 snowball+0.099
Neg logit contributions
ager-0.123
leaf-0.121
bel-0.119
col-0.119
han-0.118
Token and
Token ID290
Feature activation+1.24e+00

Pos logit contributions
 chew+0.059
 hop+0.051
 snowball+0.050
 spaghetti+0.048
 bisc+0.047
Neg logit contributions
ve-0.046
ager-0.046
iring-0.046
overed-0.046
Lay-0.046
Token spl
Token ID4328
Feature activation+1.13e+00

Pos logit contributions
 chew+0.029
 hop+0.026
 snowball+0.025
 pile+0.025
 spaghetti+0.024
Neg logit contributions
Lay-0.031
la-0.030
overed-0.029
leaf-0.029
ager-0.029
Tokenashes
Token ID7465
Feature activation+2.49e+00

Pos logit contributions
 chew+0.033
 hop+0.030
 pile+0.029
 carrot+0.029
 spaghetti+0.028
Neg logit contributions
ve-0.023
Lay-0.023
overed-0.023
ager-0.023
ires-0.023
Token everywhere
Token ID8347
Feature activation-1.05e+00

Pos logit contributions
 chew+0.070
 hop+0.060
 pile+0.058
 nose+0.058
 snowball+0.058
Neg logit contributions
Lay-0.051
overed-0.050
la-0.049
ve-0.049
leaf-0.049
Token added
Token ID2087
Feature activation+2.19e+00

Pos logit contributions
Lay+0.000
la+0.000
ager+0.000
leaf+0.000
ve+0.000
Neg logit contributions
 chew-0.000
 hop-0.000
 pile-0.000
 spaghetti-0.000
 nose-0.000
Token something
Token ID1223
Feature activation-4.59e-01

Pos logit contributions
 chew+0.053
 hop+0.045
 skate+0.044
 pile+0.043
 nose+0.042
Neg logit contributions
Lay-0.054
leaf-0.053
la-0.052
overed-0.051
iring-0.051
Token special
Token ID2041
Feature activation+5.44e-01

Pos logit contributions
Lay+0.011
ager+0.010
la+0.010
ve+0.010
leaf+0.010
Neg logit contributions
 chew-0.013
 hop-0.011
 pile-0.011
 spaghetti-0.011
 nose-0.011
Token -
Token ID532
Feature activation+2.63e+00

Pos logit contributions
 chew+0.015
 hop+0.013
 pile+0.013
 snowball+0.013
 spaghetti+0.013
Neg logit contributions
Lay-0.013
leaf-0.012
ager-0.012
la-0.012
ve-0.012
Token bread
Token ID8509
Feature activation+3.38e+00

Pos logit contributions
 chew+0.067
 pile+0.057
 hop+0.057
 snowball+0.055
 bisc+0.055
Neg logit contributions
Lay-0.063
leaf-0.061
iring-0.060
la-0.059
overed-0.058
Tokencr
Token ID6098
Feature activation+5.07e+00

Pos logit contributions
 chew+0.088
 snowball+0.077
 hop+0.074
 spaghetti+0.072
 pile+0.071
Neg logit contributions
Lay-0.078
ve-0.077
iring-0.076
la-0.075
overed-0.075
Tokenumbs
Token ID18146
Feature activation+3.82e+00

Pos logit contributions
 chew+0.121
 nose+0.105
 skate+0.102
 bone+0.101
 hop+0.101
Neg logit contributions
ve-0.145
Lay-0.134
iring-0.132
ires-0.128
la-0.127
Token.
Token ID13
Feature activation+3.76e-01

Pos logit contributions
 chew+0.107
 snowball+0.096
 hop+0.094
 skate+0.092
 pile+0.091
Neg logit contributions
ve-0.079
Lay-0.076
overed-0.076
leaf-0.074
la-0.073
Token The
Token ID383
Feature activation+6.97e-01

Pos logit contributions
 chew+0.010
 hop+0.009
 pile+0.009
 jump+0.009
 bisc+0.009
Neg logit contributions
Lay-0.008
May-0.008
overed-0.008
ires-0.008
leaf-0.008
Token duck
Token ID22045
Feature activation+1.06e+00

Pos logit contributions
 chew+0.019
 spaghetti+0.016
 hop+0.016
 skate+0.016
 snowball+0.015
Neg logit contributions
Lay-0.015
iring-0.014
 singer-0.014
ve-0.014
ager-0.014
Token sn
Token ID3013
Feature activation+1.03e+00

Pos logit contributions
 chew+0.029
 spaghetti+0.025
 hop+0.024
 snowball+0.024
 pile+0.024
Neg logit contributions
Lay-0.025
ve-0.024
ager-0.023
spe-0.023
la-0.023
Token in
Token ID287
Feature activation-6.51e-03

Pos logit contributions
 chew+0.020
 hop+0.018
 pile+0.018
 nose+0.018
 spaghetti+0.018
Neg logit contributions
Lay-0.013
Mad-0.012
leaf-0.012
ager-0.012
Ap-0.012
Token a
Token ID257
Feature activation+5.49e-01

Pos logit contributions
Lay+0.000
May+0.000
leaf+0.000
Mad+0.000
ve+0.000
Neg logit contributions
 chew-0.000
 hop-0.000
 puck-0.000
 snowball-0.000
 skate-0.000
Token deep
Token ID2769
Feature activation+1.92e+00

Pos logit contributions
 chew+0.012
 hop+0.010
 puck+0.009
 spaghetti+0.009
 skate+0.009
Neg logit contributions
Lay-0.014
May-0.014
ve-0.014
ager-0.014
 singer-0.013
Token pot
Token ID1787
Feature activation+1.86e+00

Pos logit contributions
 chew+0.039
 spaghetti+0.033
 skate+0.033
 puck+0.033
 hop+0.032
Neg logit contributions
Lay-0.051
han-0.050
ager-0.050
yl-0.047
elia-0.047
Token of
Token ID286
Feature activation+4.40e-01

Pos logit contributions
 chew+0.053
 hop+0.046
 snowball+0.045
 puck+0.044
 skate+0.043
Neg logit contributions
la-0.038
Lay-0.038
bel-0.038
ager-0.038
ve-0.038
Token boiling
Token ID24372
Feature activation+5.06e+00

Pos logit contributions
 chew+0.010
 hop+0.009
 puck+0.009
 skate+0.009
 jump+0.008
Neg logit contributions
Lay-0.011
leaf-0.011
la-0.010
May-0.010
iring-0.010
Token water
Token ID1660
Feature activation+1.84e+00

Pos logit contributions
 chew+0.114
 skate+0.100
 hop+0.099
 puck+0.099
 jump+0.091
Neg logit contributions
leaf-0.130
Lay-0.128
ager-0.127
ve-0.126
iring-0.125
Token.
Token ID13
Feature activation+2.85e-01

Pos logit contributions
 chew+0.049
 hop+0.043
 snowball+0.041
 spaghetti+0.040
 pile+0.040
Neg logit contributions
Lay-0.043
la-0.041
ve-0.041
overed-0.040
ager-0.040
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
 chew+0.007
 hop+0.006
 pile+0.006
 spaghetti+0.006
 nose+0.006
Neg logit contributions
Lay-0.007
ager-0.007
ve-0.007
leaf-0.007
la-0.007
Token
Token ID198
Feature activation-6.40e-01

Pos logit contributions
 chew+0.004
 hop+0.003
 pile+0.003
 spaghetti+0.003
 nose+0.003
Neg logit contributions
Lay-0.004
ager-0.004
leaf-0.004
la-0.004
spe-0.004
Token"
Token ID1
Feature activation-1.83e+00

Pos logit contributions
leaf+0.016
Lay+0.016
overed+0.016
ires+0.015
la+0.015
Neg logit contributions
 chew-0.016
 hop-0.015
 pile-0.013
 jump-0.013
 bat-0.013
Token and
Token ID290
Feature activation-2.43e-01

Pos logit contributions
 chew+0.006
 hop+0.005
 snowball+0.005
 carrot+0.005
 bisc+0.005
Neg logit contributions
Lay-0.006
ager-0.006
la-0.005
spe-0.005
overed-0.005
Token grabbed
Token ID13176
Feature activation-4.06e-01

Pos logit contributions
Lay+0.007
ve+0.006
ager+0.006
iring+0.006
Mad+0.006
Neg logit contributions
 chew-0.006
 hop-0.005
 pile-0.005
 spaghetti-0.005
 nose-0.005
Token a
Token ID257
Feature activation-5.11e-01

Pos logit contributions
Lay+0.010
la+0.009
leaf+0.009
ve+0.009
May+0.009
Neg logit contributions
 chew-0.011
 hop-0.009
 pile-0.009
 snowball-0.009
 nose-0.009
Token pot
Token ID1787
Feature activation+1.73e+00

Pos logit contributions
Lay+0.015
ve+0.015
leaf+0.015
ager+0.015
May+0.015
Neg logit contributions
 chew-0.010
 hop-0.008
 spaghetti-0.008
 nose-0.007
 jump-0.007
Token of
Token ID286
Feature activation+2.12e-01

Pos logit contributions
 chew+0.055
 snowball+0.049
 hop+0.048
 puck+0.047
 meat+0.046
Neg logit contributions
Lay-0.029
bel-0.028
la-0.028
iring-0.028
overed-0.028
Token boiling
Token ID24372
Feature activation+5.04e+00

Pos logit contributions
 chew+0.005
 hop+0.004
 puck+0.004
 skate+0.004
 pile+0.004
Neg logit contributions
leaf-0.006
Lay-0.006
ve-0.005
la-0.005
ires-0.005
Token water
Token ID1660
Feature activation+1.85e+00

Pos logit contributions
 chew+0.112
 hop+0.096
 skate+0.096
 puck+0.095
 bone+0.089
Neg logit contributions
leaf-0.132
ve-0.131
ager-0.130
Lay-0.130
han-0.129
Token.
Token ID13
Feature activation+5.67e-01

Pos logit contributions
 chew+0.049
 hop+0.043
 snowball+0.042
 spaghetti+0.040
 pile+0.040
Neg logit contributions
Lay-0.043
la-0.042
ve-0.041
overed-0.041
iring-0.040
Token She
Token ID1375
Feature activation-1.75e-01

Pos logit contributions
 chew+0.014
 hop+0.013
 pile+0.012
 spaghetti+0.012
 nose+0.012
Neg logit contributions
Lay-0.015
ager-0.014
ve-0.014
spe-0.014
la-0.014
Token poured
Token ID19036
Feature activation+1.71e+00

Pos logit contributions
Lay+0.005
ager+0.004
ve+0.004
la+0.004
leaf+0.004
Neg logit contributions
 chew-0.004
 hop-0.004
 pile-0.004
 spaghetti-0.004
 snowball-0.004
Token it
Token ID340
Feature activation+7.63e-01

Pos logit contributions
 chew+0.046
 hop+0.040
 skate+0.039
 pile+0.038
 bone+0.038
Neg logit contributions
Lay-0.039
la-0.037
iring-0.037
ve-0.037
overed-0.036
Token with
Token ID351
Feature activation-5.33e-01

Pos logit contributions
 chew+0.052
 hop+0.044
 snowball+0.043
 puck+0.042
 skate+0.041
Neg logit contributions
bel-0.040
la-0.039
Lay-0.038
ve-0.038
overed-0.038
Token some
Token ID617
Feature activation+3.01e+00

Pos logit contributions
Lay+0.012
leaf+0.012
la+0.012
spe+0.011
bel+0.011
Neg logit contributions
 chew-0.013
 skate-0.012
 hop-0.011
 snowball-0.011
 puck-0.011
Token water
Token ID1660
Feature activation+2.03e+00

Pos logit contributions
 chew+0.069
 hop+0.058
 snowball+0.057
 skate+0.056
 pile+0.056
Neg logit contributions
Lay-0.079
leaf-0.076
ve-0.075
la-0.074
bel-0.073
Token and
Token ID290
Feature activation+9.14e-01

Pos logit contributions
 chew+0.053
 hop+0.045
 snowball+0.044
 spaghetti+0.043
 pile+0.042
Neg logit contributions
Lay-0.048
la-0.047
overed-0.046
ve-0.046
spe-0.046
Token started
Token ID2067
Feature activation-5.76e-02

Pos logit contributions
 chew+0.022
 hop+0.019
 snowball+0.018
 pile+0.018
 bone+0.018
Neg logit contributions
Lay-0.023
leaf-0.022
la-0.022
bel-0.021
spe-0.021
Token boiling
Token ID24372
Feature activation+4.84e+00

Pos logit contributions
Lay+0.001
leaf+0.001
la+0.001
ve+0.001
spe+0.001
Neg logit contributions
 chew-0.002
 hop-0.002
 pile-0.002
 spaghetti-0.002
 snowball-0.002
Token it
Token ID340
Feature activation+3.26e-01

Pos logit contributions
 chew+0.122
 hop+0.106
 skate+0.105
 puck+0.102
 bone+0.100
Neg logit contributions
Lay-0.114
leaf-0.112
la-0.110
ve-0.110
overed-0.108
Token down
Token ID866
Feature activation-4.51e-01

Pos logit contributions
 chew+0.010
 hop+0.008
 snowball+0.008
 puck+0.008
 spaghetti+0.008
Neg logit contributions
Lay-0.006
ve-0.006
spe-0.006
bel-0.006
overed-0.006
Token.
Token ID13
Feature activation+9.58e-01

Pos logit contributions
Lay+0.012
ager+0.012
la+0.012
leaf+0.012
ve+0.012
Neg logit contributions
 chew-0.011
 hop-0.009
 pile-0.009
 spaghetti-0.009
 nose-0.009
Token The
Token ID383
Feature activation+5.47e-01

Pos logit contributions
 chew+0.023
 hop+0.020
 pile+0.019
 spaghetti+0.019
 nose+0.019
Neg logit contributions
Lay-0.026
ager-0.024
la-0.024
ve-0.024
leaf-0.024
Token smell
Token ID8508
Feature activation+6.28e-01

Pos logit contributions
 chew+0.013
 snowball+0.011
 skate+0.010
 hop+0.010
 bone+0.010
Neg logit contributions
Lay-0.013
 singer-0.013
ve-0.013
iring-0.013
essa-0.013
Token olive
Token ID19450
Feature activation+3.60e-01

Pos logit contributions
ve+0.012
Lay+0.011
ager+0.011
iring+0.011
ires+0.011
Neg logit contributions
 chew-0.011
 snowball-0.010
 hop-0.009
 puck-0.009
 skate-0.009
Token in
Token ID287
Feature activation-3.03e-02

Pos logit contributions
 chew+0.010
 hop+0.009
 pile+0.009
 spaghetti+0.009
 nose+0.009
Neg logit contributions
Lay-0.009
la-0.008
Mad-0.008
leaf-0.008
elia-0.008
Token a
Token ID257
Feature activation-2.80e-01

Pos logit contributions
Lay+0.001
Ap+0.001
Mad+0.001
May+0.001
Kay+0.001
Neg logit contributions
 chew-0.001
 hop-0.001
 puck-0.001
 snowball-0.001
 pile-0.001
Token pot
Token ID1787
Feature activation+1.62e+00

Pos logit contributions
ager+0.007
Lay+0.007
ve+0.007
yl+0.007
May+0.007
Neg logit contributions
 chew-0.006
 hop-0.005
 puck-0.005
 spaghetti-0.005
 nose-0.005
Token of
Token ID286
Feature activation+5.27e-01

Pos logit contributions
 chew+0.046
 snowball+0.040
 hop+0.039
 puck+0.038
 skate+0.037
Neg logit contributions
bel-0.035
overed-0.034
iring-0.033
ager-0.033
ve-0.033
Token boiling
Token ID24372
Feature activation+4.79e+00

Pos logit contributions
 chew+0.011
 skate+0.010
 puck+0.010
 snowball+0.010
 hop+0.010
Neg logit contributions
leaf-0.014
Lay-0.013
ires-0.013
ager-0.013
ve-0.012
Token water
Token ID1660
Feature activation+1.67e+00

Pos logit contributions
 chew+0.109
 skate+0.095
 hop+0.093
 puck+0.092
 snowball+0.086
Neg logit contributions
leaf-0.123
ager-0.123
han-0.122
ve-0.120
col-0.120
Token and
Token ID290
Feature activation+9.30e-01

Pos logit contributions
 chew+0.046
 hop+0.040
 snowball+0.039
 spaghetti+0.037
 puck+0.036
Neg logit contributions
Lay-0.037
la-0.037
overed-0.036
ve-0.036
iring-0.036
Token after
Token ID706
Feature activation-1.16e+00

Pos logit contributions
 chew+0.023
 snowball+0.021
 hop+0.020
 bone+0.020
 Today+0.020
Neg logit contributions
Lay-0.021
bel-0.021
ve-0.020
overed-0.020
leaf-0.020
Token a
Token ID257
Feature activation-6.78e-01

Pos logit contributions
overed+0.024
ve+0.024
Lay+0.024
ires+0.023
iring+0.023
Neg logit contributions
 chew-0.031
 pile-0.029
 bisc-0.028
 hop-0.028
 snowball-0.027
Token few
Token ID1178
Feature activation+9.21e-01

Pos logit contributions
Lay+0.017
ager+0.016
ve+0.016
spe+0.016
 singer+0.016
Neg logit contributions
 chew-0.016
 spaghetti-0.014
 hop-0.014
 nose-0.014
 snowball-0.014
Token sauce
Token ID10746
Feature activation+1.70e+00

Pos logit contributions
 chew+0.009
 hop+0.008
 snowball+0.008
 skate+0.008
 fetch+0.007
Neg logit contributions
ve-0.015
bel-0.015
ires-0.014
ager-0.014
iring-0.014
Token and
Token ID290
Feature activation+9.02e-01

Pos logit contributions
 chew+0.050
 snowball+0.043
 bisc+0.043
 hop+0.043
 bone+0.042
Neg logit contributions
overed-0.032
ires-0.032
iring-0.031
bel-0.031
rier-0.031
Token keep
Token ID1394
Feature activation+4.82e-01

Pos logit contributions
 chew+0.013
 bone+0.013
 bisc+0.013
 snowball+0.013
 boot+0.013
Neg logit contributions
bel-0.025
leaf-0.024
 yourselves-0.024
Lay-0.024
Ap-0.024
Token it
Token ID340
Feature activation-4.24e-01

Pos logit contributions
 chew+0.013
 Today+0.013
 hop+0.013
 bisc+0.013
 Yesterday+0.012
Neg logit contributions
ires-0.009
ve-0.009
 yourselves-0.008
elia-0.008
otte-0.008
Token from
Token ID422
Feature activation-1.91e-01

Pos logit contributions
ve+0.007
iring+0.007
bel+0.007
ires+0.007
Lay+0.007
Neg logit contributions
 chew-0.012
 bisc-0.011
 snowball-0.011
 puck-0.011
 hop-0.011
Token boiling
Token ID24372
Feature activation+4.79e+00

Pos logit contributions
ires+0.004
 yourselves+0.004
ind+0.003
otte+0.003
elia+0.003
Neg logit contributions
 chew-0.005
 bisc-0.005
 hop-0.005
 Today-0.005
 snowball-0.004
Token over
Token ID625
Feature activation-2.67e-01

Pos logit contributions
 chew+0.140
 hop+0.128
 skate+0.120
 puck+0.117
 jump+0.117
Neg logit contributions
ager-0.092
ires-0.090
bel-0.090
gr-0.089
leaf-0.087
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
ires+0.005
iring+0.005
Lay+0.005
spe+0.005
bel+0.005
Neg logit contributions
 chew-0.008
 hop-0.007
 bisc-0.007
 nose-0.007
 bone-0.007
Token They
Token ID1119
Feature activation+8.49e-01

Pos logit contributions
 chew+0.007
 hop+0.006
 pile+0.006
 nose+0.006
 snowball+0.006
Neg logit contributions
Lay-0.007
la-0.006
overed-0.006
leaf-0.006
ager-0.006
Token are
Token ID389
Feature activation+4.22e-01

Pos logit contributions
 chew+0.014
 spaghetti+0.012
 snowball+0.012
 bone+0.012
 hop+0.012
Neg logit contributions
Lay-0.027
leaf-0.026
ager-0.026
ve-0.025
la-0.025
Token happy
Token ID3772
Feature activation-1.04e+00

Pos logit contributions
 chew+0.012
 hop+0.011
 bisc+0.010
 pile+0.010
 bone+0.010
Neg logit contributions
iring-0.009
Lay-0.008
ires-0.008
leaf-0.008
ve-0.008
Token did
Token ID750
Feature activation-8.24e-01

Pos logit contributions
 chew+0.016
 spaghetti+0.015
 snowball+0.015
 bisc+0.014
 hop+0.014
Neg logit contributions
Lay-0.014
la-0.014
ve-0.013
ager-0.013
spe-0.013
Token not
Token ID407
Feature activation-2.88e-01

Pos logit contributions
Lay+0.019
la+0.017
leaf+0.017
ager+0.017
ve+0.017
Neg logit contributions
 chew-0.024
 hop-0.021
 pile-0.021
 jump-0.021
 spaghetti-0.020
Token hear
Token ID3285
Feature activation+2.50e-01

Pos logit contributions
Lay+0.008
 singer+0.008
leaf+0.008
ala+0.007
ager+0.007
Neg logit contributions
 chew-0.006
 spaghetti-0.006
 bisc-0.006
 snowball-0.005
 pile-0.005
Token the
Token ID262
Feature activation-4.61e-01

Pos logit contributions
 chew+0.007
 hop+0.006
 pile+0.006
 carrot+0.006
 bisc+0.006
Neg logit contributions
Lay-0.005
la-0.005
 singer-0.005
ires-0.005
ala-0.005
Token cocoa
Token ID35845
Feature activation+9.22e-02

Pos logit contributions
 singer+0.012
leaf+0.012
Lay+0.012
iring+0.012
ve+0.012
Neg logit contributions
 chew-0.011
 hop-0.009
 pile-0.008
 puck-0.008
 skate-0.008
Token boiling
Token ID24372
Feature activation+4.78e+00

Pos logit contributions
 chew+0.003
 hop+0.002
 snowball+0.002
 puck+0.002
 bisc+0.002
Neg logit contributions
 singer-0.002
overed-0.002
iring-0.002
 singers-0.002
we-0.002
Token over
Token ID625
Feature activation-4.35e-03

Pos logit contributions
 chew+0.157
 puck+0.143
 skate+0.138
 bisc+0.138
 hop+0.135
Neg logit contributions
led-0.077
iring-0.072
gr-0.071
ires-0.069
 yourselves-0.069
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
Lay+0.000
iring+0.000
ires+0.000
la+0.000
spe+0.000
Neg logit contributions
 chew-0.000
 hop-0.000
 bisc-0.000
 carrot-0.000
 nose-0.000
Token They
Token ID1119
Feature activation+4.01e-01

Pos logit contributions
 chew+0.005
 hop+0.005
 pile+0.005
 nose+0.004
 carrot+0.004
Neg logit contributions
Lay-0.005
ires-0.004
Ap-0.004
iring-0.004
overed-0.004
Token did
Token ID750
Feature activation-8.91e-01

Pos logit contributions
 chew+0.010
 spaghetti+0.009
 hop+0.009
 pile+0.009
 snowball+0.009
Neg logit contributions
Lay-0.010
la-0.009
ve-0.009
ager-0.009
leaf-0.009
Token not
Token ID407
Feature activation-3.40e-01

Pos logit contributions
Lay+0.022
la+0.021
leaf+0.021
elia+0.021
ager+0.021
Neg logit contributions
 chew-0.025
 hop-0.022
 jump-0.021
 pile-0.021
 spaghetti-0.021
Token noise
Token ID7838
Feature activation+1.44e+00

Pos logit contributions
 chew+0.029
 spaghetti+0.027
 bisc+0.026
 puck+0.025
 pile+0.025
Neg logit contributions
Lay-0.037
 singer-0.035
ve-0.034
overed-0.033
leaf-0.033
Token.
Token ID13
Feature activation+4.37e-01

Pos logit contributions
 chew+0.039
 pile+0.036
 spaghetti+0.035
 carrot+0.035
 nose+0.035
Neg logit contributions
Lay-0.031
 singer-0.030
overed-0.029
leaf-0.029
bel-0.029
Token It
Token ID632
Feature activation+2.81e-01

Pos logit contributions
 chew+0.010
 pile+0.009
 hop+0.009
 spaghetti+0.009
 snowball+0.009
Neg logit contributions
Lay-0.012
overed-0.011
ager-0.011
Ap-0.011
ve-0.011
Token was
Token ID373
Feature activation+1.93e+00

Pos logit contributions
 chew+0.008
 spaghetti+0.007
 hop+0.006
 pile+0.006
 bisc+0.006
Neg logit contributions
Lay-0.007
ve-0.006
spe-0.006
la-0.006
ager-0.006
Token a
Token ID257
Feature activation+3.78e-01

Pos logit contributions
 chew+0.052
 bisc+0.046
 pile+0.045
 hop+0.045
 snack+0.043
Neg logit contributions
Lay-0.043
iring-0.041
Ap-0.039
overed-0.039
 singer-0.038
Token sne
Token ID10505
Feature activation+4.71e+00

Pos logit contributions
 chew+0.009
 spaghetti+0.008
 hop+0.008
 pile+0.008
 bisc+0.008
Neg logit contributions
Lay-0.009
 singer-0.009
spe-0.009
ve-0.008
ager-0.008
Tokene
Token ID68
Feature activation+1.23e+00

Pos logit contributions
 chew+0.124
 spaghetti+0.119
 pile+0.118
 carrot+0.114
 puck+0.111
Neg logit contributions
ager-0.114
Lay-0.106
han-0.103
ve-0.103
leaf-0.102
Tokenze
Token ID2736
Feature activation+1.84e+00

Pos logit contributions
 chew+0.029
 hop+0.025
 pile+0.025
 nose+0.024
 spaghetti+0.024
Neg logit contributions
Lay-0.034
la-0.031
leaf-0.031
ager-0.031
ve-0.031
Token.
Token ID13
Feature activation+5.48e-01

Pos logit contributions
 chew+0.051
 hop+0.043
 puck+0.042
 spaghetti+0.042
 pile+0.042
Neg logit contributions
 singer-0.037
ve-0.036
ager-0.034
Lay-0.034
overed-0.034
Token Joe
Token ID5689
Feature activation-4.95e-01

Pos logit contributions
 chew+0.013
 pile+0.011
 hop+0.011
 spaghetti+0.011
 snack+0.010
Neg logit contributions
Lay-0.015
overed-0.014
Ap-0.014
iring-0.014
ager-0.014
Token looked
Token ID3114
Feature activation-8.30e-01

Pos logit contributions
Lay+0.013
ager+0.012
la+0.012
ve+0.012
spe+0.011
Neg logit contributions
 chew-0.013
 hop-0.011
 pile-0.011
 spaghetti-0.011
 nose-0.010
Token and
Token ID290
Feature activation+7.05e-01

Pos logit contributions
 chew+0.007
 hop+0.007
 bisc+0.007
 snowball+0.006
 bone+0.006
Neg logit contributions
iring-0.005
ires-0.005
bel-0.005
ve-0.005
Lay-0.005
Token soon
Token ID2582
Feature activation-1.06e+00

Pos logit contributions
 chew+0.018
 bone+0.016
 hop+0.016
 snowball+0.016
 nose+0.016
Neg logit contributions
Lay-0.016
Ap-0.015
overed-0.015
May-0.015
ager-0.015
Token the
Token ID262
Feature activation-7.14e-01

Pos logit contributions
Lay+0.023
la+0.023
overed+0.022
spe+0.022
ager+0.022
Neg logit contributions
 chew-0.028
 hop-0.025
 snowball-0.025
 pile-0.024
 bisc-0.024
Token pot
Token ID1787
Feature activation-1.73e-01

Pos logit contributions
 singer+0.020
ve+0.019
bel+0.019
ager+0.019
leaf+0.018
Neg logit contributions
 chew-0.015
 hop-0.012
 fetch-0.011
 skate-0.011
 bone-0.011
Token was
Token ID373
Feature activation+1.51e+00

Pos logit contributions
bel+0.004
overed+0.004
spe+0.003
ve+0.003
led+0.003
Neg logit contributions
 chew-0.005
 hop-0.004
 bone-0.004
 snowball-0.004
 spaghetti-0.004
Token boiling
Token ID24372
Feature activation+4.70e+00

Pos logit contributions
 chew+0.040
 hop+0.036
 bisc+0.034
 bone+0.034
 skate+0.033
Neg logit contributions
overed-0.033
iring-0.033
ve-0.032
Lay-0.032
bel-0.031
Token.
Token ID13
Feature activation+4.65e-02

Pos logit contributions
 chew+0.149
 hop+0.140
 skate+0.136
 puck+0.133
 nose+0.129
Neg logit contributions
led-0.079
iring-0.078
col-0.074
ager-0.073
ed-0.071
Token He
Token ID679
Feature activation+3.82e-01

Pos logit contributions
 chew+0.001
 hop+0.001
 pile+0.001
 bisc+0.001
 snowball+0.001
Neg logit contributions
Lay-0.001
spe-0.001
overed-0.001
ager-0.001
leaf-0.001
Token took
Token ID1718
Feature activation+6.62e-01

Pos logit contributions
 chew+0.010
 hop+0.008
 pile+0.008
 spaghetti+0.008
 bisc+0.008
Neg logit contributions
Lay-0.009
ager-0.009
la-0.009
ve-0.009
spe-0.009
Token the
Token ID262
Feature activation-5.45e-01

Pos logit contributions
 chew+0.019
 pile+0.017
 hop+0.017
 bisc+0.017
 skate+0.017
Neg logit contributions
leaf-0.012
la-0.012
ve-0.012
Lay-0.012
May-0.012
Token pot
Token ID1787
Feature activation+1.01e+00

Pos logit contributions
ve+0.015
leaf+0.014
ager+0.014
 singer+0.014
bel+0.013
Neg logit contributions
 chew-0.012
 skate-0.009
 hop-0.009
 puck-0.009
 bone-0.009
TokenIt
Token ID1026
Feature activation+4.18e-01

Pos logit contributions
la+0.012
elia+0.012
Lay+0.012
ve+0.012
overed+0.012
Neg logit contributions
 chew-0.019
 hop-0.017
 snowball-0.017
 puck-0.016
 jump-0.016
Token's
Token ID338
Feature activation+2.89e+00

Pos logit contributions
 chew+0.008
 hop+0.007
 pile+0.007
 nose+0.007
 jump+0.007
Neg logit contributions
Lay-0.014
overed-0.013
Mad-0.013
essa-0.013
Kat-0.013
Token just
Token ID655
Feature activation+2.16e+00

Pos logit contributions
 chew+0.091
 hop+0.083
 pile+0.081
 bisc+0.080
 spaghetti+0.078
Neg logit contributions
Lay-0.050
iring-0.045
elia-0.045
ve-0.045
la-0.045
Token a
Token ID257
Feature activation-1.46e-01

Pos logit contributions
 chew+0.052
 hop+0.045
 pile+0.044
 spaghetti+0.043
 nose+0.043
Neg logit contributions
Lay-0.058
la-0.054
leaf-0.053
ve-0.053
ager-0.053
Token little
Token ID1310
Feature activation+7.27e-01

Pos logit contributions
Lay+0.003
 singer+0.003
ager+0.003
ve+0.003
spe+0.003
Neg logit contributions
 chew-0.004
 spaghetti-0.003
 hop-0.003
 pile-0.003
 nose-0.003
Token sne
Token ID10505
Feature activation+4.69e+00

Pos logit contributions
 chew+0.018
 spaghetti+0.016
 hop+0.016
 skate+0.015
 pile+0.015
Neg logit contributions
 singer-0.017
Lay-0.016
ager-0.016
spe-0.016
leaf-0.015
Tokene
Token ID68
Feature activation+1.08e+00

Pos logit contributions
 pile+0.125
 chew+0.125
 carrot+0.121
 nose+0.118
 spaghetti+0.117
Neg logit contributions
ager-0.109
gr-0.099
han-0.096
leaf-0.094
ve-0.093
Tokenze
Token ID2736
Feature activation+1.56e+00

Pos logit contributions
 chew+0.027
 hop+0.023
 pile+0.023
 spaghetti+0.023
 nose+0.022
Neg logit contributions
Lay-0.028
ager-0.026
ve-0.026
la-0.026
leaf-0.026
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
 chew+0.046
 hop+0.041
 pile+0.039
 bisc+0.039
 snowball+0.039
Neg logit contributions
 singer-0.030
ve-0.029
la-0.028
Lay-0.028
ager-0.028
Token It
Token ID632
Feature activation+1.56e-01

Pos logit contributions
 chew+0.005
 pile+0.004
 hop+0.004
 spaghetti+0.004
 carrot+0.004
Neg logit contributions
Lay-0.003
May-0.003
leaf-0.003
Mad-0.003
la-0.003
Token will
Token ID481
Feature activation+5.69e-01

Pos logit contributions
 chew+0.003
 hop+0.003
 pile+0.003
 nose+0.003
 snowball+0.003
Neg logit contributions
Lay-0.005
overed-0.005
Mad-0.005
ires-0.005
essa-0.005
Token She
Token ID1375
Feature activation-1.51e-01

Pos logit contributions
Lay+0.005
elia+0.005
May+0.004
la+0.004
iring+0.004
Neg logit contributions
 chew-0.006
 hop-0.005
 pile-0.005
 puck-0.005
 snowball-0.005
Token had
Token ID550
Feature activation-1.59e+00

Pos logit contributions
Lay+0.004
iring+0.004
la+0.004
leaf+0.004
Mad+0.004
Neg logit contributions
 chew-0.004
 hop-0.003
 pile-0.003
 spaghetti-0.003
 nose-0.003
Token big
Token ID1263
Feature activation+1.89e-01

Pos logit contributions
Lay+0.042
ager+0.039
la+0.039
ve+0.039
leaf+0.039
Neg logit contributions
 chew-0.039
 hop-0.034
 pile-0.033
 spaghetti-0.033
 nose-0.032
Token pot
Token ID1787
Feature activation+1.32e+00

Pos logit contributions
 chew+0.004
 hop+0.004
 skate+0.004
 puck+0.004
 snowball+0.004
Neg logit contributions
elia-0.004
ve-0.004
 beings-0.004
leaf-0.004
ires-0.004
Token of
Token ID286
Feature activation-5.68e-01

Pos logit contributions
 chew+0.039
 hop+0.034
 snowball+0.034
 pile+0.033
 puck+0.033
Neg logit contributions
iring-0.027
Lay-0.027
ve-0.027
bel-0.027
la-0.026
Token boiling
Token ID24372
Feature activation+4.64e+00

Pos logit contributions
leaf+0.015
Lay+0.015
iring+0.014
ires+0.014
ve+0.014
Neg logit contributions
 chew-0.013
 hop-0.011
 puck-0.011
 skate-0.011
 snowball-0.010
Token water
Token ID1660
Feature activation+1.87e+00

Pos logit contributions
 chew+0.109
 hop+0.096
 puck+0.093
 skate+0.093
 jump+0.085
Neg logit contributions
leaf-0.118
ve-0.118
iring-0.117
ager-0.116
led-0.116
Token,
Token ID11
Feature activation+1.17e+00

Pos logit contributions
 chew+0.052
 hop+0.045
 snowball+0.044
 puck+0.042
 skate+0.041
Neg logit contributions
ve-0.040
iring-0.040
la-0.040
Lay-0.040
bel-0.039
Token and
Token ID290
Feature activation+2.48e-01

Pos logit contributions
 chew+0.026
 hop+0.023
 pile+0.022
 snowball+0.021
 bone+0.021
Neg logit contributions
Lay-0.032
leaf-0.030
ve-0.030
iring-0.030
ager-0.030
Token a
Token ID257
Feature activation-6.73e-01

Pos logit contributions
 chew+0.006
 hop+0.006
 snowball+0.005
 pile+0.005
 bone+0.005
Neg logit contributions
Lay-0.006
la-0.006
elia-0.006
leaf-0.006
ve-0.006
Token big
Token ID1263
Feature activation+5.20e-01

Pos logit contributions
Lay+0.018
 singer+0.018
ve+0.017
spe+0.017
bel+0.017
Neg logit contributions
 chew-0.015
 hop-0.012
 skate-0.011
 jump-0.011
 snowball-0.011
Token a
Token ID257
Feature activation-6.75e-01

Pos logit contributions
 chew+0.033
 hop+0.029
 pile+0.028
 snowball+0.028
 bisc+0.027
Neg logit contributions
Lay-0.030
la-0.028
iring-0.028
ires-0.028
elia-0.027
Token special
Token ID2041
Feature activation-7.02e-02

Pos logit contributions
Lay+0.019
la+0.018
overed+0.018
iring+0.017
ve+0.017
Neg logit contributions
 chew-0.016
 hop-0.014
 pile-0.014
 spaghetti-0.014
 carrot-0.013
Token gift
Token ID6979
Feature activation-9.21e-01

Pos logit contributions
Lay+0.002
ager+0.002
leaf+0.002
ve+0.002
 singer+0.002
Neg logit contributions
 chew-0.002
 hop-0.002
 pile-0.002
 spaghetti-0.002
 nose-0.002
Token from
Token ID422
Feature activation-2.03e-01

Pos logit contributions
Lay+0.020
la+0.019
 singer+0.019
ager+0.019
leaf+0.019
Neg logit contributions
 chew-0.026
 hop-0.023
 snowball-0.023
 pile-0.022
 spaghetti-0.022
Token the
Token ID262
Feature activation+5.13e-01

Pos logit contributions
Lay+0.005
la+0.005
elia+0.005
ires+0.005
leaf+0.004
Neg logit contributions
 chew-0.005
 hop-0.005
 pile-0.005
 snowball-0.005
 bisc-0.005
Token sne
Token ID10505
Feature activation+4.62e+00

Pos logit contributions
 chew+0.012
 hop+0.011
 snowball+0.011
 spaghetti+0.011
 pile+0.010
Neg logit contributions
Lay-0.013
 singer-0.012
ager-0.012
May-0.012
ve-0.012
Tokene
Token ID68
Feature activation+8.11e-01

Pos logit contributions
 chew+0.118
 pile+0.108
 spaghetti+0.105
 hop+0.103
 carrot+0.103
Neg logit contributions
ager-0.118
Lay-0.105
leaf-0.104
ve-0.102
han-0.102
Tokenze
Token ID2736
Feature activation+1.45e+00

Pos logit contributions
 chew+0.021
 hop+0.019
 nose+0.018
 pile+0.018
 spaghetti+0.018
Neg logit contributions
Lay-0.021
leaf-0.019
Mad-0.019
Ap-0.019
iring-0.019
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
 chew+0.042
 hop+0.037
 snowball+0.037
 pile+0.035
 carrot+0.034
Neg logit contributions
 singer-0.030
la-0.027
ager-0.027
spe-0.026
Lay-0.026
Token They
Token ID1119
Feature activation+4.49e-01

Pos logit contributions
 chew+0.007
 hop+0.006
 pile+0.006
 nose+0.006
 carrot+0.006
Neg logit contributions
Lay-0.006
ires-0.006
overed-0.006
iring-0.006
ager-0.005
Token went
Token ID1816
Feature activation-4.11e-02

Pos logit contributions
 chew+0.012
 spaghetti+0.011
 snowball+0.011
 hop+0.011
 pile+0.010
Neg logit contributions
Lay-0.010
la-0.010
ager-0.010
spe-0.009
ve-0.009
Token closer
Token ID5699
Feature activation-5.74e-01

Pos logit contributions
Lay+0.013
ve+0.012
la+0.012
ager+0.012
iring+0.012
Neg logit contributions
 chew-0.014
 spaghetti-0.012
 pile-0.012
 hop-0.012
 nose-0.011
Token.
Token ID13
Feature activation+1.83e-01

Pos logit contributions
Lay+0.014
la+0.013
overed+0.013
ager+0.013
ve+0.013
Neg logit contributions
 chew-0.015
 hop-0.013
 spaghetti-0.013
 pile-0.013
 snowball-0.013
Token It
Token ID632
Feature activation+2.61e-01

Pos logit contributions
 chew+0.004
 hop+0.004
 spaghetti+0.003
 pile+0.003
 nose+0.003
Neg logit contributions
Lay-0.005
ager-0.005
ve-0.005
la-0.005
spe-0.005
Token opens
Token ID9808
Feature activation-5.52e-01

Pos logit contributions
 chew+0.006
 hop+0.005
 snowball+0.005
 jump+0.005
 pile+0.005
Neg logit contributions
Mad-0.008
essa-0.008
otte-0.008
Kat-0.008
Lay-0.008
Token its
Token ID663
Feature activation+1.74e+00

Pos logit contributions
ager+0.017
ve+0.017
leaf+0.016
Lay+0.016
Mad+0.016
Neg logit contributions
 chew-0.011
 nose-0.011
 snowball-0.010
 carrot-0.010
 hop-0.010
Token be
Token ID307
Feature activation+4.61e+00

Pos logit contributions
 chew+0.025
 spaghetti+0.019
 hop+0.018
 snowball+0.017
 pile+0.017
Neg logit contributions
Lay-0.063
ve-0.059
iring-0.059
ager-0.059
ires-0.059
Tokenak
Token ID461
Feature activation+2.35e+00

Pos logit contributions
 chew+0.093
 hop+0.079
 pile+0.077
 spaghetti+0.075
 nose+0.075
Neg logit contributions
Lay-0.143
ager-0.134
la-0.134
ve-0.134
leaf-0.134
Token and
Token ID290
Feature activation+6.91e-02

Pos logit contributions
 chew+0.064
 spaghetti+0.055
 hop+0.053
 snowball+0.053
 snack+0.052
Neg logit contributions
Lay-0.054
iring-0.054
la-0.052
overed-0.051
ve-0.051
Token sl
Token ID1017
Feature activation-1.06e-01

Pos logit contributions
 chew+0.002
 hop+0.001
 jump+0.001
 pile+0.001
 nose+0.001
Neg logit contributions
Lay-0.002
ve-0.002
Mad-0.002
ager-0.002
iring-0.002
Tokenaps
Token ID1686
Feature activation+1.40e+00

Pos logit contributions
Lay+0.002
ve+0.002
ala+0.002
iring+0.002
ires+0.002
Neg logit contributions
 chew-0.004
 spaghetti-0.003
 pile-0.003
 carrot-0.003
 meat-0.003
Token Lily
Token ID20037
Feature activation+8.05e-01

Pos logit contributions
 chew+0.042
 pile+0.037
 hop+0.037
 spaghetti+0.036
 nose+0.036
Neg logit contributions
Lay-0.030
la-0.027
ve-0.027
ager-0.027
leaf-0.027
Token the
Token ID262
Feature activation-2.35e-02

Pos logit contributions
 chew+0.051
 snowball+0.046
 hop+0.046
 pile+0.045
 nose+0.045
Neg logit contributions
Lay-0.029
spe-0.027
la-0.026
 singer-0.026
Ap-0.026
Token bag
Token ID6131
Feature activation-3.51e-01

Pos logit contributions
Lay+0.001
ve+0.001
ager+0.001
spe+0.001
Ag+0.001
Neg logit contributions
 chew-0.001
 hop-0.000
 snowball-0.000
 spaghetti-0.000
 pile-0.000
Token was
Token ID373
Feature activation+2.30e+00

Pos logit contributions
Lay+0.009
ager+0.008
la+0.008
ve+0.008
leaf+0.008
Neg logit contributions
 chew-0.009
 hop-0.008
 pile-0.008
 spaghetti-0.007
 nose-0.007
Token full
Token ID1336
Feature activation+2.30e+00

Pos logit contributions
 chew+0.061
 hop+0.054
 snowball+0.053
 pile+0.052
 nose+0.051
Neg logit contributions
Lay-0.054
la-0.050
ve-0.050
iring-0.049
spe-0.049
Token of
Token ID286
Feature activation+5.56e-01

Pos logit contributions
 chew+0.077
 snowball+0.073
 hop+0.069
 puck+0.069
 skate+0.069
Neg logit contributions
iring-0.034
spe-0.033
leaf-0.032
Lay-0.032
keeper-0.032
Token salty
Token ID36021
Feature activation+4.61e+00

Pos logit contributions
 chew+0.012
 hop+0.011
 snowball+0.011
 nose+0.011
 pile+0.011
Neg logit contributions
Lay-0.015
leaf-0.014
ve-0.014
Ag-0.014
iring-0.013
Token peanuts
Token ID45612
Feature activation+1.68e+00

Pos logit contributions
 chew+0.103
 hop+0.099
 snowball+0.094
 skate+0.091
 jump+0.090
Neg logit contributions
iring-0.110
ve-0.110
led-0.107
ires-0.104
Lay-0.103
Token.
Token ID13
Feature activation+9.01e-01

Pos logit contributions
 chew+0.045
 snowball+0.041
 hop+0.039
 pile+0.037
 puck+0.037
Neg logit contributions
ve-0.038
Lay-0.036
ager-0.036
leaf-0.035
iring-0.035
Token The
Token ID383
Feature activation+5.95e-01

Pos logit contributions
 chew+0.021
 spaghetti+0.018
 pile+0.018
 hop+0.018
 nose+0.018
Neg logit contributions
Lay-0.025
ager-0.024
ve-0.024
leaf-0.023
la-0.023
Token little
Token ID1310
Feature activation-2.14e-01

Pos logit contributions
 chew+0.018
 hop+0.016
 snowball+0.016
 pile+0.015
 spaghetti+0.015
Neg logit contributions
Lay-0.012
ve-0.011
ager-0.011
elia-0.011
spe-0.011
Token girl
Token ID2576
Feature activation-2.16e-01

Pos logit contributions
Lay+0.005
leaf+0.004
ve+0.004
ager+0.004
Mad+0.004
Neg logit contributions
 chew-0.007
 pile-0.006
 hop-0.006
 spaghetti-0.006
 nose-0.006
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
Lay+0.034
ve+0.034
la+0.033
ager+0.032
iring+0.032
Neg logit contributions
 chew-0.027
 spaghetti-0.021
 hop-0.021
 bisc-0.021
 snowball-0.021
Token that
Token ID326
Feature activation+1.17e+00

Pos logit contributions
Lay+0.039
ve+0.036
la+0.036
ager+0.036
overed+0.036
Neg logit contributions
 chew-0.043
 hop-0.038
 pile-0.036
 nose-0.036
 spaghetti-0.036
Token was
Token ID373
Feature activation+1.16e+00

Pos logit contributions
 chew+0.030
 spaghetti+0.026
 pile+0.025
 bisc+0.025
 hop+0.025
Neg logit contributions
 singer-0.025
ager-0.025
Lay-0.025
Ap-0.024
spe-0.024
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
 chew+0.031
 hop+0.027
 pile+0.026
 bisc+0.025
 nose+0.025
Neg logit contributions
Lay-0.026
iring-0.025
la-0.025
Ap-0.024
leaf-0.024
Token big
Token ID1263
Feature activation+7.09e-01

Pos logit contributions
Lay+0.003
 singer+0.003
May+0.003
 talented+0.003
Ap+0.003
Neg logit contributions
 chew-0.004
 spaghetti-0.003
 hop-0.003
 nose-0.003
 pile-0.003
Token sne
Token ID10505
Feature activation+4.60e+00

Pos logit contributions
 chew+0.016
 spaghetti+0.015
bowl+0.015
 hop+0.014
hot+0.014
Neg logit contributions
 singer-0.017
 angel-0.015
yl-0.014
Lay-0.013
ager-0.013
Tokene
Token ID68
Feature activation+1.01e+00

Pos logit contributions
 chew+0.116
 pile+0.110
 spaghetti+0.109
 carrot+0.107
 nose+0.103
Neg logit contributions
ager-0.115
ve-0.105
leaf-0.101
han-0.101
Lay-0.100
Tokenze
Token ID2736
Feature activation+1.36e+00

Pos logit contributions
 chew+0.026
 hop+0.022
 pile+0.022
 spaghetti+0.022
 nose+0.021
Neg logit contributions
Lay-0.026
ager-0.024
ve-0.024
la-0.024
leaf-0.023
Token!"
Token ID2474
Feature activation+5.65e-01

Pos logit contributions
 chew+0.040
 hop+0.035
 pile+0.034
 bisc+0.033
 nose+0.033
Neg logit contributions
 singer-0.027
ve-0.026
ager-0.026
Lay-0.026
la-0.025
Token
Token ID198
Feature activation+7.23e-01

Pos logit contributions
 chew+0.012
 pile+0.011
 hop+0.011
 spaghetti+0.010
 carrot+0.010
Neg logit contributions
Lay-0.016
ager-0.015
overed-0.015
leaf-0.015
ve-0.015
Token
Token ID198
Feature activation-1.02e-01

Pos logit contributions
 chew+0.020
 pile+0.018
 spaghetti+0.018
 hop+0.018
 nose+0.018
Neg logit contributions
Lay-0.017
ve-0.015
ager-0.015
la-0.015
iring-0.015
Token sur
Token ID969
Feature activation-5.86e+00

Pos logit contributions
Lay+0.006
ager+0.006
la+0.006
ve+0.006
elia+0.006
Neg logit contributions
 chew-0.005
 hop-0.004
 pile-0.004
 spaghetti-0.004
 nose-0.004
Tokenfer
Token ID2232
Feature activation-6.99e-01

Pos logit contributions
Lay+0.113
elia+0.107
ires+0.105
irst+0.099
la+0.099
Neg logit contributions
 chew-0.173
 skate-0.161
 jump-0.156
 hop-0.154
 reel-0.153
Token.
Token ID13
Feature activation+9.92e-01

Pos logit contributions
 singer+0.017
Lay+0.017
spe+0.016
bel+0.016
ager+0.016
Neg logit contributions
 chew-0.015
 spaghetti-0.015
 bisc-0.015
 snowball-0.014
 meat-0.014
Token Whenever
Token ID21326
Feature activation-3.35e-01

Pos logit contributions
 chew+0.025
 hop+0.023
 pile+0.022
 spaghetti+0.022
 nose+0.022
Neg logit contributions
Lay-0.025
ager-0.024
ve-0.024
leaf-0.023
la-0.023
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
Lay+0.006
ager+0.006
leaf+0.006
ve+0.006
la+0.006
Neg logit contributions
 chew-0.011
 hop-0.010
 pile-0.010
 spaghetti-0.010
 nose-0.009
Token sur
Token ID969
Feature activation-6.53e+00

Pos logit contributions
Lay+0.032
elia+0.030
la+0.030
ve+0.030
overed+0.029
Neg logit contributions
 chew-0.029
 hop-0.026
 pile-0.025
 jump-0.025
 nose-0.024
Tokenfer
Token ID2232
Feature activation-1.18e+00

Pos logit contributions
ires+0.142
Lay+0.140
elia+0.139
 Kirst+0.137
atalie+0.135
Neg logit contributions
 chew-0.153
 skate-0.151
 reel-0.143
 jump-0.136
 puck-0.135
Token balanced
Token ID12974
Feature activation-7.15e-01

Pos logit contributions
Lay+0.028
ager+0.027
spe+0.027
la+0.027
 singer+0.026
Neg logit contributions
 chew-0.030
 spaghetti-0.028
 pile-0.027
 snowball-0.027
 hop-0.026
Token on
Token ID319
Feature activation+3.19e-02

Pos logit contributions
Lay+0.018
 singer+0.017
la+0.017
overed+0.017
bel+0.017
Neg logit contributions
 chew-0.018
 spaghetti-0.016
 pile-0.015
 hop-0.015
 snowball-0.015
Token the
Token ID262
Feature activation+1.87e-01

Pos logit contributions
 chew+0.001
 bisc+0.001
 pile+0.001
 spaghetti+0.001
 carrot+0.001
Neg logit contributions
Lay-0.001
la-0.001
spe-0.001
 singer-0.001
yl-0.001
Token board
Token ID3096
Feature activation-6.71e-01

Pos logit contributions
 chew+0.004
 hop+0.004
 spaghetti+0.004
 snowball+0.004
 pile+0.003
Neg logit contributions
Lay-0.005
leaf-0.005
ve-0.005
ager-0.005
la-0.005
Token wanted
Token ID2227
Feature activation+3.46e-01

Pos logit contributions
Lay+0.008
la+0.007
ager+0.007
spe+0.007
ve+0.007
Neg logit contributions
 chew-0.009
 hop-0.008
 spaghetti-0.008
 pile-0.008
 snowball-0.007
Token to
Token ID284
Feature activation-6.43e-01

Pos logit contributions
 chew+0.013
 hop+0.012
 pile+0.012
 snowball+0.012
 bisc+0.012
Neg logit contributions
Lay-0.004
leaf-0.003
la-0.003
 singer-0.003
spe-0.003
Token be
Token ID307
Feature activation+5.05e-01

Pos logit contributions
Lay+0.020
leaf+0.019
 singer+0.019
ager+0.018
Mad+0.018
Neg logit contributions
 spaghetti-0.011
 chew-0.010
 snowball-0.010
 meat-0.010
 carrot-0.010
Token a
Token ID257
Feature activation-1.76e+00

Pos logit contributions
 chew+0.012
 hop+0.011
 pile+0.011
 spaghetti+0.010
 nose+0.010
Neg logit contributions
Lay-0.013
ager-0.012
leaf-0.012
ve-0.012
la-0.012
Token good
Token ID922
Feature activation-1.33e+00

Pos logit contributions
Lay+0.047
la+0.044
ve+0.043
ager+0.043
leaf+0.043
Neg logit contributions
 chew-0.043
 hop-0.038
 pile-0.037
 spaghetti-0.036
 nose-0.036
Token gol
Token ID43791
Feature activation-6.46e+00

Pos logit contributions
Lay+0.033
ager+0.031
ve+0.030
leaf+0.030
la+0.030
Neg logit contributions
 chew-0.034
 hop-0.030
 pile-0.030
 spaghetti-0.029
 nose-0.029
Tokenfer
Token ID2232
Feature activation-1.26e+00

Pos logit contributions
Lay+0.118
elia+0.114
ires+0.113
leaf+0.112
la+0.112
Neg logit contributions
 chew-0.208
 hop-0.194
 nose-0.184
 pile-0.183
 jump-0.182
Token like
Token ID588
Feature activation+1.77e-01

Pos logit contributions
Lay+0.029
 singer+0.029
ager+0.029
la+0.028
May+0.028
Neg logit contributions
 chew-0.033
 pile-0.029
 hop-0.029
 snowball-0.028
 nose-0.028
Token his
Token ID465
Feature activation-1.46e+00

Pos logit contributions
 chew+0.004
 hop+0.004
 pile+0.004
 nose+0.004
 snowball+0.004
Neg logit contributions
Lay-0.005
la-0.004
leaf-0.004
ager-0.004
 singer-0.004
Token dad
Token ID9955
Feature activation+1.57e-01

Pos logit contributions
Lay+0.039
la+0.037
ve+0.036
ager+0.036
leaf+0.036
Neg logit contributions
 chew-0.036
 hop-0.031
 pile-0.031
 nose-0.030
 spaghetti-0.030
Token.
Token ID13
Feature activation-8.76e-02

Pos logit contributions
 chew+0.004
 snowball+0.004
 hop+0.004
 pile+0.004
 spaghetti+0.004
Neg logit contributions
 singer-0.003
la-0.003
Lay-0.003
May-0.003
ve-0.003
Token arm
Token ID3211
Feature activation-6.07e-01

Pos logit contributions
Lay+0.043
la+0.041
ager+0.040
spe+0.040
ve+0.040
Neg logit contributions
 chew-0.038
 hop-0.033
 pile-0.032
 spaghetti-0.031
 snowball-0.031
Tokenchair
Token ID16337
Feature activation-1.93e+00

Pos logit contributions
Lay+0.015
la+0.014
ve+0.014
spe+0.014
ager+0.014
Neg logit contributions
 chew-0.016
 hop-0.014
 pile-0.014
 spaghetti-0.013
 carrot-0.013
Token.
Token ID13
Feature activation-9.56e-01

Pos logit contributions
Lay+0.049
la+0.046
ve+0.046
ager+0.046
spe+0.045
Neg logit contributions
 chew-0.049
 hop-0.043
 pile-0.042
 spaghetti-0.041
 snowball-0.041
Token
Token ID198
Feature activation-3.63e-01

Pos logit contributions
Lay+0.026
ager+0.024
ve+0.024
leaf+0.024
la+0.024
Neg logit contributions
 chew-0.023
 hop-0.019
 pile-0.019
 spaghetti-0.019
 nose-0.019
Token
Token ID198
Feature activation-8.29e-01

Pos logit contributions
Lay+0.009
la+0.009
ager+0.009
ve+0.009
leaf+0.009
Neg logit contributions
 chew-0.009
 hop-0.008
 pile-0.008
 nose-0.008
 spaghetti-0.008
TokenTHE
Token ID10970
Feature activation-6.46e+00

Pos logit contributions
Lay+0.020
iring+0.020
ve+0.020
la+0.020
overed+0.020
Neg logit contributions
 chew-0.021
 hop-0.019
 snowball-0.017
 jump-0.017
 spaghetti-0.017
Token<|endoftext|>
Token ID50256
Feature activation-4.86e-01

Pos logit contributions
iring+0.255
ires+0.246
leaf+0.244
Lay+0.241
onda+0.241
Neg logit contributions
 chew-0.085
 hop-0.072
 jump-0.067
 nose-0.061
 pile-0.055
TokenOnce
Token ID7454
Feature activation-1.38e+00

Pos logit contributions
Lay+0.009
May+0.008
ager+0.008
ve+0.008
la+0.008
Neg logit contributions
 chew-0.016
 nose-0.014
 pile-0.014
 hop-0.014
 spaghetti-0.014
Token upon
Token ID2402
Feature activation-7.48e-01

Pos logit contributions
leaf+0.040
ager+0.038
Mad+0.036
Lay+0.036
beh+0.036
Neg logit contributions
 chew-0.030
 pile-0.028
 hop-0.028
 spaghetti-0.027
 jump-0.027
Token a
Token ID257
Feature activation+5.73e-01

Pos logit contributions
Lay+0.023
la+0.022
spe+0.022
ve+0.022
ager+0.021
Neg logit contributions
 chew-0.016
 hop-0.013
 spaghetti-0.013
 snowball-0.012
 pile-0.012
Token time
Token ID640
Feature activation+7.34e-01

Pos logit contributions
 chew+0.012
 pile+0.011
 nose+0.011
 bisc+0.010
 stack+0.010
Neg logit contributions
ager-0.016
Lay-0.015
May-0.015
elia-0.015
Ap-0.015
Token before
Token ID878
Feature activation-1.69e-01

Pos logit contributions
Lay+0.064
leaf+0.059
ager+0.059
la+0.059
ve+0.059
Neg logit contributions
 chew-0.056
 hop-0.048
 pile-0.048
 spaghetti-0.047
 nose-0.047
Token dinner
Token ID8073
Feature activation+3.17e-01

Pos logit contributions
Lay+0.004
la+0.004
ve+0.004
ager+0.004
Mad+0.004
Neg logit contributions
 chew-0.005
 hop-0.004
 pile-0.004
 snowball-0.004
 nose-0.004
Token.
Token ID13
Feature activation-7.86e-01

Pos logit contributions
 chew+0.008
 hop+0.007
 snowball+0.007
 pile+0.006
 carrot+0.006
Neg logit contributions
Lay-0.008
ve-0.008
la-0.008
ager-0.008
 singer-0.008
Token<|endoftext|>
Token ID50256
Feature activation-7.47e-01

Pos logit contributions
Lay+0.020
ager+0.019
la+0.019
ve+0.019
leaf+0.018
Neg logit contributions
 chew-0.020
 hop-0.017
 pile-0.017
 spaghetti-0.017
 nose-0.017
TokenS
Token ID50
Feature activation-1.04e+00

Pos logit contributions
Lay+0.016
ve+0.014
la+0.014
ager+0.014
May+0.014
Neg logit contributions
 chew-0.022
 hop-0.020
 pile-0.020
 nose-0.019
 spaghetti-0.019
Tokenaman
Token ID10546
Feature activation-6.44e+00

Pos logit contributions
Lay+0.025
leaf+0.025
bel+0.024
ager+0.024
 singer+0.024
Neg logit contributions
 chew-0.028
 pile-0.024
 spaghetti-0.024
 hop-0.024
 carrot-0.024
Tokentha
Token ID12898
Feature activation-1.45e+00

Pos logit contributions
leaf+0.155
Lay+0.136
gr+0.130
Ag+0.126
Ap+0.124
Neg logit contributions
 nose-0.176
 chew-0.168
 pile-0.160
 jump-0.159
 carrot-0.152
Token was
Token ID373
Feature activation+1.40e-01

Pos logit contributions
Lay+0.044
leaf+0.041
ager+0.041
ve+0.041
iring+0.040
Neg logit contributions
 chew-0.031
 hop-0.026
 pile-0.025
 nose-0.025
 spaghetti-0.025
Token a
Token ID257
Feature activation-1.17e+00

Pos logit contributions
 chew+0.004
 hop+0.003
 spaghetti+0.003
 pile+0.003
 nose+0.003
Neg logit contributions
Lay-0.004
spe-0.003
ager-0.003
la-0.003
leaf-0.003
Token lazy
Token ID16931
Feature activation+2.15e-01

Pos logit contributions
Lay+0.026
May+0.024
ager+0.024
elia+0.024
ve+0.024
Neg logit contributions
 chew-0.033
 hop-0.029
 spaghetti-0.028
 pile-0.028
 nose-0.028
Token girl
Token ID2576
Feature activation+2.07e-01

Pos logit contributions
 chew+0.006
 hop+0.005
 bisc+0.005
 spaghetti+0.005
 pile+0.005
Neg logit contributions
 singer-0.005
Lay-0.004
elia-0.004
la-0.004
ager-0.004
Token came
Token ID1625
Feature activation+6.57e-01

Pos logit contributions
Lay+0.021
ager+0.020
la+0.020
ve+0.020
spe+0.020
Neg logit contributions
 chew-0.020
 hop-0.017
 spaghetti-0.017
 pile-0.017
 snowball-0.017
Token back
Token ID736
Feature activation-1.50e-01

Pos logit contributions
 chew+0.017
 hop+0.015
 pile+0.015
 nose+0.015
 jump+0.014
Neg logit contributions
Lay-0.016
ager-0.016
leaf-0.016
ve-0.015
la-0.015
Token.
Token ID13
Feature activation-8.32e-01

Pos logit contributions
Lay+0.004
leaf+0.004
ager+0.004
ve+0.004
Ap+0.004
Neg logit contributions
 chew-0.003
 hop-0.003
 pile-0.003
 nose-0.003
 jump-0.003
Token
Token ID198
Feature activation-4.53e-01

Pos logit contributions
Lay+0.023
ve+0.022
ager+0.022
leaf+0.022
la+0.022
Neg logit contributions
 chew-0.019
 hop-0.016
 pile-0.016
 spaghetti-0.016
 nose-0.015
Token
Token ID198
Feature activation-9.65e-01

Pos logit contributions
Lay+0.014
ager+0.013
la+0.013
ve+0.013
leaf+0.013
Neg logit contributions
 chew-0.009
 hop-0.008
 pile-0.007
 spaghetti-0.007
 snowball-0.007
TokenTHE
Token ID10970
Feature activation-6.23e+00

Pos logit contributions
Lay+0.022
overed+0.022
la+0.022
 singer+0.022
leaf+0.022
Neg logit contributions
 chew-0.026
 hop-0.023
 snowball-0.022
 jump-0.021
 spaghetti-0.021
Token<|endoftext|>
Token ID50256
Feature activation-4.91e-01

Pos logit contributions
iring+0.270
ires+0.263
leaf+0.257
ind+0.256
essa+0.252
Neg logit contributions
 chew-0.059
 hop-0.047
 jump-0.043
 nose-0.037
 pile-0.030
TokenOnce
Token ID7454
Feature activation-1.54e+00

Pos logit contributions
Lay+0.009
May+0.008
ager+0.008
ve+0.008
la+0.008
Neg logit contributions
 chew-0.016
 pile-0.014
 nose-0.014
 hop-0.014
 spaghetti-0.014
Token upon
Token ID2402
Feature activation-6.89e-01

Pos logit contributions
leaf+0.044
ager+0.042
beh+0.039
Mad+0.039
Lay+0.039
Neg logit contributions
 chew-0.034
 pile-0.032
 hop-0.031
 spaghetti-0.030
 jump-0.030
Token a
Token ID257
Feature activation+6.10e-01

Pos logit contributions
Lay+0.022
spe+0.021
la+0.021
ve+0.020
ager+0.020
Neg logit contributions
 chew-0.014
 hop-0.012
 spaghetti-0.011
 snowball-0.011
 carrot-0.011
Token time
Token ID640
Feature activation+6.57e-01

Pos logit contributions
 chew+0.012
 pile+0.011
 nose+0.011
 stack+0.010
 puck+0.010
Neg logit contributions
ager-0.018
Lay-0.017
May-0.016
ve-0.016
elia-0.016
Token strawberries
Token ID36973
Feature activation-9.11e-01

Pos logit contributions
 chew+0.014
 hop+0.012
 snowball+0.012
 pile+0.012
 nose+0.011
Neg logit contributions
Lay-0.014
la-0.013
 singer-0.013
ager-0.013
iring-0.013
Token!"
Token ID2474
Feature activation-7.54e-01

Pos logit contributions
Lay+0.021
ve+0.020
la+0.020
ager+0.020
leaf+0.019
Neg logit contributions
 chew-0.025
 hop-0.022
 snowball-0.022
 pile-0.021
 nose-0.021
Token
Token ID198
Feature activation-5.31e-01

Pos logit contributions
Lay+0.019
ager+0.017
ve+0.017
la+0.017
leaf+0.017
Neg logit contributions
 chew-0.020
 hop-0.017
 pile-0.017
 spaghetti-0.017
 nose-0.017
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
Lay+0.014
spe+0.013
leaf+0.013
ager+0.013
la+0.013
Neg logit contributions
 chew-0.013
 hop-0.011
 pile-0.011
 snowball-0.011
 spaghetti-0.011
TokenS
Token ID50
Feature activation-1.63e+00

Pos logit contributions
Lay+0.022
 singer+0.020
leaf+0.020
overed+0.020
spe+0.020
Neg logit contributions
 chew-0.039
 hop-0.035
 pile-0.035
 snowball-0.034
 carrot-0.034
Tokenaman
Token ID10546
Feature activation-6.20e+00

Pos logit contributions
bel+0.036
Lay+0.035
 singer+0.035
spe+0.034
May+0.033
Neg logit contributions
 chew-0.043
 bisc-0.040
 carrot-0.039
 pile-0.038
 snack-0.038
Tokentha
Token ID12898
Feature activation-1.94e+00

Pos logit contributions
leaf+0.130
Lay+0.119
May+0.113
ires+0.107
Mad+0.107
Neg logit contributions
 nose-0.184
 jump-0.166
 chew-0.165
 pile-0.164
 patch-0.157
Token and
Token ID290
Feature activation+9.86e-01

Pos logit contributions
Lay+0.064
Mad+0.062
ires+0.061
Ag+0.061
iring+0.061
Neg logit contributions
 chew-0.034
 hop-0.030
 nose-0.030
 pile-0.029
 jump-0.028
Token Dan
Token ID6035
Feature activation-7.11e-01

Pos logit contributions
 chew+0.023
 pile+0.020
 spaghetti+0.019
 hop+0.019
 snowball+0.019
Neg logit contributions
Lay-0.028
la-0.026
leaf-0.026
ager-0.026
ve-0.025
Token thought
Token ID1807
Feature activation-1.66e+00

Pos logit contributions
Lay+0.019
iring+0.018
Mad+0.018
leaf+0.018
ve+0.018
Neg logit contributions
 chew-0.017
 hop-0.015
 pile-0.015
 nose-0.014
 jump-0.014
Token it
Token ID340
Feature activation+1.24e-01

Pos logit contributions
Lay+0.036
la+0.033
ve+0.033
leaf+0.033
ager+0.033
Neg logit contributions
 chew-0.048
 pile-0.042
 hop-0.042
 nose-0.041
 snowball-0.041
TokenB
Token ID33
Feature activation-4.24e-01

Pos logit contributions
 chew+0.026
 hop+0.025
 pile+0.024
 jump+0.023
 snowball+0.023
Neg logit contributions
Lay-0.024
leaf-0.024
la-0.024
elia-0.023
ires-0.023
Tokenella
Token ID12627
Feature activation-4.72e-01

Pos logit contributions
Lay+0.010
leaf+0.010
Mad+0.010
Ap+0.009
 singer+0.009
Neg logit contributions
 chew-0.011
 hop-0.010
 pile-0.010
 jump-0.010
 nose-0.009
Token watched
Token ID7342
Feature activation+6.52e-01

Pos logit contributions
Lay+0.013
iring+0.013
ires+0.013
leaf+0.013
Mad+0.012
Neg logit contributions
 chew-0.011
 hop-0.010
 pile-0.009
 nose-0.009
 jump-0.009
Token as
Token ID355
Feature activation+6.28e-01

Pos logit contributions
 chew+0.018
 pile+0.016
 hop+0.016
 bisc+0.015
 spaghetti+0.015
Neg logit contributions
Lay-0.015
overed-0.014
ve-0.014
 singer-0.014
leaf-0.014
Token the
Token ID262
Feature activation-1.03e+00

Pos logit contributions
 chew+0.015
 hop+0.014
 jump+0.013
 spaghetti+0.012
 snowball+0.012
Neg logit contributions
ager-0.018
lor-0.017
ve-0.017
Lay-0.017
leaf-0.017
Token sur
Token ID969
Feature activation-6.10e+00

Pos logit contributions
Lay+0.031
la+0.030
elia+0.030
ve+0.030
ager+0.029
Neg logit contributions
 chew-0.021
 hop-0.019
 jump-0.018
 pile-0.018
 nose-0.017
Tokenfer
Token ID2232
Feature activation-6.99e-01

Pos logit contributions
ires+0.139
Lay+0.131
iring+0.130
elia+0.130
 yourselves+0.126
Neg logit contributions
 chew-0.155
 skate-0.144
 reel-0.141
 jump-0.137
 hop-0.135
Token rode
Token ID22075
Feature activation+3.14e-02

Pos logit contributions
Lay+0.016
 singer+0.016
ager+0.016
la+0.016
spe+0.016
Neg logit contributions
 chew-0.017
 spaghetti-0.017
 snowball-0.016
 bisc-0.015
 carrot-0.015
Token on
Token ID319
Feature activation+6.65e-02

Pos logit contributions
 chew+0.001
 hop+0.001
 pile+0.001
 snowball+0.001
 nose+0.001
Neg logit contributions
Lay-0.001
ager-0.001
ve-0.001
leaf-0.001
elia-0.001
Token the
Token ID262
Feature activation-2.51e-01

Pos logit contributions
 chew+0.002
 hop+0.002
 spaghetti+0.002
 bisc+0.002
 nose+0.002
Neg logit contributions
Lay-0.002
la-0.001
spe-0.001
May-0.001
Ag-0.001
Token waves
Token ID9813
Feature activation+1.26e-01

Pos logit contributions
Lay+0.007
leaf+0.007
ager+0.006
la+0.006
ve+0.006
Neg logit contributions
 chew-0.006
 hop-0.005
 spaghetti-0.005
 snowball-0.005
 pile-0.005
Token Peter
Token ID5613
Feature activation-9.59e-01

Pos logit contributions
 chew+0.016
 hop+0.013
 pile+0.013
 spaghetti+0.013
 nose+0.013
Neg logit contributions
Lay-0.028
leaf-0.027
ager-0.027
ve-0.027
la-0.027
Token wasn
Token ID2492
Feature activation-2.26e+00

Pos logit contributions
Lay+0.026
ires+0.024
leaf+0.024
iring+0.024
ager+0.024
Neg logit contributions
 chew-0.023
 hop-0.020
 pile-0.020
 nose-0.020
 jump-0.019
Token't
Token ID470
Feature activation-5.95e-01

Pos logit contributions
ager+0.044
Lay+0.044
elia+0.042
ires+0.042
bel+0.041
Neg logit contributions
 chew-0.070
 hop-0.064
 jump-0.062
 snowball-0.059
 pile-0.059
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
Lay+0.017
ires+0.017
ager+0.017
overed+0.017
iring+0.016
Neg logit contributions
 chew-0.013
 hop-0.011
 jump-0.011
 pile-0.010
 snowball-0.010
Token greatest
Token ID6000
Feature activation-1.01e+00

Pos logit contributions
Lay+0.032
ager+0.030
la+0.030
leaf+0.030
ve+0.030
Neg logit contributions
 chew-0.032
 hop-0.028
 pile-0.028
 spaghetti-0.027
 nose-0.027
Token sur
Token ID969
Feature activation-6.06e+00

Pos logit contributions
Lay+0.025
ager+0.023
la+0.023
ve+0.023
leaf+0.023
Neg logit contributions
 chew-0.026
 hop-0.023
 pile-0.023
 spaghetti-0.022
 nose-0.022
Tokenfer
Token ID2232
Feature activation-1.43e+00

Pos logit contributions
Lay+0.127
ires+0.127
endant+0.125
elia+0.124
iring+0.118
Neg logit contributions
 chew-0.158
 skate-0.150
 jump-0.145
 hop-0.143
 reel-0.140
Token,
Token ID11
Feature activation+3.98e-01

Pos logit contributions
Lay+0.034
leaf+0.031
iring+0.031
ires+0.031
Mad+0.031
Neg logit contributions
 chew-0.039
 hop-0.034
 pile-0.033
 spaghetti-0.033
 nose-0.033
Token but
Token ID475
Feature activation-8.16e-01

Pos logit contributions
 chew+0.009
 hop+0.008
 pile+0.007
 spaghetti+0.007
 nose+0.007
Neg logit contributions
Lay-0.011
ager-0.011
la-0.011
leaf-0.011
ve-0.011
Token he
Token ID339
Feature activation-4.11e-01

Pos logit contributions
Lay+0.025
ve+0.025
la+0.024
ager+0.024
Mad+0.024
Neg logit contributions
 chew-0.017
 hop-0.014
 jump-0.013
 pile-0.013
 carrot-0.012
Token loved
Token ID6151
Feature activation-2.11e-01

Pos logit contributions
Lay+0.012
ires+0.011
iring+0.011
Mad+0.011
leaf+0.011
Neg logit contributions
 chew-0.010
 hop-0.008
 pile-0.008
 jump-0.008
 nose-0.008
Token again
Token ID757
Feature activation-2.16e-01

Pos logit contributions
Lay+0.007
 singer+0.006
overed+0.006
May+0.006
ager+0.006
Neg logit contributions
 chew-0.012
 nose-0.010
 bisc-0.010
 pile-0.010
 carrot-0.010
Token tomorrow
Token ID9439
Feature activation+2.21e-01

Pos logit contributions
Lay+0.005
ve+0.005
la+0.005
ager+0.005
May+0.005
Neg logit contributions
 chew-0.006
 snowball-0.005
 bisc-0.005
 pile-0.005
 spaghetti-0.005
Token.
Token ID13
Feature activation-7.82e-01

Pos logit contributions
 chew+0.005
 hop+0.005
 pile+0.005
 snowball+0.005
 spaghetti+0.004
Neg logit contributions
Lay-0.006
la-0.005
ager-0.005
ve-0.005
spe-0.005
Token
Token ID198
Feature activation+1.50e-01

Pos logit contributions
Lay+0.022
ager+0.020
la+0.020
ve+0.020
leaf+0.020
Neg logit contributions
 chew-0.018
 hop-0.015
 pile-0.015
 spaghetti-0.015
 nose-0.015
Token
Token ID198
Feature activation-3.64e-01

Pos logit contributions
 chew+0.004
 pile+0.003
 hop+0.003
 nose+0.003
 spaghetti+0.003
Neg logit contributions
Lay-0.004
la-0.004
ve-0.004
ager-0.004
May-0.004
TokenTHE
Token ID10970
Feature activation-6.04e+00

Pos logit contributions
Lay+0.008
la+0.008
ve+0.008
leaf+0.008
overed+0.008
Neg logit contributions
 chew-0.010
 hop-0.009
 pile-0.009
 spaghetti-0.009
 snowball-0.009
Token<|endoftext|>
Token ID50256
Feature activation-3.15e-01

Pos logit contributions
iring+0.241
ires+0.233
Lay+0.233
leaf+0.228
elia+0.228
Neg logit contributions
 chew-0.075
 hop-0.060
 jump-0.055
 nose-0.054
 pile-0.047
TokenOnce
Token ID7454
Feature activation-1.43e+00

Pos logit contributions
Lay+0.006
May+0.005
ager+0.005
ve+0.005
la+0.005
Neg logit contributions
 chew-0.010
 nose-0.009
 pile-0.009
 hop-0.009
 spaghetti-0.009
Token upon
Token ID2402
Feature activation-6.53e-01

Pos logit contributions
leaf+0.040
ager+0.037
Lay+0.036
Mad+0.035
beh+0.035
Neg logit contributions
 chew-0.033
 pile-0.031
 hop-0.030
 spaghetti-0.030
 carrot-0.029
Token a
Token ID257
Feature activation+8.67e-01

Pos logit contributions
Lay+0.020
spe+0.019
la+0.019
ve+0.019
ager+0.019
Neg logit contributions
 chew-0.013
 hop-0.011
 spaghetti-0.011
 snowball-0.011
 pile-0.011
Token time
Token ID640
Feature activation+7.04e-01

Pos logit contributions
 chew+0.018
 pile+0.016
 nose+0.016
 stack+0.016
 puck+0.015
Neg logit contributions
ager-0.024
Lay-0.023
May-0.022
elia-0.022
ve-0.022
Token hands
Token ID2832
Feature activation+1.51e+00

Pos logit contributions
 chew+0.007
 hop+0.006
 nose+0.006
 pile+0.005
 jump+0.005
Neg logit contributions
Lay-0.016
Mad-0.015
la-0.015
leaf-0.015
iring-0.015
Token.
Token ID13
Feature activation+1.68e+00

Pos logit contributions
 chew+0.042
 pile+0.037
 snowball+0.037
 meat+0.037
 spaghetti+0.036
Neg logit contributions
Lay-0.032
overed-0.030
ve-0.030
la-0.030
spe-0.030
Token
Token ID198
Feature activation+1.26e+00

Pos logit contributions
 chew+0.040
 hop+0.035
 pile+0.034
 spaghetti+0.033
 nose+0.033
Neg logit contributions
Lay-0.045
ager-0.044
leaf-0.043
ve-0.043
la-0.043
Token
Token ID198
Feature activation+5.70e-01

Pos logit contributions
 chew+0.034
 pile+0.030
 nose+0.030
 hop+0.030
 spaghetti+0.030
Neg logit contributions
Lay-0.031
ve-0.028
ager-0.027
leaf-0.027
May-0.027
TokenThe
Token ID464
Feature activation-4.22e-01

Pos logit contributions
 chew+0.016
 hop+0.015
 pile+0.014
 jump+0.014
 snowball+0.013
Neg logit contributions
Lay-0.013
la-0.013
elia-0.012
leaf-0.012
essa-0.012
Token sur
Token ID969
Feature activation-5.97e+00

Pos logit contributions
Lay+0.011
la+0.011
elia+0.011
ve+0.011
overed+0.011
Neg logit contributions
 chew-0.010
 hop-0.009
 pile-0.009
 jump-0.009
 nose-0.008
Tokenfer
Token ID2232
Feature activation-1.20e+00

Pos logit contributions
ires+0.143
elia+0.137
endant+0.131
atalie+0.130
 Kirst+0.129
Neg logit contributions
 skate-0.128
 chew-0.124
 reel-0.120
 jump-0.113
 hop-0.113
Token saw
Token ID2497
Feature activation+1.54e+00

Pos logit contributions
Lay+0.029
ager+0.027
la+0.027
spe+0.027
ve+0.026
Neg logit contributions
 chew-0.032
 spaghetti-0.028
 hop-0.028
 pile-0.028
 snowball-0.027
Token Bella
Token ID39304
Feature activation+4.67e-01

Pos logit contributions
 chew+0.038
 pile+0.032
 hop+0.032
 nose+0.032
 snowball+0.031
Neg logit contributions
Lay-0.041
la-0.038
iring-0.037
leaf-0.037
ve-0.037
Token and
Token ID290
Feature activation+1.12e+00

Pos logit contributions
 chew+0.011
 bisc+0.011
 spaghetti+0.011
 snowball+0.010
 pile+0.010
Neg logit contributions
Lay-0.011
spe-0.011
 singer-0.010
la-0.010
overed-0.010
Token waved
Token ID26834
Feature activation-6.76e-01

Pos logit contributions
 chew+0.028
 hop+0.025
 pile+0.024
 nose+0.024
 spaghetti+0.023
Neg logit contributions
Lay-0.029
ve-0.028
ager-0.028
leaf-0.027
la-0.027
Token He
Token ID679
Feature activation+2.58e-01

Pos logit contributions
Lay+0.013
spe+0.012
la+0.012
Ap+0.012
ires+0.012
Neg logit contributions
 chew-0.009
 hop-0.008
 nose-0.008
 pile-0.008
 carrot-0.008
Token was
Token ID373
Feature activation+3.97e-02

Pos logit contributions
 chew+0.007
 hop+0.006
 pile+0.006
 spaghetti+0.006
 nose+0.006
Neg logit contributions
Lay-0.006
ager-0.005
la-0.005
ve-0.005
leaf-0.005
Token a
Token ID257
Feature activation-8.73e-01

Pos logit contributions
 chew+0.001
 hop+0.001
 pile+0.001
 spaghetti+0.001
 nose+0.001
Neg logit contributions
Lay-0.001
ager-0.001
la-0.001
ve-0.001
leaf-0.001
Token very
Token ID845
Feature activation+2.66e-01

Pos logit contributions
Lay+0.023
la+0.021
ve+0.021
ager+0.021
leaf+0.021
Neg logit contributions
 chew-0.022
 hop-0.019
 pile-0.018
 spaghetti-0.018
 nose-0.018
Token successful
Token ID4388
Feature activation-1.99e+00

Pos logit contributions
 chew+0.007
 hop+0.006
 pile+0.005
 jump+0.005
 spaghetti+0.005
Neg logit contributions
Lay-0.007
la-0.007
overed-0.007
Mad-0.007
iring-0.007
Token clim
Token ID5424
Feature activation-5.95e+00

Pos logit contributions
Lay+0.047
ager+0.044
la+0.043
ve+0.043
leaf+0.043
Neg logit contributions
 chew-0.054
 hop-0.048
 pile-0.047
 spaghetti-0.046
 nose-0.046
Tokenber
Token ID527
Feature activation-1.11e+00

Pos logit contributions
Lay+0.102
Ap+0.098
 yourselves+0.092
ires+0.092
Ag+0.090
Neg logit contributions
 chew-0.183
 skate-0.178
 jump-0.173
 hop-0.171
 stack-0.169
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
Lay+0.028
ager+0.026
la+0.026
ve+0.026
spe+0.026
Neg logit contributions
 chew-0.028
 hop-0.025
 pile-0.024
 spaghetti-0.024
 snowball-0.024
Token Every
Token ID3887
Feature activation+3.17e-01

Pos logit contributions
Lay+0.006
spe+0.006
Mad+0.006
May+0.006
Ap+0.006
Neg logit contributions
 chew-0.004
 hop-0.004
 nose-0.004
 carrot-0.004
 pile-0.004
Token day
Token ID1110
Feature activation-9.40e-01

Pos logit contributions
 chew+0.008
 bisc+0.007
 pile+0.007
 nose+0.007
 hop+0.007
Neg logit contributions
ager-0.008
 singer-0.008
la-0.008
spe-0.007
Lay-0.007
Token he
Token ID339
Feature activation+3.92e-01

Pos logit contributions
Lay+0.026
spe+0.025
la+0.025
 singer+0.024
Mad+0.024
Neg logit contributions
 chew-0.021
 hop-0.018
 pile-0.018
 snowball-0.018
 spaghetti-0.018
Token he
Token ID339
Feature activation+7.53e-02

Pos logit contributions
 chew+0.026
 hop+0.022
 jump+0.020
 pile+0.020
 snowball+0.020
Neg logit contributions
Lay-0.031
leaf-0.031
ager-0.030
la-0.030
elia-0.029
Token could
Token ID714
Feature activation-5.15e-01

Pos logit contributions
 chew+0.002
 hop+0.002
 pile+0.002
 spaghetti+0.002
 nose+0.002
Neg logit contributions
Lay-0.002
ager-0.002
la-0.002
ve-0.002
leaf-0.002
Token be
Token ID307
Feature activation+4.78e-01

Pos logit contributions
Lay+0.016
 singer+0.015
ve+0.014
Mad+0.014
ager+0.014
Neg logit contributions
 spaghetti-0.009
 chew-0.009
 bisc-0.009
 meat-0.009
 snowball-0.008
Token a
Token ID257
Feature activation-1.20e+00

Pos logit contributions
 chew+0.012
 pile+0.011
 hop+0.011
 spaghetti+0.011
 nose+0.010
Neg logit contributions
Lay-0.012
ager-0.011
la-0.011
ve-0.011
leaf-0.011
Token better
Token ID1365
Feature activation-7.26e-01

Pos logit contributions
Lay+0.031
ager+0.029
ve+0.028
la+0.028
leaf+0.028
Neg logit contributions
 chew-0.030
 hop-0.026
 pile-0.026
 spaghetti-0.026
 nose-0.025
Token sur
Token ID969
Feature activation-5.92e+00

Pos logit contributions
Lay+0.019
la+0.018
leaf+0.018
ve+0.018
ager+0.018
Neg logit contributions
 chew-0.018
 hop-0.016
 pile-0.015
 spaghetti-0.015
 nose-0.015
Tokenfer
Token ID2232
Feature activation-8.38e-01

Pos logit contributions
Lay+0.108
ires+0.104
elia+0.099
iring+0.098
la+0.093
Neg logit contributions
 chew-0.194
 hop-0.175
 skate-0.171
 jump-0.170
 nose-0.165
Token.
Token ID13
Feature activation+5.78e-01

Pos logit contributions
Lay+0.021
ager+0.020
la+0.020
ve+0.020
spe+0.019
Neg logit contributions
 chew-0.021
 pile-0.018
 hop-0.018
 spaghetti-0.018
 snowball-0.018
Token He
Token ID679
Feature activation+7.41e-02

Pos logit contributions
 chew+0.014
 hop+0.012
 pile+0.012
 spaghetti+0.011
 nose+0.011
Neg logit contributions
Lay-0.016
ager-0.015
la-0.015
ve-0.015
leaf-0.015
Token hurried
Token ID31662
Feature activation+9.27e-01

Pos logit contributions
 chew+0.002
 spaghetti+0.002
 hop+0.002
 pile+0.002
 carrot+0.002
Neg logit contributions
Lay-0.002
la-0.002
spe-0.002
ager-0.002
ve-0.002
Token to
Token ID284
Feature activation-5.30e-01

Pos logit contributions
 chew+0.022
 hop+0.020
 pile+0.019
 jump+0.019
 spaghetti+0.019
Neg logit contributions
Lay-0.024
ager-0.023
Ap-0.023
leaf-0.023
ires-0.023
Token alone
Token ID3436
Feature activation+9.97e-01

Pos logit contributions
 chew+0.023
 hop+0.021
 pile+0.020
 spaghetti+0.020
 snowball+0.020
Neg logit contributions
Lay-0.020
iring-0.018
ve-0.018
la-0.018
spe-0.018
Token,
Token ID11
Feature activation+1.16e+00

Pos logit contributions
 chew+0.024
 hop+0.020
 pile+0.020
 spaghetti+0.020
 nose+0.019
Neg logit contributions
Lay-0.027
ve-0.025
ager-0.025
la-0.025
spe-0.025
Token without
Token ID1231
Feature activation-7.33e-01

Pos logit contributions
 chew+0.025
 pile+0.022
 spaghetti+0.022
 hop+0.021
 bisc+0.021
Neg logit contributions
Lay-0.032
spe-0.030
la-0.029
May-0.029
Ap-0.029
Token a
Token ID257
Feature activation-9.12e-01

Pos logit contributions
Lay+0.016
la+0.015
leaf+0.015
May+0.015
elia+0.015
Neg logit contributions
 chew-0.018
 pile-0.018
 bisc-0.017
 hop-0.017
 stack-0.017
Token cool
Token ID3608
Feature activation+3.71e-01

Pos logit contributions
Lay+0.024
spe+0.023
 singer+0.023
ve+0.023
bel+0.023
Neg logit contributions
 chew-0.019
 spaghetti-0.018
 hop-0.016
 meat-0.015
 pile-0.015
Token hairst
Token ID42658
Feature activation-5.88e+00

Pos logit contributions
 chew+0.007
 spaghetti+0.006
 pile+0.006
 hop+0.006
 snowball+0.006
Neg logit contributions
Lay-0.011
 singer-0.010
bel-0.010
spe-0.010
ager-0.010
Tokenyle
Token ID2349
Feature activation-1.24e+00

Pos logit contributions
Lay+0.098
Ag+0.090
Ap+0.090
ager+0.087
atalie+0.085
Neg logit contributions
 chew-0.172
 skate-0.162
 jump-0.154
 nose-0.153
 snowball-0.151
Token and
Token ID290
Feature activation+4.24e-01

Pos logit contributions
Lay+0.037
Ap+0.034
iring+0.034
Mad+0.034
elia+0.034
Neg logit contributions
 chew-0.027
 hop-0.024
 nose-0.022
 jump-0.022
 pile-0.022
Token without
Token ID1231
Feature activation-8.70e-01

Pos logit contributions
 chew+0.010
 hop+0.009
 pile+0.009
 snowball+0.009
 spaghetti+0.009
Neg logit contributions
Lay-0.011
la-0.010
leaf-0.010
ager-0.010
ve-0.010
Token a
Token ID257
Feature activation-8.53e-01

Pos logit contributions
Lay+0.020
la+0.019
May+0.019
leaf+0.018
elia+0.018
Neg logit contributions
 chew-0.022
 pile-0.021
 snowball-0.020
 hop-0.020
 bisc-0.020
Token solution
Token ID4610
Feature activation-1.70e+00

Pos logit contributions
Lay+0.025
ve+0.023
ager+0.023
la+0.023
leaf+0.023
Neg logit contributions
 chew-0.018
 hop-0.015
 spaghetti-0.015
 pile-0.015
 snowball-0.015
Token pond
Token ID16723
Feature activation+1.26e+00

Pos logit contributions
 chew+0.014
 hop+0.011
 snowball+0.011
 spaghetti+0.011
 carrot+0.011
Neg logit contributions
ager-0.018
Lay-0.018
 singer-0.017
elia-0.017
ve-0.017
Token?"
Token ID1701
Feature activation-1.91e-01

Pos logit contributions
 chew+0.042
 snowball+0.038
 hop+0.037
 bisc+0.036
 tumble+0.036
Neg logit contributions
bel-0.018
iring-0.018
spe-0.017
ager-0.017
han-0.017
Token
Token ID198
Feature activation+4.87e-01

Pos logit contributions
Lay+0.005
ager+0.004
leaf+0.004
Ap+0.004
spe+0.004
Neg logit contributions
 chew-0.005
 hop-0.004
 pile-0.004
 spaghetti-0.004
 nose-0.004
Token
Token ID198
Feature activation-3.27e-01

Pos logit contributions
 chew+0.011
 hop+0.010
 snowball+0.009
 pile+0.009
 jump+0.009
Neg logit contributions
Lay-0.014
la-0.013
overed-0.013
ve-0.013
iring-0.013
TokenThe
Token ID464
Feature activation-5.20e-01

Pos logit contributions
Lay+0.008
la+0.008
overed+0.008
ve+0.008
iring+0.008
Neg logit contributions
 chew-0.009
 hop-0.008
 snowball-0.007
 pile-0.007
 jump-0.007
Token gol
Token ID43791
Feature activation-5.87e+00

Pos logit contributions
la+0.011
Lay+0.011
elia+0.010
ve+0.010
overed+0.010
Neg logit contributions
 chew-0.016
 hop-0.014
 pile-0.014
 jump-0.014
 puck-0.014
Tokenfer
Token ID2232
Feature activation-3.37e-01

Pos logit contributions
ires+0.123
la+0.116
iring+0.115
elia+0.114
Lay+0.113
Neg logit contributions
 chew-0.169
 hop-0.157
 puck-0.155
 bone-0.152
 fetch-0.150
Token replied
Token ID8712
Feature activation-4.44e-01

Pos logit contributions
Lay+0.008
ager+0.008
ve+0.008
la+0.007
spe+0.007
Neg logit contributions
 chew-0.009
 spaghetti-0.008
 hop-0.008
 pile-0.008
 snowball-0.008
Token,
Token ID11
Feature activation+9.96e-01

Pos logit contributions
Lay+0.016
leaf+0.015
la+0.015
ager+0.015
ve+0.015
Neg logit contributions
 chew-0.007
 hop-0.006
 nose-0.005
 pile-0.005
 spaghetti-0.005
Token "
Token ID366
Feature activation-1.26e+00

Pos logit contributions
 chew+0.022
 hop+0.018
 nose+0.018
 pile+0.018
 spaghetti+0.018
Neg logit contributions
Lay-0.029
la-0.028
leaf-0.028
ager-0.028
spe-0.027
TokenI
Token ID40
Feature activation-1.25e-02

Pos logit contributions
elia+0.025
ve+0.025
la+0.024
overed+0.024
Lay+0.024
Neg logit contributions
 chew-0.038
 hop-0.034
 puck-0.033
 snowball-0.033
 skate-0.032
Token but
Token ID475
Feature activation-8.17e-01

Pos logit contributions
 chew+0.011
 pile+0.009
 snowball+0.009
 bisc+0.009
 spaghetti+0.009
Neg logit contributions
Lay-0.016
Ap-0.015
la-0.015
Ag-0.015
May-0.015
Token she
Token ID673
Feature activation-3.23e-01

Pos logit contributions
Lay+0.016
ve+0.015
ager+0.015
leaf+0.015
la+0.015
Neg logit contributions
 chew-0.026
 hop-0.023
 pile-0.023
 spaghetti-0.022
 nose-0.022
Token liked
Token ID8288
Feature activation+3.28e-01

Pos logit contributions
Lay+0.008
leaf+0.007
ve+0.007
la+0.007
ager+0.007
Neg logit contributions
 chew-0.009
 hop-0.008
 pile-0.008
 spaghetti-0.007
 nose-0.007
Token watching
Token ID4964
Feature activation+6.19e-02

Pos logit contributions
 chew+0.009
 hop+0.008
 pile+0.008
 nose+0.008
 snowball+0.008
Neg logit contributions
Lay-0.008
la-0.007
leaf-0.007
 singer-0.007
spe-0.007
Token the
Token ID262
Feature activation-2.15e-01

Pos logit contributions
 chew+0.002
 bisc+0.002
 pile+0.002
 hop+0.001
 spaghetti+0.001
Neg logit contributions
Lay-0.001
 singer-0.001
la-0.001
ala-0.001
May-0.001
Token sur
Token ID969
Feature activation-5.86e+00

Pos logit contributions
Lay+0.006
ager+0.006
la+0.006
ve+0.006
elia+0.006
Neg logit contributions
 chew-0.005
 hop-0.004
 pile-0.004
 spaghetti-0.004
 nose-0.004
Tokenfer
Token ID2232
Feature activation-6.99e-01

Pos logit contributions
Lay+0.113
elia+0.107
ires+0.105
irst+0.099
la+0.099
Neg logit contributions
 chew-0.173
 skate-0.161
 jump-0.156
 hop-0.154
 reel-0.153
Token.
Token ID13
Feature activation+9.92e-01

Pos logit contributions
 singer+0.017
Lay+0.017
spe+0.016
bel+0.016
ager+0.016
Neg logit contributions
 chew-0.015
 spaghetti-0.015
 bisc-0.015
 snowball-0.014
 meat-0.014
Token Whenever
Token ID21326
Feature activation-3.35e-01

Pos logit contributions
 chew+0.025
 hop+0.023
 pile+0.022
 spaghetti+0.022
 nose+0.022
Neg logit contributions
Lay-0.025
ager-0.024
ve-0.024
leaf-0.023
la-0.023
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
Lay+0.006
ager+0.006
leaf+0.006
ve+0.006
la+0.006
Neg logit contributions
 chew-0.011
 hop-0.010
 pile-0.010
 spaghetti-0.010
 nose-0.009
Token sur
Token ID969
Feature activation-6.53e+00

Pos logit contributions
Lay+0.032
elia+0.030
la+0.030
ve+0.030
overed+0.029
Neg logit contributions
 chew-0.029
 hop-0.026
 pile-0.025
 jump-0.025
 nose-0.024
Token was
Token ID373
Feature activation+7.67e-01

Pos logit contributions
Lay+0.029
leaf+0.027
ve+0.027
ager+0.027
Mad+0.027
Neg logit contributions
 chew-0.026
 hop-0.023
 pile-0.022
 nose-0.022
 spaghetti-0.021
Token no
Token ID645
Feature activation+9.66e-01

Pos logit contributions
 chew+0.021
 hop+0.018
 pile+0.018
 spaghetti+0.018
 nose+0.017
Neg logit contributions
Lay-0.018
la-0.017
ager-0.017
ve-0.017
leaf-0.017
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
 chew+0.020
 hop+0.017
 pile+0.017
 spaghetti+0.017
 nose+0.016
Neg logit contributions
Lay-0.029
ager-0.027
leaf-0.027
ve-0.027
la-0.027
Token<|endoftext|>
Token ID50256
Feature activation-2.40e-01

Pos logit contributions
 chew+0.004
 spaghetti+0.003
 hop+0.003
 pile+0.003
 nose+0.003
Neg logit contributions
Lay-0.005
ager-0.005
leaf-0.005
ve-0.005
la-0.005
TokenS
Token ID50
Feature activation-1.13e+00

Pos logit contributions
Lay+0.005
ve+0.004
la+0.004
May+0.004
ager+0.004
Neg logit contributions
 chew-0.008
 hop-0.007
 pile-0.007
 nose-0.007
 spaghetti-0.007
Tokenaman
Token ID10546
Feature activation-5.86e+00

Pos logit contributions
Lay+0.027
la+0.025
ager+0.025
ve+0.025
leaf+0.025
Neg logit contributions
 chew-0.030
 hop-0.027
 pile-0.026
 spaghetti-0.026
 nose-0.026
Tokentha
Token ID12898
Feature activation-1.70e+00

Pos logit contributions
leaf+0.120
Lay+0.113
ires+0.109
Ag+0.101
gr+0.099
Neg logit contributions
 nose-0.177
 chew-0.168
 jump-0.163
 pile-0.162
 snack-0.153
Token opened
Token ID4721
Feature activation-1.72e+00

Pos logit contributions
Lay+0.051
leaf+0.048
iring+0.048
ires+0.048
Mad+0.047
Neg logit contributions
 chew-0.036
 hop-0.031
 pile-0.030
 nose-0.030
 spaghetti-0.029
Token the
Token ID262
Feature activation+3.93e-01

Pos logit contributions
ager+0.051
Lay+0.051
ve+0.050
spe+0.050
leaf+0.048
Neg logit contributions
 chew-0.037
 spaghetti-0.034
 carrot-0.033
 pile-0.032
 nose-0.032
Token door
Token ID3420
Feature activation+3.65e-02

Pos logit contributions
 chew+0.010
 hop+0.009
 pile+0.008
 spaghetti+0.008
 nose+0.008
Neg logit contributions
Lay-0.010
la-0.010
ager-0.010
ve-0.009
leaf-0.009
Token and
Token ID290
Feature activation+4.13e-01

Pos logit contributions
 chew+0.001
 hop+0.001
 snowball+0.001
 spaghetti+0.001
 carrot+0.001
Neg logit contributions
Lay-0.001
la-0.001
iring-0.001
overed-0.001
spe-0.001
Token
Token ID220
Feature activation-8.02e-01

Pos logit contributions
Lay+0.026
ve+0.023
Mad+0.021
overed+0.021
May+0.021
Neg logit contributions
 chew-0.083
 pile-0.078
 nose-0.077
 hop-0.076
 spaghetti-0.076
Token
Token ID198
Feature activation-3.26e-01

Pos logit contributions
Lay+0.022
ager+0.021
leaf+0.021
 singer+0.021
la+0.021
Neg logit contributions
 chew-0.018
 hop-0.016
 spaghetti-0.015
 pile-0.015
 nose-0.015
Token
Token ID198
Feature activation-6.58e-01

Pos logit contributions
Lay+0.009
ager+0.008
la+0.008
ve+0.008
leaf+0.008
Neg logit contributions
 chew-0.008
 hop-0.007
 pile-0.007
 spaghetti-0.007
 snowball-0.007
TokenThe
Token ID464
Feature activation+2.97e-01

Pos logit contributions
Lay+0.015
overed+0.015
la+0.015
iring+0.015
leaf+0.015
Neg logit contributions
 chew-0.018
 hop-0.016
 pile-0.015
 spaghetti-0.015
 snowball-0.015
Token town
Token ID3240
Feature activation-2.38e+00

Pos logit contributions
 chew+0.008
 hop+0.007
 pile+0.007
 spaghetti+0.006
 nose+0.006
Neg logit contributions
Lay-0.007
ager-0.007
ve-0.007
la-0.007
leaf-0.007
Tokenspe
Token ID4125
Feature activation-5.85e+00

Pos logit contributions
Lay+0.057
leaf+0.054
Mad+0.053
ires+0.052
May+0.052
Neg logit contributions
 chew-0.061
 pile-0.057
 hop-0.056
 jump-0.054
 nose-0.054
Tokenople
Token ID643
Feature activation-1.18e+00

Pos logit contributions
Lay+0.114
ala+0.104
iring+0.103
Ag+0.103
atalie+0.102
Neg logit contributions
 chew-0.161
 pile-0.150
 stack-0.148
 nose-0.146
 end-0.145
Token had
Token ID550
Feature activation-1.31e+00

Pos logit contributions
Lay+0.035
leaf+0.032
iring+0.032
ager+0.032
ve+0.032
Neg logit contributions
 chew-0.025
 hop-0.022
 pile-0.021
 nose-0.021
 jump-0.020
Token to
Token ID284
Feature activation-3.25e-01

Pos logit contributions
Lay+0.035
ve+0.032
ager+0.032
la+0.032
leaf+0.032
Neg logit contributions
 chew-0.032
 hop-0.028
 pile-0.027
 spaghetti-0.027
 nose-0.027
Token find
Token ID1064
Feature activation-1.00e-01

Pos logit contributions
Lay+0.010
ager+0.010
spe+0.010
Mad+0.010
leaf+0.010
Neg logit contributions
 chew-0.006
 spaghetti-0.005
 snowball-0.005
 hop-0.005
 carrot-0.005
Token a
Token ID257
Feature activation-7.64e-01

Pos logit contributions
Lay+0.003
la+0.002
leaf+0.002
ager+0.002
ve+0.002
Neg logit contributions
 chew-0.002
 hop-0.002
 pile-0.002
 spaghetti-0.002
 nose-0.002
Token!
Token ID0
Feature activation-1.31e-01

Pos logit contributions
 chew+0.012
 spaghetti+0.012
 carrot+0.011
 nose+0.011
 snowball+0.011
Neg logit contributions
Lay-0.015
ve-0.015
ager-0.014
Mad-0.014
leaf-0.014
Token I
Token ID314
Feature activation-4.64e-01

Pos logit contributions
Lay+0.003
la+0.003
ve+0.003
ager+0.003
bel+0.003
Neg logit contributions
 chew-0.004
 hop-0.003
 spaghetti-0.003
 nose-0.003
 pile-0.003
Token'm
Token ID1101
Feature activation-3.59e-01

Pos logit contributions
Lay+0.012
la+0.011
elia+0.011
overed+0.011
iring+0.011
Neg logit contributions
 chew-0.012
 hop-0.010
 pile-0.010
 jump-0.010
 snowball-0.010
Token the
Token ID262
Feature activation-1.49e+00

Pos logit contributions
Lay+0.010
ager+0.010
spe+0.010
bel+0.010
la+0.010
Neg logit contributions
 chew-0.008
 hop-0.007
 jump-0.007
 spaghetti-0.007
 pile-0.007
Token best
Token ID1266
Feature activation-4.60e-01

Pos logit contributions
Lay+0.037
ager+0.034
ve+0.033
la+0.033
leaf+0.033
Neg logit contributions
 chew-0.040
 hop-0.035
 pile-0.034
 spaghetti-0.034
 nose-0.034
Token gol
Token ID43791
Feature activation-5.78e+00

Pos logit contributions
Lay+0.011
la+0.011
elia+0.011
leaf+0.011
iring+0.011
Neg logit contributions
 chew-0.012
 hop-0.011
 pile-0.011
 jump-0.011
 bone-0.010
Tokenfer
Token ID2232
Feature activation-1.42e+00

Pos logit contributions
Lay+0.115
ires+0.114
la+0.114
iring+0.112
elia+0.110
Neg logit contributions
 chew-0.170
 hop-0.161
 fetch-0.152
 jump-0.150
 puck-0.149
Token in
Token ID287
Feature activation-1.07e+00

Pos logit contributions
Lay+0.037
iring+0.034
ires+0.033
overed+0.033
Mad+0.033
Neg logit contributions
 chew-0.037
 hop-0.034
 jump-0.033
 pile-0.032
 nose-0.032
Token the
Token ID262
Feature activation-2.42e-01

Pos logit contributions
Lay+0.024
ager+0.022
la+0.022
spe+0.022
ve+0.021
Neg logit contributions
 chew-0.030
 hop-0.027
 pile-0.026
 nose-0.026
 spaghetti-0.026
Token world
Token ID995
Feature activation-1.61e+00

Pos logit contributions
Lay+0.006
ager+0.005
Ap+0.005
Ag+0.005
spe+0.005
Neg logit contributions
 chew-0.006
 snowball-0.006
 nose-0.005
 spaghetti-0.005
 hop-0.005
Token and
Token ID290
Feature activation-3.59e-01

Pos logit contributions
Lay+0.045
iring+0.041
Mad+0.041
leaf+0.041
Ap+0.040
Neg logit contributions
 chew-0.038
 pile-0.033
 hop-0.033
 spaghetti-0.032
 jump-0.032
Token.
Token ID13
Feature activation-2.48e-01

Pos logit contributions
Lay+0.020
la+0.019
ager+0.018
leaf+0.018
 singer+0.018
Neg logit contributions
 chew-0.022
 hop-0.019
 snowball-0.019
 pile-0.019
 carrot-0.018
Token You
Token ID921
Feature activation-7.05e-01

Pos logit contributions
Lay+0.006
ve+0.005
ager+0.005
la+0.005
leaf+0.005
Neg logit contributions
 chew-0.007
 hop-0.006
 pile-0.006
 spaghetti-0.006
 nose-0.006
Token are
Token ID389
Feature activation-4.87e-01

Pos logit contributions
Lay+0.019
ve+0.018
ager+0.018
la+0.018
leaf+0.018
Neg logit contributions
 chew-0.017
 spaghetti-0.015
 snowball-0.015
 bisc-0.014
 pile-0.014
Token a
Token ID257
Feature activation-1.64e+00

Pos logit contributions
Lay+0.015
spe+0.014
ager+0.014
overed+0.014
ve+0.014
Neg logit contributions
 chew-0.010
 hop-0.009
 pile-0.009
 jump-0.009
 spaghetti-0.009
Token great
Token ID1049
Feature activation-1.63e+00

Pos logit contributions
Lay+0.047
overed+0.046
iring+0.045
la+0.045
ires+0.044
Neg logit contributions
 chew-0.037
 hop-0.033
 pile-0.032
 snowball-0.031
 spaghetti-0.031
Token sur
Token ID969
Feature activation-5.76e+00

Pos logit contributions
Lay+0.036
overed+0.034
endant+0.034
iring+0.033
la+0.032
Neg logit contributions
 chew-0.047
 hop-0.044
 pile-0.043
 jump-0.043
 stack-0.043
Tokenfer
Token ID2232
Feature activation-1.08e+00

Pos logit contributions
Lay+0.114
ires+0.106
iring+0.102
elia+0.101
endant+0.099
Neg logit contributions
 chew-0.170
 skate-0.159
 hop-0.157
 jump-0.157
 reel-0.150
Token.
Token ID13
Feature activation-1.44e-01

Pos logit contributions
Lay+0.030
ve+0.027
leaf+0.027
la+0.027
ager+0.027
Neg logit contributions
 chew-0.025
 hop-0.022
 pile-0.022
 spaghetti-0.021
 nose-0.021
Token We
Token ID775
Feature activation+5.34e-01

Pos logit contributions
Lay+0.003
ve+0.003
spe+0.003
ager+0.003
bel+0.003
Neg logit contributions
 chew-0.004
 hop-0.004
 spaghetti-0.004
 pile-0.004
 nose-0.004
Token are
Token ID389
Feature activation-5.53e-01

Pos logit contributions
 chew+0.012
 spaghetti+0.010
 hop+0.010
 snowball+0.010
 pile+0.010
Neg logit contributions
Lay-0.015
ve-0.015
ager-0.014
leaf-0.014
la-0.014
Token proud
Token ID6613
Feature activation-1.11e+00

Pos logit contributions
Lay+0.017
spe+0.017
ager+0.016
lor+0.016
overed+0.016
Neg logit contributions
 chew-0.010
 jump-0.010
 hop-0.010
 spaghetti-0.009
 pile-0.009
Token dolls
Token ID36062
Feature activation-6.68e-01

Pos logit contributions
 chew+0.012
 hop+0.010
 pile+0.009
 spaghetti+0.009
 snowball+0.009
Neg logit contributions
Lay-0.016
ve-0.015
bel-0.015
la-0.015
iring-0.015
Token and
Token ID290
Feature activation-1.46e-01

Pos logit contributions
Lay+0.013
ve+0.012
la+0.012
bel+0.012
May+0.012
Neg logit contributions
 chew-0.020
 snowball-0.018
 hop-0.018
 spaghetti-0.018
 pile-0.017
Token pretend
Token ID16614
Feature activation-3.21e+00

Pos logit contributions
Lay+0.005
bel+0.004
 singer+0.004
leaf+0.004
spe+0.004
Neg logit contributions
 chew-0.002
 snowball-0.002
 spaghetti-0.002
 pile-0.002
 hop-0.002
Token they
Token ID484
Feature activation+3.51e-01

Pos logit contributions
Lay+0.068
ager+0.063
ve+0.062
Mad+0.061
leaf+0.061
Neg logit contributions
 chew-0.098
 hop-0.089
 spaghetti-0.088
 pile-0.088
 nose-0.086
Token were
Token ID547
Feature activation+8.77e-02

Pos logit contributions
 chew+0.009
 snowball+0.008
 spaghetti+0.008
 bisc+0.008
 bone+0.008
Neg logit contributions
Lay-0.008
la-0.008
ve-0.008
bel-0.008
spe-0.008
Token mer
Token ID4017
Feature activation-5.69e+00

Pos logit contributions
 chew+0.002
 hop+0.002
 pile+0.002
 spaghetti+0.002
 nose+0.002
Neg logit contributions
Lay-0.002
la-0.002
ve-0.002
ager-0.002
leaf-0.002
Tokenmaid
Token ID23151
Feature activation-2.03e+00

Pos logit contributions
leaf+0.118
Lay+0.114
Ap+0.113
ager+0.111
Mad+0.108
Neg logit contributions
 chew-0.159
 nose-0.153
 hop-0.152
 jump-0.148
 pile-0.146
Tokens
Token ID82
Feature activation+9.38e-01

Pos logit contributions
Lay+0.046
ager+0.043
leaf+0.043
Ap+0.043
Mad+0.043
Neg logit contributions
 chew-0.055
 hop-0.052
 nose-0.051
 pile-0.051
 spaghetti-0.050
Token or
Token ID393
Feature activation-4.30e-01

Pos logit contributions
 chew+0.027
 hop+0.023
 bisc+0.022
 snowball+0.022
 pile+0.022
Neg logit contributions
Lay-0.021
la-0.020
ve-0.020
iring-0.019
 singer-0.019
Token pirates
Token ID27516
Feature activation-1.53e+00

Pos logit contributions
Lay+0.012
la+0.011
leaf+0.011
ager+0.011
ve+0.011
Neg logit contributions
 chew-0.009
 hop-0.008
 pile-0.008
 snowball-0.008
 spaghetti-0.008
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
Lay+0.043
ager+0.040
leaf+0.040
ve+0.040
la+0.040
Neg logit contributions
 chew-0.035
 hop-0.030
 pile-0.030
 spaghetti-0.029
 nose-0.029
Token counts
Token ID9853
Feature activation+2.92e-01

Pos logit contributions
Lay+0.001
ve+0.001
iring+0.001
leaf+0.001
ager+0.001
Neg logit contributions
 chew-0.001
 hop-0.001
 pile-0.001
 spaghetti-0.001
 nose-0.001
Token them
Token ID606
Feature activation+1.66e-01

Pos logit contributions
 chew+0.009
 hop+0.008
 bisc+0.008
 snowball+0.008
 pile+0.008
Neg logit contributions
Lay-0.006
la-0.006
iring-0.005
ires-0.005
ve-0.005
Token every
Token ID790
Feature activation+7.98e-01

Pos logit contributions
 chew+0.005
 snowball+0.004
 bisc+0.004
 carrot+0.004
 spaghetti+0.004
Neg logit contributions
Lay-0.004
ires-0.004
la-0.004
ve-0.003
iring-0.003
Token day
Token ID1110
Feature activation+4.59e-01

Pos logit contributions
 chew+0.018
 bisc+0.015
 hop+0.015
 snowball+0.014
 spaghetti+0.014
Neg logit contributions
 singer-0.021
ager-0.021
spe-0.021
Lay-0.020
la-0.020
Token.
Token ID13
Feature activation+7.08e-01

Pos logit contributions
 chew+0.012
 snowball+0.010
 hop+0.010
 pile+0.010
 carrot+0.010
Neg logit contributions
Lay-0.011
la-0.011
May-0.011
ve-0.011
 singer-0.011
Token One
Token ID1881
Feature activation+3.51e+00

Pos logit contributions
 chew+0.016
 hop+0.015
 snowball+0.014
 carrot+0.014
 pile+0.014
Neg logit contributions
Lay-0.019
May-0.018
ires-0.018
Mad-0.017
la-0.017
Token,
Token ID11
Feature activation+2.61e-01

Pos logit contributions
 chew+0.088
 puck+0.079
 bisc+0.079
 snowball+0.077
 carrot+0.074
Neg logit contributions
 singer-0.084
spe-0.083
yl-0.079
iring-0.079
ires-0.078
Token two
Token ID734
Feature activation+1.49e+00

Pos logit contributions
 chew+0.008
 hop+0.007
 pile+0.007
 snowball+0.007
 nose+0.007
Neg logit contributions
Lay-0.005
la-0.005
ve-0.004
leaf-0.004
May-0.004
Token,
Token ID11
Feature activation+8.99e-01

Pos logit contributions
 chew+0.036
 snowball+0.032
 carrot+0.031
 bisc+0.031
 spaghetti+0.031
Neg logit contributions
Lay-0.037
spe-0.037
elia-0.036
iring-0.036
ve-0.036
Token three
Token ID1115
Feature activation+6.26e-01

Pos logit contributions
 chew+0.023
 pile+0.020
 snowball+0.020
 hop+0.019
 spaghetti+0.019
Neg logit contributions
Lay-0.023
May-0.021
la-0.021
leaf-0.021
ve-0.021
Token,
Token ID11
Feature activation+1.17e+00

Pos logit contributions
 chew+0.014
 bisc+0.012
 carrot+0.012
 hop+0.012
 spaghetti+0.012
Neg logit contributions
Lay-0.017
ve-0.017
spe-0.017
elia-0.017
la-0.016
Token her
Token ID607
Feature activation+1.08e+00

Pos logit contributions
Lay+0.025
la+0.023
ve+0.023
leaf+0.023
ager+0.023
Neg logit contributions
 chew-0.033
 hop-0.029
 pile-0.028
 spaghetti-0.028
 snowball-0.028
Token strength
Token ID4202
Feature activation-6.39e-01

Pos logit contributions
 chew+0.027
 hop+0.024
 snowball+0.023
 pile+0.023
 bisc+0.023
Neg logit contributions
Lay-0.027
spe-0.026
iring-0.026
ve-0.026
 singer-0.026
Token.
Token ID13
Feature activation+7.46e-02

Pos logit contributions
Lay+0.018
la+0.017
ve+0.016
spe+0.016
ager+0.016
Neg logit contributions
 chew-0.015
 hop-0.013
 pile-0.012
 spaghetti-0.012
 snowball-0.012
Token
Token ID198
Feature activation+9.18e-02

Pos logit contributions
 chew+0.002
 pile+0.001
 hop+0.001
 spaghetti+0.001
 nose+0.001
Neg logit contributions
Lay-0.002
ve-0.002
ager-0.002
leaf-0.002
la-0.002
Token
Token ID198
Feature activation-8.39e-01

Pos logit contributions
 chew+0.002
 hop+0.002
 pile+0.001
 spaghetti+0.001
 jump+0.001
Neg logit contributions
leaf-0.003
Lay-0.003
ager-0.003
 singer-0.003
elia-0.003
TokenOne
Token ID3198
Feature activation+2.54e+00

Pos logit contributions
leaf+0.026
Lay+0.026
la+0.026
 singer+0.026
ve+0.025
Neg logit contributions
 chew-0.015
 hop-0.012
 pile-0.012
 spaghetti-0.012
 jump-0.012
Token day
Token ID1110
Feature activation-4.24e-01

Pos logit contributions
 chew+0.055
 pile+0.047
 nose+0.045
 hop+0.045
 bisc+0.045
Neg logit contributions
Lay-0.071
ager-0.070
 singer-0.070
spe-0.070
elia-0.068
Token,
Token ID11
Feature activation-4.67e-01

Pos logit contributions
Lay+0.016
leaf+0.016
ager+0.015
May+0.015
onda+0.015
Neg logit contributions
 chew-0.005
 hop-0.004
 pile-0.004
 jump-0.004
 spaghetti-0.004
Token Jill
Token ID22772
Feature activation-7.55e-01

Pos logit contributions
Lay+0.016
lor+0.015
leaf+0.015
ind+0.015
ager+0.015
Neg logit contributions
 chew-0.008
 spaghetti-0.007
 pile-0.007
 jump-0.007
 stack-0.006
Token was
Token ID373
Feature activation-8.77e-03

Pos logit contributions
Lay+0.022
leaf+0.021
ires+0.021
iring+0.021
overed+0.021
Neg logit contributions
 chew-0.016
 hop-0.014
 pile-0.014
 spaghetti-0.013
 nose-0.013
Token playing
Token ID2712
Feature activation+6.71e-01

Pos logit contributions
Lay+0.000
ager+0.000
la+0.000
ve+0.000
leaf+0.000
Neg logit contributions
 chew-0.000
 hop-0.000
 pile-0.000
 spaghetti-0.000
 nose-0.000
Token can
Token ID460
Feature activation+3.62e-01

Pos logit contributions
 chew+0.044
 snowball+0.041
 nose+0.040
 spaghetti+0.040
 meat+0.039
Neg logit contributions
Lay-0.043
ve-0.040
ager-0.040
leaf-0.038
gr-0.038
Token use
Token ID779
Feature activation+3.02e-01

Pos logit contributions
 chew+0.006
 hop+0.005
 spaghetti+0.005
 pile+0.005
 snowball+0.005
Neg logit contributions
Lay-0.012
ve-0.011
ager-0.011
la-0.011
leaf-0.011
Token this
Token ID428
Feature activation+1.25e-01

Pos logit contributions
 chew+0.009
 hop+0.008
 pile+0.007
 nose+0.007
 snowball+0.007
Neg logit contributions
Lay-0.007
la-0.006
ve-0.006
leaf-0.006
ager-0.006
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
 chew+0.003
 hop+0.003
 nose+0.002
 jump+0.002
 spaghetti+0.002
Neg logit contributions
Lay-0.003
ager-0.003
 singer-0.003
ve-0.003
spe-0.003
Token It
Token ID632
Feature activation+9.91e-01

Pos logit contributions
Lay+0.003
leaf+0.003
ager+0.003
May+0.003
la+0.003
Neg logit contributions
 chew-0.004
 pile-0.003
 hop-0.003
 spaghetti-0.003
 snowball-0.003
Token's
Token ID338
Feature activation+2.48e+00

Pos logit contributions
 chew+0.020
 pile+0.018
 hop+0.018
 jump+0.016
 nose+0.016
Neg logit contributions
Lay-0.032
Mad-0.031
leaf-0.031
essa-0.031
Kat-0.030
Token like
Token ID588
Feature activation+1.17e+00

Pos logit contributions
 chew+0.069
 hop+0.061
 bisc+0.059
 snowball+0.058
 skate+0.057
Neg logit contributions
Lay-0.050
ires-0.049
elia-0.047
 yourselves-0.047
la-0.047
Token paper
Token ID3348
Feature activation+5.04e-01

Pos logit contributions
 chew+0.029
 hop+0.026
 pile+0.025
 snowball+0.025
 nose+0.024
Neg logit contributions
Lay-0.029
leaf-0.027
la-0.027
ve-0.026
ager-0.026
Token,
Token ID11
Feature activation+5.31e-01

Pos logit contributions
 chew+0.013
 hop+0.012
 snowball+0.011
 pile+0.011
 spaghetti+0.011
Neg logit contributions
Lay-0.012
 singer-0.011
la-0.011
ve-0.011
ires-0.011
Token but
Token ID475
Feature activation+9.61e-01

Pos logit contributions
 chew+0.013
 snowball+0.011
 pile+0.011
 hop+0.011
 bisc+0.011
Neg logit contributions
Lay-0.012
ires-0.012
 singer-0.012
 yourselves-0.011
leaf-0.011
Token bigger
Token ID5749
Feature activation+1.77e-01

Pos logit contributions
 chew+0.032
 snowball+0.029
 hop+0.028
 spaghetti+0.028
 bisc+0.028
Neg logit contributions
Lay-0.016
ires-0.015
iring-0.015
ve-0.015
la-0.014
Token looked
Token ID3114
Feature activation-6.52e-01

Pos logit contributions
 chew+0.024
 hop+0.021
 pile+0.021
 nose+0.020
 spaghetti+0.020
Neg logit contributions
Lay-0.030
ager-0.028
leaf-0.028
ve-0.028
la-0.027
Token inside
Token ID2641
Feature activation-9.61e-01

Pos logit contributions
Lay+0.017
ala+0.017
Mad+0.017
 singer+0.017
onda+0.016
Neg logit contributions
 chew-0.016
 pile-0.015
 spaghetti-0.014
 hop-0.014
 carrot-0.014
Token.
Token ID13
Feature activation+7.44e-01

Pos logit contributions
Lay+0.022
ve+0.021
iring+0.021
la+0.020
overed+0.020
Neg logit contributions
 chew-0.026
 hop-0.023
 snowball-0.023
 bisc-0.022
 pile-0.022
Token It
Token ID632
Feature activation+4.43e-01

Pos logit contributions
 chew+0.018
 hop+0.015
 pile+0.015
 spaghetti+0.015
 nose+0.015
Neg logit contributions
Lay-0.020
la-0.019
overed-0.019
ve-0.019
ager-0.019
Token was
Token ID373
Feature activation+1.94e+00

Pos logit contributions
 chew+0.014
 spaghetti+0.013
 snowball+0.013
 meat+0.012
 hop+0.012
Neg logit contributions
Lay-0.008
la-0.008
ve-0.007
spe-0.007
yl-0.007
Token also
Token ID635
Feature activation+2.69e+00

Pos logit contributions
 chew+0.053
 hop+0.047
 snowball+0.045
 bisc+0.044
 skate+0.043
Neg logit contributions
Lay-0.043
iring-0.041
la-0.040
overed-0.040
spe-0.038
Token empty
Token ID6565
Feature activation+1.21e+00

Pos logit contributions
 chew+0.068
 hop+0.059
 pile+0.057
 spaghetti+0.056
 nose+0.056
Neg logit contributions
Lay-0.070
la-0.065
ve-0.065
ager-0.064
spe-0.064
Token.
Token ID13
Feature activation+9.41e-01

Pos logit contributions
 chew+0.031
 hop+0.027
 snowball+0.026
 nose+0.025
 carrot+0.025
Neg logit contributions
Lay-0.029
la-0.028
iring-0.028
spe-0.028
ve-0.028
Token He
Token ID679
Feature activation+2.28e-02

Pos logit contributions
 chew+0.024
 pile+0.021
 snowball+0.020
 spaghetti+0.020
 hop+0.020
Neg logit contributions
Lay-0.024
la-0.022
iring-0.022
overed-0.022
ires-0.022
Token put
Token ID1234
Feature activation+1.23e+00

Pos logit contributions
 chew+0.001
 hop+0.001
 pile+0.001
 spaghetti+0.001
 snowball+0.000
Neg logit contributions
Lay-0.001
la-0.001
ager-0.001
ve-0.001
spe-0.001
Token it
Token ID340
Feature activation+1.17e-01

Pos logit contributions
 chew+0.035
 hop+0.031
 pile+0.030
 spaghetti+0.029
 snowball+0.029
Neg logit contributions
Lay-0.027
la-0.026
ve-0.025
overed-0.024
leaf-0.024
Token belt
Token ID10999
Feature activation+3.53e-01

Pos logit contributions
Lay+0.017
leaf+0.017
 singer+0.017
ve+0.016
May+0.016
Neg logit contributions
 chew-0.012
 hop-0.010
 pile-0.010
 spaghetti-0.009
 snowball-0.009
Token.
Token ID13
Feature activation-9.89e-01

Pos logit contributions
 chew+0.009
 hop+0.008
 pile+0.007
 snowball+0.007
 spaghetti+0.007
Neg logit contributions
Lay-0.009
la-0.009
ve-0.009
ager-0.008
spe-0.008
Token It
Token ID632
Feature activation+6.12e-02

Pos logit contributions
Lay+0.025
la+0.023
May+0.023
spe+0.023
Ap+0.023
Neg logit contributions
 chew-0.025
 pile-0.022
 hop-0.022
 nose-0.021
 spaghetti-0.021
Token was
Token ID373
Feature activation+1.49e+00

Pos logit contributions
 chew+0.002
 spaghetti+0.001
 hop+0.001
 pile+0.001
 nose+0.001
Neg logit contributions
Lay-0.002
la-0.001
spe-0.001
ager-0.001
ve-0.001
Token a
Token ID257
Feature activation-3.04e-01

Pos logit contributions
 chew+0.040
 hop+0.036
 pile+0.033
 bisc+0.033
 nose+0.031
Neg logit contributions
Lay-0.032
la-0.032
iring-0.031
ires-0.030
overed-0.029
Token bit
Token ID1643
Feature activation+2.84e+00

Pos logit contributions
Lay+0.007
 singer+0.007
May+0.007
ager+0.007
spe+0.007
Neg logit contributions
 chew-0.007
 spaghetti-0.006
 hop-0.006
 nose-0.006
 meat-0.006
Token messy
Token ID24097
Feature activation+7.17e-01

Pos logit contributions
 chew+0.070
 hop+0.062
 nose+0.062
 bisc+0.060
 skate+0.059
Neg logit contributions
Lay-0.068
iring-0.067
ve-0.064
ires-0.064
spe-0.063
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
 chew+0.018
 hop+0.016
 snowball+0.015
 carrot+0.015
 nose+0.015
Neg logit contributions
Lay-0.018
la-0.017
spe-0.017
ager-0.017
 singer-0.017
Token but
Token ID475
Feature activation+2.50e-02

Pos logit contributions
 chew+0.025
 hop+0.020
 pile+0.020
 carrot+0.019
 snowball+0.019
Neg logit contributions
Lay-0.036
iring-0.034
overed-0.034
ires-0.034
Ap-0.034
Token he
Token ID339
Feature activation+1.19e-01

Pos logit contributions
 chew+0.001
 hop+0.001
 pile+0.001
 snowball+0.001
 bisc+0.001
Neg logit contributions
Lay-0.001
la-0.001
spe-0.001
elia-0.001
overed-0.001
Token liked
Token ID8288
Feature activation-1.60e-01

Pos logit contributions
 chew+0.003
 hop+0.003
 pile+0.003
 spaghetti+0.003
 nose+0.003
Neg logit contributions
Lay-0.003
ager-0.003
la-0.003
ve-0.003
leaf-0.003
Token not
Token ID407
Feature activation+1.49e+00

Pos logit contributions
 chew+0.040
 hop+0.035
 pile+0.033
 spaghetti+0.033
 nose+0.033
Neg logit contributions
Lay-0.037
la-0.035
spe-0.034
ve-0.034
iring-0.034
Token good
Token ID922
Feature activation-1.28e+00

Pos logit contributions
 chew+0.041
 hop+0.035
 bisc+0.033
 snowball+0.033
 spaghetti+0.032
Neg logit contributions
Lay-0.034
ve-0.032
la-0.032
iring-0.032
 singer-0.032
Token.
Token ID13
Feature activation+5.81e-02

Pos logit contributions
Lay+0.040
ager+0.037
leaf+0.037
la+0.037
May+0.037
Neg logit contributions
 chew-0.026
 pile-0.022
 spaghetti-0.022
 nose-0.022
 hop-0.022
Token The
Token ID383
Feature activation-2.33e-01

Pos logit contributions
 chew+0.002
 hop+0.001
 pile+0.001
 carrot+0.001
 nose+0.001
Neg logit contributions
Lay-0.001
ires-0.001
iring-0.001
Ap-0.001
May-0.001
Token house
Token ID2156
Feature activation-6.09e-01

Pos logit contributions
Lay+0.007
la+0.007
ager+0.006
ve+0.006
leaf+0.006
Neg logit contributions
 chew-0.005
 hop-0.004
 pile-0.004
 spaghetti-0.004
 nose-0.004
Token is
Token ID318
Feature activation+1.76e+00

Pos logit contributions
Lay+0.016
leaf+0.015
Mad+0.015
ager+0.015
Ag+0.015
Neg logit contributions
 chew-0.015
 pile-0.014
 hop-0.014
 nose-0.013
 jump-0.013
Token old
Token ID1468
Feature activation-4.79e-01

Pos logit contributions
 chew+0.048
 hop+0.043
 bisc+0.040
 carrot+0.040
 spaghetti+0.040
Neg logit contributions
Lay-0.038
spe-0.037
bel-0.037
iring-0.036
la-0.036
Token and
Token ID290
Feature activation+7.38e-01

Pos logit contributions
Lay+0.015
ve+0.014
leaf+0.014
ager+0.014
la+0.014
Neg logit contributions
 chew-0.010
 pile-0.008
 hop-0.008
 spaghetti-0.008
 nose-0.008
Token broken
Token ID5445
Feature activation+5.09e-02

Pos logit contributions
 chew+0.018
 hop+0.016
 spaghetti+0.015
 snowball+0.015
 pile+0.015
Neg logit contributions
Lay-0.019
spe-0.018
bel-0.017
leaf-0.017
la-0.017
Token.
Token ID13
Feature activation+1.82e-01

Pos logit contributions
 chew+0.001
 hop+0.001
 bisc+0.001
 jump+0.001
 carrot+0.001
Neg logit contributions
Lay-0.001
spe-0.001
la-0.001
iring-0.001
ires-0.001
Token The
Token ID383
Feature activation-2.35e-01

Pos logit contributions
 chew+0.005
 pile+0.004
 hop+0.004
 nose+0.004
 carrot+0.004
Neg logit contributions
Lay-0.004
ires-0.004
iring-0.004
Mad-0.004
May-0.004
Token to
Token ID284
Feature activation+1.08e+00

Pos logit contributions
 chew+0.022
 pile+0.019
 hop+0.019
 snowball+0.018
 spaghetti+0.018
Neg logit contributions
Lay-0.021
ager-0.019
la-0.019
ve-0.019
leaf-0.019
Token be
Token ID307
Feature activation+1.90e+00

Pos logit contributions
 spaghetti+0.020
 chew+0.020
 snowball+0.019
 meat+0.018
 bisc+0.018
Neg logit contributions
Lay-0.032
ager-0.029
ala-0.029
 singer-0.029
la-0.029
Token very
Token ID845
Feature activation+5.43e-01

Pos logit contributions
 chew+0.055
 stack+0.047
 pile+0.046
 spaghetti+0.046
 bisc+0.046
Neg logit contributions
Lay-0.041
gr-0.037
Ag-0.037
Mad-0.037
Ap-0.037
Token careful
Token ID8161
Feature activation-4.73e-01

Pos logit contributions
 chew+0.013
 hop+0.011
 pile+0.011
 spaghetti+0.011
 nose+0.011
Neg logit contributions
Lay-0.015
la-0.014
ager-0.014
leaf-0.014
ve-0.014
Token not
Token ID407
Feature activation+9.88e-01

Pos logit contributions
Lay+0.011
spe+0.010
iring+0.010
overed+0.010
ve+0.010
Neg logit contributions
 chew-0.013
 bisc-0.012
 pile-0.011
 snowball-0.011
 carrot-0.011
Token to
Token ID284
Feature activation+1.69e-01

Pos logit contributions
 chew+0.033
 spaghetti+0.030
 pile+0.029
 snowball+0.029
 hop+0.029
Neg logit contributions
Lay-0.017
ager-0.015
ve-0.015
iring-0.015
spe-0.015
Token hurt
Token ID5938
Feature activation+7.97e-01

Pos logit contributions
 spaghetti+0.004
 bisc+0.004
 meat+0.003
 snowball+0.003
 carrot+0.003
Neg logit contributions
Lay-0.004
ala-0.004
 singer-0.004
 yourselves-0.004
guard-0.004
Token anyone
Token ID2687
Feature activation-7.21e-01

Pos logit contributions
 chew+0.022
 pile+0.020
 bisc+0.020
 spaghetti+0.019
 stack+0.019
Neg logit contributions
Lay-0.018
iring-0.017
ala-0.017
la-0.017
 yourselves-0.017
Token.
Token ID13
Feature activation+6.65e-01

Pos logit contributions
Lay+0.016
spe+0.015
iring+0.015
 singer+0.015
la+0.014
Neg logit contributions
 chew-0.019
 bisc-0.017
 spaghetti-0.017
 pile-0.017
 nose-0.017
Token Bob
Token ID5811
Feature activation-8.79e-01

Pos logit contributions
 chew+0.016
 hop+0.014
 pile+0.014
 spaghetti+0.014
 nose+0.014
Neg logit contributions
Lay-0.017
ager-0.016
overed-0.016
spe-0.016
ires-0.016
Token promised
Token ID8072
Feature activation-6.52e-01

Pos logit contributions
Lay+0.021
ager+0.020
la+0.020
ve+0.020
spe+0.020
Neg logit contributions
 chew-0.023
 hop-0.020
 pile-0.020
 spaghetti-0.020
 nose-0.019
Token,
Token ID11
Feature activation-4.72e-01

Pos logit contributions
 chew+0.007
 snowball+0.006
 spaghetti+0.006
 hop+0.006
 carrot+0.006
Neg logit contributions
Lay-0.009
la-0.009
ve-0.009
spe-0.009
overed-0.008
Token she
Token ID673
Feature activation+6.82e-02

Pos logit contributions
Lay+0.012
ager+0.011
la+0.011
ve+0.011
leaf+0.011
Neg logit contributions
 chew-0.012
 hop-0.010
 pile-0.010
 spaghetti-0.010
 nose-0.010
Token wasn
Token ID2492
Feature activation-1.88e+00

Pos logit contributions
 chew+0.002
 hop+0.001
 pile+0.001
 spaghetti+0.001
 nose+0.001
Neg logit contributions
Lay-0.002
ager-0.002
la-0.002
ve-0.002
leaf-0.002
Token't
Token ID470
Feature activation+7.04e-02

Pos logit contributions
Lay+0.037
ager+0.036
spe+0.036
bel+0.035
ires+0.035
Neg logit contributions
 chew-0.058
 hop-0.054
 jump-0.052
 pile-0.050
 snowball-0.050
Token hurt
Token ID5938
Feature activation+1.29e+00

Pos logit contributions
 chew+0.002
 pile+0.002
 hop+0.002
 spaghetti+0.002
 nose+0.002
Neg logit contributions
Lay-0.002
ve-0.002
la-0.001
Mad-0.001
ager-0.001
Token but
Token ID475
Feature activation-3.13e-01

Pos logit contributions
 chew+0.023
 hop+0.020
 jump+0.018
 pile+0.018
 snowball+0.018
Neg logit contributions
leaf-0.042
ager-0.042
Lay-0.041
bel-0.041
ires-0.040
Token she
Token ID673
Feature activation-2.35e-01

Pos logit contributions
Lay+0.007
ager+0.006
ve+0.006
la+0.006
leaf+0.006
Neg logit contributions
 chew-0.009
 hop-0.008
 pile-0.008
 spaghetti-0.008
 nose-0.008
Token was
Token ID373
Feature activation-4.50e-01

Pos logit contributions
Lay+0.005
ager+0.005
la+0.005
ve+0.005
spe+0.005
Neg logit contributions
 chew-0.006
 spaghetti-0.006
 hop-0.006
 pile-0.006
 snowball-0.006
Token a
Token ID257
Feature activation-5.92e-01

Pos logit contributions
Lay+0.011
la+0.010
ager+0.010
ve+0.010
leaf+0.010
Neg logit contributions
 chew-0.012
 hop-0.010
 pile-0.010
 spaghetti-0.010
 nose-0.010
Token bit
Token ID1643
Feature activation+1.82e+00

Pos logit contributions
Lay+0.014
ager+0.013
ve+0.013
spe+0.013
May+0.013
Neg logit contributions
 chew-0.015
 spaghetti-0.014
 hop-0.013
 pile-0.013
 nose-0.013
Token scared
Token ID12008
Feature activation+9.60e-01

Pos logit contributions
 chew+0.048
 hop+0.042
 pile+0.041
 spaghetti+0.041
 nose+0.041
Neg logit contributions
Lay-0.045
ve-0.041
la-0.041
ager-0.041
spe-0.041
Token she
Token ID673
Feature activation-3.07e-01

Pos logit contributions
Lay+0.001
leaf+0.001
ager+0.001
Mad+0.001
ve+0.001
Neg logit contributions
 chew-0.002
 pile-0.002
 hop-0.002
 spaghetti-0.002
 nose-0.002
Token couldn
Token ID3521
Feature activation-1.88e+00

Pos logit contributions
Lay+0.007
ager+0.007
ve+0.007
la+0.007
elia+0.007
Neg logit contributions
 chew-0.008
 hop-0.007
 pile-0.007
 spaghetti-0.007
 snowball-0.007
Token't
Token ID470
Feature activation+4.00e-02

Pos logit contributions
Lay+0.031
ager+0.028
la+0.028
leaf+0.028
spe+0.027
Neg logit contributions
 chew-0.065
 hop-0.059
 pile-0.058
 spaghetti-0.057
 nose-0.056
Token open
Token ID1280
Feature activation-1.26e+00

Pos logit contributions
 snowball+0.001
 spaghetti+0.001
 nose+0.001
 bisc+0.001
 puck+0.001
Neg logit contributions
ve-0.001
ager-0.001
Lay-0.001
otte-0.001
May-0.001
Token it
Token ID340
Feature activation+7.25e-01

Pos logit contributions
Lay+0.033
ager+0.032
leaf+0.031
ve+0.031
spe+0.031
Neg logit contributions
 chew-0.031
 pile-0.027
 hop-0.026
 spaghetti-0.026
 nose-0.026
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
 snowball+0.024
 chew+0.024
 puck+0.023
 hop+0.022
 bisc+0.022
Neg logit contributions
ve-0.012
iring-0.012
la-0.011
Lay-0.011
ires-0.011
Token She
Token ID1375
Feature activation-3.51e-01

Pos logit contributions
 chew+0.003
 hop+0.003
 pile+0.003
 spaghetti+0.003
 nose+0.003
Neg logit contributions
Lay-0.003
ager-0.003
la-0.003
ve-0.003
leaf-0.003
Token tried
Token ID3088
Feature activation+7.92e-01

Pos logit contributions
Lay+0.009
ager+0.008
la+0.008
ve+0.008
leaf+0.008
Neg logit contributions
 chew-0.009
 hop-0.008
 pile-0.008
 spaghetti-0.008
 nose-0.008
Token and
Token ID290
Feature activation-6.43e-01

Pos logit contributions
 chew+0.028
 hop+0.026
 bisc+0.025
 snowball+0.025
 pile+0.025
Neg logit contributions
Lay-0.012
ve-0.011
yl-0.011
la-0.010
May-0.010
Token tried
Token ID3088
Feature activation+1.16e+00

Pos logit contributions
Lay+0.013
ve+0.012
otte+0.012
yl+0.011
Ap+0.011
Neg logit contributions
 snowball-0.019
 chew-0.018
 carrot-0.018
 bisc-0.017
 spaghetti-0.017
Token but
Token ID475
Feature activation-5.97e-02

Pos logit contributions
 chew+0.031
 bisc+0.029
 hop+0.028
 snowball+0.028
 carrot+0.027
Neg logit contributions
Lay-0.027
ve-0.025
yl-0.024
la-0.023
iring-0.023
Token to
Token ID284
Feature activation-2.51e-01

Pos logit contributions
Lay+0.036
ager+0.033
leaf+0.033
ve+0.033
la+0.033
Neg logit contributions
 chew-0.030
 hop-0.026
 pile-0.026
 spaghetti-0.025
 nose-0.025
Token get
Token ID651
Feature activation-1.95e-01

Pos logit contributions
Lay+0.007
Mad+0.006
leaf+0.006
 singer+0.006
 yourselves+0.006
Neg logit contributions
 chew-0.005
 spaghetti-0.005
 snowball-0.005
 nose-0.005
 bisc-0.005
Token to
Token ID284
Feature activation-4.69e-01

Pos logit contributions
Lay+0.006
ager+0.005
la+0.005
ve+0.005
overed+0.005
Neg logit contributions
 chew-0.004
 hop-0.004
 spaghetti-0.004
 pile-0.004
 nose-0.004
Token their
Token ID511
Feature activation-4.95e-01

Pos logit contributions
Lay+0.010
Mad+0.009
leaf+0.009
 singer+0.009
 yourselves+0.009
Neg logit contributions
 chew-0.012
 snowball-0.011
 nose-0.011
 pile-0.011
 bisc-0.011
Token homes
Token ID5682
Feature activation-1.99e+00

Pos logit contributions
Lay+0.012
elia+0.012
iring+0.012
 singer+0.012
spe+0.012
Neg logit contributions
 chew-0.012
 hop-0.011
 spaghetti-0.010
 snowball-0.010
 bisc-0.010
Token.
Token ID13
Feature activation+2.89e-01

Pos logit contributions
Lay+0.042
spe+0.042
bel+0.041
iring+0.041
la+0.041
Neg logit contributions
 chew-0.056
 hop-0.049
 snowball-0.049
 carrot-0.047
 spaghetti-0.047
Token They
Token ID1119
Feature activation+3.13e-01

Pos logit contributions
 chew+0.007
 hop+0.006
 pile+0.006
 spaghetti+0.006
 nose+0.006
Neg logit contributions
Lay-0.008
ager-0.007
la-0.007
ve-0.007
leaf-0.007
Token were
Token ID547
Feature activation-4.23e-01

Pos logit contributions
 chew+0.009
 snowball+0.008
 spaghetti+0.008
 bisc+0.008
 puck+0.007
Neg logit contributions
Lay-0.007
la-0.007
ve-0.007
spe-0.006
ager-0.006
Token so
Token ID523
Feature activation-6.22e-01

Pos logit contributions
Lay+0.010
ve+0.010
iring+0.010
la+0.010
elia+0.009
Neg logit contributions
 chew-0.011
 bisc-0.010
 hop-0.009
 pile-0.009
 spaghetti-0.009
Token busy
Token ID8179
Feature activation-6.76e-01

Pos logit contributions
Lay+0.018
la+0.017
ager+0.017
leaf+0.017
May+0.017
Neg logit contributions
 chew-0.014
 hop-0.012
 pile-0.011
 jump-0.011
 spaghetti-0.011
Token that
Token ID326
Feature activation+7.77e-01

Pos logit contributions
Lay+0.020
la+0.019
ager+0.019
leaf+0.019
ve+0.019
Neg logit contributions
 chew-0.015
 hop-0.012
 pile-0.012
 spaghetti-0.012
 nose-0.012
Token,
Token ID11
Feature activation+1.49e+00

Pos logit contributions
 chew+0.009
 hop+0.008
 nose+0.007
 spaghetti+0.007
 pile+0.006
Neg logit contributions
Lay-0.029
leaf-0.028
iring-0.028
ires-0.028
ager-0.028
Token "
Token ID366
Feature activation+2.09e-01

Pos logit contributions
 chew+0.034
 hop+0.030
 pile+0.029
 spaghetti+0.029
 nose+0.028
Neg logit contributions
Lay-0.042
ager-0.039
la-0.039
ve-0.039
leaf-0.038
TokenThat
Token ID2504
Feature activation-9.41e-01

Pos logit contributions
 chew+0.006
 hop+0.005
 spaghetti+0.005
 snowball+0.005
 bisc+0.005
Neg logit contributions
ve-0.004
overed-0.004
ires-0.004
iring-0.004
Lay-0.004
Token's
Token ID338
Feature activation+2.07e+00

Pos logit contributions
Lay+0.026
ager+0.025
la+0.025
leaf+0.024
 singer+0.024
Neg logit contributions
 chew-0.021
 hop-0.018
 pile-0.018
 spaghetti-0.017
 snowball-0.017
Token a
Token ID257
Feature activation-3.69e-01

Pos logit contributions
 chew+0.049
 hop+0.042
 pile+0.041
 nose+0.040
 snowball+0.040
Neg logit contributions
Lay-0.057
ager-0.053
la-0.053
ve-0.053
iring-0.052
Token great
Token ID1049
Feature activation-8.54e-01

Pos logit contributions
ager+0.008
Lay+0.008
May+0.008
 singer+0.008
 talented+0.007
Neg logit contributions
 chew-0.010
 hop-0.008
 nose-0.008
 spaghetti-0.008
 pile-0.008
Token idea
Token ID2126
Feature activation-6.71e-01

Pos logit contributions
ager+0.026
 singer+0.025
Lay+0.025
spe+0.024
leaf+0.024
Neg logit contributions
 chew-0.016
 hop-0.013
 spaghetti-0.013
 pile-0.013
 nose-0.012
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
Lay+0.023
Mad+0.021
Ap+0.021
iring+0.021
ires+0.021
Neg logit contributions
 chew-0.012
 pile-0.010
 nose-0.010
 hop-0.010
 spaghetti-0.010
Token Lily
Token ID20037
Feature activation+1.12e+00

Pos logit contributions
 chew+0.005
 hop+0.005
 pile+0.004
 spaghetti+0.004
 nose+0.004
Neg logit contributions
Lay-0.006
ve-0.006
ager-0.006
leaf-0.006
la-0.006
Token!
Token ID0
Feature activation+3.95e-01

Pos logit contributions
 chew+0.023
 pile+0.019
 hop+0.019
 nose+0.019
 spaghetti+0.019
Neg logit contributions
Lay-0.035
leaf-0.032
Mad-0.032
iring-0.032
overed-0.032
Token Let
Token ID3914
Feature activation+1.76e+00

Pos logit contributions
 chew+0.010
 hop+0.009
 spaghetti+0.008
 jump+0.008
 pile+0.008
Neg logit contributions
ve-0.010
Lay-0.010
bel-0.010
onda-0.009
iring-0.009
Token way
Token ID835
Feature activation-5.46e-01

Pos logit contributions
Lay+0.005
ager+0.005
la+0.005
ve+0.005
iring+0.005
Neg logit contributions
 chew-0.003
 hop-0.002
 spaghetti-0.002
 pile-0.002
 snowball-0.002
Token home
Token ID1363
Feature activation-9.29e-01

Pos logit contributions
Lay+0.015
ager+0.015
leaf+0.014
spe+0.014
ve+0.014
Neg logit contributions
 chew-0.013
 hop-0.012
 pile-0.011
 nose-0.011
 jump-0.011
Token,
Token ID11
Feature activation+3.07e-01

Pos logit contributions
Lay+0.020
iring+0.019
bel+0.019
la+0.019
spe+0.018
Neg logit contributions
 chew-0.027
 snowball-0.024
 spaghetti-0.023
 puck-0.023
 carrot-0.023
Token but
Token ID475
Feature activation-1.75e-01

Pos logit contributions
 chew+0.008
 snowball+0.007
 bisc+0.007
 pile+0.007
 puck+0.007
Neg logit contributions
Lay-0.007
la-0.007
but-0.007
iring-0.007
May-0.007
Token he
Token ID339
Feature activation+1.57e-01

Pos logit contributions
leaf+0.005
ager+0.004
Lay+0.004
spe+0.004
ve+0.004
Neg logit contributions
 chew-0.005
 hop-0.004
 carrot-0.004
 nose-0.004
 pile-0.004
Token didn
Token ID1422
Feature activation-1.91e+00

Pos logit contributions
 chew+0.004
 spaghetti+0.004
 pile+0.004
 snowball+0.004
 hop+0.004
Neg logit contributions
Lay-0.004
ve-0.003
ager-0.003
la-0.003
elia-0.003
Token't
Token ID470
Feature activation-2.82e-02

Pos logit contributions
Lay+0.034
ager+0.032
la+0.032
spe+0.032
leaf+0.032
Neg logit contributions
 chew-0.064
 hop-0.058
 jump-0.055
 pile-0.055
 spaghetti-0.055
Token know
Token ID760
Feature activation-6.79e-01

Pos logit contributions
Lay+0.001
ager+0.001
ve+0.001
leaf+0.001
la+0.001
Neg logit contributions
 chew-0.001
 spaghetti-0.000
 snowball-0.000
 pile-0.000
 nose-0.000
Token where
Token ID810
Feature activation-6.10e-01

Pos logit contributions
Lay+0.013
la+0.012
iring+0.012
May+0.011
ires+0.011
Neg logit contributions
 chew-0.021
 hop-0.019
 pile-0.019
 nose-0.019
 spaghetti-0.019
Token Pink
Token ID14657
Feature activation-8.73e-02

Pos logit contributions
ager+0.016
Lay+0.015
ve+0.015
leaf+0.015
iring+0.015
Neg logit contributions
 chew-0.016
 hop-0.015
 jump-0.013
 snack-0.013
 spaghetti-0.013
Token lived
Token ID5615
Feature activation+2.64e-01

Pos logit contributions
Lay+0.002
la+0.002
ve+0.002
ager+0.002
iring+0.002
Neg logit contributions
 chew-0.002
 hop-0.002
 pile-0.002
 spaghetti-0.002
 snowball-0.002
Token pepper
Token ID13385
Feature activation+9.25e-01

Pos logit contributions
 chew+0.016
 afloat+0.015
 Samuel+0.014
 puff+0.013
 snowball+0.013
Neg logit contributions
 singer-0.016
 angel-0.015
rot-0.014
ager-0.014
ala-0.014
Token.
Token ID13
Feature activation+8.14e-01

Pos logit contributions
 chew+0.028
 snowball+0.025
 hop+0.024
 bisc+0.023
 tumble+0.023
Neg logit contributions
ager-0.017
 singer-0.016
iring-0.016
ber-0.016
ala-0.016
Token Sam
Token ID3409
Feature activation-1.37e-01

Pos logit contributions
 chew+0.020
 afloat+0.018
 carrot+0.018
 snowball+0.018
 hop+0.017
Neg logit contributions
ires-0.018
overed-0.017
Lay-0.017
iring-0.017
Ag-0.016
Token said
Token ID531
Feature activation-5.13e-01

Pos logit contributions
Lay+0.003
leaf+0.003
ve+0.003
la+0.003
iring+0.003
Neg logit contributions
 chew-0.004
 hop-0.003
 pile-0.003
 spaghetti-0.003
 nose-0.003
Token,
Token ID11
Feature activation+8.70e-02

Pos logit contributions
Lay+0.017
la+0.016
ager+0.016
ve+0.016
leaf+0.016
Neg logit contributions
 chew-0.009
 hop-0.007
 pile-0.007
 spaghetti-0.007
 nose-0.007
Token "
Token ID366
Feature activation-8.71e-01

Pos logit contributions
 chew+0.002
 pile+0.002
 snowball+0.002
 hop+0.002
 bisc+0.002
Neg logit contributions
Lay-0.002
overed-0.002
ve-0.002
Ap-0.002
May-0.002
TokenHi
Token ID17250
Feature activation+3.07e-01

Pos logit contributions
Lay+0.013
ager+0.011
Ap+0.011
Ag+0.011
leaf+0.011
Neg logit contributions
 chew-0.030
 pile-0.029
 nose-0.028
 bed-0.028
 hop-0.028
Token Tim
Token ID5045
Feature activation+1.63e+00

Pos logit contributions
 chew+0.008
 itch+0.007
 gag+0.007
 bisc+0.006
 meat+0.006
Neg logit contributions
yl-0.007
iring-0.007
elia-0.006
Lay-0.006
ires-0.006
Token!
Token ID0
Feature activation-7.73e-02

Pos logit contributions
 chew+0.044
 hop+0.038
 puck+0.037
 spaghetti+0.036
 skate+0.035
Neg logit contributions
ager-0.036
elia-0.035
Lay-0.035
ve-0.035
han-0.034
Token Do
Token ID2141
Feature activation+3.82e-01

Pos logit contributions
Lay+0.002
leaf+0.001
ager+0.001
spe+0.001
May+0.001
Neg logit contributions
 chew-0.002
 pile-0.002
 hop-0.002
 spaghetti-0.002
 snowball-0.002
Token you
Token ID345
Feature activation+5.85e-01

Pos logit contributions
 chew+0.013
 hop+0.012
 jump+0.012
 spaghetti+0.011
 nose+0.011
Neg logit contributions
Lay-0.007
spe-0.007
la-0.007
overed-0.007
leaf-0.007
Token head
Token ID1182
Feature activation+6.00e-02

Pos logit contributions
 chew+0.001
 hop+0.001
 pile+0.001
 snowball+0.001
 spaghetti+0.001
Neg logit contributions
Lay-0.003
ve-0.003
ager-0.003
la-0.003
iring-0.003
Token on
Token ID319
Feature activation-3.11e-02

Pos logit contributions
 chew+0.002
 pile+0.002
 snowball+0.002
 spaghetti+0.002
 meat+0.002
Neg logit contributions
Lay-0.001
ve-0.001
la-0.001
overed-0.001
bel-0.001
Token her
Token ID607
Feature activation+1.07e+00

Pos logit contributions
Lay+0.001
May+0.001
la+0.001
 singer+0.001
elia+0.001
Neg logit contributions
 chew-0.001
 puck-0.001
 hop-0.001
 pile-0.001
 bisc-0.001
Token pillow
Token ID28774
Feature activation+2.95e-01

Pos logit contributions
 chew+0.019
 hop+0.015
 snowball+0.015
 pile+0.014
 spaghetti+0.014
Neg logit contributions
Lay-0.031
ager-0.031
ve-0.031
bel-0.031
 singer-0.031
Token and
Token ID290
Feature activation+3.51e-01

Pos logit contributions
 chew+0.008
 snowball+0.007
 hop+0.006
 puck+0.006
 bisc+0.006
Neg logit contributions
Lay-0.007
la-0.007
overed-0.007
ve-0.007
ager-0.007
Token soon
Token ID2582
Feature activation-8.62e-01

Pos logit contributions
 chew+0.008
 pile+0.007
 hop+0.007
 snowball+0.007
 spaghetti+0.007
Neg logit contributions
Lay-0.009
la-0.009
ager-0.009
ve-0.009
May-0.008
Token fell
Token ID3214
Feature activation+1.28e+00

Pos logit contributions
Lay+0.021
leaf+0.020
ager+0.019
ve+0.019
ires+0.019
Neg logit contributions
 chew-0.023
 hop-0.021
 pile-0.020
 spaghetti-0.020
 nose-0.020
Token asleep
Token ID16039
Feature activation-1.11e+00

Pos logit contributions
 chew+0.039
 hop+0.035
 pile+0.034
 spaghetti+0.034
 nose+0.033
Neg logit contributions
Lay-0.026
la-0.024
ve-0.024
ager-0.023
overed-0.023
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
Lay+0.031
ager+0.029
leaf+0.029
la+0.029
ve+0.029
Neg logit contributions
 chew-0.026
 hop-0.022
 pile-0.022
 spaghetti-0.021
 nose-0.021
Token<|endoftext|>
Token ID50256
Feature activation-1.20e-01

Pos logit contributions
 chew+0.003
 hop+0.002
 nose+0.002
 pile+0.002
 spaghetti+0.002
Neg logit contributions
Lay-0.003
ager-0.003
ve-0.003
leaf-0.003
la-0.003
TokenOnce
Token ID7454
Feature activation-1.61e+00

Pos logit contributions
Lay+0.002
la+0.002
ve+0.002
ager+0.002
leaf+0.002
Neg logit contributions
 chew-0.004
 hop-0.003
 pile-0.003
 spaghetti-0.003
 nose-0.003
Token he
Token ID339
Feature activation-2.30e-01

Pos logit contributions
 chew+0.019
 bisc+0.018
 snowball+0.018
 pile+0.017
 nose+0.017
Neg logit contributions
Lay-0.012
spe-0.012
Ap-0.011
 singer-0.011
iring-0.010
Token can
Token ID460
Feature activation-1.22e+00

Pos logit contributions
Lay+0.007
overed+0.006
Mad+0.006
Kay+0.006
ala+0.006
Neg logit contributions
 chew-0.005
 hop-0.004
 spaghetti-0.004
 pile-0.004
 nose-0.004
Token still
Token ID991
Feature activation+5.72e-01

Pos logit contributions
Lay+0.035
la+0.033
ager+0.033
leaf+0.033
spe+0.032
Neg logit contributions
 chew-0.028
 hop-0.024
 pile-0.023
 jump-0.023
 nose-0.022
Token make
Token ID787
Feature activation+8.65e-01

Pos logit contributions
 chew+0.009
 spaghetti+0.008
 hop+0.008
 snowball+0.008
 pile+0.008
Neg logit contributions
Lay-0.019
ager-0.018
ve-0.018
leaf-0.018
 singer-0.018
Token a
Token ID257
Feature activation-8.30e-01

Pos logit contributions
 chew+0.023
 hop+0.020
 pile+0.020
 nose+0.020
 bisc+0.019
Neg logit contributions
Lay-0.021
iring-0.019
ires-0.019
ve-0.019
la-0.019
Token new
Token ID649
Feature activation-1.55e+00

Pos logit contributions
Lay+0.023
ager+0.022
ve+0.022
la+0.021
leaf+0.021
Neg logit contributions
 chew-0.019
 hop-0.017
 pile-0.016
 spaghetti-0.016
 nose-0.016
Token picture
Token ID4286
Feature activation-7.57e-01

Pos logit contributions
Lay+0.040
 singer+0.038
ager+0.038
ve+0.037
spe+0.037
Neg logit contributions
 chew-0.038
 hop-0.033
 snowball-0.031
 spaghetti-0.031
 pile-0.031
Token.
Token ID13
Feature activation+9.41e-01

Pos logit contributions
Lay+0.017
 singer+0.017
bel+0.017
ager+0.016
spe+0.016
Neg logit contributions
 chew-0.020
 hop-0.017
 pile-0.017
 snowball-0.017
 nose-0.017
Token Tim
Token ID5045
Feature activation+1.46e-01

Pos logit contributions
 chew+0.023
 hop+0.021
 snowball+0.020
 carrot+0.020
 nose+0.020
Neg logit contributions
Lay-0.024
overed-0.022
ires-0.022
Ap-0.022
spe-0.022
Tokenmy
Token ID1820
Feature activation-1.85e-01

Pos logit contributions
 chew+0.003
 hop+0.002
 bisc+0.002
 spaghetti+0.002
 pile+0.002
Neg logit contributions
ager-0.005
Lay-0.005
elia-0.005
la-0.005
ve-0.004
Token felt
Token ID2936
Feature activation-6.87e-01

Pos logit contributions
Lay+0.005
leaf+0.005
la+0.005
iring+0.005
ve+0.005
Neg logit contributions
 chew-0.004
 hop-0.004
 pile-0.004
 nose-0.004
 spaghetti-0.004
Token so
Token ID523
Feature activation-6.32e-01

Pos logit contributions
 chew+0.028
 hop+0.024
 pile+0.024
 snowball+0.023
 spaghetti+0.023
Neg logit contributions
Lay-0.034
spe-0.031
leaf-0.031
la-0.031
ager-0.031
Token he
Token ID339
Feature activation-4.20e-01

Pos logit contributions
Lay+0.014
ager+0.013
leaf+0.013
ve+0.013
la+0.013
Neg logit contributions
 chew-0.018
 hop-0.016
 pile-0.016
 spaghetti-0.016
 nose-0.016
Token promised
Token ID8072
Feature activation-1.35e+00

Pos logit contributions
Lay+0.012
ires+0.012
iring+0.012
leaf+0.011
Mad+0.011
Neg logit contributions
 chew-0.010
 hop-0.009
 pile-0.008
 jump-0.008
 nose-0.008
Token to
Token ID284
Feature activation-7.74e-01

Pos logit contributions
Lay+0.035
ires+0.035
leaf+0.035
endant+0.034
spe+0.034
Neg logit contributions
 chew-0.033
 hop-0.028
 jump-0.027
 nose-0.027
 snack-0.026
Token be
Token ID307
Feature activation+1.14e+00

Pos logit contributions
Lay+0.024
ager+0.022
la+0.022
ve+0.022
leaf+0.022
Neg logit contributions
 chew-0.016
 hop-0.013
 pile-0.013
 spaghetti-0.013
 nose-0.013
Token Beat
Token ID12568
Feature activation-4.96e+00

Pos logit contributions
 chew+0.024
 hop+0.021
 pile+0.020
 jump+0.019
 spaghetti+0.019
Neg logit contributions
Lay-0.035
ve-0.034
overed-0.033
la-0.033
leaf-0.033
Tokenrice
Token ID20970
Feature activation-7.05e-01

Pos logit contributions
ires+0.086
iring+0.079
Ap+0.069
Lay+0.069
 yourselves+0.066
Neg logit contributions
 chew-0.174
 hop-0.166
 jump-0.160
 nose-0.158
 fetch-0.156
Token's
Token ID338
Feature activation+7.73e-01

Pos logit contributions
Lay+0.017
spe+0.016
la+0.016
ve+0.016
ager+0.016
Neg logit contributions
 chew-0.018
 snowball-0.016
 pile-0.016
 hop-0.016
 spaghetti-0.015
Token friend
Token ID1545
Feature activation+5.30e-01

Pos logit contributions
 chew+0.020
 snowball+0.019
 spaghetti+0.019
 bisc+0.018
 puck+0.018
Neg logit contributions
Lay-0.017
spe-0.017
 singer-0.016
ager-0.016
elia-0.016
Token.
Token ID13
Feature activation+8.66e-02

Pos logit contributions
 chew+0.014
 snowball+0.012
 pile+0.012
 hop+0.012
 spaghetti+0.012
Neg logit contributions
Lay-0.013
la-0.013
May-0.012
ager-0.012
spe-0.012
Token From
Token ID3574
Feature activation-1.53e+00

Pos logit contributions
 chew+0.002
 hop+0.002
 pile+0.002
 bone+0.001
 nose+0.001
Neg logit contributions
ager-0.002
Lay-0.002
leaf-0.002
ve-0.002
iring-0.002
Token agreed
Token ID4987
Feature activation-3.93e-01

Pos logit contributions
 chew+0.028
 hop+0.025
 pile+0.024
 snowball+0.024
 bisc+0.024
Neg logit contributions
Lay-0.024
spe-0.023
la-0.023
Ap-0.023
ager-0.023
Token to
Token ID284
Feature activation-7.25e-01

Pos logit contributions
Lay+0.008
ager+0.007
ve+0.007
overed+0.007
la+0.007
Neg logit contributions
 chew-0.012
 hop-0.011
 pile-0.011
 snowball-0.011
 spaghetti-0.010
Token take
Token ID1011
Feature activation+6.72e-01

Pos logit contributions
Lay+0.023
ager+0.022
la+0.021
leaf+0.021
 singer+0.021
Neg logit contributions
 chew-0.013
 spaghetti-0.011
 snowball-0.011
 pile-0.011
 hop-0.011
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 chew+0.017
 pile+0.015
 hop+0.015
 spaghetti+0.014
 nose+0.014
Neg logit contributions
Lay-0.017
ager-0.016
leaf-0.016
la-0.016
Mad-0.016
Token aer
Token ID9551
Feature activation-5.83e-01

Pos logit contributions
Lay+0.029
ager+0.026
la+0.026
overed+0.026
leaf+0.026
Neg logit contributions
 chew-0.034
 hop-0.030
 pile-0.030
 nose-0.029
 jump-0.029
Tokenopl
Token ID20106
Feature activation-3.84e+00

Pos logit contributions
Lay+0.018
la+0.016
leaf+0.016
ve+0.016
ager+0.016
Neg logit contributions
 chew-0.012
 hop-0.010
 pile-0.010
 spaghetti-0.010
 nose-0.010
Tokenane
Token ID1531
Feature activation-1.40e+00

Pos logit contributions
elia+0.084
Lay+0.084
Els+0.080
lves+0.079
ager+0.078
Neg logit contributions
 chew-0.088
 skate-0.082
 jump-0.082
 hop-0.081
 puff-0.080
Token on
Token ID319
Feature activation-5.21e-01

Pos logit contributions
Lay+0.035
ager+0.032
ve+0.032
la+0.032
leaf+0.032
Neg logit contributions
 chew-0.037
 hop-0.032
 pile-0.032
 spaghetti-0.031
 nose-0.031
Token a
Token ID257
Feature activation-1.11e+00

Pos logit contributions
Lay+0.013
la+0.012
May+0.011
Mad+0.011
Ap+0.011
Neg logit contributions
 chew-0.014
 hop-0.012
 pile-0.012
 bisc-0.012
 nose-0.012
Token journey
Token ID7002
Feature activation-1.71e+00

Pos logit contributions
Lay+0.028
May+0.026
ager+0.026
spe+0.026
Mad+0.026
Neg logit contributions
 chew-0.026
 spaghetti-0.024
 bisc-0.024
 nose-0.022
 hop-0.022
Token.
Token ID13
Feature activation+9.35e-01

Pos logit contributions
Lay+0.047
ager+0.044
la+0.044
ve+0.044
leaf+0.044
Neg logit contributions
 chew-0.040
 hop-0.034
 pile-0.034
 spaghetti-0.033
 nose-0.033
Token Lily
Token ID20037
Feature activation+4.33e-02

Pos logit contributions
Lay+0.025
May+0.023
la+0.022
Mad+0.022
spe+0.022
Neg logit contributions
 chew-0.022
 pile-0.020
 hop-0.019
 nose-0.019
 spaghetti-0.019
Token loved
Token ID6151
Feature activation-2.85e-01

Pos logit contributions
 chew+0.001
 hop+0.001
 spaghetti+0.001
 pile+0.001
 snowball+0.001
Neg logit contributions
Lay-0.001
la-0.001
ager-0.001
ve-0.001
May-0.001
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
Lay+0.006
la+0.005
leaf+0.005
ager+0.005
ve+0.005
Neg logit contributions
 chew-0.009
 hop-0.008
 pile-0.008
 nose-0.007
 spaghetti-0.007
Token big
Token ID1263
Feature activation+4.57e-03

Pos logit contributions
Lay+0.028
 singer+0.027
leaf+0.026
la+0.026
ager+0.026
Neg logit contributions
 chew-0.024
 hop-0.020
 pile-0.019
 nose-0.019
 spaghetti-0.019
Token Fer
Token ID12880
Feature activation-3.89e+00

Pos logit contributions
 chew+0.000
 afloat+0.000
 reel+0.000
 hop+0.000
 bisc+0.000
Neg logit contributions
 singer-0.000
 angel-0.000
yl-0.000
elia-0.000
 singers-0.000
Tokenris
Token ID2442
Feature activation-4.60e+00

Pos logit contributions
Lay+0.085
leaf+0.076
Ap+0.075
overed+0.074
ve+0.074
Neg logit contributions
 chew-0.113
 pile-0.103
 snowball-0.099
 stack-0.099
 hop-0.098
Token wheel
Token ID7825
Feature activation-8.26e-01

Pos logit contributions
Lay+0.147
iring+0.140
leaf+0.140
ires+0.136
Ag+0.135
Neg logit contributions
 chew-0.088
 pile-0.077
 nose-0.073
 stack-0.072
 hop-0.070
Token the
Token ID262
Feature activation-8.65e-01

Pos logit contributions
Lay+0.022
la+0.021
ager+0.020
ve+0.020
spe+0.020
Neg logit contributions
 chew-0.020
 hop-0.017
 pile-0.017
 spaghetti-0.017
 snowball-0.017
Token most
Token ID749
Feature activation+2.76e-01

Pos logit contributions
Lay+0.023
la+0.022
ve+0.022
spe+0.022
ager+0.022
Neg logit contributions
 chew-0.021
 hop-0.018
 spaghetti-0.017
 pile-0.017
 nose-0.017
Token.
Token ID13
Feature activation-8.78e-02

Pos logit contributions
 chew+0.007
 pile+0.006
 hop+0.006
 spaghetti+0.006
 snowball+0.006
Neg logit contributions
Lay-0.008
la-0.007
overed-0.007
leaf-0.007
ve-0.007
Token She
Token ID1375
Feature activation-4.95e-01

Pos logit contributions
Lay+0.002
la+0.002
May+0.002
Ap+0.002
Mad+0.002
Neg logit contributions
 chew-0.002
 hop-0.002
 pile-0.002
 nose-0.002
 spaghetti-0.002
Token,
Token ID11
Feature activation+8.41e-01

Pos logit contributions
 chew+0.009
 hop+0.007
 pile+0.007
 spaghetti+0.007
 nose+0.007
Neg logit contributions
Lay-0.011
ager-0.010
ve-0.010
la-0.010
leaf-0.010
Token gems
Token ID23685
Feature activation-3.98e-01

Pos logit contributions
 chew+0.020
 hop+0.018
 snowball+0.017
 pile+0.017
 nose+0.017
Neg logit contributions
Lay-0.021
leaf-0.020
spe-0.019
ve-0.019
iring-0.019
Token and
Token ID290
Feature activation+1.12e+00

Pos logit contributions
Lay+0.013
Ap+0.012
Mad+0.012
May+0.012
ager+0.012
Neg logit contributions
 chew-0.007
 pile-0.006
 hop-0.006
 nose-0.006
 carrot-0.006
Token even
Token ID772
Feature activation+7.06e-01

Pos logit contributions
 chew+0.033
 hop+0.029
 snowball+0.029
 pile+0.028
 spaghetti+0.028
Neg logit contributions
Lay-0.023
leaf-0.021
spe-0.021
la-0.021
ve-0.020
Token golden
Token ID10861
Feature activation-2.78e-01

Pos logit contributions
 chew+0.017
 hop+0.015
 pile+0.015
 spaghetti+0.014
 snowball+0.014
Neg logit contributions
Lay-0.019
la-0.017
leaf-0.017
ager-0.017
ve-0.017
Token jew
Token ID12711
Feature activation-4.79e+00

Pos logit contributions
Lay+0.006
Ap+0.006
otte+0.005
Mad+0.005
ager+0.005
Neg logit contributions
 chew-0.008
 pile-0.008
 nose-0.007
 hop-0.007
 carrot-0.007
Tokenellery
Token ID41800
Feature activation-1.46e+00

Pos logit contributions
Ap+0.120
leaf+0.116
Lay+0.115
 singer+0.111
spe+0.111
Neg logit contributions
 chew-0.116
 pile-0.102
 nose-0.101
 carrot-0.098
 spaghetti-0.096
Token.
Token ID13
Feature activation+3.53e-01

Pos logit contributions
Lay+0.050
Ap+0.049
Mad+0.049
otte+0.048
leaf+0.046
Neg logit contributions
 chew-0.023
 pile-0.023
 nose-0.021
 carrot-0.021
 bone-0.020
Token Molly
Token ID30236
Feature activation-7.49e-01

Pos logit contributions
 chew+0.009
 hop+0.008
 pile+0.008
 nose+0.008
 spaghetti+0.008
Neg logit contributions
Lay-0.009
la-0.008
May-0.008
leaf-0.008
spe-0.008
Token was
Token ID373
Feature activation+2.13e-01

Pos logit contributions
Lay+0.021
iring+0.020
leaf+0.020
ires+0.020
Mad+0.020
Neg logit contributions
 chew-0.017
 hop-0.015
 pile-0.015
 nose-0.014
 jump-0.014
Token so
Token ID523
Feature activation-1.63e-01

Pos logit contributions
 chew+0.004
 hop+0.004
 pile+0.004
 nose+0.004
 jump+0.003
Neg logit contributions
Lay-0.007
ager-0.006
ve-0.006
leaf-0.006
ires-0.006
Token trunk
Token ID21427
Feature activation-5.31e-01

Pos logit contributions
 chew+0.008
 hop+0.006
 spaghetti+0.006
 pile+0.006
 nose+0.006
Neg logit contributions
Lay-0.014
ager-0.014
ve-0.014
la-0.013
leaf-0.013
Token of
Token ID286
Feature activation+1.04e+00

Pos logit contributions
Lay+0.013
iring+0.012
ager+0.012
overed+0.012
spe+0.012
Neg logit contributions
 chew-0.014
 hop-0.012
 spaghetti-0.012
 snowball-0.011
 bisc-0.011
Token the
Token ID262
Feature activation+7.79e-01

Pos logit contributions
 chew+0.031
 bisc+0.030
iver+0.029
 spaghetti+0.028
 hop+0.028
Neg logit contributions
Lay-0.017
yl-0.015
 yourselves-0.015
elia-0.014
Ab-0.014
Token tree
Token ID5509
Feature activation-3.69e-01

Pos logit contributions
 chew+0.016
 spaghetti+0.014
 hop+0.014
 bisc+0.013
 nose+0.013
Neg logit contributions
Lay-0.022
ager-0.021
ve-0.021
elia-0.021
iring-0.021
Token and
Token ID290
Feature activation-7.92e-02

Pos logit contributions
overed+0.008
Lay+0.007
ager+0.007
bel+0.007
spe+0.007
Neg logit contributions
 chew-0.010
 snowball-0.009
 spaghetti-0.009
 bisc-0.009
 puck-0.009
Token sc
Token ID629
Feature activation-4.12e+00

Pos logit contributions
Lay+0.002
ager+0.002
la+0.002
ve+0.002
leaf+0.002
Neg logit contributions
 chew-0.002
 hop-0.002
 pile-0.002
 spaghetti-0.002
 nose-0.002
Tokenur
Token ID333
Feature activation-2.35e+00

Pos logit contributions
Lay+0.074
May+0.070
 singer+0.069
Ap+0.068
Mad+0.067
Neg logit contributions
 chew-0.138
 hop-0.126
 jump-0.122
 pile-0.122
 nose-0.119
Tokenried
Token ID2228
Feature activation-1.16e-01

Pos logit contributions
Lay+0.048
Mad+0.044
Ap+0.043
leaf+0.043
May+0.043
Neg logit contributions
 chew-0.071
 hop-0.066
 pile-0.065
 jump-0.063
 nose-0.063
Token along
Token ID1863
Feature activation-1.37e+00

Pos logit contributions
Lay+0.003
onda+0.003
spe+0.003
Mad+0.003
 singer+0.003
Neg logit contributions
 chew-0.003
 hop-0.002
 jump-0.002
 pile-0.002
 nose-0.002
Token the
Token ID262
Feature activation-3.83e-02

Pos logit contributions
Lay+0.034
ager+0.031
ve+0.031
overed+0.031
spe+0.031
Neg logit contributions
 chew-0.035
 pile-0.031
 spaghetti-0.031
 hop-0.031
 snowball-0.030
Token ground
Token ID2323
Feature activation-3.10e-01

Pos logit contributions
Lay+0.001
ve+0.001
ager+0.001
iring+0.001
spe+0.001
Neg logit contributions
 chew-0.001
 spaghetti-0.001
 bisc-0.001
 snowball-0.001
 hop-0.001