TOP ACTIVATIONS
MAX = 4.118

there were two friends , Pau ly and Kar la .
your own pen . Find another pen !" L
showed the bird how to use the compass .
a very pretty dress , Jenny . And it is unique
The bird flew down and helped Bun Bun pull up the
said " Self - destruct activated . Countdown begins ."
" Sure , I can help you . Let 's go
all the blue beads in another pile , and so on
taught him how to attach a string to his jets .
That 's very nice , Jude , but will it really
there were two friends named Walter and L ila . They
- destruct activated . Countdown begins ." Tom and
wise owl who lived in a big tree . The owl
down his big adventures in a big book . He wrote
, the blue paper in another file , and the yellow
wise owl who lived in a tree . Every Tuesday ,
the band , one by one . She made a pattern
They may be harder to find , but they are worth
old owl who lived in a big tree . The owl
put some feathers on the end to make it look friendly

MIN ACTIVATIONS
MIN = -4.497

! Amy had never seen coffee before and she couldn 't
because he had never seen chocolate in the wild before .
never been back to the coffee shop again and lost his
he never wanted to try coffee again ! He never forgot
eager to eat the y ummy cookies . She ate one
They both enjoyed the y ummy dessert . After
because she had never had cocoa before . She wanted to
. Sarah had never tried coffee before and she was very
She never got her delicious chocolate . <|endoftext|> Once there was
When Mitt ens woke up , he saw
, can I eat the chocolate cake now ?"
had never seen a w affle before , and she knew
replied . As they walked upstairs , Lily saw
not yours ." Me gan tried to resist her
comfortable and they enjoyed the sunny day . Suddenly , Lily
" Why are you sad , Ann ?" Ann
He had never seen a lime before and it felt very
. He had never seen rain before .
never tried the sm elly coffee , but they had a
kitchen . They saw a m op in the corner .

INTERVAL 1.964 - 4.118
CONTAINS 0.516%

upon a time , in a small town , there was
ask your neighbor if they found it ." Tim my thought
cut a hole for the door . They tape some paper
She used the peanuts to make a peanut butter sandwich and
day , on a little boat . He cast his line

INTERVAL -0.190 - 1.964
CONTAINS 59.817%

there were two friends called Jack and Sam . They were
after thinking it through , she hugged Amy and said ,
strong enough . He tried to save the cat but he
he asked his dad for help . His dad nodded and
at times , but it 's always right ."

INTERVAL -2.343 - -0.190
CONTAINS 39.467%

I 'm looking for a place to bloom ."
a little girl who loved to search . Everywhere she went
Big drops of water started to pour down , and the
and said hello to the bear . The bear
L ily 's holiday was ruined because she didn 't

INTERVAL -4.497 - -2.343
CONTAINS 0.200%

to hide and Spot liked to find her . When Spot
. Let 's get the m op and clean up this
started walking again . As they walked , Rex noticed that
closed the doors . Tom and Sue sat on their beds
tall and hard . Sara and Tom could not see what
Token there
Token ID612
Feature activation-8.13e-03

Pos logit contributions
round+0.012
Whe+0.011
rob+0.010
cream+0.009
Mike+0.009
Neg logit contributions
 big-0.003
 kind-0.003
 pen-0.003
 small-0.003
 drawer-0.003
Token were
Token ID547
Feature activation+3.26e-01

Pos logit contributions
round+0.000
Whe+0.000
rob+0.000
cream+0.000
Rob+0.000
Neg logit contributions
 big-0.000
 small-0.000
 kind-0.000
 pen-0.000
 drawer-0.000
Token two
Token ID734
Feature activation+5.29e-01

Pos logit contributions
 big+0.006
 small+0.006
 kind+0.006
 true+0.006
 pen+0.006
Neg logit contributions
round-0.014
Whe-0.014
rob-0.013
cream-0.011
Mike-0.011
Token friends
Token ID2460
Feature activation+5.75e-01

Pos logit contributions
 drawer+0.012
 pen+0.011
 true+0.011
 file+0.011
 big+0.011
Neg logit contributions
round-0.020
Whe-0.019
rob-0.017
cream-0.015
 Clown-0.015
Token,
Token ID11
Feature activation+6.80e-01

Pos logit contributions
 big+0.012
 small+0.011
 drawer+0.011
 pen+0.011
 kind+0.011
Neg logit contributions
round-0.023
Whe-0.022
rob-0.020
cream-0.018
Mike-0.018
Token Pau
Token ID37329
Feature activation+4.12e+00

Pos logit contributions
 big+0.014
 small+0.014
 kind+0.014
 pen+0.013
 wise+0.013
Neg logit contributions
round-0.028
Whe-0.027
rob-0.025
cream-0.022
Rob-0.021
Tokenly
Token ID306
Feature activation+1.43e+00

Pos logit contributions
 drawer+0.137
 big+0.136
 small+0.136
 box+0.134
 kind+0.132
Neg logit contributions
round-0.117
Whe-0.089
rob-0.088
cream-0.079
Rob-0.067
Token and
Token ID290
Feature activation+8.81e-01

Pos logit contributions
 big+0.025
 small+0.025
 drawer+0.025
 kind+0.025
 pen+0.024
Neg logit contributions
round-0.062
Whe-0.058
rob-0.054
cream-0.049
Rob-0.047
Token Kar
Token ID9375
Feature activation+1.81e+00

Pos logit contributions
 drawer+0.021
 big+0.021
 pen+0.021
 box+0.020
 small+0.020
Neg logit contributions
round-0.031
Whe-0.030
rob-0.028
 Clown-0.024
Mike-0.024
Tokenla
Token ID5031
Feature activation+1.52e+00

Pos logit contributions
 big+0.049
 kind+0.048
 small+0.048
 wise+0.047
 box+0.045
Neg logit contributions
round-0.063
Whe-0.058
rob-0.054
cream-0.048
Mike-0.046
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
 big+0.018
 small+0.018
 kind+0.017
 drawer+0.017
 pen+0.017
Neg logit contributions
round-0.075
Whe-0.072
rob-0.067
cream-0.062
Rob-0.059
Token your
Token ID534
Feature activation+3.01e+00

Pos logit contributions
 handy+0.017
 wise+0.016
 seed+0.016
 drawer+0.016
 kind+0.016
Neg logit contributions
round-0.040
Whe-0.039
rob-0.034
cream-0.033
Rob-0.032
Token own
Token ID898
Feature activation+2.57e+00

Pos logit contributions
ook+0.055
ft+0.048
 handy+0.045
yt+0.045
ooks+0.044
Neg logit contributions
Whe-0.093
round-0.091
rob-0.085
 Clown-0.080
robat-0.080
Token pen
Token ID3112
Feature activation+9.69e-01

Pos logit contributions
ook+0.038
 handy+0.034
 recycling+0.032
olving+0.031
Jud+0.030
Neg logit contributions
round-0.093
Whe-0.082
rob-0.079
robat-0.076
cream-0.076
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
 small+0.018
 big+0.018
 kind+0.018
 wise+0.018
 drawer+0.017
Neg logit contributions
round-0.040
Whe-0.035
rob-0.032
cream-0.031
seek-0.029
Token Find
Token ID9938
Feature activation+1.62e+00

Pos logit contributions
 big+0.033
 small+0.032
 kind+0.031
 pen+0.031
 drawer+0.031
Neg logit contributions
round-0.045
Whe-0.043
rob-0.037
cream-0.034
 Clown-0.033
Token another
Token ID1194
Feature activation+4.01e+00

Pos logit contributions
 big+0.031
 kind+0.031
 drawer+0.031
 small+0.031
 wise+0.030
Neg logit contributions
round-0.065
Whe-0.062
rob-0.057
cream-0.053
Rob-0.050
Token pen
Token ID3112
Feature activation+1.11e+00

Pos logit contributions
olving+0.055
ook+0.051
iqu+0.045
 maj+0.045
 handy+0.045
Neg logit contributions
round-0.149
rob-0.142
robat-0.133
Whe-0.132
 Clown-0.120
Token!"
Token ID2474
Feature activation+2.70e-01

Pos logit contributions
 small+0.022
 big+0.021
 kind+0.021
 wise+0.021
 true+0.020
Neg logit contributions
round-0.047
Whe-0.042
rob-0.039
cream-0.036
hop-0.034
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
 big+0.005
 small+0.005
 kind+0.005
 wise+0.005
 pen+0.005
Neg logit contributions
round-0.012
Whe-0.011
rob-0.010
cream-0.009
icans-0.009
Token
Token ID198
Feature activation+8.42e-01

Pos logit contributions
 drawer+0.004
 pen+0.004
 kind+0.004
 big+0.004
 small+0.003
Neg logit contributions
round-0.006
Whe-0.005
rob-0.005
cream-0.004
Rob-0.004
TokenL
Token ID43
Feature activation-1.00e+00

Pos logit contributions
 pen+0.023
 drawer+0.022
 file+0.022
 big+0.022
 box+0.022
Neg logit contributions
round-0.029
Whe-0.028
rob-0.023
Rob-0.022
Mike-0.022
Token showed
Token ID3751
Feature activation+1.74e+00

Pos logit contributions
 drawer+0.031
 small+0.030
 big+0.030
 file+0.029
 diary+0.029
Neg logit contributions
round-0.039
Whe-0.035
rob-0.033
cream-0.032
hop-0.028
Token the
Token ID262
Feature activation+2.09e+00

Pos logit contributions
 drawer+0.041
 file+0.039
 pen+0.039
 seed+0.038
 kind+0.038
Neg logit contributions
round-0.061
Whe-0.060
rob-0.054
cream-0.051
hop-0.046
Token bird
Token ID6512
Feature activation+7.96e-01

Pos logit contributions
ook+0.051
 drawer+0.047
olving+0.045
ooks+0.045
 Po+0.044
Neg logit contributions
round-0.061
rob-0.056
Whe-0.055
Rob-0.054
robat-0.049
Token how
Token ID703
Feature activation+2.19e+00

Pos logit contributions
 drawer+0.018
 pen+0.017
 file+0.016
 handy+0.016
 blunt+0.016
Neg logit contributions
round-0.029
Whe-0.026
cream-0.024
rob-0.022
hop-0.022
Token to
Token ID284
Feature activation+1.56e+00

Pos logit contributions
 drawer+0.062
 pen+0.055
 box+0.054
 folder+0.052
 pot+0.052
Neg logit contributions
cream-0.063
round-0.059
Whe-0.054
seek-0.050
Nice-0.050
Token use
Token ID779
Feature activation+3.90e+00

Pos logit contributions
 drawer+0.034
 seed+0.032
 small+0.032
 folder+0.031
 diary+0.031
Neg logit contributions
round-0.053
Whe-0.048
rob-0.045
Rob-0.039
cream-0.039
Token the
Token ID262
Feature activation+2.99e+00

Pos logit contributions
 seed+0.103
rick+0.100
 drawer+0.098
 kind+0.098
ror+0.098
Neg logit contributions
round-0.075
rob-0.072
Whe-0.071
 as-0.069
 himself-0.066
Token compass
Token ID31855
Feature activation+2.23e+00

Pos logit contributions
ook+0.073
 Po+0.065
ot+0.065
ooks+0.064
 plant+0.062
Neg logit contributions
round-0.084
rob-0.078
Whe-0.076
robat-0.069
Rob-0.068
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
 kind+0.048
 drawer+0.048
 lav+0.047
 small+0.046
 blunt+0.046
Neg logit contributions
round-0.078
cream-0.060
Whe-0.058
hop-0.057
seek-0.055
Token
Token ID220
Feature activation+1.01e+00

Pos logit contributions
 kind+0.039
 big+0.039
 drawer+0.038
 small+0.038
 pen+0.038
Neg logit contributions
round-0.056
Whe-0.054
rob-0.048
cream-0.044
Rob-0.041
Token
Token ID198
Feature activation+6.28e-01

Pos logit contributions
 big+0.020
 small+0.020
 kind+0.020
 wise+0.019
 pen+0.019
Neg logit contributions
round-0.043
Whe-0.041
rob-0.037
cream-0.033
 Clown-0.032
Token a
Token ID257
Feature activation+5.19e-01

Pos logit contributions
 pen+0.009
 drawer+0.009
 box+0.009
 seed+0.009
 file+0.008
Neg logit contributions
round-0.017
Whe-0.016
rob-0.014
cream-0.013
 Clown-0.013
Token very
Token ID845
Feature activation+2.40e-01

Pos logit contributions
 pen+0.007
 small+0.007
 big+0.006
 drawer+0.006
 kind+0.006
Neg logit contributions
round-0.024
Whe-0.023
rob-0.021
 Clown-0.019
Rob-0.019
Token pretty
Token ID2495
Feature activation-7.09e-01

Pos logit contributions
 pen+0.004
 drawer+0.004
 box+0.004
 diary+0.004
olving+0.004
Neg logit contributions
round-0.010
Whe-0.009
 Clown-0.008
Mike-0.008
rob-0.008
Token dress
Token ID6576
Feature activation+5.06e-01

Pos logit contributions
round+0.031
Whe+0.029
rob+0.026
 Clown+0.023
cream+0.023
Neg logit contributions
 small-0.012
 big-0.012
 kind-0.012
 wise-0.012
 handy-0.011
Token,
Token ID11
Feature activation-5.79e-01

Pos logit contributions
 small+0.008
 big+0.008
 kind+0.008
 drawer+0.007
 pen+0.007
Neg logit contributions
round-0.023
Whe-0.021
rob-0.020
cream-0.018
 Clown-0.018
Token Jenny
Token ID28437
Feature activation+3.82e+00

Pos logit contributions
round+0.020
Whe+0.019
rob+0.015
Mike+0.015
 Clown+0.015
Neg logit contributions
 seed-0.014
 pen-0.014
 big-0.014
 small-0.014
 drawer-0.014
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
 big+0.083
 small+0.082
olving+0.082
 loose+0.082
 solar+0.080
Neg logit contributions
round-0.143
Whe-0.115
rob-0.113
cream-0.109
Rob-0.099
Token And
Token ID843
Feature activation+1.00e+00

Pos logit contributions
 big+0.041
 small+0.040
 kind+0.039
 pen+0.039
 box+0.038
Neg logit contributions
round-0.053
Whe-0.051
rob-0.040
 Clown-0.039
cream-0.038
Token it
Token ID340
Feature activation+1.10e+00

Pos logit contributions
 pen+0.026
 drawer+0.025
 box+0.025
 file+0.024
 big+0.024
Neg logit contributions
Whe-0.030
round-0.030
 Clown-0.023
cream-0.022
Mike-0.021
Token is
Token ID318
Feature activation+1.00e+00

Pos logit contributions
 big+0.014
 small+0.014
 pen+0.013
 kind+0.013
 drawer+0.013
Neg logit contributions
round-0.051
Whe-0.050
rob-0.046
cream-0.042
 Clown-0.041
Token unique
Token ID3748
Feature activation-1.21e+00

Pos logit contributions
 pen+0.021
 file+0.020
 drawer+0.020
 box+0.020
 seed+0.019
Neg logit contributions
round-0.037
Whe-0.035
rob-0.030
cream-0.028
 Clown-0.028
Token The
Token ID383
Feature activation+1.70e+00

Pos logit contributions
 big+0.032
 drawer+0.032
 kind+0.032
 small+0.031
 pen+0.031
Neg logit contributions
round-0.055
Whe-0.052
rob-0.046
cream-0.043
Rob-0.041
Token bird
Token ID6512
Feature activation+1.12e+00

Pos logit contributions
 drawer+0.040
 solar+0.036
olving+0.035
 pen+0.035
 handy+0.034
Neg logit contributions
round-0.057
Whe-0.055
rob-0.047
ican-0.044
 Clown-0.044
Token flew
Token ID13112
Feature activation+1.39e+00

Pos logit contributions
 big+0.025
 small+0.025
 kind+0.024
 drawer+0.024
 pen+0.024
Neg logit contributions
round-0.044
Whe-0.042
rob-0.038
cream-0.034
Rob-0.032
Token down
Token ID866
Feature activation+8.11e-01

Pos logit contributions
 drawer+0.038
 pen+0.037
 file+0.035
 diary+0.034
 kind+0.034
Neg logit contributions
round-0.048
Whe-0.043
rob-0.036
cream-0.034
icans-0.034
Token and
Token ID290
Feature activation+9.97e-01

Pos logit contributions
 drawer+0.017
 pen+0.017
 diary+0.016
 kind+0.016
 file+0.016
Neg logit contributions
round-0.032
Whe-0.029
rob-0.027
cream-0.024
icans-0.024
Token helped
Token ID4193
Feature activation+3.81e+00

Pos logit contributions
 drawer+0.027
 kind+0.026
 big+0.025
 file+0.025
 pen+0.025
Neg logit contributions
round-0.033
Whe-0.032
rob-0.029
cream-0.026
Rob-0.024
Token Bun
Token ID28515
Feature activation+8.78e-01

Pos logit contributions
 drawer+0.127
 box+0.117
 kind+0.116
 seed+0.115
 diary+0.114
Neg logit contributions
round-0.100
Whe-0.091
rob-0.079
cream-0.079
icans-0.068
Token Bun
Token ID28515
Feature activation+4.58e-01

Pos logit contributions
 drawer+0.021
 diary+0.019
 box+0.019
 file+0.019
 big+0.019
Neg logit contributions
round-0.033
Whe-0.030
rob-0.027
cream-0.027
seek-0.026
Token pull
Token ID2834
Feature activation+2.34e+00

Pos logit contributions
 drawer+0.010
 big+0.009
 diary+0.009
 small+0.009
 box+0.009
Neg logit contributions
round-0.019
Whe-0.017
rob-0.015
cream-0.015
seek-0.015
Token up
Token ID510
Feature activation+1.25e+00

Pos logit contributions
 drawer+0.052
 pen+0.049
 kind+0.048
 diary+0.047
 box+0.046
Neg logit contributions
round-0.088
Whe-0.084
rob-0.074
cream-0.073
Rob-0.067
Token the
Token ID262
Feature activation+1.58e+00

Pos logit contributions
 drawer+0.024
 pen+0.023
 kind+0.022
 diary+0.021
 box+0.021
Neg logit contributions
round-0.050
Whe-0.047
cream-0.042
rob-0.041
icans-0.039
Token said
Token ID531
Feature activation-8.53e-02

Pos logit contributions
round+0.009
Whe+0.009
rob+0.008
cream+0.007
Rob+0.007
Neg logit contributions
 big-0.005
 small-0.005
 kind-0.005
 pen-0.005
 drawer-0.005
Token "
Token ID366
Feature activation+3.52e-01

Pos logit contributions
round+0.004
Whe+0.003
rob+0.003
 Clown+0.003
Mike+0.003
Neg logit contributions
 pen-0.001
 drawer-0.001
 file-0.001
 big-0.001
 small-0.001
TokenSelf
Token ID24704
Feature activation+6.57e-01

Pos logit contributions
 true+0.010
 big+0.010
 small+0.009
 pen+0.009
 kind+0.009
Neg logit contributions
round-0.012
Whe-0.010
rob-0.010
cream-0.009
 Clown-0.008
Token-
Token ID12
Feature activation-1.08e+00

Pos logit contributions
 handy+0.018
 big+0.018
 wise+0.018
 kind+0.018
 small+0.018
Neg logit contributions
round-0.022
Whe-0.019
cream-0.017
rob-0.015
ored-0.014
Tokendestruct
Token ID35678
Feature activation+1.23e+00

Pos logit contributions
round+0.039
Whe+0.034
rob+0.031
cream+0.030
Rob+0.027
Neg logit contributions
 drawer-0.030
 big-0.029
 small-0.029
 kind-0.029
 box-0.029
Token activated
Token ID13906
Feature activation+3.74e+00

Pos logit contributions
 pen+0.033
 drawer+0.031
 kind+0.031
 diary+0.031
 file+0.030
Neg logit contributions
round-0.041
Whe-0.034
cream-0.031
rob-0.028
Rob-0.028
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
 diary+0.092
 pen+0.089
 drawer+0.088
 kind+0.088
 file+0.085
Neg logit contributions
round-0.125
Whe-0.102
cream-0.092
Mike-0.087
Rob-0.086
Token Countdown
Token ID47555
Feature activation+9.62e-01

Pos logit contributions
 big+0.005
 kind+0.004
 small+0.004
 pen+0.004
 drawer+0.004
Neg logit contributions
round-0.007
Whe-0.007
rob-0.006
cream-0.005
Rob-0.005
Token begins
Token ID6140
Feature activation+3.63e+00

Pos logit contributions
 pen+0.023
 drawer+0.022
 small+0.022
 kind+0.022
 big+0.022
Neg logit contributions
round-0.036
Whe-0.032
rob-0.027
cream-0.027
 Clown-0.026
Token."
Token ID526
Feature activation-6.11e-01

Pos logit contributions
 drawer+0.079
 pen+0.077
 file+0.076
 diary+0.076
 box+0.073
Neg logit contributions
round-0.132
Whe-0.110
cream-0.101
hop-0.098
seek-0.095
Token
Token ID198
Feature activation-7.39e-01

Pos logit contributions
round+0.028
Whe+0.026
rob+0.025
cream+0.022
icans+0.021
Neg logit contributions
 big-0.012
 small-0.011
 kind-0.011
 wise-0.010
 drawer-0.010
Token "
Token ID366
Feature activation+1.32e+00

Pos logit contributions
 big+0.009
 drawer+0.008
 pen+0.008
 small+0.008
 kind+0.008
Neg logit contributions
round-0.033
Whe-0.032
rob-0.029
cream-0.027
Rob-0.026
TokenSure
Token ID19457
Feature activation+9.92e-01

Pos logit contributions
 big+0.044
 small+0.044
 kind+0.044
 pen+0.043
 drawer+0.043
Neg logit contributions
round-0.038
Whe-0.035
rob-0.030
cream-0.026
Rob-0.024
Token,
Token ID11
Feature activation+1.59e-01

Pos logit contributions
 drawer+0.021
 solar+0.020
 pen+0.020
olving+0.018
 file+0.018
Neg logit contributions
round-0.039
Whe-0.035
rob-0.029
 Clown-0.029
Mike-0.029
Token I
Token ID314
Feature activation+2.06e+00

Pos logit contributions
 drawer+0.004
 pen+0.004
 solar+0.004
 blunt+0.004
 file+0.004
Neg logit contributions
round-0.005
Whe-0.005
cream-0.004
 Clown-0.004
icle-0.004
Token can
Token ID460
Feature activation+2.91e+00

Pos logit contributions
 drawer+0.033
 small+0.032
 pen+0.030
 old+0.030
 envelop+0.029
Neg logit contributions
round-0.080
Whe-0.074
cream-0.064
ored-0.064
rob-0.061
Token help
Token ID1037
Feature activation+3.71e+00

Pos logit contributions
 drawer+0.063
 solar+0.059
 diary+0.058
 small+0.058
 pen+0.058
Neg logit contributions
round-0.103
Whe-0.091
cream-0.085
rob-0.079
whe-0.078
Token you
Token ID345
Feature activation+2.24e+00

Pos logit contributions
 drawer+0.098
 box+0.092
 pen+0.091
 diary+0.090
 big+0.089
Neg logit contributions
round-0.125
Whe-0.114
rob-0.101
cream-0.097
icans-0.090
Token.
Token ID13
Feature activation+1.74e+00

Pos logit contributions
 drawer+0.044
 pen+0.042
 box+0.041
 kind+0.040
 small+0.040
Neg logit contributions
round-0.087
Whe-0.076
cream-0.067
rob-0.065
hop-0.064
Token Let
Token ID3914
Feature activation+2.47e+00

Pos logit contributions
 big+0.046
 small+0.045
 drawer+0.043
 pen+0.043
 kind+0.043
Neg logit contributions
round-0.064
Whe-0.061
rob-0.050
cream-0.048
 Clown-0.045
Token's
Token ID338
Feature activation+3.27e+00

Pos logit contributions
 drawer+0.057
olving+0.055
 handy+0.052
 solar+0.050
 tool+0.050
Neg logit contributions
Whe-0.089
round-0.087
rob-0.078
cream-0.069
hop-0.069
Token go
Token ID467
Feature activation+9.65e-01

Pos logit contributions
 drawer+0.065
 small+0.064
 kind+0.062
 valuable+0.060
 true+0.060
Neg logit contributions
round-0.118
Whe-0.112
cream-0.097
rob-0.092
hop-0.087
Token all
Token ID477
Feature activation+1.95e+00

Pos logit contributions
 true+0.018
 kind+0.018
 file+0.017
 diary+0.017
 wise+0.017
Neg logit contributions
round-0.033
Whe-0.033
rob-0.029
icans-0.026
cream-0.025
Token the
Token ID262
Feature activation+2.01e+00

Pos logit contributions
 diary+0.045
 wise+0.044
 true+0.043
 handy+0.042
 kind+0.041
Neg logit contributions
round-0.067
Whe-0.061
rob-0.050
cream-0.048
mine-0.046
Token blue
Token ID4171
Feature activation+1.79e+00

Pos logit contributions
 handy+0.040
 true+0.040
 wise+0.038
 kind+0.038
olving+0.038
Neg logit contributions
Whe-0.071
round-0.069
rob-0.062
 Clown-0.053
icans-0.052
Token beads
Token ID36116
Feature activation+2.82e+00

Pos logit contributions
 kind+0.041
 wise+0.040
 true+0.039
 big+0.038
 small+0.038
Neg logit contributions
round-0.066
Whe-0.063
rob-0.057
cream-0.050
icans-0.049
Token in
Token ID287
Feature activation+1.93e+00

Pos logit contributions
 true+0.059
 wise+0.058
 handy+0.057
 kind+0.056
 solar+0.055
Neg logit contributions
round-0.105
Whe-0.087
rob-0.084
ored-0.079
cream-0.079
Token another
Token ID1194
Feature activation+3.71e+00

Pos logit contributions
 diary+0.042
 wise+0.042
 kind+0.040
olving+0.040
 seed+0.039
Neg logit contributions
Whe-0.063
round-0.061
Mike-0.052
rob-0.051
 Clown-0.048
Token pile
Token ID14540
Feature activation+2.14e+00

Pos logit contributions
olving+0.069
 true+0.061
 wise+0.059
 handy+0.057
 loose+0.052
Neg logit contributions
round-0.136
Whe-0.133
rob-0.115
hop-0.109
robat-0.108
Token,
Token ID11
Feature activation+9.38e-01

Pos logit contributions
 wise+0.046
 true+0.045
 diary+0.045
 pen+0.044
Ru+0.044
Neg logit contributions
round-0.075
Whe-0.064
Mike-0.058
cream-0.057
rob-0.056
Token and
Token ID290
Feature activation+3.95e-01

Pos logit contributions
 file+0.023
 kind+0.023
 diary+0.023
 true+0.023
 wise+0.022
Neg logit contributions
round-0.030
Whe-0.029
rob-0.025
icans-0.022
cream-0.022
Token so
Token ID523
Feature activation+1.27e+00

Pos logit contributions
 wise+0.009
 kind+0.009
 handy+0.009
 true+0.009
 diary+0.009
Neg logit contributions
Whe-0.013
round-0.013
rob-0.011
cream-0.010
 Clown-0.010
Token on
Token ID319
Feature activation+1.26e+00

Pos logit contributions
 pen+0.025
 wise+0.025
 drawer+0.025
 diary+0.025
 box+0.024
Neg logit contributions
round-0.047
Whe-0.046
rob-0.038
cream-0.037
whe-0.036
Token taught
Token ID7817
Feature activation+9.97e-01

Pos logit contributions
 drawer+0.034
 seed+0.033
 small+0.032
 big+0.031
 box+0.031
Neg logit contributions
round-0.053
Whe-0.050
rob-0.046
cream-0.044
Rob-0.041
Token him
Token ID683
Feature activation+7.66e-01

Pos logit contributions
 big+0.033
 small+0.032
 kind+0.032
 pen+0.032
 drawer+0.031
Neg logit contributions
round-0.029
Whe-0.027
rob-0.024
cream-0.020
Rob-0.019
Token how
Token ID703
Feature activation+1.72e+00

Pos logit contributions
 drawer+0.016
 pen+0.015
 big+0.015
 small+0.015
 box+0.015
Neg logit contributions
round-0.030
Whe-0.028
cream-0.025
rob-0.024
seek-0.022
Token to
Token ID284
Feature activation+1.48e+00

Pos logit contributions
 drawer+0.036
 box+0.033
 pen+0.031
 pot+0.029
 shed+0.028
Neg logit contributions
round-0.069
cream-0.061
Whe-0.061
seek-0.054
Mike-0.053
Token attach
Token ID10199
Feature activation+2.60e+00

Pos logit contributions
 drawer+0.032
 small+0.030
 big+0.030
 kind+0.028
 box+0.028
Neg logit contributions
round-0.058
Whe-0.053
rob-0.047
cream-0.043
Rob-0.042
Token a
Token ID257
Feature activation+3.70e+00

Pos logit contributions
 drawer+0.058
 kind+0.055
 pen+0.055
 wise+0.054
 seed+0.054
Neg logit contributions
round-0.097
Whe-0.093
rob-0.083
cream-0.078
Rob-0.071
Token string
Token ID4731
Feature activation+1.27e+00

Pos logit contributions
olving+0.074
ook+0.070
 Pr+0.066
ror+0.065
 Po+0.065
Neg logit contributions
round-0.137
Whe-0.110
rob-0.099
ding-0.092
 coaster-0.091
Token to
Token ID284
Feature activation+2.37e+00

Pos logit contributions
 kind+0.020
 wise+0.020
 small+0.019
 drawer+0.019
 pen+0.018
Neg logit contributions
round-0.057
Whe-0.049
cream-0.045
rob-0.045
hop-0.041
Token his
Token ID465
Feature activation+2.93e+00

Pos logit contributions
 drawer+0.043
 seed+0.041
 kind+0.040
 box+0.040
 pen+0.039
Neg logit contributions
round-0.095
Whe-0.090
rob-0.079
cream-0.077
hop-0.071
Token jets
Token ID20792
Feature activation+1.10e+00

Pos logit contributions
ook+0.057
 drawer+0.055
 handy+0.055
olving+0.053
 kind+0.053
Neg logit contributions
round-0.105
Whe-0.097
rob-0.084
cream-0.078
bike-0.078
Token.
Token ID13
Feature activation+5.89e-01

Pos logit contributions
 small+0.017
 kind+0.017
 drawer+0.017
 big+0.017
 wise+0.016
Neg logit contributions
round-0.050
Whe-0.047
rob-0.042
cream-0.039
hop-0.037
TokenThat
Token ID2504
Feature activation+2.83e-01

Pos logit contributions
 big+0.006
 small+0.006
 kind+0.006
 pen+0.006
 true+0.006
Neg logit contributions
round-0.008
Whe-0.007
rob-0.006
cream-0.006
 Clown-0.005
Token's
Token ID338
Feature activation+2.24e-01

Pos logit contributions
 small+0.004
 big+0.004
 kind+0.004
 handy+0.004
 wise+0.004
Neg logit contributions
round-0.013
Whe-0.012
rob-0.011
cream-0.011
 Clown-0.010
Token very
Token ID845
Feature activation-2.64e-01

Pos logit contributions
 pen+0.005
 drawer+0.005
 big+0.005
 small+0.004
 box+0.004
Neg logit contributions
round-0.009
Whe-0.008
rob-0.007
cream-0.007
 Clown-0.006
Token nice
Token ID3621
Feature activation-4.98e-02

Pos logit contributions
round+0.011
Whe+0.010
cream+0.010
Mike+0.009
rob+0.009
Neg logit contributions
 pen-0.004
 diary-0.004
 drawer-0.004
 box-0.004
 folder-0.003
Token,
Token ID11
Feature activation+6.31e-01

Pos logit contributions
round+0.002
Whe+0.002
cream+0.002
rob+0.002
 Clown+0.002
Neg logit contributions
 small-0.001
 drawer-0.001
 big-0.001
 blunt-0.001
 pen-0.001
Token Jude
Token ID30044
Feature activation+3.67e+00

Pos logit contributions
 diary+0.016
 drawer+0.015
 seed+0.015
 box+0.015
 file+0.014
Neg logit contributions
round-0.021
Whe-0.019
 Clown-0.016
Mike-0.015
rob-0.015
Token,
Token ID11
Feature activation-7.86e-01

Pos logit contributions
 drawer+0.075
 big+0.075
 diary+0.074
 small+0.074
 handy+0.072
Neg logit contributions
round-0.140
rob-0.119
Whe-0.116
cream-0.114
robat-0.102
Token but
Token ID475
Feature activation+5.10e-01

Pos logit contributions
round+0.029
Whe+0.027
cream+0.022
 Clown+0.021
rob+0.021
Neg logit contributions
 drawer-0.016
 pen-0.016
 small-0.016
 big-0.016
 diary-0.016
Token will
Token ID481
Feature activation+1.09e+00

Pos logit contributions
 big+0.012
 pen+0.011
 small+0.011
 drawer+0.011
 kind+0.011
Neg logit contributions
round-0.017
Whe-0.016
rob-0.014
cream-0.014
 Clown-0.014
Token it
Token ID340
Feature activation+2.73e-01

Pos logit contributions
 big+0.022
 small+0.022
 kind+0.022
 pen+0.022
 drawer+0.022
Neg logit contributions
round-0.043
Whe-0.042
rob-0.037
cream-0.034
 Clown-0.033
Token really
Token ID1107
Feature activation+4.53e-01

Pos logit contributions
 big+0.005
 drawer+0.005
 small+0.005
 pen+0.005
 kind+0.005
Neg logit contributions
round-0.012
Whe-0.011
rob-0.010
cream-0.009
Rob-0.009
Token there
Token ID612
Feature activation-3.74e-01

Pos logit contributions
 pen+0.008
 drawer+0.007
 kind+0.007
 file+0.007
 wise+0.007
Neg logit contributions
round-0.020
Whe-0.020
rob-0.017
Mike-0.016
Jacob-0.016
Token were
Token ID547
Feature activation+4.28e-02

Pos logit contributions
round+0.016
Whe+0.016
rob+0.014
cream+0.013
Rob+0.012
Neg logit contributions
 big-0.007
 small-0.007
 kind-0.007
 pen-0.007
 drawer-0.007
Token two
Token ID734
Feature activation+4.84e-01

Pos logit contributions
 big+0.001
 small+0.001
 kind+0.001
 true+0.001
 pen+0.001
Neg logit contributions
round-0.002
Whe-0.002
rob-0.002
cream-0.002
 Clown-0.001
Token friends
Token ID2460
Feature activation+6.12e-01

Pos logit contributions
 drawer+0.011
 pen+0.011
 true+0.011
 file+0.010
 big+0.010
Neg logit contributions
round-0.018
Whe-0.017
rob-0.015
cream-0.013
 Clown-0.013
Token named
Token ID3706
Feature activation+6.39e-01

Pos logit contributions
 big+0.013
 small+0.012
 kind+0.012
 drawer+0.012
 pen+0.012
Neg logit contributions
round-0.025
Whe-0.024
rob-0.021
cream-0.019
Mike-0.019
Token Walter
Token ID15819
Feature activation+3.66e+00

Pos logit contributions
 kind+0.020
 pen+0.019
 wise+0.019
 drawer+0.019
 small+0.019
Neg logit contributions
round-0.019
Whe-0.018
rob-0.015
cream-0.013
 Clown-0.013
Token and
Token ID290
Feature activation+9.46e-01

Pos logit contributions
 drawer+0.082
 big+0.078
 file+0.077
 small+0.077
 box+0.077
Neg logit contributions
round-0.139
Whe-0.117
rob-0.117
cream-0.109
Rob-0.102
Token L
Token ID406
Feature activation+1.04e+00

Pos logit contributions
 drawer+0.023
 big+0.022
 box+0.022
 small+0.022
 pen+0.022
Neg logit contributions
round-0.032
Whe-0.031
rob-0.029
 Clown-0.025
Rob-0.025
Tokenila
Token ID10102
Feature activation+1.50e+00

Pos logit contributions
 big+0.032
 small+0.032
 kind+0.032
 pen+0.031
 drawer+0.031
Neg logit contributions
round-0.032
Whe-0.030
rob-0.027
cream-0.022
Mike-0.022
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
 drawer+0.019
 big+0.019
 small+0.019
 pen+0.018
 kind+0.018
Neg logit contributions
round-0.072
Whe-0.068
rob-0.063
cream-0.058
Rob-0.056
Token They
Token ID1119
Feature activation+1.07e+00

Pos logit contributions
 kind+0.046
 big+0.044
 drawer+0.044
 diary+0.043
 wise+0.043
Neg logit contributions
round-0.050
Whe-0.046
rob-0.044
cream-0.038
icans-0.037
Token-
Token ID12
Feature activation-1.08e+00

Pos logit contributions
 handy+0.018
 big+0.018
 wise+0.018
 kind+0.018
 small+0.018
Neg logit contributions
round-0.022
Whe-0.019
cream-0.017
rob-0.015
ored-0.014
Tokendestruct
Token ID35678
Feature activation+1.23e+00

Pos logit contributions
round+0.039
Whe+0.034
rob+0.031
cream+0.030
Rob+0.027
Neg logit contributions
 drawer-0.030
 big-0.029
 small-0.029
 kind-0.029
 box-0.029
Token activated
Token ID13906
Feature activation+3.74e+00

Pos logit contributions
 pen+0.033
 drawer+0.031
 kind+0.031
 diary+0.031
 file+0.030
Neg logit contributions
round-0.041
Whe-0.034
cream-0.031
rob-0.028
Rob-0.028
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
 diary+0.092
 pen+0.089
 drawer+0.088
 kind+0.088
 file+0.085
Neg logit contributions
round-0.125
Whe-0.102
cream-0.092
Mike-0.087
Rob-0.086
Token Countdown
Token ID47555
Feature activation+9.62e-01

Pos logit contributions
 big+0.005
 kind+0.004
 small+0.004
 pen+0.004
 drawer+0.004
Neg logit contributions
round-0.007
Whe-0.007
rob-0.006
cream-0.005
Rob-0.005
Token begins
Token ID6140
Feature activation+3.63e+00

Pos logit contributions
 pen+0.023
 drawer+0.022
 small+0.022
 kind+0.022
 big+0.022
Neg logit contributions
round-0.036
Whe-0.032
rob-0.027
cream-0.027
 Clown-0.026
Token."
Token ID526
Feature activation-6.11e-01

Pos logit contributions
 drawer+0.079
 pen+0.077
 file+0.076
 diary+0.076
 box+0.073
Neg logit contributions
round-0.132
Whe-0.110
cream-0.101
hop-0.098
seek-0.095
Token
Token ID198
Feature activation-7.39e-01

Pos logit contributions
round+0.028
Whe+0.026
rob+0.025
cream+0.022
icans+0.021
Neg logit contributions
 big-0.012
 small-0.011
 kind-0.011
 wise-0.010
 drawer-0.010
Token
Token ID198
Feature activation-8.38e-01

Pos logit contributions
round+0.025
Whe+0.024
rob+0.021
Rob+0.019
Mike+0.019
Neg logit contributions
 drawer-0.020
 pen-0.020
 seed-0.019
 file-0.019
 true-0.019
TokenTom
Token ID13787
Feature activation-8.99e-01

Pos logit contributions
round+0.035
Whe+0.033
rob+0.029
Jacob+0.027
Rob+0.027
Neg logit contributions
 pen-0.017
 drawer-0.017
 big-0.016
 box-0.016
 true-0.016
Token and
Token ID290
Feature activation-2.90e+00

Pos logit contributions
round+0.038
Whe+0.037
rob+0.034
cream+0.030
icans+0.029
Neg logit contributions
 big-0.018
 small-0.018
 kind-0.018
 pen-0.017
 wise-0.017
Token wise
Token ID10787
Feature activation+2.69e-01

Pos logit contributions
olving+0.020
 drawer+0.019
 pen+0.018
 true+0.018
 box+0.017
Neg logit contributions
round-0.057
Whe-0.055
rob-0.048
 Clown-0.046
Ste-0.044
Token owl
Token ID39610
Feature activation+1.50e+00

Pos logit contributions
 true+0.004
 drawer+0.004
 will+0.004
 small+0.004
 big+0.004
Neg logit contributions
round-0.012
Whe-0.011
rob-0.010
 Clown-0.009
cream-0.009
Token who
Token ID508
Feature activation+2.22e+00

Pos logit contributions
 drawer+0.039
 big+0.038
 small+0.038
 handy+0.037
 file+0.037
Neg logit contributions
round-0.054
Whe-0.046
rob-0.044
cream-0.040
hop-0.038
Token lived
Token ID5615
Feature activation+1.35e+00

Pos logit contributions
 drawer+0.059
 big+0.057
 small+0.057
 pen+0.057
 box+0.056
Neg logit contributions
round-0.075
Whe-0.069
rob-0.065
cream-0.059
Rob-0.054
Token in
Token ID287
Feature activation+1.72e+00

Pos logit contributions
 pen+0.036
 drawer+0.035
 file+0.034
 can+0.033
 pays+0.033
Neg logit contributions
round-0.041
Whe-0.040
rob-0.035
Ste-0.034
cream-0.031
Token a
Token ID257
Feature activation+3.61e+00

Pos logit contributions
 drawer+0.039
 box+0.037
 pen+0.037
olving+0.037
 file+0.036
Neg logit contributions
Whe-0.060
round-0.055
rob-0.052
cream-0.046
Rob-0.045
Token big
Token ID1263
Feature activation+3.06e+00

Pos logit contributions
olving+0.068
 pays+0.051
ac+0.047
 password+0.046
IVE+0.045
Neg logit contributions
round-0.133
Whe-0.132
rob-0.118
Rob-0.115
Ste-0.114
Token tree
Token ID5509
Feature activation+1.86e+00

Pos logit contributions
olving+0.085
ook+0.075
 password+0.074
ft+0.073
 true+0.072
Neg logit contributions
round-0.087
Whe-0.073
rob-0.071
 Clown-0.059
hop-0.059
Token.
Token ID13
Feature activation+6.93e-01

Pos logit contributions
 true+0.036
 pen+0.035
 file+0.035
 drawer+0.034
 handy+0.034
Neg logit contributions
round-0.075
Whe-0.063
rob-0.060
hop-0.058
cream-0.057
Token The
Token ID383
Feature activation+1.71e+00

Pos logit contributions
 big+0.016
 kind+0.015
 small+0.015
 pen+0.015
 box+0.015
Neg logit contributions
round-0.027
Whe-0.026
rob-0.022
cream-0.020
 Clown-0.020
Token owl
Token ID39610
Feature activation+1.58e+00

Pos logit contributions
 drawer+0.039
rick+0.038
 folder+0.037
ook+0.036
 pen+0.036
Neg logit contributions
round-0.056
Whe-0.055
rob-0.048
cream-0.043
hop-0.042
Token down
Token ID866
Feature activation+8.90e-01

Pos logit contributions
 drawer+0.014
 seed+0.013
 file+0.013
 pen+0.013
 box+0.013
Neg logit contributions
round-0.023
Whe-0.021
rob-0.020
cream-0.019
Rob-0.017
Token his
Token ID465
Feature activation+1.81e+00

Pos logit contributions
 drawer+0.021
 pen+0.021
 handy+0.021
 seed+0.020
 file+0.020
Neg logit contributions
round-0.030
Whe-0.026
rob-0.024
cream-0.024
 Clown-0.021
Token big
Token ID1263
Feature activation+2.31e+00

Pos logit contributions
 kind+0.034
 true+0.033
 drawer+0.033
 small+0.033
 handy+0.032
Neg logit contributions
round-0.072
Whe-0.067
rob-0.062
cream-0.058
 Clown-0.054
Token adventures
Token ID17545
Feature activation+1.43e+00

Pos logit contributions
olving+0.046
 handy+0.045
ook+0.045
 loose+0.044
 true+0.043
Neg logit contributions
round-0.082
Whe-0.069
rob-0.069
 Clown-0.064
 coaster-0.061
Token in
Token ID287
Feature activation+2.02e+00

Pos logit contributions
 small+0.022
 big+0.022
 kind+0.022
 drawer+0.022
 true+0.021
Neg logit contributions
round-0.065
Whe-0.060
rob-0.057
cream-0.053
 Clown-0.049
Token a
Token ID257
Feature activation+3.60e+00

Pos logit contributions
olving+0.062
 seed+0.062
 file+0.059
 drawer+0.059
 will+0.059
Neg logit contributions
Whe-0.048
round-0.044
rob-0.039
 Clown-0.035
hop-0.035
Token big
Token ID1263
Feature activation+3.10e+00

Pos logit contributions
olving+0.081
olve+0.052
 vain+0.051
 unlocks+0.048
 pays+0.048
Neg logit contributions
round-0.140
Whe-0.127
rob-0.127
Rob-0.118
 clown-0.112
Token book
Token ID1492
Feature activation+8.09e-01

Pos logit contributions
olving+0.074
 true+0.060
ook+0.058
 loose+0.058
 valuable+0.055
Neg logit contributions
round-0.116
rob-0.100
Whe-0.100
robat-0.086
 Clown-0.084
Token.
Token ID13
Feature activation+7.17e-01

Pos logit contributions
 true+0.011
 small+0.011
 kind+0.011
 drawer+0.011
 pen+0.011
Neg logit contributions
round-0.038
Whe-0.035
rob-0.033
cream-0.030
seek-0.029
Token He
Token ID679
Feature activation+1.82e+00

Pos logit contributions
 big+0.015
 small+0.015
 kind+0.015
 pen+0.014
 drawer+0.014
Neg logit contributions
round-0.029
Whe-0.028
rob-0.026
cream-0.023
Mike-0.022
Token wrote
Token ID2630
Feature activation-1.98e-01

Pos logit contributions
 big+0.036
 small+0.036
 kind+0.035
 drawer+0.035
 wise+0.034
Neg logit contributions
round-0.075
Whe-0.071
rob-0.066
cream-0.059
Rob-0.057
Token,
Token ID11
Feature activation+4.90e-01

Pos logit contributions
 true+0.025
 wise+0.024
 kind+0.024
 small+0.022
 solar+0.022
Neg logit contributions
round-0.039
Whe-0.033
rob-0.030
cream-0.029
hop-0.028
Token the
Token ID262
Feature activation+1.96e+00

Pos logit contributions
 true+0.012
 kind+0.011
 diary+0.011
 big+0.011
 wise+0.011
Neg logit contributions
round-0.017
Whe-0.016
rob-0.014
cream-0.012
icans-0.012
Token blue
Token ID4171
Feature activation+1.39e+00

Pos logit contributions
ook+0.035
 true+0.033
ool+0.029
 feat+0.029
 handy+0.028
Neg logit contributions
Whe-0.068
round-0.066
 sidelines-0.059
 Clown-0.055
rob-0.055
Token paper
Token ID3348
Feature activation+1.72e+00

Pos logit contributions
 true+0.030
 kind+0.029
 wise+0.029
 small+0.028
 big+0.027
Neg logit contributions
round-0.053
Whe-0.047
rob-0.043
cream-0.040
 Clown-0.039
Token in
Token ID287
Feature activation+1.57e+00

Pos logit contributions
 true+0.037
 wise+0.035
 kind+0.035
 handy+0.031
 feat+0.031
Neg logit contributions
round-0.062
Whe-0.053
rob-0.049
cream-0.047
hop-0.046
Token another
Token ID1194
Feature activation+3.60e+00

Pos logit contributions
 diary+0.036
 wise+0.035
 seed+0.035
 kind+0.034
 pen+0.034
Neg logit contributions
round-0.052
Whe-0.051
rob-0.042
Mike-0.041
Jacob-0.039
Token file
Token ID2393
Feature activation+1.62e+00

Pos logit contributions
olving+0.065
 true+0.062
 wise+0.057
 feat+0.056
 maj+0.056
Neg logit contributions
round-0.126
Whe-0.121
rob-0.104
robat-0.103
 Clown-0.102
Token,
Token ID11
Feature activation+5.04e-01

Pos logit contributions
 true+0.031
 wise+0.029
 kind+0.028
 small+0.028
 big+0.027
Neg logit contributions
round-0.067
Whe-0.059
rob-0.056
cream-0.051
hop-0.050
Token and
Token ID290
Feature activation+1.54e-01

Pos logit contributions
 diary+0.013
 kind+0.013
 true+0.012
 file+0.012
 pen+0.012
Neg logit contributions
round-0.016
Whe-0.015
rob-0.013
cream-0.012
icans-0.012
Token the
Token ID262
Feature activation+1.92e+00

Pos logit contributions
 kind+0.004
 wise+0.004
 big+0.004
 small+0.004
 pen+0.004
Neg logit contributions
round-0.005
Whe-0.005
rob-0.005
cream-0.004
 Clown-0.004
Token yellow
Token ID7872
Feature activation+1.15e+00

Pos logit contributions
ook+0.037
 true+0.033
ool+0.031
 handy+0.031
olving+0.030
Neg logit contributions
Whe-0.064
round-0.061
 sidelines-0.054
rob-0.052
 Clown-0.051
Token wise
Token ID10787
Feature activation+1.95e-01

Pos logit contributions
olving+0.019
 drawer+0.018
 pen+0.017
 true+0.017
 box+0.016
Neg logit contributions
round-0.054
Whe-0.051
rob-0.045
 Clown-0.043
Ste-0.041
Token owl
Token ID39610
Feature activation+1.62e+00

Pos logit contributions
 true+0.003
 drawer+0.003
 small+0.003
 big+0.003
 will+0.003
Neg logit contributions
round-0.009
Whe-0.008
rob-0.007
 Clown-0.006
cream-0.006
Token who
Token ID508
Feature activation+2.22e+00

Pos logit contributions
 big+0.041
 drawer+0.041
 small+0.041
 handy+0.039
 kind+0.039
Neg logit contributions
round-0.059
Whe-0.051
rob-0.049
cream-0.044
hop-0.042
Token lived
Token ID5615
Feature activation+1.40e+00

Pos logit contributions
 drawer+0.064
 handy+0.061
 pen+0.061
 small+0.060
 file+0.060
Neg logit contributions
round-0.069
Whe-0.061
rob-0.059
cream-0.055
Rob-0.049
Token in
Token ID287
Feature activation+1.66e+00

Pos logit contributions
 pen+0.037
 drawer+0.036
 file+0.035
 can+0.033
 pays+0.033
Neg logit contributions
round-0.043
Whe-0.042
rob-0.037
Ste-0.035
cream-0.033
Token a
Token ID257
Feature activation+3.58e+00

Pos logit contributions
 drawer+0.038
 box+0.036
 pen+0.036
olving+0.036
 will+0.035
Neg logit contributions
Whe-0.057
round-0.053
rob-0.049
cream-0.044
Rob-0.043
Token tree
Token ID5509
Feature activation+1.75e+00

Pos logit contributions
olving+0.067
 pays+0.050
ac+0.046
 leads+0.044
 password+0.044
Neg logit contributions
round-0.133
Whe-0.131
rob-0.118
Rob-0.115
Ste-0.114
Token.
Token ID13
Feature activation+6.20e-01

Pos logit contributions
 true+0.033
 pen+0.033
 drawer+0.032
 file+0.032
 handy+0.032
Neg logit contributions
round-0.071
Whe-0.061
rob-0.058
hop-0.055
cream-0.054
Token Every
Token ID3887
Feature activation+8.39e-01

Pos logit contributions
 big+0.015
 pen+0.015
 kind+0.015
 box+0.015
 small+0.014
Neg logit contributions
round-0.023
Whe-0.022
rob-0.019
cream-0.017
 Clown-0.017
Token Tuesday
Token ID3431
Feature activation+1.26e-01

Pos logit contributions
 big+0.019
 small+0.019
 kind+0.019
 drawer+0.019
 pen+0.019
Neg logit contributions
round-0.032
Whe-0.030
rob-0.027
cream-0.024
 Clown-0.023
Token,
Token ID11
Feature activation+1.55e+00

Pos logit contributions
 big+0.001
 small+0.001
 kind+0.001
 pen+0.001
 wise+0.001
Neg logit contributions
round-0.007
Whe-0.006
rob-0.006
cream-0.005
Mike-0.005
Token the
Token ID262
Feature activation+2.10e+00

Pos logit contributions
 kind+0.014
 wise+0.014
 true+0.013
 seed+0.013
 pen+0.013
Neg logit contributions
round-0.029
Whe-0.027
rob-0.024
Mike-0.022
hop-0.022
Token band
Token ID4097
Feature activation+1.09e+00

Pos logit contributions
 true+0.053
 kind+0.050
 wise+0.048
 handy+0.048
olving+0.048
Neg logit contributions
round-0.073
Whe-0.068
rob-0.062
 sidelines-0.053
hop-0.052
Token,
Token ID11
Feature activation+1.09e+00

Pos logit contributions
 small+0.032
 big+0.032
 kind+0.032
 wise+0.032
 true+0.032
Neg logit contributions
round-0.035
Whe-0.031
rob-0.028
cream-0.025
hop-0.024
Token one
Token ID530
Feature activation+3.00e+00

Pos logit contributions
 true+0.025
 kind+0.024
 wise+0.024
 handy+0.023
 diary+0.023
Neg logit contributions
round-0.038
Whe-0.036
rob-0.032
cream-0.029
hop-0.027
Token by
Token ID416
Feature activation+7.46e-01

Pos logit contributions
 wise+0.061
 kind+0.058
 true+0.056
 handy+0.055
 solar+0.053
Neg logit contributions
round-0.114
Whe-0.101
hop-0.087
rob-0.083
cream-0.081
Token one
Token ID530
Feature activation+3.58e+00

Pos logit contributions
 big+0.018
 wise+0.018
 will+0.018
 kind+0.018
 handy+0.018
Neg logit contributions
round-0.025
Whe-0.025
rob-0.021
hop-0.020
Mike-0.019
Token.
Token ID13
Feature activation+4.15e-01

Pos logit contributions
 wise+0.072
 true+0.068
 solar+0.068
 kind+0.067
 valuable+0.067
Neg logit contributions
round-0.127
Whe-0.107
hop-0.099
 Clown-0.091
whe-0.087
Token She
Token ID1375
Feature activation+1.45e+00

Pos logit contributions
 big+0.009
 kind+0.009
 small+0.009
 pen+0.009
 true+0.008
Neg logit contributions
round-0.017
Whe-0.016
rob-0.014
cream-0.013
 Clown-0.012
Token made
Token ID925
Feature activation+1.36e+00

Pos logit contributions
 big+0.028
 small+0.027
 kind+0.027
 wise+0.026
 true+0.026
Neg logit contributions
round-0.061
Whe-0.059
rob-0.053
cream-0.049
Mike-0.046
Token a
Token ID257
Feature activation+2.34e+00

Pos logit contributions
 drawer+0.028
 seed+0.027
 file+0.027
 kind+0.027
 box+0.027
Neg logit contributions
Whe-0.051
round-0.051
rob-0.044
cream-0.042
Mike-0.040
Token pattern
Token ID3912
Feature activation+5.54e-01

Pos logit contributions
olving+0.040
 handy+0.035
 seed+0.033
 true+0.033
 wise+0.033
Neg logit contributions
round-0.090
Whe-0.090
rob-0.075
 sidelines-0.075
 coaster-0.070
Token They
Token ID1119
Feature activation+1.98e+00

Pos logit contributions
 big+0.034
 small+0.032
 pen+0.031
 kind+0.031
 drawer+0.031
Neg logit contributions
round-0.043
Whe-0.042
rob-0.034
cream-0.032
 Clown-0.031
Token may
Token ID743
Feature activation+1.91e+00

Pos logit contributions
 drawer+0.033
 small+0.032
 big+0.031
 kind+0.031
 pen+0.031
Neg logit contributions
round-0.077
Whe-0.074
cream-0.063
rob-0.063
icans-0.060
Token be
Token ID307
Feature activation+1.93e+00

Pos logit contributions
 drawer+0.042
 pen+0.041
 box+0.039
 kind+0.039
 file+0.038
Neg logit contributions
round-0.068
Whe-0.063
rob-0.055
 Clown-0.051
Ste-0.050
Token harder
Token ID7069
Feature activation+2.09e+00

Pos logit contributions
 drawer+0.041
 pen+0.039
 box+0.038
 diary+0.036
 file+0.036
Neg logit contributions
round-0.066
Whe-0.063
rob-0.057
cream-0.055
Ste-0.052
Token to
Token ID284
Feature activation+2.01e+00

Pos logit contributions
 drawer+0.061
 pen+0.057
 diary+0.057
 box+0.057
 kind+0.056
Neg logit contributions
round-0.056
Whe-0.046
rob-0.044
cream-0.039
icans-0.036
Token find
Token ID1064
Feature activation+3.57e+00

Pos logit contributions
 drawer+0.042
 pen+0.040
 box+0.040
 diary+0.039
 kind+0.038
Neg logit contributions
round-0.073
Whe-0.068
rob-0.060
 sidelines-0.055
icans-0.055
Token,
Token ID11
Feature activation-1.49e-01

Pos logit contributions
 drawer+0.095
 pen+0.088
olving+0.085
 handy+0.085
 file+0.083
Neg logit contributions
round-0.106
Whe-0.098
rob-0.090
cream-0.090
icans-0.079
Token but
Token ID475
Feature activation+7.38e-01

Pos logit contributions
round+0.005
Whe+0.004
rob+0.003
icans+0.003
 Clown+0.003
Neg logit contributions
 drawer-0.004
 pen-0.004
 big-0.004
 kind-0.004
 box-0.004
Token they
Token ID484
Feature activation+6.29e-01

Pos logit contributions
 drawer+0.017
 pen+0.017
 big+0.016
 kind+0.016
 file+0.016
Neg logit contributions
round-0.024
Whe-0.023
rob-0.020
 Clown-0.019
cream-0.019
Token are
Token ID389
Feature activation+8.21e-01

Pos logit contributions
 big+0.009
 small+0.009
 kind+0.009
 pen+0.009
 drawer+0.009
Neg logit contributions
round-0.029
Whe-0.028
rob-0.025
cream-0.023
icans-0.022
Token worth
Token ID2861
Feature activation+1.29e+00

Pos logit contributions
 pen+0.018
 drawer+0.017
 file+0.017
 box+0.016
 diary+0.016
Neg logit contributions
round-0.028
Whe-0.027
rob-0.023
cream-0.022
Ste-0.021
Token old
Token ID1468
Feature activation-1.55e-01

Pos logit contributions
 true+0.004
 drawer+0.004
 will+0.004
 pen+0.004
 small+0.004
Neg logit contributions
round-0.011
Whe-0.010
rob-0.009
 Clown-0.008
cream-0.008
Token owl
Token ID39610
Feature activation+1.81e+00

Pos logit contributions
round+0.006
rob+0.005
Whe+0.005
 Clown+0.004
cream+0.004
Neg logit contributions
 true-0.003
olving-0.003
 small-0.003
 big-0.003
 will-0.003
Token who
Token ID508
Feature activation+2.03e+00

Pos logit contributions
 big+0.044
 drawer+0.044
 small+0.043
 handy+0.042
 true+0.042
Neg logit contributions
round-0.068
Whe-0.059
rob-0.057
cream-0.052
hop-0.048
Token lived
Token ID5615
Feature activation+1.50e+00

Pos logit contributions
 drawer+0.056
 small+0.054
 big+0.054
 pen+0.053
 handy+0.053
Neg logit contributions
round-0.065
Whe-0.059
rob-0.056
cream-0.052
Rob-0.046
Token in
Token ID287
Feature activation+1.65e+00

Pos logit contributions
 pen+0.041
 drawer+0.040
 file+0.039
 pays+0.038
 tool+0.037
Neg logit contributions
round-0.043
Whe-0.041
rob-0.036
Ste-0.035
cream-0.032
Token a
Token ID257
Feature activation+3.55e+00

Pos logit contributions
 drawer+0.039
olving+0.037
 box+0.037
 pen+0.037
 will+0.036
Neg logit contributions
Whe-0.055
round-0.050
rob-0.046
cream-0.041
Rob-0.041
Token big
Token ID1263
Feature activation+2.92e+00

Pos logit contributions
olving+0.068
 pays+0.053
ac+0.049
 leads+0.047
IVE+0.047
Neg logit contributions
round-0.128
Whe-0.123
Rob-0.110
rob-0.110
Ste-0.109
Token tree
Token ID5509
Feature activation+1.95e+00

Pos logit contributions
olving+0.080
ft+0.070
ook+0.069
IVE+0.068
 password+0.067
Neg logit contributions
round-0.084
Whe-0.068
rob-0.066
 Clown-0.059
 sidelines-0.058
Token.
Token ID13
Feature activation+2.50e-01

Pos logit contributions
 true+0.037
 pen+0.037
 drawer+0.036
 file+0.036
 handy+0.036
Neg logit contributions
round-0.078
Whe-0.066
rob-0.063
hop-0.060
cream-0.060
Token The
Token ID383
Feature activation+1.99e+00

Pos logit contributions
 big+0.006
 pen+0.006
 kind+0.006
 small+0.005
 drawer+0.005
Neg logit contributions
round-0.010
Whe-0.009
rob-0.008
cream-0.007
 Clown-0.007
Token owl
Token ID39610
Feature activation+1.70e+00

Pos logit contributions
rick+0.050
ook+0.047
 drawer+0.046
 folder+0.045
yt+0.043
Neg logit contributions
round-0.055
Whe-0.055
rob-0.044
hop-0.042
cream-0.042
Token put
Token ID1234
Feature activation+1.66e+00

Pos logit contributions
 drawer+0.017
 big+0.016
 kind+0.016
 pen+0.016
 small+0.016
Neg logit contributions
round-0.029
Whe-0.028
rob-0.025
cream-0.023
Mike-0.021
Token some
Token ID617
Feature activation+1.52e+00

Pos logit contributions
 wise+0.039
 kind+0.038
 drawer+0.038
 diary+0.036
 pen+0.036
Neg logit contributions
round-0.059
Whe-0.058
rob-0.054
cream-0.049
Mike-0.045
Token feathers
Token ID27400
Feature activation+1.22e+00

Pos logit contributions
 drawer+0.024
 kind+0.024
 true+0.024
 wise+0.023
 handy+0.022
Neg logit contributions
round-0.065
Whe-0.061
rob-0.056
cream-0.054
Rob-0.048
Token on
Token ID319
Feature activation+1.01e+00

Pos logit contributions
 kind+0.025
 wise+0.025
 drawer+0.025
 true+0.024
 small+0.024
Neg logit contributions
round-0.050
Whe-0.044
rob-0.041
cream-0.040
icans-0.036
Token the
Token ID262
Feature activation+3.36e+00

Pos logit contributions
 kind+0.023
 wise+0.022
 diary+0.022
 drawer+0.022
 pen+0.021
Neg logit contributions
round-0.038
Whe-0.035
rob-0.032
cream-0.030
Rob-0.029
Token end
Token ID886
Feature activation+3.52e+00

Pos logit contributions
 kind+0.072
ook+0.072
olving+0.070
 handy+0.070
 wise+0.066
Neg logit contributions
round-0.121
Whe-0.108
rob-0.097
cream-0.091
 coaster-0.086
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
 kind+0.089
 wise+0.089
 diary+0.085
 pen+0.080
 true+0.080
Neg logit contributions
round-0.122
Whe-0.090
cream-0.089
icles-0.083
icle-0.082
Token make
Token ID787
Feature activation+2.29e+00

Pos logit contributions
 kind+0.026
 drawer+0.025
 small+0.024
 wise+0.024
 true+0.024
Neg logit contributions
round-0.056
Whe-0.055
rob-0.049
cream-0.045
hop-0.042
Token it
Token ID340
Feature activation+9.78e-01

Pos logit contributions
 drawer+0.051
 seed+0.048
 kind+0.047
 pen+0.047
 box+0.047
Neg logit contributions
Whe-0.080
round-0.080
cream-0.070
rob-0.070
Mike-0.063
Token look
Token ID804
Feature activation+1.74e+00

Pos logit contributions
 drawer+0.018
 pen+0.018
 file+0.017
 box+0.017
 kind+0.017
Neg logit contributions
round-0.037
Whe-0.036
cream-0.031
rob-0.030
whe-0.028
Token friendly
Token ID8030
Feature activation-2.01e-01

Pos logit contributions
 pen+0.036
 drawer+0.035
 shed+0.034
 file+0.034
 diary+0.034
Neg logit contributions
round-0.066
Whe-0.062
cream-0.054
rob-0.051
ding-0.049
Token!
Token ID0
Feature activation-9.85e-01

Pos logit contributions
round+0.115
Whe+0.110
rob+0.101
cream+0.092
Mike+0.088
Neg logit contributions
 big-0.043
 small-0.042
 kind-0.042
 pen-0.040
 wise-0.040
Token Amy
Token ID14235
Feature activation+1.14e-01

Pos logit contributions
round+0.038
Whe+0.036
rob+0.033
cream+0.029
Rob+0.028
Neg logit contributions
 big-0.023
 small-0.023
 kind-0.023
 pen-0.022
 drawer-0.022
Token had
Token ID550
Feature activation-9.04e-02

Pos logit contributions
 big+0.003
 small+0.002
 kind+0.002
 drawer+0.002
 true+0.002
Neg logit contributions
round-0.004
Whe-0.004
rob-0.004
cream-0.003
Rob-0.003
Token never
Token ID1239
Feature activation-2.51e-01

Pos logit contributions
round+0.003
Whe+0.003
cream+0.002
Mike+0.002
rob+0.002
Neg logit contributions
 handy-0.002
 drawer-0.002
 file-0.002
 pen-0.002
 diary-0.002
Token seen
Token ID1775
Feature activation-1.35e+00

Pos logit contributions
round+0.008
Whe+0.008
rob+0.007
cream+0.006
Mike+0.005
Neg logit contributions
 big-0.007
 small-0.007
 kind-0.007
 drawer-0.007
 pen-0.007
Token coffee
Token ID6891
Feature activation-4.50e+00

Pos logit contributions
round+0.055
Whe+0.053
rob+0.048
cream+0.043
Rob+0.041
Neg logit contributions
 big-0.029
 small-0.029
 kind-0.028
 pen-0.028
 drawer-0.027
Token before
Token ID878
Feature activation-2.20e-01

Pos logit contributions
round+0.142
Whe+0.133
rob+0.115
cream+0.101
 Clown+0.095
Neg logit contributions
 big-0.135
 small-0.134
 kind-0.133
 pen-0.131
 drawer-0.130
Token and
Token ID290
Feature activation-1.50e+00

Pos logit contributions
round+0.010
Whe+0.009
rob+0.008
cream+0.008
Mike+0.007
Neg logit contributions
 kind-0.004
 small-0.004
 big-0.004
 pen-0.003
 drawer-0.003
Token she
Token ID673
Feature activation-5.29e-02

Pos logit contributions
round+0.058
Whe+0.055
rob+0.048
cream+0.046
Mike+0.043
Neg logit contributions
 big-0.032
 small-0.032
 kind-0.032
 pen-0.031
 true-0.031
Token couldn
Token ID3521
Feature activation-1.73e+00

Pos logit contributions
round+0.002
Whe+0.002
rob+0.002
cream+0.002
Rob+0.002
Neg logit contributions
 big-0.001
 small-0.001
 kind-0.001
 pen-0.001
 drawer-0.001
Token't
Token ID470
Feature activation-1.31e+00

Pos logit contributions
round+0.057
Whe+0.054
rob+0.047
cream+0.041
Rob+0.039
Neg logit contributions
 big-0.050
 small-0.050
 kind-0.050
 pen-0.048
 drawer-0.048
Token because
Token ID780
Feature activation-6.61e-01

Pos logit contributions
round+0.015
Whe+0.015
rob+0.014
cream+0.012
Rob+0.012
Neg logit contributions
 big-0.006
 small-0.006
 kind-0.006
 pen-0.006
 wise-0.006
Token he
Token ID339
Feature activation-7.23e-02

Pos logit contributions
round+0.026
Whe+0.025
rob+0.022
cream+0.020
Mike+0.019
Neg logit contributions
 big-0.015
 small-0.015
 kind-0.014
 pen-0.014
 drawer-0.014
Token had
Token ID550
Feature activation-6.06e-01

Pos logit contributions
round+0.003
Whe+0.003
rob+0.003
cream+0.002
 Clown+0.002
Neg logit contributions
 big-0.001
 kind-0.001
 small-0.001
 wise-0.001
 pen-0.001
Token never
Token ID1239
Feature activation-5.89e-01

Pos logit contributions
round+0.024
Whe+0.023
rob+0.020
cream+0.019
Rob+0.018
Neg logit contributions
 drawer-0.012
 pen-0.012
 box-0.012
 small-0.012
 kind-0.012
Token seen
Token ID1775
Feature activation-1.62e+00

Pos logit contributions
round+0.020
Whe+0.019
rob+0.017
cream+0.015
Rob+0.014
Neg logit contributions
 big-0.017
 small-0.016
 kind-0.016
 pen-0.016
 drawer-0.016
Token chocolate
Token ID11311
Feature activation-4.45e+00

Pos logit contributions
round+0.074
Whe+0.070
rob+0.064
cream+0.056
Rob+0.056
Neg logit contributions
 big-0.030
 small-0.029
 kind-0.027
 pen-0.026
 drawer-0.026
Token in
Token ID287
Feature activation-1.14e+00

Pos logit contributions
round+0.159
Whe+0.158
rob+0.144
Rob+0.119
cream+0.118
Neg logit contributions
 big-0.118
 small-0.112
 kind-0.108
 drawer-0.107
 long-0.105
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
round+0.052
Whe+0.050
rob+0.045
cream+0.042
 Clown+0.040
Neg logit contributions
 big-0.018
 kind-0.018
 small-0.018
 pen-0.017
 drawer-0.017
Token wild
Token ID4295
Feature activation-1.37e+00

Pos logit contributions
 big+0.000
 small+0.000
 kind+0.000
 pen+0.000
 drawer+0.000
Neg logit contributions
round-0.001
Whe-0.001
rob-0.000
cream-0.000
Rob-0.000
Token before
Token ID878
Feature activation-4.23e-01

Pos logit contributions
round+0.057
Whe+0.055
rob+0.050
cream+0.045
 Clown+0.043
Neg logit contributions
 big-0.027
 small-0.027
 kind-0.027
 pen-0.026
 drawer-0.026
Token.
Token ID13
Feature activation-1.00e-01

Pos logit contributions
round+0.020
Whe+0.019
rob+0.018
cream+0.017
Mike+0.016
Neg logit contributions
 kind-0.005
 big-0.005
 small-0.005
 pen-0.005
 drawer-0.005
Token never
Token ID1239
Feature activation-4.87e-01

Pos logit contributions
 big+0.000
 small+0.000
 kind+0.000
 pen+0.000
 wise+0.000
Neg logit contributions
round-0.000
Whe-0.000
rob-0.000
cream-0.000
 Clown-0.000
Token been
Token ID587
Feature activation-8.67e-01

Pos logit contributions
round+0.017
Whe+0.016
rob+0.014
cream+0.013
seek+0.012
Neg logit contributions
 big-0.012
 small-0.012
 drawer-0.012
 box-0.012
 pen-0.012
Token back
Token ID736
Feature activation-1.89e+00

Pos logit contributions
round+0.036
Whe+0.035
rob+0.032
cream+0.028
 Clown+0.027
Neg logit contributions
 big-0.020
 small-0.020
 kind-0.020
 wise-0.018
 true-0.018
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
round+0.080
Whe+0.077
rob+0.071
cream+0.063
Rob+0.060
Neg logit contributions
 big-0.038
 small-0.037
 kind-0.037
 wise-0.035
 pen-0.035
Token the
Token ID262
Feature activation-1.27e+00

Pos logit contributions
round+0.045
Whe+0.043
rob+0.039
cream+0.035
Mike+0.034
Neg logit contributions
 big-0.021
 small-0.021
 kind-0.020
 pen-0.020
 drawer-0.020
Token coffee
Token ID6891
Feature activation-4.35e+00

Pos logit contributions
round+0.048
Whe+0.045
rob+0.041
cream+0.036
Rob+0.034
Neg logit contributions
 big-0.031
 small-0.031
 kind-0.031
 pen-0.030
 drawer-0.030
Token shop
Token ID6128
Feature activation-1.59e+00

Pos logit contributions
round+0.164
Whe+0.162
rob+0.147
cream+0.132
Rob+0.125
Neg logit contributions
 big-0.107
 drawer-0.105
 box-0.104
 kind-0.103
 small-0.102
Token again
Token ID757
Feature activation+6.31e-01

Pos logit contributions
round+0.054
Whe+0.048
rob+0.041
cream+0.038
Mike+0.037
Neg logit contributions
 big-0.042
 pen-0.041
 small-0.041
 kind-0.041
 seed-0.040
Token and
Token ID290
Feature activation-1.58e+00

Pos logit contributions
 big+0.010
 small+0.010
 kind+0.010
 pen+0.010
 drawer+0.010
Neg logit contributions
round-0.028
Whe-0.027
rob-0.024
cream-0.022
Mike-0.022
Token lost
Token ID2626
Feature activation-8.12e-01

Pos logit contributions
round+0.063
Whe+0.060
rob+0.054
cream+0.047
Rob+0.046
Neg logit contributions
 big-0.037
 kind-0.037
 small-0.036
 drawer-0.035
 wise-0.035
Token his
Token ID465
Feature activation-1.52e+00

Pos logit contributions
round+0.031
Whe+0.030
rob+0.027
cream+0.024
Mike+0.023
Neg logit contributions
 big-0.019
 small-0.018
 kind-0.018
 pen-0.018
 drawer-0.018
Token he
Token ID339
Feature activation+2.89e-01

Pos logit contributions
round+0.045
Whe+0.043
rob+0.040
cream+0.035
seek+0.034
Neg logit contributions
 big-0.027
 kind-0.026
 small-0.026
 wise-0.024
 pen-0.024
Token never
Token ID1239
Feature activation-8.12e-01

Pos logit contributions
 big+0.007
 small+0.006
 kind+0.006
 pen+0.006
 wise+0.006
Neg logit contributions
round-0.011
Whe-0.011
rob-0.010
cream-0.009
Rob-0.008
Token wanted
Token ID2227
Feature activation-8.72e-01

Pos logit contributions
round+0.031
Whe+0.030
rob+0.027
cream+0.024
Rob+0.023
Neg logit contributions
 big-0.019
 kind-0.019
 small-0.019
 pen-0.018
 wise-0.018
Token to
Token ID284
Feature activation-1.82e+00

Pos logit contributions
round+0.046
Whe+0.046
rob+0.043
cream+0.038
Rob+0.037
Neg logit contributions
 big-0.009
 small-0.008
 kind-0.008
 true-0.007
 wise-0.006
Token try
Token ID1949
Feature activation-1.07e+00

Pos logit contributions
round+0.080
Whe+0.079
rob+0.076
cream+0.063
robat+0.062
Neg logit contributions
 big-0.037
 kind-0.034
 small-0.033
 wise-0.031
 true-0.031
Token coffee
Token ID6891
Feature activation-4.35e+00

Pos logit contributions
round+0.049
Whe+0.048
rob+0.043
cream+0.039
Jacob+0.038
Neg logit contributions
 big-0.020
 small-0.019
 kind-0.018
 true-0.017
 pen-0.017
Token again
Token ID757
Feature activation+6.52e-01

Pos logit contributions
Whe+0.174
round+0.174
rob+0.160
cream+0.137
Rob+0.136
Neg logit contributions
 big-0.097
 kind-0.089
 small-0.088
 true-0.085
 drawer-0.083
Token!
Token ID0
Feature activation-5.22e-01

Pos logit contributions
 big+0.009
 small+0.009
 kind+0.008
 pen+0.008
 drawer+0.008
Neg logit contributions
round-0.031
Whe-0.030
rob-0.027
cream-0.025
Mike-0.024
Token He
Token ID679
Feature activation-6.76e-01

Pos logit contributions
round+0.021
Whe+0.020
rob+0.020
icans+0.017
cream+0.017
Neg logit contributions
 big-0.011
 small-0.010
 kind-0.010
 drawer-0.010
 wise-0.010
Token never
Token ID1239
Feature activation-1.13e+00

Pos logit contributions
round+0.029
Whe+0.028
rob+0.026
cream+0.022
icans+0.022
Neg logit contributions
 big-0.014
 kind-0.014
 small-0.014
 pen-0.013
 wise-0.013
Token forgot
Token ID16453
Feature activation-1.52e+00

Pos logit contributions
round+0.044
Whe+0.042
rob+0.039
cream+0.034
Rob+0.033
Neg logit contributions
 big-0.026
 kind-0.026
 small-0.026
 pen-0.025
 wise-0.025
Token eager
Token ID11069
Feature activation-7.83e-01

Pos logit contributions
round+0.026
Whe+0.025
rob+0.022
cream+0.020
Mike+0.020
Neg logit contributions
 pen-0.014
 drawer-0.013
 box-0.013
 big-0.013
 small-0.013
Token to
Token ID284
Feature activation-1.27e+00

Pos logit contributions
round+0.042
Whe+0.040
rob+0.037
cream+0.034
Rob+0.033
Neg logit contributions
 big-0.007
 small-0.007
 kind-0.007
 pen-0.006
 drawer-0.006
Token eat
Token ID4483
Feature activation-1.52e+00

Pos logit contributions
round+0.053
Whe+0.052
rob+0.049
cream+0.042
Rob+0.040
Neg logit contributions
 big-0.028
 kind-0.027
 small-0.026
 wise-0.025
 pen-0.025
Token the
Token ID262
Feature activation-2.87e+00

Pos logit contributions
round+0.070
Whe+0.067
rob+0.062
cream+0.055
Rob+0.054
Neg logit contributions
 big-0.028
 small-0.027
 kind-0.026
 drawer-0.024
 pen-0.023
Token y
Token ID331
Feature activation-2.38e+00

Pos logit contributions
round+0.117
Whe+0.107
rob+0.104
Rob+0.087
Mike+0.085
Neg logit contributions
 big-0.084
 small-0.078
 kind-0.076
 red-0.070
 green-0.069
Tokenummy
Token ID13513
Feature activation-4.20e+00

Pos logit contributions
round+0.108
Whe+0.107
rob+0.098
Rob+0.087
cream+0.086
Neg logit contributions
 big-0.041
 small-0.038
 kind-0.037
 drawer-0.036
 box-0.035
Token cookies
Token ID14746
Feature activation-1.75e+00

Pos logit contributions
round+0.158
Whe+0.151
rob+0.140
ican+0.123
Rob+0.123
Neg logit contributions
 big-0.129
 small-0.120
 kind-0.117
 red-0.114
 box-0.112
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
round+0.086
Whe+0.084
rob+0.077
cream+0.071
Rob+0.069
Neg logit contributions
 big-0.023
 small-0.022
 kind-0.021
 pen-0.020
 drawer-0.020
Token She
Token ID1375
Feature activation+2.15e-01

Pos logit contributions
round+0.058
Whe+0.056
rob+0.055
cream+0.048
robat+0.046
Neg logit contributions
 small-0.024
 big-0.024
 kind-0.023
 wise-0.022
 drawer-0.021
Token ate
Token ID15063
Feature activation-1.90e+00

Pos logit contributions
 big+0.005
 small+0.005
 kind+0.005
 pen+0.004
 drawer+0.004
Neg logit contributions
round-0.009
Whe-0.008
rob-0.008
cream-0.007
 Clown-0.006
Token one
Token ID530
Feature activation-1.75e+00

Pos logit contributions
round+0.089
Whe+0.085
rob+0.079
cream+0.071
Rob+0.069
Neg logit contributions
 big-0.035
 small-0.034
 kind-0.031
 long-0.028
 green-0.028
Token They
Token ID1119
Feature activation-5.16e-01

Pos logit contributions
round+0.004
Whe+0.003
rob+0.003
cream+0.003
 Clown+0.003
Neg logit contributions
 big-0.002
 kind-0.002
 small-0.002
 pen-0.002
 true-0.002
Token both
Token ID1111
Feature activation-1.41e+00

Pos logit contributions
round+0.020
Whe+0.019
rob+0.017
cream+0.015
Rob+0.014
Neg logit contributions
 big-0.012
 small-0.012
 kind-0.012
 pen-0.012
 drawer-0.012
Token enjoyed
Token ID8359
Feature activation-7.94e-01

Pos logit contributions
round+0.051
Whe+0.050
rob+0.045
icans+0.038
cream+0.037
Neg logit contributions
 big-0.038
 kind-0.037
 small-0.037
 pen-0.035
 drawer-0.035
Token the
Token ID262
Feature activation-3.22e+00

Pos logit contributions
round+0.035
Whe+0.034
rob+0.031
cream+0.028
Mike+0.027
Neg logit contributions
 big-0.014
 small-0.014
 kind-0.014
 pen-0.013
 drawer-0.013
Token y
Token ID331
Feature activation-2.21e+00

Pos logit contributions
round+0.149
Whe+0.135
rob+0.129
ican+0.116
icans+0.114
Neg logit contributions
 big-0.080
 kind-0.073
 small-0.071
 new-0.064
 box-0.063
Tokenummy
Token ID13513
Feature activation-4.18e+00

Pos logit contributions
round+0.099
Whe+0.097
rob+0.089
cream+0.079
Rob+0.078
Neg logit contributions
 big-0.039
 small-0.037
 kind-0.037
 drawer-0.035
 pen-0.034
Token dessert
Token ID23084
Feature activation-7.42e-01

Pos logit contributions
round+0.153
Whe+0.144
rob+0.134
ican+0.121
icans+0.116
Neg logit contributions
 big-0.133
 kind-0.127
 small-0.126
 box-0.117
 red-0.116
Token.
Token ID13
Feature activation-7.13e-01

Pos logit contributions
round+0.032
Whe+0.031
rob+0.028
cream+0.026
Rob+0.025
Neg logit contributions
 big-0.014
 small-0.013
 kind-0.013
 pen-0.013
 drawer-0.013
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
round+0.034
rob+0.033
Whe+0.033
robat+0.028
icans+0.028
Neg logit contributions
 big-0.011
 small-0.011
 kind-0.010
 box-0.009
 can-0.009
Token
Token ID198
Feature activation-1.27e+00

Pos logit contributions
round+0.062
Whe+0.060
rob+0.055
cream+0.051
icans+0.049
Neg logit contributions
 big-0.024
 small-0.023
 kind-0.021
 wise-0.019
 long-0.018
TokenAfter
Token ID3260
Feature activation-1.61e+00

Pos logit contributions
round+0.051
Whe+0.048
rob+0.048
cream+0.042
 Clown+0.041
Neg logit contributions
 big-0.029
 kind-0.029
 small-0.028
 true-0.027
 wise-0.026
Token because
Token ID780
Feature activation-1.32e+00

Pos logit contributions
round+0.043
Whe+0.042
rob+0.038
cream+0.033
Mike+0.032
Neg logit contributions
 big-0.030
 small-0.030
 kind-0.029
 drawer-0.028
 wise-0.028
Token she
Token ID673
Feature activation-7.29e-01

Pos logit contributions
round+0.063
Whe+0.060
rob+0.056
cream+0.051
Rob+0.049
Neg logit contributions
 big-0.022
 small-0.021
 kind-0.021
 wise-0.019
 drawer-0.019
Token had
Token ID550
Feature activation-6.33e-01

Pos logit contributions
round+0.034
Whe+0.032
rob+0.030
cream+0.026
Mike+0.026
Neg logit contributions
 big-0.014
 kind-0.013
 small-0.013
 pen-0.013
 wise-0.012
Token never
Token ID1239
Feature activation-3.51e-01

Pos logit contributions
round+0.027
Whe+0.027
rob+0.024
cream+0.022
Rob+0.021
Neg logit contributions
 big-0.010
 small-0.010
 kind-0.010
 drawer-0.010
 pen-0.010
Token had
Token ID550
Feature activation-7.70e-01

Pos logit contributions
round+0.012
Whe+0.011
rob+0.010
cream+0.009
Rob+0.008
Neg logit contributions
 big-0.010
 small-0.010
 kind-0.010
 drawer-0.009
 pen-0.009
Token cocoa
Token ID35845
Feature activation-4.17e+00

Pos logit contributions
round+0.033
Whe+0.032
rob+0.029
cream+0.027
Rob+0.025
Neg logit contributions
 big-0.013
 small-0.013
 kind-0.013
 drawer-0.013
 pen-0.013
Token before
Token ID878
Feature activation-1.28e-01

Pos logit contributions
round+0.148
Whe+0.141
rob+0.126
cream+0.111
Rob+0.105
Neg logit contributions
 big-0.110
 small-0.108
 kind-0.107
 pen-0.105
 drawer-0.105
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
round+0.006
Whe+0.006
rob+0.005
cream+0.005
 Clown+0.005
Neg logit contributions
 kind-0.002
 big-0.002
 small-0.002
 pen-0.002
 drawer-0.002
Token She
Token ID1375
Feature activation+4.83e-02

Pos logit contributions
round+0.060
Whe+0.057
rob+0.052
cream+0.047
Rob+0.045
Neg logit contributions
 big-0.028
 small-0.028
 kind-0.027
 pen-0.026
 drawer-0.026
Token wanted
Token ID2227
Feature activation-6.92e-01

Pos logit contributions
 big+0.001
 kind+0.001
 small+0.001
 pen+0.001
 drawer+0.001
Neg logit contributions
round-0.002
Whe-0.002
rob-0.002
cream-0.002
Mike-0.002
Token to
Token ID284
Feature activation-6.08e-01

Pos logit contributions
round+0.041
Whe+0.039
rob+0.038
cream+0.033
Mike+0.033
Neg logit contributions
 big-0.005
 small-0.004
 kind-0.004
 pen-0.003
 wise-0.003
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
 big+0.006
 small+0.006
 kind+0.006
 wise+0.005
 pen+0.005
Neg logit contributions
round-0.023
Whe-0.022
rob-0.020
cream-0.018
 Clown-0.018
Token Sarah
Token ID10490
Feature activation-7.45e-01

Pos logit contributions
 big+0.008
 small+0.007
 kind+0.007
 pen+0.007
 drawer+0.007
Neg logit contributions
round-0.012
Whe-0.012
rob-0.011
cream-0.009
Rob-0.009
Token had
Token ID550
Feature activation-1.16e+00

Pos logit contributions
round+0.032
Whe+0.032
rob+0.029
cream+0.025
 Clown+0.024
Neg logit contributions
 big-0.014
 small-0.014
 kind-0.014
 pen-0.013
 drawer-0.013
Token never
Token ID1239
Feature activation-1.13e+00

Pos logit contributions
round+0.046
Whe+0.044
rob+0.040
cream+0.036
Rob+0.034
Neg logit contributions
 big-0.026
 small-0.025
 kind-0.025
 pen-0.024
 drawer-0.024
Token tried
Token ID3088
Feature activation-8.06e-01

Pos logit contributions
round+0.044
Whe+0.041
rob+0.038
cream+0.033
Rob+0.032
Neg logit contributions
 kind-0.029
 big-0.029
 small-0.028
 wise-0.027
 pen-0.027
Token coffee
Token ID6891
Feature activation-4.15e+00

Pos logit contributions
round+0.042
Whe+0.040
rob+0.038
cream+0.035
Rob+0.033
Neg logit contributions
 big-0.009
 small-0.009
 kind-0.008
 pen-0.008
 true-0.007
Token before
Token ID878
Feature activation-3.03e-01

Pos logit contributions
round+0.157
Whe+0.152
rob+0.139
cream+0.122
Rob+0.117
Neg logit contributions
 big-0.102
 small-0.098
 kind-0.097
 drawer-0.094
 pen-0.092
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
round+0.015
Whe+0.014
rob+0.013
cream+0.012
Rob+0.012
Neg logit contributions
 big-0.004
 small-0.004
 kind-0.004
 pen-0.004
 drawer-0.004
Token she
Token ID673
Feature activation-3.12e-01

Pos logit contributions
round+0.052
Whe+0.050
rob+0.045
cream+0.041
Mike+0.038
Neg logit contributions
 big-0.028
 small-0.028
 kind-0.028
 pen-0.027
 drawer-0.027
Token was
Token ID373
Feature activation-1.43e+00

Pos logit contributions
round+0.013
Whe+0.013
rob+0.012
cream+0.010
icans+0.010
Neg logit contributions
 big-0.006
 kind-0.006
 small-0.006
 pen-0.006
 wise-0.006
Token very
Token ID845
Feature activation-6.48e-01

Pos logit contributions
round+0.059
Whe+0.056
rob+0.053
cream+0.046
icans+0.044
Neg logit contributions
 big-0.033
 kind-0.033
 small-0.033
 wise-0.031
 true-0.030
Token She
Token ID1375
Feature activation-2.87e-01

Pos logit contributions
round+0.030
Whe+0.029
rob+0.028
cream+0.024
robat+0.023
Neg logit contributions
 small-0.016
 big-0.016
 kind-0.015
 wise-0.015
 drawer-0.015
Token never
Token ID1239
Feature activation-4.76e-01

Pos logit contributions
round+0.013
Whe+0.012
rob+0.011
cream+0.010
icans+0.010
Neg logit contributions
 big-0.006
 small-0.005
 kind-0.005
 pen-0.005
 wise-0.005
Token got
Token ID1392
Feature activation-2.67e-01

Pos logit contributions
round+0.018
Whe+0.017
rob+0.015
cream+0.014
Rob+0.013
Neg logit contributions
 big-0.011
 small-0.011
 drawer-0.011
 kind-0.011
 pen-0.011
Token her
Token ID607
Feature activation-2.14e+00

Pos logit contributions
round+0.013
Whe+0.013
rob+0.012
cream+0.011
 Clown+0.011
Neg logit contributions
 big-0.004
 small-0.004
 kind-0.003
 wise-0.003
 true-0.003
Token delicious
Token ID12625
Feature activation-3.36e+00

Pos logit contributions
round+0.077
Whe+0.072
rob+0.069
Rob+0.057
Mike+0.057
Neg logit contributions
 big-0.066
 small-0.062
 kind-0.061
 wise-0.058
 red-0.057
Token chocolate
Token ID11311
Feature activation-4.15e+00

Pos logit contributions
round+0.120
Whe+0.114
rob+0.109
Rob+0.092
Nice+0.088
Neg logit contributions
 big-0.097
 kind-0.094
 small-0.091
 red-0.088
 box-0.086
Token.
Token ID13
Feature activation-8.75e-01

Pos logit contributions
Whe+0.160
round+0.158
rob+0.148
Rob+0.130
icans+0.129
Neg logit contributions
 big-0.100
 box-0.092
 small-0.088
 kind-0.088
 drawer-0.087
Token<|endoftext|>
Token ID50256
Feature activation-8.71e-01

Pos logit contributions
round+0.041
Whe+0.038
rob+0.037
cream+0.032
icans+0.032
Neg logit contributions
 big-0.015
 small-0.015
 kind-0.014
 wise-0.014
 drawer-0.014
TokenOnce
Token ID7454
Feature activation+2.84e-01

Pos logit contributions
round+0.032
Whe+0.030
rob+0.027
cream+0.024
Mike+0.022
Neg logit contributions
 big-0.023
 small-0.022
 kind-0.022
 pen-0.021
 drawer-0.021
Token there
Token ID612
Feature activation-6.59e-01

Pos logit contributions
 pen+0.006
 kind+0.006
 wise+0.006
 true+0.006
 drawer+0.006
Neg logit contributions
round-0.011
Whe-0.010
rob-0.009
Mike-0.008
Jacob-0.008
Token was
Token ID373
Feature activation+1.77e-01

Pos logit contributions
round+0.033
Whe+0.031
rob+0.029
cream+0.027
 sidelines+0.026
Neg logit contributions
 big-0.010
 small-0.009
 kind-0.009
 box-0.008
 can-0.008
Token
Token ID220
Feature activation-6.94e-01

Pos logit contributions
round+0.014
Whe+0.013
rob+0.012
cream+0.010
Rob+0.010
Neg logit contributions
 big-0.009
 small-0.009
 kind-0.009
 pen-0.009
 drawer-0.009
Token
Token ID198
Feature activation-7.69e-01

Pos logit contributions
round+0.032
Whe+0.030
rob+0.028
cream+0.025
 Clown+0.025
Neg logit contributions
 big-0.012
 small-0.012
 kind-0.011
 wise-0.010
 true-0.010
Token
Token ID198
Feature activation-1.93e-01

Pos logit contributions
round+0.034
Whe+0.033
rob+0.030
cream+0.028
icans+0.027
Neg logit contributions
 big-0.014
 small-0.014
 kind-0.013
 wise-0.012
 pen-0.012
TokenWhen
Token ID2215
Feature activation+8.32e-01

Pos logit contributions
round+0.006
Whe+0.006
rob+0.005
cream+0.004
Rob+0.004
Neg logit contributions
 big-0.006
 small-0.006
 kind-0.006
 drawer-0.006
 pen-0.006
Token Mitt
Token ID16627
Feature activation-1.58e+00

Pos logit contributions
 kind+0.021
 drawer+0.020
 true+0.020
 wise+0.020
 pen+0.020
Neg logit contributions
round-0.027
Whe-0.027
rob-0.024
cream-0.022
Mike-0.020
Tokenens
Token ID641
Feature activation-4.08e+00

Pos logit contributions
round+0.055
Whe+0.051
rob+0.045
cream+0.041
Mike+0.039
Neg logit contributions
 big-0.043
 pen-0.042
 kind-0.042
 small-0.042
 wise-0.040
Token woke
Token ID19092
Feature activation-2.00e+00

Pos logit contributions
round+0.182
Whe+0.176
rob+0.161
cream+0.146
icans+0.138
Neg logit contributions
 big-0.079
 small-0.074
 kind-0.072
 box-0.071
 pen-0.069
Token up
Token ID510
Feature activation-1.50e+00

Pos logit contributions
round+0.093
Whe+0.090
rob+0.082
cream+0.074
 Clown+0.073
Neg logit contributions
 big-0.033
 small-0.032
 kind-0.030
 wise-0.029
 true-0.028
Token,
Token ID11
Feature activation+3.53e-01

Pos logit contributions
round+0.077
Whe+0.074
rob+0.068
cream+0.063
Rob+0.061
Neg logit contributions
 big-0.017
 small-0.016
 kind-0.016
 pen-0.015
 drawer-0.015
Token he
Token ID339
Feature activation-5.58e-01

Pos logit contributions
 kind+0.008
 drawer+0.008
 pen+0.008
 true+0.008
 big+0.008
Neg logit contributions
Whe-0.012
round-0.012
rob-0.011
cream-0.010
Mike-0.009
Token saw
Token ID2497
Feature activation-3.20e-01

Pos logit contributions
round+0.023
Whe+0.022
rob+0.020
cream+0.018
Mike+0.017
Neg logit contributions
 big-0.012
 small-0.012
 kind-0.012
 pen-0.011
 drawer-0.011
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
 drawer+0.021
 small+0.020
 true+0.019
 big+0.019
 handy+0.019
Neg logit contributions
round-0.043
Whe-0.040
rob-0.036
 Clown-0.034
cream-0.033
Token can
Token ID460
Feature activation+1.16e+00

Pos logit contributions
round+0.086
Whe+0.084
cream+0.072
rob+0.070
 Clown+0.068
Neg logit contributions
 small-0.031
 pen-0.030
 wise-0.030
 handy-0.030
 big-0.029
Token I
Token ID314
Feature activation-6.11e-01

Pos logit contributions
 big+0.023
 small+0.022
 kind+0.022
 pen+0.021
 wise+0.021
Neg logit contributions
round-0.050
Whe-0.048
rob-0.044
cream-0.039
Mike-0.037
Token eat
Token ID4483
Feature activation-1.91e+00

Pos logit contributions
round+0.027
Whe+0.026
rob+0.023
cream+0.021
Rob+0.020
Neg logit contributions
 big-0.011
 small-0.011
 kind-0.011
 pen-0.010
 drawer-0.010
Token the
Token ID262
Feature activation-2.83e+00

Pos logit contributions
round+0.087
Whe+0.085
rob+0.078
seek+0.071
|+0.068
Neg logit contributions
 big-0.038
 small-0.035
 kind-0.032
 red-0.031
 green-0.030
Token chocolate
Token ID11311
Feature activation-4.06e+00

Pos logit contributions
round+0.110
Whe+0.100
rob+0.094
seek+0.086
icans+0.084
Neg logit contributions
 big-0.083
 small-0.076
 kind-0.072
 red-0.072
 green-0.070
Token cake
Token ID12187
Feature activation-2.03e+00

Pos logit contributions
Whe+0.176
round+0.166
rob+0.152
Nice+0.147
icans+0.146
Neg logit contributions
 big-0.083
 box-0.076
 red-0.071
 small-0.069
 drawer-0.068
Token now
Token ID783
Feature activation-1.08e-02

Pos logit contributions
round+0.089
Whe+0.088
rob+0.079
cream+0.071
Rob+0.071
Neg logit contributions
 big-0.039
 small-0.036
 kind-0.035
 box-0.034
 drawer-0.034
Token?"
Token ID1701
Feature activation+2.10e-01

Pos logit contributions
round+0.000
Whe+0.000
rob+0.000
cream+0.000
Rob+0.000
Neg logit contributions
 small-0.000
 pen-0.000
 kind-0.000
 big-0.000
 drawer-0.000
Token
Token ID198
Feature activation-6.74e-01

Pos logit contributions
 big+0.004
 small+0.004
 kind+0.004
 pen+0.004
 wise+0.004
Neg logit contributions
round-0.009
Whe-0.009
rob-0.008
cream-0.007
icans-0.007
Token
Token ID198
Feature activation-5.10e-01

Pos logit contributions
round+0.026
Whe+0.025
rob+0.023
cream+0.020
Rob+0.020
Neg logit contributions
 big-0.015
 small-0.015
 kind-0.015
 pen-0.015
 drawer-0.014
Token had
Token ID550
Feature activation-7.79e-01

Pos logit contributions
round+0.003
Whe+0.003
rob+0.003
cream+0.003
icans+0.002
Neg logit contributions
 big-0.002
 small-0.001
 kind-0.001
 wise-0.001
 pen-0.001
Token never
Token ID1239
Feature activation-6.67e-01

Pos logit contributions
round+0.027
Whe+0.027
rob+0.023
Rob+0.022
Mike+0.022
Neg logit contributions
 handy-0.017
 drawer-0.017
 file-0.017
 pen-0.017
 seed-0.017
Token seen
Token ID1775
Feature activation-1.96e+00

Pos logit contributions
round+0.022
Whe+0.021
rob+0.019
cream+0.016
Rob+0.015
Neg logit contributions
 big-0.019
 small-0.019
 kind-0.018
 pen-0.018
 drawer-0.018
Token a
Token ID257
Feature activation-1.85e+00

Pos logit contributions
round+0.088
Whe+0.083
rob+0.076
cream+0.068
Rob+0.067
Neg logit contributions
 big-0.038
 small-0.037
 kind-0.035
 pen-0.034
 drawer-0.034
Token w
Token ID266
Feature activation-2.80e+00

Pos logit contributions
round+0.073
Whe+0.070
rob+0.062
cream+0.056
Rob+0.053
Neg logit contributions
 big-0.042
 small-0.042
 kind-0.041
 pen-0.040
 drawer-0.040
Tokenaffle
Token ID30697
Feature activation-4.05e+00

Pos logit contributions
round+0.059
Whe+0.057
rob+0.046
cream+0.036
Rob+0.035
Neg logit contributions
 big-0.116
 kind-0.113
 small-0.112
 drawer-0.111
 wise-0.110
Token before
Token ID878
Feature activation-2.64e-01

Pos logit contributions
round+0.137
Whe+0.131
rob+0.116
cream+0.101
Rob+0.095
Neg logit contributions
 big-0.113
 small-0.111
 kind-0.110
 pen-0.108
 drawer-0.107
Token,
Token ID11
Feature activation-1.17e+00

Pos logit contributions
round+0.012
Whe+0.011
rob+0.010
cream+0.009
 Clown+0.009
Neg logit contributions
 kind-0.004
 small-0.004
 big-0.004
 pen-0.004
 drawer-0.004
Token and
Token ID290
Feature activation-9.00e-01

Pos logit contributions
round+0.039
Whe+0.037
rob+0.032
cream+0.028
 Clown+0.027
Neg logit contributions
 big-0.032
 kind-0.032
 small-0.032
 pen-0.032
 drawer-0.031
Token she
Token ID673
Feature activation+2.85e-02

Pos logit contributions
round+0.035
Whe+0.034
rob+0.029
cream+0.028
Mike+0.026
Neg logit contributions
 big-0.019
 kind-0.019
 small-0.018
 pen-0.018
 true-0.018
Token knew
Token ID2993
Feature activation-2.12e-02

Pos logit contributions
 big+0.001
 kind+0.001
 small+0.001
 wise+0.001
 pen+0.001
Neg logit contributions
round-0.001
Whe-0.001
rob-0.001
cream-0.001
icans-0.001
Token replied
Token ID8712
Feature activation+5.66e-01

Pos logit contributions
round+0.016
Whe+0.015
rob+0.014
cream+0.012
Mike+0.011
Neg logit contributions
 big-0.011
 small-0.011
 drawer-0.010
 pen-0.010
 kind-0.010
Token.
Token ID13
Feature activation-2.55e-01

Pos logit contributions
 pen+0.008
 big+0.008
 drawer+0.008
 small+0.008
 box+0.008
Neg logit contributions
round-0.026
Whe-0.024
rob-0.022
cream-0.021
Mike-0.020
Token
Token ID198
Feature activation-6.93e-01

Pos logit contributions
round+0.010
Whe+0.010
rob+0.009
cream+0.008
Rob+0.008
Neg logit contributions
 big-0.005
 kind-0.005
 small-0.005
 pen-0.005
 drawer-0.005
Token
Token ID198
Feature activation-5.39e-02

Pos logit contributions
round+0.030
Whe+0.029
rob+0.026
cream+0.024
 Clown+0.023
Neg logit contributions
 big-0.014
 small-0.013
 kind-0.013
 wise-0.012
 pen-0.012
TokenAs
Token ID1722
Feature activation+1.49e+00

Pos logit contributions
round+0.002
Whe+0.002
rob+0.002
cream+0.001
Rob+0.001
Neg logit contributions
 big-0.001
 small-0.001
 kind-0.001
 pen-0.001
 drawer-0.001
Token they
Token ID484
Feature activation-3.97e+00

Pos logit contributions
 pen+0.039
 kind+0.039
 drawer+0.038
 wise+0.038
 file+0.038
Neg logit contributions
round-0.048
Whe-0.046
rob-0.038
Mike-0.035
Rob-0.034
Token walked
Token ID6807
Feature activation-1.25e+00

Pos logit contributions
round+0.173
Whe+0.164
rob+0.162
icans+0.142
 Clown+0.138
Neg logit contributions
 big-0.082
 small-0.079
 kind-0.075
 box-0.073
 can-0.071
Token upstairs
Token ID26148
Feature activation-3.61e-01

Pos logit contributions
round+0.056
Whe+0.055
rob+0.051
cream+0.046
 Clown+0.045
Neg logit contributions
 big-0.024
 small-0.023
 kind-0.021
 box-0.021
 wise-0.020
Token,
Token ID11
Feature activation+4.76e-01

Pos logit contributions
round+0.017
Whe+0.016
rob+0.015
cream+0.014
Mike+0.013
Neg logit contributions
 kind-0.005
 true-0.005
 pen-0.005
 big-0.005
 small-0.005
Token Lily
Token ID20037
Feature activation-9.29e-01

Pos logit contributions
 big+0.008
 small+0.008
 long+0.007
 kind+0.007
 drawer+0.007
Neg logit contributions
round-0.024
Whe-0.022
rob-0.021
cream-0.019
robat-0.018
Token saw
Token ID2497
Feature activation-7.23e-01

Pos logit contributions
round+0.038
Whe+0.037
rob+0.033
cream+0.030
Rob+0.029
Neg logit contributions
 big-0.018
 kind-0.018
 small-0.018
 true-0.018
 drawer-0.017
Token not
Token ID407
Feature activation+2.35e-01

Pos logit contributions
round+0.021
Whe+0.020
rob+0.018
cream+0.017
Rob+0.016
Neg logit contributions
 pen-0.010
 drawer-0.010
 kind-0.010
 small-0.010
 big-0.010
Token yours
Token ID12431
Feature activation-2.72e-01

Pos logit contributions
 big+0.005
 small+0.005
 kind+0.005
 pen+0.005
 drawer+0.005
Neg logit contributions
round-0.010
Whe-0.009
rob-0.008
cream-0.007
Rob-0.007
Token."
Token ID526
Feature activation-2.73e+00

Pos logit contributions
round+0.013
Whe+0.012
rob+0.011
cream+0.010
 Clown+0.010
Neg logit contributions
 big-0.004
 small-0.004
 kind-0.004
 pen-0.004
 drawer-0.004
Token
Token ID198
Feature activation-3.20e+00

Pos logit contributions
rob+0.117
round+0.116
Whe+0.106
robat+0.100
cream+0.093
Neg logit contributions
 small-0.058
 big-0.055
 kind-0.052
 "-0.049
 to-0.048
Token
Token ID198
Feature activation-3.15e+00

Pos logit contributions
round+0.176
rob+0.168
Whe+0.164
cream+0.149
icans+0.149
Neg logit contributions
 big-0.038
 small-0.037
 kind-0.031
"-0.025
 wise-0.024
TokenMe
Token ID5308
Feature activation-3.94e+00

Pos logit contributions
rob+0.140
round+0.137
Whe+0.123
icans+0.116
cream+0.115
Neg logit contributions
 kind-0.068
 small-0.067
"-0.064
 big-0.064
 wise-0.057
Tokengan
Token ID1030
Feature activation-8.35e-01

Pos logit contributions
rob+0.151
round+0.142
Whe+0.129
robat+0.118
 sidelines+0.111
Neg logit contributions
 kind-0.122
 small-0.120
 big-0.119
 will-0.109
 hard-0.107
Token tried
Token ID3088
Feature activation+1.23e-01

Pos logit contributions
round+0.036
Whe+0.035
rob+0.032
cream+0.028
icans+0.027
Neg logit contributions
 kind-0.017
 big-0.017
 small-0.017
 wise-0.016
 pen-0.016
Token to
Token ID284
Feature activation-4.47e-01

Pos logit contributions
 big+0.002
 small+0.002
 kind+0.002
 pen+0.002
 drawer+0.002
Neg logit contributions
round-0.006
Whe-0.006
rob-0.005
cream-0.005
Mike-0.005
Token resist
Token ID4180
Feature activation+2.18e-01

Pos logit contributions
round+0.020
Whe+0.019
rob+0.018
cream+0.016
 Clown+0.015
Neg logit contributions
 big-0.009
 kind-0.008
 small-0.008
 wise-0.008
 pen-0.008
Token her
Token ID607
Feature activation-3.05e-02

Pos logit contributions
 big+0.004
 kind+0.004
 small+0.004
 pen+0.004
 drawer+0.004
Neg logit contributions
round-0.009
Whe-0.009
rob-0.008
cream-0.007
Mike-0.007
Token comfortable
Token ID6792
Feature activation-2.35e+00

Pos logit contributions
round+0.055
Whe+0.053
rob+0.047
cream+0.044
Rob+0.042
Neg logit contributions
 big-0.020
 small-0.020
 pen-0.020
 kind-0.020
 drawer-0.019
Token and
Token ID290
Feature activation-3.14e-01

Pos logit contributions
round+0.109
Whe+0.106
rob+0.096
cream+0.088
Rob+0.085
Neg logit contributions
 big-0.038
 small-0.037
 kind-0.036
 box-0.034
 drawer-0.034
Token they
Token ID484
Feature activation-5.34e-01

Pos logit contributions
round+0.013
Whe+0.013
rob+0.011
cream+0.010
Mike+0.010
Neg logit contributions
 pen-0.006
 drawer-0.006
 file-0.006
 box-0.006
 handy-0.006
Token enjoyed
Token ID8359
Feature activation-8.87e-01

Pos logit contributions
round+0.021
Whe+0.020
rob+0.019
icans+0.016
cream+0.016
Neg logit contributions
 big-0.013
 kind-0.012
 small-0.012
 pen-0.011
 box-0.011
Token the
Token ID262
Feature activation-3.14e+00

Pos logit contributions
round+0.037
Whe+0.035
rob+0.032
cream+0.029
Rob+0.028
Neg logit contributions
 big-0.018
 small-0.018
 kind-0.018
 pen-0.017
 wise-0.017
Token sunny
Token ID27737
Feature activation-3.93e+00

Pos logit contributions
round+0.138
rob+0.123
Whe+0.122
cream+0.116
icans+0.112
Neg logit contributions
 big-0.073
 kind-0.065
 small-0.064
 green-0.057
 red-0.057
Token day
Token ID1110
Feature activation-5.12e-01

Pos logit contributions
round+0.191
Whe+0.187
cream+0.175
rob+0.175
Rob+0.168
Neg logit contributions
 big-0.053
 green-0.043
 red-0.043
 small-0.041
 kind-0.040
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
round+0.024
Whe+0.023
rob+0.020
cream+0.019
Mike+0.018
Neg logit contributions
 big-0.008
 kind-0.008
 pen-0.008
 small-0.008
 drawer-0.008
Token Suddenly
Token ID24975
Feature activation+3.69e-01

Pos logit contributions
round+0.011
Whe+0.010
rob+0.010
cream+0.009
Mike+0.008
Neg logit contributions
 big-0.005
 small-0.005
 kind-0.004
 drawer-0.004
 pen-0.004
Token,
Token ID11
Feature activation+5.45e-01

Pos logit contributions
 drawer+0.006
 pen+0.006
 kind+0.006
 wise+0.006
 handy+0.006
Neg logit contributions
round-0.015
Whe-0.015
rob-0.013
Mike-0.012
Rob-0.012
Token Lily
Token ID20037
Feature activation-1.07e+00

Pos logit contributions
 big+0.010
 small+0.010
 kind+0.009
 pen+0.009
 drawer+0.009
Neg logit contributions
round-0.025
Whe-0.024
rob-0.022
cream-0.020
Rob-0.019
Token "
Token ID366
Feature activation-2.38e+00

Pos logit contributions
round+0.028
Whe+0.028
rob+0.025
cream+0.023
 Clown+0.023
Neg logit contributions
 big-0.006
 small-0.006
 pen-0.006
 kind-0.006
 drawer-0.006
TokenWhy
Token ID5195
Feature activation-5.02e-01

Pos logit contributions
round+0.084
Whe+0.075
rob+0.070
cream+0.065
ored+0.058
Neg logit contributions
 big-0.065
 kind-0.065
 small-0.063
 box-0.063
 pen-0.061
Token are
Token ID389
Feature activation-1.36e+00

Pos logit contributions
round+0.017
Whe+0.017
rob+0.015
 Clown+0.014
cream+0.013
Neg logit contributions
 small-0.012
 drawer-0.012
 handy-0.012
 pen-0.011
 big-0.011
Token you
Token ID345
Feature activation-1.16e+00

Pos logit contributions
round+0.056
Whe+0.054
rob+0.049
cream+0.044
Rob+0.042
Neg logit contributions
 big-0.029
 small-0.028
 kind-0.028
 pen-0.026
 drawer-0.026
Token sad
Token ID6507
Feature activation-2.20e+00

Pos logit contributions
round+0.053
Whe+0.051
rob+0.047
cream+0.043
Rob+0.041
Neg logit contributions
 big-0.021
 small-0.021
 kind-0.021
 wise-0.019
 pen-0.019
Token,
Token ID11
Feature activation-3.93e+00

Pos logit contributions
round+0.107
Whe+0.103
rob+0.096
cream+0.088
Rob+0.084
Neg logit contributions
 big-0.032
 small-0.031
 kind-0.030
 wise-0.028
 pen-0.028
Token Ann
Token ID5506
Feature activation-9.22e-01

Pos logit contributions
round+0.138
rob+0.132
Whe+0.132
cream+0.112
seek+0.104
Neg logit contributions
 big-0.112
 kind-0.107
 small-0.107
 wise-0.101
 box-0.098
Token?"
Token ID1701
Feature activation-2.55e+00

Pos logit contributions
round+0.031
Whe+0.027
rob+0.025
cream+0.022
 Clown+0.022
Neg logit contributions
 pen-0.025
 small-0.024
 drawer-0.024
 big-0.024
 loose-0.024
Token
Token ID198
Feature activation-2.43e+00

Pos logit contributions
round+0.129
Whe+0.118
rob+0.118
 sidelines+0.110
cream+0.110
Neg logit contributions
 big-0.032
 small-0.030
 kind-0.029
 wise-0.024
 long-0.023
Token
Token ID198
Feature activation-2.79e+00

Pos logit contributions
round+0.108
Whe+0.104
rob+0.095
cream+0.088
icans+0.083
Neg logit contributions
 big-0.046
 small-0.044
 kind-0.043
 wise-0.040
 true-0.039
TokenAnn
Token ID18858
Feature activation-9.57e-01

Pos logit contributions
round+0.076
Whe+0.070
rob+0.064
cream+0.055
icans+0.049
Neg logit contributions
 big-0.099
 small-0.097
 kind-0.097
 wise-0.093
 true-0.091
Token He
Token ID679
Feature activation-1.25e-02

Pos logit contributions
round+0.005
Whe+0.005
rob+0.004
cream+0.004
Rob+0.004
Neg logit contributions
 big-0.004
 small-0.004
 kind-0.004
 pen-0.004
 drawer-0.003
Token had
Token ID550
Feature activation-1.02e+00

Pos logit contributions
round+0.000
Whe+0.000
rob+0.000
cream+0.000
Mike+0.000
Neg logit contributions
 big-0.000
 small-0.000
 kind-0.000
 pen-0.000
 wise-0.000
Token never
Token ID1239
Feature activation-5.29e-01

Pos logit contributions
round+0.039
Whe+0.037
rob+0.033
cream+0.030
Rob+0.028
Neg logit contributions
 big-0.024
 small-0.023
 kind-0.023
 pen-0.023
 drawer-0.023
Token seen
Token ID1775
Feature activation-1.09e+00

Pos logit contributions
round+0.021
Whe+0.020
rob+0.018
cream+0.016
Rob+0.016
Neg logit contributions
 big-0.013
 kind-0.013
 small-0.013
 wise-0.012
 pen-0.012
Token a
Token ID257
Feature activation+5.11e-02

Pos logit contributions
round+0.047
Whe+0.045
rob+0.041
cream+0.037
Rob+0.035
Neg logit contributions
 big-0.022
 small-0.021
 kind-0.021
 pen-0.020
 wise-0.020
Token lime
Token ID28738
Feature activation-3.92e+00

Pos logit contributions
 drawer+0.001
 handy+0.001
olving+0.001
 box+0.001
 diary+0.001
Neg logit contributions
round-0.002
Whe-0.002
rob-0.001
 Clown-0.001
Ste-0.001
Token before
Token ID878
Feature activation-4.37e-01

Pos logit contributions
round+0.129
Whe+0.124
rob+0.110
cream+0.096
Rob+0.090
Neg logit contributions
 big-0.114
 small-0.112
 kind-0.111
 pen-0.108
 drawer-0.108
Token and
Token ID290
Feature activation-1.40e+00

Pos logit contributions
round+0.021
Whe+0.020
rob+0.018
cream+0.017
Rob+0.016
Neg logit contributions
 big-0.006
 small-0.006
 kind-0.006
 pen-0.006
 drawer-0.006
Token it
Token ID340
Feature activation-3.88e-01

Pos logit contributions
round+0.055
Whe+0.053
rob+0.048
cream+0.043
Rob+0.041
Neg logit contributions
 big-0.031
 small-0.031
 kind-0.031
 pen-0.030
 drawer-0.030
Token felt
Token ID2936
Feature activation-3.31e-02

Pos logit contributions
round+0.017
Whe+0.016
rob+0.015
cream+0.013
Rob+0.013
Neg logit contributions
 big-0.008
 small-0.007
 kind-0.007
 pen-0.007
 wise-0.007
Token very
Token ID845
Feature activation-3.59e-01

Pos logit contributions
round+0.001
Whe+0.001
rob+0.001
cream+0.001
 Clown+0.001
Neg logit contributions
 pen-0.001
 drawer-0.001
 box-0.001
 big-0.001
 small-0.001
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
round+0.021
Whe+0.020
rob+0.018
cream+0.017
 Clown+0.016
Neg logit contributions
 pen-0.007
 drawer-0.007
 kind-0.007
 big-0.007
 small-0.007
Token He
Token ID679
Feature activation-3.21e-01

Pos logit contributions
round+0.082
rob+0.080
Whe+0.077
robat+0.071
 sidelines+0.067
Neg logit contributions
 small-0.029
 big-0.028
 kind-0.025
 wise-0.024
 long-0.023
Token had
Token ID550
Feature activation-3.91e-01

Pos logit contributions
round+0.014
Whe+0.013
rob+0.012
cream+0.011
 Clown+0.010
Neg logit contributions
 big-0.007
 small-0.007
 kind-0.007
 wise-0.006
 pen-0.006
Token never
Token ID1239
Feature activation-5.67e-01

Pos logit contributions
round+0.017
Whe+0.017
rob+0.015
cream+0.014
Rob+0.013
Neg logit contributions
 drawer-0.006
 pen-0.006
 big-0.006
 kind-0.006
 small-0.006
Token seen
Token ID1775
Feature activation-1.68e+00

Pos logit contributions
round+0.021
Whe+0.020
rob+0.018
cream+0.016
Rob+0.015
Neg logit contributions
 big-0.014
 small-0.014
 kind-0.014
 pen-0.013
 drawer-0.013
Token rain
Token ID6290
Feature activation-3.92e+00

Pos logit contributions
round+0.085
Whe+0.078
rob+0.074
Mike+0.066
cream+0.066
Neg logit contributions
 big-0.027
 small-0.025
 kind-0.022
 wise-0.021
 pen-0.021
Token before
Token ID878
Feature activation-3.89e-01

Pos logit contributions
round+0.133
Whe+0.127
rob+0.114
cream+0.100
Rob+0.094
Neg logit contributions
 big-0.110
 small-0.107
 kind-0.107
 drawer-0.103
 wise-0.103
Token.
Token ID13
Feature activation-1.65e+00

Pos logit contributions
round+0.021
Whe+0.020
rob+0.018
cream+0.017
Rob+0.016
Neg logit contributions
 big-0.004
 small-0.004
 kind-0.004
 wise-0.003
 pen-0.003
Token
Token ID220
Feature activation-6.00e-01

Pos logit contributions
round+0.083
rob+0.080
Whe+0.078
robat+0.070
 sidelines+0.069
Neg logit contributions
 small-0.024
 big-0.023
 kind-0.021
 wise-0.019
 long-0.017
Token
Token ID198
Feature activation-1.18e+00

Pos logit contributions
round+0.025
Whe+0.024
rob+0.022
cream+0.019
Rob+0.018
Neg logit contributions
 big-0.012
 small-0.012
 kind-0.012
 pen-0.012
 drawer-0.011
Token
Token ID198
Feature activation-1.92e+00

Pos logit contributions
round+0.055
Whe+0.053
rob+0.048
cream+0.045
icans+0.043
Neg logit contributions
 big-0.021
 small-0.020
 kind-0.018
 wise-0.017
 true-0.016
Token never
Token ID1239
Feature activation-4.62e-01

Pos logit contributions
round+0.027
Whe+0.025
rob+0.023
cream+0.020
Rob+0.019
Neg logit contributions
 big-0.017
 small-0.017
 kind-0.017
 pen-0.016
 drawer-0.016
Token tried
Token ID3088
Feature activation-3.89e-01

Pos logit contributions
round+0.018
Whe+0.018
rob+0.016
cream+0.014
Mike+0.014
Neg logit contributions
 big-0.010
 kind-0.010
 small-0.010
 wise-0.010
 pen-0.010
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
round+0.021
Whe+0.020
rob+0.019
cream+0.017
Rob+0.017
Neg logit contributions
 big-0.004
 small-0.004
 kind-0.004
 true-0.003
 wise-0.003
Token sm
Token ID895
Feature activation-2.71e+00

Pos logit contributions
round+0.070
Whe+0.066
rob+0.059
cream+0.052
Rob+0.050
Neg logit contributions
 big-0.049
 small-0.048
 kind-0.048
 pen-0.047
 drawer-0.046
Tokenelly
Token ID6148
Feature activation-3.12e+00

Pos logit contributions
Whe+0.116
round+0.110
rob+0.109
Rob+0.100
 coaster+0.094
Neg logit contributions
 kind-0.043
 big-0.040
 plant-0.038
 seed-0.038
 drawer-0.038
Token coffee
Token ID6891
Feature activation-3.90e+00

Pos logit contributions
round+0.115
Whe+0.112
rob+0.100
Rob+0.088
cream+0.087
Neg logit contributions
 big-0.084
 kind-0.082
 small-0.081
 box-0.077
 wise-0.076
Token,
Token ID11
Feature activation-9.93e-01

Pos logit contributions
round+0.150
Whe+0.148
rob+0.134
cream+0.118
Rob+0.114
Neg logit contributions
 big-0.092
 small-0.089
 kind-0.088
 drawer-0.088
 box-0.088
Token but
Token ID475
Feature activation-3.07e-01

Pos logit contributions
round+0.035
Whe+0.034
rob+0.029
cream+0.026
 Clown+0.025
Neg logit contributions
 big-0.025
 small-0.025
 kind-0.025
 pen-0.025
 drawer-0.025
Token they
Token ID484
Feature activation-7.61e-01

Pos logit contributions
round+0.013
Whe+0.013
rob+0.011
cream+0.010
Rob+0.010
Neg logit contributions
 big-0.006
 small-0.006
 kind-0.006
 pen-0.006
 wise-0.006
Token had
Token ID550
Feature activation-2.17e-01

Pos logit contributions
round+0.032
Whe+0.031
rob+0.028
cream+0.025
Rob+0.024
Neg logit contributions
 big-0.016
 kind-0.016
 small-0.015
 wise-0.015
 pen-0.015
Token a
Token ID257
Feature activation+6.74e-01

Pos logit contributions
round+0.009
Whe+0.009
rob+0.008
cream+0.008
Mike+0.007
Neg logit contributions
 drawer-0.003
 pen-0.003
 big-0.003
 small-0.003
 box-0.003
Token kitchen
Token ID9592
Feature activation+5.30e-01

Pos logit contributions
 kind+0.027
 true+0.027
 wise+0.026
 pen+0.026
 seed+0.026
Neg logit contributions
Whe-0.038
round-0.037
rob-0.032
cream-0.027
Mike-0.027
Token.
Token ID13
Feature activation-2.14e-01

Pos logit contributions
 true+0.010
 diary+0.009
 pen+0.009
 wise+0.008
 kind+0.008
Neg logit contributions
round-0.022
Whe-0.020
rob-0.019
 Clown-0.017
cream-0.017
Token They
Token ID1119
Feature activation+2.59e-01

Pos logit contributions
round+0.008
Whe+0.008
rob+0.007
cream+0.007
Rob+0.006
Neg logit contributions
 big-0.005
 small-0.005
 kind-0.005
 drawer-0.005
 pen-0.005
Token saw
Token ID2497
Feature activation+1.15e-01

Pos logit contributions
 true+0.006
 pen+0.006
 wise+0.006
 big+0.006
 kind+0.006
Neg logit contributions
round-0.009
Whe-0.009
rob-0.008
cream-0.007
Mike-0.007
Token a
Token ID257
Feature activation-1.06e+00

Pos logit contributions
 big+0.002
 small+0.002
 kind+0.002
 pen+0.002
 drawer+0.002
Neg logit contributions
round-0.005
Whe-0.005
rob-0.005
cream-0.004
Rob-0.004
Token m
Token ID285
Feature activation-3.90e+00

Pos logit contributions
round+0.054
Whe+0.051
rob+0.048
 Clown+0.044
cream+0.044
Neg logit contributions
 true-0.008
olving-0.007
 kind-0.007
 wise-0.006
 pen-0.006
Tokenop
Token ID404
Feature activation-1.62e+00

Pos logit contributions
Whe+0.157
round+0.148
rob+0.139
icans+0.131
 sidelines+0.122
Neg logit contributions
 big-0.085
 kind-0.078
 box-0.077
 small-0.076
 drawer-0.076
Token in
Token ID287
Feature activation+2.34e-01

Pos logit contributions
round+0.070
Whe+0.063
rob+0.060
cream+0.055
 Clown+0.052
Neg logit contributions
 true-0.030
 kind-0.030
 small-0.029
 big-0.029
 wise-0.029
Token the
Token ID262
Feature activation+3.38e-01

Pos logit contributions
 pen+0.004
 kind+0.004
 wise+0.004
 seed+0.004
 true+0.004
Neg logit contributions
round-0.010
Whe-0.010
rob-0.009
Mike-0.008
Jacob-0.008
Token corner
Token ID5228
Feature activation+7.16e-01

Pos logit contributions
 true+0.007
 kind+0.006
 wise+0.006
 small+0.006
 big+0.006
Neg logit contributions
round-0.014
Whe-0.013
rob-0.012
cream-0.011
Jacob-0.010
Token.
Token ID13
Feature activation-6.58e-01

Pos logit contributions
 pen+0.014
 true+0.013
 diary+0.013
 big+0.012
 drawer+0.012
Neg logit contributions
round-0.030
Whe-0.026
rob-0.024
Mike-0.023
cream-0.023
Token upon
Token ID2402
Feature activation-4.72e-01

Pos logit contributions
 pen+0.017
 kind+0.016
 drawer+0.016
 file+0.016
 wise+0.016
Neg logit contributions
round-0.028
Whe-0.026
rob-0.024
Mike-0.022
Jacob-0.021
Token a
Token ID257
Feature activation+1.16e+00

Pos logit contributions
round+0.025
Whe+0.023
rob+0.022
cream+0.020
icans+0.019
Neg logit contributions
 big-0.007
 small-0.006
 kind-0.006
 wise-0.005
 true-0.005
Token time
Token ID640
Feature activation-3.94e-01

Pos logit contributions
 big+0.020
 small+0.019
 kind+0.019
 pen+0.018
 drawer+0.018
Neg logit contributions
round-0.052
Whe-0.050
rob-0.045
cream-0.041
Rob-0.040
Token,
Token ID11
Feature activation+5.79e-01

Pos logit contributions
round+0.019
Whe+0.019
rob+0.017
cream+0.016
Mike+0.016
Neg logit contributions
 big-0.005
 pen-0.004
 kind-0.004
 small-0.004
 drawer-0.004
Token in
Token ID287
Feature activation-2.34e-01

Pos logit contributions
 pen+0.011
 drawer+0.011
 file+0.010
 wise+0.010
 kind+0.010
Neg logit contributions
round-0.023
Whe-0.023
rob-0.020
Jacob-0.018
cream-0.018
Token a
Token ID257
Feature activation+2.78e+00

Pos logit contributions
round+0.010
Whe+0.010
rob+0.009
cream+0.008
Rob+0.008
Neg logit contributions
 big-0.004
 drawer-0.004
 pen-0.004
 kind-0.004
 small-0.004
Token small
Token ID1402
Feature activation+1.15e+00

Pos logit contributions
olving+0.039
 pen+0.035
 drawer+0.032
 file+0.032
 pays+0.029
Neg logit contributions
Whe-0.127
round-0.116
Jacob-0.103
rob-0.101
Rob-0.100
Token town
Token ID3240
Feature activation+2.59e-01

Pos logit contributions
olving+0.028
 true+0.027
 pen+0.026
 loose+0.026
 valuable+0.026
Neg logit contributions
round-0.038
Whe-0.035
rob-0.029
 Clown-0.028
hop-0.026
Token,
Token ID11
Feature activation+5.76e-01

Pos logit contributions
 pen+0.005
 drawer+0.005
 diary+0.004
 needle+0.004
 handy+0.004
Neg logit contributions
round-0.011
Whe-0.010
rob-0.009
 Clown-0.008
Mike-0.008
Token there
Token ID612
Feature activation-4.40e-01

Pos logit contributions
 pen+0.011
 drawer+0.011
 handy+0.010
 file+0.010
 true+0.010
Neg logit contributions
round-0.023
Whe-0.023
rob-0.019
Jacob-0.018
cream-0.018
Token was
Token ID373
Feature activation-3.98e-01

Pos logit contributions
round+0.019
Whe+0.018
rob+0.016
cream+0.015
Rob+0.014
Neg logit contributions
 big-0.009
 small-0.008
 kind-0.008
 pen-0.008
 drawer-0.008
Token ask
Token ID1265
Feature activation+1.54e+00

Pos logit contributions
round+0.018
Whe+0.016
rob+0.015
cream+0.014
Rob+0.013
Neg logit contributions
 kind-0.008
 small-0.008
 big-0.008
 pen-0.008
 drawer-0.008
Token your
Token ID534
Feature activation+3.24e-01

Pos logit contributions
 big+0.029
 small+0.029
 kind+0.028
 drawer+0.028
 pen+0.028
Neg logit contributions
round-0.065
Whe-0.062
rob-0.055
cream-0.052
Rob-0.049
Token neighbor
Token ID4780
Feature activation-2.80e-01

Pos logit contributions
 pen+0.006
 seed+0.006
 file+0.006
 drawer+0.006
 small+0.006
Neg logit contributions
round-0.012
Whe-0.012
 Clown-0.010
rob-0.010
cream-0.010
Token if
Token ID611
Feature activation+1.86e+00

Pos logit contributions
round+0.012
Whe+0.012
rob+0.010
cream+0.010
 Clown+0.009
Neg logit contributions
 big-0.005
 small-0.005
 drawer-0.005
 kind-0.005
 pen-0.005
Token they
Token ID484
Feature activation+9.86e-03

Pos logit contributions
 drawer+0.052
 pen+0.051
 solar+0.049
olving+0.049
 big+0.049
Neg logit contributions
round-0.053
Whe-0.050
cream-0.044
rob-0.043
Rob-0.041
Token found
Token ID1043
Feature activation+2.12e+00

Pos logit contributions
 small+0.000
 drawer+0.000
 big+0.000
 kind+0.000
 pen+0.000
Neg logit contributions
round-0.000
Whe-0.000
rob-0.000
cream-0.000
Rob-0.000
Token it
Token ID340
Feature activation-5.23e-01

Pos logit contributions
 kind+0.039
 big+0.039
 small+0.038
 drawer+0.038
 wise+0.038
Neg logit contributions
round-0.088
Whe-0.084
rob-0.077
cream-0.072
Rob-0.067
Token."
Token ID526
Feature activation+9.77e-01

Pos logit contributions
round+0.021
Whe+0.020
rob+0.018
cream+0.017
Rob+0.016
Neg logit contributions
 big-0.010
 kind-0.010
 small-0.010
 pen-0.010
 drawer-0.010
Token Tim
Token ID5045
Feature activation+6.08e-01

Pos logit contributions
 big+0.019
 small+0.019
 file+0.018
 box+0.018
 kind+0.018
Neg logit contributions
Whe-0.041
round-0.041
rob-0.039
Rob-0.033
robat-0.033
Tokenmy
Token ID1820
Feature activation+1.23e+00

Pos logit contributions
 big+0.014
 small+0.014
 kind+0.014
 pen+0.013
 drawer+0.013
Neg logit contributions
round-0.024
Whe-0.023
rob-0.020
cream-0.018
Rob-0.017
Token thought
Token ID1807
Feature activation+4.03e-01

Pos logit contributions
 big+0.026
 small+0.026
 kind+0.026
 pen+0.025
 drawer+0.025
Neg logit contributions
round-0.050
Whe-0.048
rob-0.043
cream-0.039
Rob-0.037
Token cut
Token ID2005
Feature activation+1.50e+00

Pos logit contributions
 true+0.033
 wise+0.032
 kind+0.032
 small+0.031
 diary+0.031
Neg logit contributions
round-0.061
Whe-0.059
rob-0.051
cream-0.046
Rob-0.045
Token a
Token ID257
Feature activation+2.25e+00

Pos logit contributions
 kind+0.031
 wise+0.031
 diary+0.030
 drawer+0.029
 pen+0.029
Neg logit contributions
round-0.058
Whe-0.056
rob-0.050
cream-0.047
Mike-0.043
Token hole
Token ID7604
Feature activation+1.06e+00

Pos logit contributions
olving+0.033
 wise+0.032
 kind+0.032
 true+0.031
rick+0.028
Neg logit contributions
round-0.097
Whe-0.089
rob-0.078
Rob-0.074
Jacob-0.072
Token for
Token ID329
Feature activation+3.25e-01

Pos logit contributions
 diary+0.028
 pen+0.028
 file+0.027
 true+0.026
 drawer+0.026
Neg logit contributions
round-0.035
Whe-0.028
rob-0.025
cream-0.025
edly-0.023
Token the
Token ID262
Feature activation+6.79e-01

Pos logit contributions
 kind+0.005
 big+0.005
 pen+0.005
 small+0.005
 drawer+0.005
Neg logit contributions
round-0.014
Whe-0.014
rob-0.013
cream-0.012
Mike-0.011
Token door
Token ID3420
Feature activation+2.47e+00

Pos logit contributions
 kind+0.012
 big+0.012
 small+0.012
 wise+0.012
 true+0.012
Neg logit contributions
round-0.028
Whe-0.027
rob-0.024
cream-0.022
 Clown-0.022
Token.
Token ID13
Feature activation+9.80e-01

Pos logit contributions
Ru+0.059
 diary+0.058
 pen+0.056
 solar+0.054
 wise+0.053
Neg logit contributions
round-0.090
Whe-0.068
cream-0.066
ding-0.063
whe-0.061
Token They
Token ID1119
Feature activation+7.30e-01

Pos logit contributions
 kind+0.023
 big+0.023
 small+0.022
 pen+0.022
 true+0.022
Neg logit contributions
round-0.038
Whe-0.037
rob-0.032
cream-0.029
Mike-0.028
Token tape
Token ID9154
Feature activation+1.15e+00

Pos logit contributions
 true+0.016
 kind+0.015
 wise+0.015
 small+0.015
 big+0.014
Neg logit contributions
round-0.028
Whe-0.027
rob-0.023
cream-0.021
Jacob-0.021
Token some
Token ID617
Feature activation+1.44e+00

Pos logit contributions
 wise+0.030
 diary+0.029
 kind+0.028
 seed+0.028
 file+0.027
Neg logit contributions
round-0.037
Whe-0.032
icle-0.029
icles-0.028
cream-0.028
Token paper
Token ID3348
Feature activation+1.58e+00

Pos logit contributions
 true+0.026
 wise+0.025
 kind+0.024
 drawer+0.023
 pen+0.023
Neg logit contributions
round-0.056
Whe-0.055
rob-0.048
cream-0.046
Rob-0.042
Token She
Token ID1375
Feature activation+1.87e+00

Pos logit contributions
 true+0.023
 kind+0.023
 big+0.023
 key+0.022
 pen+0.022
Neg logit contributions
round-0.026
Whe-0.024
cream-0.018
Ste-0.017
 While-0.017
Token used
Token ID973
Feature activation+2.43e+00

Pos logit contributions
 small+0.043
 big+0.043
 true+0.042
olving+0.042
 drawer+0.042
Neg logit contributions
round-0.067
Whe-0.057
rob-0.053
cream-0.052
Rob-0.048
Token the
Token ID262
Feature activation+2.02e+00

Pos logit contributions
 kind+0.053
 wise+0.053
 drawer+0.052
 seed+0.050
 true+0.050
Neg logit contributions
round-0.078
Whe-0.074
cream-0.067
rob-0.066
icans-0.063
Token peanuts
Token ID45612
Feature activation+5.89e-02

Pos logit contributions
ook+0.036
 drawer+0.034
 solar+0.034
 handy+0.033
olving+0.033
Neg logit contributions
round-0.071
rob-0.069
Whe-0.068
rell-0.059
cream-0.059
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
 kind+0.001
 wise+0.001
 drawer+0.001
 small+0.001
 true+0.001
Neg logit contributions
round-0.003
Whe-0.002
cream-0.002
rob-0.002
seek-0.002
Token make
Token ID787
Feature activation+2.08e+00

Pos logit contributions
 drawer+0.027
 diary+0.026
 small+0.026
 kind+0.026
 true+0.026
Neg logit contributions
round-0.051
Whe-0.047
rob-0.041
cream-0.040
 sidelines-0.038
Token a
Token ID257
Feature activation+1.52e+00

Pos logit contributions
 drawer+0.049
 box+0.047
 seed+0.047
 kind+0.046
 pen+0.045
Neg logit contributions
round-0.069
Whe-0.068
rob-0.062
cream-0.059
icans-0.055
Token peanut
Token ID26636
Feature activation+4.17e-01

Pos logit contributions
olving+0.022
 drawer+0.021
 blunt+0.020
Ru+0.019
 kind+0.019
Neg logit contributions
round-0.061
Whe-0.056
 sidelines-0.049
rob-0.049
cream-0.047
Token butter
Token ID9215
Feature activation+7.07e-01

Pos logit contributions
 drawer+0.008
 solar+0.008
 small+0.008
 kind+0.008
 wise+0.008
Neg logit contributions
round-0.016
Whe-0.014
rob-0.013
cream-0.013
whe-0.012
Token sandwich
Token ID20433
Feature activation+3.73e-01

Pos logit contributions
 kind+0.015
 small+0.015
 drawer+0.015
 wise+0.015
 solar+0.014
Neg logit contributions
round-0.027
Whe-0.023
rob-0.022
cream-0.021
whe-0.019
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
 wise+0.008
 small+0.008
 drawer+0.008
 kind+0.007
 pen+0.007
Neg logit contributions
round-0.014
Whe-0.013
rob-0.011
cream-0.011
whe-0.010
Token day
Token ID1110
Feature activation+1.55e+00

Pos logit contributions
 big+0.039
 small+0.038
 kind+0.038
 pen+0.037
 drawer+0.037
Neg logit contributions
round-0.076
Whe-0.072
rob-0.065
cream-0.059
Rob-0.056
Token,
Token ID11
Feature activation+6.67e-01

Pos logit contributions
 drawer+0.026
 true+0.026
 kind+0.025
 diary+0.025
 pen+0.025
Neg logit contributions
round-0.069
Whe-0.062
rob-0.056
cream-0.053
hop-0.051
Token on
Token ID319
Feature activation+4.21e-03

Pos logit contributions
 big+0.015
 kind+0.015
 small+0.015
 drawer+0.015
 pen+0.015
Neg logit contributions
round-0.026
Whe-0.024
rob-0.022
cream-0.020
Rob-0.019
Token a
Token ID257
Feature activation+1.91e+00

Pos logit contributions
 big+0.000
 kind+0.000
 small+0.000
 pen+0.000
 drawer+0.000
Neg logit contributions
round-0.000
Whe-0.000
rob-0.000
cream-0.000
Rob-0.000
Token little
Token ID1310
Feature activation+9.26e-01

Pos logit contributions
 drawer+0.019
 true+0.019
 kind+0.018
 pen+0.018
olving+0.017
Neg logit contributions
round-0.097
Whe-0.091
rob-0.084
cream-0.077
Rob-0.077
Token boat
Token ID8848
Feature activation+2.01e+00

Pos logit contributions
 big+0.022
 small+0.021
 kind+0.021
 pen+0.021
 drawer+0.021
Neg logit contributions
round-0.036
Whe-0.034
rob-0.030
cream-0.027
Rob-0.026
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
 small+0.031
 big+0.031
 drawer+0.031
 kind+0.030
 true+0.030
Neg logit contributions
round-0.093
Whe-0.086
rob-0.081
cream-0.075
hop-0.070
Token He
Token ID679
Feature activation+8.56e-01

Pos logit contributions
 big+0.003
 small+0.003
 kind+0.002
 wise+0.002
 drawer+0.002
Neg logit contributions
round-0.006
Whe-0.006
rob-0.006
cream-0.005
Mike-0.005
Token cast
Token ID3350
Feature activation+7.69e-01

Pos logit contributions
 big+0.017
 small+0.017
 kind+0.017
 pen+0.016
 wise+0.016
Neg logit contributions
round-0.036
Whe-0.035
rob-0.032
cream-0.029
Mike-0.027
Token his
Token ID465
Feature activation+1.05e+00

Pos logit contributions
 drawer+0.017
 pen+0.017
 kind+0.017
 big+0.017
 box+0.017
Neg logit contributions
round-0.029
Whe-0.027
rob-0.025
cream-0.023
Rob-0.021
Token line
Token ID1627
Feature activation+9.22e-01

Pos logit contributions
 big+0.023
 small+0.022
 kind+0.022
 pen+0.022
 drawer+0.021
Neg logit contributions
round-0.042
Whe-0.040
rob-0.036
cream-0.033
Rob-0.031
Token there
Token ID612
Feature activation-4.95e-01

Pos logit contributions
round+0.036
Whe+0.034
rob+0.032
cream+0.029
Rob+0.028
Neg logit contributions
 big-0.007
 small-0.007
 kind-0.007
 pen-0.006
 drawer-0.006
Token were
Token ID547
Feature activation+1.29e-01

Pos logit contributions
round+0.024
Whe+0.022
rob+0.021
cream+0.019
icans+0.018
Neg logit contributions
 big-0.008
 small-0.008
 kind-0.008
 box-0.007
 drawer-0.007
Token two
Token ID734
Feature activation+3.44e-01

Pos logit contributions
 big+0.002
 small+0.002
 kind+0.002
 true+0.002
 wise+0.002
Neg logit contributions
round-0.006
Whe-0.006
rob-0.005
cream-0.005
 Clown-0.005
Token friends
Token ID2460
Feature activation+1.97e-01

Pos logit contributions
 drawer+0.007
 pen+0.007
 true+0.007
 big+0.007
 file+0.007
Neg logit contributions
round-0.013
Whe-0.012
rob-0.011
cream-0.010
 Clown-0.010
Token called
Token ID1444
Feature activation+4.47e-01

Pos logit contributions
 big+0.004
 small+0.004
 kind+0.004
 pen+0.004
 drawer+0.004
Neg logit contributions
round-0.008
Whe-0.008
rob-0.007
cream-0.006
Rob-0.006
Token Jack
Token ID3619
Feature activation+1.79e+00

Pos logit contributions
 big+0.012
 small+0.012
 kind+0.012
 pen+0.012
 drawer+0.011
Neg logit contributions
round-0.016
Whe-0.015
rob-0.013
cream-0.012
Rob-0.011
Token and
Token ID290
Feature activation+3.43e-01

Pos logit contributions
 big+0.034
 small+0.034
 kind+0.033
 drawer+0.033
 pen+0.032
Neg logit contributions
round-0.076
Whe-0.071
rob-0.065
cream-0.060
Rob-0.057
Token Sam
Token ID3409
Feature activation+1.42e+00

Pos logit contributions
 big+0.008
 small+0.008
 drawer+0.008
 kind+0.008
 pen+0.008
Neg logit contributions
round-0.013
Whe-0.012
rob-0.011
cream-0.010
 Clown-0.009
Token.
Token ID13
Feature activation+3.70e-01

Pos logit contributions
 big+0.016
 small+0.015
 kind+0.015
 pen+0.014
 drawer+0.014
Neg logit contributions
round-0.072
Whe-0.070
rob-0.065
cream-0.060
Rob-0.058
Token They
Token ID1119
Feature activation+6.89e-01

Pos logit contributions
 big+0.007
 small+0.007
 pen+0.007
 kind+0.007
 drawer+0.007
Neg logit contributions
round-0.016
Whe-0.015
rob-0.014
cream-0.013
Rob-0.012
Token were
Token ID547
Feature activation+5.11e-01

Pos logit contributions
 big+0.017
 kind+0.016
 small+0.016
 pen+0.015
 wise+0.015
Neg logit contributions
round-0.027
Whe-0.026
rob-0.023
cream-0.021
Rob-0.020
Token after
Token ID706
Feature activation-1.31e+00

Pos logit contributions
round+0.006
Whe+0.006
rob+0.006
cream+0.005
Rob+0.005
Neg logit contributions
 big-0.003
 small-0.003
 kind-0.003
 wise-0.003
 true-0.003
Token thinking
Token ID3612
Feature activation-5.03e-01

Pos logit contributions
round+0.065
Whe+0.063
rob+0.056
cream+0.051
Rob+0.050
Neg logit contributions
 big-0.023
 small-0.022
 long-0.021
 kind-0.020
 true-0.019
Token it
Token ID340
Feature activation-7.14e-01

Pos logit contributions
round+0.017
Whe+0.015
rob+0.013
cream+0.012
Jacob+0.012
Neg logit contributions
 pen-0.013
 drawer-0.013
 small-0.012
 big-0.012
 box-0.012
Token through
Token ID832
Feature activation+3.15e-01

Pos logit contributions
round+0.020
Whe+0.020
rob+0.017
Jacob+0.016
cream+0.016
Neg logit contributions
 drawer-0.019
 pen-0.018
 file-0.018
 seed-0.018
 blunt-0.017
Token,
Token ID11
Feature activation-5.59e-01

Pos logit contributions
 pen+0.006
 kind+0.006
 wise+0.006
 true+0.006
 drawer+0.005
Neg logit contributions
round-0.011
Whe-0.011
rob-0.010
hop-0.009
 Clown-0.009
Token she
Token ID673
Feature activation+4.51e-01

Pos logit contributions
round+0.022
Whe+0.021
rob+0.018
cream+0.017
Jacob+0.016
Neg logit contributions
 big-0.012
 pen-0.012
 drawer-0.012
 small-0.012
 kind-0.012
Token hugged
Token ID41130
Feature activation-3.96e-01

Pos logit contributions
 small+0.010
 big+0.010
 kind+0.010
 drawer+0.010
 file+0.010
Neg logit contributions
round-0.017
Whe-0.016
rob-0.014
cream-0.013
Rob-0.012
Token Amy
Token ID14235
Feature activation-5.58e-01

Pos logit contributions
round+0.018
Whe+0.017
rob+0.015
cream+0.014
 Clown+0.013
Neg logit contributions
 big-0.007
 small-0.007
 kind-0.007
 drawer-0.006
 pen-0.006
Token and
Token ID290
Feature activation-5.96e-01

Pos logit contributions
round+0.022
Whe+0.021
rob+0.018
cream+0.017
hop+0.017
Neg logit contributions
 big-0.011
 small-0.011
 kind-0.011
 drawer-0.011
 pen-0.011
Token said
Token ID531
Feature activation+9.66e-01

Pos logit contributions
round+0.022
Whe+0.021
rob+0.019
cream+0.017
Mike+0.016
Neg logit contributions
 big-0.015
 small-0.014
 kind-0.014
 drawer-0.014
 pen-0.014
Token,
Token ID11
Feature activation-2.07e-01

Pos logit contributions
 pen+0.026
 drawer+0.025
 file+0.024
 box+0.023
 shed+0.023
Neg logit contributions
round-0.030
Whe-0.027
ored-0.024
Mike-0.023
seek-0.023
Token strong
Token ID1913
Feature activation+2.82e-01

Pos logit contributions
 big+0.018
 pen+0.018
 small+0.018
 drawer+0.018
 kind+0.017
Neg logit contributions
round-0.035
Whe-0.033
rob-0.030
cream-0.027
Rob-0.026
Token enough
Token ID1576
Feature activation+5.07e-01

Pos logit contributions
 pen+0.007
 drawer+0.006
 seed+0.006
 small+0.006
 box+0.006
Neg logit contributions
round-0.010
Whe-0.010
rob-0.009
cream-0.008
Mike-0.008
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
 pen+0.009
 drawer+0.008
 true+0.008
 diary+0.008
 solar+0.008
Neg logit contributions
round-0.021
Whe-0.019
cream-0.017
rob-0.017
Mike-0.016
Token He
Token ID679
Feature activation+6.37e-01

Pos logit contributions
 small+0.002
 big+0.002
 kind+0.002
 drawer+0.002
 wise+0.002
Neg logit contributions
round-0.004
Whe-0.004
rob-0.004
cream-0.003
icans-0.003
Token tried
Token ID3088
Feature activation+8.55e-01

Pos logit contributions
 big+0.013
 small+0.013
 kind+0.013
 pen+0.013
 wise+0.012
Neg logit contributions
round-0.027
Whe-0.026
rob-0.024
cream-0.021
 Clown-0.020
Token to
Token ID284
Feature activation+7.55e-01

Pos logit contributions
 big+0.011
 kind+0.011
 small+0.011
 pen+0.011
 drawer+0.010
Neg logit contributions
round-0.041
Whe-0.039
rob-0.036
cream-0.033
Mike-0.032
Token save
Token ID3613
Feature activation+8.06e-01

Pos logit contributions
 big+0.014
 small+0.014
 kind+0.014
 pen+0.014
 drawer+0.013
Neg logit contributions
round-0.033
Whe-0.031
rob-0.029
cream-0.026
Rob-0.025
Token the
Token ID262
Feature activation+3.70e-01

Pos logit contributions
 big+0.015
 small+0.014
 kind+0.014
 pen+0.013
 drawer+0.013
Neg logit contributions
round-0.036
Whe-0.035
rob-0.032
cream-0.028
Rob-0.028
Token cat
Token ID3797
Feature activation-1.39e-01

Pos logit contributions
 drawer+0.008
 true+0.007
 solar+0.007
 pen+0.007
 kind+0.007
Neg logit contributions
Whe-0.014
round-0.014
rob-0.011
 coaster-0.010
Ste-0.010
Token but
Token ID475
Feature activation-4.06e-01

Pos logit contributions
round+0.006
Whe+0.005
rob+0.005
cream+0.005
hop+0.004
Neg logit contributions
 drawer-0.003
 kind-0.002
 small-0.002
 big-0.002
 pen-0.002
Token he
Token ID339
Feature activation+4.38e-01

Pos logit contributions
round+0.016
Whe+0.015
rob+0.013
cream+0.012
Mike+0.012
Neg logit contributions
 big-0.009
 kind-0.009
 small-0.008
 pen-0.008
 drawer-0.008
Token he
Token ID339
Feature activation+6.48e-01

Pos logit contributions
 big+0.004
 small+0.004
 kind+0.004
 long+0.004
 true+0.003
Neg logit contributions
round-0.007
Whe-0.006
rob-0.006
icans-0.005
robat-0.005
Token asked
Token ID1965
Feature activation+4.39e-01

Pos logit contributions
 big+0.014
 small+0.014
 kind+0.014
 pen+0.013
 drawer+0.013
Neg logit contributions
round-0.026
Whe-0.025
rob-0.023
cream-0.020
 Clown-0.020
Token his
Token ID465
Feature activation+3.34e-01

Pos logit contributions
 seed+0.014
 pen+0.014
 drawer+0.013
 blunt+0.013
 box+0.013
Neg logit contributions
round-0.012
Whe-0.011
rob-0.009
cream-0.009
Mike-0.009
Token dad
Token ID9955
Feature activation+9.97e-02

Pos logit contributions
 seed+0.008
olving+0.007
 pen+0.007
 drawer+0.007
 true+0.007
Neg logit contributions
round-0.011
Whe-0.011
 Clown-0.009
cream-0.009
rob-0.009
Token for
Token ID329
Feature activation-1.05e-01

Pos logit contributions
 blunt+0.002
 pen+0.002
 seed+0.002
 solar+0.002
 big+0.002
Neg logit contributions
round-0.004
Whe-0.003
cream-0.003
 Clown-0.003
rob-0.003
Token help
Token ID1037
Feature activation+1.49e+00

Pos logit contributions
round+0.004
Whe+0.004
Mike+0.003
rob+0.003
cream+0.003
Neg logit contributions
 seed-0.002
 drawer-0.002
 solar-0.002
 big-0.002
 pen-0.002
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
 big+0.024
 small+0.024
 seed+0.023
 kind+0.023
 true+0.022
Neg logit contributions
round-0.065
Whe-0.059
rob-0.053
cream-0.052
hop-0.051
Token His
Token ID2399
Feature activation+3.39e-01

Pos logit contributions
round+0.011
Whe+0.011
rob+0.010
cream+0.008
icans+0.008
Neg logit contributions
 big-0.010
 small-0.009
 kind-0.009
 wise-0.009
 drawer-0.009
Token dad
Token ID9955
Feature activation+9.36e-01

Pos logit contributions
 seed+0.007
 small+0.007
 pen+0.006
 big+0.006
 drawer+0.006
Neg logit contributions
round-0.013
Whe-0.013
rob-0.011
cream-0.011
Mike-0.010
Token nodded
Token ID14464
Feature activation+1.29e+00

Pos logit contributions
 big+0.025
 small+0.024
 drawer+0.024
 true+0.024
 pen+0.024
Neg logit contributions
round-0.031
Whe-0.028
rob-0.026
cream-0.025
Mike-0.023
Token and
Token ID290
Feature activation+4.91e-01

Pos logit contributions
 drawer+0.029
 pen+0.028
 file+0.028
 big+0.028
 seed+0.028
Neg logit contributions
round-0.047
Whe-0.044
rob-0.038
cream-0.036
Mike-0.036
Token at
Token ID379
Feature activation-2.41e-01

Pos logit contributions
 drawer+0.007
 pen+0.007
 kind+0.007
 big+0.007
 small+0.007
Neg logit contributions
round-0.017
Whe-0.016
rob-0.015
cream-0.014
Rob-0.013
Token times
Token ID1661
Feature activation+4.92e-01

Pos logit contributions
round+0.011
Whe+0.010
rob+0.009
cream+0.009
Rob+0.008
Neg logit contributions
 big-0.005
 small-0.005
 kind-0.005
 true-0.005
 box-0.004
Token,
Token ID11
Feature activation-6.47e-01

Pos logit contributions
 big+0.009
 pen+0.009
 small+0.009
 kind+0.009
 drawer+0.009
Neg logit contributions
round-0.021
Whe-0.020
rob-0.018
cream-0.016
Rob-0.016
Token but
Token ID475
Feature activation+3.01e-01

Pos logit contributions
round+0.022
Whe+0.021
rob+0.017
cream+0.016
 Clown+0.015
Neg logit contributions
 big-0.017
 drawer-0.017
 pen-0.017
 small-0.017
 kind-0.017
Token it
Token ID340
Feature activation+9.54e-01

Pos logit contributions
 drawer+0.007
 pen+0.007
 big+0.006
 box+0.006
 small+0.006
Neg logit contributions
round-0.011
Whe-0.011
rob-0.009
cream-0.009
 Clown-0.009
Token's
Token ID338
Feature activation+1.21e-02

Pos logit contributions
 big+0.012
 drawer+0.012
 pen+0.012
 small+0.012
 kind+0.011
Neg logit contributions
round-0.044
Whe-0.043
rob-0.039
cream-0.037
 Clown-0.035
Token always
Token ID1464
Feature activation+1.19e+00

Pos logit contributions
 big+0.000
 pen+0.000
 drawer+0.000
 small+0.000
 kind+0.000
Neg logit contributions
round-0.001
Whe-0.001
rob-0.000
cream-0.000
Rob-0.000
Token right
Token ID826
Feature activation+9.16e-01

Pos logit contributions
 drawer+0.021
 pen+0.021
 big+0.019
 box+0.019
 file+0.019
Neg logit contributions
round-0.048
Whe-0.046
rob-0.041
cream-0.039
 Clown-0.036
Token."
Token ID526
Feature activation+5.54e-01

Pos logit contributions
 pen+0.019
 drawer+0.018
 big+0.018
 small+0.018
 file+0.018
Neg logit contributions
round-0.036
Whe-0.032
rob-0.029
cream-0.028
Rob-0.026
Token
Token ID198
Feature activation-2.39e-01

Pos logit contributions
 big+0.011
 small+0.011
 kind+0.011
 pen+0.011
 drawer+0.011
Neg logit contributions
round-0.023
Whe-0.022
rob-0.020
cream-0.018
Rob-0.017
Token
Token ID198
Feature activation+3.82e-01

Pos logit contributions
round+0.009
Whe+0.009
rob+0.008
cream+0.007
Mike+0.007
Neg logit contributions
 big-0.005
 kind-0.005
 small-0.005
 pen-0.005
 drawer-0.005
Token I
Token ID314
Feature activation-9.55e-01

Pos logit contributions
round+0.033
Whe+0.032
rob+0.027
cream+0.025
 Clown+0.025
Neg logit contributions
 big-0.019
 pen-0.018
 small-0.018
 kind-0.018
 drawer-0.018
Token'm
Token ID1101
Feature activation-5.00e-01

Pos logit contributions
round+0.039
Whe+0.037
rob+0.034
cream+0.031
Rob+0.029
Neg logit contributions
 big-0.021
 small-0.020
 kind-0.020
 pen-0.019
 wise-0.019
Token looking
Token ID2045
Feature activation+1.25e+00

Pos logit contributions
round+0.020
Whe+0.019
rob+0.017
cream+0.016
Rob+0.015
Neg logit contributions
 big-0.011
 small-0.010
 kind-0.010
 pen-0.010
 drawer-0.010
Token for
Token ID329
Feature activation-2.48e-01

Pos logit contributions
 pen+0.022
 drawer+0.022
 box+0.021
 kind+0.021
 file+0.020
Neg logit contributions
round-0.053
Whe-0.050
rob-0.043
cream-0.041
 Clown-0.040
Token a
Token ID257
Feature activation+3.26e-01

Pos logit contributions
round+0.011
Whe+0.010
rob+0.009
cream+0.008
Rob+0.008
Neg logit contributions
 big-0.005
 small-0.005
 kind-0.005
 pen-0.005
 wise-0.005
Token place
Token ID1295
Feature activation-2.04e-01

Pos logit contributions
 kind+0.005
 drawer+0.005
 big+0.005
 small+0.005
 pen+0.005
Neg logit contributions
round-0.014
Whe-0.013
rob-0.011
 Clown-0.011
cream-0.011
Token to
Token ID284
Feature activation-8.60e-01

Pos logit contributions
round+0.010
Whe+0.009
rob+0.008
cream+0.008
 Clown+0.007
Neg logit contributions
 small-0.003
 big-0.003
 pen-0.003
 kind-0.003
 wise-0.003
Token bloom
Token ID29955
Feature activation+6.67e-01

Pos logit contributions
round+0.034
Whe+0.033
rob+0.029
cream+0.026
Mike+0.025
Neg logit contributions
 big-0.018
 small-0.018
 pen-0.018
 drawer-0.018
 kind-0.018
Token."
Token ID526
Feature activation+5.01e-01

Pos logit contributions
 pen+0.014
 drawer+0.014
 file+0.014
 small+0.014
 kind+0.013
Neg logit contributions
round-0.027
Whe-0.024
rob-0.022
cream-0.020
ored-0.019
Token
Token ID198
Feature activation+1.43e-01

Pos logit contributions
 small+0.008
 big+0.008
 kind+0.008
 wise+0.008
 pen+0.008
Neg logit contributions
round-0.023
Whe-0.022
rob-0.021
cream-0.018
icans-0.018
Token
Token ID198
Feature activation+5.22e-01

Pos logit contributions
 big+0.003
 small+0.003
 kind+0.003
 pen+0.003
 drawer+0.003
Neg logit contributions
round-0.006
Whe-0.005
rob-0.005
cream-0.004
Mike-0.004
Token a
Token ID257
Feature activation+1.44e+00

Pos logit contributions
 big+0.009
 small+0.009
 kind+0.008
 long+0.007
 true+0.007
Neg logit contributions
round-0.040
Whe-0.036
rob-0.035
cream-0.033
 Clown-0.032
Token little
Token ID1310
Feature activation+2.29e-01

Pos logit contributions
olving+0.022
 drawer+0.021
 pen+0.020
 true+0.020
 box+0.019
Neg logit contributions
round-0.063
Whe-0.060
rob-0.052
 Clown-0.049
Rob-0.049
Token girl
Token ID2576
Feature activation+9.80e-01

Pos logit contributions
 big+0.005
 kind+0.005
 small+0.005
 true+0.005
 drawer+0.005
Neg logit contributions
round-0.009
Whe-0.008
rob-0.008
cream-0.007
 Clown-0.007
Token who
Token ID508
Feature activation+1.61e+00

Pos logit contributions
olving+0.035
 true+0.034
 handy+0.034
 drawer+0.033
 key+0.032
Neg logit contributions
round-0.024
Whe-0.017
rob-0.015
hop-0.015
cream-0.014
Token loved
Token ID6151
Feature activation-2.29e-01

Pos logit contributions
 drawer+0.045
 pen+0.045
 handy+0.045
 box+0.044
 file+0.043
Neg logit contributions
Whe-0.048
round-0.048
rob-0.039
cream-0.036
|-0.034
Token to
Token ID284
Feature activation-4.08e-01

Pos logit contributions
round+0.012
Whe+0.011
rob+0.010
cream+0.010
 Clown+0.009
Neg logit contributions
 big-0.002
 small-0.002
 kind-0.002
 pen-0.002
 drawer-0.002
Token search
Token ID2989
Feature activation+1.27e+00

Pos logit contributions
round+0.015
Whe+0.014
rob+0.012
cream+0.011
 Clown+0.011
Neg logit contributions
 small-0.010
 kind-0.009
 big-0.009
 drawer-0.009
 true-0.009
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 true+0.025
 pen+0.025
olving+0.024
 drawer+0.024
 kind+0.023
Neg logit contributions
round-0.048
Whe-0.043
rob-0.040
cream-0.039
dogs-0.034
Token Everywhere
Token ID48201
Feature activation+9.95e-01

Pos logit contributions
 true+0.023
 big+0.023
 kind+0.023
 pen+0.023
 small+0.023
Neg logit contributions
round-0.037
Whe-0.037
rob-0.027
 Clown-0.027
cream-0.027
Token she
Token ID673
Feature activation+1.80e+00

Pos logit contributions
olving+0.026
 handy+0.025
 pen+0.024
 loose+0.024
 true+0.024
Neg logit contributions
Whe-0.032
round-0.031
rob-0.028
cream-0.024
 Clown-0.024
Token went
Token ID1816
Feature activation+7.55e-01

Pos logit contributions
 true+0.045
olving+0.044
 wise+0.043
 kind+0.043
 handy+0.042
Neg logit contributions
round-0.057
Whe-0.053
rob-0.051
cream-0.049
Rob-0.041
Token Big
Token ID4403
Feature activation+4.96e-01

Pos logit contributions
round+0.028
Whe+0.026
rob+0.026
cream+0.022
robat+0.021
Neg logit contributions
 big-0.012
 small-0.012
 kind-0.011
 wise-0.011
 drawer-0.011
Token drops
Token ID10532
Feature activation-7.93e-01

Pos logit contributions
 big+0.013
 drawer+0.013
 kind+0.013
 small+0.013
 pen+0.013
Neg logit contributions
round-0.017
Whe-0.016
rob-0.014
cream-0.013
Rob-0.012
Token of
Token ID286
Feature activation-1.23e+00

Pos logit contributions
round+0.031
Whe+0.029
rob+0.025
cream+0.023
Mike+0.023
Neg logit contributions
 drawer-0.017
 pen-0.017
 kind-0.017
 true-0.017
 small-0.017
Token water
Token ID1660
Feature activation-8.47e-01

Pos logit contributions
round+0.055
Whe+0.052
rob+0.048
cream+0.043
Rob+0.041
Neg logit contributions
 big-0.023
 small-0.023
 kind-0.022
 pen-0.021
 wise-0.021
Token started
Token ID2067
Feature activation+6.54e-01

Pos logit contributions
round+0.034
Whe+0.032
rob+0.029
cream+0.027
Mike+0.025
Neg logit contributions
 kind-0.017
 big-0.017
 small-0.017
 drawer-0.017
 pen-0.017
Token to
Token ID284
Feature activation-6.31e-01

Pos logit contributions
 drawer+0.009
 pen+0.009
 file+0.009
 kind+0.008
 box+0.008
Neg logit contributions
round-0.030
Whe-0.029
rob-0.026
cream-0.024
icans-0.024
Token pour
Token ID12797
Feature activation+5.22e-02

Pos logit contributions
round+0.027
Whe+0.025
rob+0.023
cream+0.021
Rob+0.020
Neg logit contributions
 big-0.014
 small-0.013
 kind-0.013
 wise-0.013
 true-0.012
Token down
Token ID866
Feature activation-3.54e-01

Pos logit contributions
 big+0.001
 small+0.001
 kind+0.001
 pen+0.001
 drawer+0.001
Neg logit contributions
round-0.002
Whe-0.002
rob-0.002
cream-0.002
Rob-0.002
Token,
Token ID11
Feature activation-9.49e-01

Pos logit contributions
round+0.015
Whe+0.014
rob+0.012
cream+0.012
Mike+0.011
Neg logit contributions
 pen-0.007
 drawer-0.007
 kind-0.007
 small-0.006
 wise-0.006
Token and
Token ID290
Feature activation-6.27e-01

Pos logit contributions
round+0.038
Whe+0.036
rob+0.032
cream+0.029
Rob+0.028
Neg logit contributions
 big-0.021
 small-0.020
 kind-0.020
 pen-0.020
 drawer-0.020
Token the
Token ID262
Feature activation+3.81e-01

Pos logit contributions
round+0.027
Whe+0.026
rob+0.023
cream+0.021
Rob+0.020
Neg logit contributions
 big-0.012
 kind-0.012
 small-0.012
 pen-0.011
 drawer-0.011
Token and
Token ID290
Feature activation+8.55e-02

Pos logit contributions
round+0.019
Whe+0.017
rob+0.015
cream+0.014
hop+0.013
Neg logit contributions
 small-0.009
 kind-0.009
 big-0.009
 drawer-0.009
 wise-0.009
Token said
Token ID531
Feature activation+4.74e-01

Pos logit contributions
 drawer+0.002
 big+0.002
 kind+0.002
 small+0.002
 box+0.002
Neg logit contributions
round-0.003
Whe-0.003
rob-0.003
cream-0.003
Rob-0.002
Token hello
Token ID23748
Feature activation-7.34e-01

Pos logit contributions
 pen+0.010
 drawer+0.010
 file+0.009
 box+0.009
 handy+0.008
Neg logit contributions
round-0.019
Whe-0.018
rob-0.015
 Clown-0.015
cream-0.015
Token to
Token ID284
Feature activation-6.19e-01

Pos logit contributions
round+0.035
Whe+0.033
rob+0.029
cream+0.029
 Clown+0.028
Neg logit contributions
 drawer-0.010
 big-0.010
 pen-0.010
 small-0.010
 kind-0.009
Token the
Token ID262
Feature activation+3.17e-01

Pos logit contributions
round+0.028
Whe+0.026
rob+0.024
cream+0.022
Mike+0.021
Neg logit contributions
 big-0.012
 small-0.011
 kind-0.011
 wise-0.011
 true-0.011
Token bear
Token ID6842
Feature activation-4.52e-01

Pos logit contributions
 big+0.007
 small+0.007
 kind+0.007
 wise+0.007
 pen+0.007
Neg logit contributions
round-0.013
Whe-0.013
rob-0.011
cream-0.010
Mike-0.010
Token.
Token ID13
Feature activation-9.28e-02

Pos logit contributions
round+0.021
Whe+0.019
rob+0.017
cream+0.017
 Clown+0.016
Neg logit contributions
 big-0.007
 small-0.007
 kind-0.007
 drawer-0.007
 pen-0.006
Token
Token ID198
Feature activation-4.76e-01

Pos logit contributions
round+0.004
Whe+0.004
rob+0.004
cream+0.003
robat+0.003
Neg logit contributions
 big-0.002
 small-0.002
 kind-0.002
 wise-0.002
 drawer-0.002
Token
Token ID198
Feature activation+1.87e-01

Pos logit contributions
round+0.019
Whe+0.018
rob+0.017
cream+0.015
Rob+0.014
Neg logit contributions
 big-0.010
 small-0.010
 kind-0.010
 pen-0.010
 drawer-0.010
TokenThe
Token ID464
Feature activation+6.79e-01

Pos logit contributions
 big+0.004
 small+0.004
 kind+0.004
 pen+0.004
 drawer+0.004
Neg logit contributions
round-0.008
Whe-0.007
rob-0.007
cream-0.006
Rob-0.006
Token bear
Token ID6842
Feature activation-3.38e-01

Pos logit contributions
 big+0.015
 small+0.015
 kind+0.015
 wise+0.014
 pen+0.014
Neg logit contributions
round-0.028
Whe-0.027
rob-0.024
cream-0.022
Mike-0.021
Token
Token ID198
Feature activation-6.79e-01

Pos logit contributions
round+0.041
Whe+0.039
rob+0.037
cream+0.034
icans+0.032
Neg logit contributions
 big-0.015
 small-0.015
 kind-0.014
 wise-0.013
 true-0.012
TokenL
Token ID43
Feature activation-1.16e+00

Pos logit contributions
round+0.025
Whe+0.024
rob+0.022
cream+0.019
 Clown+0.018
Neg logit contributions
 big-0.017
 small-0.017
 kind-0.017
 wise-0.016
 pen-0.016
Tokenily
Token ID813
Feature activation-8.41e-01

Pos logit contributions
round+0.051
Whe+0.048
rob+0.046
cream+0.041
 Clown+0.040
Neg logit contributions
 kind-0.022
 big-0.022
 small-0.021
 drawer-0.021
 true-0.020
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
round+0.037
Whe+0.036
rob+0.033
cream+0.030
Rob+0.029
Neg logit contributions
 big-0.015
 small-0.014
 kind-0.014
 pen-0.014
 drawer-0.014
Token holiday
Token ID9912
Feature activation-9.68e-01

Pos logit contributions
round+0.046
Whe+0.045
rob+0.040
cream+0.037
 Clown+0.036
Neg logit contributions
 big-0.015
 small-0.015
 kind-0.015
 pen-0.014
 drawer-0.014
Token was
Token ID373
Feature activation-1.18e+00

Pos logit contributions
round+0.041
Whe+0.039
rob+0.036
cream+0.032
Rob+0.031
Neg logit contributions
 big-0.019
 small-0.019
 kind-0.019
 pen-0.018
 drawer-0.018
Token ruined
Token ID20484
Feature activation-6.02e-01

Pos logit contributions
round+0.049
Whe+0.047
rob+0.042
cream+0.038
Rob+0.036
Neg logit contributions
 big-0.025
 small-0.024
 kind-0.024
 pen-0.023
 drawer-0.023
Token because
Token ID780
Feature activation-1.08e+00

Pos logit contributions
round+0.025
Whe+0.024
rob+0.021
cream+0.019
Mike+0.019
Neg logit contributions
 pen-0.011
 small-0.011
 kind-0.011
 wise-0.011
 big-0.011
Token she
Token ID673
Feature activation-1.08e+00

Pos logit contributions
round+0.050
Whe+0.048
rob+0.044
cream+0.040
Rob+0.038
Neg logit contributions
 big-0.019
 small-0.018
 kind-0.018
 drawer-0.017
 pen-0.017
Token didn
Token ID1422
Feature activation-1.53e+00

Pos logit contributions
round+0.054
Whe+0.052
rob+0.048
cream+0.042
icans+0.042
Neg logit contributions
 big-0.018
 kind-0.017
 small-0.016
 pen-0.015
 can-0.015
Token't
Token ID470
Feature activation-1.25e+00

Pos logit contributions
round+0.060
Whe+0.057
rob+0.053
cream+0.047
icans+0.044
Neg logit contributions
 big-0.039
 small-0.036
 true-0.034
 kind-0.033
 long-0.033
Token to
Token ID284
Feature activation-2.13e+00

Pos logit contributions
round+0.026
Whe+0.025
rob+0.023
cream+0.021
Rob+0.020
Neg logit contributions
 big-0.008
 small-0.008
 kind-0.008
 pen-0.007
 drawer-0.007
Token hide
Token ID7808
Feature activation+3.38e-01

Pos logit contributions
round+0.088
Whe+0.084
rob+0.078
cream+0.070
Rob+0.067
Neg logit contributions
 big-0.047
 kind-0.046
 small-0.045
 wise-0.044
 pen-0.043
Token and
Token ID290
Feature activation-1.70e+00

Pos logit contributions
 big+0.006
 small+0.006
 kind+0.006
 pen+0.006
 drawer+0.006
Neg logit contributions
round-0.015
Whe-0.014
rob-0.013
cream-0.012
Rob-0.011
Token Spot
Token ID15899
Feature activation-1.94e+00

Pos logit contributions
round+0.060
Whe+0.057
rob+0.051
cream+0.045
Rob+0.042
Neg logit contributions
 big-0.046
 small-0.045
 kind-0.045
 pen-0.043
 drawer-0.043
Token liked
Token ID8288
Feature activation-3.58e-01

Pos logit contributions
round+0.088
Whe+0.085
rob+0.078
cream+0.070
Mike+0.068
Neg logit contributions
 big-0.035
 small-0.034
 kind-0.034
 pen-0.032
 wise-0.031
Token to
Token ID284
Feature activation-2.37e+00

Pos logit contributions
round+0.017
Whe+0.016
rob+0.015
cream+0.014
 Clown+0.013
Neg logit contributions
 big-0.005
 small-0.005
 kind-0.005
 pen-0.005
 drawer-0.005
Token find
Token ID1064
Feature activation+9.76e-02

Pos logit contributions
round+0.098
Whe+0.093
rob+0.086
cream+0.078
Rob+0.073
Neg logit contributions
 big-0.053
 kind-0.051
 small-0.050
 wise-0.048
 pen-0.047
Token her
Token ID607
Feature activation-1.70e+00

Pos logit contributions
 big+0.002
 small+0.002
 kind+0.002
 pen+0.002
 drawer+0.002
Neg logit contributions
round-0.004
Whe-0.004
rob-0.003
cream-0.003
Mike-0.003
Token.
Token ID13
Feature activation-4.53e-01

Pos logit contributions
round+0.073
Whe+0.070
rob+0.063
cream+0.057
Rob+0.055
Neg logit contributions
 big-0.034
 small-0.034
 kind-0.033
 pen-0.032
 wise-0.032
Token When
Token ID1649
Feature activation+1.95e-01

Pos logit contributions
round+0.020
Whe+0.019
rob+0.018
Mike+0.016
robat+0.016
Neg logit contributions
 big-0.009
 small-0.008
 kind-0.007
 wise-0.007
 long-0.007
Token Spot
Token ID15899
Feature activation-2.80e+00

Pos logit contributions
 big+0.004
 small+0.004
 kind+0.004
 pen+0.004
 true+0.004
Neg logit contributions
round-0.008
Whe-0.008
rob-0.007
cream-0.006
Mike-0.006
Token.
Token ID13
Feature activation+8.30e-01

Pos logit contributions
round+0.032
Whe+0.029
rob+0.026
cream+0.025
Rob+0.023
Neg logit contributions
 pen-0.012
 kind-0.012
 small-0.012
 drawer-0.012
 big-0.011
Token Let
Token ID3914
Feature activation+1.26e+00

Pos logit contributions
 big+0.022
 small+0.021
 pen+0.020
 kind+0.020
 drawer+0.020
Neg logit contributions
round-0.030
Whe-0.029
rob-0.024
cream-0.023
 Clown-0.022
Token's
Token ID338
Feature activation+4.07e-01

Pos logit contributions
 drawer+0.022
 seed+0.021
 pen+0.021
 kind+0.021
 small+0.020
Neg logit contributions
Whe-0.053
round-0.052
rob-0.046
cream-0.043
Rob-0.042
Token get
Token ID651
Feature activation+1.43e-01

Pos logit contributions
 big+0.007
 small+0.007
 kind+0.007
 pen+0.007
 drawer+0.007
Neg logit contributions
round-0.017
Whe-0.017
rob-0.015
cream-0.014
Rob-0.013
Token the
Token ID262
Feature activation-1.73e+00

Pos logit contributions
 big+0.003
 small+0.002
 kind+0.002
 pen+0.002
 drawer+0.002
Neg logit contributions
round-0.007
Whe-0.006
rob-0.006
cream-0.005
 Clown-0.005
Token m
Token ID285
Feature activation-3.22e+00

Pos logit contributions
round+0.074
Whe+0.072
rob+0.064
cream+0.059
 Clown+0.056
Neg logit contributions
 kind-0.030
 big-0.030
 small-0.030
 pen-0.029
 true-0.029
Tokenop
Token ID404
Feature activation-1.69e+00

Pos logit contributions
round+0.110
Whe+0.109
rob+0.101
icans+0.085
Rob+0.084
Neg logit contributions
 big-0.086
 drawer-0.084
 small-0.082
 pen-0.082
 box-0.082
Token and
Token ID290
Feature activation+1.80e-01

Pos logit contributions
round+0.071
Whe+0.067
rob+0.061
cream+0.055
Rob+0.052
Neg logit contributions
 big-0.034
 small-0.033
 kind-0.033
 pen-0.032
 drawer-0.032
Token clean
Token ID3424
Feature activation-8.07e-01

Pos logit contributions
 big+0.003
 small+0.003
 kind+0.003
 drawer+0.003
 pen+0.003
Neg logit contributions
round-0.007
Whe-0.007
rob-0.006
cream-0.006
Rob-0.006
Token up
Token ID510
Feature activation+9.97e-01

Pos logit contributions
round+0.034
Whe+0.032
rob+0.029
cream+0.027
Mike+0.026
Neg logit contributions
 kind-0.016
 big-0.016
 small-0.015
 pen-0.015
 wise-0.015
Token this
Token ID428
Feature activation-7.99e-01

Pos logit contributions
 pen+0.019
 kind+0.019
 drawer+0.019
 big+0.019
 small+0.019
Neg logit contributions
round-0.042
Whe-0.039
rob-0.035
cream-0.033
Mike-0.031
Token started
Token ID2067
Feature activation+8.95e-01

Pos logit contributions
 big+0.002
 small+0.002
 kind+0.002
 drawer+0.002
 pen+0.002
Neg logit contributions
round-0.003
Whe-0.003
rob-0.003
cream-0.003
Mike-0.002
Token walking
Token ID6155
Feature activation+3.60e-01

Pos logit contributions
 seed+0.019
 drawer+0.018
 pen+0.018
 box+0.018
 file+0.017
Neg logit contributions
round-0.034
Whe-0.032
rob-0.028
cream-0.028
Mike-0.027
Token again
Token ID757
Feature activation+9.22e-01

Pos logit contributions
 big+0.006
 small+0.006
 kind+0.006
 wise+0.006
 pen+0.006
Neg logit contributions
round-0.016
Whe-0.016
rob-0.014
cream-0.013
Rob-0.013
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
 drawer+0.014
 pen+0.014
 kind+0.013
 big+0.013
 box+0.013
Neg logit contributions
round-0.042
Whe-0.040
rob-0.036
cream-0.033
Mike-0.033
Token As
Token ID1081
Feature activation+1.28e+00

Pos logit contributions
round+0.013
rob+0.013
Whe+0.012
robat+0.011
cream+0.010
Neg logit contributions
 small-0.007
 big-0.007
 long-0.007
 kind-0.006
 wise-0.006
Token they
Token ID484
Feature activation-2.41e+00

Pos logit contributions
 big+0.027
 small+0.027
 kind+0.026
 true+0.025
 wise+0.024
Neg logit contributions
round-0.054
Whe-0.051
rob-0.047
cream-0.042
icans-0.040
Token walked
Token ID6807
Feature activation-5.89e-01

Pos logit contributions
round+0.101
Whe+0.099
rob+0.092
ocolate+0.081
icans+0.080
Neg logit contributions
 big-0.049
 small-0.046
 kind-0.045
 can-0.042
 will-0.040
Token,
Token ID11
Feature activation+9.51e-02

Pos logit contributions
round+0.028
Whe+0.028
rob+0.026
cream+0.024
 Clown+0.023
Neg logit contributions
 big-0.009
 small-0.008
 kind-0.008
 wise-0.007
 box-0.007
Token Rex
Token ID17853
Feature activation-1.75e+00

Pos logit contributions
 big+0.002
 small+0.002
 long+0.002
 kind+0.002
 true+0.002
Neg logit contributions
round-0.004
Whe-0.004
rob-0.004
cream-0.003
robat-0.003
Token noticed
Token ID6810
Feature activation-6.90e-01

Pos logit contributions
round+0.071
Whe+0.069
rob+0.063
cream+0.056
Mike+0.052
Neg logit contributions
 big-0.038
 small-0.038
 kind-0.037
 wise-0.035
 pen-0.035
Token that
Token ID326
Feature activation-7.87e-01

Pos logit contributions
round+0.030
Whe+0.029
rob+0.026
cream+0.023
Rob+0.023
Neg logit contributions
 big-0.013
 small-0.013
 kind-0.013
 wise-0.012
 pen-0.012
Token closed
Token ID4838
Feature activation+5.61e-01

Pos logit contributions
round+0.056
Whe+0.053
rob+0.048
cream+0.044
Rob+0.042
Neg logit contributions
 big-0.027
 small-0.026
 kind-0.026
 pen-0.025
 drawer-0.025
Token the
Token ID262
Feature activation-2.23e-01

Pos logit contributions
 kind+0.011
 big+0.011
 pen+0.011
 small+0.010
 wise+0.010
Neg logit contributions
round-0.023
Whe-0.022
rob-0.020
cream-0.019
Mike-0.018
Token doors
Token ID8215
Feature activation+1.69e+00

Pos logit contributions
round+0.008
Whe+0.007
rob+0.007
cream+0.006
 desserts+0.006
Neg logit contributions
 true-0.005
 feat-0.004
rick-0.004
ook-0.004
Ru-0.004
Token.
Token ID13
Feature activation-6.73e-01

Pos logit contributions
 pen+0.038
Ru+0.037
 seed+0.036
 diary+0.036
 Po+0.036
Neg logit contributions
round-0.063
Whe-0.050
cream-0.046
ding-0.045
hop-0.044
Token Tom
Token ID4186
Feature activation-1.04e+00

Pos logit contributions
round+0.031
Whe+0.030
rob+0.029
robat+0.025
cream+0.025
Neg logit contributions
 big-0.012
 small-0.012
 kind-0.011
 wise-0.010
 long-0.010
Token and
Token ID290
Feature activation-2.62e+00

Pos logit contributions
round+0.047
Whe+0.045
rob+0.041
cream+0.037
Mike+0.036
Neg logit contributions
 big-0.018
 small-0.017
 kind-0.017
 pen-0.017
 drawer-0.016
Token Sue
Token ID26089
Feature activation-1.99e+00

Pos logit contributions
round+0.105
Whe+0.102
rob+0.098
robat+0.082
icans+0.080
Neg logit contributions
 big-0.063
 kind-0.063
 small-0.062
 wise-0.056
 box-0.054
Token sat
Token ID3332
Feature activation-4.82e-01

Pos logit contributions
round+0.076
Whe+0.073
rob+0.066
cream+0.058
Mike+0.055
Neg logit contributions
 big-0.049
 kind-0.048
 small-0.048
 pen-0.046
 drawer-0.046
Token on
Token ID319
Feature activation-7.23e-01

Pos logit contributions
round+0.022
Whe+0.021
rob+0.019
cream+0.017
Rob+0.017
Neg logit contributions
 big-0.008
 small-0.008
 kind-0.008
 pen-0.008
 drawer-0.008
Token their
Token ID511
Feature activation-9.08e-01

Pos logit contributions
round+0.033
Whe+0.032
rob+0.029
cream+0.027
Mike+0.026
Neg logit contributions
 big-0.011
 kind-0.011
 small-0.011
 pen-0.010
 drawer-0.010
Token beds
Token ID20237
Feature activation+5.70e-01

Pos logit contributions
round+0.031
Whe+0.029
rob+0.026
cream+0.023
 Clown+0.021
Neg logit contributions
 big-0.025
 kind-0.025
 small-0.024
 pen-0.024
 true-0.024
Token tall
Token ID7331
Feature activation-4.05e-01

Pos logit contributions
 pen+0.006
 drawer+0.006
 diary+0.006
 craft+0.006
 file+0.006
Neg logit contributions
round-0.012
Whe-0.011
Ste-0.009
rob-0.009
cream-0.009
Token and
Token ID290
Feature activation+2.94e-01

Pos logit contributions
round+0.015
Whe+0.013
rob+0.013
cream+0.011
 coaster+0.011
Neg logit contributions
 true-0.009
 pen-0.009
 drawer-0.009
 file-0.009
 small-0.009
Token hard
Token ID1327
Feature activation-5.04e-01

Pos logit contributions
 pen+0.005
 drawer+0.005
 diary+0.005
olving+0.005
 true+0.005
Neg logit contributions
round-0.012
Whe-0.011
rob-0.010
cream-0.010
Mike-0.009
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
round+0.023
Whe+0.021
rob+0.020
cream+0.019
seek+0.018
Neg logit contributions
 kind-0.008
 true-0.007
 wise-0.007
 pen-0.007
 drawer-0.007
Token Sara
Token ID24799
Feature activation-7.12e-01

Pos logit contributions
round+0.065
rob+0.063
Whe+0.059
robat+0.057
cream+0.052
Neg logit contributions
 small-0.034
 big-0.033
 long-0.030
 green-0.028
 kind-0.028
Token and
Token ID290
Feature activation-2.51e+00

Pos logit contributions
round+0.031
Whe+0.030
rob+0.027
cream+0.024
Rob+0.023
Neg logit contributions
 big-0.013
 small-0.013
 kind-0.013
 pen-0.012
 drawer-0.012
Token Tom
Token ID4186
Feature activation-1.08e+00

Pos logit contributions
round+0.105
Whe+0.095
rob+0.092
cream+0.081
Rob+0.078
Neg logit contributions
 big-0.056
 kind-0.054
 small-0.054
 wise-0.049
 long-0.048
Token could
Token ID714
Feature activation-5.34e-01

Pos logit contributions
round+0.045
Whe+0.043
rob+0.040
cream+0.035
 Clown+0.034
Neg logit contributions
 big-0.024
 small-0.023
 kind-0.023
 wise-0.022
 pen-0.022
Token not
Token ID407
Feature activation-1.03e+00

Pos logit contributions
round+0.026
Whe+0.025
rob+0.024
cream+0.022
Rob+0.021
Neg logit contributions
 big-0.008
 small-0.008
 kind-0.007
 wise-0.007
 true-0.006
Token see
Token ID766
Feature activation+7.40e-02

Pos logit contributions
round+0.048
Whe+0.047
rob+0.047
cream+0.042
Mike+0.039
Neg logit contributions
 big-0.017
 small-0.016
 kind-0.015
 wise-0.014
 true-0.014
Token what
Token ID644
Feature activation-3.20e-02

Pos logit contributions
 kind+0.001
 pen+0.001
 big+0.001
 drawer+0.001
 small+0.001
Neg logit contributions
round-0.003
Whe-0.003
rob-0.003
cream-0.003
 Clown-0.003