TOP ACTIVATIONS
MAX = 4.604

a little crab called Sc amp i . Sc amp i
Sc amp i . Sc amp i was unique because it
a young rabbit named Sc amp i . Sc amp i
was a little girl called Sof ia . Sof ia was
ch ubby little fish named Fif i . She lived in
a unique fle a named Fif i . Fif i lived
park . Lily was the seeker and Ben was the h
seek . Tom was the seeker and Lily was the h
Sc amp i . Sc amp i hopped around her garden
a big , fit tr amp oline ! Sarah 's mom
a big , fit tr amp oline ! Sarah 's mom
a strong lizard named T az . He lived happily among
garden . Ben was the seeker and Lily was the h
park . Lily was the seeker and Tom was the h
park . Tom was the seeker and Lily was the h
. " Which locked door ?" uncle Bob asked
seek . Ben was the seeker and Lily was the h
park . Anna was the seeker and Ben was the h
house . Lily was the seeker and Max was the h
park . Lily was the seeker and Max was the h

MIN ACTIVATIONS
MIN = -3.925

red cape and make - bel ieve he had super powers
Emily felt encouraged by her pap a 's words . Every
He would race make - bel ieve cars around the track
she woke up she felt refreshed . She felt happy and
loved to play make - bel ieve . Today she was
too ?" Her pap a smiled and said ,
prevent Katie from writing her poems . They did not understand
. Jan ie shared her creations with all her family and
the key to achieving her dreams . She was no longer
liked to play make - bel ieve . One
her tutor . When her parents found out , they were
was ready for make - bel ieve fun . <|endoftext|> Once
stop him from achieving his dreams . <|endoftext|> It was raining
and they played make - bel ieve . Finally , they
skill . Katie could write poems ! One day someone tried
- up and make - bel ieve . One day ,
He loved playing make - bel ieve with his friends .
slipped away . Emma 's pap a quickly grabbed it .
thank you gift . After one hour , the girl waited
a world of make - bel ieve . She

INTERVAL 2.472 - 4.604
CONTAINS 0.014%

returned to the books he lf . The moral
was a girl named Mag gy . Mag gy was very
played together with Lily 's pink belt . From that day
Sam . He learned that black ink was not for drawing
years old and he loved aer opl anes !

INTERVAL 0.339 - 2.472
CONTAINS 28.339%

you all about jokes . They can make people feel happy
plants in the area were very happy . They were not
� � € � � Hello there ! What â €
a long drive . The motorcycle was happy to be moving
upon a time , there was a brave animal . His

INTERVAL -1.793 - 0.339
CONTAINS 71.365%

room , pretending to fight and chase each other . They
nice . He will be gentle and kind . And the
would take Jack 's carrots and never said please or thank
a lovely cake for dessert . Lily was so excited to
. The elephant noticed Lily and Max and their mom .

INTERVAL -3.925 - -1.793
CONTAINS 0.282%

fog gy and Jessica felt a bit scared . Suddenly ,
adventure ! â € <|endoftext|> Once there was a little boy
His mom smiled . " That 's lovely , Jack
on a farm . Her parents had lots of animals ,
day on , Tim my learned to take better care of
Token a
Token ID257
Feature activation-3.81e-01

Pos logit contributions
kin+0.013
pan+0.013
aver+0.012
 tray+0.012
 bowl+0.011
Neg logit contributions
years-0.009
but-0.009
What-0.008
 after-0.008
to-0.008
Token little
Token ID1310
Feature activation-7.32e-01

Pos logit contributions
What+0.010
years+0.010
 after+0.010
Ho+0.009
Why+0.009
Neg logit contributions
pan-0.011
kin-0.011
 happens-0.010
aver-0.010
 teaches-0.009
Token crab
Token ID32202
Feature activation+4.05e-01

Pos logit contributions
 after+0.017
years+0.017
What+0.016
 him+0.016
Ho+0.015
Neg logit contributions
pan-0.023
kin-0.023
aver-0.021
 teaches-0.021
 happens-0.021
Token called
Token ID1444
Feature activation+7.08e-01

Pos logit contributions
kin+0.011
pan+0.011
aver+0.010
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.012
but-0.011
 after-0.011
What-0.011
and-0.011
Token Sc
Token ID1446
Feature activation+2.57e+00

Pos logit contributions
kin+0.024
pan+0.023
aver+0.022
 happens+0.020
 teaches+0.020
Neg logit contributions
 him-0.016
years-0.016
 after-0.015
to-0.015
but-0.014
Tokenamp
Token ID696
Feature activation+4.60e+00

Pos logit contributions
kin+0.082
pan+0.079
aver+0.077
 tray+0.072
 bowl+0.071
Neg logit contributions
years-0.063
but-0.059
What-0.059
 after-0.058
 him-0.056
Tokeni
Token ID72
Feature activation+1.55e+00

Pos logit contributions
kin+0.133
pan+0.129
aver+0.119
 tray+0.115
 bowl+0.114
Neg logit contributions
years-0.129
 him-0.127
but-0.124
 after-0.122
to-0.120
Token.
Token ID13
Feature activation-8.79e-01

Pos logit contributions
kin+0.052
pan+0.051
aver+0.047
 tray+0.046
 happens+0.046
Neg logit contributions
years-0.036
but-0.033
 after-0.033
 him-0.032
and-0.032
Token Sc
Token ID1446
Feature activation+1.82e+00

Pos logit contributions
years+0.023
and+0.020
but+0.020
unct+0.020
to+0.020
Neg logit contributions
aver-0.026
kin-0.026
pan-0.025
 bowl-0.023
 happens-0.023
Tokenamp
Token ID696
Feature activation+4.19e+00

Pos logit contributions
kin+0.058
aver+0.056
pan+0.054
 tray+0.050
 bowl+0.049
Neg logit contributions
years-0.042
What-0.041
 after-0.040
but-0.040
what-0.039
Tokeni
Token ID72
Feature activation+8.53e-01

Pos logit contributions
kin+0.122
pan+0.120
aver+0.112
 tray+0.108
 bowl+0.105
Neg logit contributions
years-0.115
but-0.110
 him-0.109
 after-0.108
to-0.107
Token Sc
Token ID1446
Feature activation+2.57e+00

Pos logit contributions
kin+0.024
pan+0.023
aver+0.022
 happens+0.020
 teaches+0.020
Neg logit contributions
 him-0.016
years-0.016
 after-0.015
to-0.015
but-0.014
Tokenamp
Token ID696
Feature activation+4.60e+00

Pos logit contributions
kin+0.082
pan+0.079
aver+0.077
 tray+0.072
 bowl+0.071
Neg logit contributions
years-0.063
but-0.059
What-0.059
 after-0.058
 him-0.056
Tokeni
Token ID72
Feature activation+1.55e+00

Pos logit contributions
kin+0.133
pan+0.129
aver+0.119
 tray+0.115
 bowl+0.114
Neg logit contributions
years-0.129
 him-0.127
but-0.124
 after-0.122
to-0.120
Token.
Token ID13
Feature activation-8.79e-01

Pos logit contributions
kin+0.052
pan+0.051
aver+0.047
 tray+0.046
 happens+0.046
Neg logit contributions
years-0.036
but-0.033
 after-0.033
 him-0.032
and-0.032
Token Sc
Token ID1446
Feature activation+1.82e+00

Pos logit contributions
years+0.023
and+0.020
but+0.020
unct+0.020
to+0.020
Neg logit contributions
aver-0.026
kin-0.026
pan-0.025
 bowl-0.023
 happens-0.023
Tokenamp
Token ID696
Feature activation+4.19e+00

Pos logit contributions
kin+0.058
aver+0.056
pan+0.054
 tray+0.050
 bowl+0.049
Neg logit contributions
years-0.042
What-0.041
 after-0.040
but-0.040
what-0.039
Tokeni
Token ID72
Feature activation+8.53e-01

Pos logit contributions
kin+0.122
pan+0.120
aver+0.112
 tray+0.108
 bowl+0.105
Neg logit contributions
years-0.115
but-0.110
 him-0.109
 after-0.108
to-0.107
Token was
Token ID373
Feature activation-2.21e-01

Pos logit contributions
pan+0.029
kin+0.027
 tray+0.025
aver+0.024
 bowl+0.023
Neg logit contributions
years-0.020
 him-0.019
 after-0.019
but-0.019
to-0.019
Token unique
Token ID3748
Feature activation+1.08e+00

Pos logit contributions
years+0.004
but+0.004
 afford+0.004
and+0.004
to+0.003
Neg logit contributions
kin-0.009
aver-0.008
pan-0.008
 bowl-0.008
 tray-0.008
Token because
Token ID780
Feature activation+8.33e-02

Pos logit contributions
kin+0.041
pan+0.040
aver+0.039
 tray+0.037
 bowl+0.036
Neg logit contributions
years-0.020
but-0.019
 after-0.019
What-0.018
and-0.018
Token it
Token ID340
Feature activation-2.13e-01

Pos logit contributions
kin+0.003
pan+0.003
aver+0.003
 bowl+0.002
 tray+0.002
Neg logit contributions
years-0.002
but-0.002
What-0.002
to-0.002
 after-0.002
Token a
Token ID257
Feature activation-2.14e-01

Pos logit contributions
kin+0.008
pan+0.008
aver+0.008
 tray+0.007
 bowl+0.007
Neg logit contributions
years-0.006
but-0.006
What-0.006
 after-0.006
to-0.006
Token young
Token ID1862
Feature activation-7.12e-01

Pos logit contributions
years+0.006
What+0.006
 after+0.006
Ho+0.005
but+0.005
Neg logit contributions
pan-0.006
kin-0.006
aver-0.005
 happens-0.005
 teaches-0.005
Token rabbit
Token ID22746
Feature activation+5.06e-01

Pos logit contributions
 after+0.016
years+0.015
What+0.014
 knowing+0.014
Ho+0.014
Neg logit contributions
pan-0.024
kin-0.023
 happens-0.023
 teaches-0.022
aver-0.021
Token named
Token ID3706
Feature activation-2.06e-01

Pos logit contributions
kin+0.016
pan+0.015
aver+0.014
 happens+0.014
 tray+0.013
Neg logit contributions
years-0.013
 after-0.012
but-0.012
 him-0.012
What-0.012
Token Sc
Token ID1446
Feature activation+1.80e+00

Pos logit contributions
years+0.006
but+0.005
What+0.005
 after+0.005
to+0.005
Neg logit contributions
kin-0.006
pan-0.006
aver-0.006
 tray-0.005
 bowl-0.005
Tokenamp
Token ID696
Feature activation+3.88e+00

Pos logit contributions
kin+0.051
aver+0.049
pan+0.048
 tray+0.044
 bowl+0.043
Neg logit contributions
What-0.048
years-0.048
 after-0.048
but-0.046
what-0.046
Tokeni
Token ID72
Feature activation+9.59e-01

Pos logit contributions
kin+0.126
pan+0.124
aver+0.113
 tray+0.109
 bowl+0.108
Neg logit contributions
years-0.095
 him-0.095
but-0.091
 after-0.090
What-0.088
Token.
Token ID13
Feature activation-9.69e-01

Pos logit contributions
kin+0.031
pan+0.030
aver+0.029
 tray+0.027
 bowl+0.027
Neg logit contributions
years-0.023
but-0.022
 after-0.021
What-0.021
to-0.021
Token Sc
Token ID1446
Feature activation+1.20e+00

Pos logit contributions
years+0.024
and+0.022
but+0.022
unct+0.022
 afford+0.022
Neg logit contributions
aver-0.029
kin-0.028
pan-0.028
 bowl-0.026
 snake-0.026
Tokenamp
Token ID696
Feature activation+3.57e+00

Pos logit contributions
kin+0.037
aver+0.036
pan+0.034
ool+0.032
case+0.031
Neg logit contributions
What-0.029
 after-0.028
years-0.028
Why-0.027
what-0.027
Tokeni
Token ID72
Feature activation+5.40e-01

Pos logit contributions
kin+0.099
pan+0.098
aver+0.090
 tray+0.086
 bowl+0.084
Neg logit contributions
years-0.104
 him-0.100
but-0.100
 after-0.098
to-0.097
Token was
Token ID373
Feature activation+3.20e-01

Pos logit contributions
pan+0.005
kin+0.004
aver+0.004
 tray+0.004
 broom+0.004
Neg logit contributions
 him-0.003
What-0.003
 after-0.003
years-0.003
to-0.003
Token a
Token ID257
Feature activation-3.32e-01

Pos logit contributions
kin+0.010
pan+0.010
aver+0.010
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.008
but-0.007
What-0.007
to-0.007
 after-0.007
Token little
Token ID1310
Feature activation-8.09e-01

Pos logit contributions
years+0.009
What+0.009
 after+0.009
Ho+0.008
but+0.008
Neg logit contributions
pan-0.009
kin-0.009
aver-0.008
 happens-0.008
 teaches-0.008
Token girl
Token ID2576
Feature activation+4.79e-02

Pos logit contributions
years+0.020
 after+0.019
What+0.019
 him+0.018
Ho+0.017
Neg logit contributions
pan-0.025
kin-0.025
aver-0.023
 teaches-0.022
 happens-0.022
Token called
Token ID1444
Feature activation+7.56e-01

Pos logit contributions
kin+0.001
pan+0.001
aver+0.001
 tray+0.001
 happens+0.001
Neg logit contributions
years-0.001
 after-0.001
but-0.001
What-0.001
 him-0.001
Token Sof
Token ID41427
Feature activation+3.84e+00

Pos logit contributions
kin+0.023
pan+0.022
aver+0.021
 teaches+0.019
 happens+0.019
Neg logit contributions
years-0.020
 after-0.019
and-0.019
 him-0.018
but-0.018
Tokenia
Token ID544
Feature activation+1.08e+00

Pos logit contributions
pan+0.077
kin+0.067
 tray+0.058
 bowl+0.058
aver+0.058
Neg logit contributions
years-0.140
to-0.140
and-0.138
 him-0.135
 afford-0.135
Token.
Token ID13
Feature activation-8.78e-01

Pos logit contributions
kin+0.037
pan+0.036
aver+0.034
 tray+0.032
 happens+0.032
Neg logit contributions
years-0.024
but-0.022
 after-0.022
and-0.022
 him-0.022
Token Sof
Token ID41427
Feature activation+2.65e+00

Pos logit contributions
years+0.017
but+0.014
unct+0.014
 afford+0.014
to+0.014
Neg logit contributions
kin-0.032
pan-0.030
aver-0.030
 happens-0.030
 bowl-0.030
Tokenia
Token ID544
Feature activation-6.10e-02

Pos logit contributions
pan+0.052
kin+0.045
aver+0.039
 tray+0.039
 bowl+0.038
Neg logit contributions
years-0.101
to-0.097
 him-0.096
and-0.095
but-0.095
Token was
Token ID373
Feature activation-1.39e-01

Pos logit contributions
years+0.001
 him+0.001
 after+0.001
What+0.001
but+0.001
Neg logit contributions
pan-0.002
kin-0.002
aver-0.002
 tray-0.002
 bowl-0.002
Token ch
Token ID442
Feature activation+4.61e-01

Pos logit contributions
What+0.003
years+0.003
 after+0.003
Ho+0.003
 afford+0.003
Neg logit contributions
pan-0.004
kin-0.004
 happens-0.003
aver-0.003
 teaches-0.003
Tokenubby
Token ID38393
Feature activation+8.53e-01

Pos logit contributions
pan+0.011
 happens+0.010
kin+0.010
 teaches+0.010
 tray+0.010
Neg logit contributions
years-0.015
ind-0.014
away-0.013
ory-0.013
 after-0.013
Token little
Token ID1310
Feature activation-7.27e-01

Pos logit contributions
pan+0.024
 happens+0.022
kin+0.022
 teaches+0.021
bench+0.019
Neg logit contributions
 after-0.023
ory-0.022
Ho-0.022
 knowing-0.022
 him-0.022
Token fish
Token ID5916
Feature activation-4.30e-01

Pos logit contributions
years+0.019
 after+0.019
 him+0.018
What+0.018
Ho+0.017
Neg logit contributions
pan-0.021
kin-0.021
aver-0.019
 teaches-0.018
 happens-0.018
Token named
Token ID3706
Feature activation-2.39e-01

Pos logit contributions
years+0.014
but+0.013
What+0.013
to+0.012
 after+0.012
Neg logit contributions
kin-0.011
aver-0.010
pan-0.010
 bowl-0.009
 tray-0.009
Token Fif
Token ID12830
Feature activation+3.64e+00

Pos logit contributions
 him+0.005
years+0.005
 after+0.005
What+0.004
to+0.004
Neg logit contributions
kin-0.009
pan-0.009
aver-0.008
 happens-0.008
 teaches-0.007
Tokeni
Token ID72
Feature activation+8.66e-01

Pos logit contributions
kin+0.192
pan+0.190
aver+0.180
 teaches+0.177
 tray+0.177
Neg logit contributions
years-0.017
 him-0.009
to-0.009
i-0.008
unct-0.008
Token.
Token ID13
Feature activation-8.71e-01

Pos logit contributions
kin+0.029
pan+0.028
aver+0.026
 tray+0.025
 bowl+0.025
Neg logit contributions
years-0.020
but-0.019
 after-0.019
 him-0.019
to-0.018
Token She
Token ID1375
Feature activation-6.18e-01

Pos logit contributions
years+0.022
 afford+0.020
but+0.020
unct+0.019
to+0.019
Neg logit contributions
kin-0.026
aver-0.025
pan-0.025
 snake-0.025
 bowl-0.025
Token lived
Token ID5615
Feature activation-2.08e-01

Pos logit contributions
years+0.014
 him+0.014
 after+0.013
What+0.013
Ho+0.013
Neg logit contributions
pan-0.022
kin-0.020
 tray-0.019
 bowl-0.018
aver-0.018
Token in
Token ID287
Feature activation-1.43e-01

Pos logit contributions
years+0.004
but+0.003
 afford+0.003
What+0.003
to+0.003
Neg logit contributions
kin-0.008
aver-0.008
pan-0.008
 bowl-0.007
 tray-0.007
Token a
Token ID257
Feature activation-2.31e-01

Pos logit contributions
kin+0.012
pan+0.012
aver+0.011
 tray+0.011
 bowl+0.011
Neg logit contributions
years-0.009
but-0.008
What-0.008
 after-0.008
to-0.008
Token unique
Token ID3748
Feature activation+4.88e-01

Pos logit contributions
years+0.006
What+0.006
 after+0.006
Ho+0.006
but+0.006
Neg logit contributions
pan-0.007
kin-0.006
 happens-0.006
aver-0.006
 teaches-0.006
Token fle
Token ID5104
Feature activation+5.94e-01

Pos logit contributions
pan+0.012
kin+0.011
 happens+0.010
aver+0.010
 teaches+0.010
Neg logit contributions
 after-0.015
What-0.014
 knowing-0.014
years-0.014
 afford-0.014
Tokena
Token ID64
Feature activation+5.27e-02

Pos logit contributions
pan+0.021
kin+0.020
aver+0.019
 tray+0.019
 bowl+0.019
Neg logit contributions
years-0.013
but-0.012
and-0.012
to-0.011
 after-0.011
Token named
Token ID3706
Feature activation-1.57e-02

Pos logit contributions
pan+0.002
kin+0.002
 happens+0.002
 tray+0.002
aver+0.002
Neg logit contributions
 after-0.001
years-0.001
but-0.001
 him-0.001
and-0.001
Token Fif
Token ID12830
Feature activation+3.62e+00

Pos logit contributions
years+0.000
 after+0.000
 him+0.000
What+0.000
but+0.000
Neg logit contributions
kin-0.001
pan-0.001
aver-0.000
 tray-0.000
 teaches-0.000
Tokeni
Token ID72
Feature activation+7.61e-01

Pos logit contributions
kin+0.180
pan+0.180
aver+0.170
 tray+0.166
 teaches+0.165
Neg logit contributions
years-0.026
 him-0.017
to-0.016
but-0.016
 after-0.015
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
kin+0.026
pan+0.026
aver+0.023
 tray+0.023
 happens+0.023
Neg logit contributions
years-0.017
but-0.016
 him-0.016
 after-0.016
to-0.015
Token Fif
Token ID12830
Feature activation+2.58e+00

Pos logit contributions
years+0.025
 afford+0.022
but+0.022
unct+0.022
to+0.021
Neg logit contributions
kin-0.033
pan-0.032
aver-0.032
 snake-0.031
 bowl-0.031
Tokeni
Token ID72
Feature activation+7.85e-02

Pos logit contributions
kin+0.128
pan+0.125
aver+0.122
 tray+0.117
 bowl+0.117
Neg logit contributions
years-0.019
but-0.014
What-0.013
 after-0.012
to-0.012
Token lived
Token ID5615
Feature activation-3.49e-01

Pos logit contributions
pan+0.003
kin+0.002
aver+0.002
 tray+0.002
 bowl+0.002
Neg logit contributions
years-0.002
 him-0.002
 after-0.002
but-0.002
What-0.002
Token park
Token ID3952
Feature activation+1.45e+00

Pos logit contributions
years+0.005
but+0.004
to+0.004
and+0.004
 after+0.004
Neg logit contributions
kin-0.002
pan-0.002
aver-0.002
 bowl-0.002
 tray-0.002
Token.
Token ID13
Feature activation+9.26e-02

Pos logit contributions
kin+0.039
pan+0.034
aver+0.033
 tray+0.032
 bowl+0.032
Neg logit contributions
years-0.046
unct-0.043
to-0.042
What-0.042
but-0.041
Token Lily
Token ID20037
Feature activation+5.60e-01

Pos logit contributions
kin+0.003
pan+0.003
aver+0.003
 tray+0.003
 happens+0.003
Neg logit contributions
years-0.002
What-0.002
but-0.002
 after-0.002
 him-0.002
Token was
Token ID373
Feature activation+1.81e-01

Pos logit contributions
pan+0.020
kin+0.020
aver+0.018
 tray+0.017
 hawk+0.017
Neg logit contributions
years-0.012
 after-0.012
 him-0.011
but-0.011
 quite-0.011
Token the
Token ID262
Feature activation+3.72e-01

Pos logit contributions
kin+0.007
pan+0.006
aver+0.006
 tray+0.006
 bowl+0.006
Neg logit contributions
years-0.004
but-0.003
What-0.003
to-0.003
 after-0.003
Token seeker
Token ID45993
Feature activation+3.59e+00

Pos logit contributions
kin+0.008
aver+0.007
 snake+0.007
 tray+0.007
 bowl+0.006
Neg logit contributions
years-0.014
to-0.014
and-0.013
 afford-0.013
but-0.013
Token and
Token ID290
Feature activation+1.95e+00

Pos logit contributions
 teaches+0.158
 happens+0.157
pan+0.156
kin+0.155
 explodes+0.149
Neg logit contributions
 after-0.045
but-0.037
 when-0.036
 since-0.035
 because-0.032
Token Ben
Token ID3932
Feature activation+1.76e-01

Pos logit contributions
kin+0.053
pan+0.048
aver+0.047
 tray+0.043
 bowl+0.043
Neg logit contributions
years-0.059
but-0.056
What-0.055
to-0.055
and-0.055
Token was
Token ID373
Feature activation+2.98e-01

Pos logit contributions
pan+0.007
kin+0.006
aver+0.006
 hawk+0.006
 tray+0.006
Neg logit contributions
 after-0.003
 him-0.003
 quite-0.003
 really-0.003
 like-0.003
Token the
Token ID262
Feature activation+4.42e-01

Pos logit contributions
kin+0.011
pan+0.011
aver+0.010
 tray+0.010
 bowl+0.010
Neg logit contributions
years-0.007
 after-0.006
What-0.006
but-0.006
 him-0.006
Token h
Token ID289
Feature activation+1.47e+00

Pos logit contributions
kin+0.012
pan+0.011
aver+0.011
 tray+0.010
 bowl+0.010
Neg logit contributions
years-0.014
but-0.013
to-0.013
 afford-0.012
What-0.012
Token seek
Token ID5380
Feature activation+9.45e-01

Pos logit contributions
years+0.001
but+0.001
to+0.001
 int+0.001
 afford+0.001
Neg logit contributions
kin-0.001
pan-0.001
aver-0.001
 bowl-0.001
 tray-0.001
Token.
Token ID13
Feature activation+6.33e-01

Pos logit contributions
kin+0.034
pan+0.034
aver+0.032
 happens+0.030
 bowl+0.030
Neg logit contributions
years-0.019
 after-0.018
but-0.018
What-0.017
 him-0.017
Token Tom
Token ID4186
Feature activation+2.60e-01

Pos logit contributions
kin+0.019
pan+0.019
aver+0.018
 bowl+0.018
 tray+0.017
Neg logit contributions
years-0.017
but-0.015
unct-0.015
to-0.014
and-0.014
Token was
Token ID373
Feature activation+3.76e-01

Pos logit contributions
pan+0.009
kin+0.009
aver+0.008
 tray+0.007
 hawk+0.007
Neg logit contributions
years-0.006
 after-0.006
 him-0.006
to-0.006
What-0.006
Token the
Token ID262
Feature activation+4.85e-01

Pos logit contributions
kin+0.013
aver+0.013
pan+0.013
 tray+0.012
 bowl+0.012
Neg logit contributions
years-0.008
but-0.007
to-0.007
 afford-0.007
and-0.007
Token seeker
Token ID45993
Feature activation+3.59e+00

Pos logit contributions
kin+0.011
aver+0.010
 tray+0.009
 snake+0.009
 bowl+0.009
Neg logit contributions
years-0.018
to-0.017
 afford-0.017
but-0.016
and-0.016
Token and
Token ID290
Feature activation+1.86e+00

Pos logit contributions
pan+0.155
kin+0.155
 teaches+0.152
 happens+0.152
aver+0.146
Neg logit contributions
 after-0.047
but-0.038
 when-0.037
 since-0.036
 him-0.035
Token Lily
Token ID20037
Feature activation+8.51e-01

Pos logit contributions
kin+0.054
pan+0.053
aver+0.049
 tray+0.047
 bowl+0.046
Neg logit contributions
years-0.052
but-0.048
 after-0.047
What-0.047
 him-0.047
Token was
Token ID373
Feature activation+4.69e-01

Pos logit contributions
pan+0.036
kin+0.033
 hawk+0.031
aver+0.029
 tray+0.029
Neg logit contributions
 after-0.016
 him-0.014
 quite-0.014
 really-0.014
 like-0.013
Token the
Token ID262
Feature activation+5.54e-01

Pos logit contributions
kin+0.017
pan+0.017
aver+0.016
 tray+0.015
 bowl+0.015
Neg logit contributions
years-0.010
but-0.009
 after-0.009
What-0.009
 him-0.008
Token h
Token ID289
Feature activation+1.35e+00

Pos logit contributions
kin+0.015
pan+0.014
aver+0.014
 tray+0.013
 bowl+0.013
Neg logit contributions
years-0.017
but-0.016
What-0.015
to-0.015
 after-0.015
Token Sc
Token ID1446
Feature activation+1.80e+00

Pos logit contributions
years+0.006
but+0.005
What+0.005
 after+0.005
to+0.005
Neg logit contributions
kin-0.006
pan-0.006
aver-0.006
 tray-0.005
 bowl-0.005
Tokenamp
Token ID696
Feature activation+3.88e+00

Pos logit contributions
kin+0.051
aver+0.049
pan+0.048
 tray+0.044
 bowl+0.043
Neg logit contributions
What-0.048
years-0.048
 after-0.048
but-0.046
what-0.046
Tokeni
Token ID72
Feature activation+9.59e-01

Pos logit contributions
kin+0.126
pan+0.124
aver+0.113
 tray+0.109
 bowl+0.108
Neg logit contributions
years-0.095
 him-0.095
but-0.091
 after-0.090
What-0.088
Token.
Token ID13
Feature activation-9.69e-01

Pos logit contributions
kin+0.031
pan+0.030
aver+0.029
 tray+0.027
 bowl+0.027
Neg logit contributions
years-0.023
but-0.022
 after-0.021
What-0.021
to-0.021
Token Sc
Token ID1446
Feature activation+1.20e+00

Pos logit contributions
years+0.024
and+0.022
but+0.022
unct+0.022
 afford+0.022
Neg logit contributions
aver-0.029
kin-0.028
pan-0.028
 bowl-0.026
 snake-0.026
Tokenamp
Token ID696
Feature activation+3.57e+00

Pos logit contributions
kin+0.037
aver+0.036
pan+0.034
ool+0.032
case+0.031
Neg logit contributions
What-0.029
 after-0.028
years-0.028
Why-0.027
what-0.027
Tokeni
Token ID72
Feature activation+5.40e-01

Pos logit contributions
kin+0.099
pan+0.098
aver+0.090
 tray+0.086
 bowl+0.084
Neg logit contributions
years-0.104
 him-0.100
but-0.100
 after-0.098
to-0.097
Token hopped
Token ID45230
Feature activation-1.13e+00

Pos logit contributions
kin+0.017
pan+0.017
aver+0.015
 tray+0.015
 bowl+0.015
Neg logit contributions
years-0.014
but-0.013
 after-0.013
 him-0.012
What-0.012
Token around
Token ID1088
Feature activation-6.94e-01

Pos logit contributions
years+0.028
unct+0.026
 Him+0.026
 afford+0.026
What+0.025
Neg logit contributions
kin-0.035
aver-0.033
pan-0.032
 tray-0.031
 bowl-0.030
Token her
Token ID607
Feature activation-2.61e-01

Pos logit contributions
years+0.019
 afford+0.019
beh+0.018
unct+0.018
itating+0.017
Neg logit contributions
kin-0.019
 snake-0.019
pan-0.018
 tray-0.018
 bowl-0.017
Token garden
Token ID11376
Feature activation+2.14e-01

Pos logit contributions
years+0.008
 after+0.008
but+0.008
What+0.008
 him+0.008
Neg logit contributions
pan-0.007
kin-0.007
aver-0.006
 happens-0.006
 teaches-0.006
Token a
Token ID257
Feature activation-4.24e-01

Pos logit contributions
kin+0.010
pan+0.010
aver+0.010
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.009
but-0.008
and-0.008
to-0.008
unct-0.008
Token big
Token ID1263
Feature activation-3.93e-01

Pos logit contributions
years+0.014
but+0.014
What+0.013
 after+0.013
to+0.013
Neg logit contributions
kin-0.010
pan-0.009
aver-0.009
 tray-0.008
 bowl-0.008
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
 after+0.011
 him+0.010
years+0.010
What+0.010
 afford+0.010
Neg logit contributions
kin-0.011
pan-0.010
aver-0.010
 teaches-0.009
 happens-0.009
Token fit
Token ID4197
Feature activation+7.39e-01

Pos logit contributions
kin+0.028
pan+0.027
aver+0.025
 tray+0.023
 bowl+0.023
Neg logit contributions
years-0.036
but-0.034
What-0.033
 after-0.033
 him-0.033
Token tr
Token ID491
Feature activation+2.85e+00

Pos logit contributions
kin+0.018
pan+0.018
aver+0.017
 happens+0.016
 teaches+0.016
Neg logit contributions
years-0.022
 after-0.022
but-0.021
 him-0.021
What-0.021
Tokenamp
Token ID696
Feature activation+3.55e+00

Pos logit contributions
kin+0.096
pan+0.093
aver+0.090
 tray+0.084
 bowl+0.083
Neg logit contributions
years-0.067
but-0.061
What-0.058
to-0.058
 after-0.058
Tokenoline
Token ID14453
Feature activation+1.28e+00

Pos logit contributions
kin+0.071
aver+0.066
pan+0.066
 bowl+0.057
 teaches+0.056
Neg logit contributions
but-0.127
 after-0.127
 him-0.125
years-0.124
to-0.120
Token!
Token ID0
Feature activation+4.34e-01

Pos logit contributions
kin+0.042
pan+0.041
aver+0.039
 tray+0.037
 bowl+0.037
Neg logit contributions
years-0.030
but-0.028
What-0.027
 after-0.027
to-0.027
Token Sarah
Token ID10490
Feature activation+5.44e-01

Pos logit contributions
kin+0.015
pan+0.014
aver+0.014
 tray+0.013
 bowl+0.013
Neg logit contributions
years-0.010
but-0.009
What-0.009
 after-0.009
to-0.009
Token's
Token ID338
Feature activation-1.02e-01

Pos logit contributions
kin+0.019
pan+0.019
aver+0.017
 tray+0.016
 teaches+0.016
Neg logit contributions
years-0.013
but-0.012
to-0.012
 after-0.012
 him-0.011
Token mom
Token ID1995
Feature activation-8.17e-01

Pos logit contributions
years+0.004
but+0.003
 after+0.003
What+0.003
 him+0.003
Neg logit contributions
kin-0.002
pan-0.002
aver-0.002
 happens-0.002
 tray-0.002
Token a
Token ID257
Feature activation-4.24e-01

Pos logit contributions
kin+0.010
pan+0.010
aver+0.010
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.009
but-0.008
and-0.008
to-0.008
unct-0.008
Token big
Token ID1263
Feature activation-3.93e-01

Pos logit contributions
years+0.014
but+0.014
What+0.013
 after+0.013
to+0.013
Neg logit contributions
kin-0.010
pan-0.009
aver-0.009
 tray-0.008
 bowl-0.008
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
 after+0.011
 him+0.010
years+0.010
What+0.010
 afford+0.010
Neg logit contributions
kin-0.011
pan-0.010
aver-0.010
 teaches-0.009
 happens-0.009
Token fit
Token ID4197
Feature activation+7.39e-01

Pos logit contributions
kin+0.028
pan+0.027
aver+0.025
 tray+0.023
 bowl+0.023
Neg logit contributions
years-0.036
but-0.034
What-0.033
 after-0.033
 him-0.033
Token tr
Token ID491
Feature activation+2.85e+00

Pos logit contributions
kin+0.018
pan+0.018
aver+0.017
 happens+0.016
 teaches+0.016
Neg logit contributions
years-0.022
 after-0.022
but-0.021
 him-0.021
What-0.021
Tokenamp
Token ID696
Feature activation+3.55e+00

Pos logit contributions
kin+0.096
pan+0.093
aver+0.090
 tray+0.084
 bowl+0.083
Neg logit contributions
years-0.067
but-0.061
What-0.058
to-0.058
 after-0.058
Tokenoline
Token ID14453
Feature activation+1.28e+00

Pos logit contributions
kin+0.071
aver+0.066
pan+0.066
 bowl+0.057
 teaches+0.056
Neg logit contributions
but-0.127
 after-0.127
 him-0.125
years-0.124
to-0.120
Token!
Token ID0
Feature activation+4.34e-01

Pos logit contributions
kin+0.042
pan+0.041
aver+0.039
 tray+0.037
 bowl+0.037
Neg logit contributions
years-0.030
but-0.028
What-0.027
 after-0.027
to-0.027
Token Sarah
Token ID10490
Feature activation+5.44e-01

Pos logit contributions
kin+0.015
pan+0.014
aver+0.014
 tray+0.013
 bowl+0.013
Neg logit contributions
years-0.010
but-0.009
What-0.009
 after-0.009
to-0.009
Token's
Token ID338
Feature activation-1.02e-01

Pos logit contributions
kin+0.019
pan+0.019
aver+0.017
 tray+0.016
 teaches+0.016
Neg logit contributions
years-0.013
but-0.012
to-0.012
 after-0.012
 him-0.011
Token mom
Token ID1995
Feature activation-8.17e-01

Pos logit contributions
years+0.004
but+0.003
 after+0.003
What+0.003
 him+0.003
Neg logit contributions
kin-0.002
pan-0.002
aver-0.002
 happens-0.002
 tray-0.002
Token a
Token ID257
Feature activation-2.69e-01

Pos logit contributions
kin+0.011
pan+0.010
aver+0.010
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.008
but-0.008
What-0.008
to-0.008
 after-0.008
Token strong
Token ID1913
Feature activation-4.97e-02

Pos logit contributions
What+0.007
years+0.007
 after+0.007
Ho+0.007
Why+0.006
Neg logit contributions
pan-0.008
kin-0.008
 happens-0.007
aver-0.007
 teaches-0.007
Token lizard
Token ID42406
Feature activation+7.44e-01

Pos logit contributions
 after+0.001
 like+0.001
 knowing+0.001
 because+0.001
 quite+0.001
Neg logit contributions
pan-0.001
 happens-0.001
kin-0.001
 teaches-0.001
ves-0.001
Token named
Token ID3706
Feature activation-2.57e-01

Pos logit contributions
kin+0.022
pan+0.022
aver+0.020
 tray+0.019
 bowl+0.019
Neg logit contributions
years-0.020
but-0.019
 after-0.018
What-0.018
 him-0.018
Token T
Token ID309
Feature activation+1.84e+00

Pos logit contributions
years+0.007
 him+0.007
 after+0.007
to+0.007
but+0.007
Neg logit contributions
kin-0.008
pan-0.007
aver-0.007
 teaches-0.006
 tray-0.006
Tokenaz
Token ID1031
Feature activation+3.55e+00

Pos logit contributions
kin+0.062
pan+0.062
aver+0.058
 tray+0.056
 teaches+0.055
Neg logit contributions
 him-0.040
to-0.039
years-0.038
 afford-0.037
but-0.037
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
kin+0.119
pan+0.118
 bowl+0.109
 tray+0.109
aver+0.107
Neg logit contributions
 him-0.080
years-0.080
but-0.080
to-0.077
 after-0.075
Token He
Token ID679
Feature activation-9.17e-01

Pos logit contributions
years+0.028
but+0.028
and+0.027
 afford+0.027
itating+0.026
Neg logit contributions
kin-0.031
 sheet-0.029
 bowl-0.029
aver-0.028
 happens-0.028
Token lived
Token ID5615
Feature activation-5.22e-01

Pos logit contributions
years+0.023
 him+0.021
What+0.021
 after+0.021
to+0.021
Neg logit contributions
pan-0.030
kin-0.029
 tray-0.026
aver-0.026
 bowl-0.025
Token happily
Token ID18177
Feature activation-4.55e-01

Pos logit contributions
years+0.009
but+0.008
What+0.007
 after+0.007
to+0.007
Neg logit contributions
kin-0.021
pan-0.020
aver-0.020
 tray-0.019
 bowl-0.019
Token among
Token ID1871
Feature activation+5.62e-02

Pos logit contributions
What+0.012
 int+0.012
what+0.011
 Him+0.011
 afford+0.011
Neg logit contributions
kin-0.014
aver-0.013
 snake-0.011
 bowl-0.011
pan-0.011
Token garden
Token ID11376
Feature activation+1.35e+00

Pos logit contributions
years+0.001
but+0.001
to+0.000
and+0.000
 afford+0.000
Neg logit contributions
kin-0.000
pan-0.000
aver-0.000
 bowl-0.000
 tray-0.000
Token.
Token ID13
Feature activation+3.90e-01

Pos logit contributions
kin+0.040
pan+0.038
aver+0.037
 tray+0.035
 bowl+0.035
Neg logit contributions
years-0.037
but-0.034
What-0.033
to-0.033
and-0.032
Token Ben
Token ID3932
Feature activation+4.72e-02

Pos logit contributions
kin+0.012
pan+0.012
aver+0.011
 tray+0.010
 bowl+0.010
Neg logit contributions
years-0.010
but-0.009
What-0.009
 after-0.009
to-0.009
Token was
Token ID373
Feature activation+2.77e-01

Pos logit contributions
pan+0.002
kin+0.002
aver+0.001
 hawk+0.001
 tray+0.001
Neg logit contributions
 after-0.001
years-0.001
 him-0.001
 quite-0.001
What-0.001
Token the
Token ID262
Feature activation+4.76e-01

Pos logit contributions
kin+0.009
pan+0.009
aver+0.009
 tray+0.008
 bowl+0.008
Neg logit contributions
years-0.006
but-0.006
to-0.006
 afford-0.006
What-0.006
Token seeker
Token ID45993
Feature activation+3.55e+00

Pos logit contributions
kin+0.010
aver+0.009
 snake+0.009
 tray+0.008
 bowl+0.008
Neg logit contributions
years-0.018
to-0.018
and-0.017
but-0.017
 afford-0.017
Token and
Token ID290
Feature activation+1.92e+00

Pos logit contributions
pan+0.157
 happens+0.155
 teaches+0.154
kin+0.153
 cleans+0.147
Neg logit contributions
 after-0.045
 when-0.035
 since-0.035
but-0.035
 him-0.034
Token Lily
Token ID20037
Feature activation+8.46e-01

Pos logit contributions
pan+0.062
kin+0.061
aver+0.057
 tray+0.055
 bowl+0.054
Neg logit contributions
years-0.048
 after-0.043
but-0.043
What-0.043
 him-0.042
Token was
Token ID373
Feature activation+4.17e-01

Pos logit contributions
pan+0.035
kin+0.033
 hawk+0.031
aver+0.029
 tray+0.029
Neg logit contributions
 after-0.015
 him-0.014
 really-0.014
 quite-0.013
 like-0.013
Token the
Token ID262
Feature activation+5.93e-01

Pos logit contributions
kin+0.015
pan+0.014
aver+0.014
 tray+0.013
 bowl+0.013
Neg logit contributions
years-0.009
but-0.008
 after-0.008
What-0.008
 him-0.008
Token h
Token ID289
Feature activation+1.52e+00

Pos logit contributions
kin+0.015
pan+0.015
aver+0.014
 tray+0.013
 bowl+0.013
Neg logit contributions
years-0.018
but-0.017
to-0.017
What-0.017
and-0.017
Token park
Token ID3952
Feature activation+1.49e+00

Pos logit contributions
kin+0.005
pan+0.004
aver+0.004
 bowl+0.004
 tray+0.004
Neg logit contributions
years-0.008
but-0.008
to-0.008
and-0.008
 afford-0.008
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
kin+0.037
aver+0.032
pan+0.031
 tray+0.030
 bowl+0.030
Neg logit contributions
years-0.050
unct-0.047
to-0.046
What-0.046
Ho-0.045
Token Lily
Token ID20037
Feature activation+5.65e-01

Pos logit contributions
kin+0.007
pan+0.007
aver+0.006
 tray+0.006
 bowl+0.006
Neg logit contributions
years-0.006
but-0.005
What-0.005
 after-0.005
to-0.005
Token was
Token ID373
Feature activation+3.93e-01

Pos logit contributions
pan+0.020
kin+0.020
aver+0.018
 hawk+0.016
 tray+0.016
Neg logit contributions
years-0.013
 after-0.013
 him-0.012
but-0.011
What-0.011
Token the
Token ID262
Feature activation+7.51e-01

Pos logit contributions
kin+0.015
pan+0.014
aver+0.014
 tray+0.013
 bowl+0.013
Neg logit contributions
years-0.008
but-0.007
to-0.007
What-0.007
and-0.007
Token seeker
Token ID45993
Feature activation+3.54e+00

Pos logit contributions
kin+0.016
 snake+0.014
 tray+0.014
aver+0.014
 ball+0.014
Neg logit contributions
to-0.028
years-0.027
and-0.026
but-0.026
 Him-0.026
Token and
Token ID290
Feature activation+2.25e+00

Pos logit contributions
kin+0.148
pan+0.147
 happens+0.145
 teaches+0.144
aver+0.142
Neg logit contributions
 after-0.049
but-0.041
 since-0.039
 when-0.039
 because-0.036
Token Tom
Token ID4186
Feature activation+4.08e-01

Pos logit contributions
kin+0.057
pan+0.053
aver+0.052
 tray+0.047
 bowl+0.047
Neg logit contributions
years-0.071
but-0.067
What-0.065
to-0.065
and-0.065
Token was
Token ID373
Feature activation+4.07e-01

Pos logit contributions
pan+0.015
kin+0.015
aver+0.013
 tray+0.013
 hawk+0.012
Neg logit contributions
 him-0.009
 after-0.008
years-0.008
 quite-0.008
What-0.008
Token the
Token ID262
Feature activation+7.55e-01

Pos logit contributions
kin+0.014
pan+0.014
aver+0.013
 tray+0.013
 bowl+0.013
Neg logit contributions
years-0.009
 after-0.008
but-0.008
What-0.008
 him-0.008
Token h
Token ID289
Feature activation+1.56e+00

Pos logit contributions
kin+0.021
aver+0.019
pan+0.019
 tray+0.018
 bowl+0.018
Neg logit contributions
years-0.022
but-0.021
to-0.021
 afford-0.020
and-0.020
Token park
Token ID3952
Feature activation+1.42e+00

Pos logit contributions
years+0.004
but+0.004
What+0.003
 after+0.003
to+0.003
Neg logit contributions
kin-0.002
pan-0.002
aver-0.002
 bowl-0.002
 tray-0.002
Token.
Token ID13
Feature activation+2.70e-01

Pos logit contributions
kin+0.037
aver+0.033
pan+0.032
 tray+0.031
 bowl+0.031
Neg logit contributions
years-0.047
unct-0.043
to-0.042
but-0.041
Ho-0.041
Token Tom
Token ID4186
Feature activation+1.05e-01

Pos logit contributions
kin+0.009
pan+0.009
aver+0.009
 happens+0.008
 teaches+0.008
Neg logit contributions
What-0.006
 after-0.006
years-0.006
but-0.006
 him-0.006
Token was
Token ID373
Feature activation-1.99e-01

Pos logit contributions
pan+0.004
kin+0.004
aver+0.003
 tray+0.003
 bowl+0.003
Neg logit contributions
 after-0.002
years-0.002
 him-0.002
and-0.002
What-0.002
Token the
Token ID262
Feature activation+3.52e-01

Pos logit contributions
years+0.004
but+0.004
to+0.004
 afford+0.004
What+0.004
Neg logit contributions
kin-0.007
pan-0.007
aver-0.007
 tray-0.006
 bowl-0.006
Token seeker
Token ID45993
Feature activation+3.52e+00

Pos logit contributions
kin+0.006
aver+0.005
 snake+0.005
 tray+0.005
 ball+0.004
Neg logit contributions
years-0.015
to-0.014
 int-0.014
 afford-0.014
and-0.014
Token and
Token ID290
Feature activation+1.52e+00

Pos logit contributions
pan+0.157
 teaches+0.156
kin+0.154
 happens+0.153
aver+0.147
Neg logit contributions
 after-0.043
 when-0.035
but-0.032
 him-0.031
 because-0.030
Token Lily
Token ID20037
Feature activation+7.79e-01

Pos logit contributions
kin+0.046
pan+0.044
aver+0.042
 tray+0.039
 bowl+0.039
Neg logit contributions
years-0.041
but-0.038
What-0.037
 after-0.037
to-0.037
Token was
Token ID373
Feature activation+1.65e-01

Pos logit contributions
pan+0.035
kin+0.031
 hawk+0.029
 tray+0.028
aver+0.028
Neg logit contributions
 after-0.014
 really-0.012
 him-0.011
 quite-0.011
 like-0.011
Token the
Token ID262
Feature activation+4.22e-01

Pos logit contributions
kin+0.006
pan+0.005
aver+0.005
 tray+0.005
 bowl+0.005
Neg logit contributions
years-0.004
but-0.004
What-0.003
 after-0.003
 him-0.003
Token h
Token ID289
Feature activation+1.41e+00

Pos logit contributions
kin+0.010
pan+0.010
aver+0.010
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.014
but-0.013
to-0.013
 afford-0.012
What-0.012
Token.
Token ID13
Feature activation+7.03e-01

Pos logit contributions
kin+0.005
pan+0.004
aver+0.004
 tray+0.004
 bowl+0.004
Neg logit contributions
years-0.004
but-0.004
What-0.004
 after-0.004
to-0.004
Token
Token ID198
Feature activation+8.15e-01

Pos logit contributions
kin+0.023
pan+0.023
aver+0.022
 bowl+0.020
 tray+0.020
Neg logit contributions
years-0.017
but-0.015
and-0.014
 after-0.014
 afford-0.014
Token
Token ID198
Feature activation+6.36e-01

Pos logit contributions
kin+0.035
pan+0.033
aver+0.031
 tray+0.030
 bowl+0.030
Neg logit contributions
 him-0.013
What-0.012
but-0.012
years-0.012
to-0.012
Token"
Token ID1
Feature activation-9.31e-01

Pos logit contributions
kin+0.017
pan+0.016
aver+0.015
 bowl+0.014
 tray+0.014
Neg logit contributions
years-0.019
What-0.019
but-0.018
 after-0.018
 him-0.018
TokenWhich
Token ID13828
Feature activation-2.79e-01

Pos logit contributions
unct+0.019
years+0.018
i+0.018
 int+0.017
to+0.017
Neg logit contributions
aver-0.031
kin-0.031
pan-0.030
 snake-0.029
 ball-0.029
Token locked
Token ID8970
Feature activation+3.51e+00

Pos logit contributions
 him+0.008
What+0.008
 after+0.008
 her+0.008
Why+0.007
Neg logit contributions
kin-0.008
pan-0.007
aver-0.007
oon-0.006
 shards-0.006
Token door
Token ID3420
Feature activation+1.55e+00

Pos logit contributions
kin+0.094
pan+0.090
aver+0.088
 tray+0.080
 bowl+0.077
Neg logit contributions
 him-0.100
 after-0.096
What-0.094
but-0.094
years-0.093
Token?"
Token ID1701
Feature activation+1.74e-01

Pos logit contributions
kin+0.033
pan+0.031
aver+0.030
 bowl+0.027
 tray+0.027
Neg logit contributions
years-0.055
but-0.052
What-0.051
and-0.050
to-0.050
Token uncle
Token ID7711
Feature activation+1.90e+00

Pos logit contributions
kin+0.003
pan+0.003
aver+0.003
 snake+0.003
 bowl+0.003
Neg logit contributions
years-0.006
and-0.006
but-0.006
to-0.006
what-0.006
Token Bob
Token ID5811
Feature activation+9.78e-01

Pos logit contributions
kin+0.053
pan+0.051
aver+0.049
 bowl+0.045
 tray+0.045
Neg logit contributions
years-0.055
but-0.051
What-0.050
 after-0.049
and-0.049
Token asked
Token ID1965
Feature activation-6.40e-02

Pos logit contributions
kin+0.026
aver+0.025
pan+0.025
 bowl+0.023
 tray+0.023
Neg logit contributions
years-0.029
but-0.027
and-0.026
What-0.026
 afford-0.026
Token seek
Token ID5380
Feature activation+1.05e+00

Pos logit contributions
kin+0.012
pan+0.011
aver+0.010
 bowl+0.010
 tray+0.010
Neg logit contributions
years-0.017
but-0.016
to-0.015
What-0.015
and-0.015
Token.
Token ID13
Feature activation+8.62e-01

Pos logit contributions
pan+0.039
kin+0.039
aver+0.038
 happens+0.037
 teaches+0.036
Neg logit contributions
 after-0.018
years-0.018
but-0.017
 him-0.017
 during-0.017
Token Ben
Token ID3932
Feature activation+1.16e-02

Pos logit contributions
kin+0.027
pan+0.026
aver+0.025
 tray+0.023
 bowl+0.023
Neg logit contributions
years-0.022
but-0.020
 after-0.020
What-0.020
to-0.019
Token was
Token ID373
Feature activation-9.74e-02

Pos logit contributions
pan+0.000
kin+0.000
aver+0.000
 tray+0.000
 bowl+0.000
Neg logit contributions
 after-0.000
years-0.000
but-0.000
 him-0.000
and-0.000
Token the
Token ID262
Feature activation+3.50e-01

Pos logit contributions
years+0.002
 after+0.002
but+0.002
What+0.002
 him+0.002
Neg logit contributions
kin-0.004
pan-0.003
aver-0.003
 bowl-0.003
 tray-0.003
Token seeker
Token ID45993
Feature activation+3.49e+00

Pos logit contributions
kin+0.009
pan+0.008
aver+0.008
 tray+0.008
 bowl+0.007
Neg logit contributions
years-0.011
but-0.011
to-0.010
and-0.010
What-0.010
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
 teaches+0.149
kin+0.146
pan+0.144
 happens+0.144
aver+0.138
Neg logit contributions
 after-0.051
but-0.042
 when-0.041
 him-0.039
 since-0.038
Token Lily
Token ID20037
Feature activation+8.30e-01

Pos logit contributions
pan+0.065
kin+0.064
aver+0.060
 tray+0.058
 bowl+0.057
Neg logit contributions
years-0.041
 after-0.038
 him-0.037
but-0.037
What-0.036
Token was
Token ID373
Feature activation+1.83e-01

Pos logit contributions
pan+0.035
kin+0.033
 hawk+0.030
 tray+0.029
 flame+0.029
Neg logit contributions
 after-0.015
 when-0.013
 quite-0.012
 him-0.012
but-0.012
Token the
Token ID262
Feature activation+3.34e-01

Pos logit contributions
kin+0.007
pan+0.007
aver+0.006
 bowl+0.006
 tray+0.006
Neg logit contributions
years-0.004
 after-0.004
What-0.003
but-0.003
 him-0.003
Token h
Token ID289
Feature activation+1.15e+00

Pos logit contributions
kin+0.009
pan+0.009
aver+0.008
 bowl+0.007
 teaches+0.007
Neg logit contributions
years-0.010
 after-0.010
What-0.009
but-0.009
 him-0.009
Token park
Token ID3952
Feature activation+1.28e+00

Pos logit contributions
years+0.004
but+0.003
to+0.003
and+0.003
 afford+0.003
Neg logit contributions
kin-0.002
pan-0.002
aver-0.002
 bowl-0.002
 tray-0.002
Token.
Token ID13
Feature activation+1.79e-01

Pos logit contributions
kin+0.032
aver+0.027
pan+0.027
 tray+0.027
 bowl+0.026
Neg logit contributions
years-0.044
unct-0.041
to-0.040
What-0.039
Ho-0.039
Token Anna
Token ID11735
Feature activation+6.99e-01

Pos logit contributions
kin+0.007
pan+0.007
aver+0.007
 happens+0.006
 tray+0.006
Neg logit contributions
years-0.003
What-0.003
 after-0.003
 him-0.003
but-0.003
Token was
Token ID373
Feature activation+1.40e-01

Pos logit contributions
pan+0.024
kin+0.023
aver+0.021
 tray+0.020
 bowl+0.019
Neg logit contributions
years-0.017
 after-0.016
but-0.015
 him-0.015
What-0.015
Token the
Token ID262
Feature activation+4.29e-01

Pos logit contributions
kin+0.005
pan+0.005
aver+0.005
 tray+0.004
 bowl+0.004
Neg logit contributions
years-0.003
but-0.003
to-0.003
What-0.003
and-0.003
Token seeker
Token ID45993
Feature activation+3.49e+00

Pos logit contributions
kin+0.008
aver+0.007
 snake+0.007
 tray+0.007
 bowl+0.007
Neg logit contributions
years-0.016
to-0.016
and-0.016
but-0.016
 afford-0.016
Token and
Token ID290
Feature activation+2.05e+00

Pos logit contributions
pan+0.153
 happens+0.151
 teaches+0.151
kin+0.151
 explodes+0.144
Neg logit contributions
 after-0.044
but-0.036
 when-0.036
 since-0.035
 because-0.032
Token Ben
Token ID3932
Feature activation+2.28e-01

Pos logit contributions
kin+0.056
pan+0.052
aver+0.051
 tray+0.047
 bowl+0.046
Neg logit contributions
years-0.061
but-0.058
What-0.056
and-0.056
to-0.056
Token was
Token ID373
Feature activation+2.73e-01

Pos logit contributions
pan+0.009
kin+0.008
aver+0.008
 hawk+0.007
 tray+0.007
Neg logit contributions
 after-0.004
 him-0.004
 quite-0.004
 really-0.004
 like-0.004
Token the
Token ID262
Feature activation+5.06e-01

Pos logit contributions
kin+0.010
pan+0.009
aver+0.009
 tray+0.008
 bowl+0.008
Neg logit contributions
years-0.006
 after-0.006
but-0.006
What-0.006
 him-0.005
Token h
Token ID289
Feature activation+1.42e+00

Pos logit contributions
kin+0.013
pan+0.012
aver+0.012
 tray+0.011
 bowl+0.011
Neg logit contributions
years-0.016
but-0.015
to-0.014
 afford-0.014
What-0.014
Token house
Token ID2156
Feature activation+9.59e-01

Pos logit contributions
kin+0.025
pan+0.024
aver+0.024
ves+0.020
 teaches+0.019
Neg logit contributions
 after-0.025
 him-0.024
years-0.024
What-0.023
 during-0.022
Token.
Token ID13
Feature activation+4.41e-01

Pos logit contributions
kin+0.027
pan+0.026
aver+0.025
 tray+0.024
 bowl+0.023
Neg logit contributions
years-0.028
What-0.025
to-0.025
unct-0.025
but-0.025
Token Lily
Token ID20037
Feature activation+5.32e-01

Pos logit contributions
kin+0.016
pan+0.015
aver+0.014
 happens+0.013
 tray+0.013
Neg logit contributions
years-0.010
What-0.009
 after-0.009
 him-0.009
but-0.009
Token was
Token ID373
Feature activation+2.03e-01

Pos logit contributions
kin+0.018
pan+0.018
aver+0.016
 tray+0.015
 bowl+0.015
Neg logit contributions
years-0.013
 after-0.012
 him-0.012
but-0.011
to-0.011
Token the
Token ID262
Feature activation+6.26e-01

Pos logit contributions
kin+0.008
pan+0.007
aver+0.007
 tray+0.007
 bowl+0.007
Neg logit contributions
years-0.004
but-0.004
to-0.003
What-0.003
 afford-0.003
Token seeker
Token ID45993
Feature activation+3.48e+00

Pos logit contributions
kin+0.013
 tray+0.012
 snake+0.012
aver+0.012
 bowl+0.011
Neg logit contributions
to-0.023
years-0.022
and-0.022
but-0.021
 afford-0.021
Token and
Token ID290
Feature activation+2.13e+00

Pos logit contributions
pan+0.145
 teaches+0.142
kin+0.142
 happens+0.141
aver+0.137
Neg logit contributions
 after-0.049
but-0.041
 when-0.040
 since-0.040
 because-0.038
Token Max
Token ID5436
Feature activation+4.30e-01

Pos logit contributions
pan+0.069
kin+0.067
aver+0.063
 bowl+0.061
 tray+0.060
Neg logit contributions
years-0.055
 him-0.052
 after-0.051
What-0.049
but-0.048
Token was
Token ID373
Feature activation+2.73e-01

Pos logit contributions
pan+0.016
kin+0.015
aver+0.014
 tray+0.013
 bowl+0.013
Neg logit contributions
 after-0.009
 him-0.009
 quite-0.008
years-0.008
What-0.008
Token the
Token ID262
Feature activation+6.82e-01

Pos logit contributions
kin+0.010
pan+0.010
aver+0.009
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.006
 after-0.005
but-0.005
What-0.005
 him-0.005
Token h
Token ID289
Feature activation+1.66e+00

Pos logit contributions
kin+0.017
pan+0.016
aver+0.016
 tray+0.015
 bowl+0.015
Neg logit contributions
years-0.021
but-0.020
to-0.020
 afford-0.019
What-0.019
Token park
Token ID3952
Feature activation+1.53e+00

Pos logit contributions
years+0.005
but+0.005
to+0.005
and+0.005
 afford+0.005
Neg logit contributions
kin-0.003
pan-0.003
aver-0.003
 bowl-0.002
 tray-0.002
Token.
Token ID13
Feature activation+1.55e-01

Pos logit contributions
kin+0.040
aver+0.035
pan+0.035
 tray+0.034
 bowl+0.034
Neg logit contributions
years-0.050
unct-0.046
to-0.045
What-0.044
but-0.044
Token Lily
Token ID20037
Feature activation+6.12e-01

Pos logit contributions
kin+0.006
pan+0.006
aver+0.005
 happens+0.005
 teaches+0.005
Neg logit contributions
What-0.003
 after-0.003
years-0.003
 him-0.003
what-0.002
Token was
Token ID373
Feature activation+2.43e-01

Pos logit contributions
pan+0.022
kin+0.022
aver+0.019
 tray+0.018
 hawk+0.018
Neg logit contributions
years-0.014
 after-0.014
 him-0.013
but-0.013
 quite-0.012
Token the
Token ID262
Feature activation+4.62e-01

Pos logit contributions
kin+0.009
pan+0.009
aver+0.008
 tray+0.008
 bowl+0.008
Neg logit contributions
years-0.005
but-0.004
What-0.004
 after-0.004
to-0.004
Token seeker
Token ID45993
Feature activation+3.48e+00

Pos logit contributions
kin+0.008
 tray+0.007
 snake+0.007
aver+0.007
 bowl+0.007
Neg logit contributions
years-0.018
to-0.018
and-0.018
but-0.018
 afford-0.017
Token and
Token ID290
Feature activation+2.08e+00

Pos logit contributions
pan+0.151
 teaches+0.151
 happens+0.150
kin+0.150
 explodes+0.142
Neg logit contributions
 after-0.044
 since-0.036
 when-0.036
but-0.035
 because-0.033
Token Max
Token ID5436
Feature activation+6.14e-01

Pos logit contributions
pan+0.061
kin+0.060
aver+0.056
 bowl+0.053
 tray+0.053
Neg logit contributions
years-0.059
 after-0.055
 him-0.055
What-0.054
but-0.054
Token was
Token ID373
Feature activation+3.79e-01

Pos logit contributions
pan+0.023
kin+0.022
aver+0.020
 tray+0.019
 bowl+0.019
Neg logit contributions
 after-0.013
 him-0.012
 quite-0.012
years-0.011
What-0.011
Token the
Token ID262
Feature activation+5.28e-01

Pos logit contributions
kin+0.014
pan+0.013
aver+0.013
 tray+0.012
 bowl+0.012
Neg logit contributions
years-0.009
 after-0.008
What-0.008
but-0.008
 him-0.007
Token h
Token ID289
Feature activation+1.37e+00

Pos logit contributions
kin+0.013
pan+0.012
aver+0.012
 tray+0.011
 bowl+0.011
Neg logit contributions
years-0.017
but-0.016
to-0.016
 afford-0.015
and-0.015
Token red
Token ID2266
Feature activation+1.15e+00

Pos logit contributions
years+0.009
 him+0.009
 after+0.009
but+0.009
What+0.008
Neg logit contributions
pan-0.009
kin-0.009
aver-0.009
 happens-0.008
 teaches-0.008
Token cape
Token ID36023
Feature activation-1.88e+00

Pos logit contributions
kin+0.023
pan+0.022
aver+0.021
 teaches+0.018
 happens+0.018
Neg logit contributions
years-0.041
 him-0.040
 after-0.040
but-0.040
to-0.039
Token and
Token ID290
Feature activation-7.99e-01

Pos logit contributions
years+0.044
What+0.040
to+0.037
what+0.037
unct+0.037
Neg logit contributions
kin-0.065
pan-0.063
aver-0.060
 tray-0.057
 bowl-0.056
Token make
Token ID787
Feature activation-1.72e+00

Pos logit contributions
years+0.020
but+0.019
What+0.018
and+0.018
to+0.018
Neg logit contributions
kin-0.025
pan-0.024
aver-0.023
 bowl-0.022
 tray-0.022
Token-
Token ID12
Feature activation-5.60e-01

Pos logit contributions
years+0.042
but+0.039
What+0.037
 afford+0.037
what+0.036
Neg logit contributions
kin-0.055
pan-0.053
aver-0.052
 happens-0.048
 tray-0.048
Tokenbel
Token ID6667
Feature activation-3.93e+00

Pos logit contributions
years+0.006
 int+0.006
What+0.006
but+0.006
ory+0.006
Neg logit contributions
aver-0.023
pan-0.023
kin-0.022
 happens-0.020
 block-0.020
Tokenieve
Token ID12311
Feature activation-7.24e-01

Pos logit contributions
years+0.106
but+0.100
to+0.098
 after+0.097
What+0.096
Neg logit contributions
kin-0.113
pan-0.108
aver-0.101
 teaches-0.101
 happens-0.096
Token he
Token ID339
Feature activation-7.66e-01

Pos logit contributions
years+0.016
but+0.014
What+0.014
to+0.014
 afford+0.014
Neg logit contributions
kin-0.026
pan-0.024
aver-0.023
 tray-0.022
 bowl-0.022
Token had
Token ID550
Feature activation-1.88e-01

Pos logit contributions
years+0.022
but+0.021
What+0.021
to+0.020
what+0.020
Neg logit contributions
kin-0.021
pan-0.020
aver-0.019
 happens-0.018
 teaches-0.018
Token super
Token ID2208
Feature activation-3.90e-01

Pos logit contributions
years+0.004
but+0.004
What+0.004
unct+0.004
 afford+0.004
Neg logit contributions
kin-0.006
pan-0.006
aver-0.006
 happens-0.005
 tray-0.005
Tokenpowers
Token ID30132
Feature activation-1.38e+00

Pos logit contributions
years+0.010
but+0.009
What+0.009
 after+0.009
to+0.009
Neg logit contributions
kin-0.012
pan-0.012
aver-0.011
 tray-0.011
 bowl-0.011
TokenEmily
Token ID48640
Feature activation+1.00e-01

Pos logit contributions
years+0.002
unct+0.002
 him+0.001
but+0.001
 after+0.001
Neg logit contributions
kin-0.002
aver-0.002
pan-0.002
 teaches-0.002
 sheet-0.002
Token felt
Token ID2936
Feature activation-4.48e-01

Pos logit contributions
kin+0.003
pan+0.003
aver+0.003
 tray+0.003
 bowl+0.003
Neg logit contributions
years-0.002
 after-0.002
but-0.002
to-0.002
 him-0.002
Token encouraged
Token ID10085
Feature activation-1.54e+00

Pos logit contributions
years+0.010
but+0.009
What+0.009
 after+0.009
to+0.009
Neg logit contributions
kin-0.016
pan-0.015
aver-0.015
 tray-0.014
 bowl-0.014
Token by
Token ID416
Feature activation-4.48e-01

Pos logit contributions
ric+0.055
unct+0.053
Ho+0.053
bel+0.052
years+0.052
Neg logit contributions
er-0.022
 tree-0.020
 cat-0.020
 time-0.018
 box-0.018
Token her
Token ID607
Feature activation-4.20e-01

Pos logit contributions
years+0.011
but+0.010
What+0.010
 after+0.010
to+0.009
Neg logit contributions
kin-0.015
pan-0.014
aver-0.014
 tray-0.013
 bowl-0.013
Token pap
Token ID20461
Feature activation-3.37e+00

Pos logit contributions
years+0.014
but+0.013
and+0.012
What+0.012
Ho+0.012
Neg logit contributions
kin-0.010
pan-0.009
 bowl-0.009
 tray-0.009
 sheet-0.009
Tokena
Token ID64
Feature activation-1.36e+00

Pos logit contributions
years+0.094
What+0.089
 after+0.087
 him+0.085
but+0.085
Neg logit contributions
kin-0.103
aver-0.096
pan-0.089
 teaches-0.082
 bowl-0.080
Token's
Token ID338
Feature activation-5.29e-01

Pos logit contributions
years+0.037
What+0.034
but+0.034
Ho+0.032
unct+0.032
Neg logit contributions
kin-0.040
pan-0.039
aver-0.036
 tray-0.036
 bowl-0.035
Token words
Token ID2456
Feature activation-9.10e-01

Pos logit contributions
years+0.015
but+0.014
What+0.014
to+0.014
and+0.013
Neg logit contributions
kin-0.015
pan-0.014
aver-0.013
 tray-0.013
 bowl-0.013
Token.
Token ID13
Feature activation-3.53e-01

Pos logit contributions
years+0.022
What+0.020
but+0.020
to+0.020
unct+0.019
Neg logit contributions
kin-0.030
pan-0.029
aver-0.027
 tray-0.026
 bowl-0.026
Token Every
Token ID3887
Feature activation-1.86e-01

Pos logit contributions
years+0.006
 afford+0.005
unct+0.005
What+0.005
but+0.005
Neg logit contributions
pan-0.014
kin-0.013
 bowl-0.013
aver-0.012
 tray-0.012
Token He
Token ID679
Feature activation-7.51e-02

Pos logit contributions
kin+0.002
pan+0.002
aver+0.002
 tray+0.002
 bowl+0.002
Neg logit contributions
years-0.001
but-0.001
What-0.001
 after-0.001
to-0.001
Token would
Token ID561
Feature activation-9.85e-01

Pos logit contributions
years+0.002
 after+0.002
 him+0.002
 int+0.002
What+0.002
Neg logit contributions
kin-0.002
pan-0.002
aver-0.002
 tray-0.002
 bowl-0.002
Token race
Token ID3234
Feature activation-2.00e-02

Pos logit contributions
years+0.023
but+0.022
 after+0.021
 him+0.021
What+0.021
Neg logit contributions
kin-0.032
pan-0.031
aver-0.030
 tray-0.028
 bowl-0.028
Token make
Token ID787
Feature activation-1.06e+00

Pos logit contributions
years+0.001
unct+0.000
to+0.000
 afford+0.000
but+0.000
Neg logit contributions
kin-0.001
pan-0.001
aver-0.001
 snake-0.001
 happens-0.001
Token-
Token ID12
Feature activation+4.36e-02

Pos logit contributions
years+0.028
but+0.025
What+0.025
 after+0.024
to+0.024
Neg logit contributions
kin-0.033
pan-0.032
aver-0.030
 tray-0.028
 bowl-0.028
Tokenbel
Token ID6667
Feature activation-3.31e+00

Pos logit contributions
aver+0.002
pan+0.001
kin+0.001
 happens+0.001
 snake+0.001
Neg logit contributions
years-0.001
 int-0.001
What-0.001
unct-0.001
but-0.001
Tokenieve
Token ID12311
Feature activation-1.70e-01

Pos logit contributions
years+0.091
but+0.088
to+0.086
What+0.086
 after+0.085
Neg logit contributions
kin-0.091
pan-0.086
aver-0.085
 teaches-0.082
 happens-0.078
Token cars
Token ID5006
Feature activation-7.21e-01

Pos logit contributions
years+0.003
 after+0.003
but+0.003
and+0.003
 him+0.003
Neg logit contributions
kin-0.006
pan-0.006
aver-0.006
 bowl-0.005
 tray-0.005
Token around
Token ID1088
Feature activation-3.44e-01

Pos logit contributions
years+0.018
What+0.017
but+0.016
to+0.016
unct+0.016
Neg logit contributions
kin-0.023
pan-0.022
aver-0.021
 tray-0.020
 teaches-0.020
Token the
Token ID262
Feature activation-2.62e-01

Pos logit contributions
years+0.009
but+0.008
to+0.008
What+0.008
unct+0.008
Neg logit contributions
kin-0.011
pan-0.011
aver-0.010
 tray-0.010
 bowl-0.010
Token track
Token ID2610
Feature activation+3.68e-01

Pos logit contributions
 after+0.008
What+0.007
years+0.007
 him+0.007
 during+0.007
Neg logit contributions
kin-0.007
pan-0.007
aver-0.006
 happens-0.006
 teaches-0.006
Token she
Token ID673
Feature activation-1.14e+00

Pos logit contributions
kin+0.012
pan+0.012
 sheet+0.011
 blanket+0.011
 happens+0.011
Neg logit contributions
years-0.013
and-0.012
but-0.011
to-0.011
minded-0.011
Token woke
Token ID19092
Feature activation-1.40e+00

Pos logit contributions
years+0.037
but+0.035
What+0.034
and+0.034
to+0.033
Neg logit contributions
kin-0.029
pan-0.026
aver-0.025
 bowl-0.024
 tray-0.023
Token up
Token ID510
Feature activation-1.57e+00

Pos logit contributions
years+0.043
and+0.039
but+0.035
yes+0.034
bel+0.034
Neg logit contributions
pan-0.040
kin-0.037
 happens-0.037
 blanket-0.036
 sheet-0.036
Token she
Token ID673
Feature activation-4.38e-01

Pos logit contributions
years+0.069
and+0.063
Ho+0.062
beh+0.061
What+0.061
Neg logit contributions
pan-0.025
kin-0.023
 bowl-0.023
 blanket-0.021
 chair-0.021
Token felt
Token ID2936
Feature activation-1.56e-01

Pos logit contributions
years+0.013
but+0.013
What+0.012
and+0.012
to+0.012
Neg logit contributions
kin-0.012
pan-0.011
aver-0.011
 bowl-0.010
 tray-0.010
Token refreshed
Token ID47193
Feature activation-3.31e+00

Pos logit contributions
years+0.003
but+0.003
What+0.003
to+0.003
and+0.003
Neg logit contributions
kin-0.006
pan-0.006
aver-0.005
 bowl-0.005
 tray-0.005
Token.
Token ID13
Feature activation-3.92e-01

Pos logit contributions
years+0.131
nam+0.115
to+0.108
beh+0.107
What+0.107
Neg logit contributions
pan-0.077
kin-0.075
 bowl-0.061
 blanket-0.060
 sheet-0.060
Token She
Token ID1375
Feature activation+3.27e-01

Pos logit contributions
years+0.008
 afford+0.007
unct+0.007
but+0.007
Ho+0.007
Neg logit contributions
pan-0.014
kin-0.013
 happens-0.013
 bowl-0.013
aver-0.012
Token felt
Token ID2936
Feature activation-8.85e-03

Pos logit contributions
kin+0.010
pan+0.010
aver+0.009
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.008
 after-0.008
but-0.008
to-0.008
 him-0.008
Token happy
Token ID3772
Feature activation-1.76e+00

Pos logit contributions
years+0.000
but+0.000
to+0.000
What+0.000
and+0.000
Neg logit contributions
kin-0.000
pan-0.000
aver-0.000
 bowl-0.000
 happens-0.000
Token and
Token ID290
Feature activation+1.80e-01

Pos logit contributions
years+0.051
Ho+0.045
unct+0.044
to+0.043
What+0.043
Neg logit contributions
kin-0.052
pan-0.051
 bowl-0.047
 blanket-0.047
 sheet-0.046
Token loved
Token ID6151
Feature activation+1.97e-01

Pos logit contributions
years+0.004
but+0.004
 after+0.003
 him+0.003
to+0.003
Neg logit contributions
kin-0.005
pan-0.005
aver-0.004
 tray-0.004
 bowl-0.004
Token to
Token ID284
Feature activation-5.23e-01

Pos logit contributions
pan+0.006
 bowl+0.006
kin+0.006
 tray+0.006
 snake+0.006
Neg logit contributions
years-0.004
but-0.004
itating-0.004
Ho-0.004
 afford-0.004
Token play
Token ID711
Feature activation+2.58e-01

Pos logit contributions
years+0.010
but+0.009
What+0.009
 after+0.009
 him+0.009
Neg logit contributions
kin-0.020
pan-0.019
aver-0.018
 tray-0.017
 bowl-0.017
Token make
Token ID787
Feature activation-8.69e-01

Pos logit contributions
kin+0.010
pan+0.009
aver+0.009
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.005
unct-0.005
but-0.005
 afford-0.004
to-0.004
Token-
Token ID12
Feature activation+1.87e-01

Pos logit contributions
years+0.027
but+0.025
 afford+0.024
What+0.024
unct+0.024
Neg logit contributions
kin-0.022
pan-0.021
aver-0.020
 tray-0.020
 bowl-0.020
Tokenbel
Token ID6667
Feature activation-3.30e+00

Pos logit contributions
aver+0.006
kin+0.006
pan+0.006
 bowl+0.005
 block+0.005
Neg logit contributions
years-0.004
What-0.004
 int-0.004
but-0.004
 after-0.004
Tokenieve
Token ID12311
Feature activation+3.96e-02

Pos logit contributions
years+0.090
but+0.087
 after+0.087
What+0.087
ory+0.086
Neg logit contributions
kin-0.088
aver-0.082
pan-0.082
 teaches-0.081
 happens-0.077
Token.
Token ID13
Feature activation-8.55e-01

Pos logit contributions
kin+0.001
pan+0.001
 tray+0.001
aver+0.001
 sheet+0.001
Neg logit contributions
years-0.001
unct-0.001
What-0.001
Ho-0.001
but-0.001
Token Today
Token ID6288
Feature activation-1.95e-01

Pos logit contributions
years+0.027
unct+0.025
bel+0.024
Ho+0.024
 after+0.024
Neg logit contributions
pan-0.020
 bowl-0.019
kin-0.019
case-0.018
 teaches-0.018
Token she
Token ID673
Feature activation-2.92e-01

Pos logit contributions
years+0.007
Ho+0.006
What+0.006
minded+0.006
to+0.006
Neg logit contributions
kin-0.004
pan-0.004
 happens-0.004
 bowl-0.004
 sheet-0.004
Token was
Token ID373
Feature activation-5.20e-01

Pos logit contributions
years+0.007
 after+0.007
 afford+0.007
to+0.007
 him+0.007
Neg logit contributions
kin-0.009
pan-0.009
aver-0.009
 tray-0.008
 bowl-0.008
Token too
Token ID1165
Feature activation+9.98e-01

Pos logit contributions
years+0.014
but+0.013
What+0.012
to+0.012
 after+0.012
Neg logit contributions
kin-0.013
pan-0.013
aver-0.012
 tray-0.011
 bowl-0.011
Token?"
Token ID1701
Feature activation-9.18e-01

Pos logit contributions
kin+0.023
pan+0.022
aver+0.021
 tray+0.019
 bowl+0.019
Neg logit contributions
years-0.034
but-0.032
What-0.031
 after-0.031
to-0.031
Token
Token ID198
Feature activation+6.11e-02

Pos logit contributions
years+0.030
ge+0.029
beh+0.027
ind+0.027
bel+0.027
Neg logit contributions
 bowl-0.022
pan-0.020
kin-0.019
 tree-0.019
 teaches-0.019
Token
Token ID198
Feature activation-9.40e-02

Pos logit contributions
kin+0.002
pan+0.002
aver+0.002
 bowl+0.002
 tray+0.002
Neg logit contributions
years-0.001
but-0.001
 after-0.001
What-0.001
and-0.001
TokenHer
Token ID9360
Feature activation-4.40e-01

Pos logit contributions
years+0.002
unct+0.002
 after+0.002
 him+0.002
but+0.002
Neg logit contributions
kin-0.003
pan-0.003
aver-0.003
 teaches-0.003
 bowl-0.003
Token pap
Token ID20461
Feature activation-3.27e+00

Pos logit contributions
years+0.013
but+0.012
What+0.012
and+0.012
 after+0.012
Neg logit contributions
kin-0.012
pan-0.012
aver-0.011
 bowl-0.010
 tray-0.010
Tokena
Token ID64
Feature activation-3.15e-02

Pos logit contributions
years+0.081
 after+0.076
but+0.075
What+0.075
 him+0.073
Neg logit contributions
kin-0.106
aver-0.099
pan-0.099
 bowl-0.089
 teaches-0.089
Token smiled
Token ID13541
Feature activation-1.57e+00

Pos logit contributions
years+0.001
 after+0.001
 him+0.001
 int+0.001
but+0.001
Neg logit contributions
pan-0.001
kin-0.001
aver-0.001
 tray-0.001
 bowl-0.001
Token and
Token ID290
Feature activation+5.40e-01

Pos logit contributions
years+0.040
but+0.036
and+0.036
to+0.036
 afford+0.035
Neg logit contributions
kin-0.049
pan-0.047
aver-0.045
 bowl-0.043
 tray-0.042
Token said
Token ID531
Feature activation+2.07e-02

Pos logit contributions
kin+0.017
pan+0.016
aver+0.015
 tray+0.015
 bowl+0.014
Neg logit contributions
years-0.014
but-0.013
 after-0.013
What-0.013
to-0.013
Token,
Token ID11
Feature activation+6.23e-01

Pos logit contributions
kin+0.001
aver+0.001
pan+0.001
 tray+0.001
 bowl+0.001
Neg logit contributions
years-0.000
but-0.000
 after-0.000
 him-0.000
to-0.000
Token prevent
Token ID2948
Feature activation-8.23e-01

Pos logit contributions
years+0.000
but+0.000
and+0.000
to+0.000
minded+0.000
Neg logit contributions
kin-0.000
pan-0.000
aver-0.000
 teaches-0.000
 bowl-0.000
Token Katie
Token ID24721
Feature activation-1.26e+00

Pos logit contributions
bel+0.023
years+0.022
but+0.021
and+0.021
Ho+0.020
Neg logit contributions
kin-0.021
 happens-0.021
pan-0.021
 tray-0.019
 iv-0.019
Token from
Token ID422
Feature activation-7.81e-01

Pos logit contributions
years+0.038
What+0.035
Nice+0.033
Really+0.031
Ho+0.031
Neg logit contributions
kin-0.037
pan-0.037
 tray-0.033
 piece-0.033
 bowl-0.033
Token writing
Token ID3597
Feature activation-1.50e+00

Pos logit contributions
but+0.021
and+0.021
years+0.020
Ho+0.019
yes+0.019
Neg logit contributions
kin-0.022
pan-0.022
 bowl-0.020
 tray-0.020
 snake-0.020
Token her
Token ID607
Feature activation-9.66e-01

Pos logit contributions
years+0.036
but+0.032
What+0.031
to+0.030
and+0.030
Neg logit contributions
kin-0.052
pan-0.049
aver-0.046
 tray-0.045
 bowl-0.044
Token poems
Token ID31888
Feature activation-3.27e+00

Pos logit contributions
years+0.035
but+0.033
What+0.032
and+0.031
Ho+0.031
Neg logit contributions
kin-0.022
pan-0.019
 tray-0.018
 teaches-0.018
 happens-0.018
Token.
Token ID13
Feature activation-1.39e-01

Pos logit contributions
years+0.096
What+0.083
Ho+0.081
but+0.080
to+0.080
Neg logit contributions
kin-0.098
pan-0.095
 tray-0.083
aver-0.083
 bowl-0.082
Token They
Token ID1119
Feature activation+1.53e-01

Pos logit contributions
years+0.004
unct+0.003
Ho+0.003
beh+0.003
ge+0.003
Neg logit contributions
pan-0.004
kin-0.004
 bowl-0.004
 tray-0.004
 happens-0.004
Token did
Token ID750
Feature activation-8.88e-01

Pos logit contributions
kin+0.004
pan+0.004
aver+0.004
 bowl+0.004
 teaches+0.004
Neg logit contributions
but-0.004
and-0.004
to-0.004
years-0.004
What-0.004
Token not
Token ID407
Feature activation-6.34e-01

Pos logit contributions
years+0.033
amb+0.030
Ho+0.030
and+0.029
elt+0.029
Neg logit contributions
kin-0.019
pan-0.019
 teaches-0.018
 tray-0.017
 sheet-0.016
Token understand
Token ID1833
Feature activation-5.30e-01

Pos logit contributions
years+0.017
but+0.015
to+0.015
Ho+0.015
What+0.015
Neg logit contributions
kin-0.020
pan-0.019
aver-0.019
 teaches-0.018
 happens-0.017
Token.
Token ID13
Feature activation-8.14e-01

Pos logit contributions
years+0.035
What+0.032
but+0.031
to+0.031
unct+0.030
Neg logit contributions
kin-0.041
pan-0.039
aver-0.036
 tray-0.035
 bowl-0.034
Token Jan
Token ID2365
Feature activation+6.19e-01

Pos logit contributions
years+0.021
unct+0.020
 afford+0.020
 after+0.019
ind+0.019
Neg logit contributions
kin-0.021
 teaches-0.021
aver-0.021
pan-0.021
case-0.020
Tokenie
Token ID494
Feature activation-4.53e-01

Pos logit contributions
kin+0.026
pan+0.025
aver+0.024
 bowl+0.023
 tray+0.023
Neg logit contributions
years-0.010
but-0.009
What-0.008
 after-0.008
to-0.008
Token shared
Token ID4888
Feature activation-1.21e+00

Pos logit contributions
years+0.011
but+0.009
 him+0.009
 after+0.009
to+0.009
Neg logit contributions
kin-0.015
pan-0.015
aver-0.014
 tray-0.014
 happens-0.013
Token her
Token ID607
Feature activation-4.84e-01

Pos logit contributions
years+0.026
unct+0.025
but+0.024
to+0.023
minded+0.022
Neg logit contributions
kin-0.043
pan-0.041
 bowl-0.039
 tray-0.039
aver-0.038
Token creations
Token ID28443
Feature activation-3.22e+00

Pos logit contributions
years+0.012
 after+0.011
What+0.011
but+0.011
 afford+0.011
Neg logit contributions
pan-0.015
aver-0.014
kin-0.014
 happens-0.013
 tray-0.013
Token with
Token ID351
Feature activation-1.13e+00

Pos logit contributions
years+0.086
What+0.076
unct+0.073
to+0.070
Ho+0.070
Neg logit contributions
kin-0.108
pan-0.098
 bowl-0.094
 tray-0.094
aver-0.092
Token all
Token ID477
Feature activation-8.14e-02

Pos logit contributions
unct+0.024
but+0.024
years+0.024
to+0.023
itating+0.022
Neg logit contributions
kin-0.038
pan-0.035
aver-0.034
 teaches-0.034
 bowl-0.033
Token her
Token ID607
Feature activation+4.48e-01

Pos logit contributions
years+0.002
but+0.001
What+0.001
to+0.001
 after+0.001
Neg logit contributions
kin-0.003
pan-0.003
aver-0.003
 tray-0.003
 bowl-0.003
Token family
Token ID1641
Feature activation-1.79e+00

Pos logit contributions
kin+0.012
pan+0.012
aver+0.012
 tray+0.011
 bowl+0.010
Neg logit contributions
years-0.013
but-0.012
 after-0.012
What-0.012
 him-0.012
Token and
Token ID290
Feature activation-1.50e-01

Pos logit contributions
years+0.043
but+0.039
What+0.038
to+0.038
and+0.037
Neg logit contributions
kin-0.060
pan-0.057
aver-0.055
 bowl-0.052
 tray-0.052
Token the
Token ID262
Feature activation+5.53e-01

Pos logit contributions
years+0.005
 after+0.005
but+0.004
What+0.004
 him+0.004
Neg logit contributions
kin-0.007
pan-0.007
aver-0.006
 tray-0.006
 bowl-0.006
Token key
Token ID1994
Feature activation+5.58e-02

Pos logit contributions
kin+0.015
pan+0.014
aver+0.014
 bowl+0.013
 tray+0.013
Neg logit contributions
years-0.017
but-0.016
to-0.015
What-0.015
and-0.015
Token to
Token ID284
Feature activation-7.90e-01

Pos logit contributions
kin+0.002
pan+0.002
aver+0.002
 tray+0.002
 bowl+0.002
Neg logit contributions
years-0.001
but-0.001
 after-0.001
What-0.001
and-0.001
Token achieving
Token ID16937
Feature activation-1.33e+00

Pos logit contributions
years+0.022
but+0.020
and+0.019
to+0.019
What+0.018
Neg logit contributions
kin-0.024
pan-0.023
aver-0.021
 bowl-0.021
 teaches-0.020
Token her
Token ID607
Feature activation-2.19e+00

Pos logit contributions
years+0.032
but+0.030
and+0.028
to+0.028
 afford+0.027
Neg logit contributions
kin-0.046
pan-0.043
 teaches-0.040
 happens-0.040
 tray-0.040
Token dreams
Token ID10625
Feature activation-3.22e+00

Pos logit contributions
years+0.076
but+0.072
and+0.067
to+0.066
What+0.066
Neg logit contributions
kin-0.052
 happens-0.047
pan-0.045
 teaches-0.045
 bowl-0.044
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
years+0.088
but+0.079
What+0.078
to+0.077
 after+0.075
Neg logit contributions
kin-0.098
pan-0.094
aver-0.088
 tray-0.084
 bowl-0.083
Token She
Token ID1375
Feature activation+6.30e-03

Pos logit contributions
years+0.007
unct+0.006
but+0.006
Ho+0.006
 after+0.006
Neg logit contributions
kin-0.012
pan-0.012
aver-0.012
 bowl-0.011
 teaches-0.011
Token was
Token ID373
Feature activation-5.13e-01

Pos logit contributions
kin+0.000
pan+0.000
 tray+0.000
aver+0.000
 bowl+0.000
Neg logit contributions
years-0.000
 after-0.000
but-0.000
 him-0.000
 int-0.000
Token no
Token ID645
Feature activation-4.32e-01

Pos logit contributions
years+0.009
 after+0.008
but+0.008
What+0.008
 him+0.008
Neg logit contributions
kin-0.020
pan-0.020
aver-0.019
 tray-0.019
 bowl-0.018
Token longer
Token ID2392
Feature activation-7.55e-01

Pos logit contributions
years+0.005
 after+0.004
but+0.004
What+0.004
 him+0.003
Neg logit contributions
kin-0.020
pan-0.019
aver-0.019
 tray-0.018
 bowl-0.018
Token liked
Token ID8288
Feature activation+6.61e-01

Pos logit contributions
years+0.005
but+0.005
What+0.004
 after+0.004
to+0.004
Neg logit contributions
kin-0.006
pan-0.006
aver-0.006
 tray-0.005
 bowl-0.005
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
pan+0.021
 bowl+0.020
kin+0.020
 tray+0.019
 snake+0.019
Neg logit contributions
years-0.015
Ho-0.014
unct-0.014
itating-0.013
 afford-0.013
Token play
Token ID711
Feature activation+6.51e-02

Pos logit contributions
years+0.020
 him+0.018
 after+0.018
but+0.018
 afford+0.018
Neg logit contributions
kin-0.038
pan-0.037
aver-0.036
 tray-0.033
 bowl-0.033
Token make
Token ID787
Feature activation-1.08e+00

Pos logit contributions
kin+0.003
pan+0.002
aver+0.002
 tray+0.002
 bowl+0.002
Neg logit contributions
years-0.001
but-0.001
What-0.001
to-0.001
and-0.001
Token-
Token ID12
Feature activation-3.51e-01

Pos logit contributions
years+0.027
but+0.024
What+0.024
 afford+0.023
to+0.023
Neg logit contributions
kin-0.035
pan-0.034
aver-0.032
 tray-0.031
 bowl-0.031
Tokenbel
Token ID6667
Feature activation-3.21e+00

Pos logit contributions
years+0.005
 int+0.005
 him+0.004
What+0.004
but+0.004
Neg logit contributions
aver-0.014
kin-0.014
pan-0.014
 bowl-0.012
 happens-0.012
Tokenieve
Token ID12311
Feature activation-3.82e-01

Pos logit contributions
years+0.082
but+0.076
 after+0.074
What+0.074
to+0.073
Neg logit contributions
kin-0.099
pan-0.095
aver-0.092
 teaches-0.085
 happens-0.084
Token.
Token ID13
Feature activation-9.92e-01

Pos logit contributions
years+0.009
but+0.008
What+0.008
 after+0.008
to+0.007
Neg logit contributions
kin-0.013
pan-0.013
aver-0.012
 tray-0.012
 bowl-0.012
Token
Token ID198
Feature activation-8.01e-02

Pos logit contributions
years+0.025
but+0.023
What+0.022
 after+0.022
to+0.022
Neg logit contributions
kin-0.031
pan-0.030
aver-0.029
 tray-0.027
 bowl-0.027
Token
Token ID198
Feature activation-2.21e-01

Pos logit contributions
years+0.002
unct+0.002
but+0.002
 after+0.002
minded+0.002
Neg logit contributions
kin-0.002
aver-0.002
pan-0.002
 bowl-0.002
 tray-0.002
TokenOne
Token ID3198
Feature activation+1.55e-02

Pos logit contributions
years+0.004
but+0.004
 him+0.004
unct+0.004
to+0.004
Neg logit contributions
kin-0.008
pan-0.008
aver-0.008
 tray-0.007
 bowl-0.007
Token her
Token ID607
Feature activation-1.11e-01

Pos logit contributions
years+0.008
 him+0.006
 after+0.006
What+0.005
but+0.005
Neg logit contributions
kin-0.043
pan-0.043
aver-0.042
 bowl-0.039
 tray-0.039
Token tutor
Token ID48680
Feature activation-1.78e+00

Pos logit contributions
 after+0.003
years+0.003
but+0.003
 him+0.003
 int+0.003
Neg logit contributions
pan-0.003
aver-0.003
kin-0.003
 happens-0.003
 teaches-0.003
Token.
Token ID13
Feature activation-6.75e-01

Pos logit contributions
years+0.038
 after+0.037
 him+0.035
but+0.035
and+0.034
Neg logit contributions
kin-0.063
pan-0.061
aver-0.059
 bowl-0.054
 tray-0.054
Token When
Token ID1649
Feature activation+8.01e-02

Pos logit contributions
years+0.011
but+0.009
What+0.009
unct+0.009
 after+0.009
Neg logit contributions
kin-0.027
pan-0.027
aver-0.026
 tray-0.025
 bowl-0.025
Token her
Token ID607
Feature activation-1.21e+00

Pos logit contributions
kin+0.003
pan+0.003
 bowl+0.003
 tray+0.003
aver+0.003
Neg logit contributions
years-0.001
but-0.001
and-0.001
to-0.001
What-0.001
Token parents
Token ID3397
Feature activation-3.16e+00

Pos logit contributions
years+0.034
 after+0.034
 him+0.033
but+0.032
to+0.032
Neg logit contributions
pan-0.033
kin-0.033
aver-0.033
 iv-0.027
 bowl-0.026
Token found
Token ID1043
Feature activation-1.06e+00

Pos logit contributions
years+0.092
 after+0.085
 him+0.085
but+0.085
What+0.084
Neg logit contributions
kin-0.089
pan-0.084
aver-0.080
 tray-0.074
 bowl-0.073
Token out
Token ID503
Feature activation-9.78e-01

Pos logit contributions
years+0.016
but+0.014
What+0.014
 after+0.014
 him+0.014
Neg logit contributions
kin-0.044
pan-0.043
aver-0.042
 tray-0.040
 bowl-0.040
Token,
Token ID11
Feature activation-7.01e-02

Pos logit contributions
years+0.023
but+0.021
What+0.021
to+0.021
and+0.021
Neg logit contributions
kin-0.033
pan-0.031
aver-0.030
 tray-0.028
 bowl-0.028
Token they
Token ID484
Feature activation-3.73e-01

Pos logit contributions
years+0.002
but+0.002
to+0.002
and+0.002
What+0.002
Neg logit contributions
kin-0.002
pan-0.002
aver-0.002
 tray-0.002
 bowl-0.002
Token were
Token ID547
Feature activation-3.33e-01

Pos logit contributions
years+0.009
but+0.008
 after+0.008
 him+0.008
What+0.008
Neg logit contributions
kin-0.012
pan-0.012
aver-0.011
 tray-0.011
 bowl-0.010
Token was
Token ID373
Feature activation-3.82e-01

Pos logit contributions
years+0.004
but+0.003
 after+0.003
What+0.003
to+0.003
Neg logit contributions
kin-0.004
pan-0.004
aver-0.004
 tray-0.004
 bowl-0.004
Token ready
Token ID3492
Feature activation-4.63e-01

Pos logit contributions
years+0.007
 after+0.007
 him+0.007
but+0.007
What+0.006
Neg logit contributions
kin-0.014
pan-0.014
 tray-0.013
 happens-0.013
aver-0.013
Token for
Token ID329
Feature activation-7.18e-01

Pos logit contributions
years+0.013
What+0.011
Ho+0.011
nam+0.010
unct+0.010
Neg logit contributions
kin-0.015
pan-0.015
 bowl-0.013
 tray-0.013
aver-0.013
Token make
Token ID787
Feature activation-6.11e-01

Pos logit contributions
years+0.019
but+0.018
to+0.017
What+0.017
and+0.017
Neg logit contributions
kin-0.022
pan-0.021
aver-0.020
 tray-0.019
 bowl-0.019
Token-
Token ID12
Feature activation+1.78e-01

Pos logit contributions
years+0.015
but+0.013
What+0.013
 after+0.013
to+0.013
Neg logit contributions
kin-0.020
pan-0.020
aver-0.019
 tray-0.018
 bowl-0.018
Tokenbel
Token ID6667
Feature activation-3.16e+00

Pos logit contributions
aver+0.006
kin+0.005
pan+0.005
 teaches+0.005
 happens+0.005
Neg logit contributions
years-0.004
What-0.004
but-0.004
Ho-0.004
 int-0.004
Tokenieve
Token ID12311
Feature activation+5.77e-02

Pos logit contributions
years+0.080
but+0.076
What+0.074
to+0.074
 after+0.073
Neg logit contributions
kin-0.097
pan-0.092
aver-0.088
 teaches-0.085
 happens-0.082
Token fun
Token ID1257
Feature activation-2.26e-01

Pos logit contributions
kin+0.002
pan+0.002
aver+0.002
 tray+0.002
 bowl+0.002
Neg logit contributions
years-0.001
but-0.001
 after-0.001
What-0.001
and-0.001
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
years+0.007
but+0.006
What+0.006
unct+0.006
to+0.006
Neg logit contributions
kin-0.007
pan-0.006
aver-0.006
 teaches-0.006
 tray-0.006
Token<|endoftext|>
Token ID50256
Feature activation-2.02e-01

Pos logit contributions
kin+0.009
pan+0.009
aver+0.009
 tray+0.008
 bowl+0.008
Neg logit contributions
years-0.006
but-0.005
What-0.005
 after-0.005
to-0.005
TokenOnce
Token ID7454
Feature activation-1.30e+00

Pos logit contributions
What+0.003
 him+0.003
to+0.003
years+0.003
and+0.003
Neg logit contributions
pan-0.008
kin-0.008
aver-0.008
 tray-0.007
 broom-0.007
Token stop
Token ID2245
Feature activation+4.33e-02

Pos logit contributions
kin+0.013
pan+0.013
aver+0.011
 bowl+0.011
 happens+0.011
Neg logit contributions
years-0.011
What-0.010
but-0.010
Ho-0.010
to-0.009
Token him
Token ID683
Feature activation-7.45e-01

Pos logit contributions
kin+0.001
pan+0.001
 bowl+0.001
 happens+0.001
aver+0.001
Neg logit contributions
years-0.001
What-0.001
but-0.001
to-0.001
and-0.001
Token from
Token ID422
Feature activation-7.47e-01

Pos logit contributions
years+0.026
What+0.023
Ho+0.022
but+0.022
ind+0.021
Neg logit contributions
kin-0.018
pan-0.017
 bowl-0.016
 happens-0.016
 ball-0.015
Token achieving
Token ID16937
Feature activation-1.70e+00

Pos logit contributions
years+0.019
but+0.019
and+0.019
to+0.018
Ho+0.018
Neg logit contributions
kin-0.023
pan-0.022
aver-0.020
 bowl-0.020
 tray-0.020
Token his
Token ID465
Feature activation-1.97e+00

Pos logit contributions
years+0.043
but+0.039
and+0.037
to+0.036
unct+0.036
Neg logit contributions
kin-0.056
pan-0.053
aver-0.050
 teaches-0.049
 bowl-0.049
Token dreams
Token ID10625
Feature activation-3.15e+00

Pos logit contributions
years+0.068
but+0.065
and+0.063
What+0.062
to+0.062
Neg logit contributions
kin-0.045
pan-0.041
aver-0.038
 bowl-0.038
 tray-0.038
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
years+0.090
What+0.081
but+0.080
to+0.078
and+0.077
Neg logit contributions
kin-0.092
pan-0.089
aver-0.083
 bowl-0.079
 tray-0.079
Token<|endoftext|>
Token ID50256
Feature activation+4.31e-01

Pos logit contributions
years+0.023
but+0.021
What+0.021
 after+0.021
to+0.020
Neg logit contributions
kin-0.038
pan-0.037
aver-0.036
 tray-0.033
 bowl-0.033
TokenIt
Token ID1026
Feature activation-3.90e-01

Pos logit contributions
pan+0.021
kin+0.020
aver+0.019
 tray+0.018
 broom+0.018
Neg logit contributions
 him-0.005
What-0.005
and-0.004
to-0.004
 after-0.004
Token was
Token ID373
Feature activation+9.59e-03

Pos logit contributions
years+0.011
but+0.011
What+0.010
 after+0.010
 him+0.010
Neg logit contributions
kin-0.011
pan-0.011
aver-0.010
 tray-0.009
 bowl-0.009
Token raining
Token ID43079
Feature activation+6.44e-02

Pos logit contributions
kin+0.000
pan+0.000
aver+0.000
 tray+0.000
 happens+0.000
Neg logit contributions
years-0.000
 after-0.000
What-0.000
but-0.000
 him-0.000
Token and
Token ID290
Feature activation+4.66e-01

Pos logit contributions
kin+0.014
pan+0.013
aver+0.012
 bowl+0.012
 tray+0.011
Neg logit contributions
years-0.013
What-0.012
to-0.011
but-0.011
unct-0.011
Token they
Token ID484
Feature activation+2.87e-01

Pos logit contributions
kin+0.015
pan+0.014
aver+0.014
 bowl+0.013
 teaches+0.013
Neg logit contributions
years-0.012
but-0.011
What-0.010
to-0.010
and-0.010
Token played
Token ID2826
Feature activation+3.66e-02

Pos logit contributions
kin+0.009
pan+0.008
aver+0.008
 bowl+0.008
 tray+0.008
Neg logit contributions
years-0.007
but-0.007
What-0.007
and-0.007
to-0.007
Token make
Token ID787
Feature activation-2.51e-02

Pos logit contributions
kin+0.001
pan+0.001
aver+0.001
 tray+0.001
 teaches+0.001
Neg logit contributions
years-0.001
unct-0.001
but-0.001
to-0.001
 afford-0.001
Token-
Token ID12
Feature activation+7.28e-01

Pos logit contributions
years+0.001
but+0.000
 afford+0.000
unct+0.000
What+0.000
Neg logit contributions
kin-0.001
pan-0.001
aver-0.001
 teaches-0.001
 tray-0.001
Tokenbel
Token ID6667
Feature activation-3.14e+00

Pos logit contributions
aver+0.028
pan+0.027
kin+0.027
case+0.025
 block+0.025
Neg logit contributions
 int-0.010
unct-0.010
 least-0.009
 curiously-0.009
 after-0.008
Tokenieve
Token ID12311
Feature activation+3.95e-01

Pos logit contributions
years+0.087
but+0.081
to+0.080
 after+0.080
unct+0.079
Neg logit contributions
kin-0.088
pan-0.083
 teaches-0.079
aver-0.078
 happens-0.074
Token.
Token ID13
Feature activation+2.33e-01

Pos logit contributions
kin+0.014
pan+0.013
aver+0.013
 tray+0.012
 bowl+0.012
Neg logit contributions
years-0.009
but-0.008
What-0.008
to-0.008
unct-0.008
Token Finally
Token ID9461
Feature activation-1.06e+00

Pos logit contributions
kin+0.006
pan+0.005
aver+0.005
 bowl+0.005
 tray+0.005
Neg logit contributions
years-0.008
but-0.007
What-0.007
 afford-0.007
and-0.007
Token,
Token ID11
Feature activation+2.01e-01

Pos logit contributions
years+0.030
but+0.028
 after+0.028
What+0.027
 him+0.027
Neg logit contributions
kin-0.030
pan-0.029
aver-0.028
 tray-0.026
 bowl-0.026
Token they
Token ID484
Feature activation+2.26e-01

Pos logit contributions
kin+0.006
 bowl+0.005
 tray+0.004
 net+0.004
 ball+0.004
Neg logit contributions
years-0.007
and-0.007
beh-0.007
ind-0.006
unct-0.006
Token skill
Token ID5032
Feature activation-1.70e+00

Pos logit contributions
years+0.014
 after+0.014
but+0.013
 afford+0.013
What+0.013
Neg logit contributions
kin-0.011
pan-0.010
aver-0.010
 happens-0.009
 teaches-0.009
Token.
Token ID13
Feature activation-3.84e-01

Pos logit contributions
years+0.039
but+0.036
What+0.035
 after+0.035
to+0.035
Neg logit contributions
kin-0.057
pan-0.056
aver-0.053
 tray-0.050
 bowl-0.050
Token Katie
Token ID24721
Feature activation-7.09e-02

Pos logit contributions
years+0.010
but+0.008
unct+0.008
bel+0.008
 after+0.008
Neg logit contributions
kin-0.012
pan-0.011
 bowl-0.011
 tray-0.011
aver-0.011
Token could
Token ID714
Feature activation-1.04e+00

Pos logit contributions
years+0.002
 after+0.001
but+0.001
 him+0.001
to+0.001
Neg logit contributions
kin-0.002
pan-0.002
aver-0.002
 tray-0.002
 bowl-0.002
Token write
Token ID3551
Feature activation-1.51e+00

Pos logit contributions
years+0.029
and+0.027
but+0.025
Ho+0.025
to+0.025
Neg logit contributions
kin-0.031
pan-0.030
aver-0.028
 bowl-0.028
 tray-0.028
Token poems
Token ID31888
Feature activation-3.14e+00

Pos logit contributions
years+0.040
unct+0.037
 afford+0.035
but+0.035
and+0.034
Neg logit contributions
kin-0.047
pan-0.043
 tray-0.042
 teaches-0.042
 happens-0.041
Token!
Token ID0
Feature activation-5.11e-01

Pos logit contributions
years+0.085
unct+0.077
Ho+0.073
What+0.070
ind+0.070
Neg logit contributions
kin-0.102
pan-0.098
case-0.091
 teaches-0.091
 tray-0.089
Token One
Token ID1881
Feature activation-1.07e-01

Pos logit contributions
years+0.009
but+0.008
What+0.008
 after+0.008
to+0.007
Neg logit contributions
kin-0.020
pan-0.019
aver-0.019
 bowl-0.018
 tray-0.018
Token day
Token ID1110
Feature activation+2.55e-01

Pos logit contributions
years+0.003
but+0.003
What+0.003
to+0.003
and+0.003
Neg logit contributions
kin-0.003
aver-0.003
pan-0.003
 bowl-0.003
 tray-0.003
Token someone
Token ID2130
Feature activation-3.73e-01

Pos logit contributions
kin+0.006
pan+0.005
 happens+0.005
 bowl+0.005
aver+0.005
Neg logit contributions
years-0.009
and-0.008
but-0.008
What-0.008
unct-0.008
Token tried
Token ID3088
Feature activation+4.52e-01

Pos logit contributions
years+0.012
Ho+0.011
unct+0.011
 int+0.011
but+0.011
Neg logit contributions
kin-0.009
aver-0.009
 happens-0.009
pan-0.008
 bowl-0.008
Token-
Token ID12
Feature activation-2.06e-03

Pos logit contributions
years+0.025
What+0.021
minded+0.020
 least+0.020
Why+0.020
Neg logit contributions
kin-0.018
pan-0.017
 tray-0.017
 sheet-0.016
ves-0.016
Tokenup
Token ID929
Feature activation-1.41e+00

Pos logit contributions
years+0.000
 int+0.000
Why+0.000
What+0.000
 least+0.000
Neg logit contributions
aver-0.000
kin-0.000
ves-0.000
pan-0.000
case-0.000
Token and
Token ID290
Feature activation-6.58e-01

Pos logit contributions
years+0.039
What+0.036
to+0.035
unct+0.035
Ho+0.033
Neg logit contributions
kin-0.046
pan-0.041
 tray-0.040
aver-0.039
 teaches-0.038
Token make
Token ID787
Feature activation-1.17e+00

Pos logit contributions
years+0.018
but+0.018
to+0.017
and+0.017
minded+0.017
Neg logit contributions
kin-0.020
pan-0.019
aver-0.018
 bowl-0.017
 sheet-0.017
Token-
Token ID12
Feature activation+4.56e-02

Pos logit contributions
years+0.035
but+0.034
 afford+0.033
itating+0.031
unct+0.031
Neg logit contributions
kin-0.030
pan-0.028
 sheet-0.027
aver-0.027
 bowl-0.027
Tokenbel
Token ID6667
Feature activation-3.13e+00

Pos logit contributions
aver+0.001
kin+0.001
pan+0.001
ves+0.001
 block+0.001
Neg logit contributions
 int-0.001
 curiously-0.001
itating-0.001
What-0.001
unct-0.001
Tokenieve
Token ID12311
Feature activation-3.47e-01

Pos logit contributions
years+0.088
but+0.083
to+0.081
 after+0.081
What+0.080
Neg logit contributions
kin-0.087
pan-0.082
aver-0.080
 teaches-0.076
 happens-0.073
Token.
Token ID13
Feature activation-9.93e-01

Pos logit contributions
years+0.008
unct+0.008
but+0.008
What+0.008
to+0.008
Neg logit contributions
kin-0.012
pan-0.011
aver-0.010
 tray-0.010
 bowl-0.010
Token One
Token ID1881
Feature activation-5.97e-02

Pos logit contributions
years+0.022
but+0.021
What+0.020
 after+0.020
 him+0.020
Neg logit contributions
kin-0.034
pan-0.033
aver-0.032
 tray-0.030
 bowl-0.029
Token day
Token ID1110
Feature activation-1.60e-01

Pos logit contributions
years+0.002
but+0.001
What+0.001
 after+0.001
to+0.001
Neg logit contributions
kin-0.002
pan-0.002
aver-0.002
 tray-0.002
 bowl-0.002
Token,
Token ID11
Feature activation-2.74e-01

Pos logit contributions
years+0.005
but+0.004
 after+0.004
What+0.004
to+0.004
Neg logit contributions
kin-0.005
pan-0.004
aver-0.004
 tray-0.004
 bowl-0.004
Token He
Token ID679
Feature activation-9.82e-01

Pos logit contributions
years+0.028
but+0.025
to+0.024
 after+0.024
What+0.024
Neg logit contributions
kin-0.038
pan-0.037
aver-0.036
 bowl-0.034
 tray-0.034
Token loved
Token ID6151
Feature activation-3.56e-02

Pos logit contributions
years+0.030
but+0.029
What+0.028
and+0.028
 after+0.027
Neg logit contributions
kin-0.026
pan-0.024
aver-0.023
 bowl-0.021
 tray-0.021
Token playing
Token ID2712
Feature activation+1.09e-01

Pos logit contributions
years+0.001
but+0.001
itating+0.001
Ho+0.001
unct+0.001
Neg logit contributions
pan-0.001
 bowl-0.001
 snake-0.001
 tray-0.001
kin-0.001
Token make
Token ID787
Feature activation-8.93e-01

Pos logit contributions
kin+0.004
pan+0.004
aver+0.004
 tray+0.004
 bowl+0.004
Neg logit contributions
years-0.002
but-0.002
to-0.002
unct-0.002
What-0.002
Token-
Token ID12
Feature activation+6.46e-02

Pos logit contributions
years+0.025
but+0.023
What+0.023
 afford+0.022
unct+0.022
Neg logit contributions
kin-0.025
pan-0.024
aver-0.024
 tray-0.022
 bowl-0.022
Tokenbel
Token ID6667
Feature activation-3.12e+00

Pos logit contributions
aver+0.002
pan+0.002
kin+0.002
 block+0.002
case+0.002
Neg logit contributions
 after-0.001
 int-0.001
Ho-0.001
unct-0.001
What-0.001
Tokenieve
Token ID12311
Feature activation-1.05e-01

Pos logit contributions
years+0.084
but+0.081
 after+0.081
to+0.080
ory+0.080
Neg logit contributions
kin-0.085
pan-0.079
 teaches-0.079
aver-0.079
 happens-0.072
Token with
Token ID351
Feature activation-5.27e-01

Pos logit contributions
years+0.003
but+0.002
What+0.002
to+0.002
unct+0.002
Neg logit contributions
kin-0.003
pan-0.003
aver-0.003
 tray-0.003
 bowl-0.003
Token his
Token ID465
Feature activation-2.04e-01

Pos logit contributions
years+0.012
but+0.011
What+0.010
to+0.010
 after+0.010
Neg logit contributions
kin-0.018
pan-0.018
aver-0.017
 bowl-0.016
 tray-0.016
Token friends
Token ID2460
Feature activation-1.65e+00

Pos logit contributions
years+0.007
but+0.007
What+0.006
 after+0.006
to+0.006
Neg logit contributions
kin-0.005
pan-0.004
aver-0.004
 tray-0.004
 bowl-0.004
Token.
Token ID13
Feature activation-6.16e-01

Pos logit contributions
years+0.042
but+0.038
What+0.038
to+0.037
unct+0.037
Neg logit contributions
kin-0.052
pan-0.050
aver-0.048
 tray-0.045
 bowl-0.045
Token slipped
Token ID18859
Feature activation-2.21e-01

Pos logit contributions
kin+0.012
pan+0.012
aver+0.011
 tray+0.011
 bowl+0.010
Neg logit contributions
 after-0.010
but-0.010
 him-0.010
years-0.010
 afford-0.009
Token away
Token ID1497
Feature activation-4.43e-01

Pos logit contributions
years+0.004
but+0.004
What+0.004
to+0.003
and+0.003
Neg logit contributions
kin-0.008
pan-0.008
aver-0.008
 tray-0.008
 bowl-0.008
Token.
Token ID13
Feature activation-2.79e-01

Pos logit contributions
years+0.014
to+0.012
What+0.011
minded+0.011
unct+0.011
Neg logit contributions
kin-0.013
pan-0.013
aver-0.011
 bowl-0.011
 tray-0.011
Token Emma
Token ID18966
Feature activation+3.19e-02

Pos logit contributions
years+0.008
but+0.007
What+0.007
 after+0.007
to+0.007
Neg logit contributions
kin-0.008
pan-0.008
aver-0.008
 tray-0.007
 bowl-0.007
Token's
Token ID338
Feature activation-3.52e-01

Pos logit contributions
kin+0.001
pan+0.001
aver+0.001
 tray+0.001
 teaches+0.001
Neg logit contributions
years-0.001
 after-0.001
but-0.001
 him-0.001
to-0.001
Token pap
Token ID20461
Feature activation-3.12e+00

Pos logit contributions
years+0.009
 int+0.009
 him+0.009
 after+0.009
but+0.009
Neg logit contributions
kin-0.010
pan-0.010
aver-0.009
 teaches-0.009
 tray-0.009
Tokena
Token ID64
Feature activation-3.30e-01

Pos logit contributions
years+0.081
 after+0.076
What+0.076
 him+0.075
but+0.075
Neg logit contributions
kin-0.099
aver-0.093
pan-0.090
 teaches-0.082
 bowl-0.080
Token quickly
Token ID2952
Feature activation-1.29e-01

Pos logit contributions
years+0.008
 after+0.008
 him+0.007
but+0.007
to+0.007
Neg logit contributions
kin-0.011
pan-0.010
 tray-0.009
aver-0.009
 teaches-0.009
Token grabbed
Token ID13176
Feature activation-1.05e+00

Pos logit contributions
years+0.003
 after+0.003
 him+0.003
 afford+0.003
to+0.003
Neg logit contributions
kin-0.004
pan-0.004
aver-0.003
 tray-0.003
 teaches-0.003
Token it
Token ID340
Feature activation-9.29e-01

Pos logit contributions
years+0.020
but+0.018
and+0.017
What+0.017
to+0.017
Neg logit contributions
kin-0.040
pan-0.039
aver-0.037
 tray-0.036
 bowl-0.036
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
years+0.022
What+0.020
but+0.019
to+0.019
what+0.019
Neg logit contributions
kin-0.031
pan-0.030
aver-0.029
 bowl-0.028
 tray-0.028
Token thank
Token ID5875
Feature activation-1.16e+00

Pos logit contributions
years+0.022
 after+0.021
but+0.021
What+0.021
to+0.021
Neg logit contributions
kin-0.017
pan-0.016
aver-0.015
 tray-0.013
 happens-0.013
Token you
Token ID345
Feature activation-1.25e+00

Pos logit contributions
years+0.022
but+0.019
What+0.019
and+0.019
 afford+0.019
Neg logit contributions
kin-0.044
pan-0.041
aver-0.040
 happens-0.038
 bowl-0.037
Token gift
Token ID6979
Feature activation-1.10e+00

Pos logit contributions
years+0.033
What+0.032
but+0.031
 afford+0.030
unct+0.029
Neg logit contributions
kin-0.036
pan-0.034
aver-0.033
 tray-0.032
 teaches-0.032
Token.
Token ID13
Feature activation-3.19e-01

Pos logit contributions
years+0.033
What+0.029
unct+0.028
but+0.028
to+0.028
Neg logit contributions
kin-0.032
pan-0.030
aver-0.028
 tray-0.027
 bowl-0.026
Token After
Token ID2293
Feature activation-5.77e-01

Pos logit contributions
years+0.006
but+0.006
What+0.006
 after+0.005
to+0.005
Neg logit contributions
kin-0.012
pan-0.011
aver-0.011
 tray-0.010
 bowl-0.010
Token one
Token ID530
Feature activation-3.12e+00

Pos logit contributions
itating+0.016
and+0.016
but+0.015
to+0.015
minded+0.015
Neg logit contributions
kin-0.016
pan-0.015
 bowl-0.014
aver-0.014
 tray-0.014
Token hour
Token ID1711
Feature activation-1.77e+00

Pos logit contributions
years+0.099
but+0.091
What+0.089
to+0.088
 afford+0.087
Neg logit contributions
kin-0.085
pan-0.076
aver-0.075
 bowl-0.072
 tray-0.072
Token,
Token ID11
Feature activation+1.29e-01

Pos logit contributions
years+0.052
but+0.048
What+0.048
to+0.047
 after+0.047
Neg logit contributions
kin-0.049
pan-0.047
aver-0.045
 bowl-0.041
 tray-0.041
Token the
Token ID262
Feature activation-4.21e-01

Pos logit contributions
kin+0.003
pan+0.003
 bowl+0.003
 tray+0.003
 tree+0.003
Neg logit contributions
years-0.004
and-0.004
to-0.004
ind-0.004
 afford-0.004
Token girl
Token ID2576
Feature activation+4.00e-01

Pos logit contributions
years+0.016
but+0.015
What+0.015
 after+0.015
to+0.015
Neg logit contributions
kin-0.008
pan-0.008
aver-0.007
 tray-0.007
 bowl-0.007
Token waited
Token ID13488
Feature activation-3.42e-01

Pos logit contributions
kin+0.013
pan+0.013
aver+0.012
 tray+0.011
 happens+0.011
Neg logit contributions
years-0.009
 after-0.009
but-0.009
to-0.009
 him-0.009
Token a
Token ID257
Feature activation-6.44e-01

Pos logit contributions
years+0.009
but+0.009
to+0.008
and+0.008
unct+0.008
Neg logit contributions
kin-0.012
pan-0.012
aver-0.011
 tray-0.011
 bowl-0.011
Token world
Token ID995
Feature activation-2.31e-01

Pos logit contributions
years+0.020
but+0.018
What+0.018
 after+0.018
to+0.018
Neg logit contributions
kin-0.017
pan-0.016
aver-0.016
 tray-0.015
 bowl-0.014
Token of
Token ID286
Feature activation-8.51e-01

Pos logit contributions
years+0.005
but+0.005
 after+0.005
and+0.005
What+0.005
Neg logit contributions
kin-0.008
pan-0.007
aver-0.007
 bowl-0.007
 tray-0.007
Token make
Token ID787
Feature activation-1.01e+00

Pos logit contributions
years+0.024
but+0.024
to+0.022
 afford+0.022
unct+0.022
Neg logit contributions
kin-0.024
pan-0.022
 tray-0.021
aver-0.021
 bowl-0.021
Token-
Token ID12
Feature activation-9.74e-02

Pos logit contributions
years+0.026
but+0.025
What+0.024
 afford+0.023
unct+0.023
Neg logit contributions
kin-0.031
pan-0.030
aver-0.028
 tray-0.028
 teaches-0.027
Tokenbel
Token ID6667
Feature activation-3.12e+00

Pos logit contributions
What+0.001
amb+0.001
but+0.001
ory+0.001
 least+0.001
Neg logit contributions
aver-0.003
pan-0.003
kin-0.003
case-0.003
 happens-0.003
Tokenieve
Token ID12311
Feature activation-3.81e-01

Pos logit contributions
years+0.083
but+0.079
to+0.078
 after+0.077
What+0.077
Neg logit contributions
kin-0.089
pan-0.084
aver-0.081
 teaches-0.081
 happens-0.076
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
years+0.010
but+0.009
What+0.009
to+0.009
 after+0.009
Neg logit contributions
kin-0.012
pan-0.011
aver-0.011
 tray-0.010
 bowl-0.010
Token
Token ID198
Feature activation-5.41e-01

Pos logit contributions
years+0.018
but+0.016
What+0.015
 after+0.015
and+0.015
Neg logit contributions
kin-0.051
pan-0.049
aver-0.048
 tray-0.045
 bowl-0.044
Token
Token ID198
Feature activation-7.88e-01

Pos logit contributions
years+0.012
but+0.011
What+0.010
 after+0.010
to+0.010
Neg logit contributions
kin-0.019
pan-0.019
aver-0.018
 tray-0.017
 bowl-0.017
TokenShe
Token ID3347
Feature activation-3.07e-02

Pos logit contributions
years+0.013
but+0.011
 after+0.010
 him+0.010
to+0.010
Neg logit contributions
kin-0.032
pan-0.031
aver-0.031
 tray-0.029
 bowl-0.029
Token returned
Token ID4504
Feature activation+8.05e-01

Pos logit contributions
kin+0.018
pan+0.017
aver+0.016
 tray+0.016
 teaches+0.016
Neg logit contributions
years-0.016
 after-0.015
but-0.015
 him-0.015
to-0.014
Token to
Token ID284
Feature activation+1.20e+00

Pos logit contributions
kin+0.027
pan+0.027
aver+0.025
 bowl+0.025
 tray+0.024
Neg logit contributions
years-0.019
What-0.016
but-0.016
unct-0.016
Ho-0.016
Token the
Token ID262
Feature activation+3.52e-01

Pos logit contributions
pan+0.031
kin+0.031
 bowl+0.029
 tray+0.029
aver+0.028
Neg logit contributions
years-0.036
but-0.035
and-0.033
to-0.031
Ho-0.031
Token books
Token ID3835
Feature activation+1.02e+00

Pos logit contributions
kin+0.007
 tray+0.007
 bowl+0.007
pan+0.007
 sheet+0.007
Neg logit contributions
years-0.012
but-0.012
to-0.011
and-0.011
itating-0.011
Tokenhe
Token ID258
Feature activation+1.54e+00

Pos logit contributions
kin+0.037
pan+0.035
aver+0.034
 tray+0.032
 bowl+0.032
Neg logit contributions
years-0.022
but-0.019
What-0.019
to-0.019
 after-0.018
Tokenlf
Token ID1652
Feature activation+2.48e+00

Pos logit contributions
kin+0.012
pan+0.009
aver+0.007
 tray+0.005
 bowl+0.005
Neg logit contributions
years-0.078
but-0.074
What-0.074
to-0.073
and-0.072
Token.
Token ID13
Feature activation+6.13e-01

Pos logit contributions
kin+0.084
pan+0.082
aver+0.078
 tray+0.074
 bowl+0.074
Neg logit contributions
years-0.057
What-0.053
but-0.051
to-0.049
what-0.049
Token
Token ID198
Feature activation+6.25e-01

Pos logit contributions
kin+0.017
pan+0.017
aver+0.016
 bowl+0.016
 tray+0.015
Neg logit contributions
years-0.017
unct-0.015
but-0.015
 afford-0.015
and-0.015
Token
Token ID198
Feature activation+6.03e-01

Pos logit contributions
kin+0.022
pan+0.021
aver+0.020
 tray+0.019
 bowl+0.019
Neg logit contributions
years-0.014
but-0.013
What-0.013
 after-0.012
to-0.012
TokenThe
Token ID464
Feature activation-9.79e-02

Pos logit contributions
kin+0.022
aver+0.022
pan+0.021
 teaches+0.020
 tray+0.020
Neg logit contributions
years-0.011
unct-0.010
 after-0.010
 him-0.010
but-0.010
Token moral
Token ID6573
Feature activation-7.26e-01

Pos logit contributions
years+0.003
but+0.003
to+0.003
and+0.003
 after+0.003
Neg logit contributions
kin-0.003
aver-0.002
pan-0.002
 tray-0.002
 bowl-0.002
Token was
Token ID373
Feature activation+6.61e-01

Pos logit contributions
kin+0.005
pan+0.005
aver+0.005
 tray+0.004
 bowl+0.004
Neg logit contributions
years-0.005
but-0.005
What-0.005
 after-0.005
 him-0.005
Token a
Token ID257
Feature activation+1.49e-01

Pos logit contributions
kin+0.020
pan+0.019
aver+0.019
 bowl+0.018
 tray+0.018
Neg logit contributions
years-0.017
but-0.016
to-0.015
unct-0.015
What-0.015
Token girl
Token ID2576
Feature activation+1.34e-01

Pos logit contributions
pan+0.004
kin+0.004
aver+0.004
 happens+0.004
 teaches+0.004
Neg logit contributions
years-0.004
What-0.004
 after-0.004
but-0.004
Ho-0.004
Token named
Token ID3706
Feature activation-5.70e-01

Pos logit contributions
kin+0.003
pan+0.003
aver+0.003
 tray+0.003
 bowl+0.003
Neg logit contributions
years-0.004
but-0.004
What-0.004
 after-0.004
to-0.004
Token Mag
Token ID2944
Feature activation+1.73e+00

Pos logit contributions
years+0.015
 after+0.014
but+0.014
What+0.014
and+0.014
Neg logit contributions
kin-0.018
pan-0.018
aver-0.017
 happens-0.015
 tray-0.015
Tokengy
Token ID1360
Feature activation+2.56e+00

Pos logit contributions
kin+0.061
pan+0.058
aver+0.056
 tray+0.054
 bowl+0.054
Neg logit contributions
years-0.039
but-0.036
to-0.035
 after-0.035
What-0.035
Token.
Token ID13
Feature activation-7.72e-01

Pos logit contributions
kin+0.064
aver+0.062
pan+0.060
 bowl+0.058
 tray+0.057
Neg logit contributions
years-0.079
but-0.072
to-0.071
What-0.071
and-0.070
Token Mag
Token ID2944
Feature activation+1.34e+00

Pos logit contributions
years+0.014
but+0.012
to+0.011
and+0.011
What+0.011
Neg logit contributions
kin-0.030
pan-0.029
aver-0.028
 happens-0.027
 tray-0.027
Tokengy
Token ID1360
Feature activation+2.73e+00

Pos logit contributions
kin+0.047
pan+0.045
aver+0.043
 tray+0.042
 bowl+0.042
Neg logit contributions
years-0.031
but-0.029
to-0.028
What-0.028
what-0.027
Token was
Token ID373
Feature activation+2.14e-01

Pos logit contributions
kin+0.078
pan+0.074
aver+0.072
 tray+0.066
 bowl+0.066
Neg logit contributions
years-0.077
but-0.072
What-0.070
 after-0.069
to-0.069
Token very
Token ID845
Feature activation+3.49e-01

Pos logit contributions
aver+0.006
kin+0.006
 bowl+0.005
 tray+0.005
pan+0.005
Neg logit contributions
minded-0.006
but-0.006
unct-0.006
to-0.006
and-0.006
Token played
Token ID2826
Feature activation+1.03e+00

Pos logit contributions
kin+0.052
pan+0.051
aver+0.047
 tray+0.046
 bowl+0.045
Neg logit contributions
years-0.039
 after-0.036
 him-0.036
What-0.035
but-0.035
Token together
Token ID1978
Feature activation+3.38e-01

Pos logit contributions
kin+0.049
pan+0.049
aver+0.047
 bowl+0.045
 tray+0.045
Neg logit contributions
years-0.009
but-0.007
 after-0.007
What-0.007
 him-0.007
Token with
Token ID351
Feature activation+1.06e+00

Pos logit contributions
kin+0.013
pan+0.012
aver+0.012
 happens+0.011
 tray+0.011
Neg logit contributions
 after-0.006
years-0.006
but-0.006
 him-0.006
and-0.006
Token Lily
Token ID20037
Feature activation+4.63e-01

Pos logit contributions
kin+0.040
pan+0.039
aver+0.038
 tray+0.036
 bowl+0.036
Neg logit contributions
years-0.020
but-0.018
What-0.018
 after-0.018
 him-0.017
Token's
Token ID338
Feature activation+6.10e-01

Pos logit contributions
kin+0.016
pan+0.015
aver+0.015
 tray+0.014
 happens+0.014
Neg logit contributions
years-0.011
 after-0.010
 him-0.010
and-0.010
but-0.010
Token pink
Token ID11398
Feature activation+2.65e+00

Pos logit contributions
pan+0.016
kin+0.016
aver+0.016
 happens+0.014
 teaches+0.014
Neg logit contributions
years-0.017
 him-0.017
 after-0.017
 afford-0.016
and-0.016
Token belt
Token ID10999
Feature activation+1.28e+00

Pos logit contributions
kin+0.055
aver+0.055
pan+0.052
 happens+0.046
 teaches+0.046
Neg logit contributions
 after-0.090
 him-0.089
years-0.086
 afford-0.086
but-0.085
Token.
Token ID13
Feature activation+5.32e-01

Pos logit contributions
kin+0.047
aver+0.045
pan+0.045
 tray+0.042
 bowl+0.042
Neg logit contributions
 after-0.024
but-0.024
years-0.023
 him-0.023
and-0.022
Token From
Token ID3574
Feature activation+6.17e-01

Pos logit contributions
kin+0.017
pan+0.017
aver+0.016
 teaches+0.016
 bowl+0.015
Neg logit contributions
years-0.013
unct-0.011
but-0.011
and-0.011
What-0.011
Token that
Token ID326
Feature activation-1.74e-01

Pos logit contributions
kin+0.021
pan+0.020
aver+0.020
 bowl+0.019
 tray+0.019
Neg logit contributions
years-0.014
and-0.013
but-0.012
to-0.012
What-0.012
Token day
Token ID1110
Feature activation-5.57e-01

Pos logit contributions
years+0.004
but+0.004
 after+0.004
What+0.004
 him+0.004
Neg logit contributions
kin-0.006
pan-0.005
aver-0.005
 tray-0.005
 bowl-0.005
Token Sam
Token ID3409
Feature activation-6.28e-01

Pos logit contributions
years+0.004
but+0.003
and+0.003
to+0.003
bel+0.003
Neg logit contributions
kin-0.003
aver-0.003
 bowl-0.002
pan-0.002
 teaches-0.002
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
years+0.019
What+0.018
but+0.018
to+0.017
what+0.017
Neg logit contributions
kin-0.017
pan-0.016
aver-0.015
 bowl-0.015
 tray-0.015
Token He
Token ID679
Feature activation+2.22e-01

Pos logit contributions
years+0.005
unct+0.005
 afford+0.005
to+0.005
and+0.005
Neg logit contributions
kin-0.006
aver-0.006
pan-0.006
 bowl-0.005
 teaches-0.005
Token learned
Token ID4499
Feature activation-1.46e+00

Pos logit contributions
kin+0.007
pan+0.007
aver+0.007
 tray+0.006
 bowl+0.006
Neg logit contributions
years-0.005
 after-0.005
What-0.005
 him-0.005
but-0.005
Token that
Token ID326
Feature activation+1.44e-01

Pos logit contributions
years+0.038
but+0.034
unct+0.033
and+0.033
Ho+0.032
Neg logit contributions
pan-0.044
kin-0.044
aver-0.042
 teaches-0.040
 happens-0.040
Token black
Token ID2042
Feature activation+2.55e+00

Pos logit contributions
kin+0.005
pan+0.005
aver+0.005
 tray+0.005
 bowl+0.005
Neg logit contributions
years-0.003
but-0.003
 after-0.003
 him-0.003
What-0.003
Token ink
Token ID16882
Feature activation-1.23e-01

Pos logit contributions
kin+0.081
pan+0.078
aver+0.075
 tray+0.070
 bowl+0.070
Neg logit contributions
years-0.064
but-0.060
What-0.058
 after-0.058
to-0.057
Token was
Token ID373
Feature activation-9.58e-01

Pos logit contributions
 him+0.003
 after+0.003
but+0.003
years+0.003
and+0.003
Neg logit contributions
aver-0.003
pan-0.003
kin-0.003
 tray-0.003
 bowl-0.003
Token not
Token ID407
Feature activation+2.17e-01

Pos logit contributions
years+0.023
but+0.021
What+0.021
 after+0.021
 him+0.021
Neg logit contributions
kin-0.031
pan-0.030
aver-0.029
 tray-0.027
 bowl-0.027
Token for
Token ID329
Feature activation+3.17e-01

Pos logit contributions
kin+0.007
pan+0.007
aver+0.006
 tray+0.006
 bowl+0.006
Neg logit contributions
years-0.006
but-0.005
What-0.005
to-0.005
and-0.005
Token drawing
Token ID8263
Feature activation+2.71e-01

Pos logit contributions
kin+0.009
pan+0.008
aver+0.008
 bowl+0.008
 snake+0.008
Neg logit contributions
but-0.009
years-0.008
to-0.008
What-0.008
itating-0.008
Token years
Token ID812
Feature activation+1.63e+00

Pos logit contributions
pan+0.058
kin+0.058
aver+0.053
 happens+0.052
 tray+0.051
Neg logit contributions
years-0.020
 after-0.017
and-0.016
 him-0.015
but-0.015
Token old
Token ID1468
Feature activation+1.78e+00

Pos logit contributions
kin+0.046
pan+0.043
aver+0.039
 happens+0.037
 teaches+0.036
Neg logit contributions
years-0.049
 after-0.047
and-0.045
 int-0.045
 him-0.044
Token and
Token ID290
Feature activation+2.15e-01

Pos logit contributions
kin+0.050
aver+0.047
pan+0.047
 bowl+0.047
 tray+0.045
Neg logit contributions
years-0.051
to-0.047
unct-0.046
but-0.046
 int-0.045
Token he
Token ID339
Feature activation-9.32e-01

Pos logit contributions
kin+0.007
pan+0.007
aver+0.007
 tray+0.006
 bowl+0.006
Neg logit contributions
years-0.005
but-0.005
What-0.005
 after-0.005
 him-0.005
Token loved
Token ID6151
Feature activation+3.96e-01

Pos logit contributions
years+0.024
but+0.022
What+0.021
 after+0.021
to+0.021
Neg logit contributions
kin-0.030
pan-0.029
aver-0.027
 tray-0.026
 bowl-0.026
Token aer
Token ID9551
Feature activation+2.52e+00

Pos logit contributions
pan+0.013
kin+0.013
 bowl+0.013
aver+0.013
 tray+0.012
Neg logit contributions
years-0.008
but-0.008
itating-0.007
unct-0.007
What-0.007
Tokenopl
Token ID20106
Feature activation+1.30e+00

Pos logit contributions
kin+0.069
pan+0.066
 tray+0.058
aver+0.057
 bowl+0.056
Neg logit contributions
years-0.081
but-0.074
 him-0.074
and-0.074
to-0.072
Tokenanes
Token ID7305
Feature activation+2.58e-01

Pos logit contributions
kin+0.031
pan+0.030
aver+0.028
 bowl+0.025
 tray+0.025
Neg logit contributions
years-0.042
What-0.040
 after-0.039
but-0.039
and-0.038
Token!
Token ID0
Feature activation-5.99e-01

Pos logit contributions
kin+0.007
pan+0.006
aver+0.006
 bowl+0.006
 tray+0.006
Neg logit contributions
years-0.008
What-0.008
to-0.007
but-0.007
unct-0.007
Token
Token ID198
Feature activation+2.84e-01

Pos logit contributions
years+0.012
 him+0.012
 after+0.012
What+0.012
but+0.012
Neg logit contributions
kin-0.021
pan-0.021
aver-0.020
 tray-0.018
 happens-0.018
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
kin+0.009
aver+0.008
pan+0.008
 bowl+0.008
 tray+0.008
Neg logit contributions
years-0.007
unct-0.006
but-0.006
 after-0.006
What-0.006
Token you
Token ID345
Feature activation-5.05e-01

Pos logit contributions
years+0.001
but+0.001
What+0.001
 afford+0.001
to+0.001
Neg logit contributions
kin-0.004
pan-0.004
aver-0.004
 tray-0.004
 teaches-0.004
Token all
Token ID477
Feature activation+2.42e-04

Pos logit contributions
years+0.009
 after+0.009
but+0.009
and+0.009
 him+0.009
Neg logit contributions
kin-0.019
pan-0.019
aver-0.018
 bowl-0.017
 tray-0.017
Token about
Token ID546
Feature activation-1.67e-01

Pos logit contributions
kin+0.000
pan+0.000
aver+0.000
 bowl+0.000
 tray+0.000
Neg logit contributions
years-0.000
but-0.000
 after-0.000
What-0.000
and-0.000
Token jokes
Token ID14532
Feature activation-4.12e-01

Pos logit contributions
years+0.004
but+0.003
What+0.003
 after+0.003
 him+0.003
Neg logit contributions
kin-0.006
pan-0.006
aver-0.005
 tray-0.005
 bowl-0.005
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
years+0.010
but+0.010
 after+0.010
What+0.010
and+0.010
Neg logit contributions
kin-0.013
pan-0.012
aver-0.012
 bowl-0.011
 tray-0.011
Token They
Token ID1119
Feature activation+5.24e-01

Pos logit contributions
kin+0.001
pan+0.001
aver+0.001
 tray+0.001
 bowl+0.001
Neg logit contributions
years-0.000
but-0.000
What-0.000
 after-0.000
to-0.000
Token can
Token ID460
Feature activation+1.73e-01

Pos logit contributions
kin+0.016
pan+0.015
aver+0.015
 tray+0.014
 teaches+0.014
Neg logit contributions
years-0.014
but-0.013
to-0.012
What-0.012
and-0.012
Token make
Token ID787
Feature activation-4.03e-01

Pos logit contributions
kin+0.005
pan+0.005
 teaches+0.005
aver+0.005
 snake+0.005
Neg logit contributions
years-0.005
and-0.004
but-0.004
to-0.004
 after-0.004
Token people
Token ID661
Feature activation-8.10e-01

Pos logit contributions
years+0.010
but+0.010
 afford+0.009
What+0.009
what+0.009
Neg logit contributions
kin-0.012
pan-0.012
 teaches-0.011
 happens-0.011
aver-0.011
Token feel
Token ID1254
Feature activation+2.35e-01

Pos logit contributions
years+0.023
but+0.020
to+0.020
What+0.019
unct+0.019
Neg logit contributions
kin-0.024
pan-0.023
aver-0.022
 teaches-0.021
 bowl-0.021
Token happy
Token ID3772
Feature activation-7.13e-01

Pos logit contributions
kin+0.009
pan+0.008
aver+0.008
 tray+0.008
 bowl+0.007
Neg logit contributions
years-0.005
 after-0.005
but-0.004
What-0.004
 him-0.004
Token plants
Token ID6134
Feature activation-6.71e-01

Pos logit contributions
years+0.008
but+0.008
What+0.008
 after+0.008
to+0.008
Neg logit contributions
kin-0.008
pan-0.008
aver-0.007
 tray-0.007
 bowl-0.007
Token in
Token ID287
Feature activation-9.77e-01

Pos logit contributions
years+0.014
but+0.013
 after+0.013
What+0.013
 him+0.012
Neg logit contributions
kin-0.024
pan-0.023
aver-0.022
 tray-0.021
 bowl-0.021
Token the
Token ID262
Feature activation-5.37e-01

Pos logit contributions
 him+0.017
 after+0.017
years+0.015
What+0.015
Why+0.014
Neg logit contributions
kin-0.038
pan-0.038
aver-0.037
 happens-0.035
ool-0.033
Token area
Token ID1989
Feature activation+3.47e-01

Pos logit contributions
years+0.023
but+0.022
to+0.022
and+0.022
 him+0.021
Neg logit contributions
kin-0.008
 tray-0.007
aver-0.007
 bowl-0.007
pan-0.006
Token were
Token ID547
Feature activation-6.77e-01

Pos logit contributions
kin+0.012
pan+0.012
aver+0.011
 tray+0.011
 bowl+0.010
Neg logit contributions
years-0.008
but-0.007
 after-0.007
 him-0.007
and-0.007
Token very
Token ID845
Feature activation+4.97e-01

Pos logit contributions
years+0.012
 after+0.011
 him+0.010
 quite+0.010
 knowing+0.010
Neg logit contributions
kin-0.027
pan-0.026
aver-0.024
 happens-0.024
 tray-0.024
Token happy
Token ID3772
Feature activation-8.26e-01

Pos logit contributions
kin+0.016
pan+0.015
aver+0.015
 bowl+0.014
 tray+0.014
Neg logit contributions
years-0.013
but-0.012
What-0.011
to-0.011
and-0.011
Token.
Token ID13
Feature activation-3.14e-01

Pos logit contributions
years+0.024
unct+0.021
Ho+0.021
What+0.021
to+0.020
Neg logit contributions
kin-0.024
pan-0.023
aver-0.021
 bowl-0.021
 snake-0.020
Token They
Token ID1119
Feature activation+5.03e-01

Pos logit contributions
years+0.006
but+0.005
 after+0.005
What+0.005
 him+0.005
Neg logit contributions
kin-0.012
pan-0.012
aver-0.011
 tray-0.011
 bowl-0.011
Token were
Token ID547
Feature activation-2.80e-01

Pos logit contributions
kin+0.017
pan+0.016
aver+0.015
 tray+0.015
 bowl+0.015
Neg logit contributions
years-0.012
 after-0.011
 him-0.011
but-0.010
What-0.010
Token not
Token ID407
Feature activation+4.97e-01

Pos logit contributions
years+0.004
 after+0.004
 him+0.004
but+0.004
 knowing+0.003
Neg logit contributions
kin-0.012
pan-0.011
aver-0.011
 happens-0.011
 tray-0.011
Token �
Token ID6184
Feature activation-8.33e-01

Pos logit contributions
kin+0.001
pan+0.001
aver+0.001
 tray+0.001
 bowl+0.001
Neg logit contributions
years-0.001
but-0.001
What-0.001
to-0.001
and-0.001
Token�
Token ID95
Feature activation-1.21e+00

Pos logit contributions
years+0.023
and+0.021
but+0.021
to+0.020
What+0.020
Neg logit contributions
kin-0.026
pan-0.024
aver-0.023
 tray-0.022
 bowl-0.022
Token€
Token ID26391
Feature activation+3.26e-01

Pos logit contributions
years+0.029
but+0.027
What+0.027
 after+0.026
to+0.026
Neg logit contributions
kin-0.040
pan-0.038
aver-0.036
 tray-0.035
 bowl-0.035
Token�
Token ID129
Feature activation-6.72e-01

Pos logit contributions
kin+0.012
pan+0.012
aver+0.011
 tray+0.011
 bowl+0.011
Neg logit contributions
but-0.006
years-0.006
to-0.006
 after-0.006
What-0.005
Token�
Token ID241
Feature activation-8.79e-01

Pos logit contributions
years+0.017
but+0.016
What+0.016
to+0.016
 after+0.016
Neg logit contributions
kin-0.021
pan-0.020
aver-0.019
 bowl-0.018
 tray-0.018
TokenHello
Token ID15496
Feature activation+5.57e-01

Pos logit contributions
years+0.021
unct+0.020
 Him+0.018
minded+0.018
 int+0.018
Neg logit contributions
kin-0.028
aver-0.027
pan-0.025
 teaches-0.024
 happens-0.024
Token there
Token ID612
Feature activation+1.18e-01

Pos logit contributions
kin+0.016
pan+0.016
aver+0.015
 tray+0.014
 bowl+0.014
Neg logit contributions
years-0.015
 after-0.014
What-0.014
but-0.014
 him-0.014
Token!
Token ID0
Feature activation-5.23e-01

Pos logit contributions
kin+0.003
pan+0.003
aver+0.003
 tray+0.003
 bowl+0.003
Neg logit contributions
years-0.003
What-0.003
but-0.003
 him-0.003
 after-0.003
Token What
Token ID1867
Feature activation-4.25e-01

Pos logit contributions
years+0.011
but+0.010
What+0.010
 after+0.010
to+0.010
Neg logit contributions
kin-0.019
pan-0.018
aver-0.017
 tray-0.016
 bowl-0.016
Tokenâ
Token ID22940
Feature activation-1.14e-01

Pos logit contributions
 him+0.009
and+0.009
years+0.009
 after+0.009
What+0.009
Neg logit contributions
pan-0.015
kin-0.014
aver-0.014
 tray-0.013
 bowl-0.012
Token€
Token ID26391
Feature activation+2.17e-01

Pos logit contributions
but+0.003
What+0.002
years+0.002
Why+0.002
 after+0.002
Neg logit contributions
pan-0.004
kin-0.004
aver-0.003
 tray-0.003
 broom-0.003
Token a
Token ID257
Feature activation-6.54e-01

Pos logit contributions
kin+0.023
pan+0.022
aver+0.020
 tray+0.020
 bowl+0.020
Neg logit contributions
years-0.021
but-0.019
unct-0.019
and-0.018
What-0.018
Token long
Token ID890
Feature activation+6.77e-02

Pos logit contributions
years+0.018
 after+0.018
but+0.017
 afford+0.017
What+0.017
Neg logit contributions
kin-0.018
pan-0.017
aver-0.017
 happens-0.016
 teaches-0.015
Token drive
Token ID3708
Feature activation+6.46e-01

Pos logit contributions
kin+0.002
 happens+0.002
pan+0.002
 iv+0.002
aver+0.002
Neg logit contributions
 after-0.002
 him-0.002
 afford-0.002
 int-0.002
away-0.002
Token.
Token ID13
Feature activation+1.25e-03

Pos logit contributions
kin+0.022
pan+0.021
aver+0.020
 tray+0.019
 bowl+0.019
Neg logit contributions
years-0.015
but-0.014
 after-0.014
 him-0.013
What-0.013
Token The
Token ID383
Feature activation-1.95e-01

Pos logit contributions
kin+0.000
pan+0.000
aver+0.000
 tray+0.000
 bowl+0.000
Neg logit contributions
years-0.000
but-0.000
What-0.000
 after-0.000
to-0.000
Token motorcycle
Token ID18757
Feature activation+4.36e-01

Pos logit contributions
What+0.006
 after+0.006
 him+0.006
years+0.006
what+0.006
Neg logit contributions
kin-0.005
pan-0.005
 happens-0.004
oon-0.004
 iv-0.004
Token was
Token ID373
Feature activation-3.20e-01

Pos logit contributions
kin+0.014
pan+0.013
aver+0.013
 tray+0.012
 bowl+0.012
Neg logit contributions
 after-0.010
but-0.010
 him-0.010
years-0.010
 afford-0.010
Token happy
Token ID3772
Feature activation-1.78e-01

Pos logit contributions
years+0.004
 after+0.004
 him+0.004
 quite+0.004
 int+0.004
Neg logit contributions
kin-0.014
pan-0.013
 happens-0.013
aver-0.013
 tray-0.013
Token to
Token ID284
Feature activation-5.39e-01

Pos logit contributions
years+0.004
unct+0.003
What+0.003
but+0.003
Ho+0.003
Neg logit contributions
kin-0.006
pan-0.006
aver-0.006
 bowl-0.006
 tray-0.005
Token be
Token ID307
Feature activation+7.02e-01

Pos logit contributions
years+0.012
but+0.011
 after+0.011
What+0.011
 him+0.011
Neg logit contributions
kin-0.018
pan-0.018
aver-0.017
 tray-0.016
 bowl-0.016
Token moving
Token ID3867
Feature activation+5.20e-01

Pos logit contributions
kin+0.025
pan+0.024
 tray+0.023
 happens+0.022
 snake+0.022
Neg logit contributions
 after-0.016
years-0.016
 him-0.015
but-0.014
 during-0.014
Token upon
Token ID2402
Feature activation+6.65e-01

Pos logit contributions
years+0.008
 him+0.007
but+0.006
 after+0.006
to+0.005
Neg logit contributions
kin-0.064
pan-0.063
aver-0.062
 tray-0.059
 teaches-0.058
Token a
Token ID257
Feature activation+9.01e-01

Pos logit contributions
kin+0.019
aver+0.017
pan+0.017
 bowl+0.016
 tray+0.016
Neg logit contributions
years-0.019
but-0.018
unct-0.018
minded-0.017
 int-0.017
Token time
Token ID640
Feature activation-6.88e-01

Pos logit contributions
pan+0.022
kin+0.021
aver+0.018
 happens+0.017
 bowl+0.016
Neg logit contributions
 after-0.029
years-0.029
and-0.028
to-0.028
What-0.028
Token,
Token ID11
Feature activation-1.38e+00

Pos logit contributions
years+0.018
but+0.017
 after+0.017
 him+0.017
What+0.016
Neg logit contributions
kin-0.021
pan-0.020
aver-0.020
 tray-0.018
 bowl-0.018
Token there
Token ID612
Feature activation+2.83e-01

Pos logit contributions
years+0.041
but+0.038
What+0.038
 after+0.038
to+0.037
Neg logit contributions
kin-0.038
pan-0.036
aver-0.034
 tray-0.032
 bowl-0.032
Token was
Token ID373
Feature activation+5.28e-01

Pos logit contributions
pan+0.010
kin+0.009
aver+0.009
 tray+0.008
 teaches+0.008
Neg logit contributions
 him-0.007
What-0.006
years-0.006
 after-0.006
but-0.006
Token a
Token ID257
Feature activation-8.70e-02

Pos logit contributions
kin+0.016
aver+0.015
pan+0.015
 bowl+0.014
 tray+0.014
Neg logit contributions
years-0.014
but-0.013
unct-0.013
to-0.013
 afford-0.013
Token brave
Token ID14802
Feature activation-5.67e-01

Pos logit contributions
years+0.003
What+0.002
 after+0.002
but+0.002
Ho+0.002
Neg logit contributions
pan-0.002
kin-0.002
aver-0.002
 happens-0.002
 teaches-0.002
Token animal
Token ID5044
Feature activation+5.56e-01

Pos logit contributions
 after+0.016
years+0.014
ory+0.014
Ho+0.014
 quite+0.014
Neg logit contributions
pan-0.016
kin-0.016
 happens-0.014
 teaches-0.013
aver-0.013
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
kin+0.014
pan+0.013
aver+0.013
 tray+0.012
 bowl+0.012
Neg logit contributions
years-0.018
but-0.017
What-0.016
to-0.016
 after-0.016
Token His
Token ID2399
Feature activation+2.29e-01

Pos logit contributions
years+0.030
What+0.027
 after+0.027
but+0.027
 him+0.027
Neg logit contributions
kin-0.045
pan-0.044
aver-0.042
 tray-0.039
 bowl-0.039
Token room
Token ID2119
Feature activation+1.38e+00

Pos logit contributions
years+0.008
What+0.007
but+0.007
 after+0.007
 him+0.007
Neg logit contributions
kin-0.005
pan-0.005
aver-0.005
 tray-0.004
 bowl-0.004
Token,
Token ID11
Feature activation+4.38e-01

Pos logit contributions
kin+0.044
pan+0.042
aver+0.040
 tray+0.038
 bowl+0.038
Neg logit contributions
years-0.035
but-0.032
What-0.032
 after-0.031
to-0.031
Token pretending
Token ID23662
Feature activation+3.64e-01

Pos logit contributions
pan+0.013
kin+0.012
 tray+0.012
 bowl+0.012
aver+0.012
Neg logit contributions
years-0.012
to-0.011
 afford-0.011
and-0.011
unct-0.011
Token to
Token ID284
Feature activation+8.26e-01

Pos logit contributions
kin+0.015
aver+0.015
pan+0.015
 bowl+0.014
 tray+0.014
Neg logit contributions
years-0.005
 after-0.005
 him-0.005
but-0.005
and-0.005
Token fight
Token ID1907
Feature activation+1.03e+00

Pos logit contributions
kin+0.025
pan+0.023
aver+0.023
 bowl+0.022
 tray+0.021
Neg logit contributions
years-0.024
but-0.022
to-0.021
and-0.020
 int-0.020
Token and
Token ID290
Feature activation-8.37e-01

Pos logit contributions
kin+0.034
pan+0.032
aver+0.031
 tray+0.029
 bowl+0.029
Neg logit contributions
years-0.026
but-0.023
What-0.023
to-0.022
 after-0.022
Token chase
Token ID15505
Feature activation+1.89e+00

Pos logit contributions
years+0.023
but+0.021
to+0.020
and+0.020
What+0.020
Neg logit contributions
kin-0.025
pan-0.025
aver-0.022
 tray-0.022
 bowl-0.022
Token each
Token ID1123
Feature activation+5.52e-01

Pos logit contributions
kin+0.058
pan+0.055
aver+0.053
 tray+0.051
 bowl+0.050
Neg logit contributions
years-0.050
but-0.047
to-0.044
What-0.044
unct-0.044
Token other
Token ID584
Feature activation-1.56e+00

Pos logit contributions
kin+0.015
pan+0.014
aver+0.013
 tray+0.012
 bowl+0.012
Neg logit contributions
years-0.017
but-0.016
What-0.016
 after-0.015
to-0.015
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
years+0.042
but+0.039
What+0.038
to+0.037
 afford+0.037
Neg logit contributions
kin-0.048
pan-0.046
aver-0.043
 tray-0.042
 bowl-0.041
Token They
Token ID1119
Feature activation+1.44e-01

Pos logit contributions
kin+0.009
pan+0.009
aver+0.009
 tray+0.008
 bowl+0.008
Neg logit contributions
years-0.006
unct-0.005
but-0.005
to-0.005
 him-0.005
Token nice
Token ID3621
Feature activation+7.51e-02

Pos logit contributions
kin+0.011
pan+0.011
aver+0.010
 tray+0.010
 bowl+0.010
Neg logit contributions
years-0.012
but-0.012
What-0.011
 after-0.011
to-0.011
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
kin+0.003
pan+0.003
aver+0.003
 tray+0.002
 bowl+0.002
Neg logit contributions
years-0.002
 after-0.002
but-0.001
 him-0.001
and-0.001
Token He
Token ID679
Feature activation-1.14e+00

Pos logit contributions
kin+0.009
pan+0.008
aver+0.008
 tray+0.008
 bowl+0.008
Neg logit contributions
years-0.005
but-0.005
What-0.005
 after-0.005
to-0.005
Token will
Token ID481
Feature activation+1.39e-01

Pos logit contributions
years+0.038
but+0.036
 after+0.035
 him+0.035
What+0.035
Neg logit contributions
kin-0.027
pan-0.026
aver-0.024
 tray-0.022
 bowl-0.022
Token be
Token ID307
Feature activation+6.22e-01

Pos logit contributions
kin+0.004
pan+0.004
aver+0.004
 teaches+0.004
 bowl+0.004
Neg logit contributions
years-0.004
and-0.004
but-0.004
to-0.004
What-0.003
Token gentle
Token ID10296
Feature activation-8.12e-02

Pos logit contributions
kin+0.020
pan+0.019
aver+0.018
 bowl+0.017
 tray+0.017
Neg logit contributions
years-0.016
but-0.015
What-0.014
to-0.014
and-0.014
Token and
Token ID290
Feature activation+7.03e-02

Pos logit contributions
 after+0.001
and+0.001
but+0.001
years+0.001
 him+0.001
Neg logit contributions
kin-0.003
aver-0.003
pan-0.003
 tray-0.003
 bowl-0.003
Token kind
Token ID1611
Feature activation+2.98e-01

Pos logit contributions
kin+0.002
pan+0.002
aver+0.002
 bowl+0.002
 teaches+0.002
Neg logit contributions
years-0.002
but-0.002
to-0.002
and-0.002
What-0.002
Token.
Token ID13
Feature activation+4.24e-01

Pos logit contributions
kin+0.009
pan+0.009
aver+0.008
 tray+0.008
 bowl+0.008
Neg logit contributions
years-0.008
but-0.007
What-0.007
to-0.007
 afford-0.007
Token And
Token ID843
Feature activation-9.42e-01

Pos logit contributions
kin+0.015
pan+0.015
aver+0.014
 tray+0.013
 bowl+0.013
Neg logit contributions
years-0.009
but-0.008
What-0.008
 after-0.008
to-0.008
Token the
Token ID262
Feature activation-2.70e-01

Pos logit contributions
years+0.026
but+0.024
to+0.024
unct+0.022
and+0.022
Neg logit contributions
kin-0.028
 teaches-0.026
pan-0.025
 happens-0.025
 bowl-0.024
Token would
Token ID561
Feature activation-8.58e-01

Pos logit contributions
kin+0.000
pan+0.000
aver+0.000
 bowl+0.000
 tray+0.000
Neg logit contributions
years-0.000
but-0.000
What-0.000
and-0.000
to-0.000
Token take
Token ID1011
Feature activation+2.49e-01

Pos logit contributions
years+0.022
but+0.020
and+0.020
What+0.019
to+0.019
Neg logit contributions
kin-0.027
pan-0.027
aver-0.025
 tray-0.024
 bowl-0.024
Token Jack
Token ID3619
Feature activation-6.75e-01

Pos logit contributions
kin+0.008
pan+0.008
aver+0.008
 happens+0.008
 tray+0.007
Neg logit contributions
years-0.006
but-0.005
 afford-0.005
What-0.005
what-0.005
Token's
Token ID338
Feature activation-3.42e-01

Pos logit contributions
years+0.018
What+0.017
but+0.016
Ho+0.015
unct+0.015
Neg logit contributions
kin-0.019
pan-0.019
 tray-0.019
 bowl-0.018
aver-0.018
Token carrots
Token ID34397
Feature activation-1.40e+00

Pos logit contributions
years+0.010
but+0.010
 after+0.010
 him+0.010
What+0.010
Neg logit contributions
kin-0.009
pan-0.009
aver-0.008
 tray-0.007
 happens-0.007
Token and
Token ID290
Feature activation-2.78e-02

Pos logit contributions
years+0.028
What+0.025
but+0.024
to+0.024
and+0.023
Neg logit contributions
kin-0.052
pan-0.051
aver-0.049
 bowl-0.047
 tray-0.047
Token never
Token ID1239
Feature activation-5.06e-01

Pos logit contributions
years+0.001
 after+0.001
 him+0.001
but+0.001
What+0.001
Neg logit contributions
kin-0.001
pan-0.001
aver-0.001
 tray-0.001
 bowl-0.001
Token said
Token ID531
Feature activation+4.76e-02

Pos logit contributions
years+0.014
but+0.013
What+0.012
to+0.012
and+0.012
Neg logit contributions
kin-0.015
pan-0.015
aver-0.014
 bowl-0.014
 happens-0.013
Token please
Token ID3387
Feature activation-1.64e-01

Pos logit contributions
kin+0.002
aver+0.002
pan+0.002
 tray+0.002
 bowl+0.001
Neg logit contributions
years-0.001
 him-0.001
 after-0.001
but-0.001
What-0.001
Token or
Token ID393
Feature activation-8.77e-01

Pos logit contributions
years+0.005
What+0.004
but+0.004
unct+0.004
Ho+0.004
Neg logit contributions
kin-0.005
pan-0.005
aver-0.004
 tray-0.004
 bowl-0.004
Token thank
Token ID5875
Feature activation-1.47e+00

Pos logit contributions
years+0.021
but+0.019
What+0.018
 after+0.018
to+0.018
Neg logit contributions
kin-0.029
pan-0.028
aver-0.027
 bowl-0.026
 tray-0.026
Token a
Token ID257
Feature activation-6.07e-01

Pos logit contributions
years+0.021
but+0.020
What+0.019
 afford+0.019
Ho+0.018
Neg logit contributions
kin-0.028
pan-0.027
aver-0.026
 bowl-0.024
 tray-0.024
Token lovely
Token ID14081
Feature activation-4.28e-01

Pos logit contributions
years+0.022
but+0.020
and+0.019
to+0.019
What+0.019
Neg logit contributions
kin-0.014
pan-0.013
 bowl-0.013
 tray-0.013
aver-0.012
Token cake
Token ID12187
Feature activation-1.33e+00

Pos logit contributions
years+0.013
 after+0.013
but+0.013
What+0.013
 him+0.013
Neg logit contributions
kin-0.010
pan-0.010
aver-0.009
 happens-0.008
 teaches-0.008
Token for
Token ID329
Feature activation-5.28e-01

Pos logit contributions
years+0.039
What+0.033
Ho+0.031
unct+0.031
to+0.031
Neg logit contributions
kin-0.044
pan-0.041
 bowl-0.039
 tray-0.038
aver-0.037
Token dessert
Token ID23084
Feature activation+3.39e-02

Pos logit contributions
years+0.012
but+0.011
to+0.011
and+0.011
What+0.011
Neg logit contributions
kin-0.018
pan-0.017
aver-0.016
 bowl-0.016
 tray-0.016
Token.
Token ID13
Feature activation+5.04e-03

Pos logit contributions
kin+0.001
pan+0.001
aver+0.001
 bowl+0.001
 tray+0.001
Neg logit contributions
years-0.001
What-0.001
but-0.001
to-0.001
and-0.001
Token Lily
Token ID20037
Feature activation+1.63e-01

Pos logit contributions
kin+0.000
pan+0.000
aver+0.000
 tray+0.000
 bowl+0.000
Neg logit contributions
years-0.000
but-0.000
What-0.000
 after-0.000
to-0.000
Token was
Token ID373
Feature activation-5.42e-01

Pos logit contributions
kin+0.005
pan+0.005
aver+0.005
 tray+0.004
 bowl+0.004
Neg logit contributions
years-0.004
 after-0.004
but-0.004
 him-0.004
to-0.004
Token so
Token ID523
Feature activation+4.61e-01

Pos logit contributions
years+0.007
but+0.006
 after+0.005
What+0.005
 him+0.005
Neg logit contributions
kin-0.024
pan-0.024
aver-0.023
 tray-0.022
 happens-0.022
Token excited
Token ID6568
Feature activation-2.22e-01

Pos logit contributions
kin+0.014
aver+0.012
 bowl+0.012
pan+0.011
 tray+0.011
Neg logit contributions
years-0.012
What-0.012
but-0.012
Ho-0.012
what-0.012
Token to
Token ID284
Feature activation+1.78e-01

Pos logit contributions
years+0.008
Ho+0.007
nam+0.007
unct+0.007
What+0.007
Neg logit contributions
 happens-0.004
pan-0.004
 bowl-0.004
kin-0.004
 tray-0.004
Token.
Token ID13
Feature activation+4.48e-01

Pos logit contributions
kin+0.014
pan+0.013
aver+0.013
 tray+0.012
 bowl+0.012
Neg logit contributions
years-0.011
but-0.010
What-0.010
 after-0.010
to-0.010
Token The
Token ID383
Feature activation+5.26e-01

Pos logit contributions
kin+0.014
pan+0.014
aver+0.013
 tray+0.012
 bowl+0.012
Neg logit contributions
years-0.011
but-0.011
What-0.010
 after-0.010
to-0.010
Token elephant
Token ID20950
Feature activation+1.55e+00

Pos logit contributions
kin+0.012
pan+0.011
aver+0.010
 tray+0.009
 bowl+0.009
Neg logit contributions
years-0.018
but-0.017
What-0.017
 after-0.017
 him-0.017
Token noticed
Token ID6810
Feature activation+1.06e-01

Pos logit contributions
kin+0.046
pan+0.045
aver+0.042
 tray+0.039
 bowl+0.038
Neg logit contributions
years-0.042
 after-0.040
but-0.039
 him-0.039
What-0.038
Token Lily
Token ID20037
Feature activation+7.56e-01

Pos logit contributions
kin+0.004
pan+0.004
aver+0.004
 tray+0.004
 bowl+0.004
Neg logit contributions
years-0.002
 him-0.002
What-0.002
 after-0.002
but-0.002
Token and
Token ID290
Feature activation+2.48e-01

Pos logit contributions
kin+0.026
pan+0.025
aver+0.024
 tray+0.023
 bowl+0.023
Neg logit contributions
years-0.017
but-0.015
 after-0.015
What-0.015
to-0.015
Token Max
Token ID5436
Feature activation-1.10e+00

Pos logit contributions
kin+0.008
aver+0.007
pan+0.007
 tray+0.006
 teaches+0.006
Neg logit contributions
years-0.007
 him-0.007
 after-0.007
What-0.006
nam-0.006
Token and
Token ID290
Feature activation-1.77e-01

Pos logit contributions
years+0.032
but+0.030
What+0.029
to+0.028
bel+0.028
Neg logit contributions
kin-0.030
pan-0.029
 bowl-0.027
 tray-0.027
aver-0.026
Token their
Token ID511
Feature activation+1.04e+00

Pos logit contributions
years+0.005
 after+0.004
 him+0.004
What+0.004
but+0.004
Neg logit contributions
kin-0.006
pan-0.005
aver-0.005
 tray-0.005
 bowl-0.005
Token mom
Token ID1995
Feature activation-1.26e+00

Pos logit contributions
kin+0.021
pan+0.019
aver+0.019
 bowl+0.018
 tray+0.018
Neg logit contributions
years-0.038
but-0.037
to-0.036
What-0.036
and-0.035
Token.
Token ID13
Feature activation-9.51e-02

Pos logit contributions
years+0.034
but+0.032
What+0.031
to+0.031
and+0.030
Neg logit contributions
kin-0.037
pan-0.036
aver-0.034
 bowl-0.033
 tray-0.032
Token fog
Token ID19558
Feature activation+1.66e-01

Pos logit contributions
kin+0.033
pan+0.032
aver+0.031
 bowl+0.029
 tray+0.029
Neg logit contributions
years-0.035
but-0.033
to-0.032
What-0.032
what-0.032
Tokengy
Token ID1360
Feature activation+1.12e+00

Pos logit contributions
kin+0.005
pan+0.005
aver+0.005
 tray+0.004
 bowl+0.004
Neg logit contributions
years-0.004
but-0.004
What-0.004
to-0.004
and-0.004
Token and
Token ID290
Feature activation+7.45e-01

Pos logit contributions
kin+0.032
pan+0.032
 tray+0.030
aver+0.030
 bowl+0.029
Neg logit contributions
years-0.032
What-0.029
but-0.028
to-0.028
what-0.027
Token Jessica
Token ID17352
Feature activation+3.72e-01

Pos logit contributions
kin+0.027
aver+0.026
pan+0.026
 bowl+0.024
 tray+0.023
Neg logit contributions
years-0.015
 after-0.015
What-0.014
 him-0.014
Why-0.013
Token felt
Token ID2936
Feature activation+9.00e-02

Pos logit contributions
kin+0.013
pan+0.013
aver+0.012
 tray+0.012
 happens+0.011
Neg logit contributions
years-0.008
 after-0.008
but-0.007
 quite-0.007
 him-0.007
Token a
Token ID257
Feature activation-1.97e+00

Pos logit contributions
kin+0.004
pan+0.003
aver+0.003
 tray+0.003
 bowl+0.003
Neg logit contributions
years-0.002
 after-0.001
but-0.001
What-0.001
 him-0.001
Token bit
Token ID1643
Feature activation+4.06e-01

Pos logit contributions
years+0.057
but+0.053
to+0.051
What+0.051
and+0.051
Neg logit contributions
kin-0.056
pan-0.054
aver-0.051
 bowl-0.048
 tray-0.048
Token scared
Token ID12008
Feature activation+3.26e-01

Pos logit contributions
kin+0.012
pan+0.011
aver+0.011
 bowl+0.010
 tray+0.010
Neg logit contributions
years-0.012
but-0.011
What-0.010
to-0.010
and-0.010
Token.
Token ID13
Feature activation-5.79e-01

Pos logit contributions
pan+0.008
kin+0.007
 happens+0.007
aver+0.007
 iv+0.007
Neg logit contributions
years-0.011
unct-0.010
to-0.010
bel-0.010
Ho-0.010
Token Suddenly
Token ID24975
Feature activation-4.62e-01

Pos logit contributions
years+0.014
unct+0.013
Ho+0.013
ge+0.012
beh+0.012
Neg logit contributions
 teaches-0.017
case-0.017
 happens-0.017
pan-0.016
kin-0.016
Token,
Token ID11
Feature activation-4.53e-01

Pos logit contributions
 after+0.010
 him+0.009
years+0.008
 during+0.008
 her+0.008
Neg logit contributions
aver-0.017
kin-0.017
pan-0.017
 bowl-0.015
 tray-0.015
Token adventure
Token ID8855
Feature activation-3.33e-01

Pos logit contributions
 after+0.021
years+0.021
What+0.020
but+0.020
 afford+0.020
Neg logit contributions
kin-0.020
pan-0.020
aver-0.019
 happens-0.017
 teaches-0.016
Token!
Token ID0
Feature activation-7.06e-01

Pos logit contributions
years+0.008
but+0.007
 after+0.007
What+0.007
 him+0.007
Neg logit contributions
kin-0.011
pan-0.011
aver-0.010
 tray-0.010
 bowl-0.010
Tokenâ
Token ID22940
Feature activation-1.38e+00

Pos logit contributions
years+0.016
but+0.015
What+0.014
 after+0.014
to+0.014
Neg logit contributions
kin-0.024
pan-0.023
aver-0.022
 tray-0.021
 bowl-0.021
Token€
Token ID26391
Feature activation+1.77e-01

Pos logit contributions
years+0.036
but+0.033
What+0.032
 after+0.032
to+0.032
Neg logit contributions
kin-0.043
pan-0.041
aver-0.040
 tray-0.037
 bowl-0.037
Token<|endoftext|>
Token ID50256
Feature activation-3.21e-01

Pos logit contributions
kin+0.007
pan+0.006
aver+0.006
 tray+0.006
 snake+0.006
Neg logit contributions
but-0.004
to-0.003
years-0.003
 after-0.003
What-0.003
TokenOnce
Token ID7454
Feature activation-1.87e+00

Pos logit contributions
What+0.005
 him+0.005
to+0.005
years+0.005
what+0.005
Neg logit contributions
kin-0.013
pan-0.013
aver-0.013
 tray-0.012
 teaches-0.011
Token there
Token ID612
Feature activation+2.81e-02

Pos logit contributions
years+0.009
 him+0.007
 after+0.006
but+0.006
to+0.005
Neg logit contributions
kin-0.096
pan-0.095
aver-0.092
 tray-0.089
 teaches-0.088
Token was
Token ID373
Feature activation+4.57e-01

Pos logit contributions
kin+0.001
pan+0.001
aver+0.001
 tray+0.001
 bowl+0.001
Neg logit contributions
years-0.001
 him-0.001
but-0.001
What-0.001
 after-0.001
Token a
Token ID257
Feature activation+1.34e-01

Pos logit contributions
kin+0.014
aver+0.013
pan+0.013
 bowl+0.012
 tray+0.012
Neg logit contributions
years-0.012
but-0.011
What-0.011
 afford-0.011
to-0.011
Token little
Token ID1310
Feature activation-7.78e-01

Pos logit contributions
pan+0.004
kin+0.004
aver+0.003
 happens+0.003
 teaches+0.003
Neg logit contributions
years-0.004
What-0.004
 after-0.004
but-0.004
Ho-0.004
Token boy
Token ID2933
Feature activation+5.13e-01

Pos logit contributions
years+0.021
 after+0.019
What+0.019
but+0.019
 him+0.019
Neg logit contributions
kin-0.023
pan-0.023
aver-0.021
 teaches-0.020
 happens-0.020
Token
Token ID198
Feature activation+7.32e-01

Pos logit contributions
kin+0.031
pan+0.031
aver+0.029
 tray+0.028
 happens+0.028
Neg logit contributions
to-0.013
but-0.013
and-0.013
years-0.012
 him-0.012
TokenHis
Token ID6653
Feature activation-3.01e-02

Pos logit contributions
kin+0.025
pan+0.025
aver+0.024
 tray+0.022
 happens+0.022
Neg logit contributions
years-0.016
but-0.015
What-0.015
 after-0.015
to-0.015
Token mom
Token ID1995
Feature activation-2.23e-01

Pos logit contributions
years+0.001
 after+0.001
What+0.001
 him+0.001
 afford+0.001
Neg logit contributions
pan-0.001
kin-0.001
aver-0.001
oon-0.001
 iv-0.001
Token smiled
Token ID13541
Feature activation-1.56e+00

Pos logit contributions
years+0.006
 him+0.005
What+0.005
but+0.005
 after+0.005
Neg logit contributions
kin-0.007
pan-0.007
aver-0.006
 tray-0.006
 happens-0.006
Token.
Token ID13
Feature activation+5.59e-01

Pos logit contributions
years+0.043
but+0.038
and+0.038
to+0.037
unct+0.037
Neg logit contributions
kin-0.047
pan-0.044
aver-0.042
 bowl-0.041
 tray-0.039
Token "
Token ID366
Feature activation-1.82e+00

Pos logit contributions
kin+0.016
pan+0.015
aver+0.015
 tray+0.013
 happens+0.013
Neg logit contributions
years-0.016
but-0.016
What-0.015
 after-0.015
to-0.015
TokenThat
Token ID2504
Feature activation-5.63e-01

Pos logit contributions
unct+0.050
i+0.043
 Him+0.043
 int+0.041
 curiously+0.041
Neg logit contributions
aver-0.052
kin-0.051
 teaches-0.049
pan-0.046
oon-0.045
Token's
Token ID338
Feature activation-2.53e-01

Pos logit contributions
 after+0.010
 him+0.010
to+0.009
 really+0.008
What+0.008
Neg logit contributions
kin-0.023
pan-0.022
aver-0.021
 tray-0.018
 iv-0.017
Token lovely
Token ID14081
Feature activation+9.82e-01

Pos logit contributions
years+0.007
and+0.007
but+0.007
to+0.006
unct+0.006
Neg logit contributions
kin-0.008
aver-0.007
pan-0.007
 teaches-0.007
 tray-0.006
Token,
Token ID11
Feature activation+1.15e+00

Pos logit contributions
pan+0.036
kin+0.036
aver+0.035
 tray+0.030
 teaches+0.030
Neg logit contributions
 after-0.020
 him-0.018
 during-0.017
 like-0.016
and-0.016
Token Jack
Token ID3619
Feature activation+2.64e-01

Pos logit contributions
pan+0.042
kin+0.042
aver+0.041
 tray+0.036
 hawk+0.036
Neg logit contributions
 after-0.024
 him-0.023
What-0.021
years-0.021
but-0.021
Token on
Token ID319
Feature activation+7.35e-01

Pos logit contributions
years+0.003
but+0.003
What+0.003
 afford+0.003
 int+0.003
Neg logit contributions
kin-0.007
pan-0.007
aver-0.007
 bowl-0.006
 tray-0.006
Token a
Token ID257
Feature activation-5.28e-01

Pos logit contributions
kin+0.023
pan+0.022
aver+0.021
 tray+0.020
 bowl+0.020
Neg logit contributions
years-0.019
but-0.018
and-0.017
to-0.017
What-0.017
Token farm
Token ID5318
Feature activation-3.61e-01

Pos logit contributions
years+0.018
but+0.016
What+0.016
to+0.016
and+0.016
Neg logit contributions
kin-0.013
pan-0.012
aver-0.011
 tray-0.011
 bowl-0.011
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
years+0.009
but+0.008
What+0.008
to+0.008
 after+0.007
Neg logit contributions
kin-0.012
pan-0.011
aver-0.011
 tray-0.011
 bowl-0.010
Token Her
Token ID2332
Feature activation-2.23e-01

Pos logit contributions
years+0.019
but+0.017
unct+0.015
What+0.015
 afford+0.015
Neg logit contributions
kin-0.045
pan-0.044
aver-0.043
 tray-0.041
 bowl-0.041
Token parents
Token ID3397
Feature activation-1.80e+00

Pos logit contributions
 after+0.004
bel+0.004
 afford+0.004
iring+0.004
 int+0.004
Neg logit contributions
pan-0.008
kin-0.008
aver-0.007
 happens-0.007
oon-0.007
Token had
Token ID550
Feature activation-2.88e-01

Pos logit contributions
years+0.042
but+0.039
What+0.038
 after+0.038
 him+0.037
Neg logit contributions
kin-0.060
pan-0.058
aver-0.056
 tray-0.052
 bowl-0.052
Token lots
Token ID6041
Feature activation+6.73e-01

Pos logit contributions
years+0.007
but+0.006
 after+0.006
What+0.006
 him+0.006
Neg logit contributions
kin-0.010
pan-0.010
aver-0.009
 tray-0.009
 bowl-0.009
Token of
Token ID286
Feature activation+6.23e-02

Pos logit contributions
kin+0.020
pan+0.019
aver+0.018
 tray+0.017
 bowl+0.017
Neg logit contributions
years-0.018
but-0.017
 after-0.017
and-0.017
 him-0.017
Token animals
Token ID4695
Feature activation-8.57e-01

Pos logit contributions
kin+0.002
pan+0.002
aver+0.001
 bowl+0.001
 tray+0.001
Neg logit contributions
years-0.002
 after-0.002
 him-0.002
but-0.002
What-0.002
Token,
Token ID11
Feature activation+5.84e-01

Pos logit contributions
years+0.020
unct+0.018
What+0.018
but+0.018
Ho+0.017
Neg logit contributions
kin-0.029
pan-0.028
aver-0.027
 bowl-0.026
 tray-0.026
Token day
Token ID1110
Feature activation-1.26e+00

Pos logit contributions
years+0.023
but+0.022
What+0.021
to+0.021
and+0.021
Neg logit contributions
kin-0.024
pan-0.023
aver-0.022
 bowl-0.021
 tray-0.021
Token on
Token ID319
Feature activation+1.15e-01

Pos logit contributions
years+0.026
but+0.024
 after+0.024
 him+0.023
What+0.023
Neg logit contributions
kin-0.046
pan-0.044
aver-0.043
 tray-0.041
 bowl-0.040
Token,
Token ID11
Feature activation-1.60e-01

Pos logit contributions
kin+0.004
pan+0.003
aver+0.003
 tray+0.003
 bowl+0.003
Neg logit contributions
years-0.003
 after-0.003
 him-0.003
but-0.003
What-0.003
Token Tim
Token ID5045
Feature activation+1.73e-01

Pos logit contributions
years+0.005
unct+0.005
but+0.005
to+0.005
 afford+0.005
Neg logit contributions
kin-0.004
pan-0.004
aver-0.004
 bowl-0.003
 tray-0.003
Tokenmy
Token ID1820
Feature activation-5.13e-01

Pos logit contributions
kin+0.006
aver+0.006
pan+0.006
 bowl+0.006
 teaches+0.005
Neg logit contributions
years-0.004
but-0.003
What-0.003
 afford-0.003
to-0.003
Token learned
Token ID4499
Feature activation-1.93e+00

Pos logit contributions
years+0.012
 after+0.011
but+0.011
 him+0.011
to+0.011
Neg logit contributions
kin-0.017
pan-0.017
aver-0.016
 tray-0.016
 bowl-0.015
Token to
Token ID284
Feature activation-3.61e-01

Pos logit contributions
years+0.057
unct+0.048
Ho+0.047
What+0.046
what+0.045
Neg logit contributions
pan-0.051
 happens-0.050
kin-0.049
aver-0.049
 teaches-0.049
Token take
Token ID1011
Feature activation-1.41e-01

Pos logit contributions
years+0.010
but+0.009
and+0.008
to+0.008
What+0.008
Neg logit contributions
kin-0.012
aver-0.011
pan-0.011
 teaches-0.010
 bowl-0.010
Token better
Token ID1365
Feature activation-2.15e+00

Pos logit contributions
years+0.004
but+0.004
 afford+0.003
itating+0.003
unct+0.003
Neg logit contributions
kin-0.004
 happens-0.004
pan-0.004
aver-0.004
 teaches-0.004
Token care
Token ID1337
Feature activation-6.44e-01

Pos logit contributions
years+0.058
but+0.055
itating+0.055
 int+0.051
What+0.051
Neg logit contributions
kin-0.059
 happens-0.058
aver-0.057
pan-0.057
 broom-0.054
Token of
Token ID286
Feature activation-1.19e+00

Pos logit contributions
years+0.021
What+0.019
but+0.019
unct+0.019
to+0.018
Neg logit contributions
kin-0.016
pan-0.015
aver-0.015
 bowl-0.013
 teaches-0.013