TOP ACTIVATIONS
MAX = 8.865

find out what all the comm otion was about . When
them away from their natural habitats . We should be kind
came to see what the comm otion was about . She
ice cream , and bought souven irs . They
them away from their natural habitats . We should be friendly
can keep it as a souven ir of your adventure .
could keep it as a souven ir . S
" I 'm stuck !" Ans wered the little boy .
to keep them as a souven ir of Frost y .
take it home as a souven ir of their sm elly
in her hand , what shapes were in the sky ,
didn 't understand what the characters were saying . " Mom
take it home as a souven ir from the park .
The motor hum med and wh ir red and it churn
keep the ring as a souven ir . They thanked the
gold coin as a special souven ir of Sarah 's adventure
the gray shell as a souven ir of his adventure .
felt jealous , because the ch ame leon was very colorful
keep the spring as a souven ir of his adventure .
faster than the clouds and wh izz ed past the trees

MIN ACTIVATIONS
MIN = -7.111

bitter ! He quickly spat it out . The
She was able to scoop it up with her net and
her tongue . She spat it out and cried .
at all . He spat it out and made a face
mouth . You should spit it out . The teacher will
unch up . She spat it out and said , "
face twist . She spat it out and felt a pain
mouth hurt . She spit it out and yelled , "
like it . He spat it out and threw it away
He gob bled it up in one bite and
like it . She spit it out and said , "
them cough . They spit it out and make faces .
tasted awful . He spit it out and said , "
her mouth . She spit it out and saw that it
y ucky . She spit it out and rubbed her tongue
mouth , but he spat it out . He tasted something
like food . He spat it out and the paper floated
very sour . They spit it out . Mom
get it . She snatched it from underneath the bench and
hurt . So he spit it out and sw am away

INTERVAL 4.871 - 8.865
CONTAINS 0.099%

a long nose and big ears . " What
from the trap . B amb i was very happy and
. Billy said , " Tr ains are boring . I
just need to appreciate what we already have . <|endoftext|> Once
and strong . All the animals around the tree were happy

INTERVAL 0.877 - 4.871
CONTAINS 27.380%

field to shoot fair arrows at the moon . <|endoftext|> Once
le mons , please ." The woman gave
Jane went to the park to practice with the clay .
The little girl had the most wonderful time at the beach
them back . " You are my sweet cream pies ."

INTERVAL -3.117 - 0.877
CONTAINS 69.403%

was too strong , and it made them sh iver .
helmet and a vest . " Hello , sir
They say they will have a happy birthday for Mom .
smiles and says yes . He gives them a basket and
went inside . It was sm elly , but there was

INTERVAL -7.111 - -3.117
CONTAINS 3.118%

Tim asked . " Not now , sweet ie
, so he decide to go to the park . John
happy and thrilled she had managed to hide the m op
he put the silver coin back on the ground and vowed
. She loved to throw it up in the air and
Token find
Token ID1064
Feature activation-2.22e+00

Pos logit contributions
 Owl+0.032
Da+0.031
lim+0.031
 aback+0.030
 tighter+0.030
Neg logit contributions
inton-0.020
kin-0.020
 ago-0.018
orses-0.017
 toes-0.017
Token out
Token ID503
Feature activation-4.10e-01

Pos logit contributions
 Owl+0.041
lim+0.040
 tighter+0.036
xc+0.035
Da+0.035
Neg logit contributions
kin-0.049
inton-0.048
 toes-0.046
orses-0.045
orse-0.043
Token what
Token ID644
Feature activation-8.56e-02

Pos logit contributions
 Owl+0.007
 again+0.007
 tighter+0.006
 owl+0.006
 berries+0.006
Neg logit contributions
kin-0.011
inton-0.011
 ago-0.010
orses-0.010
orse-0.009
Token all
Token ID477
Feature activation+2.78e+00

Pos logit contributions
 Owl+0.002
 again+0.002
 owl+0.002
 berries+0.002
Da+0.002
Neg logit contributions
inton-0.002
kin-0.002
 ago-0.002
orses-0.002
ippers-0.002
Token the
Token ID262
Feature activation+3.04e+00

Pos logit contributions
inton+0.068
kin+0.068
 ago+0.062
ippers+0.061
orses+0.061
Neg logit contributions
 Owl-0.052
 again-0.049
 berries-0.049
 owl-0.048
 tighter-0.046
Token comm
Token ID725
Feature activation+8.86e+00

Pos logit contributions
inton+0.063
kin+0.063
 ago+0.057
orses+0.056
 toes+0.054
Neg logit contributions
 Owl-0.066
 aback-0.061
 tighter-0.061
 again-0.061
lim-0.060
Tokenotion
Token ID9650
Feature activation+2.44e+00

Pos logit contributions
 unison+0.163
inton+0.157
 laptop+0.154
kin+0.152
 toes+0.150
Neg logit contributions
 again-0.186
 Owl-0.174
 Tweet-0.165
 av-0.164
 berries-0.163
Token was
Token ID373
Feature activation+1.56e+00

Pos logit contributions
inton+0.058
kin+0.056
orses+0.052
ippers+0.052
 ago+0.052
Neg logit contributions
 again-0.047
 Owl-0.045
 berries-0.043
 owl-0.042
 ac-0.041
Token about
Token ID546
Feature activation+1.67e+00

Pos logit contributions
kin+0.037
inton+0.036
orses+0.032
ippers+0.032
 toes+0.032
Neg logit contributions
 Owl-0.031
 again-0.030
 owl-0.029
 berries-0.028
 tighter-0.027
Token.
Token ID13
Feature activation+2.36e-02

Pos logit contributions
kin+0.040
inton+0.039
orses+0.035
orse+0.034
ippers+0.034
Neg logit contributions
 Owl-0.032
 again-0.032
 owl-0.029
 berries-0.029
 tighter-0.029
Token When
Token ID1649
Feature activation+1.77e-01

Pos logit contributions
inton+0.000
kin+0.000
 ago+0.000
orses+0.000
 toes+0.000
Neg logit contributions
 Owl-0.001
 tighter-0.001
 again-0.001
 owl-0.001
Da-0.001
Token them
Token ID606
Feature activation-3.74e+00

Pos logit contributions
 Owl+0.021
Da+0.019
lim+0.019
 tighter+0.019
 owl+0.019
Neg logit contributions
inton-0.023
kin-0.023
 ago-0.021
orses-0.021
 toes-0.020
Token away
Token ID1497
Feature activation-1.32e+00

Pos logit contributions
 Owl+0.045
Da+0.038
 owl+0.037
lim+0.037
 berries+0.037
Neg logit contributions
inton-0.114
kin-0.113
 ago-0.106
orses-0.104
 toes-0.101
Token from
Token ID422
Feature activation+6.11e-01

Pos logit contributions
 Owl+0.026
 again+0.024
 tighter+0.024
Da+0.024
 owl+0.024
Neg logit contributions
kin-0.030
inton-0.030
 ago-0.027
orses-0.026
 toes-0.025
Token their
Token ID511
Feature activation+8.06e-01

Pos logit contributions
inton+0.014
kin+0.014
 ago+0.013
orses+0.013
 toes+0.012
Neg logit contributions
 Owl-0.012
 again-0.011
 tighter-0.011
 owl-0.011
 berries-0.010
Token natural
Token ID3288
Feature activation+1.41e+00

Pos logit contributions
inton+0.019
kin+0.019
 ago+0.017
orses+0.017
 toes+0.016
Neg logit contributions
 Owl-0.015
 again-0.014
 tighter-0.014
 owl-0.014
Da-0.014
Token habitats
Token ID35308
Feature activation+8.03e+00

Pos logit contributions
kin+0.033
inton+0.033
orses+0.030
 ago+0.030
orse+0.029
Neg logit contributions
 Owl-0.027
 again-0.025
 owl-0.024
 berries-0.024
 tighter-0.024
Token.
Token ID13
Feature activation-3.89e-01

Pos logit contributions
inton+0.210
kin+0.197
ippers+0.184
 hardly+0.180
orses+0.180
Neg logit contributions
 again-0.152
 Owl-0.135
 owl-0.125
 tighter-0.118
 berries-0.117
Token We
Token ID775
Feature activation+2.74e+00

Pos logit contributions
 Owl+0.009
 again+0.007
 Tweet+0.007
 aback+0.007
lim+0.007
Neg logit contributions
kin-0.009
inton-0.009
 toes-0.008
orses-0.007
ippers-0.007
Token should
Token ID815
Feature activation+4.55e-01

Pos logit contributions
inton+0.069
kin+0.069
 ago+0.063
 toes+0.063
orses+0.063
Neg logit contributions
 Owl-0.050
 owl-0.045
 again-0.045
 aback-0.043
 berries-0.043
Token be
Token ID307
Feature activation-4.94e-01

Pos logit contributions
kin+0.012
inton+0.011
 toes+0.011
ippers+0.011
orses+0.011
Neg logit contributions
 Owl-0.008
 owl-0.008
 again-0.007
 ac-0.007
 Tweet-0.007
Token kind
Token ID1611
Feature activation-1.55e+00

Pos logit contributions
 Owl+0.011
 berries+0.010
lim+0.010
Da+0.010
 again+0.010
Neg logit contributions
inton-0.010
kin-0.010
orses-0.009
 ago-0.009
orse-0.008
Token came
Token ID1625
Feature activation-3.27e+00

Pos logit contributions
 Owl+0.036
 again+0.034
 tighter+0.033
 owl+0.033
 aback+0.032
Neg logit contributions
kin-0.037
inton-0.036
 ago-0.032
orses-0.032
ippers-0.031
Token to
Token ID284
Feature activation-1.56e+00

Pos logit contributions
 Owl+0.046
 berries+0.042
Da+0.040
lim+0.040
 owl+0.039
Neg logit contributions
inton-0.092
kin-0.087
 ago-0.085
 toes-0.083
orses-0.082
Token see
Token ID766
Feature activation-1.07e+00

Pos logit contributions
 Owl+0.038
lim+0.036
 berries+0.036
Da+0.036
 tighter+0.036
Neg logit contributions
inton-0.026
kin-0.025
 ago-0.025
orses-0.022
 toes-0.022
Token what
Token ID644
Feature activation-7.49e-01

Pos logit contributions
 Owl+0.019
 again+0.018
 tighter+0.017
 owl+0.017
 berries+0.017
Neg logit contributions
kin-0.027
inton-0.027
orses-0.025
 ago-0.024
orse-0.024
Token the
Token ID262
Feature activation+1.69e+00

Pos logit contributions
 Owl+0.014
 again+0.013
 owl+0.013
 berries+0.013
Da+0.013
Neg logit contributions
inton-0.018
kin-0.018
 ago-0.016
orses-0.016
ippers-0.015
Token comm
Token ID725
Feature activation+7.94e+00

Pos logit contributions
kin+0.037
inton+0.037
 ago+0.036
 toes+0.032
orses+0.031
Neg logit contributions
 Owl-0.033
xc-0.033
 again-0.032
lim-0.032
 tighter-0.032
Tokenotion
Token ID9650
Feature activation+1.85e+00

Pos logit contributions
orses+0.155
 toes+0.150
inton+0.150
 unison+0.148
anas+0.146
Neg logit contributions
 again-0.168
 aback-0.143
 Owl-0.138
 further-0.137
lim-0.133
Token was
Token ID373
Feature activation+1.16e+00

Pos logit contributions
inton+0.047
kin+0.045
orses+0.043
ippers+0.042
 ago+0.042
Neg logit contributions
 again-0.033
 Owl-0.032
 berries-0.030
 owl-0.029
 ac-0.029
Token about
Token ID546
Feature activation+1.31e+00

Pos logit contributions
kin+0.027
inton+0.026
orses+0.024
orse+0.023
ippers+0.023
Neg logit contributions
 Owl-0.023
 again-0.022
 owl-0.022
 berries-0.021
 aback-0.021
Token.
Token ID13
Feature activation-2.86e-01

Pos logit contributions
kin+0.031
inton+0.030
orses+0.027
orse+0.026
 ago+0.026
Neg logit contributions
 Owl-0.026
 again-0.025
 owl-0.023
 tighter-0.023
 berries-0.023
Token She
Token ID1375
Feature activation-3.28e-01

Pos logit contributions
 again+0.006
 Owl+0.005
 aback+0.005
Da+0.005
 tighter+0.005
Neg logit contributions
kin-0.007
inton-0.007
orses-0.006
orse-0.006
ilot-0.006
Token ice
Token ID4771
Feature activation-4.86e-01

Pos logit contributions
 Owl+0.003
 aback+0.003
Da+0.003
lim+0.003
 owl+0.003
Neg logit contributions
kin-0.004
inton-0.004
 ago-0.003
 toes-0.003
orses-0.003
Token cream
Token ID8566
Feature activation-5.74e-01

Pos logit contributions
 Owl+0.009
Da+0.008
 aback+0.008
 owl+0.008
lim+0.008
Neg logit contributions
kin-0.013
inton-0.012
orses-0.011
 toes-0.011
 ago-0.011
Token,
Token ID11
Feature activation+1.53e+00

Pos logit contributions
 Owl+0.011
Da+0.010
 aback+0.010
lim+0.010
 owl+0.010
Neg logit contributions
kin-0.014
inton-0.014
 ago-0.012
orses-0.012
 toes-0.012
Token and
Token ID290
Feature activation+9.99e-01

Pos logit contributions
inton+0.043
kin+0.042
 ago+0.039
orses+0.038
ippers+0.037
Neg logit contributions
 Owl-0.023
 again-0.021
 berries-0.021
 tighter-0.021
 owl-0.020
Token bought
Token ID5839
Feature activation-1.78e+00

Pos logit contributions
inton+0.024
kin+0.024
 ago+0.021
orses+0.021
ippers+0.021
Neg logit contributions
 Owl-0.018
 again-0.018
 berries-0.017
 tighter-0.017
 owl-0.017
Token souven
Token ID45043
Feature activation+7.88e+00

Pos logit contributions
 Owl+0.034
Da+0.031
lim+0.030
 aback+0.030
 owl+0.030
Neg logit contributions
inton-0.042
kin-0.041
orses-0.038
 ago-0.038
 toes-0.037
Tokenirs
Token ID17062
Feature activation+1.77e+00

Pos logit contributions
inton+0.159
kin+0.153
 ago+0.130
 toes+0.130
orses+0.129
Neg logit contributions
lim-0.169
 again-0.164
 av-0.155
 Owl-0.153
 berries-0.152
Token.
Token ID13
Feature activation-3.63e-02

Pos logit contributions
inton+0.043
kin+0.042
orses+0.038
orse+0.037
 ago+0.037
Neg logit contributions
 again-0.033
 Owl-0.033
 berries-0.030
 owl-0.030
 tighter-0.029
Token
Token ID198
Feature activation-3.44e-01

Pos logit contributions
 Owl+0.001
 Tweet+0.001
Da+0.001
 Gig+0.001
 Twe+0.001
Neg logit contributions
kin-0.001
inton-0.001
ippers-0.001
 toes-0.001
inations-0.001
Token
Token ID198
Feature activation-1.86e+00

Pos logit contributions
Da+0.006
 Owl+0.006
 Tweet+0.005
 again+0.005
 owl+0.005
Neg logit contributions
kin-0.009
inton-0.009
 toes-0.009
ippers-0.008
 ago-0.008
TokenThey
Token ID2990
Feature activation+1.93e+00

Pos logit contributions
Da+0.033
 Owl+0.028
lim+0.027
Bet+0.026
 tighter+0.025
Neg logit contributions
kin-0.049
inton-0.048
 toes-0.048
 ago-0.045
ippers-0.045
Token them
Token ID606
Feature activation-3.77e+00

Pos logit contributions
 Owl+0.020
Da+0.018
lim+0.018
 owl+0.018
 tighter+0.018
Neg logit contributions
inton-0.022
kin-0.021
 ago-0.020
 toes-0.019
orses-0.019
Token away
Token ID1497
Feature activation-1.56e+00

Pos logit contributions
 Owl+0.046
 again+0.040
 tighter+0.039
 owl+0.038
 berries+0.038
Neg logit contributions
inton-0.115
kin-0.115
 ago-0.106
orses-0.105
 toes-0.102
Token from
Token ID422
Feature activation+4.85e-01

Pos logit contributions
 Owl+0.025
 again+0.024
 tighter+0.023
 owl+0.022
 berries+0.022
Neg logit contributions
inton-0.042
kin-0.041
orses-0.038
 ago-0.037
orse-0.036
Token their
Token ID511
Feature activation+6.64e-01

Pos logit contributions
inton+0.012
kin+0.012
 ago+0.010
orses+0.010
 toes+0.010
Neg logit contributions
 Owl-0.009
 again-0.008
 tighter-0.008
 owl-0.008
 berries-0.008
Token natural
Token ID3288
Feature activation+1.18e+00

Pos logit contributions
kin+0.017
inton+0.016
orses+0.015
 ago+0.015
orse+0.014
Neg logit contributions
 Owl-0.012
 again-0.011
 tighter-0.011
 owl-0.011
 berries-0.011
Token habitats
Token ID35308
Feature activation+7.63e+00

Pos logit contributions
kin+0.030
inton+0.029
orses+0.026
orse+0.026
 ago+0.025
Neg logit contributions
 Owl-0.021
 again-0.020
 berries-0.019
 owl-0.019
lim-0.019
Token.
Token ID13
Feature activation-3.95e-01

Pos logit contributions
inton+0.202
kin+0.192
orses+0.180
ippers+0.178
orse+0.177
Neg logit contributions
 again-0.142
 Owl-0.121
 owl-0.110
 tighter-0.106
 ac-0.105
Token We
Token ID775
Feature activation+2.67e+00

Pos logit contributions
 Owl+0.009
 again+0.009
 owl+0.009
Da+0.009
 tighter+0.009
Neg logit contributions
kin-0.008
inton-0.008
 ago-0.007
orses-0.007
 toes-0.006
Token should
Token ID815
Feature activation-1.70e-02

Pos logit contributions
inton+0.064
kin+0.064
orses+0.057
 ago+0.057
 toes+0.056
Neg logit contributions
 Owl-0.051
 again-0.047
 owl-0.046
 aback-0.046
 tighter-0.045
Token be
Token ID307
Feature activation-5.26e-01

Pos logit contributions
 Owl+0.000
 owl+0.000
 again+0.000
lim+0.000
 ac+0.000
Neg logit contributions
kin-0.000
inton-0.000
orses-0.000
 toes-0.000
ippers-0.000
Token friendly
Token ID8030
Feature activation-2.23e-01

Pos logit contributions
 Owl+0.012
 berries+0.011
lim+0.011
Da+0.011
 again+0.011
Neg logit contributions
inton-0.010
kin-0.010
 ago-0.009
orses-0.009
 wear-0.008
Token can
Token ID460
Feature activation+5.40e-01

Pos logit contributions
inton+0.051
kin+0.051
orses+0.045
 ago+0.045
 toes+0.044
Neg logit contributions
 Owl-0.058
 owl-0.053
 again-0.053
 aback-0.052
 berries-0.052
Token keep
Token ID1394
Feature activation+4.07e-01

Pos logit contributions
kin+0.010
inton+0.010
 ago+0.008
orses+0.008
 toes+0.008
Neg logit contributions
 Owl-0.014
 again-0.013
 owl-0.013
 berries-0.012
lim-0.012
Token it
Token ID340
Feature activation-5.07e+00

Pos logit contributions
inton+0.009
kin+0.009
 ago+0.008
orses+0.008
orse+0.008
Neg logit contributions
 Owl-0.008
Da-0.008
 tighter-0.008
lim-0.007
 aback-0.007
Token as
Token ID355
Feature activation+4.87e-01

Pos logit contributions
 Owl+0.097
Da+0.088
lim+0.087
 again+0.087
 aback+0.087
Neg logit contributions
kin-0.120
inton-0.119
 ago-0.108
orses-0.106
 toes-0.103
Token a
Token ID257
Feature activation+3.20e-01

Pos logit contributions
inton+0.014
kin+0.013
 unison+0.012
orses+0.012
ilot+0.012
Neg logit contributions
 Owl-0.006
 berries-0.006
 again-0.006
 owl-0.006
 Tweet-0.006
Token souven
Token ID45043
Feature activation+7.62e+00

Pos logit contributions
inton+0.007
kin+0.007
 ago+0.006
orses+0.006
 toes+0.006
Neg logit contributions
 Owl-0.006
 again-0.006
 tighter-0.006
 owl-0.006
 berries-0.006
Tokenir
Token ID343
Feature activation+5.56e-01

Pos logit contributions
inton+0.209
kin+0.196
orses+0.185
 toes+0.185
 hardly+0.177
Neg logit contributions
lim-0.108
 again-0.105
 Owl-0.093
xc-0.092
st-0.086
Token of
Token ID286
Feature activation+3.60e+00

Pos logit contributions
inton+0.014
kin+0.014
 toes+0.012
 ago+0.012
orses+0.012
Neg logit contributions
 Owl-0.010
 again-0.010
 owl-0.010
 berries-0.009
lim-0.009
Token your
Token ID534
Feature activation+1.15e+00

Pos logit contributions
inton+0.100
kin+0.097
orses+0.087
ippers+0.086
ilot+0.084
Neg logit contributions
 Owl-0.056
 again-0.054
 owl-0.051
 berries-0.050
 tighter-0.048
Token adventure
Token ID8855
Feature activation+3.65e-01

Pos logit contributions
inton+0.024
kin+0.023
 hardly+0.021
orses+0.020
ana+0.020
Neg logit contributions
 again-0.025
 Owl-0.024
 owl-0.023
 berries-0.023
 ac-0.023
Token.
Token ID13
Feature activation+8.76e-02

Pos logit contributions
inton+0.008
kin+0.008
 ago+0.007
orses+0.007
 toes+0.007
Neg logit contributions
 Owl-0.008
 again-0.007
 tighter-0.007
 owl-0.007
 berries-0.007
Token could
Token ID714
Feature activation+1.05e+00

Pos logit contributions
 Owl+0.006
 again+0.006
 owl+0.005
 tighter+0.005
 berries+0.005
Neg logit contributions
kin-0.005
inton-0.005
ippers-0.004
orses-0.004
 ago-0.004
Token keep
Token ID1394
Feature activation+8.06e-02

Pos logit contributions
kin+0.020
inton+0.020
 ago+0.017
orses+0.017
 toes+0.017
Neg logit contributions
 Owl-0.026
 again-0.025
 owl-0.024
 ac-0.024
 Tweet-0.024
Token it
Token ID340
Feature activation-5.00e+00

Pos logit contributions
inton+0.002
kin+0.002
 ago+0.002
orses+0.002
 toes+0.002
Neg logit contributions
 Owl-0.002
Da-0.001
lim-0.001
 tighter-0.001
 again-0.001
Token as
Token ID355
Feature activation+4.62e-03

Pos logit contributions
 Owl+0.091
Da+0.082
lim+0.082
 aback+0.081
 again+0.081
Neg logit contributions
kin-0.122
inton-0.121
 ago-0.110
orses-0.108
 toes-0.105
Token a
Token ID257
Feature activation+1.50e-02

Pos logit contributions
inton+0.000
kin+0.000
orses+0.000
 ago+0.000
orse+0.000
Neg logit contributions
 berries-0.000
 Owl-0.000
 again-0.000
 av-0.000
 Tweet-0.000
Token souven
Token ID45043
Feature activation+7.61e+00

Pos logit contributions
inton+0.000
kin+0.000
orses+0.000
 ago+0.000
 toes+0.000
Neg logit contributions
 Owl-0.000
 again-0.000
 tighter-0.000
 owl-0.000
 berries-0.000
Tokenir
Token ID343
Feature activation+4.37e-01

Pos logit contributions
inton+0.209
kin+0.195
orses+0.190
 toes+0.188
 hardly+0.182
Neg logit contributions
 again-0.103
lim-0.102
xc-0.090
st-0.083
po-0.080
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
kin+0.011
inton+0.011
 toes+0.009
ippers+0.009
 hardly+0.009
Neg logit contributions
 Owl-0.008
 again-0.008
 owl-0.008
 ac-0.007
 berries-0.007
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
 Owl+0.006
 again+0.006
Da+0.006
 berries+0.006
 Tweet+0.006
Neg logit contributions
kin-0.008
inton-0.008
orses-0.007
orse-0.007
inations-0.007
Token
Token ID198
Feature activation-1.82e+00

Pos logit contributions
 Owl+0.004
 again+0.004
Da+0.004
 Tweet+0.004
 berries+0.004
Neg logit contributions
kin-0.006
inton-0.006
 toes-0.005
 ago-0.005
ippers-0.005
TokenS
Token ID50
Feature activation-5.71e-01

Pos logit contributions
 Owl+0.022
Da+0.022
 tighter+0.021
 again+0.021
 berries+0.020
Neg logit contributions
inton-0.054
kin-0.054
 ago-0.050
 toes-0.049
orses-0.048
Token"
Token ID1
Feature activation-4.50e+00

Pos logit contributions
 Owl+0.006
 berries+0.006
 again+0.006
 tighter+0.006
 aback+0.006
Neg logit contributions
inton-0.005
kin-0.005
orses-0.005
 ago-0.005
orse-0.004
TokenI
Token ID40
Feature activation+1.43e+00

Pos logit contributions
 Owl+0.071
Da+0.064
 owl+0.062
 again+0.062
 berries+0.061
Neg logit contributions
kin-0.123
inton-0.122
 ago-0.112
orses-0.110
 toes-0.108
Token'm
Token ID1101
Feature activation+2.81e+00

Pos logit contributions
kin+0.029
inton+0.029
 toes+0.025
orses+0.025
ippers+0.024
Neg logit contributions
 Owl-0.033
 again-0.030
 aback-0.030
 ac-0.029
 owl-0.029
Token stuck
Token ID7819
Feature activation-1.54e+00

Pos logit contributions
kin+0.064
inton+0.064
orses+0.056
 ago+0.056
 toes+0.054
Neg logit contributions
 Owl-0.058
 again-0.052
 owl-0.052
 berries-0.051
 aback-0.051
Token!"
Token ID2474
Feature activation+2.24e+00

Pos logit contributions
Da+0.032
 berries+0.031
 Owl+0.031
lim+0.030
 tighter+0.030
Neg logit contributions
 ago-0.032
kin-0.030
inton-0.029
 toes-0.026
ilot-0.025
Token Ans
Token ID28038
Feature activation+7.61e+00

Pos logit contributions
inton+0.044
kin+0.044
 ago+0.041
orses+0.040
orse+0.039
Neg logit contributions
 Owl-0.047
 berries-0.045
lim-0.045
 tighter-0.045
 again-0.044
Tokenwered
Token ID8279
Feature activation+2.54e+00

Pos logit contributions
inton+0.153
kin+0.153
 toes+0.133
 ago+0.132
orses+0.132
Neg logit contributions
 Owl-0.174
lim-0.162
 owl-0.159
 again-0.158
Da-0.155
Token the
Token ID262
Feature activation-9.84e-02

Pos logit contributions
kin+0.055
inton+0.053
 ago+0.051
 toes+0.047
orses+0.047
Neg logit contributions
 Owl-0.049
Da-0.048
lim-0.046
 tighter-0.046
 aback-0.045
Token little
Token ID1310
Feature activation-2.32e+00

Pos logit contributions
 Owl+0.002
Da+0.002
lim+0.002
 tighter+0.002
 aback+0.002
Neg logit contributions
inton-0.002
kin-0.002
 ago-0.002
orses-0.002
 toes-0.002
Token boy
Token ID2933
Feature activation-2.81e+00

Pos logit contributions
Da+0.051
lim+0.048
xc+0.047
ign+0.046
pr+0.046
Neg logit contributions
 ago-0.042
 toes-0.040
inton-0.039
ilot-0.037
orses-0.037
Token.
Token ID13
Feature activation+6.20e-01

Pos logit contributions
 Owl+0.058
 again+0.056
 owl+0.052
 tighter+0.052
 Tweet+0.049
Neg logit contributions
kin-0.068
inton-0.065
 ago-0.056
ippers-0.056
orses-0.054
Token to
Token ID284
Feature activation+7.12e-01

Pos logit contributions
 Owl+0.027
 tighter+0.022
Da+0.022
 again+0.022
lim+0.022
Neg logit contributions
inton-0.081
kin-0.080
 ago-0.075
orses-0.074
 toes-0.073
Token keep
Token ID1394
Feature activation+1.90e-01

Pos logit contributions
inton+0.014
kin+0.014
 ago+0.013
orses+0.013
orse+0.012
Neg logit contributions
 Owl-0.016
 again-0.015
 owl-0.015
lim-0.015
 Tweet-0.015
Token them
Token ID606
Feature activation-2.70e+00

Pos logit contributions
kin+0.005
inton+0.005
 ago+0.004
orses+0.004
orse+0.004
Neg logit contributions
 Owl-0.004
 again-0.003
 tighter-0.003
 berries-0.003
 owl-0.003
Token as
Token ID355
Feature activation-5.07e-01

Pos logit contributions
 again+0.043
 Owl+0.043
 tighter+0.042
 owl+0.039
 ac+0.038
Neg logit contributions
inton-0.071
kin-0.070
 ago-0.066
orses-0.066
orse-0.063
Token a
Token ID257
Feature activation+1.65e-01

Pos logit contributions
 berries+0.007
 again+0.007
 Owl+0.007
 Tweet+0.006
 tighter+0.006
Neg logit contributions
inton-0.015
kin-0.014
 ago-0.013
orses-0.013
ippers-0.013
Token souven
Token ID45043
Feature activation+7.60e+00

Pos logit contributions
inton+0.004
kin+0.004
 ago+0.003
orses+0.003
 toes+0.003
Neg logit contributions
 Owl-0.003
 again-0.003
 tighter-0.003
 owl-0.003
 berries-0.003
Tokenir
Token ID343
Feature activation+6.62e-01

Pos logit contributions
inton+0.213
kin+0.198
orses+0.191
 toes+0.185
 hardly+0.183
Neg logit contributions
 again-0.104
lim-0.097
xc-0.090
 Owl-0.079
st-0.078
Token of
Token ID286
Feature activation+3.75e+00

Pos logit contributions
kin+0.018
inton+0.018
ippers+0.015
 toes+0.015
 unison+0.015
Neg logit contributions
 Owl-0.011
 again-0.011
 owl-0.011
 ac-0.010
 Tweet-0.009
Token Frost
Token ID15122
Feature activation+1.39e+00

Pos logit contributions
inton+0.098
kin+0.094
orses+0.087
ippers+0.084
orse+0.082
Neg logit contributions
 again-0.069
 Owl-0.066
 tighter-0.062
 berries-0.060
 owl-0.060
Tokeny
Token ID88
Feature activation+7.69e-01

Pos logit contributions
inton+0.041
kin+0.041
 ago+0.038
 hardly+0.037
orses+0.037
Neg logit contributions
 Owl-0.017
 again-0.017
 berries-0.015
 owl-0.015
 tighter-0.015
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
inton+0.018
kin+0.018
orses+0.016
 hardly+0.016
ippers+0.015
Neg logit contributions
 again-0.015
 Owl-0.015
 owl-0.014
 ac-0.014
 berries-0.014
Token take
Token ID1011
Feature activation-1.90e+00

Pos logit contributions
inton+0.007
kin+0.007
 ago+0.006
orses+0.006
ippers+0.006
Neg logit contributions
 Owl-0.007
 berries-0.007
 again-0.007
 owl-0.007
 ac-0.007
Token it
Token ID340
Feature activation-5.70e+00

Pos logit contributions
 Owl+0.038
 owl+0.034
Da+0.034
 tighter+0.034
lim+0.034
Neg logit contributions
inton-0.043
kin-0.043
 ago-0.039
orses-0.038
 toes-0.037
Token home
Token ID1363
Feature activation-7.16e-01

Pos logit contributions
 Owl+0.087
lim+0.077
Da+0.074
 owl+0.074
 aback+0.071
Neg logit contributions
kin-0.156
inton-0.155
 ago-0.144
 toes-0.140
orses-0.137
Token as
Token ID355
Feature activation-3.01e-01

Pos logit contributions
 Owl+0.015
 again+0.014
 tighter+0.014
 owl+0.014
 berries+0.014
Neg logit contributions
inton-0.015
kin-0.015
 ago-0.014
orses-0.013
 toes-0.013
Token a
Token ID257
Feature activation+1.80e-01

Pos logit contributions
 again+0.004
 Owl+0.004
 Tweet+0.004
 berries+0.004
 tighter+0.004
Neg logit contributions
inton-0.009
kin-0.008
ippers-0.008
orses-0.008
 ago-0.008
Token souven
Token ID45043
Feature activation+7.58e+00

Pos logit contributions
kin+0.004
inton+0.004
 ago+0.004
orses+0.004
 toes+0.003
Neg logit contributions
 Owl-0.004
 aback-0.003
 again-0.003
lim-0.003
 tighter-0.003
Tokenir
Token ID343
Feature activation+1.09e+00

Pos logit contributions
inton+0.209
kin+0.197
orses+0.194
 toes+0.190
orse+0.184
Neg logit contributions
 again-0.103
lim-0.101
xc-0.092
Da-0.084
st-0.083
Token of
Token ID286
Feature activation+4.08e+00

Pos logit contributions
kin+0.028
inton+0.027
ippers+0.024
 toes+0.023
orses+0.023
Neg logit contributions
 Owl-0.020
 owl-0.020
 again-0.019
 ac-0.018
 berries-0.018
Token their
Token ID511
Feature activation+1.30e+00

Pos logit contributions
inton+0.108
kin+0.103
ippers+0.095
orses+0.095
ilot+0.092
Neg logit contributions
 again-0.070
 Owl-0.068
 berries-0.065
 tighter-0.063
 owl-0.063
Token sm
Token ID895
Feature activation+1.29e+00

Pos logit contributions
kin+0.032
inton+0.031
 ago+0.028
orses+0.028
orse+0.027
Neg logit contributions
 Owl-0.024
 again-0.023
 berries-0.022
 tighter-0.022
 owl-0.022
Tokenelly
Token ID6148
Feature activation+1.03e+00

Pos logit contributions
inton+0.032
kin+0.032
 ago+0.029
orses+0.028
ippers+0.027
Neg logit contributions
 Owl-0.023
 again-0.021
 tighter-0.021
 owl-0.021
 berries-0.021
Token in
Token ID287
Feature activation+1.45e+00

Pos logit contributions
kin+0.038
inton+0.038
orses+0.034
 ago+0.034
 toes+0.033
Neg logit contributions
 Owl-0.024
 again-0.021
 owl-0.021
 berries-0.021
 aback-0.020
Token her
Token ID607
Feature activation+2.92e+00

Pos logit contributions
inton+0.038
kin+0.037
 ago+0.034
orses+0.034
orse+0.033
Neg logit contributions
 Owl-0.025
 again-0.022
 tighter-0.022
 owl-0.022
 berries-0.022
Token hand
Token ID1021
Feature activation-2.96e-01

Pos logit contributions
inton+0.063
kin+0.063
orses+0.056
 ago+0.055
orse+0.054
Neg logit contributions
 Owl-0.062
 owl-0.058
 again-0.057
lim-0.056
 berries-0.056
Token,
Token ID11
Feature activation+5.98e-01

Pos logit contributions
 Owl+0.005
 again+0.005
 owl+0.005
 berries+0.005
 ac+0.005
Neg logit contributions
kin-0.008
inton-0.008
orses-0.007
orse-0.007
ippers-0.006
Token what
Token ID644
Feature activation+1.16e+00

Pos logit contributions
kin+0.015
inton+0.015
 ago+0.014
orses+0.014
orse+0.013
Neg logit contributions
 Owl-0.010
 again-0.009
 owl-0.009
 berries-0.009
 tighter-0.009
Token shapes
Token ID15268
Feature activation+7.57e+00

Pos logit contributions
kin+0.027
inton+0.027
 ago+0.024
orses+0.024
 toes+0.023
Neg logit contributions
 Owl-0.022
 tighter-0.021
 again-0.021
 aback-0.021
lim-0.020
Token were
Token ID547
Feature activation+1.88e+00

Pos logit contributions
inton+0.150
kin+0.143
orses+0.133
 ago+0.129
ana+0.128
Neg logit contributions
 again-0.166
 Owl-0.164
 berries-0.163
 owl-0.160
 ac-0.157
Token in
Token ID287
Feature activation+2.05e+00

Pos logit contributions
kin+0.049
inton+0.049
orses+0.044
 ago+0.044
orse+0.043
Neg logit contributions
 Owl-0.031
 again-0.028
 owl-0.028
 berries-0.027
 tighter-0.027
Token the
Token ID262
Feature activation+4.99e-01

Pos logit contributions
inton+0.049
kin+0.048
orses+0.043
 ago+0.043
orse+0.042
Neg logit contributions
 Owl-0.039
 again-0.036
 tighter-0.035
 berries-0.035
 owl-0.035
Token sky
Token ID6766
Feature activation+3.23e+00

Pos logit contributions
kin+0.011
inton+0.011
 ago+0.010
orses+0.010
 toes+0.010
Neg logit contributions
 Owl-0.010
 tighter-0.009
 again-0.009
Da-0.009
 aback-0.009
Token,
Token ID11
Feature activation+8.61e-01

Pos logit contributions
inton+0.082
kin+0.082
orses+0.073
ippers+0.072
orse+0.072
Neg logit contributions
 again-0.056
 Owl-0.055
 owl-0.050
 berries-0.050
 ac-0.048
Token didn
Token ID1422
Feature activation+1.53e+00

Pos logit contributions
inton+0.042
kin+0.042
 ago+0.037
orses+0.037
 toes+0.035
Neg logit contributions
 Owl-0.046
 again-0.044
 tighter-0.043
 owl-0.043
 berries-0.042
Token't
Token ID470
Feature activation-1.51e-01

Pos logit contributions
kin+0.026
inton+0.026
 ago+0.023
orses+0.022
 toes+0.021
Neg logit contributions
 Owl-0.039
 again-0.037
 tighter-0.036
lim-0.036
 owl-0.036
Token understand
Token ID1833
Feature activation+4.54e-01

Pos logit contributions
 Owl+0.004
 again+0.004
 owl+0.004
 tighter+0.003
 aback+0.003
Neg logit contributions
kin-0.003
inton-0.003
 ago-0.002
orses-0.002
ippers-0.002
Token what
Token ID644
Feature activation+4.83e-01

Pos logit contributions
inton+0.011
kin+0.011
 ago+0.010
orses+0.010
 toes+0.009
Neg logit contributions
 Owl-0.009
 again-0.008
 tighter-0.008
 owl-0.008
 berries-0.008
Token the
Token ID262
Feature activation+2.33e+00

Pos logit contributions
inton+0.013
kin+0.013
 ago+0.012
orses+0.012
ippers+0.011
Neg logit contributions
 Owl-0.008
 again-0.007
 owl-0.007
Da-0.007
 berries-0.007
Token characters
Token ID3435
Feature activation+7.48e+00

Pos logit contributions
kin+0.048
inton+0.045
orses+0.044
orse+0.043
 ago+0.042
Neg logit contributions
 Owl-0.053
 again-0.052
 owl-0.051
 berries-0.049
 aback-0.049
Token were
Token ID547
Feature activation+1.64e+00

Pos logit contributions
inton+0.165
orses+0.163
ippers+0.152
 ago+0.150
kin+0.150
Neg logit contributions
 again-0.153
 ac-0.133
 berries-0.130
 owl-0.129
 Owl-0.129
Token saying
Token ID2282
Feature activation+2.34e+00

Pos logit contributions
kin+0.043
inton+0.038
 unison+0.037
orses+0.036
ilot+0.036
Neg logit contributions
 again-0.031
 Owl-0.029
 owl-0.029
 ac-0.025
 enough-0.025
Token.
Token ID13
Feature activation-4.44e-01

Pos logit contributions
inton+0.062
kin+0.061
orses+0.058
orse+0.056
ippers+0.056
Neg logit contributions
 again-0.044
 enough-0.033
 Owl-0.033
 ac-0.032
 owl-0.032
Token "
Token ID366
Feature activation-3.37e+00

Pos logit contributions
 Owl+0.011
 aback+0.010
 tighter+0.010
lim+0.010
Da+0.010
Neg logit contributions
inton-0.008
kin-0.007
 ago-0.007
 toes-0.007
orses-0.006
TokenMom
Token ID29252
Feature activation-1.94e+00

Pos logit contributions
 Owl+0.070
 berries+0.065
 again+0.063
 aback+0.063
 owl+0.062
Neg logit contributions
inton-0.076
kin-0.074
orses-0.066
 ago-0.065
 hardly-0.064
Token take
Token ID1011
Feature activation-1.75e+00

Pos logit contributions
inton+0.005
kin+0.005
 ago+0.004
orses+0.004
ippers+0.004
Neg logit contributions
 Owl-0.005
 again-0.005
 tighter-0.005
 owl-0.005
 berries-0.005
Token it
Token ID340
Feature activation-5.86e+00

Pos logit contributions
 Owl+0.037
 owl+0.034
lim+0.033
 berries+0.033
 tighter+0.033
Neg logit contributions
inton-0.037
kin-0.037
 ago-0.033
orses-0.032
 toes-0.032
Token home
Token ID1363
Feature activation-1.13e+00

Pos logit contributions
 Owl+0.085
Da+0.072
 berries+0.071
 owl+0.071
lim+0.070
Neg logit contributions
inton-0.166
kin-0.165
 toes-0.151
 ago-0.150
 wear-0.146
Token as
Token ID355
Feature activation-2.43e-02

Pos logit contributions
 Owl+0.026
 berries+0.024
lim+0.024
Da+0.024
 owl+0.024
Neg logit contributions
kin-0.022
inton-0.022
 ago-0.019
orses-0.019
 toes-0.018
Token a
Token ID257
Feature activation-4.96e-02

Pos logit contributions
 Owl+0.000
 again+0.000
 tighter+0.000
 Tweet+0.000
 berries+0.000
Neg logit contributions
inton-0.001
kin-0.001
orses-0.001
 ago-0.001
orse-0.001
Token souven
Token ID45043
Feature activation+7.45e+00

Pos logit contributions
 Owl+0.001
lim+0.001
 aback+0.001
 again+0.001
 berries+0.001
Neg logit contributions
kin-0.001
inton-0.001
 ago-0.001
orses-0.001
orse-0.001
Tokenir
Token ID343
Feature activation+5.98e-01

Pos logit contributions
inton+0.191
kin+0.181
orses+0.173
 toes+0.167
orse+0.163
Neg logit contributions
 again-0.121
lim-0.119
xc-0.105
 tighter-0.104
Da-0.103
Token from
Token ID422
Feature activation+4.68e-01

Pos logit contributions
kin+0.015
inton+0.015
 ago+0.013
 toes+0.013
ippers+0.013
Neg logit contributions
 Owl-0.011
 again-0.011
 owl-0.011
lim-0.010
 ac-0.010
Token the
Token ID262
Feature activation-6.08e-01

Pos logit contributions
kin+0.011
inton+0.011
 ago+0.010
orses+0.010
 toes+0.010
Neg logit contributions
 Owl-0.009
Da-0.008
 aback-0.008
 tighter-0.008
 berries-0.008
Token park
Token ID3952
Feature activation+3.50e-01

Pos logit contributions
 Owl+0.012
Da+0.011
 aback+0.011
 tighter+0.011
 again+0.011
Neg logit contributions
kin-0.014
inton-0.014
 ago-0.013
orses-0.012
 toes-0.012
Token.
Token ID13
Feature activation+4.62e-01

Pos logit contributions
inton+0.007
kin+0.007
 ago+0.006
orses+0.006
 toes+0.006
Neg logit contributions
 Owl-0.008
 again-0.008
 tighter-0.007
 owl-0.007
 berries-0.007
Token The
Token ID383
Feature activation+1.26e+00

Pos logit contributions
 Owl+0.002
 Tweet+0.002
 owl+0.002
 again+0.002
 aback+0.002
Neg logit contributions
inton-0.002
kin-0.002
orses-0.002
orse-0.002
ippers-0.002
Token motor
Token ID5584
Feature activation+6.49e-01

Pos logit contributions
kin+0.035
 ago+0.035
orses+0.034
inton+0.033
orse+0.033
Neg logit contributions
 owl-0.019
 again-0.019
 tighter-0.018
 Owl-0.018
 ac-0.018
Token hum
Token ID1311
Feature activation+5.11e+00

Pos logit contributions
kin+0.017
inton+0.017
 ago+0.015
ippers+0.015
orses+0.015
Neg logit contributions
 Owl-0.011
 owl-0.010
 again-0.010
 ac-0.010
 Tweet-0.009
Tokenmed
Token ID1150
Feature activation+1.51e+00

Pos logit contributions
inton+0.122
kin+0.112
 ago+0.111
 toes+0.109
ippers+0.106
Neg logit contributions
 again-0.088
 Owl-0.083
 ac-0.080
 owl-0.075
lim-0.075
Token and
Token ID290
Feature activation+3.71e-01

Pos logit contributions
inton+0.039
kin+0.039
 ago+0.035
orses+0.035
 toes+0.034
Neg logit contributions
 Owl-0.025
 aback-0.023
 berries-0.023
lim-0.022
 tighter-0.022
Token wh
Token ID348
Feature activation+7.44e+00

Pos logit contributions
inton+0.010
kin+0.009
ippers+0.009
 ago+0.009
ilot+0.008
Neg logit contributions
 again-0.006
 ac-0.006
 Owl-0.006
 owl-0.006
 Tweet-0.006
Tokenir
Token ID343
Feature activation+2.07e+00

Pos logit contributions
inton+0.143
kin+0.136
 toes+0.133
orses+0.128
 ago+0.127
Neg logit contributions
 again-0.152
lim-0.151
 Owl-0.147
 aback-0.144
Da-0.143
Tokenred
Token ID445
Feature activation+3.55e-01

Pos logit contributions
kin+0.056
inton+0.055
 ago+0.051
orses+0.050
orse+0.049
Neg logit contributions
 Owl-0.032
 again-0.031
lim-0.029
 owl-0.029
 aback-0.029
Token and
Token ID290
Feature activation+7.85e-01

Pos logit contributions
kin+0.009
inton+0.008
 ago+0.008
orses+0.008
orse+0.007
Neg logit contributions
 Owl-0.007
 again-0.006
 tighter-0.006
 owl-0.006
 berries-0.006
Token it
Token ID340
Feature activation-1.68e-01

Pos logit contributions
inton+0.020
kin+0.019
ippers+0.019
 ago+0.018
ilot+0.018
Neg logit contributions
 again-0.014
 Owl-0.013
 Tweet-0.013
 ac-0.012
 owl-0.012
Token churn
Token ID28488
Feature activation+2.27e+00

Pos logit contributions
 Owl+0.003
 again+0.003
 tighter+0.003
 owl+0.003
 ac+0.003
Neg logit contributions
inton-0.004
kin-0.004
 ago-0.004
orses-0.004
ippers-0.004
Token keep
Token ID1394
Feature activation-8.86e-03

Pos logit contributions
kin+0.006
inton+0.006
 ago+0.006
orses+0.005
 toes+0.005
Neg logit contributions
 Owl-0.008
 again-0.007
 owl-0.007
lim-0.007
 berries-0.007
Token the
Token ID262
Feature activation-2.46e-01

Pos logit contributions
 Owl+0.000
 again+0.000
 tighter+0.000
 owl+0.000
 berries+0.000
Neg logit contributions
kin-0.000
inton-0.000
 ago-0.000
orses-0.000
orse-0.000
Token ring
Token ID5858
Feature activation-2.05e+00

Pos logit contributions
 Owl+0.004
Da+0.004
 aback+0.004
 tighter+0.004
 again+0.004
Neg logit contributions
kin-0.006
inton-0.006
 ago-0.006
orses-0.005
 toes-0.005
Token as
Token ID355
Feature activation+4.61e-02

Pos logit contributions
 Owl+0.039
Da+0.036
 berries+0.035
lim+0.035
 aback+0.035
Neg logit contributions
kin-0.049
inton-0.048
 ago-0.044
orses-0.043
 toes-0.042
Token a
Token ID257
Feature activation+3.73e-01

Pos logit contributions
inton+0.001
kin+0.001
orses+0.001
ippers+0.001
ilot+0.001
Neg logit contributions
 again-0.001
 berries-0.001
 Owl-0.001
 tighter-0.001
 closer-0.001
Token souven
Token ID45043
Feature activation+7.38e+00

Pos logit contributions
inton+0.009
kin+0.009
 ago+0.008
orses+0.008
 toes+0.008
Neg logit contributions
 Owl-0.007
 again-0.006
 tighter-0.006
 owl-0.006
 berries-0.006
Tokenir
Token ID343
Feature activation+8.20e-01

Pos logit contributions
inton+0.205
kin+0.189
orses+0.184
 toes+0.180
 hardly+0.175
Neg logit contributions
 again-0.103
lim-0.102
xc-0.090
st-0.086
 Owl-0.083
Token.
Token ID13
Feature activation-1.27e-01

Pos logit contributions
inton+0.020
kin+0.020
 toes+0.018
ippers+0.017
orses+0.017
Neg logit contributions
 Owl-0.015
 owl-0.015
 again-0.014
 ac-0.013
lim-0.013
Token They
Token ID1119
Feature activation+2.33e+00

Pos logit contributions
 again+0.003
 aback+0.002
 Owl+0.002
 Tweet+0.002
 closer+0.002
Neg logit contributions
kin-0.003
inton-0.003
orses-0.002
orse-0.002
ilot-0.002
Token thanked
Token ID26280
Feature activation+1.97e+00

Pos logit contributions
inton+0.054
kin+0.053
 ago+0.048
orses+0.048
ippers+0.046
Neg logit contributions
 Owl-0.045
 again-0.043
 tighter-0.042
 owl-0.041
 berries-0.041
Token the
Token ID262
Feature activation-9.62e-01

Pos logit contributions
inton+0.036
kin+0.036
 ago+0.032
orses+0.031
 toes+0.030
Neg logit contributions
 Owl-0.048
 again-0.045
 tighter-0.044
 owl-0.044
 berries-0.044
Token gold
Token ID3869
Feature activation-1.77e+00

Pos logit contributions
Da+0.014
 Owl+0.014
 tighter+0.012
lim+0.012
 aback+0.012
Neg logit contributions
kin-0.022
inton-0.021
 ago-0.020
orses-0.019
 toes-0.019
Token coin
Token ID10752
Feature activation-3.55e+00

Pos logit contributions
Da+0.037
 Owl+0.035
lim+0.032
 aback+0.032
Bet+0.031
Neg logit contributions
kin-0.044
inton-0.038
 ago-0.035
 toes-0.035
orses-0.035
Token as
Token ID355
Feature activation+5.17e-02

Pos logit contributions
 Owl+0.064
Da+0.059
lim+0.059
 aback+0.058
 berries+0.057
Neg logit contributions
kin-0.089
inton-0.085
 ago-0.078
orses-0.076
 toes-0.075
Token a
Token ID257
Feature activation-1.13e-01

Pos logit contributions
inton+0.001
kin+0.001
orses+0.001
 ago+0.001
orse+0.001
Neg logit contributions
 Owl-0.001
 again-0.001
 berries-0.001
 tighter-0.001
Da-0.001
Token special
Token ID2041
Feature activation+3.86e-01

Pos logit contributions
 Owl+0.002
 aback+0.002
lim+0.002
Da+0.002
 berries+0.002
Neg logit contributions
kin-0.002
inton-0.002
 ago-0.002
orses-0.002
orse-0.002
Token souven
Token ID45043
Feature activation+7.37e+00

Pos logit contributions
kin+0.009
inton+0.008
 ago+0.008
orses+0.007
 toes+0.007
Neg logit contributions
 Owl-0.008
Da-0.007
 tighter-0.007
 again-0.007
 berries-0.007
Tokenir
Token ID343
Feature activation+9.74e-01

Pos logit contributions
inton+0.194
kin+0.183
orses+0.173
 toes+0.170
orse+0.165
Neg logit contributions
lim-0.118
 again-0.116
 Owl-0.105
 av-0.103
xc-0.101
Token of
Token ID286
Feature activation+4.01e+00

Pos logit contributions
kin+0.024
inton+0.023
ippers+0.020
 toes+0.020
orses+0.020
Neg logit contributions
 Owl-0.019
 again-0.018
 owl-0.018
 ac-0.017
lim-0.017
Token Sarah
Token ID10490
Feature activation-7.24e-01

Pos logit contributions
inton+0.104
kin+0.101
orses+0.093
ippers+0.089
ilot+0.088
Neg logit contributions
 Owl-0.070
 again-0.067
 owl-0.064
 tighter-0.064
 berries-0.063
Token's
Token ID338
Feature activation+2.64e+00

Pos logit contributions
 Owl+0.013
 again+0.013
 owl+0.012
 tighter+0.012
 berries+0.012
Neg logit contributions
inton-0.018
kin-0.018
orses-0.016
 ago-0.015
ippers-0.015
Token adventure
Token ID8855
Feature activation+7.44e-01

Pos logit contributions
inton+0.063
kin+0.063
orses+0.056
 ago+0.055
orse+0.053
Neg logit contributions
 Owl-0.050
 again-0.047
 owl-0.046
 berries-0.045
 tighter-0.045
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
 Owl+0.015
 tighter+0.014
 again+0.014
lim+0.014
 owl+0.014
Neg logit contributions
inton-0.014
kin-0.014
 ago-0.012
orses-0.012
orse-0.012
Token gray
Token ID12768
Feature activation+2.00e+00

Pos logit contributions
 Owl+0.022
 again+0.020
Da+0.020
 tighter+0.020
lim+0.020
Neg logit contributions
kin-0.029
inton-0.029
 ago-0.026
orses-0.025
 toes-0.025
Token shell
Token ID7582
Feature activation-3.69e+00

Pos logit contributions
kin+0.050
inton+0.049
 ago+0.045
orses+0.044
 toes+0.043
Neg logit contributions
 Owl-0.037
 tighter-0.033
Da-0.033
lim-0.033
 aback-0.033
Token as
Token ID355
Feature activation-3.68e-01

Pos logit contributions
 Owl+0.056
 again+0.055
 tighter+0.052
 berries+0.051
 owl+0.050
Neg logit contributions
inton-0.100
kin-0.098
 ago-0.092
orses-0.090
orse-0.087
Token a
Token ID257
Feature activation-5.99e-02

Pos logit contributions
 Owl+0.006
 again+0.006
 Tweet+0.006
 tighter+0.005
 owl+0.005
Neg logit contributions
inton-0.010
kin-0.009
 ago-0.009
ippers-0.009
ana-0.009
Token souven
Token ID45043
Feature activation+7.35e+00

Pos logit contributions
 Owl+0.001
 again+0.001
 tighter+0.001
 owl+0.001
 berries+0.001
Neg logit contributions
inton-0.001
kin-0.001
 ago-0.001
orses-0.001
ippers-0.001
Tokenir
Token ID343
Feature activation+6.01e-01

Pos logit contributions
inton+0.197
kin+0.185
orses+0.179
 toes+0.173
orse+0.170
Neg logit contributions
 again-0.114
lim-0.109
Da-0.098
xc-0.096
 tighter-0.096
Token of
Token ID286
Feature activation+3.32e+00

Pos logit contributions
kin+0.015
inton+0.015
ippers+0.013
 toes+0.012
 ago+0.012
Neg logit contributions
 Owl-0.012
 again-0.011
 owl-0.011
 ac-0.010
lim-0.010
Token his
Token ID465
Feature activation+5.51e-01

Pos logit contributions
inton+0.087
kin+0.084
orses+0.077
ippers+0.075
 ago+0.075
Neg logit contributions
 Owl-0.057
 again-0.056
 owl-0.052
 tighter-0.052
 berries-0.052
Token adventure
Token ID8855
Feature activation+1.50e-01

Pos logit contributions
inton+0.012
kin+0.012
 ago+0.011
orses+0.011
ippers+0.010
Neg logit contributions
 Owl-0.011
 again-0.011
 tighter-0.010
 owl-0.010
 berries-0.010
Token.
Token ID13
Feature activation-4.04e-01

Pos logit contributions
inton+0.003
kin+0.003
orses+0.003
 ago+0.003
ippers+0.003
Neg logit contributions
 again-0.003
 Owl-0.003
 tighter-0.003
 owl-0.003
 aback-0.003
Token felt
Token ID2936
Feature activation+5.53e-01

Pos logit contributions
inton+0.002
kin+0.001
orses+0.001
 ago+0.001
orse+0.001
Neg logit contributions
 Owl-0.002
 aback-0.001
 tighter-0.001
lim-0.001
Da-0.001
Token jealous
Token ID19354
Feature activation+1.57e+00

Pos logit contributions
inton+0.015
kin+0.015
 ago+0.014
ippers+0.014
orses+0.014
Neg logit contributions
 Owl-0.009
 again-0.008
 aback-0.008
 ac-0.008
 tighter-0.008
Token,
Token ID11
Feature activation+4.68e-01

Pos logit contributions
kin+0.037
inton+0.037
 ago+0.033
orses+0.033
orse+0.032
Neg logit contributions
 Owl-0.030
 again-0.029
 tighter-0.028
 owl-0.027
 berries-0.027
Token because
Token ID780
Feature activation+2.28e+00

Pos logit contributions
inton+0.012
kin+0.012
 ago+0.011
orses+0.011
 toes+0.010
Neg logit contributions
 Owl-0.008
lim-0.007
 tighter-0.007
 again-0.007
 berries-0.007
Token the
Token ID262
Feature activation+2.24e+00

Pos logit contributions
inton+0.054
kin+0.053
orses+0.047
 ago+0.047
ippers+0.046
Neg logit contributions
 Owl-0.045
 again-0.042
 owl-0.041
 berries-0.040
 tighter-0.040
Token ch
Token ID442
Feature activation+7.34e+00

Pos logit contributions
kin+0.060
inton+0.059
 ago+0.055
orses+0.054
orse+0.053
Neg logit contributions
 Owl-0.034
Da-0.032
 tighter-0.032
 aback-0.032
lim-0.032
Tokename
Token ID480
Feature activation+3.90e+00

Pos logit contributions
inton+0.141
kin+0.141
 ago+0.121
orses+0.118
 toes+0.115
Neg logit contributions
 Owl-0.173
 owl-0.159
lim-0.158
 again-0.157
 berries-0.155
Tokenleon
Token ID38970
Feature activation+3.24e+00

Pos logit contributions
inton+0.107
kin+0.103
 ago+0.095
ippers+0.095
uits+0.094
Neg logit contributions
 Owl-0.063
 owl-0.057
 again-0.056
 Tweet-0.052
 ac-0.051
Token was
Token ID373
Feature activation+1.88e+00

Pos logit contributions
inton+0.078
kin+0.077
 ago+0.070
orses+0.069
ippers+0.069
Neg logit contributions
 Owl-0.061
 again-0.057
 owl-0.055
 berries-0.054
 aback-0.052
Token very
Token ID845
Feature activation+2.73e-01

Pos logit contributions
kin+0.049
inton+0.049
ippers+0.044
orses+0.044
 ago+0.043
Neg logit contributions
 Owl-0.032
 again-0.030
 owl-0.029
 tighter-0.028
 berries-0.028
Token colorful
Token ID20239
Feature activation+2.06e+00

Pos logit contributions
kin+0.008
inton+0.007
ippers+0.007
 ago+0.007
orses+0.007
Neg logit contributions
 Owl-0.004
 ac-0.004
 owl-0.004
 again-0.004
 tighter-0.004
Token keep
Token ID1394
Feature activation-2.82e-03

Pos logit contributions
kin+0.004
inton+0.004
 ago+0.004
orses+0.004
 toes+0.004
Neg logit contributions
 Owl-0.005
 again-0.005
 owl-0.005
lim-0.004
 berries-0.004
Token the
Token ID262
Feature activation-6.88e-01

Pos logit contributions
 Owl+0.000
Da+0.000
lim+0.000
 tighter+0.000
 aback+0.000
Neg logit contributions
inton-0.000
kin-0.000
orses-0.000
 ago-0.000
 toes-0.000
Token spring
Token ID6076
Feature activation-4.40e-01

Pos logit contributions
 Owl+0.011
Da+0.011
 aback+0.010
lim+0.010
 again+0.010
Neg logit contributions
kin-0.018
inton-0.017
 ago-0.016
orses-0.015
 toes-0.015
Token as
Token ID355
Feature activation-8.48e-01

Pos logit contributions
 Owl+0.008
Da+0.007
 aback+0.007
lim+0.007
 tighter+0.007
Neg logit contributions
kin-0.011
inton-0.011
 ago-0.010
orses-0.009
 toes-0.009
Token a
Token ID257
Feature activation+1.22e-02

Pos logit contributions
 Owl+0.014
 again+0.014
 tighter+0.013
 berries+0.013
 owl+0.013
Neg logit contributions
inton-0.022
kin-0.021
 ago-0.020
orses-0.020
 unison-0.019
Token souven
Token ID45043
Feature activation+7.33e+00

Pos logit contributions
inton+0.000
kin+0.000
 ago+0.000
orses+0.000
 toes+0.000
Neg logit contributions
 Owl-0.000
 again-0.000
 tighter-0.000
 owl-0.000
 berries-0.000
Tokenir
Token ID343
Feature activation+8.63e-01

Pos logit contributions
inton+0.204
kin+0.193
orses+0.184
 toes+0.182
ana+0.178
Neg logit contributions
lim-0.105
 again-0.101
 Owl-0.091
xc-0.088
st-0.084
Token of
Token ID286
Feature activation+3.24e+00

Pos logit contributions
kin+0.020
inton+0.020
ippers+0.017
 toes+0.017
 ago+0.017
Neg logit contributions
 Owl-0.018
 again-0.017
 owl-0.017
lim-0.016
 ac-0.016
Token his
Token ID465
Feature activation+1.30e+00

Pos logit contributions
inton+0.085
kin+0.082
orses+0.074
ippers+0.072
ilot+0.072
Neg logit contributions
 Owl-0.059
 again-0.056
 owl-0.053
 ac-0.052
 berries-0.052
Token adventure
Token ID8855
Feature activation+4.54e-01

Pos logit contributions
inton+0.028
kin+0.028
 ago+0.025
orses+0.025
orse+0.023
Neg logit contributions
 Owl-0.027
 again-0.026
 owl-0.025
 berries-0.025
 tighter-0.025
Token.
Token ID13
Feature activation+3.45e-01

Pos logit contributions
inton+0.010
kin+0.010
orses+0.009
 ago+0.009
ippers+0.009
Neg logit contributions
 Owl-0.009
 again-0.009
 tighter-0.008
 owl-0.008
 aback-0.008
Token faster
Token ID5443
Feature activation-4.26e-01

Pos logit contributions
 Owl+0.029
lim+0.026
 berries+0.025
 owl+0.025
 av+0.025
Neg logit contributions
kin-0.028
inton-0.028
 toes-0.026
 ago-0.025
uits-0.025
Token than
Token ID621
Feature activation+9.70e-01

Pos logit contributions
 Owl+0.007
 again+0.006
 tighter+0.006
 owl+0.006
 berries+0.006
Neg logit contributions
inton-0.011
kin-0.011
 ago-0.010
orses-0.010
 toes-0.010
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
inton+0.023
kin+0.023
orses+0.020
 ago+0.020
ippers+0.019
Neg logit contributions
 Owl-0.019
 again-0.018
 owl-0.017
 tighter-0.017
 berries-0.017
Token clouds
Token ID15114
Feature activation+3.72e+00

Pos logit contributions
inton+0.027
kin+0.027
 ago+0.024
orses+0.024
orse+0.023
Neg logit contributions
 Owl-0.022
 again-0.020
 owl-0.020
 tighter-0.020
 berries-0.019
Token and
Token ID290
Feature activation+8.60e-01

Pos logit contributions
inton+0.109
kin+0.107
ippers+0.100
orses+0.097
ana+0.095
Neg logit contributions
 again-0.060
 Owl-0.051
 ac-0.047
 owl-0.045
 further-0.045
Token wh
Token ID348
Feature activation+7.26e+00

Pos logit contributions
inton+0.019
kin+0.019
 ago+0.018
orses+0.017
 toes+0.017
Neg logit contributions
 Owl-0.017
Da-0.016
lim-0.016
 tighter-0.016
 aback-0.016
Tokenizz
Token ID6457
Feature activation+4.01e-01

Pos logit contributions
inton+0.143
 toes+0.139
kin+0.137
 ago+0.133
 hardly+0.129
Neg logit contributions
lim-0.142
 again-0.139
 Owl-0.137
st-0.135
xc-0.135
Tokened
Token ID276
Feature activation-2.19e+00

Pos logit contributions
kin+0.011
inton+0.011
 ago+0.010
orses+0.010
orse+0.010
Neg logit contributions
 Owl-0.006
 again-0.005
 tighter-0.005
 owl-0.005
 berries-0.005
Token past
Token ID1613
Feature activation+3.72e+00

Pos logit contributions
 Owl+0.040
 again+0.037
 tighter+0.036
 owl+0.036
 berries+0.036
Neg logit contributions
kin-0.053
inton-0.053
 ago-0.048
orses-0.048
orse-0.046
Token the
Token ID262
Feature activation-3.99e-01

Pos logit contributions
inton+0.087
kin+0.087
 ago+0.078
orses+0.077
orse+0.074
Neg logit contributions
 Owl-0.072
 again-0.067
 tighter-0.065
 berries-0.065
 owl-0.065
Token trees
Token ID7150
Feature activation+3.41e+00

Pos logit contributions
 Owl+0.008
 again+0.007
 tighter+0.007
Da+0.007
 aback+0.007
Neg logit contributions
inton-0.009
kin-0.009
 ago-0.009
orses-0.008
 toes-0.008
Token bitter
Token ID12922
Feature activation-5.67e-01

Pos logit contributions
kin+0.038
inton+0.037
 ago+0.034
 toes+0.033
orses+0.033
Neg logit contributions
 Owl-0.024
lim-0.023
Da-0.022
 aback-0.022
 owl-0.022
Token!
Token ID0
Feature activation+6.20e-01

Pos logit contributions
 Owl+0.011
Da+0.010
 tighter+0.010
 owl+0.010
lim+0.010
Neg logit contributions
inton-0.013
kin-0.013
 ago-0.012
 toes-0.012
orses-0.012
Token He
Token ID679
Feature activation-3.88e-01

Pos logit contributions
kin+0.012
inton+0.012
 ago+0.011
orses+0.011
 toes+0.010
Neg logit contributions
 Owl-0.015
 again-0.013
Da-0.013
 aback-0.013
 berries-0.013
Token quickly
Token ID2952
Feature activation-1.50e+00

Pos logit contributions
 Owl+0.008
 aback+0.008
lim+0.008
 again+0.008
 tighter+0.008
Neg logit contributions
kin-0.008
inton-0.008
orses-0.007
 ago-0.007
orse-0.007
Token spat
Token ID15246
Feature activation+1.87e+00

Pos logit contributions
 Owl+0.029
 again+0.027
 tighter+0.027
 berries+0.027
 owl+0.027
Neg logit contributions
kin-0.035
inton-0.035
 ago-0.031
orses-0.031
ippers-0.030
Token it
Token ID340
Feature activation-7.11e+00

Pos logit contributions
kin+0.046
inton+0.045
 ago+0.041
orses+0.040
 toes+0.039
Neg logit contributions
 Owl-0.034
 owl-0.031
 tighter-0.031
Da-0.031
lim-0.031
Token out
Token ID503
Feature activation-1.77e+00

Pos logit contributions
 Owl+0.137
Da+0.128
 owl+0.128
lim+0.125
iw+0.116
Neg logit contributions
inton-0.168
kin-0.163
 toes-0.145
 hardly-0.145
 wear-0.143
Token.
Token ID13
Feature activation+6.97e-01

Pos logit contributions
 Owl+0.036
lim+0.033
Da+0.032
 owl+0.032
 berries+0.032
Neg logit contributions
inton-0.040
kin-0.039
 ago-0.036
 toes-0.035
orses-0.034
Token
Token ID198
Feature activation+3.04e-01

Pos logit contributions
kin+0.014
inton+0.014
 ago+0.012
orses+0.012
 toes+0.012
Neg logit contributions
 Owl-0.016
 again-0.015
 aback-0.015
Da-0.014
 berries-0.014
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
kin+0.007
inton+0.007
 ago+0.007
 toes+0.007
ippers+0.006
Neg logit contributions
 Owl-0.006
Da-0.006
 again-0.005
 aback-0.005
 Tweet-0.005
TokenThe
Token ID464
Feature activation-5.35e-02

Pos logit contributions
Da+0.022
 Owl+0.022
lim+0.020
 again+0.020
 tighter+0.020
Neg logit contributions
kin-0.034
inton-0.034
 ago-0.033
 toes-0.032
orses-0.030
Token She
Token ID1375
Feature activation-3.78e-01

Pos logit contributions
 Owl+0.007
 again+0.006
 aback+0.006
Da+0.006
 owl+0.006
Neg logit contributions
kin-0.010
inton-0.009
orses-0.008
ippers-0.008
orse-0.008
Token was
Token ID373
Feature activation+8.09e-01

Pos logit contributions
 Owl+0.009
 aback+0.008
lim+0.008
 tighter+0.008
 owl+0.008
Neg logit contributions
inton-0.007
kin-0.007
orses-0.007
orse-0.007
 hardly-0.006
Token able
Token ID1498
Feature activation-2.94e+00

Pos logit contributions
inton+0.023
kin+0.023
 ago+0.021
orses+0.021
 toes+0.020
Neg logit contributions
 Owl-0.012
 again-0.011
 tighter-0.010
 owl-0.010
 berries-0.010
Token to
Token ID284
Feature activation-1.94e-01

Pos logit contributions
 Owl+0.038
lim+0.033
 owl+0.032
 berries+0.031
 tighter+0.031
Neg logit contributions
inton-0.086
kin-0.083
 ago-0.080
orses-0.078
 toes-0.078
Token scoop
Token ID23529
Feature activation-1.00e+00

Pos logit contributions
 Owl+0.004
 again+0.004
 owl+0.004
 tighter+0.004
 berries+0.004
Neg logit contributions
kin-0.004
inton-0.004
 ago-0.004
orses-0.004
 toes-0.004
Token it
Token ID340
Feature activation-7.08e+00

Pos logit contributions
 Owl+0.020
 owl+0.019
lim+0.018
Da+0.017
izable+0.016
Neg logit contributions
inton-0.022
kin-0.022
 toes-0.020
orses-0.018
ippers-0.018
Token up
Token ID510
Feature activation-3.00e+00

Pos logit contributions
 Owl+0.140
 owl+0.128
Da+0.120
iw+0.119
 av+0.118
Neg logit contributions
inton-0.168
kin-0.164
 toes-0.144
 wear-0.142
 hardly-0.141
Token with
Token ID351
Feature activation+4.32e-01

Pos logit contributions
 Owl+0.066
lim+0.063
 owl+0.061
Da+0.060
xc+0.058
Neg logit contributions
kin-0.058
inton-0.057
 toes-0.053
 ago-0.049
 wear-0.048
Token her
Token ID607
Feature activation+8.24e-01

Pos logit contributions
inton+0.011
kin+0.011
 ago+0.010
orses+0.010
 toes+0.010
Neg logit contributions
 Owl-0.007
 again-0.007
 tighter-0.007
lim-0.006
 owl-0.006
Token net
Token ID2010
Feature activation-2.43e+00

Pos logit contributions
inton+0.021
kin+0.021
orses+0.019
 ago+0.019
loe+0.019
Neg logit contributions
 again-0.014
 Owl-0.013
 tighter-0.012
 ac-0.012
 berries-0.012
Token and
Token ID290
Feature activation+3.81e-01

Pos logit contributions
 Owl+0.040
 again+0.037
 tighter+0.036
 berries+0.035
 owl+0.035
Neg logit contributions
inton-0.064
kin-0.063
 ago-0.058
orses-0.058
orse-0.055
Token her
Token ID607
Feature activation+6.04e-01

Pos logit contributions
kin+0.012
inton+0.011
 toes+0.011
 ago+0.010
ana+0.010
Neg logit contributions
 Owl-0.013
lim-0.013
 owl-0.012
Da-0.011
stone-0.011
Token tongue
Token ID11880
Feature activation-9.62e-01

Pos logit contributions
inton+0.011
kin+0.011
orses+0.010
 ago+0.010
orse+0.009
Neg logit contributions
 Owl-0.014
 again-0.014
 berries-0.014
 tighter-0.013
 owl-0.013
Token.
Token ID13
Feature activation+1.30e-01

Pos logit contributions
 Owl+0.018
lim+0.016
Da+0.016
 aback+0.016
 owl+0.016
Neg logit contributions
kin-0.024
inton-0.023
 ago-0.021
orses-0.021
 toes-0.021
Token She
Token ID1375
Feature activation+1.97e-01

Pos logit contributions
inton+0.003
kin+0.003
orses+0.002
 ago+0.002
orse+0.002
Neg logit contributions
 Owl-0.003
 again-0.003
Da-0.003
 berries-0.003
 aback-0.003
Token spat
Token ID15246
Feature activation+1.83e+00

Pos logit contributions
inton+0.004
kin+0.004
 ago+0.004
orses+0.004
ippers+0.004
Neg logit contributions
 Owl-0.004
 again-0.004
 tighter-0.004
 berries-0.004
 owl-0.004
Token it
Token ID340
Feature activation-7.02e+00

Pos logit contributions
inton+0.046
kin+0.046
 ago+0.042
orses+0.041
 toes+0.039
Neg logit contributions
 Owl-0.032
 again-0.030
 tighter-0.029
 berries-0.029
 owl-0.029
Token out
Token ID503
Feature activation-1.71e+00

Pos logit contributions
 Owl+0.148
Da+0.136
 owl+0.136
lim+0.136
 av+0.129
Neg logit contributions
kin-0.156
inton-0.154
 toes-0.136
 wear-0.134
 hardly-0.129
Token and
Token ID290
Feature activation-3.46e-01

Pos logit contributions
 Owl+0.043
lim+0.041
 aback+0.040
Da+0.040
 owl+0.039
Neg logit contributions
kin-0.030
inton-0.029
 ago-0.026
 toes-0.026
orses-0.024
Token cried
Token ID16896
Feature activation+1.73e+00

Pos logit contributions
 Owl+0.007
 again+0.006
lim+0.006
 tighter+0.006
 owl+0.006
Neg logit contributions
kin-0.008
inton-0.008
 ago-0.007
orses-0.007
 toes-0.007
Token.
Token ID13
Feature activation-2.80e-01

Pos logit contributions
kin+0.026
inton+0.026
 toes+0.022
 ago+0.022
orses+0.020
Neg logit contributions
 Owl-0.047
lim-0.045
Da-0.045
 aback-0.044
 owl-0.044
Token
Token ID198
Feature activation-2.76e-01

Pos logit contributions
 Owl+0.006
 again+0.006
Da+0.006
 berries+0.006
 owl+0.006
Neg logit contributions
kin-0.006
inton-0.006
orses-0.005
 ago-0.005
orse-0.005
Token at
Token ID379
Feature activation+2.01e+00

Pos logit contributions
 Owl+0.039
 again+0.038
 berries+0.036
 tighter+0.036
 owl+0.035
Neg logit contributions
kin-0.044
inton-0.044
 ago-0.041
orses-0.040
ippers-0.038
Token all
Token ID477
Feature activation+6.13e-01

Pos logit contributions
kin+0.048
inton+0.046
 ago+0.043
 toes+0.041
 hardly+0.040
Neg logit contributions
 Owl-0.038
Da-0.037
lim-0.036
 aback-0.036
 owl-0.035
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
kin+0.016
inton+0.015
ippers+0.014
ana+0.014
ilot+0.014
Neg logit contributions
 again-0.012
 Owl-0.011
 berries-0.011
 owl-0.010
 ac-0.010
Token He
Token ID679
Feature activation-8.08e-04

Pos logit contributions
kin+0.024
inton+0.024
 ago+0.021
orses+0.021
 toes+0.020
Neg logit contributions
 Owl-0.024
 again-0.023
 aback-0.022
 berries-0.021
Da-0.021
Token spat
Token ID15246
Feature activation+2.05e+00

Pos logit contributions
 Owl+0.000
 aback+0.000
 owl+0.000
lim+0.000
Da+0.000
Neg logit contributions
kin-0.000
inton-0.000
orses-0.000
 hardly-0.000
 toes-0.000
Token it
Token ID340
Feature activation-6.98e+00

Pos logit contributions
kin+0.048
inton+0.047
orses+0.041
 ago+0.041
 toes+0.040
Neg logit contributions
 Owl-0.041
 owl-0.038
lim-0.037
Da-0.037
 tighter-0.036
Token out
Token ID503
Feature activation-1.53e+00

Pos logit contributions
 Owl+0.147
 owl+0.139
lim+0.133
Da+0.132
 av+0.122
Neg logit contributions
inton-0.156
kin-0.154
 wear-0.133
 toes-0.131
 hardly-0.128
Token and
Token ID290
Feature activation-1.12e-01

Pos logit contributions
 Owl+0.033
lim+0.031
Da+0.030
 owl+0.029
xc+0.029
Neg logit contributions
inton-0.032
kin-0.032
 ago-0.029
 toes-0.029
orses-0.027
Token made
Token ID925
Feature activation+9.10e-01

Pos logit contributions
 Owl+0.002
lim+0.002
 owl+0.002
Da+0.002
 aback+0.002
Neg logit contributions
kin-0.002
inton-0.002
orses-0.002
 toes-0.002
 ago-0.002
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
kin+0.024
inton+0.024
 ago+0.023
orses+0.022
 toes+0.021
Neg logit contributions
 Owl-0.014
 again-0.013
 tighter-0.013
 berries-0.013
 owl-0.013
Token face
Token ID1986
Feature activation-7.34e-01

Pos logit contributions
 Owl+0.031
lim+0.029
 aback+0.029
 owl+0.028
Da+0.028
Neg logit contributions
inton-0.023
kin-0.023
orses-0.020
 toes-0.019
orse-0.019
Token mouth
Token ID5422
Feature activation+2.88e-01

Pos logit contributions
inton+0.019
kin+0.019
 ago+0.017
orses+0.017
orse+0.016
Neg logit contributions
 Owl-0.022
 again-0.021
 tighter-0.020
 berries-0.020
 owl-0.020
Token.
Token ID13
Feature activation+1.95e-01

Pos logit contributions
kin+0.006
inton+0.006
 ago+0.006
 toes+0.005
orses+0.005
Neg logit contributions
 Owl-0.006
Da-0.006
lim-0.005
 berries-0.005
 owl-0.005
Token You
Token ID921
Feature activation+3.06e+00

Pos logit contributions
inton+0.004
kin+0.004
 ago+0.003
orses+0.003
 toes+0.003
Neg logit contributions
 Owl-0.004
 again-0.004
 tighter-0.004
 owl-0.004
Da-0.004
Token should
Token ID815
Feature activation+2.99e-01

Pos logit contributions
kin+0.049
inton+0.049
 ago+0.042
orses+0.041
orse+0.038
Neg logit contributions
 Owl-0.081
Da-0.077
 tighter-0.076
 again-0.076
 owl-0.075
Token spit
Token ID27591
Feature activation+7.99e-01

Pos logit contributions
inton+0.006
kin+0.006
 ago+0.006
orses+0.005
ilot+0.005
Neg logit contributions
 Owl-0.007
 berries-0.006
 ac-0.006
 again-0.006
 owl-0.006
Token it
Token ID340
Feature activation-6.97e+00

Pos logit contributions
kin+0.015
inton+0.014
 wear+0.013
ana+0.012
orses+0.012
Neg logit contributions
 Owl-0.019
Da-0.018
lim-0.017
 owl-0.017
xc-0.017
Token out
Token ID503
Feature activation-1.24e+00

Pos logit contributions
 Owl+0.146
Da+0.132
lim+0.131
 owl+0.127
iw+0.124
Neg logit contributions
kin-0.154
inton-0.148
 wear-0.144
 time-0.131
 toes-0.128
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
 Owl+0.024
Da+0.022
lim+0.022
 owl+0.022
 aback+0.021
Neg logit contributions
kin-0.028
inton-0.028
 ago-0.026
orses-0.025
 toes-0.025
Token The
Token ID383
Feature activation+4.96e-01

Pos logit contributions
kin+0.005
inton+0.005
 ago+0.005
orses+0.004
orse+0.004
Neg logit contributions
 Owl-0.006
 again-0.006
 tighter-0.006
Da-0.006
 owl-0.006
Token teacher
Token ID4701
Feature activation-9.69e-02

Pos logit contributions
inton+0.014
kin+0.014
 ago+0.013
orses+0.012
orse+0.012
Neg logit contributions
 Owl-0.008
 again-0.007
 owl-0.007
 berries-0.007
 tighter-0.007
Token will
Token ID481
Feature activation+1.64e+00

Pos logit contributions
 Owl+0.002
 again+0.002
 owl+0.002
 tighter+0.002
 berries+0.002
Neg logit contributions
inton-0.002
kin-0.002
 ago-0.002
orses-0.002
 toes-0.002
Tokenunch
Token ID3316
Feature activation+9.73e-01

Pos logit contributions
inton+0.058
kin+0.058
 ago+0.048
orses+0.048
orse+0.045
Neg logit contributions
 Owl-0.112
 again-0.105
 owl-0.105
 tighter-0.104
 aback-0.104
Token up
Token ID510
Feature activation-2.05e+00

Pos logit contributions
kin+0.024
inton+0.023
 toes+0.019
 ago+0.019
orses+0.019
Neg logit contributions
 Owl-0.019
lim-0.017
Da-0.017
 aback-0.017
 owl-0.016
Token.
Token ID13
Feature activation-4.50e-01

Pos logit contributions
 Owl+0.059
lim+0.056
Da+0.055
 owl+0.054
 aback+0.053
Neg logit contributions
kin-0.030
inton-0.028
 toes-0.025
 ago-0.024
 wear-0.022
Token She
Token ID1375
Feature activation+2.07e-01

Pos logit contributions
 Owl+0.011
 tighter+0.010
 owl+0.010
lim+0.010
Da+0.010
Neg logit contributions
inton-0.008
kin-0.008
 ago-0.008
 toes-0.007
orses-0.007
Token spat
Token ID15246
Feature activation+2.01e+00

Pos logit contributions
kin+0.004
inton+0.004
 hardly+0.003
orse+0.003
 toes+0.003
Neg logit contributions
 Owl-0.005
 aback-0.005
lim-0.005
Da-0.005
 owl-0.004
Token it
Token ID340
Feature activation-6.91e+00

Pos logit contributions
kin+0.045
inton+0.045
orses+0.039
 toes+0.039
 ago+0.038
Neg logit contributions
 Owl-0.041
 owl-0.038
lim-0.037
Da-0.037
 tighter-0.036
Token out
Token ID503
Feature activation-1.61e+00

Pos logit contributions
 Owl+0.154
 owl+0.142
lim+0.141
Da+0.139
iw+0.135
Neg logit contributions
kin-0.143
inton-0.137
 wear-0.128
 toes-0.120
 hardly-0.118
Token and
Token ID290
Feature activation-2.29e-01

Pos logit contributions
 Owl+0.036
lim+0.035
xc+0.033
Da+0.033
 aback+0.033
Neg logit contributions
kin-0.031
inton-0.031
 toes-0.029
 ago-0.029
 wear-0.026
Token said
Token ID531
Feature activation-4.09e-01

Pos logit contributions
 Owl+0.005
lim+0.004
Da+0.004
 aback+0.004
 owl+0.004
Neg logit contributions
kin-0.005
inton-0.005
orses-0.004
 ago-0.004
 toes-0.004
Token,
Token ID11
Feature activation-1.76e+00

Pos logit contributions
 Owl+0.010
 tighter+0.009
 aback+0.009
lim+0.009
 owl+0.009
Neg logit contributions
kin-0.008
inton-0.007
 ago-0.007
orses-0.007
 toes-0.007
Token "
Token ID366
Feature activation-3.49e+00

Pos logit contributions
 Owl+0.043
 tighter+0.040
 again+0.040
lim+0.040
 berries+0.040
Neg logit contributions
inton-0.032
kin-0.032
 ago-0.028
orses-0.027
 toes-0.026
Token face
Token ID1986
Feature activation-9.44e-01

Pos logit contributions
 Owl+0.007
 again+0.007
 tighter+0.007
 berries+0.007
 owl+0.007
Neg logit contributions
inton-0.007
kin-0.007
 ago-0.006
orses-0.006
orse-0.006
Token twist
Token ID14528
Feature activation+5.49e-01

Pos logit contributions
 Owl+0.019
 again+0.018
 tighter+0.018
 berries+0.017
 owl+0.017
Neg logit contributions
inton-0.021
kin-0.021
 ago-0.019
orses-0.019
orse-0.018
Token.
Token ID13
Feature activation-7.67e-02

Pos logit contributions
kin+0.012
inton+0.012
 ago+0.011
 toes+0.011
orses+0.010
Neg logit contributions
 Owl-0.011
lim-0.010
 owl-0.010
Da-0.010
 aback-0.010
Token She
Token ID1375
Feature activation+2.31e-01

Pos logit contributions
 Owl+0.002
 again+0.002
Da+0.001
 berries+0.001
 aback+0.001
Neg logit contributions
kin-0.002
inton-0.002
orses-0.001
 ago-0.001
orse-0.001
Token spat
Token ID15246
Feature activation+1.93e+00

Pos logit contributions
kin+0.005
inton+0.005
 ago+0.004
orses+0.004
 toes+0.004
Neg logit contributions
 Owl-0.005
 again-0.005
lim-0.005
 aback-0.005
 tighter-0.005
Token it
Token ID340
Feature activation-6.90e+00

Pos logit contributions
kin+0.046
inton+0.046
 ago+0.041
orses+0.041
 toes+0.039
Neg logit contributions
 Owl-0.037
 owl-0.033
 tighter-0.033
 again-0.033
lim-0.033
Token out
Token ID503
Feature activation-1.52e+00

Pos logit contributions
 Owl+0.157
 owl+0.147
lim+0.144
Da+0.144
 Eagle+0.134
Neg logit contributions
inton-0.138
kin-0.137
 wear-0.122
 toes-0.118
 hardly-0.117
Token and
Token ID290
Feature activation-2.03e-01

Pos logit contributions
 Owl+0.035
lim+0.034
 aback+0.032
Da+0.032
xc+0.032
Neg logit contributions
kin-0.029
inton-0.029
 toes-0.026
 ago-0.026
ilot-0.023
Token felt
Token ID2936
Feature activation+2.82e-01

Pos logit contributions
 Owl+0.004
lim+0.004
 owl+0.004
Da+0.004
 again+0.004
Neg logit contributions
kin-0.005
inton-0.005
 ago-0.004
orses-0.004
 toes-0.004
Token a
Token ID257
Feature activation+3.27e-01

Pos logit contributions
inton+0.007
kin+0.007
 ago+0.007
orses+0.007
ippers+0.006
Neg logit contributions
 Owl-0.005
 again-0.005
 tighter-0.004
 berries-0.004
 aback-0.004
Token pain
Token ID2356
Feature activation+9.64e-01

Pos logit contributions
inton+0.008
kin+0.008
 ago+0.007
orses+0.007
 toes+0.006
Neg logit contributions
 Owl-0.006
 again-0.006
 tighter-0.006
 berries-0.006
 owl-0.006
Token mouth
Token ID5422
Feature activation-7.07e-01

Pos logit contributions
 again+0.016
 tighter+0.016
 Owl+0.015
 ac+0.015
 berries+0.015
Neg logit contributions
inton-0.017
kin-0.017
 ago-0.017
orses-0.016
ippers-0.015
Token hurt
Token ID5938
Feature activation+1.94e+00

Pos logit contributions
 Owl+0.013
 again+0.013
 tighter+0.012
 berries+0.012
 owl+0.012
Neg logit contributions
inton-0.017
kin-0.016
 ago-0.015
orses-0.015
orse-0.014
Token.
Token ID13
Feature activation+3.69e-02

Pos logit contributions
kin+0.033
inton+0.033
 ago+0.029
 toes+0.027
orses+0.027
Neg logit contributions
 Owl-0.050
lim-0.047
 aback-0.046
Da-0.046
 berries-0.046
Token She
Token ID1375
Feature activation+3.12e-01

Pos logit contributions
inton+0.001
kin+0.001
orses+0.001
orse+0.001
 ago+0.001
Neg logit contributions
 Owl-0.001
 again-0.001
Da-0.001
 berries-0.001
 aback-0.001
Token spit
Token ID27591
Feature activation+4.81e-01

Pos logit contributions
inton+0.007
kin+0.007
 ago+0.006
orses+0.006
ippers+0.006
Neg logit contributions
 Owl-0.006
 again-0.006
 tighter-0.006
 berries-0.006
 owl-0.006
Token it
Token ID340
Feature activation-6.89e+00

Pos logit contributions
kin+0.011
inton+0.011
 ago+0.010
orses+0.010
 toes+0.010
Neg logit contributions
 Owl-0.010
 owl-0.009
lim-0.009
Da-0.008
 berries-0.008
Token out
Token ID503
Feature activation-1.54e+00

Pos logit contributions
 Owl+0.149
lim+0.141
 owl+0.136
Da+0.136
 av+0.131
Neg logit contributions
kin-0.147
inton-0.144
 wear-0.131
 toes-0.123
 have-0.120
Token and
Token ID290
Feature activation-3.95e-01

Pos logit contributions
 Owl+0.034
lim+0.031
Da+0.030
 aback+0.030
 owl+0.030
Neg logit contributions
kin-0.033
inton-0.032
 ago-0.029
 toes-0.028
orses-0.028
Token yelled
Token ID22760
Feature activation+1.20e+00

Pos logit contributions
 Owl+0.008
 again+0.007
 tighter+0.007
 berries+0.007
 owl+0.007
Neg logit contributions
inton-0.009
kin-0.009
 ago-0.008
orses-0.008
orse-0.008
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
inton+0.028
kin+0.028
 ago+0.025
orses+0.025
orse+0.024
Neg logit contributions
 Owl-0.023
 again-0.022
 tighter-0.021
 berries-0.021
 aback-0.021
Token "
Token ID366
Feature activation-3.44e+00

Pos logit contributions
 Owl+0.025
 again+0.024
 owl+0.023
 berries+0.022
 tighter+0.022
Neg logit contributions
kin-0.029
inton-0.029
orses-0.025
ippers-0.025
asts-0.025
Token like
Token ID588
Feature activation+2.89e+00

Pos logit contributions
 Owl+0.019
 again+0.018
 owl+0.018
 tighter+0.018
 berries+0.018
Neg logit contributions
kin-0.014
inton-0.014
 ago-0.013
orses-0.012
 toes-0.012
Token it
Token ID340
Feature activation-1.90e+00

Pos logit contributions
kin+0.075
inton+0.075
 ago+0.067
orses+0.067
ippers+0.066
Neg logit contributions
 Owl-0.049
 again-0.046
 berries-0.044
 owl-0.044
 tighter-0.043
Token.
Token ID13
Feature activation+3.32e-01

Pos logit contributions
 again+0.039
 Owl+0.038
 ac+0.035
 tighter+0.035
 berries+0.034
Neg logit contributions
kin-0.044
inton-0.043
 ago-0.040
orses-0.040
ippers-0.039
Token He
Token ID679
Feature activation+8.75e-02

Pos logit contributions
inton+0.008
kin+0.007
orses+0.007
 ago+0.007
 toes+0.007
Neg logit contributions
 Owl-0.007
 again-0.006
 aback-0.006
Da-0.006
 Tweet-0.006
Token spat
Token ID15246
Feature activation+2.05e+00

Pos logit contributions
kin+0.002
inton+0.002
orses+0.002
 ago+0.002
orse+0.002
Neg logit contributions
 Owl-0.002
 aback-0.002
Da-0.002
lim-0.002
 again-0.002
Token it
Token ID340
Feature activation-6.88e+00

Pos logit contributions
inton+0.053
kin+0.053
 ago+0.048
orses+0.047
 toes+0.046
Neg logit contributions
 Owl-0.034
 again-0.032
 tighter-0.031
 berries-0.031
 owl-0.031
Token out
Token ID503
Feature activation-1.52e+00

Pos logit contributions
 Owl+0.138
Da+0.136
 owl+0.125
lim+0.125
iw+0.121
Neg logit contributions
inton-0.151
kin-0.147
 wear-0.133
 toes-0.127
 have-0.126
Token and
Token ID290
Feature activation-3.22e-01

Pos logit contributions
 Owl+0.035
lim+0.033
Da+0.032
 aback+0.032
 owl+0.032
Neg logit contributions
kin-0.030
inton-0.030
 ago-0.027
 toes-0.026
orses-0.025
Token threw
Token ID9617
Feature activation-9.62e-01

Pos logit contributions
 Owl+0.007
lim+0.006
Da+0.006
 aback+0.006
 owl+0.006
Neg logit contributions
kin-0.007
inton-0.007
orses-0.006
 toes-0.006
 ago-0.006
Token it
Token ID340
Feature activation-4.73e+00

Pos logit contributions
 Owl+0.026
lim+0.024
Da+0.024
 owl+0.023
 god+0.022
Neg logit contributions
kin-0.015
 toes-0.014
inton-0.014
 ago-0.013
 have-0.011
Token away
Token ID1497
Feature activation-6.77e-01

Pos logit contributions
 Owl+0.098
Da+0.090
 owl+0.087
lim+0.085
 berries+0.084
Neg logit contributions
kin-0.102
inton-0.102
 ago-0.094
 toes-0.094
 wear-0.085
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
kin+0.031
inton+0.030
 ago+0.027
orses+0.026
ilot+0.025
Neg logit contributions
 Owl-0.032
 again-0.031
Da-0.029
 berries-0.029
 aback-0.029
Token
Token ID198
Feature activation-5.50e-01

Pos logit contributions
kin+0.029
inton+0.029
 ago+0.028
 toes+0.027
ippers+0.025
Neg logit contributions
Da-0.023
 again-0.022
 Owl-0.022
 aback-0.020
 Tweet-0.020
TokenHe
Token ID1544
Feature activation-1.10e+00

Pos logit contributions
Da+0.009
 Owl+0.009
 again+0.009
lim+0.009
 tighter+0.009
Neg logit contributions
kin-0.014
inton-0.014
 ago-0.013
 toes-0.013
orses-0.012
Token gob
Token ID48484
Feature activation+2.13e+00

Pos logit contributions
 Owl+0.028
 aback+0.027
minded+0.025
 owl+0.025
lim+0.024
Neg logit contributions
kin-0.020
inton-0.018
orse-0.018
 hardly-0.018
orses-0.017
Tokenbled
Token ID9342
Feature activation-3.92e+00

Pos logit contributions
kin+0.031
inton+0.029
ippers+0.022
 toes+0.022
orses+0.022
Neg logit contributions
 Owl-0.064
 owl-0.060
Da-0.059
 aback-0.058
 tighter-0.058
Token it
Token ID340
Feature activation-6.87e+00

Pos logit contributions
 Owl+0.087
 owl+0.080
Da+0.078
lim+0.075
Bet+0.070
Neg logit contributions
inton-0.079
kin-0.073
 toes-0.070
asts-0.066
orses-0.065
Token up
Token ID510
Feature activation-1.65e+00

Pos logit contributions
 Owl+0.141
 owl+0.129
Da+0.122
lim+0.113
iw+0.110
Neg logit contributions
kin-0.167
inton-0.166
 toes-0.153
 hardly-0.151
 wear-0.143
Token in
Token ID287
Feature activation+8.33e-01

Pos logit contributions
 Owl+0.037
lim+0.034
 owl+0.034
Da+0.034
 berries+0.033
Neg logit contributions
kin-0.035
inton-0.033
 toes-0.030
 ago-0.028
 hardly-0.027
Token one
Token ID530
Feature activation-1.77e+00

Pos logit contributions
kin+0.019
inton+0.019
 ago+0.018
orses+0.017
 toes+0.017
Neg logit contributions
 Owl-0.016
lim-0.015
 again-0.015
 owl-0.015
 tighter-0.014
Token bite
Token ID13197
Feature activation-2.20e-01

Pos logit contributions
 Owl+0.040
Da+0.036
lim+0.036
 aback+0.035
 owl+0.035
Neg logit contributions
kin-0.038
inton-0.034
 toes-0.032
 ago-0.031
orses-0.031
Token and
Token ID290
Feature activation-4.57e-01

Pos logit contributions
 Owl+0.004
 owl+0.004
lim+0.004
Da+0.004
 tighter+0.004
Neg logit contributions
kin-0.005
inton-0.005
 ago-0.004
orses-0.004
 toes-0.004
Token like
Token ID588
Feature activation+3.26e+00

Pos logit contributions
 Owl+0.013
 again+0.012
 owl+0.012
 berries+0.012
 ac+0.012
Neg logit contributions
inton-0.010
kin-0.010
 ago-0.009
orses-0.009
ippers-0.009
Token it
Token ID340
Feature activation-1.99e+00

Pos logit contributions
inton+0.081
kin+0.081
 ago+0.074
orses+0.072
 toes+0.070
Neg logit contributions
 Owl-0.058
 tighter-0.052
 again-0.052
lim-0.051
 aback-0.051
Token.
Token ID13
Feature activation+8.73e-02

Pos logit contributions
 Owl+0.044
 again+0.042
 tighter+0.040
 berries+0.040
 owl+0.040
Neg logit contributions
inton-0.041
kin-0.041
 ago-0.037
orses-0.036
orse-0.035
Token She
Token ID1375
Feature activation+3.55e-01

Pos logit contributions
inton+0.002
kin+0.002
 ago+0.002
orses+0.002
 toes+0.002
Neg logit contributions
 Owl-0.002
 tighter-0.001
 again-0.001
 owl-0.001
lim-0.001
Token spit
Token ID27591
Feature activation+5.15e-01

Pos logit contributions
kin+0.007
inton+0.007
orses+0.006
 ago+0.006
orse+0.006
Neg logit contributions
 Owl-0.008
 aback-0.008
lim-0.008
 again-0.008
 tighter-0.008
Token it
Token ID340
Feature activation-6.86e+00

Pos logit contributions
kin+0.012
inton+0.012
 toes+0.011
 wear+0.011
 hardly+0.010
Neg logit contributions
 Owl-0.010
lim-0.009
 owl-0.009
Da-0.009
 berries-0.008
Token out
Token ID503
Feature activation-1.54e+00

Pos logit contributions
 Owl+0.162
lim+0.156
 owl+0.149
Da+0.146
 av+0.144
Neg logit contributions
kin-0.131
inton-0.124
 wear-0.115
 have-0.107
 toes-0.107
Token and
Token ID290
Feature activation-3.33e-01

Pos logit contributions
 Owl+0.037
lim+0.036
xc+0.033
Da+0.033
 aback+0.033
Neg logit contributions
kin-0.029
inton-0.027
 toes-0.026
 ago-0.026
 wear-0.023
Token said
Token ID531
Feature activation-4.59e-02

Pos logit contributions
 Owl+0.007
lim+0.007
 aback+0.006
Da+0.006
 owl+0.006
Neg logit contributions
kin-0.007
inton-0.007
orses-0.006
 ago-0.006
 toes-0.006
Token,
Token ID11
Feature activation-1.78e+00

Pos logit contributions
 Owl+0.001
 tighter+0.001
lim+0.001
 aback+0.001
 owl+0.001
Neg logit contributions
kin-0.001
inton-0.001
 ago-0.001
orses-0.001
 toes-0.001
Token "
Token ID366
Feature activation-3.45e+00

Pos logit contributions
 Owl+0.038
 again+0.036
 owl+0.035
 berries+0.034
 tighter+0.034
Neg logit contributions
kin-0.039
inton-0.039
orses-0.034
 ago-0.034
ippers-0.033
Token them
Token ID606
Feature activation-1.76e+00

Pos logit contributions
kin+0.051
inton+0.051
 ago+0.048
orses+0.046
ippers+0.045
Neg logit contributions
 Owl-0.031
 again-0.031
 berries-0.030
 tighter-0.030
 Tweet-0.028
Token cough
Token ID22094
Feature activation+1.01e+00

Pos logit contributions
 again+0.032
 tighter+0.031
 berries+0.030
 ac+0.030
 Owl+0.029
Neg logit contributions
kin-0.042
 ago-0.042
inton-0.041
orses-0.038
ippers-0.037
Token.
Token ID13
Feature activation-2.04e-01

Pos logit contributions
inton+0.020
kin+0.020
 ago+0.017
orses+0.017
 toes+0.016
Neg logit contributions
 Owl-0.023
lim-0.021
 owl-0.021
 again-0.021
Da-0.021
Token They
Token ID1119
Feature activation+2.46e+00

Pos logit contributions
 Owl+0.004
Da+0.004
 Tweet+0.004
 again+0.004
 berries+0.004
Neg logit contributions
kin-0.004
inton-0.004
inations-0.004
orses-0.004
 ago-0.003
Token spit
Token ID27591
Feature activation+5.10e-01

Pos logit contributions
kin+0.050
inton+0.049
 ago+0.045
orses+0.044
ippers+0.043
Neg logit contributions
 Owl-0.055
 again-0.054
 berries-0.054
 tighter-0.053
 ac-0.053
Token it
Token ID340
Feature activation-6.86e+00

Pos logit contributions
kin+0.012
inton+0.012
 ago+0.011
orses+0.011
 toes+0.011
Neg logit contributions
 Owl-0.010
 owl-0.009
lim-0.009
Da-0.008
 tighter-0.008
Token out
Token ID503
Feature activation-1.82e+00

Pos logit contributions
 Owl+0.157
lim+0.149
 owl+0.148
Da+0.138
xc+0.138
Neg logit contributions
inton-0.135
kin-0.134
 wear-0.119
 have-0.114
 hardly-0.111
Token and
Token ID290
Feature activation-1.75e-01

Pos logit contributions
 Owl+0.034
lim+0.031
Da+0.031
 owl+0.031
 aback+0.030
Neg logit contributions
kin-0.044
inton-0.044
 ago-0.040
 toes-0.039
orses-0.038
Token make
Token ID787
Feature activation+1.04e+00

Pos logit contributions
 again+0.004
 Owl+0.004
 tighter+0.004
 berries+0.004
 ac+0.003
Neg logit contributions
inton-0.004
kin-0.004
 ago-0.003
orses-0.003
ippers-0.003
Token faces
Token ID6698
Feature activation+1.41e+00

Pos logit contributions
kin+0.027
inton+0.027
 ago+0.025
orses+0.024
orse+0.023
Neg logit contributions
 Owl-0.017
 again-0.016
 tighter-0.016
 berries-0.016
 owl-0.015
Token.
Token ID13
Feature activation+6.19e-02

Pos logit contributions
kin+0.033
inton+0.032
 ago+0.031
orses+0.030
ippers+0.029
Neg logit contributions
 again-0.029
 Owl-0.026
 tighter-0.025
 berries-0.025
 ac-0.025
Token tasted
Token ID29187
Feature activation-2.36e-01

Pos logit contributions
 Owl+0.007
lim+0.006
 aback+0.006
 tighter+0.006
 owl+0.006
Neg logit contributions
kin-0.007
inton-0.006
 hardly-0.006
orses-0.006
orse-0.005
Token awful
Token ID12659
Feature activation-1.25e+00

Pos logit contributions
 Owl+0.005
 owl+0.004
lim+0.004
Da+0.004
 aback+0.004
Neg logit contributions
inton-0.006
kin-0.006
 toes-0.005
orses-0.005
 ago-0.005
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
 Owl+0.029
Da+0.026
 tighter+0.026
lim+0.026
 aback+0.026
Neg logit contributions
kin-0.025
inton-0.025
 ago-0.022
orses-0.021
 toes-0.021
Token He
Token ID679
Feature activation-2.00e-01

Pos logit contributions
kin+0.000
inton+0.000
orses+0.000
 toes+0.000
 ago+0.000
Neg logit contributions
 Owl-0.000
 again-0.000
 aback-0.000
 Tweet-0.000
Da-0.000
Token spit
Token ID27591
Feature activation+3.13e-01

Pos logit contributions
 Owl+0.005
 aback+0.004
lim+0.004
 tighter+0.004
 again+0.004
Neg logit contributions
kin-0.004
inton-0.004
orses-0.004
 ago-0.003
orse-0.003
Token it
Token ID340
Feature activation-6.86e+00

Pos logit contributions
kin+0.008
inton+0.007
 ago+0.007
orses+0.007
 toes+0.006
Neg logit contributions
 Owl-0.006
 owl-0.005
lim-0.005
 tighter-0.005
Da-0.005
Token out
Token ID503
Feature activation-1.70e+00

Pos logit contributions
 Owl+0.156
lim+0.149
 owl+0.145
Da+0.142
minded+0.136
Neg logit contributions
kin-0.138
inton-0.132
 wear-0.116
 toes-0.112
 have-0.109
Token and
Token ID290
Feature activation-4.73e-01

Pos logit contributions
 Owl+0.033
lim+0.031
Da+0.030
 owl+0.030
 aback+0.029
Neg logit contributions
kin-0.040
inton-0.039
 ago-0.036
 toes-0.035
orses-0.034
Token said
Token ID531
Feature activation-2.20e-01

Pos logit contributions
 Owl+0.010
lim+0.009
Da+0.009
 owl+0.009
 aback+0.009
Neg logit contributions
kin-0.010
inton-0.010
orses-0.009
 ago-0.009
 toes-0.009
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
 Owl+0.005
 tighter+0.005
lim+0.005
 owl+0.005
 aback+0.004
Neg logit contributions
kin-0.005
inton-0.004
 ago-0.004
orses-0.004
orse-0.004
Token "
Token ID366
Feature activation-3.28e+00

Pos logit contributions
 Owl+0.036
 again+0.034
 owl+0.033
 berries+0.032
 tighter+0.032
Neg logit contributions
kin-0.035
inton-0.035
orses-0.030
 ago-0.030
ippers-0.029
Token her
Token ID607
Feature activation+2.49e+00

Pos logit contributions
inton+0.063
kin+0.063
 ago+0.058
orses+0.057
 toes+0.055
Neg logit contributions
 Owl-0.037
 again-0.033
 tighter-0.033
 owl-0.032
 berries-0.032
Token mouth
Token ID5422
Feature activation+1.05e+00

Pos logit contributions
inton+0.051
kin+0.050
 ago+0.045
orses+0.045
orse+0.042
Neg logit contributions
 Owl-0.055
 again-0.053
 owl-0.052
 tighter-0.052
 ac-0.051
Token.
Token ID13
Feature activation-3.61e-01

Pos logit contributions
inton+0.021
kin+0.021
 ago+0.018
orses+0.018
orse+0.017
Neg logit contributions
 Owl-0.024
 again-0.023
 tighter-0.022
 berries-0.022
 owl-0.022
Token She
Token ID1375
Feature activation+5.33e-01

Pos logit contributions
 Owl+0.009
 tighter+0.008
 owl+0.008
lim+0.008
 aback+0.008
Neg logit contributions
inton-0.007
kin-0.006
 ago-0.006
 toes-0.006
ippers-0.005
Token spit
Token ID27591
Feature activation+5.18e-01

Pos logit contributions
kin+0.010
inton+0.010
 toes+0.009
 hardly+0.009
orse+0.009
Neg logit contributions
 Owl-0.013
 aback-0.012
lim-0.012
Da-0.012
 owl-0.011
Token it
Token ID340
Feature activation-6.86e+00

Pos logit contributions
kin+0.014
inton+0.013
 ago+0.012
orses+0.012
 toes+0.012
Neg logit contributions
 Owl-0.009
lim-0.008
 owl-0.008
Da-0.007
 tighter-0.007
Token out
Token ID503
Feature activation-1.68e+00

Pos logit contributions
 Owl+0.167
iw+0.157
 owl+0.154
Da+0.153
lim+0.153
Neg logit contributions
kin-0.119
inton-0.119
 wear-0.112
 hardly-0.107
 toes-0.103
Token and
Token ID290
Feature activation-5.40e-01

Pos logit contributions
 Owl+0.037
lim+0.036
Da+0.033
 owl+0.033
minded+0.033
Neg logit contributions
kin-0.034
inton-0.032
 toes-0.031
 ago-0.030
 hardly-0.028
Token saw
Token ID2497
Feature activation-1.35e+00

Pos logit contributions
 Owl+0.011
lim+0.011
Da+0.010
 aback+0.010
 owl+0.010
Neg logit contributions
kin-0.012
inton-0.012
orses-0.010
 ago-0.010
 toes-0.010
Token that
Token ID326
Feature activation-1.14e+00

Pos logit contributions
 Owl+0.025
lim+0.024
Da+0.023
 tighter+0.023
 aback+0.023
Neg logit contributions
kin-0.031
inton-0.031
 ago-0.028
 toes-0.028
orses-0.027
Token it
Token ID340
Feature activation-1.28e+00

Pos logit contributions
 Owl+0.025
lim+0.024
 aback+0.024
Da+0.023
 tighter+0.022
Neg logit contributions
kin-0.024
inton-0.023
 ago-0.020
 toes-0.020
 wear-0.020
Token y
Token ID331
Feature activation+3.60e+00

Pos logit contributions
kin+0.006
inton+0.006
 ago+0.006
orses+0.006
 toes+0.006
Neg logit contributions
 Owl-0.004
lim-0.004
 again-0.004
 aback-0.004
 owl-0.004
Tokenucky
Token ID5309
Feature activation+8.32e-01

Pos logit contributions
inton+0.083
kin+0.082
 ago+0.074
orses+0.072
 toes+0.072
Neg logit contributions
 Owl-0.072
 owl-0.066
 tighter-0.065
 again-0.065
Da-0.064
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
inton+0.018
kin+0.018
orses+0.016
 ago+0.016
ippers+0.016
Neg logit contributions
 again-0.017
 Owl-0.016
 berries-0.016
 tighter-0.016
 ac-0.015
Token She
Token ID1375
Feature activation+1.87e-01

Pos logit contributions
kin+0.005
inton+0.005
orses+0.005
 ago+0.005
orse+0.004
Neg logit contributions
 Owl-0.004
 again-0.003
Da-0.003
 owl-0.003
 aback-0.003
Token spit
Token ID27591
Feature activation+4.97e-01

Pos logit contributions
inton+0.004
kin+0.004
 ago+0.003
orses+0.003
 toes+0.003
Neg logit contributions
 Owl-0.004
 again-0.004
 tighter-0.004
 berries-0.004
 owl-0.004
Token it
Token ID340
Feature activation-6.84e+00

Pos logit contributions
kin+0.011
inton+0.011
 ago+0.010
orses+0.010
 toes+0.009
Neg logit contributions
 Owl-0.010
 owl-0.009
lim-0.009
Da-0.009
 berries-0.009
Token out
Token ID503
Feature activation-1.53e+00

Pos logit contributions
 Owl+0.158
 owl+0.146
lim+0.145
Da+0.143
xc+0.136
Neg logit contributions
kin-0.142
inton-0.133
 wear-0.121
 toes-0.119
 hardly-0.113
Token and
Token ID290
Feature activation-4.10e-01

Pos logit contributions
 Owl+0.034
lim+0.031
Da+0.031
 aback+0.030
 owl+0.030
Neg logit contributions
kin-0.032
inton-0.031
 ago-0.029
 toes-0.028
orses-0.027
Token rubbed
Token ID31862
Feature activation+2.33e+00

Pos logit contributions
 Owl+0.008
 again+0.007
lim+0.007
 tighter+0.007
 owl+0.007
Neg logit contributions
kin-0.009
inton-0.009
 ago-0.008
orses-0.008
 toes-0.008
Token her
Token ID607
Feature activation-1.28e+00

Pos logit contributions
kin+0.050
inton+0.048
 toes+0.042
 ago+0.042
orses+0.041
Neg logit contributions
 Owl-0.051
lim-0.048
 owl-0.047
Da-0.046
 berries-0.045
Token tongue
Token ID11880
Feature activation-1.80e+00

Pos logit contributions
 Owl+0.033
Da+0.031
lim+0.031
 aback+0.030
 owl+0.030
Neg logit contributions
kin-0.023
inton-0.022
 toes-0.019
orses-0.018
 ago-0.018
Token mouth
Token ID5422
Feature activation-1.12e+00

Pos logit contributions
inton+0.025
kin+0.024
orses+0.022
 ago+0.021
orse+0.020
Neg logit contributions
 Owl-0.031
 again-0.029
 tighter-0.028
 owl-0.028
 berries-0.028
Token,
Token ID11
Feature activation+9.34e-01

Pos logit contributions
 Owl+0.024
lim+0.022
Da+0.022
 owl+0.022
 berries+0.022
Neg logit contributions
kin-0.024
inton-0.023
 ago-0.021
 toes-0.020
orses-0.020
Token but
Token ID475
Feature activation+1.81e+00

Pos logit contributions
inton+0.024
kin+0.024
 ago+0.021
orses+0.021
orse+0.020
Neg logit contributions
 Owl-0.016
 again-0.015
 tighter-0.015
 owl-0.015
 berries-0.015
Token he
Token ID339
Feature activation+1.27e+00

Pos logit contributions
inton+0.042
kin+0.040
 ago+0.037
orses+0.037
ilot+0.036
Neg logit contributions
 Owl-0.036
 again-0.035
 tighter-0.034
 berries-0.033
 owl-0.032
Token spat
Token ID15246
Feature activation+1.90e+00

Pos logit contributions
inton+0.026
kin+0.025
 ago+0.023
orses+0.023
ippers+0.022
Neg logit contributions
 Owl-0.028
 again-0.026
 tighter-0.026
 berries-0.026
 owl-0.025
Token it
Token ID340
Feature activation-6.83e+00

Pos logit contributions
inton+0.047
kin+0.047
 ago+0.043
orses+0.042
orse+0.041
Neg logit contributions
 Owl-0.033
 again-0.032
 tighter-0.031
 berries-0.030
 aback-0.030
Token out
Token ID503
Feature activation-1.35e+00

Pos logit contributions
 Owl+0.130
Da+0.124
 owl+0.119
lim+0.117
 berries+0.108
Neg logit contributions
inton-0.164
kin-0.164
 wear-0.142
 toes-0.139
 hardly-0.136
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
 Owl+0.033
lim+0.031
Da+0.031
 owl+0.031
 berries+0.030
Neg logit contributions
kin-0.024
inton-0.024
 ago-0.021
orses-0.021
 toes-0.020
Token He
Token ID679
Feature activation-3.49e-01

Pos logit contributions
 Owl+0.004
 owl+0.004
 tighter+0.004
lim+0.004
 berries+0.004
Neg logit contributions
inton-0.004
kin-0.004
 ago-0.003
orses-0.003
 toes-0.003
Token tasted
Token ID29187
Feature activation-6.44e-01

Pos logit contributions
 Owl+0.008
 aback+0.007
lim+0.007
 owl+0.007
 again+0.007
Neg logit contributions
kin-0.007
inton-0.007
orses-0.006
 ago-0.006
orse-0.006
Token something
Token ID1223
Feature activation-1.20e+00

Pos logit contributions
 Owl+0.011
 again+0.011
 tighter+0.010
 berries+0.010
 owl+0.010
Neg logit contributions
inton-0.016
kin-0.016
 ago-0.015
orses-0.014
orse-0.014
Token like
Token ID588
Feature activation+2.12e+00

Pos logit contributions
 Owl+0.022
 owl+0.020
Da+0.020
lim+0.020
 tighter+0.019
Neg logit contributions
inton-0.027
kin-0.027
 ago-0.024
 toes-0.024
orses-0.023
Token food
Token ID2057
Feature activation+2.40e+00

Pos logit contributions
inton+0.055
kin+0.055
orses+0.049
 ago+0.048
ippers+0.048
Neg logit contributions
 Owl-0.037
 again-0.035
 berries-0.033
 owl-0.033
 ac-0.032
Token.
Token ID13
Feature activation+5.09e-01

Pos logit contributions
inton+0.059
kin+0.058
orses+0.055
ippers+0.054
ilot+0.053
Neg logit contributions
 again-0.045
 Owl-0.042
 berries-0.041
 ac-0.040
 owl-0.038
Token He
Token ID679
Feature activation-1.27e-01

Pos logit contributions
inton+0.010
kin+0.010
 ago+0.009
orses+0.008
 toes+0.008
Neg logit contributions
 Owl-0.012
 again-0.011
 tighter-0.011
Da-0.011
 owl-0.011
Token spat
Token ID15246
Feature activation+1.88e+00

Pos logit contributions
 Owl+0.003
 aback+0.003
lim+0.003
Da+0.003
 owl+0.003
Neg logit contributions
kin-0.002
inton-0.002
 hardly-0.002
orse-0.002
orses-0.002
Token it
Token ID340
Feature activation-6.80e+00

Pos logit contributions
kin+0.048
inton+0.047
 ago+0.042
orses+0.042
 toes+0.042
Neg logit contributions
 Owl-0.033
 owl-0.030
lim-0.029
Da-0.029
 tighter-0.029
Token out
Token ID503
Feature activation-1.62e+00

Pos logit contributions
 Owl+0.138
Da+0.127
 owl+0.126
lim+0.124
iw+0.121
Neg logit contributions
inton-0.156
kin-0.154
 toes-0.134
 wear-0.133
 hardly-0.132
Token and
Token ID290
Feature activation-3.39e-01

Pos logit contributions
 Owl+0.033
lim+0.031
Da+0.030
 owl+0.030
 berries+0.029
Neg logit contributions
kin-0.035
inton-0.035
 ago-0.032
 toes-0.032
 hardly-0.030
Token the
Token ID262
Feature activation-2.02e-01

Pos logit contributions
 Owl+0.007
lim+0.007
Da+0.007
 aback+0.006
 owl+0.006
Neg logit contributions
kin-0.007
inton-0.007
orses-0.006
 toes-0.006
 ago-0.006
Token paper
Token ID3348
Feature activation+1.13e+00

Pos logit contributions
 Owl+0.004
 again+0.003
 tighter+0.003
 owl+0.003
 berries+0.003
Neg logit contributions
inton-0.005
kin-0.005
 ago-0.005
orses-0.005
orse-0.004
Token floated
Token ID29742
Feature activation-2.53e-01

Pos logit contributions
inton+0.028
kin+0.027
 ago+0.025
orses+0.025
ippers+0.024
Neg logit contributions
 Owl-0.020
 again-0.020
 berries-0.019
 owl-0.018
 tighter-0.018
Token very
Token ID845
Feature activation-2.10e-01

Pos logit contributions
inton+0.034
kin+0.034
 ago+0.031
orses+0.031
 toes+0.030
Neg logit contributions
 Owl-0.022
 again-0.020
 tighter-0.020
 owl-0.020
 berries-0.020
Token sour
Token ID11348
Feature activation+3.20e-01

Pos logit contributions
 Owl+0.003
 again+0.003
 tighter+0.003
 owl+0.003
 berries+0.003
Neg logit contributions
inton-0.006
kin-0.006
 ago-0.005
orses-0.005
 toes-0.005
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
kin+0.006
inton+0.006
 ago+0.005
orses+0.005
 toes+0.005
Neg logit contributions
 Owl-0.008
Da-0.007
lim-0.007
 aback-0.007
 tighter-0.007
Token They
Token ID1119
Feature activation+2.22e+00

Pos logit contributions
inton+0.005
kin+0.005
 ago+0.004
orses+0.004
 toes+0.004
Neg logit contributions
 Owl-0.005
 again-0.005
 berries-0.005
Da-0.005
 Tweet-0.005
Token spit
Token ID27591
Feature activation+4.74e-01

Pos logit contributions
kin+0.047
inton+0.046
 ago+0.043
ippers+0.042
orses+0.041
Neg logit contributions
 again-0.047
 berries-0.046
 ac-0.046
 Owl-0.045
 tighter-0.045
Token it
Token ID340
Feature activation-6.80e+00

Pos logit contributions
kin+0.011
inton+0.011
orses+0.009
 toes+0.009
 ago+0.009
Neg logit contributions
 Owl-0.010
 owl-0.009
lim-0.009
Da-0.009
 tighter-0.009
Token out
Token ID503
Feature activation-1.32e+00

Pos logit contributions
 Owl+0.163
 owl+0.153
lim+0.150
Da+0.149
xc+0.140
Neg logit contributions
kin-0.131
inton-0.130
 wear-0.112
 have-0.111
 toes-0.107
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
 Owl+0.032
lim+0.030
Da+0.029
 owl+0.029
 aback+0.029
Neg logit contributions
kin-0.025
inton-0.025
 ago-0.022
 toes-0.022
orses-0.021
Token
Token ID198
Feature activation-4.99e-01

Pos logit contributions
kin+0.006
inton+0.006
 ago+0.005
orses+0.005
 toes+0.005
Neg logit contributions
 Owl-0.006
 again-0.006
 berries-0.006
 Tweet-0.005
Da-0.005
Token
Token ID198
Feature activation-2.26e+00

Pos logit contributions
 Owl+0.009
 again+0.009
 Tweet+0.008
Da+0.008
 tighter+0.007
Neg logit contributions
kin-0.013
inton-0.013
 toes-0.012
 ago-0.011
ippers-0.011
TokenMom
Token ID29252
Feature activation-2.00e+00

Pos logit contributions
Da+0.030
 tighter+0.030
 again+0.030
 Owl+0.029
 closer+0.027
Neg logit contributions
inton-0.061
kin-0.060
 ago-0.058
 toes-0.055
inations-0.055
Token get
Token ID651
Feature activation-4.58e+00

Pos logit contributions
inton+0.027
kin+0.027
 ago+0.024
orses+0.023
 toes+0.022
Neg logit contributions
 Owl-0.030
 again-0.028
 tighter-0.028
 owl-0.028
 berries-0.028
Token it
Token ID340
Feature activation-5.29e+00

Pos logit contributions
 Owl+0.099
lim+0.093
Da+0.093
 owl+0.090
xc+0.083
Neg logit contributions
inton-0.091
kin-0.090
 wear-0.083
 toes-0.082
orses-0.080
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
 Owl+0.133
lim+0.125
Da+0.124
 owl+0.121
 berries+0.118
Neg logit contributions
kin-0.093
inton-0.091
 ago-0.079
 toes-0.076
 wear-0.076
Token She
Token ID1375
Feature activation+5.02e-01

Pos logit contributions
inton+0.004
 ago+0.004
kin+0.004
 toes+0.004
 hardly+0.003
Neg logit contributions
 Owl-0.004
 tighter-0.004
 again-0.004
Da-0.004
 owl-0.004
Token snatched
Token ID43276
Feature activation-2.06e+00

Pos logit contributions
inton+0.009
kin+0.008
 hardly+0.008
 toes+0.007
 wear+0.007
Neg logit contributions
 Owl-0.012
 aback-0.011
Da-0.011
lim-0.011
 tighter-0.011
Token it
Token ID340
Feature activation-6.78e+00

Pos logit contributions
 Owl+0.062
Da+0.057
 owl+0.056
lim+0.055
 god+0.052
Neg logit contributions
 toes-0.025
 wear-0.024
inton-0.023
kin-0.020
 have-0.020
Token from
Token ID422
Feature activation-7.78e-01

Pos logit contributions
 Owl+0.140
Da+0.132
 av+0.129
lim+0.125
 owl+0.122
Neg logit contributions
inton-0.135
kin-0.131
 wear-0.124
 toes-0.119
 hardly-0.112
Token underneath
Token ID14638
Feature activation+1.58e+00

Pos logit contributions
 Owl+0.017
lim+0.017
 aback+0.017
 tighter+0.016
Da+0.016
Neg logit contributions
inton-0.014
 toes-0.014
kin-0.014
 wear-0.013
 ago-0.013
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
kin+0.037
inton+0.037
 ago+0.034
orses+0.033
 toes+0.032
Neg logit contributions
 Owl-0.030
lim-0.027
 tighter-0.027
 owl-0.026
 again-0.026
Token bench
Token ID7624
Feature activation-5.24e-01

Pos logit contributions
 Owl+0.022
Da+0.020
lim+0.020
 aback+0.020
 tighter+0.019
Neg logit contributions
inton-0.028
kin-0.028
 ago-0.026
 toes-0.025
orses-0.025
Token and
Token ID290
Feature activation+9.76e-02

Pos logit contributions
 Owl+0.010
lim+0.009
Da+0.009
 berries+0.009
 tighter+0.009
Neg logit contributions
inton-0.013
kin-0.013
 ago-0.012
orses-0.011
 toes-0.011
Token hurt
Token ID5938
Feature activation+1.86e+00

Pos logit contributions
inton+0.072
 ago+0.069
kin+0.069
orses+0.066
ippers+0.063
Neg logit contributions
 again-0.058
 ac-0.056
 Owl-0.054
 tighter-0.053
 berries-0.052
Token.
Token ID13
Feature activation+4.24e-01

Pos logit contributions
inton+0.042
kin+0.042
 ago+0.038
orses+0.037
orse+0.036
Neg logit contributions
 Owl-0.037
 again-0.035
 tighter-0.034
 owl-0.034
 berries-0.034
Token So
Token ID1406
Feature activation+1.63e-01

Pos logit contributions
kin+0.009
inton+0.008
 ago+0.007
orses+0.007
ippers+0.007
Neg logit contributions
 Owl-0.010
 again-0.009
 aback-0.009
Da-0.009
 berries-0.009
Token he
Token ID339
Feature activation-1.37e+00

Pos logit contributions
inton+0.004
kin+0.004
 ago+0.003
orses+0.003
 toes+0.003
Neg logit contributions
 Owl-0.003
 again-0.003
 tighter-0.003
 owl-0.003
 berries-0.003
Token spit
Token ID27591
Feature activation+2.08e-01

Pos logit contributions
 Owl+0.032
 aback+0.030
Da+0.029
lim+0.029
 owl+0.028
Neg logit contributions
kin-0.027
inton-0.026
 hardly-0.023
orse-0.023
orses-0.022
Token it
Token ID340
Feature activation-6.78e+00

Pos logit contributions
kin+0.005
inton+0.005
 ago+0.005
orses+0.005
 toes+0.005
Neg logit contributions
 Owl-0.003
 again-0.003
 tighter-0.003
 owl-0.003
lim-0.003
Token out
Token ID503
Feature activation-2.04e+00

Pos logit contributions
 Owl+0.145
lim+0.138
Da+0.138
iw+0.135
 owl+0.135
Neg logit contributions
kin-0.134
inton-0.134
 wear-0.121
 toes-0.116
 hardly-0.116
Token and
Token ID290
Feature activation-7.59e-01

Pos logit contributions
 Owl+0.039
lim+0.035
Da+0.035
 owl+0.035
 berries+0.034
Neg logit contributions
kin-0.048
inton-0.048
 ago-0.044
orses-0.042
 toes-0.042
Token sw
Token ID1509
Feature activation+7.77e-01

Pos logit contributions
 Owl+0.015
lim+0.014
Da+0.014
 owl+0.014
 tighter+0.014
Neg logit contributions
kin-0.017
inton-0.017
 ago-0.015
orses-0.015
 toes-0.015
Tokenam
Token ID321
Feature activation-7.80e-01

Pos logit contributions
inton+0.017
kin+0.017
orses+0.015
 ago+0.015
 toes+0.015
Neg logit contributions
 Owl-0.017
 owl-0.015
 again-0.015
Da-0.015
 tighter-0.015
Token away
Token ID1497
Feature activation-1.44e+00

Pos logit contributions
 Owl+0.006
 again+0.005
 tighter+0.005
 aback+0.004
 berries+0.004
Neg logit contributions
kin-0.028
inton-0.028
 ago-0.026
orses-0.026
orse-0.025
Token a
Token ID257
Feature activation+1.71e+00

Pos logit contributions
inton+0.036
kin+0.035
 ago+0.033
orses+0.032
 toes+0.031
Neg logit contributions
 Owl-0.023
 again-0.021
 tighter-0.021
lim-0.020
 aback-0.020
Token long
Token ID890
Feature activation+2.64e+00

Pos logit contributions
inton+0.037
kin+0.037
 ago+0.033
orses+0.032
ippers+0.031
Neg logit contributions
 Owl-0.036
 owl-0.033
 again-0.033
 berries-0.033
 tighter-0.032
Token nose
Token ID9686
Feature activation+2.47e+00

Pos logit contributions
inton+0.057
kin+0.053
ippers+0.047
orses+0.047
inations+0.045
Neg logit contributions
 Owl-0.056
 owl-0.054
 ac-0.052
 berries-0.051
 again-0.051
Token and
Token ID290
Feature activation+3.18e-01

Pos logit contributions
inton+0.070
kin+0.069
orses+0.063
inations+0.061
ippers+0.061
Neg logit contributions
 again-0.037
 Owl-0.035
 owl-0.033
 ac-0.032
 berries-0.031
Token big
Token ID1263
Feature activation+4.49e-02

Pos logit contributions
inton+0.009
kin+0.009
 ago+0.008
orses+0.008
 toes+0.008
Neg logit contributions
 Owl-0.005
 again-0.004
 tighter-0.004
 berries-0.004
 owl-0.004
Token ears
Token ID11368
Feature activation+4.90e+00

Pos logit contributions
inton+0.001
kin+0.001
 ago+0.001
 toes+0.001
orses+0.001
Neg logit contributions
 Owl-0.001
lim-0.001
Da-0.001
 tighter-0.001
 again-0.001
Token.
Token ID13
Feature activation-2.56e-01

Pos logit contributions
kin+0.128
inton+0.125
orses+0.112
ippers+0.111
inations+0.110
Neg logit contributions
 again-0.090
 Owl-0.083
 owl-0.079
 ac-0.079
 berries-0.076
Token
Token ID198
Feature activation-3.70e-02

Pos logit contributions
 Owl+0.006
 again+0.005
 Tweet+0.005
 berries+0.005
Da+0.005
Neg logit contributions
kin-0.006
inton-0.005
orses-0.005
 ago-0.005
 toes-0.005
Token
Token ID198
Feature activation-1.73e+00

Pos logit contributions
 again+0.001
 Owl+0.001
Da+0.001
 Tweet+0.001
 aback+0.001
Neg logit contributions
kin-0.001
inton-0.001
 toes-0.001
 ago-0.001
ippers-0.001
Token"
Token ID1
Feature activation-4.93e+00

Pos logit contributions
Da+0.027
 tighter+0.026
 Owl+0.025
 again+0.025
lim+0.024
Neg logit contributions
kin-0.046
inton-0.045
 toes-0.043
 ago-0.042
ippers-0.042
TokenWhat
Token ID2061
Feature activation+4.21e-01

Pos logit contributions
 Owl+0.073
 again+0.067
lim+0.065
 tighter+0.064
 aback+0.064
Neg logit contributions
inton-0.136
kin-0.136
 ago-0.124
orses-0.124
orse-0.120
Token from
Token ID422
Feature activation-1.15e+00

Pos logit contributions
 Owl+0.075
Da+0.069
 berries+0.067
lim+0.067
 again+0.067
Neg logit contributions
inton-0.113
kin-0.110
 ago-0.101
orses-0.101
orse-0.097
Token the
Token ID262
Feature activation-1.68e+00

Pos logit contributions
Da+0.022
lim+0.022
 Owl+0.022
 aback+0.021
 again+0.021
Neg logit contributions
inton-0.024
kin-0.023
 ago-0.022
 wear-0.021
 toes-0.021
Token trap
Token ID12840
Feature activation-1.99e+00

Pos logit contributions
xc+0.030
 aback+0.027
lim+0.026
Da+0.026
 again+0.026
Neg logit contributions
 ago-0.038
kin-0.037
inton-0.037
orses-0.035
 arrive-0.034
Token.
Token ID13
Feature activation-5.62e-01

Pos logit contributions
 Owl+0.032
 again+0.030
 tighter+0.028
 owl+0.028
 aback+0.028
Neg logit contributions
inton-0.053
kin-0.053
 ago-0.048
orses-0.048
 toes-0.047
Token B
Token ID347
Feature activation+1.18e+00

Pos logit contributions
 Owl+0.011
 again+0.010
 tighter+0.010
 owl+0.010
 berries+0.010
Neg logit contributions
inton-0.013
kin-0.013
 ago-0.011
orses-0.011
 toes-0.011
Tokenamb
Token ID4131
Feature activation+5.05e+00

Pos logit contributions
inton+0.025
kin+0.024
 ago+0.022
orses+0.022
 toes+0.021
Neg logit contributions
 Owl-0.026
 again-0.024
 tighter-0.023
 berries-0.023
Da-0.023
Tokeni
Token ID72
Feature activation-2.41e-02

Pos logit contributions
inton+0.144
kin+0.144
 ago+0.132
orses+0.131
 toes+0.127
Neg logit contributions
 Owl-0.072
 again-0.064
 tighter-0.063
 owl-0.062
 berries-0.062
Token was
Token ID373
Feature activation+9.10e-01

Pos logit contributions
 tighter+0.001
 Owl+0.000
 aback+0.000
Da+0.000
lim+0.000
Neg logit contributions
inton-0.000
kin-0.000
orses-0.000
orse-0.000
ilot-0.000
Token very
Token ID845
Feature activation-3.80e-01

Pos logit contributions
kin+0.023
inton+0.023
 ago+0.021
orses+0.021
orse+0.020
Neg logit contributions
 Owl-0.015
lim-0.014
 berries-0.014
 tighter-0.014
 owl-0.014
Token happy
Token ID3772
Feature activation+1.14e+00

Pos logit contributions
 Owl+0.009
 tighter+0.008
 again+0.008
 aback+0.008
 owl+0.008
Neg logit contributions
inton-0.008
kin-0.008
 ago-0.007
orses-0.007
 toes-0.007
Token and
Token ID290
Feature activation-9.71e-01

Pos logit contributions
inton+0.027
kin+0.026
 ago+0.025
orses+0.024
orse+0.024
Neg logit contributions
 Owl-0.020
lim-0.019
 tighter-0.019
Da-0.019
 berries-0.019
Token.
Token ID13
Feature activation+1.56e-03

Pos logit contributions
kin+0.078
inton+0.076
orses+0.072
orse+0.070
 ago+0.070
Neg logit contributions
 again-0.056
 Owl-0.050
 ac-0.047
 tighter-0.047
 owl-0.046
Token Billy
Token ID15890
Feature activation-7.50e-01

Pos logit contributions
inton+0.000
kin+0.000
 ago+0.000
orses+0.000
orse+0.000
Neg logit contributions
 tighter-0.000
 Owl-0.000
lim-0.000
 aback-0.000
 berries-0.000
Token said
Token ID531
Feature activation-3.83e-02

Pos logit contributions
 Owl+0.019
 aback+0.018
lim+0.018
iw+0.018
Da+0.018
Neg logit contributions
inton-0.011
kin-0.010
 wear-0.009
orse-0.009
 hardly-0.009
Token,
Token ID11
Feature activation-8.43e-01

Pos logit contributions
 Owl+0.001
 aback+0.001
 tighter+0.001
lim+0.001
 berries+0.001
Neg logit contributions
kin-0.001
inton-0.001
orses-0.001
orse-0.001
 ago-0.001
Token "
Token ID366
Feature activation-3.38e+00

Pos logit contributions
lim+0.022
 aback+0.022
 Owl+0.022
 tighter+0.022
 berries+0.022
Neg logit contributions
inton-0.012
kin-0.011
 ago-0.010
orse-0.010
orses-0.010
TokenTr
Token ID2898
Feature activation+5.15e+00

Pos logit contributions
 Owl+0.068
 berries+0.066
 again+0.063
 aback+0.062
 tighter+0.061
Neg logit contributions
inton-0.077
kin-0.070
orses-0.067
orse-0.064
 hardly-0.064
Tokenains
Token ID1299
Feature activation+3.89e+00

Pos logit contributions
kin+0.095
 hardly+0.094
orses+0.087
 toes+0.087
inton+0.085
Neg logit contributions
lim-0.110
 Owl-0.107
 again-0.105
st-0.104
 aback-0.102
Token are
Token ID389
Feature activation+1.23e+00

Pos logit contributions
kin+0.077
inton+0.072
orses+0.066
orse+0.065
 ago+0.064
Neg logit contributions
 Owl-0.090
 again-0.090
 ac-0.087
 owl-0.086
 berries-0.085
Token boring
Token ID14262
Feature activation+8.76e-01

Pos logit contributions
kin+0.031
inton+0.031
 ago+0.028
orses+0.028
orse+0.027
Neg logit contributions
 Owl-0.021
 again-0.019
 berries-0.019
 tighter-0.019
 owl-0.019
Token.
Token ID13
Feature activation+8.93e-01

Pos logit contributions
kin+0.024
inton+0.023
orses+0.022
orse+0.021
 hardly+0.021
Neg logit contributions
 again-0.014
 Owl-0.014
 owl-0.012
 berries-0.012
 aback-0.012
Token I
Token ID314
Feature activation+2.10e+00

Pos logit contributions
kin+0.019
inton+0.018
orses+0.015
 ago+0.015
 toes+0.015
Neg logit contributions
 Owl-0.021
 again-0.019
 Tweet-0.019
 owl-0.019
lim-0.019
Token just
Token ID655
Feature activation+2.27e+00

Pos logit contributions
kin+0.071
inton+0.069
orses+0.064
 toes+0.061
 ago+0.061
Neg logit contributions
 Owl-0.056
 again-0.052
 aback-0.050
 owl-0.050
lim-0.050
Token need
Token ID761
Feature activation-1.08e+00

Pos logit contributions
kin+0.037
inton+0.037
orses+0.031
ippers+0.030
ilot+0.029
Neg logit contributions
 Owl-0.061
 again-0.059
 owl-0.057
 berries-0.057
lim-0.057
Token to
Token ID284
Feature activation+1.37e+00

Pos logit contributions
 Owl+0.014
Da+0.012
 tighter+0.012
 owl+0.012
lim+0.012
Neg logit contributions
inton-0.031
kin-0.031
 ago-0.030
orses-0.028
 toes-0.028
Token appreciate
Token ID9144
Feature activation+2.36e+00

Pos logit contributions
kin+0.030
inton+0.029
orses+0.027
 ago+0.026
ilot+0.026
Neg logit contributions
 Owl-0.029
 again-0.028
 ac-0.027
 owl-0.027
 berries-0.026
Token what
Token ID644
Feature activation+6.95e-02

Pos logit contributions
inton+0.059
kin+0.059
orses+0.053
 ago+0.052
orse+0.051
Neg logit contributions
 Owl-0.042
 again-0.040
 berries-0.038
 tighter-0.038
 owl-0.037
Token we
Token ID356
Feature activation+4.87e+00

Pos logit contributions
inton+0.001
kin+0.001
 ago+0.001
orses+0.001
ippers+0.001
Neg logit contributions
 Owl-0.001
 again-0.001
 berries-0.001
 owl-0.001
 tighter-0.001
Token already
Token ID1541
Feature activation+1.74e+00

Pos logit contributions
kin+0.091
inton+0.087
orses+0.076
 toes+0.072
ilot+0.072
Neg logit contributions
 Owl-0.122
 again-0.114
 owl-0.113
 aback-0.113
 ac-0.113
Token have
Token ID423
Feature activation-2.11e+00

Pos logit contributions
kin+0.034
inton+0.032
ippers+0.029
ilot+0.029
 toes+0.028
Neg logit contributions
 again-0.040
 Owl-0.038
 ac-0.038
 berries-0.037
 aback-0.036
Token.
Token ID13
Feature activation+8.33e-02

Pos logit contributions
 again+0.037
 Owl+0.035
 aback+0.033
 berries+0.032
 tighter+0.032
Neg logit contributions
kin-0.053
inton-0.052
orses-0.048
orse-0.047
 toes-0.046
Token<|endoftext|>
Token ID50256
Feature activation+1.89e+00

Pos logit contributions
kin+0.002
inton+0.002
 ago+0.001
orses+0.001
 toes+0.001
Neg logit contributions
 Owl-0.002
 again-0.002
Da-0.002
 tighter-0.002
 berries-0.002
TokenOnce
Token ID7454
Feature activation-4.03e-02

Pos logit contributions
inton+0.051
kin+0.050
 toes+0.047
orses+0.045
ippers+0.045
Neg logit contributions
 again-0.034
 Owl-0.030
 aback-0.028
Da-0.028
 Tweet-0.027
Token and
Token ID290
Feature activation+3.48e-01

Pos logit contributions
Da+0.012
xc+0.012
 Owl+0.012
 aback+0.012
lim+0.011
Neg logit contributions
kin-0.011
inton-0.011
 ago-0.011
 toes-0.010
orses-0.009
Token strong
Token ID1913
Feature activation+2.90e-01

Pos logit contributions
kin+0.008
inton+0.007
 ago+0.007
orses+0.007
 toes+0.007
Neg logit contributions
 Owl-0.007
 aback-0.007
 again-0.007
lim-0.006
 tighter-0.006
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
kin+0.006
inton+0.006
orses+0.005
 ago+0.005
orse+0.005
Neg logit contributions
 Owl-0.007
 again-0.006
 owl-0.006
 tighter-0.006
 berries-0.006
Token All
Token ID1439
Feature activation+1.21e+00

Pos logit contributions
kin+0.025
inton+0.025
ippers+0.021
orses+0.021
 ago+0.020
Neg logit contributions
 Owl-0.020
 Tweet-0.018
 again-0.018
 owl-0.018
 berries-0.017
Token the
Token ID262
Feature activation+2.01e+00

Pos logit contributions
inton+0.033
kin+0.032
 ago+0.029
ippers+0.029
ana+0.028
Neg logit contributions
 Owl-0.020
 again-0.019
 owl-0.018
 berries-0.018
 ac-0.018
Token animals
Token ID4695
Feature activation+5.91e+00

Pos logit contributions
inton+0.038
 toes+0.037
kin+0.037
 ago+0.035
orses+0.034
Neg logit contributions
 aback-0.042
xc-0.041
 tighter-0.041
 Owl-0.040
Da-0.040
Token around
Token ID1088
Feature activation+2.66e+00

Pos logit contributions
inton+0.135
kin+0.134
 ago+0.120
orses+0.119
ippers+0.118
Neg logit contributions
 Owl-0.116
 again-0.112
 owl-0.110
 berries-0.107
 ac-0.104
Token the
Token ID262
Feature activation+5.80e-01

Pos logit contributions
inton+0.071
kin+0.070
ippers+0.064
orses+0.060
ana+0.060
Neg logit contributions
 again-0.047
 Owl-0.044
 berries-0.040
 owl-0.040
 Tweet-0.038
Token tree
Token ID5509
Feature activation+8.68e-01

Pos logit contributions
inton+0.013
kin+0.012
 ago+0.012
 toes+0.012
 hardly+0.012
Neg logit contributions
 aback-0.010
 tighter-0.010
xc-0.010
Da-0.009
 again-0.009
Token were
Token ID547
Feature activation+5.33e-01

Pos logit contributions
kin+0.019
inton+0.019
ippers+0.016
orses+0.016
 ago+0.015
Neg logit contributions
 Owl-0.018
 again-0.018
 owl-0.017
 berries-0.017
lim-0.017
Token happy
Token ID3772
Feature activation+1.59e+00

Pos logit contributions
kin+0.014
inton+0.014
orses+0.012
 ago+0.012
ippers+0.012
Neg logit contributions
 Owl-0.009
 owl-0.009
 again-0.008
 berries-0.008
 tighter-0.008
Token field
Token ID2214
Feature activation-4.04e-02

Pos logit contributions
 Owl+0.034
 aback+0.033
Da+0.032
 berries+0.032
lim+0.032
Neg logit contributions
inton-0.030
kin-0.030
 ago-0.028
orses-0.027
 toes-0.026
Token to
Token ID284
Feature activation+9.99e-01

Pos logit contributions
 Owl+0.001
 again+0.001
 tighter+0.001
 owl+0.001
 aback+0.001
Neg logit contributions
inton-0.001
kin-0.001
orses-0.001
 ago-0.001
ippers-0.001
Token shoot
Token ID2686
Feature activation-3.77e-01

Pos logit contributions
inton+0.018
kin+0.018
 ago+0.016
orses+0.015
 toes+0.015
Neg logit contributions
 Owl-0.025
 again-0.024
 owl-0.023
 tighter-0.023
lim-0.023
Token fair
Token ID3148
Feature activation-1.75e+00

Pos logit contributions
 Owl+0.008
Da+0.007
lim+0.007
 owl+0.007
 berries+0.007
Neg logit contributions
kin-0.008
inton-0.008
 ago-0.007
 toes-0.007
orses-0.007
Token arrows
Token ID20507
Feature activation-9.76e-01

Pos logit contributions
 again+0.030
 Owl+0.028
 owl+0.026
 tighter+0.026
 berries+0.026
Neg logit contributions
kin-0.046
inton-0.045
orses-0.041
 ago-0.040
orse-0.040
Token at
Token ID379
Feature activation+1.77e+00

Pos logit contributions
 Owl+0.018
 again+0.017
 tighter+0.016
 owl+0.016
 aback+0.015
Neg logit contributions
inton-0.024
kin-0.024
orses-0.022
 ago-0.022
orse-0.021
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
kin+0.042
inton+0.041
 ago+0.038
orses+0.037
 toes+0.036
Neg logit contributions
 Owl-0.034
 tighter-0.031
Da-0.031
 berries-0.031
 again-0.031
Token moon
Token ID8824
Feature activation+1.50e+00

Pos logit contributions
 Owl+0.026
 aback+0.025
Da+0.025
 berries+0.025
 tighter+0.024
Neg logit contributions
inton-0.023
kin-0.023
 ago-0.022
orses-0.020
 toes-0.020
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
inton+0.034
kin+0.034
orses+0.030
 ago+0.030
ippers+0.029
Neg logit contributions
 Owl-0.031
 again-0.031
 owl-0.028
 tighter-0.027
lim-0.026
Token<|endoftext|>
Token ID50256
Feature activation+1.73e+00

Pos logit contributions
kin+0.003
inton+0.003
orses+0.002
 toes+0.002
 ago+0.002
Neg logit contributions
 Owl-0.002
 aback-0.002
 again-0.002
 Tweet-0.002
 Oak-0.002
TokenOnce
Token ID7454
Feature activation-1.05e-01

Pos logit contributions
inton+0.050
kin+0.049
 toes+0.045
orses+0.045
orse+0.044
Neg logit contributions
 again-0.027
 Owl-0.027
 aback-0.022
Da-0.022
 Tweet-0.022
Token le
Token ID443
Feature activation-1.79e-01

Pos logit contributions
 Owl+0.019
 again+0.017
Da+0.017
 owl+0.017
lim+0.017
Neg logit contributions
inton-0.026
kin-0.026
 ago-0.024
orses-0.024
 toes-0.023
Tokenmons
Token ID11567
Feature activation-1.71e+00

Pos logit contributions
 Owl+0.004
 again+0.004
 tighter+0.004
 owl+0.004
Da+0.004
Neg logit contributions
inton-0.004
kin-0.004
ippers-0.003
orses-0.003
 ago-0.003
Token,
Token ID11
Feature activation+6.26e-01

Pos logit contributions
 Owl+0.028
 again+0.025
 tighter+0.025
 owl+0.025
Da+0.025
Neg logit contributions
kin-0.045
inton-0.045
 ago-0.041
orses-0.040
 toes-0.039
Token please
Token ID3387
Feature activation-3.40e-01

Pos logit contributions
inton+0.014
kin+0.014
 ago+0.013
orses+0.013
 toes+0.013
Neg logit contributions
 Owl-0.012
Da-0.011
lim-0.011
 aback-0.011
 tighter-0.011
Token."
Token ID526
Feature activation+1.52e+00

Pos logit contributions
 Owl+0.007
 again+0.006
 owl+0.006
 tighter+0.006
 berries+0.006
Neg logit contributions
inton-0.008
kin-0.008
orses-0.007
 ago-0.007
 toes-0.007
Token
Token ID220
Feature activation+2.60e+00

Pos logit contributions
inton+0.026
kin+0.025
 ago+0.022
orses+0.022
 toes+0.020
Neg logit contributions
 Owl-0.039
 again-0.037
 tighter-0.037
 owl-0.036
 berries-0.036
Token
Token ID198
Feature activation+7.70e-01

Pos logit contributions
kin+0.062
inton+0.058
 ago+0.052
 toes+0.052
orses+0.051
Neg logit contributions
 Owl-0.055
 again-0.051
 aback-0.049
 Tweet-0.048
Da-0.047
Token
Token ID198
Feature activation-8.91e-01

Pos logit contributions
kin+0.016
inton+0.015
 ago+0.014
 toes+0.013
orses+0.013
Neg logit contributions
 Owl-0.018
 again-0.017
Da-0.017
 aback-0.016
 tighter-0.016
TokenThe
Token ID464
Feature activation-8.39e-02

Pos logit contributions
 Owl+0.019
 again+0.018
 berries+0.018
 owl+0.018
 tighter+0.017
Neg logit contributions
inton-0.019
kin-0.018
orses-0.017
orse-0.016
 ago-0.015
Token woman
Token ID2415
Feature activation+4.21e-01

Pos logit contributions
 Owl+0.002
 again+0.001
 tighter+0.001
 aback+0.001
Da+0.001
Neg logit contributions
inton-0.002
kin-0.002
 ago-0.002
orses-0.002
 toes-0.002
Token gave
Token ID2921
Feature activation-3.51e-01

Pos logit contributions
kin+0.008
inton+0.007
orse+0.006
orses+0.006
 hardly+0.006
Neg logit contributions
 Owl-0.010
Da-0.009
 aback-0.009
 berries-0.009
 owl-0.009
Token Jane
Token ID12091
Feature activation-5.15e-01

Pos logit contributions
 Owl+0.012
lim+0.012
 tighter+0.012
 aback+0.012
 again+0.011
Neg logit contributions
inton-0.013
kin-0.013
 ago-0.011
 toes-0.011
orses-0.011
Token went
Token ID1816
Feature activation-3.83e+00

Pos logit contributions
 Owl+0.009
 again+0.008
 tighter+0.008
 owl+0.008
 berries+0.008
Neg logit contributions
kin-0.013
inton-0.013
 ago-0.012
orses-0.011
 toes-0.011
Token to
Token ID284
Feature activation-1.47e+00

Pos logit contributions
 Owl+0.055
Da+0.052
lim+0.051
 berries+0.050
 tighter+0.047
Neg logit contributions
inton-0.102
kin-0.098
 ago-0.097
 wear-0.092
ippers-0.092
Token the
Token ID262
Feature activation-2.07e+00

Pos logit contributions
 Owl+0.029
 again+0.027
 tighter+0.027
 berries+0.026
 owl+0.026
Neg logit contributions
inton-0.034
kin-0.034
 ago-0.030
orses-0.030
 toes-0.029
Token park
Token ID3952
Feature activation+4.51e-01

Pos logit contributions
 Owl+0.047
 aback+0.047
Da+0.046
 tighter+0.045
lim+0.044
Neg logit contributions
inton-0.038
kin-0.037
 ago-0.033
 toes-0.032
 arrive-0.031
Token to
Token ID284
Feature activation+9.85e-01

Pos logit contributions
inton+0.012
kin+0.012
 ago+0.011
orses+0.011
 toes+0.010
Neg logit contributions
 Owl-0.007
 again-0.006
 tighter-0.006
 berries-0.006
Da-0.006
Token practice
Token ID3357
Feature activation+1.26e+00

Pos logit contributions
kin+0.019
inton+0.019
orses+0.017
 ago+0.016
 toes+0.015
Neg logit contributions
 Owl-0.024
 again-0.023
 owl-0.022
 aback-0.022
 berries-0.022
Token with
Token ID351
Feature activation+8.04e-01

Pos logit contributions
inton+0.030
kin+0.030
orses+0.027
 ago+0.027
orse+0.026
Neg logit contributions
 Owl-0.023
 again-0.022
 tighter-0.021
 owl-0.021
 berries-0.021
Token the
Token ID262
Feature activation+1.22e-01

Pos logit contributions
inton+0.022
kin+0.022
 ago+0.020
orses+0.020
orse+0.020
Neg logit contributions
 Owl-0.012
 again-0.011
 tighter-0.011
 berries-0.011
 owl-0.011
Token clay
Token ID21558
Feature activation+1.23e+00

Pos logit contributions
inton+0.003
kin+0.003
orses+0.003
 ago+0.003
orse+0.003
Neg logit contributions
 Owl-0.002
 again-0.002
 owl-0.002
 berries-0.002
 tighter-0.002
Token.
Token ID13
Feature activation-2.91e-01

Pos logit contributions
kin+0.027
inton+0.026
 ago+0.024
orses+0.023
 toes+0.023
Neg logit contributions
 Owl-0.026
 tighter-0.024
 again-0.024
Da-0.024
lim-0.023
TokenThe
Token ID464
Feature activation+1.49e-01

Pos logit contributions
 Owl+0.019
Da+0.019
lim+0.018
 again+0.018
 tighter+0.018
Neg logit contributions
kin-0.024
inton-0.024
 ago-0.022
 toes-0.021
orses-0.021
Token little
Token ID1310
Feature activation-1.80e+00

Pos logit contributions
inton+0.004
orses+0.004
 ago+0.004
kin+0.004
ippers+0.004
Neg logit contributions
 again-0.002
 berries-0.002
 owl-0.002
 ac-0.002
 closer-0.002
Token girl
Token ID2576
Feature activation-4.63e-01

Pos logit contributions
 Owl+0.037
Da+0.032
lim+0.031
 aback+0.031
Bet+0.030
Neg logit contributions
kin-0.042
inton-0.038
 wear-0.037
 toes-0.037
 ago-0.036
Token had
Token ID550
Feature activation-3.06e+00

Pos logit contributions
 Owl+0.009
 again+0.008
 tighter+0.008
 owl+0.008
 berries+0.008
Neg logit contributions
inton-0.011
kin-0.011
 ago-0.010
orses-0.009
 toes-0.009
Token the
Token ID262
Feature activation-3.65e-01

Pos logit contributions
 Owl+0.054
 again+0.049
 tighter+0.049
 owl+0.048
 berries+0.048
Neg logit contributions
inton-0.077
kin-0.077
 ago-0.070
orses-0.069
 toes-0.066
Token most
Token ID749
Feature activation+2.19e+00

Pos logit contributions
 Owl+0.007
 owl+0.007
 again+0.007
 berries+0.006
 tighter+0.006
Neg logit contributions
inton-0.009
kin-0.008
orses-0.008
 ago-0.008
orse-0.008
Token wonderful
Token ID7932
Feature activation+1.87e+00

Pos logit contributions
inton+0.057
kin+0.056
 ago+0.053
orses+0.052
orse+0.050
Neg logit contributions
 Owl-0.035
 again-0.035
 berries-0.033
 tighter-0.033
 owl-0.033
Token time
Token ID640
Feature activation+2.57e+00

Pos logit contributions
inton+0.040
orses+0.038
kin+0.037
 ago+0.036
orse+0.035
Neg logit contributions
 again-0.038
 berries-0.037
 owl-0.037
 Owl-0.036
 ac-0.036
Token at
Token ID379
Feature activation+2.36e+00

Pos logit contributions
kin+0.067
inton+0.066
ilot+0.058
ippers+0.057
orses+0.057
Neg logit contributions
 again-0.047
 Owl-0.043
 berries-0.041
 owl-0.038
Da-0.038
Token the
Token ID262
Feature activation+8.31e-02

Pos logit contributions
kin+0.054
inton+0.054
 ago+0.049
orses+0.048
 toes+0.046
Neg logit contributions
 Owl-0.046
 tighter-0.043
 again-0.042
Da-0.042
 berries-0.042
Token beach
Token ID10481
Feature activation+1.10e+00

Pos logit contributions
kin+0.002
inton+0.002
orses+0.001
 ago+0.001
orse+0.001
Neg logit contributions
 Owl-0.002
 again-0.002
 tighter-0.002
lim-0.002
 berries-0.002
Token them
Token ID606
Feature activation-1.26e+00

Pos logit contributions
kin+0.030
inton+0.030
 toes+0.027
 ago+0.026
orses+0.025
Neg logit contributions
 Owl-0.039
lim-0.037
 owl-0.036
 tighter-0.036
 aback-0.036
Token back
Token ID736
Feature activation-4.22e+00

Pos logit contributions
 Owl+0.028
 again+0.027
 tighter+0.026
 berries+0.025
 owl+0.025
Neg logit contributions
inton-0.026
kin-0.026
 ago-0.024
orses-0.023
orse-0.022
Token.
Token ID13
Feature activation-6.52e-01

Pos logit contributions
 Owl+0.094
 again+0.090
 tighter+0.088
 owl+0.086
 berries+0.086
Neg logit contributions
inton-0.086
kin-0.086
orses-0.076
 ago-0.075
orse-0.072
Token "
Token ID366
Feature activation-3.15e+00

Pos logit contributions
 Owl+0.014
 again+0.013
Da+0.013
 berries+0.012
 Tweet+0.012
Neg logit contributions
kin-0.015
inton-0.014
orses-0.013
ippers-0.012
 ago-0.012
TokenYou
Token ID1639
Feature activation+2.36e+00

Pos logit contributions
 Owl+0.057
Da+0.052
 again+0.052
 tighter+0.052
 owl+0.051
Neg logit contributions
inton-0.077
kin-0.077
 ago-0.070
orses-0.069
 toes-0.067
Token are
Token ID389
Feature activation+1.12e+00

Pos logit contributions
inton+0.031
kin+0.031
 ago+0.026
orses+0.025
ippers+0.023
Neg logit contributions
 Owl-0.070
 again-0.067
 berries-0.066
 aback-0.066
 owl-0.066
Token my
Token ID616
Feature activation+1.15e+00

Pos logit contributions
inton+0.030
kin+0.030
 ago+0.028
orses+0.028
orse+0.027
Neg logit contributions
 Owl-0.018
 owl-0.016
 again-0.016
 berries-0.015
Da-0.015
Token sweet
Token ID6029
Feature activation+1.03e+00

Pos logit contributions
inton+0.025
kin+0.024
 ago+0.022
orses+0.021
 toes+0.021
Neg logit contributions
 Owl-0.024
 again-0.022
 tighter-0.022
Da-0.022
lim-0.022
Token cream
Token ID8566
Feature activation-1.47e+00

Pos logit contributions
inton+0.026
kin+0.026
 ago+0.024
orses+0.024
orse+0.023
Neg logit contributions
 Owl-0.018
 again-0.016
 berries-0.016
 owl-0.016
 tighter-0.016
Token pies
Token ID47199
Feature activation+8.12e-02

Pos logit contributions
 Owl+0.028
 again+0.026
 tighter+0.026
Da+0.026
lim+0.026
Neg logit contributions
kin-0.034
inton-0.034
 ago-0.031
orses-0.030
 toes-0.029
Token."
Token ID526
Feature activation+1.03e+00

Pos logit contributions
inton+0.002
kin+0.002
orses+0.002
orse+0.002
 hardly+0.002
Neg logit contributions
 again-0.001
 Owl-0.001
 berries-0.001
 tighter-0.001
 owl-0.001
Token was
Token ID373
Feature activation+1.30e+00

Pos logit contributions
inton+0.018
kin+0.018
 ago+0.017
orses+0.016
ippers+0.016
Neg logit contributions
 Owl-0.015
 again-0.014
 tighter-0.014
 owl-0.014
 berries-0.013
Token too
Token ID1165
Feature activation-1.87e-01

Pos logit contributions
inton+0.034
kin+0.033
orses+0.031
 ago+0.030
orse+0.030
Neg logit contributions
 Owl-0.021
 again-0.021
 tighter-0.020
 berries-0.019
 owl-0.019
Token strong
Token ID1913
Feature activation-1.77e+00

Pos logit contributions
 Owl+0.003
 tighter+0.003
 again+0.003
 berries+0.003
Da+0.003
Neg logit contributions
inton-0.005
kin-0.005
 ago-0.004
orses-0.004
orse-0.004
Token,
Token ID11
Feature activation+1.03e+00

Pos logit contributions
 Owl+0.028
 again+0.026
 tighter+0.025
 owl+0.025
 aback+0.025
Neg logit contributions
kin-0.048
inton-0.048
orses-0.043
 ago-0.043
orse-0.042
Token and
Token ID290
Feature activation+1.83e-01

Pos logit contributions
kin+0.028
inton+0.028
orses+0.025
 ago+0.025
ippers+0.025
Neg logit contributions
 Owl-0.016
 again-0.015
 tighter-0.014
 berries-0.014
 owl-0.014
Token it
Token ID340
Feature activation+3.45e-01

Pos logit contributions
kin+0.004
inton+0.004
 ago+0.004
orses+0.004
 toes+0.003
Neg logit contributions
 Owl-0.004
 again-0.004
 tighter-0.003
 owl-0.003
lim-0.003
Token made
Token ID925
Feature activation+1.25e+00

Pos logit contributions
kin+0.008
inton+0.007
orses+0.007
 ago+0.007
 toes+0.006
Neg logit contributions
 Owl-0.007
 aback-0.007
lim-0.007
Da-0.007
 owl-0.007
Token them
Token ID606
Feature activation-1.75e+00

Pos logit contributions
kin+0.033
inton+0.033
 ago+0.030
orses+0.030
orse+0.029
Neg logit contributions
 Owl-0.020
 again-0.019
 tighter-0.019
 berries-0.019
 owl-0.018
Token sh
Token ID427
Feature activation+2.52e+00

Pos logit contributions
 again+0.034
 tighter+0.032
 ac+0.032
 closer+0.031
 worse+0.030
Neg logit contributions
kin-0.042
inton-0.040
 ago-0.040
orses-0.037
ippers-0.036
Tokeniver
Token ID1428
Feature activation+2.38e+00

Pos logit contributions
inton+0.042
kin+0.041
 ago+0.035
orses+0.035
 toes+0.033
Neg logit contributions
 Owl-0.067
 again-0.063
 tighter-0.062
 berries-0.062
 owl-0.062
Token.
Token ID13
Feature activation-3.09e-01

Pos logit contributions
inton+0.049
kin+0.049
 ago+0.044
orses+0.043
 toes+0.041
Neg logit contributions
 Owl-0.052
 again-0.049
 tighter-0.048
 owl-0.048
 berries-0.048
Token helmet
Token ID14335
Feature activation-6.44e-01

Pos logit contributions
inton+0.023
kin+0.023
 ago+0.021
orses+0.021
 toes+0.020
Neg logit contributions
 Owl-0.016
 owl-0.015
 again-0.015
 tighter-0.015
 berries-0.015
Token and
Token ID290
Feature activation+7.17e-01

Pos logit contributions
 Owl+0.010
 again+0.010
 owl+0.009
 tighter+0.009
 ac+0.008
Neg logit contributions
inton-0.018
kin-0.018
orses-0.017
 ago-0.016
orse-0.016
Token a
Token ID257
Feature activation+1.74e+00

Pos logit contributions
inton+0.022
kin+0.022
orses+0.019
 ago+0.019
ippers+0.019
Neg logit contributions
 Owl-0.009
 again-0.008
 owl-0.008
 berries-0.008
 tighter-0.008
Token vest
Token ID19750
Feature activation+1.01e-01

Pos logit contributions
inton+0.041
kin+0.040
 ago+0.037
orses+0.036
 toes+0.034
Neg logit contributions
 Owl-0.033
 owl-0.031
 tighter-0.031
 again-0.031
 berries-0.030
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
inton+0.002
kin+0.002
orses+0.002
ippers+0.002
orse+0.002
Neg logit contributions
 again-0.002
 Owl-0.002
 owl-0.002
 tighter-0.002
 ac-0.002
Token
Token ID198
Feature activation+7.49e-02

Pos logit contributions
kin+0.000
inton+0.000
orses+0.000
 ago+0.000
ippers+0.000
Neg logit contributions
 Owl-0.000
 again-0.000
 aback-0.000
Da-0.000
 berries-0.000
Token
Token ID198
Feature activation-1.53e+00

Pos logit contributions
kin+0.002
inton+0.002
 toes+0.002
 ago+0.002
ippers+0.002
Neg logit contributions
 again-0.001
 Owl-0.001
Da-0.001
 aback-0.001
 tighter-0.001
Token"
Token ID1
Feature activation-4.88e+00

Pos logit contributions
Da+0.026
 again+0.026
 tighter+0.026
 Owl+0.025
 aback+0.024
Neg logit contributions
inton-0.039
kin-0.038
 ago-0.035
 toes-0.035
ippers-0.035
TokenHello
Token ID15496
Feature activation-1.74e-01

Pos logit contributions
 Owl+0.066
 again+0.059
 berries+0.057
lim+0.057
 tighter+0.057
Neg logit contributions
inton-0.140
kin-0.140
 ago-0.128
orses-0.128
orse-0.124
Token,
Token ID11
Feature activation+1.01e+00

Pos logit contributions
 Owl+0.003
 tighter+0.003
Da+0.003
 aback+0.003
lim+0.003
Neg logit contributions
kin-0.004
inton-0.004
 ago-0.004
orses-0.003
orse-0.003
Token sir
Token ID15967
Feature activation-2.40e+00

Pos logit contributions
kin+0.023
inton+0.023
 ago+0.020
orses+0.020
 toes+0.019
Neg logit contributions
 Owl-0.021
 owl-0.020
 again-0.020
 berries-0.019
 tighter-0.019
Token They
Token ID1119
Feature activation+1.45e+00

Pos logit contributions
 Owl+0.014
 Tweet+0.013
 again+0.012
 berries+0.012
Da+0.012
Neg logit contributions
kin-0.019
inton-0.017
inations-0.015
ippers-0.015
iding-0.015
Token say
Token ID910
Feature activation+3.86e-01

Pos logit contributions
kin+0.030
inton+0.029
 ago+0.027
orses+0.027
ippers+0.027
Neg logit contributions
 ac-0.031
 again-0.031
 tighter-0.031
 Owl-0.031
 berries-0.031
Token they
Token ID484
Feature activation-8.08e-01

Pos logit contributions
inton+0.008
kin+0.007
ippers+0.007
 ago+0.006
orses+0.006
Neg logit contributions
 again-0.009
 Owl-0.009
 berries-0.009
 Tweet-0.008
 owl-0.008
Token will
Token ID481
Feature activation+6.67e-01

Pos logit contributions
 again+0.018
 ac+0.016
 berries+0.016
 tighter+0.016
 Owl+0.015
Neg logit contributions
kin-0.018
inton-0.016
orse-0.015
orses-0.015
ippers-0.015
Token have
Token ID423
Feature activation-3.49e+00

Pos logit contributions
kin+0.016
inton+0.016
orses+0.015
ippers+0.014
orse+0.014
Neg logit contributions
 again-0.013
 Owl-0.012
 owl-0.011
 ac-0.011
 tighter-0.011
Token a
Token ID257
Feature activation-5.22e-01

Pos logit contributions
 Owl+0.062
lim+0.054
 aback+0.054
 owl+0.054
Da+0.053
Neg logit contributions
inton-0.089
kin-0.088
orses-0.082
 ago-0.082
 toes-0.080
Token happy
Token ID3772
Feature activation+1.49e-01

Pos logit contributions
 Owl+0.012
 aback+0.011
lim+0.010
Da+0.010
 owl+0.010
Neg logit contributions
kin-0.011
inton-0.011
orses-0.009
 ago-0.009
orse-0.009
Token birthday
Token ID10955
Feature activation+4.54e-01

Pos logit contributions
inton+0.002
kin+0.002
orses+0.002
 ago+0.002
ippers+0.002
Neg logit contributions
 Owl-0.004
 again-0.004
 owl-0.004
 berries-0.004
 tighter-0.004
Token for
Token ID329
Feature activation+1.68e+00

Pos logit contributions
inton+0.011
kin+0.011
 ago+0.010
orses+0.010
orse+0.009
Neg logit contributions
 Owl-0.008
 again-0.008
 tighter-0.008
 owl-0.008
 berries-0.008
Token Mom
Token ID11254
Feature activation-1.59e+00

Pos logit contributions
inton+0.037
kin+0.036
 ago+0.033
orses+0.032
ippers+0.031
Neg logit contributions
 Owl-0.035
 again-0.035
 berries-0.032
 tighter-0.032
 owl-0.032
Token.
Token ID13
Feature activation-6.72e-01

Pos logit contributions
 again+0.026
 Owl+0.025
 tighter+0.023
 owl+0.022
 berries+0.022
Neg logit contributions
kin-0.043
inton-0.042
orses-0.039
 ago-0.038
ippers-0.038
Token smiles
Token ID21845
Feature activation+1.19e+00

Pos logit contributions
 Owl+0.013
Da+0.013
 tighter+0.012
 aback+0.012
 again+0.012
Neg logit contributions
inton-0.012
kin-0.012
 ago-0.011
orses-0.010
orse-0.010
Token and
Token ID290
Feature activation-1.20e+00

Pos logit contributions
kin+0.035
inton+0.035
orses+0.033
 ago+0.032
orse+0.032
Neg logit contributions
 Owl-0.016
 again-0.016
 tighter-0.014
 owl-0.014
 aback-0.013
Token says
Token ID1139
Feature activation+3.88e-01

Pos logit contributions
 Owl+0.025
 again+0.023
 tighter+0.023
 owl+0.022
 berries+0.022
Neg logit contributions
inton-0.026
kin-0.026
 ago-0.024
orses-0.023
 toes-0.022
Token yes
Token ID3763
Feature activation-4.79e-02

Pos logit contributions
inton+0.009
kin+0.009
 ago+0.008
orses+0.008
 toes+0.007
Neg logit contributions
 Owl-0.008
 again-0.007
 berries-0.007
 owl-0.007
 tighter-0.007
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
 again+0.001
 Owl+0.001
 tighter+0.001
 owl+0.001
 berries+0.001
Neg logit contributions
kin-0.001
inton-0.001
orses-0.001
 ago-0.001
orse-0.001
Token He
Token ID679
Feature activation-1.09e+00

Pos logit contributions
 Owl+0.025
 again+0.024
 tighter+0.023
 Tweet+0.023
Da+0.023
Neg logit contributions
kin-0.026
inton-0.026
orses-0.023
 toes-0.022
 ago-0.022
Token gives
Token ID3607
Feature activation-3.96e-01

Pos logit contributions
 Owl+0.025
Da+0.025
 aback+0.024
 tighter+0.023
 again+0.023
Neg logit contributions
inton-0.020
kin-0.019
 ago-0.017
orses-0.017
 wear-0.016
Token them
Token ID606
Feature activation-2.87e+00

Pos logit contributions
 Owl+0.010
lim+0.009
 owl+0.009
Da+0.009
 aback+0.009
Neg logit contributions
kin-0.007
inton-0.006
 ago-0.006
orses-0.006
 toes-0.005
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
 Owl+0.058
lim+0.051
Da+0.051
 aback+0.051
 owl+0.050
Neg logit contributions
inton-0.062
kin-0.058
 toes-0.057
orses-0.053
 wear-0.052
Token basket
Token ID7988
Feature activation-2.20e+00

Pos logit contributions
 Owl+0.021
 again+0.020
lim+0.020
 aback+0.019
 tighter+0.019
Neg logit contributions
kin-0.024
inton-0.023
 ago-0.021
orses-0.021
ilot-0.021
Token and
Token ID290
Feature activation+2.23e-01

Pos logit contributions
 Owl+0.035
 again+0.034
 tighter+0.032
 owl+0.031
 berries+0.031
Neg logit contributions
inton-0.059
kin-0.058
orses-0.053
 ago-0.052
 toes-0.051
Token went
Token ID1816
Feature activation-3.63e+00

Pos logit contributions
inton+0.011
kin+0.011
 ago+0.010
orses+0.010
 toes+0.010
Neg logit contributions
 Owl-0.009
 again-0.008
 tighter-0.008
lim-0.008
 aback-0.008
Token inside
Token ID2641
Feature activation+2.11e-01

Pos logit contributions
 Owl+0.043
 again+0.037
 tighter+0.036
 owl+0.036
 berries+0.036
Neg logit contributions
kin-0.112
inton-0.112
 ago-0.104
orses-0.103
 toes-0.100
Token.
Token ID13
Feature activation-2.33e-02

Pos logit contributions
kin+0.007
inton+0.007
ilot+0.006
orses+0.006
ippers+0.006
Neg logit contributions
 again-0.004
 anyway-0.003
 closer-0.003
 Owl-0.003
 to-0.003
Token It
Token ID632
Feature activation-5.70e-01

Pos logit contributions
 Owl+0.000
 again+0.000
 owl+0.000
 berries+0.000
Da+0.000
Neg logit contributions
kin-0.001
inton-0.001
 ago-0.000
orses-0.000
orse-0.000
Token was
Token ID373
Feature activation+1.14e+00

Pos logit contributions
 Owl+0.014
 aback+0.014
lim+0.014
minded+0.014
 tighter+0.014
Neg logit contributions
kin-0.009
inton-0.009
 hardly-0.008
orse-0.008
orses-0.008
Token sm
Token ID895
Feature activation+5.72e-01

Pos logit contributions
kin+0.029
inton+0.029
 ago+0.027
orses+0.026
 toes+0.025
Neg logit contributions
 Owl-0.019
 again-0.018
lim-0.017
 tighter-0.017
 aback-0.017
Tokenelly
Token ID6148
Feature activation-5.47e-01

Pos logit contributions
inton+0.011
 ago+0.010
kin+0.010
ana+0.009
ippers+0.009
Neg logit contributions
 Owl-0.014
 again-0.013
 owl-0.013
 tighter-0.012
 berries-0.012
Token,
Token ID11
Feature activation+4.66e-01

Pos logit contributions
 Owl+0.011
Da+0.011
lim+0.010
 tighter+0.010
 aback+0.010
Neg logit contributions
inton-0.012
kin-0.011
 ago-0.011
 toes-0.011
orses-0.010
Token but
Token ID475
Feature activation+1.45e+00

Pos logit contributions
inton+0.011
kin+0.011
 ago+0.010
orses+0.010
 toes+0.010
Neg logit contributions
 Owl-0.008
lim-0.008
 tighter-0.007
 again-0.007
 aback-0.007
Token there
Token ID612
Feature activation+1.77e+00

Pos logit contributions
inton+0.035
kin+0.035
 ago+0.032
orses+0.031
orse+0.030
Neg logit contributions
 Owl-0.027
 again-0.025
 tighter-0.024
 berries-0.024
 owl-0.024
Token was
Token ID373
Feature activation+2.40e-01

Pos logit contributions
inton+0.044
kin+0.043
 ago+0.040
orses+0.039
ippers+0.038
Neg logit contributions
 Owl-0.032
 again-0.031
 berries-0.030
 owl-0.029
Da-0.029
Token Tim
Token ID5045
Feature activation-8.82e-01

Pos logit contributions
kin+0.061
inton+0.059
orses+0.053
 ago+0.053
ippers+0.052
Neg logit contributions
 Owl-0.034
 Tweet-0.032
 berries-0.031
Da-0.030
 owl-0.029
Token asked
Token ID1965
Feature activation+1.25e+00

Pos logit contributions
 Owl+0.018
 again+0.016
 berries+0.016
 tighter+0.016
lim+0.016
Neg logit contributions
inton-0.020
kin-0.020
 ago-0.018
orses-0.017
 toes-0.017
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
inton+0.027
kin+0.027
 ago+0.025
orses+0.025
orse+0.023
Neg logit contributions
 Owl-0.026
 again-0.026
 tighter-0.025
 aback-0.024
 owl-0.024
Token
Token ID198
Feature activation+1.35e+00

Pos logit contributions
inton+0.023
kin+0.023
orses+0.020
 ago+0.020
orse+0.018
Neg logit contributions
 Owl-0.023
 again-0.021
Da-0.021
 Tweet-0.021
 aback-0.021
Token
Token ID198
Feature activation-1.94e-01

Pos logit contributions
inton+0.032
kin+0.032
 ago+0.030
 toes+0.029
orses+0.028
Neg logit contributions
 Owl-0.026
 again-0.025
Da-0.024
 Tweet-0.023
 aback-0.023
Token"
Token ID1
Feature activation-4.61e+00

Pos logit contributions
 Owl+0.003
 again+0.003
Da+0.003
 tighter+0.003
lim+0.003
Neg logit contributions
inton-0.005
kin-0.005
 ago-0.005
orses-0.005
 toes-0.004
TokenNot
Token ID3673
Feature activation+1.10e+00

Pos logit contributions
 Owl+0.079
 again+0.070
 owl+0.069
 tighter+0.068
 berries+0.068
Neg logit contributions
inton-0.120
kin-0.119
 ago-0.108
orses-0.106
 toes-0.104
Token now
Token ID783
Feature activation-1.94e+00

Pos logit contributions
inton+0.026
kin+0.026
 ago+0.024
orses+0.024
 toes+0.023
Neg logit contributions
 Owl-0.020
 tighter-0.019
 again-0.018
Da-0.018
lim-0.018
Token,
Token ID11
Feature activation+4.78e-02

Pos logit contributions
 Owl+0.032
 again+0.031
 owl+0.028
 tighter+0.027
 berries+0.026
Neg logit contributions
kin-0.053
inton-0.052
orses-0.048
orse-0.046
 ago-0.046
Token sweet
Token ID6029
Feature activation+5.40e-01

Pos logit contributions
kin+0.001
inton+0.001
 ago+0.001
orses+0.001
ippers+0.001
Neg logit contributions
 Owl-0.001
 berries-0.001
 again-0.001
 owl-0.001
 tighter-0.001
Tokenie
Token ID494
Feature activation+8.64e-02

Pos logit contributions
kin+0.018
inton+0.018
 ago+0.017
orses+0.016
 hardly+0.016
Neg logit contributions
 Owl-0.005
 again-0.005
 berries-0.005
 owl-0.004
lim-0.004
Token,
Token ID11
Feature activation+2.33e+00

Pos logit contributions
 again+0.008
 Owl+0.007
 ac+0.007
 owl+0.007
 berries+0.006
Neg logit contributions
kin-0.011
inton-0.010
orses-0.010
ippers-0.009
orse-0.009
Token so
Token ID523
Feature activation+8.89e-01

Pos logit contributions
inton+0.057
kin+0.057
 ago+0.052
orses+0.051
 toes+0.049
Neg logit contributions
 Owl-0.043
 again-0.039
 berries-0.038
 owl-0.038
 tighter-0.038
Token he
Token ID339
Feature activation-2.99e-01

Pos logit contributions
kin+0.023
inton+0.023
 ago+0.021
ilot+0.020
orses+0.020
Neg logit contributions
 again-0.015
 ac-0.014
 berries-0.014
 Owl-0.014
 tighter-0.014
Token decide
Token ID5409
Feature activation-2.17e+00

Pos logit contributions
 Owl+0.006
 again+0.006
 aback+0.006
 tighter+0.006
lim+0.006
Neg logit contributions
kin-0.006
inton-0.006
 ago-0.006
orses-0.006
orse-0.005
Token to
Token ID284
Feature activation+1.29e+00

Pos logit contributions
 Owl+0.022
 again+0.021
 tighter+0.020
 owl+0.019
 berries+0.019
Neg logit contributions
kin-0.071
inton-0.070
 ago-0.065
orses-0.065
 toes-0.062
Token go
Token ID467
Feature activation-3.19e+00

Pos logit contributions
inton+0.029
kin+0.029
orses+0.028
 ago+0.028
orse+0.026
Neg logit contributions
 Owl-0.026
 owl-0.024
 ac-0.024
lim-0.024
 again-0.023
Token to
Token ID284
Feature activation-6.79e-01

Pos logit contributions
 Owl+0.045
Da+0.040
 tighter+0.039
lim+0.039
 berries+0.039
Neg logit contributions
inton-0.091
kin-0.089
 ago-0.083
orses-0.082
 toes-0.080
Token the
Token ID262
Feature activation-2.10e+00

Pos logit contributions
 Owl+0.010
 again+0.010
 owl+0.009
 berries+0.009
 ac+0.008
Neg logit contributions
kin-0.020
inton-0.020
orses-0.018
ilot-0.018
ippers-0.017
Token park
Token ID3952
Feature activation+8.14e-01

Pos logit contributions
 aback+0.055
 Owl+0.052
Da+0.050
 tighter+0.050
ign+0.050
Neg logit contributions
inton-0.032
kin-0.030
 arrive-0.028
 ago-0.028
 wear-0.027
Token.
Token ID13
Feature activation+7.42e-01

Pos logit contributions
kin+0.018
inton+0.018
orses+0.016
 ago+0.016
orse+0.015
Neg logit contributions
 Owl-0.017
 again-0.017
 tighter-0.015
 owl-0.015
 berries-0.015
Token John
Token ID1757
Feature activation-1.01e+00

Pos logit contributions
kin+0.023
inton+0.020
orses+0.019
orse+0.019
ippers+0.018
Neg logit contributions
 again-0.012
 aback-0.012
 Owl-0.011
 berries-0.009
 tighter-0.009
Token happy
Token ID3772
Feature activation+6.51e-01

Pos logit contributions
 aback+0.005
 closer+0.005
 cal+0.005
 again+0.005
 bigger+0.005
Neg logit contributions
kin-0.010
quin-0.009
asts-0.009
 unison-0.009
 ago-0.009
Token and
Token ID290
Feature activation-9.17e-01

Pos logit contributions
kin+0.018
inton+0.018
orses+0.016
 ago+0.016
ilot+0.015
Neg logit contributions
 again-0.011
 Owl-0.010
 owl-0.009
 berries-0.009
Da-0.009
Token thrilled
Token ID20536
Feature activation+5.79e-01

Pos logit contributions
 again+0.015
 tighter+0.015
 Owl+0.014
 ac+0.014
 Tweet+0.014
Neg logit contributions
kin-0.023
inton-0.023
 ago-0.022
ilot-0.022
asts-0.021
Token she
Token ID673
Feature activation-1.54e+00

Pos logit contributions
inton+0.013
kin+0.013
 ago+0.012
orses+0.011
 toes+0.011
Neg logit contributions
 Owl-0.012
 tighter-0.011
lim-0.010
Da-0.010
 berries-0.010
Token had
Token ID550
Feature activation-2.46e+00

Pos logit contributions
 again+0.030
 Owl+0.028
 ac+0.027
Da+0.026
 owl+0.026
Neg logit contributions
kin-0.035
inton-0.034
 ago-0.033
ippers-0.033
asts-0.031
Token managed
Token ID5257
Feature activation-3.44e+00

Pos logit contributions
 again+0.039
Da+0.038
 Owl+0.037
 aback+0.036
 tighter+0.035
Neg logit contributions
kin-0.065
inton-0.064
 ago-0.060
ippers-0.058
orses-0.058
Token to
Token ID284
Feature activation+5.04e-01

Pos logit contributions
 Owl+0.036
 again+0.031
 tighter+0.030
 owl+0.030
 berries+0.030
Neg logit contributions
inton-0.110
kin-0.110
 ago-0.102
orses-0.101
 toes-0.099
Token hide
Token ID7808
Feature activation+1.52e+00

Pos logit contributions
 ago+0.011
inton+0.011
kin+0.011
orses+0.010
ippers+0.010
Neg logit contributions
 again-0.010
 Owl-0.010
 ac-0.010
 owl-0.010
 berries-0.010
Token the
Token ID262
Feature activation-2.02e+00

Pos logit contributions
kin+0.035
inton+0.034
 toes+0.030
 ago+0.030
ippers+0.029
Neg logit contributions
 Owl-0.031
lim-0.028
Da-0.028
 owl-0.028
 aback-0.027
Token m
Token ID285
Feature activation+9.12e-01

Pos logit contributions
 Owl+0.038
 again+0.035
 tighter+0.034
 owl+0.034
 berries+0.034
Neg logit contributions
inton-0.048
kin-0.048
 ago-0.044
orses-0.043
 toes-0.042
Tokenop
Token ID404
Feature activation-1.13e+00

Pos logit contributions
kin+0.014
inton+0.014
 ago+0.012
orses+0.011
orse+0.010
Neg logit contributions
 Owl-0.025
 again-0.023
 owl-0.023
lim-0.023
 tighter-0.023
Token he
Token ID339
Feature activation-1.23e+00

Pos logit contributions
 Owl+0.005
 again+0.004
 tighter+0.004
 aback+0.004
 owl+0.004
Neg logit contributions
inton-0.006
kin-0.005
 ago-0.005
orses-0.005
ilot-0.005
Token put
Token ID1234
Feature activation-1.18e+00

Pos logit contributions
 Owl+0.026
 aback+0.023
Da+0.023
 berries+0.023
 owl+0.023
Neg logit contributions
kin-0.027
inton-0.027
orses-0.023
 ago-0.023
orse-0.023
Token the
Token ID262
Feature activation-1.84e+00

Pos logit contributions
 Owl+0.025
Da+0.023
lim+0.023
 owl+0.023
 berries+0.022
Neg logit contributions
kin-0.026
inton-0.024
 ago-0.022
orses-0.021
 toes-0.021
Token silver
Token ID8465
Feature activation-4.33e-01

Pos logit contributions
 Owl+0.032
Da+0.029
 again+0.028
lim+0.028
 tighter+0.028
Neg logit contributions
kin-0.047
inton-0.046
 ago-0.043
orses-0.042
 toes-0.041
Token coin
Token ID10752
Feature activation-4.59e+00

Pos logit contributions
Da+0.010
 Owl+0.009
Bet+0.009
 aback+0.008
lim+0.008
Neg logit contributions
kin-0.010
inton-0.009
 ago-0.008
 toes-0.008
ippers-0.008
Token back
Token ID736
Feature activation-4.70e+00

Pos logit contributions
 Owl+0.089
Da+0.084
 aback+0.080
 berries+0.079
lim+0.079
Neg logit contributions
kin-0.112
inton-0.102
 toes-0.095
 ago-0.095
orses-0.090
Token on
Token ID319
Feature activation+2.04e-01

Pos logit contributions
 Owl+0.076
 again+0.069
 tighter+0.068
 owl+0.067
 berries+0.067
Neg logit contributions
inton-0.123
kin-0.123
 ago-0.113
orses-0.111
 toes-0.108
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
inton+0.005
kin+0.005
orses+0.005
 ago+0.005
orse+0.004
Neg logit contributions
 Owl-0.004
 again-0.003
 tighter-0.003
 owl-0.003
 aback-0.003
Token ground
Token ID2323
Feature activation+2.83e-01

Pos logit contributions
 Owl+0.028
Da+0.026
 again+0.026
 aback+0.026
 tighter+0.026
Neg logit contributions
kin-0.032
inton-0.032
 ago-0.029
orses-0.028
 toes-0.028
Token and
Token ID290
Feature activation-5.27e-01

Pos logit contributions
inton+0.008
kin+0.008
orses+0.007
 ago+0.007
orse+0.007
Neg logit contributions
 Owl-0.004
 again-0.004
 tighter-0.004
 owl-0.004
 berries-0.004
Token vowed
Token ID19982
Feature activation-2.12e+00

Pos logit contributions
 Owl+0.011
Da+0.010
lim+0.010
 owl+0.010
 berries+0.010
Neg logit contributions
kin-0.012
inton-0.011
 ago-0.010
orses-0.010
 toes-0.010
Token.
Token ID13
Feature activation+7.15e-01

Pos logit contributions
 Owl+0.020
 again+0.019
 owl+0.018
 tighter+0.018
 berries+0.018
Neg logit contributions
inton-0.023
kin-0.023
 ago-0.020
orses-0.020
 toes-0.019
Token She
Token ID1375
Feature activation+1.03e+00

Pos logit contributions
inton+0.016
kin+0.016
 ago+0.014
orses+0.014
orse+0.014
Neg logit contributions
 Owl-0.014
 again-0.013
 berries-0.013
 owl-0.013
 tighter-0.013
Token loved
Token ID6151
Feature activation+1.08e+00

Pos logit contributions
inton+0.019
kin+0.019
 ago+0.018
orses+0.017
ippers+0.017
Neg logit contributions
 Owl-0.024
 again-0.023
 owl-0.023
lim-0.023
 ac-0.023
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
inton+0.030
kin+0.029
 ago+0.027
orses+0.027
 toes+0.027
Neg logit contributions
 Owl-0.016
 aback-0.015
 tighter-0.015
lim-0.014
 again-0.014
Token throw
Token ID3714
Feature activation-7.46e-01

Pos logit contributions
inton+0.025
kin+0.024
 ago+0.023
orses+0.022
ilot+0.021
Neg logit contributions
 ac-0.024
 Tweet-0.023
 owl-0.023
 Owl-0.023
 berries-0.022
Token it
Token ID340
Feature activation-4.35e+00

Pos logit contributions
 Owl+0.020
lim+0.018
xc+0.018
Da+0.018
ocular+0.017
Neg logit contributions
kin-0.012
inton-0.011
 wear-0.011
 toes-0.010
orses-0.009
Token up
Token ID510
Feature activation-1.16e+00

Pos logit contributions
 Owl+0.104
Da+0.095
 aback+0.094
minded+0.093
xc+0.093
Neg logit contributions
kin-0.081
inton-0.076
 toes-0.073
 wear-0.071
 ago-0.065
Token in
Token ID287
Feature activation+1.14e+00

Pos logit contributions
 Owl+0.026
 aback+0.024
 berries+0.023
lim+0.023
 owl+0.023
Neg logit contributions
kin-0.024
inton-0.022
 ago-0.022
 toes-0.021
 wear-0.019
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
inton+0.029
kin+0.029
orses+0.026
 ago+0.026
orse+0.025
Neg logit contributions
 Owl-0.020
 again-0.018
 tighter-0.018
 owl-0.018
 berries-0.018
Token air
Token ID1633
Feature activation+2.42e+00

Pos logit contributions
 Owl+0.025
 again+0.023
 tighter+0.023
Da+0.022
 owl+0.022
Neg logit contributions
inton-0.026
kin-0.026
 ago-0.023
orses-0.023
 toes-0.022
Token and
Token ID290
Feature activation+5.94e-02

Pos logit contributions
inton+0.063
kin+0.062
orses+0.057
 ago+0.056
orse+0.055
Neg logit contributions
 Owl-0.040
 again-0.038
 tighter-0.036
 owl-0.036
 berries-0.036