TOP ACTIVATIONS
MAX = 0.075

were all broken and her room was a mess . From
her favorite toy and her room . She also made her
happily together and Lily 's room stayed organized . <|endoftext|> Once
at home and everyone was dressed so nice . He didn
best friend . You are a wonderful princess ." <|endoftext|> Tom
loved her bear and his bow . She wanted to fix
old friends and his old room . " Come
you too . You are a clever girl and a good
over the place and her accessories were everywhere . It definitely
her zipper broke and her jacket became too cheap to wear
my started to cry . " I don 't want you
Lily was heart broken . " It 's not fine anymore
with their toys in the living room . They had many
with their toys in the living room . They liked to
but it wouldn 't bud ge . Max
so happy and wore her jacket every day . She never
with their toys in the living room . They liked to
with their toys in the living room . Lily had a
me today . You are a great superhero , a princess
you too . You are a good brother . I forgive

MIN ACTIVATIONS
MIN = -0.080

They see strawberry jam , raspberry jam , blue berry jam
work . She picked the ripe berries one by one and
" You can eat as much as you want !"
! I can eat as much as I want and no
. He pl ucked the ripe l imes from the trees
, there were lots of juicy grapes . A
pond with lots of y ummy bugs and fish ."
perfect bush with lots of juicy berries . Little Tim was
both took some of the juicy berries . But from nowhere
bear brought lots of y ummy food and shared it with
jumped in and drank as much as she wanted . She
. It was full of tasty fish . All day long
, so she ate as much as she wanted .
glad I could help the tad p oles ". <|endoftext|> One
They brought lots of y ummy food and drinks .
" Why hello there blue berry ! What brings you here
little pond . They saw lot of ducks swimming around .
fox gob bled up the juicy worm , feeling much better
said : " Take as much as you like and have
treat . He ate as much of the rain as he

INTERVAL 0.036 - 0.075
CONTAINS 0.331%

No , Mom , please . I am sorry . I
in the corner of the room . One day , a
thanked her for the wonderful presents . They thanked her for
my notebook ! You are a bad friend !"
got to the front , Tim my picked out a book

INTERVAL -0.002 - 0.036
CONTAINS 56.380%

Sometimes they saw big waves that crashed on the shore .
they tried and finally , they were able to carry the
a little boy named Tim my . Tim my loved to
jumped in the water and made a big splash with the
eager to get his power . But then , something bad

INTERVAL -0.041 - -0.002
CONTAINS 43.039%

the park . They skip and run and laugh . They
have a surprise for you , Tom . We have tickets
cage with a big , furry gorilla inside . "
and had lots of fun . One day ,
the swings and went down the slide . After they played

INTERVAL -0.080 - -0.041
CONTAINS 0.249%

of honey . She poured some honey over the mint leaves
Sure , I will add more water and then fold the
. He also saw a lot of things he didn 't
playing tag and having so much fun . Suddenly , Lily
night and wondered what it was like up so high .
Token were
Token ID547
Feature activation+1.35e-02

Pos logit contributions
attered+0.058
 possession+0.057
 favorite+0.055
 favourite+0.054
 forever+0.051
Neg logit contributions
 answers-0.055
 ey-0.055
 say-0.054
useum-0.052
esame-0.052
Token all
Token ID477
Feature activation+1.45e-02

Pos logit contributions
attered+0.016
 possession+0.016
 favorite+0.015
 favourite+0.015
 nobody+0.013
Neg logit contributions
 answers-0.023
 ey-0.023
iles-0.022
 say-0.022
useum-0.022
Token broken
Token ID5445
Feature activation+3.05e-02

Pos logit contributions
attered+0.018
 possession+0.017
 favorite+0.017
 favourite+0.016
 nobody+0.015
Neg logit contributions
 answers-0.025
 ey-0.025
 say-0.024
useum-0.024
iles-0.024
Token and
Token ID290
Feature activation+4.68e-02

Pos logit contributions
attered+0.052
 possession+0.051
 favorite+0.051
 favourite+0.050
 nobody+0.049
Neg logit contributions
 answers-0.036
 ey-0.034
useum-0.033
esame-0.032
iles-0.032
Token her
Token ID607
Feature activation+3.10e-02

Pos logit contributions
attered+0.060
 possession+0.058
 favorite+0.057
 favourite+0.057
 forever+0.051
Neg logit contributions
 answers-0.075
 ey-0.075
 say-0.074
 calls-0.072
 calling-0.070
Token room
Token ID2119
Feature activation+7.51e-02

Pos logit contributions
attered+0.034
 possession+0.031
 favorite+0.029
 nobody+0.028
 forever+0.028
Neg logit contributions
 ey-0.057
 answers-0.057
 say-0.055
iles-0.054
useum-0.054
Token was
Token ID373
Feature activation+2.59e-02

Pos logit contributions
 possession+0.121
attered+0.117
 favourite+0.114
 favorite+0.114
 forever+0.106
Neg logit contributions
 answers-0.096
 calls-0.089
 say-0.089
 ey-0.087
useum-0.085
Token a
Token ID257
Feature activation+4.78e-02

Pos logit contributions
 possession+0.036
attered+0.035
 favourite+0.032
 favorite+0.032
Mid+0.029
Neg logit contributions
 answers-0.038
 calling-0.038
 say-0.037
 ey-0.037
 question-0.036
Token mess
Token ID2085
Feature activation+1.93e-02

Pos logit contributions
attered+0.073
 possession+0.065
 favourite+0.060
 favorite+0.059
organized+0.056
Neg logit contributions
 question-0.073
 sigh-0.069
 say-0.068
 answers-0.068
 ey-0.067
Token.
Token ID13
Feature activation+9.17e-03

Pos logit contributions
 favourite+0.035
 favorite+0.035
 possession+0.035
attered+0.034
 nobody+0.032
Neg logit contributions
 answers-0.022
 say-0.021
 calling-0.020
 calls-0.020
esame-0.020
Token From
Token ID3574
Feature activation-1.21e-02

Pos logit contributions
attered+0.011
 favorite+0.011
 possession+0.011
 nobody+0.010
 favourite+0.010
Neg logit contributions
iles-0.016
 ey-0.016
 answers-0.016
esame-0.016
useum-0.015
Token her
Token ID607
Feature activation+1.55e-02

Pos logit contributions
attered+0.056
 favorite+0.049
 favourite+0.049
 possession+0.048
 nobody+0.045
Neg logit contributions
 ey-0.052
 answers-0.052
 say-0.048
 calls-0.048
 calling-0.047
Token favorite
Token ID4004
Feature activation+2.74e-02

Pos logit contributions
attered+0.019
 possession+0.016
 nobody+0.015
 favorite+0.013
 forever+0.013
Neg logit contributions
 answers-0.028
 ey-0.028
 say-0.027
 question-0.027
 calls-0.026
Token toy
Token ID13373
Feature activation+4.63e-02

Pos logit contributions
attered+0.030
 nobody+0.023
 possession+0.023
 favorite+0.022
 favourite+0.022
Neg logit contributions
 answers-0.053
 ey-0.053
 question-0.052
 say-0.051
 calls-0.050
Token and
Token ID290
Feature activation+3.70e-02

Pos logit contributions
attered+0.087
 possession+0.082
 favourite+0.081
 favorite+0.079
readable+0.076
Neg logit contributions
 calls-0.044
 answers-0.042
 say-0.041
 pe-0.039
 slam-0.037
Token her
Token ID607
Feature activation+2.25e-02

Pos logit contributions
attered+0.059
 favourite+0.052
dates+0.052
 possession+0.051
 favorite+0.051
Neg logit contributions
 say-0.052
 answers-0.051
 says-0.050
 ey-0.049
 calls-0.049
Token room
Token ID2119
Feature activation+6.69e-02

Pos logit contributions
attered+0.029
 possession+0.024
 nobody+0.023
Mid+0.021
 forward+0.021
Neg logit contributions
 answers-0.038
 ey-0.038
 say-0.037
 question-0.036
 calls-0.036
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
 possession+0.110
attered+0.109
 favourite+0.106
 favorite+0.105
Mid+0.098
Neg logit contributions
 answers-0.082
 say-0.076
 calls-0.076
 slam-0.073
 ey-0.073
Token She
Token ID1375
Feature activation+2.20e-02

Pos logit contributions
attered+0.030
 possession+0.029
 favorite+0.028
 favourite+0.028
 forever+0.027
Neg logit contributions
 ey-0.024
 answers-0.023
useum-0.023
 say-0.023
esame-0.022
Token also
Token ID635
Feature activation+2.80e-02

Pos logit contributions
attered+0.033
 possession+0.033
 favorite+0.032
 favourite+0.031
 forever+0.030
Neg logit contributions
 ey-0.032
 answers-0.031
 say-0.031
useum-0.030
 calls-0.030
Token made
Token ID925
Feature activation+2.48e-02

Pos logit contributions
attered+0.043
 possession+0.042
 favorite+0.041
 favourite+0.041
 forever+0.038
Neg logit contributions
 say-0.039
 ey-0.039
 answers-0.039
 explains-0.037
 calls-0.037
Token her
Token ID607
Feature activation+3.26e-02

Pos logit contributions
attered+0.037
 possession+0.035
 forever+0.032
 favorite+0.031
 favourite+0.031
Neg logit contributions
 answers-0.038
 say-0.038
 calls-0.035
 ey-0.035
 explains-0.034
Token happily
Token ID18177
Feature activation-2.71e-03

Pos logit contributions
attered+0.003
 possession+0.003
 favorite+0.003
 favourite+0.002
 forever+0.002
Neg logit contributions
 ey-0.005
 answers-0.005
useum-0.005
iles-0.005
esame-0.005
Token together
Token ID1978
Feature activation+7.61e-03

Pos logit contributions
 ey+0.006
useum+0.006
 answers+0.005
iles+0.005
esame+0.005
Neg logit contributions
attered-0.003
 favorite-0.003
 possession-0.002
 forever-0.002
 favourite-0.002
Token and
Token ID290
Feature activation+2.81e-02

Pos logit contributions
 forever+0.010
 favorite+0.010
 possession+0.009
 favourite+0.009
attered+0.009
Neg logit contributions
 ey-0.014
useum-0.013
iles-0.013
esame-0.013
 answers-0.013
Token Lily
Token ID20037
Feature activation+2.41e-02

Pos logit contributions
attered+0.034
 favorite+0.034
 possession+0.034
 forever+0.033
 favourite+0.032
Neg logit contributions
 ey-0.050
useum-0.049
iles-0.048
 answers-0.048
rig-0.047
Token's
Token ID338
Feature activation+2.62e-02

Pos logit contributions
attered+0.034
 possession+0.033
 favorite+0.032
 favourite+0.031
 nobody+0.030
Neg logit contributions
 answers-0.037
 ey-0.036
 say-0.036
esame-0.036
iles-0.036
Token room
Token ID2119
Feature activation+6.69e-02

Pos logit contributions
 favorite+0.029
 favourite+0.028
attered+0.027
 possession+0.027
 forever+0.025
Neg logit contributions
 ey-0.050
useum-0.049
esame-0.049
iles-0.049
 answers-0.048
Token stayed
Token ID9658
Feature activation+3.29e-02

Pos logit contributions
attered+0.105
 possession+0.105
 favorite+0.094
 favourite+0.093
Mid+0.090
Neg logit contributions
 answers-0.090
 say-0.084
 calls-0.081
rig-0.080
 question-0.080
Token organized
Token ID8389
Feature activation+2.66e-02

Pos logit contributions
attered+0.050
 possession+0.046
Mid+0.041
 nobody+0.040
oun+0.040
Neg logit contributions
 say-0.047
 answers-0.046
 calling-0.046
rig-0.043
 perk-0.042
Token.
Token ID13
Feature activation+1.45e-02

Pos logit contributions
attered+0.043
 possession+0.040
 favorite+0.039
 favourite+0.038
 nobody+0.038
Neg logit contributions
 answers-0.036
 say-0.034
esame-0.034
rig-0.033
 ey-0.033
Token<|endoftext|>
Token ID50256
Feature activation-4.85e-03

Pos logit contributions
attered+0.024
 possession+0.024
 favorite+0.023
 favourite+0.022
 forever+0.022
Neg logit contributions
 ey-0.018
 answers-0.018
useum-0.018
esame-0.018
 say-0.018
TokenOnce
Token ID7454
Feature activation+2.11e-02

Pos logit contributions
 ey+0.008
useum+0.008
iles+0.008
 answers+0.008
rig+0.008
Neg logit contributions
 favorite-0.006
 possession-0.006
 forever-0.006
attered-0.006
 favourite-0.006
Token at
Token ID379
Feature activation+1.19e-02

Pos logit contributions
attered+0.042
 possession+0.041
 favorite+0.040
 favourite+0.039
 forever+0.038
Neg logit contributions
 ey-0.040
 answers-0.039
useum-0.039
iles-0.038
esame-0.038
Token home
Token ID1363
Feature activation-4.68e-04

Pos logit contributions
 favorite+0.013
 possession+0.013
 forever+0.013
 favourite+0.012
attered+0.012
Neg logit contributions
 ey-0.022
useum-0.022
iles-0.022
rig-0.022
 explains-0.021
Token and
Token ID290
Feature activation+3.03e-02

Pos logit contributions
 ey+0.001
useum+0.001
iles+0.001
rig+0.001
esame+0.001
Neg logit contributions
attered-0.001
 favorite-0.001
 possession-0.001
 forever-0.001
 no-0.001
Token everyone
Token ID2506
Feature activation+1.79e-02

Pos logit contributions
attered+0.038
 favorite+0.038
 possession+0.037
 nobody+0.037
 favourite+0.036
Neg logit contributions
 ey-0.052
iles-0.052
useum-0.051
esame-0.050
rig-0.050
Token was
Token ID373
Feature activation+8.35e-03

Pos logit contributions
attered+0.026
 possession+0.025
 favorite+0.025
 favourite+0.024
 forever+0.023
Neg logit contributions
 answers-0.027
 ey-0.027
useum-0.026
 say-0.026
iles-0.026
Token dressed
Token ID12049
Feature activation+6.66e-02

Pos logit contributions
attered+0.011
 possession+0.010
 favorite+0.010
 favourite+0.010
 forever+0.010
Neg logit contributions
 ey-0.014
 answers-0.014
useum-0.013
iles-0.013
esame-0.013
Token so
Token ID523
Feature activation+1.45e-02

Pos logit contributions
 possession+0.104
attered+0.103
Mid+0.100
 favourite+0.092
 nobody+0.092
Neg logit contributions
 answers-0.084
 say-0.077
 questions-0.072
 calling-0.070
 explains-0.068
Token nice
Token ID3621
Feature activation-4.75e-03

Pos logit contributions
 possession+0.015
attered+0.014
 favorite+0.013
 favourite+0.012
 forever+0.012
Neg logit contributions
 answers-0.027
useum-0.027
 ey-0.027
 say-0.026
iles-0.026
Token.
Token ID13
Feature activation+9.32e-03

Pos logit contributions
 ey+0.008
 answers+0.007
useum+0.007
iles+0.007
esame+0.007
Neg logit contributions
attered-0.006
 possession-0.006
 favorite-0.006
 favourite-0.006
 forever-0.006
Token He
Token ID679
Feature activation+2.62e-03

Pos logit contributions
attered+0.014
 possession+0.013
 favorite+0.013
 favourite+0.013
 forever+0.012
Neg logit contributions
 ey-0.014
 answers-0.014
iles-0.014
useum-0.014
esame-0.013
Token didn
Token ID1422
Feature activation+6.32e-04

Pos logit contributions
 favorite+0.003
attered+0.003
 favourite+0.003
 nobody+0.003
 forever+0.003
Neg logit contributions
 ey-0.005
iles-0.004
useum-0.004
esame-0.004
 answers-0.004
Token best
Token ID1266
Feature activation+4.02e-02

Pos logit contributions
attered+0.068
 possession+0.055
 nobody+0.048
organized+0.044
powered+0.042
Neg logit contributions
 answers-0.107
 question-0.103
 say-0.103
 questions-0.099
 calls-0.097
Token friend
Token ID1545
Feature activation+3.44e-02

Pos logit contributions
attered+0.062
 possession+0.053
 nobody+0.048
 favourite+0.045
organized+0.044
Neg logit contributions
 say-0.067
 question-0.066
 answers-0.065
 calls-0.065
 questions-0.061
Token.
Token ID13
Feature activation+3.67e-02

Pos logit contributions
attered+0.072
 possession+0.062
 favourite+0.062
 favorite+0.058
 proper+0.056
Neg logit contributions
 say-0.038
 says-0.037
 answers-0.036
 calls-0.035
 answered-0.031
Token You
Token ID921
Feature activation+8.65e-04

Pos logit contributions
attered+0.057
 possession+0.052
 favorite+0.050
 favourite+0.049
 forever+0.047
Neg logit contributions
 say-0.054
 ey-0.054
 answers-0.054
 calls-0.052
esame-0.052
Token are
Token ID389
Feature activation+3.39e-02

Pos logit contributions
attered+0.002
 possession+0.002
 favorite+0.002
 favourite+0.001
 nobody+0.001
Neg logit contributions
 say-0.001
 answers-0.001
esame-0.001
 explains-0.001
 calls-0.001
Token a
Token ID257
Feature activation+6.58e-02

Pos logit contributions
attered+0.072
 possession+0.061
 favourite+0.052
pool+0.051
oom+0.050
Neg logit contributions
 say-0.043
 answers-0.042
 calling-0.038
 question-0.036
 calls-0.035
Token wonderful
Token ID7932
Feature activation+2.72e-02

Pos logit contributions
attered+0.133
 possession+0.103
ors+0.097
bed+0.096
bled+0.096
Neg logit contributions
 say-0.091
 question-0.089
 cal-0.084
 answers-0.082
 pe-0.081
Token princess
Token ID21752
Feature activation+2.98e-02

Pos logit contributions
attered+0.035
 possession+0.030
 nobody+0.029
 favourite+0.028
 favorite+0.028
Neg logit contributions
 answers-0.051
 question-0.049
 say-0.049
 calls-0.048
 ey-0.047
Token."
Token ID526
Feature activation+6.00e-04

Pos logit contributions
attered+0.061
 favourite+0.052
 possession+0.051
 proper+0.050
 favorite+0.049
Neg logit contributions
 say-0.033
 answers-0.032
 calls-0.031
 says-0.030
iles-0.029
Token<|endoftext|>
Token ID50256
Feature activation-1.76e-02

Pos logit contributions
attered+0.001
 possession+0.001
 favorite+0.001
 favourite+0.001
 nobody+0.001
Neg logit contributions
 ey-0.001
 say-0.001
 answers-0.001
useum-0.001
esame-0.001
TokenTom
Token ID13787
Feature activation+1.48e-02

Pos logit contributions
 ey+0.027
rig+0.027
useum+0.026
 answers+0.025
iles+0.025
Neg logit contributions
 favorite-0.024
attered-0.023
 forever-0.023
 possession-0.023
 whole-0.023
Token loved
Token ID6151
Feature activation-8.22e-03

Pos logit contributions
attered+0.005
 favorite+0.004
 no+0.004
 nobody+0.004
 forever+0.004
Neg logit contributions
esame-0.007
 ey-0.007
iles-0.006
useum-0.006
rig-0.006
Token her
Token ID607
Feature activation+1.22e-02

Pos logit contributions
rig+0.018
useum+0.018
esame+0.017
 ey+0.017
 explains+0.017
Neg logit contributions
 favorite-0.008
 no-0.007
 possession-0.007
 nobody-0.007
 forever-0.006
Token bear
Token ID6842
Feature activation+1.54e-02

Pos logit contributions
 favorite+0.012
 possession+0.011
 favourite+0.011
 forever+0.010
attered+0.010
Neg logit contributions
esame-0.026
useum-0.025
iles-0.025
 ey-0.025
ount-0.025
Token and
Token ID290
Feature activation+2.32e-02

Pos logit contributions
attered+0.023
 possession+0.022
 favorite+0.021
 favourite+0.021
 nobody+0.020
Neg logit contributions
 answers-0.022
 ey-0.022
 say-0.021
esame-0.021
iles-0.021
Token his
Token ID465
Feature activation+2.02e-02

Pos logit contributions
attered+0.030
 possession+0.029
 favorite+0.029
 nobody+0.028
 favourite+0.027
Neg logit contributions
 ey-0.039
useum-0.038
iles-0.038
esame-0.038
 answers-0.038
Token bow
Token ID9563
Feature activation+6.55e-02

Pos logit contributions
 favorite+0.023
 possession+0.022
attered+0.022
 favourite+0.022
 forever+0.020
Neg logit contributions
 ey-0.037
useum-0.037
esame-0.037
 answers-0.036
iles-0.036
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
attered+0.114
 possession+0.110
 favourite+0.104
 favorite+0.100
Mid+0.099
Neg logit contributions
 answers-0.076
 say-0.073
 calls-0.068
 rust-0.067
 says-0.064
Token She
Token ID1375
Feature activation+5.02e-03

Pos logit contributions
attered+0.016
 favorite+0.016
 possession+0.015
 favourite+0.015
 nobody+0.015
Neg logit contributions
 ey-0.021
 answers-0.021
iles-0.021
useum-0.020
esame-0.020
Token wanted
Token ID2227
Feature activation+1.16e-02

Pos logit contributions
 favorite+0.006
attered+0.006
 nobody+0.006
 no+0.006
 possession+0.006
Neg logit contributions
 ey-0.009
esame-0.009
useum-0.009
iles-0.008
 answers-0.008
Token to
Token ID284
Feature activation+7.67e-03

Pos logit contributions
 favorite+0.012
 no+0.012
 nobody+0.011
 possession+0.011
 favourite+0.011
Neg logit contributions
rig-0.025
useum-0.024
 ey-0.024
iles-0.023
rimp-0.023
Token fix
Token ID4259
Feature activation+2.48e-03

Pos logit contributions
 favorite+0.011
 no+0.011
 possession+0.011
 nobody+0.011
 forever+0.011
Neg logit contributions
useum-0.012
 ey-0.011
iles-0.011
esame-0.011
rig-0.011
Token old
Token ID1468
Feature activation+5.96e-03

Pos logit contributions
 favorite+0.013
attered+0.013
 possession+0.013
 favourite+0.012
 forever+0.011
Neg logit contributions
 ey-0.029
esame-0.028
useum-0.028
iles-0.028
 answers-0.028
Token friends
Token ID2460
Feature activation+2.12e-02

Pos logit contributions
 favorite+0.006
 possession+0.006
 favourite+0.006
attered+0.006
 forever+0.005
Neg logit contributions
esame-0.012
 ey-0.012
iles-0.012
useum-0.012
rig-0.012
Token and
Token ID290
Feature activation+2.63e-02

Pos logit contributions
attered+0.036
 possession+0.035
 favorite+0.033
 favourite+0.033
 nobody+0.031
Neg logit contributions
 answers-0.027
 say-0.026
 ey-0.024
 question-0.024
esame-0.024
Token his
Token ID465
Feature activation+1.36e-02

Pos logit contributions
attered+0.036
 favorite+0.035
 possession+0.035
 favourite+0.034
 nobody+0.033
Neg logit contributions
 ey-0.042
useum-0.041
iles-0.041
esame-0.041
 answers-0.041
Token old
Token ID1468
Feature activation+8.51e-03

Pos logit contributions
 favorite+0.014
 possession+0.013
 favourite+0.013
attered+0.013
 forever+0.011
Neg logit contributions
esame-0.027
 ey-0.027
useum-0.027
iles-0.027
rig-0.026
Token room
Token ID2119
Feature activation+6.53e-02

Pos logit contributions
 favorite+0.009
 possession+0.009
attered+0.008
 favourite+0.008
 forever+0.008
Neg logit contributions
 ey-0.017
esame-0.016
useum-0.016
iles-0.016
rig-0.016
Token.
Token ID13
Feature activation-7.09e-04

Pos logit contributions
attered+0.107
 possession+0.105
 favourite+0.097
 favorite+0.096
Mid+0.093
Neg logit contributions
 answers-0.087
 say-0.080
 calls-0.079
 question-0.077
useum-0.076
Token
Token ID198
Feature activation-6.91e-03

Pos logit contributions
 ey+0.001
 answers+0.001
esame+0.001
iles+0.001
useum+0.001
Neg logit contributions
 favorite-0.001
attered-0.001
 favourite-0.001
 possession-0.001
 nobody-0.001
Token
Token ID198
Feature activation-1.67e-03

Pos logit contributions
 ey+0.011
 answers+0.011
useum+0.011
esame+0.011
iles+0.011
Neg logit contributions
 favorite-0.009
attered-0.009
 possession-0.009
 favourite-0.009
 forever-0.009
Token"
Token ID1
Feature activation+3.32e-02

Pos logit contributions
 ey+0.003
 answers+0.003
useum+0.002
iles+0.002
esame+0.002
Neg logit contributions
attered-0.002
 possession-0.002
 favorite-0.002
 favourite-0.002
 forever-0.002
TokenCome
Token ID16773
Feature activation-4.37e-03

Pos logit contributions
attered+0.044
 possession+0.042
 favorite+0.042
 favourite+0.040
 forever+0.039
Neg logit contributions
 ey-0.056
 answers-0.055
useum-0.054
iles-0.054
esame-0.053
Token you
Token ID345
Feature activation+1.15e-02

Pos logit contributions
attered+0.007
 possession+0.007
 favorite+0.006
 favourite+0.006
 nobody+0.006
Neg logit contributions
 ey-0.010
 answers-0.010
esame-0.010
useum-0.010
iles-0.010
Token too
Token ID1165
Feature activation+1.06e-02

Pos logit contributions
attered+0.022
 possession+0.021
 favourite+0.018
 favorite+0.018
 nobody+0.018
Neg logit contributions
 say-0.014
 answers-0.013
 questions-0.013
 calls-0.012
esame-0.012
Token.
Token ID13
Feature activation+3.21e-02

Pos logit contributions
attered+0.017
 possession+0.017
 favourite+0.015
 favorite+0.015
 nobody+0.015
Neg logit contributions
 answers-0.015
useum-0.014
 say-0.014
 calls-0.014
iles-0.014
Token You
Token ID921
Feature activation-4.53e-03

Pos logit contributions
attered+0.050
 possession+0.046
 favorite+0.044
 favourite+0.043
 model+0.041
Neg logit contributions
 say-0.048
 ey-0.047
 answers-0.047
 calls-0.046
esame-0.045
Token are
Token ID389
Feature activation+2.96e-02

Pos logit contributions
 answers+0.006
 say+0.005
 ey+0.005
esame+0.005
useum+0.005
Neg logit contributions
attered-0.008
 possession-0.008
 favorite-0.008
 favourite-0.007
 nobody-0.007
Token a
Token ID257
Feature activation+6.51e-02

Pos logit contributions
attered+0.055
 possession+0.049
 favourite+0.042
 favorite+0.041
powered+0.039
Neg logit contributions
 answers-0.041
 say-0.041
 calling-0.038
 calls-0.036
 question-0.036
Token clever
Token ID14169
Feature activation+3.68e-02

Pos logit contributions
attered+0.135
 possession+0.106
bed+0.103
bled+0.101
ors+0.099
Neg logit contributions
 say-0.087
 question-0.085
 cal-0.082
 sigh-0.081
 answers-0.079
Token girl
Token ID2576
Feature activation+2.30e-02

Pos logit contributions
attered+0.057
 nobody+0.048
 possession+0.047
 favourite+0.046
 favorite+0.044
Neg logit contributions
 answers-0.061
 question-0.060
 say-0.058
 calls-0.056
 answer-0.055
Token and
Token ID290
Feature activation+3.30e-02

Pos logit contributions
attered+0.042
 possession+0.038
 favourite+0.037
 favorite+0.035
 nobody+0.035
Neg logit contributions
 say-0.029
 answers-0.029
 calls-0.028
 says-0.026
 questions-0.025
Token a
Token ID257
Feature activation+5.71e-02

Pos logit contributions
attered+0.049
 possession+0.044
 favorite+0.039
 forever+0.039
 favourite+0.039
Neg logit contributions
 say-0.054
 answers-0.053
 calls-0.050
 says-0.049
 ey-0.049
Token good
Token ID922
Feature activation+3.41e-02

Pos logit contributions
attered+0.095
 possession+0.076
organized+0.066
bled+0.060
bed+0.060
Neg logit contributions
 say-0.096
 question-0.094
 answers-0.089
 sigh-0.088
 calls-0.087
Token over
Token ID625
Feature activation+1.69e-02

Pos logit contributions
 possession+0.024
attered+0.023
 favourite+0.022
 favorite+0.022
 nobody+0.021
Neg logit contributions
 answers-0.030
 ey-0.028
 say-0.028
 calling-0.027
 calls-0.027
Token the
Token ID262
Feature activation+5.31e-03

Pos logit contributions
attered+0.027
 favorite+0.025
 possession+0.025
 favourite+0.025
 nobody+0.024
Neg logit contributions
 answers-0.024
 ey-0.023
esame-0.022
iles-0.022
useum-0.022
Token place
Token ID1295
Feature activation+2.24e-02

Pos logit contributions
attered+0.007
 favorite+0.007
 possession+0.007
 favourite+0.007
 forever+0.006
Neg logit contributions
 ey-0.009
iles-0.009
useum-0.009
esame-0.008
 answers-0.008
Token and
Token ID290
Feature activation+2.47e-02

Pos logit contributions
attered+0.032
 possession+0.031
 favorite+0.031
 favourite+0.030
 forever+0.029
Neg logit contributions
 ey-0.034
 answers-0.033
useum-0.033
iles-0.032
esame-0.032
Token her
Token ID607
Feature activation+4.69e-03

Pos logit contributions
attered+0.027
 favorite+0.026
 possession+0.026
 nobody+0.025
 favourite+0.025
Neg logit contributions
 ey-0.046
iles-0.045
useum-0.045
 answers-0.045
esame-0.045
Token accessories
Token ID18199
Feature activation+6.42e-02

Pos logit contributions
 favorite+0.006
attered+0.006
 possession+0.006
 favourite+0.006
 forever+0.005
Neg logit contributions
iles-0.008
 ey-0.008
useum-0.008
esame-0.008
rig-0.008
Token were
Token ID547
Feature activation+2.52e-02

Pos logit contributions
attered+0.102
 possession+0.101
 favorite+0.097
 favourite+0.095
Mid+0.089
Neg logit contributions
 answers-0.085
 say-0.084
 calls-0.078
 ey-0.077
esame-0.076
Token everywhere
Token ID8347
Feature activation+2.60e-02

Pos logit contributions
 possession+0.032
attered+0.031
 favorite+0.028
 favourite+0.028
Mid+0.025
Neg logit contributions
 answers-0.042
 ey-0.041
 say-0.040
 calling-0.040
useum-0.039
Token.
Token ID13
Feature activation+2.27e-04

Pos logit contributions
attered+0.046
 possession+0.045
 favorite+0.043
 favourite+0.042
 proper+0.040
Neg logit contributions
 say-0.029
 calling-0.029
 answers-0.028
useum-0.027
 ey-0.027
Token It
Token ID632
Feature activation+2.13e-02

Pos logit contributions
attered+0.000
 favorite+0.000
 favourite+0.000
 possession+0.000
 nobody+0.000
Neg logit contributions
iles-0.000
 ey-0.000
 answers-0.000
useum-0.000
 say-0.000
Token definitely
Token ID4753
Feature activation+1.06e-02

Pos logit contributions
attered+0.032
 possession+0.032
 favorite+0.031
 favourite+0.030
 forever+0.029
Neg logit contributions
 answers-0.031
 ey-0.030
 say-0.030
useum-0.029
rig-0.029
Token her
Token ID607
Feature activation+8.73e-03

Pos logit contributions
 say+0.002
 calls+0.002
 says+0.002
 Jen+0.002
 answers+0.002
Neg logit contributions
attered-0.004
 possession-0.003
 forever-0.003
 dearly-0.003
 virtue-0.003
Token zipper
Token ID48992
Feature activation+3.20e-02

Pos logit contributions
attered+0.013
 possession+0.013
 forever+0.012
 dearly+0.012
 nobody+0.012
Neg logit contributions
 ey-0.012
 say-0.012
 question-0.012
 answers-0.012
 calls-0.012
Token broke
Token ID6265
Feature activation+1.70e-02

Pos logit contributions
 possession+0.056
attered+0.053
 favourite+0.052
 dearly+0.051
 favorite+0.051
Neg logit contributions
 say-0.036
 calls-0.036
 crept-0.036
 answers-0.035
 climbs-0.034
Token and
Token ID290
Feature activation+2.97e-02

Pos logit contributions
attered+0.028
 possession+0.028
 favorite+0.027
 favourite+0.027
 nobody+0.026
Neg logit contributions
 ey-0.021
 answers-0.021
esame-0.020
useum-0.020
iles-0.020
Token her
Token ID607
Feature activation+1.75e-02

Pos logit contributions
attered+0.048
 possession+0.047
 favourite+0.044
 favorite+0.043
 forever+0.041
Neg logit contributions
 say-0.040
 answers-0.038
 ey-0.038
 says-0.038
 calls-0.038
Token jacket
Token ID15224
Feature activation+6.35e-02

Pos logit contributions
attered+0.028
 forever+0.025
dates+0.025
 forward+0.024
overs+0.024
Neg logit contributions
 ey-0.024
 say-0.022
 answers-0.022
 question-0.021
 calls-0.021
Token became
Token ID2627
Feature activation+1.55e-02

Pos logit contributions
 possession+0.127
 favourite+0.119
 forever+0.118
 generation+0.118
dates+0.117
Neg logit contributions
 shook-0.055
 calls-0.053
 say-0.053
 shakes-0.052
 answered-0.052
Token too
Token ID1165
Feature activation+1.71e-02

Pos logit contributions
 possession+0.023
attered+0.022
 favourite+0.020
 favorite+0.019
dates+0.019
Neg logit contributions
 ey-0.022
 answers-0.022
 say-0.022
 rust-0.021
 pe-0.021
Token cheap
Token ID7026
Feature activation+6.48e-03

Pos logit contributions
 possession+0.024
 favourite+0.022
attered+0.022
 nobody+0.022
 favorite+0.021
Neg logit contributions
 answers-0.026
useum-0.026
 sigh-0.025
 say-0.024
esame-0.024
Token to
Token ID284
Feature activation+6.75e-03

Pos logit contributions
attered+0.010
 possession+0.010
 favorite+0.010
 favourite+0.010
 nobody+0.009
Neg logit contributions
 answers-0.009
 ey-0.009
useum-0.008
 say-0.008
esame-0.008
Token wear
Token ID5806
Feature activation+3.93e-02

Pos logit contributions
attered+0.012
 possession+0.012
 favorite+0.011
 favourite+0.011
 forever+0.011
Neg logit contributions
 say-0.008
 answers-0.008
 ey-0.008
esame-0.008
useum-0.008
Tokenmy
Token ID1820
Feature activation+2.46e-02

Pos logit contributions
attered+0.062
 possession+0.061
 favorite+0.060
 favourite+0.059
 nobody+0.057
Neg logit contributions
 ey-0.041
 answers-0.041
iles-0.039
useum-0.039
 say-0.039
Token started
Token ID2067
Feature activation+8.58e-03

Pos logit contributions
attered+0.039
 possession+0.038
 favorite+0.038
 favourite+0.036
 forever+0.036
Neg logit contributions
 ey-0.034
 answers-0.033
useum-0.033
iles-0.033
esame-0.032
Token to
Token ID284
Feature activation-2.20e-03

Pos logit contributions
attered+0.010
 favorite+0.010
 possession+0.010
 forever+0.010
 favourite+0.010
Neg logit contributions
iles-0.016
 ey-0.016
useum-0.015
esame-0.015
rig-0.015
Token cry
Token ID3960
Feature activation+2.25e-02

Pos logit contributions
iles+0.004
 ey+0.004
useum+0.004
esame+0.004
nine+0.004
Neg logit contributions
 no-0.003
 forever-0.003
 nobody-0.003
 favorite-0.003
 whole-0.002
Token.
Token ID13
Feature activation+1.90e-02

Pos logit contributions
 possession+0.040
attered+0.039
 favourite+0.037
 favorite+0.036
 nobody+0.035
Neg logit contributions
 say-0.027
 answers-0.026
 calls-0.025
useum-0.025
 calling-0.024
Token "
Token ID366
Feature activation+6.34e-02

Pos logit contributions
attered+0.031
 possession+0.030
 favorite+0.029
 favourite+0.029
 forever+0.028
Neg logit contributions
 ey-0.025
useum-0.025
 answers-0.025
esame-0.024
 say-0.024
TokenI
Token ID40
Feature activation+5.80e-03

Pos logit contributions
 possession+0.101
 favorite+0.098
 favourite+0.095
attered+0.094
 forever+0.090
Neg logit contributions
 say-0.083
 ey-0.081
rimp-0.078
rig-0.078
 explains-0.077
Token don
Token ID836
Feature activation-7.53e-03

Pos logit contributions
attered+0.009
 possession+0.008
 favorite+0.008
 favourite+0.008
 nobody+0.008
Neg logit contributions
 answers-0.008
 say-0.008
 ey-0.008
useum-0.008
rig-0.008
Token't
Token ID470
Feature activation-1.69e-02

Pos logit contributions
esame+0.010
 say+0.010
useum+0.010
 calling+0.010
 explains+0.010
Neg logit contributions
 possession-0.013
attered-0.012
 favorite-0.012
 favourite-0.012
 proper-0.011
Token want
Token ID765
Feature activation+1.34e-02

Pos logit contributions
 ey+0.024
useum+0.023
iles+0.023
esame+0.023
rig+0.022
Neg logit contributions
attered-0.026
 forever-0.026
 possession-0.026
 favorite-0.026
 no-0.025
Token you
Token ID345
Feature activation-1.03e-03

Pos logit contributions
 favorite+0.016
 possession+0.016
attered+0.015
 forever+0.015
 favourite+0.015
Neg logit contributions
 ey-0.024
rig-0.023
useum-0.023
iles-0.023
 answers-0.023
Token Lily
Token ID20037
Feature activation+2.31e-02

Pos logit contributions
 possession+0.020
attered+0.020
 forever+0.018
 favorite+0.018
 favourite+0.018
Neg logit contributions
useum-0.017
rig-0.016
 ey-0.016
 say-0.016
 answers-0.016
Token was
Token ID373
Feature activation+1.98e-02

Pos logit contributions
attered+0.030
 favorite+0.029
 possession+0.028
 forever+0.028
 favourite+0.027
Neg logit contributions
 ey-0.040
useum-0.039
iles-0.039
 answers-0.038
esame-0.038
Token heart
Token ID2612
Feature activation+3.18e-02

Pos logit contributions
attered+0.025
 possession+0.024
 favorite+0.023
 favourite+0.022
 forever+0.020
Neg logit contributions
 answers-0.033
 say-0.033
 ey-0.032
 question-0.031
rig-0.031
Tokenbroken
Token ID25826
Feature activation+1.46e-02

Pos logit contributions
 possession+0.032
attered+0.030
 favorite+0.029
 favourite+0.028
 nobody+0.027
Neg logit contributions
rig-0.058
 say-0.057
esame-0.057
 calling-0.057
 answers-0.056
Token.
Token ID13
Feature activation+1.84e-02

Pos logit contributions
attered+0.027
 possession+0.026
 favorite+0.026
 favourite+0.025
 nobody+0.024
Neg logit contributions
 answers-0.016
 say-0.015
 ey-0.015
esame-0.015
useum-0.015
Token "
Token ID366
Feature activation+6.33e-02

Pos logit contributions
attered+0.032
 possession+0.031
 favorite+0.030
 favourite+0.030
 forever+0.029
Neg logit contributions
 ey-0.022
 answers-0.022
useum-0.022
esame-0.021
 calls-0.021
TokenIt
Token ID1026
Feature activation+1.75e-02

Pos logit contributions
 possession+0.096
attered+0.093
 favorite+0.093
 favourite+0.091
 forever+0.089
Neg logit contributions
 ey-0.087
 say-0.081
rig-0.079
 explains-0.079
 answers-0.078
Token's
Token ID338
Feature activation+3.94e-03

Pos logit contributions
attered+0.025
 possession+0.025
 favourite+0.023
 favorite+0.023
 forever+0.023
Neg logit contributions
 answers-0.027
 say-0.027
 explains-0.026
 calls-0.026
 ey-0.025
Token not
Token ID407
Feature activation+5.34e-03

Pos logit contributions
attered+0.005
 possession+0.005
 favorite+0.004
 favourite+0.004
 forever+0.004
Neg logit contributions
 ey-0.007
 answers-0.007
 say-0.007
useum-0.007
esame-0.007
Token fine
Token ID3734
Feature activation+4.16e-03

Pos logit contributions
attered+0.006
 possession+0.006
 favorite+0.005
 favourite+0.005
 nobody+0.005
Neg logit contributions
 answers-0.010
 say-0.010
 ey-0.009
useum-0.009
esame-0.009
Token anymore
Token ID7471
Feature activation+2.53e-03

Pos logit contributions
 favorite+0.006
attered+0.006
 possession+0.006
 forever+0.006
 favourite+0.006
Neg logit contributions
 ey-0.007
useum-0.006
rig-0.006
iles-0.006
esame-0.006
Token with
Token ID351
Feature activation+4.50e-03

Pos logit contributions
attered+0.053
 possession+0.050
 favorite+0.045
 forever+0.045
 nobody+0.045
Neg logit contributions
 answers-0.046
 say-0.046
 ey-0.045
iles-0.044
esame-0.043
Token their
Token ID511
Feature activation+2.56e-02

Pos logit contributions
attered+0.006
 possession+0.006
 favorite+0.006
 favourite+0.005
 forever+0.005
Neg logit contributions
 ey-0.007
 answers-0.007
iles-0.007
useum-0.007
 say-0.007
Token toys
Token ID14958
Feature activation+1.29e-02

Pos logit contributions
attered+0.029
 forever+0.028
 possession+0.027
 nobody+0.026
 no+0.024
Neg logit contributions
 ey-0.047
 answers-0.046
 say-0.045
iles-0.044
 question-0.043
Token in
Token ID287
Feature activation+2.37e-02

Pos logit contributions
attered+0.018
 possession+0.017
 favorite+0.017
 favourite+0.016
 forever+0.016
Neg logit contributions
 ey-0.020
 answers-0.020
iles-0.020
useum-0.020
esame-0.020
Token the
Token ID262
Feature activation+2.83e-02

Pos logit contributions
attered+0.048
 possession+0.042
 favorite+0.042
 forever+0.041
 favourite+0.040
Neg logit contributions
 ey-0.026
 answers-0.026
 say-0.026
 question-0.024
iles-0.024
Token living
Token ID2877
Feature activation+6.32e-02

Pos logit contributions
attered+0.046
 possession+0.046
 forever+0.044
 nobody+0.043
 favorite+0.042
Neg logit contributions
 ey-0.039
 answers-0.037
 say-0.036
iles-0.034
 question-0.034
Token room
Token ID2119
Feature activation+3.35e-02

Pos logit contributions
Mid+0.114
 dearly+0.112
 forever+0.110
attered+0.110
oun+0.109
Neg logit contributions
 answers-0.072
 ey-0.063
 question-0.061
 says-0.057
 calls-0.056
Token.
Token ID13
Feature activation-5.02e-03

Pos logit contributions
attered+0.052
 possession+0.051
 favorite+0.048
 favourite+0.047
 nobody+0.046
Neg logit contributions
 answers-0.047
useum-0.045
iles-0.045
 ey-0.044
 say-0.044
Token They
Token ID1119
Feature activation+5.65e-03

Pos logit contributions
 ey+0.008
 answers+0.008
iles+0.008
esame+0.008
useum+0.008
Neg logit contributions
attered-0.007
 favorite-0.007
 possession-0.006
 favourite-0.006
 nobody-0.006
Token had
Token ID550
Feature activation+1.65e-02

Pos logit contributions
 possession+0.009
attered+0.009
 favorite+0.008
 favourite+0.008
 forever+0.008
Neg logit contributions
 say-0.008
 answers-0.008
 ey-0.007
 calls-0.007
iles-0.007
Token many
Token ID867
Feature activation-7.30e-06

Pos logit contributions
attered+0.025
 possession+0.025
 forever+0.021
 favorite+0.021
 favourite+0.021
Neg logit contributions
 answers-0.024
 say-0.023
 calls-0.022
 ey-0.022
iles-0.022
Token with
Token ID351
Feature activation+9.55e-03

Pos logit contributions
attered+0.055
 possession+0.052
 favorite+0.045
 dearly+0.045
 forever+0.045
Neg logit contributions
 answers-0.046
 say-0.045
 ey-0.044
iles-0.044
esame-0.043
Token their
Token ID511
Feature activation+2.40e-02

Pos logit contributions
attered+0.014
 possession+0.013
 favorite+0.013
 forever+0.013
 favourite+0.012
Neg logit contributions
 ey-0.015
 answers-0.014
iles-0.014
 say-0.014
esame-0.014
Token toys
Token ID14958
Feature activation+1.35e-02

Pos logit contributions
attered+0.028
 possession+0.026
 forever+0.025
 nobody+0.025
 no+0.023
Neg logit contributions
 ey-0.044
 answers-0.044
 say-0.042
iles-0.042
useum-0.041
Token in
Token ID287
Feature activation+2.46e-02

Pos logit contributions
attered+0.020
 possession+0.019
 favorite+0.018
 favourite+0.018
 nobody+0.017
Neg logit contributions
 answers-0.021
 ey-0.020
iles-0.020
esame-0.020
useum-0.020
Token the
Token ID262
Feature activation+2.68e-02

Pos logit contributions
attered+0.051
 favorite+0.044
 possession+0.044
 forever+0.043
 favourite+0.043
Neg logit contributions
 ey-0.027
 answers-0.026
 say-0.026
 question-0.024
iles-0.023
Token living
Token ID2877
Feature activation+6.27e-02

Pos logit contributions
attered+0.045
 possession+0.044
 forever+0.042
 nobody+0.041
 favorite+0.040
Neg logit contributions
 ey-0.036
 answers-0.034
 say-0.033
 calls-0.032
 question-0.032
Token room
Token ID2119
Feature activation+3.30e-02

Pos logit contributions
 dearly+0.123
Mid+0.121
 forever+0.118
oun+0.117
attered+0.117
Neg logit contributions
 answers-0.062
 ey-0.052
 question-0.051
 says-0.049
 opens-0.049
Token.
Token ID13
Feature activation-4.57e-03

Pos logit contributions
attered+0.052
 possession+0.051
 favorite+0.048
 favourite+0.047
 nobody+0.045
Neg logit contributions
 answers-0.046
useum-0.044
iles-0.043
 say-0.043
 ey-0.043
Token They
Token ID1119
Feature activation+4.92e-03

Pos logit contributions
 ey+0.008
esame+0.008
iles+0.008
rig+0.008
 answers+0.008
Neg logit contributions
 favorite-0.006
attered-0.005
 favourite-0.005
 no-0.005
 nobody-0.005
Token liked
Token ID8288
Feature activation-5.87e-03

Pos logit contributions
 possession+0.008
attered+0.008
 favorite+0.007
 favourite+0.007
 forever+0.007
Neg logit contributions
 say-0.007
 answers-0.007
 ey-0.006
 calls-0.006
iles-0.006
Token to
Token ID284
Feature activation+2.03e-03

Pos logit contributions
 ey+0.010
useum+0.010
esame+0.010
 answers+0.010
iles+0.010
Neg logit contributions
 favorite-0.007
attered-0.007
 possession-0.007
 favourite-0.007
 nobody-0.007
Token but
Token ID475
Feature activation+1.30e-02

Pos logit contributions
iles+0.015
useum+0.015
 ey+0.015
esame+0.014
 answers+0.014
Neg logit contributions
 no-0.015
 favorite-0.014
 forever-0.014
 nobody-0.014
 possession-0.014
Token it
Token ID340
Feature activation+2.24e-02

Pos logit contributions
attered+0.025
 possession+0.022
ini+0.022
 favourite+0.021
 favorite+0.021
Neg logit contributions
 say-0.016
 shakes-0.015
 slam-0.014
 Jen-0.014
 shook-0.014
Token wouldn
Token ID3636
Feature activation+8.93e-03

Pos logit contributions
 possession+0.037
attered+0.035
 favourite+0.034
 favorite+0.033
 forever+0.033
Neg logit contributions
 say-0.028
 answers-0.028
 shakes-0.027
 ey-0.027
 crept-0.027
Token't
Token ID470
Feature activation-1.03e-02

Pos logit contributions
 possession+0.014
attered+0.014
 favorite+0.013
 favourite+0.013
 proper+0.012
Neg logit contributions
 say-0.014
esame-0.013
 calls-0.013
 answers-0.013
 slam-0.012
Token bud
Token ID3600
Feature activation+1.90e-02

Pos logit contributions
 answers+0.008
 say+0.008
 ey+0.008
useum+0.008
esame+0.008
Neg logit contributions
attered-0.023
 possession-0.022
 favorite-0.022
 favourite-0.022
 nobody-0.021
Tokenge
Token ID469
Feature activation+6.26e-02

Pos logit contributions
 possession+0.037
 favorite+0.037
 favourite+0.037
attered+0.037
 nobody+0.036
Neg logit contributions
 answers-0.016
 say-0.016
iles-0.016
rig-0.016
 ey-0.016
Token.
Token ID13
Feature activation+1.73e-03

Pos logit contributions
attered+0.119
 favourite+0.118
 favorite+0.115
 possession+0.112
 nobody+0.107
Neg logit contributions
 answers-0.062
 say-0.054
 calling-0.053
 calls-0.053
esame-0.051
Token
Token ID220
Feature activation+2.97e-03

Pos logit contributions
attered+0.003
 possession+0.003
 favorite+0.003
 favourite+0.003
 forever+0.003
Neg logit contributions
 ey-0.002
 answers-0.002
useum-0.002
esame-0.002
 say-0.002
Token
Token ID198
Feature activation+6.56e-03

Pos logit contributions
attered+0.005
 possession+0.005
 favorite+0.005
 favourite+0.005
 forever+0.005
Neg logit contributions
 answers-0.003
 ey-0.003
useum-0.003
esame-0.003
 say-0.003
Token
Token ID198
Feature activation+1.60e-02

Pos logit contributions
attered+0.011
 possession+0.011
 favorite+0.010
 favourite+0.010
 forever+0.010
Neg logit contributions
 ey-0.008
 answers-0.008
useum-0.008
iles-0.008
esame-0.008
TokenMax
Token ID11518
Feature activation+1.44e-02

Pos logit contributions
 possession+0.022
attered+0.022
 favorite+0.021
 forever+0.020
 favourite+0.020
Neg logit contributions
useum-0.024
rig-0.024
 ey-0.024
esame-0.023
 say-0.023
Token so
Token ID523
Feature activation+2.37e-02

Pos logit contributions
attered+0.009
 possession+0.008
 favorite+0.007
 favourite+0.007
dates+0.007
Neg logit contributions
 say-0.008
 perk-0.007
 answers-0.007
 sigh-0.007
 ey-0.007
Token happy
Token ID3772
Feature activation-1.02e-03

Pos logit contributions
attered+0.037
 possession+0.036
 favorite+0.032
 favourite+0.032
dates+0.031
Neg logit contributions
 say-0.036
 ey-0.034
 perk-0.033
 sigh-0.032
 answers-0.032
Token and
Token ID290
Feature activation+2.23e-02

Pos logit contributions
 ey+0.002
 answers+0.002
useum+0.002
iles+0.002
esame+0.002
Neg logit contributions
attered-0.001
 possession-0.001
 favorite-0.001
 favourite-0.001
 forever-0.001
Token wore
Token ID12408
Feature activation+4.34e-02

Pos logit contributions
attered+0.040
 possession+0.037
 favorite+0.036
 favourite+0.035
dates+0.034
Neg logit contributions
 say-0.031
 says-0.027
 answered-0.025
 calls-0.025
 explains-0.025
Token her
Token ID607
Feature activation+1.37e-02

Pos logit contributions
attered+0.080
 favorite+0.061
 possession+0.060
istered+0.059
 nobody+0.059
Neg logit contributions
 answers-0.058
 say-0.055
 ey-0.053
esame-0.052
 whispers-0.051
Token jacket
Token ID15224
Feature activation+6.26e-02

Pos logit contributions
attered+0.024
 nobody+0.019
 possession+0.018
 generation+0.018
overs+0.018
Neg logit contributions
 ey-0.020
 say-0.019
 answers-0.018
 question-0.017
 sigh-0.017
Token every
Token ID790
Feature activation+2.60e-02

Pos logit contributions
attered+0.094
 possession+0.087
 natural+0.080
 nobody+0.077
istered+0.076
Neg logit contributions
 say-0.080
 answers-0.077
esame-0.074
 calling-0.072
 rust-0.071
Token day
Token ID1110
Feature activation+1.57e-02

Pos logit contributions
attered+0.031
 favorite+0.027
 possession+0.026
 favourite+0.026
 nobody+0.025
Neg logit contributions
 answers-0.047
 say-0.046
 question-0.046
 ey-0.045
rig-0.045
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
attered+0.026
 possession+0.025
 favorite+0.025
 favourite+0.023
 nobody+0.023
Neg logit contributions
 answers-0.019
 say-0.019
 calling-0.018
esame-0.018
 calls-0.017
Token She
Token ID1375
Feature activation+1.29e-02

Pos logit contributions
attered+0.023
 possession+0.022
 forever+0.020
 favorite+0.020
 favourite+0.020
Neg logit contributions
 ey-0.013
 answers-0.013
 say-0.012
useum-0.012
esame-0.012
Token never
Token ID1239
Feature activation+1.08e-02

Pos logit contributions
attered+0.020
 possession+0.019
 favorite+0.018
 favourite+0.017
 nobody+0.017
Neg logit contributions
 say-0.019
 ey-0.019
 answers-0.019
esame-0.018
 explains-0.018
Token with
Token ID351
Feature activation+9.29e-03

Pos logit contributions
attered+0.053
 possession+0.050
 favorite+0.045
 forever+0.045
 nobody+0.044
Neg logit contributions
 answers-0.047
 say-0.046
 ey-0.046
iles-0.045
esame-0.043
Token their
Token ID511
Feature activation+2.17e-02

Pos logit contributions
attered+0.012
 possession+0.012
 favorite+0.011
 favourite+0.011
 forever+0.011
Neg logit contributions
 ey-0.015
 answers-0.015
iles-0.015
useum-0.015
 say-0.015
Token toys
Token ID14958
Feature activation+1.81e-02

Pos logit contributions
attered+0.026
 forever+0.025
 possession+0.024
 nobody+0.024
 dearly+0.022
Neg logit contributions
 ey-0.039
 answers-0.038
 say-0.037
iles-0.036
 question-0.035
Token in
Token ID287
Feature activation+2.43e-02

Pos logit contributions
attered+0.026
 possession+0.025
 favorite+0.024
 favourite+0.024
 nobody+0.023
Neg logit contributions
 answers-0.028
 ey-0.027
iles-0.027
useum-0.027
esame-0.026
Token the
Token ID262
Feature activation+2.63e-02

Pos logit contributions
attered+0.048
 favorite+0.043
 possession+0.042
 favourite+0.042
 forever+0.041
Neg logit contributions
 ey-0.028
 answers-0.027
 say-0.026
 question-0.025
iles-0.025
Token living
Token ID2877
Feature activation+6.24e-02

Pos logit contributions
attered+0.044
 possession+0.044
 forever+0.042
 nobody+0.041
 favorite+0.040
Neg logit contributions
 ey-0.036
 answers-0.033
 say-0.032
 calls-0.031
 question-0.031
Token room
Token ID2119
Feature activation+3.36e-02

Pos logit contributions
 dearly+0.124
oun+0.122
Mid+0.121
attered+0.120
 forever+0.119
Neg logit contributions
 answers-0.064
 ey-0.054
 question-0.051
 calls-0.051
 says-0.051
Token.
Token ID13
Feature activation-8.01e-03

Pos logit contributions
attered+0.054
 possession+0.053
 favorite+0.049
 favourite+0.048
 nobody+0.047
Neg logit contributions
 answers-0.046
useum-0.044
iles-0.044
 say-0.043
 ey-0.042
Token They
Token ID1119
Feature activation-4.98e-04

Pos logit contributions
esame+0.014
 ey+0.014
 answers+0.014
rig+0.014
iles+0.014
Neg logit contributions
 favorite-0.010
attered-0.010
 favourite-0.010
 possession-0.009
 nobody-0.009
Token liked
Token ID8288
Feature activation-2.93e-03

Pos logit contributions
 say+0.001
 answers+0.001
 ey+0.001
iles+0.001
 calls+0.001
Neg logit contributions
 possession-0.001
attered-0.001
 favorite-0.001
 favourite-0.001
 forever-0.001
Token to
Token ID284
Feature activation+4.84e-03

Pos logit contributions
 ey+0.005
useum+0.005
esame+0.005
iles+0.005
 answers+0.005
Neg logit contributions
 favorite-0.004
attered-0.004
 possession-0.004
 favourite-0.003
 nobody-0.003
Token with
Token ID351
Feature activation+7.98e-03

Pos logit contributions
attered+0.053
 possession+0.050
 forever+0.045
 dearly+0.044
 favorite+0.044
Neg logit contributions
 answers-0.045
 say-0.045
 ey-0.044
iles-0.042
esame-0.041
Token their
Token ID511
Feature activation+2.36e-02

Pos logit contributions
attered+0.012
 possession+0.011
 forever+0.011
 favorite+0.010
 nobody+0.010
Neg logit contributions
 ey-0.012
 answers-0.012
iles-0.012
 say-0.012
esame-0.011
Token toys
Token ID14958
Feature activation+1.24e-02

Pos logit contributions
attered+0.029
 forever+0.028
 possession+0.026
 nobody+0.026
 dearly+0.025
Neg logit contributions
 answers-0.040
 ey-0.040
 say-0.039
 question-0.038
iles-0.038
Token in
Token ID287
Feature activation+2.36e-02

Pos logit contributions
attered+0.018
 possession+0.017
 favorite+0.017
 favourite+0.016
 forever+0.016
Neg logit contributions
 ey-0.019
 answers-0.019
useum-0.018
iles-0.018
esame-0.018
Token the
Token ID262
Feature activation+2.44e-02

Pos logit contributions
attered+0.050
 possession+0.043
 favorite+0.043
 forever+0.042
 favourite+0.042
Neg logit contributions
 ey-0.025
 answers-0.025
 say-0.025
 question-0.023
 calls-0.022
Token living
Token ID2877
Feature activation+6.22e-02

Pos logit contributions
attered+0.042
 possession+0.041
 forever+0.040
 nobody+0.038
 favorite+0.038
Neg logit contributions
 ey-0.032
 answers-0.030
 say-0.029
 calls-0.028
 question-0.028
Token room
Token ID2119
Feature activation+3.16e-02

Pos logit contributions
 dearly+0.120
Mid+0.118
attered+0.116
oun+0.116
 forever+0.115
Neg logit contributions
 answers-0.064
 ey-0.055
 question-0.053
 says-0.052
 pe-0.051
Token.
Token ID13
Feature activation-4.91e-03

Pos logit contributions
attered+0.049
 possession+0.048
 favorite+0.045
 favourite+0.044
 nobody+0.043
Neg logit contributions
 answers-0.045
useum-0.043
 ey-0.043
iles-0.043
 say-0.043
Token Lily
Token ID20037
Feature activation-6.12e-03

Pos logit contributions
 ey+0.008
 answers+0.008
esame+0.008
iles+0.008
rig+0.008
Neg logit contributions
attered-0.006
 favorite-0.006
 favourite-0.006
 possession-0.006
 nobody-0.006
Token had
Token ID550
Feature activation+2.59e-02

Pos logit contributions
 ey+0.010
 answers+0.010
useum+0.009
iles+0.009
 say+0.009
Neg logit contributions
attered-0.008
 possession-0.008
 favorite-0.008
 favourite-0.008
 forever-0.007
Token a
Token ID257
Feature activation+2.64e-02

Pos logit contributions
attered+0.045
 possession+0.043
 forever+0.039
eddy+0.037
rier+0.036
Neg logit contributions
 answers-0.032
 say-0.030
 calls-0.029
 Jen-0.028
 question-0.028
Token me
Token ID502
Feature activation-8.14e-03

Pos logit contributions
attered+0.035
 possession+0.032
 favorite+0.032
 favourite+0.031
 forever+0.030
Neg logit contributions
 answers-0.030
 ey-0.030
iles-0.029
esame-0.029
 say-0.029
Token today
Token ID1909
Feature activation+4.19e-03

Pos logit contributions
 ey+0.016
useum+0.015
 answers+0.015
iles+0.015
esame+0.015
Neg logit contributions
 favorite-0.009
attered-0.009
 possession-0.009
 forever-0.008
 favourite-0.008
Token.
Token ID13
Feature activation+3.79e-02

Pos logit contributions
attered+0.005
 possession+0.005
 favorite+0.005
 favourite+0.005
 nobody+0.005
Neg logit contributions
 ey-0.007
 answers-0.007
useum-0.007
iles-0.007
esame-0.007
Token You
Token ID921
Feature activation-4.97e-03

Pos logit contributions
attered+0.056
 possession+0.052
 favorite+0.050
 favourite+0.048
 forever+0.046
Neg logit contributions
 say-0.059
 ey-0.059
 answers-0.057
 calls-0.057
useum-0.057
Token are
Token ID389
Feature activation+3.00e-02

Pos logit contributions
 ey+0.007
 answers+0.006
useum+0.006
iles+0.006
esame+0.006
Neg logit contributions
attered-0.008
 favorite-0.008
 possession-0.008
 favourite-0.008
 forever-0.007
Token a
Token ID257
Feature activation+6.22e-02

Pos logit contributions
attered+0.057
 possession+0.051
 favourite+0.044
 favorite+0.044
istered+0.040
Neg logit contributions
 say-0.040
 answers-0.040
 calling-0.036
 question-0.036
iles-0.035
Token great
Token ID1049
Feature activation+3.15e-02

Pos logit contributions
attered+0.119
 possession+0.092
organized+0.085
bled+0.085
bed+0.084
Neg logit contributions
 say-0.091
 question-0.090
 pe-0.085
 answers-0.085
 cal-0.080
Token superhero
Token ID17343
Feature activation+1.95e-02

Pos logit contributions
attered+0.041
 possession+0.036
 nobody+0.034
 favourite+0.032
 favorite+0.032
Neg logit contributions
 answers-0.058
 say-0.056
 question-0.055
 calls-0.054
 ey-0.054
Token,
Token ID11
Feature activation+1.29e-02

Pos logit contributions
attered+0.032
 possession+0.028
 favourite+0.027
 favorite+0.027
 nobody+0.026
Neg logit contributions
 answers-0.028
 say-0.027
iles-0.026
 calls-0.026
esame-0.026
Token a
Token ID257
Feature activation+4.11e-02

Pos logit contributions
 favorite+0.016
 forever+0.015
 possession+0.015
 no+0.014
 favourite+0.014
Neg logit contributions
 ey-0.023
useum-0.023
iles-0.023
esame-0.022
 answers-0.022
Token princess
Token ID21752
Feature activation+1.77e-02

Pos logit contributions
attered+0.053
 possession+0.046
 favorite+0.040
 favourite+0.039
 nobody+0.038
Neg logit contributions
 say-0.075
 answers-0.074
 question-0.073
 ey-0.070
useum-0.070
Token you
Token ID345
Feature activation+7.65e-03

Pos logit contributions
attered+0.008
 possession+0.008
 favorite+0.008
 favourite+0.007
 forever+0.007
Neg logit contributions
 ey-0.012
 answers-0.011
esame-0.011
useum-0.011
iles-0.011
Token too
Token ID1165
Feature activation+8.35e-03

Pos logit contributions
attered+0.015
 possession+0.015
 favourite+0.013
 nobody+0.013
 favorite+0.013
Neg logit contributions
 say-0.009
 answers-0.008
 questions-0.008
 calls-0.008
 says-0.008
Token.
Token ID13
Feature activation+3.25e-02

Pos logit contributions
attered+0.014
 possession+0.014
 nobody+0.013
 favourite+0.013
 favorite+0.013
Neg logit contributions
 answers-0.011
useum-0.011
 say-0.010
 calls-0.010
iles-0.010
Token You
Token ID921
Feature activation-2.12e-03

Pos logit contributions
attered+0.049
 possession+0.046
 favorite+0.044
 favourite+0.043
 forever+0.041
Neg logit contributions
 ey-0.049
 say-0.049
 answers-0.048
 calls-0.047
esame-0.047
Token are
Token ID389
Feature activation+2.83e-02

Pos logit contributions
 answers+0.002
 say+0.002
 ey+0.002
esame+0.002
iles+0.002
Neg logit contributions
attered-0.004
 possession-0.004
 favorite-0.004
 favourite-0.004
 nobody-0.004
Token a
Token ID257
Feature activation+6.21e-02

Pos logit contributions
attered+0.051
 possession+0.046
 favourite+0.040
 favorite+0.039
 forever+0.037
Neg logit contributions
 answers-0.040
 say-0.039
 ey-0.036
 calling-0.035
 question-0.035
Token good
Token ID922
Feature activation+3.62e-02

Pos logit contributions
attered+0.120
 possession+0.093
bed+0.085
organized+0.083
 favourite+0.081
Neg logit contributions
 question-0.088
 say-0.087
 answers-0.085
 pe-0.082
 answer-0.079
Token brother
Token ID3956
Feature activation+1.09e-02

Pos logit contributions
attered+0.049
 possession+0.044
 nobody+0.044
 dearly+0.040
 favourite+0.040
Neg logit contributions
 answers-0.065
 say-0.063
 question-0.063
 calls-0.060
useum-0.058
Token.
Token ID13
Feature activation+3.32e-02

Pos logit contributions
attered+0.021
 possession+0.019
 favourite+0.019
 favorite+0.018
 nobody+0.017
Neg logit contributions
 say-0.013
 answers-0.013
 calls-0.012
 says-0.012
 explains-0.012
Token I
Token ID314
Feature activation+6.89e-04

Pos logit contributions
attered+0.043
 possession+0.041
 favorite+0.040
 favourite+0.038
 forever+0.037
Neg logit contributions
 ey-0.056
 answers-0.056
 say-0.055
useum-0.054
esame-0.054
Token forgive
Token ID20927
Feature activation-4.93e-03

Pos logit contributions
attered+0.001
 possession+0.001
 favorite+0.001
 favourite+0.001
 nobody+0.001
Neg logit contributions
 answers-0.001
 ey-0.001
 say-0.001
useum-0.001
iles-0.001
Token They
Token ID1119
Feature activation-2.37e-02

Pos logit contributions
 favorite+0.008
 favourite+0.007
 no+0.006
 nobody+0.006
attered+0.006
Neg logit contributions
 ey-0.017
iles-0.017
aches-0.016
rig-0.016
esame-0.016
Token see
Token ID766
Feature activation-9.80e-03

Pos logit contributions
rig+0.041
useum+0.040
esame+0.039
iles+0.038
rimp+0.038
Neg logit contributions
 no-0.033
 favorite-0.032
 forever-0.030
 favourite-0.029
 nobody-0.028
Token strawberry
Token ID41236
Feature activation-4.31e-02

Pos logit contributions
rig+0.019
 ey+0.019
useum+0.018
esame+0.018
rimp+0.018
Neg logit contributions
 favorite-0.012
 no-0.011
 nobody-0.010
 favourite-0.010
attered-0.010
Token jam
Token ID16853
Feature activation-3.28e-02

Pos logit contributions
rig+0.071
 ey+0.067
 respond+0.065
 question+0.063
useum+0.063
Neg logit contributions
 favorite-0.064
 favourite-0.059
 no-0.059
 perfectly-0.058
attered-0.058
Token,
Token ID11
Feature activation-1.96e-02

Pos logit contributions
 ey+0.068
rig+0.066
useum+0.062
 answers+0.061
 question+0.061
Neg logit contributions
 favorite-0.033
 forever-0.032
 no-0.031
 favourite-0.030
attered-0.029
Token raspberry
Token ID38973
Feature activation-7.95e-02

Pos logit contributions
rig+0.032
useum+0.030
 question+0.027
 answers+0.026
utt+0.026
Neg logit contributions
 no-0.034
 favorite-0.032
 favourite-0.029
 nobody-0.028
 toy-0.026
Token jam
Token ID16853
Feature activation-3.75e-02

Pos logit contributions
rig+0.155
 respond+0.135
 blame+0.133
 ey+0.133
 approach+0.132
Neg logit contributions
 favorite-0.100
 no-0.093
 toy-0.089
 favourite-0.089
 best-0.086
Token,
Token ID11
Feature activation-3.39e-02

Pos logit contributions
rig+0.075
 ey+0.075
useum+0.069
 answers+0.068
 approach+0.068
Neg logit contributions
 favorite-0.039
 forever-0.038
 no-0.038
 favourite-0.035
attered-0.035
Token blue
Token ID4171
Feature activation-5.04e-02

Pos logit contributions
rig+0.057
useum+0.054
 ey+0.051
 question+0.050
 answers+0.050
Neg logit contributions
 no-0.053
 favorite-0.052
 favourite-0.047
 forever-0.043
 perfectly-0.043
Tokenberry
Token ID8396
Feature activation-7.03e-02

Pos logit contributions
rig+0.100
 respond+0.083
 slam+0.081
utt+0.080
useum+0.080
Neg logit contributions
 favorite-0.065
 no-0.063
 favourite-0.059
 one-0.058
attered-0.057
Token jam
Token ID16853
Feature activation-3.74e-02

Pos logit contributions
rig+0.141
 respond+0.129
 ey+0.128
 hes+0.126
 hesitation+0.125
Neg logit contributions
 favorite-0.081
 no-0.076
 toy-0.074
 favourite-0.069
 whole-0.069
Token work
Token ID670
Feature activation-7.58e-03

Pos logit contributions
iles+0.022
oke+0.021
ount+0.021
useum+0.020
 ey+0.020
Neg logit contributions
 favorite-0.012
 no-0.012
 toy-0.012
 favourite-0.012
 possession-0.011
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
 ey+0.014
iles+0.013
useum+0.013
 answers+0.013
esame+0.013
Neg logit contributions
 possession-0.009
attered-0.009
 forever-0.009
 favorite-0.009
 favourite-0.009
Token She
Token ID1375
Feature activation-1.14e-02

Pos logit contributions
iles+0.022
 ey+0.022
 answers+0.021
useum+0.020
rig+0.020
Neg logit contributions
attered-0.018
 favorite-0.017
 possession-0.017
 favourite-0.017
 nobody-0.016
Token picked
Token ID6497
Feature activation-1.66e-02

Pos logit contributions
 ey+0.019
useum+0.017
 answers+0.017
iles+0.017
esame+0.017
Neg logit contributions
attered-0.016
 favorite-0.016
 favourite-0.016
 possession-0.015
 nobody-0.015
Token the
Token ID262
Feature activation-3.52e-02

Pos logit contributions
 ey+0.029
 answers+0.028
useum+0.028
iles+0.028
esame+0.028
Neg logit contributions
attered-0.021
 possession-0.020
 favorite-0.020
 favourite-0.019
 forever-0.018
Token ripe
Token ID29036
Feature activation-7.92e-02

Pos logit contributions
 ey+0.039
 answers+0.037
iles+0.037
useum+0.037
esame+0.036
Neg logit contributions
attered-0.066
 favorite-0.065
 possession-0.064
 favourite-0.063
 forever-0.061
Token berries
Token ID36322
Feature activation-4.78e-02

Pos logit contributions
rig+0.088
 ey+0.084
iles+0.083
rimp+0.081
 answers+0.081
Neg logit contributions
 favorite-0.150
attered-0.144
 possession-0.144
 favourite-0.144
 forever-0.137
Token one
Token ID530
Feature activation-2.47e-02

Pos logit contributions
 ey+0.074
useum+0.072
iles+0.072
 answers+0.070
rig+0.069
Neg logit contributions
attered-0.069
 favorite-0.068
 forever-0.068
 possession-0.066
 no-0.065
Token by
Token ID416
Feature activation-1.43e-02

Pos logit contributions
useum+0.042
 ey+0.041
esame+0.039
iles+0.039
ount+0.039
Neg logit contributions
 favorite-0.036
 possession-0.034
 favourite-0.034
 forever-0.033
attered-0.032
Token one
Token ID530
Feature activation-2.13e-02

Pos logit contributions
 ey+0.029
useum+0.029
iles+0.029
rig+0.029
 answers+0.029
Neg logit contributions
 possession-0.014
 favorite-0.013
attered-0.013
 no-0.012
 favourite-0.012
Token and
Token ID290
Feature activation-1.15e-03

Pos logit contributions
ount+0.042
useum+0.042
 ey+0.042
esame+0.041
iles+0.040
Neg logit contributions
 forever-0.023
 favorite-0.023
 possession-0.023
 favourite-0.022
 no-0.021
Token "
Token ID366
Feature activation+1.84e-02

Pos logit contributions
 ey+0.058
iles+0.057
 answers+0.055
rig+0.054
useum+0.054
Neg logit contributions
 favorite-0.025
 favourite-0.024
 nobody-0.024
 no-0.024
 forever-0.023
TokenYou
Token ID1639
Feature activation-1.05e-02

Pos logit contributions
attered+0.022
 favorite+0.021
 possession+0.021
 favourite+0.020
 forever+0.020
Neg logit contributions
 ey-0.032
 answers-0.032
iles-0.031
esame-0.031
useum-0.031
Token can
Token ID460
Feature activation-1.12e-02

Pos logit contributions
 answers+0.015
 ey+0.015
useum+0.015
iles+0.015
 say+0.015
Neg logit contributions
attered-0.016
 possession-0.015
 favorite-0.015
 favourite-0.015
 forever-0.014
Token eat
Token ID4483
Feature activation-4.14e-02

Pos logit contributions
 ey+0.017
useum+0.016
 answers+0.016
rig+0.016
iles+0.015
Neg logit contributions
 favorite-0.017
 forever-0.017
 favourite-0.016
attered-0.016
 no-0.016
Token as
Token ID355
Feature activation-2.28e-02

Pos logit contributions
 ey+0.091
rig+0.083
 explains+0.082
useum+0.081
 answers+0.081
Neg logit contributions
 favorite-0.043
 forever-0.042
 favourite-0.040
 no-0.040
 nobody-0.035
Token much
Token ID881
Feature activation-7.82e-02

Pos logit contributions
 ey+0.050
 answers+0.049
useum+0.048
esame+0.048
iles+0.048
Neg logit contributions
attered-0.018
 possession-0.018
 favorite-0.018
 favourite-0.016
 forever-0.016
Token as
Token ID355
Feature activation-1.98e-02

Pos logit contributions
 ey+0.183
rig+0.172
ount+0.166
 explains+0.166
 Voice+0.165
Neg logit contributions
 favorite-0.063
 favourite-0.055
 no-0.053
 forever-0.052
 one-0.046
Token you
Token ID345
Feature activation-4.86e-02

Pos logit contributions
 ey+0.041
 answers+0.040
esame+0.039
 explains+0.039
rig+0.039
Neg logit contributions
 favorite-0.019
 possession-0.018
 forever-0.018
 favourite-0.017
attered-0.017
Token want
Token ID765
Feature activation-2.41e-02

Pos logit contributions
 ey+0.097
irens+0.088
rig+0.087
 facts+0.085
iren+0.084
Neg logit contributions
 favorite-0.054
 forever-0.052
 no-0.049
 favourite-0.049
 whole-0.047
Token!"
Token ID2474
Feature activation-1.47e-02

Pos logit contributions
 ey+0.053
rig+0.050
useum+0.047
 slam+0.047
 explains+0.047
Neg logit contributions
 forever-0.023
 favorite-0.023
 no-0.022
 favourite-0.021
 possession-0.020
Token
Token ID198
Feature activation-2.84e-03

Pos logit contributions
iles+0.030
 ey+0.030
useum+0.028
 respond+0.028
 answers+0.028
Neg logit contributions
 favorite-0.015
 possession-0.014
 forever-0.014
 no-0.013
attered-0.013
Token!
Token ID0
Feature activation+1.29e-02

Pos logit contributions
 ey+0.064
esame+0.059
useum+0.059
 answers+0.059
 explains+0.059
Neg logit contributions
 favorite-0.028
 possession-0.026
 forever-0.026
attered-0.026
 favourite-0.024
Token I
Token ID314
Feature activation-2.07e-02

Pos logit contributions
attered+0.014
 favorite+0.014
 possession+0.014
 favourite+0.013
 nobody+0.013
Neg logit contributions
 ey-0.024
 answers-0.024
iles-0.023
esame-0.023
rig-0.023
Token can
Token ID460
Feature activation-7.71e-03

Pos logit contributions
 ey+0.030
 answers+0.029
useum+0.029
iles+0.029
esame+0.029
Neg logit contributions
attered-0.031
 possession-0.030
 favorite-0.030
 favourite-0.029
 forever-0.029
Token eat
Token ID4483
Feature activation-3.88e-02

Pos logit contributions
 ey+0.009
rig+0.009
useum+0.009
 answers+0.009
esame+0.009
Neg logit contributions
 favorite-0.014
attered-0.013
 favourite-0.013
 possession-0.013
 nobody-0.013
Token as
Token ID355
Feature activation-7.65e-03

Pos logit contributions
 ey+0.087
rig+0.082
 answers+0.080
 explains+0.079
useum+0.079
Neg logit contributions
 favorite-0.035
 forever-0.032
 favourite-0.031
 no-0.030
 nobody-0.029
Token much
Token ID881
Feature activation-7.77e-02

Pos logit contributions
 answers+0.015
 ey+0.015
useum+0.015
iles+0.015
esame+0.015
Neg logit contributions
attered-0.007
 possession-0.007
 favorite-0.006
 favourite-0.006
 forever-0.006
Token as
Token ID355
Feature activation-1.00e-02

Pos logit contributions
 ey+0.160
rig+0.158
ount+0.144
iki+0.143
esame+0.143
Neg logit contributions
 favorite-0.081
 no-0.070
 favourite-0.070
 toy-0.066
 forever-0.065
Token I
Token ID314
Feature activation-4.82e-02

Pos logit contributions
 ey+0.019
 answers+0.019
useum+0.019
iles+0.019
esame+0.019
Neg logit contributions
attered-0.011
 possession-0.010
 favorite-0.010
 favourite-0.009
 forever-0.009
Token want
Token ID765
Feature activation-1.46e-02

Pos logit contributions
 ey+0.092
 facts+0.085
nine+0.083
iles+0.083
esame+0.083
Neg logit contributions
 favorite-0.061
 favourite-0.054
 whole-0.053
 no-0.052
 forever-0.051
Token and
Token ID290
Feature activation-1.68e-03

Pos logit contributions
 ey+0.030
rig+0.029
 answers+0.028
useum+0.028
iles+0.028
Neg logit contributions
 favorite-0.014
 possession-0.013
attered-0.013
 forever-0.013
 favourite-0.013
Token no
Token ID645
Feature activation-1.28e-03

Pos logit contributions
rig+0.003
iles+0.002
 ey+0.002
useum+0.002
esame+0.002
Neg logit contributions
 no-0.003
 favorite-0.003
 nobody-0.003
 favourite-0.002
 forever-0.002
Token.
Token ID13
Feature activation-3.28e-03

Pos logit contributions
 ey+0.082
useum+0.079
iles+0.077
rig+0.075
 answers+0.075
Neg logit contributions
 forever-0.052
 favorite-0.050
attered-0.049
 no-0.049
 possession-0.049
Token He
Token ID679
Feature activation-9.11e-03

Pos logit contributions
iles+0.006
 ey+0.006
 perk+0.006
 answers+0.006
aches+0.006
Neg logit contributions
 favorite-0.004
attered-0.004
 possession-0.004
 favourite-0.003
 nobody-0.003
Token pl
Token ID458
Feature activation-9.23e-03

Pos logit contributions
 ey+0.017
useum+0.016
iles+0.016
esame+0.016
ount+0.015
Neg logit contributions
 favorite-0.012
 favourite-0.011
 no-0.011
 nobody-0.011
 forever-0.011
Tokenucked
Token ID17758
Feature activation-5.75e-03

Pos logit contributions
 answers+0.020
 ey+0.020
 calls+0.019
useum+0.019
 facts+0.019
Neg logit contributions
attered-0.010
 possession-0.007
 favourite-0.007
 no-0.006
 nobody-0.006
Token the
Token ID262
Feature activation-4.27e-02

Pos logit contributions
 ey+0.010
useum+0.010
 answers+0.010
rig+0.010
iles+0.010
Neg logit contributions
attered-0.007
 favorite-0.007
 possession-0.007
 favourite-0.007
 nobody-0.006
Token ripe
Token ID29036
Feature activation-7.61e-02

Pos logit contributions
 ey+0.062
iles+0.060
useum+0.060
rig+0.058
 explains+0.057
Neg logit contributions
 favorite-0.066
 favourite-0.063
attered-0.062
 possession-0.061
 whole-0.059
Token l
Token ID300
Feature activation-4.90e-02

Pos logit contributions
rig+0.108
 ey+0.104
useum+0.103
 explains+0.100
iles+0.100
Neg logit contributions
 favorite-0.119
attered-0.115
 favourite-0.113
 possession-0.113
 toy-0.107
Tokenimes
Token ID999
Feature activation-5.00e-02

Pos logit contributions
 say+0.044
 ey+0.044
 answers+0.043
useum+0.043
 explains+0.043
Neg logit contributions
attered-0.101
 possession-0.096
 favorite-0.095
 favourite-0.095
 perfectly-0.092
Token from
Token ID422
Feature activation+1.47e-03

Pos logit contributions
 ey+0.080
useum+0.078
iles+0.077
 answers+0.075
rig+0.074
Neg logit contributions
attered-0.070
 forever-0.068
 favorite-0.068
 possession-0.067
 no-0.066
Token the
Token ID262
Feature activation-1.86e-02

Pos logit contributions
 possession+0.002
 favorite+0.002
 nobody+0.002
 no+0.002
 forever+0.002
Neg logit contributions
useum-0.003
 ey-0.003
rig-0.003
ount-0.003
esame-0.003
Token trees
Token ID7150
Feature activation-2.21e-02

Pos logit contributions
iles+0.029
rig+0.029
 explains+0.029
useum+0.028
rimp+0.028
Neg logit contributions
 favorite-0.027
 favourite-0.026
 whole-0.026
 no-0.026
 possession-0.025
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
rig+0.059
useum+0.056
 ey+0.056
esame+0.054
ount+0.053
Neg logit contributions
 no-0.028
 nobody-0.027
 favorite-0.027
 forever-0.025
 favourite-0.024
Token there
Token ID612
Feature activation+4.50e-03

Pos logit contributions
 ey+0.031
rig+0.030
useum+0.030
 answers+0.030
iles+0.030
Neg logit contributions
 favorite-0.018
attered-0.018
 possession-0.018
 nobody-0.017
 favourite-0.017
Token were
Token ID547
Feature activation-3.15e-02

Pos logit contributions
 favorite+0.005
 nobody+0.005
 no+0.004
 favourite+0.004
 possession+0.004
Neg logit contributions
rig-0.009
useum-0.009
 ey-0.009
esame-0.009
iles-0.008
Token lots
Token ID6041
Feature activation-4.69e-02

Pos logit contributions
 ey+0.073
rig+0.073
esame+0.072
useum+0.072
 answers+0.072
Neg logit contributions
 favorite-0.021
attered-0.020
 favourite-0.019
 no-0.019
 nobody-0.018
Token of
Token ID286
Feature activation-5.18e-02

Pos logit contributions
esame+0.098
 explains+0.095
rig+0.095
 respond+0.095
 approach+0.095
Neg logit contributions
 favorite-0.043
 toy-0.036
 favourite-0.036
 possession-0.035
attered-0.034
Token juicy
Token ID35075
Feature activation-7.56e-02

Pos logit contributions
esame+0.095
rig+0.094
rimp+0.093
olo+0.088
ount+0.087
Neg logit contributions
 favorite-0.073
 best-0.068
 favourite-0.066
 toy-0.064
 biggest-0.061
Token grapes
Token ID36656
Feature activation-3.06e-02

Pos logit contributions
rig+0.120
 respond+0.107
 explains+0.104
rimp+0.103
esame+0.100
Neg logit contributions
 favorite-0.124
 favourite-0.113
 toy-0.111
 possession-0.111
attered-0.110
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 ey+0.056
 answers+0.053
esame+0.053
rig+0.053
useum+0.053
Neg logit contributions
attered-0.036
 favorite-0.035
 possession-0.035
 forever-0.034
 favourite-0.033
Token
Token ID198
Feature activation-6.19e-03

Pos logit contributions
 ey+0.024
rig+0.023
iles+0.023
aches+0.022
 answers+0.022
Neg logit contributions
 favorite-0.010
 favourite-0.010
 nobody-0.009
 no-0.008
 forever-0.008
Token
Token ID198
Feature activation+5.06e-03

Pos logit contributions
 ey+0.010
 answers+0.009
useum+0.009
iles+0.009
esame+0.009
Neg logit contributions
 favorite-0.009
 possession-0.009
attered-0.009
 favourite-0.008
 forever-0.008
TokenA
Token ID32
Feature activation-5.36e-03

Pos logit contributions
attered+0.006
 favorite+0.006
 possession+0.006
 nobody+0.006
 favourite+0.006
Neg logit contributions
 ey-0.009
 answers-0.009
esame-0.009
iles-0.009
 say-0.008
Token pond
Token ID16723
Feature activation-1.87e-02

Pos logit contributions
 ey+0.042
esame+0.042
rig+0.041
useum+0.041
iles+0.041
Neg logit contributions
 forever-0.035
 favorite-0.033
 possession-0.033
 no-0.032
 toy-0.032
Token with
Token ID351
Feature activation-2.30e-02

Pos logit contributions
 ey+0.034
 answers+0.032
esame+0.032
useum+0.032
iles+0.032
Neg logit contributions
attered-0.023
 possession-0.022
 favorite-0.022
 forever-0.022
 favourite-0.021
Token lots
Token ID6041
Feature activation-4.45e-02

Pos logit contributions
 ey+0.041
rig+0.039
esame+0.039
useum+0.039
 answers+0.038
Neg logit contributions
 favorite-0.028
 no-0.027
attered-0.027
 nobody-0.027
 possession-0.027
Token of
Token ID286
Feature activation-5.46e-02

Pos logit contributions
 ey+0.087
esame+0.085
 say+0.084
 explains+0.083
rig+0.083
Neg logit contributions
 forever-0.043
 possession-0.042
attered-0.042
 favorite-0.041
 favourite-0.037
Token y
Token ID331
Feature activation-5.13e-02

Pos logit contributions
esame+0.099
rig+0.095
useum+0.087
 ey+0.086
utt+0.085
Neg logit contributions
 favorite-0.075
 favourite-0.071
 toy-0.070
 lot-0.066
 possession-0.065
Tokenummy
Token ID13513
Feature activation-7.55e-02

Pos logit contributions
 say+0.108
 answers+0.107
 explains+0.107
rig+0.106
 ey+0.105
Neg logit contributions
attered-0.045
 possession-0.038
 favorite-0.037
 forever-0.036
 favourite-0.034
Token bugs
Token ID11316
Feature activation-4.16e-02

Pos logit contributions
rig+0.150
utt+0.140
 slam+0.139
esame+0.134
 ey+0.132
Neg logit contributions
 toy-0.088
 favorite-0.084
 favourite-0.076
 lot-0.075
 whole-0.074
Token and
Token ID290
Feature activation-2.51e-02

Pos logit contributions
 ey+0.079
useum+0.075
esame+0.075
 answers+0.074
rig+0.073
Neg logit contributions
 forever-0.046
attered-0.045
 possession-0.044
 favorite-0.044
 favourite-0.041
Token fish
Token ID5916
Feature activation-2.80e-02

Pos logit contributions
esame+0.034
rig+0.034
 ey+0.033
useum+0.033
 explains+0.032
Neg logit contributions
 favorite-0.042
 no-0.041
 favourite-0.041
 nobody-0.040
 possession-0.039
Token."
Token ID526
Feature activation-6.96e-03

Pos logit contributions
 ey+0.056
esame+0.053
rig+0.052
 answers+0.052
useum+0.052
Neg logit contributions
attered-0.030
 forever-0.029
 favorite-0.029
 possession-0.029
 favourite-0.026
Token
Token ID198
Feature activation-1.94e-03

Pos logit contributions
iles+0.015
 ey+0.015
useum+0.014
rig+0.014
 respond+0.014
Neg logit contributions
 favorite-0.006
 favourite-0.006
 possession-0.006
 no-0.006
 nobody-0.006
Token perfect
Token ID2818
Feature activation-2.30e-02

Pos logit contributions
 ey+0.033
esame+0.032
rig+0.031
useum+0.030
 explains+0.030
Neg logit contributions
 favorite-0.036
 favourite-0.034
 best-0.032
 whole-0.031
 forever-0.031
Token bush
Token ID24484
Feature activation-1.12e-02

Pos logit contributions
esame+0.033
 ey+0.032
 explains+0.031
rig+0.031
useum+0.031
Neg logit contributions
 favorite-0.038
 possession-0.036
 favourite-0.036
 forever-0.035
attered-0.035
Token with
Token ID351
Feature activation-5.85e-03

Pos logit contributions
 ey+0.020
useum+0.019
esame+0.019
 answers+0.019
iles+0.019
Neg logit contributions
attered-0.014
 favorite-0.014
 possession-0.013
 forever-0.013
 favourite-0.013
Token lots
Token ID6041
Feature activation-3.77e-02

Pos logit contributions
 ey+0.010
useum+0.010
esame+0.010
rig+0.010
 answers+0.010
Neg logit contributions
 favorite-0.007
attered-0.007
 possession-0.007
 favourite-0.007
 nobody-0.007
Token of
Token ID286
Feature activation-4.29e-02

Pos logit contributions
 ey+0.071
esame+0.070
useum+0.068
iles+0.068
rig+0.068
Neg logit contributions
 favorite-0.039
 possession-0.039
 forever-0.039
attered-0.038
 favourite-0.037
Token juicy
Token ID35075
Feature activation-7.50e-02

Pos logit contributions
esame+0.068
rig+0.065
useum+0.062
olo+0.062
 ey+0.061
Neg logit contributions
 favorite-0.070
 favourite-0.068
 lot-0.061
 possession-0.061
 toy-0.061
Token berries
Token ID36322
Feature activation-4.08e-02

Pos logit contributions
rig+0.104
 explains+0.090
 slam+0.089
esame+0.089
 ey+0.087
Neg logit contributions
 favorite-0.134
 favourite-0.127
 possession-0.122
attered-0.122
 toy-0.121
Token.
Token ID13
Feature activation+2.92e-03

Pos logit contributions
 ey+0.084
useum+0.078
 answers+0.078
rig+0.077
 question+0.077
Neg logit contributions
attered-0.043
 forever-0.042
 favorite-0.041
 possession-0.039
 favourite-0.038
Token Little
Token ID7703
Feature activation+8.12e-03

Pos logit contributions
attered+0.004
 possession+0.004
 favorite+0.004
 favourite+0.004
 forever+0.004
Neg logit contributions
 ey-0.004
 answers-0.004
useum-0.004
esame-0.004
 say-0.004
Token Tim
Token ID5045
Feature activation+1.39e-02

Pos logit contributions
 favorite+0.010
attered+0.010
 possession+0.009
 favourite+0.009
 no+0.009
Neg logit contributions
 ey-0.015
rig-0.014
 answers-0.014
useum-0.014
esame-0.014
Token was
Token ID373
Feature activation-5.74e-03

Pos logit contributions
 favorite+0.019
attered+0.019
 favourite+0.018
 possession+0.018
 forever+0.018
Neg logit contributions
useum-0.023
 ey-0.023
esame-0.022
 answers-0.022
ount-0.021
Token both
Token ID1111
Feature activation+1.20e-02

Pos logit contributions
attered+0.006
 favorite+0.006
 possession+0.006
 favourite+0.006
 forever+0.006
Neg logit contributions
 ey-0.008
 answers-0.008
useum-0.008
iles-0.008
esame-0.008
Token took
Token ID1718
Feature activation+1.71e-02

Pos logit contributions
attered+0.018
 favorite+0.017
 possession+0.017
 forever+0.016
 favourite+0.016
Neg logit contributions
 ey-0.019
useum-0.018
esame-0.018
 answers-0.018
iles-0.017
Token some
Token ID617
Feature activation-4.00e-02

Pos logit contributions
attered+0.022
 favorite+0.021
 possession+0.021
 favourite+0.020
 forever+0.020
Neg logit contributions
 ey-0.029
useum-0.028
 answers-0.028
iles-0.028
esame-0.027
Token of
Token ID286
Feature activation-1.20e-02

Pos logit contributions
 ey+0.065
avi+0.062
olo+0.060
rimp+0.060
 Dan+0.059
Neg logit contributions
 favorite-0.063
 favourite-0.060
 toy-0.058
 proper-0.054
 whole-0.054
Token the
Token ID262
Feature activation-3.79e-02

Pos logit contributions
 ey+0.022
useum+0.022
rig+0.021
esame+0.021
 answers+0.020
Neg logit contributions
 favorite-0.014
 possession-0.014
 no-0.013
 favourite-0.013
 nobody-0.013
Token juicy
Token ID35075
Feature activation-7.44e-02

Pos logit contributions
 ey+0.035
esame+0.033
rig+0.033
 answers+0.032
iles+0.032
Neg logit contributions
 favorite-0.080
 favourite-0.078
attered-0.076
 possession-0.074
 forever-0.074
Token berries
Token ID36322
Feature activation-3.26e-02

Pos logit contributions
rig+0.079
 ey+0.073
 explains+0.072
esame+0.072
iles+0.070
Neg logit contributions
 favorite-0.150
 favourite-0.144
attered-0.143
 possession-0.140
 toy-0.138
Token.
Token ID13
Feature activation-7.84e-03

Pos logit contributions
 ey+0.054
useum+0.052
 answers+0.052
iles+0.051
esame+0.050
Neg logit contributions
attered-0.045
 possession-0.043
 favorite-0.043
 forever-0.042
 favourite-0.041
Token But
Token ID887
Feature activation-3.19e-03

Pos logit contributions
 ey+0.017
iles+0.016
 answers+0.016
rimp+0.015
 perk+0.015
Neg logit contributions
attered-0.008
 favorite-0.007
 nobody-0.007
 favourite-0.007
 no-0.007
Token from
Token ID422
Feature activation-2.77e-03

Pos logit contributions
 ey+0.006
useum+0.006
 answers+0.006
rig+0.006
rimp+0.006
Neg logit contributions
 nobody-0.004
 no-0.004
 favorite-0.004
 forever-0.003
 possession-0.003
Token nowhere
Token ID12062
Feature activation-3.73e-03

Pos logit contributions
 ey+0.005
useum+0.005
esame+0.005
rimp+0.005
 answers+0.005
Neg logit contributions
 possession-0.004
 favorite-0.003
 nobody-0.003
 favourite-0.003
attered-0.003
Token bear
Token ID6842
Feature activation-1.26e-02

Pos logit contributions
 ey+0.005
 answers+0.005
iles+0.004
rig+0.004
esame+0.004
Neg logit contributions
 favorite-0.003
 favourite-0.003
attered-0.003
 possession-0.003
 forever-0.003
Token brought
Token ID3181
Feature activation+5.17e-03

Pos logit contributions
 ey+0.024
esame+0.022
useum+0.022
 answers+0.022
iles+0.021
Neg logit contributions
 favorite-0.016
attered-0.015
 favourite-0.015
 possession-0.014
 forever-0.014
Token lots
Token ID6041
Feature activation-4.38e-02

Pos logit contributions
attered+0.008
 possession+0.008
 nobody+0.007
 forever+0.007
 favorite+0.007
Neg logit contributions
 answers-0.007
iles-0.007
 say-0.007
esame-0.007
 ey-0.007
Token of
Token ID286
Feature activation-5.03e-02

Pos logit contributions
 ey+0.068
 answers+0.067
useum+0.066
esame+0.065
iles+0.064
Neg logit contributions
attered-0.062
 favorite-0.061
 possession-0.060
 favourite-0.058
 forever-0.056
Token y
Token ID331
Feature activation-4.77e-02

Pos logit contributions
esame+0.091
useum+0.091
 ey+0.091
 slam+0.090
rig+0.090
Neg logit contributions
 favorite-0.067
 favourite-0.061
 toy-0.057
 best-0.053
 proper-0.051
Tokenummy
Token ID13513
Feature activation-7.44e-02

Pos logit contributions
 ey+0.093
 answers+0.092
useum+0.091
esame+0.090
 say+0.090
Neg logit contributions
attered-0.048
 possession-0.046
 favorite-0.045
 favourite-0.043
 forever-0.041
Token food
Token ID2057
Feature activation-3.04e-02

Pos logit contributions
rig+0.129
 ey+0.125
 slam+0.123
 respond+0.120
esame+0.118
Neg logit contributions
 favorite-0.103
 favourite-0.095
 toy-0.093
 whole-0.086
 possession-0.085
Token and
Token ID290
Feature activation-2.33e-03

Pos logit contributions
 ey+0.057
useum+0.054
iles+0.052
rig+0.052
esame+0.052
Neg logit contributions
 favorite-0.038
 forever-0.036
 possession-0.036
 favourite-0.035
attered-0.035
Token shared
Token ID4888
Feature activation-5.52e-03

Pos logit contributions
 ey+0.004
rig+0.003
useum+0.003
 answers+0.003
esame+0.003
Neg logit contributions
 favorite-0.004
 favourite-0.003
attered-0.003
 no-0.003
 possession-0.003
Token it
Token ID340
Feature activation+1.25e-03

Pos logit contributions
 ey+0.011
useum+0.010
rig+0.010
 answers+0.010
esame+0.010
Neg logit contributions
 favorite-0.006
 possession-0.006
 favourite-0.006
attered-0.006
 forever-0.006
Token with
Token ID351
Feature activation+1.57e-02

Pos logit contributions
attered+0.002
 possession+0.002
 nobody+0.002
 favorite+0.002
 favourite+0.002
Neg logit contributions
 answers-0.002
 say-0.002
rimp-0.002
rig-0.002
iles-0.002
Token jumped
Token ID11687
Feature activation-2.16e-02

Pos logit contributions
attered+0.011
 favorite+0.010
 favourite+0.010
 possession+0.010
 forever+0.010
Neg logit contributions
 ey-0.015
useum-0.015
esame-0.015
 answers-0.015
iles-0.015
Token in
Token ID287
Feature activation-1.36e-02

Pos logit contributions
 ey+0.040
iles+0.039
 answers+0.039
useum+0.039
esame+0.038
Neg logit contributions
attered-0.025
 favorite-0.024
 possession-0.023
 favourite-0.023
 forever-0.023
Token and
Token ID290
Feature activation+7.51e-03

Pos logit contributions
iles+0.019
 ey+0.019
useum+0.019
esame+0.018
 answers+0.018
Neg logit contributions
 possession-0.021
 favorite-0.021
attered-0.021
 forever-0.020
 favourite-0.020
Token drank
Token ID24070
Feature activation-1.82e-02

Pos logit contributions
attered+0.013
 possession+0.013
 favorite+0.013
 favourite+0.012
 forever+0.012
Neg logit contributions
 say-0.009
 ey-0.009
 answers-0.008
esame-0.008
useum-0.008
Token as
Token ID355
Feature activation-5.29e-03

Pos logit contributions
 ey+0.032
 answers+0.032
useum+0.032
iles+0.032
esame+0.031
Neg logit contributions
attered-0.022
 favorite-0.021
 possession-0.021
 favourite-0.020
 forever-0.019
Token much
Token ID881
Feature activation-7.42e-02

Pos logit contributions
 answers+0.008
 Jen+0.008
esame+0.008
iles+0.008
 say+0.008
Neg logit contributions
attered-0.007
 possession-0.006
 favourite-0.006
 favorite-0.006
dates-0.006
Token as
Token ID355
Feature activation-6.96e-03

Pos logit contributions
 ey+0.146
rig+0.141
 explains+0.137
iles+0.137
 answers+0.135
Neg logit contributions
 favorite-0.079
 favourite-0.072
 no-0.071
attered-0.068
 forever-0.066
Token she
Token ID673
Feature activation-1.37e-02

Pos logit contributions
 answers+0.010
 say+0.010
esame+0.010
iles+0.010
useum+0.010
Neg logit contributions
attered-0.011
 possession-0.009
 favorite-0.009
 favourite-0.008
 model-0.008
Token wanted
Token ID2227
Feature activation-1.34e-02

Pos logit contributions
 ey+0.025
 answers+0.024
useum+0.024
esame+0.024
rig+0.023
Neg logit contributions
 favorite-0.017
attered-0.017
 favourite-0.016
 forever-0.015
 possession-0.015
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
 ey+0.029
rig+0.029
iles+0.028
useum+0.028
 explains+0.027
Neg logit contributions
 no-0.013
 forever-0.013
 favorite-0.013
 favourite-0.011
 nobody-0.011
Token She
Token ID1375
Feature activation+2.08e-03

Pos logit contributions
attered+0.034
 possession+0.034
 model+0.032
 nobody+0.031
 forever+0.031
Neg logit contributions
 calls-0.015
 answers-0.015
 ey-0.015
useum-0.014
 opens-0.014
Token.
Token ID13
Feature activation+2.84e-03

Pos logit contributions
 ey+0.012
 answers+0.012
useum+0.012
 say+0.012
iles+0.012
Neg logit contributions
attered-0.011
 possession-0.011
 favorite-0.011
 favourite-0.010
 forever-0.010
Token It
Token ID632
Feature activation-7.06e-03

Pos logit contributions
attered+0.003
 favorite+0.003
 no+0.003
 nobody+0.002
 possession+0.002
Neg logit contributions
rig-0.006
 answers-0.006
 ey-0.006
 explains-0.006
iles-0.006
Token was
Token ID373
Feature activation-2.54e-02

Pos logit contributions
 say+0.009
 ey+0.009
iles+0.009
 answers+0.009
 calls+0.008
Neg logit contributions
 possession-0.012
attered-0.012
 forever-0.011
 favorite-0.011
 favourite-0.011
Token full
Token ID1336
Feature activation-2.69e-02

Pos logit contributions
 ey+0.052
esame+0.052
 answers+0.052
useum+0.052
iles+0.051
Neg logit contributions
 favorite-0.024
attered-0.023
 favourite-0.022
 possession-0.022
 no-0.021
Token of
Token ID286
Feature activation-5.59e-02

Pos logit contributions
 ey+0.046
 answers+0.044
esame+0.044
useum+0.043
iles+0.043
Neg logit contributions
 favorite-0.035
attered-0.034
 forever-0.033
 possession-0.033
 favourite-0.033
Token tasty
Token ID25103
Feature activation-7.42e-02

Pos logit contributions
esame+0.101
rig+0.098
useum+0.093
oke+0.092
 explains+0.091
Neg logit contributions
 favorite-0.080
 best-0.073
 toy-0.072
 favourite-0.071
 no-0.069
Token fish
Token ID5916
Feature activation-2.44e-02

Pos logit contributions
rig+0.119
 explains+0.116
 respond+0.115
 slam+0.114
esame+0.114
Neg logit contributions
 favorite-0.113
 toy-0.102
 favourite-0.101
 whole-0.097
 best-0.096
Token.
Token ID13
Feature activation+1.30e-03

Pos logit contributions
 ey+0.039
 answers+0.038
useum+0.038
esame+0.037
iles+0.037
Neg logit contributions
attered-0.034
 possession-0.033
 favorite-0.032
 favourite-0.031
 forever-0.030
Token All
Token ID1439
Feature activation-6.88e-03

Pos logit contributions
 favorite+0.001
 nobody+0.001
 favourite+0.001
 no+0.001
attered+0.001
Neg logit contributions
 ey-0.003
rig-0.003
iles-0.003
 answers-0.003
esame-0.003
Token day
Token ID1110
Feature activation-1.40e-02

Pos logit contributions
 ey+0.009
 answers+0.009
iles+0.009
useum+0.009
 say+0.009
Neg logit contributions
attered-0.011
 possession-0.011
 favorite-0.011
 favourite-0.010
 forever-0.010
Token long
Token ID890
Feature activation-4.58e-02

Pos logit contributions
useum+0.019
 answers+0.019
iles+0.019
 say+0.019
esame+0.019
Neg logit contributions
attered-0.022
 possession-0.021
 favorite-0.020
 favourite-0.020
 nobody-0.019
Token,
Token ID11
Feature activation-2.63e-02

Pos logit contributions
 ey+0.084
useum+0.078
esame+0.077
rig+0.076
ount+0.075
Neg logit contributions
 favorite-0.061
 favourite-0.057
 possession-0.056
 forever-0.056
attered-0.054
Token so
Token ID523
Feature activation-2.18e-03

Pos logit contributions
iles+0.049
useum+0.048
 ey+0.047
esame+0.047
rig+0.046
Neg logit contributions
 favorite-0.034
 no-0.033
 favourite-0.031
 forever-0.029
 nobody-0.029
Token she
Token ID673
Feature activation-6.65e-03

Pos logit contributions
 say+0.003
 answers+0.003
useum+0.003
esame+0.003
 ey+0.003
Neg logit contributions
attered-0.003
 possession-0.003
 favorite-0.003
 favourite-0.003
 forever-0.003
Token ate
Token ID15063
Feature activation-2.34e-02

Pos logit contributions
 ey+0.011
esame+0.010
 answers+0.010
useum+0.010
iles+0.010
Neg logit contributions
attered-0.009
 favorite-0.009
 favourite-0.009
 nobody-0.009
 forever-0.008
Token as
Token ID355
Feature activation-1.08e-02

Pos logit contributions
 ey+0.049
useum+0.048
rig+0.048
iles+0.047
 answers+0.047
Neg logit contributions
 favorite-0.021
attered-0.020
 favourite-0.019
 possession-0.019
 forever-0.019
Token much
Token ID881
Feature activation-7.39e-02

Pos logit contributions
 answers+0.018
iles+0.018
esame+0.018
 ey+0.017
aches+0.017
Neg logit contributions
attered-0.014
 possession-0.012
 favorite-0.011
 favourite-0.011
 forever-0.011
Token as
Token ID355
Feature activation-1.59e-02

Pos logit contributions
 ey+0.154
rig+0.144
ount+0.141
 slam+0.141
useum+0.139
Neg logit contributions
 favorite-0.075
 favourite-0.069
 no-0.063
 forever-0.061
 possession-0.061
Token she
Token ID673
Feature activation-1.72e-02

Pos logit contributions
 answers+0.025
iles+0.025
esame+0.024
 ey+0.024
 say+0.024
Neg logit contributions
attered-0.023
 possession-0.020
 favorite-0.019
 favourite-0.019
 forever-0.018
Token wanted
Token ID2227
Feature activation-1.03e-02

Pos logit contributions
 ey+0.032
 respond+0.030
rig+0.030
 answers+0.030
useum+0.030
Neg logit contributions
attered-0.021
 favorite-0.021
 favourite-0.020
 no-0.019
 forever-0.019
Token.
Token ID13
Feature activation-2.92e-03

Pos logit contributions
 ey+0.022
rig+0.022
useum+0.021
iles+0.021
ount+0.020
Neg logit contributions
 no-0.010
 forever-0.010
 favorite-0.010
 favourite-0.009
 possession-0.009
Token
Token ID198
Feature activation-1.76e-03

Pos logit contributions
 ey+0.004
 answers+0.004
useum+0.004
iles+0.004
esame+0.004
Neg logit contributions
attered-0.005
 possession-0.004
 favorite-0.004
 favourite-0.004
 forever-0.004
Token glad
Token ID9675
Feature activation+5.33e-03

Pos logit contributions
attered+0.001
 possession+0.001
 favorite+0.001
 favourite+0.001
 forever+0.001
Neg logit contributions
 answers-0.001
 ey-0.001
 say-0.001
useum-0.001
esame-0.001
Token I
Token ID314
Feature activation-1.04e-02

Pos logit contributions
attered+0.006
 possession+0.006
 favorite+0.006
 favourite+0.005
 forever+0.005
Neg logit contributions
 ey-0.010
 answers-0.010
useum-0.010
iles-0.010
esame-0.010
Token could
Token ID714
Feature activation-1.20e-02

Pos logit contributions
 ey+0.021
useum+0.020
iles+0.019
esame+0.019
rig+0.019
Neg logit contributions
 forever-0.011
 favorite-0.010
 no-0.010
attered-0.010
 nobody-0.010
Token help
Token ID1037
Feature activation+8.21e-03

Pos logit contributions
 ey+0.020
useum+0.019
iles+0.018
rig+0.017
esame+0.017
Neg logit contributions
 forever-0.019
 no-0.018
 favorite-0.017
 favourite-0.017
 nobody-0.017
Token the
Token ID262
Feature activation+2.79e-03

Pos logit contributions
 possession+0.008
 forever+0.008
 favorite+0.008
attered+0.008
 favourite+0.008
Neg logit contributions
 ey-0.016
useum-0.016
iles-0.015
rig-0.015
 answers-0.015
Token tad
Token ID36264
Feature activation-7.38e-02

Pos logit contributions
 favorite+0.004
 favourite+0.003
 whole+0.003
 possession+0.003
 nobody+0.003
Neg logit contributions
 ey-0.005
rig-0.005
ount-0.004
esame-0.004
iles-0.004
Tokenp
Token ID79
Feature activation-3.23e-02

Pos logit contributions
 ey+0.117
useum+0.106
esame+0.105
iles+0.099
 explains+0.098
Neg logit contributions
 favorite-0.108
attered-0.107
 favourite-0.103
 forever-0.103
 possession-0.103
Tokenoles
Token ID4316
Feature activation-3.65e-02

Pos logit contributions
 ey+0.060
useum+0.055
 answers+0.054
esame+0.054
iles+0.053
Neg logit contributions
 forever-0.039
 no-0.038
 nobody-0.037
 favorite-0.037
 favourite-0.036
Token".
Token ID1911
Feature activation+3.54e-03

Pos logit contributions
 ey+0.072
useum+0.069
iles+0.066
esame+0.066
 answers+0.065
Neg logit contributions
 forever-0.041
 favorite-0.038
 no-0.036
attered-0.036
 favourite-0.036
Token<|endoftext|>
Token ID50256
Feature activation-7.40e-03

Pos logit contributions
attered+0.005
 favorite+0.005
 possession+0.005
 favourite+0.005
 forever+0.005
Neg logit contributions
 ey-0.005
iles-0.005
 answers-0.005
useum-0.005
rig-0.005
TokenOne
Token ID3198
Feature activation+1.00e-02

Pos logit contributions
 ey+0.011
rig+0.011
 answers+0.011
iles+0.010
 facts+0.010
Neg logit contributions
attered-0.010
 forever-0.010
 possession-0.010
 whole-0.010
 favorite-0.010
Token They
Token ID1119
Feature activation-1.78e-02

Pos logit contributions
 ey+0.026
iles+0.025
 answers+0.025
esame+0.024
useum+0.023
Neg logit contributions
 favorite-0.015
attered-0.015
 favourite-0.014
 possession-0.014
 nobody-0.014
Token brought
Token ID3181
Feature activation+4.88e-03

Pos logit contributions
 ey+0.032
useum+0.030
rig+0.030
esame+0.030
 answers+0.029
Neg logit contributions
 favorite-0.023
attered-0.022
 favourite-0.021
 forever-0.021
 no-0.021
Token lots
Token ID6041
Feature activation-4.35e-02

Pos logit contributions
attered+0.008
 possession+0.007
 nobody+0.007
 forever+0.007
 favorite+0.006
Neg logit contributions
 answers-0.007
 say-0.007
iles-0.007
esame-0.006
 ey-0.006
Token of
Token ID286
Feature activation-5.01e-02

Pos logit contributions
 ey+0.069
 answers+0.068
useum+0.067
esame+0.066
iles+0.065
Neg logit contributions
 favorite-0.060
attered-0.059
 possession-0.058
 favourite-0.057
 forever-0.054
Token y
Token ID331
Feature activation-4.68e-02

Pos logit contributions
 ey+0.095
esame+0.094
useum+0.094
 slam+0.093
 respond+0.092
Neg logit contributions
 favorite-0.064
 favourite-0.059
 toy-0.053
 best-0.049
 proper-0.049
Tokenummy
Token ID13513
Feature activation-7.36e-02

Pos logit contributions
 ey+0.091
 answers+0.090
useum+0.088
esame+0.088
 say+0.087
Neg logit contributions
attered-0.048
 possession-0.045
 favorite-0.045
 favourite-0.043
 forever-0.041
Token food
Token ID2057
Feature activation-3.11e-02

Pos logit contributions
rig+0.129
 ey+0.128
 slam+0.126
 respond+0.122
utt+0.121
Neg logit contributions
 favorite-0.100
 favourite-0.092
 toy-0.089
 whole-0.081
 possession-0.080
Token and
Token ID290
Feature activation-4.88e-03

Pos logit contributions
 ey+0.058
useum+0.054
iles+0.052
esame+0.052
 answers+0.051
Neg logit contributions
 favorite-0.041
 forever-0.039
 possession-0.038
 favourite-0.038
attered-0.037
Token drinks
Token ID11758
Feature activation-7.92e-03

Pos logit contributions
 ey+0.008
rig+0.008
useum+0.008
esame+0.007
iles+0.007
Neg logit contributions
 favorite-0.007
 favourite-0.007
 no-0.007
attered-0.007
 nobody-0.006
Token.
Token ID13
Feature activation-9.54e-03

Pos logit contributions
 ey+0.016
iles+0.015
useum+0.015
rig+0.014
esame+0.014
Neg logit contributions
 favorite-0.009
 forever-0.008
 favourite-0.008
attered-0.008
 no-0.008
Token
Token ID198
Feature activation-8.72e-03

Pos logit contributions
 ey+0.018
iles+0.017
 answers+0.017
esame+0.016
useum+0.016
Neg logit contributions
 favorite-0.012
attered-0.012
 favourite-0.011
 possession-0.011
 nobody-0.011
Token "
Token ID366
Feature activation+1.46e-02

Pos logit contributions
 ey+0.062
iles+0.060
 answers+0.058
rig+0.058
esame+0.057
Neg logit contributions
 nobody-0.021
 favorite-0.021
 no-0.021
 favourite-0.019
 forever-0.019
TokenWhy
Token ID5195
Feature activation-3.48e-03

Pos logit contributions
attered+0.022
 possession+0.022
 favorite+0.022
 favourite+0.021
 nobody+0.020
Neg logit contributions
 ey-0.021
 answers-0.021
useum-0.020
iles-0.020
esame-0.020
Token hello
Token ID23748
Feature activation-5.67e-03

Pos logit contributions
 ey+0.006
esame+0.005
rig+0.005
useum+0.005
 answers+0.005
Neg logit contributions
 favorite-0.005
 no-0.005
 nobody-0.005
 forever-0.005
 favourite-0.005
Token there
Token ID612
Feature activation-4.30e-03

Pos logit contributions
 ey+0.011
 answers+0.010
esame+0.010
iles+0.010
useum+0.009
Neg logit contributions
 favorite-0.007
attered-0.007
 favourite-0.007
 possession-0.007
 forever-0.007
Token blue
Token ID4171
Feature activation-4.93e-02

Pos logit contributions
 ey+0.009
useum+0.009
esame+0.008
rig+0.008
nine+0.008
Neg logit contributions
 favorite-0.004
 nobody-0.004
 no-0.004
 forever-0.004
 favourite-0.004
Tokenberry
Token ID8396
Feature activation-7.33e-02

Pos logit contributions
rig+0.080
 slam+0.079
 ey+0.074
 answers+0.073
 til+0.073
Neg logit contributions
 favorite-0.063
 favourite-0.061
attered-0.060
 no-0.059
 forever-0.059
Token!
Token ID0
Feature activation+4.45e-03

Pos logit contributions
 ey+0.136
 answers+0.126
nine+0.125
 hes+0.125
 explains+0.122
Neg logit contributions
 forever-0.083
 favorite-0.082
 no-0.081
 nobody-0.081
time-0.080
Token What
Token ID1867
Feature activation+7.40e-03

Pos logit contributions
 nobody+0.004
 no+0.004
 favorite+0.004
 favourite+0.003
attered+0.003
Neg logit contributions
 ey-0.010
iles-0.009
 answers-0.009
aches-0.009
rig-0.009
Token brings
Token ID6774
Feature activation-1.12e-02

Pos logit contributions
 favorite+0.010
 no+0.010
 nobody+0.009
 forever+0.009
 favourite+0.009
Neg logit contributions
 ey-0.013
esame-0.012
rig-0.012
useum-0.012
ount-0.012
Token you
Token ID345
Feature activation-4.20e-02

Pos logit contributions
 ey+0.026
useum+0.024
rig+0.024
 slam+0.024
esame+0.024
Neg logit contributions
 favorite-0.009
 forever-0.008
 no-0.008
 favourite-0.008
 nobody-0.008
Token here
Token ID994
Feature activation-1.58e-02

Pos logit contributions
 ey+0.083
useum+0.080
esame+0.078
iles+0.078
 answers+0.077
Neg logit contributions
 favorite-0.049
 forever-0.048
 favourite-0.045
 no-0.044
 possession-0.042
Token little
Token ID1310
Feature activation-3.65e-03

Pos logit contributions
 ey+0.042
iles+0.041
 explains+0.039
esame+0.039
useum+0.039
Neg logit contributions
 favorite-0.027
 possession-0.027
 forever-0.027
 favourite-0.026
 toy-0.025
Token pond
Token ID16723
Feature activation-1.63e-02

Pos logit contributions
 ey+0.006
 answers+0.006
iles+0.006
useum+0.006
esame+0.006
Neg logit contributions
attered-0.005
 possession-0.004
 favorite-0.004
 favourite-0.004
 forever-0.004
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
 ey+0.032
iles+0.030
esame+0.030
useum+0.030
 answers+0.029
Neg logit contributions
 possession-0.018
attered-0.018
 favorite-0.018
 forever-0.018
 favourite-0.016
Token They
Token ID1119
Feature activation-1.10e-02

Pos logit contributions
iles+0.024
 ey+0.024
 answers+0.024
aches+0.024
esame+0.023
Neg logit contributions
attered-0.015
 favorite-0.015
 favourite-0.014
 possession-0.014
 forever-0.013
Token saw
Token ID2497
Feature activation-3.56e-02

Pos logit contributions
 ey+0.020
useum+0.019
esame+0.019
 answers+0.019
iles+0.018
Neg logit contributions
 favorite-0.014
 favourite-0.014
attered-0.014
 forever-0.013
 no-0.013
Token lot
Token ID1256
Feature activation-7.31e-02

Pos logit contributions
 ey+0.071
useum+0.071
esame+0.068
rig+0.068
 answers+0.067
Neg logit contributions
 favorite-0.039
 favourite-0.036
 no-0.036
 nobody-0.035
attered-0.034
Token of
Token ID286
Feature activation-5.74e-02

Pos logit contributions
 ey+0.128
useum+0.123
esame+0.121
 say+0.118
 question+0.118
Neg logit contributions
 favorite-0.090
 no-0.085
 favourite-0.085
 possession-0.083
attered-0.081
Token ducks
Token ID39694
Feature activation-4.42e-02

Pos logit contributions
esame+0.127
useum+0.118
 ey+0.117
rimp+0.114
rig+0.114
Neg logit contributions
 favorite-0.058
 favourite-0.052
 best-0.051
 toy-0.051
 forever-0.048
Token swimming
Token ID14899
Feature activation-2.02e-02

Pos logit contributions
 ey+0.093
useum+0.089
esame+0.085
 answers+0.084
rig+0.082
Neg logit contributions
 forever-0.052
 no-0.046
 favorite-0.045
 favourite-0.041
 best-0.040
Token around
Token ID1088
Feature activation-1.22e-02

Pos logit contributions
 ey+0.041
esame+0.038
useum+0.038
 answers+0.038
 explains+0.038
Neg logit contributions
 favorite-0.023
 forever-0.022
attered-0.022
 favourite-0.021
 possession-0.020
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
 ey+0.025
useum+0.024
rig+0.024
esame+0.024
 explains+0.023
Neg logit contributions
 forever-0.014
 no-0.013
 favorite-0.012
 favourite-0.012
 possession-0.012
Token fox
Token ID21831
Feature activation-1.25e-02

Pos logit contributions
 ey+0.015
ount+0.015
esame+0.014
useum+0.014
rig+0.014
Neg logit contributions
 favorite-0.011
 favourite-0.010
 toy-0.010
 possession-0.010
 whole-0.010
Token gob
Token ID48484
Feature activation-4.92e-03

Pos logit contributions
 ey+0.027
useum+0.025
ount+0.024
iles+0.024
esame+0.024
Neg logit contributions
 no-0.014
 forever-0.013
 nobody-0.012
 favorite-0.012
 toy-0.011
Tokenbled
Token ID9342
Feature activation-9.30e-03

Pos logit contributions
 ey+0.011
 answers+0.011
useum+0.011
iles+0.011
esame+0.011
Neg logit contributions
attered-0.003
 possession-0.003
 favorite-0.003
 favourite-0.003
 forever-0.002
Token up
Token ID510
Feature activation-1.96e-02

Pos logit contributions
 ey+0.017
useum+0.017
 answers+0.017
iles+0.017
rig+0.017
Neg logit contributions
attered-0.010
 favorite-0.010
 possession-0.010
 favourite-0.009
 forever-0.009
Token the
Token ID262
Feature activation-2.81e-02

Pos logit contributions
 ey+0.040
rig+0.039
useum+0.039
iles+0.038
rimp+0.037
Neg logit contributions
 favorite-0.022
 no-0.021
 forever-0.020
 favourite-0.019
 nobody-0.019
Token juicy
Token ID35075
Feature activation-7.26e-02

Pos logit contributions
 ey+0.038
useum+0.038
esame+0.037
rig+0.037
iles+0.037
Neg logit contributions
 favorite-0.047
 possession-0.044
attered-0.044
 favourite-0.044
 no-0.042
Token worm
Token ID23849
Feature activation-2.42e-02

Pos logit contributions
rig+0.103
useum+0.098
ount+0.097
esame+0.095
 explains+0.094
Neg logit contributions
 favorite-0.120
 possession-0.117
attered-0.116
 favourite-0.111
 toy-0.110
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
 ey+0.040
useum+0.039
iles+0.038
 answers+0.038
esame+0.038
Neg logit contributions
 favorite-0.032
attered-0.032
 possession-0.032
 favourite-0.031
 forever-0.031
Token feeling
Token ID4203
Feature activation-6.97e-03

Pos logit contributions
iles+0.029
useum+0.028
esame+0.027
 ey+0.026
rig+0.026
Neg logit contributions
 no-0.024
 favorite-0.023
 forever-0.023
 nobody-0.023
 possession-0.021
Token much
Token ID881
Feature activation-2.98e-02

Pos logit contributions
 answers+0.012
 say+0.012
 ey+0.011
 perk+0.011
 explains+0.011
Neg logit contributions
attered-0.009
 possession-0.009
 favorite-0.008
 favourite-0.008
 forever-0.007
Token better
Token ID1365
Feature activation-1.53e-02

Pos logit contributions
iles+0.055
 ey+0.054
useum+0.053
esame+0.052
 answers+0.051
Neg logit contributions
 favorite-0.034
 forever-0.034
attered-0.033
 no-0.032
 possession-0.032
Token said
Token ID531
Feature activation-3.99e-03

Pos logit contributions
useum+0.013
 ey+0.013
esame+0.013
iles+0.013
rig+0.012
Neg logit contributions
attered-0.010
 favorite-0.010
 no-0.010
 favourite-0.009
 nobody-0.009
Token:
Token ID25
Feature activation-2.63e-02

Pos logit contributions
 ey+0.006
 answers+0.006
iles+0.006
useum+0.006
esame+0.006
Neg logit contributions
attered-0.006
 favorite-0.006
 possession-0.006
 favourite-0.005
 forever-0.005
Token "
Token ID366
Feature activation+1.77e-02

Pos logit contributions
iles+0.064
 ey+0.061
rig+0.060
 answers+0.060
 facts+0.057
Neg logit contributions
 no-0.019
 nobody-0.017
 favorite-0.017
 favourite-0.017
 forever-0.016
TokenTake
Token ID12322
Feature activation-1.97e-02

Pos logit contributions
attered+0.017
 possession+0.015
 favorite+0.014
 favourite+0.014
 nobody+0.014
Neg logit contributions
 ey-0.037
 answers-0.036
useum-0.036
 calls-0.035
iles-0.035
Token as
Token ID355
Feature activation-2.30e-02

Pos logit contributions
 ey+0.041
iles+0.039
useum+0.039
rig+0.038
 answers+0.037
Neg logit contributions
 possession-0.018
 no-0.018
 forever-0.018
 favorite-0.018
 favourite-0.017
Token much
Token ID881
Feature activation-7.26e-02

Pos logit contributions
 ey+0.048
 answers+0.047
useum+0.047
iles+0.047
esame+0.046
Neg logit contributions
attered-0.020
 possession-0.020
 favorite-0.020
 favourite-0.019
 forever-0.018
Token as
Token ID355
Feature activation-2.15e-02

Pos logit contributions
 ey+0.156
rig+0.150
ount+0.149
iles+0.149
ric+0.142
Neg logit contributions
 no-0.061
 favorite-0.061
 forever-0.057
 favourite-0.057
 good-0.054
Token you
Token ID345
Feature activation-4.77e-02

Pos logit contributions
 ey+0.046
 answers+0.044
useum+0.044
iles+0.044
rig+0.044
Neg logit contributions
 favorite-0.018
 possession-0.018
attered-0.018
 favourite-0.017
 forever-0.017
Token like
Token ID588
Feature activation-2.47e-02

Pos logit contributions
 ey+0.090
rig+0.082
 facts+0.082
 explains+0.081
useum+0.081
Neg logit contributions
 favorite-0.053
 forever-0.052
attered-0.051
 perfectly-0.050
 favourite-0.049
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
 ey+0.046
 answers+0.044
iles+0.044
useum+0.044
rig+0.044
Neg logit contributions
 possession-0.027
 favorite-0.027
attered-0.027
 forever-0.026
 favourite-0.026
Token have
Token ID423
Feature activation+1.81e-03

Pos logit contributions
iles+0.022
 ey+0.022
useum+0.022
rig+0.021
esame+0.021
Neg logit contributions
 no-0.016
 favorite-0.016
 forever-0.016
 nobody-0.016
 favourite-0.015
Token treat
Token ID2190
Feature activation-6.65e-03

Pos logit contributions
 favorite+0.011
 favourite+0.010
attered+0.010
 forever+0.010
 possession+0.010
Neg logit contributions
iles-0.012
 ey-0.012
esame-0.012
 answers-0.012
 explains-0.011
Token.
Token ID13
Feature activation-5.50e-03

Pos logit contributions
 ey+0.013
rig+0.012
useum+0.012
iles+0.012
esame+0.012
Neg logit contributions
attered-0.007
 favorite-0.007
 forever-0.007
 possession-0.007
 no-0.007
Token He
Token ID679
Feature activation-9.11e-03

Pos logit contributions
 ey+0.010
iles+0.010
 answers+0.010
useum+0.010
rig+0.010
Neg logit contributions
attered-0.006
 favorite-0.006
 favourite-0.006
 possession-0.006
 nobody-0.006
Token ate
Token ID15063
Feature activation-2.60e-02

Pos logit contributions
 ey+0.017
useum+0.017
esame+0.016
iles+0.016
 answers+0.016
Neg logit contributions
 favorite-0.012
 favourite-0.011
attered-0.011
 no-0.011
 forever-0.010
Token as
Token ID355
Feature activation-1.61e-02

Pos logit contributions
useum+0.055
 ey+0.054
iles+0.053
rig+0.053
 question+0.051
Neg logit contributions
 favorite-0.026
 forever-0.025
 no-0.025
 favourite-0.023
 nobody-0.022
Token much
Token ID881
Feature activation-7.23e-02

Pos logit contributions
 ey+0.033
 answers+0.033
iles+0.033
esame+0.033
useum+0.032
Neg logit contributions
attered-0.015
 possession-0.013
 favorite-0.013
 favourite-0.012
 forever-0.011
Token of
Token ID286
Feature activation-3.55e-02

Pos logit contributions
 ey+0.152
rig+0.148
iles+0.148
ount+0.142
avi+0.141
Neg logit contributions
 favorite-0.068
 favourite-0.061
 good-0.061
 no-0.061
 much-0.057
Token the
Token ID262
Feature activation-2.05e-02

Pos logit contributions
useum+0.078
rig+0.077
iles+0.073
 ey+0.073
ount+0.073
Neg logit contributions
 no-0.037
 nobody-0.032
 forever-0.030
 favorite-0.030
 possession-0.029
Token rain
Token ID6290
Feature activation-5.37e-03

Pos logit contributions
iles+0.037
esame+0.037
 ey+0.036
 explains+0.036
olo+0.035
Neg logit contributions
 favorite-0.027
 whole-0.025
attered-0.025
 favourite-0.025
 toy-0.023
Token as
Token ID355
Feature activation-1.48e-02

Pos logit contributions
 ey+0.008
useum+0.008
iles+0.008
 answers+0.007
esame+0.007
Neg logit contributions
attered-0.008
 favorite-0.008
 possession-0.008
 favourite-0.008
 forever-0.008
Token he
Token ID339
Feature activation-1.05e-02

Pos logit contributions
 ey+0.027
 answers+0.026
useum+0.026
esame+0.026
rig+0.025
Neg logit contributions
attered-0.017
 favorite-0.017
 possession-0.017
 favourite-0.016
 forever-0.016
TokenNo
Token ID2949
Feature activation+1.61e-02

Pos logit contributions
 possession+0.054
attered+0.053
 favourite+0.052
 favorite+0.051
 forever+0.051
Neg logit contributions
 say-0.054
 answers-0.051
 ey-0.051
 explains-0.050
rimp-0.050
Token,
Token ID11
Feature activation-3.13e-04

Pos logit contributions
attered+0.026
 possession+0.025
 favorite+0.025
 favourite+0.024
 nobody+0.023
Neg logit contributions
 ey-0.022
 answers-0.022
useum-0.021
iles-0.021
esame-0.021
Token Mom
Token ID11254
Feature activation-7.41e-03

Pos logit contributions
 say+0.001
 answers+0.001
 ey+0.001
 says+0.001
 question+0.000
Neg logit contributions
attered-0.000
 possession-0.000
 favourite-0.000
 favorite-0.000
organized-0.000
Token,
Token ID11
Feature activation-1.63e-03

Pos logit contributions
 ey+0.012
 answers+0.012
useum+0.012
iles+0.011
esame+0.011
Neg logit contributions
attered-0.010
 possession-0.010
 favorite-0.010
 favourite-0.009
 forever-0.009
Token please
Token ID3387
Feature activation-1.34e-02

Pos logit contributions
 say+0.003
 answers+0.003
 ey+0.003
esame+0.003
 question+0.003
Neg logit contributions
attered-0.002
 possession-0.002
 favourite-0.002
 favorite-0.002
 forever-0.002
Token.
Token ID13
Feature activation+4.10e-02

Pos logit contributions
 answers+0.019
 ey+0.019
useum+0.019
 say+0.019
esame+0.018
Neg logit contributions
attered-0.021
 possession-0.020
 favorite-0.020
 favourite-0.019
 forever-0.018
Token I
Token ID314
Feature activation-3.93e-03

Pos logit contributions
 possession+0.059
attered+0.059
 forever+0.056
 favourite+0.055
 favorite+0.054
Neg logit contributions
 say-0.061
 calls-0.060
 ey-0.060
 says-0.058
 slam-0.058
Token am
Token ID716
Feature activation+1.61e-02

Pos logit contributions
 ey+0.006
 answers+0.006
useum+0.006
esame+0.006
iles+0.006
Neg logit contributions
attered-0.006
 favorite-0.006
 possession-0.006
 favourite-0.005
 forever-0.005
Token sorry
Token ID7926
Feature activation+5.69e-03

Pos logit contributions
attered+0.022
 possession+0.021
 favorite+0.020
 favourite+0.019
 forever+0.019
Neg logit contributions
 answers-0.026
 say-0.026
 ey-0.026
useum-0.026
esame-0.025
Token.
Token ID13
Feature activation+4.22e-02

Pos logit contributions
attered+0.011
 possession+0.010
 favourite+0.010
 favorite+0.009
 nobody+0.009
Neg logit contributions
 say-0.007
 answers-0.007
 calls-0.007
useum-0.007
iles-0.006
Token I
Token ID314
Feature activation-2.74e-04

Pos logit contributions
attered+0.063
 possession+0.062
 forever+0.059
 favourite+0.058
 favorite+0.057
Neg logit contributions
 say-0.062
 ey-0.062
 calls-0.061
 answers-0.060
 says-0.060
Token in
Token ID287
Feature activation+1.18e-02

Pos logit contributions
 possession+0.011
 favourite+0.010
Mid+0.010
attered+0.010
 proper+0.010
Neg logit contributions
iles-0.007
 answers-0.007
 ey-0.007
 pe-0.007
esame-0.007
Token the
Token ID262
Feature activation+2.01e-02

Pos logit contributions
attered+0.019
 possession+0.017
 favorite+0.017
 favourite+0.017
 nobody+0.016
Neg logit contributions
 answers-0.017
 ey-0.017
 say-0.016
esame-0.016
useum-0.016
Token corner
Token ID5228
Feature activation+1.31e-02

Pos logit contributions
attered+0.033
 possession+0.032
 forever+0.031
 badly+0.030
 nobody+0.029
Neg logit contributions
 ey-0.027
 answers-0.025
 say-0.024
useum-0.023
esame-0.023
Token of
Token ID286
Feature activation+5.76e-03

Pos logit contributions
attered+0.020
 possession+0.019
 favorite+0.019
 favourite+0.018
 forever+0.018
Neg logit contributions
 answers-0.019
 ey-0.019
useum-0.018
iles-0.018
 say-0.018
Token the
Token ID262
Feature activation+1.10e-02

Pos logit contributions
attered+0.009
 possession+0.009
 favorite+0.009
 favourite+0.008
 forever+0.008
Neg logit contributions
 ey-0.008
 answers-0.008
useum-0.008
esame-0.008
iles-0.008
Token room
Token ID2119
Feature activation+3.79e-02

Pos logit contributions
attered+0.016
 possession+0.015
 nobody+0.015
 forever+0.014
 favorite+0.014
Neg logit contributions
 ey-0.017
 answers-0.017
useum-0.017
 say-0.016
esame-0.016
Token.
Token ID13
Feature activation-1.94e-03

Pos logit contributions
attered+0.062
 possession+0.062
 favorite+0.057
Mid+0.057
 favourite+0.057
Neg logit contributions
 answers-0.048
 say-0.046
useum-0.046
 calls-0.044
rimp-0.043
Token One
Token ID1881
Feature activation+9.24e-03

Pos logit contributions
iles+0.004
 ey+0.004
rig+0.004
 answers+0.004
esame+0.004
Neg logit contributions
 favorite-0.002
 favourite-0.002
 nobody-0.002
 no-0.002
attered-0.002
Token day
Token ID1110
Feature activation-8.10e-03

Pos logit contributions
attered+0.008
 possession+0.008
 favorite+0.008
 favourite+0.007
 forever+0.007
Neg logit contributions
 ey-0.019
 answers-0.019
useum-0.019
iles-0.019
esame-0.019
Token,
Token ID11
Feature activation-8.89e-03

Pos logit contributions
 ey+0.011
 answers+0.010
useum+0.010
esame+0.010
iles+0.010
Neg logit contributions
attered-0.013
 possession-0.013
 favorite-0.013
 favourite-0.012
 forever-0.012
Token a
Token ID257
Feature activation+1.57e-02

Pos logit contributions
 ey+0.014
 answers+0.014
useum+0.014
esame+0.014
iles+0.014
Neg logit contributions
attered-0.012
 possession-0.012
 favorite-0.012
 favourite-0.011
 forever-0.011
Token thanked
Token ID26280
Feature activation+1.02e-02

Pos logit contributions
 favorite+0.029
 possession+0.029
attered+0.028
 forever+0.028
 nobody+0.028
Neg logit contributions
 ey-0.033
iles-0.033
useum-0.032
rig-0.032
esame-0.031
Token her
Token ID607
Feature activation-2.02e-03

Pos logit contributions
 possession+0.010
 favorite+0.010
 forever+0.010
 nobody+0.009
 favourite+0.009
Neg logit contributions
 ey-0.020
rig-0.020
useum-0.020
iles-0.020
 answers-0.019
Token for
Token ID329
Feature activation+1.39e-02

Pos logit contributions
 ey+0.004
iles+0.004
esame+0.004
useum+0.004
rimp+0.004
Neg logit contributions
 favorite-0.002
 favourite-0.002
 forever-0.002
 possession-0.002
attered-0.002
Token the
Token ID262
Feature activation+1.88e-02

Pos logit contributions
attered+0.021
 possession+0.020
 favorite+0.020
 favourite+0.019
 forever+0.018
Neg logit contributions
 ey-0.020
 answers-0.020
 say-0.020
esame-0.020
useum-0.020
Token wonderful
Token ID7932
Feature activation+1.21e-02

Pos logit contributions
attered+0.027
 possession+0.025
 nobody+0.022
 forever+0.022
 favorite+0.022
Neg logit contributions
 answers-0.032
 say-0.031
 ey-0.030
 question-0.029
useum-0.029
Token presents
Token ID10969
Feature activation+4.14e-02

Pos logit contributions
attered+0.018
 nobody+0.016
 possession+0.016
 favorite+0.015
 forever+0.015
Neg logit contributions
 answers-0.019
 say-0.018
 ey-0.018
 question-0.018
useum-0.018
Token.
Token ID13
Feature activation-1.49e-03

Pos logit contributions
attered+0.070
 possession+0.065
 favorite+0.063
 favourite+0.062
 nobody+0.060
Neg logit contributions
 say-0.055
 answers-0.053
 calls-0.049
useum-0.049
 calling-0.048
Token They
Token ID1119
Feature activation+6.88e-03

Pos logit contributions
 ey+0.002
 answers+0.002
useum+0.002
esame+0.002
 say+0.002
Neg logit contributions
attered-0.002
 possession-0.002
 favorite-0.002
 favourite-0.002
 forever-0.002
Token thanked
Token ID26280
Feature activation+1.15e-02

Pos logit contributions
attered+0.008
 possession+0.008
 favorite+0.008
 favourite+0.007
 nobody+0.007
Neg logit contributions
 ey-0.012
 answers-0.012
 say-0.012
iles-0.012
esame-0.012
Token her
Token ID607
Feature activation-6.85e-03

Pos logit contributions
 favorite+0.011
 possession+0.011
 forever+0.011
 nobody+0.011
 favourite+0.010
Neg logit contributions
 ey-0.023
useum-0.022
iles-0.022
rig-0.022
 answers-0.022
Token for
Token ID329
Feature activation+1.29e-02

Pos logit contributions
 ey+0.014
iles+0.013
useum+0.013
esame+0.013
rimp+0.013
Neg logit contributions
 favorite-0.007
 favourite-0.007
 forever-0.007
 possession-0.006
 no-0.006
Token my
Token ID616
Feature activation+4.13e-03

Pos logit contributions
attered+0.001
 possession+0.001
 favourite+0.001
 favorite+0.001
 forever+0.001
Neg logit contributions
 say-0.001
 answers-0.001
esame-0.001
 Jen-0.001
iles-0.001
Token notebook
Token ID20922
Feature activation+1.34e-02

Pos logit contributions
attered+0.007
 nobody+0.006
 possession+0.006
 forever+0.006
 favourite+0.006
Neg logit contributions
 answers-0.006
 say-0.006
 ey-0.006
 question-0.005
 says-0.005
Token!
Token ID0
Feature activation+1.87e-02

Pos logit contributions
attered+0.023
 possession+0.022
 favourite+0.021
 nobody+0.020
 favorite+0.020
Neg logit contributions
 answers-0.017
 say-0.017
esame-0.016
 says-0.015
 ey-0.015
Token You
Token ID921
Feature activation-1.05e-02

Pos logit contributions
 possession+0.031
attered+0.031
 forever+0.030
 favourite+0.029
 favorite+0.029
Neg logit contributions
 calls-0.023
 say-0.023
 ey-0.023
 says-0.023
useum-0.023
Token are
Token ID389
Feature activation+1.02e-02

Pos logit contributions
 say+0.011
 answers+0.011
iles+0.010
 calls+0.010
 says+0.010
Neg logit contributions
 possession-0.021
attered-0.021
 favourite-0.019
 favorite-0.019
 forever-0.018
Token a
Token ID257
Feature activation+4.43e-02

Pos logit contributions
attered+0.015
 possession+0.014
 favourite+0.013
 favorite+0.013
 forever+0.012
Neg logit contributions
 answers-0.016
 say-0.016
 ey-0.015
iles-0.015
 calling-0.015
Token bad
Token ID2089
Feature activation+1.26e-02

Pos logit contributions
attered+0.083
 possession+0.068
organized+0.065
 nobody+0.063
 favourite+0.062
Neg logit contributions
 pe-0.060
 question-0.059
 say-0.058
 answers-0.058
 calls-0.053
Token friend
Token ID1545
Feature activation+5.92e-03

Pos logit contributions
attered+0.017
 possession+0.015
 nobody+0.015
 favourite+0.014
 favorite+0.014
Neg logit contributions
 answers-0.022
 say-0.022
 question-0.021
 ey-0.021
iles-0.021
Token!"
Token ID2474
Feature activation-1.78e-02

Pos logit contributions
attered+0.010
 favourite+0.009
 possession+0.009
 favorite+0.009
 nobody+0.008
Neg logit contributions
 answers-0.008
 say-0.008
iles-0.008
useum-0.008
 calls-0.008
Token
Token ID198
Feature activation-1.45e-02

Pos logit contributions
iles+0.037
rig+0.036
useum+0.035
rimp+0.034
ount+0.034
Neg logit contributions
 favorite-0.017
 no-0.017
 favourite-0.015
 nobody-0.015
 forever-0.015
Token
Token ID198
Feature activation-4.11e-03

Pos logit contributions
useum+0.027
iles+0.027
 answers+0.027
 ey+0.027
ount+0.026
Neg logit contributions
 favorite-0.015
 no-0.014
attered-0.014
 possession-0.014
 favourite-0.014
Token got
Token ID1392
Feature activation-1.37e-04

Pos logit contributions
attered+0.006
 possession+0.006
 favorite+0.006
 favourite+0.006
 forever+0.006
Neg logit contributions
 ey-0.005
 answers-0.005
useum-0.005
iles-0.005
esame-0.005
Token to
Token ID284
Feature activation-3.10e-03

Pos logit contributions
 answers+0.000
 ey+0.000
esame+0.000
useum+0.000
 say+0.000
Neg logit contributions
attered-0.000
 possession-0.000
 favorite-0.000
 favourite-0.000
 nobody-0.000
Token the
Token ID262
Feature activation+8.65e-03

Pos logit contributions
iles+0.005
rig+0.005
 ey+0.005
useum+0.005
rimp+0.005
Neg logit contributions
 possession-0.004
 favorite-0.004
 forever-0.004
 no-0.004
 favourite-0.004
Token front
Token ID2166
Feature activation-1.16e-03

Pos logit contributions
attered+0.015
 possession+0.015
 forever+0.014
 favorite+0.013
 favourite+0.013
Neg logit contributions
 answers-0.011
 say-0.011
useum-0.010
esame-0.010
 ey-0.010
Token,
Token ID11
Feature activation-6.25e-03

Pos logit contributions
 ey+0.002
iles+0.002
rig+0.002
esame+0.002
useum+0.002
Neg logit contributions
attered-0.002
 favorite-0.002
 possession-0.002
 favourite-0.002
 no-0.001
Token Tim
Token ID5045
Feature activation+3.86e-02

Pos logit contributions
 say+0.008
 ey+0.008
 answers+0.008
useum+0.008
esame+0.008
Neg logit contributions
attered-0.010
 possession-0.010
 favorite-0.009
 favourite-0.009
 forever-0.009
Tokenmy
Token ID1820
Feature activation+1.67e-02

Pos logit contributions
attered+0.066
 possession+0.065
 favorite+0.064
 favourite+0.062
 forever+0.061
Neg logit contributions
 ey-0.048
 answers-0.047
useum-0.046
iles-0.046
esame-0.046
Token picked
Token ID6497
Feature activation-3.39e-04

Pos logit contributions
attered+0.023
 favorite+0.023
 possession+0.022
 favourite+0.022
 forever+0.021
Neg logit contributions
 ey-0.026
iles-0.026
useum-0.026
 answers-0.025
esame-0.025
Token out
Token ID503
Feature activation+1.32e-02

Pos logit contributions
 answers+0.001
 say+0.001
esame+0.001
 ey+0.001
 explains+0.001
Neg logit contributions
attered-0.000
 possession-0.000
 forever-0.000
 nobody-0.000
 favorite-0.000
Token a
Token ID257
Feature activation+2.99e-03

Pos logit contributions
attered+0.023
 possession+0.021
 forever+0.020
 favourite+0.019
 favorite+0.019
Neg logit contributions
 answers-0.018
 say-0.017
 ey-0.017
 questions-0.016
esame-0.016
Token book
Token ID1492
Feature activation-8.27e-03

Pos logit contributions
 favorite+0.003
attered+0.003
 possession+0.003
 favourite+0.003
 forever+0.003
Neg logit contributions
 ey-0.006
iles-0.006
useum-0.005
 answers-0.005
esame-0.005
Token Sometimes
Token ID8975
Feature activation-1.24e-03

Pos logit contributions
 favorite+0.003
attered+0.003
 favourite+0.003
 nobody+0.003
 no+0.003
Neg logit contributions
 ey-0.005
 answers-0.005
iles-0.005
esame-0.005
aches-0.005
Token they
Token ID484
Feature activation-7.53e-03

Pos logit contributions
 ey+0.003
rig+0.002
esame+0.002
 answers+0.002
useum+0.002
Neg logit contributions
 favorite-0.001
 possession-0.001
attered-0.001
 nobody-0.001
 favourite-0.001
Token saw
Token ID2497
Feature activation-1.94e-02

Pos logit contributions
 ey+0.014
useum+0.013
esame+0.013
rig+0.013
 answers+0.013
Neg logit contributions
attered-0.009
 favorite-0.009
 forever-0.008
 possession-0.008
 nobody-0.008
Token big
Token ID1263
Feature activation-3.76e-02

Pos logit contributions
rig+0.032
 ey+0.032
esame+0.032
useum+0.032
rimp+0.030
Neg logit contributions
 favorite-0.027
 no-0.025
 nobody-0.025
 favourite-0.025
attered-0.025
Token waves
Token ID9813
Feature activation+3.72e-03

Pos logit contributions
rig+0.054
esame+0.048
rimp+0.047
useum+0.046
utt+0.046
Neg logit contributions
 favorite-0.061
 possession-0.059
 forever-0.057
attered-0.057
 toy-0.057
Token that
Token ID326
Feature activation+3.70e-03

Pos logit contributions
 no+0.005
 forever+0.004
 favorite+0.004
attered+0.004
 nobody+0.004
Neg logit contributions
 ey-0.007
rig-0.007
useum-0.007
esame-0.007
 answers-0.006
Token crashed
Token ID14997
Feature activation+9.21e-03

Pos logit contributions
 favorite+0.004
 nobody+0.004
 no+0.004
attered+0.004
 favourite+0.004
Neg logit contributions
esame-0.007
 ey-0.007
rig-0.007
useum-0.007
iles-0.006
Token on
Token ID319
Feature activation+2.25e-03

Pos logit contributions
attered+0.011
 favorite+0.011
 forever+0.010
 no+0.010
 possession+0.010
Neg logit contributions
rig-0.017
 ey-0.017
 explains-0.016
 say-0.016
useum-0.016
Token the
Token ID262
Feature activation-1.72e-02

Pos logit contributions
 favorite+0.003
attered+0.003
 possession+0.003
 favourite+0.003
 nobody+0.003
Neg logit contributions
 ey-0.004
useum-0.004
 answers-0.004
rig-0.004
esame-0.004
Token shore
Token ID15191
Feature activation-1.25e-02

Pos logit contributions
useum+0.025
 explains+0.025
iles+0.024
esame+0.024
rig+0.024
Neg logit contributions
 favorite-0.026
attered-0.025
 favourite-0.025
 possession-0.025
 no-0.024
Token.
Token ID13
Feature activation+4.57e-03

Pos logit contributions
 ey+0.026
rig+0.026
useum+0.025
 explains+0.025
esame+0.025
Neg logit contributions
 no-0.010
attered-0.010
 forever-0.010
 favorite-0.010
 nobody-0.009
Token they
Token ID484
Feature activation-1.20e-02

Pos logit contributions
 ey+0.032
 answers+0.031
useum+0.031
iles+0.031
rig+0.030
Neg logit contributions
attered-0.028
 favorite-0.027
 possession-0.027
 favourite-0.026
 forever-0.026
Token tried
Token ID3088
Feature activation-1.51e-03

Pos logit contributions
 ey+0.019
 answers+0.018
useum+0.018
iles+0.018
esame+0.018
Neg logit contributions
attered-0.017
 favorite-0.016
 possession-0.016
 favourite-0.016
 forever-0.016
Token and
Token ID290
Feature activation-4.24e-03

Pos logit contributions
 ey+0.003
iles+0.003
useum+0.003
rig+0.003
rimp+0.003
Neg logit contributions
 favorite-0.002
 forever-0.002
 possession-0.002
 favourite-0.002
 no-0.002
Token finally
Token ID3443
Feature activation-7.74e-03

Pos logit contributions
 ey+0.007
iles+0.007
useum+0.007
rig+0.007
 answers+0.007
Neg logit contributions
 forever-0.006
attered-0.005
 no-0.005
 possession-0.005
 nobody-0.005
Token,
Token ID11
Feature activation-2.62e-02

Pos logit contributions
 ey+0.013
useum+0.012
iles+0.012
 answers+0.012
esame+0.012
Neg logit contributions
attered-0.011
 favorite-0.010
 possession-0.010
 forever-0.010
 favourite-0.010
Token they
Token ID484
Feature activation+4.84e-03

Pos logit contributions
 ey+0.044
iles+0.042
useum+0.042
 answers+0.041
rig+0.040
Neg logit contributions
 favorite-0.035
 forever-0.034
 no-0.034
attered-0.034
 possession-0.033
Token were
Token ID547
Feature activation-3.77e-04

Pos logit contributions
attered+0.006
 favorite+0.006
 possession+0.006
 favourite+0.006
 forever+0.006
Neg logit contributions
 ey-0.008
 answers-0.008
useum-0.008
iles-0.008
esame-0.008
Token able
Token ID1498
Feature activation+1.57e-02

Pos logit contributions
 answers+0.001
 ey+0.001
 say+0.001
esame+0.001
iles+0.001
Neg logit contributions
attered-0.000
 possession-0.000
 favorite-0.000
 favourite-0.000
 nobody-0.000
Token to
Token ID284
Feature activation-1.60e-02

Pos logit contributions
attered+0.026
 possession+0.025
 favorite+0.024
 favourite+0.024
 nobody+0.023
Neg logit contributions
 answers-0.020
 say-0.019
useum-0.019
 ey-0.019
esame-0.019
Token carry
Token ID3283
Feature activation+3.65e-03

Pos logit contributions
 ey+0.023
useum+0.022
iles+0.022
 answers+0.022
 explains+0.021
Neg logit contributions
 favorite-0.024
attered-0.024
 forever-0.024
 possession-0.024
 favourite-0.024
Token the
Token ID262
Feature activation-2.48e-03

Pos logit contributions
attered+0.006
 possession+0.006
 favorite+0.006
 favourite+0.006
 nobody+0.005
Neg logit contributions
 answers-0.005
 ey-0.005
esame-0.005
 say-0.005
iles-0.004
Token a
Token ID257
Feature activation+1.78e-02

Pos logit contributions
 say+0.013
 answers+0.012
 ey+0.012
iles+0.012
useum+0.011
Neg logit contributions
attered-0.020
 possession-0.019
 forever-0.017
 nobody-0.017
 perfectly-0.016
Token little
Token ID1310
Feature activation+8.77e-03

Pos logit contributions
attered+0.024
 possession+0.022
 favorite+0.020
 favourite+0.020
 nobody+0.020
Neg logit contributions
 ey-0.029
useum-0.029
 say-0.029
 answers-0.029
 question-0.029
Token boy
Token ID2933
Feature activation+7.11e-03

Pos logit contributions
attered+0.012
 possession+0.011
 nobody+0.011
 favourite+0.011
 favorite+0.011
Neg logit contributions
 ey-0.014
 say-0.014
 answers-0.014
iles-0.013
useum-0.013
Token named
Token ID3706
Feature activation+9.62e-03

Pos logit contributions
attered+0.015
 favourite+0.014
 favorite+0.014
 possession+0.014
 nobody+0.013
Neg logit contributions
 say-0.007
iles-0.006
 calls-0.006
 answers-0.006
 question-0.006
Token Tim
Token ID5045
Feature activation+2.49e-02

Pos logit contributions
attered+0.015
 possession+0.015
 favorite+0.014
 favourite+0.014
 forever+0.014
Neg logit contributions
 ey-0.013
 answers-0.013
useum-0.013
iles-0.013
 say-0.013
Tokenmy
Token ID1820
Feature activation+2.24e-02

Pos logit contributions
attered+0.050
 favourite+0.048
 possession+0.047
 favorite+0.047
 nobody+0.046
Neg logit contributions
 say-0.026
 answers-0.025
iles-0.024
 ey-0.024
 calls-0.022
Token.
Token ID13
Feature activation+5.78e-04

Pos logit contributions
attered+0.047
 favourite+0.046
 perfectly+0.044
dates+0.044
 nobody+0.043
Neg logit contributions
 say-0.022
 calling-0.018
 calls-0.017
 question-0.016
 pe-0.016
Token Tim
Token ID5045
Feature activation+2.32e-02

Pos logit contributions
attered+0.001
 favorite+0.001
 favourite+0.001
 forever+0.001
 possession+0.001
Neg logit contributions
esame-0.001
 ey-0.001
 answers-0.001
rig-0.001
useum-0.001
Tokenmy
Token ID1820
Feature activation+1.98e-02

Pos logit contributions
attered+0.047
 possession+0.045
 favourite+0.045
 nobody+0.045
 favorite+0.044
Neg logit contributions
 say-0.021
 answers-0.021
 ey-0.020
iles-0.020
 explains-0.018
Token loved
Token ID6151
Feature activation-3.25e-03

Pos logit contributions
attered+0.027
rier+0.027
 possession+0.026
 forward+0.025
 nobody+0.024
Neg logit contributions
 say-0.031
 says-0.028
 calls-0.028
 answers-0.027
 answered-0.027
Token to
Token ID284
Feature activation+9.15e-03

Pos logit contributions
 ey+0.006
rig+0.006
useum+0.006
iles+0.006
esame+0.005
Neg logit contributions
 favorite-0.004
 possession-0.004
attered-0.004
 favourite-0.004
 nobody-0.003
Token jumped
Token ID11687
Feature activation-1.88e-02

Pos logit contributions
 ey+0.026
useum+0.025
iles+0.024
rig+0.024
 facts+0.023
Neg logit contributions
 no-0.019
 favorite-0.019
 forever-0.018
 favourite-0.018
 nobody-0.018
Token in
Token ID287
Feature activation-1.27e-02

Pos logit contributions
 ey+0.037
iles+0.036
useum+0.035
 answers+0.034
esame+0.034
Neg logit contributions
 favorite-0.022
attered-0.021
 favourite-0.020
 forever-0.020
 no-0.018
Token the
Token ID262
Feature activation+1.45e-03

Pos logit contributions
useum+0.022
iles+0.022
esame+0.022
 ey+0.022
rimp+0.021
Neg logit contributions
 favorite-0.016
 forever-0.015
 possession-0.015
 no-0.015
 favourite-0.015
Token water
Token ID1660
Feature activation-2.21e-04

Pos logit contributions
attered+0.002
 favorite+0.002
 possession+0.002
 favourite+0.002
 nobody+0.002
Neg logit contributions
useum-0.002
iles-0.002
 answers-0.002
esame-0.002
 ey-0.002
Token and
Token ID290
Feature activation+1.00e-02

Pos logit contributions
 ey+0.000
useum+0.000
esame+0.000
iles+0.000
rig+0.000
Neg logit contributions
attered-0.000
 forever-0.000
 favorite-0.000
 no-0.000
 possession-0.000
Token made
Token ID925
Feature activation+5.19e-03

Pos logit contributions
 favorite+0.014
 no+0.013
attered+0.013
 nobody+0.013
 favourite+0.013
Neg logit contributions
 ey-0.018
useum-0.018
iles-0.017
rig-0.017
esame-0.017
Token a
Token ID257
Feature activation+5.19e-03

Pos logit contributions
 favorite+0.006
attered+0.006
 favourite+0.006
 possession+0.006
 nobody+0.005
Neg logit contributions
 ey-0.009
useum-0.009
esame-0.009
iles-0.009
 answers-0.009
Token big
Token ID1263
Feature activation-2.31e-02

Pos logit contributions
 favorite+0.006
 favourite+0.005
attered+0.005
 possession+0.005
 forever+0.005
Neg logit contributions
esame-0.010
 ey-0.010
iles-0.010
 answers-0.010
useum-0.010
Token splash
Token ID22870
Feature activation-2.39e-03

Pos logit contributions
esame+0.031
iles+0.031
useum+0.031
 answers+0.031
 ey+0.030
Neg logit contributions
 favorite-0.037
 forever-0.036
 possession-0.036
attered-0.036
 favourite-0.035
Token with
Token ID351
Feature activation-6.05e-03

Pos logit contributions
 ey+0.004
 answers+0.004
useum+0.004
esame+0.004
iles+0.004
Neg logit contributions
attered-0.003
 possession-0.003
 favorite-0.003
 favourite-0.003
 forever-0.003
Token the
Token ID262
Feature activation-8.71e-03

Pos logit contributions
 ey+0.009
useum+0.009
 answers+0.009
esame+0.009
iles+0.009
Neg logit contributions
attered-0.009
 favorite-0.008
 possession-0.008
 favourite-0.008
 forever-0.008
Token eager
Token ID11069
Feature activation+7.14e-03

Pos logit contributions
attered+0.005
 possession+0.004
 favorite+0.004
 favourite+0.004
 forever+0.004
Neg logit contributions
 ey-0.007
 answers-0.007
useum-0.007
iles-0.007
esame-0.007
Token to
Token ID284
Feature activation+3.39e-04

Pos logit contributions
 favorite+0.008
 possession+0.008
 forever+0.008
attered+0.008
 favourite+0.008
Neg logit contributions
 ey-0.013
useum-0.013
iles-0.013
rig-0.013
esame-0.012
Token get
Token ID651
Feature activation+4.39e-03

Pos logit contributions
 possession+0.001
 forever+0.001
 favorite+0.001
 favourite+0.001
 nobody+0.001
Neg logit contributions
iles-0.000
useum-0.000
 ey-0.000
ount-0.000
rig-0.000
Token his
Token ID465
Feature activation-6.90e-03

Pos logit contributions
 possession+0.004
 favorite+0.004
attered+0.004
 favourite+0.004
 nobody+0.004
Neg logit contributions
 ey-0.009
useum-0.009
iles-0.009
rig-0.009
esame-0.008
Token power
Token ID1176
Feature activation+6.54e-03

Pos logit contributions
 ey+0.013
ount+0.013
iles+0.013
rimp+0.013
useum+0.013
Neg logit contributions
 favorite-0.008
 favourite-0.008
 possession-0.007
 forever-0.007
 nobody-0.006
Token.
Token ID13
Feature activation-1.50e-03

Pos logit contributions
attered+0.009
 possession+0.009
 favorite+0.009
 favourite+0.009
 forever+0.008
Neg logit contributions
 ey-0.010
 answers-0.010
useum-0.010
iles-0.010
esame-0.010
Token But
Token ID887
Feature activation-3.63e-03

Pos logit contributions
iles+0.003
 ey+0.003
 perk+0.002
 answers+0.002
 say+0.002
Neg logit contributions
attered-0.002
 favorite-0.002
 possession-0.002
 favourite-0.002
 nobody-0.002
Token then
Token ID788
Feature activation-3.16e-03

Pos logit contributions
 ey+0.007
useum+0.007
iles+0.006
 answers+0.006
rig+0.006
Neg logit contributions
 favorite-0.004
 nobody-0.004
 possession-0.004
 no-0.004
 forever-0.004
Token,
Token ID11
Feature activation-6.26e-03

Pos logit contributions
rig+0.006
 ey+0.006
iles+0.006
useum+0.005
rimp+0.005
Neg logit contributions
 no-0.004
 nobody-0.004
 favorite-0.004
 forever-0.004
 possession-0.004
Token something
Token ID1223
Feature activation-2.18e-03

Pos logit contributions
 ey+0.012
iles+0.012
useum+0.012
rig+0.012
 answers+0.011
Neg logit contributions
 nobody-0.007
 no-0.007
 favorite-0.007
 possession-0.007
 forever-0.007
Token bad
Token ID2089
Feature activation+5.89e-03

Pos logit contributions
 ey+0.004
esame+0.004
useum+0.004
iles+0.004
rimp+0.004
Neg logit contributions
 favorite-0.002
attered-0.002
 possession-0.002
 nobody-0.002
 favourite-0.002
Token the
Token ID262
Feature activation+9.85e-03

Pos logit contributions
attered+0.013
 possession+0.013
 favorite+0.013
 favourite+0.013
 forever+0.012
Neg logit contributions
 ey-0.014
 answers-0.013
useum-0.013
iles-0.013
esame-0.013
Token park
Token ID3952
Feature activation+6.40e-03

Pos logit contributions
attered+0.018
 favorite+0.017
 possession+0.017
 favourite+0.017
 forever+0.016
Neg logit contributions
 ey-0.011
 answers-0.011
useum-0.011
iles-0.011
esame-0.011
Token.
Token ID13
Feature activation-7.43e-03

Pos logit contributions
 favorite+0.007
attered+0.007
 favourite+0.007
 possession+0.007
 forever+0.007
Neg logit contributions
 ey-0.013
esame-0.012
useum-0.012
rig-0.012
iles-0.011
Token They
Token ID1119
Feature activation-3.06e-02

Pos logit contributions
esame+0.015
 ey+0.015
iles+0.015
rig+0.014
 answers+0.014
Neg logit contributions
 favorite-0.008
 favourite-0.008
attered-0.007
 no-0.007
 nobody-0.006
Token skip
Token ID14267
Feature activation-7.15e-03

Pos logit contributions
 ey+0.040
useum+0.040
esame+0.039
rig+0.039
iles+0.038
Neg logit contributions
 favorite-0.050
attered-0.049
 favourite-0.048
 possession-0.048
 forever-0.047
Token and
Token ID290
Feature activation-1.69e-02

Pos logit contributions
 ey+0.011
 answers+0.011
useum+0.011
iles+0.011
esame+0.010
Neg logit contributions
attered-0.010
 possession-0.010
 favorite-0.010
 favourite-0.009
 nobody-0.009
Token run
Token ID1057
Feature activation-9.35e-03

Pos logit contributions
useum+0.026
rig+0.024
iles+0.024
esame+0.023
 ey+0.023
Neg logit contributions
 no-0.027
 favorite-0.027
 forever-0.026
 favourite-0.025
 whole-0.024
Token and
Token ID290
Feature activation-1.32e-02

Pos logit contributions
 answers+0.012
 say+0.011
useum+0.011
esame+0.011
rig+0.011
Neg logit contributions
attered-0.016
 possession-0.015
 nobody-0.015
 favorite-0.014
 favourite-0.014
Token laugh
Token ID6487
Feature activation+1.16e-02

Pos logit contributions
useum+0.016
iles+0.016
 ey+0.016
esame+0.015
rig+0.015
Neg logit contributions
 favorite-0.024
 favourite-0.023
 no-0.023
 forever-0.023
attered-0.022
Token.
Token ID13
Feature activation-5.24e-03

Pos logit contributions
attered+0.021
 possession+0.021
 nobody+0.019
 favourite+0.018
 favorite+0.018
Neg logit contributions
 answers-0.014
 say-0.013
useum-0.013
iles-0.013
esame-0.012
Token They
Token ID1119
Feature activation-2.74e-02

Pos logit contributions
 ey+0.010
esame+0.009
iles+0.009
 answers+0.009
rig+0.009
Neg logit contributions
 favorite-0.007
 favourite-0.007
attered-0.006
 no-0.006
 nobody-0.006
Token have
Token ID423
Feature activation+1.14e-02

Pos logit contributions
 ey+0.027
useum+0.026
esame+0.026
rig+0.026
iles+0.025
Neg logit contributions
 favorite-0.024
attered-0.024
 possession-0.023
 forever-0.023
 no-0.023
Token a
Token ID257
Feature activation+9.71e-03

Pos logit contributions
 favorite+0.014
 possession+0.013
attered+0.013
 favourite+0.013
 no+0.013
Neg logit contributions
 ey-0.021
rig-0.020
useum-0.020
iles-0.020
esame-0.020
Token surprise
Token ID5975
Feature activation+3.61e-03

Pos logit contributions
 favorite+0.011
 possession+0.010
attered+0.010
 favourite+0.010
 forever+0.010
Neg logit contributions
 ey-0.018
iles-0.018
esame-0.018
useum-0.017
 answers-0.017
Token for
Token ID329
Feature activation+2.77e-04

Pos logit contributions
 favorite+0.004
 forever+0.004
 possession+0.004
attered+0.004
 favourite+0.004
Neg logit contributions
 ey-0.007
rig-0.007
useum-0.007
esame-0.006
iles-0.006
Token you
Token ID345
Feature activation-1.95e-02

Pos logit contributions
 favorite+0.000
 no+0.000
 nobody+0.000
 forever+0.000
 favourite+0.000
Neg logit contributions
 ey-0.001
rig-0.001
useum-0.001
 answers-0.001
ount-0.001
Token,
Token ID11
Feature activation-1.49e-02

Pos logit contributions
 ey+0.041
rig+0.037
useum+0.036
esame+0.035
 perk+0.035
Neg logit contributions
 forever-0.022
 favorite-0.021
 no-0.021
 favourite-0.019
 possession-0.018
Token Tom
Token ID4186
Feature activation-1.56e-03

Pos logit contributions
 ey+0.025
esame+0.024
useum+0.023
iles+0.023
rig+0.023
Neg logit contributions
 favorite-0.022
 no-0.022
 forever-0.020
 nobody-0.020
 favourite-0.020
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
 ey+0.003
 answers+0.002
useum+0.002
esame+0.002
iles+0.002
Neg logit contributions
attered-0.002
 possession-0.002
 favorite-0.002
 favourite-0.002
 forever-0.002
Token We
Token ID775
Feature activation-2.02e-02

Pos logit contributions
attered+0.013
 favorite+0.012
 possession+0.012
 nobody+0.012
 favourite+0.012
Neg logit contributions
 ey-0.028
iles-0.027
 answers-0.027
rig-0.027
useum-0.026
Token have
Token ID423
Feature activation+1.61e-02

Pos logit contributions
 ey+0.031
 answers+0.030
useum+0.030
esame+0.029
iles+0.029
Neg logit contributions
attered-0.029
 favorite-0.028
 possession-0.028
 favourite-0.027
 forever-0.027
Token tickets
Token ID8587
Feature activation+7.37e-03

Pos logit contributions
 favorite+0.020
attered+0.020
 possession+0.020
 favourite+0.019
 forever+0.018
Neg logit contributions
 ey-0.028
useum-0.027
 answers-0.027
iles-0.027
rig-0.027
Token cage
Token ID17186
Feature activation-1.97e-02

Pos logit contributions
rig+0.032
esame+0.031
rimp+0.031
 respond+0.031
 explains+0.031
Neg logit contributions
 possession-0.027
 favorite-0.026
 model-0.025
 toy-0.025
attered-0.025
Token with
Token ID351
Feature activation-5.58e-03

Pos logit contributions
 ey+0.037
iles+0.037
esame+0.036
 answers+0.036
rig+0.035
Neg logit contributions
attered-0.024
 possession-0.021
 favorite-0.021
 nobody-0.019
 favourite-0.019
Token a
Token ID257
Feature activation-6.29e-03

Pos logit contributions
 ey+0.009
 answers+0.009
 say+0.009
useum+0.009
iles+0.009
Neg logit contributions
attered-0.007
 possession-0.007
 favorite-0.006
 forever-0.006
 favourite-0.006
Token big
Token ID1263
Feature activation-2.61e-02

Pos logit contributions
iles+0.012
esame+0.011
 answers+0.011
 ey+0.011
rig+0.011
Neg logit contributions
 favorite-0.007
 possession-0.007
 favourite-0.007
attered-0.007
 forever-0.007
Token,
Token ID11
Feature activation-3.43e-02

Pos logit contributions
rig+0.045
iles+0.044
esame+0.042
 answers+0.042
rimp+0.041
Neg logit contributions
 possession-0.034
 favorite-0.034
 forever-0.033
attered-0.033
 no-0.032
Token furry
Token ID40857
Feature activation-1.56e-02

Pos logit contributions
iles+0.083
esame+0.078
rimp+0.077
 ey+0.075
rig+0.075
Neg logit contributions
 favorite-0.029
 toy-0.024
 favourite-0.024
 whole-0.023
 possession-0.022
Token gorilla
Token ID45314
Feature activation-2.58e-02

Pos logit contributions
rig+0.027
esame+0.026
 answers+0.026
iles+0.026
 ey+0.026
Neg logit contributions
 favorite-0.020
 possession-0.020
attered-0.019
 favourite-0.019
 forever-0.018
Token inside
Token ID2641
Feature activation+2.18e-02

Pos logit contributions
 ey+0.048
esame+0.047
 answers+0.047
rig+0.047
useum+0.046
Neg logit contributions
attered-0.030
 possession-0.029
 favorite-0.028
 forever-0.026
 favourite-0.026
Token.
Token ID13
Feature activation+1.94e-03

Pos logit contributions
attered+0.026
 possession+0.025
 favorite+0.024
 forever+0.024
 nobody+0.024
Neg logit contributions
 ey-0.039
esame-0.038
rig-0.038
useum-0.038
iles-0.038
Token
Token ID198
Feature activation+6.44e-03

Pos logit contributions
 possession+0.003
attered+0.003
 forever+0.003
 favorite+0.003
 favourite+0.003
Neg logit contributions
 answers-0.003
 calls-0.003
 say-0.003
useum-0.003
 explains-0.003
Token"
Token ID1
Feature activation+3.39e-02

Pos logit contributions
attered+0.014
 possession+0.013
 favorite+0.013
 nobody+0.012
 favourite+0.012
Neg logit contributions
 say-0.006
 explains-0.006
 question-0.005
 says-0.005
useum-0.005
Token and
Token ID290
Feature activation+1.76e-02

Pos logit contributions
attered+0.012
 possession+0.011
 favorite+0.010
 favourite+0.010
 nobody+0.010
Neg logit contributions
 answers-0.023
iles-0.023
 say-0.022
 ey-0.022
esame-0.022
Token had
Token ID550
Feature activation+1.71e-02

Pos logit contributions
attered+0.026
 possession+0.024
 favorite+0.023
 favourite+0.023
 forever+0.022
Neg logit contributions
 answers-0.027
 say-0.026
 ey-0.026
iles-0.026
esame-0.025
Token lots
Token ID6041
Feature activation-2.11e-02

Pos logit contributions
attered+0.025
 possession+0.023
 favorite+0.020
 favourite+0.020
 nobody+0.019
Neg logit contributions
 answers-0.027
 say-0.026
iles-0.025
esame-0.025
 facts-0.025
Token of
Token ID286
Feature activation-2.10e-02

Pos logit contributions
 ey+0.033
 answers+0.032
esame+0.032
useum+0.032
iles+0.031
Neg logit contributions
attered-0.030
 favorite-0.030
 possession-0.029
 favourite-0.028
 forever-0.028
Token fun
Token ID1257
Feature activation-1.09e-02

Pos logit contributions
 ey+0.046
esame+0.046
useum+0.045
ount+0.044
rimp+0.044
Neg logit contributions
 favorite-0.020
 favourite-0.018
 possession-0.016
 best-0.016
 toy-0.015
Token.
Token ID13
Feature activation-3.27e-03

Pos logit contributions
iles+0.014
 answers+0.014
useum+0.014
 say+0.013
 facts+0.013
Neg logit contributions
attered-0.018
 possession-0.017
 nobody-0.017
 favorite-0.016
 favourite-0.016
Token
Token ID198
Feature activation-6.78e-03

Pos logit contributions
 ey+0.004
 answers+0.004
useum+0.004
iles+0.004
esame+0.004
Neg logit contributions
attered-0.005
 possession-0.005
 favorite-0.005
 favourite-0.005
 forever-0.005
Token
Token ID198
Feature activation+4.59e-03

Pos logit contributions
 answers+0.008
 ey+0.008
iles+0.008
useum+0.007
 say+0.007
Neg logit contributions
attered-0.012
 possession-0.012
 favorite-0.011
 favourite-0.011
 nobody-0.011
TokenOne
Token ID3198
Feature activation+5.72e-03

Pos logit contributions
attered+0.009
 possession+0.008
 favorite+0.008
 nobody+0.008
 forever+0.007
Neg logit contributions
iles-0.005
useum-0.005
 answers-0.005
rig-0.005
 ey-0.005
Token day
Token ID1110
Feature activation-7.34e-03

Pos logit contributions
attered+0.005
 possession+0.005
 favorite+0.005
 favourite+0.004
 forever+0.004
Neg logit contributions
 ey-0.012
 answers-0.012
useum-0.012
iles-0.012
esame-0.012
Token,
Token ID11
Feature activation-8.26e-03

Pos logit contributions
useum+0.007
 say+0.007
esame+0.007
iles+0.007
 answers+0.007
Neg logit contributions
attered-0.015
 possession-0.015
 favorite-0.014
 favourite-0.013
 forever-0.013
Token the
Token ID262
Feature activation-2.45e-02

Pos logit contributions
 ey+0.046
 answers+0.045
useum+0.044
iles+0.044
esame+0.044
Neg logit contributions
attered-0.046
 favorite-0.046
 possession-0.046
 favourite-0.044
 forever-0.043
Token swings
Token ID26728
Feature activation-1.31e-02

Pos logit contributions
 ey+0.016
 answers+0.016
 say+0.015
iles+0.015
useum+0.015
Neg logit contributions
attered-0.056
 possession-0.055
 favorite-0.054
 forever-0.053
 favourite-0.053
Token and
Token ID290
Feature activation-1.88e-02

Pos logit contributions
 answers+0.013
 say+0.013
iles+0.013
 ey+0.013
esame+0.013
Neg logit contributions
attered-0.025
 possession-0.025
 favorite-0.024
 favourite-0.024
 nobody-0.023
Token went
Token ID1816
Feature activation-1.64e-02

Pos logit contributions
 ey+0.026
useum+0.026
rig+0.025
 answers+0.025
iles+0.025
Neg logit contributions
 favorite-0.030
attered-0.029
 possession-0.029
 favourite-0.028
 no-0.028
Token down
Token ID866
Feature activation-1.71e-02

Pos logit contributions
 ey+0.030
 answers+0.029
useum+0.029
iles+0.029
esame+0.029
Neg logit contributions
attered-0.019
 favorite-0.018
 possession-0.018
 favourite-0.017
 forever-0.017
Token the
Token ID262
Feature activation-2.47e-02

Pos logit contributions
 answers+0.023
 ey+0.023
useum+0.022
esame+0.022
iles+0.022
Neg logit contributions
attered-0.028
 possession-0.027
 favorite-0.027
 favourite-0.026
 nobody-0.025
Token slide
Token ID10649
Feature activation+1.63e-02

Pos logit contributions
 ey+0.021
 answers+0.021
useum+0.020
iles+0.020
esame+0.020
Neg logit contributions
attered-0.052
 possession-0.051
 favorite-0.051
 favourite-0.049
 forever-0.049
Token.
Token ID13
Feature activation-5.56e-04

Pos logit contributions
 possession+0.031
attered+0.030
 favourite+0.029
Mid+0.029
 nobody+0.028
Neg logit contributions
 say-0.017
 answers-0.015
 says-0.015
 calls-0.015
 shouts-0.015
Token After
Token ID2293
Feature activation-4.70e-02

Pos logit contributions
 say+0.001
 ey+0.001
useum+0.001
 answers+0.001
 calls+0.001
Neg logit contributions
 possession-0.001
attered-0.001
 forever-0.001
 favorite-0.001
 nobody-0.001
Token they
Token ID484
Feature activation-7.48e-03

Pos logit contributions
 ey+0.083
 answers+0.081
useum+0.080
iles+0.080
rig+0.079
Neg logit contributions
attered-0.057
 favorite-0.055
 possession-0.055
 favourite-0.053
 forever-0.052
Token played
Token ID2826
Feature activation-1.27e-02

Pos logit contributions
 ey+0.012
 answers+0.012
useum+0.012
iles+0.012
esame+0.011
Neg logit contributions
attered-0.010
 favorite-0.010
 possession-0.010
 favourite-0.009
 forever-0.009
Token of
Token ID286
Feature activation-4.01e-02

Pos logit contributions
 answers+0.024
 ey+0.024
 say+0.023
esame+0.023
useum+0.023
Neg logit contributions
attered-0.033
 possession-0.033
 favorite-0.032
 favourite-0.031
 nobody-0.030
Token honey
Token ID12498
Feature activation-1.47e-02

Pos logit contributions
iles+0.068
rig+0.067
useum+0.066
 ey+0.063
esame+0.063
Neg logit contributions
 favorite-0.055
 favourite-0.052
 possession-0.051
attered-0.049
 no-0.049
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
 ey+0.025
useum+0.024
iles+0.024
 answers+0.024
rig+0.024
Neg logit contributions
attered-0.019
 favorite-0.019
 possession-0.018
 forever-0.018
 favourite-0.017
Token She
Token ID1375
Feature activation-1.45e-03

Pos logit contributions
attered+0.016
 possession+0.015
 favorite+0.015
 favourite+0.015
 forever+0.014
Neg logit contributions
 ey-0.019
 answers-0.018
iles-0.018
useum-0.018
esame-0.018
Token poured
Token ID19036
Feature activation-2.44e-02

Pos logit contributions
 ey+0.002
useum+0.002
esame+0.002
iles+0.002
rig+0.002
Neg logit contributions
 favorite-0.002
attered-0.002
 favourite-0.002
 no-0.002
 possession-0.002
Token some
Token ID617
Feature activation-5.02e-02

Pos logit contributions
 ey+0.044
useum+0.043
rig+0.043
iles+0.042
 answers+0.042
Neg logit contributions
attered-0.029
 favorite-0.029
 possession-0.027
 favourite-0.027
 no-0.027
Token honey
Token ID12498
Feature activation-2.69e-02

Pos logit contributions
iles+0.080
rig+0.079
 ey+0.079
useum+0.078
rimp+0.075
Neg logit contributions
 favorite-0.073
 favourite-0.070
attered-0.069
 possession-0.064
 toy-0.063
Token over
Token ID625
Feature activation-6.52e-03

Pos logit contributions
 ey+0.046
iles+0.045
useum+0.045
 answers+0.044
rig+0.043
Neg logit contributions
attered-0.035
 favorite-0.034
 forever-0.033
 favourite-0.033
 possession-0.032
Token the
Token ID262
Feature activation-2.62e-02

Pos logit contributions
 ey+0.011
 answers+0.011
useum+0.010
iles+0.010
esame+0.010
Neg logit contributions
attered-0.009
 possession-0.008
 favorite-0.008
 favourite-0.008
 forever-0.008
Token mint
Token ID21043
Feature activation-4.38e-02

Pos logit contributions
iles+0.048
rig+0.046
 explains+0.044
rimp+0.043
ount+0.042
Neg logit contributions
 favorite-0.035
 favourite-0.032
 toy-0.032
 whole-0.032
attered-0.032
Token leaves
Token ID5667
Feature activation-2.30e-02

Pos logit contributions
 ey+0.068
iles+0.066
useum+0.066
 answers+0.065
rig+0.065
Neg logit contributions
attered-0.064
 favorite-0.062
 possession-0.060
 favourite-0.059
 forever-0.057
TokenSure
Token ID19457
Feature activation+4.68e-03

Pos logit contributions
attered+0.031
 possession+0.030
 favorite+0.030
 forever+0.029
 favourite+0.029
Neg logit contributions
 ey-0.027
 answers-0.027
 say-0.027
rig-0.027
useum-0.027
Token,
Token ID11
Feature activation-1.26e-02

Pos logit contributions
attered+0.008
 possession+0.008
 favorite+0.008
 favourite+0.007
 nobody+0.007
Neg logit contributions
 answers-0.006
 ey-0.006
useum-0.006
 say-0.006
esame-0.006
Token I
Token ID314
Feature activation-1.92e-02

Pos logit contributions
 ey+0.022
 answers+0.021
iles+0.021
useum+0.021
esame+0.021
Neg logit contributions
attered-0.016
 favorite-0.016
 possession-0.015
 favourite-0.015
 forever-0.015
Token will
Token ID481
Feature activation-1.41e-02

Pos logit contributions
 answers+0.030
 ey+0.030
 say+0.030
useum+0.030
esame+0.029
Neg logit contributions
attered-0.026
 possession-0.026
 favorite-0.025
 favourite-0.024
 nobody-0.024
Token add
Token ID751
Feature activation-2.20e-02

Pos logit contributions
 ey+0.020
iles+0.020
useum+0.019
 answers+0.019
rig+0.019
Neg logit contributions
attered-0.022
 favorite-0.021
 possession-0.021
 forever-0.021
 favourite-0.021
Token more
Token ID517
Feature activation-4.66e-02

Pos logit contributions
rig+0.045
 ey+0.045
useum+0.044
ount+0.044
iles+0.043
Neg logit contributions
 favorite-0.022
 no-0.020
 favourite-0.020
 forever-0.020
 possession-0.019
Token water
Token ID1660
Feature activation-2.48e-02

Pos logit contributions
ount+0.081
utt+0.079
useum+0.078
 slam+0.077
esame+0.076
Neg logit contributions
 favorite-0.064
 favourite-0.059
 toy-0.058
 possession-0.057
 forever-0.054
Token and
Token ID290
Feature activation-6.01e-03

Pos logit contributions
 ey+0.049
useum+0.046
rig+0.045
 explains+0.044
iles+0.044
Neg logit contributions
 favorite-0.027
attered-0.026
 forever-0.025
 possession-0.025
 favourite-0.024
Token then
Token ID788
Feature activation-1.66e-02

Pos logit contributions
iles+0.010
 ey+0.010
useum+0.010
ount+0.010
rig+0.010
Neg logit contributions
 favorite-0.008
 no-0.008
 favourite-0.008
 nobody-0.008
 forever-0.007
Token fold
Token ID5591
Feature activation+1.19e-02

Pos logit contributions
 ey+0.032
iles+0.031
rig+0.031
useum+0.030
 answers+0.030
Neg logit contributions
 favorite-0.018
 no-0.018
 forever-0.017
 nobody-0.017
 favourite-0.017
Token the
Token ID262
Feature activation-2.13e-02

Pos logit contributions
attered+0.013
 favorite+0.013
 possession+0.012
 favourite+0.012
 forever+0.011
Neg logit contributions
 ey-0.022
useum-0.022
iles-0.022
 answers-0.021
rig-0.021
Token.
Token ID13
Feature activation-9.28e-03

Pos logit contributions
attered+0.005
 favorite+0.005
 forever+0.005
 nobody+0.005
 no+0.005
Neg logit contributions
 ey-0.011
esame-0.010
useum-0.010
rig-0.010
 answers-0.010
Token He
Token ID679
Feature activation-1.09e-02

Pos logit contributions
iles+0.017
 ey+0.017
rimp+0.016
aches+0.016
esame+0.016
Neg logit contributions
 favorite-0.011
attered-0.010
 favourite-0.010
 nobody-0.010
 possession-0.009
Token also
Token ID635
Feature activation+5.22e-03

Pos logit contributions
esame+0.023
 ey+0.023
useum+0.022
rig+0.022
ount+0.022
Neg logit contributions
 favorite-0.011
 nobody-0.010
 favourite-0.010
 no-0.010
attered-0.010
Token saw
Token ID2497
Feature activation-2.19e-02

Pos logit contributions
attered+0.006
 favorite+0.006
 favourite+0.006
 no+0.006
 nobody+0.006
Neg logit contributions
esame-0.010
 ey-0.010
iles-0.010
useum-0.009
rig-0.009
Token a
Token ID257
Feature activation-1.51e-04

Pos logit contributions
 ey+0.045
rig+0.044
useum+0.044
esame+0.043
 answers+0.042
Neg logit contributions
 favorite-0.022
 nobody-0.021
 no-0.020
 favourite-0.020
 possession-0.020
Token lot
Token ID1256
Feature activation-4.84e-02

Pos logit contributions
iles+0.000
esame+0.000
 ey+0.000
 answers+0.000
useum+0.000
Neg logit contributions
 favorite-0.000
 possession-0.000
 favourite-0.000
attered-0.000
 forever-0.000
Token of
Token ID286
Feature activation-4.01e-02

Pos logit contributions
esame+0.096
useum+0.092
 ey+0.092
rig+0.091
 say+0.090
Neg logit contributions
 possession-0.046
 favorite-0.046
attered-0.045
 toy-0.044
 favourite-0.042
Token things
Token ID1243
Feature activation-2.66e-02

Pos logit contributions
esame+0.083
rig+0.079
utt+0.077
rimp+0.076
useum+0.075
Neg logit contributions
 favorite-0.044
 toy-0.042
 favourite-0.041
 possession-0.037
 best-0.037
Token he
Token ID339
Feature activation-1.66e-02

Pos logit contributions
 ey+0.057
esame+0.056
rig+0.055
useum+0.055
iles+0.054
Neg logit contributions
 favorite-0.023
 forever-0.023
attered-0.022
 no-0.022
 favourite-0.021
Token didn
Token ID1422
Feature activation-6.44e-03

Pos logit contributions
esame+0.037
 ey+0.036
useum+0.035
rig+0.033
 perk+0.033
Neg logit contributions
 favorite-0.016
 favourite-0.015
attered-0.013
 nobody-0.013
 no-0.013
Token't
Token ID470
Feature activation-2.92e-02

Pos logit contributions
 answers+0.012
 ey+0.012
useum+0.012
esame+0.012
iles+0.012
Neg logit contributions
attered-0.007
 possession-0.006
 favorite-0.006
 favourite-0.006
 forever-0.006
Token playing
Token ID2712
Feature activation-3.12e-03

Pos logit contributions
 answers+0.011
 say+0.011
iles+0.010
 ey+0.010
 calling+0.010
Neg logit contributions
 possession-0.012
attered-0.012
 favorite-0.011
 favourite-0.011
 forever-0.011
Token tag
Token ID7621
Feature activation-4.31e-03

Pos logit contributions
 answers+0.004
 say+0.004
iles+0.004
 ey+0.004
esame+0.004
Neg logit contributions
attered-0.006
 possession-0.005
 favorite-0.005
 favourite-0.005
 nobody-0.005
Token and
Token ID290
Feature activation-3.81e-03

Pos logit contributions
 ey+0.006
 answers+0.006
useum+0.006
esame+0.005
iles+0.005
Neg logit contributions
attered-0.007
 favorite-0.007
 possession-0.007
 favourite-0.007
 forever-0.007
Token having
Token ID1719
Feature activation-6.74e-03

Pos logit contributions
 ey+0.007
rig+0.007
useum+0.007
esame+0.006
iles+0.006
Neg logit contributions
 no-0.005
 favorite-0.005
 nobody-0.005
 favourite-0.005
 forever-0.005
Token so
Token ID523
Feature activation-1.10e-02

Pos logit contributions
 answers+0.011
 ey+0.011
useum+0.011
iles+0.010
 say+0.010
Neg logit contributions
attered-0.009
 possession-0.009
 favorite-0.008
 favourite-0.008
 nobody-0.008
Token much
Token ID881
Feature activation-4.86e-02

Pos logit contributions
iles+0.019
useum+0.019
 facts+0.018
rig+0.018
 answers+0.018
Neg logit contributions
attered-0.013
 possession-0.012
 favourite-0.011
 favorite-0.011
 nobody-0.011
Token fun
Token ID1257
Feature activation-1.25e-02

Pos logit contributions
 ey+0.112
iles+0.106
esame+0.106
rimp+0.104
rig+0.103
Neg logit contributions
 favorite-0.037
 forever-0.031
 favourite-0.031
 possession-0.031
 toy-0.028
Token.
Token ID13
Feature activation+5.06e-03

Pos logit contributions
 answers+0.013
iles+0.013
useum+0.013
 calls+0.013
 say+0.013
Neg logit contributions
attered-0.024
 favorite-0.022
 possession-0.022
 favourite-0.021
 nobody-0.021
Token Suddenly
Token ID24975
Feature activation-3.90e-03

Pos logit contributions
 possession+0.009
 forever+0.008
attered+0.008
 nobody+0.007
 no+0.007
Neg logit contributions
useum-0.007
 calls-0.006
 answers-0.006
 say-0.006
 ey-0.006
Token,
Token ID11
Feature activation-5.18e-03

Pos logit contributions
esame+0.004
useum+0.004
 answers+0.004
 say+0.004
 ey+0.004
Neg logit contributions
attered-0.008
 possession-0.008
 favorite-0.007
 favourite-0.007
 forever-0.007
Token Lily
Token ID20037
Feature activation+1.19e-02

Pos logit contributions
 say+0.006
 Jen+0.006
useum+0.005
esame+0.005
 answers+0.005
Neg logit contributions
attered-0.010
 possession-0.010
 forever-0.008
 dearly-0.008
eddy-0.008
Token night
Token ID1755
Feature activation+2.36e-03

Pos logit contributions
attered+0.014
 possession+0.013
 favorite+0.013
 favourite+0.012
 forever+0.012
Neg logit contributions
 answers-0.024
 ey-0.024
iles-0.024
 say-0.023
esame-0.023
Token and
Token ID290
Feature activation+1.34e-02

Pos logit contributions
attered+0.003
 favorite+0.003
 no+0.003
 nobody+0.003
 possession+0.003
Neg logit contributions
 ey-0.004
iles-0.004
useum-0.004
rig-0.004
 explains-0.004
Token wondered
Token ID14028
Feature activation+1.59e-03

Pos logit contributions
 favorite+0.015
 nobody+0.015
 no+0.015
 favourite+0.014
 possession+0.014
Neg logit contributions
useum-0.026
 ey-0.026
rig-0.025
esame-0.025
iles-0.025
Token what
Token ID644
Feature activation+1.83e-02

Pos logit contributions
 favorite+0.003
 forever+0.003
 nobody+0.003
 possession+0.003
 no+0.002
Neg logit contributions
 ey-0.002
useum-0.002
rig-0.002
iles-0.002
esame-0.002
Token it
Token ID340
Feature activation-2.13e-02

Pos logit contributions
 possession+0.019
attered+0.019
 favorite+0.019
 favourite+0.017
 nobody+0.017
Neg logit contributions
 ey-0.035
useum-0.034
esame-0.034
 answers-0.034
rig-0.034
Token was
Token ID373
Feature activation-4.16e-02

Pos logit contributions
 ey+0.051
esame+0.050
useum+0.048
rimp+0.048
 hesitation+0.047
Neg logit contributions
 favorite-0.017
 nobody-0.015
 no-0.014
 favourite-0.013
attered-0.012
Token like
Token ID588
Feature activation-6.58e-03

Pos logit contributions
 ey+0.090
esame+0.084
useum+0.083
rig+0.082
iles+0.082
Neg logit contributions
 favorite-0.043
 forever-0.041
 no-0.037
 favourite-0.036
 possession-0.036
Token up
Token ID510
Feature activation+3.03e-03

Pos logit contributions
 ey+0.012
rig+0.011
useum+0.011
esame+0.011
iles+0.011
Neg logit contributions
 favorite-0.008
 possession-0.008
 forever-0.007
attered-0.007
 favourite-0.007
Token so
Token ID523
Feature activation+3.60e-03

Pos logit contributions
attered+0.004
 possession+0.003
 favorite+0.003
 forever+0.003
 favourite+0.003
Neg logit contributions
 ey-0.006
useum-0.005
esame-0.005
 answers-0.005
iles-0.005
Token high
Token ID1029
Feature activation-7.66e-03

Pos logit contributions
attered+0.004
 possession+0.004
 favorite+0.004
 favourite+0.004
 forever+0.004
Neg logit contributions
 ey-0.007
 answers-0.006
useum-0.006
iles-0.006
esame-0.006
Token.
Token ID13
Feature activation-2.99e-03

Pos logit contributions
 ey+0.013
useum+0.013
 answers+0.013
esame+0.013
rig+0.013
Neg logit contributions
attered-0.010
 possession-0.009
 favorite-0.009
 favourite-0.009
 forever-0.009