TOP ACTIVATIONS
MAX = 0.097

to the beach with Mom and Dad . <|endoftext|> L ily
sorry . Mom and dad get up from the
. " Mom and Dad got us a kitten
the concert tonight . Mom and dad say yes . They
they are sorry to Mom and Dad . They say they
them . Mom and dad smiled and hugged Lily
sorry . Mom and dad hug them and say
together . Mom and dad are angry and worried
dad . Mom and dad hear Tom and Mia
what happened . Their mom and dad are proud of Ben
trouble . Mom and dad realized they were in
them what happened . Mom and Dad were very upset and
. They cried for Mom and Dad . But Mom and
rain that way , mom and dad . The rain is
anymore . Mom and Dad kissed them and said
together . Mom and Dad smile and look at
now ?" Mom and dad looked at their plates
. " Mom and dad are here !" Tim
scared and cold . Mom and Dad hug them and say
them what happened . Mom and dad are sorry and give

MIN ACTIVATIONS
MIN = -0.083

toys and games for his 3 year old daughter to enjoy
day , she taught a 3 year old how to count
roof together . It had 3 corners , 7 sides and
daddy . They had a 3 year old girl who was
same house . They were 3 years old and both very
called Isabel . Isabel was 3 years old . One day
all he wanted for his 3 rd birthday was to try
la . They were both 3 years old and were always
special day , she had three puzzles that she wanted to
Ellen . They were both 3 years old and were always
called Paul . Paul was 3 years old and was very
named Tom . Tom was 3 years old and wanted to
fight . They were both 3 years old and were very
Joe . They were both 3 years old , and they
Sil via . She was 3 years old and loved to
baby . The baby was 3 years old and was very
a daddy who had a 3 year old child . Each
a lot of money and three kids .
called Zoe . Zoe was 3 years old and love playing
named Maria . She was 3 years old and she loved

INTERVAL 0.052 - 0.097
CONTAINS 0.203%

a sad sound . Sara and Ben stop playing . They
!" M um and Dad hugged each other and
said , " Thank you , mom and dad . I
the beautiful sight . Tim and his friends were happy to
died . John and Grace were so sad .

INTERVAL 0.007 - 0.052
CONTAINS 31.508%

an avocado , mum ?' Her mum said ' Ok ,
She followed the carrots for a really long time
family smiled and hugged each other , and they shouted with
do it , rabbit !" and " Don 't give up
to Ben . She cried . She was sorry . She

INTERVAL -0.038 - 0.007
CONTAINS 67.198%

of a cone and a scoop of cream on it .
a hat , and a leg . " A
ay , so Jon had to be very careful .
to be healthy too ! Maybe we can exercise together and
, Kitty ! Here is some milk for you ." Kitty

INTERVAL -0.083 - -0.038
CONTAINS 1.091%

and disagreed on who should do certain things . One day
mom asked . " How did you get a dinosaur in
. His favorite thing to do when he was feeling this
sad but was filled with an incredible sense of content ment
of fun . <|endoftext|> Once upon a time , there was
Token to
Token ID284
Feature activation+1.34e-02

Pos logit contributions
rying+0.053
 least+0.050
 forth+0.048
ipping+0.046
ter+0.046
Neg logit contributions
ves-0.046
kered-0.044
ggy-0.039
 lesson-0.037
ork-0.035
Token the
Token ID262
Feature activation+3.33e-02

Pos logit contributions
rying+0.028
 least+0.027
 forth+0.026
ipping+0.026
ter+0.025
Neg logit contributions
ves-0.021
kered-0.019
ggy-0.017
 lesson-0.017
ork-0.016
Token beach
Token ID10481
Feature activation+1.17e-02

Pos logit contributions
rying+0.047
ipping+0.043
umper+0.041
used+0.041
 forth+0.041
Neg logit contributions
ves-0.073
 lesson-0.069
kered-0.068
ggy-0.066
ork-0.063
Token with
Token ID351
Feature activation+1.18e-02

Pos logit contributions
rying+0.025
 least+0.023
 forth+0.023
ter+0.021
ipping+0.021
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.016
 lesson-0.015
ork-0.014
Token Mom
Token ID11254
Feature activation+2.34e-02

Pos logit contributions
 least+0.021
rying+0.019
ter+0.019
 forth+0.018
ory+0.018
Neg logit contributions
ves-0.022
kered-0.021
ggy-0.019
ork-0.018
 lesson-0.018
Token and
Token ID290
Feature activation+9.52e-02

Pos logit contributions
rying+0.053
 least+0.050
 forth+0.049
ter+0.048
ipping+0.048
Neg logit contributions
ves-0.033
kered-0.031
ggy-0.027
 lesson-0.026
ork-0.025
Token Dad
Token ID17415
Feature activation+2.49e-02

Pos logit contributions
rying+0.188
ipping+0.180
oming+0.166
ologist+0.166
 forth+0.165
Neg logit contributions
 lesson-0.138
ves-0.132
kered-0.128
 teeth-0.125
 was-0.124
Token.
Token ID13
Feature activation-4.66e-03

Pos logit contributions
rying+0.058
 least+0.055
 forth+0.054
ipping+0.053
ter+0.052
Neg logit contributions
ves-0.033
kered-0.030
ggy-0.027
 lesson-0.026
ork-0.024
Token<|endoftext|>
Token ID50256
Feature activation+4.48e-02

Pos logit contributions
ves+0.008
kered+0.007
ggy+0.006
 lesson+0.006
ork+0.006
Neg logit contributions
rying-0.009
 least-0.009
ancer-0.008
ter-0.008
 forth-0.008
TokenL
Token ID43
Feature activation+2.29e-03

Pos logit contributions
rying+0.057
 least+0.052
 forth+0.051
ipping+0.048
ter+0.046
Neg logit contributions
ves-0.106
kered-0.102
ggy-0.095
 lesson-0.093
ork-0.090
Tokenily
Token ID813
Feature activation-1.92e-02

Pos logit contributions
rying+0.004
 least+0.004
umper+0.004
ory+0.004
ter+0.003
Neg logit contributions
ves-0.004
kered-0.004
ggy-0.004
 lesson-0.004
 teeth-0.003
Token sorry
Token ID7926
Feature activation+3.73e-03

Pos logit contributions
rying+0.006
 least+0.006
 forth+0.006
ipping+0.006
ter+0.006
Neg logit contributions
ves-0.006
kered-0.006
ggy-0.005
 lesson-0.005
ork-0.005
Token.
Token ID13
Feature activation-2.09e-02

Pos logit contributions
rying+0.008
 least+0.008
 forth+0.008
ter+0.007
ipping+0.007
Neg logit contributions
ves-0.006
kered-0.005
ggy-0.005
 lesson-0.004
ork-0.004
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
ves+0.036
kered+0.034
ggy+0.030
 lesson+0.028
ork+0.028
Neg logit contributions
rying-0.037
 least-0.037
ter-0.035
 gentlemen-0.035
 forth-0.034
Token
Token ID198
Feature activation+8.38e-03

Pos logit contributions
ves+0.019
kered+0.019
ggy+0.017
 lesson+0.016
ork+0.016
Neg logit contributions
rying-0.019
 least-0.018
oming-0.017
 forth-0.017
umper-0.017
TokenMom
Token ID29252
Feature activation+4.50e-05

Pos logit contributions
rying+0.018
 least+0.017
 forth+0.017
ipping+0.016
ter+0.016
Neg logit contributions
ves-0.014
kered-0.013
ggy-0.012
 lesson-0.011
ork-0.011
Token and
Token ID290
Feature activation+9.34e-02

Pos logit contributions
rying+0.000
 least+0.000
 forth+0.000
ipping+0.000
ter+0.000
Neg logit contributions
ves-0.000
kered-0.000
ggy-0.000
 lesson-0.000
ork-0.000
Token dad
Token ID9955
Feature activation+8.43e-03

Pos logit contributions
rying+0.217
oming+0.203
ipping+0.202
ologist+0.201
ode+0.194
Neg logit contributions
 teeth-0.109
 words-0.098
 lesson-0.098
kered-0.093
 Anna-0.091
Token get
Token ID651
Feature activation+2.25e-02

Pos logit contributions
rying+0.019
ter+0.017
ancer+0.017
umper+0.017
 forth+0.017
Neg logit contributions
ves-0.011
kered-0.011
ggy-0.009
 lesson-0.009
 words-0.009
Token up
Token ID510
Feature activation+4.27e-03

Pos logit contributions
rying+0.047
 least+0.045
 forth+0.044
ipping+0.043
ter+0.042
Neg logit contributions
ves-0.034
kered-0.032
ggy-0.029
 lesson-0.029
ork-0.026
Token from
Token ID422
Feature activation+2.13e-02

Pos logit contributions
rying+0.009
 least+0.008
 forth+0.008
ipping+0.008
ter+0.008
Neg logit contributions
ves-0.007
kered-0.007
ggy-0.006
 lesson-0.005
ork-0.005
Token the
Token ID262
Feature activation+2.44e-02

Pos logit contributions
rying+0.053
ologist+0.052
umper+0.052
ode+0.051
ancer+0.051
Neg logit contributions
ves-0.021
kered-0.019
 lesson-0.017
ggy-0.017
 time-0.015
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
rying+0.025
 least+0.024
 forth+0.023
ipping+0.023
ter+0.022
Neg logit contributions
ves-0.014
kered-0.013
ggy-0.011
 lesson-0.011
ork-0.010
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
ves+0.018
kered+0.017
ggy+0.016
 lesson+0.015
ggie+0.014
Neg logit contributions
 least-0.017
rying-0.017
oming-0.017
ancer-0.017
 gentlemen-0.017
Token
Token ID198
Feature activation+1.29e-02

Pos logit contributions
ves+0.019
kered+0.018
 lesson+0.016
ggy+0.016
ork+0.015
Neg logit contributions
rying-0.018
 least-0.017
umper-0.016
oming-0.016
 forth-0.016
Token"
Token ID1
Feature activation+1.83e-02

Pos logit contributions
rying+0.029
 least+0.027
 forth+0.026
ipping+0.026
ancer+0.025
Neg logit contributions
ves-0.020
kered-0.018
ggy-0.017
 lesson-0.015
ork-0.015
TokenMom
Token ID29252
Feature activation-1.82e-03

Pos logit contributions
rying+0.024
 least+0.023
ter+0.022
 forth+0.022
ipping+0.021
Neg logit contributions
ves-0.041
kered-0.039
ggy-0.036
 lesson-0.035
ork-0.035
Token and
Token ID290
Feature activation+9.27e-02

Pos logit contributions
ves+0.004
kered+0.003
 lesson+0.003
ggy+0.003
ork+0.003
Neg logit contributions
rying-0.003
 least-0.003
 forth-0.003
ter-0.003
ipping-0.003
Token Dad
Token ID17415
Feature activation-5.30e-03

Pos logit contributions
rying+0.201
ologist+0.188
ipping+0.182
 least+0.181
oise+0.178
Neg logit contributions
 teeth-0.128
 lesson-0.120
ves-0.118
 words-0.112
 turns-0.110
Token got
Token ID1392
Feature activation+1.27e-02

Pos logit contributions
ves+0.010
kered+0.009
 lesson+0.008
ork+0.008
ggy+0.008
Neg logit contributions
rying-0.010
 least-0.010
 forth-0.010
ter-0.009
ipping-0.009
Token us
Token ID514
Feature activation+7.86e-03

Pos logit contributions
rying+0.022
 least+0.021
ipping+0.020
 forth+0.020
umper+0.019
Neg logit contributions
ves-0.023
kered-0.022
ggy-0.021
 lesson-0.020
ork-0.019
Token a
Token ID257
Feature activation-5.80e-03

Pos logit contributions
rying+0.014
 least+0.013
 forth+0.012
ipping+0.012
ter+0.012
Neg logit contributions
ves-0.015
kered-0.014
ggy-0.013
 lesson-0.013
ork-0.012
Token kitten
Token ID42143
Feature activation+1.44e-02

Pos logit contributions
ves+0.010
kered+0.010
ggy+0.008
 lesson+0.008
ork+0.008
Neg logit contributions
rying-0.011
 least-0.010
 forth-0.010
ipping-0.010
ter-0.010
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
rying+0.004
 least+0.004
 forth+0.003
ter+0.003
ipping+0.003
Neg logit contributions
ves-0.003
kered-0.003
ggy-0.002
 lesson-0.002
ork-0.002
Token concert
Token ID10010
Feature activation-1.28e-02

Pos logit contributions
rying+0.026
 least+0.024
 forth+0.023
umper+0.023
ipping+0.022
Neg logit contributions
ves-0.025
kered-0.023
 lesson-0.021
ggy-0.021
ork-0.019
Token tonight
Token ID9975
Feature activation-4.08e-03

Pos logit contributions
kered+0.021
ves+0.021
ggy+0.018
ork+0.018
 lesson+0.018
Neg logit contributions
rying-0.024
 least-0.023
 forth-0.023
ter-0.022
ancer-0.021
Token.
Token ID13
Feature activation+2.08e-04

Pos logit contributions
kered+0.008
ves+0.008
ggy+0.007
 lesson+0.007
ork+0.006
Neg logit contributions
rying-0.006
 least-0.006
 forth-0.006
ter-0.005
ancer-0.005
Token Mom
Token ID11254
Feature activation-1.83e-02

Pos logit contributions
rying+0.000
 least+0.000
 gentlemen+0.000
ancer+0.000
 forth+0.000
Neg logit contributions
ves-0.000
kered-0.000
ggy-0.000
 lesson-0.000
ork-0.000
Token and
Token ID290
Feature activation+9.15e-02

Pos logit contributions
ves+0.024
kered+0.023
ggy+0.020
 lesson+0.019
ork+0.018
Neg logit contributions
rying-0.042
 least-0.041
 forth-0.040
ipping-0.039
ter-0.038
Token dad
Token ID9955
Feature activation+7.68e-03

Pos logit contributions
rying+0.205
ipping+0.185
ode+0.184
ologist+0.184
oming+0.183
Neg logit contributions
 lesson-0.112
ves-0.112
 words-0.107
 teeth-0.105
kered-0.104
Token say
Token ID910
Feature activation+2.36e-03

Pos logit contributions
rying+0.015
 least+0.014
 forth+0.014
ipping+0.013
ter+0.013
Neg logit contributions
ves-0.013
kered-0.012
ggy-0.011
 lesson-0.011
ork-0.010
Token yes
Token ID3763
Feature activation-5.69e-03

Pos logit contributions
rying+0.003
 least+0.003
ter+0.003
ipping+0.003
 forth+0.003
Neg logit contributions
ves-0.005
kered-0.005
ggy-0.005
ork-0.005
 lesson-0.005
Token.
Token ID13
Feature activation-6.24e-03

Pos logit contributions
kered+0.011
ves+0.011
ggy+0.009
 teeth+0.009
ork+0.008
Neg logit contributions
rying-0.009
 least-0.009
 forth-0.009
ory-0.008
ipping-0.008
Token They
Token ID1119
Feature activation+1.36e-02

Pos logit contributions
ves+0.009
kered+0.009
ggy+0.008
 lesson+0.007
ork+0.007
Neg logit contributions
rying-0.013
 least-0.013
 forth-0.012
ipping-0.012
ter-0.012
Token they
Token ID484
Feature activation+3.25e-03

Pos logit contributions
rying+0.030
 least+0.028
 forth+0.028
ter+0.027
ipping+0.027
Neg logit contributions
ves-0.037
kered-0.035
ggy-0.032
 lesson-0.031
ork-0.030
Token are
Token ID389
Feature activation+1.99e-02

Pos logit contributions
rying+0.006
 least+0.006
 forth+0.006
ipping+0.006
umper+0.006
Neg logit contributions
ves-0.005
kered-0.005
ggy-0.005
 lesson-0.004
ork-0.004
Token sorry
Token ID7926
Feature activation+1.04e-02

Pos logit contributions
rying+0.042
ipping+0.039
 forth+0.038
used+0.038
ancer+0.037
Neg logit contributions
ggy-0.030
ves-0.029
kered-0.028
 lesson-0.026
 teaches-0.025
Token to
Token ID284
Feature activation+2.23e-02

Pos logit contributions
rying+0.026
 least+0.025
 forth+0.024
umper+0.024
ipping+0.024
Neg logit contributions
ves-0.012
kered-0.011
 lesson-0.010
ggy-0.010
 words-0.009
Token Mom
Token ID11254
Feature activation-3.38e-03

Pos logit contributions
rying+0.048
ipping+0.044
 least+0.043
 forth+0.043
oming+0.042
Neg logit contributions
ves-0.034
kered-0.031
ggy-0.029
 lesson-0.028
ork-0.026
Token and
Token ID290
Feature activation+9.13e-02

Pos logit contributions
ves+0.006
kered+0.005
ggy+0.005
ork+0.005
 lesson+0.004
Neg logit contributions
rying-0.007
 least-0.006
 forth-0.006
ter-0.006
ipping-0.006
Token Dad
Token ID17415
Feature activation+2.20e-03

Pos logit contributions
ipping+0.173
rying+0.170
oming+0.161
ologist+0.159
umper+0.154
Neg logit contributions
 lesson-0.153
kered-0.134
 teaches-0.133
ves-0.132
 words-0.129
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
rying+0.005
 least+0.005
 forth+0.005
ipping+0.004
ter+0.004
Neg logit contributions
ves-0.003
kered-0.003
ggy-0.003
 lesson-0.002
ork-0.002
Token They
Token ID1119
Feature activation+1.70e-02

Pos logit contributions
ves+0.020
kered+0.018
ggy+0.017
ork+0.015
ggie+0.015
Neg logit contributions
rying-0.022
 least-0.022
 forth-0.021
ter-0.020
 gentlemen-0.020
Token say
Token ID910
Feature activation+1.60e-02

Pos logit contributions
rying+0.033
 least+0.030
 forth+0.030
ipping+0.029
ter+0.029
Neg logit contributions
ves-0.029
kered-0.028
ggy-0.025
 lesson-0.025
ork-0.023
Token they
Token ID484
Feature activation+5.53e-03

Pos logit contributions
rying+0.028
 least+0.026
 forth+0.026
ipping+0.025
ter+0.025
Neg logit contributions
ves-0.030
kered-0.029
ggy-0.026
 lesson-0.026
ork-0.025
Token them
Token ID606
Feature activation+6.48e-03

Pos logit contributions
kered+0.005
ves+0.005
ggy+0.004
ork+0.004
 lesson+0.004
Neg logit contributions
rying-0.004
 least-0.004
 forth-0.003
ter-0.003
 surfing-0.003
Token.
Token ID13
Feature activation-8.76e-03

Pos logit contributions
rying+0.013
 least+0.012
 forth+0.012
ipping+0.011
ter+0.011
Neg logit contributions
ves-0.011
kered-0.010
ggy-0.009
 lesson-0.009
ork-0.009
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
ves+0.014
kered+0.013
ggy+0.012
 lesson+0.011
ork+0.011
Neg logit contributions
rying-0.017
 least-0.017
 forth-0.016
 gentlemen-0.016
ter-0.016
Token
Token ID198
Feature activation+7.08e-03

Pos logit contributions
kered+0.026
ves+0.025
ggy+0.022
 lesson+0.021
 teeth+0.021
Neg logit contributions
rying-0.024
 least-0.024
oming-0.022
umper-0.022
 forth-0.022
TokenMom
Token ID29252
Feature activation-1.22e-02

Pos logit contributions
rying+0.015
 least+0.014
 forth+0.013
ipping+0.013
ter+0.013
Neg logit contributions
ves-0.012
kered-0.011
ggy-0.010
 lesson-0.010
ork-0.009
Token and
Token ID290
Feature activation+9.13e-02

Pos logit contributions
ves+0.018
kered+0.017
ggy+0.015
 lesson+0.015
ork+0.014
Neg logit contributions
rying-0.026
 least-0.025
 forth-0.024
ipping-0.024
ter-0.024
Token dad
Token ID9955
Feature activation-1.17e-03

Pos logit contributions
rying+0.201
ipping+0.188
used+0.183
 least+0.182
ode+0.182
Neg logit contributions
ves-0.106
 lesson-0.105
 teeth-0.102
 words-0.102
kered-0.101
Token smiled
Token ID13541
Feature activation+1.01e-02

Pos logit contributions
ves+0.002
kered+0.002
ggy+0.001
 lesson+0.001
 teaches+0.001
Neg logit contributions
rying-0.003
 least-0.002
 forth-0.002
ipping-0.002
used-0.002
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
rying+0.026
 least+0.025
 forth+0.024
ipping+0.024
umper+0.024
Neg logit contributions
kered-0.010
ves-0.010
 lesson-0.009
ggy-0.008
 words-0.008
Token hugged
Token ID41130
Feature activation+1.14e-03

Pos logit contributions
rying+0.032
 least+0.030
 forth+0.030
ipping+0.029
used+0.029
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.016
 lesson-0.015
 teaches-0.014
Token Lily
Token ID20037
Feature activation+6.17e-03

Pos logit contributions
rying+0.002
 least+0.002
 forth+0.002
ting+0.002
ter+0.002
Neg logit contributions
ves-0.003
kered-0.002
ggy-0.002
ggie-0.002
ork-0.002
Token sorry
Token ID7926
Feature activation+8.91e-03

Pos logit contributions
rying+0.021
 least+0.019
 forth+0.019
ipping+0.019
ter+0.019
Neg logit contributions
ves-0.033
kered-0.032
ggy-0.029
 lesson-0.029
ork-0.028
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
rying+0.017
 least+0.016
 forth+0.016
ter+0.016
ting+0.015
Neg logit contributions
ves-0.015
kered-0.014
ggy-0.013
ork-0.012
ggie-0.012
Token
Token ID198
Feature activation-1.40e-02

Pos logit contributions
ves+0.019
kered+0.018
ggy+0.016
 lesson+0.015
ork+0.014
Neg logit contributions
rying-0.021
 least-0.020
 gentlemen-0.020
 forth-0.020
ter-0.019
Token
Token ID198
Feature activation+7.13e-03

Pos logit contributions
ves+0.025
kered+0.024
 lesson+0.021
ggy+0.021
 teeth+0.020
Neg logit contributions
rying-0.023
 least-0.022
 gentlemen-0.021
 forth-0.021
umper-0.021
TokenMom
Token ID29252
Feature activation-7.81e-03

Pos logit contributions
rying+0.014
 least+0.013
 forth+0.013
ipping+0.013
ter+0.012
Neg logit contributions
ves-0.012
kered-0.012
ggy-0.011
 lesson-0.010
ork-0.010
Token and
Token ID290
Feature activation+9.11e-02

Pos logit contributions
ves+0.011
kered+0.010
ggy+0.009
 lesson+0.008
ork+0.008
Neg logit contributions
rying-0.018
 least-0.017
 forth-0.017
ipping-0.016
ter-0.016
Token dad
Token ID9955
Feature activation+4.41e-03

Pos logit contributions
rying+0.206
oming+0.195
ipping+0.194
ologist+0.194
 least+0.191
Neg logit contributions
 teeth-0.100
ves-0.099
 words-0.097
 lesson-0.097
kered-0.095
Token hug
Token ID16225
Feature activation+8.26e-03

Pos logit contributions
rying+0.009
 forth+0.009
 least+0.008
ter+0.008
umper+0.008
Neg logit contributions
ves-0.007
kered-0.006
 lesson-0.005
ggy-0.005
 teaches-0.005
Token them
Token ID606
Feature activation+6.74e-03

Pos logit contributions
rying+0.015
 least+0.015
 forth+0.014
ter+0.014
ipping+0.014
Neg logit contributions
ves-0.015
kered-0.014
ggy-0.013
 lesson-0.012
ork-0.012
Token and
Token ID290
Feature activation+2.87e-02

Pos logit contributions
rying+0.016
 least+0.016
 forth+0.015
ipping+0.015
umper+0.015
Neg logit contributions
ves-0.008
kered-0.008
ggy-0.007
 lesson-0.007
ork-0.006
Token say
Token ID910
Feature activation+4.80e-03

Pos logit contributions
rying+0.052
 least+0.049
 forth+0.048
ipping+0.046
ter+0.045
Neg logit contributions
ves-0.053
kered-0.050
ggy-0.046
 lesson-0.044
ork-0.043
Token together
Token ID1978
Feature activation+2.09e-02

Pos logit contributions
ves+0.034
kered+0.032
ggy+0.031
ggie+0.027
ork+0.027
Neg logit contributions
 least-0.021
 forth-0.021
rying-0.021
ter-0.020
ting-0.020
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
rying+0.041
 least+0.039
 forth+0.039
ter+0.036
ipping+0.035
Neg logit contributions
ves-0.036
kered-0.034
ggy-0.031
ork-0.028
 lesson-0.028
Token
Token ID198
Feature activation-1.03e-02

Pos logit contributions
ves+0.022
kered+0.022
ggy+0.019
ork+0.017
 lesson+0.017
Neg logit contributions
 least-0.023
rying-0.022
 gentlemen-0.022
 forth-0.021
 highlight-0.021
Token
Token ID198
Feature activation+1.14e-02

Pos logit contributions
ves+0.018
kered+0.018
ggy+0.015
 lesson+0.015
ork+0.014
Neg logit contributions
rying-0.018
 least-0.017
 forth-0.016
umper-0.016
ipping-0.016
TokenMom
Token ID29252
Feature activation-1.09e-02

Pos logit contributions
rying+0.025
 least+0.023
 forth+0.023
ipping+0.022
ter+0.022
Neg logit contributions
ves-0.018
kered-0.016
ggy-0.015
 lesson-0.015
ork-0.014
Token and
Token ID290
Feature activation+9.07e-02

Pos logit contributions
ves+0.014
kered+0.013
ggy+0.011
 lesson+0.010
ork+0.010
Neg logit contributions
rying-0.026
 least-0.025
 forth-0.024
ipping-0.024
ter-0.024
Token dad
Token ID9955
Feature activation+2.75e-03

Pos logit contributions
rying+0.216
ode+0.198
oming+0.198
ipping+0.195
ologist+0.194
Neg logit contributions
 Anna-0.097
 teeth-0.097
 words-0.095
 lesson-0.087
ves-0.086
Token are
Token ID389
Feature activation+3.70e-02

Pos logit contributions
rying+0.006
ter+0.006
ancer+0.006
umper+0.006
 forth+0.006
Neg logit contributions
ves-0.004
kered-0.004
 lesson-0.003
ggy-0.003
 teaches-0.003
Token angry
Token ID7954
Feature activation-2.02e-03

Pos logit contributions
rying+0.078
ancer+0.071
 forth+0.071
 least+0.070
umper+0.069
Neg logit contributions
ves-0.059
ggy-0.055
kered-0.053
 lesson-0.047
ggie-0.046
Token and
Token ID290
Feature activation+1.31e-02

Pos logit contributions
ves+0.002
kered+0.002
ggy+0.002
 lesson+0.002
 words+0.002
Neg logit contributions
rying-0.005
 least-0.005
 forth-0.005
umper-0.005
 highlight-0.005
Token worried
Token ID7960
Feature activation-1.72e-02

Pos logit contributions
rying+0.024
 least+0.023
 forth+0.022
ipping+0.022
umper+0.022
Neg logit contributions
ves-0.023
kered-0.022
 lesson-0.020
ggy-0.020
 teaches-0.019
Token dad
Token ID9955
Feature activation+1.70e-02

Pos logit contributions
rying+0.126
ipping+0.118
oming+0.118
umper+0.110
ancer+0.109
Neg logit contributions
ves-0.131
kered-0.125
 lesson-0.122
ggy-0.120
 teeth-0.119
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
rying+0.037
 least+0.035
 forth+0.034
ipping+0.033
ter+0.033
Neg logit contributions
ves-0.025
kered-0.024
ggy-0.021
 lesson-0.020
ork-0.019
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
ves+0.030
kered+0.029
ggy+0.025
 lesson+0.023
ggie+0.023
Neg logit contributions
�-0.022
 gentlemen-0.022
 least-0.022
rying-0.021
elt-0.021
Token
Token ID198
Feature activation+2.38e-03

Pos logit contributions
kered+0.023
ves+0.023
 lesson+0.019
ggy+0.019
 teeth+0.019
Neg logit contributions
rying-0.019
 least-0.019
 gentlemen-0.017
 forth-0.017
oming-0.017
TokenMom
Token ID29252
Feature activation-2.17e-03

Pos logit contributions
rying+0.005
 least+0.004
 forth+0.004
ipping+0.004
ter+0.004
Neg logit contributions
ves-0.004
kered-0.004
ggy-0.004
 lesson-0.003
ork-0.003
Token and
Token ID290
Feature activation+9.06e-02

Pos logit contributions
ves+0.003
kered+0.003
ggy+0.003
 lesson+0.003
ork+0.002
Neg logit contributions
rying-0.005
 least-0.005
 forth-0.004
ipping-0.004
ting-0.004
Token dad
Token ID9955
Feature activation+9.52e-03

Pos logit contributions
ipping+0.199
oming+0.196
rying+0.195
 least+0.182
umper+0.180
Neg logit contributions
 teeth-0.110
 words-0.108
 lesson-0.103
ves-0.101
 Anna-0.100
Token hear
Token ID3285
Feature activation+6.18e-04

Pos logit contributions
rying+0.020
 least+0.019
ter+0.018
 forth+0.018
umper+0.018
Neg logit contributions
ves-0.014
kered-0.013
ggy-0.012
 lesson-0.011
ork-0.011
Token Tom
Token ID4186
Feature activation-1.20e-02

Pos logit contributions
rying+0.001
 least+0.001
 forth+0.001
ipping+0.001
ter+0.001
Neg logit contributions
ves-0.001
kered-0.001
ggy-0.001
 lesson-0.001
ork-0.001
Token and
Token ID290
Feature activation+4.18e-02

Pos logit contributions
ves+0.017
kered+0.016
ggy+0.014
 lesson+0.014
ork+0.013
Neg logit contributions
rying-0.027
 least-0.025
 forth-0.025
ipping-0.024
ter-0.024
Token Mia
Token ID32189
Feature activation+7.34e-03

Pos logit contributions
rying+0.094
 least+0.089
 forth+0.088
ter+0.084
ipping+0.083
Neg logit contributions
ves-0.061
kered-0.056
ggy-0.049
 lesson-0.047
ork-0.046
Token what
Token ID644
Feature activation+8.20e-03

Pos logit contributions
ves+0.003
kered+0.002
ggy+0.002
 lesson+0.002
ork+0.002
Neg logit contributions
rying-0.002
 least-0.002
 forth-0.002
ter-0.002
ipping-0.002
Token happened
Token ID3022
Feature activation+6.19e-03

Pos logit contributions
 least+0.017
 surfing+0.017
rying+0.017
 forth+0.016
ter+0.016
Neg logit contributions
kered-0.011
ves-0.011
ork-0.008
aron-0.008
ggy-0.008
Token.
Token ID13
Feature activation-2.08e-03

Pos logit contributions
rying+0.014
 least+0.013
 forth+0.013
ipping+0.012
ter+0.012
Neg logit contributions
ves-0.009
kered-0.009
ggy-0.008
 lesson-0.007
ork-0.007
Token Their
Token ID5334
Feature activation+1.87e-02

Pos logit contributions
ves+0.003
kered+0.003
ggy+0.003
 lesson+0.003
ork+0.003
Neg logit contributions
rying-0.004
 least-0.004
 gentlemen-0.004
oming-0.004
ipping-0.004
Token mom
Token ID1995
Feature activation-1.26e-02

Pos logit contributions
rying+0.031
 forth+0.029
 least+0.029
ipping+0.028
ter+0.027
Neg logit contributions
ves-0.036
kered-0.035
ggy-0.031
ork-0.030
 lesson-0.029
Token and
Token ID290
Feature activation+9.05e-02

Pos logit contributions
ves+0.019
kered+0.018
ggy+0.016
 lesson+0.016
ork+0.015
Neg logit contributions
rying-0.027
 least-0.026
 forth-0.025
ipping-0.025
ting-0.024
Token dad
Token ID9955
Feature activation+1.13e-03

Pos logit contributions
rying+0.191
 least+0.184
ipping+0.183
oming+0.182
 highlight+0.180
Neg logit contributions
ves-0.113
 teeth-0.113
 words-0.110
 lesson-0.107
kered-0.101
Token are
Token ID389
Feature activation+3.84e-02

Pos logit contributions
rying+0.002
 least+0.002
 forth+0.002
ter+0.002
ipping+0.002
Neg logit contributions
ves-0.002
kered-0.002
ggy-0.002
 lesson-0.001
ork-0.001
Token proud
Token ID6613
Feature activation+2.15e-02

Pos logit contributions
rying+0.080
ipping+0.073
ancer+0.073
 least+0.073
 forth+0.073
Neg logit contributions
ves-0.059
ggy-0.055
kered-0.054
 lesson-0.048
ggie-0.046
Token of
Token ID286
Feature activation+1.43e-02

Pos logit contributions
rying+0.052
 least+0.050
 forth+0.049
ipping+0.048
used+0.047
Neg logit contributions
ves-0.026
kered-0.024
ggy-0.020
 lesson-0.020
 words-0.018
Token Ben
Token ID3932
Feature activation+2.12e-03

Pos logit contributions
rying+0.027
 least+0.026
 forth+0.025
ipping+0.024
ter+0.024
Neg logit contributions
ves-0.025
kered-0.024
ggy-0.021
 lesson-0.021
ork-0.020
Token trouble
Token ID5876
Feature activation-2.78e-02

Pos logit contributions
ves+0.025
kered+0.024
ggy+0.019
ork+0.018
 lesson+0.017
Neg logit contributions
 least-0.037
rying-0.036
 forth-0.035
ter-0.033
ipping-0.033
Token.
Token ID13
Feature activation-2.81e-02

Pos logit contributions
ves+0.044
kered+0.041
 lesson+0.036
ggy+0.035
ork+0.035
Neg logit contributions
rying-0.059
 least-0.055
 forth-0.054
ter-0.053
ipping-0.052
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
ves+0.050
kered+0.048
ggy+0.042
 lesson+0.041
ggie+0.038
Neg logit contributions
 least-0.047
rying-0.046
ter-0.044
 gentlemen-0.044
ancer-0.043
Token
Token ID198
Feature activation+7.09e-03

Pos logit contributions
kered+0.019
ves+0.019
 lesson+0.017
ggy+0.016
 teeth+0.016
Neg logit contributions
rying-0.018
 least-0.018
 gentlemen-0.016
oming-0.016
umper-0.016
TokenMom
Token ID29252
Feature activation-8.58e-03

Pos logit contributions
rying+0.014
 least+0.013
 forth+0.013
ipping+0.012
ter+0.012
Neg logit contributions
ves-0.013
kered-0.012
ggy-0.011
 lesson-0.010
ork-0.010
Token and
Token ID290
Feature activation+9.04e-02

Pos logit contributions
ves+0.013
kered+0.012
 lesson+0.011
ggy+0.011
 purpose+0.010
Neg logit contributions
rying-0.018
 least-0.017
 forth-0.017
ipping-0.016
ting-0.016
Token dad
Token ID9955
Feature activation-3.43e-04

Pos logit contributions
rying+0.213
ipping+0.203
oming+0.200
ologist+0.198
ode+0.191
Neg logit contributions
 teeth-0.100
 words-0.092
 lesson-0.088
kered-0.088
ves-0.087
Token realized
Token ID6939
Feature activation-5.38e-03

Pos logit contributions
ves+0.000
kered+0.000
ggy+0.000
 lesson+0.000
ork+0.000
Neg logit contributions
rying-0.001
 least-0.001
 forth-0.001
ipping-0.001
ter-0.001
Token they
Token ID484
Feature activation-9.64e-03

Pos logit contributions
ves+0.010
kered+0.010
ggy+0.009
 lesson+0.009
ork+0.008
Neg logit contributions
rying-0.009
 least-0.009
 forth-0.008
ter-0.008
ipping-0.008
Token were
Token ID547
Feature activation+9.42e-03

Pos logit contributions
ves+0.016
kered+0.014
ggy+0.013
 lesson+0.013
ork+0.013
Neg logit contributions
rying-0.019
 least-0.019
 forth-0.018
ter-0.017
ting-0.017
Token in
Token ID287
Feature activation-1.02e-02

Pos logit contributions
rying+0.019
ipping+0.019
ory+0.019
ancer+0.019
 forth+0.018
Neg logit contributions
ggy-0.013
ves-0.013
kered-0.012
ggie-0.011
 lesson-0.011
Token them
Token ID606
Feature activation-9.48e-03

Pos logit contributions
rying+0.013
 least+0.012
 forth+0.012
ter+0.012
ting+0.011
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.016
ork-0.016
 lesson-0.016
Token what
Token ID644
Feature activation+3.19e-03

Pos logit contributions
kered+0.018
ves+0.018
ggy+0.015
ork+0.015
 lesson+0.015
Neg logit contributions
rying-0.016
 least-0.014
ter-0.013
 forth-0.013
ting-0.013
Token happened
Token ID3022
Feature activation+6.48e-03

Pos logit contributions
rying+0.005
 surfing+0.005
 least+0.005
 forth+0.005
ter+0.005
Neg logit contributions
kered-0.006
ves-0.005
ork-0.004
aron-0.004
 teeth-0.004
Token.
Token ID13
Feature activation-9.41e-03

Pos logit contributions
rying+0.014
 least+0.013
 forth+0.013
ter+0.012
ipping+0.012
Neg logit contributions
ves-0.010
kered-0.009
ggy-0.008
 lesson-0.008
ork-0.008
Token Mom
Token ID11254
Feature activation-2.83e-02

Pos logit contributions
ves+0.016
kered+0.015
ggy+0.014
 lesson+0.013
ggie+0.012
Neg logit contributions
rying-0.018
 least-0.017
ipping-0.016
ancer-0.016
ter-0.016
Token and
Token ID290
Feature activation+9.00e-02

Pos logit contributions
ves+0.046
kered+0.044
 lesson+0.040
ggy+0.039
 purpose+0.039
Neg logit contributions
rying-0.056
 least-0.053
 forth-0.052
ter-0.051
ting-0.050
Token Dad
Token ID17415
Feature activation-7.96e-03

Pos logit contributions
rying+0.181
ologist+0.165
ipping+0.163
 least+0.163
ode+0.162
Neg logit contributions
ves-0.129
 teeth-0.122
kered-0.122
 lesson-0.121
 words-0.117
Token were
Token ID547
Feature activation+2.75e-02

Pos logit contributions
ves+0.013
kered+0.012
ggy+0.011
 lesson+0.010
ork+0.010
Neg logit contributions
rying-0.016
 least-0.016
 forth-0.015
ipping-0.015
ter-0.015
Token very
Token ID845
Feature activation+1.74e-02

Pos logit contributions
rying+0.056
 least+0.052
 forth+0.052
ipping+0.051
umper+0.049
Neg logit contributions
ves-0.045
kered-0.042
ggy-0.039
 lesson-0.036
ork-0.034
Token upset
Token ID9247
Feature activation-5.72e-04

Pos logit contributions
rying+0.031
 least+0.029
 forth+0.028
ipping+0.027
ter+0.027
Neg logit contributions
ves-0.032
kered-0.031
ggy-0.027
 lesson-0.027
ork-0.026
Token and
Token ID290
Feature activation-1.80e-03

Pos logit contributions
ves+0.001
kered+0.001
ggy+0.001
 lesson+0.001
ork+0.001
Neg logit contributions
rying-0.001
 least-0.001
 forth-0.001
ipping-0.001
ter-0.001
Token.
Token ID13
Feature activation-2.54e-02

Pos logit contributions
ves+0.044
kered+0.039
ggy+0.035
ork+0.035
 lesson+0.035
Neg logit contributions
rying-0.052
ter-0.049
 least-0.049
 forth-0.048
ting-0.046
Token They
Token ID1119
Feature activation+5.59e-03

Pos logit contributions
ves+0.046
kered+0.042
ggy+0.039
 lesson+0.036
ggie+0.034
Neg logit contributions
rying-0.044
 least-0.043
ter-0.042
ory-0.041
 gentlemen-0.041
Token cried
Token ID16896
Feature activation+5.13e-03

Pos logit contributions
 least+0.010
 forth+0.010
rying+0.009
ter+0.009
ting+0.009
Neg logit contributions
ves-0.010
kered-0.009
 lesson-0.009
 purpose-0.008
ork-0.008
Token for
Token ID329
Feature activation+5.85e-03

Pos logit contributions
rying+0.011
 least+0.010
 forth+0.010
ipping+0.010
ter+0.010
Neg logit contributions
ves-0.008
kered-0.007
ggy-0.006
 lesson-0.006
ork-0.006
Token Mom
Token ID11254
Feature activation-7.20e-04

Pos logit contributions
rying+0.012
 forth+0.011
 least+0.011
ipping+0.011
umper+0.011
Neg logit contributions
ves-0.009
kered-0.009
ggy-0.008
 lesson-0.008
 teeth-0.007
Token and
Token ID290
Feature activation+8.92e-02

Pos logit contributions
ves+0.001
kered+0.001
ggy+0.001
 lesson+0.001
ork+0.001
Neg logit contributions
rying-0.001
 least-0.001
 forth-0.001
ting-0.001
ter-0.001
Token Dad
Token ID17415
Feature activation+5.68e-03

Pos logit contributions
rying+0.177
ipping+0.171
oming+0.164
oise+0.158
ancer+0.156
Neg logit contributions
 teeth-0.128
kered-0.119
 lesson-0.114
ves-0.113
 words-0.112
Token.
Token ID13
Feature activation-5.08e-03

Pos logit contributions
rying+0.012
 least+0.011
 forth+0.011
ter+0.011
ting+0.011
Neg logit contributions
ves-0.009
kered-0.008
ggy-0.007
 lesson-0.007
ork-0.007
Token But
Token ID887
Feature activation+7.97e-03

Pos logit contributions
ves+0.009
kered+0.008
ggy+0.007
 lesson+0.007
ork+0.007
Neg logit contributions
rying-0.009
 least-0.009
 gentlemen-0.008
ter-0.008
 forth-0.008
Token Mom
Token ID11254
Feature activation-2.16e-02

Pos logit contributions
rying+0.018
umper+0.017
ipping+0.017
ologist+0.017
ode+0.017
Neg logit contributions
ves-0.011
kered-0.010
ggy-0.009
 lesson-0.009
 was-0.008
Token and
Token ID290
Feature activation+8.72e-02

Pos logit contributions
ves+0.036
kered+0.036
 purpose+0.034
 lesson+0.033
 common+0.032
Neg logit contributions
rying-0.039
 forth-0.038
 least-0.038
ter-0.036
ting-0.035
Token rain
Token ID6290
Feature activation+4.48e-03

Pos logit contributions
rying+0.048
 least+0.044
 forth+0.044
ipping+0.043
umper+0.043
Neg logit contributions
ves-0.049
kered-0.046
 lesson-0.044
ggy-0.042
ork-0.041
Token that
Token ID326
Feature activation+1.03e-02

Pos logit contributions
rying+0.008
 least+0.008
 forth+0.008
ter+0.007
ipping+0.007
Neg logit contributions
ves-0.008
kered-0.008
ggy-0.007
 lesson-0.007
ork-0.007
Token way
Token ID835
Feature activation+1.45e-02

Pos logit contributions
 least+0.018
 forth+0.018
rying+0.018
ter+0.017
ancer+0.016
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.017
ggie-0.016
ork-0.015
Token,
Token ID11
Feature activation+2.08e-02

Pos logit contributions
rying+0.030
 least+0.029
 forth+0.028
ipping+0.028
ter+0.027
Neg logit contributions
ves-0.022
kered-0.020
ggy-0.018
 lesson-0.018
ork-0.017
Token mom
Token ID1995
Feature activation+8.01e-03

Pos logit contributions
rying+0.039
 least+0.037
 forth+0.036
ter+0.035
ipping+0.034
Neg logit contributions
ves-0.037
kered-0.036
ggy-0.032
 lesson-0.031
ork-0.030
Token and
Token ID290
Feature activation+8.90e-02

Pos logit contributions
rying+0.017
 least+0.016
 forth+0.016
ter+0.015
ipping+0.015
Neg logit contributions
ves-0.013
kered-0.012
ggy-0.011
 lesson-0.011
ork-0.010
Token dad
Token ID9955
Feature activation+2.53e-02

Pos logit contributions
rying+0.191
ipping+0.181
oming+0.174
used+0.171
 forth+0.164
Neg logit contributions
ves-0.120
 lesson-0.111
 words-0.108
kered-0.106
 teeth-0.105
Token.
Token ID13
Feature activation-2.32e-03

Pos logit contributions
rying+0.056
 least+0.053
 forth+0.052
ipping+0.050
ter+0.050
Neg logit contributions
ves-0.037
kered-0.035
ggy-0.030
 lesson-0.029
ork-0.028
Token The
Token ID383
Feature activation+2.62e-02

Pos logit contributions
ves+0.004
kered+0.003
ggy+0.003
 lesson+0.003
ork+0.003
Neg logit contributions
rying-0.005
 least-0.005
 forth-0.004
ter-0.004
ipping-0.004
Token rain
Token ID6290
Feature activation-1.04e-02

Pos logit contributions
rying+0.052
umper+0.048
 least+0.047
 forth+0.047
ipping+0.047
Neg logit contributions
ves-0.044
kered-0.041
 lesson-0.040
ggy-0.038
ork-0.035
Token is
Token ID318
Feature activation+1.45e-02

Pos logit contributions
ves+0.013
kered+0.012
ggy+0.011
 teeth+0.010
ggie+0.010
Neg logit contributions
rying-0.023
 least-0.023
 forth-0.022
ter-0.022
ipping-0.022
Token anymore
Token ID7471
Feature activation-9.61e-03

Pos logit contributions
 least+0.030
rying+0.029
 forth+0.027
ter+0.027
ting+0.026
Neg logit contributions
ves-0.038
kered-0.036
ggy-0.035
ggie-0.032
ork-0.031
Token.
Token ID13
Feature activation-8.62e-03

Pos logit contributions
kered+0.016
ves+0.016
 lesson+0.013
ork+0.013
ggy+0.013
Neg logit contributions
rying-0.017
 forth-0.016
ter-0.016
 least-0.016
ting-0.015
Token
Token ID198
Feature activation-2.41e-03

Pos logit contributions
ves+0.014
kered+0.014
ggy+0.012
 lesson+0.011
ork+0.011
Neg logit contributions
rying-0.016
 least-0.016
ancer-0.015
ter-0.015
 gentlemen-0.015
Token
Token ID198
Feature activation+1.66e-02

Pos logit contributions
ves+0.004
kered+0.004
ggy+0.003
 lesson+0.003
ork+0.003
Neg logit contributions
rying-0.004
 least-0.004
 forth-0.004
oming-0.004
ipping-0.004
TokenMom
Token ID29252
Feature activation-1.64e-02

Pos logit contributions
rying+0.032
 least+0.030
 forth+0.029
ipping+0.028
ter+0.028
Neg logit contributions
ves-0.031
kered-0.028
ggy-0.026
 lesson-0.025
ork-0.024
Token and
Token ID290
Feature activation+8.84e-02

Pos logit contributions
ves+0.027
kered+0.025
ggy+0.023
 lesson+0.023
 purpose+0.022
Neg logit contributions
rying-0.032
 least-0.031
 forth-0.030
ter-0.029
ipping-0.029
Token Dad
Token ID17415
Feature activation-2.62e-03

Pos logit contributions
rying+0.196
oming+0.183
ipping+0.183
ologist+0.180
oise+0.174
Neg logit contributions
 Anna-0.104
 teeth-0.099
 lesson-0.097
 time-0.096
kered-0.095
Token kissed
Token ID27946
Feature activation+1.92e-02

Pos logit contributions
ves+0.004
kered+0.004
ggy+0.003
 lesson+0.003
ork+0.003
Neg logit contributions
rying-0.006
 least-0.005
 forth-0.005
ipping-0.005
ter-0.005
Token them
Token ID606
Feature activation+8.33e-03

Pos logit contributions
rying+0.028
 least+0.028
ter+0.027
 forth+0.027
ting+0.027
Neg logit contributions
ves-0.040
kered-0.038
ggy-0.036
ork-0.033
 lesson-0.033
Token and
Token ID290
Feature activation+2.42e-02

Pos logit contributions
rying+0.018
 least+0.017
 forth+0.016
ipping+0.016
ter+0.016
Neg logit contributions
ves-0.013
kered-0.012
ggy-0.010
 lesson-0.010
ork-0.010
Token said
Token ID531
Feature activation+1.71e-03

Pos logit contributions
rying+0.050
 least+0.049
 forth+0.048
ter+0.045
ipping+0.044
Neg logit contributions
ves-0.038
kered-0.036
ggy-0.032
ork-0.031
 lesson-0.030
Token together
Token ID1978
Feature activation+1.98e-02

Pos logit contributions
ves+0.001
kered+0.001
ggy+0.001
 lesson+0.001
ork+0.001
Neg logit contributions
rying-0.002
 least-0.002
 forth-0.002
ipping-0.002
ter-0.002
Token.
Token ID13
Feature activation-7.58e-03

Pos logit contributions
rying+0.035
 least+0.033
 forth+0.032
ipping+0.030
ter+0.029
Neg logit contributions
ves-0.036
kered-0.034
ggy-0.032
ork-0.030
ggie-0.028
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
ves+0.013
kered+0.012
ggy+0.011
 lesson+0.010
ork+0.010
Neg logit contributions
rying-0.014
 least-0.014
 forth-0.013
 gentlemen-0.013
ipping-0.013
Token
Token ID198
Feature activation+8.83e-03

Pos logit contributions
kered+0.021
ves+0.021
ggy+0.018
 lesson+0.018
 teeth+0.017
Neg logit contributions
 least-0.018
rying-0.018
 forth-0.017
 gentlemen-0.017
oming-0.017
TokenMom
Token ID29252
Feature activation-1.11e-02

Pos logit contributions
rying+0.017
 least+0.016
 forth+0.015
ipping+0.015
ter+0.015
Neg logit contributions
ves-0.016
kered-0.015
ggy-0.014
 lesson-0.013
ork-0.012
Token and
Token ID290
Feature activation+8.84e-02

Pos logit contributions
ves+0.013
kered+0.012
ggy+0.010
 lesson+0.010
ork+0.009
Neg logit contributions
rying-0.027
 least-0.026
 forth-0.026
ter-0.025
ipping-0.025
Token Dad
Token ID17415
Feature activation+4.63e-03

Pos logit contributions
rying+0.207
oming+0.187
ipping+0.184
ode+0.182
oise+0.179
Neg logit contributions
 Anna-0.102
 teeth-0.101
 words-0.100
 turns-0.095
 time-0.091
Token smile
Token ID8212
Feature activation+2.67e-02

Pos logit contributions
rying+0.010
ter+0.009
umper+0.009
 least+0.009
oming+0.009
Neg logit contributions
ves-0.007
kered-0.006
 lesson-0.006
ggy-0.006
 teaches-0.005
Token and
Token ID290
Feature activation+9.73e-03

Pos logit contributions
rying+0.062
 least+0.059
 forth+0.058
ipping+0.057
ter+0.056
Neg logit contributions
ves-0.036
kered-0.033
ggy-0.029
 lesson-0.028
ork-0.026
Token look
Token ID804
Feature activation+3.40e-02

Pos logit contributions
rying+0.019
 least+0.017
 forth+0.017
ipping+0.017
umper+0.017
Neg logit contributions
ves-0.016
kered-0.016
ggy-0.014
 lesson-0.014
 teaches-0.013
Token at
Token ID379
Feature activation+1.49e-03

Pos logit contributions
rying+0.042
 least+0.039
 forth+0.037
ipping+0.036
ter+0.035
Neg logit contributions
ves-0.082
kered-0.078
ggy-0.073
 lesson-0.072
ork-0.070
Token now
Token ID783
Feature activation+1.77e-02

Pos logit contributions
rying+0.002
 least+0.001
 forth+0.001
ipping+0.001
ter+0.001
Neg logit contributions
ves-0.002
kered-0.002
ggy-0.001
 lesson-0.001
ork-0.001
Token?"
Token ID1701
Feature activation-7.20e-03

Pos logit contributions
rying+0.031
 least+0.030
 forth+0.029
ipping+0.028
ter+0.028
Neg logit contributions
ves-0.033
kered-0.031
ggy-0.029
 lesson-0.028
ork-0.027
Token
Token ID198
Feature activation-3.07e-03

Pos logit contributions
ves+0.008
kered+0.007
ggy+0.006
 lesson+0.006
ork+0.006
Neg logit contributions
rying-0.019
 least-0.017
 forth-0.017
ter-0.017
ode-0.017
Token
Token ID198
Feature activation+1.63e-02

Pos logit contributions
ves+0.005
kered+0.005
ggy+0.004
 lesson+0.004
ork+0.004
Neg logit contributions
rying-0.006
 least-0.006
 forth-0.006
ipping-0.006
umper-0.005
TokenMom
Token ID29252
Feature activation-1.01e-02

Pos logit contributions
rying+0.035
 least+0.033
 forth+0.032
ter+0.031
ipping+0.031
Neg logit contributions
ves-0.027
kered-0.024
ggy-0.023
 lesson-0.021
ork-0.021
Token and
Token ID290
Feature activation+8.84e-02

Pos logit contributions
ves+0.012
kered+0.011
ggy+0.009
 lesson+0.009
ork+0.008
Neg logit contributions
rying-0.025
 least-0.024
 forth-0.023
ipping-0.023
ter-0.022
Token dad
Token ID9955
Feature activation+1.86e-02

Pos logit contributions
rying+0.190
oming+0.169
ipping+0.169
ologist+0.166
 least+0.166
Neg logit contributions
ves-0.114
 teeth-0.114
 words-0.108
kered-0.107
 Anna-0.105
Token looked
Token ID3114
Feature activation+3.67e-02

Pos logit contributions
rying+0.045
 least+0.043
 forth+0.042
ipping+0.041
oming+0.041
Neg logit contributions
ves-0.021
kered-0.021
 was-0.016
ggy-0.015
 lesson-0.015
Token at
Token ID379
Feature activation+5.48e-03

Pos logit contributions
rying+0.057
 least+0.055
ode+0.051
ancer+0.051
ter+0.051
Neg logit contributions
ves-0.072
kered-0.070
 lesson-0.066
 teeth-0.063
 teaches-0.062
Token their
Token ID511
Feature activation+6.58e-03

Pos logit contributions
 least+0.009
rying+0.009
ter+0.009
ting+0.009
 forth+0.008
Neg logit contributions
ves-0.011
kered-0.010
ggy-0.009
ork-0.009
 lesson-0.009
Token plates
Token ID13375
Feature activation+2.09e-02

Pos logit contributions
rying+0.012
 least+0.012
 forth+0.012
ipping+0.011
ter+0.011
Neg logit contributions
ves-0.012
kered-0.011
ggy-0.010
ork-0.010
 lesson-0.009
Token.
Token ID13
Feature activation-2.36e-02

Pos logit contributions
rying+0.004
 forth+0.004
ter+0.004
 least+0.004
ting+0.004
Neg logit contributions
ves-0.005
kered-0.005
 lesson-0.005
ggy-0.004
ork-0.004
Token
Token ID198
Feature activation-6.45e-03

Pos logit contributions
ves+0.039
kered+0.037
ggy+0.034
 lesson+0.032
ggie+0.030
Neg logit contributions
rying-0.043
 least-0.042
ter-0.041
ancer-0.041
oming-0.041
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
kered+0.013
ves+0.012
 lesson+0.011
ggy+0.011
 teeth+0.011
Neg logit contributions
rying-0.008
 least-0.008
oming-0.008
 surfing-0.008
 gentlemen-0.008
Token"
Token ID1
Feature activation+2.37e-02

Pos logit contributions
rying+0.027
 least+0.027
 forth+0.025
ipping+0.025
ter+0.024
Neg logit contributions
ves-0.020
kered-0.018
ggy-0.016
 lesson-0.015
ork-0.014
TokenMom
Token ID29252
Feature activation+8.57e-03

Pos logit contributions
rying+0.044
 least+0.040
ipping+0.039
 forth+0.039
oming+0.037
Neg logit contributions
ves-0.045
kered-0.042
ggy-0.038
 lesson-0.038
ork-0.035
Token and
Token ID290
Feature activation+8.84e-02

Pos logit contributions
rying+0.018
 least+0.017
 forth+0.017
ipping+0.016
ter+0.016
Neg logit contributions
ves-0.014
kered-0.013
ggy-0.011
 lesson-0.011
ork-0.011
Token dad
Token ID9955
Feature activation+9.13e-03

Pos logit contributions
rying+0.192
ipping+0.179
 least+0.172
oming+0.171
umper+0.166
Neg logit contributions
 teeth-0.124
ves-0.117
 lesson-0.112
 words-0.111
kered-0.110
Token are
Token ID389
Feature activation+2.52e-02

Pos logit contributions
rying+0.019
 least+0.018
 forth+0.018
ipping+0.017
ter+0.017
Neg logit contributions
ves-0.014
kered-0.013
ggy-0.012
 lesson-0.012
ork-0.011
Token here
Token ID994
Feature activation+1.98e-02

Pos logit contributions
rying+0.040
umper+0.038
 least+0.036
ipping+0.035
ode+0.034
Neg logit contributions
ves-0.051
ggy-0.049
kered-0.046
 time-0.044
ggie-0.043
Token!"
Token ID2474
Feature activation-1.98e-02

Pos logit contributions
rying+0.046
 least+0.044
 forth+0.043
ter+0.041
ipping+0.041
Neg logit contributions
ves-0.026
kered-0.025
ggy-0.021
 lesson-0.020
ork-0.019
Token Tim
Token ID5045
Feature activation-2.39e-02

Pos logit contributions
ves+0.038
kered+0.037
ggy+0.033
 lesson+0.033
ork+0.032
Neg logit contributions
 least-0.031
 gentlemen-0.030
 forth-0.030
rying-0.030
ter-0.029
Token scared
Token ID12008
Feature activation-3.51e-03

Pos logit contributions
rying+0.042
 least+0.039
 forth+0.038
ipping+0.038
umper+0.036
Neg logit contributions
ves-0.040
kered-0.038
ggy-0.035
 lesson-0.034
ork-0.031
Token and
Token ID290
Feature activation+2.17e-03

Pos logit contributions
ves+0.004
kered+0.004
ggy+0.003
 lesson+0.003
ork+0.003
Neg logit contributions
rying-0.009
 least-0.008
 forth-0.008
ipping-0.008
umper-0.008
Token cold
Token ID4692
Feature activation-4.91e-03

Pos logit contributions
rying+0.003
 forth+0.003
 least+0.003
ter+0.003
ipping+0.003
Neg logit contributions
ves-0.005
kered-0.004
ggy-0.004
ork-0.004
 lesson-0.004
Token.
Token ID13
Feature activation-3.24e-02

Pos logit contributions
ves+0.008
kered+0.008
ggy+0.007
ork+0.007
 lesson+0.007
Neg logit contributions
rying-0.010
 forth-0.009
ter-0.009
 least-0.009
ting-0.009
Token Mom
Token ID11254
Feature activation-2.17e-02

Pos logit contributions
ves+0.066
kered+0.059
ggy+0.056
ggie+0.051
 lesson+0.049
Neg logit contributions
 least-0.049
 gentlemen-0.048
ter-0.047
rying-0.046
elt-0.045
Token and
Token ID290
Feature activation+8.83e-02

Pos logit contributions
ves+0.032
kered+0.030
ggy+0.026
 lesson+0.025
ork+0.024
Neg logit contributions
rying-0.048
 least-0.046
 forth-0.045
ipping-0.044
ter-0.042
Token Dad
Token ID17415
Feature activation-2.50e-03

Pos logit contributions
rying+0.196
ipping+0.182
ologist+0.180
oming+0.178
ancer+0.175
Neg logit contributions
 lesson-0.106
 teeth-0.106
 words-0.104
ves-0.103
kered-0.102
Token hug
Token ID16225
Feature activation+2.77e-03

Pos logit contributions
ves+0.004
kered+0.004
ggy+0.003
 lesson+0.003
ork+0.003
Neg logit contributions
rying-0.005
 least-0.005
 forth-0.005
ter-0.005
ipping-0.005
Token them
Token ID606
Feature activation+5.87e-03

Pos logit contributions
rying+0.005
 least+0.004
 forth+0.004
ter+0.004
ting+0.004
Neg logit contributions
ves-0.006
kered-0.005
ggy-0.005
 lesson-0.004
ork-0.004
Token and
Token ID290
Feature activation+2.95e-02

Pos logit contributions
rying+0.014
 least+0.013
 forth+0.013
ipping+0.013
umper+0.013
Neg logit contributions
ves-0.007
kered-0.007
ggy-0.006
 lesson-0.006
ork-0.005
Token say
Token ID910
Feature activation+3.17e-03

Pos logit contributions
rying+0.051
 least+0.049
 forth+0.048
ipping+0.045
ter+0.045
Neg logit contributions
ves-0.057
kered-0.053
ggy-0.049
 lesson-0.047
ork-0.046
Token them
Token ID606
Feature activation-1.26e-02

Pos logit contributions
rying+0.008
 least+0.007
 forth+0.007
ting+0.007
ter+0.007
Neg logit contributions
ves-0.015
kered-0.014
ork-0.013
ggy-0.013
ggie-0.012
Token what
Token ID644
Feature activation+5.36e-03

Pos logit contributions
ves+0.023
kered+0.023
ggy+0.020
ork+0.019
 lesson+0.018
Neg logit contributions
rying-0.021
 least-0.019
 forth-0.018
ter-0.018
 surfing-0.018
Token happened
Token ID3022
Feature activation+4.04e-03

Pos logit contributions
 surfing+0.011
 least+0.011
rying+0.011
 forth+0.011
ter+0.010
Neg logit contributions
kered-0.007
ves-0.006
aron-0.005
ork-0.005
ggy-0.004
Token.
Token ID13
Feature activation-1.65e-02

Pos logit contributions
rying+0.009
 least+0.008
 forth+0.008
ter+0.008
ipping+0.008
Neg logit contributions
ves-0.006
kered-0.006
ggy-0.005
 lesson-0.005
ork-0.005
Token Mom
Token ID11254
Feature activation-2.02e-02

Pos logit contributions
ves+0.031
kered+0.031
ggy+0.028
ggie+0.026
 lesson+0.025
Neg logit contributions
 gentlemen-0.025
rying-0.024
 least-0.024
�-0.023
elt-0.023
Token and
Token ID290
Feature activation+8.83e-02

Pos logit contributions
ves+0.031
kered+0.029
ggy+0.026
 lesson+0.025
ork+0.024
Neg logit contributions
rying-0.044
 least-0.042
 forth-0.041
ipping-0.040
ting-0.039
Token dad
Token ID9955
Feature activation-1.06e-03

Pos logit contributions
rying+0.169
ipping+0.161
 least+0.160
oming+0.155
 highlight+0.150
Neg logit contributions
ves-0.144
kered-0.133
 teeth-0.129
 lesson-0.129
 words-0.127
Token are
Token ID389
Feature activation+3.53e-02

Pos logit contributions
ves+0.002
kered+0.002
ggy+0.001
 lesson+0.001
ork+0.001
Neg logit contributions
rying-0.002
 least-0.002
 forth-0.002
ipping-0.002
ter-0.002
Token sorry
Token ID7926
Feature activation+1.59e-02

Pos logit contributions
rying+0.075
 least+0.070
 forth+0.069
ipping+0.069
ancer+0.069
Neg logit contributions
ves-0.053
kered-0.049
ggy-0.048
 lesson-0.043
ggie-0.039
Token and
Token ID290
Feature activation+2.95e-02

Pos logit contributions
rying+0.040
 least+0.038
umper+0.038
 forth+0.038
used+0.038
Neg logit contributions
ves-0.017
kered-0.015
 lesson-0.014
ggy-0.013
 words-0.013
Token give
Token ID1577
Feature activation+7.73e-03

Pos logit contributions
rying+0.055
 least+0.052
 forth+0.051
ipping+0.050
ter+0.049
Neg logit contributions
ves-0.052
kered-0.050
ggy-0.045
 lesson-0.044
ork-0.042
Token toys
Token ID14958
Feature activation+1.01e-02

Pos logit contributions
ves+0.047
kered+0.046
ggy+0.042
 lesson+0.039
ggie+0.039
Neg logit contributions
rying-0.047
 forth-0.045
ipping-0.042
 least-0.041
 surfing-0.040
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
rying+0.021
 least+0.020
 forth+0.019
ipping+0.019
ter+0.019
Neg logit contributions
ves-0.016
kered-0.015
ggy-0.013
 lesson-0.013
ork-0.012
Token games
Token ID1830
Feature activation-3.91e-03

Pos logit contributions
ves+0.034
ork+0.031
kered+0.031
ggy+0.030
ggie+0.028
Neg logit contributions
 forth-0.028
nam-0.026
 highlight-0.025
rying-0.024
 surfing-0.024
Token for
Token ID329
Feature activation-2.55e-02

Pos logit contributions
ves+0.006
kered+0.006
ggy+0.005
 lesson+0.005
ork+0.005
Neg logit contributions
rying-0.008
 least-0.007
 forth-0.007
ter-0.007
ipping-0.007
Token his
Token ID465
Feature activation-1.21e-02

Pos logit contributions
kered+0.044
ves+0.043
ggy+0.036
 lesson+0.035
ork+0.035
Neg logit contributions
 least-0.045
rying-0.043
 forth-0.043
 surfing-0.041
years-0.040
Token 3
Token ID513
Feature activation-8.27e-02

Pos logit contributions
ves+0.026
kered+0.025
ggy+0.022
 lesson+0.021
ork+0.021
Neg logit contributions
rying-0.018
 least-0.017
 forth-0.017
ipping-0.016
ter-0.016
Token year
Token ID614
Feature activation-2.16e-02

Pos logit contributions
kered+0.129
ves+0.122
 happens+0.109
 common+0.109
 teaches+0.103
Neg logit contributions
rying-0.136
 forth-0.130
ologist-0.126
 shopping-0.120
ipping-0.119
Token old
Token ID1468
Feature activation-1.19e-02

Pos logit contributions
ves+0.030
kered+0.028
ggy+0.024
 lesson+0.023
ork+0.021
Neg logit contributions
rying-0.049
 least-0.045
 forth-0.045
ipping-0.043
oming-0.043
Token daughter
Token ID4957
Feature activation-6.60e-03

Pos logit contributions
ves+0.024
kered+0.023
ggy+0.021
ork+0.020
 lesson+0.019
Neg logit contributions
rying-0.018
 least-0.018
 forth-0.018
ter-0.017
 gentlemen-0.016
Token to
Token ID284
Feature activation+2.84e-03

Pos logit contributions
ves+0.010
kered+0.010
 lesson+0.008
ggy+0.008
ork+0.008
Neg logit contributions
rying-0.014
 least-0.013
 forth-0.013
ter-0.012
ipping-0.012
Token enjoy
Token ID2883
Feature activation-8.15e-03

Pos logit contributions
 least+0.004
rying+0.004
 forth+0.004
ter+0.003
umper+0.003
Neg logit contributions
ves-0.006
kered-0.006
ork-0.005
ggy-0.005
 lesson-0.005
Token day
Token ID1110
Feature activation+1.00e-03

Pos logit contributions
ves+0.037
kered+0.035
ggy+0.031
 lesson+0.030
ork+0.029
Neg logit contributions
rying-0.043
 least-0.041
 forth-0.040
ipping-0.039
ter-0.039
Token,
Token ID11
Feature activation-3.91e-03

Pos logit contributions
rying+0.002
 least+0.002
 forth+0.002
ipping+0.002
ter+0.002
Neg logit contributions
ves-0.002
kered-0.001
ggy-0.001
 lesson-0.001
ork-0.001
Token she
Token ID673
Feature activation+6.83e-03

Pos logit contributions
ves+0.007
kered+0.007
ggy+0.006
ork+0.006
 lesson+0.006
Neg logit contributions
 forth-0.007
 least-0.007
ter-0.007
rying-0.007
ting-0.006
Token taught
Token ID7817
Feature activation-1.64e-02

Pos logit contributions
rying+0.012
 least+0.012
 forth+0.011
ter+0.011
umper+0.010
Neg logit contributions
ves-0.013
kered-0.012
ggy-0.011
 lesson-0.011
ork-0.010
Token a
Token ID257
Feature activation-2.85e-02

Pos logit contributions
ves+0.034
kered+0.031
ggy+0.028
ork+0.027
 lesson+0.025
Neg logit contributions
 least-0.026
 forth-0.024
ting-0.024
rying-0.024
umper-0.023
Token 3
Token ID513
Feature activation-8.05e-02

Pos logit contributions
ves+0.039
kered+0.039
ggy+0.031
ork+0.028
 teeth+0.027
Neg logit contributions
rying-0.060
 least-0.059
 forth-0.057
 gentlemen-0.055
 highlight-0.055
Token year
Token ID614
Feature activation-2.89e-02

Pos logit contributions
kered+0.117
ves+0.112
 common+0.104
 happens+0.098
 teeth+0.097
Neg logit contributions
rying-0.133
 forth-0.123
ter-0.120
years-0.117
 gentlemen-0.117
Token old
Token ID1468
Feature activation-5.23e-03

Pos logit contributions
ves+0.041
kered+0.039
ggy+0.034
 lesson+0.031
ork+0.029
Neg logit contributions
rying-0.064
 least-0.060
 forth-0.059
ter-0.057
oming-0.057
Token how
Token ID703
Feature activation+2.16e-03

Pos logit contributions
ves+0.010
kered+0.009
ggy+0.008
 lesson+0.008
ork+0.008
Neg logit contributions
rying-0.009
 least-0.009
 forth-0.008
ter-0.008
ipping-0.008
Token to
Token ID284
Feature activation-1.98e-02

Pos logit contributions
rying+0.004
 least+0.004
 forth+0.004
ipping+0.004
ter+0.004
Neg logit contributions
ves-0.003
kered-0.003
ggy-0.003
ork-0.003
 lesson-0.003
Token count
Token ID954
Feature activation-2.76e-02

Pos logit contributions
ves+0.039
kered+0.037
ggy+0.033
 lesson+0.033
ork+0.032
Neg logit contributions
 least-0.031
rying-0.030
 forth-0.029
ter-0.026
ipping-0.026
Token roof
Token ID9753
Feature activation+1.26e-03

Pos logit contributions
ves+0.001
kered+0.001
ggy+0.000
 lesson+0.000
ork+0.000
Neg logit contributions
rying-0.001
 least-0.001
 forth-0.001
ipping-0.000
ter-0.000
Token together
Token ID1978
Feature activation+1.29e-02

Pos logit contributions
rying+0.002
 forth+0.002
 least+0.002
ipping+0.002
ter+0.002
Neg logit contributions
ves-0.002
kered-0.002
ggy-0.002
 lesson-0.002
ork-0.002
Token.
Token ID13
Feature activation-2.97e-03

Pos logit contributions
rying+0.027
 least+0.025
 forth+0.025
ipping+0.024
ter+0.024
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.017
 lesson-0.016
ork-0.015
Token It
Token ID632
Feature activation-5.94e-03

Pos logit contributions
ves+0.004
kered+0.004
ggy+0.003
 lesson+0.003
ork+0.003
Neg logit contributions
rying-0.007
 least-0.006
 forth-0.006
ipping-0.006
ter-0.006
Token had
Token ID550
Feature activation-1.58e-02

Pos logit contributions
ves+0.007
kered+0.007
ggy+0.006
ork+0.005
 lesson+0.005
Neg logit contributions
 forth-0.014
rying-0.014
 least-0.014
ter-0.013
 highlight-0.012
Token 3
Token ID513
Feature activation-8.04e-02

Pos logit contributions
ves+0.032
kered+0.030
ggy+0.029
ork+0.026
 common+0.025
Neg logit contributions
 least-0.024
 gentlemen-0.021
 highlight-0.021
rying-0.021
ode-0.021
Token corners
Token ID14371
Feature activation-1.01e-02

Pos logit contributions
kered+0.136
ves+0.132
 lesson+0.129
 common+0.127
ork+0.119
Neg logit contributions
years-0.124
 forth-0.123
rying-0.122
ipping-0.118
ologist-0.116
Token,
Token ID11
Feature activation-9.18e-03

Pos logit contributions
ves+0.017
kered+0.015
ggy+0.014
 lesson+0.014
ork+0.013
Neg logit contributions
rying-0.020
 forth-0.019
ipping-0.019
 least-0.019
oming-0.018
Token 7
Token ID767
Feature activation-4.40e-02

Pos logit contributions
ves+0.018
kered+0.017
ggy+0.016
ork+0.015
 lesson+0.015
Neg logit contributions
rying-0.015
 forth-0.015
 least-0.015
ter-0.014
ipping-0.013
Token sides
Token ID5389
Feature activation-3.05e-03

Pos logit contributions
kered+0.076
ves+0.074
 lesson+0.069
 common+0.064
ggy+0.064
Neg logit contributions
rying-0.076
 forth-0.073
ipping-0.071
 least-0.069
used-0.068
Token and
Token ID290
Feature activation-2.85e-03

Pos logit contributions
ves+0.005
kered+0.004
ggy+0.004
 lesson+0.004
ork+0.004
Neg logit contributions
rying-0.006
 least-0.006
 forth-0.006
ipping-0.006
ter-0.006
Token daddy
Token ID35822
Feature activation+6.56e-04

Pos logit contributions
rying+0.002
 least+0.002
 forth+0.002
ipping+0.002
umper+0.002
Neg logit contributions
ves-0.001
kered-0.001
 lesson-0.001
ggy-0.001
ork-0.001
Token.
Token ID13
Feature activation+9.42e-03

Pos logit contributions
rying+0.002
 forth+0.002
 least+0.002
ipping+0.002
oming+0.001
Neg logit contributions
ves-0.001
kered-0.001
ggy-0.001
 lesson-0.001
 teaches-0.000
Token They
Token ID1119
Feature activation+1.41e-02

Pos logit contributions
rying+0.021
 least+0.020
 forth+0.020
ipping+0.019
ter+0.019
Neg logit contributions
ves-0.014
kered-0.013
ggy-0.011
 lesson-0.011
ork-0.010
Token had
Token ID550
Feature activation-5.61e-03

Pos logit contributions
rying+0.033
 least+0.031
 forth+0.030
ipping+0.030
oming+0.029
Neg logit contributions
ves-0.018
kered-0.017
ggy-0.015
 lesson-0.014
ork-0.013
Token a
Token ID257
Feature activation-2.36e-02

Pos logit contributions
ves+0.011
kered+0.010
ork+0.009
ggy+0.009
 choice+0.008
Neg logit contributions
 least-0.009
 highlight-0.009
ter-0.009
rying-0.008
 forth-0.008
Token 3
Token ID513
Feature activation-7.86e-02

Pos logit contributions
ves+0.032
kered+0.029
ggy+0.027
 lesson+0.026
ork+0.024
Neg logit contributions
rying-0.055
 least-0.051
 forth-0.050
ipping-0.050
ter-0.049
Token year
Token ID614
Feature activation-2.03e-02

Pos logit contributions
ves+0.121
kered+0.116
 common+0.106
 lesson+0.106
 happens+0.105
Neg logit contributions
rying-0.125
 forth-0.123
ipping-0.118
ish-0.115
years-0.112
Token old
Token ID1468
Feature activation-4.27e-03

Pos logit contributions
ves+0.029
kered+0.027
ggy+0.023
 lesson+0.022
ork+0.021
Neg logit contributions
rying-0.045
 least-0.042
 forth-0.042
ipping-0.041
oming-0.041
Token girl
Token ID2576
Feature activation-1.12e-02

Pos logit contributions
ves+0.009
kered+0.008
ggy+0.008
 lesson+0.008
ork+0.007
Neg logit contributions
rying-0.007
 least-0.006
 forth-0.006
ipping-0.006
ter-0.006
Token who
Token ID508
Feature activation-1.81e-02

Pos logit contributions
ves+0.025
kered+0.024
 lesson+0.022
ggy+0.021
ork+0.020
Neg logit contributions
rying-0.016
 least-0.015
 forth-0.014
ipping-0.014
ter-0.014
Token was
Token ID373
Feature activation+1.03e-02

Pos logit contributions
ves+0.032
kered+0.029
ork+0.028
 lesson+0.027
 purpose+0.027
Neg logit contributions
 least-0.031
rying-0.030
ter-0.028
ancer-0.028
 forth-0.027
Token same
Token ID976
Feature activation+4.81e-03

Pos logit contributions
rying+0.019
 forth+0.017
 least+0.017
ipping+0.016
ter+0.016
Neg logit contributions
ves-0.021
kered-0.020
ggy-0.019
 lesson-0.019
ork-0.018
Token house
Token ID2156
Feature activation-1.24e-02

Pos logit contributions
rying+0.007
 least+0.007
 forth+0.007
ipping+0.007
umper+0.006
Neg logit contributions
ves-0.010
kered-0.010
ggy-0.009
 lesson-0.009
 teaches-0.008
Token.
Token ID13
Feature activation+4.07e-03

Pos logit contributions
ves+0.019
kered+0.018
 lesson+0.016
ggy+0.015
ork+0.015
Neg logit contributions
rying-0.026
 least-0.025
 forth-0.024
ipping-0.024
ter-0.024
Token They
Token ID1119
Feature activation+1.43e-02

Pos logit contributions
 least+0.007
rying+0.007
ter+0.007
umper+0.007
ancer+0.007
Neg logit contributions
ves-0.007
kered-0.007
ggy-0.006
 lesson-0.006
ork-0.005
Token were
Token ID547
Feature activation+1.06e-02

Pos logit contributions
rying+0.031
 least+0.030
 forth+0.029
ter+0.028
ipping+0.028
Neg logit contributions
ves-0.021
kered-0.019
ggy-0.017
 lesson-0.017
ork-0.016
Token 3
Token ID513
Feature activation-7.83e-02

Pos logit contributions
rying+0.018
 least+0.017
 forth+0.017
ter+0.016
ipping+0.016
Neg logit contributions
ves-0.021
kered-0.020
ggy-0.018
 lesson-0.017
ork-0.017
Token years
Token ID812
Feature activation-1.30e-02

Pos logit contributions
ves+0.154
kered+0.151
ork+0.136
 lesson+0.136
 common+0.136
Neg logit contributions
rying-0.109
years-0.103
 forth-0.102
 least-0.096
ipping-0.095
Token old
Token ID1468
Feature activation-1.64e-02

Pos logit contributions
ves+0.016
kered+0.015
ggy+0.013
 lesson+0.012
ork+0.011
Neg logit contributions
rying-0.031
 least-0.030
 forth-0.030
ipping-0.029
ter-0.029
Token and
Token ID290
Feature activation-2.18e-02

Pos logit contributions
ves+0.023
kered+0.021
ggy+0.018
 lesson+0.018
ork+0.017
Neg logit contributions
rying-0.037
 least-0.035
 forth-0.035
ter-0.034
ipping-0.034
Token both
Token ID1111
Feature activation-2.22e-02

Pos logit contributions
ves+0.039
ork+0.038
kered+0.036
 lesson+0.033
its+0.031
Neg logit contributions
ter-0.032
 least-0.031
years-0.030
 forth-0.030
mallow-0.030
Token very
Token ID845
Feature activation-1.47e-02

Pos logit contributions
ves+0.035
kered+0.030
ork+0.028
ggy+0.028
 lesson+0.027
Neg logit contributions
rying-0.045
 least-0.044
ter-0.043
 forth-0.042
ting-0.041
Token called
Token ID1444
Feature activation+1.57e-02

Pos logit contributions
ves+0.019
kered+0.018
ggy+0.016
 lesson+0.016
ork+0.015
Neg logit contributions
rying-0.017
 least-0.016
 forth-0.016
ipping-0.015
ter-0.015
Token Isabel
Token ID41838
Feature activation-1.22e-02

Pos logit contributions
rying+0.030
 least+0.028
ter+0.028
 gentlemen+0.027
ancer+0.027
Neg logit contributions
ves-0.027
kered-0.026
 lesson-0.022
ggy-0.021
ork-0.020
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
ves+0.022
kered+0.022
 lesson+0.019
 purpose+0.019
ggy+0.019
Neg logit contributions
rying-0.021
 least-0.019
ter-0.018
 forth-0.018
umper-0.017
Token Isabel
Token ID41838
Feature activation-4.40e-02

Pos logit contributions
 least+0.019
rying+0.019
 forth+0.018
ancer+0.017
ipping+0.017
Neg logit contributions
ves-0.027
kered-0.025
ggy-0.024
 lesson-0.022
ork-0.021
Token was
Token ID373
Feature activation+1.36e-02

Pos logit contributions
ves+0.063
kered+0.060
 purpose+0.058
 lesson+0.056
 common+0.054
Neg logit contributions
rying-0.083
 least-0.083
 forth-0.082
ter-0.077
ish-0.074
Token 3
Token ID513
Feature activation-7.82e-02

Pos logit contributions
rying+0.027
 least+0.025
 forth+0.025
ipping+0.024
umper+0.024
Neg logit contributions
ves-0.022
kered-0.021
ggy-0.019
 lesson-0.018
ork-0.017
Token years
Token ID812
Feature activation-1.39e-02

Pos logit contributions
ves+0.164
kered+0.157
 common+0.143
 lesson+0.142
ork+0.140
Neg logit contributions
rying-0.112
 forth-0.106
years-0.098
 least-0.096
ting-0.095
Token old
Token ID1468
Feature activation-1.28e-02

Pos logit contributions
ves+0.014
kered+0.013
 lesson+0.011
ggy+0.011
ork+0.010
Neg logit contributions
rying-0.036
 least-0.035
 forth-0.034
ipping-0.034
ter-0.033
Token.
Token ID13
Feature activation+1.81e-03

Pos logit contributions
ves+0.016
kered+0.014
ggy+0.013
 lesson+0.012
ork+0.011
Neg logit contributions
rying-0.031
 least-0.029
 forth-0.029
ipping-0.028
umper-0.028
Token One
Token ID1881
Feature activation-2.80e-02

Pos logit contributions
 least+0.003
rying+0.003
 forth+0.003
ancer+0.002
used+0.002
Neg logit contributions
ves-0.004
kered-0.003
ggy-0.003
 lesson-0.003
ork-0.003
Token day
Token ID1110
Feature activation-7.34e-03

Pos logit contributions
ves+0.044
kered+0.042
ggy+0.037
 lesson+0.037
ork+0.034
Neg logit contributions
rying-0.058
 least-0.055
 forth-0.054
ipping-0.053
ter-0.052
Token all
Token ID477
Feature activation-1.00e-03

Pos logit contributions
rying+0.060
 least+0.058
 forth+0.057
ter+0.055
umper+0.053
Neg logit contributions
ves-0.055
kered-0.052
ggy-0.045
ork-0.045
 lesson-0.044
Token he
Token ID339
Feature activation-1.02e-02

Pos logit contributions
ves+0.001
kered+0.001
ggy+0.001
 lesson+0.001
ork+0.001
Neg logit contributions
rying-0.002
 least-0.002
ipping-0.002
 forth-0.002
ter-0.002
Token wanted
Token ID2227
Feature activation-1.50e-03

Pos logit contributions
ves+0.017
kered+0.015
ork+0.014
 lesson+0.014
 words+0.013
Neg logit contributions
 least-0.020
rying-0.020
ter-0.019
 forth-0.018
umper-0.018
Token for
Token ID329
Feature activation-2.70e-02

Pos logit contributions
ves+0.003
kered+0.003
ggy+0.002
ork+0.002
 lesson+0.002
Neg logit contributions
rying-0.003
 least-0.003
 forth-0.002
 surfing-0.002
ter-0.002
Token his
Token ID465
Feature activation-1.71e-02

Pos logit contributions
ves+0.040
kered+0.038
ggy+0.032
 lesson+0.032
ork+0.031
Neg logit contributions
rying-0.058
 least-0.056
 forth-0.055
ipping-0.053
ter-0.052
Token 3
Token ID513
Feature activation-7.82e-02

Pos logit contributions
ves+0.031
kered+0.030
ggy+0.027
 lesson+0.026
ork+0.025
Neg logit contributions
rying-0.031
 least-0.029
 forth-0.029
ipping-0.028
ter-0.027
Tokenrd
Token ID4372
Feature activation-1.58e-02

Pos logit contributions
ves+0.109
kered+0.109
ork+0.088
 common+0.081
 teaches+0.080
Neg logit contributions
rying-0.157
 forth-0.150
ipping-0.144
ologist-0.143
 least-0.141
Token birthday
Token ID10955
Feature activation+1.21e-04

Pos logit contributions
ves+0.030
kered+0.029
ggy+0.026
 lesson+0.025
ork+0.025
Neg logit contributions
rying-0.027
 least-0.026
 forth-0.025
ipping-0.024
ter-0.024
Token was
Token ID373
Feature activation+9.18e-03

Pos logit contributions
rying+0.000
 least+0.000
 forth+0.000
ipping+0.000
ter+0.000
Neg logit contributions
ves-0.000
kered-0.000
ggy-0.000
 lesson-0.000
ork-0.000
Token to
Token ID284
Feature activation+2.55e-03

Pos logit contributions
 least+0.016
rying+0.016
ter+0.014
 surfing+0.014
 forth+0.014
Neg logit contributions
ves-0.018
kered-0.017
ork-0.014
ggy-0.014
 lesson-0.014
Token try
Token ID1949
Feature activation-1.83e-02

Pos logit contributions
 least+0.004
rying+0.004
 forth+0.004
ter+0.003
 gentlemen+0.003
Neg logit contributions
ves-0.005
kered-0.005
ggy-0.005
ork-0.004
 lesson-0.004
Tokenla
Token ID5031
Feature activation+1.51e-02

Pos logit contributions
 sidelines+0.052
 forth+0.050
 highlight+0.050
 least+0.049
 gentlemen+0.049
Neg logit contributions
kered-0.074
ves-0.071
 was-0.066
ggy-0.064
 time-0.063
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
rying+0.038
 least+0.036
 forth+0.036
ipping+0.035
umper+0.035
Neg logit contributions
ves-0.017
kered-0.016
ggy-0.013
 lesson-0.012
ork-0.011
Token They
Token ID1119
Feature activation+1.74e-02

Pos logit contributions
rying+0.039
 least+0.038
 forth+0.037
ipping+0.036
ter+0.036
Neg logit contributions
ves-0.018
kered-0.016
ggy-0.014
 lesson-0.014
 teeth-0.012
Token were
Token ID547
Feature activation+1.11e-02

Pos logit contributions
rying+0.040
 least+0.038
ipping+0.038
 forth+0.037
oming+0.036
Neg logit contributions
ves-0.023
kered-0.022
ggy-0.019
 lesson-0.018
 teaches-0.016
Token both
Token ID1111
Feature activation+1.26e-03

Pos logit contributions
rying+0.020
 least+0.018
 forth+0.018
ipping+0.018
umper+0.017
Neg logit contributions
ves-0.020
kered-0.019
ggy-0.018
 lesson-0.017
 teeth-0.016
Token 3
Token ID513
Feature activation-7.82e-02

Pos logit contributions
rying+0.002
 least+0.002
 forth+0.002
ter+0.002
ipping+0.002
Neg logit contributions
ves-0.002
kered-0.002
ggy-0.002
 lesson-0.002
ork-0.002
Token years
Token ID812
Feature activation-8.13e-03

Pos logit contributions
ves+0.157
kered+0.156
ork+0.143
 lesson+0.138
 common+0.136
Neg logit contributions
rying-0.110
 forth-0.101
years-0.097
ipping-0.095
 least-0.093
Token old
Token ID1468
Feature activation-6.82e-03

Pos logit contributions
ves+0.010
kered+0.009
ggy+0.008
 lesson+0.008
ork+0.007
Neg logit contributions
rying-0.020
 least-0.019
 forth-0.019
ipping-0.018
ter-0.018
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
ves+0.007
kered+0.007
ggy+0.006
 lesson+0.006
 teeth+0.005
Neg logit contributions
rying-0.017
 least-0.016
 forth-0.016
ipping-0.016
umper-0.016
Token were
Token ID547
Feature activation+5.75e-03

Pos logit contributions
ves+0.016
kered+0.015
ork+0.013
 lesson+0.013
ggy+0.012
Neg logit contributions
rying-0.021
 least-0.021
ter-0.020
 forth-0.020
ipping-0.019
Token always
Token ID1464
Feature activation+1.15e-02

Pos logit contributions
rying+0.011
 least+0.010
 forth+0.010
ipping+0.010
umper+0.010
Neg logit contributions
ves-0.010
kered-0.009
ggy-0.008
 lesson-0.008
ork-0.007
Token special
Token ID2041
Feature activation-2.49e-02

Pos logit contributions
kered+0.022
ves+0.022
ggy+0.019
 teeth+0.017
ggie+0.017
Neg logit contributions
rying-0.024
 least-0.024
 forth-0.023
ipping-0.022
oming-0.021
Token day
Token ID1110
Feature activation-1.40e-02

Pos logit contributions
ves+0.039
kered+0.039
ggy+0.033
 lesson+0.030
ork+0.029
Neg logit contributions
rying-0.049
 forth-0.048
 least-0.047
ipping-0.045
ter-0.044
Token,
Token ID11
Feature activation-6.41e-04

Pos logit contributions
ves+0.023
kered+0.022
 lesson+0.020
ggy+0.020
ork+0.019
Neg logit contributions
rying-0.025
ipping-0.024
 forth-0.024
 least-0.024
ter-0.024
Token she
Token ID673
Feature activation-7.01e-03

Pos logit contributions
ves+0.001
kered+0.001
ggy+0.001
 lesson+0.001
ork+0.001
Neg logit contributions
rying-0.001
 least-0.001
 forth-0.001
ipping-0.001
ter-0.001
Token had
Token ID550
Feature activation-9.04e-03

Pos logit contributions
ves+0.013
kered+0.012
ggy+0.011
 purpose+0.011
 common+0.011
Neg logit contributions
 least-0.012
 forth-0.011
rying-0.011
ode-0.010
years-0.010
Token three
Token ID1115
Feature activation-7.78e-02

Pos logit contributions
ves+0.015
kered+0.015
ggy+0.013
 lesson+0.012
 teaches+0.012
Neg logit contributions
 least-0.017
rying-0.016
years-0.015
 forth-0.015
 gentlemen-0.015
Token puzzles
Token ID24367
Feature activation-1.66e-02

Pos logit contributions
kered+0.129
ves+0.128
 lesson+0.116
ggy+0.111
 common+0.108
Neg logit contributions
rying-0.133
 forth-0.133
ipping-0.126
years-0.125
ologist-0.118
Token that
Token ID326
Feature activation-9.19e-03

Pos logit contributions
ves+0.029
kered+0.028
ggy+0.026
 lesson+0.025
ork+0.024
Neg logit contributions
rying-0.029
 least-0.027
 forth-0.027
ter-0.026
oming-0.025
Token she
Token ID673
Feature activation-2.41e-02

Pos logit contributions
ves+0.019
kered+0.018
ggy+0.017
ork+0.016
 lesson+0.016
Neg logit contributions
 forth-0.013
 least-0.013
rying-0.013
ter-0.012
ancer-0.012
Token wanted
Token ID2227
Feature activation-1.31e-02

Pos logit contributions
ves+0.040
kered+0.038
ggy+0.034
 lesson+0.034
 purpose+0.033
Neg logit contributions
rying-0.045
 least-0.043
 forth-0.042
years-0.041
ter-0.040
Token to
Token ID284
Feature activation-2.26e-02

Pos logit contributions
kered+0.028
ves+0.027
ggy+0.025
aron+0.022
 common+0.022
Neg logit contributions
rying-0.018
 least-0.018
 shopping-0.015
years-0.015
 forth-0.015
Token Ellen
Token ID23811
Feature activation+1.38e-02

Pos logit contributions
 least+0.054
rying+0.054
 forth+0.052
ter+0.051
 gentlemen+0.049
Neg logit contributions
ves-0.071
kered-0.067
ork-0.062
ggy-0.060
 lesson-0.058
Token.
Token ID13
Feature activation+1.82e-02

Pos logit contributions
rying+0.035
ipping+0.033
 forth+0.033
 least+0.033
umper+0.032
Neg logit contributions
ves-0.014
kered-0.014
ggy-0.012
 lesson-0.011
 was-0.009
Token They
Token ID1119
Feature activation+1.86e-02

Pos logit contributions
rying+0.041
 least+0.039
 forth+0.039
ipping+0.038
ter+0.037
Neg logit contributions
ves-0.025
kered-0.023
ggy-0.020
 lesson-0.020
ork-0.019
Token were
Token ID547
Feature activation+1.31e-02

Pos logit contributions
rying+0.041
 least+0.038
ipping+0.038
 forth+0.038
oming+0.036
Neg logit contributions
ves-0.027
kered-0.026
ggy-0.022
 lesson-0.022
 teaches-0.020
Token both
Token ID1111
Feature activation+2.31e-03

Pos logit contributions
rying+0.022
 least+0.020
 forth+0.019
ipping+0.019
umper+0.018
Neg logit contributions
ves-0.026
kered-0.024
ggy-0.023
 lesson-0.022
ork-0.021
Token 3
Token ID513
Feature activation-7.77e-02

Pos logit contributions
rying+0.004
 least+0.004
ter+0.004
 forth+0.004
ipping+0.004
Neg logit contributions
ves-0.004
kered-0.004
 lesson-0.004
ork-0.004
ggy-0.004
Token years
Token ID812
Feature activation-4.67e-03

Pos logit contributions
ves+0.155
kered+0.152
ork+0.139
 lesson+0.132
 common+0.130
Neg logit contributions
rying-0.112
 forth-0.105
years-0.099
ipping-0.098
 least-0.095
Token old
Token ID1468
Feature activation-4.58e-03

Pos logit contributions
ves+0.006
kered+0.005
ggy+0.004
 lesson+0.004
ork+0.004
Neg logit contributions
rying-0.011
 least-0.011
 forth-0.011
ipping-0.010
ter-0.010
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
ves+0.005
kered+0.005
 lesson+0.004
ggy+0.004
 teeth+0.004
Neg logit contributions
rying-0.011
 least-0.011
 forth-0.011
ipping-0.011
umper-0.010
Token were
Token ID547
Feature activation+6.59e-03

Pos logit contributions
ves+0.019
kered+0.018
ork+0.017
ggy+0.016
 lesson+0.016
Neg logit contributions
 least-0.019
 forth-0.018
rying-0.018
ter-0.018
ting-0.017
Token always
Token ID1464
Feature activation+1.19e-02

Pos logit contributions
rying+0.012
 least+0.011
 forth+0.011
ipping+0.010
ter+0.010
Neg logit contributions
ves-0.012
kered-0.012
ggy-0.011
 lesson-0.010
ork-0.010
Token called
Token ID1444
Feature activation+7.15e-03

Pos logit contributions
ves+0.019
kered+0.018
ggy+0.016
 lesson+0.016
ork+0.015
Neg logit contributions
rying-0.017
 least-0.016
 forth-0.016
ipping-0.015
ter-0.015
Token Paul
Token ID3362
Feature activation+4.94e-03

Pos logit contributions
rying+0.012
 least+0.011
ter+0.011
ancer+0.011
 forth+0.011
Neg logit contributions
ves-0.014
kered-0.013
 lesson-0.012
ggy-0.011
ork-0.011
Token.
Token ID13
Feature activation+4.01e-03

Pos logit contributions
rying+0.010
 least+0.010
 forth+0.009
ter+0.009
umper+0.009
Neg logit contributions
ves-0.008
kered-0.008
ggy-0.007
 lesson-0.007
ork-0.006
Token Paul
Token ID3362
Feature activation-2.72e-02

Pos logit contributions
rying+0.008
 least+0.007
 forth+0.007
ipping+0.007
ter+0.007
Neg logit contributions
ves-0.007
kered-0.007
ggy-0.006
 lesson-0.006
ork-0.006
Token was
Token ID373
Feature activation+7.25e-03

Pos logit contributions
ves+0.047
kered+0.043
 lesson+0.039
ork+0.038
 purpose+0.036
Neg logit contributions
 least-0.048
rying-0.047
 forth-0.046
ter-0.045
ish-0.043
Token 3
Token ID513
Feature activation-7.76e-02

Pos logit contributions
rying+0.015
 least+0.014
 forth+0.014
ipping+0.013
ter+0.013
Neg logit contributions
ves-0.012
kered-0.011
ggy-0.010
 lesson-0.010
ork-0.009
Token years
Token ID812
Feature activation-1.63e-02

Pos logit contributions
ves+0.157
kered+0.153
 lesson+0.137
ork+0.135
 common+0.134
Neg logit contributions
rying-0.113
 forth-0.105
years-0.097
ipping-0.096
 least-0.094
Token old
Token ID1468
Feature activation-1.29e-02

Pos logit contributions
ves+0.019
kered+0.017
ggy+0.015
 lesson+0.015
ork+0.013
Neg logit contributions
rying-0.041
 least-0.039
 forth-0.038
ipping-0.037
ter-0.037
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
ves+0.016
kered+0.015
ggy+0.013
 lesson+0.012
ork+0.011
Neg logit contributions
rying-0.031
 least-0.030
 forth-0.029
ipping-0.029
umper-0.028
Token was
Token ID373
Feature activation+4.03e-03

Pos logit contributions
ves+0.027
kered+0.025
ork+0.024
 lesson+0.021
ggy+0.021
Neg logit contributions
 forth-0.025
 least-0.025
rying-0.024
ter-0.024
years-0.024
Token very
Token ID845
Feature activation+1.19e-03

Pos logit contributions
rying+0.008
 least+0.007
 forth+0.007
ipping+0.007
ter+0.007
Neg logit contributions
ves-0.007
kered-0.007
ggy-0.006
 lesson-0.006
ork-0.006
Token named
Token ID3706
Feature activation+2.23e-02

Pos logit contributions
ves+0.028
kered+0.026
ggy+0.024
 lesson+0.024
ork+0.023
Neg logit contributions
rying-0.024
 least-0.023
 forth-0.022
ipping-0.021
ter-0.021
Token Tom
Token ID4186
Feature activation-1.66e-02

Pos logit contributions
rying+0.045
 least+0.043
 forth+0.042
ipping+0.042
ter+0.040
Neg logit contributions
ves-0.036
kered-0.034
ggy-0.031
 lesson-0.030
ork-0.029
Token.
Token ID13
Feature activation+7.28e-03

Pos logit contributions
ves+0.028
kered+0.026
 lesson+0.023
ggy+0.022
ork+0.021
Neg logit contributions
rying-0.034
 least-0.032
 forth-0.031
umper-0.030
ter-0.030
Token Tom
Token ID4186
Feature activation-4.26e-02

Pos logit contributions
 forth+0.012
rying+0.012
 least+0.012
 surfing+0.012
ipping+0.012
Neg logit contributions
ves-0.013
kered-0.012
ggy-0.011
ork-0.010
 lesson-0.010
Token was
Token ID373
Feature activation+9.90e-03

Pos logit contributions
ves+0.069
kered+0.063
 lesson+0.055
ork+0.054
ggy+0.051
Neg logit contributions
 least-0.083
rying-0.082
 forth-0.081
ter-0.077
umper-0.076
Token 3
Token ID513
Feature activation-7.74e-02

Pos logit contributions
rying+0.020
 least+0.019
 forth+0.019
ipping+0.018
ter+0.018
Neg logit contributions
ves-0.016
kered-0.015
ggy-0.013
 lesson-0.013
ork-0.012
Token years
Token ID812
Feature activation-1.45e-02

Pos logit contributions
ves+0.153
kered+0.147
 common+0.134
 lesson+0.133
ork+0.132
Neg logit contributions
rying-0.116
 forth-0.109
years-0.100
ipping-0.099
ter-0.097
Token old
Token ID1468
Feature activation-1.66e-02

Pos logit contributions
ves+0.016
kered+0.015
ggy+0.013
 lesson+0.013
ork+0.012
Neg logit contributions
rying-0.036
 least-0.035
 forth-0.034
ipping-0.034
ter-0.033
Token and
Token ID290
Feature activation-1.09e-02

Pos logit contributions
ves+0.020
kered+0.018
ggy+0.016
 lesson+0.015
ork+0.014
Neg logit contributions
rying-0.040
 least-0.038
 forth-0.038
ipping-0.037
oming-0.036
Token wanted
Token ID2227
Feature activation-5.38e-03

Pos logit contributions
ves+0.019
kered+0.017
ork+0.016
 lesson+0.014
ggy+0.014
Neg logit contributions
 forth-0.019
rying-0.019
 least-0.019
ter-0.019
years-0.018
Token to
Token ID284
Feature activation-2.33e-02

Pos logit contributions
ves+0.009
kered+0.009
ggy+0.007
 lesson+0.007
ork+0.007
Neg logit contributions
rying-0.010
 least-0.010
 forth-0.010
ter-0.009
ipping-0.009
Token fight
Token ID1907
Feature activation-1.16e-02

Pos logit contributions
ves+0.028
kered+0.026
ggy+0.024
 lesson+0.023
ork+0.022
Neg logit contributions
rying-0.045
 least-0.042
 forth-0.042
ipping-0.041
ter-0.040
Token.
Token ID13
Feature activation-9.83e-04

Pos logit contributions
ves+0.019
kered+0.018
 lesson+0.016
ork+0.016
ggy+0.016
Neg logit contributions
rying-0.022
 forth-0.021
 least-0.020
ter-0.020
ipping-0.020
Token They
Token ID1119
Feature activation-1.27e-03

Pos logit contributions
ves+0.002
kered+0.001
ggy+0.001
 lesson+0.001
ork+0.001
Neg logit contributions
rying-0.002
 least-0.002
 forth-0.002
ter-0.002
ipping-0.002
Token were
Token ID547
Feature activation-4.42e-04

Pos logit contributions
ves+0.002
kered+0.002
ork+0.002
ggy+0.002
 lesson+0.002
Neg logit contributions
 forth-0.002
 least-0.002
rying-0.002
ter-0.002
umper-0.002
Token both
Token ID1111
Feature activation-7.66e-03

Pos logit contributions
ves+0.001
kered+0.001
ggy+0.001
 lesson+0.001
ork+0.001
Neg logit contributions
rying-0.001
 least-0.001
 forth-0.001
ipping-0.001
ter-0.001
Token 3
Token ID513
Feature activation-7.74e-02

Pos logit contributions
ves+0.017
kered+0.016
ork+0.014
 lesson+0.014
ggy+0.014
Neg logit contributions
 least-0.011
rying-0.011
 forth-0.011
used-0.010
ter-0.010
Token years
Token ID812
Feature activation-4.58e-03

Pos logit contributions
kered+0.139
ves+0.139
 common+0.125
 lesson+0.123
ork+0.123
Neg logit contributions
rying-0.117
 forth-0.116
ipping-0.110
ting-0.104
years-0.104
Token old
Token ID1468
Feature activation-1.38e-03

Pos logit contributions
ves+0.006
kered+0.005
ggy+0.005
 lesson+0.005
ork+0.004
Neg logit contributions
rying-0.011
 least-0.010
 forth-0.010
ipping-0.010
ter-0.010
Token and
Token ID290
Feature activation-1.90e-03

Pos logit contributions
ves+0.002
kered+0.002
ggy+0.002
 lesson+0.002
ork+0.002
Neg logit contributions
rying-0.003
 least-0.003
 forth-0.003
ter-0.003
ipping-0.003
Token were
Token ID547
Feature activation+7.56e-04

Pos logit contributions
ves+0.004
kered+0.004
ork+0.004
 lesson+0.003
ggy+0.003
Neg logit contributions
rying-0.003
 forth-0.003
ter-0.003
 least-0.003
used-0.002
Token very
Token ID845
Feature activation+3.84e-03

Pos logit contributions
rying+0.001
 least+0.001
 forth+0.001
ipping+0.001
ter+0.001
Neg logit contributions
ves-0.002
kered-0.001
ggy-0.001
 lesson-0.001
ork-0.001
Token Joe
Token ID5689
Feature activation-3.71e-03

Pos logit contributions
rying+0.098
 least+0.092
 forth+0.090
ipping+0.088
ter+0.086
Neg logit contributions
ves-0.086
kered-0.081
ggy-0.074
 lesson-0.071
ork-0.067
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
ves+0.005
kered+0.004
ggy+0.004
 lesson+0.004
ork+0.003
Neg logit contributions
rying-0.009
 least-0.008
 forth-0.008
ipping-0.008
ter-0.008
Token They
Token ID1119
Feature activation+1.57e-02

Pos logit contributions
rying+0.040
 least+0.038
 forth+0.038
ipping+0.037
ter+0.036
Neg logit contributions
ves-0.016
kered-0.015
 lesson-0.012
ggy-0.012
 teeth-0.011
Token were
Token ID547
Feature activation+1.09e-02

Pos logit contributions
rying+0.039
 least+0.036
ipping+0.036
 forth+0.036
oming+0.035
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.015
 lesson-0.015
 teaches-0.013
Token both
Token ID1111
Feature activation+2.01e-03

Pos logit contributions
rying+0.018
 least+0.016
ipping+0.016
 forth+0.016
umper+0.015
Neg logit contributions
ves-0.021
kered-0.021
ggy-0.019
 lesson-0.019
 teeth-0.018
Token 3
Token ID513
Feature activation-7.71e-02

Pos logit contributions
rying+0.004
 least+0.004
 forth+0.003
ter+0.003
ipping+0.003
Neg logit contributions
ves-0.004
kered-0.003
ggy-0.003
 lesson-0.003
ork-0.003
Token years
Token ID812
Feature activation-6.69e-03

Pos logit contributions
ves+0.148
kered+0.145
ork+0.132
 lesson+0.130
 common+0.126
Neg logit contributions
rying-0.118
 forth-0.108
years-0.103
ipping-0.102
 least-0.100
Token old
Token ID1468
Feature activation-6.89e-03

Pos logit contributions
ves+0.008
kered+0.007
ggy+0.006
 lesson+0.006
ork+0.006
Neg logit contributions
rying-0.016
 least-0.016
 forth-0.016
ipping-0.015
ter-0.015
Token,
Token ID11
Feature activation-2.17e-04

Pos logit contributions
ves+0.008
kered+0.007
ggy+0.006
 lesson+0.006
 teeth+0.005
Neg logit contributions
rying-0.017
 least-0.016
 forth-0.016
ipping-0.016
umper-0.016
Token and
Token ID290
Feature activation-1.15e-03

Pos logit contributions
ves+0.000
kered+0.000
ork+0.000
ggy+0.000
 lesson+0.000
Neg logit contributions
rying-0.000
 least-0.000
 forth-0.000
ter-0.000
ipping-0.000
Token they
Token ID484
Feature activation+1.50e-02

Pos logit contributions
ves+0.002
kered+0.002
ork+0.002
 lesson+0.002
ggy+0.002
Neg logit contributions
rying-0.002
 least-0.002
 forth-0.002
ter-0.002
ipping-0.002
Token Sil
Token ID4243
Feature activation+2.53e-02

Pos logit contributions
rying+0.044
 least+0.042
 forth+0.041
ipping+0.040
ter+0.040
Neg logit contributions
ves-0.025
kered-0.023
ggy-0.020
 lesson-0.020
ork-0.018
Tokenvia
Token ID8869
Feature activation-7.48e-03

Pos logit contributions
rying+0.050
 least+0.046
 forth+0.045
ipping+0.044
umper+0.043
Neg logit contributions
ves-0.042
kered-0.040
ggy-0.036
 lesson-0.036
ork-0.033
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
ves+0.014
kered+0.014
 lesson+0.013
ork+0.012
 teeth+0.012
Neg logit contributions
rying-0.013
ter-0.012
 least-0.012
 forth-0.011
ting-0.011
Token She
Token ID1375
Feature activation-1.94e-02

Pos logit contributions
rying+0.042
 least+0.039
 forth+0.038
ipping+0.038
umper+0.036
Neg logit contributions
ves-0.037
kered-0.035
 lesson-0.031
ggy-0.031
ork-0.030
Token was
Token ID373
Feature activation+8.11e-03

Pos logit contributions
ves+0.036
kered+0.035
 choice+0.034
 lesson+0.033
 purpose+0.033
Neg logit contributions
ter-0.026
�-0.025
 least-0.024
 forth-0.023
Playing-0.023
Token 3
Token ID513
Feature activation-7.68e-02

Pos logit contributions
rying+0.016
 least+0.016
 forth+0.015
ter+0.015
ipping+0.014
Neg logit contributions
ves-0.014
kered-0.013
ggy-0.011
 lesson-0.011
ork-0.011
Token years
Token ID812
Feature activation-1.48e-02

Pos logit contributions
ves+0.157
kered+0.151
 lesson+0.139
 common+0.136
ork+0.134
Neg logit contributions
rying-0.108
years-0.100
 forth-0.100
ter-0.094
ipping-0.092
Token old
Token ID1468
Feature activation-2.07e-02

Pos logit contributions
ves+0.014
kered+0.013
 lesson+0.011
ggy+0.011
ork+0.010
Neg logit contributions
rying-0.039
 least-0.038
 forth-0.037
ipping-0.037
 highlight-0.036
Token and
Token ID290
Feature activation-2.18e-02

Pos logit contributions
ves+0.029
kered+0.027
ggy+0.023
 lesson+0.023
ork+0.022
Neg logit contributions
rying-0.046
 least-0.044
 forth-0.043
ter-0.043
ipping-0.042
Token loved
Token ID6151
Feature activation-1.29e-03

Pos logit contributions
ves+0.039
kered+0.036
ork+0.034
 lesson+0.032
 purpose+0.030
Neg logit contributions
 least-0.035
ter-0.035
 forth-0.035
years-0.035
rying-0.034
Token to
Token ID284
Feature activation-1.46e-02

Pos logit contributions
ves+0.002
kered+0.002
 lesson+0.001
ggy+0.001
 teeth+0.001
Neg logit contributions
rying-0.003
 least-0.003
 forth-0.003
ipping-0.003
ter-0.003
Token baby
Token ID5156
Feature activation+1.35e-02

Pos logit contributions
rying+0.009
 least+0.008
 forth+0.008
ipping+0.008
umper+0.008
Neg logit contributions
ves-0.005
kered-0.005
 lesson-0.005
ggy-0.005
ork-0.004
Token.
Token ID13
Feature activation+5.83e-03

Pos logit contributions
rying+0.031
 least+0.028
ipping+0.028
oming+0.028
 forth+0.028
Neg logit contributions
ves-0.017
kered-0.016
ggy-0.015
 lesson-0.014
 was-0.013
Token The
Token ID383
Feature activation+5.40e-03

Pos logit contributions
rying+0.013
 least+0.012
 forth+0.012
ipping+0.012
ter+0.012
Neg logit contributions
ves-0.008
kered-0.008
ggy-0.007
 lesson-0.007
ork-0.006
Token baby
Token ID5156
Feature activation-1.29e-02

Pos logit contributions
rying+0.010
 least+0.009
 forth+0.009
ter+0.009
ipping+0.009
Neg logit contributions
ves-0.010
kered-0.009
ggy-0.009
 lesson-0.008
ork-0.008
Token was
Token ID373
Feature activation+1.27e-02

Pos logit contributions
ves+0.015
kered+0.014
 lesson+0.011
ork+0.011
ggy+0.011
Neg logit contributions
ter-0.031
 forth-0.031
 least-0.030
rying-0.030
flower-0.029
Token 3
Token ID513
Feature activation-7.67e-02

Pos logit contributions
rying+0.022
 least+0.021
ter+0.020
 forth+0.020
ipping+0.019
Neg logit contributions
ves-0.026
kered-0.024
ork-0.021
 lesson-0.021
ggy-0.021
Token years
Token ID812
Feature activation-9.09e-03

Pos logit contributions
ves+0.144
kered+0.140
 common+0.128
ork+0.126
 lesson+0.124
Neg logit contributions
rying-0.114
 forth-0.109
ter-0.097
years-0.097
ipping-0.097
Token old
Token ID1468
Feature activation-1.47e-02

Pos logit contributions
ves+0.011
kered+0.010
ggy+0.008
 lesson+0.008
ork+0.008
Neg logit contributions
rying-0.022
 least-0.022
 forth-0.021
ipping-0.021
ter-0.020
Token and
Token ID290
Feature activation-1.44e-02

Pos logit contributions
ves+0.019
kered+0.018
ggy+0.015
 lesson+0.015
ork+0.014
Neg logit contributions
rying-0.034
 least-0.033
 forth-0.032
ipping-0.032
ter-0.031
Token was
Token ID373
Feature activation+8.93e-03

Pos logit contributions
ves+0.026
kered+0.024
ork+0.024
 lesson+0.021
ggy+0.020
Neg logit contributions
ter-0.024
 forth-0.023
 least-0.023
mallow-0.022
ancer-0.021
Token very
Token ID845
Feature activation+4.82e-03

Pos logit contributions
rying+0.017
 least+0.016
 forth+0.016
ter+0.015
ipping+0.015
Neg logit contributions
ves-0.016
kered-0.015
ggy-0.013
 lesson-0.013
ork-0.013
Token a
Token ID257
Feature activation-5.95e-04

Pos logit contributions
rying+0.080
 least+0.077
 forth+0.075
ipping+0.073
ter+0.072
Neg logit contributions
ves-0.053
kered-0.049
ggy-0.043
 lesson-0.042
ork-0.040
Token daddy
Token ID35822
Feature activation-1.25e-04

Pos logit contributions
ves+0.001
kered+0.001
ggy+0.001
 lesson+0.001
ork+0.001
Neg logit contributions
rying-0.001
 least-0.001
 forth-0.001
ipping-0.001
umper-0.001
Token who
Token ID508
Feature activation-4.80e-03

Pos logit contributions
ves+0.000
kered+0.000
ggy+0.000
 lesson+0.000
ork+0.000
Neg logit contributions
rying-0.000
 forth-0.000
 least-0.000
ipping-0.000
oming-0.000
Token had
Token ID550
Feature activation-2.63e-03

Pos logit contributions
ves+0.009
kered+0.008
ork+0.008
 lesson+0.007
ggy+0.007
Neg logit contributions
 least-0.008
rying-0.008
ter-0.007
 forth-0.007
umper-0.007
Token a
Token ID257
Feature activation-2.13e-02

Pos logit contributions
ves+0.005
kered+0.005
ork+0.004
ggy+0.004
 lesson+0.004
Neg logit contributions
 least-0.005
ter-0.004
rying-0.004
 highlight-0.004
 forth-0.004
Token 3
Token ID513
Feature activation-7.66e-02

Pos logit contributions
ves+0.032
kered+0.029
ggy+0.027
 lesson+0.026
ork+0.025
Neg logit contributions
rying-0.046
 least-0.043
 forth-0.043
ipping-0.042
ter-0.041
Token year
Token ID614
Feature activation-1.74e-02

Pos logit contributions
ves+0.117
kered+0.109
 common+0.100
 lesson+0.098
ork+0.098
Neg logit contributions
rying-0.131
 forth-0.124
ipping-0.121
ish-0.120
years-0.119
Token old
Token ID1468
Feature activation-5.26e-03

Pos logit contributions
ves+0.022
kered+0.020
ggy+0.018
 lesson+0.017
ork+0.016
Neg logit contributions
rying-0.041
 least-0.039
 forth-0.038
ipping-0.038
ter-0.037
Token child
Token ID1200
Feature activation-1.33e-02

Pos logit contributions
ves+0.011
kered+0.010
ggy+0.009
 lesson+0.009
ork+0.009
Neg logit contributions
rying-0.008
 least-0.008
 forth-0.008
ipping-0.007
ter-0.007
Token.
Token ID13
Feature activation+5.77e-03

Pos logit contributions
ves+0.025
kered+0.024
 lesson+0.021
ggy+0.021
ork+0.020
Neg logit contributions
rying-0.023
 least-0.022
 forth-0.021
ipping-0.021
ter-0.021
Token Each
Token ID5501
Feature activation+3.21e-03

Pos logit contributions
rying+0.013
 least+0.012
 forth+0.012
ipping+0.012
ter+0.011
Neg logit contributions
ves-0.008
kered-0.008
ggy-0.007
 lesson-0.007
ork-0.006
Token a
Token ID257
Feature activation-2.36e-02

Pos logit contributions
ves+0.014
kered+0.013
ork+0.011
ggy+0.011
 choice+0.011
Neg logit contributions
 least-0.012
 highlight-0.012
 surfing-0.011
ter-0.011
 harbor-0.011
Token lot
Token ID1256
Feature activation-2.71e-02

Pos logit contributions
ves+0.033
kered+0.030
ggy+0.028
 lesson+0.028
ork+0.026
Neg logit contributions
rying-0.054
 least-0.050
 forth-0.049
ipping-0.049
ter-0.048
Token of
Token ID286
Feature activation-3.11e-02

Pos logit contributions
ves+0.046
kered+0.043
ggy+0.041
 lesson+0.038
ggie+0.037
Neg logit contributions
rying-0.048
 least-0.046
oming-0.044
ipping-0.043
ter-0.042
Token money
Token ID1637
Feature activation+5.24e-03

Pos logit contributions
ves+0.054
kered+0.050
ggy+0.045
ork+0.043
ggie+0.040
Neg logit contributions
rying-0.058
 least-0.057
 forth-0.055
umper-0.052
ter-0.052
Token and
Token ID290
Feature activation-1.69e-02

Pos logit contributions
rying+0.010
 least+0.010
 forth+0.010
ter+0.009
ipping+0.009
Neg logit contributions
ves-0.009
kered-0.008
ggy-0.007
 lesson-0.007
ork-0.007
Token three
Token ID1115
Feature activation-7.65e-02

Pos logit contributions
ves+0.032
kered+0.031
ggy+0.029
ork+0.028
aron+0.027
Neg logit contributions
 least-0.027
nam-0.024
 forth-0.023
years-0.023
 harbor-0.023
Token kids
Token ID3988
Feature activation-7.52e-03

Pos logit contributions
ves+0.124
kered+0.123
ggy+0.110
 lesson+0.109
ork+0.105
Neg logit contributions
rying-0.136
 forth-0.128
ipping-0.125
years-0.121
 least-0.121
Token.
Token ID13
Feature activation-7.64e-03

Pos logit contributions
ves+0.011
kered+0.010
ggy+0.009
 lesson+0.009
ork+0.008
Neg logit contributions
rying-0.016
 least-0.016
 forth-0.015
ipping-0.015
ter-0.015
Token
Token ID220
Feature activation-2.11e-02

Pos logit contributions
ves+0.011
kered+0.011
ggy+0.009
 lesson+0.009
ork+0.009
Neg logit contributions
rying-0.017
 least-0.016
 forth-0.016
ipping-0.015
ter-0.015
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
kered+0.035
ves+0.035
ggy+0.031
 lesson+0.030
ork+0.028
Neg logit contributions
rying-0.039
 least-0.036
ancer-0.035
 forth-0.034
oming-0.034
Token
Token ID198
Feature activation+6.43e-03

Pos logit contributions
kered+0.021
ves+0.020
ggy+0.018
 lesson+0.018
 teeth+0.017
Neg logit contributions
rying-0.019
 least-0.018
ancer-0.017
 forth-0.017
oming-0.017
Token called
Token ID1444
Feature activation+9.38e-03

Pos logit contributions
ves+0.016
kered+0.015
ggy+0.014
 lesson+0.013
ork+0.013
Neg logit contributions
rying-0.017
 least-0.016
 forth-0.015
ipping-0.015
ter-0.015
Token Zoe
Token ID32280
Feature activation+8.48e-03

Pos logit contributions
rying+0.018
 least+0.017
ter+0.017
 forth+0.016
ancer+0.016
Neg logit contributions
ves-0.016
kered-0.015
 lesson-0.013
ggy-0.013
 teaches-0.012
Token.
Token ID13
Feature activation+2.78e-03

Pos logit contributions
rying+0.018
 least+0.017
 forth+0.016
ter+0.016
ipping+0.016
Neg logit contributions
ves-0.013
kered-0.012
 lesson-0.011
ggy-0.011
ork-0.010
Token Zoe
Token ID32280
Feature activation-2.67e-02

Pos logit contributions
rying+0.005
 forth+0.004
 least+0.004
ter+0.004
ipping+0.004
Neg logit contributions
ves-0.005
kered-0.005
ggy-0.005
ork-0.004
 lesson-0.004
Token was
Token ID373
Feature activation+1.16e-02

Pos logit contributions
ves+0.042
kered+0.038
 lesson+0.035
ggy+0.033
ork+0.032
Neg logit contributions
rying-0.053
 least-0.053
 forth-0.051
ter-0.049
ipping-0.047
Token 3
Token ID513
Feature activation-7.64e-02

Pos logit contributions
rying+0.023
 least+0.022
 forth+0.022
ipping+0.021
ter+0.021
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.016
 lesson-0.016
ork-0.015
Token years
Token ID812
Feature activation-1.30e-02

Pos logit contributions
ves+0.153
kered+0.147
 lesson+0.131
ork+0.131
 common+0.128
Neg logit contributions
rying-0.117
 forth-0.111
 least-0.102
ipping-0.101
years-0.100
Token old
Token ID1468
Feature activation-1.18e-02

Pos logit contributions
ves+0.013
kered+0.012
 lesson+0.010
ggy+0.010
ork+0.009
Neg logit contributions
rying-0.034
 least-0.033
 forth-0.032
ipping-0.031
ter-0.031
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
ves+0.014
kered+0.013
ggy+0.011
 lesson+0.011
ork+0.010
Neg logit contributions
rying-0.029
 least-0.027
 forth-0.027
ipping-0.027
umper-0.026
Token love
Token ID1842
Feature activation+5.86e-03

Pos logit contributions
ves+0.019
kered+0.018
ork+0.016
 lesson+0.015
ggy+0.015
Neg logit contributions
 least-0.022
rying-0.022
 forth-0.022
ter-0.022
years-0.021
Token playing
Token ID2712
Feature activation-1.46e-02

Pos logit contributions
rying+0.012
 least+0.011
 forth+0.011
ipping+0.011
ter+0.011
Neg logit contributions
ves-0.010
kered-0.009
ggy-0.008
 lesson-0.008
ork-0.007
Token named
Token ID3706
Feature activation+1.96e-02

Pos logit contributions
ves+0.017
kered+0.016
ggy+0.015
 lesson+0.014
ork+0.014
Neg logit contributions
rying-0.016
 least-0.015
 forth-0.014
ipping-0.014
ter-0.014
Token Maria
Token ID14200
Feature activation-1.10e-02

Pos logit contributions
rying+0.041
 least+0.039
 forth+0.038
ter+0.037
ipping+0.036
Neg logit contributions
ves-0.031
kered-0.029
ggy-0.025
 lesson-0.024
ork-0.022
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
ves+0.018
kered+0.017
 lesson+0.015
ggy+0.014
ork+0.014
Neg logit contributions
rying-0.022
 least-0.021
 forth-0.021
ter-0.020
umper-0.019
Token She
Token ID1375
Feature activation-2.10e-02

Pos logit contributions
rying+0.024
 least+0.022
 forth+0.022
ipping+0.021
ter+0.020
Neg logit contributions
ves-0.028
kered-0.026
ggy-0.025
 lesson-0.024
ork-0.023
Token was
Token ID373
Feature activation+9.53e-03

Pos logit contributions
ves+0.038
kered+0.035
 lesson+0.031
 choice+0.031
ork+0.030
Neg logit contributions
 least-0.032
 forth-0.032
ter-0.032
rying-0.031
�-0.030
Token 3
Token ID513
Feature activation-7.63e-02

Pos logit contributions
rying+0.020
 least+0.018
 forth+0.018
ipping+0.018
umper+0.017
Neg logit contributions
ves-0.015
kered-0.014
ggy-0.013
 lesson-0.012
ork-0.011
Token years
Token ID812
Feature activation-1.21e-02

Pos logit contributions
ves+0.150
kered+0.145
 lesson+0.132
 common+0.128
ork+0.127
Neg logit contributions
rying-0.116
 forth-0.110
years-0.102
ipping-0.101
 least-0.098
Token old
Token ID1468
Feature activation-1.35e-02

Pos logit contributions
ves+0.013
kered+0.011
 lesson+0.010
ggy+0.010
ork+0.009
Neg logit contributions
rying-0.031
 least-0.030
 forth-0.030
ipping-0.029
ter-0.029
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
ves+0.016
kered+0.014
ggy+0.013
 lesson+0.012
ork+0.011
Neg logit contributions
rying-0.034
 least-0.032
 forth-0.031
ipping-0.031
oming-0.030
Token she
Token ID673
Feature activation-2.48e-02

Pos logit contributions
ves+0.019
kered+0.017
ork+0.016
 lesson+0.014
ggy+0.014
Neg logit contributions
 least-0.019
 forth-0.018
rying-0.018
ter-0.018
years-0.018
Token loved
Token ID6151
Feature activation+1.47e-03

Pos logit contributions
ves+0.045
kered+0.040
 lesson+0.036
ork+0.036
 words+0.034
Neg logit contributions
 least-0.043
rying-0.040
 forth-0.040
ter-0.039
years-0.039
Token a
Token ID257
Feature activation-4.10e-02

Pos logit contributions
ves+0.045
 common+0.040
aron+0.040
 teaches+0.039
kered+0.038
Neg logit contributions
 least-0.032
ter-0.029
 forth-0.029
ting-0.027
oming-0.027
Token sad
Token ID6507
Feature activation-2.36e-02

Pos logit contributions
ves+0.044
kered+0.040
ggy+0.033
 lesson+0.030
ork+0.027
Neg logit contributions
rying-0.102
 least-0.100
 forth-0.099
ter-0.095
ipping-0.095
Token sound
Token ID2128
Feature activation-3.20e-02

Pos logit contributions
ves+0.034
kered+0.031
ggy+0.027
 lesson+0.025
ork+0.024
Neg logit contributions
rying-0.051
ter-0.049
 forth-0.049
 least-0.048
ipping-0.047
Token.
Token ID13
Feature activation-2.36e-02

Pos logit contributions
ves+0.056
kered+0.051
 lesson+0.049
ggy+0.047
ork+0.045
Neg logit contributions
rying-0.057
ting-0.055
ter-0.055
 forth-0.055
 least-0.054
Token Sara
Token ID24799
Feature activation-3.73e-02

Pos logit contributions
ves+0.044
kered+0.041
ggy+0.037
 lesson+0.036
 choice+0.034
Neg logit contributions
 least-0.037
ancer-0.036
ipping-0.036
oming-0.036
umper-0.036
Token and
Token ID290
Feature activation+7.18e-02

Pos logit contributions
ves+0.064
kered+0.058
 lesson+0.054
ggy+0.053
 purpose+0.051
Neg logit contributions
 least-0.071
rying-0.070
 forth-0.069
ipping-0.067
ting-0.065
Token Ben
Token ID3932
Feature activation-1.13e-02

Pos logit contributions
rying+0.161
 least+0.144
 forth+0.139
oming+0.139
ipping+0.139
Neg logit contributions
ves-0.100
kered-0.094
 teeth-0.084
ggy-0.084
 lesson-0.084
Token stop
Token ID2245
Feature activation+4.74e-03

Pos logit contributions
ves+0.017
kered+0.016
ggy+0.014
 lesson+0.014
ork+0.013
Neg logit contributions
rying-0.024
 least-0.023
 forth-0.022
ipping-0.022
ter-0.021
Token playing
Token ID2712
Feature activation-1.39e-02

Pos logit contributions
rying+0.010
ancer+0.010
 least+0.010
umper+0.010
 highlight+0.010
Neg logit contributions
ves-0.006
kered-0.006
ggy-0.005
 teeth-0.005
 lesson-0.005
Token.
Token ID13
Feature activation-2.33e-02

Pos logit contributions
ves+0.024
kered+0.023
ggy+0.020
 lesson+0.020
ork+0.019
Neg logit contributions
ipping-0.024
 forth-0.024
rying-0.024
 least-0.023
ting-0.023
Token They
Token ID1119
Feature activation+6.51e-03

Pos logit contributions
ves+0.043
kered+0.040
ggy+0.038
ggie+0.034
 lesson+0.033
Neg logit contributions
 least-0.037
rying-0.036
oming-0.035
ter-0.035
 surfing-0.035
Token!"
Token ID2474
Feature activation-1.26e-02

Pos logit contributions
ves+0.020
kered+0.020
 lesson+0.017
ggy+0.017
ork+0.017
Neg logit contributions
rying-0.021
 least-0.020
 forth-0.020
ter-0.019
ipping-0.019
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
ves+0.019
kered+0.018
ggy+0.016
 lesson+0.015
ork+0.015
Neg logit contributions
rying-0.027
 least-0.026
 forth-0.025
ipping-0.024
ter-0.024
Token
Token ID198
Feature activation+4.86e-03

Pos logit contributions
ves+0.017
kered+0.017
 lesson+0.014
 teeth+0.014
ggy+0.014
Neg logit contributions
rying-0.018
 least-0.018
umper-0.017
ipping-0.017
ter-0.017
TokenM
Token ID44
Feature activation+8.46e-03

Pos logit contributions
rying+0.010
 least+0.009
 forth+0.009
ipping+0.009
ter+0.009
Neg logit contributions
ves-0.008
kered-0.007
ggy-0.007
 lesson-0.007
ork-0.006
Tokenum
Token ID388
Feature activation-2.27e-02

Pos logit contributions
rying+0.015
 least+0.015
ipping+0.014
 forth+0.014
ter+0.014
Neg logit contributions
ves-0.015
kered-0.014
ggy-0.013
 lesson-0.013
ork-0.012
Token and
Token ID290
Feature activation+5.60e-02

Pos logit contributions
ves+0.035
kered+0.032
 lesson+0.029
ggy+0.029
ork+0.028
Neg logit contributions
rying-0.046
 least-0.044
 forth-0.044
ter-0.043
ipping-0.042
Token Dad
Token ID17415
Feature activation-5.15e-03

Pos logit contributions
rying+0.085
 least+0.075
ipping+0.074
 forth+0.072
oming+0.070
Neg logit contributions
ves-0.118
kered-0.115
 lesson-0.105
ggy-0.105
 teeth-0.098
Token hugged
Token ID41130
Feature activation+9.06e-03

Pos logit contributions
ves+0.008
kered+0.007
ggy+0.007
 lesson+0.006
ork+0.006
Neg logit contributions
rying-0.011
 least-0.010
 forth-0.010
ipping-0.010
ter-0.010
Token each
Token ID1123
Feature activation-2.27e-03

Pos logit contributions
rying+0.017
 forth+0.016
ting+0.016
ter+0.016
 least+0.016
Neg logit contributions
ves-0.016
kered-0.015
ggy-0.014
ork-0.013
 lesson-0.013
Token other
Token ID584
Feature activation+1.05e-02

Pos logit contributions
ves+0.004
kered+0.004
ggy+0.004
ork+0.003
 purpose+0.003
Neg logit contributions
rying-0.004
 least-0.004
 forth-0.004
ting-0.004
 highlight-0.003
Token and
Token ID290
Feature activation+1.08e-02

Pos logit contributions
rying+0.023
 least+0.021
 forth+0.021
ipping+0.021
ter+0.020
Neg logit contributions
ves-0.015
kered-0.014
ggy-0.013
 lesson-0.012
ork-0.012
Token said
Token ID531
Feature activation+1.02e-02

Pos logit contributions
 least+0.018
 forth+0.018
rying+0.017
ter+0.017
 gentlemen+0.016
Neg logit contributions
ves-0.021
ork-0.019
kered-0.018
ggy-0.017
 lesson-0.017
Token,
Token ID11
Feature activation+4.18e-02

Pos logit contributions
rying+0.022
 least+0.021
 forth+0.020
ipping+0.020
ter+0.020
Neg logit contributions
ves-0.015
kered-0.014
ggy-0.013
 lesson-0.012
ork-0.012
Token "
Token ID366
Feature activation+2.26e-02

Pos logit contributions
rying+0.105
 least+0.102
 forth+0.096
umper+0.095
ipping+0.094
Neg logit contributions
ves-0.052
kered-0.042
 lesson-0.039
ggy-0.039
 words-0.035
TokenThank
Token ID10449
Feature activation+8.72e-03

Pos logit contributions
rying+0.027
 least+0.026
 forth+0.025
ter+0.024
ipping+0.024
Neg logit contributions
ves-0.054
kered-0.051
ggy-0.048
 lesson-0.047
ork-0.047
Token you
Token ID345
Feature activation-4.02e-03

Pos logit contributions
rying+0.017
 least+0.015
 forth+0.014
 highlight+0.014
 harbour+0.014
Neg logit contributions
ves-0.017
kered-0.015
ggy-0.015
 lesson-0.013
 words-0.013
Token,
Token ID11
Feature activation+5.53e-02

Pos logit contributions
ves+0.005
kered+0.005
ggy+0.004
 lesson+0.004
ork+0.004
Neg logit contributions
rying-0.010
 least-0.009
 forth-0.009
ipping-0.009
umper-0.009
Token mom
Token ID1995
Feature activation-4.29e-03

Pos logit contributions
rying+0.122
ipping+0.110
 least+0.109
 forth+0.109
used+0.106
Neg logit contributions
ves-0.082
kered-0.074
 lesson-0.070
ggy-0.069
 teaches-0.063
Token and
Token ID290
Feature activation+8.14e-02

Pos logit contributions
ves+0.007
kered+0.007
ggy+0.006
 lesson+0.005
ork+0.005
Neg logit contributions
rying-0.009
 least-0.009
 forth-0.008
ter-0.008
umper-0.008
Token dad
Token ID9955
Feature activation-1.16e-02

Pos logit contributions
rying+0.186
ipping+0.167
oming+0.161
ologist+0.160
oise+0.158
Neg logit contributions
 lesson-0.108
ves-0.107
 words-0.097
 teeth-0.095
kered-0.094
Token.
Token ID13
Feature activation+3.09e-04

Pos logit contributions
ves+0.019
kered+0.018
ggy+0.016
ork+0.015
 lesson+0.015
Neg logit contributions
rying-0.023
 least-0.023
ter-0.022
 forth-0.021
ipping-0.021
Token I
Token ID314
Feature activation+4.30e-03

Pos logit contributions
rying+0.000
 least+0.000
 forth+0.000
ipping+0.000
ter+0.000
Neg logit contributions
ves-0.001
kered-0.001
ggy-0.001
 lesson-0.001
ork-0.001
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
ves+0.002
kered+0.002
ggy+0.002
ork+0.002
 lesson+0.002
Neg logit contributions
rying-0.002
 least-0.002
 forth-0.002
ter-0.002
ting-0.002
Token beautiful
Token ID4950
Feature activation+2.15e-02

Pos logit contributions
rying+0.027
 least+0.027
 forth+0.026
ter+0.025
oming+0.025
Neg logit contributions
ves-0.027
kered-0.025
ggy-0.022
ork-0.022
 lesson-0.021
Token sight
Token ID6504
Feature activation-2.33e-03

Pos logit contributions
 forth+0.034
rying+0.033
oming+0.033
ipping+0.032
ting+0.032
Neg logit contributions
ves-0.039
kered-0.039
ggy-0.036
ork-0.033
 lesson-0.032
Token.
Token ID13
Feature activation+5.70e-03

Pos logit contributions
ves+0.003
kered+0.003
ggy+0.003
 lesson+0.002
ork+0.002
Neg logit contributions
rying-0.005
 least-0.005
 forth-0.005
ipping-0.005
ter-0.005
Token Tim
Token ID5045
Feature activation-3.00e-02

Pos logit contributions
rying+0.013
 least+0.012
 forth+0.012
ipping+0.011
ter+0.011
Neg logit contributions
ves-0.008
kered-0.008
ggy-0.007
 lesson-0.007
ork-0.006
Token and
Token ID290
Feature activation+6.14e-02

Pos logit contributions
ves+0.050
kered+0.045
ork+0.042
ggy+0.040
 lesson+0.039
Neg logit contributions
 least-0.062
 forth-0.060
rying-0.058
ter-0.055
umper-0.053
Token his
Token ID465
Feature activation+1.50e-02

Pos logit contributions
rying+0.138
 forth+0.133
 least+0.132
ter+0.127
ipping+0.126
Neg logit contributions
ves-0.085
kered-0.078
ggy-0.069
ork-0.063
 lesson-0.062
Token friends
Token ID2460
Feature activation+1.59e-02

Pos logit contributions
rying+0.021
 least+0.019
 forth+0.019
ipping+0.018
ter+0.018
Neg logit contributions
ves-0.033
kered-0.032
ggy-0.029
 lesson-0.029
ork-0.029
Token were
Token ID547
Feature activation+2.08e-02

Pos logit contributions
rying+0.037
 least+0.034
ipping+0.034
 forth+0.033
umper+0.033
Neg logit contributions
ves-0.021
kered-0.020
ggy-0.017
 lesson-0.017
 was-0.015
Token happy
Token ID3772
Feature activation+2.35e-02

Pos logit contributions
rying+0.041
 least+0.038
 forth+0.038
ipping+0.037
ter+0.036
Neg logit contributions
ves-0.035
kered-0.033
ggy-0.030
 lesson-0.029
ork-0.027
Token to
Token ID284
Feature activation+9.44e-03

Pos logit contributions
rying+0.056
umper+0.054
 forth+0.052
used+0.051
ipping+0.051
Neg logit contributions
ves-0.028
kered-0.026
 lesson-0.026
ggy-0.024
 was-0.023
Token died
Token ID3724
Feature activation+1.08e-03

Pos logit contributions
ves+0.010
kered+0.009
ggy+0.008
aron+0.007
 choice+0.007
Neg logit contributions
 least-0.009
rying-0.008
 highlight-0.007
ter-0.007
 forth-0.007
Token.
Token ID13
Feature activation-3.35e-03

Pos logit contributions
rying+0.002
ter+0.002
 forth+0.002
 least+0.002
ting+0.002
Neg logit contributions
ves-0.002
kered-0.002
ggy-0.002
 teaches-0.002
 lesson-0.002
Token
Token ID198
Feature activation-2.47e-03

Pos logit contributions
ves+0.006
kered+0.006
ggy+0.005
 lesson+0.005
ork+0.005
Neg logit contributions
 least-0.006
rying-0.006
umper-0.005
 forth-0.005
ter-0.005
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
kered+0.005
ves+0.005
 lesson+0.004
 teeth+0.004
ggy+0.004
Neg logit contributions
rying-0.004
 least-0.004
umper-0.003
 gentlemen-0.003
oming-0.003
TokenJohn
Token ID7554
Feature activation-2.66e-02

Pos logit contributions
rying+0.022
 least+0.021
 forth+0.020
ipping+0.020
ter+0.019
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.016
 lesson-0.016
ork-0.015
Token and
Token ID290
Feature activation+5.79e-02

Pos logit contributions
ves+0.046
kered+0.040
 common+0.039
 lesson+0.038
 purpose+0.037
Neg logit contributions
 least-0.054
 forth-0.050
rying-0.050
ter-0.048
ting-0.047
Token Grace
Token ID16156
Feature activation+5.93e-03

Pos logit contributions
rying+0.123
 least+0.120
 forth+0.120
ter+0.114
umper+0.112
Neg logit contributions
ves-0.091
kered-0.083
ggy-0.072
ork-0.068
 lesson-0.067
Token were
Token ID547
Feature activation+1.00e-02

Pos logit contributions
rying+0.011
 least+0.011
 forth+0.011
ter+0.010
ting+0.010
Neg logit contributions
ves-0.010
kered-0.009
 lesson-0.009
ggy-0.008
ork-0.008
Token so
Token ID523
Feature activation+4.10e-03

Pos logit contributions
rying+0.019
 least+0.018
 forth+0.017
ipping+0.017
ter+0.016
Neg logit contributions
ves-0.018
kered-0.017
ggy-0.015
 lesson-0.015
ork-0.014
Token sad
Token ID6507
Feature activation+3.83e-03

Pos logit contributions
rying+0.007
 least+0.007
 forth+0.007
ipping+0.006
ter+0.006
Neg logit contributions
ves-0.008
kered-0.007
ggy-0.007
 lesson-0.006
ork-0.006
Token.
Token ID13
Feature activation-5.32e-03

Pos logit contributions
rying+0.008
 least+0.008
 forth+0.007
ipping+0.007
ter+0.007
Neg logit contributions
ves-0.006
kered-0.006
ggy-0.005
 lesson-0.005
ork-0.005
Token an
Token ID281
Feature activation-2.95e-02

Pos logit contributions
ves+0.011
kered+0.010
ggy+0.009
ork+0.008
 lesson+0.008
Neg logit contributions
 least-0.011
rying-0.011
 forth-0.010
ter-0.010
 highlight-0.010
Token avocado
Token ID40377
Feature activation+7.58e-03

Pos logit contributions
ves+0.046
kered+0.043
ggy+0.039
 lesson+0.038
ork+0.036
Neg logit contributions
rying-0.061
 least-0.058
 forth-0.057
ipping-0.056
ter-0.055
Token,
Token ID11
Feature activation+1.34e-03

Pos logit contributions
rying+0.015
 least+0.014
 forth+0.014
ipping+0.014
ter+0.014
Neg logit contributions
ves-0.012
kered-0.012
ggy-0.010
 lesson-0.010
ork-0.010
Token mum
Token ID25682
Feature activation-3.75e-03

Pos logit contributions
rying+0.002
 least+0.002
 forth+0.002
ter+0.002
ipping+0.002
Neg logit contributions
ves-0.002
kered-0.002
ggy-0.002
 lesson-0.002
ork-0.002
Token?'
Token ID8348
Feature activation-1.28e-02

Pos logit contributions
ves+0.006
kered+0.006
 lesson+0.005
ggy+0.005
ork+0.005
Neg logit contributions
rying-0.008
 least-0.007
 forth-0.007
ter-0.007
ipping-0.007
Token Her
Token ID2332
Feature activation+8.50e-03

Pos logit contributions
ves+0.023
kered+0.021
ggy+0.019
 lesson+0.019
ork+0.018
Neg logit contributions
rying-0.024
 least-0.023
 forth-0.022
ter-0.022
ipping-0.022
Token mum
Token ID25682
Feature activation-1.59e-02

Pos logit contributions
rying+0.010
 least+0.009
 forth+0.009
ipping+0.008
ter+0.008
Neg logit contributions
ves-0.021
kered-0.020
ggy-0.019
 lesson-0.018
ork-0.018
Token said
Token ID531
Feature activation-5.87e-03

Pos logit contributions
ves+0.027
kered+0.025
 lesson+0.023
ggy+0.022
ork+0.022
Neg logit contributions
rying-0.031
 forth-0.029
 least-0.029
ipping-0.028
used-0.028
Token '
Token ID705
Feature activation+1.51e-02

Pos logit contributions
ves+0.010
kered+0.009
 lesson+0.009
ggy+0.009
ork+0.008
Neg logit contributions
rying-0.011
ter-0.010
ipping-0.010
used-0.010
 forth-0.010
TokenOk
Token ID18690
Feature activation-3.73e-03

Pos logit contributions
rying+0.024
 least+0.022
 forth+0.021
ipping+0.021
ter+0.020
Neg logit contributions
ves-0.031
kered-0.030
ggy-0.027
 lesson-0.027
ork-0.026
Token,
Token ID11
Feature activation+1.51e-02

Pos logit contributions
ves+0.007
kered+0.006
ggy+0.006
 lesson+0.005
ork+0.005
Neg logit contributions
rying-0.007
 least-0.007
 forth-0.007
ter-0.006
ipping-0.006
Token
Token ID198
Feature activation-9.93e-03

Pos logit contributions
rying+0.005
 least+0.005
 forth+0.005
ipping+0.005
ter+0.005
Neg logit contributions
ves-0.005
kered-0.005
ggy-0.004
 lesson-0.004
ork-0.004
Token
Token ID198
Feature activation+1.07e-02

Pos logit contributions
kered+0.020
ves+0.019
 teeth+0.018
 lesson+0.017
ggy+0.017
Neg logit contributions
rying-0.014
 least-0.013
umper-0.013
oming-0.012
ipping-0.012
TokenShe
Token ID3347
Feature activation-9.40e-05

Pos logit contributions
rying+0.020
 least+0.019
 forth+0.018
ipping+0.018
ter+0.018
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.016
 lesson-0.016
ork-0.015
Token followed
Token ID3940
Feature activation-5.48e-03

Pos logit contributions
ves+0.000
kered+0.000
 lesson+0.000
ggy+0.000
ork+0.000
Neg logit contributions
rying-0.000
 forth-0.000
 least-0.000
ter-0.000
ipping-0.000
Token the
Token ID262
Feature activation+9.85e-03

Pos logit contributions
ves+0.009
kered+0.008
ggy+0.007
 lesson+0.007
ork+0.007
Neg logit contributions
rying-0.011
 least-0.011
 forth-0.011
ter-0.010
ipping-0.010
Token carrots
Token ID34397
Feature activation+4.17e-02

Pos logit contributions
rying+0.017
 least+0.017
ipping+0.016
 forth+0.015
umper+0.015
Neg logit contributions
ves-0.018
kered-0.017
 lesson-0.016
ggy-0.016
 words-0.015
Token for
Token ID329
Feature activation-2.11e-02

Pos logit contributions
rying+0.090
 least+0.085
 forth+0.083
ipping+0.083
umper+0.082
Neg logit contributions
ves-0.058
kered-0.054
 lesson-0.046
ggy-0.045
ork-0.042
Token a
Token ID257
Feature activation-1.30e-02

Pos logit contributions
kered+0.036
ves+0.035
ggy+0.030
ork+0.030
aron+0.029
Neg logit contributions
 least-0.037
rying-0.035
 forth-0.034
ting-0.032
 gentlemen-0.032
Token really
Token ID1107
Feature activation-3.68e-02

Pos logit contributions
ves+0.014
kered+0.011
ggy+0.011
 lesson+0.011
ork+0.010
Neg logit contributions
rying-0.034
 least-0.033
ipping-0.032
 forth-0.032
umper-0.032
Token long
Token ID890
Feature activation-1.78e-02

Pos logit contributions
kered+0.050
ves+0.049
 teaches+0.038
ggy+0.038
ork+0.037
Neg logit contributions
rying-0.078
 forth-0.075
 least-0.072
 gentlemen-0.070
ipping-0.068
Token time
Token ID640
Feature activation-6.55e-03

Pos logit contributions
kered+0.017
ves+0.017
ggy+0.012
 teaches+0.011
 lesson+0.011
Neg logit contributions
rying-0.045
 forth-0.045
 least-0.043
ipping-0.042
ode-0.041
Token family
Token ID1641
Feature activation+6.01e-03

Pos logit contributions
rying+0.055
ipping+0.048
ancer+0.048
used+0.048
imb+0.046
Neg logit contributions
 lesson-0.036
ves-0.034
kered-0.032
 time-0.030
ggy-0.029
Token smiled
Token ID13541
Feature activation+1.62e-02

Pos logit contributions
rying+0.014
 least+0.013
 forth+0.013
ipping+0.013
ter+0.013
Neg logit contributions
ves-0.008
kered-0.007
ggy-0.006
 lesson-0.006
ork-0.006
Token and
Token ID290
Feature activation+6.77e-03

Pos logit contributions
rying+0.038
 least+0.036
 forth+0.035
ipping+0.034
umper+0.034
Neg logit contributions
ves-0.021
kered-0.020
ggy-0.017
 lesson-0.017
ork-0.015
Token hugged
Token ID41130
Feature activation+1.22e-03

Pos logit contributions
 least+0.012
rying+0.012
 forth+0.012
ter+0.011
ipping+0.011
Neg logit contributions
ves-0.012
kered-0.011
ork-0.010
ggy-0.010
 lesson-0.010
Token each
Token ID1123
Feature activation-4.92e-03

Pos logit contributions
rying+0.002
 forth+0.002
ting+0.002
 least+0.002
ter+0.002
Neg logit contributions
ves-0.003
kered-0.003
ggy-0.002
ork-0.002
 lesson-0.002
Token other
Token ID584
Feature activation+8.78e-03

Pos logit contributions
ves+0.008
kered+0.008
ggy+0.007
ork+0.007
 teeth+0.006
Neg logit contributions
rying-0.009
 least-0.009
 forth-0.009
ting-0.009
umper-0.008
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
rying+0.018
 least+0.017
 forth+0.017
ipping+0.016
ter+0.016
Neg logit contributions
ves-0.014
kered-0.013
ggy-0.012
 lesson-0.011
ork-0.011
Token and
Token ID290
Feature activation+1.69e-02

Pos logit contributions
rying+0.042
 least+0.041
 forth+0.040
ipping+0.038
ter+0.038
Neg logit contributions
ves-0.036
kered-0.034
ggy-0.031
 lesson-0.029
ork-0.029
Token they
Token ID484
Feature activation+3.54e-02

Pos logit contributions
 forth+0.028
 least+0.028
rying+0.027
ter+0.026
ting+0.025
Neg logit contributions
ves-0.033
kered-0.030
ork-0.029
ggy-0.028
 lesson-0.026
Token shouted
Token ID17293
Feature activation+1.18e-02

Pos logit contributions
rying+0.064
 least+0.058
 forth+0.058
ipping+0.057
ter+0.055
Neg logit contributions
ves-0.065
kered-0.062
ggy-0.056
 lesson-0.055
ork-0.052
Token with
Token ID351
Feature activation-3.31e-03

Pos logit contributions
rying+0.021
 least+0.020
 forth+0.019
ipping+0.019
ter+0.019
Neg logit contributions
ves-0.022
kered-0.020
ggy-0.019
 lesson-0.018
ork-0.017
Token do
Token ID466
Feature activation-2.81e-02

Pos logit contributions
rying+0.001
 least+0.001
 forth+0.001
ipping+0.001
ter+0.001
Neg logit contributions
ves-0.001
kered-0.001
ggy-0.001
 lesson-0.001
ork-0.001
Token it
Token ID340
Feature activation+1.02e-02

Pos logit contributions
kered+0.039
ves+0.038
 lesson+0.030
ork+0.029
ggy+0.029
Neg logit contributions
 least-0.060
rying-0.059
 forth-0.058
ipping-0.056
ter-0.055
Token,
Token ID11
Feature activation+1.28e-02

Pos logit contributions
rying+0.020
 least+0.019
 forth+0.018
ipping+0.018
ter+0.018
Neg logit contributions
ves-0.018
kered-0.017
ggy-0.015
 lesson-0.015
ork-0.014
Token rabbit
Token ID22746
Feature activation+1.02e-02

Pos logit contributions
rying+0.023
 least+0.022
 forth+0.021
ter+0.020
ipping+0.020
Neg logit contributions
ves-0.024
kered-0.023
ggy-0.020
 lesson-0.020
ork-0.019
Token!"
Token ID2474
Feature activation-1.73e-02

Pos logit contributions
rying+0.020
 least+0.019
 forth+0.019
ter+0.018
ipping+0.018
Neg logit contributions
ves-0.017
kered-0.016
 lesson-0.014
ggy-0.014
ork-0.013
Token and
Token ID290
Feature activation+1.89e-02

Pos logit contributions
ves+0.029
kered+0.027
ggy+0.024
 lesson+0.023
ork+0.022
Neg logit contributions
 least-0.033
rying-0.033
 forth-0.031
ipping-0.031
ter-0.031
Token "
Token ID366
Feature activation+2.12e-02

Pos logit contributions
rying+0.032
 forth+0.032
ter+0.030
 least+0.030
ting+0.029
Neg logit contributions
ves-0.036
kered-0.034
ggy-0.031
 lesson-0.031
ork-0.030
TokenDon
Token ID3987
Feature activation-2.84e-03

Pos logit contributions
rying+0.039
 least+0.035
 forth+0.035
ipping+0.034
oming+0.034
Neg logit contributions
ves-0.039
kered-0.037
ggy-0.034
 lesson-0.033
ork-0.031
Token't
Token ID470
Feature activation-9.28e-03

Pos logit contributions
ves+0.004
kered+0.004
ggy+0.004
 lesson+0.004
ork+0.003
Neg logit contributions
rying-0.006
 least-0.006
 forth-0.005
ipping-0.005
ter-0.005
Token give
Token ID1577
Feature activation-2.79e-03

Pos logit contributions
ves+0.015
kered+0.014
ggy+0.013
 lesson+0.013
ork+0.011
Neg logit contributions
rying-0.019
 least-0.018
ipping-0.017
ancer-0.017
oming-0.017
Token up
Token ID510
Feature activation-8.95e-03

Pos logit contributions
ves+0.004
kered+0.003
ggy+0.003
 lesson+0.003
ork+0.003
Neg logit contributions
rying-0.007
 least-0.006
 forth-0.006
ipping-0.006
ter-0.006
Token to
Token ID284
Feature activation+7.20e-03

Pos logit contributions
rying+0.041
 least+0.039
 forth+0.038
ipping+0.037
ter+0.036
Neg logit contributions
ves-0.037
kered-0.035
ggy-0.032
 lesson-0.031
ork-0.030
Token Ben
Token ID3932
Feature activation+1.66e-02

Pos logit contributions
rying+0.014
 least+0.014
 forth+0.013
ter+0.013
ipping+0.013
Neg logit contributions
ves-0.012
kered-0.012
ggy-0.010
 lesson-0.010
ork-0.009
Token.
Token ID13
Feature activation+4.23e-03

Pos logit contributions
rying+0.038
 least+0.036
 forth+0.036
ipping+0.035
ter+0.034
Neg logit contributions
ves-0.023
kered-0.021
ggy-0.018
 lesson-0.018
ork-0.016
Token She
Token ID1375
Feature activation-1.36e-02

Pos logit contributions
rying+0.008
 least+0.008
 forth+0.007
ipping+0.007
ter+0.007
Neg logit contributions
ves-0.007
kered-0.007
ggy-0.006
 lesson-0.006
ork-0.006
Token cried
Token ID16896
Feature activation+1.12e-02

Pos logit contributions
kered+0.029
 lesson+0.027
ves+0.027
 purpose+0.027
ork+0.024
Neg logit contributions
 least-0.019
 forth-0.019
rying-0.019
ter-0.017
ting-0.016
Token.
Token ID13
Feature activation+7.19e-03

Pos logit contributions
rying+0.026
 least+0.025
 forth+0.024
ipping+0.024
ter+0.023
Neg logit contributions
ves-0.015
kered-0.014
ggy-0.012
 lesson-0.012
ork-0.011
Token She
Token ID1375
Feature activation-1.86e-02

Pos logit contributions
rying+0.014
 least+0.014
 forth+0.013
ipping+0.013
ter+0.013
Neg logit contributions
ves-0.012
kered-0.011
ggy-0.010
 lesson-0.010
ork-0.009
Token was
Token ID373
Feature activation+1.92e-02

Pos logit contributions
ves+0.037
 purpose+0.036
kered+0.035
 lesson+0.034
ork+0.032
Neg logit contributions
 least-0.027
 forth-0.025
ter-0.024
ting-0.021
rying-0.021
Token sorry
Token ID7926
Feature activation+1.08e-02

Pos logit contributions
rying+0.036
 least+0.034
 forth+0.033
ipping+0.032
ter+0.031
Neg logit contributions
ves-0.034
kered-0.032
ggy-0.029
 lesson-0.028
ork-0.027
Token.
Token ID13
Feature activation+1.10e-03

Pos logit contributions
rying+0.027
 least+0.025
 forth+0.025
ipping+0.025
umper+0.024
Neg logit contributions
ves-0.013
kered-0.012
 lesson-0.011
ggy-0.010
 teeth-0.009
Token She
Token ID1375
Feature activation-1.09e-02

Pos logit contributions
rying+0.002
 least+0.002
 forth+0.002
ipping+0.002
ter+0.002
Neg logit contributions
ves-0.002
kered-0.002
ggy-0.002
 lesson-0.001
ork-0.001
Token of
Token ID286
Feature activation-7.59e-03

Pos logit contributions
rying+0.041
 least+0.039
 forth+0.039
ipping+0.038
ter+0.038
Neg logit contributions
ves-0.028
kered-0.026
ggy-0.023
 lesson-0.023
ork-0.021
Token a
Token ID257
Feature activation+3.07e-03

Pos logit contributions
ves+0.011
kered+0.011
ggy+0.009
 lesson+0.009
ork+0.009
Neg logit contributions
 least-0.016
rying-0.015
ter-0.015
 forth-0.015
ting-0.014
Token cone
Token ID27763
Feature activation+1.83e-02

Pos logit contributions
rying+0.006
 least+0.006
ipping+0.006
 forth+0.006
umper+0.005
Neg logit contributions
ves-0.005
kered-0.005
ggy-0.004
 lesson-0.004
ork-0.004
Token and
Token ID290
Feature activation-5.83e-03

Pos logit contributions
rying+0.042
 least+0.041
 forth+0.039
umper+0.039
ipping+0.038
Neg logit contributions
ves-0.025
kered-0.023
ggy-0.020
 lesson-0.019
ork-0.018
Token a
Token ID257
Feature activation-5.23e-03

Pos logit contributions
ves+0.011
kered+0.010
ork+0.009
ggy+0.009
gh+0.008
Neg logit contributions
 least-0.010
 gentlemen-0.009
 forth-0.009
ter-0.009
rying-0.009
Token scoop
Token ID23529
Feature activation-3.31e-03

Pos logit contributions
ves+0.009
kered+0.008
ggy+0.007
 lesson+0.007
ork+0.007
Neg logit contributions
rying-0.010
 least-0.010
 forth-0.010
ipping-0.009
ter-0.009
Token of
Token ID286
Feature activation-2.37e-02

Pos logit contributions
ves+0.006
kered+0.005
ggy+0.005
 common+0.005
 lesson+0.005
Neg logit contributions
rying-0.006
ter-0.006
ipping-0.006
ting-0.006
 forth-0.005
Token cream
Token ID8566
Feature activation+2.92e-02

Pos logit contributions
ves+0.045
kered+0.042
ggy+0.038
ork+0.038
 lesson+0.035
Neg logit contributions
 forth-0.040
 least-0.039
rying-0.038
ter-0.036
ting-0.035
Token on
Token ID319
Feature activation-8.57e-03

Pos logit contributions
rying+0.065
umper+0.061
 least+0.061
years+0.060
ipping+0.059
Neg logit contributions
ves-0.042
kered-0.037
ggy-0.034
 lesson-0.034
ork-0.032
Token it
Token ID340
Feature activation+2.04e-02

Pos logit contributions
kered+0.014
ves+0.014
ggy+0.012
 lesson+0.011
 teaches+0.011
Neg logit contributions
 forth-0.015
ting-0.015
ter-0.015
 least-0.015
rying-0.014
Token.
Token ID13
Feature activation-1.94e-02

Pos logit contributions
rying+0.036
 forth+0.035
 least+0.033
ter+0.032
ting+0.031
Neg logit contributions
ves-0.036
kered-0.034
ggy-0.032
 lesson-0.029
ork-0.028
Token a
Token ID257
Feature activation-1.27e-02

Pos logit contributions
ves+0.007
kered+0.007
ggy+0.006
ork+0.006
 lesson+0.006
Neg logit contributions
 least-0.007
rying-0.007
ter-0.007
 forth-0.007
ting-0.006
Token hat
Token ID6877
Feature activation+5.06e-03

Pos logit contributions
ves+0.021
kered+0.020
ggy+0.018
 lesson+0.017
ork+0.016
Neg logit contributions
rying-0.024
 least-0.023
 forth-0.023
ipping-0.022
ter-0.022
Token,
Token ID11
Feature activation-4.04e-03

Pos logit contributions
rying+0.010
 least+0.010
 forth+0.010
ter+0.010
ipping+0.009
Neg logit contributions
ves-0.008
kered-0.008
 lesson-0.007
ggy-0.007
ork-0.006
Token and
Token ID290
Feature activation-4.31e-03

Pos logit contributions
ves+0.007
kered+0.007
ggy+0.007
ork+0.006
 lesson+0.006
Neg logit contributions
 least-0.007
rying-0.007
ter-0.007
 forth-0.007
ting-0.007
Token a
Token ID257
Feature activation-1.51e-02

Pos logit contributions
ves+0.007
kered+0.006
ggy+0.006
ork+0.006
 lesson+0.005
Neg logit contributions
rying-0.009
 least-0.008
 forth-0.008
ter-0.008
 gentlemen-0.008
Token leg
Token ID1232
Feature activation-1.85e-02

Pos logit contributions
ves+0.025
kered+0.024
ggy+0.021
 lesson+0.021
ork+0.020
Neg logit contributions
rying-0.029
 least-0.028
 forth-0.027
ipping-0.026
ter-0.026
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
ves+0.029
kered+0.027
 lesson+0.025
ggy+0.024
ork+0.022
Neg logit contributions
rying-0.037
 least-0.035
 forth-0.035
ter-0.034
ipping-0.034
Token
Token ID198
Feature activation-6.10e-03

Pos logit contributions
ves+0.021
kered+0.020
ggy+0.018
 lesson+0.017
ork+0.016
Neg logit contributions
rying-0.032
 least-0.030
 forth-0.030
ipping-0.029
ter-0.029
Token
Token ID198
Feature activation+9.96e-03

Pos logit contributions
kered+0.014
ves+0.012
 teeth+0.012
 lesson+0.012
ggy+0.011
Neg logit contributions
�-0.007
oming-0.007
rying-0.006
ory-0.006
 gentlemen-0.006
Token"
Token ID1
Feature activation+2.51e-02

Pos logit contributions
rying+0.020
 least+0.019
 forth+0.018
ipping+0.018
ter+0.017
Neg logit contributions
ves-0.017
kered-0.016
ggy-0.014
 lesson-0.014
ork-0.013
TokenA
Token ID32
Feature activation-7.20e-03

Pos logit contributions
rying+0.048
 least+0.044
ipping+0.043
 forth+0.042
oming+0.041
Neg logit contributions
ves-0.046
kered-0.044
ggy-0.039
 lesson-0.039
 words-0.036
Tokenay
Token ID323
Feature activation+7.52e-03

Pos logit contributions
 least+0.008
rying+0.008
ipping+0.008
 forth+0.007
oming+0.007
Neg logit contributions
ves-0.009
kered-0.008
ggy-0.008
 lesson-0.007
ggie-0.007
Token,
Token ID11
Feature activation+1.25e-02

Pos logit contributions
 least+0.013
rying+0.012
 forth+0.012
ter+0.012
ipping+0.012
Neg logit contributions
ves-0.014
kered-0.013
 lesson-0.012
ggy-0.012
 common-0.011
Token so
Token ID523
Feature activation-2.24e-02

Pos logit contributions
rying+0.022
 least+0.022
 forth+0.021
ter+0.020
ting+0.019
Neg logit contributions
ves-0.024
kered-0.023
ggy-0.021
 lesson-0.019
ork-0.019
Token Jon
Token ID5966
Feature activation-1.03e-02

Pos logit contributions
ves+0.038
kered+0.036
ggy+0.032
ork+0.031
 teaches+0.030
Neg logit contributions
rying-0.042
 forth-0.040
 least-0.039
ter-0.039
used-0.037
Token had
Token ID550
Feature activation+2.21e-04

Pos logit contributions
ves+0.019
kered+0.016
ork+0.015
ggy+0.015
 purpose+0.014
Neg logit contributions
rying-0.019
 least-0.018
ter-0.018
 forth-0.017
ting-0.016
Token to
Token ID284
Feature activation-2.23e-02

Pos logit contributions
 least+0.000
rying+0.000
 forth+0.000
ter+0.000
 highlight+0.000
Neg logit contributions
ves-0.000
kered-0.000
ggy-0.000
ork-0.000
ggie-0.000
Token be
Token ID307
Feature activation+9.44e-03

Pos logit contributions
ves+0.045
kered+0.042
 lesson+0.037
ork+0.036
ggy+0.036
Neg logit contributions
 least-0.037
rying-0.036
 forth-0.035
ter-0.033
ipping-0.031
Token very
Token ID845
Feature activation+5.29e-03

Pos logit contributions
rying+0.016
 forth+0.015
 least+0.015
ipping+0.014
ter+0.014
Neg logit contributions
ves-0.019
kered-0.018
ork-0.016
 lesson-0.016
ggy-0.015
Token careful
Token ID8161
Feature activation-2.34e-02

Pos logit contributions
rying+0.010
 least+0.010
 forth+0.010
ipping+0.009
umper+0.009
Neg logit contributions
ves-0.009
kered-0.008
ggy-0.008
 lesson-0.007
ork-0.007
Token.
Token ID13
Feature activation+3.66e-03

Pos logit contributions
ves+0.038
kered+0.036
ork+0.031
ggy+0.031
 lesson+0.031
Neg logit contributions
rying-0.048
 forth-0.044
 least-0.043
ter-0.042
ipping-0.042
Token
Token ID198
Feature activation-1.30e-02

Pos logit contributions
rying+0.008
 least+0.008
 forth+0.008
ipping+0.007
ter+0.007
Neg logit contributions
ves-0.005
kered-0.005
ggy-0.004
 lesson-0.004
ork-0.004
Token to
Token ID284
Feature activation-1.18e-02

Pos logit contributions
rying+0.007
 least+0.007
 forth+0.006
ter+0.006
ipping+0.006
Neg logit contributions
ves-0.006
kered-0.006
ggy-0.005
ork-0.005
 lesson-0.005
Token be
Token ID307
Feature activation+1.16e-02

Pos logit contributions
ves+0.023
kered+0.022
ork+0.019
 lesson+0.019
ggy+0.018
Neg logit contributions
 least-0.020
rying-0.019
 forth-0.019
ipping-0.017
ter-0.017
Token healthy
Token ID5448
Feature activation+2.13e-03

Pos logit contributions
rying+0.022
 least+0.021
 forth+0.020
ipping+0.020
ter+0.020
Neg logit contributions
ves-0.021
kered-0.019
ggy-0.017
 lesson-0.017
ork-0.017
Token too
Token ID1165
Feature activation-8.06e-03

Pos logit contributions
rying+0.004
 least+0.004
 forth+0.004
ter+0.004
ipping+0.004
Neg logit contributions
ves-0.004
kered-0.004
ork-0.003
ggy-0.003
 lesson-0.003
Token!
Token ID0
Feature activation+3.54e-03

Pos logit contributions
ves+0.014
kered+0.013
ggy+0.011
ork+0.011
 lesson+0.011
Neg logit contributions
rying-0.016
 least-0.014
 forth-0.014
used-0.014
ter-0.014
Token Maybe
Token ID6674
Feature activation-6.57e-03

Pos logit contributions
rying+0.005
 least+0.005
 forth+0.005
ter+0.005
ipping+0.005
Neg logit contributions
ves-0.007
kered-0.007
ggy-0.006
 lesson-0.006
ork-0.006
Token we
Token ID356
Feature activation+3.57e-02

Pos logit contributions
ves+0.013
kered+0.012
ggy+0.011
 lesson+0.011
ork+0.011
Neg logit contributions
rying-0.011
 least-0.010
 forth-0.010
ipping-0.009
ter-0.009
Token can
Token ID460
Feature activation+1.55e-03

Pos logit contributions
rying+0.081
ipping+0.077
 forth+0.076
oming+0.073
 least+0.072
Neg logit contributions
ves-0.046
kered-0.044
 lesson-0.040
 teaches-0.037
ggy-0.037
Token exercise
Token ID5517
Feature activation-8.12e-03

Pos logit contributions
rying+0.003
 least+0.003
 forth+0.003
ipping+0.003
ter+0.003
Neg logit contributions
ves-0.003
kered-0.002
ggy-0.002
 lesson-0.002
ork-0.002
Token together
Token ID1978
Feature activation+1.34e-02

Pos logit contributions
ves+0.015
kered+0.015
ggy+0.013
 lesson+0.012
ork+0.012
Neg logit contributions
rying-0.014
 least-0.014
 forth-0.013
ipping-0.013
ter-0.013
Token and
Token ID290
Feature activation+3.88e-03

Pos logit contributions
rying+0.028
 least+0.027
 forth+0.026
ipping+0.025
ter+0.025
Neg logit contributions
ves-0.021
kered-0.020
ggy-0.017
 lesson-0.017
ork-0.016
Token,
Token ID11
Feature activation+6.10e-03

Pos logit contributions
ves+0.025
kered+0.024
ork+0.020
 lesson+0.020
ggy+0.019
Neg logit contributions
rying-0.022
 forth-0.022
ter-0.021
 least-0.021
used-0.020
Token Kitty
Token ID21168
Feature activation-2.04e-02

Pos logit contributions
rying+0.011
 least+0.011
ter+0.011
 forth+0.011
ting+0.010
Neg logit contributions
ves-0.010
kered-0.010
ggy-0.009
ork-0.008
 lesson-0.008
Token!
Token ID0
Feature activation+5.75e-03

Pos logit contributions
ves+0.039
kered+0.037
 lesson+0.033
ork+0.032
ggy+0.030
Neg logit contributions
rying-0.035
ter-0.034
 least-0.033
 forth-0.033
ting-0.031
Token Here
Token ID3423
Feature activation-5.94e-03

Pos logit contributions
rying+0.010
 forth+0.009
 least+0.009
ipping+0.009
umper+0.009
Neg logit contributions
ves-0.011
kered-0.010
ggy-0.009
 lesson-0.009
ork-0.009
Token is
Token ID318
Feature activation+3.86e-04

Pos logit contributions
kered+0.010
ves+0.010
 lesson+0.009
ork+0.008
ggy+0.008
Neg logit contributions
 least-0.011
 forth-0.010
rying-0.010
ter-0.010
ipping-0.010
Token some
Token ID617
Feature activation-1.53e-02

Pos logit contributions
rying+0.001
 least+0.001
 forth+0.001
ter+0.001
ipping+0.001
Neg logit contributions
ves-0.001
kered-0.001
ggy-0.000
 lesson-0.000
ork-0.000
Token milk
Token ID7545
Feature activation+1.58e-02

Pos logit contributions
ves+0.024
kered+0.022
ggy+0.020
ork+0.019
ggie+0.018
Neg logit contributions
rying-0.030
 least-0.030
 forth-0.029
ter-0.029
ipping-0.029
Token for
Token ID329
Feature activation-1.96e-02

Pos logit contributions
rying+0.034
 least+0.032
 forth+0.032
ipping+0.031
ter+0.030
Neg logit contributions
ves-0.024
kered-0.022
ggy-0.020
 lesson-0.019
ork-0.018
Token you
Token ID345
Feature activation+9.23e-03

Pos logit contributions
ves+0.040
kered+0.038
ggy+0.035
 lesson+0.035
ork+0.033
Neg logit contributions
rying-0.031
 least-0.029
 forth-0.028
ipping-0.027
ter-0.027
Token."
Token ID526
Feature activation-1.69e-02

Pos logit contributions
rying+0.021
 least+0.020
 forth+0.019
ter+0.018
ipping+0.018
Neg logit contributions
ves-0.013
kered-0.013
ggy-0.011
 lesson-0.010
ork-0.010
Token Kitty
Token ID21168
Feature activation-3.71e-02

Pos logit contributions
ves+0.031
kered+0.029
ggy+0.026
 lesson+0.026
ork+0.025
Neg logit contributions
rying-0.028
 least-0.027
 forth-0.027
ter-0.027
ting-0.026
Token and
Token ID290
Feature activation-8.47e-04

Pos logit contributions
ves+0.033
kered+0.033
ggy+0.030
ork+0.029
 lesson+0.028
Neg logit contributions
rying-0.023
 forth-0.022
 least-0.021
ter-0.020
ting-0.019
Token disagreed
Token ID27884
Feature activation-2.98e-02

Pos logit contributions
ves+0.002
kered+0.002
ork+0.001
ggy+0.001
 lesson+0.001
Neg logit contributions
 forth-0.001
rying-0.001
 least-0.001
ter-0.001
ting-0.001
Token on
Token ID319
Feature activation-2.67e-02

Pos logit contributions
ves+0.050
kered+0.048
ork+0.044
ggy+0.042
 lesson+0.041
Neg logit contributions
rying-0.052
ter-0.048
 forth-0.048
ipping-0.047
 least-0.046
Token who
Token ID508
Feature activation-2.10e-03

Pos logit contributions
kered+0.054
ves+0.053
ork+0.046
ggy+0.046
 lesson+0.045
Neg logit contributions
rying-0.039
 forth-0.038
 least-0.038
ting-0.036
ter-0.036
Token should
Token ID815
Feature activation-1.70e-02

Pos logit contributions
ves+0.003
kered+0.003
ork+0.003
 lesson+0.003
ggy+0.003
Neg logit contributions
rying-0.004
 forth-0.004
 least-0.004
ter-0.004
ting-0.003
Token do
Token ID466
Feature activation-5.29e-02

Pos logit contributions
kered+0.029
ves+0.029
ork+0.024
 lesson+0.024
 teeth+0.023
Neg logit contributions
rying-0.029
 least-0.029
 forth-0.029
ter-0.027
ipping-0.026
Token certain
Token ID1728
Feature activation-3.79e-02

Pos logit contributions
kered+0.095
ves+0.086
ork+0.076
aron+0.074
 lesson+0.071
Neg logit contributions
rying-0.089
 least-0.089
 forth-0.087
ipping-0.080
ter-0.080
Token things
Token ID1243
Feature activation-2.53e-02

Pos logit contributions
kered+0.065
ves+0.063
ork+0.054
ggy+0.053
 lesson+0.050
Neg logit contributions
rying-0.068
 forth-0.067
 least-0.066
ipping-0.061
ter-0.061
Token.
Token ID13
Feature activation-3.79e-03

Pos logit contributions
ves+0.038
kered+0.036
ggy+0.032
 lesson+0.031
ork+0.030
Neg logit contributions
rying-0.054
 least-0.051
 forth-0.050
ipping-0.048
ter-0.048
Token One
Token ID1881
Feature activation-2.90e-02

Pos logit contributions
ves+0.006
kered+0.005
ggy+0.005
 lesson+0.005
ork+0.004
Neg logit contributions
rying-0.008
 least-0.008
 forth-0.007
ipping-0.007
ter-0.007
Token day
Token ID1110
Feature activation-4.31e-03

Pos logit contributions
ves+0.058
kered+0.055
ork+0.047
ggy+0.047
 teeth+0.045
Neg logit contributions
rying-0.046
 forth-0.044
 least-0.044
ter-0.042
 highlight-0.041
Token mom
Token ID1995
Feature activation-1.02e-02

Pos logit contributions
ves+0.041
kered+0.041
ggy+0.036
 lesson+0.036
ork+0.035
Neg logit contributions
 least-0.026
rying-0.026
 forth-0.025
 gentlemen-0.025
ter-0.024
Token asked
Token ID1965
Feature activation-4.34e-03

Pos logit contributions
ves+0.014
kered+0.013
ggy+0.012
 lesson+0.011
ork+0.010
Neg logit contributions
rying-0.023
 least-0.022
 forth-0.021
ipping-0.021
ter-0.021
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
ves+0.006
kered+0.006
ggy+0.005
 lesson+0.005
ork+0.005
Neg logit contributions
rying-0.010
 least-0.009
 forth-0.009
ipping-0.009
ter-0.009
Token "
Token ID366
Feature activation+1.90e-02

Pos logit contributions
ves+0.022
kered+0.022
ggy+0.019
 lesson+0.019
ggie+0.018
Neg logit contributions
rying-0.016
 least-0.015
ter-0.015
ting-0.015
�-0.015
TokenHow
Token ID2437
Feature activation-1.45e-02

Pos logit contributions
rying+0.029
 least+0.028
 forth+0.027
ipping+0.026
oming+0.025
Neg logit contributions
ves-0.040
kered-0.038
ggy-0.035
 lesson-0.033
ork-0.033
Token did
Token ID750
Feature activation-4.53e-02

Pos logit contributions
kered+0.026
ves+0.024
aron+0.022
 purpose+0.021
ggy+0.021
Neg logit contributions
ancer-0.022
 surfing-0.022
 least-0.022
rying-0.021
 gentlemen-0.019
Token you
Token ID345
Feature activation-5.54e-03

Pos logit contributions
kered+0.102
ves+0.097
 lesson+0.089
ork+0.088
aron+0.088
Neg logit contributions
 least-0.052
 forth-0.051
ter-0.049
 gentlemen-0.048
rying-0.048
Token get
Token ID651
Feature activation-2.07e-04

Pos logit contributions
ves+0.010
kered+0.009
ggy+0.008
 lesson+0.008
ork+0.008
Neg logit contributions
rying-0.010
 least-0.010
 forth-0.010
ter-0.009
ipping-0.009
Token a
Token ID257
Feature activation-1.36e-02

Pos logit contributions
ves+0.000
ggy+0.000
kered+0.000
 lesson+0.000
 teeth+0.000
Neg logit contributions
rying-0.000
ipping-0.000
ancer-0.000
 highlight-0.000
ologist-0.000
Token dinosaur
Token ID28257
Feature activation+6.82e-03

Pos logit contributions
ves+0.023
kered+0.022
ggy+0.019
 lesson+0.018
ork+0.018
Neg logit contributions
rying-0.026
 least-0.025
 forth-0.024
ipping-0.023
ter-0.023
Token in
Token ID287
Feature activation-1.45e-02

Pos logit contributions
rying+0.012
 least+0.011
 forth+0.011
ipping+0.011
ter+0.010
Neg logit contributions
ves-0.013
kered-0.012
ggy-0.011
 lesson-0.011
ork-0.010
Token.
Token ID13
Feature activation+6.32e-03

Pos logit contributions
ves+0.005
kered+0.005
ggy+0.004
 lesson+0.004
ork+0.004
Neg logit contributions
rying-0.007
 least-0.006
ter-0.006
 forth-0.006
ting-0.006
Token His
Token ID2399
Feature activation-8.02e-03

Pos logit contributions
rying+0.014
 least+0.013
 forth+0.013
ipping+0.013
ter+0.013
Neg logit contributions
ves-0.009
kered-0.008
ggy-0.007
 lesson-0.007
ork-0.007
Token favorite
Token ID4004
Feature activation-1.58e-02

Pos logit contributions
ves+0.017
kered+0.016
ggy+0.014
ork+0.014
 teaches+0.013
Neg logit contributions
 least-0.012
rying-0.012
 forth-0.012
ter-0.011
 highlight-0.011
Token thing
Token ID1517
Feature activation-1.21e-02

Pos logit contributions
ves+0.031
kered+0.029
ork+0.024
ggy+0.023
 teaches+0.022
Neg logit contributions
 least-0.025
 forth-0.025
ter-0.024
rying-0.024
 highlight-0.023
Token to
Token ID284
Feature activation-2.18e-03

Pos logit contributions
ves+0.015
kered+0.014
 common+0.011
ork+0.011
ggy+0.011
Neg logit contributions
 forth-0.027
 least-0.027
rying-0.027
ter-0.026
ting-0.024
Token do
Token ID466
Feature activation-4.16e-02

Pos logit contributions
ves+0.004
kered+0.004
ork+0.004
 lesson+0.004
ggy+0.004
Neg logit contributions
 least-0.003
rying-0.003
 forth-0.003
ter-0.003
ancer-0.003
Token when
Token ID618
Feature activation-1.26e-02

Pos logit contributions
kered+0.059
ves+0.054
 common+0.046
 lesson+0.044
ork+0.043
Neg logit contributions
rying-0.083
 least-0.083
ipping-0.079
 surfing-0.076
 exploring-0.075
Token he
Token ID339
Feature activation+1.58e-02

Pos logit contributions
ves+0.025
kered+0.024
ggy+0.021
ork+0.020
 lesson+0.020
Neg logit contributions
rying-0.021
 least-0.020
 forth-0.020
ipping-0.019
ter-0.018
Token was
Token ID373
Feature activation+2.93e-02

Pos logit contributions
 least+0.031
rying+0.030
 forth+0.029
ory+0.027
umper+0.027
Neg logit contributions
ves-0.028
kered-0.025
 lesson-0.022
ggy-0.022
 purpose-0.021
Token feeling
Token ID4203
Feature activation+1.22e-02

Pos logit contributions
rying+0.043
 least+0.040
 forth+0.039
ipping+0.037
ter+0.036
Neg logit contributions
ves-0.065
kered-0.062
ggy-0.056
 lesson-0.056
ork-0.054
Token this
Token ID428
Feature activation-1.78e-04

Pos logit contributions
 least+0.016
rying+0.016
 forth+0.014
oming+0.014
ipping+0.014
Neg logit contributions
ves-0.029
kered-0.027
ggy-0.025
ork-0.024
 lesson-0.023
Token sad
Token ID6507
Feature activation+1.52e-02

Pos logit contributions
rying+0.034
 least+0.033
 forth+0.031
ter+0.030
oming+0.029
Neg logit contributions
ves-0.046
kered-0.043
 lesson-0.039
ork-0.039
ggy-0.037
Token but
Token ID475
Feature activation-3.32e-03

Pos logit contributions
rying+0.029
 least+0.028
 forth+0.027
ipping+0.026
ter+0.026
Neg logit contributions
ves-0.027
kered-0.025
ggy-0.023
 lesson-0.022
ork-0.021
Token was
Token ID373
Feature activation+1.71e-02

Pos logit contributions
ves+0.006
kered+0.006
ggy+0.005
 lesson+0.005
ork+0.005
Neg logit contributions
rying-0.006
 least-0.006
 forth-0.005
ipping-0.005
ter-0.005
Token filled
Token ID5901
Feature activation-2.85e-02

Pos logit contributions
rying+0.031
 least+0.029
 forth+0.028
ter+0.027
ipping+0.027
Neg logit contributions
ves-0.032
kered-0.030
ggy-0.027
 lesson-0.026
ork-0.026
Token with
Token ID351
Feature activation-8.83e-03

Pos logit contributions
ves+0.056
kered+0.054
 teeth+0.048
 lesson+0.045
 happens+0.045
Neg logit contributions
rying-0.048
 forth-0.046
ter-0.044
 least-0.044
ipping-0.043
Token an
Token ID281
Feature activation-3.86e-02

Pos logit contributions
ves+0.018
kered+0.015
ggy+0.015
ork+0.015
ggie+0.013
Neg logit contributions
 least-0.016
 forth-0.014
rying-0.014
 highlight-0.013
ter-0.013
Token incredible
Token ID8082
Feature activation-2.13e-02

Pos logit contributions
ves+0.067
kered+0.062
ggy+0.056
 lesson+0.053
ork+0.053
Neg logit contributions
rying-0.075
 least-0.071
 forth-0.069
ipping-0.067
ter-0.066
Token sense
Token ID2565
Feature activation-2.24e-03

Pos logit contributions
ves+0.036
kered+0.034
ggy+0.030
 lesson+0.029
ork+0.028
Neg logit contributions
rying-0.041
 least-0.039
 forth-0.039
ipping-0.038
ter-0.037
Token of
Token ID286
Feature activation-6.95e-03

Pos logit contributions
ves+0.003
kered+0.003
ggy+0.003
 lesson+0.003
ork+0.003
Neg logit contributions
rying-0.005
 least-0.005
 forth-0.004
ipping-0.004
ter-0.004
Token content
Token ID2695
Feature activation-1.61e-02

Pos logit contributions
ves+0.013
kered+0.012
ggy+0.011
ork+0.011
 lesson+0.010
Neg logit contributions
rying-0.012
 least-0.012
 forth-0.012
ipping-0.011
ter-0.011
Tokenment
Token ID434
Feature activation-1.09e-02

Pos logit contributions
ves+0.027
kered+0.024
ggy+0.023
 lesson+0.021
 teaches+0.021
Neg logit contributions
rying-0.030
ipping-0.028
 forth-0.028
ter-0.028
 least-0.027
Token of
Token ID286
Feature activation-3.64e-02

Pos logit contributions
ves+0.058
kered+0.053
ggy+0.048
ork+0.048
 lesson+0.045
Neg logit contributions
rying-0.064
ipping-0.062
 least-0.060
oming-0.059
 forth-0.059
Token fun
Token ID1257
Feature activation-2.44e-02

Pos logit contributions
ves+0.056
kered+0.054
ork+0.048
ggy+0.045
 lesson+0.041
Neg logit contributions
rying-0.072
 least-0.071
 forth-0.071
ipping-0.068
 surfing-0.065
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
ves+0.037
kered+0.035
ggy+0.031
 lesson+0.030
ork+0.029
Neg logit contributions
rying-0.052
 least-0.049
 forth-0.049
ipping-0.048
ter-0.046
Token<|endoftext|>
Token ID50256
Feature activation+3.65e-02

Pos logit contributions
rying+0.024
 least+0.023
 forth+0.023
ipping+0.022
ter+0.022
Neg logit contributions
ves-0.019
kered-0.018
ggy-0.016
 lesson-0.016
ork-0.015
TokenOnce
Token ID7454
Feature activation+3.72e-02

Pos logit contributions
rying+0.047
 least+0.044
 forth+0.041
ipping+0.041
ter+0.040
Neg logit contributions
ves-0.085
kered-0.081
ggy-0.075
 lesson-0.074
ork-0.071
Token upon
Token ID2402
Feature activation-4.05e-02

Pos logit contributions
rying+0.034
 least+0.030
 forth+0.028
ipping+0.026
ter+0.025
Neg logit contributions
ves-0.102
kered-0.098
ggy-0.092
 lesson-0.091
ork-0.088
Token a
Token ID257
Feature activation-2.26e-02

Pos logit contributions
ves+0.078
kered+0.074
ork+0.066
 teaches+0.064
 lesson+0.063
Neg logit contributions
 least-0.058
rying-0.058
ter-0.056
 surfing-0.055
ancer-0.053
Token time
Token ID640
Feature activation-1.34e-02

Pos logit contributions
kered+0.022
ves+0.021
 lesson+0.013
ggy+0.012
ork+0.012
Neg logit contributions
rying-0.058
 least-0.057
 forth-0.056
ipping-0.055
 highlight-0.053
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
ves+0.018
kered+0.016
ggy+0.015
 lesson+0.014
ork+0.013
Neg logit contributions
rying-0.031
 least-0.029
 forth-0.029
ipping-0.028
ter-0.028
Token there
Token ID612
Feature activation-1.23e-02

Pos logit contributions
rying+0.018
 least+0.017
 forth+0.016
ipping+0.015
ter+0.015
Neg logit contributions
ves-0.030
kered-0.029
ggy-0.027
 lesson-0.026
ork-0.026
Token was
Token ID373
Feature activation+3.90e-03

Pos logit contributions
ves+0.019
kered+0.018
aron+0.016
 lesson+0.015
ork+0.015
Neg logit contributions
 least-0.022
ter-0.022
 forth-0.021
mallow-0.021
ipping-0.020