TOP ACTIVATIONS
MAX = 0.105

back on the zip - line , z ipping all around
crown and playing in the freshly cut grass . It looked
, even to the t ini est of creatures . <|endoftext|>
robe back onto the clothes line . He used all his
the edge of the shore line and just stood there .
as he explored the shore line . One day
keep going beyond the shore line and swim out further in
the edge of the shore line and just stood there .
I fell off the clothes line and I can 't get
far away from the nos y villagers . The elder noted
mom and to the ost ric hes . They learned their
Lily hiding under the t arp . He knew they were
jumped onto the zip - line and z ipped around the
off or play in the sandy ground ! But before he
to the biggest , sh ini est fish in the pond
walked away from the boss y girl . Tim my was
books back on the books he lf , smiling proudly .
h anger from the clothes line . He set the h
when it hugged the shore line again , it noticed a
share stories about the nos y chicken . Everyone loved the

MIN ACTIVATIONS
MIN = -0.106

- eight , twenty - nine , thirty , thirty -
called Daisy , who was nine years old . She lived
mouse named Mickey . " Mic key , help ! The
counter . Mic key wanted some popcorn ,
says , " They are av oc ados . My mom
. He liked to eat math . Every night he went
home safely . Mic key learned that it 's
. She loved to eat av oc ados . One day
the famous chef was making las agna . That is a
said , " I am Doctor Lily . I will listen
. She loved to eat av oc ados because they were
B ongo loved to eat av oc ados . One day
vanished ! Mic key was scared , but
to Mickey . Mic key was very happy and
. " Is math hard ?" asked Jenny .
the price - it was $ 5 . Sally was so
She puts on her p aj amas . She hugs her
stay at home and do math . One day , a
asked him why he chose math . Jack smiled and said
Everyday he would take the av oc ados and scatter them

INTERVAL 0.052 - 0.105
CONTAINS 0.253%

! Go to your room now !" L ily
They learned a hard lesson that day . <|endoftext|> Sam and
excited and happy . The gloomy mood was gone and he
thirsty and went to the water fountain to drink . They
The mom was the boss y one . She told them

INTERVAL -0.001 - 0.052
CONTAINS 48.750%

, but her mom said , " Don 't worry ,
waved good - bye and thanked her for sharing the treats
should be proud of them ." Max thought about
the paper , and no matter what they did , they
the lovely day . The prince smiled , waved goodbye and

INTERVAL -0.053 - -0.001
CONTAINS 50.857%

High ness . I will hide it in a secret place
" No , it 's mine !" Sam says . He
, Sab rina and her mom went to the park .
a time , there was a little bird named Tweet .
her that she will be okay . L ily

INTERVAL -0.106 - -0.053
CONTAINS 0.140%

and use his knowledge to solve difficult problems .
were playing cow boys and robbers in the backyard . Lily
" How do you make things vanish ?" The magician smiled
. Let 's put the pizz as in the oven and
Tim my loved to bake cakes with his mom my .
Token back
Token ID736
Feature activation-1.35e-02

Pos logit contributions
keeping+0.000
con+0.000
ls+0.000
 olds+0.000
 av+0.000
Neg logit contributions
 wish-0.000
 wake-0.000
 stop-0.000
upid-0.000
ee-0.000
Token on
Token ID319
Feature activation+2.97e-03

Pos logit contributions
keeping+0.025
con+0.024
 olds+0.023
ls+0.023
led+0.023
Neg logit contributions
 wish-0.016
 wake-0.016
 stop-0.016
ee-0.016
 watch-0.015
Token the
Token ID262
Feature activation+9.73e-03

Pos logit contributions
 wish+0.003
 wake+0.003
 stop+0.003
 end+0.003
 watch+0.003
Neg logit contributions
keeping-0.006
con-0.006
led-0.006
ls-0.006
 olds-0.006
Token zip
Token ID19974
Feature activation+3.40e-02

Pos logit contributions
 wake+0.012
 wish+0.011
 stop+0.011
 end+0.010
 watch+0.010
Neg logit contributions
keeping-0.019
con-0.019
ls-0.018
led-0.018
 olds-0.018
Token-
Token ID12
Feature activation+2.08e-02

Pos logit contributions
 wish+0.043
 wake+0.042
 stop+0.041
ee+0.040
 end+0.040
Neg logit contributions
keeping-0.062
con-0.061
ls-0.059
 olds-0.058
 av-0.056
Tokenline
Token ID1370
Feature activation+1.05e-01

Pos logit contributions
ee+0.026
 wish+0.026
 wake+0.026
 bet+0.025
 stop+0.025
Neg logit contributions
keeping-0.037
con-0.036
 olds-0.035
ls-0.035
bes-0.034
Token,
Token ID11
Feature activation+8.14e-03

Pos logit contributions
 wish+0.134
 wake+0.132
 stop+0.130
 end+0.125
upid+0.123
Neg logit contributions
keeping-0.189
con-0.188
ls-0.179
 olds-0.175
led-0.170
Token z
Token ID1976
Feature activation+9.07e-03

Pos logit contributions
 wish+0.010
 wake+0.010
upid+0.009
ee+0.009
 stop+0.009
Neg logit contributions
keeping-0.015
con-0.015
ls-0.014
 olds-0.014
 av-0.014
Tokenipping
Token ID4501
Feature activation+4.66e-03

Pos logit contributions
 wish+0.013
 wake+0.013
ee+0.013
 stop+0.013
upid+0.013
Neg logit contributions
keeping-0.015
con-0.015
 olds-0.014
ls-0.014
 av-0.013
Token all
Token ID477
Feature activation+3.76e-02

Pos logit contributions
 wish+0.005
 wake+0.004
 stop+0.004
ee+0.004
 sure+0.004
Neg logit contributions
keeping-0.010
con-0.010
 olds-0.009
ls-0.009
bes-0.009
Token around
Token ID1088
Feature activation+3.58e-02

Pos logit contributions
 wish+0.048
 wake+0.046
 stop+0.043
upid+0.043
 bet+0.042
Neg logit contributions
keeping-0.071
con-0.069
ls-0.066
 olds-0.065
led-0.064
Token crown
Token ID12389
Feature activation-1.44e-02

Pos logit contributions
 wish+0.002
 wake+0.002
 stop+0.002
 end+0.002
upid+0.002
Neg logit contributions
keeping-0.004
con-0.004
ls-0.004
 olds-0.004
led-0.004
Token and
Token ID290
Feature activation+1.67e-02

Pos logit contributions
keeping+0.028
con+0.027
ls+0.026
 olds+0.026
 av+0.025
Neg logit contributions
 wish-0.017
 wake-0.016
 stop-0.016
upid-0.016
ee-0.015
Token playing
Token ID2712
Feature activation+1.13e-02

Pos logit contributions
 wish+0.019
 happy+0.018
 sure+0.017
 watch+0.017
 wake+0.017
Neg logit contributions
keeping-0.034
con-0.033
ls-0.033
 olds-0.032
ados-0.031
Token in
Token ID287
Feature activation+4.02e-03

Pos logit contributions
 wish+0.019
upid+0.018
 wake+0.018
ee+0.018
 bet+0.018
Neg logit contributions
keeping-0.016
con-0.016
ls-0.015
 av-0.015
 olds-0.014
Token the
Token ID262
Feature activation+1.52e-02

Pos logit contributions
 wish+0.006
 wake+0.005
upid+0.005
ee+0.005
 stop+0.005
Neg logit contributions
keeping-0.007
con-0.007
ls-0.006
 av-0.006
 olds-0.006
Token freshly
Token ID29026
Feature activation+1.03e-01

Pos logit contributions
 wish+0.021
 wake+0.020
upid+0.019
 stop+0.019
ee+0.019
Neg logit contributions
keeping-0.027
con-0.027
ls-0.025
 olds-0.025
 av-0.024
Token cut
Token ID2005
Feature activation+4.16e-03

Pos logit contributions
upid+0.164
 wish+0.161
 bet+0.151
 understand+0.147
 stop+0.147
Neg logit contributions
con-0.160
keeping-0.158
led-0.150
bes-0.147
 av-0.143
Token grass
Token ID8701
Feature activation+2.00e-02

Pos logit contributions
 wish+0.006
 wake+0.006
 stop+0.006
 end+0.006
ee+0.006
Neg logit contributions
keeping-0.007
con-0.006
 olds-0.006
ls-0.006
 av-0.006
Token.
Token ID13
Feature activation-3.10e-03

Pos logit contributions
 wake+0.023
 wish+0.023
 stop+0.023
 end+0.022
ee+0.022
Neg logit contributions
keeping-0.038
con-0.037
 olds-0.036
ls-0.035
bes-0.035
Token It
Token ID632
Feature activation-6.61e-03

Pos logit contributions
keeping+0.005
con+0.005
ls+0.004
 olds+0.004
ados+0.004
Neg logit contributions
 wish-0.005
 wake-0.005
 stop-0.005
ee-0.005
upid-0.005
Token looked
Token ID3114
Feature activation-1.14e-02

Pos logit contributions
keeping+0.012
con+0.012
ls+0.011
 olds+0.011
 av+0.011
Neg logit contributions
 wish-0.009
 wake-0.008
 stop-0.008
ee-0.008
 bet-0.008
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
con+0.007
keeping+0.007
 av+0.006
ls+0.006
bes+0.006
Neg logit contributions
 wish-0.004
 wake-0.004
 stop-0.004
upid-0.004
 end-0.004
Token even
Token ID772
Feature activation+3.01e-02

Pos logit contributions
 wish+0.021
 wake+0.020
 stop+0.019
upid+0.019
 bet+0.019
Neg logit contributions
keeping-0.042
con-0.042
ls-0.040
 olds-0.039
led-0.038
Token to
Token ID284
Feature activation+1.78e-02

Pos logit contributions
 wish+0.029
 wake+0.027
 sure+0.025
 stop+0.025
 happy+0.024
Neg logit contributions
keeping-0.069
con-0.067
bes-0.064
 olds-0.063
ls-0.063
Token the
Token ID262
Feature activation+1.83e-02

Pos logit contributions
ee+0.015
 wish+0.015
upid+0.014
 wake+0.014
 bet+0.014
Neg logit contributions
keeping-0.039
con-0.037
 olds-0.035
 av-0.035
ls-0.035
Token t
Token ID256
Feature activation+3.97e-02

Pos logit contributions
 wish+0.018
 wake+0.018
upid+0.017
 stop+0.017
ee+0.017
Neg logit contributions
keeping-0.038
con-0.038
ls-0.036
 olds-0.036
 av-0.035
Tokenini
Token ID5362
Feature activation+1.03e-01

Pos logit contributions
 wish+0.066
 wake+0.065
 stop+0.063
upid+0.061
 bang+0.061
Neg logit contributions
keeping-0.059
con-0.057
ls-0.055
 olds-0.052
led-0.051
Tokenest
Token ID395
Feature activation+5.67e-02

Pos logit contributions
 wish+0.137
 wake+0.130
upid+0.125
 bet+0.123
 stop+0.122
Neg logit contributions
keeping-0.182
con-0.180
ls-0.168
led-0.165
 olds-0.165
Token of
Token ID286
Feature activation+3.16e-02

Pos logit contributions
 wish+0.076
 wake+0.071
 end+0.068
 stop+0.068
 plan+0.066
Neg logit contributions
keeping-0.106
con-0.106
ls-0.098
bes-0.098
 olds-0.097
Token creatures
Token ID8109
Feature activation+3.78e-02

Pos logit contributions
 wish+0.036
 wake+0.034
ee+0.033
upid+0.033
 stop+0.032
Neg logit contributions
keeping-0.062
con-0.061
ls-0.058
 olds-0.058
 av-0.057
Token.
Token ID13
Feature activation-1.96e-03

Pos logit contributions
 wish+0.047
 wake+0.045
 stop+0.043
 bet+0.042
upid+0.042
Neg logit contributions
keeping-0.072
con-0.072
ls-0.067
 olds-0.066
ados-0.065
Token<|endoftext|>
Token ID50256
Feature activation-1.06e-02

Pos logit contributions
keeping+0.004
con+0.004
ls+0.004
 olds+0.004
 av+0.004
Neg logit contributions
 wish-0.002
 wake-0.002
 stop-0.002
upid-0.002
ee-0.002
Tokenrobe
Token ID25481
Feature activation-4.71e-03

Pos logit contributions
 wake+0.030
 wish+0.030
 stop+0.030
 end+0.029
ee+0.028
Neg logit contributions
con-0.033
keeping-0.033
 av-0.030
ls-0.030
 olds-0.030
Token back
Token ID736
Feature activation-5.02e-03

Pos logit contributions
con+0.008
keeping+0.008
ls+0.008
 olds+0.007
 av+0.007
Neg logit contributions
 wish-0.007
 wake-0.006
upid-0.006
ee-0.006
 bet-0.006
Token onto
Token ID4291
Feature activation+2.15e-02

Pos logit contributions
keeping+0.009
con+0.009
 olds+0.008
ls+0.008
 av+0.008
Neg logit contributions
 wish-0.006
 wake-0.006
 stop-0.006
upid-0.006
ee-0.006
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
 wish+0.029
 wake+0.028
upid+0.027
ee+0.026
 bet+0.026
Neg logit contributions
keeping-0.038
con-0.038
ls-0.035
 olds-0.035
 av-0.034
Token clothes
Token ID8242
Feature activation+2.49e-02

Pos logit contributions
 wish+0.011
 wake+0.010
upid+0.010
ee+0.010
 bet+0.010
Neg logit contributions
keeping-0.022
con-0.022
ls-0.021
 olds-0.021
bes-0.020
Tokenline
Token ID1370
Feature activation+1.02e-01

Pos logit contributions
 wish+0.040
 wake+0.037
upid+0.036
 bet+0.036
ee+0.036
Neg logit contributions
keeping-0.038
con-0.038
ls-0.035
 olds-0.034
led-0.033
Token.
Token ID13
Feature activation-4.82e-03

Pos logit contributions
 wish+0.152
 wake+0.145
upid+0.139
ee+0.139
 bet+0.138
Neg logit contributions
keeping-0.168
con-0.164
ls-0.153
 olds-0.152
 av-0.147
Token He
Token ID679
Feature activation-1.56e-02

Pos logit contributions
keeping+0.009
con+0.009
ls+0.009
 olds+0.009
 av+0.008
Neg logit contributions
 wish-0.006
 wake-0.006
 stop-0.005
upid-0.005
ee-0.005
Token used
Token ID973
Feature activation-1.47e-02

Pos logit contributions
keeping+0.028
con+0.027
ls+0.026
 olds+0.025
 av+0.025
Neg logit contributions
 wish-0.021
 wake-0.020
 stop-0.020
upid-0.019
ee-0.019
Token all
Token ID477
Feature activation+2.14e-02

Pos logit contributions
con+0.027
keeping+0.026
ls+0.025
 av+0.024
 olds+0.024
Neg logit contributions
 wish-0.019
 wake-0.019
ee-0.018
 stop-0.017
 bet-0.017
Token his
Token ID465
Feature activation-4.22e-02

Pos logit contributions
 wish+0.032
 wake+0.030
 stop+0.028
 spy+0.027
 rule+0.026
Neg logit contributions
con-0.035
ls-0.034
keeping-0.034
led-0.033
 av-0.032
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
keeping+0.016
con+0.016
 av+0.015
 olds+0.014
ls+0.014
Neg logit contributions
upid-0.018
be-0.018
 wish-0.017
ee-0.017
 bet-0.016
Token edge
Token ID5743
Feature activation+1.27e-02

Pos logit contributions
keeping+0.025
con+0.025
ls+0.024
 olds+0.024
 av+0.023
Neg logit contributions
 wish-0.013
 wake-0.013
upid-0.012
ee-0.012
 bet-0.012
Token of
Token ID286
Feature activation+1.44e-02

Pos logit contributions
 wish+0.012
 wake+0.012
 stop+0.012
ee+0.011
 end+0.011
Neg logit contributions
keeping-0.027
con-0.027
ls-0.026
 olds-0.025
bes-0.025
Token the
Token ID262
Feature activation-7.28e-05

Pos logit contributions
 wish+0.021
upid+0.020
 wake+0.019
 bet+0.019
ee+0.019
Neg logit contributions
keeping-0.025
con-0.023
 av-0.022
ls-0.022
 olds-0.022
Token shore
Token ID15191
Feature activation+5.22e-02

Pos logit contributions
keeping+0.000
con+0.000
ls+0.000
 olds+0.000
 av+0.000
Neg logit contributions
 wish-0.000
 wake-0.000
upid-0.000
 stop-0.000
ee-0.000
Tokenline
Token ID1370
Feature activation+1.01e-01

Pos logit contributions
 wish+0.074
 wake+0.071
upid+0.069
ee+0.068
 stop+0.068
Neg logit contributions
keeping-0.089
con-0.088
ls-0.082
 olds-0.081
 av-0.079
Token and
Token ID290
Feature activation+2.68e-03

Pos logit contributions
 wish+0.154
 wake+0.143
upid+0.143
ee+0.140
 bet+0.139
Neg logit contributions
keeping-0.162
con-0.157
ls-0.147
 olds-0.146
led-0.143
Token just
Token ID655
Feature activation+2.18e-02

Pos logit contributions
 wish+0.004
 wake+0.004
upid+0.003
 stop+0.003
ee+0.003
Neg logit contributions
keeping-0.005
con-0.005
ls-0.004
 olds-0.004
 av-0.004
Token stood
Token ID6204
Feature activation-3.26e-03

Pos logit contributions
 wish+0.031
 wake+0.030
upid+0.030
ee+0.029
 bet+0.029
Neg logit contributions
keeping-0.035
con-0.035
led-0.033
ls-0.033
 olds-0.032
Token there
Token ID612
Feature activation-6.17e-03

Pos logit contributions
keeping+0.006
con+0.006
ls+0.005
 olds+0.005
 av+0.005
Neg logit contributions
 wish-0.004
 wake-0.004
upid-0.004
 stop-0.004
ee-0.004
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
keeping+0.010
con+0.010
ls+0.009
 olds+0.009
 av+0.009
Neg logit contributions
 wish-0.009
 wake-0.009
upid-0.008
ee-0.008
 bet-0.008
Token as
Token ID355
Feature activation+1.30e-02

Pos logit contributions
 wish+0.007
 wake+0.007
 stop+0.006
upid+0.006
ee+0.006
Neg logit contributions
keeping-0.009
con-0.009
ls-0.008
 olds-0.008
 av-0.008
Token he
Token ID339
Feature activation+9.75e-04

Pos logit contributions
 wish+0.016
 wake+0.015
 stop+0.015
ee+0.015
 dare+0.014
Neg logit contributions
keeping-0.026
con-0.025
 olds-0.024
bes-0.023
ados-0.023
Token explored
Token ID18782
Feature activation-1.05e-02

Pos logit contributions
 wish+0.001
 wake+0.001
 stop+0.001
upid+0.001
ee+0.001
Neg logit contributions
keeping-0.002
con-0.002
ls-0.002
 olds-0.002
 av-0.001
Token the
Token ID262
Feature activation-3.96e-03

Pos logit contributions
keeping+0.023
con+0.023
bes+0.021
unn+0.021
 olds+0.021
Neg logit contributions
 wish-0.010
 end-0.010
 wake-0.010
 stop-0.009
 watch-0.009
Token shore
Token ID15191
Feature activation+5.13e-02

Pos logit contributions
keeping+0.009
con+0.008
ls+0.008
 olds+0.008
 av+0.008
Neg logit contributions
 wish-0.004
 wake-0.004
upid-0.003
 stop-0.003
ee-0.003
Tokenline
Token ID1370
Feature activation+1.01e-01

Pos logit contributions
 wish+0.065
 wake+0.065
 stop+0.062
 end+0.060
ee+0.060
Neg logit contributions
keeping-0.093
con-0.093
ls-0.087
 olds-0.086
bes-0.084
Token.
Token ID13
Feature activation-5.56e-03

Pos logit contributions
 wish+0.154
 wake+0.146
upid+0.142
ee+0.140
 bet+0.140
Neg logit contributions
keeping-0.162
con-0.158
ls-0.148
 olds-0.146
 av-0.143
Token
Token ID198
Feature activation-3.18e-03

Pos logit contributions
con+0.010
keeping+0.010
 olds+0.009
 av+0.009
ls+0.009
Neg logit contributions
 wish-0.007
 wake-0.007
 stop-0.007
upid-0.007
be-0.007
Token
Token ID198
Feature activation-2.22e-03

Pos logit contributions
con+0.005
ls+0.005
keeping+0.005
 olds+0.004
Ell+0.004
Neg logit contributions
 wish-0.005
 wake-0.005
 stop-0.005
upid-0.004
 bang-0.004
TokenOne
Token ID3198
Feature activation-1.02e-02

Pos logit contributions
con+0.003
 av+0.003
 olds+0.003
Ell+0.003
ls+0.003
Neg logit contributions
 wish-0.003
 wake-0.003
upid-0.003
 stop-0.003
ee-0.003
Token day
Token ID1110
Feature activation-2.98e-02

Pos logit contributions
ls+0.018
led+0.017
 olds+0.017
glers+0.016
con+0.016
Neg logit contributions
 bang-0.012
upid-0.012
 wish-0.011
 boo-0.010
 bet-0.010
Token keep
Token ID1394
Feature activation-1.49e-02

Pos logit contributions
 wish+0.000
upid+0.000
ee+0.000
 wake+0.000
be+0.000
Neg logit contributions
keeping-0.002
con-0.001
 av-0.001
 olds-0.001
ls-0.001
Token going
Token ID1016
Feature activation-1.61e-02

Pos logit contributions
keeping+0.026
con+0.026
ls+0.025
 av+0.024
led+0.023
Neg logit contributions
 wish-0.021
upid-0.020
 wake-0.020
 bet-0.019
ee-0.019
Token beyond
Token ID3675
Feature activation-5.86e-04

Pos logit contributions
keeping+0.032
con+0.032
ls+0.030
led+0.030
 olds+0.030
Neg logit contributions
 wake-0.018
 wish-0.017
 stop-0.017
 end-0.016
 watch-0.016
Token the
Token ID262
Feature activation-1.15e-03

Pos logit contributions
keeping+0.001
con+0.001
ls+0.001
 olds+0.001
led+0.001
Neg logit contributions
 wish-0.001
 wake-0.001
 stop-0.001
upid-0.001
ee-0.001
Token shore
Token ID15191
Feature activation+5.27e-02

Pos logit contributions
keeping+0.002
ls+0.002
 olds+0.002
con+0.002
 av+0.002
Neg logit contributions
upid-0.001
 wish-0.001
 bang-0.001
 sure-0.001
 understand-0.001
Tokenline
Token ID1370
Feature activation+9.93e-02

Pos logit contributions
 wish+0.073
 wake+0.071
 stop+0.067
upid+0.067
ee+0.066
Neg logit contributions
keeping-0.092
con-0.090
ls-0.085
 olds-0.084
 av-0.082
Token and
Token ID290
Feature activation+1.08e-02

Pos logit contributions
 wish+0.152
 wake+0.142
upid+0.141
ee+0.140
 bet+0.137
Neg logit contributions
keeping-0.160
con-0.155
ls-0.144
 olds-0.144
 av-0.140
Token swim
Token ID9422
Feature activation+5.15e-03

Pos logit contributions
 wish+0.009
ee+0.009
upid+0.009
 wake+0.009
 bet+0.008
Neg logit contributions
keeping-0.024
con-0.023
 olds-0.022
ls-0.022
 av-0.022
Token out
Token ID503
Feature activation-2.38e-02

Pos logit contributions
 wish+0.006
 stop+0.006
 wake+0.006
 bet+0.006
 end+0.005
Neg logit contributions
con-0.010
keeping-0.010
ados-0.010
 av-0.010
bes-0.009
Token further
Token ID2252
Feature activation-1.17e-02

Pos logit contributions
keeping+0.047
con+0.046
ls+0.043
 olds+0.042
bes+0.042
Neg logit contributions
 wish-0.028
 wake-0.028
 stop-0.026
upid-0.026
 end-0.025
Token in
Token ID287
Feature activation-6.61e-03

Pos logit contributions
con+0.025
cakes+0.024
keeping+0.023
 av+0.023
ados+0.023
Neg logit contributions
 wish-0.012
 wake-0.012
 stop-0.012
 happy-0.011
 end-0.011
Token the
Token ID262
Feature activation-1.31e-02

Pos logit contributions
keeping+0.016
con+0.016
 av+0.015
 olds+0.014
ls+0.014
Neg logit contributions
upid-0.018
be-0.018
 wish-0.018
ee-0.017
 bet-0.017
Token edge
Token ID5743
Feature activation+1.23e-02

Pos logit contributions
keeping+0.026
con+0.026
ls+0.025
 olds+0.025
 av+0.024
Neg logit contributions
 wish-0.014
 wake-0.013
upid-0.013
ee-0.013
 bet-0.012
Token of
Token ID286
Feature activation+1.20e-02

Pos logit contributions
 wish+0.012
 wake+0.012
 stop+0.011
ee+0.011
 end+0.011
Neg logit contributions
keeping-0.027
con-0.026
ls-0.025
 olds-0.024
bes-0.024
Token the
Token ID262
Feature activation-6.21e-04

Pos logit contributions
 wish+0.017
upid+0.016
 wake+0.016
 bet+0.016
ee+0.016
Neg logit contributions
keeping-0.020
con-0.019
 av-0.018
ls-0.018
 olds-0.018
Token shore
Token ID15191
Feature activation+5.28e-02

Pos logit contributions
keeping+0.001
con+0.001
ls+0.001
 olds+0.001
 av+0.001
Neg logit contributions
 wish-0.001
 wake-0.001
upid-0.001
 stop-0.001
ee-0.001
Tokenline
Token ID1370
Feature activation+9.88e-02

Pos logit contributions
 wish+0.074
 wake+0.071
upid+0.068
 stop+0.068
ee+0.067
Neg logit contributions
keeping-0.091
con-0.090
ls-0.085
 olds-0.083
 av-0.081
Token and
Token ID290
Feature activation+1.87e-03

Pos logit contributions
 wish+0.143
 wake+0.135
upid+0.131
ee+0.129
 bet+0.128
Neg logit contributions
keeping-0.167
con-0.163
ls-0.153
 olds-0.151
led-0.148
Token just
Token ID655
Feature activation+2.12e-02

Pos logit contributions
 wish+0.003
 wake+0.002
upid+0.002
 stop+0.002
ee+0.002
Neg logit contributions
keeping-0.003
con-0.003
ls-0.003
 olds-0.003
led-0.003
Token stood
Token ID6204
Feature activation-4.09e-03

Pos logit contributions
 wish+0.031
upid+0.029
 wake+0.029
ee+0.029
 bet+0.028
Neg logit contributions
keeping-0.034
con-0.034
led-0.032
ls-0.031
 olds-0.031
Token there
Token ID612
Feature activation-6.67e-03

Pos logit contributions
keeping+0.007
con+0.007
ls+0.007
 olds+0.007
 av+0.007
Neg logit contributions
 wish-0.005
 wake-0.005
upid-0.005
 stop-0.005
ee-0.005
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
keeping+0.011
con+0.011
ls+0.010
 olds+0.010
 av+0.010
Neg logit contributions
 wish-0.010
 wake-0.009
upid-0.009
ee-0.009
 bet-0.009
TokenI
Token ID40
Feature activation+2.14e-02

Pos logit contributions
keeping+0.016
con+0.015
ls+0.014
 olds+0.014
bes+0.014
Neg logit contributions
 wish-0.015
 wake-0.014
upid-0.014
 stop-0.014
 plan-0.013
Token fell
Token ID3214
Feature activation-5.32e-03

Pos logit contributions
upid+0.019
 crashing+0.019
 end+0.019
 stop+0.018
 alarm+0.018
Neg logit contributions
ls-0.043
con-0.042
keeping-0.041
led-0.040
bes-0.038
Token off
Token ID572
Feature activation-3.12e-02

Pos logit contributions
keeping+0.009
con+0.009
ls+0.008
 olds+0.008
led+0.008
Neg logit contributions
 wish-0.008
 wake-0.007
 bet-0.007
 stop-0.007
ee-0.007
Token the
Token ID262
Feature activation+4.24e-04

Pos logit contributions
keeping+0.061
con+0.061
ls+0.057
 olds+0.056
 av+0.055
Neg logit contributions
 wish-0.036
 wake-0.035
 stop-0.034
upid-0.033
ee-0.033
Token clothes
Token ID8242
Feature activation+2.07e-02

Pos logit contributions
 wish+0.000
 wake+0.000
 stop+0.000
upid+0.000
ee+0.000
Neg logit contributions
keeping-0.001
con-0.001
ls-0.001
 olds-0.001
 av-0.001
Tokenline
Token ID1370
Feature activation+9.67e-02

Pos logit contributions
 wish+0.029
 wake+0.027
 bet+0.027
upid+0.026
ee+0.026
Neg logit contributions
keeping-0.036
con-0.034
ls-0.033
 olds-0.032
 av-0.031
Token and
Token ID290
Feature activation+6.23e-03

Pos logit contributions
 wish+0.121
 wake+0.120
 stop+0.115
upid+0.112
 end+0.110
Neg logit contributions
con-0.178
keeping-0.177
ls-0.169
 olds-0.164
led-0.160
Token I
Token ID314
Feature activation+3.05e-02

Pos logit contributions
 wish+0.007
 wake+0.007
 stop+0.007
 watch+0.006
 go+0.006
Neg logit contributions
con-0.013
keeping-0.012
ls-0.012
 olds-0.012
 av-0.011
Token can
Token ID460
Feature activation-2.04e-02

Pos logit contributions
upid+0.030
 end+0.029
 crashing+0.028
 wake+0.027
 ding+0.027
Neg logit contributions
ls-0.058
keeping-0.057
con-0.057
led-0.056
bes-0.053
Token't
Token ID470
Feature activation+1.13e-02

Pos logit contributions
con+0.035
ls+0.034
keeping+0.034
bes+0.033
hog+0.032
Neg logit contributions
 wish-0.023
upid-0.023
 bet-0.022
ee-0.022
 sure-0.021
Token get
Token ID651
Feature activation-1.56e-02

Pos logit contributions
upid+0.012
ee+0.010
 wish+0.010
 bet+0.010
 ding+0.009
Neg logit contributions
keeping-0.022
con-0.021
ls-0.020
bes-0.020
 av-0.020
Token far
Token ID1290
Feature activation+1.34e-02

Pos logit contributions
 wish+0.004
 wake+0.004
 stop+0.003
 happy+0.003
 watch+0.003
Neg logit contributions
keeping-0.007
con-0.007
ls-0.007
 olds-0.006
led-0.006
Token away
Token ID1497
Feature activation-2.36e-02

Pos logit contributions
 wish+0.015
 wake+0.014
 end+0.014
 stop+0.014
 part+0.013
Neg logit contributions
con-0.028
keeping-0.027
bes-0.027
cakes-0.026
 av-0.025
Token from
Token ID422
Feature activation-6.55e-03

Pos logit contributions
con+0.050
keeping+0.046
bes+0.046
ados+0.045
 av+0.044
Neg logit contributions
 wish-0.026
 wake-0.025
 stop-0.024
 part-0.024
 happy-0.024
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
keeping+0.013
con+0.013
ls+0.012
 olds+0.012
led+0.012
Neg logit contributions
 wish-0.007
 wake-0.007
 stop-0.007
upid-0.006
 end-0.006
Token nos
Token ID43630
Feature activation+3.04e-02

Pos logit contributions
 wish+0.016
 wake+0.015
 stop+0.014
 end+0.014
 bet+0.013
Neg logit contributions
keeping-0.029
con-0.029
ls-0.027
 olds-0.027
led-0.027
Tokeny
Token ID88
Feature activation+9.65e-02

Pos logit contributions
 wish+0.031
ee+0.031
 wake+0.030
upid+0.029
 end+0.028
Neg logit contributions
con-0.063
keeping-0.061
bes-0.060
 olds-0.057
ls-0.057
Token villagers
Token ID25263
Feature activation+3.31e-02

Pos logit contributions
 wish+0.108
 wake+0.106
 stop+0.099
 end+0.099
 watch+0.095
Neg logit contributions
keeping-0.196
con-0.195
ls-0.185
 olds-0.182
bes-0.182
Token.
Token ID13
Feature activation+5.83e-03

Pos logit contributions
 wish+0.041
 wake+0.040
 stop+0.038
 end+0.037
upid+0.036
Neg logit contributions
con-0.062
keeping-0.061
ls-0.058
 olds-0.057
ados-0.057
Token The
Token ID383
Feature activation-9.84e-03

Pos logit contributions
 wish+0.007
 wake+0.006
upid+0.006
 stop+0.006
 bet+0.006
Neg logit contributions
keeping-0.011
con-0.011
ls-0.011
 olds-0.011
led-0.010
Token elder
Token ID10550
Feature activation-1.32e-02

Pos logit contributions
keeping+0.017
con+0.016
ls+0.015
 olds+0.015
 av+0.015
Neg logit contributions
 wish-0.014
 wake-0.014
upid-0.013
 stop-0.013
ee-0.013
Token noted
Token ID4367
Feature activation-1.28e-02

Pos logit contributions
keeping+0.027
con+0.027
bes+0.025
ls+0.025
ados+0.024
Neg logit contributions
 wish-0.015
 end-0.014
 wake-0.014
 sure-0.013
 heart-0.013
Token mom
Token ID1995
Feature activation-5.95e-03

Pos logit contributions
upid+0.012
 wish+0.011
ee+0.011
 bet+0.011
 wake+0.011
Neg logit contributions
con-0.011
keeping-0.011
ls-0.011
 av-0.010
ries-0.010
Token and
Token ID290
Feature activation+4.25e-02

Pos logit contributions
keeping+0.013
con+0.013
bes+0.012
ls+0.012
 olds+0.011
Neg logit contributions
 end-0.006
 wake-0.006
 stop-0.006
 wish-0.006
 happy-0.006
Token to
Token ID284
Feature activation+2.00e-02

Pos logit contributions
 wish+0.055
 wake+0.054
 stop+0.052
 watch+0.049
 end+0.049
Neg logit contributions
keeping-0.079
con-0.079
ls-0.075
 olds-0.073
bes-0.072
Token the
Token ID262
Feature activation+2.04e-02

Pos logit contributions
ee+0.029
 wish+0.029
upid+0.028
 bet+0.028
ack+0.026
Neg logit contributions
keeping-0.031
con-0.030
 av-0.029
 olds-0.028
ados-0.027
Token ost
Token ID23619
Feature activation+3.72e-02

Pos logit contributions
upid+0.036
ee+0.034
 understand+0.031
 bet+0.031
 wish+0.031
Neg logit contributions
ls-0.026
con-0.026
keeping-0.024
 olds-0.024
 bottles-0.023
Tokenric
Token ID1173
Feature activation+9.63e-02

Pos logit contributions
 wish+0.042
 wake+0.038
 sure+0.035
 bet+0.035
 dare+0.035
Neg logit contributions
keeping-0.078
con-0.077
ls-0.074
ries-0.073
 olds-0.072
Tokenhes
Token ID956
Feature activation+3.61e-02

Pos logit contributions
 wish+0.178
 wake+0.164
 bet+0.162
 plan+0.156
 stop+0.153
Neg logit contributions
ls-0.141
led-0.138
keeping-0.130
con-0.127
bes-0.121
Token.
Token ID13
Feature activation-1.49e-03

Pos logit contributions
 wish+0.055
 wake+0.051
 bet+0.051
upid+0.050
ee+0.050
Neg logit contributions
keeping-0.057
con-0.054
ls-0.053
led-0.051
 olds-0.050
Token They
Token ID1119
Feature activation+5.70e-03

Pos logit contributions
con+0.003
keeping+0.003
ls+0.003
led+0.003
 av+0.003
Neg logit contributions
 wish-0.002
 wake-0.002
 stop-0.002
 bet-0.002
 watch-0.002
Token learned
Token ID4499
Feature activation-1.81e-02

Pos logit contributions
 wish+0.008
 wake+0.008
 stop+0.007
 end+0.007
 sure+0.007
Neg logit contributions
keeping-0.011
con-0.010
 olds-0.010
ls-0.010
ados-0.009
Token their
Token ID511
Feature activation-1.17e-02

Pos logit contributions
keeping+0.040
con+0.039
ls+0.037
 olds+0.037
led+0.036
Neg logit contributions
 wish-0.018
 wake-0.016
 stop-0.016
 end-0.015
upid-0.015
Token Lily
Token ID20037
Feature activation+1.09e-02

Pos logit contributions
 wish+0.010
 wake+0.010
 stop+0.009
 happy+0.009
 watch+0.009
Neg logit contributions
keeping-0.020
con-0.020
ls-0.019
 olds-0.019
bes-0.019
Token hiding
Token ID11816
Feature activation+1.73e-02

Pos logit contributions
 wish+0.016
upid+0.016
ee+0.016
 bet+0.016
 dare+0.015
Neg logit contributions
keeping-0.017
con-0.016
ls-0.016
led-0.015
 av-0.015
Token under
Token ID739
Feature activation+9.22e-03

Pos logit contributions
 wish+0.022
 stop+0.022
 part+0.022
 wake+0.021
 happy+0.021
Neg logit contributions
con-0.034
unn-0.031
pill-0.031
keeping-0.030
 olds-0.030
Token the
Token ID262
Feature activation+2.82e-02

Pos logit contributions
 wish+0.010
 wake+0.009
 stop+0.009
upid+0.009
ee+0.009
Neg logit contributions
keeping-0.019
con-0.019
ls-0.018
 olds-0.018
led-0.017
Token t
Token ID256
Feature activation+4.66e-02

Pos logit contributions
 wish+0.032
 wake+0.031
 stop+0.030
 end+0.029
 watch+0.029
Neg logit contributions
keeping-0.057
con-0.056
ls-0.052
 olds-0.052
led-0.052
Tokenarp
Token ID5117
Feature activation+9.61e-02

Pos logit contributions
 wish+0.086
 wake+0.083
 bet+0.081
 sure+0.080
 dare+0.080
Neg logit contributions
keeping-0.066
ls-0.064
con-0.059
ries-0.057
ados-0.056
Token.
Token ID13
Feature activation-4.61e-03

Pos logit contributions
 wish+0.166
 bet+0.149
 wake+0.149
ee+0.148
 dare+0.147
Neg logit contributions
keeping-0.145
con-0.138
ls-0.129
 olds-0.126
led-0.123
Token He
Token ID679
Feature activation-1.72e-02

Pos logit contributions
keeping+0.008
con+0.008
ls+0.007
 olds+0.007
 av+0.007
Neg logit contributions
 wish-0.007
 wake-0.006
 stop-0.006
upid-0.006
ee-0.006
Token knew
Token ID2993
Feature activation-1.93e-03

Pos logit contributions
keeping+0.036
con+0.033
ados+0.033
 olds+0.031
ls+0.030
Neg logit contributions
 wish-0.022
 wake-0.020
 happy-0.019
 stop-0.019
 sure-0.019
Token they
Token ID484
Feature activation+1.16e-02

Pos logit contributions
keeping+0.004
con+0.004
ls+0.004
 olds+0.004
bes+0.004
Neg logit contributions
 wish-0.002
 wake-0.002
 stop-0.002
 sure-0.002
 end-0.002
Token were
Token ID547
Feature activation+1.75e-02

Pos logit contributions
 wish+0.016
 wake+0.015
 stop+0.014
 end+0.014
 bet+0.014
Neg logit contributions
con-0.021
keeping-0.021
 olds-0.020
ls-0.020
ados-0.019
Token jumped
Token ID11687
Feature activation-2.16e-03

Pos logit contributions
 wish+0.004
 wake+0.003
ee+0.003
 stop+0.003
 bet+0.003
Neg logit contributions
keeping-0.005
con-0.005
ls-0.005
 olds-0.005
ados-0.005
Token onto
Token ID4291
Feature activation+1.24e-03

Pos logit contributions
keeping+0.004
con+0.004
ls+0.003
 olds+0.003
led+0.003
Neg logit contributions
 wish-0.003
 wake-0.003
upid-0.003
ee-0.003
 bet-0.003
Token the
Token ID262
Feature activation-4.93e-03

Pos logit contributions
 wish+0.002
 wake+0.002
upid+0.001
ee+0.001
 stop+0.001
Neg logit contributions
keeping-0.002
con-0.002
ls-0.002
 olds-0.002
 av-0.002
Token zip
Token ID19974
Feature activation+2.22e-02

Pos logit contributions
con+0.010
keeping+0.010
ls+0.009
 olds+0.009
bes+0.009
Neg logit contributions
 wish-0.005
 wake-0.005
upid-0.005
 bet-0.005
ee-0.005
Token-
Token ID12
Feature activation+1.25e-02

Pos logit contributions
 wish+0.028
 wake+0.027
 stop+0.026
upid+0.026
ee+0.025
Neg logit contributions
keeping-0.041
con-0.041
ls-0.038
 olds-0.038
 av-0.037
Tokenline
Token ID1370
Feature activation+9.50e-02

Pos logit contributions
 wish+0.018
 wake+0.017
 stop+0.017
upid+0.017
ee+0.016
Neg logit contributions
keeping-0.021
con-0.021
ls-0.020
 olds-0.019
 av-0.019
Token and
Token ID290
Feature activation+3.80e-04

Pos logit contributions
 wish+0.134
 wake+0.127
upid+0.123
ee+0.122
 bet+0.121
Neg logit contributions
keeping-0.163
con-0.160
ls-0.150
 olds-0.149
 av-0.145
Token z
Token ID1976
Feature activation+1.11e-03

Pos logit contributions
 wish+0.000
 wake+0.000
 stop+0.000
 sure+0.000
 watch+0.000
Neg logit contributions
keeping-0.001
con-0.001
 olds-0.001
ados-0.001
 av-0.001
Tokenipped
Token ID3949
Feature activation-1.48e-02

Pos logit contributions
 wish+0.002
 wake+0.002
 stop+0.002
 end+0.002
 bet+0.002
Neg logit contributions
con-0.001
keeping-0.001
ls-0.001
led-0.001
 olds-0.001
Token around
Token ID1088
Feature activation+1.82e-02

Pos logit contributions
keeping+0.032
con+0.031
ls+0.029
 olds+0.029
 av+0.029
Neg logit contributions
 wish-0.015
 wake-0.014
 stop-0.014
 sure-0.013
 end-0.013
Token the
Token ID262
Feature activation-3.08e-04

Pos logit contributions
 wish+0.021
 wake+0.021
 stop+0.020
 happy+0.020
 watch+0.019
Neg logit contributions
keeping-0.036
con-0.036
ls-0.035
led-0.034
 olds-0.034
Token off
Token ID572
Feature activation-1.74e-02

Pos logit contributions
 wish+0.029
 wake+0.028
 stop+0.027
 watch+0.026
 go+0.026
Neg logit contributions
keeping-0.047
con-0.047
 olds-0.043
ls-0.043
bes-0.042
Token or
Token ID393
Feature activation-1.99e-02

Pos logit contributions
keeping+0.039
con+0.038
bes+0.036
 olds+0.036
Ell+0.035
Neg logit contributions
 stop-0.015
 wish-0.015
 wake-0.015
 happy-0.015
 go-0.014
Token play
Token ID711
Feature activation+9.37e-03

Pos logit contributions
con+0.042
keeping+0.042
ls+0.040
 olds+0.039
bes+0.038
Neg logit contributions
 wish-0.021
 wake-0.020
 stop-0.020
 watch-0.018
 end-0.018
Token in
Token ID287
Feature activation-1.15e-03

Pos logit contributions
 wish+0.013
 wake+0.013
 stop+0.012
upid+0.012
ee+0.012
Neg logit contributions
keeping-0.016
con-0.016
ls-0.015
 olds-0.015
 av-0.014
Token the
Token ID262
Feature activation+1.60e-02

Pos logit contributions
keeping+0.002
con+0.002
ls+0.002
 olds+0.002
 av+0.002
Neg logit contributions
 wish-0.001
 wake-0.001
upid-0.001
 stop-0.001
ee-0.001
Token sandy
Token ID44039
Feature activation+9.44e-02

Pos logit contributions
 wish+0.014
upid+0.014
ee+0.013
 wake+0.013
 bang+0.013
Neg logit contributions
keeping-0.034
con-0.034
ls-0.033
 olds-0.033
 av-0.032
Token ground
Token ID2323
Feature activation+3.78e-02

Pos logit contributions
 wish+0.106
 wake+0.102
 stop+0.096
upid+0.095
ee+0.094
Neg logit contributions
keeping-0.189
con-0.187
ls-0.177
 olds-0.174
 av-0.171
Token!
Token ID0
Feature activation+9.67e-03

Pos logit contributions
 wish+0.044
 wake+0.043
 stop+0.042
 end+0.042
 watch+0.040
Neg logit contributions
keeping-0.075
con-0.073
 olds-0.069
ls-0.068
bes-0.067
Token But
Token ID887
Feature activation+2.26e-03

Pos logit contributions
 dare+0.010
 wish+0.010
 stop+0.010
 go+0.010
 sure+0.009
Neg logit contributions
keeping-0.020
ls-0.019
led-0.019
bes-0.019
con-0.018
Token before
Token ID878
Feature activation+1.64e-03

Pos logit contributions
 wish+0.003
 wake+0.003
 stop+0.003
 end+0.003
 dare+0.003
Neg logit contributions
keeping-0.004
con-0.004
ls-0.004
 olds-0.004
bes-0.004
Token he
Token ID339
Feature activation+1.50e-02

Pos logit contributions
 wish+0.002
 stop+0.002
 wake+0.002
 end+0.002
 go+0.002
Neg logit contributions
keeping-0.003
con-0.003
led-0.003
 olds-0.003
bes-0.003
Token to
Token ID284
Feature activation+2.72e-02

Pos logit contributions
con+0.034
keeping+0.033
ls+0.032
bes+0.032
unn+0.031
Neg logit contributions
 stop-0.015
 wish-0.014
 wake-0.014
 part-0.014
 end-0.014
Token the
Token ID262
Feature activation+1.85e-02

Pos logit contributions
ee+0.037
 wish+0.036
be+0.036
upid+0.035
 wake+0.035
Neg logit contributions
keeping-0.046
con-0.044
 av-0.042
 olds-0.042
ls-0.040
Token biggest
Token ID4094
Feature activation+6.59e-02

Pos logit contributions
 wish+0.019
upid+0.019
 wake+0.018
ee+0.018
 bet+0.017
Neg logit contributions
keeping-0.037
con-0.037
 olds-0.035
ls-0.035
 av-0.035
Token,
Token ID11
Feature activation+2.90e-02

Pos logit contributions
 wish+0.086
 wake+0.080
 stop+0.076
 end+0.075
 plan+0.074
Neg logit contributions
con-0.126
keeping-0.125
ls-0.119
 olds-0.117
led-0.117
Token sh
Token ID427
Feature activation+5.49e-02

Pos logit contributions
 wish+0.031
 wake+0.030
 stop+0.028
 bet+0.027
 dare+0.027
Neg logit contributions
keeping-0.060
con-0.059
ls-0.056
 olds-0.055
bes-0.054
Tokenini
Token ID5362
Feature activation+9.39e-02

Pos logit contributions
 wish+0.094
 wake+0.091
 bet+0.088
upid+0.087
 plan+0.087
Neg logit contributions
keeping-0.080
con-0.077
led-0.073
ls-0.071
 olds-0.070
Tokenest
Token ID395
Feature activation+5.01e-02

Pos logit contributions
 wish+0.136
upid+0.130
 wake+0.129
 bet+0.125
 dare+0.123
Neg logit contributions
keeping-0.151
con-0.149
led-0.144
ls-0.138
 av-0.134
Token fish
Token ID5916
Feature activation+3.41e-02

Pos logit contributions
 wish+0.059
 wake+0.053
 stop+0.051
 end+0.051
 watch+0.048
Neg logit contributions
keeping-0.103
con-0.102
ls-0.098
 olds-0.097
bes-0.096
Token in
Token ID287
Feature activation+9.20e-05

Pos logit contributions
 wish+0.043
 wake+0.041
 stop+0.040
 sure+0.039
ee+0.038
Neg logit contributions
con-0.063
keeping-0.063
 olds-0.059
ls-0.058
bes-0.058
Token the
Token ID262
Feature activation+1.96e-02

Pos logit contributions
 wish+0.000
 wake+0.000
 stop+0.000
upid+0.000
ee+0.000
Neg logit contributions
keeping-0.000
con-0.000
ls-0.000
 olds-0.000
 av-0.000
Token pond
Token ID16723
Feature activation+5.26e-02

Pos logit contributions
 wish+0.021
 wake+0.020
 stop+0.018
upid+0.018
ee+0.018
Neg logit contributions
keeping-0.041
con-0.040
ls-0.038
 olds-0.038
 av-0.037
Token walked
Token ID6807
Feature activation-1.44e-02

Pos logit contributions
 wish+0.016
 wake+0.016
upid+0.015
 stop+0.015
ee+0.015
Neg logit contributions
keeping-0.018
con-0.018
ls-0.017
 olds-0.016
 av-0.016
Token away
Token ID1497
Feature activation-3.10e-02

Pos logit contributions
con+0.033
keeping+0.031
 av+0.031
pill+0.031
ados+0.031
Neg logit contributions
 stop-0.013
 wake-0.012
 happy-0.012
 part-0.012
 watch-0.012
Token from
Token ID422
Feature activation-1.54e-02

Pos logit contributions
con+0.062
keeping+0.061
ados+0.058
 av+0.058
pill+0.057
Neg logit contributions
 stop-0.036
 wish-0.036
 happy-0.035
 wake-0.034
 sure-0.033
Token the
Token ID262
Feature activation+1.48e-02

Pos logit contributions
con+0.033
keeping+0.032
ls+0.031
 olds+0.030
bes+0.030
Neg logit contributions
 wish-0.016
 stop-0.015
 wake-0.015
 end-0.015
 watch-0.014
Token boss
Token ID6478
Feature activation+2.53e-02

Pos logit contributions
 wish+0.015
 wake+0.015
upid+0.015
ee+0.014
 bet+0.014
Neg logit contributions
keeping-0.030
con-0.030
ls-0.029
 olds-0.028
 av-0.027
Tokeny
Token ID88
Feature activation+9.38e-02

Pos logit contributions
 wish+0.030
 wake+0.029
ee+0.028
 stop+0.028
 end+0.028
Neg logit contributions
con-0.049
keeping-0.048
ls-0.045
bes-0.045
 olds-0.045
Token girl
Token ID2576
Feature activation+4.62e-02

Pos logit contributions
 wish+0.095
 wake+0.092
 stop+0.087
 end+0.084
ee+0.083
Neg logit contributions
keeping-0.200
con-0.197
ls-0.188
 olds-0.185
 av-0.182
Token.
Token ID13
Feature activation+3.53e-03

Pos logit contributions
 wish+0.049
 wake+0.048
 stop+0.046
 end+0.046
 watch+0.044
Neg logit contributions
con-0.093
keeping-0.092
ls-0.089
 av-0.088
led-0.087
Token Tim
Token ID5045
Feature activation-6.27e-03

Pos logit contributions
 wish+0.004
 bet+0.004
 wake+0.004
 stop+0.004
 sure+0.004
Neg logit contributions
keeping-0.007
con-0.006
ls-0.006
led-0.006
 av-0.006
Tokenmy
Token ID1820
Feature activation+1.43e-02

Pos logit contributions
keeping+0.011
con+0.010
ls+0.010
led+0.009
 olds+0.009
Neg logit contributions
 wish-0.009
 wake-0.009
 stop-0.008
upid-0.008
 dare-0.008
Token was
Token ID373
Feature activation+2.38e-02

Pos logit contributions
 wish+0.020
 wake+0.019
 stop+0.018
ee+0.018
 bet+0.018
Neg logit contributions
keeping-0.026
con-0.025
ls-0.024
 olds-0.023
 av-0.023
Token books
Token ID3835
Feature activation-3.63e-02

Pos logit contributions
keeping+0.032
con+0.032
ls+0.029
 olds+0.028
 av+0.027
Neg logit contributions
 wish-0.051
 wake-0.050
upid-0.048
 stop-0.048
ee-0.048
Token back
Token ID736
Feature activation+1.19e-03

Pos logit contributions
keeping+0.074
con+0.073
ls+0.069
 olds+0.068
 av+0.066
Neg logit contributions
 wish-0.040
 wake-0.038
 stop-0.037
upid-0.036
ee-0.036
Token on
Token ID319
Feature activation+1.31e-02

Pos logit contributions
 wish+0.001
 wake+0.001
 stop+0.001
ee+0.001
upid+0.001
Neg logit contributions
con-0.002
keeping-0.002
 olds-0.002
ls-0.002
 av-0.002
Token the
Token ID262
Feature activation+1.65e-02

Pos logit contributions
 wish+0.012
 end+0.011
 wake+0.011
 stop+0.011
 time+0.011
Neg logit contributions
con-0.029
keeping-0.029
ls-0.028
 olds-0.027
led-0.027
Token books
Token ID3835
Feature activation-8.56e-03

Pos logit contributions
 wish+0.025
 wake+0.024
 stop+0.024
 end+0.023
 watch+0.022
Neg logit contributions
keeping-0.027
con-0.026
ls-0.026
 olds-0.024
led-0.024
Tokenhe
Token ID258
Feature activation+9.32e-02

Pos logit contributions
keeping+0.018
con+0.018
ls+0.017
 olds+0.016
 av+0.016
Neg logit contributions
 wish-0.009
 wake-0.008
upid-0.008
 bet-0.008
 stop-0.008
Tokenlf
Token ID1652
Feature activation+1.20e-02

Pos logit contributions
 wish+0.193
 wake+0.189
 stop+0.183
upid+0.181
ee+0.181
Neg logit contributions
keeping-0.098
con-0.097
ls-0.087
 olds-0.085
 av-0.081
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
 wish+0.014
 stop+0.014
 wake+0.014
 end+0.013
 go+0.013
Neg logit contributions
con-0.022
keeping-0.022
ls-0.022
 olds-0.021
 av-0.021
Token smiling
Token ID16755
Feature activation+4.11e-02

Pos logit contributions
 wish+0.018
 wake+0.018
 stop+0.017
 bet+0.016
upid+0.016
Neg logit contributions
keeping-0.025
con-0.024
ls-0.023
 olds-0.023
led-0.022
Token proudly
Token ID23017
Feature activation+1.63e-05

Pos logit contributions
 wish+0.050
 wake+0.049
 stop+0.047
 sure+0.045
ee+0.045
Neg logit contributions
keeping-0.078
con-0.078
ls-0.073
 olds-0.072
 av-0.070
Token.
Token ID13
Feature activation+4.79e-04

Pos logit contributions
 wish+0.000
 wake+0.000
 stop+0.000
upid+0.000
ee+0.000
Neg logit contributions
keeping-0.000
con-0.000
ls-0.000
 olds-0.000
 av-0.000
Token h
Token ID289
Feature activation-1.04e-02

Pos logit contributions
ls+0.018
con+0.018
 av+0.017
 cores+0.016
hog+0.016
Neg logit contributions
upid-0.015
 wish-0.015
 end-0.015
ass-0.014
 actor-0.014
Tokenanger
Token ID2564
Feature activation+2.04e-03

Pos logit contributions
con+0.017
keeping+0.016
ls+0.015
 olds+0.015
bes+0.015
Neg logit contributions
 wish-0.015
 wake-0.015
 bet-0.014
 plan-0.014
upid-0.014
Token from
Token ID422
Feature activation-2.44e-02

Pos logit contributions
 wish+0.002
 wake+0.002
 stop+0.002
ee+0.002
 end+0.002
Neg logit contributions
keeping-0.004
con-0.004
ls-0.004
 olds-0.004
 av-0.004
Token the
Token ID262
Feature activation+7.07e-03

Pos logit contributions
keeping+0.051
con+0.051
ls+0.048
 olds+0.048
 av+0.047
Neg logit contributions
 wish-0.025
 wake-0.024
 stop-0.023
upid-0.022
ee-0.022
Token clothes
Token ID8242
Feature activation+2.26e-02

Pos logit contributions
 wish+0.008
 wake+0.008
upid+0.007
ee+0.007
 bet+0.007
Neg logit contributions
keeping-0.014
con-0.014
ls-0.013
 olds-0.013
 av-0.013
Tokenline
Token ID1370
Feature activation+9.31e-02

Pos logit contributions
 wish+0.034
 wake+0.031
upid+0.031
 bet+0.031
ee+0.030
Neg logit contributions
keeping-0.038
con-0.036
ls-0.034
 olds-0.033
 av-0.033
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
 wish+0.126
 wake+0.122
 stop+0.117
upid+0.116
ee+0.115
Neg logit contributions
keeping-0.164
con-0.162
ls-0.153
 olds-0.150
 av-0.146
Token He
Token ID679
Feature activation-1.66e-02

Pos logit contributions
keeping+0.027
con+0.026
ls+0.026
led+0.024
 olds+0.024
Neg logit contributions
 wish-0.019
 wake-0.018
 stop-0.017
 bet-0.017
upid-0.017
Token set
Token ID900
Feature activation-3.29e-03

Pos logit contributions
keeping+0.033
con+0.031
ados+0.030
 olds+0.030
ls+0.029
Neg logit contributions
 wish-0.021
 wake-0.021
 sure-0.020
 stop-0.019
ee-0.019
Token the
Token ID262
Feature activation-3.02e-02

Pos logit contributions
keeping+0.006
con+0.006
ls+0.005
 olds+0.005
ados+0.005
Neg logit contributions
 wish-0.005
 wake-0.004
 bet-0.004
upid-0.004
ee-0.004
Token h
Token ID289
Feature activation-1.71e-02

Pos logit contributions
keeping+0.067
con+0.065
ls+0.062
 olds+0.061
 av+0.060
Neg logit contributions
 wish-0.027
upid-0.027
 wake-0.025
 bet-0.025
ee-0.025
Token when
Token ID618
Feature activation+4.20e-03

Pos logit contributions
keeping+0.013
con+0.013
ls+0.013
 olds+0.012
 av+0.012
Neg logit contributions
 wish-0.008
 wake-0.007
 stop-0.007
upid-0.007
ee-0.007
Token it
Token ID340
Feature activation+9.97e-03

Pos logit contributions
 wish+0.004
 wake+0.004
 stop+0.004
 end+0.004
upid+0.004
Neg logit contributions
keeping-0.009
con-0.009
ls-0.008
 olds-0.008
bes-0.008
Token hugged
Token ID41130
Feature activation+3.94e-03

Pos logit contributions
upid+0.017
 wish+0.016
 bet+0.016
 understand+0.016
 rule+0.015
Neg logit contributions
led-0.014
ls-0.014
con-0.012
keeping-0.011
glers-0.011
Token the
Token ID262
Feature activation-1.48e-02

Pos logit contributions
 wish+0.005
 wake+0.004
 stop+0.004
upid+0.004
ee+0.004
Neg logit contributions
keeping-0.008
con-0.008
ls-0.007
 olds-0.007
 av-0.007
Token shore
Token ID15191
Feature activation+4.87e-02

Pos logit contributions
keeping+0.025
 olds+0.024
ls+0.024
 jewels+0.023
con+0.023
Neg logit contributions
upid-0.019
ee-0.017
 wish-0.017
 bet-0.017
 sure-0.017
Tokenline
Token ID1370
Feature activation+9.31e-02

Pos logit contributions
 stop+0.059
 wake+0.058
 wish+0.058
 end+0.057
 happy+0.057
Neg logit contributions
keeping-0.098
con-0.093
bes-0.088
ls-0.085
Ell-0.084
Token again
Token ID757
Feature activation-8.51e-03

Pos logit contributions
 wish+0.134
 wake+0.131
 stop+0.126
 end+0.123
upid+0.123
Neg logit contributions
keeping-0.159
con-0.157
ls-0.146
 olds-0.144
bes-0.139
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
keeping+0.015
con+0.015
 olds+0.014
ls+0.014
bes+0.013
Neg logit contributions
 wish-0.012
 wake-0.012
 stop-0.011
 watch-0.011
ee-0.011
Token it
Token ID340
Feature activation+1.27e-02

Pos logit contributions
 wish+0.010
 wake+0.009
 stop+0.009
upid+0.008
ee+0.008
Neg logit contributions
keeping-0.027
con-0.026
 olds-0.025
ls-0.025
bes-0.024
Token noticed
Token ID6810
Feature activation+1.44e-02

Pos logit contributions
upid+0.019
 wish+0.019
 wake+0.018
ee+0.018
 bet+0.018
Neg logit contributions
con-0.019
ls-0.019
led-0.019
keeping-0.018
 olds-0.017
Token a
Token ID257
Feature activation-3.26e-03

Pos logit contributions
 wish+0.014
 wake+0.013
 stop+0.013
 end+0.012
upid+0.012
Neg logit contributions
keeping-0.031
con-0.031
ls-0.029
 olds-0.029
bes-0.028
Token share
Token ID2648
Feature activation-1.71e-02

Pos logit contributions
ee+0.018
upid+0.018
 wish+0.018
 bet+0.016
be+0.016
Neg logit contributions
keeping-0.035
con-0.034
 av-0.032
bes-0.031
 olds-0.030
Token stories
Token ID3923
Feature activation-3.97e-02

Pos logit contributions
con+0.039
keeping+0.039
 olds+0.037
ls+0.036
led+0.036
Neg logit contributions
 wish-0.015
 wake-0.014
 happy-0.014
 stop-0.013
 sure-0.013
Token about
Token ID546
Feature activation+2.09e-02

Pos logit contributions
keeping+0.073
con+0.072
ls+0.068
 olds+0.067
ados+0.066
Neg logit contributions
 wake-0.049
 wish-0.048
 stop-0.048
 sure-0.047
 happy-0.046
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
 wish+0.022
 wake+0.021
 stop+0.020
 end+0.019
 watch+0.019
Neg logit contributions
keeping-0.043
con-0.043
ls-0.040
 olds-0.040
led-0.039
Token nos
Token ID43630
Feature activation+2.94e-02

Pos logit contributions
 wish+0.017
upid+0.017
 wake+0.016
 stop+0.016
 bet+0.016
Neg logit contributions
con-0.026
keeping-0.026
ls-0.025
 olds-0.024
 av-0.024
Tokeny
Token ID88
Feature activation+9.28e-02

Pos logit contributions
 wish+0.034
 wake+0.034
ee+0.032
upid+0.032
 stop+0.031
Neg logit contributions
keeping-0.057
con-0.056
ls-0.052
 olds-0.052
bes-0.051
Token chicken
Token ID9015
Feature activation+2.80e-02

Pos logit contributions
 wish+0.129
upid+0.123
 wake+0.120
 stop+0.118
 bet+0.118
Neg logit contributions
con-0.159
keeping-0.159
ls-0.151
 olds-0.146
led-0.144
Token.
Token ID13
Feature activation-1.42e-03

Pos logit contributions
 wish+0.036
 wake+0.035
 stop+0.034
 end+0.033
ee+0.032
Neg logit contributions
keeping-0.051
con-0.050
ls-0.047
 olds-0.047
 av-0.046
Token Everyone
Token ID11075
Feature activation+2.17e-02

Pos logit contributions
keeping+0.003
con+0.003
 olds+0.002
ls+0.002
 av+0.002
Neg logit contributions
 wish-0.002
 wake-0.002
 stop-0.002
upid-0.002
ee-0.002
Token loved
Token ID6151
Feature activation+3.24e-02

Pos logit contributions
 wish+0.031
 wake+0.030
upid+0.030
 stop+0.029
 dare+0.029
Neg logit contributions
keeping-0.035
con-0.034
ls-0.034
led-0.033
 olds-0.032
Token the
Token ID262
Feature activation-2.48e-03

Pos logit contributions
 wake+0.034
 wish+0.034
 stop+0.032
 sure+0.032
 watch+0.031
Neg logit contributions
keeping-0.066
con-0.065
ls-0.063
 olds-0.061
led-0.061
Token-
Token ID12
Feature activation-9.84e-03

Pos logit contributions
keeping+0.065
con+0.064
ls+0.059
 olds+0.058
 av+0.057
Neg logit contributions
 wish-0.062
 wake-0.060
 stop-0.057
ee-0.057
upid-0.057
Tokeneight
Token ID26022
Feature activation-5.36e-02

Pos logit contributions
con+0.014
keeping+0.014
 olds+0.013
led+0.013
 av+0.013
Neg logit contributions
 wish-0.017
 wake-0.017
 stop-0.016
upid-0.015
 plan-0.015
Token,
Token ID11
Feature activation-2.22e-02

Pos logit contributions
keeping+0.121
con+0.102
ls+0.100
led+0.099
glers+0.099
Neg logit contributions
 wish-0.055
 end-0.055
 time-0.053
ee-0.053
 happy-0.050
Token twenty
Token ID8208
Feature activation-4.32e-02

Pos logit contributions
con+0.047
keeping+0.046
 olds+0.045
ls+0.045
 av+0.044
Neg logit contributions
 wish-0.021
 wake-0.021
upid-0.020
 stop-0.019
ee-0.019
Token-
Token ID12
Feature activation-1.18e-02

Pos logit contributions
keeping+0.074
con+0.071
ls+0.066
 olds+0.065
 av+0.064
Neg logit contributions
 wish-0.064
 wake-0.060
ee-0.059
 end-0.058
 stop-0.058
Tokennine
Token ID30888
Feature activation-1.06e-01

Pos logit contributions
con+0.017
keeping+0.016
 olds+0.015
 av+0.015
ls+0.015
Neg logit contributions
 wish-0.021
 wake-0.020
 stop-0.019
 plan-0.019
upid-0.019
Token,
Token ID11
Feature activation-2.49e-02

Pos logit contributions
keeping+0.228
 olds+0.193
ls+0.189
con+0.188
cakes+0.186
Neg logit contributions
ee-0.123
 wish-0.120
 sure-0.111
 end-0.110
 stop-0.110
Token thirty
Token ID12277
Feature activation-4.57e-02

Pos logit contributions
con+0.055
keeping+0.055
ls+0.053
 olds+0.052
 av+0.052
Neg logit contributions
 wish-0.022
 wake-0.021
upid-0.019
 stop-0.019
ee-0.019
Token,
Token ID11
Feature activation-2.36e-02

Pos logit contributions
keeping+0.083
con+0.080
ls+0.075
 olds+0.074
 av+0.073
Neg logit contributions
 wish-0.062
 wake-0.058
ee-0.057
 stop-0.056
upid-0.056
Token thirty
Token ID12277
Feature activation-4.38e-02

Pos logit contributions
con+0.053
keeping+0.053
ls+0.051
 olds+0.050
 av+0.050
Neg logit contributions
 wish-0.020
 wake-0.019
upid-0.018
 stop-0.017
ee-0.017
Token-
Token ID12
Feature activation-1.15e-02

Pos logit contributions
keeping+0.079
con+0.077
ls+0.072
 olds+0.071
 av+0.070
Neg logit contributions
 wish-0.059
 wake-0.056
ee-0.055
 stop-0.054
upid-0.054
Token called
Token ID1444
Feature activation-1.35e-03

Pos logit contributions
keeping+0.013
con+0.013
ls+0.012
 olds+0.012
ados+0.011
Neg logit contributions
 wish-0.011
 wake-0.010
 stop-0.010
ee-0.010
 end-0.010
Token Daisy
Token ID40355
Feature activation-2.43e-02

Pos logit contributions
led+0.003
con+0.002
keeping+0.002
ls+0.002
bes+0.002
Neg logit contributions
 wake-0.002
 stop-0.002
 watch-0.002
upid-0.002
 wish-0.002
Token,
Token ID11
Feature activation+3.02e-03

Pos logit contributions
keeping+0.043
con+0.042
ls+0.040
 olds+0.039
 av+0.038
Neg logit contributions
 wish-0.033
 wake-0.032
 stop-0.031
upid-0.030
ee-0.030
Token who
Token ID508
Feature activation+2.13e-02

Pos logit contributions
 wish+0.003
upid+0.003
 wake+0.003
ee+0.003
 stop+0.003
Neg logit contributions
con-0.006
keeping-0.006
ls-0.006
 av-0.005
 olds-0.005
Token was
Token ID373
Feature activation+5.73e-03

Pos logit contributions
upid+0.034
 wish+0.030
 stop+0.030
 bet+0.030
 wake+0.029
Neg logit contributions
keeping-0.032
con-0.030
led-0.029
bes-0.028
ls-0.028
Token nine
Token ID5193
Feature activation-1.03e-01

Pos logit contributions
upid+0.006
 wish+0.006
 bet+0.005
 wake+0.005
ee+0.005
Neg logit contributions
keeping-0.011
 av-0.011
con-0.011
 olds-0.011
ls-0.011
Token years
Token ID812
Feature activation-3.55e-02

Pos logit contributions
keeping+0.243
con+0.219
bes+0.212
led+0.209
Ell+0.203
Neg logit contributions
 wish-0.109
 wake-0.104
 end-0.103
 stop-0.102
 watch-0.101
Token old
Token ID1468
Feature activation-8.42e-03

Pos logit contributions
keeping+0.077
bes+0.073
con+0.072
led+0.068
Ell+0.067
Neg logit contributions
 happy-0.036
 wish-0.035
 wake-0.034
 end-0.033
 stop-0.033
Token.
Token ID13
Feature activation+7.35e-03

Pos logit contributions
keeping+0.020
bes+0.020
con+0.020
ls+0.020
led+0.019
Neg logit contributions
 end-0.007
 wish-0.007
 watch-0.006
 stop-0.006
 m-0.006
Token She
Token ID1375
Feature activation-1.80e-02

Pos logit contributions
 wake+0.009
 bet+0.009
 wish+0.009
 happy+0.009
 dare+0.009
Neg logit contributions
con-0.014
keeping-0.014
led-0.014
ls-0.013
bes-0.013
Token lived
Token ID5615
Feature activation-2.30e-03

Pos logit contributions
keeping+0.033
con+0.033
ls+0.031
 olds+0.031
 av+0.030
Neg logit contributions
 wish-0.023
 wake-0.022
upid-0.021
 stop-0.021
ee-0.021
Token mouse
Token ID10211
Feature activation-2.11e-02

Pos logit contributions
 wish+0.022
upid+0.022
 bet+0.021
ee+0.021
 wake+0.021
Neg logit contributions
con-0.031
keeping-0.030
 av-0.028
 olds-0.028
ls-0.028
Token named
Token ID3706
Feature activation-1.68e-02

Pos logit contributions
keeping+0.037
con+0.037
ls+0.035
 olds+0.034
 av+0.033
Neg logit contributions
 wish-0.029
 wake-0.028
 stop-0.026
upid-0.026
ee-0.026
Token Mickey
Token ID27232
Feature activation-4.60e-02

Pos logit contributions
led+0.026
ls+0.026
keeping+0.024
edly+0.024
cakes+0.024
Neg logit contributions
 wake-0.027
 sure-0.026
 stop-0.026
 wish-0.026
 dare-0.025
Token.
Token ID13
Feature activation-6.89e-03

Pos logit contributions
keeping+0.083
con+0.082
ls+0.077
 olds+0.077
led+0.074
Neg logit contributions
 wish-0.062
 wake-0.060
 stop-0.057
 end-0.056
upid-0.056
Token "
Token ID366
Feature activation-1.27e-02

Pos logit contributions
keeping+0.016
lights+0.015
led+0.015
ls+0.015
bes+0.015
Neg logit contributions
 happy-0.006
 wake-0.006
 sure-0.006
 go-0.006
 try-0.006
TokenMic
Token ID25437
Feature activation-1.00e-01

Pos logit contributions
keeping+0.023
con+0.023
ls+0.022
 olds+0.022
led+0.021
Neg logit contributions
 wish-0.016
 wake-0.016
 bet-0.015
 stop-0.015
 end-0.015
Tokenkey
Token ID2539
Feature activation-4.07e-02

Pos logit contributions
 olds+0.122
keeping+0.122
edly+0.118
lights+0.117
cakes+0.114
Neg logit contributions
ee-0.189
be-0.188
 sure-0.181
ah-0.180
 dare-0.172
Token,
Token ID11
Feature activation-8.75e-03

Pos logit contributions
keeping+0.086
con+0.078
cakes+0.078
edly+0.077
 olds+0.076
Neg logit contributions
 wish-0.048
 wake-0.046
upid-0.044
ee-0.044
 stop-0.043
Token help
Token ID1037
Feature activation-2.01e-02

Pos logit contributions
con+0.018
keeping+0.018
ls+0.017
 av+0.017
 olds+0.017
Neg logit contributions
 wish-0.009
 wake-0.008
upid-0.008
ee-0.008
 bet-0.008
Token!
Token ID0
Feature activation+1.46e-02

Pos logit contributions
keeping+0.050
con+0.049
ls+0.047
edly+0.047
bes+0.047
Neg logit contributions
 wake-0.015
 stop-0.014
 end-0.013
 wish-0.013
upid-0.012
Token The
Token ID383
Feature activation-4.05e-03

Pos logit contributions
 wake+0.016
 stop+0.015
 bet+0.014
 wish+0.014
 please+0.014
Neg logit contributions
keeping-0.030
ls-0.030
led-0.030
con-0.029
bes-0.028
Token counter
Token ID3753
Feature activation+1.32e-02

Pos logit contributions
 wish+0.033
 wake+0.031
upid+0.031
 bet+0.030
ee+0.030
Neg logit contributions
keeping-0.042
con-0.042
ls-0.039
 olds-0.039
 av-0.038
Token.
Token ID13
Feature activation-6.01e-03

Pos logit contributions
 wake+0.015
 wish+0.015
 stop+0.015
 end+0.014
 watch+0.014
Neg logit contributions
con-0.026
keeping-0.025
ls-0.025
 olds-0.024
led-0.024
Token
Token ID220
Feature activation+5.14e-04

Pos logit contributions
keeping+0.015
con+0.014
ls+0.014
 olds+0.014
led+0.014
Neg logit contributions
 sure-0.004
 wish-0.004
 wake-0.003
 happy-0.003
 understand-0.003
Token
Token ID198
Feature activation-3.33e-03

Pos logit contributions
 wish+0.001
 wake+0.001
 stop+0.001
 bang+0.001
upid+0.001
Neg logit contributions
keeping-0.001
con-0.001
ls-0.001
 olds-0.001
bes-0.001
Token
Token ID198
Feature activation-1.70e-03

Pos logit contributions
con+0.007
keeping+0.007
ls+0.006
 olds+0.006
 av+0.006
Neg logit contributions
 wish-0.004
 wake-0.004
 stop-0.003
upid-0.003
ee-0.003
TokenMic
Token ID25437
Feature activation-9.90e-02

Pos logit contributions
keeping+0.002
con+0.002
 olds+0.002
ls+0.002
ries+0.002
Neg logit contributions
 wake-0.003
 end-0.003
 wish-0.003
 sure-0.003
 understand-0.003
Tokenkey
Token ID2539
Feature activation-3.52e-02

Pos logit contributions
 olds+0.134
keeping+0.128
lights+0.126
cakes+0.124
 passages+0.114
Neg logit contributions
 sure-0.181
ee-0.178
be-0.168
ah-0.168
 end-0.165
Token wanted
Token ID2227
Feature activation+1.60e-02

Pos logit contributions
keeping+0.083
con+0.078
ados+0.075
 olds+0.074
ls+0.073
Neg logit contributions
 sure-0.033
 wake-0.032
 wish-0.032
 happy-0.029
 go-0.029
Token some
Token ID617
Feature activation-2.36e-02

Pos logit contributions
 wake+0.020
 wish+0.019
 stop+0.019
 watch+0.018
 sure+0.018
Neg logit contributions
keeping-0.031
con-0.030
 olds-0.029
ls-0.029
led-0.028
Token popcorn
Token ID38914
Feature activation-1.83e-02

Pos logit contributions
con+0.041
ls+0.040
keeping+0.039
 olds+0.039
 av+0.038
Neg logit contributions
 wish-0.030
upid-0.029
 wake-0.029
 bet-0.029
 stop-0.027
Token,
Token ID11
Feature activation-1.42e-02

Pos logit contributions
con+0.035
keeping+0.035
ls+0.033
 olds+0.033
bes+0.032
Neg logit contributions
 wish-0.022
 wake-0.022
 stop-0.021
 end-0.020
ee-0.020
Token says
Token ID1139
Feature activation-3.23e-03

Pos logit contributions
keeping+0.057
con+0.050
bes+0.047
ados+0.047
edly+0.046
Neg logit contributions
 wish-0.025
 wake-0.023
 go-0.022
 bet-0.022
 watch-0.022
Token,
Token ID11
Feature activation-3.00e-03

Pos logit contributions
keeping+0.007
con+0.006
bes+0.006
 olds+0.006
led+0.006
Neg logit contributions
 stop-0.004
 wish-0.004
 wake-0.004
 dare-0.004
 go-0.004
Token "
Token ID366
Feature activation-5.92e-03

Pos logit contributions
keeping+0.006
con+0.006
ls+0.006
 olds+0.006
 av+0.006
Neg logit contributions
 wish-0.003
 wake-0.003
upid-0.003
ee-0.003
 stop-0.003
TokenThey
Token ID2990
Feature activation-2.25e-02

Pos logit contributions
keeping+0.010
con+0.010
ls+0.010
 olds+0.009
 av+0.009
Neg logit contributions
 wish-0.008
 wake-0.008
upid-0.008
 stop-0.008
ee-0.007
Token are
Token ID389
Feature activation-1.88e-02

Pos logit contributions
keeping+0.055
con+0.055
ls+0.053
 olds+0.052
 av+0.051
Neg logit contributions
 wish-0.015
 wake-0.014
upid-0.012
 stop-0.012
ee-0.012
Token av
Token ID1196
Feature activation-9.81e-02

Pos logit contributions
keeping+0.041
con+0.039
led+0.038
ls+0.037
bes+0.036
Neg logit contributions
 wake-0.019
 wish-0.019
 stop-0.018
 happy-0.017
 sure-0.017
Tokenoc
Token ID420
Feature activation-5.36e-02

Pos logit contributions
keeping+0.183
con+0.175
 olds+0.172
ls+0.166
cakes+0.165
Neg logit contributions
ee-0.120
 wish-0.119
 wake-0.113
be-0.113
upid-0.110
Tokenados
Token ID22484
Feature activation-7.62e-02

Pos logit contributions
keeping+0.053
lights+0.047
mary+0.047
bes+0.045
chard+0.045
Neg logit contributions
 stop-0.098
 bang-0.096
upid-0.095
 alarm-0.095
ee-0.093
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
keeping+0.155
con+0.153
led+0.146
 olds+0.145
ls+0.143
Neg logit contributions
 wake-0.082
 stop-0.078
 wish-0.076
 go-0.074
 watch-0.074
Token My
Token ID2011
Feature activation+1.05e-02

Pos logit contributions
keeping+0.024
con+0.024
ls+0.024
led+0.024
bes+0.022
Neg logit contributions
 wake-0.015
 bet-0.015
 wish-0.015
 stop-0.015
 go-0.014
Token mom
Token ID1995
Feature activation-1.46e-02

Pos logit contributions
 wish+0.011
 wake+0.011
 stop+0.010
 dare+0.010
 plan+0.010
Neg logit contributions
keeping-0.022
con-0.021
ls-0.020
led-0.020
 olds-0.020
Token.
Token ID13
Feature activation+1.45e-02

Pos logit contributions
keeping+0.033
con+0.032
ls+0.030
 olds+0.030
ados+0.029
Neg logit contributions
 wish-0.021
 wake-0.020
 stop-0.019
 end-0.019
ee-0.019
Token He
Token ID679
Feature activation-2.59e-02

Pos logit contributions
 wish+0.021
 wake+0.020
 stop+0.019
upid+0.019
ee+0.019
Neg logit contributions
keeping-0.025
con-0.024
ls-0.023
 olds-0.023
 av-0.022
Token liked
Token ID8288
Feature activation+1.60e-02

Pos logit contributions
keeping+0.054
con+0.052
ls+0.050
 olds+0.049
ados+0.049
Neg logit contributions
 wish-0.030
 wake-0.028
ee-0.026
 stop-0.026
 watch-0.026
Token to
Token ID284
Feature activation-5.17e-03

Pos logit contributions
 wish+0.023
 wake+0.022
 stop+0.022
upid+0.021
ee+0.021
Neg logit contributions
keeping-0.027
con-0.026
ls-0.025
 olds-0.024
 av-0.024
Token eat
Token ID4483
Feature activation-2.12e-02

Pos logit contributions
keeping+0.011
con+0.011
 olds+0.011
bes+0.010
 av+0.010
Neg logit contributions
 wish-0.004
upid-0.004
ee-0.004
 wake-0.004
 bet-0.004
Token math
Token ID10688
Feature activation-9.80e-02

Pos logit contributions
keeping+0.037
con+0.036
ls+0.033
 olds+0.033
led+0.032
Neg logit contributions
 wish-0.030
 wake-0.029
 stop-0.028
 end-0.027
ee-0.027
Token.
Token ID13
Feature activation-6.31e-03

Pos logit contributions
keeping+0.191
con+0.179
 olds+0.167
ls+0.166
led+0.163
Neg logit contributions
 wish-0.120
 wake-0.116
 end-0.113
 stop-0.112
 dare-0.109
Token Every
Token ID3887
Feature activation+2.58e-02

Pos logit contributions
con+0.010
 olds+0.009
keeping+0.009
 av+0.009
ados+0.008
Neg logit contributions
 wish-0.010
 wake-0.010
ee-0.009
 stop-0.009
be-0.009
Token night
Token ID1755
Feature activation-1.66e-02

Pos logit contributions
upid+0.027
 wish+0.026
 sure+0.024
 stop+0.024
be+0.024
Neg logit contributions
con-0.048
 mushrooms-0.044
ls-0.044
keeping-0.044
 av-0.044
Token he
Token ID339
Feature activation-6.39e-03

Pos logit contributions
keeping+0.032
con+0.030
ls+0.030
bes+0.029
led+0.028
Neg logit contributions
 wake-0.022
 wish-0.021
 stop-0.021
 watch-0.021
 end-0.020
Token went
Token ID1816
Feature activation-1.67e-02

Pos logit contributions
keeping+0.012
con+0.012
ls+0.011
 olds+0.011
ados+0.011
Neg logit contributions
 wish-0.008
 wake-0.008
 stop-0.007
ee-0.007
 bet-0.007
Token home
Token ID1363
Feature activation-3.95e-04

Pos logit contributions
con+0.018
led+0.018
ados+0.018
cakes+0.018
bes+0.018
Neg logit contributions
 wish-0.004
 sure-0.004
 wake-0.004
 go-0.004
 watch-0.004
Token safely
Token ID11512
Feature activation+1.00e-02

Pos logit contributions
con+0.001
ados+0.001
ls+0.001
pill+0.001
 av+0.001
Neg logit contributions
 happy-0.000
 stop-0.000
 sure-0.000
 wish-0.000
 wake-0.000
Token.
Token ID13
Feature activation+3.73e-03

Pos logit contributions
 wish+0.011
 wake+0.011
 stop+0.011
 go+0.010
 sure+0.010
Neg logit contributions
con-0.020
keeping-0.019
ls-0.019
ados-0.019
 olds-0.019
Token
Token ID198
Feature activation+1.32e-02

Pos logit contributions
 wish+0.003
 wake+0.002
 sure+0.002
 bet+0.002
upid+0.002
Neg logit contributions
keeping-0.009
con-0.009
ls-0.009
 olds-0.009
led-0.009
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
 wish+0.013
 wake+0.013
 stop+0.012
upid+0.012
 bet+0.012
Neg logit contributions
con-0.028
keeping-0.027
ls-0.026
bes-0.025
 olds-0.025
TokenMic
Token ID25437
Feature activation-9.79e-02

Pos logit contributions
 wish+0.022
 wake+0.021
upid+0.021
 stop+0.020
ee+0.020
Neg logit contributions
keeping-0.023
con-0.023
ls-0.021
bes-0.021
 av-0.021
Tokenkey
Token ID2539
Feature activation-2.57e-02

Pos logit contributions
 olds+0.131
lights+0.120
cakes+0.120
keeping+0.116
 passages+0.112
Neg logit contributions
ee-0.185
 sure-0.182
ah-0.179
be-0.178
 end-0.166
Token learned
Token ID4499
Feature activation-1.68e-02

Pos logit contributions
keeping+0.054
con+0.053
ls+0.051
ados+0.051
 av+0.051
Neg logit contributions
 wish-0.028
 wake-0.027
 sure-0.026
 end-0.025
 happy-0.025
Token that
Token ID326
Feature activation+3.37e-02

Pos logit contributions
keeping+0.037
con+0.036
ls+0.034
 olds+0.034
led+0.034
Neg logit contributions
 wish-0.017
 wake-0.015
 stop-0.015
 sure-0.015
upid-0.014
Token it
Token ID340
Feature activation+3.39e-03

Pos logit contributions
 wish+0.038
 wake+0.037
upid+0.037
ee+0.037
 stop+0.036
Neg logit contributions
keeping-0.065
con-0.064
ls-0.060
 av-0.060
 olds-0.059
Token's
Token ID338
Feature activation+3.62e-02

Pos logit contributions
 wish+0.003
 sure+0.003
 wake+0.002
 end+0.002
 dare+0.002
Neg logit contributions
keeping-0.009
con-0.008
 olds-0.008
bes-0.008
 av-0.008
Token.
Token ID13
Feature activation+1.47e-02

Pos logit contributions
 wish+0.005
 wake+0.005
 stop+0.005
 end+0.005
 bet+0.005
Neg logit contributions
keeping-0.008
con-0.008
ls-0.007
 olds-0.007
ados-0.007
Token She
Token ID1375
Feature activation-1.50e-02

Pos logit contributions
 wish+0.025
 wake+0.024
 stop+0.023
ee+0.023
upid+0.023
Neg logit contributions
keeping-0.021
con-0.021
 olds-0.019
ls-0.019
 av-0.019
Token loved
Token ID6151
Feature activation+1.30e-02

Pos logit contributions
keeping+0.035
con+0.033
ados+0.032
ls+0.032
 olds+0.031
Neg logit contributions
 wish-0.015
 wake-0.014
 happy-0.013
 sure-0.013
 end-0.013
Token to
Token ID284
Feature activation-1.44e-02

Pos logit contributions
 wish+0.017
 wake+0.017
 stop+0.016
 watch+0.016
ee+0.016
Neg logit contributions
keeping-0.023
con-0.023
ls-0.022
 olds-0.021
led-0.021
Token eat
Token ID4483
Feature activation-2.11e-02

Pos logit contributions
keeping+0.033
con+0.033
ls+0.031
 olds+0.031
 av+0.030
Neg logit contributions
 wish-0.012
 wake-0.011
upid-0.011
ee-0.010
 stop-0.010
Token av
Token ID1196
Feature activation-9.76e-02

Pos logit contributions
keeping+0.036
con+0.033
 olds+0.032
hog+0.031
bes+0.030
Neg logit contributions
 wake-0.031
 wish-0.031
 happy-0.030
 stop-0.029
 watch-0.029
Tokenoc
Token ID420
Feature activation-5.03e-02

Pos logit contributions
keeping+0.165
con+0.156
 olds+0.155
ls+0.151
bes+0.146
Neg logit contributions
 wish-0.136
ee-0.131
 wake-0.130
 end-0.127
 dare-0.125
Tokenados
Token ID22484
Feature activation-7.03e-02

Pos logit contributions
keeping+0.036
bes+0.034
mary+0.031
ls+0.030
 olds+0.029
Neg logit contributions
upid-0.108
ee-0.106
 dare-0.105
 wish-0.104
 mark-0.104
Token.
Token ID13
Feature activation-5.57e-03

Pos logit contributions
keeping+0.144
con+0.139
 olds+0.138
bes+0.132
ls+0.131
Neg logit contributions
 end-0.071
 wake-0.069
 stop-0.068
 go-0.067
 sure-0.067
Token One
Token ID1881
Feature activation-1.52e-02

Pos logit contributions
con+0.010
keeping+0.010
 olds+0.009
 av+0.009
ados+0.009
Neg logit contributions
 wish-0.007
 wake-0.007
ee-0.007
upid-0.007
 stop-0.007
Token day
Token ID1110
Feature activation-3.64e-02

Pos logit contributions
keeping+0.034
con+0.034
 av+0.034
ls+0.034
led+0.033
Neg logit contributions
 wish-0.012
upid-0.012
 bang-0.011
 bet-0.010
 wake-0.010
Token the
Token ID262
Feature activation-9.51e-03

Pos logit contributions
keeping+0.030
con+0.030
ls+0.029
 olds+0.028
led+0.028
Neg logit contributions
 stop-0.015
 wish-0.015
 sure-0.015
 watch-0.015
 wake-0.015
Token famous
Token ID5863
Feature activation+2.91e-02

Pos logit contributions
keeping+0.020
con+0.020
ls+0.019
 olds+0.019
led+0.019
Neg logit contributions
 wish-0.010
 wake-0.009
 stop-0.009
 end-0.009
ee-0.008
Token chef
Token ID21221
Feature activation-1.99e-02

Pos logit contributions
 wish+0.030
 wake+0.028
 end+0.026
 watch+0.026
 stop+0.026
Neg logit contributions
keeping-0.065
con-0.062
ls-0.061
led-0.059
bes-0.059
Token was
Token ID373
Feature activation+1.65e-02

Pos logit contributions
con+0.040
keeping+0.039
 olds+0.038
ls+0.037
bes+0.037
Neg logit contributions
 wish-0.024
 wake-0.023
ee-0.021
 stop-0.021
 go-0.021
Token making
Token ID1642
Feature activation-1.75e-03

Pos logit contributions
 wish+0.021
 wake+0.019
upid+0.019
ee+0.019
 bet+0.019
Neg logit contributions
keeping-0.031
con-0.030
ls-0.028
 olds-0.028
 av-0.028
Token las
Token ID39990
Feature activation-9.71e-02

Pos logit contributions
keeping+0.003
con+0.003
ls+0.003
 olds+0.003
 av+0.003
Neg logit contributions
 wish-0.002
 wake-0.002
upid-0.002
 bet-0.002
 stop-0.002
Tokenagna
Token ID48669
Feature activation-5.41e-02

Pos logit contributions
keeping+0.144
con+0.141
ls+0.133
 olds+0.129
 av+0.124
Neg logit contributions
 wish-0.159
 wake-0.157
 stop-0.152
ee-0.150
upid-0.148
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
keeping+0.096
con+0.095
ls+0.090
 olds+0.088
 av+0.086
Neg logit contributions
 wish-0.072
 wake-0.070
 stop-0.067
upid-0.066
ee-0.066
Token That
Token ID1320
Feature activation+7.37e-03

Pos logit contributions
keeping+0.027
con+0.026
 olds+0.024
ls+0.024
 av+0.024
Neg logit contributions
 wish-0.023
 wake-0.022
 stop-0.021
upid-0.021
ee-0.020
Token is
Token ID318
Feature activation-5.67e-03

Pos logit contributions
 wish+0.009
upid+0.009
 wake+0.009
 stop+0.009
 bet+0.008
Neg logit contributions
keeping-0.013
con-0.013
ls-0.012
 av-0.012
led-0.012
Token a
Token ID257
Feature activation-3.54e-02

Pos logit contributions
keeping+0.012
con+0.012
ls+0.012
 olds+0.011
led+0.011
Neg logit contributions
 wish-0.006
 wake-0.005
 stop-0.005
upid-0.005
ee-0.005
Token said
Token ID531
Feature activation+1.72e-03

Pos logit contributions
keeping+0.008
con+0.008
 olds+0.008
ados+0.007
ls+0.007
Neg logit contributions
 wish-0.004
 wake-0.003
 watch-0.003
 sure-0.003
 stop-0.003
Token,
Token ID11
Feature activation-3.28e-03

Pos logit contributions
 wish+0.002
 wake+0.002
 stop+0.002
 dare+0.002
 sure+0.002
Neg logit contributions
keeping-0.003
con-0.003
 olds-0.003
bes-0.003
ls-0.003
Token "
Token ID366
Feature activation-2.70e-03

Pos logit contributions
con+0.007
keeping+0.006
ls+0.006
 olds+0.006
 av+0.006
Neg logit contributions
 wish-0.004
 wake-0.004
upid-0.003
ee-0.003
 stop-0.003
TokenI
Token ID40
Feature activation+2.79e-02

Pos logit contributions
keeping+0.004
con+0.004
 olds+0.004
ls+0.004
bes+0.004
Neg logit contributions
 wish-0.004
upid-0.004
 stop-0.004
 wake-0.004
 plan-0.004
Token am
Token ID716
Feature activation-1.96e-02

Pos logit contributions
 crashing+0.028
upid+0.027
 ding+0.023
 end+0.023
 bum+0.023
Neg logit contributions
con-0.052
ls-0.049
keeping-0.049
bes-0.047
led-0.046
Token Doctor
Token ID9356
Feature activation-9.58e-02

Pos logit contributions
con+0.036
keeping+0.036
 olds+0.034
ls+0.034
 av+0.034
Neg logit contributions
 wish-0.023
ee-0.022
upid-0.022
 wake-0.022
 bet-0.021
Token Lily
Token ID20037
Feature activation-1.11e-02

Pos logit contributions
led+0.215
keeping+0.208
bes+0.204
ls+0.200
edly+0.197
Neg logit contributions
 sure-0.090
 stop-0.085
 wake-0.084
 mate-0.084
 watch-0.083
Token.
Token ID13
Feature activation+6.15e-03

Pos logit contributions
con+0.023
keeping+0.023
ls+0.022
 olds+0.021
led+0.021
Neg logit contributions
 wake-0.012
 wish-0.012
 stop-0.012
 sure-0.012
 happy-0.011
Token I
Token ID314
Feature activation+2.70e-02

Pos logit contributions
 wake+0.008
 wish+0.008
 stop+0.007
 bet+0.007
upid+0.007
Neg logit contributions
keeping-0.011
con-0.011
ls-0.011
led-0.011
 av-0.010
Token will
Token ID481
Feature activation-2.36e-02

Pos logit contributions
upid+0.027
 crashing+0.026
 end+0.024
 stop+0.023
 ding+0.022
Neg logit contributions
con-0.051
keeping-0.051
ls-0.049
led-0.049
bes-0.048
Token listen
Token ID6004
Feature activation-1.61e-02

Pos logit contributions
keeping+0.046
con+0.045
bes+0.043
 olds+0.041
 av+0.040
Neg logit contributions
ee-0.023
upid-0.022
 bet-0.021
 wish-0.019
ack-0.019
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
 wish+0.000
 wake+0.000
 stop+0.000
upid+0.000
 bet+0.000
Neg logit contributions
keeping-0.000
con-0.000
ls-0.000
 olds-0.000
ados-0.000
Token She
Token ID1375
Feature activation-1.62e-02

Pos logit contributions
 wish+0.015
 wake+0.015
upid+0.014
 stop+0.014
 bet+0.014
Neg logit contributions
keeping-0.018
con-0.018
ls-0.017
led-0.017
 olds-0.017
Token loved
Token ID6151
Feature activation+1.29e-02

Pos logit contributions
keeping+0.037
con+0.035
ados+0.034
ls+0.034
 olds+0.033
Neg logit contributions
 wish-0.017
 wake-0.015
 happy-0.015
 sure-0.014
 watch-0.014
Token to
Token ID284
Feature activation-1.18e-02

Pos logit contributions
 wish+0.017
 wake+0.017
 stop+0.016
 watch+0.015
 dare+0.015
Neg logit contributions
keeping-0.023
con-0.023
ls-0.022
 olds-0.021
led-0.021
Token eat
Token ID4483
Feature activation-2.13e-02

Pos logit contributions
keeping+0.027
con+0.027
ls+0.026
 olds+0.026
 av+0.025
Neg logit contributions
 wish-0.009
 wake-0.009
upid-0.008
ee-0.008
 stop-0.008
Token av
Token ID1196
Feature activation-9.56e-02

Pos logit contributions
keeping+0.038
con+0.035
 olds+0.033
hog+0.033
bes+0.032
Neg logit contributions
 wake-0.030
 wish-0.029
 happy-0.029
 sure-0.028
 watch-0.028
Tokenoc
Token ID420
Feature activation-5.18e-02

Pos logit contributions
keeping+0.161
con+0.152
 olds+0.151
ls+0.147
bes+0.143
Neg logit contributions
 wish-0.134
ee-0.130
 wake-0.129
 end-0.126
 dare-0.124
Tokenados
Token ID22484
Feature activation-6.97e-02

Pos logit contributions
keeping+0.040
bes+0.040
mary+0.038
lights+0.033
ls+0.033
Neg logit contributions
upid-0.107
ee-0.104
 dare-0.103
 mark-0.102
 wish-0.102
Token because
Token ID780
Feature activation+2.66e-02

Pos logit contributions
keeping+0.143
con+0.138
 olds+0.135
bes+0.131
ls+0.129
Neg logit contributions
 end-0.070
 wake-0.069
 stop-0.068
 sure-0.067
 go-0.067
Token they
Token ID484
Feature activation+3.06e-03

Pos logit contributions
 wish+0.034
 end+0.033
 wake+0.033
 stop+0.033
 happy+0.032
Neg logit contributions
keeping-0.049
con-0.048
bes-0.045
 olds-0.044
ls-0.043
Token were
Token ID547
Feature activation+2.77e-03

Pos logit contributions
 wish+0.004
 wake+0.004
 stop+0.003
 end+0.003
 bet+0.003
Neg logit contributions
keeping-0.006
con-0.006
ls-0.006
 olds-0.005
bes-0.005
Token B
Token ID347
Feature activation-1.44e-02

Pos logit contributions
 wish+0.015
 wake+0.014
 stop+0.013
upid+0.013
 bet+0.013
Neg logit contributions
keeping-0.022
con-0.022
ls-0.021
 olds-0.020
led-0.020
Tokenongo
Token ID25162
Feature activation-2.01e-02

Pos logit contributions
keeping+0.015
con+0.015
ls+0.014
 olds+0.013
 av+0.012
Neg logit contributions
 wish-0.030
 wake-0.029
ee-0.028
upid-0.028
 stop-0.028
Token loved
Token ID6151
Feature activation+1.53e-02

Pos logit contributions
keeping+0.048
con+0.045
 olds+0.045
ados+0.044
cakes+0.042
Neg logit contributions
 wish-0.018
 happy-0.016
 wake-0.016
ee-0.016
 end-0.015
Token to
Token ID284
Feature activation-6.37e-03

Pos logit contributions
 wish+0.023
 wake+0.022
upid+0.021
 stop+0.021
ee+0.021
Neg logit contributions
keeping-0.025
con-0.025
ls-0.023
 olds-0.023
 av-0.022
Token eat
Token ID4483
Feature activation-2.18e-02

Pos logit contributions
keeping+0.014
con+0.014
ls+0.013
 olds+0.013
 av+0.013
Neg logit contributions
 wish-0.006
 wake-0.005
upid-0.005
ee-0.005
 stop-0.005
Token av
Token ID1196
Feature activation-9.50e-02

Pos logit contributions
keeping+0.032
con+0.029
 olds+0.028
bes+0.027
led+0.027
Neg logit contributions
 wish-0.037
 wake-0.037
 stop-0.035
 happy-0.035
 end-0.035
Tokenoc
Token ID420
Feature activation-5.09e-02

Pos logit contributions
keeping+0.165
 olds+0.157
con+0.156
ls+0.151
led+0.146
Neg logit contributions
 wish-0.129
ee-0.124
 wake-0.124
 end-0.120
 dare-0.119
Tokenados
Token ID22484
Feature activation-7.26e-02

Pos logit contributions
keeping+0.044
mary+0.040
bes+0.040
lights+0.039
suits+0.036
Neg logit contributions
upid-0.104
 dare-0.102
 mark-0.100
 stop-0.100
ee-0.098
Token.
Token ID13
Feature activation-2.98e-03

Pos logit contributions
keeping+0.149
con+0.142
 olds+0.138
bes+0.133
led+0.133
Neg logit contributions
 wake-0.076
 wish-0.075
 end-0.075
 stop-0.074
 sure-0.073
Token One
Token ID1881
Feature activation-1.67e-02

Pos logit contributions
con+0.005
keeping+0.004
 olds+0.004
 av+0.004
ados+0.004
Neg logit contributions
 wish-0.005
ee-0.004
upid-0.004
 wake-0.004
 stop-0.004
Token day
Token ID1110
Feature activation-3.68e-02

Pos logit contributions
con+0.032
ls+0.032
 av+0.032
keeping+0.031
led+0.030
Neg logit contributions
upid-0.018
 wish-0.016
 bang-0.016
 bet-0.016
 crashing-0.015
Token vanished
Token ID23717
Feature activation-6.53e-03

Pos logit contributions
keeping+0.077
con+0.076
 olds+0.074
ados+0.071
ls+0.071
Neg logit contributions
 wake-0.046
 wish-0.046
 sure-0.043
 stop-0.043
ee-0.041
Token!
Token ID0
Feature activation+1.12e-02

Pos logit contributions
con+0.013
keeping+0.012
bes+0.012
 olds+0.012
pill+0.012
Neg logit contributions
 stop-0.008
 wake-0.008
 wish-0.008
 part-0.007
 go-0.007
Token
Token ID220
Feature activation+4.17e-03

Pos logit contributions
 It+0.014
 sure+0.013
 happy+0.013
 try+0.013
 bang+0.013
Neg logit contributions
keeping-0.021
ls-0.019
ries-0.019
led-0.019
glers-0.019
Token
Token ID198
Feature activation+2.29e-03

Pos logit contributions
 wish+0.004
 wake+0.004
 stop+0.003
upid+0.003
ee+0.003
Neg logit contributions
keeping-0.009
con-0.009
ls-0.009
 olds-0.009
 av-0.008
Token
Token ID198
Feature activation+3.52e-03

Pos logit contributions
 wish+0.001
 wake+0.001
 sure+0.001
upid+0.001
 bet+0.001
Neg logit contributions
keeping-0.006
con-0.005
 olds-0.005
ls-0.005
led-0.005
TokenMic
Token ID25437
Feature activation-9.47e-02

Pos logit contributions
 wake+0.006
 end+0.006
 sure+0.006
 wish+0.006
 stop+0.006
Neg logit contributions
keeping-0.005
con-0.005
 olds-0.005
ls-0.004
ries-0.004
Tokenkey
Token ID2539
Feature activation-2.98e-02

Pos logit contributions
 olds+0.130
lights+0.122
keeping+0.122
cakes+0.115
 passages+0.109
Neg logit contributions
ee-0.170
 sure-0.170
be-0.160
ah-0.158
It-0.154
Token was
Token ID373
Feature activation+1.77e-02

Pos logit contributions
keeping+0.068
con+0.066
ados+0.063
 olds+0.061
 av+0.061
Neg logit contributions
 wish-0.030
 wake-0.028
 sure-0.028
 stop-0.026
 happy-0.025
Token scared
Token ID12008
Feature activation-1.94e-03

Pos logit contributions
ee+0.017
upid+0.016
 wish+0.016
 bet+0.016
 wake+0.015
Neg logit contributions
keeping-0.038
con-0.036
 olds-0.035
ls-0.035
led-0.034
Token,
Token ID11
Feature activation-3.26e-04

Pos logit contributions
keeping+0.004
con+0.004
ls+0.003
 olds+0.003
bes+0.003
Neg logit contributions
 wish-0.002
 wake-0.002
 stop-0.002
upid-0.002
ee-0.002
Token but
Token ID475
Feature activation-1.03e-03

Pos logit contributions
keeping+0.001
con+0.001
ls+0.001
 olds+0.001
 av+0.001
Neg logit contributions
 wish-0.000
 wake-0.000
 stop-0.000
 bet-0.000
 end-0.000
Token to
Token ID284
Feature activation+1.36e-02

Pos logit contributions
 wish+0.011
 wake+0.011
 stop+0.010
 sure+0.010
 dare+0.010
Neg logit contributions
keeping-0.018
con-0.018
ls-0.017
 olds-0.017
bes-0.017
Token Mickey
Token ID27232
Feature activation-3.91e-02

Pos logit contributions
 wish+0.025
 wake+0.024
 stop+0.024
 end+0.023
upid+0.023
Neg logit contributions
keeping-0.018
con-0.018
ls-0.017
 olds-0.016
led-0.016
Token.
Token ID13
Feature activation-2.16e-03

Pos logit contributions
keeping+0.077
con+0.076
ls+0.074
 olds+0.073
cakes+0.072
Neg logit contributions
 wish-0.046
 wake-0.044
 sure-0.044
 end-0.042
 stop-0.042
Token
Token ID198
Feature activation+4.70e-03

Pos logit contributions
keeping+0.005
ls+0.005
led+0.005
con+0.004
 olds+0.004
Neg logit contributions
 sure-0.002
 happy-0.002
 bet-0.002
 wish-0.002
 wake-0.002
Token
Token ID198
Feature activation+6.53e-03

Pos logit contributions
 wish+0.005
 wake+0.004
 stop+0.004
upid+0.004
ee+0.004
Neg logit contributions
keeping-0.010
con-0.010
ls-0.010
 olds-0.009
 av-0.009
TokenMic
Token ID25437
Feature activation-9.46e-02

Pos logit contributions
 wish+0.007
 wake+0.007
 stop+0.007
upid+0.007
 sure+0.007
Neg logit contributions
keeping-0.013
con-0.013
ls-0.012
 olds-0.012
led-0.011
Tokenkey
Token ID2539
Feature activation-2.61e-02

Pos logit contributions
 olds+0.128
keeping+0.123
cakes+0.119
lights+0.118
edly+0.109
Neg logit contributions
ee-0.177
ah-0.176
 sure-0.168
be-0.167
 end-0.152
Token was
Token ID373
Feature activation+1.87e-02

Pos logit contributions
keeping+0.059
ados+0.053
ls+0.053
con+0.052
 olds+0.052
Neg logit contributions
 sure-0.028
 wish-0.027
 wake-0.026
 happy-0.026
 stop-0.025
Token very
Token ID845
Feature activation-2.05e-02

Pos logit contributions
 wish+0.019
 wake+0.018
ee+0.017
upid+0.017
 bet+0.017
Neg logit contributions
keeping-0.039
con-0.038
 olds-0.037
ls-0.036
 av-0.036
Token happy
Token ID3772
Feature activation+2.97e-02

Pos logit contributions
keeping+0.052
con+0.052
ls+0.050
 olds+0.050
 av+0.049
Neg logit contributions
 wish-0.012
 wake-0.011
upid-0.010
 stop-0.010
ee-0.010
Token and
Token ID290
Feature activation-4.81e-03

Pos logit contributions
 wish+0.031
 wake+0.030
 stop+0.029
 sure+0.028
 watch+0.028
Neg logit contributions
con-0.061
keeping-0.061
ls-0.059
 olds-0.058
led-0.057
Token.
Token ID13
Feature activation-1.86e-02

Pos logit contributions
keeping+0.005
con+0.005
ls+0.005
bes+0.005
 av+0.005
Neg logit contributions
 wish-0.004
 wake-0.004
 stop-0.004
 end-0.003
upid-0.003
Token
Token ID198
Feature activation-9.53e-03

Pos logit contributions
keeping+0.027
con+0.026
 olds+0.025
ados+0.023
ls+0.023
Neg logit contributions
 wish-0.031
 wake-0.030
ee-0.029
upid-0.029
 stop-0.029
Token
Token ID198
Feature activation-7.09e-03

Pos logit contributions
keeping+0.015
con+0.015
ls+0.014
 olds+0.014
Ell+0.013
Neg logit contributions
 wish-0.014
 wake-0.014
 bang-0.013
 stop-0.013
upid-0.013
Token"
Token ID1
Feature activation-3.20e-02

Pos logit contributions
keeping+0.009
 olds+0.009
ways+0.009
ups+0.009
bes+0.009
Neg logit contributions
 wish-0.012
 wake-0.011
 bang-0.011
upid-0.011
 stop-0.011
TokenIs
Token ID3792
Feature activation-1.61e-02

Pos logit contributions
keeping+0.054
con+0.048
Ell+0.046
 av+0.045
bes+0.044
Neg logit contributions
 wish-0.051
upid-0.049
 wake-0.045
 stop-0.044
 plan-0.044
Token math
Token ID10688
Feature activation-9.39e-02

Pos logit contributions
keeping+0.034
con+0.032
 olds+0.031
ls+0.031
bes+0.030
Neg logit contributions
 wish-0.017
 wake-0.016
 stop-0.016
ee-0.015
upid-0.015
Token hard
Token ID1327
Feature activation-2.75e-02

Pos logit contributions
keeping+0.202
con+0.187
ls+0.178
 olds+0.176
led+0.174
Neg logit contributions
 wish-0.104
 wake-0.099
 stop-0.096
 dare-0.094
 end-0.093
Token?"
Token ID1701
Feature activation+5.16e-03

Pos logit contributions
keeping+0.057
con+0.055
ls+0.053
bes+0.051
 olds+0.051
Neg logit contributions
 wish-0.030
 wake-0.030
 stop-0.028
 dare-0.027
 sure-0.027
Token asked
Token ID1965
Feature activation-1.04e-02

Pos logit contributions
 wish+0.008
 wake+0.008
upid+0.008
 stop+0.008
ee+0.008
Neg logit contributions
keeping-0.008
con-0.008
ls-0.007
 olds-0.007
 av-0.007
Token Jenny
Token ID28437
Feature activation+3.65e-03

Pos logit contributions
keeping+0.021
con+0.021
ls+0.019
bes+0.019
 olds+0.019
Neg logit contributions
 wish-0.012
 stop-0.011
 wake-0.011
 sure-0.011
 end-0.011
Token.
Token ID13
Feature activation-1.27e-03

Pos logit contributions
 wish+0.005
 wake+0.005
upid+0.005
ee+0.005
 bet+0.005
Neg logit contributions
keeping-0.006
con-0.006
 olds-0.005
ls-0.005
 av-0.005
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
keeping+0.011
con+0.011
ls+0.011
 olds+0.011
bes+0.010
Neg logit contributions
 sure-0.004
 wake-0.004
 happy-0.004
 end-0.004
 wish-0.004
Token price
Token ID2756
Feature activation-2.90e-02

Pos logit contributions
keeping+0.031
con+0.031
 av+0.029
 olds+0.029
ls+0.029
Neg logit contributions
 wish-0.015
 wake-0.014
upid-0.014
ee-0.014
 stop-0.013
Token -
Token ID532
Feature activation+5.57e-03

Pos logit contributions
con+0.058
keeping+0.057
ls+0.055
bes+0.054
led+0.054
Neg logit contributions
 stop-0.031
 end-0.031
 wish-0.030
 go-0.029
 dare-0.028
Token it
Token ID340
Feature activation-9.56e-03

Pos logit contributions
 wish+0.006
 wake+0.006
 stop+0.005
 sure+0.005
 dare+0.005
Neg logit contributions
keeping-0.012
con-0.012
ls-0.011
 olds-0.011
led-0.011
Token was
Token ID373
Feature activation+2.65e-04

Pos logit contributions
keeping+0.022
con+0.020
ados+0.019
 olds+0.019
edly+0.019
Neg logit contributions
 wish-0.009
 wake-0.009
 stop-0.009
 sure-0.009
 go-0.009
Token $
Token ID720
Feature activation-9.38e-02

Pos logit contributions
 wish+0.000
 wake+0.000
 stop+0.000
 end+0.000
 sure+0.000
Neg logit contributions
keeping-0.001
con-0.001
ls-0.001
 olds-0.001
led-0.001
Token5
Token ID20
Feature activation-6.77e-02

Pos logit contributions
keeping+0.155
con+0.151
ls+0.143
 olds+0.139
led+0.137
Neg logit contributions
 wish-0.136
 wake-0.133
upid-0.130
ee-0.129
 stop-0.129
Token.
Token ID13
Feature activation-7.53e-03

Pos logit contributions
keeping+0.126
con+0.121
bes+0.114
ls+0.114
led+0.113
Neg logit contributions
ee-0.083
 end-0.082
 wish-0.081
 stop-0.080
upid-0.079
Token Sally
Token ID25737
Feature activation-7.22e-03

Pos logit contributions
keeping+0.012
ls+0.012
con+0.012
led+0.012
bes+0.011
Neg logit contributions
 wish-0.011
 bet-0.011
 wake-0.011
 stop-0.010
upid-0.010
Token was
Token ID373
Feature activation+1.04e-02

Pos logit contributions
keeping+0.016
con+0.015
ls+0.015
 olds+0.014
ados+0.014
Neg logit contributions
 wish-0.007
 wake-0.007
 sure-0.007
 happy-0.007
ee-0.007
Token so
Token ID523
Feature activation-1.73e-02

Pos logit contributions
 wish+0.011
upid+0.011
ee+0.011
 wake+0.011
 bet+0.010
Neg logit contributions
keeping-0.020
con-0.019
 olds-0.019
 av-0.019
led-0.018
Token She
Token ID1375
Feature activation-1.41e-02

Pos logit contributions
keeping+0.012
con+0.012
ls+0.011
led+0.011
 olds+0.011
Neg logit contributions
 wish-0.009
 wake-0.009
 stop-0.008
upid-0.008
 bet-0.008
Token puts
Token ID7584
Feature activation-7.92e-03

Pos logit contributions
keeping+0.029
con+0.026
ados+0.025
 olds+0.024
ls+0.024
Neg logit contributions
 wish-0.020
 wake-0.019
 bang-0.017
 watch-0.017
 stop-0.017
Token on
Token ID319
Feature activation-4.30e-02

Pos logit contributions
keeping+0.017
con+0.016
 olds+0.015
ls+0.015
bes+0.015
Neg logit contributions
 wish-0.008
 wake-0.008
 stop-0.008
 end-0.007
ee-0.007
Token her
Token ID607
Feature activation-2.08e-02

Pos logit contributions
keeping+0.094
con+0.091
ls+0.088
 olds+0.087
led+0.086
Neg logit contributions
 wish-0.042
 wake-0.042
 stop-0.039
 watch-0.038
 sure-0.037
Token p
Token ID279
Feature activation+1.07e-02

Pos logit contributions
keeping+0.042
con+0.041
ls+0.039
 olds+0.039
led+0.038
Neg logit contributions
 wish-0.024
 wake-0.023
 stop-0.022
 watch-0.021
 end-0.021
Tokenaj
Token ID1228
Feature activation-9.35e-02

Pos logit contributions
 wish+0.024
 wake+0.024
upid+0.023
ee+0.023
 stop+0.023
Neg logit contributions
keeping-0.009
con-0.008
ls-0.007
 av-0.007
 olds-0.007
Tokenamas
Token ID17485
Feature activation-3.12e-02

Pos logit contributions
keeping+0.128
con+0.128
ls+0.126
 av+0.116
led+0.115
Neg logit contributions
 wake-0.156
 wish-0.149
 stop-0.147
ee-0.145
 bang-0.145
Token.
Token ID13
Feature activation-3.46e-03

Pos logit contributions
keeping+0.050
con+0.048
ls+0.044
 olds+0.044
led+0.043
Neg logit contributions
 wish-0.048
upid-0.045
 bet-0.044
 wake-0.044
 dare-0.044
Token She
Token ID1375
Feature activation-1.52e-02

Pos logit contributions
keeping+0.006
con+0.006
ls+0.006
 olds+0.005
led+0.005
Neg logit contributions
 wish-0.005
 wake-0.005
 stop-0.005
upid-0.004
 bet-0.004
Token hugs
Token ID40657
Feature activation-4.26e-03

Pos logit contributions
keeping+0.028
con+0.026
 olds+0.024
ados+0.024
ls+0.024
Neg logit contributions
 wish-0.023
 wake-0.022
 stop-0.020
 watch-0.020
ee-0.020
Token her
Token ID607
Feature activation-2.51e-02

Pos logit contributions
keeping+0.008
con+0.008
ls+0.008
 olds+0.008
 av+0.008
Neg logit contributions
 wish-0.005
 wake-0.005
 stop-0.004
upid-0.004
ee-0.004
Token stay
Token ID2652
Feature activation-1.79e-02

Pos logit contributions
keeping+0.023
con+0.023
 olds+0.022
ls+0.022
 av+0.021
Neg logit contributions
 wish-0.009
upid-0.009
ee-0.009
 wake-0.008
 bet-0.008
Token at
Token ID379
Feature activation-2.37e-03

Pos logit contributions
keeping+0.035
con+0.035
ls+0.033
 olds+0.032
ados+0.032
Neg logit contributions
 wish-0.021
 wake-0.021
 stop-0.019
ee-0.019
 bet-0.019
Token home
Token ID1363
Feature activation-2.50e-02

Pos logit contributions
keeping+0.004
con+0.004
 av+0.003
 olds+0.003
ados+0.003
Neg logit contributions
 wish-0.004
ee-0.003
be-0.003
 bet-0.003
upid-0.003
Token and
Token ID290
Feature activation+4.26e-03

Pos logit contributions
con+0.045
keeping+0.045
ls+0.042
 olds+0.041
 av+0.040
Neg logit contributions
 wish-0.033
 wake-0.033
 stop-0.031
 end-0.030
ee-0.030
Token do
Token ID466
Feature activation+3.32e-03

Pos logit contributions
upid+0.003
ee+0.003
 wish+0.003
 wake+0.003
 bet+0.003
Neg logit contributions
keeping-0.010
con-0.009
ls-0.009
 olds-0.009
 av-0.009
Token math
Token ID10688
Feature activation-9.35e-02

Pos logit contributions
 wish+0.004
 wake+0.003
ee+0.003
upid+0.003
 bet+0.003
Neg logit contributions
con-0.007
keeping-0.007
ls-0.006
 olds-0.006
 av-0.006
Token.
Token ID13
Feature activation-6.19e-03

Pos logit contributions
keeping+0.182
con+0.176
ls+0.164
 olds+0.162
ados+0.162
Neg logit contributions
 wish-0.111
 wake-0.107
 stop-0.104
 sure-0.104
 end-0.101
Token One
Token ID1881
Feature activation-1.38e-02

Pos logit contributions
con+0.009
keeping+0.009
 olds+0.009
 av+0.009
ways+0.008
Neg logit contributions
 wish-0.010
ee-0.010
 wake-0.009
upid-0.009
be-0.009
Token day
Token ID1110
Feature activation-3.27e-02

Pos logit contributions
con+0.029
keeping+0.028
ls+0.028
 av+0.027
 olds+0.027
Neg logit contributions
upid-0.012
 wish-0.011
 bet-0.011
 bang-0.011
 wake-0.010
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
keeping+0.063
con+0.061
 olds+0.057
bes+0.056
ls+0.056
Neg logit contributions
 stop-0.041
 wish-0.041
 wake-0.041
 watch-0.040
 dare-0.039
Token a
Token ID257
Feature activation-7.36e-03

Pos logit contributions
keeping+0.022
con+0.022
ls+0.021
 olds+0.020
 av+0.020
Neg logit contributions
 wish-0.018
 wake-0.018
 stop-0.017
upid-0.017
ee-0.017
Token asked
Token ID1965
Feature activation-1.00e-02

Pos logit contributions
 wish+0.004
 wake+0.004
 stop+0.003
 end+0.003
 go+0.003
Neg logit contributions
con-0.006
keeping-0.006
ls-0.006
bes-0.006
 olds-0.006
Token him
Token ID683
Feature activation-4.90e-03

Pos logit contributions
con+0.021
keeping+0.021
bes+0.019
ls+0.019
 olds+0.018
Neg logit contributions
 stop-0.011
 end-0.011
 wake-0.011
 wish-0.011
 sure-0.011
Token why
Token ID1521
Feature activation-2.11e-02

Pos logit contributions
keeping+0.011
con+0.010
led+0.010
bes+0.010
ls+0.010
Neg logit contributions
 wake-0.005
 stop-0.005
 sure-0.005
 wish-0.005
 dare-0.005
Token he
Token ID339
Feature activation-9.36e-03

Pos logit contributions
keeping+0.036
con+0.035
ls+0.033
 olds+0.032
led+0.032
Neg logit contributions
 wish-0.031
 wake-0.030
 stop-0.029
upid-0.028
 end-0.028
Token chose
Token ID7690
Feature activation-2.07e-02

Pos logit contributions
keeping+0.019
con+0.018
ls+0.017
 olds+0.017
ados+0.017
Neg logit contributions
 wish-0.011
 wake-0.011
 stop-0.010
 dare-0.010
 end-0.010
Token math
Token ID10688
Feature activation-9.34e-02

Pos logit contributions
keeping+0.047
con+0.046
ls+0.044
bes+0.043
 olds+0.043
Neg logit contributions
 wake-0.017
 stop-0.017
 wish-0.017
 end-0.016
 part-0.016
Token.
Token ID13
Feature activation-1.83e-02

Pos logit contributions
keeping+0.184
con+0.176
ls+0.166
 olds+0.164
led+0.161
Neg logit contributions
 wish-0.113
 wake-0.107
 end-0.105
 stop-0.105
 dare-0.101
Token Jack
Token ID3619
Feature activation-1.37e-02

Pos logit contributions
keeping+0.030
con+0.029
ls+0.028
led+0.026
bes+0.026
Neg logit contributions
 wish-0.028
 wake-0.027
 stop-0.027
 bet-0.026
 dare-0.026
Token smiled
Token ID13541
Feature activation-7.25e-03

Pos logit contributions
keeping+0.027
con+0.026
ls+0.024
ados+0.024
 olds+0.024
Neg logit contributions
 wish-0.017
 wake-0.016
 stop-0.016
 sure-0.016
 plan-0.015
Token and
Token ID290
Feature activation-5.45e-03

Pos logit contributions
keeping+0.013
con+0.012
ls+0.012
 olds+0.012
led+0.012
Neg logit contributions
 wish-0.010
 wake-0.009
upid-0.009
ee-0.009
 bet-0.009
Token said
Token ID531
Feature activation-4.24e-03

Pos logit contributions
keeping+0.011
con+0.010
ls+0.010
bes+0.009
 olds+0.009
Neg logit contributions
 wish-0.007
 sure-0.007
 stop-0.006
 wake-0.006
 watch-0.006
Token Everyday
Token ID48154
Feature activation-1.81e-02

Pos logit contributions
 wish+0.023
 wake+0.022
upid+0.021
 stop+0.020
ee+0.020
Neg logit contributions
keeping-0.034
con-0.034
ls-0.032
 olds-0.032
led-0.031
Token he
Token ID339
Feature activation-5.91e-03

Pos logit contributions
keeping+0.035
con+0.034
 olds+0.032
ls+0.032
bes+0.032
Neg logit contributions
 wish-0.023
 wake-0.022
 watch-0.021
 go-0.021
 stop-0.021
Token would
Token ID561
Feature activation-1.62e-03

Pos logit contributions
keeping+0.012
con+0.011
ls+0.011
 olds+0.011
ados+0.011
Neg logit contributions
 wish-0.007
 wake-0.007
 bet-0.007
 stop-0.007
 dare-0.007
Token take
Token ID1011
Feature activation-1.98e-02

Pos logit contributions
keeping+0.004
con+0.004
 olds+0.003
ls+0.003
 av+0.003
Neg logit contributions
 wish-0.001
upid-0.001
ee-0.001
 wake-0.001
 bet-0.001
Token the
Token ID262
Feature activation-3.91e-02

Pos logit contributions
keeping+0.044
con+0.043
ls+0.042
 olds+0.041
bes+0.040
Neg logit contributions
 wish-0.018
 wake-0.017
 sure-0.016
 stop-0.016
ee-0.016
Token av
Token ID1196
Feature activation-9.33e-02

Pos logit contributions
keeping+0.076
con+0.073
ls+0.071
 olds+0.070
led+0.069
Neg logit contributions
 wish-0.048
 wake-0.046
 stop-0.044
 end-0.042
ee-0.042
Tokenoc
Token ID420
Feature activation-2.03e-02

Pos logit contributions
keeping+0.157
con+0.151
 olds+0.146
ls+0.145
bes+0.140
Neg logit contributions
 wish-0.131
 wake-0.126
ee-0.125
upid-0.123
 bet-0.122
Tokenados
Token ID22484
Feature activation-7.07e-02

Pos logit contributions
keeping+0.012
bes+0.011
ls+0.011
con+0.010
 olds+0.010
Neg logit contributions
 wish-0.047
upid-0.047
ee-0.047
be-0.046
 end-0.046
Token and
Token ID290
Feature activation+1.65e-03

Pos logit contributions
keeping+0.130
con+0.128
 olds+0.121
ls+0.121
bes+0.119
Neg logit contributions
 wish-0.087
 wake-0.086
 stop-0.084
 end-0.082
 bet-0.081
Token scatter
Token ID41058
Feature activation+1.43e-03

Pos logit contributions
 go+0.002
 wish+0.002
 watch+0.002
 plan+0.002
 stop+0.002
Neg logit contributions
keeping-0.003
ls-0.003
con-0.003
bes-0.003
led-0.003
Token them
Token ID606
Feature activation-5.10e-03

Pos logit contributions
 end+0.001
 wish+0.001
 time+0.001
 stop+0.001
 go+0.001
Neg logit contributions
keeping-0.004
con-0.004
bes-0.004
 olds-0.003
edly-0.003
Token!
Token ID0
Feature activation+2.63e-02

Pos logit contributions
 wish+0.017
 stop+0.017
 wake+0.016
ee+0.016
 sure+0.016
Neg logit contributions
con-0.022
keeping-0.022
 olds-0.020
 av-0.019
bes-0.019
Token Go
Token ID1514
Feature activation+1.90e-02

Pos logit contributions
 wish+0.030
 wake+0.029
upid+0.028
ee+0.028
 stop+0.027
Neg logit contributions
con-0.052
keeping-0.051
ls-0.049
 olds-0.048
led-0.047
Token to
Token ID284
Feature activation+1.62e-02

Pos logit contributions
 wish+0.023
 wake+0.021
upid+0.021
 plan+0.020
ee+0.020
Neg logit contributions
keeping-0.037
con-0.034
ls-0.033
 olds-0.032
led-0.031
Token your
Token ID534
Feature activation+7.69e-03

Pos logit contributions
 wish+0.020
ee+0.019
 bet+0.019
 wake+0.019
upid+0.018
Neg logit contributions
keeping-0.030
con-0.030
 olds-0.028
ls-0.028
 av-0.027
Token room
Token ID2119
Feature activation+2.37e-02

Pos logit contributions
 wish+0.008
 wake+0.008
upid+0.008
ee+0.007
 bet+0.007
Neg logit contributions
keeping-0.016
con-0.016
ls-0.015
 olds-0.015
 av-0.014
Token now
Token ID783
Feature activation+5.31e-02

Pos logit contributions
 wish+0.029
 wake+0.028
 stop+0.028
 end+0.027
upid+0.026
Neg logit contributions
con-0.046
keeping-0.044
ls-0.042
 olds-0.042
 av-0.042
Token!"
Token ID2474
Feature activation+1.55e-02

Pos logit contributions
 wish+0.081
 wake+0.079
 stop+0.076
upid+0.075
ee+0.075
Neg logit contributions
keeping-0.085
con-0.084
ls-0.079
 olds-0.078
 av-0.075
Token
Token ID198
Feature activation+2.41e-02

Pos logit contributions
 wish+0.011
 wake+0.010
 stop+0.010
 watch+0.009
 happy+0.009
Neg logit contributions
keeping-0.038
con-0.037
led-0.035
ls-0.035
 olds-0.035
Token
Token ID198
Feature activation+2.16e-02

Pos logit contributions
 wish+0.021
 wake+0.019
 stop+0.018
upid+0.018
ee+0.017
Neg logit contributions
keeping-0.055
con-0.054
ls-0.052
 olds-0.051
 av-0.051
TokenL
Token ID43
Feature activation+3.35e-03

Pos logit contributions
 wish+0.024
 wake+0.023
 stop+0.023
 end+0.022
 bet+0.022
Neg logit contributions
keeping-0.044
con-0.044
ls-0.041
 olds-0.041
led-0.040
Tokenily
Token ID813
Feature activation+2.18e-02

Pos logit contributions
 wish+0.004
 wake+0.004
 bet+0.004
 stop+0.004
upid+0.004
Neg logit contributions
keeping-0.006
con-0.006
ls-0.006
 olds-0.006
led-0.006
Token They
Token ID1119
Feature activation+3.81e-03

Pos logit contributions
keeping+0.007
con+0.007
 olds+0.006
ls+0.006
bes+0.006
Neg logit contributions
 wish-0.005
 wake-0.004
 stop-0.004
ee-0.004
upid-0.004
Token learned
Token ID4499
Feature activation-1.70e-02

Pos logit contributions
upid+0.005
 wake+0.005
 wish+0.005
ee+0.005
 dare+0.005
Neg logit contributions
keeping-0.006
con-0.006
ls-0.006
led-0.006
 olds-0.006
Token a
Token ID257
Feature activation+4.07e-03

Pos logit contributions
keeping+0.035
con+0.035
ls+0.033
 olds+0.032
led+0.032
Neg logit contributions
 wish-0.019
 wake-0.018
 stop-0.017
 end-0.016
 sure-0.016
Token hard
Token ID1327
Feature activation+2.66e-02

Pos logit contributions
ee+0.006
upid+0.006
 bet+0.006
 wake+0.005
ass+0.005
Neg logit contributions
ls-0.006
con-0.006
 olds-0.006
 av-0.005
hog-0.005
Token lesson
Token ID11483
Feature activation-1.66e-03

Pos logit contributions
 bet+0.039
 wish+0.039
upid+0.038
ee+0.038
 mate+0.037
Neg logit contributions
con-0.042
ls-0.037
keeping-0.037
bes-0.036
led-0.033
Token that
Token ID326
Feature activation+5.31e-02

Pos logit contributions
con+0.003
keeping+0.003
 av+0.003
ados+0.003
ls+0.003
Neg logit contributions
 wish-0.002
 end-0.002
 stop-0.002
 wake-0.002
 watch-0.002
Token day
Token ID1110
Feature activation+2.23e-03

Pos logit contributions
upid+0.059
 bet+0.049
ack+0.048
 wish+0.048
ee+0.048
Neg logit contributions
keeping-0.103
con-0.097
 av-0.095
 mushrooms-0.094
ls-0.093
Token.
Token ID13
Feature activation-3.06e-03

Pos logit contributions
 wish+0.003
 wake+0.003
 stop+0.003
 end+0.003
upid+0.002
Neg logit contributions
con-0.004
keeping-0.004
 av-0.004
ls-0.004
 olds-0.004
Token<|endoftext|>
Token ID50256
Feature activation-3.90e-03

Pos logit contributions
keeping+0.006
con+0.005
 olds+0.005
ls+0.005
bes+0.005
Neg logit contributions
 wish-0.004
 wake-0.004
 stop-0.004
ee-0.004
upid-0.004
TokenSam
Token ID16305
Feature activation-1.89e-02

Pos logit contributions
con+0.008
keeping+0.008
 olds+0.007
ls+0.007
ados+0.007
Neg logit contributions
 wake-0.004
 wish-0.004
ee-0.004
 end-0.004
 stop-0.004
Token and
Token ID290
Feature activation+2.11e-02

Pos logit contributions
con+0.042
keeping+0.041
 olds+0.041
ls+0.040
ados+0.040
Neg logit contributions
 wake-0.019
 wish-0.018
ee-0.018
 end-0.017
 go-0.016
Token excited
Token ID6568
Feature activation-2.36e-03

Pos logit contributions
ee+0.018
upid+0.017
 wish+0.017
 wake+0.017
 bang+0.017
Neg logit contributions
keeping-0.033
led-0.032
con-0.032
 olds-0.032
ls-0.031
Token and
Token ID290
Feature activation-1.38e-02

Pos logit contributions
keeping+0.005
con+0.005
ls+0.004
 olds+0.004
bes+0.004
Neg logit contributions
 wish-0.003
 wake-0.003
 stop-0.003
 end-0.002
 sure-0.002
Token happy
Token ID3772
Feature activation+3.06e-02

Pos logit contributions
keeping+0.027
con+0.027
ls+0.025
 olds+0.025
ados+0.025
Neg logit contributions
 wish-0.017
 wake-0.016
 stop-0.015
 sure-0.015
 bet-0.014
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
 wish+0.039
 wake+0.038
 stop+0.036
 end+0.035
 watch+0.035
Neg logit contributions
keeping-0.057
con-0.057
ls-0.054
 olds-0.053
 av-0.052
Token The
Token ID383
Feature activation-1.16e-02

Pos logit contributions
keeping+0.019
con+0.019
ls+0.018
 olds+0.017
led+0.017
Neg logit contributions
 wish-0.014
 wake-0.013
 stop-0.013
upid-0.013
 bet-0.013
Token gloomy
Token ID46400
Feature activation+5.41e-02

Pos logit contributions
keeping+0.023
con+0.023
ls+0.022
 olds+0.021
 av+0.021
Neg logit contributions
 wish-0.013
 wake-0.013
 stop-0.012
upid-0.012
ee-0.012
Token mood
Token ID10038
Feature activation+1.98e-02

Pos logit contributions
 wish+0.053
 wake+0.051
 end+0.048
 stop+0.046
 watch+0.045
Neg logit contributions
con-0.122
keeping-0.122
ls-0.114
ados-0.112
 olds-0.112
Token was
Token ID373
Feature activation+1.47e-02

Pos logit contributions
 wake+0.024
 wish+0.024
 stop+0.024
ee+0.022
 go+0.022
Neg logit contributions
keeping-0.039
con-0.038
ados-0.036
 av-0.036
bes-0.035
Token gone
Token ID3750
Feature activation+3.64e-03

Pos logit contributions
 wish+0.018
 wake+0.017
upid+0.017
ee+0.017
 bet+0.017
Neg logit contributions
keeping-0.027
con-0.026
 olds-0.025
led-0.025
ls-0.025
Token and
Token ID290
Feature activation-8.52e-04

Pos logit contributions
 wish+0.004
 wake+0.004
 stop+0.004
 end+0.004
 sure+0.004
Neg logit contributions
con-0.007
keeping-0.007
ls-0.006
 olds-0.006
bes-0.006
Token he
Token ID339
Feature activation-4.99e-04

Pos logit contributions
con+0.002
keeping+0.002
ls+0.002
 olds+0.002
bes+0.002
Neg logit contributions
 wish-0.001
 wake-0.001
 stop-0.001
 happy-0.001
 sure-0.001
Token thirsty
Token ID47124
Feature activation+2.08e-02

Pos logit contributions
con+0.017
keeping+0.017
bes+0.016
ls+0.016
 olds+0.016
Neg logit contributions
 wish-0.008
 sure-0.008
 stop-0.007
 happy-0.007
 wake-0.007
Token and
Token ID290
Feature activation+4.05e-03

Pos logit contributions
 stop+0.022
 wake+0.022
 end+0.022
 wish+0.021
 go+0.020
Neg logit contributions
con-0.044
keeping-0.043
ls-0.041
bes-0.040
 olds-0.040
Token went
Token ID1816
Feature activation-1.16e-02

Pos logit contributions
 wish+0.005
 happy+0.005
 wake+0.005
 stop+0.005
 go+0.005
Neg logit contributions
con-0.008
keeping-0.008
bes-0.007
ls-0.007
ados-0.007
Token to
Token ID284
Feature activation-3.38e-03

Pos logit contributions
keeping+0.024
con+0.024
ls+0.023
bes+0.022
 olds+0.022
Neg logit contributions
 wake-0.012
 stop-0.011
 wish-0.011
 watch-0.011
 end-0.011
Token the
Token ID262
Feature activation-1.00e-03

Pos logit contributions
keeping+0.006
con+0.006
 olds+0.006
ls+0.006
 av+0.006
Neg logit contributions
 wish-0.004
 wake-0.004
upid-0.004
ee-0.004
 bet-0.004
Token water
Token ID1660
Feature activation+5.69e-02

Pos logit contributions
keeping+0.002
con+0.002
ls+0.002
 olds+0.002
 av+0.002
Neg logit contributions
 wish-0.001
 wake-0.001
upid-0.001
ee-0.001
 stop-0.001
Token fountain
Token ID29420
Feature activation+4.23e-02

Pos logit contributions
 wish+0.080
 wake+0.076
upid+0.073
 stop+0.073
ee+0.072
Neg logit contributions
keeping-0.098
con-0.097
ls-0.091
 olds-0.090
 av-0.087
Token to
Token ID284
Feature activation+2.64e-02

Pos logit contributions
 stop+0.044
 wake+0.044
 go+0.042
 wish+0.042
 end+0.042
Neg logit contributions
con-0.087
keeping-0.084
ls-0.080
ados-0.080
 olds-0.079
Token drink
Token ID4144
Feature activation-3.61e-03

Pos logit contributions
upid+0.024
ee+0.024
 wish+0.024
be+0.023
 bang+0.021
Neg logit contributions
keeping-0.054
con-0.051
 av-0.050
 olds-0.048
bes-0.048
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
con+0.008
keeping+0.008
bes+0.007
 olds+0.007
ls+0.007
Neg logit contributions
 stop-0.004
 end-0.004
 go-0.004
 wish-0.004
 wake-0.003
Token They
Token ID1119
Feature activation-6.78e-04

Pos logit contributions
con+0.018
keeping+0.018
ls+0.017
led+0.016
 olds+0.016
Neg logit contributions
 wish-0.017
 wake-0.016
 bet-0.016
 stop-0.016
upid-0.016
TokenThe
Token ID464
Feature activation-1.05e-02

Pos logit contributions
 wish+0.021
 wake+0.021
 stop+0.020
upid+0.019
 bang+0.018
Neg logit contributions
keeping-0.032
con-0.030
bes-0.029
 olds-0.028
ls-0.028
Token mom
Token ID1995
Feature activation-7.64e-03

Pos logit contributions
keeping+0.023
con+0.022
ls+0.021
 olds+0.021
led+0.021
Neg logit contributions
 wish-0.011
 wake-0.010
 end-0.009
 stop-0.009
ee-0.009
Token was
Token ID373
Feature activation+1.89e-02

Pos logit contributions
keeping+0.015
con+0.014
 olds+0.013
ados+0.013
ls+0.013
Neg logit contributions
 wish-0.010
 wake-0.010
ee-0.009
 stop-0.009
 sure-0.009
Token the
Token ID262
Feature activation-3.64e-02

Pos logit contributions
upid+0.021
ee+0.020
 wish+0.020
 bet+0.019
 wake+0.018
Neg logit contributions
keeping-0.037
 olds-0.034
 av-0.034
con-0.034
bes-0.034
Token boss
Token ID6478
Feature activation-2.98e-03

Pos logit contributions
con+0.076
keeping+0.076
ls+0.071
 olds+0.071
 av+0.071
Neg logit contributions
 wish-0.035
 wake-0.034
upid-0.033
ee-0.032
 bet-0.031
Tokeny
Token ID88
Feature activation+5.23e-02

Pos logit contributions
con+0.006
keeping+0.006
ls+0.006
 olds+0.005
bes+0.005
Neg logit contributions
 wish-0.003
 wake-0.003
ee-0.003
 stop-0.003
 end-0.003
Token one
Token ID530
Feature activation-1.38e-02

Pos logit contributions
 wish+0.057
 wake+0.056
 stop+0.054
 end+0.053
 watch+0.051
Neg logit contributions
keeping-0.107
con-0.105
ls-0.101
 olds-0.098
bes-0.097
Token.
Token ID13
Feature activation-4.27e-04

Pos logit contributions
con+0.024
keeping+0.023
ls+0.023
 olds+0.022
led+0.022
Neg logit contributions
 wish-0.019
 wake-0.018
upid-0.018
ee-0.017
 bet-0.017
Token She
Token ID1375
Feature activation-1.12e-02

Pos logit contributions
keeping+0.001
con+0.001
ls+0.001
led+0.001
 olds+0.001
Neg logit contributions
 wish-0.001
 wake-0.001
upid-0.001
 bet-0.001
 stop-0.001
Token told
Token ID1297
Feature activation+1.38e-03

Pos logit contributions
keeping+0.021
con+0.020
ls+0.019
ados+0.019
 olds+0.019
Neg logit contributions
 wish-0.015
 wake-0.014
 stop-0.014
ee-0.013
 bet-0.013
Token them
Token ID606
Feature activation+1.52e-02

Pos logit contributions
 wish+0.001
 wake+0.001
 stop+0.001
 sure+0.001
 end+0.001
Neg logit contributions
keeping-0.003
con-0.003
ls-0.003
bes-0.003
led-0.003
Token,
Token ID11
Feature activation+3.97e-03

Pos logit contributions
 wish+0.033
 wake+0.032
upid+0.030
ee+0.030
 bet+0.030
Neg logit contributions
keeping-0.043
con-0.042
ls-0.040
 olds-0.039
 av-0.038
Token but
Token ID475
Feature activation+1.02e-02

Pos logit contributions
 wish+0.005
 wake+0.005
upid+0.005
ee+0.005
 stop+0.005
Neg logit contributions
keeping-0.007
con-0.007
ls-0.007
 olds-0.007
 av-0.006
Token her
Token ID607
Feature activation-1.23e-02

Pos logit contributions
 wish+0.014
 wake+0.013
 stop+0.013
 happy+0.013
 sure+0.013
Neg logit contributions
con-0.019
keeping-0.019
ls-0.018
 olds-0.017
bes-0.017
Token mom
Token ID1995
Feature activation-2.38e-03

Pos logit contributions
keeping+0.026
con+0.026
ls+0.025
 olds+0.025
 av+0.024
Neg logit contributions
 wish-0.012
 wake-0.012
upid-0.011
 stop-0.011
ee-0.011
Token said
Token ID531
Feature activation+1.45e-02

Pos logit contributions
keeping+0.005
con+0.005
ls+0.004
 olds+0.004
ados+0.004
Neg logit contributions
 wish-0.003
 wake-0.003
 stop-0.003
 sure-0.003
 end-0.003
Token,
Token ID11
Feature activation+1.24e-02

Pos logit contributions
 stop+0.016
 wish+0.016
 wake+0.016
 sure+0.015
 happy+0.015
Neg logit contributions
keeping-0.031
con-0.030
 olds-0.028
ls-0.028
bes-0.028
Token "
Token ID366
Feature activation-5.50e-04

Pos logit contributions
 wish+0.015
 wake+0.014
upid+0.013
ee+0.013
 bet+0.013
Neg logit contributions
keeping-0.024
con-0.024
ls-0.023
 olds-0.023
 av-0.022
TokenDon
Token ID3987
Feature activation+2.27e-02

Pos logit contributions
keeping+0.001
con+0.001
ls+0.001
 olds+0.001
 av+0.001
Neg logit contributions
 wish-0.001
 wake-0.001
 stop-0.001
upid-0.001
ee-0.001
Token't
Token ID470
Feature activation+2.72e-02

Pos logit contributions
 wish+0.020
 wake+0.020
 stop+0.019
 end+0.018
ee+0.018
Neg logit contributions
keeping-0.051
con-0.050
ls-0.048
 olds-0.047
 av-0.047
Token worry
Token ID5490
Feature activation+2.99e-02

Pos logit contributions
 wish+0.025
ee+0.024
upid+0.024
 bet+0.023
 wake+0.022
Neg logit contributions
keeping-0.058
con-0.056
 olds-0.053
 av-0.053
ls-0.053
Token,
Token ID11
Feature activation+1.97e-02

Pos logit contributions
 stop+0.039
 wake+0.038
 wish+0.037
 end+0.036
 go+0.036
Neg logit contributions
keeping-0.061
con-0.059
bes-0.054
led-0.052
 av-0.051
Token waved
Token ID26834
Feature activation+1.05e-02

Pos logit contributions
 wish+0.004
 wake+0.004
 sure+0.003
 stop+0.003
 happy+0.003
Neg logit contributions
keeping-0.006
con-0.005
ados-0.005
 olds-0.005
ls-0.005
Token good
Token ID922
Feature activation+7.75e-04

Pos logit contributions
 wish+0.013
 wake+0.012
 stop+0.012
upid+0.011
 bet+0.011
Neg logit contributions
keeping-0.020
con-0.020
ls-0.019
 olds-0.018
bes-0.018
Token-
Token ID12
Feature activation+1.35e-02

Pos logit contributions
 wish+0.001
 wake+0.001
upid+0.001
 bet+0.001
ee+0.001
Neg logit contributions
keeping-0.001
con-0.001
ls-0.001
 olds-0.001
led-0.001
Tokenbye
Token ID16390
Feature activation-1.85e-02

Pos logit contributions
 wake+0.018
ee+0.018
 wish+0.018
 stop+0.018
 sure+0.017
Neg logit contributions
keeping-0.023
con-0.022
ls-0.022
 olds-0.022
bes-0.021
Token and
Token ID290
Feature activation+2.03e-02

Pos logit contributions
keeping+0.037
con+0.036
bes+0.036
ls+0.036
 cores+0.034
Neg logit contributions
 wake-0.021
 stop-0.020
 wish-0.019
 sure-0.019
 watch-0.019
Token thanked
Token ID26280
Feature activation+4.09e-05

Pos logit contributions
 wish+0.027
 wake+0.025
 stop+0.025
 sure+0.025
 happy+0.024
Neg logit contributions
keeping-0.039
con-0.038
bes-0.036
ls-0.036
ados-0.036
Token her
Token ID607
Feature activation+2.51e-03

Pos logit contributions
 wake+0.000
 wish+0.000
 stop+0.000
 end+0.000
 sure+0.000
Neg logit contributions
con-0.000
keeping-0.000
ls-0.000
led-0.000
bes-0.000
Token for
Token ID329
Feature activation+3.95e-02

Pos logit contributions
 wish+0.003
 wake+0.003
 stop+0.003
 end+0.003
 happy+0.003
Neg logit contributions
keeping-0.005
con-0.005
led-0.005
ls-0.005
bes-0.005
Token sharing
Token ID7373
Feature activation+7.87e-03

Pos logit contributions
 wish+0.055
upid+0.053
ee+0.052
 bet+0.052
 wake+0.050
Neg logit contributions
keeping-0.071
con-0.066
 av-0.063
 olds-0.061
ados-0.060
Token the
Token ID262
Feature activation-4.97e-03

Pos logit contributions
 wish+0.008
 wake+0.007
 stop+0.007
 end+0.007
 happy+0.007
Neg logit contributions
con-0.017
keeping-0.017
ls-0.016
 olds-0.016
led-0.015
Token treats
Token ID18432
Feature activation+1.33e-03

Pos logit contributions
keeping+0.010
con+0.010
ls+0.009
 olds+0.009
 av+0.009
Neg logit contributions
 wish-0.005
 wake-0.005
upid-0.005
ee-0.005
 stop-0.005
Token should
Token ID815
Feature activation+1.04e-02

Pos logit contributions
upid+0.021
 wish+0.020
 wake+0.020
ee+0.019
 end+0.019
Neg logit contributions
con-0.053
keeping-0.052
ls-0.051
led-0.049
 av-0.049
Token be
Token ID307
Feature activation+1.02e-02

Pos logit contributions
ee+0.008
upid+0.008
 bet+0.008
 wish+0.008
be+0.008
Neg logit contributions
keeping-0.022
con-0.022
bes-0.020
 olds-0.020
ls-0.020
Token proud
Token ID6613
Feature activation+2.28e-02

Pos logit contributions
 wish+0.009
 wake+0.009
upid+0.008
ee+0.008
 stop+0.008
Neg logit contributions
keeping-0.022
con-0.022
ls-0.021
 olds-0.021
led-0.021
Token of
Token ID286
Feature activation-8.28e-03

Pos logit contributions
 wish+0.020
 wake+0.020
 end+0.018
 stop+0.018
 dare+0.017
Neg logit contributions
keeping-0.053
con-0.053
ls-0.049
bes-0.049
led-0.048
Token them
Token ID606
Feature activation+1.57e-02

Pos logit contributions
keeping+0.018
con+0.017
ls+0.017
 olds+0.016
 av+0.016
Neg logit contributions
 wish-0.008
 wake-0.008
 stop-0.007
upid-0.007
ee-0.007
Token."
Token ID526
Feature activation+8.29e-05

Pos logit contributions
 wish+0.018
 wake+0.018
 stop+0.017
 end+0.017
 sure+0.017
Neg logit contributions
keeping-0.031
con-0.031
ls-0.029
 olds-0.028
led-0.028
Token
Token ID198
Feature activation+1.22e-02

Pos logit contributions
 happy+0.000
 wish+0.000
 watch+0.000
 sure+0.000
 bet+0.000
Neg logit contributions
keeping-0.000
led-0.000
con-0.000
glers-0.000
ls-0.000
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
 wish+0.012
 wake+0.012
 stop+0.011
upid+0.011
 bet+0.011
Neg logit contributions
keeping-0.026
con-0.026
ls-0.025
 olds-0.024
bes-0.023
TokenMax
Token ID11518
Feature activation-2.63e-02

Pos logit contributions
 wish+0.015
 wake+0.014
 end+0.014
ee+0.014
 stop+0.014
Neg logit contributions
keeping-0.020
con-0.020
ls-0.019
led-0.019
 olds-0.019
Token thought
Token ID1807
Feature activation+9.61e-03

Pos logit contributions
keeping+0.051
con+0.048
ls+0.046
ados+0.045
 olds+0.045
Neg logit contributions
 wake-0.035
 wish-0.035
 stop-0.034
 sure-0.032
 watch-0.032
Token about
Token ID546
Feature activation-6.64e-03

Pos logit contributions
 wish+0.009
 wake+0.009
 stop+0.008
upid+0.008
 end+0.008
Neg logit contributions
keeping-0.021
con-0.021
ls-0.020
 olds-0.020
ados-0.019
Token the
Token ID262
Feature activation+1.90e-02

Pos logit contributions
 wish+0.033
 wake+0.032
 end+0.031
 stop+0.030
upid+0.029
Neg logit contributions
con-0.059
keeping-0.058
ls-0.057
 olds-0.055
led-0.054
Token paper
Token ID3348
Feature activation+1.27e-02

Pos logit contributions
 wish+0.021
 wake+0.020
 stop+0.019
 end+0.019
ee+0.018
Neg logit contributions
keeping-0.040
con-0.039
ls-0.038
 olds-0.037
led-0.036
Token,
Token ID11
Feature activation+9.31e-03

Pos logit contributions
 wish+0.016
 wake+0.016
 stop+0.015
 end+0.015
 watch+0.014
Neg logit contributions
keeping-0.023
con-0.023
ls-0.022
 olds-0.022
 av-0.021
Token and
Token ID290
Feature activation+1.20e-02

Pos logit contributions
 wish+0.010
 wake+0.010
 stop+0.010
 happy+0.009
 sure+0.009
Neg logit contributions
con-0.019
keeping-0.019
ls-0.019
 olds-0.018
ados-0.018
Token no
Token ID645
Feature activation+6.62e-03

Pos logit contributions
 wish+0.017
 sure+0.017
 happy+0.017
 stop+0.017
 wake+0.016
Neg logit contributions
con-0.022
keeping-0.020
unn-0.019
ls-0.019
bes-0.019
Token matter
Token ID2300
Feature activation+3.50e-03

Pos logit contributions
 wish+0.006
 wake+0.005
 stop+0.005
 end+0.005
ee+0.004
Neg logit contributions
keeping-0.016
con-0.015
ls-0.015
 olds-0.014
bes-0.014
Token what
Token ID644
Feature activation-1.84e-02

Pos logit contributions
 wish+0.004
 wake+0.003
 stop+0.003
upid+0.003
ee+0.003
Neg logit contributions
keeping-0.007
con-0.007
ls-0.007
 olds-0.007
 av-0.007
Token they
Token ID484
Feature activation+3.11e-02

Pos logit contributions
con+0.037
keeping+0.037
 olds+0.034
ados+0.033
ls+0.032
Neg logit contributions
 wish-0.023
 wake-0.021
 stop-0.020
 time-0.020
 sure-0.020
Token did
Token ID750
Feature activation+4.85e-03

Pos logit contributions
 wish+0.040
 wake+0.039
upid+0.039
ee+0.037
 bet+0.037
Neg logit contributions
keeping-0.055
con-0.054
ls-0.051
led-0.050
 olds-0.050
Token,
Token ID11
Feature activation+2.18e-02

Pos logit contributions
 wish+0.007
 wake+0.006
 stop+0.006
upid+0.006
 end+0.006
Neg logit contributions
con-0.009
keeping-0.008
ls-0.008
 olds-0.008
 av-0.008
Token they
Token ID484
Feature activation+2.59e-02

Pos logit contributions
 wish+0.032
 happy+0.032
 stop+0.032
 sure+0.032
 wake+0.031
Neg logit contributions
con-0.039
keeping-0.036
unn-0.035
ls-0.034
bes-0.033
Token the
Token ID262
Feature activation+1.39e-02

Pos logit contributions
 wish+0.066
upid+0.064
ee+0.064
 bet+0.063
 dare+0.059
Neg logit contributions
keeping-0.076
con-0.069
 av-0.067
bes-0.064
ados-0.063
Token lovely
Token ID14081
Feature activation+6.37e-02

Pos logit contributions
 wish+0.014
 wake+0.013
upid+0.013
ee+0.013
 bet+0.012
Neg logit contributions
keeping-0.029
con-0.029
ls-0.027
 olds-0.027
 av-0.027
Token day
Token ID1110
Feature activation+6.15e-03

Pos logit contributions
 wish+0.065
 wake+0.064
 watch+0.055
 end+0.055
 plan+0.055
Neg logit contributions
con-0.141
keeping-0.141
ls-0.137
 olds-0.134
led-0.133
Token.
Token ID13
Feature activation-1.53e-05

Pos logit contributions
 wake+0.007
 wish+0.007
 stop+0.007
 sure+0.006
 end+0.006
Neg logit contributions
con-0.012
keeping-0.012
ls-0.011
bes-0.011
ados-0.011
Token The
Token ID383
Feature activation-8.11e-03

Pos logit contributions
con+0.000
led+0.000
ls+0.000
keeping+0.000
bes+0.000
Neg logit contributions
 wish-0.000
 sure-0.000
 wake-0.000
 bet-0.000
 happy-0.000
Token prince
Token ID19716
Feature activation+1.10e-02

Pos logit contributions
keeping+0.016
con+0.016
ls+0.015
 olds+0.014
bes+0.014
Neg logit contributions
 wish-0.011
 wake-0.010
 end-0.010
 stop-0.010
 happy-0.009
Token smiled
Token ID13541
Feature activation+6.85e-03

Pos logit contributions
 wish+0.016
 wake+0.015
 stop+0.014
 happy+0.014
 sure+0.014
Neg logit contributions
keeping-0.020
con-0.019
 olds-0.018
ados-0.018
ls-0.018
Token,
Token ID11
Feature activation+1.02e-02

Pos logit contributions
 wish+0.009
 wake+0.008
upid+0.008
ee+0.008
 dare+0.008
Neg logit contributions
keeping-0.013
con-0.012
ls-0.012
 olds-0.012
led-0.012
Token waved
Token ID26834
Feature activation+9.72e-03

Pos logit contributions
 wish+0.012
upid+0.011
ee+0.011
 wake+0.011
 dare+0.011
Neg logit contributions
con-0.020
keeping-0.020
ls-0.019
 olds-0.019
 av-0.018
Token goodbye
Token ID24829
Feature activation-3.17e-02

Pos logit contributions
 wish+0.012
 wake+0.012
 stop+0.011
 end+0.011
ee+0.011
Neg logit contributions
keeping-0.019
con-0.019
ls-0.017
 olds-0.017
bes-0.017
Token and
Token ID290
Feature activation+1.53e-02

Pos logit contributions
keeping+0.065
con+0.064
ls+0.063
glers+0.061
 av+0.061
Neg logit contributions
 stop-0.034
 happy-0.033
 wake-0.033
 wish-0.033
 sure-0.031
Token High
Token ID3334
Feature activation-8.66e-03

Pos logit contributions
con+0.041
keeping+0.041
ls+0.038
 olds+0.037
 av+0.037
Neg logit contributions
 wake-0.037
 wish-0.037
upid-0.036
ee-0.036
 bet-0.035
Tokenness
Token ID1108
Feature activation+1.63e-02

Pos logit contributions
keeping+0.016
con+0.016
 olds+0.015
ls+0.015
cakes+0.015
Neg logit contributions
 wish-0.011
 wake-0.011
 stop-0.010
ee-0.010
 end-0.010
Token.
Token ID13
Feature activation-3.04e-03

Pos logit contributions
 wish+0.019
 wake+0.019
 stop+0.019
upid+0.018
ee+0.018
Neg logit contributions
keeping-0.032
con-0.031
 av-0.029
ados-0.029
 olds-0.029
Token I
Token ID314
Feature activation+3.12e-02

Pos logit contributions
keeping+0.005
con+0.005
 olds+0.005
ls+0.005
 av+0.005
Neg logit contributions
 wish-0.004
 wake-0.004
 stop-0.004
upid-0.004
ee-0.004
Token will
Token ID481
Feature activation-1.68e-02

Pos logit contributions
upid+0.027
 crashing+0.027
 wake+0.025
 stop+0.025
 bang+0.025
Neg logit contributions
keeping-0.062
con-0.062
ls-0.059
led-0.059
bes-0.058
Token hide
Token ID7808
Feature activation-1.78e-02

Pos logit contributions
keeping+0.037
con+0.036
ls+0.034
 olds+0.034
bes+0.034
Neg logit contributions
 wish-0.013
ee-0.013
upid-0.012
 wake-0.012
 bet-0.012
Token it
Token ID340
Feature activation-3.86e-03

Pos logit contributions
con+0.039
keeping+0.037
cakes+0.036
 olds+0.036
bes+0.035
Neg logit contributions
 wish-0.016
 stop-0.015
ee-0.015
 sure-0.015
 wake-0.015
Token in
Token ID287
Feature activation-1.37e-02

Pos logit contributions
keeping+0.007
con+0.007
 olds+0.007
ls+0.007
 av+0.007
Neg logit contributions
 wish-0.005
 wake-0.004
 stop-0.004
 sure-0.004
 end-0.004
Token a
Token ID257
Feature activation+1.06e-02

Pos logit contributions
keeping+0.026
con+0.026
ls+0.024
 olds+0.024
 av+0.024
Neg logit contributions
 wish-0.016
 wake-0.016
 stop-0.015
upid-0.015
ee-0.015
Token secret
Token ID3200
Feature activation+9.16e-03

Pos logit contributions
upid+0.013
ee+0.012
 wish+0.012
 wake+0.012
 understand+0.011
Neg logit contributions
con-0.019
keeping-0.019
ls-0.018
 av-0.018
 olds-0.017
Token place
Token ID1295
Feature activation+6.93e-03

Pos logit contributions
 wish+0.009
 wake+0.009
 stop+0.008
 end+0.008
 bet+0.008
Neg logit contributions
con-0.020
keeping-0.020
ls-0.019
led-0.019
 olds-0.019
Token"
Token ID1
Feature activation-3.21e-02

Pos logit contributions
keeping+0.004
con+0.004
ls+0.004
 olds+0.004
 av+0.004
Neg logit contributions
 wish-0.003
 wake-0.003
 stop-0.002
upid-0.002
ee-0.002
TokenNo
Token ID2949
Feature activation-7.87e-03

Pos logit contributions
con+0.057
keeping+0.055
ls+0.054
bes+0.051
 olds+0.051
Neg logit contributions
 wish-0.043
 wake-0.042
upid-0.040
 end-0.038
 stop-0.038
Token,
Token ID11
Feature activation-2.74e-03

Pos logit contributions
keeping+0.013
con+0.013
ls+0.012
 olds+0.012
led+0.012
Neg logit contributions
 wish-0.012
 wake-0.011
 stop-0.011
upid-0.011
 end-0.011
Token it
Token ID340
Feature activation+8.61e-03

Pos logit contributions
con+0.005
ls+0.004
keeping+0.004
ados+0.004
 av+0.004
Neg logit contributions
 wish-0.004
ee-0.003
 wake-0.003
 bet-0.003
upid-0.003
Token's
Token ID338
Feature activation+2.80e-02

Pos logit contributions
 wish+0.010
 wake+0.009
upid+0.009
 bet+0.009
 stop+0.009
Neg logit contributions
con-0.017
keeping-0.016
ls-0.016
 olds-0.016
led-0.016
Token mine
Token ID6164
Feature activation-2.52e-02

Pos logit contributions
 wish+0.028
 wake+0.027
 bet+0.026
ee+0.025
upid+0.025
Neg logit contributions
con-0.058
keeping-0.058
ls-0.056
 olds-0.055
 av-0.054
Token!"
Token ID2474
Feature activation+1.47e-02

Pos logit contributions
con+0.049
keeping+0.048
 av+0.044
pill+0.044
 olds+0.044
Neg logit contributions
 stop-0.031
 wish-0.031
 wake-0.029
 sure-0.029
 end-0.028
Token Sam
Token ID3409
Feature activation-1.22e-02

Pos logit contributions
 wish+0.020
 wake+0.019
 stop+0.018
 dare+0.018
 bet+0.018
Neg logit contributions
keeping-0.027
con-0.026
ls-0.025
 olds-0.024
led-0.024
Token says
Token ID1139
Feature activation+1.01e-02

Pos logit contributions
keeping+0.024
con+0.023
ls+0.021
 av+0.021
 olds+0.021
Neg logit contributions
 wish-0.015
 wake-0.015
 stop-0.015
 dare-0.014
 bet-0.014
Token.
Token ID13
Feature activation+4.97e-03

Pos logit contributions
 stop+0.012
 wish+0.011
 wake+0.011
 sure+0.011
 happy+0.011
Neg logit contributions
con-0.021
keeping-0.020
ls-0.019
 av-0.019
 olds-0.018
Token He
Token ID679
Feature activation-1.79e-02

Pos logit contributions
 wake+0.005
 wish+0.005
 stop+0.004
 bet+0.004
 dare+0.004
Neg logit contributions
keeping-0.011
con-0.010
ls-0.010
led-0.010
 av-0.010
Token,
Token ID11
Feature activation-1.51e-02

Pos logit contributions
keeping+0.077
con+0.074
 olds+0.070
bes+0.069
ls+0.067
Neg logit contributions
 stop-0.043
 wish-0.041
 wake-0.041
 watch-0.041
 time-0.041
Token Sab
Token ID9910
Feature activation-3.23e-02

Pos logit contributions
keeping+0.029
con+0.029
ls+0.028
 olds+0.028
led+0.028
Neg logit contributions
 stop-0.018
 wish-0.017
 sure-0.017
 watch-0.017
 happy-0.017
Tokenrina
Token ID22267
Feature activation-6.18e-03

Pos logit contributions
keeping+0.048
con+0.047
ls+0.044
 olds+0.043
 av+0.042
Neg logit contributions
 wish-0.053
 wake-0.052
 stop-0.050
upid-0.050
ee-0.049
Token and
Token ID290
Feature activation+1.30e-03

Pos logit contributions
keeping+0.013
con+0.013
ados+0.012
 olds+0.012
bes+0.012
Neg logit contributions
 wish-0.007
 wake-0.007
 go-0.006
ee-0.006
 sure-0.006
Token her
Token ID607
Feature activation-1.79e-02

Pos logit contributions
 wish+0.001
 wake+0.001
 sure+0.001
 stop+0.001
 end+0.001
Neg logit contributions
con-0.003
keeping-0.003
ls-0.003
 olds-0.003
bes-0.003
Token mom
Token ID1995
Feature activation-7.89e-03

Pos logit contributions
keeping+0.044
con+0.041
 olds+0.040
ls+0.040
bes+0.040
Neg logit contributions
 wish-0.015
 wake-0.013
 end-0.012
 watch-0.012
 stop-0.012
Token went
Token ID1816
Feature activation-2.36e-03

Pos logit contributions
con+0.015
keeping+0.015
ls+0.014
 olds+0.014
bes+0.014
Neg logit contributions
 wish-0.010
 wake-0.009
 stop-0.009
 end-0.009
ee-0.009
Token to
Token ID284
Feature activation-4.43e-03

Pos logit contributions
keeping+0.005
con+0.005
ls+0.004
led+0.004
ados+0.004
Neg logit contributions
 wake-0.003
 wish-0.003
 stop-0.003
 watch-0.002
 go-0.002
Token the
Token ID262
Feature activation-8.74e-03

Pos logit contributions
keeping+0.008
con+0.008
 av+0.007
ls+0.007
 olds+0.007
Neg logit contributions
 wish-0.006
upid-0.006
 wake-0.006
ee-0.005
 bet-0.005
Token park
Token ID3952
Feature activation+1.93e-02

Pos logit contributions
keeping+0.018
con+0.018
ls+0.017
 olds+0.017
 av+0.016
Neg logit contributions
 wish-0.009
upid-0.009
 wake-0.009
ee-0.008
 bet-0.008
Token.
Token ID13
Feature activation-6.62e-03

Pos logit contributions
 wish+0.025
 wake+0.024
 stop+0.024
 watch+0.023
 bet+0.023
Neg logit contributions
con-0.035
keeping-0.035
ls-0.033
 olds-0.032
ados-0.032
Token a
Token ID257
Feature activation+1.69e-02

Pos logit contributions
upid+0.045
 wish+0.043
 bet+0.043
ee+0.042
 wake+0.041
Neg logit contributions
keeping-0.057
con-0.053
 av-0.052
ls-0.051
bes-0.050
Token time
Token ID640
Feature activation-1.01e-02

Pos logit contributions
 wish+0.010
upid+0.010
 wake+0.009
 bet+0.009
 stop+0.009
Neg logit contributions
ls-0.038
con-0.038
bes-0.037
led-0.037
 av-0.037
Token,
Token ID11
Feature activation+7.97e-03

Pos logit contributions
keeping+0.016
con+0.016
ls+0.015
 olds+0.014
 av+0.014
Neg logit contributions
 wish-0.016
 wake-0.015
 stop-0.015
upid-0.014
ee-0.014
Token there
Token ID612
Feature activation+1.95e-02

Pos logit contributions
 wish+0.013
 wake+0.013
upid+0.012
 stop+0.012
 bang+0.012
Neg logit contributions
ls-0.011
led-0.011
con-0.010
keeping-0.010
 av-0.010
Token was
Token ID373
Feature activation+3.12e-02

Pos logit contributions
 wish+0.034
upid+0.034
 wake+0.033
 bet+0.032
 bang+0.032
Neg logit contributions
keeping-0.023
ls-0.023
con-0.023
led-0.022
bes-0.020
Token a
Token ID257
Feature activation-2.05e-02

Pos logit contributions
 wish+0.039
 wake+0.037
upid+0.037
 stop+0.036
 bet+0.035
Neg logit contributions
keeping-0.057
con-0.056
 av-0.054
ls-0.054
led-0.052
Token little
Token ID1310
Feature activation-2.90e-03

Pos logit contributions
 av+0.037
con+0.036
ls+0.036
 olds+0.034
hog+0.033
Neg logit contributions
upid-0.027
 wish-0.022
 bet-0.022
 dare-0.021
ee-0.021
Token bird
Token ID6512
Feature activation-1.55e-03

Pos logit contributions
keeping+0.006
con+0.006
ls+0.005
 olds+0.005
 av+0.005
Neg logit contributions
 wish-0.003
 wake-0.003
upid-0.003
 stop-0.003
 bet-0.003
Token named
Token ID3706
Feature activation-1.89e-02

Pos logit contributions
keeping+0.002
con+0.002
ls+0.002
led+0.002
 av+0.002
Neg logit contributions
 wish-0.003
upid-0.002
 wake-0.002
 bet-0.002
 stop-0.002
Token Tweet
Token ID18752
Feature activation-2.14e-02

Pos logit contributions
con+0.023
keeping+0.022
ls+0.021
 olds+0.020
 av+0.020
Neg logit contributions
 wish-0.037
 wake-0.035
 bet-0.034
upid-0.034
ee-0.034
Token.
Token ID13
Feature activation+1.50e-02

Pos logit contributions
keeping+0.031
con+0.031
ls+0.029
 av+0.028
 olds+0.028
Neg logit contributions
 wish-0.035
 wake-0.034
upid-0.033
 bet-0.033
 stop-0.033
Token her
Token ID607
Feature activation+2.50e-03

Pos logit contributions
 wish+0.023
ee+0.022
 bet+0.022
 wake+0.022
upid+0.022
Neg logit contributions
keeping-0.031
con-0.031
ls-0.030
 olds-0.029
 av-0.029
Token that
Token ID326
Feature activation+5.11e-02

Pos logit contributions
 wish+0.003
 wake+0.003
 stop+0.002
 sure+0.002
 dare+0.002
Neg logit contributions
keeping-0.005
con-0.005
 olds-0.005
ls-0.005
led-0.005
Token she
Token ID673
Feature activation+1.02e-02

Pos logit contributions
 wish+0.046
 wake+0.045
 stop+0.041
 end+0.040
 sure+0.040
Neg logit contributions
keeping-0.115
con-0.113
 olds-0.108
ls-0.107
led-0.105
Token will
Token ID481
Feature activation-2.97e-03

Pos logit contributions
 wish+0.013
 wake+0.012
 sure+0.012
 stop+0.012
 end+0.011
Neg logit contributions
keeping-0.020
con-0.019
 olds-0.018
ados-0.018
ls-0.018
Token be
Token ID307
Feature activation+1.32e-02

Pos logit contributions
keeping+0.006
con+0.006
 olds+0.006
ls+0.006
bes+0.006
Neg logit contributions
ee-0.003
 wish-0.003
upid-0.002
 wake-0.002
 bet-0.002
Token okay
Token ID8788
Feature activation-1.57e-03

Pos logit contributions
 wish+0.011
 wake+0.010
upid+0.009
ee+0.009
 stop+0.009
Neg logit contributions
keeping-0.031
con-0.030
ls-0.029
 olds-0.029
 av-0.028
Token.
Token ID13
Feature activation-3.30e-03

Pos logit contributions
con+0.003
keeping+0.003
ls+0.003
 olds+0.003
led+0.003
Neg logit contributions
 wish-0.002
 wake-0.002
 stop-0.002
 end-0.002
 sure-0.002
Token
Token ID198
Feature activation+5.69e-03

Pos logit contributions
keeping+0.006
con+0.006
ls+0.006
led+0.006
 olds+0.006
Neg logit contributions
 wish-0.004
 wake-0.004
 stop-0.004
 sure-0.004
 happy-0.004
Token
Token ID198
Feature activation+6.75e-03

Pos logit contributions
 wish+0.007
 wake+0.006
 stop+0.006
upid+0.006
 bet+0.006
Neg logit contributions
con-0.011
keeping-0.011
ls-0.011
 olds-0.010
bes-0.010
TokenL
Token ID43
Feature activation-1.48e-02

Pos logit contributions
 wish+0.010
 wake+0.010
upid+0.010
 stop+0.010
 bet+0.010
Neg logit contributions
keeping-0.010
con-0.010
ls-0.010
 olds-0.010
bes-0.010
Tokenily
Token ID813
Feature activation+8.98e-03

Pos logit contributions
keeping+0.028
con+0.028
ls+0.027
 olds+0.026
led+0.026
Neg logit contributions
 wish-0.018
 wake-0.017
 stop-0.016
upid-0.016
 bet-0.016
Token and
Token ID290
Feature activation+7.88e-03

Pos logit contributions
 wish+0.013
ee+0.013
upid+0.012
 wake+0.012
 bet+0.012
Neg logit contributions
con-0.016
keeping-0.016
ls-0.015
 av-0.015
ados-0.014
Token use
Token ID779
Feature activation-2.35e-02

Pos logit contributions
 wish+0.008
 wake+0.007
 stop+0.007
 watch+0.006
 dare+0.006
Neg logit contributions
keeping-0.018
con-0.018
ls-0.017
 olds-0.017
bes-0.016
Token his
Token ID465
Feature activation-3.07e-02

Pos logit contributions
keeping+0.049
con+0.049
ls+0.047
 av+0.046
 olds+0.046
Neg logit contributions
 wish-0.024
 wake-0.023
 stop-0.021
ee-0.021
upid-0.021
Token knowledge
Token ID3725
Feature activation-3.57e-02

Pos logit contributions
keeping+0.059
con+0.059
ls+0.056
 olds+0.055
 av+0.054
Neg logit contributions
 wish-0.036
 wake-0.035
 stop-0.033
upid-0.033
ee-0.033
Token to
Token ID284
Feature activation+1.76e-02

Pos logit contributions
keeping+0.068
con+0.067
ls+0.064
 olds+0.063
bes+0.063
Neg logit contributions
 wish-0.043
 wake-0.040
 stop-0.040
 end-0.039
 dare-0.038
Token solve
Token ID8494
Feature activation-6.28e-02

Pos logit contributions
ee+0.018
upid+0.017
be+0.017
 wish+0.016
 bet+0.016
Neg logit contributions
keeping-0.035
con-0.034
 av-0.034
 olds-0.032
ls-0.032
Token difficult
Token ID2408
Feature activation-3.22e-03

Pos logit contributions
keeping+0.132
con+0.131
 olds+0.123
ls+0.121
bes+0.120
Neg logit contributions
 wish-0.065
 wake-0.059
upid-0.057
 end-0.056
ee-0.056
Token problems
Token ID2761
Feature activation-4.51e-02

Pos logit contributions
con+0.006
keeping+0.006
cakes+0.005
bes+0.005
Ell+0.005
Neg logit contributions
 wish-0.004
 dare-0.004
 wake-0.004
 plan-0.004
 end-0.004
Token.
Token ID13
Feature activation-7.60e-03

Pos logit contributions
con+0.087
keeping+0.087
ls+0.082
 olds+0.081
ados+0.080
Neg logit contributions
 wish-0.049
 wake-0.048
 stop-0.047
 sure-0.046
 end-0.046
Token
Token ID198
Feature activation-5.09e-03

Pos logit contributions
con+0.015
keeping+0.015
 olds+0.014
ls+0.014
 av+0.014
Neg logit contributions
 wish-0.009
 wake-0.008
 stop-0.008
ee-0.008
upid-0.008
Token
Token ID198
Feature activation-3.71e-03

Pos logit contributions
con+0.008
ls+0.008
keeping+0.007
Ell+0.007
 olds+0.007
Neg logit contributions
 wish-0.008
 wake-0.007
 stop-0.007
upid-0.007
 wait-0.007
Token were
Token ID547
Feature activation+2.72e-02

Pos logit contributions
con+0.014
keeping+0.014
 olds+0.013
ls+0.013
ados+0.013
Neg logit contributions
 wish-0.007
 wake-0.007
 stop-0.006
 go-0.006
 end-0.006
Token playing
Token ID2712
Feature activation+1.61e-03

Pos logit contributions
 wish+0.032
 wake+0.030
upid+0.029
 stop+0.029
 bet+0.029
Neg logit contributions
keeping-0.051
con-0.051
ls-0.051
 av-0.050
 olds-0.049
Token cow
Token ID9875
Feature activation-5.16e-02

Pos logit contributions
 wish+0.002
 wake+0.002
 stop+0.002
upid+0.002
ee+0.002
Neg logit contributions
keeping-0.003
con-0.003
ls-0.003
 olds-0.002
 av-0.002
Tokenboys
Token ID13202
Feature activation-3.99e-02

Pos logit contributions
keeping+0.115
 olds+0.111
con+0.107
unn+0.103
riet+0.103
Neg logit contributions
 watch-0.048
 go-0.047
 part-0.046
 happy-0.045
 wake-0.044
Token and
Token ID290
Feature activation+1.43e-02

Pos logit contributions
keeping+0.092
con+0.091
 olds+0.088
ls+0.085
cakes+0.084
Neg logit contributions
 wake-0.033
 stop-0.032
 end-0.030
 go-0.030
 watch-0.029
Token robbers
Token ID46988
Feature activation-5.32e-02

Pos logit contributions
 wake+0.021
 watch+0.021
 go+0.020
 happy+0.020
 stop+0.020
Neg logit contributions
keeping-0.026
led-0.025
 olds-0.025
ls-0.025
con-0.024
Token in
Token ID287
Feature activation+5.12e-04

Pos logit contributions
keeping+0.110
con+0.108
 olds+0.106
ls+0.104
cakes+0.102
Neg logit contributions
 end-0.056
 wake-0.054
 stop-0.053
 watch-0.053
 go-0.052
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
 wish+0.001
 wake+0.001
 stop+0.001
upid+0.001
ee+0.001
Neg logit contributions
keeping-0.001
con-0.001
ls-0.001
 olds-0.001
 av-0.001
Token backyard
Token ID24296
Feature activation+1.93e-02

Pos logit contributions
 wish+0.014
 wake+0.013
upid+0.012
 stop+0.012
ee+0.012
Neg logit contributions
keeping-0.023
con-0.023
ls-0.022
 olds-0.021
 av-0.021
Token.
Token ID13
Feature activation+3.13e-03

Pos logit contributions
 wish+0.023
 wake+0.023
 end+0.022
 stop+0.022
 watch+0.022
Neg logit contributions
keeping-0.037
con-0.037
ls-0.034
 olds-0.034
ados-0.033
Token Lily
Token ID20037
Feature activation-3.25e-03

Pos logit contributions
 wake+0.004
 wish+0.004
 bet+0.004
 end+0.004
 stop+0.003
Neg logit contributions
keeping-0.006
con-0.006
ls-0.006
led-0.005
 olds-0.005
Token "
Token ID366
Feature activation+3.10e-03

Pos logit contributions
 wish+0.004
ee+0.004
upid+0.004
 wake+0.004
 dare+0.003
Neg logit contributions
con-0.005
keeping-0.005
 av-0.005
 olds-0.004
ls-0.004
TokenHow
Token ID2437
Feature activation-1.69e-02

Pos logit contributions
 wish+0.004
 wake+0.004
upid+0.004
 stop+0.004
ee+0.004
Neg logit contributions
keeping-0.006
con-0.005
 av-0.005
ls-0.005
 olds-0.005
Token do
Token ID466
Feature activation-4.14e-04

Pos logit contributions
 av+0.031
ls+0.031
con+0.030
 olds+0.029
cakes+0.029
Neg logit contributions
 crashing-0.016
 wake-0.015
 bang-0.015
be-0.015
 bet-0.015
Token you
Token ID345
Feature activation-2.54e-03

Pos logit contributions
keeping+0.001
con+0.001
led+0.001
bes+0.001
ls+0.001
Neg logit contributions
 wish-0.000
 wake-0.000
 stop-0.000
 dare-0.000
 watch-0.000
Token make
Token ID787
Feature activation-1.32e-02

Pos logit contributions
keeping+0.005
con+0.005
 av+0.005
 olds+0.005
led+0.005
Neg logit contributions
ee-0.002
upid-0.002
 wish-0.002
 wake-0.002
 crashing-0.002
Token things
Token ID1243
Feature activation-5.50e-02

Pos logit contributions
con+0.025
keeping+0.025
 av+0.024
 olds+0.024
ls+0.024
Neg logit contributions
 wish-0.015
 wake-0.015
 bet-0.015
upid-0.014
ee-0.014
Token vanish
Token ID39572
Feature activation-1.32e-02

Pos logit contributions
con+0.105
keeping+0.103
ls+0.101
 av+0.101
 olds+0.100
Neg logit contributions
 wish-0.060
ee-0.058
upid-0.057
 wake-0.057
 bet-0.056
Token?"
Token ID1701
Feature activation+5.65e-03

Pos logit contributions
keeping+0.026
con+0.026
ls+0.024
 olds+0.024
bes+0.023
Neg logit contributions
 wish-0.015
 wake-0.015
 stop-0.014
upid-0.014
 end-0.014
Token The
Token ID383
Feature activation-5.30e-03

Pos logit contributions
 wish+0.007
 wake+0.006
 stop+0.006
upid+0.006
ee+0.006
Neg logit contributions
keeping-0.011
con-0.011
ls-0.010
 olds-0.010
 av-0.010
Token magician
Token ID42055
Feature activation-6.36e-03

Pos logit contributions
con+0.009
keeping+0.009
ls+0.008
 av+0.008
 olds+0.008
Neg logit contributions
upid-0.007
 wish-0.007
 wake-0.007
ee-0.007
 bet-0.007
Token smiled
Token ID13541
Feature activation-8.93e-04

Pos logit contributions
keeping+0.011
con+0.010
ls+0.010
 olds+0.010
ados+0.009
Neg logit contributions
 wish-0.010
 wake-0.009
 stop-0.009
ee-0.009
 sure-0.009
Token.
Token ID13
Feature activation+1.41e-03

Pos logit contributions
con+0.040
keeping+0.037
Ell+0.036
bes+0.036
mary+0.035
Neg logit contributions
 wish-0.017
 sure-0.016
 happy-0.015
 want-0.015
 time-0.014
Token Let
Token ID3914
Feature activation+3.44e-03

Pos logit contributions
 wish+0.002
 wake+0.002
 stop+0.002
 bet+0.002
upid+0.002
Neg logit contributions
keeping-0.003
con-0.003
ls-0.003
led-0.002
 olds-0.002
Token's
Token ID338
Feature activation+4.37e-02

Pos logit contributions
 wish+0.004
upid+0.004
 bet+0.004
 wake+0.004
ee+0.004
Neg logit contributions
con-0.006
keeping-0.006
ls-0.006
 av-0.006
 olds-0.006
Token put
Token ID1234
Feature activation+3.65e-03

Pos logit contributions
upid+0.042
ee+0.040
 bet+0.038
 crashing+0.036
 mate+0.035
Neg logit contributions
keeping-0.083
 av-0.082
con-0.080
ados-0.076
 olds-0.075
Token the
Token ID262
Feature activation-3.06e-02

Pos logit contributions
 wish+0.004
 wake+0.004
upid+0.004
 bet+0.004
 dare+0.004
Neg logit contributions
keeping-0.007
con-0.007
ls-0.007
 av-0.007
ados-0.007
Token pizz
Token ID38748
Feature activation-5.74e-02

Pos logit contributions
con+0.042
keeping+0.041
 av+0.041
ls+0.039
ados+0.038
Neg logit contributions
 wish-0.051
upid-0.050
 wake-0.049
 bet-0.049
ee-0.048
Tokenas
Token ID292
Feature activation-6.88e-03

Pos logit contributions
keeping+0.096
con+0.094
ls+0.090
 olds+0.086
led+0.086
Neg logit contributions
 wish-0.085
 wake-0.081
upid-0.077
 stop-0.076
 bet-0.076
Token in
Token ID287
Feature activation-2.00e-02

Pos logit contributions
keeping+0.013
con+0.013
ls+0.013
 olds+0.012
 av+0.012
Neg logit contributions
 wish-0.008
 wake-0.008
upid-0.007
 bet-0.007
 stop-0.007
Token the
Token ID262
Feature activation+2.29e-03

Pos logit contributions
keeping+0.037
con+0.035
ls+0.034
 av+0.034
 olds+0.034
Neg logit contributions
 wish-0.025
 wake-0.025
upid-0.023
 stop-0.023
 bet-0.023
Token oven
Token ID14361
Feature activation-2.90e-02

Pos logit contributions
 wish+0.003
upid+0.003
ee+0.003
 wake+0.003
 sure+0.003
Neg logit contributions
con-0.004
keeping-0.004
 av-0.004
ls-0.004
 olds-0.004
Token and
Token ID290
Feature activation+1.98e-02

Pos logit contributions
con+0.057
keeping+0.056
ls+0.054
 olds+0.052
 av+0.052
Neg logit contributions
 wish-0.033
 wake-0.032
 stop-0.031
 watch-0.030
 end-0.029
Token Tim
Token ID5045
Feature activation-2.47e-02

Pos logit contributions
 wish+0.019
 wake+0.018
 stop+0.018
upid+0.018
ee+0.018
Neg logit contributions
keeping-0.020
con-0.020
ls-0.018
 olds-0.018
 av-0.018
Tokenmy
Token ID1820
Feature activation-1.53e-02

Pos logit contributions
keeping+0.041
con+0.039
led+0.038
ls+0.037
 av+0.036
Neg logit contributions
 wish-0.035
 wake-0.034
upid-0.034
 stop-0.033
 bet-0.033
Token loved
Token ID6151
Feature activation+1.47e-02

Pos logit contributions
keeping+0.029
con+0.028
ls+0.027
 olds+0.026
led+0.026
Neg logit contributions
 wish-0.019
 wake-0.018
upid-0.017
 stop-0.017
 bet-0.017
Token to
Token ID284
Feature activation-8.67e-03

Pos logit contributions
 wish+0.022
 wake+0.021
upid+0.021
 bet+0.020
 stop+0.020
Neg logit contributions
keeping-0.024
con-0.024
ls-0.022
 av-0.022
 olds-0.022
Token bake
Token ID28450
Feature activation-3.37e-02

Pos logit contributions
con+0.018
keeping+0.018
 av+0.017
 olds+0.017
bes+0.017
Neg logit contributions
upid-0.008
 wish-0.008
ee-0.007
 bet-0.007
 end-0.007
Token cakes
Token ID30849
Feature activation-5.47e-02

Pos logit contributions
keeping+0.057
con+0.054
ls+0.050
 olds+0.049
led+0.048
Neg logit contributions
 wish-0.049
 wake-0.049
 stop-0.048
 watch-0.046
 happy-0.046
Token with
Token ID351
Feature activation-8.97e-03

Pos logit contributions
keeping+0.096
con+0.094
ls+0.087
 olds+0.086
led+0.084
Neg logit contributions
 wish-0.075
 wake-0.074
 stop-0.072
 end-0.070
 watch-0.069
Token his
Token ID465
Feature activation-3.76e-02

Pos logit contributions
keeping+0.021
con+0.021
ls+0.020
 olds+0.020
bes+0.019
Neg logit contributions
 wish-0.007
 wake-0.007
 stop-0.006
ee-0.006
 end-0.006
Token mom
Token ID1995
Feature activation-1.73e-02

Pos logit contributions
keeping+0.075
con+0.074
ls+0.070
 olds+0.069
 av+0.068
Neg logit contributions
 wish-0.043
 wake-0.041
upid-0.039
 stop-0.039
ee-0.038
Tokenmy
Token ID1820
Feature activation-4.15e-03

Pos logit contributions
keeping+0.028
con+0.027
 av+0.026
ls+0.025
 olds+0.025
Neg logit contributions
 wish-0.026
upid-0.025
 bet-0.024
 wake-0.024
 stop-0.023
Token.
Token ID13
Feature activation+2.32e-03

Pos logit contributions
keeping+0.007
con+0.007
ls+0.007
 olds+0.006
 av+0.006
Neg logit contributions
 wish-0.006
 wake-0.006
upid-0.005
 stop-0.005
ee-0.005