TOP ACTIVATIONS
MAX = 0.085

" What 's in the tube ?" " It
" What 's in the box ?" she asked her mom
do you have in the sack , Tommy ?"
they were finished , Jack and Jill were both so happy
" What 's in the box ?" he asked .
. " Where are you going ?", asked Jill . "
" What is in the box ?" Tom asks .
they were done , Jack and Jill hopped onto the bed
" What if we look outside ?" Her mom
in the air . Jack and Jill quickly grabbed their rods
" What is in the box ?" Tom asked . "
, what 's in the tube ?" asked Tommy .
, what is in the box ?" Her dad smiled and
" Where are you going ?" asked the flower next
games . Jack and Jill were the best of
Pete . Where are you going ?" " I
do you think you 're going ?! You 're not allowed
, " Where are you going ?". The pear didn 't
Smith ! Where are you going ?" asked Lily .
" Where are we going ?" asked Toby , bouncing

MIN ACTIVATIONS
MIN = -0.090

put the cheese and the ham on the bread . Mom
made their favorite food : mac aron i and cheese .
man with a hat and a scarf . They threw snow
all the popcorn and drank some water . Tim my was
nose and a hat and a scarf . They lay down
some sa us ages and some bread . She gave a
he ate all the ch ocol ates and sweets . Then
man a carrot nose and a hat and a scarf .
a t eddy bear and a glass of water . She
her a box of ch ocol ates . Lily opened the
noticed something . The ch ocol ates were bitter . They
each a warm blanket and a candy cane . He snapped
bled up all the ch ocol ates , and she was
as he opened the ch ocol ates and started eating them
gave her a kiss and a hug . They said good
L ily , these ch ocol ates are bitter ," he
box and share the ch ocol ates . They are happy
loved Mom and they loved mac aron i and cheese .
queen . They wore crown s made of leaves and flowers
They used red juice and white powder and mixed them in

INTERVAL 0.041 - 0.085
CONTAINS 0.675%

Y ay , you did it , Tom !" Lily said
angry . They want to go on the slide too ,
a rabbit . I 'm looking for food ," said the
and Sarah couldn 't catch it . Sarah wanted
to find out what it all meant .

INTERVAL -0.003 - 0.041
CONTAINS 56.341%

to go to the park . When she got there ,
kind friend . You should keep the note . Maybe you
He decided he had to pack everything up and go
the cinema . May was so excited , but a bit
saw a big p uddle . She wanted to test if

INTERVAL -0.046 - -0.003
CONTAINS 42.554%

to be patient . She waited for Ben to come back
a doll that could stir a spoon in a pot and
said , " I don 't feel like playing today ,
always climbing up high things . One day he decided to
sometimes very stupid . He wanted to show everyone how silly

INTERVAL -0.090 - -0.046
CONTAINS 0.431%

ball of yarn and had a lot of fun . <|endoftext|>
them . She gives them a kiss and a cookie .
leave , Lily gave her a big hug and said goodbye
They hugged and said , " We love the ostr ich
, like lions , ze br as , and gir aff
Token "
Token ID366
Feature activation-9.59e-03

Pos logit contributions
 down+0.006
arted+0.006
 up+0.006
ivated+0.006
?!+0.006
Neg logit contributions
T-0.007
paste-0.007
 spe-0.007
Snow-0.007
engu-0.006
TokenWhat
Token ID2061
Feature activation-2.98e-02

Pos logit contributions
engu+0.019
paste+0.018
 spe+0.018
ves+0.018
lan+0.017
Neg logit contributions
 down-0.015
 up-0.014
?!-0.014
 forth-0.013
arted-0.013
Token's
Token ID338
Feature activation+5.82e-03

Pos logit contributions
Snow+0.062
Ar+0.060
paste+0.060
 spe+0.059
lan+0.057
Neg logit contributions
 up-0.042
 down-0.040
?!-0.040
?!"-0.035
!?-0.035
Token in
Token ID287
Feature activation+3.23e-02

Pos logit contributions
 down+0.010
 up+0.010
?!+0.009
 gonna+0.008
arted+0.008
Neg logit contributions
paste-0.010
T-0.010
engu-0.010
Snow-0.010
 spe-0.010
Token the
Token ID262
Feature activation+1.78e-02

Pos logit contributions
 down+0.040
 up+0.038
 alone+0.034
?!+0.033
ivated+0.033
Neg logit contributions
paste-0.073
engu-0.071
Snow-0.070
T-0.070
Ar-0.068
Token tube
Token ID12403
Feature activation+8.54e-02

Pos logit contributions
 down+0.034
 up+0.033
?!+0.032
!?+0.030
 alone+0.030
Neg logit contributions
paste-0.027
Snow-0.026
 spe-0.026
T-0.025
engu-0.025
Token?"
Token ID1701
Feature activation+2.20e-02

Pos logit contributions
arted+0.121
ivated+0.100
 down+0.099
uct+0.096
zie+0.093
Neg logit contributions
paste-0.155
engu-0.153
T-0.149
 spe-0.149
 ra-0.148
Token
Token ID198
Feature activation+3.25e-03

Pos logit contributions
 down+0.046
arted+0.045
 up+0.044
ivated+0.044
?!+0.042
Neg logit contributions
T-0.028
Snow-0.026
paste-0.026
engu-0.025
 spe-0.025
Token
Token ID198
Feature activation+3.12e-02

Pos logit contributions
 down+0.007
 up+0.007
arted+0.007
ivated+0.007
?!+0.006
Neg logit contributions
T-0.004
paste-0.004
Snow-0.004
 spe-0.004
engu-0.004
Token"
Token ID1
Feature activation-2.79e-02

Pos logit contributions
 down+0.070
arted+0.069
 up+0.067
ivated+0.066
?!+0.064
Neg logit contributions
T-0.038
Snow-0.035
paste-0.033
Ar-0.033
engu-0.032
TokenIt
Token ID1026
Feature activation+1.62e-02

Pos logit contributions
paste+0.052
 spe+0.052
 souven+0.051
engu+0.051
bread+0.050
Neg logit contributions
ivated-0.036
 down-0.036
?!-0.035
 up-0.035
!?-0.034
Token "
Token ID366
Feature activation-4.93e-03

Pos logit contributions
arted+0.033
 down+0.033
ivated+0.032
 up+0.032
?!+0.032
Neg logit contributions
T-0.039
Snow-0.038
paste-0.037
 spe-0.037
engu-0.036
TokenWhat
Token ID2061
Feature activation-1.40e-02

Pos logit contributions
paste+0.009
engu+0.008
 spe+0.008
lan+0.008
ves+0.008
Neg logit contributions
 down-0.008
 up-0.008
ivated-0.008
arted-0.007
?!-0.007
Token's
Token ID338
Feature activation+2.47e-02

Pos logit contributions
paste+0.026
Snow+0.026
 spe+0.026
Ar+0.025
T+0.025
Neg logit contributions
 down-0.022
 up-0.021
?!-0.020
!?-0.018
 gonna-0.018
Token in
Token ID287
Feature activation+3.89e-02

Pos logit contributions
arted+0.044
ivated+0.041
 down+0.040
?!+0.038
zie+0.038
Neg logit contributions
 spe-0.039
Snow-0.038
paste-0.037
T-0.036
lan-0.036
Token the
Token ID262
Feature activation+8.02e-03

Pos logit contributions
 down+0.053
 up+0.050
arted+0.047
ivated+0.046
?!+0.046
Neg logit contributions
paste-0.079
T-0.078
Snow-0.078
 spe-0.078
engu-0.077
Token box
Token ID3091
Feature activation+8.44e-02

Pos logit contributions
 down+0.014
 up+0.014
arted+0.013
?!+0.013
ivated+0.013
Neg logit contributions
paste-0.013
 spe-0.013
T-0.013
Snow-0.013
engu-0.013
Token?"
Token ID1701
Feature activation+1.16e-02

Pos logit contributions
arted+0.136
ivated+0.107
zie+0.105
uct+0.104
ags+0.102
Neg logit contributions
 spe-0.143
engu-0.141
paste-0.141
 sticker-0.137
brush-0.136
Token she
Token ID673
Feature activation+7.92e-03

Pos logit contributions
arted+0.022
 down+0.021
ivated+0.020
?!+0.020
 up+0.020
Neg logit contributions
T-0.017
paste-0.017
Snow-0.017
 spe-0.016
 ham-0.015
Token asked
Token ID1965
Feature activation+1.48e-02

Pos logit contributions
 down+0.012
 up+0.011
arted+0.011
ivated+0.010
?!+0.010
Neg logit contributions
T-0.015
paste-0.015
Snow-0.015
 spe-0.015
engu-0.015
Token her
Token ID607
Feature activation-5.34e-03

Pos logit contributions
 down+0.026
 up+0.025
arted+0.024
ivated+0.024
?!+0.023
Neg logit contributions
paste-0.024
T-0.024
 spe-0.024
Snow-0.023
engu-0.023
Token mom
Token ID1995
Feature activation+9.94e-03

Pos logit contributions
paste+0.007
 spe+0.007
Snow+0.007
T+0.007
engu+0.007
Neg logit contributions
 down-0.011
 up-0.011
arted-0.010
ivated-0.010
?!-0.010
Token do
Token ID466
Feature activation+1.16e-02

Pos logit contributions
Snow+0.036
 spe+0.036
paste+0.036
T+0.034
Ar+0.034
Neg logit contributions
 down-0.033
 up-0.032
?!-0.031
!?-0.029
 gonna-0.028
Token you
Token ID345
Feature activation+8.07e-03

Pos logit contributions
 down+0.016
 up+0.015
arted+0.015
?!+0.014
ivated+0.014
Neg logit contributions
paste-0.023
 spe-0.022
T-0.022
Snow-0.022
engu-0.022
Token have
Token ID423
Feature activation+2.19e-02

Pos logit contributions
arted+0.016
 down+0.015
 up+0.013
uct+0.013
 forth+0.013
Neg logit contributions
Ar-0.010
engu-0.010
T-0.010
 spe-0.010
Snow-0.009
Token in
Token ID287
Feature activation+3.57e-02

Pos logit contributions
 down+0.033
arted+0.033
 up+0.031
ivated+0.030
 forth+0.029
Neg logit contributions
 spe-0.039
T-0.039
paste-0.039
Snow-0.039
engu-0.038
Token the
Token ID262
Feature activation+2.45e-02

Pos logit contributions
arted+0.054
 down+0.053
ivated+0.050
 up+0.049
ieved+0.047
Neg logit contributions
 spe-0.065
Snow-0.062
T-0.061
paste-0.060
engu-0.059
Token sack
Token ID23704
Feature activation+8.40e-02

Pos logit contributions
arted+0.047
ivated+0.045
ieved+0.044
rying+0.042
uct+0.041
Neg logit contributions
 ham-0.034
 pill-0.034
Snow-0.032
tr-0.031
 cheese-0.031
Token,
Token ID11
Feature activation+4.11e-02

Pos logit contributions
arted+0.122
ivated+0.100
zie+0.094
 down+0.094
uct+0.092
Neg logit contributions
 spe-0.151
engu-0.150
Snow-0.148
paste-0.148
 sticker-0.147
Token Tommy
Token ID19919
Feature activation+4.40e-02

Pos logit contributions
 down+0.066
arted+0.064
ivated+0.063
 up+0.061
 alone+0.060
Neg logit contributions
T-0.068
engu-0.068
 spe-0.066
Snow-0.066
paste-0.065
Token?"
Token ID1701
Feature activation+2.88e-02

Pos logit contributions
arted+0.065
ivated+0.062
 sidelines+0.054
ags+0.052
 down+0.051
Neg logit contributions
T-0.085
Snow-0.081
lan-0.078
paste-0.078
 ham-0.078
Token
Token ID198
Feature activation+1.71e-02

Pos logit contributions
arted+0.055
 down+0.053
ivated+0.052
rying+0.050
 up+0.050
Neg logit contributions
T-0.040
Snow-0.037
 spe-0.036
paste-0.035
lan-0.035
Token
Token ID198
Feature activation+2.59e-02

Pos logit contributions
 down+0.037
 up+0.035
arted+0.035
ivated+0.034
?!+0.034
Neg logit contributions
paste-0.021
T-0.020
 spe-0.020
Snow-0.020
engu-0.020
Token they
Token ID484
Feature activation-5.75e-03

Pos logit contributions
 down+0.010
arted+0.010
ivated+0.010
 up+0.010
?!+0.010
Neg logit contributions
 spe-0.005
T-0.005
lan-0.005
paste-0.005
By-0.005
Token were
Token ID547
Feature activation+5.83e-03

Pos logit contributions
paste+0.008
Snow+0.008
 spe+0.008
T+0.008
engu+0.008
Neg logit contributions
 down-0.011
 up-0.011
arted-0.010
ivated-0.010
?!-0.010
Token finished
Token ID5201
Feature activation+1.37e-02

Pos logit contributions
 down+0.009
arted+0.008
?!+0.008
 up+0.008
ivated+0.008
Neg logit contributions
paste-0.010
 spe-0.010
engu-0.010
Snow-0.010
T-0.010
Token,
Token ID11
Feature activation+2.66e-02

Pos logit contributions
arted+0.026
 down+0.025
ivated+0.024
 up+0.024
?!+0.024
Neg logit contributions
 spe-0.020
paste-0.019
engu-0.019
T-0.018
lan-0.018
Token Jack
Token ID3619
Feature activation+9.08e-03

Pos logit contributions
arted+0.063
ivering+0.062
zie+0.060
ivated+0.059
rying+0.059
Neg logit contributions
 souven-0.021
T-0.020
lan-0.020
By-0.020
 cr-0.017
Token and
Token ID290
Feature activation+8.25e-02

Pos logit contributions
 down+0.017
 up+0.017
arted+0.016
ivated+0.016
?!+0.016
Neg logit contributions
paste-0.013
 spe-0.013
T-0.013
Snow-0.013
engu-0.013
Token Jill
Token ID22772
Feature activation-6.27e-04

Pos logit contributions
ivering+0.134
ivated+0.132
 gonna+0.125
?!"+0.119
?!+0.118
Neg logit contributions
 souven-0.129
T-0.127
lan-0.126
 cr-0.126
 bead-0.124
Token were
Token ID547
Feature activation+1.26e-02

Pos logit contributions
paste+0.001
Snow+0.001
 spe+0.001
T+0.001
engu+0.001
Neg logit contributions
 down-0.001
 up-0.001
arted-0.001
ivated-0.001
?!-0.001
Token both
Token ID1111
Feature activation+1.11e-02

Pos logit contributions
arted+0.020
 down+0.019
?!+0.019
ivated+0.018
 up+0.018
Neg logit contributions
 spe-0.021
paste-0.021
Snow-0.020
engu-0.020
T-0.020
Token so
Token ID523
Feature activation-3.81e-03

Pos logit contributions
 down+0.016
 up+0.015
arted+0.015
ivated+0.014
?!+0.014
Neg logit contributions
paste-0.021
 spe-0.021
T-0.021
Snow-0.021
engu-0.021
Token happy
Token ID3772
Feature activation-2.77e-03

Pos logit contributions
paste+0.008
Snow+0.008
T+0.008
 spe+0.008
engu+0.008
Neg logit contributions
 down-0.005
 up-0.004
arted-0.004
ivated-0.004
?!-0.004
Token "
Token ID366
Feature activation+5.65e-03

Pos logit contributions
ivated+0.091
ieved+0.088
rying+0.084
 forth+0.084
?!+0.084
Neg logit contributions
T-0.060
 spe-0.060
 ham-0.060
Snow-0.059
paste-0.059
TokenWhat
Token ID2061
Feature activation-1.34e-02

Pos logit contributions
 down+0.012
 up+0.012
arted+0.012
ivated+0.012
?!+0.011
Neg logit contributions
T-0.007
Snow-0.007
paste-0.006
 spe-0.006
By-0.006
Token's
Token ID338
Feature activation+1.22e-02

Pos logit contributions
Snow+0.025
 spe+0.025
paste+0.024
Ar+0.024
T+0.024
Neg logit contributions
 down-0.021
 up-0.020
?!-0.019
!?-0.018
 gonna-0.018
Token in
Token ID287
Feature activation+3.76e-02

Pos logit contributions
arted+0.018
ivated+0.017
 down+0.016
zie+0.016
?!+0.015
Neg logit contributions
 spe-0.022
Snow-0.022
paste-0.022
T-0.021
engu-0.021
Token the
Token ID262
Feature activation+8.16e-03

Pos logit contributions
 down+0.050
 up+0.047
arted+0.045
ivated+0.043
?!+0.043
Neg logit contributions
paste-0.077
Snow-0.076
T-0.076
 spe-0.076
engu-0.076
Token box
Token ID3091
Feature activation+8.18e-02

Pos logit contributions
 down+0.014
 up+0.013
?!+0.012
ivated+0.012
arted+0.012
Neg logit contributions
paste-0.015
 spe-0.014
Snow-0.014
T-0.014
engu-0.014
Token?"
Token ID1701
Feature activation+2.13e-02

Pos logit contributions
arted+0.144
ivated+0.119
uct+0.114
zie+0.111
 sidelines+0.110
Neg logit contributions
paste-0.124
 spe-0.122
 sticker-0.121
engu-0.121
brush-0.120
Token he
Token ID339
Feature activation+1.94e-02

Pos logit contributions
arted+0.043
ivated+0.042
rying+0.041
?!+0.040
ags+0.040
Neg logit contributions
T-0.028
paste-0.027
Snow-0.027
 spe-0.027
 ham-0.024
Token asked
Token ID1965
Feature activation+1.61e-02

Pos logit contributions
arted+0.033
 down+0.033
 up+0.031
ivated+0.031
?!+0.030
Neg logit contributions
T-0.033
Snow-0.031
 spe-0.030
paste-0.029
Ar-0.029
Token.
Token ID13
Feature activation+3.06e-02

Pos logit contributions
arted+0.036
ivated+0.036
 gonna+0.035
?!+0.034
ieved+0.034
Neg logit contributions
T-0.016
paste-0.016
 spe-0.015
Snow-0.014
 souven-0.014
Token
Token ID220
Feature activation+4.45e-03

Pos logit contributions
 forth+0.063
ivated+0.062
arted+0.062
rying+0.061
ieved+0.059
Neg logit contributions
T-0.039
Snow-0.037
 spe-0.037
paste-0.035
 ham-0.035
Token.
Token ID13
Feature activation+3.86e-02

Pos logit contributions
 down+0.046
 up+0.044
arted+0.044
ivated+0.042
?!+0.041
Neg logit contributions
 spe-0.036
paste-0.036
T-0.036
Snow-0.036
engu-0.035
Token "
Token ID366
Feature activation-1.27e-03

Pos logit contributions
ivated+0.087
 forth+0.086
eling+0.085
ivering+0.085
?!+0.081
Neg logit contributions
T-0.039
engu-0.035
ro-0.034
Ar-0.034
lan-0.032
TokenWhere
Token ID8496
Feature activation-2.93e-02

Pos logit contributions
engu+0.002
 spe+0.002
paste+0.002
bread+0.002
ves+0.002
Neg logit contributions
 down-0.002
arted-0.002
 up-0.002
?!-0.002
ivated-0.002
Token are
Token ID389
Feature activation+1.75e-02

Pos logit contributions
 spe+0.042
Snow+0.042
paste+0.042
T+0.041
engu+0.041
Neg logit contributions
 down-0.057
 up-0.055
?!-0.053
arted-0.052
ivated-0.051
Token you
Token ID345
Feature activation+1.12e-02

Pos logit contributions
 down+0.030
arted+0.029
 up+0.029
ivated+0.029
?!+0.028
Neg logit contributions
T-0.028
 spe-0.027
engu-0.027
lan-0.027
Snow-0.027
Token going
Token ID1016
Feature activation+8.16e-02

Pos logit contributions
arted+0.019
 sidelines+0.018
ivated+0.017
uct+0.017
ieved+0.017
Neg logit contributions
T-0.016
 spe-0.016
lan-0.016
Snow-0.016
 souven-0.016
Token?",
Token ID35379
Feature activation+2.39e-02

Pos logit contributions
arted+0.134
uct+0.128
ivated+0.125
ieved+0.122
zie+0.121
Neg logit contributions
lan-0.124
 spe-0.124
Snow-0.121
engu-0.121
 souven-0.119
Token asked
Token ID1965
Feature activation+4.86e-02

Pos logit contributions
arted+0.036
 down+0.034
?!+0.033
ivated+0.033
asc+0.033
Neg logit contributions
T-0.041
 ham-0.040
lan-0.038
 spe-0.038
paste-0.037
Token Jill
Token ID22772
Feature activation+2.05e-02

Pos logit contributions
ivated+0.057
arted+0.052
 down+0.052
 gonna+0.050
?!+0.049
Neg logit contributions
T-0.101
lan-0.096
paste-0.095
engu-0.094
 cr-0.093
Token.
Token ID13
Feature activation+5.21e-02

Pos logit contributions
arted+0.045
ivated+0.043
 gonna+0.043
 sidelines+0.042
 down+0.042
Neg logit contributions
T-0.021
paste-0.020
engu-0.019
 spe-0.019
Snow-0.018
Token "
Token ID366
Feature activation-5.40e-03

Pos logit contributions
 forth+0.111
eling+0.108
ivering+0.107
ivated+0.106
rying+0.103
Neg logit contributions
T-0.065
Ar-0.053
lan-0.052
By-0.051
ro-0.050
Token"
Token ID1
Feature activation-1.67e-02

Pos logit contributions
 down+0.035
 up+0.034
arted+0.033
ivated+0.032
?!+0.032
Neg logit contributions
Snow-0.014
T-0.014
 spe-0.013
paste-0.013
engu-0.013
TokenWhat
Token ID2061
Feature activation-2.51e-02

Pos logit contributions
paste+0.029
engu+0.028
 spe+0.027
bread+0.026
Snow+0.026
Neg logit contributions
 down-0.029
 up-0.026
arted-0.026
ivated-0.026
?!-0.025
Token is
Token ID318
Feature activation+2.86e-02

Pos logit contributions
Snow+0.049
paste+0.049
 spe+0.048
Ar+0.047
bread+0.045
Neg logit contributions
 up-0.038
 down-0.037
?!-0.036
!?-0.033
?!"-0.032
Token in
Token ID287
Feature activation+4.10e-02

Pos logit contributions
 down+0.048
arted+0.046
 up+0.046
ivated+0.045
?!+0.044
Neg logit contributions
 spe-0.047
paste-0.047
Snow-0.046
T-0.046
engu-0.046
Token the
Token ID262
Feature activation+1.49e-02

Pos logit contributions
 down+0.057
 up+0.054
arted+0.051
ivated+0.050
?!+0.049
Neg logit contributions
paste-0.082
T-0.080
Snow-0.080
 spe-0.080
engu-0.080
Token box
Token ID3091
Feature activation+8.09e-02

Pos logit contributions
 down+0.027
 up+0.026
arted+0.025
ivated+0.025
?!+0.024
Neg logit contributions
paste-0.023
 spe-0.023
T-0.023
Snow-0.023
engu-0.023
Token?"
Token ID1701
Feature activation+1.25e-02

Pos logit contributions
arted+0.115
ivated+0.093
 down+0.091
zie+0.088
uct+0.086
Neg logit contributions
engu-0.154
 spe-0.152
paste-0.151
Snow-0.148
 sticker-0.146
Token Tom
Token ID4186
Feature activation+1.58e-02

Pos logit contributions
 down+0.020
arted+0.019
 up+0.019
?!+0.019
ivated+0.018
Neg logit contributions
 spe-0.022
T-0.021
paste-0.021
Snow-0.021
engu-0.021
Token asks
Token ID7893
Feature activation+5.92e-03

Pos logit contributions
ivated+0.023
 sidelines+0.022
arted+0.022
 down+0.021
?!+0.020
Neg logit contributions
T-0.028
engu-0.026
Ar-0.026
paste-0.026
Snow-0.026
Token.
Token ID13
Feature activation+1.82e-02

Pos logit contributions
 down+0.011
 up+0.010
arted+0.010
ivated+0.010
?!+0.010
Neg logit contributions
paste-0.009
 spe-0.009
Snow-0.009
T-0.009
engu-0.009
Token
Token ID198
Feature activation+3.56e-03

Pos logit contributions
 down+0.030
 up+0.029
arted+0.028
ivated+0.027
?!+0.027
Neg logit contributions
paste-0.031
 spe-0.031
T-0.031
Snow-0.031
engu-0.030
Token they
Token ID484
Feature activation-2.98e-03

Pos logit contributions
arted+0.020
 down+0.019
ivated+0.019
 up+0.019
?!+0.018
Neg logit contributions
T-0.009
 spe-0.009
lan-0.009
paste-0.009
Snow-0.009
Token were
Token ID547
Feature activation+5.44e-03

Pos logit contributions
paste+0.004
Snow+0.004
 spe+0.004
engu+0.004
T+0.004
Neg logit contributions
 down-0.006
 up-0.006
arted-0.005
ivated-0.005
?!-0.005
Token done
Token ID1760
Feature activation-2.13e-03

Pos logit contributions
arted+0.009
 down+0.009
?!+0.008
ivated+0.008
 up+0.008
Neg logit contributions
paste-0.009
Snow-0.009
 spe-0.009
T-0.008
engu-0.008
Token,
Token ID11
Feature activation+2.78e-02

Pos logit contributions
 spe+0.003
paste+0.003
T+0.003
Snow+0.003
engu+0.003
Neg logit contributions
arted-0.004
 down-0.004
 up-0.003
ivated-0.003
?!-0.003
Token Jack
Token ID3619
Feature activation-3.11e-03

Pos logit contributions
arted+0.069
ivering+0.067
rying+0.066
ivated+0.066
zie+0.065
Neg logit contributions
T-0.019
 souven-0.018
lan-0.017
By-0.017
 cr-0.016
Token and
Token ID290
Feature activation+8.08e-02

Pos logit contributions
paste+0.005
 spe+0.005
Snow+0.005
engu+0.005
T+0.005
Neg logit contributions
 down-0.006
 up-0.005
arted-0.005
?!-0.005
ivated-0.005
Token Jill
Token ID22772
Feature activation-7.28e-03

Pos logit contributions
ivated+0.133
ivering+0.126
 gonna+0.121
used+0.119
?!+0.119
Neg logit contributions
 cr-0.133
T-0.126
 souven-0.124
lan-0.123
 bead-0.120
Token hopped
Token ID45230
Feature activation-1.54e-02

Pos logit contributions
paste+0.012
Snow+0.012
 spe+0.012
T+0.011
engu+0.011
Neg logit contributions
 down-0.013
 up-0.012
ivated-0.011
?!-0.011
arted-0.011
Token onto
Token ID4291
Feature activation+8.72e-05

Pos logit contributions
paste+0.032
Snow+0.032
T+0.032
 spe+0.032
engu+0.031
Neg logit contributions
 down-0.018
arted-0.018
?!-0.017
ivated-0.017
 up-0.017
Token the
Token ID262
Feature activation-1.10e-02

Pos logit contributions
 down+0.000
 up+0.000
arted+0.000
ivated+0.000
?!+0.000
Neg logit contributions
paste-0.000
Snow-0.000
 spe-0.000
T-0.000
engu-0.000
Token bed
Token ID3996
Feature activation-2.19e-02

Pos logit contributions
engu+0.016
paste+0.016
 spe+0.016
T+0.015
Snow+0.015
Neg logit contributions
 down-0.022
 up-0.021
arted-0.019
ivated-0.019
?!-0.019
Token "
Token ID366
Feature activation-2.15e-02

Pos logit contributions
paste+0.009
 spe+0.009
T+0.009
Snow+0.009
engu+0.009
Neg logit contributions
 down-0.008
 up-0.007
arted-0.007
ivated-0.007
?!-0.007
TokenWhat
Token ID2061
Feature activation-2.76e-02

Pos logit contributions
 spe+0.041
engu+0.041
paste+0.040
ves+0.038
bread+0.038
Neg logit contributions
 down-0.032
?!-0.031
 up-0.029
!?-0.029
arted-0.028
Token if
Token ID611
Feature activation-2.08e-03

Pos logit contributions
Snow+0.048
paste+0.046
 spe+0.046
T+0.046
Ar+0.045
Neg logit contributions
 down-0.047
 up-0.046
?!-0.044
!?-0.041
 gonna-0.041
Token we
Token ID356
Feature activation+5.91e-03

Pos logit contributions
paste+0.004
T+0.004
 spe+0.004
Snow+0.004
engu+0.004
Neg logit contributions
 down-0.003
 up-0.003
arted-0.003
ivated-0.003
?!-0.003
Token look
Token ID804
Feature activation+4.99e-02

Pos logit contributions
 down+0.011
 up+0.011
arted+0.011
ivated+0.011
?!+0.010
Neg logit contributions
 spe-0.008
paste-0.008
engu-0.008
Snow-0.008
T-0.008
Token outside
Token ID2354
Feature activation+8.04e-02

Pos logit contributions
arted+0.082
ivated+0.078
 down+0.078
 up+0.073
?!+0.073
Neg logit contributions
 spe-0.082
paste-0.082
engu-0.078
T-0.078
Snow-0.077
Token?"
Token ID1701
Feature activation+1.42e-02

Pos logit contributions
arted+0.118
ivated+0.106
uct+0.098
 down+0.097
zie+0.093
Neg logit contributions
 spe-0.149
paste-0.145
Snow-0.140
engu-0.139
T-0.137
Token
Token ID198
Feature activation+1.01e-02

Pos logit contributions
 down+0.026
 up+0.025
arted+0.024
ivated+0.024
?!+0.024
Neg logit contributions
paste-0.022
 spe-0.022
Snow-0.022
T-0.021
engu-0.021
Token
Token ID198
Feature activation+1.90e-02

Pos logit contributions
 down+0.019
 up+0.019
?!+0.018
arted+0.018
ivated+0.017
Neg logit contributions
 spe-0.015
paste-0.015
engu-0.014
bread-0.014
Snow-0.014
TokenHer
Token ID9360
Feature activation-4.68e-03

Pos logit contributions
 down+0.036
 up+0.034
ivated+0.034
?!+0.033
arted+0.033
Neg logit contributions
 spe-0.029
paste-0.028
engu-0.027
Snow-0.026
bread-0.026
Token mom
Token ID1995
Feature activation-1.00e-02

Pos logit contributions
Snow+0.007
T+0.007
paste+0.007
 spe+0.007
engu+0.007
Neg logit contributions
 down-0.009
 up-0.009
arted-0.008
?!-0.008
 alone-0.007
Token in
Token ID287
Feature activation+7.78e-03

Pos logit contributions
paste+0.011
T+0.011
Snow+0.011
engu+0.011
 spe+0.011
Neg logit contributions
 down-0.011
 up-0.011
arted-0.010
ivated-0.010
?!-0.010
Token the
Token ID262
Feature activation+1.09e-03

Pos logit contributions
 down+0.011
 up+0.010
arted+0.010
ivated+0.010
?!+0.009
Neg logit contributions
paste-0.016
Snow-0.015
T-0.015
 spe-0.015
engu-0.015
Token air
Token ID1633
Feature activation+1.09e-02

Pos logit contributions
 down+0.002
 up+0.002
arted+0.002
ivated+0.002
?!+0.002
Neg logit contributions
paste-0.002
 spe-0.002
Snow-0.002
T-0.002
engu-0.002
Token.
Token ID13
Feature activation+4.48e-02

Pos logit contributions
 down+0.020
 up+0.020
arted+0.019
ivated+0.019
?!+0.018
Neg logit contributions
paste-0.016
 spe-0.016
T-0.016
Snow-0.016
engu-0.016
Token Jack
Token ID3619
Feature activation+2.25e-02

Pos logit contributions
ivering+0.075
 forth+0.074
rying+0.070
ivated+0.070
 down+0.068
Neg logit contributions
 ham-0.067
engu-0.066
lan-0.066
T-0.066
ro-0.063
Token and
Token ID290
Feature activation+8.03e-02

Pos logit contributions
 down+0.047
arted+0.045
ivated+0.045
 up+0.044
?!+0.044
Neg logit contributions
T-0.028
 spe-0.027
Snow-0.026
paste-0.026
engu-0.026
Token Jill
Token ID22772
Feature activation+1.10e-02

Pos logit contributions
ivated+0.125
ivering+0.121
 down+0.115
 up+0.113
 gonna+0.112
Neg logit contributions
T-0.138
lan-0.130
 cr-0.129
ib-0.123
 bead-0.123
Token quickly
Token ID2952
Feature activation-2.66e-02

Pos logit contributions
 down+0.024
 up+0.023
arted+0.023
?!+0.022
ivated+0.022
Neg logit contributions
T-0.013
 spe-0.013
engu-0.013
Snow-0.012
paste-0.012
Token grabbed
Token ID13176
Feature activation-6.72e-03

Pos logit contributions
paste+0.048
 spe+0.046
Snow+0.046
T+0.046
engu+0.046
Neg logit contributions
 down-0.042
 up-0.040
arted-0.040
ivated-0.037
?!-0.037
Token their
Token ID511
Feature activation-3.15e-02

Pos logit contributions
paste+0.010
 spe+0.010
T+0.010
Snow+0.010
engu+0.010
Neg logit contributions
 down-0.012
 up-0.012
arted-0.012
ivated-0.011
?!-0.011
Token rods
Token ID32858
Feature activation-2.67e-02

Pos logit contributions
paste+0.049
T+0.048
Snow+0.048
 spe+0.047
engu+0.046
Neg logit contributions
 down-0.059
 up-0.057
arted-0.054
ivated-0.052
?!-0.052
Token "
Token ID366
Feature activation-1.46e-02

Pos logit contributions
ivated+0.025
 up+0.025
 forth+0.025
 down+0.025
arted+0.025
Neg logit contributions
 spe-0.023
Snow-0.023
T-0.023
paste-0.023
engu-0.023
TokenWhat
Token ID2061
Feature activation-1.01e-02

Pos logit contributions
engu+0.026
 spe+0.025
paste+0.025
bread+0.024
lan+0.023
Neg logit contributions
 down-0.025
 up-0.023
arted-0.023
?!-0.022
ivated-0.022
Token is
Token ID318
Feature activation+2.93e-02

Pos logit contributions
Snow+0.018
paste+0.018
 spe+0.018
T+0.017
engu+0.017
Neg logit contributions
 down-0.016
 up-0.016
?!-0.015
arted-0.014
ivated-0.014
Token in
Token ID287
Feature activation+4.09e-02

Pos logit contributions
arted+0.057
ivated+0.054
ieved+0.054
zie+0.053
rying+0.052
Neg logit contributions
 spe-0.041
 ham-0.038
Snow-0.038
lan-0.038
 pill-0.037
Token the
Token ID262
Feature activation+1.89e-02

Pos logit contributions
 down+0.053
arted+0.052
ivated+0.050
 up+0.050
?!+0.047
Neg logit contributions
 spe-0.083
Snow-0.080
T-0.080
paste-0.078
engu-0.077
Token box
Token ID3091
Feature activation+7.99e-02

Pos logit contributions
 down+0.027
arted+0.026
 up+0.026
ivated+0.025
?!+0.024
Neg logit contributions
 spe-0.035
paste-0.035
Snow-0.035
T-0.035
engu-0.035
Token?"
Token ID1701
Feature activation+9.82e-03

Pos logit contributions
arted+0.113
ivated+0.095
 down+0.091
uct+0.087
zie+0.086
Neg logit contributions
paste-0.149
 spe-0.148
engu-0.145
Snow-0.145
T-0.142
Token Tom
Token ID4186
Feature activation+7.75e-03

Pos logit contributions
arted+0.018
 down+0.018
ivated+0.017
?!+0.017
 up+0.017
Neg logit contributions
T-0.014
Snow-0.014
paste-0.014
 spe-0.014
engu-0.013
Token asked
Token ID1965
Feature activation+6.34e-03

Pos logit contributions
arted+0.015
ivated+0.015
 down+0.015
 sidelines+0.014
ags+0.014
Neg logit contributions
T-0.011
Snow-0.009
 ham-0.009
 spe-0.009
engu-0.009
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
arted+0.012
ivated+0.012
 down+0.012
 gonna+0.012
?!+0.011
Neg logit contributions
paste-0.008
T-0.008
 spe-0.008
Snow-0.008
engu-0.008
Token "
Token ID366
Feature activation-1.42e-02

Pos logit contributions
 down+0.019
 up+0.018
arted+0.018
ivated+0.018
?!+0.017
Neg logit contributions
paste-0.021
T-0.021
 spe-0.021
Snow-0.020
engu-0.020
Token,
Token ID11
Feature activation+1.18e-02

Pos logit contributions
 down+0.010
 up+0.009
arted+0.009
ivated+0.009
?!+0.009
Neg logit contributions
paste-0.010
 spe-0.010
T-0.010
Snow-0.010
engu-0.010
Token what
Token ID644
Feature activation-1.67e-02

Pos logit contributions
 down+0.019
 up+0.019
arted+0.018
ivated+0.017
?!+0.017
Neg logit contributions
paste-0.020
 spe-0.020
T-0.020
Snow-0.020
engu-0.020
Token's
Token ID338
Feature activation+1.80e-02

Pos logit contributions
 spe+0.029
Snow+0.029
paste+0.028
T+0.028
Ar+0.027
Neg logit contributions
 down-0.028
 up-0.028
?!-0.026
 gonna-0.025
!?-0.024
Token in
Token ID287
Feature activation+4.01e-02

Pos logit contributions
arted+0.034
ivated+0.032
 down+0.031
zie+0.030
?!+0.029
Neg logit contributions
Snow-0.026
 spe-0.025
paste-0.025
lan-0.025
 souven-0.025
Token the
Token ID262
Feature activation+2.27e-02

Pos logit contributions
 down+0.056
 up+0.053
arted+0.052
ivated+0.050
?!+0.049
Neg logit contributions
 spe-0.078
paste-0.078
T-0.077
Snow-0.077
engu-0.076
Token tube
Token ID12403
Feature activation+7.95e-02

Pos logit contributions
 down+0.035
 up+0.033
arted+0.032
ivated+0.031
?!+0.031
Neg logit contributions
paste-0.042
 spe-0.042
T-0.041
Snow-0.041
engu-0.041
Token?"
Token ID1701
Feature activation+6.73e-03

Pos logit contributions
arted+0.106
ivated+0.093
 down+0.089
uct+0.084
 up+0.082
Neg logit contributions
paste-0.153
engu-0.149
Snow-0.148
 spe-0.146
T-0.145
Token asked
Token ID1965
Feature activation+1.42e-02

Pos logit contributions
 down+0.012
arted+0.012
 up+0.012
ivated+0.011
?!+0.011
Neg logit contributions
T-0.010
 spe-0.010
paste-0.010
Snow-0.010
engu-0.010
Token Tommy
Token ID19919
Feature activation-1.24e-02

Pos logit contributions
 down+0.021
arted+0.021
ivated+0.020
 up+0.020
?!+0.019
Neg logit contributions
T-0.026
paste-0.025
 spe-0.025
engu-0.024
Snow-0.024
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
T+0.017
paste+0.016
Snow+0.016
 spe+0.016
engu+0.016
Neg logit contributions
 down-0.024
arted-0.024
 up-0.023
ivated-0.023
?!-0.023
Token
Token ID198
Feature activation+8.58e-04

Pos logit contributions
 down+0.021
 up+0.020
arted+0.020
ivated+0.019
?!+0.019
Neg logit contributions
T-0.015
paste-0.014
Snow-0.014
 spe-0.014
engu-0.014
Token,
Token ID11
Feature activation+1.01e-02

Pos logit contributions
arted+0.052
 down+0.052
ivated+0.050
 up+0.050
?!+0.047
Neg logit contributions
 spe-0.044
engu-0.044
paste-0.043
T-0.043
Snow-0.043
Token what
Token ID644
Feature activation-1.13e-02

Pos logit contributions
 down+0.015
 up+0.015
arted+0.014
ivated+0.014
?!+0.014
Neg logit contributions
paste-0.019
 spe-0.018
T-0.018
Snow-0.018
engu-0.018
Token is
Token ID318
Feature activation+3.04e-02

Pos logit contributions
 spe+0.020
Snow+0.020
paste+0.020
T+0.020
engu+0.019
Neg logit contributions
 down-0.019
 up-0.018
?!-0.017
!?-0.016
 gonna-0.016
Token in
Token ID287
Feature activation+3.71e-02

Pos logit contributions
arted+0.059
ivated+0.055
ieved+0.051
zie+0.051
ivering+0.051
Neg logit contributions
 spe-0.045
lan-0.045
Snow-0.043
 souven-0.043
 ham-0.043
Token the
Token ID262
Feature activation+1.22e-02

Pos logit contributions
 down+0.049
arted+0.046
 up+0.046
ivated+0.044
?!+0.042
Neg logit contributions
 spe-0.076
paste-0.074
T-0.074
Snow-0.074
engu-0.073
Token box
Token ID3091
Feature activation+7.94e-02

Pos logit contributions
 down+0.021
 up+0.020
?!+0.018
ivated+0.018
arted+0.018
Neg logit contributions
paste-0.022
 spe-0.021
T-0.021
Snow-0.021
engu-0.021
Token?"
Token ID1701
Feature activation+1.60e-02

Pos logit contributions
arted+0.124
ivated+0.103
zie+0.093
 sidelines+0.093
uct+0.093
Neg logit contributions
 spe-0.137
paste-0.136
 sticker-0.135
engu-0.134
brush-0.132
Token Her
Token ID2332
Feature activation+7.21e-03

Pos logit contributions
arted+0.028
rying+0.026
?!+0.026
 down+0.026
ivated+0.025
Neg logit contributions
 spe-0.027
T-0.026
Snow-0.026
paste-0.025
engu-0.024
Token dad
Token ID9955
Feature activation+8.37e-03

Pos logit contributions
arted+0.018
ivated+0.017
 down+0.017
roads+0.017
ieved+0.017
Neg logit contributions
T-0.006
paste-0.005
 spe-0.005
Snow-0.005
engu-0.005
Token smiled
Token ID13541
Feature activation+1.32e-02

Pos logit contributions
 down+0.014
 up+0.013
arted+0.013
ivated+0.013
?!+0.013
Neg logit contributions
paste-0.014
T-0.014
 spe-0.014
Snow-0.014
engu-0.014
Token and
Token ID290
Feature activation+5.47e-03

Pos logit contributions
 down+0.026
arted+0.025
 up+0.025
ivated+0.025
?!+0.025
Neg logit contributions
T-0.017
paste-0.017
 spe-0.017
Snow-0.017
engu-0.016
Token
Token ID198
Feature activation-3.74e-04

Pos logit contributions
 spe+0.008
paste+0.008
bread+0.007
engu+0.007
lan+0.007
Neg logit contributions
 down-0.008
 up-0.008
arted-0.008
?!-0.008
ivated-0.008
Token"
Token ID1
Feature activation-3.60e-02

Pos logit contributions
 spe+0.001
paste+0.000
engu+0.000
bread+0.000
lan+0.000
Neg logit contributions
 down-0.001
ivated-0.001
arted-0.001
 up-0.001
?!-0.001
TokenWhere
Token ID8496
Feature activation-1.69e-02

Pos logit contributions
engu+0.067
 spe+0.063
paste+0.062
bread+0.060
lan+0.060
Neg logit contributions
 down-0.053
ivated-0.051
arted-0.050
?!-0.048
 up-0.048
Token are
Token ID389
Feature activation+1.71e-02

Pos logit contributions
paste+0.023
 spe+0.023
T+0.022
Snow+0.022
engu+0.022
Neg logit contributions
 down-0.034
 up-0.033
arted-0.032
ivated-0.031
?!-0.031
Token you
Token ID345
Feature activation+1.48e-02

Pos logit contributions
arted+0.030
 down+0.029
ivated+0.028
 up+0.027
?!+0.027
Neg logit contributions
 spe-0.027
lan-0.026
engu-0.026
Snow-0.026
T-0.026
Token going
Token ID1016
Feature activation+7.94e-02

Pos logit contributions
arted+0.023
 sidelines+0.020
roads+0.020
uct+0.020
zie+0.019
Neg logit contributions
 spe-0.025
 souven-0.023
 cr-0.023
 butter-0.022
lan-0.022
Token?"
Token ID1701
Feature activation+4.36e-03

Pos logit contributions
arted+0.122
ivated+0.116
uct+0.116
ieved+0.109
zie+0.105
Neg logit contributions
 spe-0.131
lan-0.126
engu-0.126
Snow-0.125
paste-0.125
Token asked
Token ID1965
Feature activation+1.21e-02

Pos logit contributions
 down+0.007
 up+0.007
arted+0.007
ivated+0.007
?!+0.007
Neg logit contributions
T-0.007
 spe-0.007
paste-0.007
Snow-0.007
engu-0.007
Token the
Token ID262
Feature activation+2.04e-02

Pos logit contributions
 down+0.017
 up+0.016
arted+0.015
ivated+0.014
?!+0.014
Neg logit contributions
paste-0.025
 spe-0.025
Snow-0.025
engu-0.024
T-0.024
Token flower
Token ID15061
Feature activation+7.50e-03

Pos logit contributions
 down+0.032
 up+0.031
 alone+0.028
arted+0.027
?!+0.027
Neg logit contributions
paste-0.038
T-0.037
Snow-0.036
engu-0.036
 spe-0.036
Token next
Token ID1306
Feature activation-1.06e-02

Pos logit contributions
 down+0.014
 up+0.013
arted+0.013
ivated+0.012
?!+0.012
Neg logit contributions
paste-0.012
 spe-0.012
T-0.012
Snow-0.012
engu-0.011
Token games
Token ID1830
Feature activation-1.17e-02

Pos logit contributions
 down+0.001
 up+0.001
arted+0.001
ivated+0.001
?!+0.001
Neg logit contributions
paste-0.001
 spe-0.001
T-0.001
Snow-0.001
engu-0.001
Token.
Token ID13
Feature activation+1.65e-02

Pos logit contributions
engu+0.020
paste+0.020
Snow+0.020
 spe+0.020
T+0.020
Neg logit contributions
 down-0.020
 up-0.019
?!-0.017
arted-0.017
 alone-0.017
Token
Token ID198
Feature activation+1.22e-03

Pos logit contributions
 down+0.036
ivated+0.034
 up+0.034
arted+0.034
 forth+0.034
Neg logit contributions
T-0.020
engu-0.019
Snow-0.018
lan-0.018
paste-0.017
Token
Token ID198
Feature activation+2.44e-02

Pos logit contributions
 up+0.002
 down+0.002
arted+0.002
?!+0.002
ivated+0.002
Neg logit contributions
 spe-0.002
paste-0.002
bread-0.002
Snow-0.002
engu-0.002
TokenJack
Token ID14295
Feature activation-1.86e-03

Pos logit contributions
 up+0.047
 down+0.047
ivated+0.046
?!+0.046
arted+0.045
Neg logit contributions
 spe-0.035
paste-0.034
engu-0.033
Snow-0.031
bread-0.031
Token and
Token ID290
Feature activation+7.90e-02

Pos logit contributions
paste+0.003
 spe+0.003
Snow+0.003
T+0.003
engu+0.003
Neg logit contributions
 down-0.004
 up-0.003
arted-0.003
?!-0.003
ivated-0.003
Token Jill
Token ID22772
Feature activation-5.34e-03

Pos logit contributions
ivated+0.117
ivering+0.115
 down+0.110
 gonna+0.109
 up+0.108
Neg logit contributions
T-0.135
lan-0.133
 souven-0.131
 cr-0.131
 bead-0.128
Token were
Token ID547
Feature activation+1.35e-02

Pos logit contributions
paste+0.008
 spe+0.007
Snow+0.007
T+0.007
engu+0.007
Neg logit contributions
 down-0.010
 up-0.010
arted-0.010
ivated-0.010
?!-0.009
Token the
Token ID262
Feature activation+2.65e-02

Pos logit contributions
 down+0.024
arted+0.024
?!+0.024
ivated+0.023
 up+0.023
Neg logit contributions
 spe-0.020
Snow-0.019
paste-0.019
T-0.019
By-0.018
Token best
Token ID1266
Feature activation-6.37e-04

Pos logit contributions
 up+0.036
 down+0.036
?!+0.032
 alone+0.031
ivated+0.031
Neg logit contributions
paste-0.055
 spe-0.054
Snow-0.052
engu-0.052
T-0.052
Token of
Token ID286
Feature activation+6.81e-04

Pos logit contributions
Snow+0.001
paste+0.001
 spe+0.001
T+0.001
engu+0.001
Neg logit contributions
 down-0.001
 up-0.001
arted-0.001
?!-0.001
ivated-0.001
Token Pete
Token ID17542
Feature activation+1.49e-02

Pos logit contributions
 down+0.019
arted+0.018
ivated+0.018
 up+0.018
 forth+0.017
Neg logit contributions
T-0.026
 spe-0.026
Snow-0.025
paste-0.025
engu-0.025
Token.
Token ID13
Feature activation-5.44e-03

Pos logit contributions
 down+0.029
arted+0.029
ivated+0.028
 up+0.027
?!+0.027
Neg logit contributions
paste-0.020
Snow-0.020
T-0.020
 spe-0.020
engu-0.019
Token Where
Token ID6350
Feature activation-2.14e-02

Pos logit contributions
T+0.009
Snow+0.009
 spe+0.009
paste+0.009
engu+0.009
Neg logit contributions
 down-0.009
 up-0.008
arted-0.008
ivated-0.008
?!-0.008
Token are
Token ID389
Feature activation+6.48e-03

Pos logit contributions
paste+0.030
 spe+0.029
T+0.029
Snow+0.029
engu+0.029
Neg logit contributions
 down-0.043
 up-0.041
arted-0.040
ivated-0.039
?!-0.039
Token you
Token ID345
Feature activation+1.57e-02

Pos logit contributions
 down+0.010
arted+0.010
 up+0.010
ivated+0.010
?!+0.009
Neg logit contributions
T-0.011
 spe-0.011
Snow-0.011
paste-0.011
engu-0.011
Token going
Token ID1016
Feature activation+7.89e-02

Pos logit contributions
arted+0.027
uct+0.025
 sidelines+0.025
ieved+0.024
 forth+0.023
Neg logit contributions
 spe-0.024
ib-0.022
T-0.022
 pill-0.022
 cr-0.022
Token?"
Token ID1701
Feature activation-5.42e-04

Pos logit contributions
arted+0.125
uct+0.123
ivated+0.117
ieved+0.115
zie+0.110
Neg logit contributions
Snow-0.124
 spe-0.124
lan-0.122
T-0.120
 sticker-0.120
Token
Token ID198
Feature activation-1.84e-02

Pos logit contributions
paste+0.001
 spe+0.001
engu+0.001
Snow+0.001
T+0.001
Neg logit contributions
 down-0.001
 up-0.001
arted-0.001
ivated-0.001
?!-0.001
Token
Token ID198
Feature activation-4.89e-03

Pos logit contributions
paste+0.030
 spe+0.030
engu+0.028
bread+0.028
Snow+0.028
Neg logit contributions
 down-0.032
 up-0.032
?!-0.030
arted-0.029
!?-0.029
Token"
Token ID1
Feature activation-2.67e-02

Pos logit contributions
paste+0.007
 spe+0.007
engu+0.007
 souven+0.006
Snow+0.006
Neg logit contributions
 down-0.010
 up-0.009
?!-0.009
arted-0.009
ivated-0.009
TokenI
Token ID40
Feature activation-9.85e-03

Pos logit contributions
engu+0.049
paste+0.048
 spe+0.047
bread+0.047
lan+0.045
Neg logit contributions
ivated-0.041
 down-0.040
arted-0.039
 up-0.038
?!-0.037
Token do
Token ID466
Feature activation+1.08e-02

Pos logit contributions
paste+0.033
 spe+0.033
Snow+0.032
T+0.032
engu+0.031
Neg logit contributions
 down-0.046
 up-0.044
arted-0.042
?!-0.042
ivated-0.042
Token you
Token ID345
Feature activation+9.82e-03

Pos logit contributions
 down+0.018
arted+0.017
 up+0.017
ivated+0.016
?!+0.015
Neg logit contributions
T-0.019
Snow-0.018
paste-0.018
 spe-0.018
engu-0.018
Token think
Token ID892
Feature activation+3.09e-02

Pos logit contributions
arted+0.019
 down+0.018
ivated+0.017
 sidelines+0.017
 up+0.017
Neg logit contributions
 spe-0.013
engu-0.012
T-0.012
Ar-0.012
Snow-0.011
Token you
Token ID345
Feature activation+7.73e-03

Pos logit contributions
 down+0.046
arted+0.046
ivated+0.044
 up+0.044
?!+0.041
Neg logit contributions
paste-0.055
Snow-0.054
 spe-0.054
T-0.054
engu-0.054
Token're
Token ID821
Feature activation+4.46e-02

Pos logit contributions
 down+0.016
arted+0.016
 up+0.015
ivated+0.015
 forth+0.015
Neg logit contributions
 spe-0.009
T-0.009
engu-0.009
Snow-0.009
paste-0.009
Token going
Token ID1016
Feature activation+7.89e-02

Pos logit contributions
arted+0.066
ivated+0.058
ieved+0.056
 down+0.055
 forth+0.055
Neg logit contributions
 spe-0.078
Snow-0.076
lan-0.074
T-0.074
engu-0.073
Token?!
Token ID12248
Feature activation+2.80e-03

Pos logit contributions
arted+0.128
uct+0.122
ivated+0.121
ieved+0.117
zie+0.112
Neg logit contributions
Snow-0.123
T-0.121
 spe-0.121
lan-0.120
 souven-0.116
Token You
Token ID921
Feature activation-1.54e-03

Pos logit contributions
 down+0.004
arted+0.004
ivated+0.004
 up+0.004
 forth+0.004
Neg logit contributions
T-0.005
paste-0.005
Snow-0.005
engu-0.005
 spe-0.005
Token're
Token ID821
Feature activation+2.71e-02

Pos logit contributions
 spe+0.002
T+0.002
Snow+0.002
engu+0.002
paste+0.002
Neg logit contributions
 down-0.003
arted-0.003
 up-0.003
ivated-0.003
 forth-0.003
Token not
Token ID407
Feature activation+1.82e-02

Pos logit contributions
arted+0.042
ivated+0.040
 down+0.037
 forth+0.036
ican+0.036
Neg logit contributions
 spe-0.047
Snow-0.042
paste-0.042
 souven-0.040
Ar-0.040
Token allowed
Token ID3142
Feature activation+2.73e-02

Pos logit contributions
arted+0.027
ivated+0.026
ieved+0.024
 sidelines+0.024
zie+0.024
Neg logit contributions
 spe-0.032
T-0.031
Snow-0.030
Ar-0.030
 souven-0.029
Token,
Token ID11
Feature activation-5.85e-04

Pos logit contributions
paste+0.001
 spe+0.001
T+0.001
Snow+0.001
engu+0.001
Neg logit contributions
 down-0.001
 up-0.001
arted-0.001
ivated-0.001
?!-0.001
Token "
Token ID366
Feature activation-1.53e-02

Pos logit contributions
paste+0.001
 spe+0.001
T+0.001
Snow+0.001
engu+0.001
Neg logit contributions
 down-0.001
 up-0.001
arted-0.001
ivated-0.001
?!-0.001
TokenWhere
Token ID8496
Feature activation-2.35e-02

Pos logit contributions
engu+0.033
 spe+0.032
paste+0.031
ves+0.031
lan+0.030
Neg logit contributions
 down-0.020
?!-0.019
 up-0.019
ivated-0.018
 forth-0.017
Token are
Token ID389
Feature activation+1.50e-02

Pos logit contributions
Snow+0.035
 spe+0.034
paste+0.034
T+0.034
engu+0.033
Neg logit contributions
 down-0.045
 up-0.044
?!-0.042
arted-0.041
ivated-0.040
Token you
Token ID345
Feature activation+1.94e-02

Pos logit contributions
 down+0.020
 up+0.019
arted+0.019
ivated+0.018
?!+0.018
Neg logit contributions
paste-0.030
 spe-0.030
T-0.030
Snow-0.030
engu-0.029
Token going
Token ID1016
Feature activation+7.88e-02

Pos logit contributions
arted+0.031
 sidelines+0.026
ivated+0.026
roads+0.026
zie+0.026
Neg logit contributions
T-0.032
 butter-0.031
 cr-0.031
 ham-0.031
 spe-0.031
Token?".
Token ID43634
Feature activation+3.67e-04

Pos logit contributions
arted+0.121
ivated+0.115
uct+0.113
ieved+0.109
zie+0.108
Neg logit contributions
 spe-0.126
Snow-0.125
paste-0.123
T-0.123
lan-0.123
Token The
Token ID383
Feature activation+1.56e-02

Pos logit contributions
 down+0.001
 up+0.000
?!+0.000
arted+0.000
 alone+0.000
Neg logit contributions
 spe-0.001
paste-0.001
Snow-0.001
engu-0.001
T-0.001
Token pear
Token ID25286
Feature activation+7.86e-03

Pos logit contributions
 down+0.031
 up+0.030
arted+0.028
 alone+0.028
ivated+0.027
Neg logit contributions
paste-0.023
T-0.023
Snow-0.022
 spe-0.022
engu-0.022
Token didn
Token ID1422
Feature activation+6.18e-03

Pos logit contributions
 down+0.015
arted+0.014
 up+0.014
ivated+0.014
?!+0.013
Neg logit contributions
paste-0.012
 spe-0.011
T-0.011
Snow-0.011
engu-0.011
Token't
Token ID470
Feature activation+1.05e-02

Pos logit contributions
 down+0.011
arted+0.011
 forth+0.010
 up+0.010
ivated+0.010
Neg logit contributions
T-0.009
Snow-0.009
paste-0.009
engu-0.009
 spe-0.009
Token Smith
Token ID4176
Feature activation+2.66e-02

Pos logit contributions
 down+0.039
 up+0.037
arted+0.035
?!+0.035
ivated+0.034
Neg logit contributions
paste-0.038
 spe-0.037
Snow-0.037
T-0.036
engu-0.036
Token!
Token ID0
Feature activation+1.69e-02

Pos logit contributions
arted+0.043
 down+0.042
ivated+0.041
 up+0.041
?!+0.038
Neg logit contributions
 spe-0.046
paste-0.045
engu-0.045
Snow-0.044
T-0.044
Token Where
Token ID6350
Feature activation-1.77e-02

Pos logit contributions
 down+0.026
 up+0.026
arted+0.026
ivated+0.025
 forth+0.024
Neg logit contributions
Snow-0.030
T-0.030
 spe-0.029
paste-0.029
engu-0.029
Token are
Token ID389
Feature activation+1.34e-02

Pos logit contributions
paste+0.025
 spe+0.025
Snow+0.024
T+0.024
engu+0.024
Neg logit contributions
 down-0.035
 up-0.034
arted-0.032
?!-0.032
ivated-0.032
Token you
Token ID345
Feature activation+1.16e-02

Pos logit contributions
 down+0.022
arted+0.021
 up+0.021
ivated+0.021
?!+0.020
Neg logit contributions
T-0.022
Snow-0.022
 spe-0.022
engu-0.022
lan-0.021
Token going
Token ID1016
Feature activation+7.88e-02

Pos logit contributions
arted+0.020
zie+0.018
uct+0.017
 sidelines+0.017
ieved+0.017
Neg logit contributions
 spe-0.018
Snow-0.017
T-0.017
 souven-0.016
 rh-0.016
Token?"
Token ID1701
Feature activation+4.48e-03

Pos logit contributions
arted+0.119
uct+0.112
ivated+0.111
ieved+0.105
zie+0.104
Neg logit contributions
Snow-0.128
 sticker-0.127
 spe-0.126
lan-0.125
T-0.124
Token asked
Token ID1965
Feature activation+1.62e-02

Pos logit contributions
arted+0.008
 down+0.007
 up+0.007
?!+0.007
ivated+0.007
Neg logit contributions
T-0.007
 spe-0.007
Snow-0.007
paste-0.007
 ham-0.007
Token Lily
Token ID20037
Feature activation+9.17e-04

Pos logit contributions
 down+0.024
arted+0.023
 up+0.023
ivated+0.023
?!+0.022
Neg logit contributions
T-0.030
paste-0.029
 spe-0.029
Snow-0.029
engu-0.029
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
arted+0.002
 gonna+0.002
 sidelines+0.002
ivated+0.002
?!+0.002
Neg logit contributions
T-0.001
Snow-0.001
 spe-0.001
paste-0.001
engu-0.001
Token
Token ID198
Feature activation-5.58e-03

Pos logit contributions
 down+0.028
arted+0.027
 up+0.027
ivated+0.026
?!+0.026
Neg logit contributions
T-0.025
Snow-0.025
 spe-0.025
paste-0.025
engu-0.024
Token
Token ID198
Feature activation+4.00e-03

Pos logit contributions
 spe+0.016
paste+0.015
engu+0.014
bread+0.014
Snow+0.014
Neg logit contributions
 down-0.019
 up-0.018
arted-0.017
?!-0.017
ivated-0.017
Token"
Token ID1
Feature activation-2.87e-02

Pos logit contributions
 down+0.008
 up+0.008
arted+0.008
ivated+0.008
?!+0.008
Neg logit contributions
 spe-0.005
paste-0.005
engu-0.005
Snow-0.005
bread-0.005
TokenWhere
Token ID8496
Feature activation-3.32e-02

Pos logit contributions
 spe+0.047
paste+0.045
engu+0.045
bread+0.042
lan+0.042
Neg logit contributions
 down-0.052
 up-0.049
arted-0.048
?!-0.047
ivated-0.046
Token are
Token ID389
Feature activation+5.22e-03

Pos logit contributions
 spe+0.050
Snow+0.050
paste+0.050
T+0.049
engu+0.048
Neg logit contributions
 down-0.062
 up-0.060
?!-0.057
arted-0.055
ivated-0.055
Token we
Token ID356
Feature activation+2.67e-02

Pos logit contributions
 down+0.009
arted+0.009
ivated+0.009
 up+0.009
?!+0.008
Neg logit contributions
T-0.008
Snow-0.008
 spe-0.008
engu-0.008
lan-0.008
Token going
Token ID1016
Feature activation+7.83e-02

Pos logit contributions
arted+0.043
zie+0.037
 sidelines+0.037
ieved+0.037
uct+0.037
Neg logit contributions
 spe-0.044
Snow-0.043
T-0.043
By-0.042
engu-0.042
Token?"
Token ID1701
Feature activation-1.32e-03

Pos logit contributions
arted+0.132
uct+0.128
ivated+0.124
ieved+0.118
zie+0.117
Neg logit contributions
Snow-0.114
 spe-0.113
lan-0.113
 souven-0.110
engu-0.109
Token asked
Token ID1965
Feature activation+8.36e-03

Pos logit contributions
T+0.002
Snow+0.002
paste+0.002
 spe+0.002
engu+0.002
Neg logit contributions
 down-0.002
arted-0.002
 up-0.002
ivated-0.002
?!-0.002
Token Toby
Token ID37578
Feature activation+5.19e-03

Pos logit contributions
 down+0.011
 up+0.011
arted+0.010
ivated+0.010
?!+0.009
Neg logit contributions
 spe-0.017
paste-0.017
Snow-0.017
engu-0.017
T-0.017
Token,
Token ID11
Feature activation+5.25e-03

Pos logit contributions
arted+0.010
 down+0.010
?!+0.010
ivated+0.009
 gonna+0.009
Neg logit contributions
T-0.007
paste-0.007
 spe-0.006
engu-0.006
Snow-0.006
Token bouncing
Token ID30713
Feature activation+7.15e-03

Pos logit contributions
 down+0.007
arted+0.007
 up+0.007
ivated+0.007
?!+0.007
Neg logit contributions
 spe-0.010
T-0.010
Snow-0.010
paste-0.010
engu-0.010
Token put
Token ID1234
Feature activation-4.86e-02

Pos logit contributions
paste+0.050
bread+0.048
Snow+0.048
lan+0.047
engu+0.047
Neg logit contributions
 down-0.033
 up-0.030
arted-0.028
 forth-0.027
ivated-0.027
Token the
Token ID262
Feature activation-4.29e-02

Pos logit contributions
T+0.123
paste+0.121
Snow+0.121
engu+0.117
bread+0.116
Neg logit contributions
 down-0.062
 up-0.059
 alone-0.038
 forth-0.036
 over-0.035
Token cheese
Token ID9891
Feature activation-5.88e-02

Pos logit contributions
T+0.056
Ch+0.052
By+0.050
engu+0.049
 spe+0.048
Neg logit contributions
 up-0.102
 down-0.099
 alone-0.087
 my-0.085
 back-0.081
Token and
Token ID290
Feature activation-4.35e-02

Pos logit contributions
T+0.143
 spe+0.137
Snow+0.134
engu+0.133
 fiber+0.132
Neg logit contributions
 down-0.087
 up-0.082
 alone-0.064
 over-0.056
 back-0.053
Token the
Token ID262
Feature activation-4.92e-02

Pos logit contributions
T+0.085
paste+0.082
engu+0.080
br+0.079
Snow+0.079
Neg logit contributions
 down-0.073
 up-0.065
 forth-0.065
 alone-0.050
 over-0.050
Token ham
Token ID8891
Feature activation-9.04e-02

Pos logit contributions
T+0.068
Ch+0.059
engu+0.056
By+0.055
Snow+0.054
Neg logit contributions
 down-0.113
 up-0.110
 alone-0.103
 my-0.094
 forth-0.093
Token on
Token ID319
Feature activation-1.72e-02

Pos logit contributions
T+0.183
Ch+0.174
Ar+0.173
Snow+0.164
Bl+0.162
Neg logit contributions
 down-0.158
 up-0.145
 back-0.125
 alone-0.113
 into-0.100
Token the
Token ID262
Feature activation-4.72e-02

Pos logit contributions
paste+0.047
By+0.045
engu+0.045
aid+0.044
bread+0.044
Neg logit contributions
 up-0.016
 down-0.015
 over-0.009
 my-0.008
 alone-0.007
Token bread
Token ID8509
Feature activation-6.24e-02

Pos logit contributions
T+0.089
By+0.080
 spe+0.079
Ar+0.078
engu+0.078
Neg logit contributions
 down-0.083
 up-0.081
 alone-0.070
 forth-0.063
?!-0.061
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
T+0.136
Ar+0.135
 spe+0.133
Bl+0.132
Snow+0.131
Neg logit contributions
 down-0.096
 up-0.090
 alone-0.076
 over-0.070
 back-0.062
Token Mom
Token ID11254
Feature activation-2.85e-02

Pos logit contributions
paste+0.050
 spe+0.049
T+0.048
engu+0.048
Snow+0.048
Neg logit contributions
 down-0.035
 up-0.032
 gonna-0.028
arted-0.028
 alone-0.026
Token made
Token ID925
Feature activation-8.01e-03

Pos logit contributions
engu+0.060
paste+0.060
Snow+0.056
bread+0.056
 spe+0.055
Neg logit contributions
 down-0.062
 up-0.061
?!-0.056
 alone-0.056
 gonna-0.056
Token their
Token ID511
Feature activation+1.23e-02

Pos logit contributions
T+0.013
 spe+0.013
Snow+0.013
paste+0.013
Ar+0.013
Neg logit contributions
 up-0.015
 down-0.015
arted-0.013
 alone-0.012
ivated-0.012
Token favorite
Token ID4004
Feature activation-1.92e-02

Pos logit contributions
 down+0.020
 up+0.019
 alone+0.017
 forth+0.017
arted+0.017
Neg logit contributions
T-0.024
paste-0.023
Snow-0.023
 spe-0.022
engu-0.022
Token food
Token ID2057
Feature activation-2.94e-02

Pos logit contributions
paste+0.027
 spe+0.027
Snow+0.026
T+0.026
engu+0.026
Neg logit contributions
 down-0.038
 up-0.036
arted-0.036
ivated-0.035
?!-0.034
Token:
Token ID25
Feature activation-2.85e-02

Pos logit contributions
T+0.056
Snow+0.054
Ar+0.054
 spe+0.053
engu+0.053
Neg logit contributions
 down-0.048
 up-0.047
 alone-0.041
?!-0.039
 forth-0.038
Token mac
Token ID8352
Feature activation-8.93e-02

Pos logit contributions
T+0.047
Ar+0.046
Snow+0.045
engu+0.045
paste+0.044
Neg logit contributions
 down-0.051
 up-0.050
 alone-0.044
 gonna-0.043
arted-0.042
Tokenaron
Token ID8045
Feature activation-2.54e-03

Pos logit contributions
 souven+0.175
Bl+0.169
T+0.168
 spe+0.164
By+0.162
Neg logit contributions
 down-0.133
 up-0.126
 over-0.101
?!-0.099
 alone-0.097
Tokeni
Token ID72
Feature activation-4.64e-02

Pos logit contributions
Ar+0.005
paste+0.005
engu+0.005
Snow+0.005
T+0.005
Neg logit contributions
 down-0.004
 up-0.004
 forth-0.003
arted-0.003
?!-0.003
Token and
Token ID290
Feature activation-3.85e-02

Pos logit contributions
Ar+0.097
Bl+0.095
T+0.095
By+0.092
Snow+0.090
Neg logit contributions
 down-0.069
 up-0.065
 alone-0.053
?!-0.048
 over-0.048
Token cheese
Token ID9891
Feature activation-6.43e-02

Pos logit contributions
T+0.051
engu+0.048
Snow+0.046
Bl+0.044
ves+0.044
Neg logit contributions
 down-0.080
 up-0.074
 my-0.072
 alone-0.068
 forth-0.065
Token.
Token ID13
Feature activation-2.70e-03

Pos logit contributions
T+0.137
Ar+0.132
engu+0.131
Bl+0.131
Snow+0.128
Neg logit contributions
 down-0.093
 up-0.089
 alone-0.079
 over-0.068
?!-0.065
Tokenman
Token ID805
Feature activation-2.86e-02

Pos logit contributions
T+0.083
 fiber+0.079
engu+0.074
paste+0.073
Bl+0.072
Neg logit contributions
 down-0.072
 up-0.070
 forth-0.057
 alone-0.052
 over-0.049
Token with
Token ID351
Feature activation-5.35e-02

Pos logit contributions
T+0.056
 fiber+0.055
paste+0.054
Snow+0.054
 spe+0.053
Neg logit contributions
 down-0.047
 up-0.046
 alone-0.038
 forth-0.036
ivated-0.034
Token a
Token ID257
Feature activation-7.94e-02

Pos logit contributions
paste+0.111
ves+0.111
ro+0.111
T+0.110
comb+0.109
Neg logit contributions
 down-0.078
 up-0.073
 back-0.055
 my-0.054
 alone-0.054
Token hat
Token ID6877
Feature activation-4.79e-02

Pos logit contributions
T+0.131
engu+0.129
lan+0.127
paste+0.125
 fiber+0.121
Neg logit contributions
 down-0.142
 up-0.125
 alone-0.109
 forth-0.109
?!-0.107
Token and
Token ID290
Feature activation-6.23e-02

Pos logit contributions
Ar+0.098
T+0.097
 fiber+0.096
engu+0.095
Bl+0.094
Neg logit contributions
 down-0.076
 up-0.072
 over-0.058
 alone-0.057
arted-0.052
Token a
Token ID257
Feature activation-8.55e-02

Pos logit contributions
engu+0.117
T+0.114
ork+0.113
opp+0.110
Bl+0.110
Neg logit contributions
 down-0.097
 up-0.088
 my-0.070
 forth-0.067
 gonna-0.067
Token scarf
Token ID36953
Feature activation-6.09e-02

Pos logit contributions
T+0.136
engu+0.133
paste+0.125
lan+0.124
 fiber+0.123
Neg logit contributions
 down-0.159
 up-0.141
 alone-0.123
?!-0.121
 forth-0.121
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
T+0.134
 fiber+0.134
Ar+0.131
engu+0.130
Bl+0.128
Neg logit contributions
 down-0.094
 up-0.088
 over-0.069
 alone-0.067
 back-0.061
Token They
Token ID1119
Feature activation-3.44e-02

Pos logit contributions
 spe+0.058
engu+0.056
paste+0.056
Snow+0.054
ves+0.054
Neg logit contributions
 down-0.034
 up-0.030
 Sally-0.025
 gonna-0.024
 Sarah-0.024
Token threw
Token ID9617
Feature activation-4.60e-02

Pos logit contributions
paste+0.066
engu+0.066
ork+0.065
T+0.064
Snow+0.064
Neg logit contributions
 down-0.046
 up-0.041
 alone-0.040
 forth-0.036
 over-0.036
Token snow
Token ID6729
Feature activation-5.04e-02

Pos logit contributions
T+0.101
ves+0.098
paste+0.097
 fiber+0.097
bread+0.096
Neg logit contributions
 down-0.080
 up-0.074
 over-0.055
 forth-0.053
 back-0.053
Token all
Token ID477
Feature activation-6.57e-03

Pos logit contributions
engu+0.091
T+0.089
paste+0.089
Snow+0.088
lan+0.088
Neg logit contributions
 up-0.061
 down-0.060
 over-0.043
 alone-0.042
?!-0.041
Token the
Token ID262
Feature activation-5.26e-02

Pos logit contributions
paste+0.012
 spe+0.012
T+0.012
Snow+0.012
engu+0.012
Neg logit contributions
 down-0.010
 up-0.009
arted-0.009
ivated-0.009
?!-0.009
Token popcorn
Token ID38914
Feature activation-3.55e-02

Pos logit contributions
paste+0.061
 spe+0.061
T+0.060
Snow+0.060
engu+0.059
Neg logit contributions
 down-0.118
 up-0.117
ivated-0.105
?!-0.105
arted-0.103
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
T+0.063
Snow+0.063
 spe+0.062
engu+0.061
paste+0.060
Neg logit contributions
 down-0.062
 up-0.060
 alone-0.051
arted-0.051
?!-0.050
Token drank
Token ID24070
Feature activation-4.59e-02

Pos logit contributions
engu+0.037
T+0.034
paste+0.034
 spe+0.034
Snow+0.034
Neg logit contributions
 down-0.031
 up-0.030
 forth-0.029
 alone-0.025
 Sally-0.024
Token some
Token ID617
Feature activation-8.48e-02

Pos logit contributions
engu+0.099
T+0.095
Ar+0.095
Snow+0.094
 spe+0.093
Neg logit contributions
 down-0.072
 up-0.072
 over-0.055
 alone-0.049
arted-0.046
Token water
Token ID1660
Feature activation-2.46e-02

Pos logit contributions
T+0.145
Snow+0.141
 spe+0.139
engu+0.137
Bl+0.136
Neg logit contributions
 up-0.156
 down-0.155
 over-0.120
?!-0.119
 alone-0.118
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
T+0.044
Snow+0.043
 spe+0.043
engu+0.043
Ar+0.042
Neg logit contributions
 down-0.044
 up-0.042
arted-0.036
 alone-0.036
?!-0.035
Token Tim
Token ID5045
Feature activation+1.86e-02

Pos logit contributions
ivated+0.034
 down+0.034
arted+0.034
 up+0.032
?!+0.032
Neg logit contributions
T-0.018
Snow-0.018
engu-0.017
paste-0.016
Ar-0.016
Tokenmy
Token ID1820
Feature activation-6.50e-03

Pos logit contributions
 down+0.023
 up+0.022
arted+0.021
ivated+0.020
?!+0.020
Neg logit contributions
paste-0.039
 spe-0.039
T-0.039
Snow-0.039
engu-0.039
Token was
Token ID373
Feature activation+8.25e-03

Pos logit contributions
paste+0.012
 spe+0.012
engu+0.011
Snow+0.011
 souven+0.011
Neg logit contributions
 up-0.010
 down-0.010
 alone-0.009
arted-0.009
 forth-0.009
Token nose
Token ID9686
Feature activation-4.98e-02

Pos logit contributions
T+0.139
Bl+0.134
Ar+0.130
Ch+0.123
By+0.123
Neg logit contributions
 down-0.093
 up-0.086
 over-0.064
 back-0.064
 alone-0.057
Token and
Token ID290
Feature activation-4.73e-02

Pos logit contributions
Ar+0.111
Bl+0.107
T+0.106
engu+0.105
 fiber+0.103
Neg logit contributions
 down-0.075
 up-0.073
 over-0.059
 alone-0.050
arted-0.050
Token a
Token ID257
Feature activation-8.38e-02

Pos logit contributions
engu+0.087
ork+0.083
T+0.082
Bl+0.081
Snow+0.081
Neg logit contributions
 down-0.079
 up-0.072
 forth-0.062
 over-0.060
 Sarah-0.056
Token hat
Token ID6877
Feature activation-5.35e-02

Pos logit contributions
T+0.128
engu+0.125
lan+0.118
ves+0.116
paste+0.115
Neg logit contributions
 down-0.162
 up-0.144
 alone-0.127
?!-0.123
 forth-0.123
Token and
Token ID290
Feature activation-4.95e-02

Pos logit contributions
T+0.112
Ar+0.109
Bl+0.108
 fiber+0.108
engu+0.105
Neg logit contributions
 down-0.085
 up-0.080
 over-0.066
 alone-0.061
arted-0.058
Token a
Token ID257
Feature activation-8.40e-02

Pos logit contributions
engu+0.093
T+0.088
ork+0.086
 cents+0.085
opp+0.085
Neg logit contributions
 down-0.081
 up-0.074
 over-0.060
 forth-0.060
 alone-0.058
Token scarf
Token ID36953
Feature activation-6.35e-02

Pos logit contributions
T+0.125
engu+0.123
ves+0.114
lan+0.114
paste+0.112
Neg logit contributions
 down-0.166
 up-0.149
 alone-0.132
?!-0.129
 forth-0.127
Token.
Token ID13
Feature activation-2.35e-02

Pos logit contributions
T+0.142
 fiber+0.139
Bl+0.138
Ar+0.138
engu+0.134
Neg logit contributions
 down-0.096
 up-0.091
 over-0.072
 alone-0.066
 back-0.062
Token They
Token ID1119
Feature activation-3.24e-02

Pos logit contributions
 spe+0.048
paste+0.048
engu+0.046
 bead+0.045
Snow+0.044
Neg logit contributions
 down-0.034
 up-0.030
arted-0.027
 Sally-0.027
 Sarah-0.026
Token lay
Token ID3830
Feature activation-2.47e-02

Pos logit contributions
engu+0.063
paste+0.062
Snow+0.062
bread+0.061
ork+0.061
Neg logit contributions
 down-0.043
 up-0.040
 alone-0.038
 over-0.036
 forth-0.036
Token down
Token ID866
Feature activation-2.34e-02

Pos logit contributions
engu+0.061
T+0.061
paste+0.061
 spe+0.060
Snow+0.059
Neg logit contributions
 down-0.026
 up-0.023
arted-0.020
 forth-0.018
ivated-0.018
Token some
Token ID617
Feature activation-6.03e-02

Pos logit contributions
Ar+0.114
Snow+0.113
T+0.113
By+0.113
Bl+0.112
Neg logit contributions
 up-0.095
 down-0.094
 over-0.075
 around-0.062
arted-0.061
Token sa
Token ID473
Feature activation-6.20e-02

Pos logit contributions
T+0.099
Snow+0.094
Bl+0.092
By+0.092
Ar+0.090
Neg logit contributions
 down-0.118
 up-0.117
 over-0.095
 alone-0.094
 gonna-0.091
Tokenus
Token ID385
Feature activation-5.83e-02

Pos logit contributions
 spe+0.118
Bl+0.112
tr+0.108
By+0.108
 souven+0.108
Neg logit contributions
 down-0.096
 up-0.084
ying-0.078
rying-0.076
arted-0.074
Tokenages
Token ID1095
Feature activation-5.78e-02

Pos logit contributions
 spe+0.077
By+0.073
Ar+0.072
 bead+0.072
Bl+0.070
Neg logit contributions
 down-0.124
 up-0.118
rying-0.098
 alone-0.097
arted-0.095
Token and
Token ID290
Feature activation-5.24e-02

Pos logit contributions
Ar+0.121
 spe+0.119
Bl+0.119
Snow+0.118
T+0.117
Neg logit contributions
 down-0.095
 up-0.090
 over-0.073
 alone-0.070
arted-0.066
Token some
Token ID617
Feature activation-8.39e-02

Pos logit contributions
Bl+0.084
engu+0.082
Snow+0.082
ork+0.081
Ar+0.080
Neg logit contributions
 down-0.091
 up-0.087
 over-0.077
 forth-0.074
 Sally-0.072
Token bread
Token ID8509
Feature activation-7.09e-02

Pos logit contributions
T+0.134
Bl+0.127
Snow+0.126
By+0.124
Ar+0.123
Neg logit contributions
 down-0.158
 up-0.153
 alone-0.127
 over-0.123
 gonna-0.119
Token.
Token ID13
Feature activation-6.85e-03

Pos logit contributions
Bl+0.150
T+0.149
Ar+0.149
Snow+0.143
By+0.141
Neg logit contributions
 down-0.109
 up-0.103
 over-0.084
 alone-0.076
arted-0.074
Token She
Token ID1375
Feature activation-3.71e-02

Pos logit contributions
 spe+0.015
engu+0.015
paste+0.014
Snow+0.014
T+0.014
Neg logit contributions
 down-0.009
 up-0.008
arted-0.007
?!-0.007
ivated-0.007
Token gave
Token ID2921
Feature activation-8.90e-03

Pos logit contributions
Snow+0.070
Bl+0.069
engu+0.069
 souven+0.068
 spe+0.067
Neg logit contributions
 up-0.049
 down-0.044
?!-0.041
 over-0.041
!?-0.038
Token a
Token ID257
Feature activation-6.20e-02

Pos logit contributions
 spe+0.020
engu+0.020
paste+0.020
Snow+0.020
bread+0.020
Neg logit contributions
 down-0.012
 up-0.012
arted-0.009
 forth-0.008
 over-0.008
Token he
Token ID339
Feature activation+2.47e-02

Pos logit contributions
 down+0.032
 up+0.031
arted+0.028
?!+0.028
 alone+0.027
Neg logit contributions
paste-0.031
 spe-0.030
Snow-0.030
T-0.030
engu-0.030
Token ate
Token ID15063
Feature activation-2.20e-02

Pos logit contributions
 down+0.044
 up+0.042
arted+0.042
ivated+0.042
?!+0.040
Neg logit contributions
 spe-0.038
paste-0.038
T-0.038
Snow-0.037
engu-0.036
Token all
Token ID477
Feature activation+2.27e-03

Pos logit contributions
T+0.045
Snow+0.045
paste+0.045
engu+0.044
 spe+0.044
Neg logit contributions
 down-0.032
 up-0.032
arted-0.026
?!-0.025
 alone-0.025
Token the
Token ID262
Feature activation-3.80e-02

Pos logit contributions
ivated+0.004
 down+0.004
arted+0.004
?!+0.003
 gonna+0.003
Neg logit contributions
T-0.004
 spe-0.004
paste-0.004
engu-0.004
Snow-0.004
Token ch
Token ID442
Feature activation-1.63e-02

Pos logit contributions
T+0.065
paste+0.063
Snow+0.062
engu+0.061
 spe+0.060
Neg logit contributions
 up-0.068
 down-0.068
 alone-0.057
?!-0.057
 forth-0.055
Tokenocol
Token ID4668
Feature activation-8.38e-02

Pos logit contributions
T+0.033
Snow+0.033
Ar+0.031
tr+0.031
paste+0.031
Neg logit contributions
arted-0.023
 down-0.023
 up-0.021
ivated-0.020
 alone-0.020
Tokenates
Token ID689
Feature activation-2.66e-02

Pos logit contributions
 spe+0.156
Snow+0.155
T+0.154
 pill+0.148
 souven+0.148
Neg logit contributions
 down-0.122
arted-0.116
ags-0.114
 up-0.113
zie-0.105
Token and
Token ID290
Feature activation-5.50e-03

Pos logit contributions
 spe+0.045
Snow+0.045
T+0.045
paste+0.045
engu+0.044
Neg logit contributions
 down-0.046
 up-0.044
arted-0.040
ivated-0.039
?!-0.039
Token sweets
Token ID42402
Feature activation-4.04e-02

Pos logit contributions
engu+0.010
T+0.010
Snow+0.009
paste+0.009
 spe+0.009
Neg logit contributions
 down-0.010
 up-0.009
 forth-0.008
arted-0.008
 alone-0.008
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
T+0.075
engu+0.074
Snow+0.074
Ar+0.073
 spe+0.072
Neg logit contributions
 down-0.068
 up-0.066
 alone-0.055
?!-0.054
ivated-0.054
Token Then
Token ID3244
Feature activation-3.27e-03

Pos logit contributions
 down+0.024
 up+0.023
ivated+0.023
arted+0.023
?!+0.022
Neg logit contributions
paste-0.026
T-0.026
Snow-0.026
 spe-0.026
engu-0.026
Tokenman
Token ID805
Feature activation-3.06e-02

Pos logit contributions
T+0.090
Bl+0.084
 cents+0.082
 fiber+0.081
Ar+0.081
Neg logit contributions
 down-0.078
 up-0.071
 back-0.056
 over-0.055
 forth-0.050
Token a
Token ID257
Feature activation-6.92e-02

Pos logit contributions
 spe+0.054
Ar+0.053
T+0.052
By+0.052
Bl+0.052
Neg logit contributions
 down-0.056
 up-0.056
 over-0.045
 back-0.042
 alone-0.042
Token carrot
Token ID40562
Feature activation-6.01e-02

Pos logit contributions
T+0.108
lan+0.103
paste+0.103
Snow+0.102
engu+0.102
Neg logit contributions
 down-0.129
 up-0.117
?!-0.104
?!"-0.098
 gonna-0.098
Token nose
Token ID9686
Feature activation-4.98e-02

Pos logit contributions
T+0.139
Bl+0.134
Ar+0.130
Ch+0.123
By+0.123
Neg logit contributions
 down-0.093
 up-0.086
 over-0.064
 back-0.064
 alone-0.057
Token and
Token ID290
Feature activation-4.73e-02

Pos logit contributions
Ar+0.111
Bl+0.107
T+0.106
engu+0.105
 fiber+0.103
Neg logit contributions
 down-0.075
 up-0.073
 over-0.059
 alone-0.050
arted-0.050
Token a
Token ID257
Feature activation-8.38e-02

Pos logit contributions
engu+0.087
ork+0.083
T+0.082
Bl+0.081
Snow+0.081
Neg logit contributions
 down-0.079
 up-0.072
 forth-0.062
 over-0.060
 Sarah-0.056
Token hat
Token ID6877
Feature activation-5.35e-02

Pos logit contributions
T+0.128
engu+0.125
lan+0.118
ves+0.116
paste+0.115
Neg logit contributions
 down-0.162
 up-0.144
 alone-0.127
?!-0.123
 forth-0.123
Token and
Token ID290
Feature activation-4.95e-02

Pos logit contributions
T+0.112
Ar+0.109
Bl+0.108
 fiber+0.108
engu+0.105
Neg logit contributions
 down-0.085
 up-0.080
 over-0.066
 alone-0.061
arted-0.058
Token a
Token ID257
Feature activation-8.40e-02

Pos logit contributions
engu+0.093
T+0.088
ork+0.086
 cents+0.085
opp+0.085
Neg logit contributions
 down-0.081
 up-0.074
 over-0.060
 forth-0.060
 alone-0.058
Token scarf
Token ID36953
Feature activation-6.35e-02

Pos logit contributions
T+0.125
engu+0.123
ves+0.114
lan+0.114
paste+0.112
Neg logit contributions
 down-0.166
 up-0.149
 alone-0.132
?!-0.129
 forth-0.127
Token.
Token ID13
Feature activation-2.35e-02

Pos logit contributions
T+0.142
 fiber+0.139
Bl+0.138
Ar+0.138
engu+0.134
Neg logit contributions
 down-0.096
 up-0.091
 over-0.072
 alone-0.066
 back-0.062
Token a
Token ID257
Feature activation-6.50e-02

Pos logit contributions
Ar+0.048
Bl+0.047
engu+0.047
T+0.047
By+0.046
Neg logit contributions
 down-0.051
 up-0.051
 alone-0.043
?!-0.041
 forth-0.039
Token t
Token ID256
Feature activation-4.31e-02

Pos logit contributions
lan+0.113
 spe+0.112
engu+0.111
Bl+0.110
Snow+0.109
Neg logit contributions
 down-0.110
 up-0.104
?!-0.091
 alone-0.088
!?-0.084
Tokeneddy
Token ID21874
Feature activation-3.80e-02

Pos logit contributions
 spe+0.074
Bl+0.073
Ar+0.070
ves+0.068
engu+0.068
Neg logit contributions
 down-0.075
 up-0.070
arted-0.064
?!-0.062
 alone-0.060
Token bear
Token ID6842
Feature activation-4.58e-02

Pos logit contributions
engu+0.083
 spe+0.079
paste+0.078
T+0.078
Ar+0.078
Neg logit contributions
 down-0.053
 up-0.048
 forth-0.041
 alone-0.040
?!-0.040
Token and
Token ID290
Feature activation-4.21e-02

Pos logit contributions
Ar+0.097
Bl+0.096
 spe+0.093
 fiber+0.093
engu+0.092
Neg logit contributions
 down-0.072
 up-0.068
 alone-0.058
 over-0.056
 back-0.051
Token a
Token ID257
Feature activation-8.36e-02

Pos logit contributions
engu+0.079
Ar+0.074
Bl+0.072
 spe+0.072
 cents+0.070
Neg logit contributions
 down-0.071
 up-0.069
 forth-0.055
 Sarah-0.054
 alone-0.054
Token glass
Token ID5405
Feature activation-6.63e-02

Pos logit contributions
engu+0.146
lan+0.139
ves+0.136
Bl+0.135
T+0.135
Neg logit contributions
 down-0.145
 up-0.133
 alone-0.119
?!-0.117
!?-0.109
Token of
Token ID286
Feature activation-5.56e-02

Pos logit contributions
T+0.132
engu+0.131
Bl+0.130
Ar+0.126
 fiber+0.122
Neg logit contributions
 down-0.110
 up-0.103
 alone-0.097
 over-0.084
 back-0.082
Token water
Token ID1660
Feature activation-4.10e-02

Pos logit contributions
engu+0.098
T+0.098
Bl+0.092
paste+0.091
ves+0.091
Neg logit contributions
 down-0.099
 up-0.097
 alone-0.084
?!-0.078
 my-0.077
Token.
Token ID13
Feature activation-9.35e-03

Pos logit contributions
T+0.078
Ar+0.078
Bl+0.077
engu+0.077
 spe+0.074
Neg logit contributions
 down-0.070
 up-0.066
 alone-0.057
arted-0.053
 over-0.052
Token She
Token ID1375
Feature activation-4.27e-02

Pos logit contributions
 spe+0.021
paste+0.021
engu+0.021
Snow+0.021
Ar+0.020
Neg logit contributions
 down-0.011
 up-0.010
arted-0.009
?!-0.009
ivated-0.009
Token her
Token ID607
Feature activation-1.97e-02

Pos logit contributions
 down+0.008
arted+0.007
 up+0.007
ivated+0.007
?!+0.007
Neg logit contributions
T-0.013
paste-0.013
Snow-0.013
 spe-0.013
engu-0.012
Token a
Token ID257
Feature activation-2.15e-02

Pos logit contributions
paste+0.023
 spe+0.023
Snow+0.022
T+0.022
engu+0.022
Neg logit contributions
 down-0.044
 up-0.042
arted-0.041
ivated-0.040
?!-0.040
Token box
Token ID3091
Feature activation+5.24e-03

Pos logit contributions
paste+0.032
 spe+0.032
T+0.032
engu+0.032
Snow+0.031
Neg logit contributions
 down-0.040
arted-0.038
 up-0.038
ivated-0.037
?!-0.036
Token of
Token ID286
Feature activation-2.22e-02

Pos logit contributions
 down+0.010
 up+0.010
arted+0.009
?!+0.009
 alone+0.009
Neg logit contributions
 spe-0.008
paste-0.008
T-0.008
Snow-0.008
engu-0.008
Token ch
Token ID442
Feature activation-1.40e-02

Pos logit contributions
paste+0.032
Snow+0.032
T+0.031
 spe+0.031
engu+0.031
Neg logit contributions
 down-0.043
 up-0.043
arted-0.038
?!-0.038
 gonna-0.038
Tokenocol
Token ID4668
Feature activation-8.31e-02

Pos logit contributions
Snow+0.027
paste+0.027
T+0.027
 spe+0.027
engu+0.026
Neg logit contributions
 down-0.020
 up-0.019
arted-0.019
ivated-0.018
?!-0.018
Tokenates
Token ID689
Feature activation-3.87e-02

Pos logit contributions
 spe+0.148
Snow+0.144
T+0.141
bread+0.140
 souven+0.139
Neg logit contributions
ags-0.126
 down-0.124
arted-0.119
 up-0.113
 forth-0.109
Token.
Token ID13
Feature activation+2.52e-02

Pos logit contributions
 spe+0.068
Ar+0.067
Snow+0.066
engu+0.066
T+0.065
Neg logit contributions
 down-0.069
 up-0.067
?!-0.058
arted-0.057
 alone-0.056
Token Lily
Token ID20037
Feature activation-7.02e-03

Pos logit contributions
arted+0.044
 forth+0.044
ivated+0.044
?!+0.044
 down+0.043
Neg logit contributions
paste-0.037
 spe-0.037
T-0.037
Snow-0.036
 ham-0.036
Token opened
Token ID4721
Feature activation+1.20e-03

Pos logit contributions
paste+0.011
 spe+0.010
Snow+0.010
T+0.010
engu+0.010
Neg logit contributions
 down-0.013
 up-0.013
arted-0.012
ivated-0.012
?!-0.012
Token the
Token ID262
Feature activation-4.23e-03

Pos logit contributions
 down+0.002
arted+0.002
ivated+0.002
?!+0.002
 up+0.002
Neg logit contributions
 spe-0.002
T-0.002
Snow-0.002
paste-0.002
engu-0.002
Token noticed
Token ID6810
Feature activation-1.77e-02

Pos logit contributions
engu+0.012
 spe+0.012
paste+0.011
 souven+0.011
Snow+0.011
Neg logit contributions
 down-0.010
 up-0.010
arted-0.009
?!-0.009
 alone-0.009
Token something
Token ID1223
Feature activation-1.56e-03

Pos logit contributions
Ar+0.037
 spe+0.036
engu+0.036
bread+0.033
Bl+0.033
Neg logit contributions
 up-0.028
 down-0.025
?!-0.022
!?-0.022
 alone-0.021
Token.
Token ID13
Feature activation-8.47e-03

Pos logit contributions
paste+0.003
 spe+0.003
T+0.003
Snow+0.003
engu+0.003
Neg logit contributions
 down-0.003
 up-0.002
arted-0.002
ivated-0.002
?!-0.002
Token The
Token ID383
Feature activation+6.03e-03

Pos logit contributions
 spe+0.018
Snow+0.017
Ar+0.017
paste+0.017
engu+0.017
Neg logit contributions
 down-0.011
 up-0.010
?!-0.009
arted-0.009
ivated-0.009
Token ch
Token ID442
Feature activation+1.59e-03

Pos logit contributions
 down+0.009
arted+0.009
 up+0.008
ivated+0.008
?!+0.008
Neg logit contributions
 spe-0.011
paste-0.011
Snow-0.011
T-0.011
engu-0.011
Tokenocol
Token ID4668
Feature activation-8.27e-02

Pos logit contributions
 down+0.002
 up+0.002
arted+0.002
ivated+0.002
?!+0.002
Neg logit contributions
paste-0.003
 spe-0.003
T-0.003
Snow-0.003
engu-0.003
Tokenates
Token ID689
Feature activation-2.05e-02

Pos logit contributions
 spe+0.152
 souven+0.150
T+0.144
Snow+0.144
engu+0.140
Neg logit contributions
 down-0.125
ags-0.118
 up-0.112
arted-0.112
zie-0.107
Token were
Token ID547
Feature activation-6.46e-03

Pos logit contributions
Snow+0.028
engu+0.028
T+0.028
 spe+0.027
paste+0.027
Neg logit contributions
 down-0.042
 up-0.041
arted-0.038
ivated-0.038
?!-0.037
Token bitter
Token ID12922
Feature activation-1.18e-02

Pos logit contributions
T+0.013
Snow+0.012
engu+0.012
Ar+0.012
paste+0.012
Neg logit contributions
 down-0.010
 up-0.010
arted-0.009
 alone-0.008
 gonna-0.008
Token.
Token ID13
Feature activation-8.04e-03

Pos logit contributions
T+0.021
engu+0.020
Snow+0.020
Ar+0.020
 spe+0.020
Neg logit contributions
 down-0.021
 up-0.020
arted-0.018
?!-0.018
 alone-0.017
Token They
Token ID1119
Feature activation+1.34e-02

Pos logit contributions
 spe+0.016
Snow+0.016
Ar+0.016
engu+0.016
paste+0.016
Neg logit contributions
 down-0.011
 up-0.010
arted-0.010
?!-0.009
ivated-0.009
Token each
Token ID1123
Feature activation-5.93e-02

Pos logit contributions
 spe+0.057
paste+0.056
T+0.053
 bead+0.052
Ar+0.052
Neg logit contributions
 down-0.062
 up-0.061
arted-0.050
 alone-0.050
 over-0.049
Token a
Token ID257
Feature activation-6.63e-02

Pos logit contributions
paste+0.112
 spe+0.110
engu+0.107
 bead+0.103
Bl+0.102
Neg logit contributions
 down-0.102
 up-0.097
 over-0.076
 alone-0.074
 my-0.070
Token warm
Token ID5814
Feature activation-5.45e-02

Pos logit contributions
paste+0.107
T+0.106
Snow+0.104
 spe+0.103
lan+0.102
Neg logit contributions
 down-0.116
 up-0.111
?!-0.102
?!"-0.097
!?-0.095
Token blanket
Token ID18447
Feature activation-5.09e-02

Pos logit contributions
T+0.102
Bl+0.100
engu+0.100
paste+0.100
lan+0.097
Neg logit contributions
 down-0.091
 up-0.086
?!-0.071
 alone-0.071
?!"-0.068
Token and
Token ID290
Feature activation-4.96e-02

Pos logit contributions
T+0.100
Bl+0.097
engu+0.097
Ar+0.096
Snow+0.094
Neg logit contributions
 down-0.084
 up-0.081
 alone-0.069
 over-0.065
arted-0.062
Token a
Token ID257
Feature activation-8.27e-02

Pos logit contributions
engu+0.087
T+0.085
Bl+0.083
ib+0.080
Snow+0.080
Neg logit contributions
 down-0.084
 up-0.082
 alone-0.064
 gonna-0.064
 forth-0.063
Token candy
Token ID18550
Feature activation-7.71e-02

Pos logit contributions
T+0.137
engu+0.137
paste+0.132
Bl+0.131
lan+0.130
Neg logit contributions
 down-0.140
 up-0.129
?!-0.116
 alone-0.110
!?-0.109
Token cane
Token ID33009
Feature activation-5.58e-02

Pos logit contributions
T+0.167
Bl+0.162
engu+0.157
Ar+0.156
Ch+0.155
Neg logit contributions
 down-0.110
 up-0.102
 alone-0.081
 back-0.076
 outside-0.073
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
T+0.112
engu+0.107
Bl+0.107
Ar+0.107
Snow+0.105
Neg logit contributions
 down-0.087
 up-0.083
 alone-0.072
 over-0.064
?!-0.063
Token He
Token ID679
Feature activation-3.60e-02

Pos logit contributions
 spe+0.064
paste+0.063
T+0.063
engu+0.062
Snow+0.062
Neg logit contributions
 down-0.053
 up-0.050
arted-0.045
?!-0.044
ivated-0.044
Token snapped
Token ID20821
Feature activation-2.80e-02

Pos logit contributions
T+0.068
bread+0.068
lan+0.068
paste+0.068
Snow+0.068
Neg logit contributions
 up-0.047
 down-0.043
 alone-0.038
?!-0.037
 over-0.036
Tokenbled
Token ID9342
Feature activation-1.52e-02

Pos logit contributions
 down+0.011
 up+0.011
arted+0.010
ivated+0.010
?!+0.010
Neg logit contributions
paste-0.014
T-0.014
 spe-0.014
Snow-0.014
engu-0.013
Token up
Token ID510
Feature activation-1.27e-02

Pos logit contributions
paste+0.043
Snow+0.043
T+0.043
 spe+0.043
engu+0.043
Neg logit contributions
 down-0.010
 up-0.010
arted-0.006
ivated-0.005
 forth-0.004
Token all
Token ID477
Feature activation+7.97e-03

Pos logit contributions
paste+0.030
Ar+0.030
engu+0.030
Snow+0.029
T+0.029
Neg logit contributions
 up-0.015
 down-0.015
 alone-0.011
arted-0.011
ivated-0.010
Token the
Token ID262
Feature activation-2.96e-02

Pos logit contributions
 down+0.012
ivated+0.012
arted+0.011
?!+0.011
 gonna+0.011
Neg logit contributions
 spe-0.014
paste-0.014
T-0.014
Snow-0.014
engu-0.014
Token ch
Token ID442
Feature activation-1.36e-02

Pos logit contributions
T+0.045
paste+0.044
Snow+0.043
 spe+0.042
lan+0.041
Neg logit contributions
 down-0.058
 up-0.057
 alone-0.050
 forth-0.049
ivated-0.049
Tokenocol
Token ID4668
Feature activation-8.16e-02

Pos logit contributions
T+0.027
Snow+0.026
paste+0.025
 spe+0.025
Ar+0.024
Neg logit contributions
 down-0.020
arted-0.020
 up-0.019
ivated-0.018
 alone-0.018
Tokenates
Token ID689
Feature activation-3.03e-02

Pos logit contributions
 spe+0.149
Snow+0.145
T+0.144
 souven+0.139
 pill+0.136
Neg logit contributions
 down-0.123
ags-0.118
arted-0.117
 up-0.113
zie-0.107
Token,
Token ID11
Feature activation-1.51e-02

Pos logit contributions
Snow+0.052
T+0.051
 spe+0.051
paste+0.050
Ar+0.050
Neg logit contributions
 down-0.054
 up-0.053
arted-0.046
ivated-0.046
?!-0.045
Token and
Token ID290
Feature activation-1.96e-03

Pos logit contributions
engu+0.027
T+0.027
paste+0.027
Snow+0.027
 spe+0.026
Neg logit contributions
 down-0.025
 up-0.024
arted-0.022
ivated-0.021
 alone-0.021
Token she
Token ID673
Feature activation-8.51e-03

Pos logit contributions
T+0.004
engu+0.004
Snow+0.004
Ar+0.004
paste+0.003
Neg logit contributions
 down-0.003
 up-0.003
 forth-0.003
arted-0.003
 alone-0.003
Token was
Token ID373
Feature activation+1.58e-02

Pos logit contributions
Snow+0.015
paste+0.015
T+0.015
engu+0.015
bread+0.014
Neg logit contributions
 up-0.014
 down-0.014
 alone-0.012
arted-0.012
?!-0.011
Token as
Token ID355
Feature activation-2.04e-02

Pos logit contributions
T+0.033
 spe+0.033
Snow+0.033
paste+0.032
engu+0.032
Neg logit contributions
 down-0.034
 up-0.033
arted-0.030
 alone-0.030
?!-0.030
Token he
Token ID339
Feature activation-5.78e-03

Pos logit contributions
engu+0.035
paste+0.035
Snow+0.034
 spe+0.034
T+0.034
Neg logit contributions
 down-0.035
 up-0.035
?!-0.030
arted-0.030
 alone-0.030
Token opened
Token ID4721
Feature activation+2.38e-03

Pos logit contributions
paste+0.009
Snow+0.009
engu+0.009
 spe+0.009
T+0.009
Neg logit contributions
 down-0.010
 up-0.010
arted-0.010
?!-0.009
 alone-0.009
Token the
Token ID262
Feature activation+6.10e-03

Pos logit contributions
 down+0.003
arted+0.003
ivated+0.003
 up+0.003
?!+0.003
Neg logit contributions
 spe-0.004
paste-0.004
T-0.004
Snow-0.004
engu-0.004
Token ch
Token ID442
Feature activation-7.79e-03

Pos logit contributions
 down+0.008
 up+0.007
arted+0.007
ivated+0.007
?!+0.007
Neg logit contributions
paste-0.013
 spe-0.013
T-0.013
Snow-0.013
engu-0.013
Tokenocol
Token ID4668
Feature activation-8.14e-02

Pos logit contributions
paste+0.013
T+0.013
 spe+0.013
Snow+0.013
engu+0.013
Neg logit contributions
 down-0.013
 up-0.012
arted-0.012
ivated-0.012
?!-0.011
Tokenates
Token ID689
Feature activation-2.09e-02

Pos logit contributions
Snow+0.150
T+0.148
 spe+0.145
 pill+0.139
paste+0.136
Neg logit contributions
 down-0.125
arted-0.121
 up-0.117
ags-0.117
?!-0.110
Token and
Token ID290
Feature activation-3.82e-03

Pos logit contributions
 spe+0.035
Snow+0.035
paste+0.035
T+0.034
engu+0.034
Neg logit contributions
 down-0.037
 up-0.035
arted-0.033
ivated-0.032
?!-0.032
Token started
Token ID2067
Feature activation-2.35e-02

Pos logit contributions
T+0.006
paste+0.006
Snow+0.006
 spe+0.006
engu+0.006
Neg logit contributions
 down-0.006
 up-0.006
arted-0.006
ivated-0.006
?!-0.006
Token eating
Token ID6600
Feature activation-3.68e-02

Pos logit contributions
paste+0.043
Snow+0.042
T+0.042
 spe+0.042
engu+0.042
Neg logit contributions
 down-0.037
 up-0.036
arted-0.034
ivated-0.032
?!-0.032
Token them
Token ID606
Feature activation+3.07e-03

Pos logit contributions
T+0.071
Snow+0.070
paste+0.069
engu+0.068
 spe+0.067
Neg logit contributions
 down-0.060
 up-0.058
 alone-0.047
?!-0.046
arted-0.046
Token gave
Token ID2921
Feature activation-1.22e-02

Pos logit contributions
engu+0.057
bread+0.057
Ar+0.056
 cents+0.055
paste+0.055
Neg logit contributions
 down-0.045
 up-0.043
 alone-0.042
 forth-0.036
 over-0.035
Token her
Token ID607
Feature activation-4.46e-02

Pos logit contributions
bread+0.034
Ar+0.034
engu+0.033
 spe+0.033
T+0.033
Neg logit contributions
 up-0.012
 down-0.012
 alone-0.007
 forth-0.007
 over-0.006
Token a
Token ID257
Feature activation-6.32e-02

Pos logit contributions
Ar+0.088
 spe+0.084
Bl+0.084
By+0.081
engu+0.080
Neg logit contributions
 up-0.078
 down-0.075
 over-0.060
 back-0.060
 alone-0.058
Token kiss
Token ID9245
Feature activation-5.13e-02

Pos logit contributions
T+0.102
Bl+0.101
 spe+0.101
Snow+0.099
engu+0.098
Neg logit contributions
 down-0.109
 up-0.106
?!-0.090
 alone-0.086
!?-0.084
Token and
Token ID290
Feature activation-4.17e-02

Pos logit contributions
engu+0.120
Ar+0.120
 fiber+0.114
Bl+0.114
 spe+0.112
Neg logit contributions
 down-0.078
 up-0.069
 over-0.056
 back-0.053
 alone-0.051
Token a
Token ID257
Feature activation-8.12e-02

Pos logit contributions
engu+0.079
Bl+0.074
Ar+0.074
T+0.073
ib+0.072
Neg logit contributions
 down-0.067
 up-0.066
 Sarah-0.053
 over-0.052
 forth-0.051
Token hug
Token ID16225
Feature activation-4.21e-02

Pos logit contributions
T+0.129
engu+0.127
 spe+0.125
Bl+0.124
Snow+0.123
Neg logit contributions
 down-0.144
 up-0.135
 alone-0.117
?!-0.115
!?-0.109
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
Ar+0.100
engu+0.099
 spe+0.094
 fiber+0.093
Bl+0.093
Neg logit contributions
 down-0.062
 up-0.055
 over-0.044
 alone-0.043
 back-0.039
Token They
Token ID1119
Feature activation-3.36e-02

Pos logit contributions
 spe+0.032
engu+0.031
paste+0.031
Snow+0.030
T+0.030
Neg logit contributions
 down-0.024
 up-0.022
arted-0.020
?!-0.019
 alone-0.019
Token said
Token ID531
Feature activation-4.71e-02

Pos logit contributions
engu+0.065
paste+0.064
Ar+0.063
bread+0.063
 cents+0.063
Neg logit contributions
 down-0.045
 up-0.043
 alone-0.041
 forth-0.036
 over-0.036
Token good
Token ID922
Feature activation-3.49e-02

Pos logit contributions
Ar+0.105
tr+0.097
T+0.096
engu+0.095
Snow+0.094
Neg logit contributions
 down-0.065
 up-0.064
 alone-0.051
 sorry-0.048
 Jane-0.047
TokenL
Token ID43
Feature activation+8.71e-03

Pos logit contributions
engu+0.062
 spe+0.058
lan+0.054
 allowance+0.053
bread+0.053
Neg logit contributions
 down-0.060
?!-0.057
 up-0.057
ivated-0.056
!?-0.055
Tokenily
Token ID813
Feature activation+1.11e-03

Pos logit contributions
 down+0.017
 up+0.016
arted+0.016
ivated+0.015
?!+0.015
Neg logit contributions
 spe-0.013
paste-0.013
Snow-0.013
T-0.013
engu-0.013
Token,
Token ID11
Feature activation+3.25e-03

Pos logit contributions
 down+0.002
 up+0.002
arted+0.002
ivated+0.002
?!+0.002
Neg logit contributions
paste-0.002
 spe-0.002
T-0.002
Snow-0.002
engu-0.002
Token these
Token ID777
Feature activation-1.56e-02

Pos logit contributions
 down+0.005
 up+0.005
arted+0.005
?!+0.005
ivated+0.005
Neg logit contributions
 spe-0.006
paste-0.006
T-0.006
Snow-0.006
engu-0.006
Token ch
Token ID442
Feature activation-1.32e-02

Pos logit contributions
T+0.022
Snow+0.022
 spe+0.022
paste+0.022
engu+0.022
Neg logit contributions
 down-0.030
 up-0.029
arted-0.027
?!-0.027
ivated-0.027
Tokenocol
Token ID4668
Feature activation-8.12e-02

Pos logit contributions
T+0.027
Snow+0.027
 spe+0.026
paste+0.026
By+0.026
Neg logit contributions
 down-0.018
arted-0.017
 up-0.016
?!-0.015
ivated-0.015
Tokenates
Token ID689
Feature activation-8.73e-03

Pos logit contributions
 spe+0.153
 souven+0.149
Snow+0.143
bread+0.141
By+0.140
Neg logit contributions
 down-0.119
ags-0.114
 up-0.108
arted-0.106
asc-0.104
Token are
Token ID389
Feature activation+6.18e-03

Pos logit contributions
paste+0.014
 spe+0.014
Snow+0.014
T+0.014
engu+0.014
Neg logit contributions
 down-0.015
arted-0.014
 up-0.014
ivated-0.014
?!-0.013
Token bitter
Token ID12922
Feature activation-2.15e-03

Pos logit contributions
 down+0.010
 up+0.010
arted+0.009
?!+0.009
ivated+0.009
Neg logit contributions
T-0.011
Snow-0.011
paste-0.011
engu-0.011
 spe-0.010
Token,"
Token ID553
Feature activation-1.19e-02

Pos logit contributions
T+0.004
Snow+0.004
 spe+0.004
engu+0.004
Ar+0.004
Neg logit contributions
 down-0.003
 up-0.003
?!-0.003
arted-0.003
 alone-0.003
Token he
Token ID339
Feature activation-1.50e-02

Pos logit contributions
 spe+0.020
paste+0.019
Snow+0.019
engu+0.019
T+0.019
Neg logit contributions
 down-0.021
 up-0.020
arted-0.019
ivated-0.019
?!-0.018
Token box
Token ID3091
Feature activation-1.55e-03

Pos logit contributions
T+0.001
Snow+0.001
paste+0.001
 spe+0.001
engu+0.001
Neg logit contributions
 down-0.001
 up-0.001
?!-0.001
arted-0.001
 alone-0.001
Token and
Token ID290
Feature activation-8.14e-03

Pos logit contributions
Snow+0.003
paste+0.003
T+0.003
 spe+0.002
engu+0.002
Neg logit contributions
 down-0.003
 up-0.003
arted-0.003
ivated-0.002
 alone-0.002
Token share
Token ID2648
Feature activation-3.97e-02

Pos logit contributions
 spe+0.015
paste+0.015
engu+0.015
Snow+0.015
T+0.015
Neg logit contributions
 down-0.012
arted-0.011
 up-0.011
ivated-0.011
?!-0.011
Token the
Token ID262
Feature activation-1.83e-02

Pos logit contributions
paste+0.099
Snow+0.094
T+0.093
Ar+0.093
 spe+0.092
Neg logit contributions
 up-0.048
 down-0.045
 over-0.030
 alone-0.029
ivated-0.025
Token ch
Token ID442
Feature activation-1.07e-02

Pos logit contributions
paste+0.032
T+0.032
Snow+0.029
engu+0.029
 spe+0.029
Neg logit contributions
 down-0.032
 up-0.031
 alone-0.029
 forth-0.027
?!-0.026
Tokenocol
Token ID4668
Feature activation-8.07e-02

Pos logit contributions
T+0.021
Snow+0.020
 spe+0.020
paste+0.020
By+0.019
Neg logit contributions
 down-0.016
 up-0.015
arted-0.015
?!-0.014
 alone-0.013
Tokenates
Token ID689
Feature activation-5.01e-02

Pos logit contributions
 spe+0.155
T+0.146
Snow+0.144
 souven+0.143
By+0.140
Neg logit contributions
ags-0.117
 down-0.113
 up-0.102
arted-0.100
zie-0.099
Token.
Token ID13
Feature activation-1.88e-02

Pos logit contributions
 spe+0.093
T+0.092
Snow+0.092
paste+0.091
Ar+0.091
Neg logit contributions
 up-0.084
 down-0.082
 alone-0.067
ivated-0.064
 over-0.063
Token They
Token ID1119
Feature activation-1.33e-02

Pos logit contributions
paste+0.037
 spe+0.036
engu+0.035
T+0.034
Snow+0.034
Neg logit contributions
 down-0.029
 up-0.027
arted-0.025
 gonna-0.024
ivated-0.022
Token are
Token ID389
Feature activation-1.13e-02

Pos logit contributions
paste+0.025
T+0.025
Snow+0.025
 spe+0.024
lan+0.023
Neg logit contributions
 down-0.021
 up-0.020
ivated-0.018
?!-0.017
arted-0.017
Token happy
Token ID3772
Feature activation-1.89e-02

Pos logit contributions
 spe+0.021
paste+0.021
Snow+0.020
T+0.020
engu+0.020
Neg logit contributions
 down-0.017
arted-0.016
ivated-0.016
?!-0.016
 up-0.016
Token loved
Token ID6151
Feature activation-1.72e-02

Pos logit contributions
paste+0.014
engu+0.013
Snow+0.013
T+0.013
 spe+0.013
Neg logit contributions
 down-0.013
 up-0.012
 alone-0.011
arted-0.011
?!-0.011
Token Mom
Token ID11254
Feature activation-1.24e-02

Pos logit contributions
paste+0.038
T+0.037
Snow+0.037
Ar+0.036
engu+0.036
Neg logit contributions
 up-0.023
 down-0.022
 alone-0.018
arted-0.017
 forth-0.017
Token and
Token ID290
Feature activation+1.99e-02

Pos logit contributions
T+0.022
Ar+0.022
Snow+0.021
 spe+0.021
engu+0.021
Neg logit contributions
 down-0.021
 up-0.021
 alone-0.018
?!-0.018
arted-0.018
Token they
Token ID484
Feature activation-1.86e-03

Pos logit contributions
 down+0.035
 up+0.033
arted+0.032
ivated+0.031
?!+0.030
Neg logit contributions
paste-0.033
T-0.033
Snow-0.033
 spe-0.033
engu-0.033
Token loved
Token ID6151
Feature activation-2.53e-02

Pos logit contributions
paste+0.003
engu+0.003
T+0.003
 spe+0.003
Snow+0.003
Neg logit contributions
 down-0.004
 up-0.003
arted-0.003
ivated-0.003
 alone-0.003
Token mac
Token ID8352
Feature activation-8.06e-02

Pos logit contributions
paste+0.061
Ar+0.061
engu+0.060
Snow+0.060
T+0.060
Neg logit contributions
 up-0.028
 down-0.027
 alone-0.020
 Jane-0.018
 forth-0.018
Tokenaron
Token ID8045
Feature activation+1.04e-02

Pos logit contributions
 souven+0.154
 spe+0.153
Bl+0.152
engu+0.152
T+0.151
Neg logit contributions
 down-0.121
 up-0.115
 alone-0.104
 over-0.093
?!-0.091
Tokeni
Token ID72
Feature activation-3.06e-02

Pos logit contributions
 down+0.018
 up+0.017
arted+0.016
ivated+0.016
?!+0.016
Neg logit contributions
paste-0.017
 spe-0.017
T-0.017
Snow-0.017
engu-0.017
Token and
Token ID290
Feature activation-2.36e-02

Pos logit contributions
Ar+0.060
T+0.059
Snow+0.057
engu+0.057
Bl+0.057
Neg logit contributions
 down-0.049
 up-0.047
 alone-0.043
 forth-0.037
arted-0.037
Token cheese
Token ID9891
Feature activation-6.30e-02

Pos logit contributions
T+0.036
engu+0.034
Snow+0.032
ves+0.032
Ar+0.031
Neg logit contributions
 down-0.046
 alone-0.042
 up-0.041
 my-0.038
 forth-0.038
Token.
Token ID13
Feature activation-7.31e-03

Pos logit contributions
T+0.139
engu+0.136
Ar+0.131
Bl+0.130
Snow+0.129
Neg logit contributions
 down-0.086
 up-0.083
 alone-0.081
 over-0.063
 forth-0.059
Token queen
Token ID16599
Feature activation-3.49e-02

Pos logit contributions
paste+0.063
T+0.062
engu+0.061
Snow+0.059
 fiber+0.057
Neg logit contributions
 down-0.055
 up-0.050
 alone-0.049
?!-0.045
 forth-0.045
Token.
Token ID13
Feature activation-2.88e-02

Pos logit contributions
T+0.073
 fiber+0.069
engu+0.069
Snow+0.069
Ar+0.068
Neg logit contributions
 down-0.051
 alone-0.047
 up-0.046
 forth-0.039
 over-0.037
Token They
Token ID1119
Feature activation-2.08e-02

Pos logit contributions
paste+0.062
T+0.060
 spe+0.059
Snow+0.057
ves+0.057
Neg logit contributions
 down-0.037
 up-0.034
arted-0.028
 over-0.028
 gonna-0.027
Token wore
Token ID12408
Feature activation-5.18e-02

Pos logit contributions
paste+0.039
T+0.037
Snow+0.037
bread+0.036
lan+0.036
Neg logit contributions
 down-0.031
 up-0.029
 alone-0.028
 forth-0.026
 over-0.025
Token crown
Token ID12389
Feature activation-5.57e-02

Pos logit contributions
paste+0.095
engu+0.092
T+0.091
bread+0.089
Snow+0.088
Neg logit contributions
 up-0.090
 down-0.087
 over-0.066
 alone-0.063
 television-0.062
Tokens
Token ID82
Feature activation-8.04e-02

Pos logit contributions
Ar+0.110
Bl+0.108
 fiber+0.106
 spe+0.105
T+0.105
Neg logit contributions
 down-0.095
 up-0.090
 around-0.069
 alone-0.068
 over-0.068
Token made
Token ID925
Feature activation-3.70e-02

Pos logit contributions
T+0.173
engu+0.170
bread+0.164
Bl+0.164
Ar+0.163
Neg logit contributions
 down-0.113
 up-0.109
 alone-0.100
 over-0.083
 around-0.081
Token of
Token ID286
Feature activation-3.85e-02

Pos logit contributions
T+0.073
Snow+0.071
 spe+0.070
paste+0.069
 fiber+0.068
Neg logit contributions
 up-0.062
 down-0.059
 alone-0.049
arted-0.048
 over-0.045
Token leaves
Token ID5667
Feature activation-5.20e-02

Pos logit contributions
T+0.074
paste+0.073
Snow+0.068
By+0.068
ves+0.067
Neg logit contributions
 down-0.065
 up-0.065
 alone-0.059
 forth-0.053
 television-0.051
Token and
Token ID290
Feature activation-4.93e-02

Pos logit contributions
T+0.121
Ar+0.118
engu+0.115
 fiber+0.115
Bl+0.114
Neg logit contributions
 down-0.074
 up-0.074
 alone-0.063
 over-0.054
 around-0.051
Token flowers
Token ID12734
Feature activation-5.52e-02

Pos logit contributions
T+0.095
engu+0.089
By+0.083
oxy+0.082
itch+0.082
Neg logit contributions
 down-0.083
 up-0.077
 alone-0.072
 forth-0.071
 over-0.065
Token They
Token ID1119
Feature activation-7.38e-03

Pos logit contributions
 down+0.021
 up+0.020
arted+0.020
ivated+0.019
?!+0.019
Neg logit contributions
paste-0.021
 spe-0.021
T-0.020
Snow-0.020
engu-0.020
Token used
Token ID973
Feature activation-6.39e-02

Pos logit contributions
paste+0.014
Snow+0.013
bread+0.012
T+0.012
engu+0.012
Neg logit contributions
 down-0.012
 up-0.011
 alone-0.010
igator-0.010
 gonna-0.010
Token red
Token ID2266
Feature activation-6.79e-02

Pos logit contributions
By+0.146
Snow+0.144
T+0.143
oc+0.142
bread+0.137
Neg logit contributions
 up-0.091
 down-0.086
 over-0.067
 my-0.062
 me-0.054
Token juice
Token ID13135
Feature activation-5.50e-02

Pos logit contributions
tr+0.131
By+0.130
Snow+0.128
paste+0.124
T+0.121
Neg logit contributions
 down-0.109
 up-0.109
 me-0.084
 my-0.083
 over-0.077
Token and
Token ID290
Feature activation-5.32e-02

Pos logit contributions
T+0.119
Snow+0.117
Bl+0.115
 spe+0.114
Ar+0.114
Neg logit contributions
 down-0.084
 up-0.080
 over-0.067
 alone-0.060
arted-0.057
Token white
Token ID2330
Feature activation-8.02e-02

Pos logit contributions
ork+0.091
lor+0.089
tr+0.088
By+0.087
Snow+0.086
Neg logit contributions
 down-0.083
 up-0.079
 my-0.072
 Sally-0.071
 over-0.067
Token powder
Token ID11913
Feature activation-4.01e-02

Pos logit contributions
tr+0.145
By+0.144
Snow+0.133
T+0.130
Co+0.130
Neg logit contributions
 down-0.127
 up-0.121
 me-0.108
 my-0.097
 over-0.092
Token and
Token ID290
Feature activation-4.36e-02

Pos logit contributions
Ar+0.088
T+0.088
Bl+0.086
By+0.084
Snow+0.083
Neg logit contributions
 up-0.064
 down-0.061
 over-0.047
 around-0.043
 alone-0.041
Token mixed
Token ID7668
Feature activation-4.12e-02

Pos logit contributions
lor+0.078
By+0.076
T+0.076
Ar+0.075
ork+0.075
Neg logit contributions
 down-0.069
 up-0.064
 Sally-0.058
 Jane-0.055
 over-0.055
Token them
Token ID606
Feature activation-1.26e-02

Pos logit contributions
paste+0.091
By+0.090
Snow+0.090
T+0.089
engu+0.086
Neg logit contributions
 up-0.062
 down-0.060
 over-0.042
 forth-0.040
?!-0.038
Token in
Token ID287
Feature activation-3.05e-02

Pos logit contributions
T+0.025
Snow+0.024
 spe+0.024
paste+0.024
Ar+0.023
Neg logit contributions
 down-0.020
 up-0.020
 alone-0.016
ivated-0.016
arted-0.015
TokenY
Token ID56
Feature activation+3.40e-03

Pos logit contributions
engu+0.081
lan+0.071
 spe+0.070
bread+0.070
paste+0.069
Neg logit contributions
 down-0.058
ivated-0.054
arted-0.053
 forth-0.052
 up-0.052
Tokenay
Token ID323
Feature activation+2.46e-02

Pos logit contributions
 down+0.006
 up+0.006
arted+0.006
?!+0.006
ivated+0.005
Neg logit contributions
 spe-0.006
paste-0.005
Snow-0.005
T-0.005
Ar-0.005
Token,
Token ID11
Feature activation+2.94e-03

Pos logit contributions
 down+0.040
arted+0.039
ivated+0.038
 up+0.038
?!+0.035
Neg logit contributions
paste-0.042
 spe-0.042
T-0.041
Snow-0.041
engu-0.040
Token you
Token ID345
Feature activation-2.76e-02

Pos logit contributions
 down+0.004
 up+0.004
?!+0.004
 gonna+0.004
ivated+0.004
Neg logit contributions
T-0.006
engu-0.006
Snow-0.006
paste-0.006
 spe-0.006
Token did
Token ID750
Feature activation+2.39e-03

Pos logit contributions
paste+0.039
 spe+0.039
Snow+0.039
T+0.039
engu+0.038
Neg logit contributions
 down-0.054
 up-0.052
arted-0.050
ivated-0.049
?!-0.049
Token it
Token ID340
Feature activation+5.22e-02

Pos logit contributions
 down+0.004
 up+0.004
arted+0.004
ivated+0.004
?!+0.004
Neg logit contributions
paste-0.004
 spe-0.004
T-0.004
Snow-0.004
engu-0.004
Token,
Token ID11
Feature activation+2.29e-02

Pos logit contributions
arted+0.085
ivated+0.080
 down+0.075
 gonna+0.071
 sidelines+0.070
Neg logit contributions
paste-0.090
Snow-0.087
T-0.087
engu-0.084
 spe-0.082
Token Tom
Token ID4186
Feature activation+1.23e-02

Pos logit contributions
 down+0.040
 up+0.038
arted+0.037
ivated+0.036
?!+0.036
Neg logit contributions
paste-0.037
 spe-0.037
Snow-0.037
T-0.037
engu-0.036
Token!"
Token ID2474
Feature activation-3.64e-03

Pos logit contributions
arted+0.019
ivated+0.019
 down+0.019
 up+0.018
?!+0.016
Neg logit contributions
Snow-0.022
paste-0.021
 spe-0.021
T-0.021
engu-0.020
Token Lily
Token ID20037
Feature activation-1.94e-02

Pos logit contributions
paste+0.007
 spe+0.007
T+0.007
Snow+0.007
engu+0.007
Neg logit contributions
 down-0.005
 up-0.005
arted-0.005
ivated-0.005
?!-0.005
Token said
Token ID531
Feature activation-1.21e-02

Pos logit contributions
paste+0.033
 spe+0.033
T+0.032
Snow+0.032
engu+0.032
Neg logit contributions
 down-0.033
 up-0.031
arted-0.030
ivated-0.029
?!-0.029
Token angry
Token ID7954
Feature activation-2.49e-03

Pos logit contributions
?!+0.035
 down+0.034
arted+0.034
!?+0.033
?!"+0.033
Neg logit contributions
 spe-0.076
paste-0.074
Snow-0.072
 ham-0.072
lan-0.071
Token.
Token ID13
Feature activation-2.41e-04

Pos logit contributions
 spe+0.004
Snow+0.004
T+0.004
paste+0.004
engu+0.004
Neg logit contributions
 down-0.004
 up-0.004
arted-0.004
?!-0.004
 alone-0.004
Token They
Token ID1119
Feature activation+1.47e-02

Pos logit contributions
paste+0.000
 spe+0.000
engu+0.000
Snow+0.000
T+0.000
Neg logit contributions
 down-0.000
 up-0.000
arted-0.000
ivated-0.000
?!-0.000
Token want
Token ID765
Feature activation-1.25e-02

Pos logit contributions
 down+0.025
arted+0.024
 up+0.023
?!+0.023
ivated+0.023
Neg logit contributions
 spe-0.023
engu-0.023
Snow-0.023
paste-0.023
T-0.022
Token to
Token ID284
Feature activation+2.97e-02

Pos logit contributions
Snow+0.023
paste+0.023
 spe+0.023
T+0.022
Ar+0.022
Neg logit contributions
 up-0.021
 down-0.020
arted-0.018
 alone-0.018
 over-0.017
Token go
Token ID467
Feature activation+5.09e-02

Pos logit contributions
 down+0.051
arted+0.049
 up+0.048
ivated+0.047
?!+0.045
Neg logit contributions
 spe-0.048
Snow-0.048
engu-0.047
T-0.047
paste-0.046
Token on
Token ID319
Feature activation+2.74e-02

Pos logit contributions
ivated+0.084
uct+0.082
?!+0.079
arted+0.079
!?+0.077
Neg logit contributions
 souven-0.078
 sticker-0.077
Ar-0.074
Snow-0.073
ro-0.073
Token the
Token ID262
Feature activation+3.01e-02

Pos logit contributions
 down+0.042
arted+0.040
 up+0.039
ivated+0.039
?!+0.038
Neg logit contributions
 spe-0.049
T-0.049
Snow-0.048
paste-0.048
engu-0.047
Token slide
Token ID10649
Feature activation+2.87e-02

Pos logit contributions
arted+0.059
ivated+0.058
 gonna+0.055
?!+0.054
 down+0.054
Neg logit contributions
Snow-0.038
 ra-0.037
 sticker-0.037
engu-0.036
 spe-0.036
Token too
Token ID1165
Feature activation-1.71e-02

Pos logit contributions
 down+0.050
 up+0.048
arted+0.045
ivated+0.044
?!+0.044
Neg logit contributions
paste-0.048
 spe-0.047
T-0.047
Snow-0.047
engu-0.047
Token,
Token ID11
Feature activation+2.64e-02

Pos logit contributions
paste+0.031
engu+0.031
T+0.030
Snow+0.030
 spe+0.030
Neg logit contributions
 down-0.029
 up-0.027
 alone-0.025
?!-0.024
arted-0.024
Token a
Token ID257
Feature activation+5.48e-03

Pos logit contributions
 down+0.031
arted+0.030
ivated+0.029
 up+0.029
 forth+0.028
Neg logit contributions
 spe-0.045
paste-0.044
Snow-0.042
T-0.042
engu-0.042
Token rabbit
Token ID22746
Feature activation+3.21e-02

Pos logit contributions
 down+0.009
 up+0.009
?!+0.008
arted+0.008
ivated+0.008
Neg logit contributions
paste-0.009
engu-0.009
T-0.009
Snow-0.009
 spe-0.009
Token.
Token ID13
Feature activation-4.88e-03

Pos logit contributions
 down+0.054
 up+0.052
arted+0.049
ivated+0.048
?!+0.048
Neg logit contributions
paste-0.054
T-0.053
 spe-0.053
Snow-0.053
engu-0.053
Token I
Token ID314
Feature activation-1.46e-02

Pos logit contributions
paste+0.009
 spe+0.009
T+0.009
Snow+0.009
engu+0.009
Neg logit contributions
 down-0.007
 up-0.007
arted-0.006
ivated-0.006
?!-0.006
Token'm
Token ID1101
Feature activation+2.12e-02

Pos logit contributions
paste+0.023
Snow+0.023
T+0.023
 spe+0.023
engu+0.022
Neg logit contributions
 down-0.026
 up-0.025
?!-0.024
arted-0.024
ivated-0.023
Token looking
Token ID2045
Feature activation+4.34e-02

Pos logit contributions
 down+0.030
arted+0.029
ivated+0.028
 up+0.027
 forth+0.027
Neg logit contributions
 spe-0.041
paste-0.040
Snow-0.039
T-0.039
engu-0.038
Token for
Token ID329
Feature activation-4.52e-03

Pos logit contributions
arted+0.070
ivated+0.068
 down+0.067
?!+0.065
 forth+0.064
Neg logit contributions
 spe-0.069
paste-0.069
engu-0.069
T-0.068
 souven-0.067
Token food
Token ID2057
Feature activation-9.28e-03

Pos logit contributions
paste+0.007
 spe+0.007
Snow+0.007
T+0.007
engu+0.007
Neg logit contributions
 down-0.008
 up-0.008
arted-0.008
ivated-0.008
?!-0.008
Token,"
Token ID553
Feature activation-6.30e-03

Pos logit contributions
T+0.017
 spe+0.017
Snow+0.017
engu+0.017
paste+0.017
Neg logit contributions
 down-0.015
 up-0.015
 alone-0.013
?!-0.013
ivated-0.012
Token said
Token ID531
Feature activation+1.92e-03

Pos logit contributions
paste+0.012
 spe+0.012
Snow+0.012
engu+0.011
T+0.011
Neg logit contributions
 down-0.010
 up-0.009
arted-0.009
ivated-0.008
?!-0.008
Token the
Token ID262
Feature activation+1.56e-02

Pos logit contributions
 down+0.003
 up+0.002
arted+0.002
ivated+0.002
?!+0.002
Neg logit contributions
paste-0.004
Snow-0.004
 spe-0.004
engu-0.004
T-0.004
Token and
Token ID290
Feature activation+3.96e-02

Pos logit contributions
 down+0.032
 up+0.031
arted+0.030
ivated+0.029
?!+0.029
Neg logit contributions
paste-0.028
 spe-0.027
T-0.027
Snow-0.027
engu-0.027
Token Sarah
Token ID10490
Feature activation+7.76e-04

Pos logit contributions
 down+0.067
ivated+0.064
arted+0.064
 up+0.064
?!+0.063
Neg logit contributions
 spe-0.065
paste-0.063
T-0.062
Snow-0.061
engu-0.058
Token couldn
Token ID3521
Feature activation+1.12e-02

Pos logit contributions
 down+0.001
 up+0.001
arted+0.001
ivated+0.001
?!+0.001
Neg logit contributions
paste-0.001
Snow-0.001
engu-0.001
 spe-0.001
T-0.001
Token't
Token ID470
Feature activation-4.74e-03

Pos logit contributions
arted+0.022
 down+0.022
 forth+0.022
ivated+0.021
rying+0.020
Neg logit contributions
T-0.015
paste-0.014
Snow-0.014
 spe-0.014
ves-0.013
Token catch
Token ID4929
Feature activation+3.96e-02

Pos logit contributions
 spe+0.008
T+0.007
paste+0.007
Snow+0.007
engu+0.007
Neg logit contributions
 down-0.008
 up-0.008
arted-0.008
ivated-0.008
?!-0.007
Token it
Token ID340
Feature activation+4.99e-02

Pos logit contributions
 down+0.062
arted+0.061
ivated+0.061
?!+0.060
!?+0.057
Neg logit contributions
 spe-0.065
T-0.065
paste-0.064
Snow-0.063
engu-0.060
Token.
Token ID13
Feature activation+3.81e-02

Pos logit contributions
arted+0.107
ivated+0.106
?!+0.100
 down+0.099
 forth+0.099
Neg logit contributions
paste-0.050
T-0.048
 spe-0.047
Snow-0.045
 souven-0.045
Token
Token ID198
Feature activation+1.63e-02

Pos logit contributions
 down+0.076
 up+0.074
ivated+0.073
arted+0.073
 forth+0.071
Neg logit contributions
T-0.050
Snow-0.047
engu-0.046
paste-0.046
 spe-0.045
Token
Token ID198
Feature activation+3.80e-02

Pos logit contributions
 down+0.037
 up+0.035
arted+0.035
ivated+0.034
?!+0.033
Neg logit contributions
T-0.019
Snow-0.018
paste-0.017
engu-0.017
 spe-0.017
TokenSarah
Token ID29284
Feature activation+3.79e-03

Pos logit contributions
 down+0.064
 up+0.061
arted+0.059
ivated+0.057
?!+0.056
Neg logit contributions
T-0.065
Snow-0.064
paste-0.064
 spe-0.063
engu-0.062
Token wanted
Token ID2227
Feature activation+1.31e-02

Pos logit contributions
 down+0.007
 up+0.007
arted+0.006
ivated+0.006
?!+0.006
Neg logit contributions
paste-0.006
 spe-0.006
Snow-0.006
engu-0.006
T-0.006
Token to
Token ID284
Feature activation+3.20e-02

Pos logit contributions
 down+0.041
arted+0.040
ivated+0.040
 up+0.039
?!+0.038
Neg logit contributions
paste-0.025
T-0.025
 spe-0.025
Snow-0.024
engu-0.024
Token find
Token ID1064
Feature activation+1.06e-02

Pos logit contributions
arted+0.059
 down+0.057
ivated+0.055
 up+0.054
rying+0.052
Neg logit contributions
T-0.044
 spe-0.043
 souven-0.042
By-0.042
engu-0.042
Token out
Token ID503
Feature activation+2.08e-02

Pos logit contributions
arted+0.017
ivated+0.017
 down+0.016
?!+0.015
 forth+0.015
Neg logit contributions
paste-0.018
T-0.017
 spe-0.017
engu-0.017
 souven-0.017
Token what
Token ID644
Feature activation+1.48e-02

Pos logit contributions
 down+0.032
arted+0.031
ivated+0.030
 up+0.030
?!+0.029
Neg logit contributions
paste-0.036
 spe-0.036
T-0.036
Snow-0.035
engu-0.035
Token it
Token ID340
Feature activation+4.75e-02

Pos logit contributions
 down+0.025
 up+0.024
arted+0.023
?!+0.023
ivated+0.022
Neg logit contributions
paste-0.025
Snow-0.025
 spe-0.024
engu-0.024
T-0.024
Token all
Token ID477
Feature activation+5.03e-02

Pos logit contributions
 down+0.106
arted+0.105
ivated+0.104
 up+0.101
?!+0.098
Neg logit contributions
 spe-0.050
T-0.049
paste-0.049
Snow-0.048
engu-0.046
Token meant
Token ID4001
Feature activation+9.01e-03

Pos logit contributions
ivated+0.102
arted+0.097
 sidelines+0.092
 gonna+0.091
?!+0.088
Neg logit contributions
 spe-0.061
 would-0.053
 bead-0.053
 souven-0.053
 pill-0.053
Token.
Token ID13
Feature activation+2.56e-02

Pos logit contributions
 down+0.017
 up+0.016
arted+0.016
ivated+0.016
?!+0.015
Neg logit contributions
paste-0.013
 spe-0.013
T-0.013
Snow-0.013
engu-0.013
Token
Token ID220
Feature activation+3.10e-03

Pos logit contributions
 down+0.051
 up+0.049
ivated+0.048
arted+0.048
 forth+0.047
Neg logit contributions
T-0.034
engu-0.034
Snow-0.033
paste-0.032
 spe-0.032
Token
Token ID198
Feature activation+4.54e-04

Pos logit contributions
 down+0.007
 up+0.006
arted+0.006
ivated+0.006
?!+0.006
Neg logit contributions
T-0.004
paste-0.004
Snow-0.004
engu-0.003
 spe-0.003
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
 down+0.001
 up+0.001
arted+0.001
ivated+0.001
?!+0.001
Neg logit contributions
 spe-0.001
paste-0.001
Snow-0.001
engu-0.001
T-0.001
Token to
Token ID284
Feature activation+9.08e-03

Pos logit contributions
 down+0.010
 up+0.010
arted+0.009
?!+0.009
 alone+0.009
Neg logit contributions
paste-0.008
Snow-0.008
T-0.008
 spe-0.008
engu-0.008
Token go
Token ID467
Feature activation+3.92e-02

Pos logit contributions
arted+0.018
 down+0.017
ivated+0.016
 up+0.016
ieved+0.015
Neg logit contributions
T-0.013
Snow-0.012
engu-0.012
 spe-0.012
paste-0.012
Token to
Token ID284
Feature activation+1.08e-02

Pos logit contributions
arted+0.092
uct+0.089
ivated+0.088
ieved+0.087
zie+0.081
Neg logit contributions
Snow-0.040
lan-0.039
 pill-0.039
 souven-0.038
T-0.038
Token the
Token ID262
Feature activation+1.57e-02

Pos logit contributions
arted+0.022
ivated+0.020
ieved+0.020
 down+0.019
rying+0.019
Neg logit contributions
T-0.015
Snow-0.014
 pill-0.014
 spe-0.014
engu-0.013
Token park
Token ID3952
Feature activation+3.30e-02

Pos logit contributions
 down+0.021
 up+0.020
arted+0.019
ivated+0.018
?!+0.018
Neg logit contributions
paste-0.032
 spe-0.032
T-0.031
Snow-0.031
engu-0.031
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
arted+0.070
 down+0.066
ivated+0.066
?!+0.065
 gonna+0.065
Neg logit contributions
 spe-0.038
T-0.038
Snow-0.037
lan-0.037
engu-0.036
Token When
Token ID1649
Feature activation-7.74e-03

Pos logit contributions
arted+0.021
 down+0.021
ivated+0.021
 forth+0.021
 up+0.020
Neg logit contributions
T-0.017
Snow-0.017
 spe-0.017
engu-0.017
paste-0.016
Token she
Token ID673
Feature activation+3.10e-03

Pos logit contributions
paste+0.013
 spe+0.013
T+0.013
Snow+0.013
engu+0.013
Neg logit contributions
 down-0.013
 up-0.013
arted-0.012
ivated-0.012
?!-0.012
Token got
Token ID1392
Feature activation+1.55e-02

Pos logit contributions
 down+0.006
 up+0.006
arted+0.006
ivated+0.005
?!+0.005
Neg logit contributions
 spe-0.005
T-0.005
paste-0.005
Snow-0.005
engu-0.004
Token there
Token ID612
Feature activation+4.16e-02

Pos logit contributions
arted+0.027
?!+0.026
ivated+0.026
!?+0.025
 down+0.025
Neg logit contributions
Snow-0.023
 spe-0.023
T-0.023
paste-0.022
lan-0.022
Token,
Token ID11
Feature activation+3.08e-03

Pos logit contributions
 down+0.068
 up+0.065
arted+0.059
ivated+0.059
?!+0.058
Neg logit contributions
paste-0.077
Snow-0.074
 spe-0.074
T-0.074
engu-0.074
Token kind
Token ID1611
Feature activation-2.10e-04

Pos logit contributions
 down+0.035
 up+0.033
arted+0.033
ivated+0.032
?!+0.031
Neg logit contributions
paste-0.034
 spe-0.034
Snow-0.034
T-0.034
engu-0.034
Token friend
Token ID1545
Feature activation+3.72e-03

Pos logit contributions
paste+0.000
Snow+0.000
T+0.000
 spe+0.000
engu+0.000
Neg logit contributions
 down-0.000
 up-0.000
?!-0.000
 alone-0.000
!?-0.000
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
 down+0.006
 up+0.006
?!+0.005
 alone+0.005
arted+0.005
Neg logit contributions
T-0.007
engu-0.006
 spe-0.006
Snow-0.006
paste-0.006
Token You
Token ID921
Feature activation-3.36e-02

Pos logit contributions
paste+0.043
 spe+0.043
engu+0.042
T+0.042
Snow+0.042
Neg logit contributions
 down-0.028
 up-0.026
arted-0.024
?!-0.024
 gonna-0.023
Token should
Token ID815
Feature activation+1.98e-03

Pos logit contributions
paste+0.052
Snow+0.051
 spe+0.051
T+0.050
engu+0.049
Neg logit contributions
 down-0.062
 up-0.060
?!-0.057
ivated-0.057
 gonna-0.055
Token keep
Token ID1394
Feature activation+1.54e-02

Pos logit contributions
 down+0.003
 up+0.003
?!+0.003
ivated+0.003
arted+0.003
Neg logit contributions
paste-0.003
 spe-0.003
T-0.003
engu-0.003
Snow-0.003
Token the
Token ID262
Feature activation+8.52e-03

Pos logit contributions
 down+0.023
arted+0.023
ivated+0.022
 up+0.021
?!+0.021
Neg logit contributions
T-0.028
 spe-0.027
Snow-0.027
paste-0.027
engu-0.026
Token note
Token ID3465
Feature activation+1.73e-02

Pos logit contributions
 up+0.016
 down+0.015
 alone+0.015
?!+0.014
ivated+0.014
Neg logit contributions
paste-0.014
T-0.014
Snow-0.013
engu-0.013
Ar-0.013
Token.
Token ID13
Feature activation-2.29e-02

Pos logit contributions
 down+0.030
 up+0.029
 alone+0.025
?!+0.025
ivated+0.024
Neg logit contributions
 spe-0.031
engu-0.030
T-0.030
paste-0.030
Snow-0.030
Token Maybe
Token ID6674
Feature activation-1.91e-02

Pos logit contributions
 spe+0.048
paste+0.048
engu+0.046
Snow+0.045
T+0.045
Neg logit contributions
 down-0.029
 up-0.027
?!-0.026
 gonna-0.025
arted-0.025
Token you
Token ID345
Feature activation-1.74e-02

Pos logit contributions
paste+0.043
engu+0.042
Snow+0.041
Ar+0.041
Ch+0.041
Neg logit contributions
 up-0.025
 down-0.025
?!-0.021
 gonna-0.019
 forth-0.019
Token
Token ID198
Feature activation+8.78e-03

Pos logit contributions
 spe+0.007
paste+0.007
bread+0.006
engu+0.006
Snow+0.006
Neg logit contributions
 down-0.008
 up-0.008
arted-0.007
?!-0.007
ivated-0.007
TokenHe
Token ID1544
Feature activation+1.14e-02

Pos logit contributions
 down+0.018
 up+0.017
arted+0.017
ivated+0.017
?!+0.017
Neg logit contributions
 spe-0.012
paste-0.011
engu-0.011
bread-0.010
lan-0.010
Token decided
Token ID3066
Feature activation-2.19e-03

Pos logit contributions
 down+0.020
 up+0.019
arted+0.019
ivated+0.018
?!+0.018
Neg logit contributions
paste-0.019
 spe-0.018
T-0.018
Snow-0.018
engu-0.018
Token he
Token ID339
Feature activation+1.09e-02

Pos logit contributions
paste+0.003
T+0.003
Snow+0.003
 spe+0.003
engu+0.003
Neg logit contributions
 down-0.005
 up-0.004
arted-0.004
ivated-0.004
?!-0.004
Token had
Token ID550
Feature activation-1.45e-02

Pos logit contributions
 down+0.024
 up+0.023
arted+0.022
ivated+0.022
?!+0.022
Neg logit contributions
paste-0.013
 spe-0.013
T-0.013
Snow-0.013
engu-0.013
Token to
Token ID284
Feature activation-7.19e-04

Pos logit contributions
paste+0.026
Snow+0.025
engu+0.025
 spe+0.025
T+0.024
Neg logit contributions
 down-0.025
 up-0.025
 alone-0.022
?!-0.022
arted-0.022
Token pack
Token ID2353
Feature activation-6.57e-03

Pos logit contributions
paste+0.001
 spe+0.001
T+0.001
Snow+0.001
engu+0.001
Neg logit contributions
 down-0.001
 up-0.001
arted-0.001
?!-0.001
ivated-0.001
Token everything
Token ID2279
Feature activation+2.15e-02

Pos logit contributions
 spe+0.014
paste+0.014
T+0.014
Snow+0.014
engu+0.013
Neg logit contributions
 down-0.009
 up-0.009
arted-0.008
 alone-0.007
?!-0.007
Token up
Token ID510
Feature activation+1.61e-02

Pos logit contributions
arted+0.035
 down+0.034
ivated+0.034
 up+0.033
?!+0.033
Neg logit contributions
paste-0.035
 spe-0.035
Snow-0.034
T-0.034
engu-0.034
Token and
Token ID290
Feature activation+1.28e-02

Pos logit contributions
 down+0.028
 up+0.027
arted+0.025
ivated+0.025
?!+0.025
Neg logit contributions
paste-0.027
 spe-0.027
Snow-0.027
T-0.026
engu-0.026
Token go
Token ID467
Feature activation+1.99e-02

Pos logit contributions
 down+0.021
arted+0.020
 up+0.020
ivated+0.019
?!+0.019
Neg logit contributions
paste-0.021
 spe-0.021
Snow-0.021
T-0.021
engu-0.021
Token the
Token ID262
Feature activation+6.63e-03

Pos logit contributions
paste+0.008
 spe+0.008
Snow+0.008
T+0.008
engu+0.007
Neg logit contributions
 down-0.007
 up-0.006
arted-0.006
ivated-0.006
?!-0.006
Token cinema
Token ID22041
Feature activation-5.73e-03

Pos logit contributions
 down+0.010
 up+0.009
arted+0.009
ivated+0.009
?!+0.009
Neg logit contributions
paste-0.013
 spe-0.013
T-0.012
Snow-0.012
engu-0.012
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
paste+0.009
 spe+0.009
Snow+0.008
T+0.008
engu+0.008
Neg logit contributions
 down-0.011
 up-0.010
arted-0.010
ivated-0.010
?!-0.010
Token May
Token ID1737
Feature activation-1.70e-02

Pos logit contributions
paste+0.030
 spe+0.030
T+0.030
Snow+0.030
engu+0.030
Neg logit contributions
 down-0.024
 up-0.023
arted-0.022
ivated-0.021
?!-0.021
Token was
Token ID373
Feature activation+1.02e-02

Pos logit contributions
paste+0.027
 spe+0.027
Snow+0.027
T+0.026
engu+0.026
Neg logit contributions
 down-0.031
 up-0.029
arted-0.028
ivated-0.027
?!-0.027
Token so
Token ID523
Feature activation+1.00e-02

Pos logit contributions
arted+0.015
 down+0.015
?!+0.014
 forth+0.014
ivated+0.014
Neg logit contributions
 spe-0.018
paste-0.017
Snow-0.017
T-0.017
engu-0.016
Token excited
Token ID6568
Feature activation+1.60e-02

Pos logit contributions
arted+0.016
 down+0.016
?!+0.015
zie+0.015
 forth+0.015
Neg logit contributions
 spe-0.017
paste-0.016
engu-0.016
T-0.016
Snow-0.016
Token,
Token ID11
Feature activation+3.62e-03

Pos logit contributions
 down+0.031
arted+0.031
ivated+0.030
 up+0.029
?!+0.028
Neg logit contributions
T-0.021
paste-0.021
Snow-0.020
 spe-0.020
engu-0.020
Token but
Token ID475
Feature activation-1.57e-03

Pos logit contributions
 down+0.006
 up+0.006
arted+0.006
ivated+0.006
?!+0.006
Neg logit contributions
paste-0.006
 spe-0.006
T-0.006
Snow-0.006
engu-0.006
Token a
Token ID257
Feature activation-2.23e-02

Pos logit contributions
T+0.003
paste+0.003
 spe+0.003
Snow+0.003
engu+0.002
Neg logit contributions
 down-0.003
arted-0.003
 up-0.003
?!-0.002
ivated-0.002
Token bit
Token ID1643
Feature activation-4.44e-03

Pos logit contributions
paste+0.038
Snow+0.037
engu+0.036
T+0.036
 spe+0.035
Neg logit contributions
 down-0.038
 up-0.036
?!-0.034
ivated-0.033
 gonna-0.033
Token saw
Token ID2497
Feature activation+6.17e-03

Pos logit contributions
 down+0.010
 up+0.009
arted+0.008
?!+0.008
ivated+0.008
Neg logit contributions
paste-0.013
 spe-0.013
engu-0.013
Snow-0.013
T-0.013
Token a
Token ID257
Feature activation+9.99e-04

Pos logit contributions
 down+0.013
 up+0.012
arted+0.012
ivated+0.011
?!+0.011
Neg logit contributions
paste-0.008
 spe-0.008
engu-0.008
Snow-0.008
T-0.008
Token big
Token ID1263
Feature activation+2.73e-02

Pos logit contributions
 down+0.002
 up+0.002
?!+0.001
ivated+0.001
 alone+0.001
Neg logit contributions
paste-0.002
engu-0.002
T-0.002
Snow-0.002
 spe-0.002
Token p
Token ID279
Feature activation+2.43e-02

Pos logit contributions
arted+0.055
ivated+0.053
ieved+0.052
uct+0.051
 down+0.051
Neg logit contributions
 spe-0.039
 ham-0.036
Snow-0.035
paste-0.034
 bead-0.034
Tokenuddle
Token ID24500
Feature activation+1.91e-02

Pos logit contributions
 down+0.042
arted+0.040
 up+0.040
ivated+0.040
?!+0.039
Neg logit contributions
engu-0.038
Snow-0.038
paste-0.037
 spe-0.037
T-0.036
Token.
Token ID13
Feature activation+2.95e-02

Pos logit contributions
ivated+0.037
arted+0.037
 down+0.036
 up+0.035
?!+0.035
Neg logit contributions
paste-0.025
 spe-0.024
Snow-0.023
T-0.023
lan-0.022
Token She
Token ID1375
Feature activation+1.06e-02

Pos logit contributions
 down+0.043
ivated+0.043
?!+0.043
 up+0.043
arted+0.042
Neg logit contributions
 spe-0.053
engu-0.052
paste-0.052
Snow-0.050
T-0.049
Token wanted
Token ID2227
Feature activation+8.48e-04

Pos logit contributions
 down+0.018
 up+0.018
arted+0.017
ivated+0.017
?!+0.017
Neg logit contributions
T-0.017
paste-0.017
 spe-0.017
Snow-0.017
engu-0.017
Token to
Token ID284
Feature activation+2.03e-02

Pos logit contributions
 down+0.002
 up+0.002
arted+0.002
ivated+0.001
?!+0.001
Neg logit contributions
paste-0.001
 spe-0.001
T-0.001
engu-0.001
Snow-0.001
Token test
Token ID1332
Feature activation+1.89e-02

Pos logit contributions
arted+0.041
ivated+0.040
 down+0.039
ieved+0.038
uct+0.038
Neg logit contributions
Snow-0.026
T-0.025
 spe-0.024
 pill-0.024
ro-0.024
Token if
Token ID611
Feature activation+1.94e-02

Pos logit contributions
arted+0.033
ivated+0.033
 down+0.032
?!+0.032
 gonna+0.030
Neg logit contributions
paste-0.029
T-0.028
Snow-0.028
 spe-0.027
engu-0.026
Token to
Token ID284
Feature activation+1.19e-03

Pos logit contributions
 spe+0.014
paste+0.014
T+0.014
engu+0.014
Snow+0.014
Neg logit contributions
 down-0.020
 up-0.019
arted-0.018
ivated-0.018
?!-0.018
Token be
Token ID307
Feature activation+7.77e-03

Pos logit contributions
 up+0.002
 down+0.002
?!+0.001
arted+0.001
ivated+0.001
Neg logit contributions
paste-0.002
 spe-0.002
T-0.002
Snow-0.002
bread-0.002
Token patient
Token ID5827
Feature activation-1.81e-02

Pos logit contributions
arted+0.016
ivated+0.015
 down+0.015
?!+0.015
zie+0.015
Neg logit contributions
paste-0.010
T-0.010
Snow-0.009
 spe-0.009
 souven-0.009
Token.
Token ID13
Feature activation+3.20e-03

Pos logit contributions
engu+0.040
 spe+0.037
paste+0.037
Bl+0.037
Snow+0.037
Neg logit contributions
 down-0.028
 up-0.027
 alone-0.025
 over-0.021
?!-0.021
Token She
Token ID1375
Feature activation-1.63e-02

Pos logit contributions
 down+0.004
 up+0.004
arted+0.004
ivated+0.004
?!+0.003
Neg logit contributions
paste-0.007
 spe-0.007
T-0.007
Snow-0.007
engu-0.007
Token waited
Token ID13488
Feature activation-3.63e-03

Pos logit contributions
engu+0.032
bread+0.031
paste+0.031
lan+0.031
T+0.031
Neg logit contributions
 up-0.022
 down-0.020
 alone-0.019
arted-0.018
?!-0.018
Token for
Token ID329
Feature activation-2.26e-02

Pos logit contributions
paste+0.007
 spe+0.007
Snow+0.007
engu+0.007
T+0.007
Neg logit contributions
 down-0.005
 up-0.005
arted-0.005
ivated-0.005
?!-0.005
Token Ben
Token ID3932
Feature activation-1.46e-02

Pos logit contributions
engu+0.056
paste+0.055
 spe+0.055
bread+0.054
tr+0.054
Neg logit contributions
 up-0.022
 down-0.022
 Sarah-0.019
arted-0.018
 alone-0.018
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
 spe+0.028
Ar+0.026
engu+0.026
Snow+0.026
T+0.025
Neg logit contributions
 down-0.026
 up-0.025
 alone-0.023
arted-0.020
 over-0.020
Token come
Token ID1282
Feature activation+2.27e-02

Pos logit contributions
paste+0.031
engu+0.031
T+0.029
Snow+0.029
 spe+0.029
Neg logit contributions
 down-0.019
 up-0.018
 alone-0.017
?!-0.016
ivated-0.015
Token back
Token ID736
Feature activation+4.63e-03

Pos logit contributions
 down+0.027
 up+0.026
arted+0.024
ivated+0.023
?!+0.023
Neg logit contributions
paste-0.049
 spe-0.049
T-0.049
Snow-0.049
engu-0.048
Token a
Token ID257
Feature activation-1.15e-02

Pos logit contributions
paste+0.028
engu+0.026
Snow+0.026
bread+0.026
 spe+0.025
Neg logit contributions
 down-0.029
 up-0.029
 gonna-0.026
 alone-0.026
arted-0.025
Token doll
Token ID3654
Feature activation-1.62e-03

Pos logit contributions
paste+0.020
Snow+0.019
T+0.019
engu+0.019
lan+0.018
Neg logit contributions
 down-0.020
 up-0.019
?!-0.017
ivated-0.017
 alone-0.017
Token that
Token ID326
Feature activation+2.00e-02

Pos logit contributions
T+0.003
 spe+0.003
Snow+0.003
paste+0.003
engu+0.003
Neg logit contributions
 down-0.003
 up-0.003
?!-0.002
 alone-0.002
arted-0.002
Token could
Token ID714
Feature activation-1.82e-02

Pos logit contributions
 down+0.037
 up+0.036
arted+0.033
?!+0.033
ivated+0.032
Neg logit contributions
paste-0.031
Snow-0.031
T-0.030
 spe-0.030
engu-0.030
Token stir
Token ID11240
Feature activation-1.50e-02

Pos logit contributions
paste+0.032
T+0.032
 spe+0.032
engu+0.032
Snow+0.032
Neg logit contributions
 down-0.029
 up-0.028
arted-0.026
ivated-0.026
?!-0.026
Token a
Token ID257
Feature activation-4.05e-02

Pos logit contributions
T+0.027
 spe+0.027
Snow+0.027
paste+0.027
engu+0.027
Neg logit contributions
 down-0.024
 up-0.024
?!-0.020
arted-0.020
 alone-0.020
Token spoon
Token ID24556
Feature activation-1.76e-02

Pos logit contributions
T+0.063
paste+0.062
Snow+0.061
engu+0.061
 spe+0.060
Neg logit contributions
 down-0.074
 up-0.071
 alone-0.064
?!-0.064
 forth-0.063
Token in
Token ID287
Feature activation-5.85e-03

Pos logit contributions
 spe+0.031
Snow+0.030
T+0.030
Ar+0.030
engu+0.030
Neg logit contributions
 down-0.031
 up-0.029
 alone-0.026
?!-0.026
arted-0.025
Token a
Token ID257
Feature activation-3.29e-02

Pos logit contributions
paste+0.012
engu+0.012
Snow+0.011
bread+0.011
lan+0.011
Neg logit contributions
 down-0.009
 up-0.009
 alone-0.008
?!-0.007
!?-0.007
Token pot
Token ID1787
Feature activation-7.10e-03

Pos logit contributions
paste+0.052
T+0.052
 spe+0.052
Snow+0.052
engu+0.051
Neg logit contributions
 down-0.058
 up-0.056
arted-0.052
ivated-0.052
?!-0.052
Token and
Token ID290
Feature activation-9.41e-03

Pos logit contributions
 spe+0.012
T+0.012
Snow+0.012
engu+0.012
paste+0.012
Neg logit contributions
 down-0.012
 up-0.012
?!-0.011
 alone-0.011
arted-0.011
Token said
Token ID531
Feature activation-1.66e-02

Pos logit contributions
T+0.002
paste+0.002
Snow+0.002
engu+0.002
 spe+0.002
Neg logit contributions
 down-0.003
arted-0.003
 up-0.002
ivated-0.002
?!-0.002
Token,
Token ID11
Feature activation+6.83e-03

Pos logit contributions
paste+0.027
 spe+0.027
T+0.027
Snow+0.027
engu+0.026
Neg logit contributions
 down-0.029
 up-0.027
arted-0.027
ivated-0.026
?!-0.026
Token "
Token ID366
Feature activation-1.33e-02

Pos logit contributions
 down+0.011
arted+0.011
 up+0.011
?!+0.011
ivated+0.010
Neg logit contributions
T-0.011
Snow-0.011
paste-0.011
 spe-0.011
engu-0.011
TokenI
Token ID40
Feature activation-3.60e-02

Pos logit contributions
engu+0.020
paste+0.019
 spe+0.018
bread+0.017
lan+0.017
Neg logit contributions
 down-0.027
 up-0.025
ivated-0.025
arted-0.025
 forth-0.024
Token don
Token ID836
Feature activation-7.04e-04

Pos logit contributions
paste+0.059
 spe+0.059
Snow+0.058
engu+0.058
lan+0.058
Neg logit contributions
 down-0.063
 up-0.063
?!-0.058
 alone-0.055
?!"-0.054
Token't
Token ID470
Feature activation-7.86e-03

Pos logit contributions
T+0.001
Snow+0.001
paste+0.001
By+0.001
ves+0.001
Neg logit contributions
arted-0.001
 down-0.001
zie-0.001
rying-0.001
 up-0.001
Token feel
Token ID1254
Feature activation+2.46e-02

Pos logit contributions
T+0.012
Snow+0.012
paste+0.012
 spe+0.012
engu+0.012
Neg logit contributions
 down-0.014
 up-0.013
arted-0.013
ivated-0.013
?!-0.012
Token like
Token ID588
Feature activation+1.75e-03

Pos logit contributions
arted+0.035
 down+0.035
 up+0.032
zie+0.032
ivated+0.032
Neg logit contributions
paste-0.045
 spe-0.045
T-0.044
Snow-0.043
 souven-0.042
Token playing
Token ID2712
Feature activation+1.67e-02

Pos logit contributions
 down+0.003
 up+0.003
arted+0.003
ivated+0.002
?!+0.002
Neg logit contributions
 spe-0.003
paste-0.003
Snow-0.003
T-0.003
engu-0.003
Token today
Token ID1909
Feature activation+1.10e-02

Pos logit contributions
arted+0.034
zie+0.031
ivated+0.030
uct+0.029
ieved+0.029
Neg logit contributions
Snow-0.023
paste-0.022
T-0.022
 spe-0.022
 pill-0.020
Token,
Token ID11
Feature activation+3.82e-02

Pos logit contributions
 down+0.019
 up+0.018
arted+0.018
ivated+0.017
?!+0.017
Neg logit contributions
paste-0.018
 spe-0.018
T-0.018
Snow-0.018
engu-0.018
Token always
Token ID1464
Feature activation-2.64e-03

Pos logit contributions
 down+0.002
 up+0.002
arted+0.002
ivated+0.002
?!+0.002
Neg logit contributions
paste-0.002
 spe-0.002
T-0.002
Snow-0.002
engu-0.002
Token climbing
Token ID14281
Feature activation+2.71e-03

Pos logit contributions
 spe+0.005
Snow+0.005
T+0.005
paste+0.004
 ham+0.004
Neg logit contributions
arted-0.004
ivated-0.004
?!-0.004
!?-0.004
uct-0.004
Token up
Token ID510
Feature activation+1.53e-03

Pos logit contributions
 down+0.004
 up+0.004
arted+0.004
ivated+0.004
?!+0.004
Neg logit contributions
paste-0.005
 spe-0.005
T-0.005
Snow-0.005
engu-0.005
Token high
Token ID1029
Feature activation+1.33e-02

Pos logit contributions
 down+0.002
 up+0.002
arted+0.002
 alone+0.002
?!+0.002
Neg logit contributions
paste-0.003
T-0.003
Snow-0.003
engu-0.003
 spe-0.003
Token things
Token ID1243
Feature activation+7.72e-03

Pos logit contributions
 down+0.020
 up+0.019
arted+0.018
ivated+0.018
?!+0.017
Neg logit contributions
paste-0.025
 spe-0.025
T-0.025
Snow-0.024
engu-0.024
Token.
Token ID13
Feature activation-7.07e-03

Pos logit contributions
arted+0.013
 down+0.013
ivated+0.012
 up+0.012
?!+0.012
Neg logit contributions
 spe-0.012
paste-0.012
Snow-0.012
T-0.012
engu-0.012
Token One
Token ID1881
Feature activation-1.62e-02

Pos logit contributions
paste+0.013
 spe+0.013
T+0.013
Snow+0.013
engu+0.012
Neg logit contributions
 down-0.011
 up-0.010
arted-0.010
?!-0.010
ivated-0.010
Token day
Token ID1110
Feature activation-1.89e-02

Pos logit contributions
Snow+0.022
 spe+0.022
T+0.022
paste+0.022
engu+0.021
Neg logit contributions
 down-0.032
 up-0.031
arted-0.031
ivated-0.031
?!-0.030
Token he
Token ID339
Feature activation+7.79e-03

Pos logit contributions
paste+0.038
 spe+0.038
Snow+0.037
engu+0.037
T+0.037
Neg logit contributions
 down-0.028
 up-0.026
?!-0.022
arted-0.022
ivated-0.022
Token decided
Token ID3066
Feature activation-1.26e-03

Pos logit contributions
 down+0.012
 up+0.012
arted+0.011
?!+0.010
 alone+0.010
Neg logit contributions
paste-0.015
engu-0.015
 spe-0.014
Snow-0.014
T-0.014
Token to
Token ID284
Feature activation+1.70e-02

Pos logit contributions
T+0.001
Snow+0.001
 spe+0.001
paste+0.001
lan+0.001
Neg logit contributions
arted-0.003
 down-0.003
ivated-0.003
 up-0.003
?!-0.003
Token sometimes
Token ID3360
Feature activation-2.52e-03

Pos logit contributions
 down+0.026
 up+0.025
 alone+0.024
arted+0.023
?!+0.023
Neg logit contributions
engu-0.023
paste-0.022
T-0.022
 spe-0.022
Snow-0.022
Token very
Token ID845
Feature activation-1.45e-03

Pos logit contributions
paste+0.004
 spe+0.004
T+0.004
Snow+0.004
engu+0.004
Neg logit contributions
 down-0.005
 up-0.004
arted-0.004
ivated-0.004
?!-0.004
Token stupid
Token ID8531
Feature activation-5.91e-03

Pos logit contributions
paste+0.003
 spe+0.003
T+0.003
Snow+0.003
engu+0.003
Neg logit contributions
 down-0.002
 up-0.002
arted-0.002
ivated-0.002
?!-0.002
Token.
Token ID13
Feature activation+1.13e-03

Pos logit contributions
 spe+0.009
T+0.009
Snow+0.009
paste+0.009
engu+0.009
Neg logit contributions
 down-0.011
 up-0.010
arted-0.010
?!-0.010
ivated-0.009
Token He
Token ID679
Feature activation+2.78e-03

Pos logit contributions
 down+0.002
 up+0.002
arted+0.002
?!+0.002
 alone+0.002
Neg logit contributions
paste-0.002
 spe-0.002
T-0.002
Snow-0.002
engu-0.002
Token wanted
Token ID2227
Feature activation-1.61e-02

Pos logit contributions
 down+0.005
 up+0.005
 alone+0.004
arted+0.004
?!+0.004
Neg logit contributions
engu-0.005
paste-0.005
bread-0.005
T-0.004
lan-0.004
Token to
Token ID284
Feature activation+8.96e-03

Pos logit contributions
paste+0.029
T+0.028
 spe+0.028
engu+0.028
Snow+0.028
Neg logit contributions
 down-0.028
 up-0.027
 alone-0.024
arted-0.024
 gonna-0.023
Token show
Token ID905
Feature activation+1.44e-03

Pos logit contributions
 down+0.015
arted+0.015
 up+0.014
ivated+0.014
?!+0.013
Neg logit contributions
Snow-0.015
 spe-0.015
T-0.015
paste-0.015
engu-0.014
Token everyone
Token ID2506
Feature activation+2.12e-02

Pos logit contributions
 down+0.002
arted+0.002
ivated+0.002
 up+0.002
?!+0.002
Neg logit contributions
paste-0.003
Snow-0.003
T-0.003
 spe-0.003
engu-0.003
Token how
Token ID703
Feature activation-2.09e-03

Pos logit contributions
arted+0.039
ivated+0.038
 sidelines+0.036
 gonna+0.036
zie+0.035
Neg logit contributions
paste-0.029
engu-0.029
 souven-0.028
T-0.028
 spe-0.028
Token silly
Token ID14397
Feature activation+3.71e-03

Pos logit contributions
paste+0.003
 spe+0.003
T+0.003
Snow+0.003
engu+0.003
Neg logit contributions
 down-0.004
 up-0.004
arted-0.004
ivated-0.004
?!-0.003
Token ball
Token ID2613
Feature activation-1.04e-02

Pos logit contributions
engu+0.089
paste+0.086
lan+0.086
T+0.083
 fiber+0.082
Neg logit contributions
 down-0.077
 up-0.072
 alone-0.069
 forth-0.062
 upset-0.060
Token of
Token ID286
Feature activation-4.14e-02

Pos logit contributions
T+0.021
engu+0.021
Ar+0.021
Snow+0.020
 fiber+0.020
Neg logit contributions
 down-0.016
 up-0.016
 alone-0.015
arted-0.013
 over-0.013
Token yarn
Token ID21181
Feature activation-3.06e-02

Pos logit contributions
engu+0.066
paste+0.065
T+0.064
lan+0.064
 spe+0.063
Neg logit contributions
 down-0.077
 up-0.076
 alone-0.067
arted-0.064
 forth-0.064
Token and
Token ID290
Feature activation-3.94e-02

Pos logit contributions
T+0.064
engu+0.063
Ar+0.062
Snow+0.060
Bl+0.058
Neg logit contributions
 down-0.048
 up-0.046
 alone-0.044
 forth-0.039
arted-0.036
Token had
Token ID550
Feature activation-4.72e-02

Pos logit contributions
engu+0.080
eed+0.076
T+0.074
bread+0.071
Ar+0.070
Neg logit contributions
 down-0.061
 up-0.056
 forth-0.054
 over-0.051
 alone-0.049
Token a
Token ID257
Feature activation-5.06e-02

Pos logit contributions
paste+0.091
engu+0.090
bread+0.085
lan+0.084
 spe+0.083
Neg logit contributions
 up-0.069
 down-0.068
 gonna-0.057
 Sarah-0.056
arted-0.056
Token lot
Token ID1256
Feature activation-4.86e-02

Pos logit contributions
engu+0.087
paste+0.085
lan+0.083
T+0.079
bread+0.078
Neg logit contributions
 down-0.089
 up-0.085
 alone-0.076
?!-0.075
 forth-0.073
Token of
Token ID286
Feature activation-4.69e-02

Pos logit contributions
T+0.095
 spe+0.094
engu+0.092
Snow+0.092
Bl+0.092
Neg logit contributions
 up-0.069
 down-0.068
 alone-0.066
 over-0.063
arted-0.057
Token fun
Token ID1257
Feature activation-3.78e-02

Pos logit contributions
engu+0.090
T+0.089
paste+0.087
 spe+0.086
lan+0.085
Neg logit contributions
 down-0.074
 up-0.073
 alone-0.066
?!-0.062
arted-0.061
Token.
Token ID13
Feature activation-2.64e-02

Pos logit contributions
engu+0.072
T+0.069
Snow+0.068
paste+0.068
 spe+0.067
Neg logit contributions
 down-0.062
 up-0.060
 alone-0.054
?!-0.052
 forth-0.052
Token<|endoftext|>
Token ID50256
Feature activation-2.99e-03

Pos logit contributions
paste+0.053
 spe+0.053
bread+0.052
T+0.052
lan+0.051
Neg logit contributions
 down-0.035
 up-0.033
arted-0.027
 over-0.027
 Sarah-0.027
Token them
Token ID606
Feature activation+3.91e-03

Pos logit contributions
 down+0.016
 up+0.015
arted+0.014
ivated+0.014
?!+0.014
Neg logit contributions
paste-0.022
 spe-0.022
engu-0.022
Snow-0.021
T-0.021
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
 down+0.007
 up+0.007
arted+0.007
ivated+0.007
?!+0.007
Neg logit contributions
paste-0.006
 spe-0.006
T-0.006
Snow-0.006
engu-0.006
Token She
Token ID1375
Feature activation-6.70e-03

Pos logit contributions
 down+0.028
 up+0.027
arted+0.026
ivated+0.026
 forth+0.025
Neg logit contributions
T-0.030
Snow-0.030
paste-0.029
 spe-0.029
engu-0.028
Token gives
Token ID3607
Feature activation+1.35e-02

Pos logit contributions
paste+0.013
engu+0.012
 spe+0.012
Snow+0.012
T+0.012
Neg logit contributions
 down-0.010
arted-0.009
ivated-0.009
 up-0.009
?!-0.009
Token them
Token ID606
Feature activation-1.21e-02

Pos logit contributions
 down+0.020
 up+0.019
arted+0.018
ivated+0.017
?!+0.017
Neg logit contributions
 spe-0.026
paste-0.026
Snow-0.026
T-0.026
engu-0.026
Token a
Token ID257
Feature activation-5.24e-02

Pos logit contributions
 spe+0.019
paste+0.019
Snow+0.018
T+0.018
Ar+0.018
Neg logit contributions
 down-0.023
 up-0.022
arted-0.020
?!-0.019
 alone-0.019
Token kiss
Token ID9245
Feature activation-3.86e-02

Pos logit contributions
 spe+0.087
T+0.086
paste+0.086
Snow+0.085
engu+0.082
Neg logit contributions
 down-0.093
 up-0.088
?!-0.080
!?-0.075
 alone-0.075
Token and
Token ID290
Feature activation-3.17e-02

Pos logit contributions
engu+0.082
 spe+0.080
Ar+0.077
Bl+0.076
 fiber+0.075
Neg logit contributions
 down-0.062
 up-0.056
 over-0.048
 alone-0.046
?!-0.044
Token a
Token ID257
Feature activation-6.85e-02

Pos logit contributions
 spe+0.052
engu+0.051
T+0.050
Snow+0.050
Ar+0.048
Neg logit contributions
 up-0.058
 down-0.058
?!-0.048
 gonna-0.047
arted-0.047
Token cookie
Token ID19751
Feature activation-6.42e-02

Pos logit contributions
 spe+0.104
T+0.104
engu+0.103
paste+0.101
Snow+0.100
Neg logit contributions
 down-0.128
 up-0.118
?!-0.108
 alone-0.106
!?-0.103
Token.
Token ID13
Feature activation+1.46e-03

Pos logit contributions
T+0.137
Bl+0.134
Ar+0.132
engu+0.130
Snow+0.128
Neg logit contributions
 down-0.096
 up-0.088
 alone-0.081
 over-0.072
?!-0.066
Token leave
Token ID2666
Feature activation+1.18e-02

Pos logit contributions
paste+0.011
 spe+0.010
engu+0.010
T+0.010
lan+0.010
Neg logit contributions
 down-0.008
 up-0.007
?!-0.007
ivated-0.006
 alone-0.006
Token,
Token ID11
Feature activation+1.12e-02

Pos logit contributions
 down+0.020
arted+0.020
 up+0.020
ivated+0.019
?!+0.018
Neg logit contributions
 spe-0.019
paste-0.019
T-0.019
Snow-0.019
engu-0.018
Token Lily
Token ID20037
Feature activation-1.17e-02

Pos logit contributions
 down+0.017
 up+0.016
arted+0.015
ivated+0.014
?!+0.014
Neg logit contributions
engu-0.022
paste-0.021
 spe-0.021
Snow-0.021
T-0.021
Token gave
Token ID2921
Feature activation-2.05e-03

Pos logit contributions
paste+0.019
 spe+0.019
T+0.019
Snow+0.019
engu+0.019
Neg logit contributions
 down-0.020
 up-0.020
arted-0.019
ivated-0.018
?!-0.018
Token her
Token ID607
Feature activation-3.24e-02

Pos logit contributions
engu+0.005
 spe+0.005
bread+0.005
paste+0.005
Ar+0.005
Neg logit contributions
 down-0.002
 up-0.002
arted-0.002
 alone-0.001
 forth-0.001
Token a
Token ID257
Feature activation-5.03e-02

Pos logit contributions
 spe+0.048
engu+0.048
Ar+0.047
paste+0.047
Snow+0.047
Neg logit contributions
 down-0.065
 up-0.064
?!-0.057
 alone-0.056
!?-0.053
Token big
Token ID1263
Feature activation-2.66e-02

Pos logit contributions
paste+0.089
 spe+0.089
Snow+0.089
T+0.088
lan+0.088
Neg logit contributions
 down-0.082
 up-0.079
?!-0.070
 alone-0.065
ivated-0.065
Token hug
Token ID16225
Feature activation-3.81e-02

Pos logit contributions
T+0.051
paste+0.051
Snow+0.050
engu+0.049
 spe+0.049
Neg logit contributions
 down-0.040
 up-0.038
?!-0.034
arted-0.034
ivated-0.033
Token and
Token ID290
Feature activation-2.17e-02

Pos logit contributions
engu+0.080
Ar+0.078
 spe+0.078
Bl+0.076
 fiber+0.074
Neg logit contributions
 down-0.063
 up-0.058
 alone-0.048
 over-0.048
?!-0.047
Token said
Token ID531
Feature activation-3.25e-02

Pos logit contributions
engu+0.036
 spe+0.035
T+0.034
Snow+0.034
paste+0.034
Neg logit contributions
 down-0.039
 up-0.038
?!-0.033
 gonna-0.032
 forth-0.032
Token goodbye
Token ID24829
Feature activation-2.34e-02

Pos logit contributions
Ar+0.063
engu+0.062
Snow+0.061
 spe+0.061
paste+0.060
Neg logit contributions
 down-0.050
 up-0.048
?!-0.041
ivated-0.041
 alone-0.040
Token They
Token ID1119
Feature activation-2.23e-02

Pos logit contributions
paste+0.024
 spe+0.024
Snow+0.023
engu+0.023
T+0.023
Neg logit contributions
 down-0.020
 up-0.019
arted-0.019
ivated-0.017
 gonna-0.017
Token hugged
Token ID41130
Feature activation-4.21e-03

Pos logit contributions
paste+0.043
engu+0.042
bread+0.041
lan+0.040
T+0.040
Neg logit contributions
 down-0.031
 up-0.030
 alone-0.029
 forth-0.028
 over-0.026
Token and
Token ID290
Feature activation-2.73e-02

Pos logit contributions
paste+0.009
engu+0.009
Ar+0.008
 spe+0.008
bread+0.008
Neg logit contributions
 down-0.007
 up-0.007
 alone-0.005
 forth-0.005
arted-0.005
Token said
Token ID531
Feature activation-3.39e-02

Pos logit contributions
engu+0.059
Ar+0.055
T+0.055
paste+0.055
Snow+0.055
Neg logit contributions
 down-0.038
 forth-0.036
 up-0.035
 alone-0.030
 Sally-0.028
Token,
Token ID11
Feature activation-1.75e-02

Pos logit contributions
Snow+0.070
T+0.069
Ar+0.069
 spe+0.068
paste+0.067
Neg logit contributions
 down-0.050
 up-0.048
arted-0.039
 alone-0.039
ivated-0.039
Token "
Token ID366
Feature activation-5.05e-02

Pos logit contributions
paste+0.035
 spe+0.035
engu+0.035
T+0.034
Snow+0.034
Neg logit contributions
 down-0.025
 up-0.023
arted-0.022
ivated-0.021
?!-0.020
TokenWe
Token ID1135
Feature activation-4.05e-02

Pos logit contributions
engu+0.118
 allowance+0.106
ork+0.105
paste+0.103
 cents+0.100
Neg logit contributions
ivated-0.052
 down-0.050
 forth-0.045
?!-0.045
uct-0.044
Token love
Token ID1842
Feature activation-8.10e-03

Pos logit contributions
 spe+0.065
paste+0.064
Snow+0.063
bread+0.063
lan+0.062
Neg logit contributions
 down-0.071
 up-0.069
?!-0.067
 alone-0.064
?!"-0.062
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
paste+0.020
T+0.019
Snow+0.019
Ar+0.019
engu+0.019
Neg logit contributions
 down-0.009
 up-0.009
 forth-0.007
?!-0.007
 alone-0.007
Token ostr
Token ID42387
Feature activation+6.06e-03

Pos logit contributions
 up+0.025
 down+0.025
arted+0.023
?!+0.023
 alone+0.022
Neg logit contributions
paste-0.027
T-0.025
engu-0.025
Snow-0.024
 spe-0.023
Tokenich
Token ID488
Feature activation+9.00e-03

Pos logit contributions
 down+0.009
 up+0.009
arted+0.009
ivated+0.008
?!+0.008
Neg logit contributions
paste-0.011
T-0.011
 spe-0.011
engu-0.011
Snow-0.011
Token,
Token ID11
Feature activation-1.36e-02

Pos logit contributions
paste+0.023
T+0.023
Snow+0.023
 spe+0.023
engu+0.022
Neg logit contributions
 down-0.024
 up-0.023
arted-0.022
ivated-0.022
?!-0.021
Token like
Token ID588
Feature activation-3.43e-02

Pos logit contributions
paste+0.028
T+0.027
engu+0.027
 bead+0.026
Snow+0.026
Neg logit contributions
 up-0.020
 down-0.020
 alone-0.017
ivated-0.016
arted-0.015
Token lions
Token ID31420
Feature activation-3.56e-02

Pos logit contributions
paste+0.065
T+0.060
Snow+0.057
engu+0.057
 bead+0.056
Neg logit contributions
 down-0.056
 up-0.055
arted-0.048
 alone-0.047
 gonna-0.046
Token,
Token ID11
Feature activation-3.23e-02

Pos logit contributions
paste+0.073
T+0.072
lan+0.070
 spe+0.070
Snow+0.070
Neg logit contributions
 down-0.050
 up-0.048
 alone-0.042
arted-0.040
ivated-0.040
Token ze
Token ID41271
Feature activation-4.00e-02

Pos logit contributions
paste+0.065
T+0.061
engu+0.060
 bead+0.058
lan+0.058
Neg logit contributions
 down-0.047
 up-0.046
 gonna-0.038
arted-0.038
 alone-0.037
Tokenbr
Token ID1671
Feature activation-6.34e-02

Pos logit contributions
T+0.024
paste+0.024
 spe+0.022
Snow+0.022
lan+0.021
Neg logit contributions
 down-0.107
 up-0.104
arted-0.103
ivated-0.102
?!-0.100
Tokenas
Token ID292
Feature activation-1.57e-02

Pos logit contributions
paste+0.085
T+0.080
 spe+0.079
Snow+0.079
bread+0.072
Neg logit contributions
 down-0.128
arted-0.127
 up-0.124
ags-0.119
zie-0.109
Token,
Token ID11
Feature activation-3.55e-02

Pos logit contributions
paste+0.028
 spe+0.028
T+0.028
Snow+0.027
engu+0.027
Neg logit contributions
 down-0.026
 up-0.025
arted-0.023
ivated-0.022
 alone-0.022
Token and
Token ID290
Feature activation-3.21e-02

Pos logit contributions
paste+0.075
T+0.073
engu+0.072
lan+0.068
Snow+0.067
Neg logit contributions
 down-0.051
 up-0.050
arted-0.039
 alone-0.039
 over-0.038
Token gir
Token ID37370
Feature activation-3.44e-02

Pos logit contributions
paste+0.061
T+0.060
engu+0.058
Snow+0.057
kin+0.057
Neg logit contributions
 down-0.048
 up-0.046
 alone-0.040
 gonna-0.040
arted-0.040
Tokenaff
Token ID2001
Feature activation-3.13e-02

Pos logit contributions
T+0.054
paste+0.054
Snow+0.050
 bead+0.050
 spe+0.049
Neg logit contributions
arted-0.061
 down-0.060
 up-0.058
ivated-0.051
 gonna-0.051