TOP ACTIVATIONS
MAX = 0.099

The cat said , " Mil k is so y ummy
. But suddenly , a booming voice said , " Stop
bigger . Suddenly , a loud voice came from the sky
suddenly , a brightly - col oured belt caught her eye
leaves . Suddenly , a loud voice came from behind her
sun . A mer maid that lived in the
wrink led up . A sm elly smell was coming from
Her mom said , " No , Let icia ,
it . Suddenly , a loud noise came out of the
Her friend said , " Yes , it 's in
Just then , a loud , booming voice said ,
Suddenly , a loud noise scared the bunny and
Suddenly , a loud noise came from the living
Suddenly , a loud noise made them stop and
friends smiled and said , " Yes , you can .
Anna says , " Okay , I will read
Suddenly , a loud noise made the ants scatter
L ily said , " Wow , Max ! You
Her mother said , " Yes , we can .
coat . Suddenly , a loud noise came from the bushes

MIN ACTIVATIONS
MIN = -0.091

cozy fire together . <|endoftext|> Once upon a time , there
together every weekend . <|endoftext|> Once upon a time , there
all day long . <|endoftext|> Once upon a time , there
her mom replied . <|endoftext|> Once there was a boy named
lots of fun . <|endoftext|> Once upon a time , there
had that day . <|endoftext|> Once upon a time , there
of their adventures . <|endoftext|> Once upon a time , there
!" Lily exclaimed . <|endoftext|> Once upon a time , there
and happy again . <|endoftext|> Once upon a time , there
best of friends . <|endoftext|> Once upon a time , there
pictures together often . <|endoftext|> Once upon a time , there
best of friends . <|endoftext|> Once upon a time , there
more adventures together . <|endoftext|> Once upon a time , there
someone in need . <|endoftext|> Once upon a time , there
all day long . <|endoftext|> Once upon a time , there
together every day . <|endoftext|> Once upon a time , in
his mom my . <|endoftext|> Once upon a time , there
story , Lily ." <|endoftext|> Once upon a time , there
in the forest . <|endoftext|> Once upon a time , there
with his friends . <|endoftext|> Once upon a time , there

INTERVAL 0.051 - 0.099
CONTAINS 1.184%

Lily ," Tom said . " We can have some hot
the sw an replied . " If you use too much
and said � � € � � outside â € .
." His dad hugged him and said , "
so it asked again , " What do you mean ?

INTERVAL 0.004 - 0.051
CONTAINS 39.699%

her hand . They watched the parade go by and Lily
They point it at things and make bright lights . They
big difference , and he wanted to share his success with
The cat said , " Let 's clean it together !"
. But she didn 't give up . She tried again

INTERVAL -0.044 - 0.004
CONTAINS 57.831%

made was a magical bridge . When they wanted to travel
, and she ran out to play . Everywhere she looked
to answer them . At the end of the exam ,
rescue Tim my and his family . They put out the
want to be good . They go to their room .

INTERVAL -0.091 - -0.044
CONTAINS 1.287%

and put it away . " Can I have
with her family . <|endoftext|> Once upon a time , there
laughed and cl apped . Together , Tim and
the grass and stones . " Help , help
- year old girl played together all day long . <|endoftext|>
Token The
Token ID383
Feature activation+3.65e-03

Pos logit contributions
 Can+0.048
elt+0.047
 louder+0.047
Stop+0.047
ting+0.046
Neg logit contributions
case-0.046
ggy-0.045
zag-0.040
live-0.040
ves-0.039
Token cat
Token ID3797
Feature activation+9.61e-03

Pos logit contributions
 Can+0.005
elt+0.004
 Please+0.004
 louder+0.004
Stop+0.004
Neg logit contributions
ggy-0.009
live-0.008
zag-0.008
case-0.008
aches-0.008
Token said
Token ID531
Feature activation+2.10e-02

Pos logit contributions
 Can+0.017
elt+0.016
 Please+0.016
 louder+0.015
ted+0.015
Neg logit contributions
ggy-0.018
live-0.017
zag-0.017
aches-0.017
case-0.017
Token,
Token ID11
Feature activation+2.95e-02

Pos logit contributions
 Can+0.047
elt+0.046
Stop+0.043
 Please+0.042
ksh+0.042
Neg logit contributions
ggy-0.032
ves-0.028
aches-0.027
live-0.027
ail-0.026
Token "
Token ID366
Feature activation+7.79e-02

Pos logit contributions
Stop+0.054
vious+0.053
eared+0.053
ened+0.053
elling+0.053
Neg logit contributions
ggy-0.048
ves-0.043
zag-0.043
case-0.041
ail-0.041
TokenMil
Token ID24857
Feature activation+9.86e-02

Pos logit contributions
 eyes+0.136
ened+0.126
 Can+0.126
 hello+0.123
ly+0.120
Neg logit contributions
live-0.151
ggy-0.145
lly-0.132
ves-0.130
aches-0.130
Tokenk
Token ID74
Feature activation+2.70e-02

Pos logit contributions
 Can+0.153
 hello+0.136
ted+0.124
 spec+0.123
elt+0.120
Neg logit contributions
live-0.204
ggy-0.197
aches-0.188
zag-0.187
oppy-0.186
Token is
Token ID318
Feature activation+1.23e-02

Pos logit contributions
 Can+0.053
elt+0.050
 Please+0.048
 louder+0.046
Stop+0.046
Neg logit contributions
ggy-0.048
live-0.045
aches-0.044
zag-0.043
ail-0.043
Token so
Token ID523
Feature activation-2.12e-02

Pos logit contributions
 Can+0.022
elt+0.022
 Please+0.020
Stop+0.020
ly+0.020
Neg logit contributions
ggy-0.023
live-0.022
aches-0.021
zag-0.020
case-0.020
Token y
Token ID331
Feature activation+3.11e-02

Pos logit contributions
ggy+0.041
live+0.039
zag+0.039
case+0.038
ail+0.038
Neg logit contributions
 Can-0.039
elt-0.035
 Please-0.035
 louder-0.032
 ding-0.032
Tokenummy
Token ID13513
Feature activation+1.52e-02

Pos logit contributions
 Can+0.053
elt+0.049
 Please+0.048
 louder+0.045
Stop+0.044
Neg logit contributions
ggy-0.062
live-0.060
zag-0.058
case-0.057
aches-0.057
Token.
Token ID13
Feature activation+1.82e-03

Pos logit contributions
ggy+0.020
zag+0.019
live+0.019
aches+0.018
ail+0.018
Neg logit contributions
 Can-0.016
elt-0.015
 Please-0.015
Stop-0.014
 ding-0.014
Token But
Token ID887
Feature activation-2.36e-02

Pos logit contributions
 Can+0.003
elt+0.003
 Please+0.002
 louder+0.002
Stop+0.002
Neg logit contributions
ggy-0.004
live-0.004
zag-0.004
case-0.004
ail-0.004
Token suddenly
Token ID6451
Feature activation-2.75e-03

Pos logit contributions
ggy+0.053
live+0.053
case+0.051
ail+0.051
aches+0.050
Neg logit contributions
 Can-0.034
 Please-0.030
elt-0.029
 louder-0.029
ted-0.028
Token,
Token ID11
Feature activation-8.64e-03

Pos logit contributions
ggy+0.005
live+0.005
zag+0.004
case+0.004
aches+0.004
Neg logit contributions
 Can-0.005
elt-0.005
 Please-0.005
 louder-0.005
ted-0.005
Token a
Token ID257
Feature activation+4.35e-02

Pos logit contributions
ggy+0.017
live+0.017
zag+0.016
ail+0.016
case+0.016
Neg logit contributions
 Can-0.015
elt-0.014
 Please-0.014
 louder-0.013
ted-0.013
Token booming
Token ID32017
Feature activation+9.70e-02

Pos logit contributions
 Can+0.087
elt+0.085
ted+0.080
 Please+0.079
ened+0.078
Neg logit contributions
ggy-0.075
case-0.067
live-0.066
zag-0.066
ail-0.064
Token voice
Token ID3809
Feature activation+3.23e-02

Pos logit contributions
elt+0.170
 Can+0.170
ksh+0.157
ted+0.155
 Please+0.147
Neg logit contributions
ggy-0.198
zag-0.178
case-0.164
lf-0.157
lly-0.155
Token said
Token ID531
Feature activation+3.26e-02

Pos logit contributions
 Can+0.060
elt+0.056
 Please+0.054
 Look+0.051
Stop+0.051
Neg logit contributions
ggy-0.061
live-0.056
zag-0.055
ail-0.054
ves-0.054
Token,
Token ID11
Feature activation+3.74e-02

Pos logit contributions
 Can+0.071
elt+0.068
 Please+0.065
Stop+0.065
ened+0.063
Neg logit contributions
ggy-0.051
ail-0.045
live-0.045
zag-0.044
case-0.043
Token "
Token ID366
Feature activation+7.60e-02

Pos logit contributions
vious+0.062
 Can+0.062
Stop+0.062
elt+0.062
eared+0.061
Neg logit contributions
ggy-0.066
ail-0.062
zag-0.061
ves-0.060
case-0.059
TokenStop
Token ID19485
Feature activation+2.10e-02

Pos logit contributions
 Can+0.114
elt+0.101
 Please+0.096
 eyes+0.096
 louder+0.096
Neg logit contributions
ggy-0.164
live-0.162
zag-0.154
lly-0.149
aches-0.148
Token bigger
Token ID5749
Feature activation+1.99e-02

Pos logit contributions
 Can+0.028
elt+0.027
 Please+0.025
ted+0.024
ly+0.024
Neg logit contributions
ggy-0.030
aches-0.028
live-0.028
zag-0.028
case-0.028
Token.
Token ID13
Feature activation+5.99e-03

Pos logit contributions
 Can+0.049
elt+0.047
 Please+0.046
Stop+0.045
 Look+0.044
Neg logit contributions
ggy-0.024
zag-0.022
case-0.021
aches-0.021
ves-0.020
Token Suddenly
Token ID24975
Feature activation-1.52e-03

Pos logit contributions
 Can+0.010
elt+0.009
 Please+0.009
 louder+0.008
 ding+0.008
Neg logit contributions
ggy-0.012
case-0.012
live-0.011
zag-0.011
ail-0.011
Token,
Token ID11
Feature activation+1.46e-03

Pos logit contributions
ggy+0.002
live+0.002
zag+0.002
case+0.002
ail+0.002
Neg logit contributions
 Can-0.003
elt-0.003
 Please-0.003
 louder-0.003
Stop-0.003
Token a
Token ID257
Feature activation+4.13e-02

Pos logit contributions
 Can+0.003
elt+0.003
 Please+0.003
ly+0.002
 Look+0.002
Neg logit contributions
ggy-0.003
live-0.002
case-0.002
zag-0.002
aches-0.002
Token loud
Token ID7812
Feature activation+9.59e-02

Pos logit contributions
 Can+0.079
elt+0.078
ted+0.075
ened+0.073
ting+0.073
Neg logit contributions
ggy-0.073
case-0.065
live-0.063
zag-0.062
lly-0.062
Token voice
Token ID3809
Feature activation+3.21e-02

Pos logit contributions
elt+0.183
 Can+0.171
ksh+0.162
ted+0.160
elling+0.156
Neg logit contributions
ggy-0.187
lf-0.165
zag-0.160
case-0.152
ves-0.139
Token came
Token ID1625
Feature activation+2.80e-02

Pos logit contributions
 Can+0.058
elt+0.054
 Please+0.052
 Look+0.049
 louder+0.049
Neg logit contributions
ggy-0.062
live-0.056
zag-0.056
ves-0.055
aches-0.055
Token from
Token ID422
Feature activation+4.91e-03

Pos logit contributions
 Can+0.051
 Please+0.047
elt+0.047
Stop+0.043
ted+0.042
Neg logit contributions
zag-0.049
ggy-0.049
ail-0.047
ics-0.045
case-0.043
Token the
Token ID262
Feature activation-1.54e-03

Pos logit contributions
 Can+0.009
elt+0.008
 Please+0.008
 louder+0.008
Stop+0.007
Neg logit contributions
ggy-0.009
live-0.009
zag-0.009
case-0.009
aches-0.009
Token sky
Token ID6766
Feature activation+1.03e-02

Pos logit contributions
ggy+0.003
live+0.003
zag+0.003
case+0.003
ail+0.003
Neg logit contributions
 Can-0.003
elt-0.003
 Please-0.003
 louder-0.003
ted-0.003
Token suddenly
Token ID6451
Feature activation+1.32e-02

Pos logit contributions
ggy+0.030
live+0.029
zag+0.028
case+0.028
ail+0.027
Neg logit contributions
 Can-0.027
elt-0.025
 Please-0.024
 louder-0.023
Stop-0.022
Token,
Token ID11
Feature activation+1.53e-02

Pos logit contributions
 Can+0.024
elt+0.022
 Please+0.021
 louder+0.020
Stop+0.020
Neg logit contributions
ggy-0.026
live-0.024
zag-0.024
aches-0.023
ail-0.023
Token a
Token ID257
Feature activation+4.79e-02

Pos logit contributions
 Can+0.030
elt+0.028
 Please+0.027
Stop+0.026
ened+0.025
Neg logit contributions
ggy-0.027
live-0.025
aches-0.025
lly-0.024
ail-0.024
Token brightly
Token ID35254
Feature activation+5.66e-02

Pos logit contributions
 Can+0.098
elt+0.097
ted+0.093
ened+0.091
ting+0.091
Neg logit contributions
ggy-0.078
case-0.070
live-0.067
zag-0.067
ail-0.067
Token-
Token ID12
Feature activation+6.41e-02

Pos logit contributions
 Can+0.098
elt+0.090
 Please+0.087
 louder+0.082
 Look+0.081
Neg logit contributions
ggy-0.117
live-0.109
zag-0.108
aches-0.105
case-0.104
Tokencol
Token ID4033
Feature activation+9.49e-02

Pos logit contributions
 Can+0.134
 eyes+0.128
ened+0.125
 Please+0.122
elt+0.118
Neg logit contributions
ggy-0.104
case-0.101
zag-0.092
live-0.092
aches-0.087
Tokenoured
Token ID8167
Feature activation+5.59e-02

Pos logit contributions
 Can+0.198
 Please+0.175
 louder+0.173
Stop+0.169
Go+0.165
Neg logit contributions
ail-0.156
zag-0.141
oppy-0.140
uffy-0.134
live-0.134
Token belt
Token ID10999
Feature activation+9.15e-03

Pos logit contributions
 Can+0.101
elt+0.094
 Please+0.091
 louder+0.088
ksh+0.088
Neg logit contributions
ggy-0.110
ail-0.099
case-0.099
zag-0.099
live-0.098
Token caught
Token ID4978
Feature activation+4.60e-02

Pos logit contributions
 Can+0.017
elt+0.016
 Please+0.015
 louder+0.015
Stop+0.015
Neg logit contributions
ggy-0.017
live-0.016
zag-0.015
case-0.015
aches-0.015
Token her
Token ID607
Feature activation+2.58e-04

Pos logit contributions
 Can+0.097
elt+0.088
 Please+0.087
 Look+0.080
Stop+0.079
Neg logit contributions
ggy-0.076
aches-0.071
live-0.068
ail-0.068
ics-0.067
Token eye
Token ID4151
Feature activation-3.28e-03

Pos logit contributions
 Can+0.000
elt+0.000
 Please+0.000
Stop+0.000
 louder+0.000
Neg logit contributions
ggy-0.001
live-0.000
zag-0.000
case-0.000
ail-0.000
Token leaves
Token ID5667
Feature activation-2.37e-02

Pos logit contributions
ggy+0.016
live+0.015
zag+0.015
case+0.015
aches+0.015
Neg logit contributions
 Can-0.022
elt-0.020
 Please-0.020
 louder-0.019
Stop-0.019
Token.
Token ID13
Feature activation+1.99e-02

Pos logit contributions
ggy+0.037
live+0.035
zag+0.033
case+0.033
aches+0.033
Neg logit contributions
 Can-0.051
elt-0.048
 Please-0.047
 louder-0.045
ted-0.044
Token Suddenly
Token ID24975
Feature activation+2.19e-04

Pos logit contributions
 Can+0.029
 louder+0.028
 ding+0.028
Stop+0.028
elt+0.027
Neg logit contributions
case-0.040
ggy-0.039
live-0.038
ail-0.037
zag-0.036
Token,
Token ID11
Feature activation+4.60e-03

Pos logit contributions
 Can+0.000
elt+0.000
 Please+0.000
Stop+0.000
 ding+0.000
Neg logit contributions
ggy-0.000
live-0.000
ail-0.000
zag-0.000
case-0.000
Token a
Token ID257
Feature activation+4.27e-02

Pos logit contributions
 Can+0.008
elt+0.008
 Please+0.008
 ding+0.007
 Look+0.007
Neg logit contributions
ggy-0.009
live-0.008
case-0.008
aches-0.008
ail-0.008
Token loud
Token ID7812
Feature activation+9.47e-02

Pos logit contributions
 Can+0.085
elt+0.082
ted+0.078
ened+0.076
 Please+0.076
Neg logit contributions
ggy-0.074
case-0.066
live-0.066
zag-0.065
ail-0.064
Token voice
Token ID3809
Feature activation+3.03e-02

Pos logit contributions
elt+0.175
 Can+0.168
ksh+0.155
ted+0.152
eared+0.151
Neg logit contributions
ggy-0.188
zag-0.165
lf-0.156
case-0.155
ves-0.147
Token came
Token ID1625
Feature activation+3.00e-02

Pos logit contributions
 Can+0.057
elt+0.053
 Please+0.052
Stop+0.048
 Look+0.048
Neg logit contributions
ggy-0.056
live-0.051
zag-0.051
ves-0.050
aches-0.050
Token from
Token ID422
Feature activation+5.91e-03

Pos logit contributions
 Can+0.056
 Please+0.050
elt+0.050
Stop+0.047
 ding+0.046
Neg logit contributions
zag-0.053
ggy-0.052
ail-0.050
ics-0.048
live-0.047
Token behind
Token ID2157
Feature activation+2.78e-02

Pos logit contributions
 Can+0.012
elt+0.011
 Please+0.011
 Look+0.010
 ding+0.010
Neg logit contributions
ggy-0.010
live-0.010
zag-0.009
aches-0.009
ail-0.009
Token her
Token ID607
Feature activation-8.21e-03

Pos logit contributions
 Can+0.051
elt+0.049
 Please+0.046
 ding+0.045
ksh+0.043
Neg logit contributions
ggy-0.051
live-0.047
ail-0.044
case-0.044
ves-0.043
Token sun
Token ID4252
Feature activation-9.76e-04

Pos logit contributions
 Can+0.032
elt+0.031
 Please+0.029
ked+0.028
ted+0.028
Neg logit contributions
lly-0.028
ggy-0.027
asis-0.027
ail-0.027
case-0.027
Token.
Token ID13
Feature activation+9.45e-03

Pos logit contributions
ggy+0.001
zag+0.001
live+0.001
ail+0.001
case+0.001
Neg logit contributions
 Can-0.002
 Please-0.002
elt-0.002
Stop-0.002
 louder-0.002
Token
Token ID198
Feature activation-2.77e-02

Pos logit contributions
 Can+0.014
elt+0.014
 louder+0.013
ly+0.013
 Please+0.013
Neg logit contributions
ggy-0.020
case-0.019
live-0.019
zag-0.018
ves-0.018
Token
Token ID198
Feature activation-7.08e-03

Pos logit contributions
aches+0.064
kins+0.063
ail+0.061
oppy+0.061
kin+0.060
Neg logit contributions
 Can-0.042
 Please-0.035
Stop-0.032
 Look-0.032
elt-0.031
TokenA
Token ID32
Feature activation+2.31e-02

Pos logit contributions
ail+0.013
ggy+0.012
aches+0.012
live+0.012
oppy+0.012
Neg logit contributions
 Can-0.014
 Please-0.013
Stop-0.012
elt-0.012
 louder-0.012
Token mer
Token ID4017
Feature activation+9.33e-02

Pos logit contributions
 Can+0.035
elt+0.031
 Please+0.031
 louder+0.029
 ding+0.028
Neg logit contributions
ggy-0.051
live-0.050
aches-0.048
zag-0.048
case-0.048
Tokenmaid
Token ID23151
Feature activation+2.24e-02

Pos logit contributions
 Can+0.188
elt+0.173
Stop+0.170
 Please+0.166
ting+0.163
Neg logit contributions
ggy-0.159
zag-0.157
ail-0.153
lly-0.144
oppy-0.139
Token that
Token ID326
Feature activation-4.38e-03

Pos logit contributions
 Can+0.042
elt+0.039
 Please+0.038
 louder+0.036
Stop+0.036
Neg logit contributions
ggy-0.041
live-0.039
zag-0.038
case-0.037
aches-0.037
Token lived
Token ID5615
Feature activation-4.39e-03

Pos logit contributions
ggy+0.008
live+0.007
zag+0.007
aches+0.007
ail+0.007
Neg logit contributions
 Can-0.009
elt-0.008
 Please-0.008
 Look-0.007
 ding-0.007
Token in
Token ID287
Feature activation+2.68e-03

Pos logit contributions
ggy+0.009
live+0.009
case+0.008
ves+0.008
zag+0.008
Neg logit contributions
 Can-0.007
elt-0.007
 Please-0.007
 louder-0.007
ting-0.006
Token the
Token ID262
Feature activation-7.17e-03

Pos logit contributions
 Can+0.005
elt+0.004
 Please+0.004
 louder+0.004
ly+0.004
Neg logit contributions
ggy-0.005
live-0.005
zag-0.005
case-0.005
aches-0.005
Token wrink
Token ID34866
Feature activation+6.89e-02

Pos logit contributions
 Can+0.033
elt+0.030
 Please+0.030
 louder+0.028
 Look+0.028
Neg logit contributions
ggy-0.027
live-0.026
zag-0.025
ail-0.025
case-0.025
Tokenled
Token ID992
Feature activation+5.06e-02

Pos logit contributions
 Can+0.150
 Please+0.136
 Look+0.128
elt+0.125
 Come+0.123
Neg logit contributions
ggy-0.109
live-0.106
aches-0.105
ail-0.101
kins-0.100
Token up
Token ID510
Feature activation+5.07e-03

Pos logit contributions
 Can+0.112
elt+0.104
Stop+0.098
 Please+0.096
 Look+0.095
Neg logit contributions
ggy-0.072
aches-0.061
ail-0.059
kin-0.057
case-0.057
Token.
Token ID13
Feature activation+3.01e-02

Pos logit contributions
 Can+0.010
elt+0.009
Stop+0.009
 Look+0.009
 Please+0.008
Neg logit contributions
ggy-0.009
zag-0.008
case-0.008
ail-0.008
live-0.008
Token A
Token ID317
Feature activation+1.95e-02

Pos logit contributions
 Can+0.041
Stop+0.041
 louder+0.039
 ding+0.039
ting+0.038
Neg logit contributions
ggy-0.061
case-0.060
ail-0.055
live-0.054
oppy-0.052
Token sm
Token ID895
Feature activation+9.32e-02

Pos logit contributions
 Can+0.036
elt+0.036
ted+0.034
ting+0.033
ened+0.032
Neg logit contributions
ggy-0.036
case-0.032
live-0.031
ail-0.031
zag-0.031
Tokenelly
Token ID6148
Feature activation+5.93e-02

Pos logit contributions
 Can+0.159
 Please+0.151
 louder+0.146
 hello+0.144
 ding+0.142
Neg logit contributions
live-0.181
ggy-0.161
case-0.157
oppy-0.151
ail-0.151
Token smell
Token ID8508
Feature activation+3.42e-02

Pos logit contributions
elt+0.104
 Can+0.103
ksh+0.096
ting+0.092
eared+0.092
Neg logit contributions
ggy-0.119
live-0.101
kin-0.099
lly-0.099
zag-0.099
Token was
Token ID373
Feature activation+1.68e-02

Pos logit contributions
 Can+0.066
elt+0.062
 Please+0.060
 Look+0.057
 louder+0.057
Neg logit contributions
ggy-0.062
live-0.058
case-0.056
ves-0.055
aches-0.055
Token coming
Token ID2406
Feature activation+2.66e-02

Pos logit contributions
 Can+0.030
elt+0.027
 Please+0.026
 Look+0.025
ly+0.025
Neg logit contributions
ggy-0.034
live-0.032
aches-0.031
case-0.030
ail-0.030
Token from
Token ID422
Feature activation+6.35e-03

Pos logit contributions
 Can+0.054
elt+0.050
 Please+0.048
 Look+0.045
Stop+0.045
Neg logit contributions
ggy-0.044
ail-0.042
zag-0.040
case-0.038
live-0.037
Token
Token ID198
Feature activation-6.42e-03

Pos logit contributions
aches+0.083
kin+0.081
kins+0.080
live+0.078
zag+0.078
Neg logit contributions
 Can-0.046
elt-0.040
 Please-0.040
Stop-0.039
 Come-0.038
TokenHer
Token ID9360
Feature activation-1.34e-02

Pos logit contributions
ggy+0.011
live+0.011
zag+0.011
aches+0.011
ail+0.011
Neg logit contributions
 Can-0.012
elt-0.012
 Please-0.011
Stop-0.011
 louder-0.011
Token mom
Token ID1995
Feature activation+3.42e-02

Pos logit contributions
live+0.027
aches+0.027
ggy+0.026
zag+0.025
ves+0.025
Neg logit contributions
 Can-0.022
elt-0.020
 Please-0.020
 eyes-0.020
 louder-0.019
Token said
Token ID531
Feature activation+2.91e-02

Pos logit contributions
 Can+0.069
elt+0.063
 Please+0.062
Stop+0.060
 louder+0.059
Neg logit contributions
ggy-0.059
live-0.057
ail-0.055
zag-0.054
case-0.054
Token,
Token ID11
Feature activation+3.75e-02

Pos logit contributions
 Can+0.068
elt+0.066
Stop+0.063
ksh+0.062
 Please+0.062
Neg logit contributions
ggy-0.042
ves-0.037
ail-0.035
aches-0.034
live-0.034
Token "
Token ID366
Feature activation+9.30e-02

Pos logit contributions
vious+0.064
Stop+0.063
 Can+0.063
eared+0.062
ened+0.062
Neg logit contributions
ggy-0.068
ail-0.063
ves-0.062
zag-0.060
case-0.059
TokenNo
Token ID2949
Feature activation+4.83e-03

Pos logit contributions
 eyes+0.162
 Can+0.146
 hello+0.145
ked+0.134
ened+0.133
Neg logit contributions
live-0.177
lly-0.167
ggy-0.153
ll-0.150
ves-0.150
Token,
Token ID11
Feature activation+4.66e-02

Pos logit contributions
 Can+0.010
elt+0.010
 Please+0.010
 louder+0.009
ted+0.009
Neg logit contributions
ggy-0.008
live-0.007
zag-0.007
ail-0.007
aches-0.007
Token Let
Token ID3914
Feature activation+2.59e-02

Pos logit contributions
elt+0.096
eared+0.078
 Can+0.078
ksh+0.077
ted+0.076
Neg logit contributions
ggy-0.086
ves-0.078
ail-0.077
aches-0.077
live-0.076
Tokenicia
Token ID33577
Feature activation-2.06e-02

Pos logit contributions
 Can+0.043
elt+0.040
 Please+0.039
 louder+0.037
Stop+0.036
Neg logit contributions
ggy-0.053
live-0.051
zag-0.049
case-0.049
aches-0.049
Token,
Token ID11
Feature activation+2.25e-02

Pos logit contributions
ggy+0.036
live+0.035
zag+0.033
aches+0.032
ves+0.032
Neg logit contributions
 Can-0.041
elt-0.038
 Please-0.037
 louder-0.035
 Look-0.035
Token it
Token ID340
Feature activation-2.21e-02

Pos logit contributions
elt+0.030
 Can+0.030
 Please+0.028
ted+0.025
 Look+0.025
Neg logit contributions
ggy-0.031
case-0.029
ves-0.029
live-0.029
zag-0.029
Token.
Token ID13
Feature activation+4.11e-03

Pos logit contributions
ggy+0.034
live+0.032
zag+0.031
aches+0.030
case+0.030
Neg logit contributions
 Can-0.048
elt-0.045
 Please-0.044
 louder-0.042
Stop-0.042
Token Suddenly
Token ID24975
Feature activation-7.49e-04

Pos logit contributions
 Can+0.007
elt+0.006
 Please+0.006
 ding+0.006
 louder+0.006
Neg logit contributions
ggy-0.008
live-0.008
ail-0.008
case-0.008
zag-0.008
Token,
Token ID11
Feature activation+8.24e-04

Pos logit contributions
ggy+0.001
live+0.001
zag+0.001
ail+0.001
case+0.001
Neg logit contributions
 Can-0.002
elt-0.001
 Please-0.001
 louder-0.001
Stop-0.001
Token a
Token ID257
Feature activation+3.79e-02

Pos logit contributions
 Can+0.001
elt+0.001
 Please+0.001
 louder+0.001
Stop+0.001
Neg logit contributions
ggy-0.002
live-0.002
zag-0.002
case-0.002
aches-0.002
Token loud
Token ID7812
Feature activation+9.29e-02

Pos logit contributions
 Can+0.070
elt+0.070
ted+0.067
ting+0.065
ened+0.064
Neg logit contributions
ggy-0.070
case-0.063
zag-0.061
lly-0.060
ail-0.060
Token noise
Token ID7838
Feature activation+2.67e-02

Pos logit contributions
elt+0.178
 Can+0.164
ksh+0.156
ted+0.155
elling+0.148
Neg logit contributions
ggy-0.185
zag-0.160
lf-0.159
case-0.152
ves-0.143
Token came
Token ID1625
Feature activation+3.00e-02

Pos logit contributions
 Can+0.047
 Please+0.043
elt+0.042
 Look+0.040
ksh+0.039
Neg logit contributions
ggy-0.054
zag-0.047
ves-0.047
live-0.047
case-0.047
Token out
Token ID503
Feature activation+7.36e-03

Pos logit contributions
 Can+0.059
 Please+0.054
elt+0.050
 Look+0.047
ksh+0.047
Neg logit contributions
zag-0.054
ggy-0.051
ail-0.049
ics-0.048
case-0.045
Token of
Token ID286
Feature activation+2.19e-02

Pos logit contributions
 Can+0.014
elt+0.013
 Please+0.013
Stop+0.012
 Look+0.012
Neg logit contributions
ggy-0.013
zag-0.012
ail-0.012
case-0.012
live-0.011
Token the
Token ID262
Feature activation-4.66e-03

Pos logit contributions
 Can+0.046
elt+0.046
 Please+0.044
ksh+0.043
ly+0.042
Neg logit contributions
ggy-0.030
live-0.029
lf-0.028
ail-0.028
kins-0.027
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
aches+0.098
ggy+0.096
live+0.094
zag+0.092
kin+0.092
Neg logit contributions
 Can-0.070
elt-0.062
 Please-0.061
 Look-0.058
Stop-0.057
TokenHer
Token ID9360
Feature activation-1.23e-02

Pos logit contributions
ggy+0.033
live+0.032
zag+0.031
aches+0.031
ail+0.030
Neg logit contributions
 Can-0.030
elt-0.028
 Please-0.027
 louder-0.026
Stop-0.026
Token friend
Token ID1545
Feature activation+2.97e-02

Pos logit contributions
live+0.023
case+0.022
aches+0.022
ggy+0.022
ves+0.021
Neg logit contributions
 Can-0.020
 louder-0.020
elt-0.020
 hello-0.019
 eyes-0.019
Token said
Token ID531
Feature activation+1.65e-02

Pos logit contributions
 Can+0.058
elt+0.054
 Please+0.053
Stop+0.050
 ding+0.050
Neg logit contributions
ggy-0.053
live-0.049
ail-0.048
zag-0.047
case-0.046
Token,
Token ID11
Feature activation+3.25e-02

Pos logit contributions
 Can+0.039
elt+0.037
Stop+0.036
ksh+0.036
 Please+0.036
Neg logit contributions
ggy-0.023
ail-0.019
ves-0.019
aches-0.018
zag-0.018
Token "
Token ID366
Feature activation+9.27e-02

Pos logit contributions
 Can+0.052
elling+0.050
vious+0.049
elt+0.049
Stop+0.049
Neg logit contributions
ggy-0.065
live-0.057
zag-0.057
ail-0.057
ves-0.056
TokenYes
Token ID5297
Feature activation+1.74e-02

Pos logit contributions
 eyes+0.153
ened+0.151
ly+0.138
rs+0.136
ening+0.135
Neg logit contributions
live-0.166
ggy-0.161
lly-0.155
case-0.153
zag-0.149
Token,
Token ID11
Feature activation+4.91e-02

Pos logit contributions
 Can+0.040
elt+0.040
 Please+0.037
Stop+0.037
ted+0.036
Neg logit contributions
ggy-0.024
ail-0.022
live-0.021
oppy-0.020
zag-0.020
Token it
Token ID340
Feature activation-2.36e-03

Pos logit contributions
elt+0.107
ksh+0.095
ened+0.091
ted+0.090
ting+0.089
Neg logit contributions
ggy-0.081
ves-0.074
case-0.070
aches-0.069
lf-0.067
Token's
Token ID338
Feature activation+1.04e-02

Pos logit contributions
ggy+0.005
live+0.005
aches+0.005
zag+0.005
ail+0.004
Neg logit contributions
 Can-0.004
elt-0.004
 Please-0.003
 louder-0.003
Stop-0.003
Token in
Token ID287
Feature activation+1.38e-03

Pos logit contributions
 Can+0.016
 louder+0.015
 Please+0.014
 ding+0.013
 Look+0.013
Neg logit contributions
live-0.022
ggy-0.021
ail-0.021
ves-0.021
zag-0.021
Token
Token ID198
Feature activation-1.99e-02

Pos logit contributions
kin+0.093
kins+0.092
aches+0.092
zag+0.091
oppy+0.089
Neg logit contributions
 Can-0.049
 Please-0.041
Stop-0.040
 It-0.040
 Come-0.039
TokenJust
Token ID5703
Feature activation+4.75e-03

Pos logit contributions
zag+0.037
oppy+0.036
ggy+0.036
ail+0.036
live+0.034
Neg logit contributions
 Can-0.037
 Please-0.034
Stop-0.033
elt-0.032
 louder-0.031
Token then
Token ID788
Feature activation-1.12e-02

Pos logit contributions
 Can+0.009
elt+0.008
 Please+0.008
 Look+0.007
Stop+0.007
Neg logit contributions
ggy-0.009
live-0.008
aches-0.008
zag-0.008
lly-0.008
Token,
Token ID11
Feature activation-1.32e-02

Pos logit contributions
ggy+0.019
live+0.018
zag+0.018
case+0.017
aches+0.017
Neg logit contributions
 Can-0.023
elt-0.021
 Please-0.021
 louder-0.020
Stop-0.019
Token a
Token ID257
Feature activation+3.90e-02

Pos logit contributions
ggy+0.026
live+0.025
aches+0.024
case+0.024
zag+0.024
Neg logit contributions
 Can-0.023
elt-0.021
 Please-0.020
 louder-0.019
Stop-0.019
Token loud
Token ID7812
Feature activation+9.25e-02

Pos logit contributions
elt+0.076
 Can+0.075
ted+0.071
ened+0.069
ting+0.069
Neg logit contributions
ggy-0.068
case-0.061
live-0.059
ail-0.059
zag-0.059
Token,
Token ID11
Feature activation+4.46e-02

Pos logit contributions
elt+0.184
 Can+0.164
ksh+0.156
ted+0.155
eared+0.153
Neg logit contributions
ggy-0.178
lf-0.160
zag-0.156
case-0.149
ves-0.136
Token booming
Token ID32017
Feature activation+8.79e-02

Pos logit contributions
elt+0.084
 Can+0.081
ted+0.079
ened+0.077
ring+0.076
Neg logit contributions
ggy-0.075
case-0.069
lly-0.067
zag-0.064
live-0.063
Token voice
Token ID3809
Feature activation+3.30e-02

Pos logit contributions
elt+0.168
 Can+0.164
ted+0.151
ksh+0.151
vious+0.143
Neg logit contributions
ggy-0.149
zag-0.137
lf-0.132
case-0.119
ail-0.117
Token said
Token ID531
Feature activation+2.54e-02

Pos logit contributions
 Can+0.060
elt+0.055
 Please+0.054
 Look+0.050
 louder+0.049
Neg logit contributions
ggy-0.064
live-0.059
zag-0.058
ail-0.058
case-0.057
Token,
Token ID11
Feature activation+3.10e-02

Pos logit contributions
 Can+0.053
elt+0.051
 Please+0.048
Stop+0.048
ted+0.047
Neg logit contributions
ggy-0.042
ail-0.037
zag-0.036
live-0.036
case-0.036
Token
Token ID198
Feature activation-3.40e-02

Pos logit contributions
ggy+0.004
live+0.004
aches+0.004
zag+0.004
ail+0.004
Neg logit contributions
 Can-0.005
elt-0.005
 Please-0.004
 Look-0.004
ted-0.004
Token
Token ID198
Feature activation-6.57e-03

Pos logit contributions
aches+0.083
kins+0.083
kin+0.081
live+0.079
ail+0.078
Neg logit contributions
 Can-0.038
 It-0.032
 Please-0.030
Stop-0.029
 hello-0.028
TokenSuddenly
Token ID38582
Feature activation-5.19e-03

Pos logit contributions
ail+0.013
oppy+0.012
ggy+0.012
zag+0.012
live+0.012
Neg logit contributions
 Can-0.012
 Please-0.011
Stop-0.010
Go-0.010
 louder-0.010
Token,
Token ID11
Feature activation-4.66e-03

Pos logit contributions
ggy+0.008
live+0.007
zag+0.007
ail+0.007
ves+0.006
Neg logit contributions
 Can-0.012
elt-0.011
 Please-0.011
Stop-0.010
 ding-0.010
Token a
Token ID257
Feature activation+3.96e-02

Pos logit contributions
ggy+0.009
live+0.008
aches+0.008
zag+0.008
case+0.008
Neg logit contributions
 Can-0.008
elt-0.008
 Please-0.007
ly-0.007
 Look-0.007
Token loud
Token ID7812
Feature activation+9.24e-02

Pos logit contributions
 Can+0.075
elt+0.073
ted+0.068
 Please+0.067
ened+0.067
Neg logit contributions
ggy-0.072
live-0.065
case-0.064
zag-0.064
ail-0.063
Token noise
Token ID7838
Feature activation+2.48e-02

Pos logit contributions
elt+0.174
 Can+0.165
ksh+0.152
 Please+0.149
eared+0.147
Neg logit contributions
ggy-0.181
zag-0.160
lf-0.150
case-0.147
ves-0.142
Token scared
Token ID12008
Feature activation+5.21e-02

Pos logit contributions
 Can+0.041
elt+0.038
 Please+0.037
 Look+0.035
 louder+0.034
Neg logit contributions
ggy-0.051
live-0.047
zag-0.046
ves-0.046
case-0.046
Token the
Token ID262
Feature activation+1.70e-02

Pos logit contributions
 Can+0.106
 Please+0.097
elt+0.097
ksh+0.093
 ding+0.091
Neg logit contributions
ggy-0.085
live-0.067
ail-0.067
lly-0.066
ves-0.065
Token bunny
Token ID44915
Feature activation-6.69e-03

Pos logit contributions
 Can+0.025
elt+0.024
 Please+0.022
 louder+0.021
 ding+0.021
Neg logit contributions
ggy-0.037
live-0.035
case-0.035
zag-0.034
ail-0.034
Token and
Token ID290
Feature activation+1.62e-02

Pos logit contributions
ggy+0.013
live+0.012
aches+0.012
zag+0.012
case+0.012
Neg logit contributions
 Can-0.012
elt-0.011
 Please-0.011
 louder-0.011
ted-0.010
Token
Token ID198
Feature activation-3.92e-02

Pos logit contributions
ggy+0.024
live+0.024
case+0.023
ail+0.023
zag+0.023
Neg logit contributions
 Can-0.031
elt-0.029
 Please-0.029
 louder-0.028
ly-0.028
Token
Token ID198
Feature activation-2.44e-02

Pos logit contributions
aches+0.082
kins+0.078
ggy+0.078
kin+0.076
live+0.075
Neg logit contributions
 Can-0.063
elt-0.057
 Please-0.056
Stop-0.054
 Look-0.052
TokenSuddenly
Token ID38582
Feature activation-4.83e-03

Pos logit contributions
ggy+0.040
live+0.038
zag+0.037
ail+0.037
aches+0.037
Neg logit contributions
 Can-0.051
elt-0.048
 Please-0.047
 louder-0.045
Stop-0.044
Token,
Token ID11
Feature activation-5.62e-03

Pos logit contributions
ggy+0.007
live+0.007
ail+0.006
case+0.006
aches+0.006
Neg logit contributions
 Can-0.011
elt-0.010
 Please-0.010
Stop-0.009
ting-0.009
Token a
Token ID257
Feature activation+3.72e-02

Pos logit contributions
ggy+0.011
live+0.010
aches+0.010
case+0.010
zag+0.010
Neg logit contributions
 Can-0.010
elt-0.009
 Please-0.009
ly-0.008
Stop-0.008
Token loud
Token ID7812
Feature activation+9.23e-02

Pos logit contributions
 Can+0.068
elt+0.067
ted+0.063
ting+0.062
ened+0.061
Neg logit contributions
ggy-0.070
case-0.063
live-0.061
ail-0.061
zag-0.061
Token noise
Token ID7838
Feature activation+2.80e-02

Pos logit contributions
elt+0.172
 Can+0.161
ted+0.149
ksh+0.146
eared+0.145
Neg logit contributions
ggy-0.185
lf-0.162
zag-0.159
case-0.152
ves-0.144
Token came
Token ID1625
Feature activation+2.35e-02

Pos logit contributions
 Can+0.049
elt+0.045
 Please+0.044
 Look+0.042
 louder+0.040
Neg logit contributions
ggy-0.056
live-0.050
case-0.050
zag-0.050
ves-0.050
Token from
Token ID422
Feature activation+2.01e-03

Pos logit contributions
 Can+0.045
 Please+0.040
elt+0.038
Stop+0.036
ted+0.036
Neg logit contributions
zag-0.043
ggy-0.041
ail-0.040
ics-0.039
case-0.038
Token the
Token ID262
Feature activation-5.41e-04

Pos logit contributions
 Can+0.004
elt+0.003
 Please+0.003
 louder+0.003
 Look+0.003
Neg logit contributions
ggy-0.004
live-0.004
zag-0.004
case-0.003
aches-0.003
Token living
Token ID2877
Feature activation+4.02e-02

Pos logit contributions
ggy+0.001
live+0.001
case+0.001
zag+0.001
ail+0.001
Neg logit contributions
 Can-0.001
elt-0.001
 Please-0.001
ted-0.001
ened-0.001
Token
Token ID198
Feature activation-4.60e-02

Pos logit contributions
 Can+0.019
elt+0.018
 Please+0.018
 louder+0.017
ly+0.017
Neg logit contributions
ggy-0.016
live-0.016
zag-0.015
case-0.015
ail-0.015
Token
Token ID198
Feature activation-3.34e-02

Pos logit contributions
aches+0.115
live+0.111
zag+0.110
kins+0.109
kin+0.108
Neg logit contributions
 Can-0.058
 It-0.048
 Please-0.047
 louder-0.044
 Come-0.044
TokenSuddenly
Token ID38582
Feature activation-9.89e-03

Pos logit contributions
aches+0.074
zag+0.074
ics+0.071
ail+0.071
ggy+0.071
Neg logit contributions
 Can-0.050
 Please-0.047
Stop-0.043
 louder-0.043
Go-0.041
Token,
Token ID11
Feature activation-6.86e-03

Pos logit contributions
ggy+0.015
live+0.014
ail+0.013
zag+0.013
case+0.013
Neg logit contributions
 Can-0.022
elt-0.021
 Please-0.020
Stop-0.019
 louder-0.019
Token a
Token ID257
Feature activation+3.73e-02

Pos logit contributions
ggy+0.012
live+0.011
zag+0.011
aches+0.011
case+0.011
Neg logit contributions
 Can-0.014
elt-0.013
 Please-0.012
 louder-0.012
ly-0.012
Token loud
Token ID7812
Feature activation+9.22e-02

Pos logit contributions
 Can+0.072
elt+0.072
ted+0.067
ened+0.066
ting+0.065
Neg logit contributions
ggy-0.066
case-0.059
live-0.058
zag-0.058
lly-0.057
Token noise
Token ID7838
Feature activation+2.21e-02

Pos logit contributions
elt+0.175
 Can+0.166
ksh+0.153
eared+0.150
 Please+0.147
Neg logit contributions
ggy-0.184
zag-0.156
lf-0.153
ves-0.151
case-0.144
Token made
Token ID925
Feature activation+1.50e-02

Pos logit contributions
 Can+0.037
elt+0.035
 Please+0.034
 Look+0.032
 louder+0.031
Neg logit contributions
ggy-0.045
live-0.042
case-0.041
ves-0.041
zag-0.040
Token them
Token ID606
Feature activation-1.51e-02

Pos logit contributions
 Can+0.024
elt+0.022
 Please+0.021
Stop+0.019
ly+0.018
Neg logit contributions
ggy-0.032
ail-0.030
ves-0.030
case-0.029
live-0.029
Token stop
Token ID2245
Feature activation+1.65e-02

Pos logit contributions
ggy+0.030
live+0.030
ail+0.028
aches+0.028
zag+0.028
Neg logit contributions
 Can-0.025
elt-0.024
 Please-0.023
 louder-0.022
 ding-0.022
Token and
Token ID290
Feature activation+2.03e-02

Pos logit contributions
 Can+0.040
elt+0.038
 Please+0.037
Stop+0.035
vious+0.035
Neg logit contributions
ggy-0.021
ail-0.018
zag-0.018
ves-0.017
kins-0.017
Token friends
Token ID2460
Feature activation+8.66e-03

Pos logit contributions
live+0.023
ggy+0.022
case+0.021
aches+0.020
zag+0.020
Neg logit contributions
 Can-0.025
 louder-0.024
elt-0.023
 eyes-0.023
 Please-0.023
Token smiled
Token ID13541
Feature activation+4.61e-02

Pos logit contributions
 Can+0.016
elt+0.015
 Please+0.014
 Look+0.013
 ding+0.013
Neg logit contributions
ggy-0.016
live-0.015
case-0.015
ail-0.015
aches-0.015
Token and
Token ID290
Feature activation+1.34e-02

Pos logit contributions
 Can+0.090
Stop+0.089
elt+0.083
 spec+0.083
 Stop+0.082
Neg logit contributions
case-0.062
ggy-0.060
ail-0.058
zag-0.057
kins-0.056
Token said
Token ID531
Feature activation+9.87e-03

Pos logit contributions
 Can+0.029
 Please+0.026
elt+0.026
Stop+0.025
 Look+0.025
Neg logit contributions
ggy-0.021
live-0.019
lly-0.019
aches-0.018
ail-0.018
Token,
Token ID11
Feature activation+2.66e-02

Pos logit contributions
 Can+0.024
elt+0.023
 Please+0.022
Stop+0.022
 ding+0.021
Neg logit contributions
ggy-0.013
ail-0.012
ves-0.012
aches-0.011
live-0.011
Token "
Token ID366
Feature activation+9.22e-02

Pos logit contributions
 Can+0.041
elt+0.040
Stop+0.038
ened+0.038
elling+0.038
Neg logit contributions
ggy-0.053
zag-0.050
live-0.049
ail-0.049
ves-0.048
TokenYes
Token ID5297
Feature activation+1.63e-02

Pos logit contributions
 eyes+0.167
ened+0.163
ening+0.157
ring+0.144
elling+0.143
Neg logit contributions
live-0.151
ggy-0.147
ll-0.144
lly-0.143
 seek-0.130
Token,
Token ID11
Feature activation+5.51e-02

Pos logit contributions
elt+0.041
 Can+0.040
 Please+0.037
ksh+0.037
Stop+0.036
Neg logit contributions
ggy-0.019
ail-0.018
ves-0.017
live-0.016
case-0.016
Token you
Token ID345
Feature activation+6.42e-03

Pos logit contributions
elt+0.117
ksh+0.110
ened+0.107
 Can+0.103
ening+0.100
Neg logit contributions
ggy-0.097
aches-0.085
ves-0.081
lly-0.079
 solve-0.076
Token can
Token ID460
Feature activation-2.97e-02

Pos logit contributions
 Can+0.009
elt+0.008
 Please+0.008
 louder+0.007
ted+0.007
Neg logit contributions
ggy-0.015
live-0.015
zag-0.014
case-0.014
aches-0.014
Token.
Token ID13
Feature activation+4.67e-02

Pos logit contributions
live+0.057
ggy+0.055
zag+0.053
ail+0.052
ves+0.052
Neg logit contributions
 Can-0.053
 Please-0.049
ly-0.046
 hello-0.046
 louder-0.046
Token
Token ID198
Feature activation-2.18e-02

Pos logit contributions
ggy+0.001
live+0.001
aches+0.001
zag+0.001
ail+0.001
Neg logit contributions
 Can-0.001
 Please-0.001
elt-0.001
 louder-0.001
 Look-0.001
Token
Token ID198
Feature activation-1.61e-02

Pos logit contributions
aches+0.043
zag+0.042
live+0.042
ics+0.040
ail+0.039
Neg logit contributions
 Can-0.039
 Please-0.035
 louder-0.033
Stop-0.033
 hello-0.032
TokenAnna
Token ID31160
Feature activation+2.79e-02

Pos logit contributions
live+0.030
ggy+0.030
zag+0.030
aches+0.030
ail+0.029
Neg logit contributions
 Can-0.029
 Please-0.027
 louder-0.025
elt-0.025
Stop-0.025
Token says
Token ID1139
Feature activation+1.49e-02

Pos logit contributions
 Can+0.047
elt+0.044
 Please+0.040
eared+0.040
ting+0.039
Neg logit contributions
ggy-0.054
case-0.051
lly-0.050
ail-0.050
zag-0.050
Token,
Token ID11
Feature activation+2.98e-02

Pos logit contributions
 Can+0.036
elt+0.035
Stop+0.034
ksh+0.034
 Please+0.034
Neg logit contributions
ggy-0.020
ail-0.017
ves-0.016
zag-0.016
oppy-0.015
Token "
Token ID366
Feature activation+9.21e-02

Pos logit contributions
 Can+0.045
elt+0.044
ly+0.042
eared+0.042
elling+0.042
Neg logit contributions
ggy-0.061
zag-0.056
live-0.056
case-0.055
ail-0.055
TokenOkay
Token ID16454
Feature activation+1.36e-02

Pos logit contributions
ened+0.150
ening+0.137
 eyes+0.137
elling+0.136
eared+0.136
Neg logit contributions
ggy-0.182
live-0.173
lly-0.163
case-0.161
 seek-0.157
Token,
Token ID11
Feature activation+3.14e-02

Pos logit contributions
 Can+0.031
elt+0.031
ted+0.028
Stop+0.028
 Please+0.028
Neg logit contributions
ggy-0.019
ail-0.017
live-0.017
case-0.017
zag-0.017
Token I
Token ID314
Feature activation+4.62e-03

Pos logit contributions
elt+0.061
ened+0.054
eared+0.054
ted+0.053
 Can+0.051
Neg logit contributions
ggy-0.060
case-0.055
lf-0.051
oppy-0.051
 solve-0.050
Token will
Token ID481
Feature activation-6.36e-03

Pos logit contributions
 Can+0.007
 Please+0.006
elt+0.006
 louder+0.006
 Look+0.006
Neg logit contributions
ggy-0.010
live-0.010
aches-0.010
ail-0.010
ves-0.010
Token read
Token ID1100
Feature activation+2.39e-02

Pos logit contributions
live+0.012
ggy+0.012
ail+0.011
zag+0.011
ves+0.011
Neg logit contributions
 Can-0.012
 Please-0.011
elt-0.010
 louder-0.010
 Look-0.010
Token
Token ID198
Feature activation-3.56e-02

Pos logit contributions
ggy+0.003
live+0.003
case+0.003
ail+0.003
zag+0.003
Neg logit contributions
 Can-0.003
elt-0.003
 Please-0.003
 louder-0.003
 ding-0.002
Token
Token ID198
Feature activation-8.82e-03

Pos logit contributions
aches+0.087
kins+0.086
kin+0.085
live+0.079
asis+0.079
Neg logit contributions
 Can-0.043
 Please-0.037
Stop-0.036
 It-0.036
 Look-0.035
TokenSuddenly
Token ID38582
Feature activation-4.57e-03

Pos logit contributions
ggy+0.015
live+0.015
zag+0.014
aches+0.014
ail+0.014
Neg logit contributions
 Can-0.018
elt-0.016
 Please-0.016
 louder-0.015
Stop-0.015
Token,
Token ID11
Feature activation+1.23e-04

Pos logit contributions
ggy+0.007
live+0.007
case+0.006
ail+0.006
zag+0.006
Neg logit contributions
 Can-0.010
elt-0.009
 Please-0.009
Stop-0.009
 louder-0.008
Token a
Token ID257
Feature activation+3.79e-02

Pos logit contributions
 Can+0.000
elt+0.000
 Please+0.000
ly+0.000
 Look+0.000
Neg logit contributions
ggy-0.000
live-0.000
case-0.000
ail-0.000
zag-0.000
Token loud
Token ID7812
Feature activation+9.21e-02

Pos logit contributions
 Can+0.070
elt+0.068
ted+0.064
ened+0.063
ting+0.062
Neg logit contributions
ggy-0.072
case-0.064
live-0.063
zag-0.063
ail-0.062
Token noise
Token ID7838
Feature activation+2.55e-02

Pos logit contributions
elt+0.172
 Can+0.164
ksh+0.151
ted+0.146
eared+0.146
Neg logit contributions
ggy-0.187
zag-0.161
lf-0.155
ves-0.147
case-0.146
Token made
Token ID925
Feature activation+1.74e-02

Pos logit contributions
 Can+0.044
elt+0.040
 Please+0.040
 Look+0.038
 ding+0.036
Neg logit contributions
ggy-0.051
live-0.046
ves-0.046
zag-0.046
case-0.046
Token the
Token ID262
Feature activation+6.72e-03

Pos logit contributions
 Can+0.035
elt+0.033
 Please+0.032
ksh+0.030
 Look+0.029
Neg logit contributions
ggy-0.029
case-0.029
ail-0.028
ves-0.026
ics-0.026
Token ants
Token ID27842
Feature activation-1.79e-02

Pos logit contributions
 Can+0.010
elt+0.010
 Please+0.009
 louder+0.009
Stop+0.009
Neg logit contributions
ggy-0.015
live-0.014
zag-0.014
case-0.014
aches-0.014
Token scatter
Token ID41058
Feature activation+2.74e-02

Pos logit contributions
ggy+0.034
live+0.034
zag+0.031
ail+0.031
case+0.031
Neg logit contributions
 Can-0.032
elt-0.030
 Please-0.029
 louder-0.028
 ding-0.027
Token
Token ID198
Feature activation-2.83e-03

Pos logit contributions
aches+0.070
ggy+0.067
live+0.065
kins+0.065
kin+0.065
Neg logit contributions
 Can-0.053
 Please-0.045
Stop-0.042
elt-0.042
 Look-0.042
TokenL
Token ID43
Feature activation+3.35e-02

Pos logit contributions
ggy+0.005
live+0.005
zag+0.005
aches+0.005
ail+0.005
Neg logit contributions
 Can-0.005
elt-0.005
 Please-0.005
 louder-0.004
Stop-0.004
Tokenily
Token ID813
Feature activation+2.18e-02

Pos logit contributions
 Can+0.061
elt+0.058
ly+0.055
 Please+0.055
Stop+0.053
Neg logit contributions
ggy-0.062
zag-0.058
live-0.058
aches-0.057
ves-0.056
Token said
Token ID531
Feature activation+1.51e-02

Pos logit contributions
 Can+0.040
elt+0.038
 Please+0.036
 louder+0.035
Stop+0.034
Neg logit contributions
ggy-0.040
live-0.039
zag-0.037
case-0.037
ail-0.037
Token,
Token ID11
Feature activation+3.15e-02

Pos logit contributions
 Can+0.033
elt+0.032
Stop+0.031
ksh+0.031
 Please+0.031
Neg logit contributions
ggy-0.023
live-0.019
ail-0.019
zag-0.019
case-0.019
Token "
Token ID366
Feature activation+9.20e-02

Pos logit contributions
vious+0.052
elling+0.051
 Can+0.051
elt+0.051
eared+0.050
Neg logit contributions
ggy-0.059
zag-0.054
ail-0.052
live-0.052
case-0.052
TokenWow
Token ID22017
Feature activation-6.58e-03

Pos logit contributions
 eyes+0.189
elling+0.182
elt+0.179
ened+0.179
rier+0.176
Neg logit contributions
live-0.136
ggy-0.126
case-0.116
 seek-0.110
lf-0.106
Token,
Token ID11
Feature activation+3.50e-02

Pos logit contributions
ggy+0.013
live+0.013
aches+0.012
zag+0.012
ves+0.012
Neg logit contributions
 Can-0.012
 Please-0.011
elt-0.010
 louder-0.010
 Look-0.010
Token Max
Token ID5436
Feature activation+2.56e-03

Pos logit contributions
elt+0.079
ksh+0.070
eared+0.068
ened+0.067
 Can+0.066
Neg logit contributions
ggy-0.060
case-0.052
live-0.049
lf-0.048
 solve-0.047
Token!
Token ID0
Feature activation+5.51e-02

Pos logit contributions
 Can+0.005
elt+0.005
 Please+0.005
 louder+0.005
Stop+0.004
Neg logit contributions
ggy-0.004
live-0.004
zag-0.004
aches-0.004
case-0.004
Token You
Token ID921
Feature activation-1.13e-03

Pos logit contributions
 louder+0.072
ly+0.067
ened+0.065
aw+0.063
ening+0.062
Neg logit contributions
case-0.116
oppy-0.113
ggy-0.111
live-0.110
zag-0.109
Token
Token ID198
Feature activation-2.46e-02

Pos logit contributions
zag+0.071
aches+0.071
live+0.069
case+0.066
ggy+0.066
Neg logit contributions
 Can-0.057
 Please-0.053
elt-0.050
 louder-0.049
Stop-0.048
TokenHer
Token ID9360
Feature activation-4.00e-03

Pos logit contributions
ggy+0.047
live+0.047
zag+0.047
aches+0.046
ail+0.046
Neg logit contributions
 Can-0.043
 Please-0.040
elt-0.038
 louder-0.037
Stop-0.037
Token mother
Token ID2802
Feature activation+3.13e-02

Pos logit contributions
live+0.009
ggy+0.009
case+0.008
aches+0.008
zag+0.008
Neg logit contributions
 Can-0.006
 louder-0.005
elt-0.005
 Please-0.005
 eyes-0.005
Token said
Token ID531
Feature activation+1.78e-02

Pos logit contributions
 Can+0.063
elt+0.057
 Please+0.056
ksh+0.053
 Look+0.052
Neg logit contributions
ggy-0.057
ail-0.052
lly-0.051
live-0.051
case-0.048
Token,
Token ID11
Feature activation+3.13e-02

Pos logit contributions
 Can+0.043
elt+0.041
Stop+0.040
 Please+0.040
ksh+0.039
Neg logit contributions
ggy-0.023
ail-0.021
ves-0.020
aches-0.019
live-0.019
Token "
Token ID366
Feature activation+9.19e-02

Pos logit contributions
 Can+0.047
elt+0.045
elling+0.043
ened+0.043
Stop+0.043
Neg logit contributions
ggy-0.065
live-0.060
zag-0.059
ail-0.058
case-0.058
TokenYes
Token ID5297
Feature activation+1.75e-02

Pos logit contributions
ened+0.161
ening+0.152
 eyes+0.148
icol+0.138
ring+0.138
Neg logit contributions
live-0.166
ggy-0.162
lly-0.151
ll-0.149
 seek-0.146
Token,
Token ID11
Feature activation+5.11e-02

Pos logit contributions
elt+0.043
 Can+0.042
 Please+0.038
ted+0.037
Stop+0.037
Neg logit contributions
ggy-0.023
ail-0.021
live-0.020
ves-0.019
aches-0.019
Token we
Token ID356
Feature activation+2.64e-03

Pos logit contributions
elt+0.107
ted+0.093
 Can+0.093
ened+0.093
ting+0.090
Neg logit contributions
ggy-0.095
ves-0.083
live-0.081
aches-0.081
case-0.080
Token can
Token ID460
Feature activation-2.90e-02

Pos logit contributions
 Can+0.003
elt+0.003
 Please+0.003
 louder+0.003
 Look+0.003
Neg logit contributions
ggy-0.007
live-0.006
zag-0.006
aches-0.006
ail-0.006
Token.
Token ID13
Feature activation+5.05e-02

Pos logit contributions
live+0.059
ggy+0.057
zag+0.055
ves+0.054
kins+0.054
Neg logit contributions
 Can-0.047
 Please-0.044
ly-0.042
elt-0.041
 hello-0.040
Token coat
Token ID13209
Feature activation-2.77e-02

Pos logit contributions
ggy+0.048
live+0.046
zag+0.044
aches+0.043
ves+0.043
Neg logit contributions
 Can-0.056
elt-0.053
 Please-0.052
 louder-0.049
 Look-0.049
Token.
Token ID13
Feature activation+4.82e-04

Pos logit contributions
ggy+0.049
live+0.047
zag+0.045
case+0.045
aches+0.044
Neg logit contributions
 Can-0.054
elt-0.050
 Please-0.049
 louder-0.046
Stop-0.046
Token Suddenly
Token ID24975
Feature activation-2.70e-03

Pos logit contributions
 Can+0.001
elt+0.001
 Please+0.001
 louder+0.001
ted+0.001
Neg logit contributions
ggy-0.001
case-0.001
live-0.001
zag-0.001
ail-0.001
Token,
Token ID11
Feature activation-1.78e-03

Pos logit contributions
ggy+0.004
live+0.004
zag+0.004
case+0.004
ail+0.004
Neg logit contributions
 Can-0.006
elt-0.005
 Please-0.005
 louder-0.005
Stop-0.005
Token a
Token ID257
Feature activation+4.12e-02

Pos logit contributions
ggy+0.003
live+0.003
zag+0.003
case+0.003
aches+0.003
Neg logit contributions
 Can-0.003
elt-0.003
 Please-0.003
 louder-0.003
Stop-0.003
Token loud
Token ID7812
Feature activation+9.19e-02

Pos logit contributions
 Can+0.083
elt+0.081
ted+0.076
ened+0.075
 Please+0.075
Neg logit contributions
ggy-0.070
case-0.063
live-0.062
zag-0.060
lly-0.060
Token noise
Token ID7838
Feature activation+2.74e-02

Pos logit contributions
elt+0.175
 Can+0.164
ksh+0.152
eared+0.152
 Please+0.148
Neg logit contributions
ggy-0.180
zag-0.157
lf-0.156
case-0.151
ves-0.142
Token came
Token ID1625
Feature activation+3.04e-02

Pos logit contributions
 Can+0.046
elt+0.042
 Please+0.042
 Look+0.039
 louder+0.038
Neg logit contributions
ggy-0.056
live-0.052
zag-0.051
case-0.051
ves-0.051
Token from
Token ID422
Feature activation+7.77e-03

Pos logit contributions
 Can+0.056
 Please+0.051
elt+0.047
Stop+0.046
 applause+0.045
Neg logit contributions
zag-0.057
ggy-0.053
ail-0.051
ics-0.051
case-0.049
Token the
Token ID262
Feature activation+1.24e-02

Pos logit contributions
 Can+0.014
elt+0.014
 Please+0.013
 louder+0.012
 Look+0.012
Neg logit contributions
ggy-0.014
live-0.014
zag-0.013
case-0.013
aches-0.013
Token bushes
Token ID37413
Feature activation-7.68e-03

Pos logit contributions
 Can+0.023
elt+0.022
 Please+0.021
 louder+0.020
ted+0.020
Neg logit contributions
ggy-0.023
live-0.022
case-0.021
zag-0.021
ail-0.021
Token cozy
Token ID37438
Feature activation+1.75e-03

Pos logit contributions
ggy+0.009
live+0.009
zag+0.009
case+0.009
aches+0.009
Neg logit contributions
 Can-0.010
elt-0.010
 Please-0.009
 louder-0.009
Stop-0.009
Token fire
Token ID2046
Feature activation-2.78e-02

Pos logit contributions
 Can+0.003
elt+0.003
 Please+0.003
 louder+0.003
ly+0.003
Neg logit contributions
ggy-0.003
live-0.003
zag-0.003
aches-0.003
case-0.003
Token together
Token ID1978
Feature activation-4.27e-02

Pos logit contributions
ggy+0.051
live+0.049
zag+0.048
case+0.047
ail+0.047
Neg logit contributions
 Can-0.052
elt-0.048
 Please-0.047
Stop-0.044
 louder-0.044
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
live+0.079
ggy+0.079
zag+0.075
ail+0.073
aches+0.072
Neg logit contributions
 Can-0.078
elt-0.072
 Please-0.071
 louder-0.071
ly-0.069
Token<|endoftext|>
Token ID50256
Feature activation-2.38e-02

Pos logit contributions
aches+0.038
ggy+0.038
zag+0.036
live+0.035
kins+0.035
Neg logit contributions
 Can-0.032
elt-0.029
 Look-0.026
 Please-0.026
ted-0.025
TokenOnce
Token ID7454
Feature activation-9.12e-02

Pos logit contributions
aches+0.062
ggy+0.056
live+0.055
zag+0.055
oppy+0.055
Neg logit contributions
 Can-0.031
elt-0.026
ted-0.025
 Please-0.025
 Look-0.025
Token upon
Token ID2402
Feature activation-8.18e-03

Pos logit contributions
ggy+0.225
live+0.209
aches+0.208
asis+0.202
bes+0.200
Neg logit contributions
 Can-0.113
elt-0.101
ly-0.096
 eyes-0.095
 hello-0.092
Token a
Token ID257
Feature activation-1.37e-02

Pos logit contributions
live+0.019
ggy+0.019
ail+0.019
zag+0.019
aches+0.018
Neg logit contributions
 Can-0.011
elt-0.010
 Please-0.009
 louder-0.009
 Look-0.008
Token time
Token ID640
Feature activation-2.90e-02

Pos logit contributions
aches+0.023
ggy+0.023
ail+0.022
zag+0.022
live+0.022
Neg logit contributions
 Can-0.025
elt-0.024
 Please-0.024
ly-0.022
 louder-0.022
Token,
Token ID11
Feature activation-1.47e-02

Pos logit contributions
live+0.060
ggy+0.060
aches+0.059
ves+0.056
zag+0.056
Neg logit contributions
 Can-0.047
elt-0.043
 Please-0.041
 hello-0.040
 Look-0.039
Token there
Token ID612
Feature activation-1.55e-02

Pos logit contributions
live+0.032
ggy+0.032
aches+0.032
ail+0.031
ves+0.030
Neg logit contributions
 Can-0.024
 Please-0.021
elt-0.020
 Look-0.020
 louder-0.019
Token together
Token ID1978
Feature activation-2.71e-02

Pos logit contributions
ggy+0.059
live+0.057
zag+0.055
case+0.055
aches+0.054
Neg logit contributions
 Can-0.060
elt-0.055
 Please-0.054
Stop-0.051
 louder-0.051
Token every
Token ID790
Feature activation-2.33e-02

Pos logit contributions
ggy+0.056
live+0.054
zag+0.052
case+0.052
ail+0.052
Neg logit contributions
 Can-0.044
elt-0.041
 Please-0.039
 louder-0.037
ted-0.036
Token weekend
Token ID5041
Feature activation-6.71e-03

Pos logit contributions
ggy+0.029
live+0.027
zag+0.025
case+0.025
aches+0.025
Neg logit contributions
 Can-0.058
elt-0.055
 Please-0.054
 louder-0.052
Stop-0.052
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
ggy+0.012
live+0.011
zag+0.010
ves+0.010
case+0.010
Neg logit contributions
 Can-0.014
elt-0.013
 Please-0.012
Stop-0.012
 louder-0.012
Token<|endoftext|>
Token ID50256
Feature activation-2.55e-02

Pos logit contributions
ggy+0.026
live+0.025
aches+0.024
zag+0.024
ail+0.024
Neg logit contributions
 Can-0.023
elt-0.021
 Look-0.020
 Please-0.019
ted-0.019
TokenOnce
Token ID7454
Feature activation-9.06e-02

Pos logit contributions
aches+0.068
ggy+0.060
zag+0.060
ail+0.060
oppy+0.059
Neg logit contributions
 Can-0.032
 It-0.027
elt-0.025
ted-0.025
 Look-0.025
Token upon
Token ID2402
Feature activation-1.07e-02

Pos logit contributions
ggy+0.217
live+0.202
aches+0.202
bes+0.195
asis+0.194
Neg logit contributions
 Can-0.111
elt-0.101
 eyes-0.096
ly-0.095
 hello-0.091
Token a
Token ID257
Feature activation-1.12e-02

Pos logit contributions
live+0.025
ail+0.025
ggy+0.025
zag+0.025
aches+0.024
Neg logit contributions
 Can-0.014
elt-0.012
 Please-0.012
 louder-0.011
 Look-0.011
Token time
Token ID640
Feature activation-3.10e-02

Pos logit contributions
aches+0.018
ggy+0.018
ail+0.018
zag+0.018
live+0.018
Neg logit contributions
 Can-0.021
elt-0.019
 Please-0.019
ly-0.018
 louder-0.018
Token,
Token ID11
Feature activation-1.77e-02

Pos logit contributions
aches+0.066
live+0.066
ggy+0.065
ves+0.062
zag+0.061
Neg logit contributions
 Can-0.047
elt-0.042
 Please-0.040
 hello-0.040
 Look-0.039
Token there
Token ID612
Feature activation-1.75e-02

Pos logit contributions
live+0.037
ggy+0.037
aches+0.037
ves+0.035
ail+0.035
Neg logit contributions
 Can-0.030
 Please-0.025
elt-0.025
 Look-0.025
 hello-0.024
Token all
Token ID477
Feature activation-2.33e-02

Pos logit contributions
ggy+0.116
live+0.114
aches+0.108
zag+0.105
lly+0.104
Neg logit contributions
 Can-0.096
elt-0.090
 louder-0.088
 Please-0.085
ly-0.085
Token day
Token ID1110
Feature activation-5.58e-02

Pos logit contributions
ggy+0.036
live+0.034
case+0.032
aches+0.032
zag+0.032
Neg logit contributions
 Can-0.052
elt-0.047
 louder-0.047
 Please-0.046
 ding-0.045
Token long
Token ID890
Feature activation-2.08e-02

Pos logit contributions
live+0.124
aches+0.116
ggy+0.115
zag+0.112
lly+0.112
Neg logit contributions
 Can-0.079
 Please-0.077
ly-0.077
 louder-0.075
 eyes-0.071
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
ggy+0.033
live+0.032
zag+0.030
aches+0.030
ail+0.030
Neg logit contributions
 Can-0.044
elt-0.041
 Please-0.040
 louder-0.039
ted-0.039
Token<|endoftext|>
Token ID50256
Feature activation-2.48e-02

Pos logit contributions
ggy+0.034
aches+0.033
zag+0.032
ves+0.031
live+0.031
Neg logit contributions
 Can-0.025
elt-0.023
 Look-0.021
 Please-0.020
ted-0.020
TokenOnce
Token ID7454
Feature activation-9.04e-02

Pos logit contributions
aches+0.065
ggy+0.058
live+0.058
zag+0.058
ail+0.057
Neg logit contributions
 Can-0.033
elt-0.027
Stop-0.026
 Please-0.026
 Look-0.026
Token upon
Token ID2402
Feature activation-5.30e-03

Pos logit contributions
ggy+0.224
aches+0.209
live+0.207
asis+0.205
bes+0.201
Neg logit contributions
 Can-0.109
elt-0.097
 eyes-0.095
 louder-0.087
 hello-0.087
Token a
Token ID257
Feature activation-1.64e-02

Pos logit contributions
live+0.012
ggy+0.012
ail+0.012
zag+0.012
aches+0.012
Neg logit contributions
 Can-0.007
elt-0.006
 Please-0.006
 louder-0.006
 Look-0.005
Token time
Token ID640
Feature activation-2.98e-02

Pos logit contributions
aches+0.029
ggy+0.028
ail+0.028
zag+0.028
ves+0.027
Neg logit contributions
 Can-0.029
elt-0.027
 Please-0.027
 louder-0.025
 Look-0.025
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
aches+0.064
live+0.064
ggy+0.064
ves+0.061
case+0.060
Neg logit contributions
 Can-0.045
elt-0.041
 Please-0.038
 hello-0.037
 Look-0.037
Token there
Token ID612
Feature activation-1.71e-02

Pos logit contributions
ggy+0.030
live+0.030
aches+0.029
ves+0.029
zag+0.029
Neg logit contributions
 Can-0.022
 Please-0.019
elt-0.019
 louder-0.018
 Look-0.018
Token her
Token ID607
Feature activation-1.43e-02

Pos logit contributions
ggy+0.011
live+0.011
aches+0.010
ail+0.010
case+0.010
Neg logit contributions
 Can-0.008
elt-0.008
 Please-0.007
Stop-0.007
 Look-0.007
Token mom
Token ID1995
Feature activation+8.24e-03

Pos logit contributions
live+0.029
zag+0.028
ggy+0.028
aches+0.028
ves+0.027
Neg logit contributions
 Can-0.023
 eyes-0.022
elt-0.021
 louder-0.021
 Please-0.021
Token replied
Token ID8712
Feature activation-6.96e-04

Pos logit contributions
 Can+0.013
elt+0.012
 Please+0.012
 louder+0.011
ly+0.011
Neg logit contributions
ggy-0.017
live-0.017
zag-0.016
case-0.016
aches-0.016
Token.
Token ID13
Feature activation+1.70e-03

Pos logit contributions
ggy+0.001
live+0.001
zag+0.001
case+0.001
ail+0.001
Neg logit contributions
 Can-0.002
elt-0.001
 Please-0.001
 louder-0.001
Stop-0.001
Token<|endoftext|>
Token ID50256
Feature activation-1.37e-02

Pos logit contributions
 Can+0.003
elt+0.003
 Please+0.002
 louder+0.002
Stop+0.002
Neg logit contributions
ggy-0.003
live-0.003
zag-0.003
case-0.003
ail-0.003
TokenOnce
Token ID7454
Feature activation-9.03e-02

Pos logit contributions
aches+0.033
ggy+0.030
live+0.030
zag+0.030
oppy+0.029
Neg logit contributions
 Can-0.021
elt-0.018
 Please-0.018
ted-0.017
 Look-0.017
Token there
Token ID612
Feature activation-3.30e-02

Pos logit contributions
ggy+0.220
live+0.204
aches+0.202
asis+0.200
zag+0.193
Neg logit contributions
 Can-0.123
elt-0.107
 eyes-0.103
ly-0.100
 hello-0.097
Token was
Token ID373
Feature activation+1.31e-02

Pos logit contributions
zag+0.068
ggy+0.067
aches+0.066
live+0.066
ves+0.064
Neg logit contributions
 Can-0.053
elt-0.049
ened-0.046
 Look-0.044
ted-0.044
Token a
Token ID257
Feature activation-2.49e-03

Pos logit contributions
 Can+0.022
elt+0.020
 Please+0.020
 louder+0.019
Stop+0.018
Neg logit contributions
ggy-0.027
live-0.025
case-0.024
aches-0.024
zag-0.024
Token boy
Token ID2933
Feature activation-1.08e-02

Pos logit contributions
live+0.006
ggy+0.006
aches+0.006
zag+0.006
ail+0.006
Neg logit contributions
 Can-0.003
 Please-0.003
elt-0.003
 louder-0.003
 ding-0.003
Token named
Token ID3706
Feature activation-4.38e-03

Pos logit contributions
live+0.018
ggy+0.018
case+0.017
zag+0.017
aches+0.017
Neg logit contributions
 Can-0.022
elt-0.021
 Please-0.020
 louder-0.019
 Look-0.019
Token lots
Token ID6041
Feature activation+6.46e-03

Pos logit contributions
ggy+0.000
live+0.000
zag+0.000
aches+0.000
case+0.000
Neg logit contributions
 Can-0.000
elt-0.000
 Please-0.000
 louder-0.000
Stop-0.000
Token of
Token ID286
Feature activation-7.77e-03

Pos logit contributions
 Can+0.011
elt+0.011
 Please+0.010
 louder+0.010
 Look+0.010
Neg logit contributions
ggy-0.013
live-0.012
case-0.012
zag-0.012
aches-0.012
Token fun
Token ID1257
Feature activation-1.82e-02

Pos logit contributions
zag+0.017
ail+0.017
live+0.016
ggy+0.016
case+0.016
Neg logit contributions
 Can-0.011
elt-0.011
 louder-0.010
 hello-0.010
Stop-0.010
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
live+0.034
ggy+0.033
ail+0.033
zag+0.032
case+0.032
Neg logit contributions
 Can-0.033
elt-0.030
 Please-0.029
ting-0.029
 louder-0.028
Token<|endoftext|>
Token ID50256
Feature activation-2.67e-02

Pos logit contributions
aches+0.043
ggy+0.042
zag+0.040
kins+0.040
ves+0.039
Neg logit contributions
 Can-0.034
elt-0.030
 Look-0.028
 It-0.027
 Come-0.027
TokenOnce
Token ID7454
Feature activation-9.02e-02

Pos logit contributions
aches+0.071
oppy+0.063
live+0.063
ail+0.063
zag+0.062
Neg logit contributions
 Can-0.032
 It-0.026
ted-0.026
elt-0.025
 Look-0.025
Token upon
Token ID2402
Feature activation-1.07e-02

Pos logit contributions
ggy+0.217
aches+0.206
live+0.205
bes+0.201
asis+0.199
Neg logit contributions
 Can-0.105
elt-0.095
 eyes-0.095
 hello-0.090
 louder-0.088
Token a
Token ID257
Feature activation-1.62e-02

Pos logit contributions
ail+0.026
live+0.026
zag+0.026
ggy+0.026
aches+0.025
Neg logit contributions
 Can-0.013
elt-0.012
 Please-0.011
 louder-0.011
 Look-0.010
Token time
Token ID640
Feature activation-3.12e-02

Pos logit contributions
aches+0.028
ggy+0.027
ail+0.027
zag+0.027
live+0.026
Neg logit contributions
 Can-0.029
 Please-0.027
elt-0.027
 louder-0.026
 Look-0.025
Token,
Token ID11
Feature activation-1.69e-02

Pos logit contributions
aches+0.069
live+0.068
ggy+0.066
ves+0.064
case+0.063
Neg logit contributions
 Can-0.045
elt-0.041
 hello-0.039
 Please-0.038
 Look-0.038
Token there
Token ID612
Feature activation-2.04e-02

Pos logit contributions
aches+0.037
live+0.036
ggy+0.036
ves+0.035
ail+0.035
Neg logit contributions
 Can-0.027
elt-0.024
 Please-0.023
 Look-0.023
 louder-0.022
Token had
Token ID550
Feature activation-1.52e-02

Pos logit contributions
ggy+0.017
live+0.016
ail+0.016
zag+0.016
aches+0.015
Neg logit contributions
 Can-0.020
 Please-0.018
 louder-0.017
Stop-0.017
ted-0.017
Token that
Token ID326
Feature activation-3.29e-02

Pos logit contributions
ggy+0.028
live+0.028
case+0.028
ail+0.028
aches+0.028
Neg logit contributions
 Can-0.025
elt-0.025
ted-0.023
 Please-0.023
ting-0.022
Token day
Token ID1110
Feature activation-3.59e-02

Pos logit contributions
ggy+0.049
live+0.048
aches+0.046
ves+0.045
zag+0.044
Neg logit contributions
 Can-0.073
elt-0.068
 Please-0.065
 louder-0.064
 Look-0.063
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
ggy+0.069
live+0.067
zag+0.065
aches+0.064
case+0.064
Neg logit contributions
 Can-0.064
elt-0.060
 Please-0.058
 louder-0.055
ly-0.054
Token<|endoftext|>
Token ID50256
Feature activation-2.43e-02

Pos logit contributions
aches+0.041
ggy+0.040
kins+0.038
zag+0.038
ves+0.038
Neg logit contributions
 Can-0.038
elt-0.034
 Look-0.032
 Please-0.032
 Come-0.031
TokenOnce
Token ID7454
Feature activation-9.00e-02

Pos logit contributions
aches+0.065
live+0.058
zag+0.057
ggy+0.057
oppy+0.056
Neg logit contributions
 Can-0.031
ted-0.025
elt-0.025
 Look-0.025
 It-0.024
Token upon
Token ID2402
Feature activation-7.82e-03

Pos logit contributions
ggy+0.217
aches+0.204
live+0.200
asis+0.197
bes+0.192
Neg logit contributions
 Can-0.112
elt-0.099
 eyes-0.098
ly-0.097
 hello-0.091
Token a
Token ID257
Feature activation-1.50e-02

Pos logit contributions
live+0.019
ail+0.019
ggy+0.019
zag+0.019
aches+0.018
Neg logit contributions
 Can-0.010
elt-0.009
 Please-0.008
 louder-0.008
 Look-0.008
Token time
Token ID640
Feature activation-2.97e-02

Pos logit contributions
aches+0.026
ggy+0.025
ail+0.025
zag+0.025
live+0.024
Neg logit contributions
 Can-0.027
 Please-0.025
elt-0.025
 louder-0.024
ly-0.024
Token,
Token ID11
Feature activation-1.34e-02

Pos logit contributions
live+0.063
aches+0.062
ggy+0.062
ves+0.058
zag+0.057
Neg logit contributions
 Can-0.047
elt-0.042
 Please-0.040
 hello-0.040
 Look-0.039
Token there
Token ID612
Feature activation-1.45e-02

Pos logit contributions
live+0.029
ggy+0.029
aches+0.029
ail+0.028
ves+0.028
Neg logit contributions
 Can-0.022
elt-0.018
 Please-0.018
 Look-0.018
 hello-0.017
Token of
Token ID286
Feature activation-1.49e-02

Pos logit contributions
 Can+0.010
elt+0.010
Stop+0.009
 Please+0.009
 ding+0.009
Neg logit contributions
ggy-0.009
live-0.009
ail-0.008
zag-0.008
case-0.008
Token their
Token ID511
Feature activation-3.63e-02

Pos logit contributions
zag+0.034
case+0.033
live+0.033
ggy+0.033
ail+0.032
Neg logit contributions
 Can-0.022
 louder-0.019
elt-0.019
 Please-0.018
ting-0.018
Token adventures
Token ID17545
Feature activation-3.97e-02

Pos logit contributions
ggy+0.066
live+0.066
case+0.064
zag+0.064
aches+0.061
Neg logit contributions
 Can-0.065
elt-0.062
 Please-0.058
 louder-0.057
Stop-0.055
Token.
Token ID13
Feature activation-2.25e-02

Pos logit contributions
ggy+0.066
live+0.064
zag+0.062
aches+0.061
ail+0.060
Neg logit contributions
 Can-0.080
elt-0.075
 Please-0.073
ly-0.070
 louder-0.070
Token<|endoftext|>
Token ID50256
Feature activation-2.22e-02

Pos logit contributions
aches+0.052
ggy+0.050
zag+0.048
kins+0.048
ves+0.046
Neg logit contributions
 Can-0.038
elt-0.033
 Look-0.031
 Please-0.031
 It-0.030
TokenOnce
Token ID7454
Feature activation-8.96e-02

Pos logit contributions
aches+0.058
live+0.052
ggy+0.052
zag+0.051
oppy+0.051
Neg logit contributions
 Can-0.030
elt-0.024
 Please-0.024
 Look-0.024
ted-0.023
Token upon
Token ID2402
Feature activation-8.73e-03

Pos logit contributions
ggy+0.221
live+0.208
aches+0.207
asis+0.202
bes+0.201
Neg logit contributions
 Can-0.104
 eyes-0.092
elt-0.091
 hello-0.089
ly-0.088
Token a
Token ID257
Feature activation-1.45e-02

Pos logit contributions
live+0.021
ggy+0.020
ail+0.020
zag+0.020
aches+0.020
Neg logit contributions
 Can-0.011
elt-0.010
 Please-0.010
 louder-0.009
 Look-0.009
Token time
Token ID640
Feature activation-3.06e-02

Pos logit contributions
aches+0.025
ggy+0.025
ail+0.025
zag+0.024
kins+0.024
Neg logit contributions
 Can-0.026
 Please-0.024
elt-0.024
 louder-0.023
ly-0.023
Token,
Token ID11
Feature activation-1.58e-02

Pos logit contributions
live+0.065
aches+0.064
ggy+0.064
ves+0.061
case+0.060
Neg logit contributions
 Can-0.048
elt-0.043
 Please-0.041
 hello-0.041
 Look-0.040
Token there
Token ID612
Feature activation-1.56e-02

Pos logit contributions
live+0.034
ggy+0.034
aches+0.033
ves+0.032
zag+0.032
Neg logit contributions
 Can-0.026
elt-0.022
 Please-0.022
 Look-0.022
 louder-0.021
Token!"
Token ID2474
Feature activation-9.37e-03

Pos logit contributions
ggy+0.012
live+0.011
zag+0.011
case+0.011
aches+0.010
Neg logit contributions
 Can-0.016
elt-0.015
Stop-0.015
 louder-0.014
ened-0.014
Token Lily
Token ID20037
Feature activation+1.29e-02

Pos logit contributions
ggy+0.017
live+0.016
ail+0.016
case+0.016
zag+0.016
Neg logit contributions
 Can-0.018
elt-0.016
 Please-0.016
 louder-0.015
Stop-0.015
Token exclaimed
Token ID33552
Feature activation-1.12e-02

Pos logit contributions
 Can+0.023
elt+0.021
 Please+0.021
 louder+0.020
ly+0.019
Neg logit contributions
ggy-0.025
live-0.024
aches-0.023
zag-0.023
case-0.023
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
ggy+0.017
live+0.016
zag+0.015
case+0.015
ail+0.015
Neg logit contributions
 Can-0.025
elt-0.024
 Please-0.023
Stop-0.022
 louder-0.022
Token<|endoftext|>
Token ID50256
Feature activation-1.89e-02

Pos logit contributions
ggy+0.018
live+0.017
zag+0.017
aches+0.017
case+0.017
Neg logit contributions
 Can-0.021
elt-0.020
 Please-0.019
 louder-0.018
 Look-0.018
TokenOnce
Token ID7454
Feature activation-8.93e-02

Pos logit contributions
aches+0.047
ggy+0.043
live+0.042
zag+0.041
ail+0.041
Neg logit contributions
 Can-0.028
elt-0.023
 Please-0.023
ted-0.023
 Look-0.023
Token upon
Token ID2402
Feature activation-2.57e-03

Pos logit contributions
ggy+0.219
live+0.203
aches+0.203
asis+0.199
bes+0.194
Neg logit contributions
 Can-0.114
 eyes-0.098
elt-0.097
ly-0.095
 louder-0.093
Token a
Token ID257
Feature activation-1.50e-02

Pos logit contributions
ggy+0.006
live+0.006
zag+0.005
ail+0.005
aches+0.005
Neg logit contributions
 Can-0.004
elt-0.003
 Please-0.003
 louder-0.003
 Look-0.003
Token time
Token ID640
Feature activation-3.23e-02

Pos logit contributions
ggy+0.025
aches+0.025
zag+0.024
ail+0.024
live+0.023
Neg logit contributions
 Can-0.028
 Please-0.026
elt-0.026
 louder-0.025
 Look-0.025
Token,
Token ID11
Feature activation-1.60e-02

Pos logit contributions
live+0.068
ggy+0.067
aches+0.066
ves+0.062
zag+0.062
Neg logit contributions
 Can-0.053
elt-0.047
 Please-0.046
 Look-0.045
 hello-0.044
Token there
Token ID612
Feature activation-1.64e-02

Pos logit contributions
live+0.034
ggy+0.034
aches+0.033
zag+0.032
ves+0.032
Neg logit contributions
 Can-0.028
 Please-0.023
 Look-0.023
elt-0.023
 louder-0.022
Token and
Token ID290
Feature activation-1.49e-02

Pos logit contributions
 Can+0.070
Stop+0.067
 Please+0.066
OU+0.065
 ding+0.065
Neg logit contributions
zag-0.054
ll-0.052
case-0.052
live-0.052
ggy-0.052
Token happy
Token ID3772
Feature activation+7.28e-03

Pos logit contributions
ggy+0.025
live+0.024
zag+0.023
case+0.022
aches+0.022
Neg logit contributions
 Can-0.031
elt-0.029
 Please-0.028
 louder-0.027
Stop-0.026
Token again
Token ID757
Feature activation-2.80e-02

Pos logit contributions
 Can+0.013
elt+0.012
 Please+0.012
Stop+0.011
 ding+0.011
Neg logit contributions
ggy-0.014
zag-0.013
live-0.013
case-0.012
ail-0.012
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
live+0.050
ggy+0.050
case+0.049
aches+0.048
zag+0.047
Neg logit contributions
 Can-0.053
elt-0.048
 Please-0.048
 louder-0.048
ting-0.047
Token<|endoftext|>
Token ID50256
Feature activation-2.75e-02

Pos logit contributions
aches+0.033
ggy+0.032
live+0.030
zag+0.030
ves+0.030
Neg logit contributions
 Can-0.030
elt-0.027
 Look-0.024
 Please-0.024
ted-0.023
TokenOnce
Token ID7454
Feature activation-8.93e-02

Pos logit contributions
aches+0.074
zag+0.065
live+0.064
ail+0.064
oppy+0.063
Neg logit contributions
 Can-0.034
 It-0.028
ted-0.027
 Please-0.027
 Look-0.027
Token upon
Token ID2402
Feature activation-1.07e-02

Pos logit contributions
ggy+0.224
live+0.210
aches+0.210
bes+0.202
asis+0.202
Neg logit contributions
 Can-0.100
 eyes-0.088
elt-0.087
ly-0.083
 hello-0.082
Token a
Token ID257
Feature activation-1.17e-02

Pos logit contributions
live+0.027
ail+0.026
zag+0.026
ggy+0.026
aches+0.026
Neg logit contributions
 Can-0.013
elt-0.011
 Please-0.011
 louder-0.010
 Look-0.010
Token time
Token ID640
Feature activation-2.94e-02

Pos logit contributions
aches+0.020
ail+0.020
ggy+0.020
zag+0.019
kins+0.019
Neg logit contributions
 Can-0.021
 Please-0.019
elt-0.019
 louder-0.018
ly-0.018
Token,
Token ID11
Feature activation-1.80e-02

Pos logit contributions
aches+0.064
live+0.063
ggy+0.062
ves+0.059
case+0.058
Neg logit contributions
 Can-0.045
elt-0.040
 Please-0.038
 hello-0.038
 Look-0.038
Token there
Token ID612
Feature activation-1.37e-02

Pos logit contributions
aches+0.041
live+0.041
ggy+0.040
ail+0.039
ves+0.039
Neg logit contributions
 Can-0.027
 Please-0.023
 Look-0.022
elt-0.022
 hello-0.021
Token best
Token ID1266
Feature activation-1.15e-02

Pos logit contributions
ggy+0.005
live+0.005
zag+0.005
case+0.004
aches+0.004
Neg logit contributions
 Can-0.016
elt-0.015
 Please-0.015
ened-0.014
ted-0.014
Token of
Token ID286
Feature activation-1.64e-02

Pos logit contributions
ggy+0.023
live+0.022
zag+0.021
case+0.021
ail+0.021
Neg logit contributions
 Can-0.020
elt-0.019
 Please-0.018
 louder-0.017
Stop-0.017
Token friends
Token ID2460
Feature activation-1.47e-02

Pos logit contributions
ggy+0.021
live+0.021
zag+0.021
case+0.020
ail+0.020
Neg logit contributions
 Can-0.039
elt-0.037
 Please-0.036
 louder-0.035
Stop-0.035
Token.
Token ID13
Feature activation-1.77e-02

Pos logit contributions
ggy+0.031
live+0.030
zag+0.029
case+0.028
ail+0.028
Neg logit contributions
 Can-0.024
elt-0.022
 Please-0.021
 louder-0.020
Stop-0.020
Token<|endoftext|>
Token ID50256
Feature activation-2.70e-02

Pos logit contributions
ggy+0.035
aches+0.034
zag+0.033
ves+0.032
live+0.032
Neg logit contributions
 Can-0.032
elt-0.030
 Look-0.027
 Please-0.026
ted-0.026
TokenOnce
Token ID7454
Feature activation-8.92e-02

Pos logit contributions
aches+0.071
zag+0.063
ail+0.063
oppy+0.063
ggy+0.062
Neg logit contributions
 Can-0.033
 It-0.028
elt-0.027
ted-0.027
Stop-0.027
Token upon
Token ID2402
Feature activation-1.10e-02

Pos logit contributions
ggy+0.220
live+0.206
aches+0.206
bes+0.204
asis+0.201
Neg logit contributions
 Can-0.099
elt-0.090
 eyes-0.090
 louder-0.085
 hello-0.084
Token a
Token ID257
Feature activation-1.32e-02

Pos logit contributions
live+0.026
ail+0.026
ggy+0.026
zag+0.026
aches+0.025
Neg logit contributions
 Can-0.014
elt-0.012
 Please-0.012
 louder-0.012
 Look-0.011
Token time
Token ID640
Feature activation-3.22e-02

Pos logit contributions
aches+0.022
ggy+0.022
zag+0.021
ail+0.021
live+0.021
Neg logit contributions
 Can-0.024
 Please-0.023
elt-0.023
 louder-0.022
ly-0.021
Token,
Token ID11
Feature activation-1.87e-02

Pos logit contributions
aches+0.072
live+0.071
ggy+0.069
ves+0.067
case+0.065
Neg logit contributions
 Can-0.047
elt-0.042
 Please-0.040
 hello-0.040
 Look-0.039
Token there
Token ID612
Feature activation-2.06e-02

Pos logit contributions
live+0.041
aches+0.041
ggy+0.041
ves+0.039
ail+0.039
Neg logit contributions
 Can-0.029
elt-0.025
 Please-0.025
 Look-0.024
 louder-0.024
Token pictures
Token ID5986
Feature activation-1.65e-02

Pos logit contributions
 Can+0.020
 Please+0.019
elt+0.019
Stop+0.017
 Look+0.017
Neg logit contributions
ggy-0.016
case-0.015
ves-0.015
kins-0.014
aches-0.014
Token together
Token ID1978
Feature activation-2.77e-02

Pos logit contributions
ggy+0.030
case+0.028
zag+0.028
live+0.028
ves+0.027
Neg logit contributions
 Can-0.032
elt-0.030
 Please-0.029
Stop-0.028
 Look-0.028
Token often
Token ID1690
Feature activation-1.72e-03

Pos logit contributions
ggy+0.045
live+0.043
zag+0.041
case+0.041
aches+0.040
Neg logit contributions
 Can-0.058
elt-0.055
 Please-0.053
 louder-0.050
Stop-0.050
Token.
Token ID13
Feature activation-2.20e-02

Pos logit contributions
ggy+0.003
zag+0.002
case+0.002
live+0.002
aches+0.002
Neg logit contributions
 Can-0.004
elt-0.004
 Please-0.003
Stop-0.003
 Look-0.003
Token<|endoftext|>
Token ID50256
Feature activation-2.94e-02

Pos logit contributions
aches+0.044
ggy+0.044
zag+0.042
kins+0.042
live+0.041
Neg logit contributions
 Can-0.040
elt-0.036
 Look-0.033
 Please-0.032
ted-0.032
TokenOnce
Token ID7454
Feature activation-8.92e-02

Pos logit contributions
aches+0.078
ail+0.069
zag+0.069
oppy+0.069
live+0.068
Neg logit contributions
 Can-0.036
 It-0.030
 Please-0.029
ted-0.029
elt-0.028
Token upon
Token ID2402
Feature activation-9.56e-03

Pos logit contributions
ggy+0.220
aches+0.206
live+0.205
asis+0.200
bes+0.198
Neg logit contributions
 Can-0.104
elt-0.094
 eyes-0.091
ly-0.089
 hello-0.086
Token a
Token ID257
Feature activation-1.50e-02

Pos logit contributions
live+0.023
ail+0.023
ggy+0.023
zag+0.023
aches+0.022
Neg logit contributions
 Can-0.012
elt-0.011
 Please-0.010
 louder-0.010
 Look-0.009
Token time
Token ID640
Feature activation-3.15e-02

Pos logit contributions
aches+0.025
ggy+0.025
ail+0.024
zag+0.024
kins+0.024
Neg logit contributions
 Can-0.027
elt-0.025
 Please-0.025
 louder-0.024
ly-0.024
Token,
Token ID11
Feature activation-1.71e-02

Pos logit contributions
aches+0.067
live+0.067
ggy+0.066
ves+0.063
zag+0.061
Neg logit contributions
 Can-0.048
elt-0.044
 Please-0.042
 hello-0.041
 Look-0.041
Token there
Token ID612
Feature activation-1.72e-02

Pos logit contributions
live+0.038
ggy+0.038
aches+0.037
ail+0.036
ves+0.036
Neg logit contributions
 Can-0.027
 Please-0.023
elt-0.022
 Look-0.022
 louder-0.021
Token best
Token ID1266
Feature activation-1.28e-02

Pos logit contributions
ggy+0.007
live+0.006
zag+0.006
case+0.006
aches+0.006
Neg logit contributions
 Can-0.016
elt-0.015
 Please-0.015
ted-0.014
 louder-0.014
Token of
Token ID286
Feature activation-1.54e-02

Pos logit contributions
ggy+0.027
live+0.026
aches+0.025
zag+0.025
case+0.025
Neg logit contributions
 Can-0.021
elt-0.019
 Please-0.019
 louder-0.018
ly-0.017
Token friends
Token ID2460
Feature activation-1.39e-02

Pos logit contributions
live+0.023
zag+0.022
case+0.022
ggy+0.022
ail+0.021
Neg logit contributions
 Can-0.034
elt-0.032
 Please-0.031
 louder-0.031
Stop-0.030
Token.
Token ID13
Feature activation-8.12e-03

Pos logit contributions
ggy+0.030
live+0.030
aches+0.029
zag+0.029
ail+0.028
Neg logit contributions
 Can-0.021
elt-0.019
 Please-0.019
 louder-0.018
ted-0.017
Token<|endoftext|>
Token ID50256
Feature activation-2.76e-02

Pos logit contributions
ggy+0.016
aches+0.016
live+0.015
zag+0.015
ves+0.015
Neg logit contributions
 Can-0.015
elt-0.013
 Please-0.012
 Look-0.012
ted-0.012
TokenOnce
Token ID7454
Feature activation-8.92e-02

Pos logit contributions
aches+0.074
oppy+0.066
ail+0.066
live+0.065
zag+0.065
Neg logit contributions
 Can-0.033
 It-0.027
elt-0.026
 Please-0.026
ted-0.026
Token upon
Token ID2402
Feature activation-9.17e-03

Pos logit contributions
ggy+0.218
aches+0.205
live+0.201
bes+0.196
asis+0.196
Neg logit contributions
 Can-0.108
elt-0.098
 eyes-0.095
ly-0.090
 louder-0.088
Token a
Token ID257
Feature activation-1.07e-02

Pos logit contributions
live+0.022
ail+0.022
ggy+0.022
zag+0.022
aches+0.021
Neg logit contributions
 Can-0.011
elt-0.010
 Please-0.010
 louder-0.009
 Look-0.009
Token time
Token ID640
Feature activation-2.93e-02

Pos logit contributions
aches+0.018
ggy+0.018
ail+0.018
zag+0.018
live+0.017
Neg logit contributions
 Can-0.019
elt-0.018
 Please-0.018
ly-0.017
 louder-0.017
Token,
Token ID11
Feature activation-1.41e-02

Pos logit contributions
aches+0.065
live+0.064
ggy+0.063
ves+0.060
case+0.060
Neg logit contributions
 Can-0.042
elt-0.038
 Please-0.036
 hello-0.035
 Look-0.035
Token there
Token ID612
Feature activation-1.86e-02

Pos logit contributions
live+0.031
ggy+0.031
aches+0.031
ail+0.030
zag+0.029
Neg logit contributions
 Can-0.023
 Please-0.019
elt-0.019
 Look-0.019
 louder-0.018
Token more
Token ID517
Feature activation-1.50e-02

Pos logit contributions
 Can+0.006
elt+0.006
 Please+0.005
 louder+0.005
Stop+0.005
Neg logit contributions
ggy-0.007
live-0.007
zag-0.007
case-0.007
ail-0.007
Token adventures
Token ID17545
Feature activation-3.08e-02

Pos logit contributions
ggy+0.025
live+0.025
zag+0.024
case+0.023
ail+0.023
Neg logit contributions
 Can-0.030
elt-0.029
 Please-0.028
 louder-0.027
Stop-0.026
Token together
Token ID1978
Feature activation-3.79e-02

Pos logit contributions
ggy+0.054
live+0.051
zag+0.050
case+0.050
aches+0.049
Neg logit contributions
 Can-0.061
elt-0.057
 Please-0.055
Stop-0.053
 Look-0.052
Token.
Token ID13
Feature activation-9.19e-03

Pos logit contributions
ggy+0.061
live+0.059
zag+0.056
ail+0.056
case+0.055
Neg logit contributions
 Can-0.079
elt-0.073
 Please-0.072
 louder-0.069
ly-0.068
Token<|endoftext|>
Token ID50256
Feature activation-2.51e-02

Pos logit contributions
ggy+0.018
aches+0.017
zag+0.016
live+0.016
ves+0.016
Neg logit contributions
 Can-0.018
elt-0.016
 Please-0.016
 Look-0.015
ted-0.015
TokenOnce
Token ID7454
Feature activation-8.91e-02

Pos logit contributions
aches+0.064
ggy+0.058
live+0.058
zag+0.058
ail+0.056
Neg logit contributions
 Can-0.033
ted-0.028
elt-0.028
 Please-0.027
 Look-0.027
Token upon
Token ID2402
Feature activation-4.90e-03

Pos logit contributions
ggy+0.217
live+0.201
aches+0.200
asis+0.199
bes+0.191
Neg logit contributions
 Can-0.110
 eyes-0.098
elt-0.097
ly-0.096
 louder-0.092
Token a
Token ID257
Feature activation-1.61e-02

Pos logit contributions
live+0.011
ggy+0.011
ail+0.011
zag+0.011
aches+0.011
Neg logit contributions
 Can-0.007
elt-0.006
 Please-0.006
 louder-0.005
 Look-0.005
Token time
Token ID640
Feature activation-2.92e-02

Pos logit contributions
ggy+0.027
aches+0.027
zag+0.026
ail+0.026
live+0.026
Neg logit contributions
 Can-0.030
 Please-0.028
elt-0.028
 louder-0.026
ly-0.026
Token,
Token ID11
Feature activation-1.43e-02

Pos logit contributions
live+0.062
ggy+0.062
aches+0.061
ves+0.057
zag+0.057
Neg logit contributions
 Can-0.046
elt-0.041
 Please-0.040
 hello-0.039
 Look-0.039
Token there
Token ID612
Feature activation-1.55e-02

Pos logit contributions
live+0.031
ggy+0.031
aches+0.030
ves+0.029
zag+0.029
Neg logit contributions
 Can-0.023
elt-0.020
 Please-0.020
 louder-0.019
 Look-0.019
Token someone
Token ID2130
Feature activation-3.69e-02

Pos logit contributions
ggy+0.012
aches+0.011
live+0.011
ail+0.011
ves+0.011
Neg logit contributions
 Can-0.011
elt-0.011
 Please-0.010
 ding-0.009
ted-0.009
Token in
Token ID287
Feature activation-1.36e-02

Pos logit contributions
live+0.072
ggy+0.071
zag+0.068
case+0.066
ves+0.065
Neg logit contributions
 Can-0.064
ly-0.059
elt-0.058
 Please-0.058
 louder-0.057
Token need
Token ID761
Feature activation-1.37e-02

Pos logit contributions
ggy+0.025
live+0.024
zag+0.023
case+0.023
aches+0.023
Neg logit contributions
 Can-0.025
elt-0.024
 Please-0.023
 louder-0.022
Stop-0.022
Token.
Token ID13
Feature activation-1.96e-02

Pos logit contributions
ggy+0.021
live+0.020
zag+0.019
case+0.019
aches+0.019
Neg logit contributions
 Can-0.030
elt-0.028
 Please-0.027
 louder-0.026
ly-0.026
Token<|endoftext|>
Token ID50256
Feature activation-2.63e-02

Pos logit contributions
ggy+0.042
aches+0.041
zag+0.039
ves+0.039
live+0.039
Neg logit contributions
 Can-0.034
elt-0.032
 Please-0.029
 Look-0.029
 Come-0.027
TokenOnce
Token ID7454
Feature activation-8.91e-02

Pos logit contributions
aches+0.070
ggy+0.064
zag+0.062
live+0.062
ail+0.060
Neg logit contributions
 Can-0.035
elt-0.028
ted-0.028
 Please-0.027
 It-0.027
Token upon
Token ID2402
Feature activation-6.99e-03

Pos logit contributions
ggy+0.220
live+0.205
aches+0.203
asis+0.198
bes+0.196
Neg logit contributions
 Can-0.111
elt-0.097
 eyes-0.093
ly-0.092
 hello-0.089
Token a
Token ID257
Feature activation-1.76e-02

Pos logit contributions
live+0.017
ail+0.017
ggy+0.017
zag+0.017
aches+0.016
Neg logit contributions
 Can-0.009
elt-0.008
 Please-0.008
 louder-0.007
 Look-0.007
Token time
Token ID640
Feature activation-3.33e-02

Pos logit contributions
aches+0.030
ggy+0.030
ail+0.029
zag+0.029
kins+0.028
Neg logit contributions
 Can-0.032
 Please-0.030
elt-0.029
ly-0.028
 louder-0.028
Token,
Token ID11
Feature activation-1.73e-02

Pos logit contributions
ggy+0.069
live+0.068
aches+0.067
ves+0.064
case+0.063
Neg logit contributions
 Can-0.055
elt-0.050
 Please-0.048
 hello-0.046
 Look-0.046
Token there
Token ID612
Feature activation-1.84e-02

Pos logit contributions
live+0.037
ggy+0.037
aches+0.037
ves+0.036
ail+0.036
Neg logit contributions
 Can-0.028
 Please-0.024
elt-0.024
 Look-0.023
 louder-0.023
Token all
Token ID477
Feature activation-2.00e-02

Pos logit contributions
live+0.093
ggy+0.092
ail+0.089
lly+0.085
aches+0.085
Neg logit contributions
 Can-0.080
 louder-0.074
 Please-0.072
ly-0.069
ting-0.069
Token day
Token ID1110
Feature activation-5.01e-02

Pos logit contributions
ggy+0.031
live+0.029
ail+0.028
case+0.028
aches+0.028
Neg logit contributions
 Can-0.044
elt-0.039
 louder-0.039
 Please-0.039
ly-0.039
Token long
Token ID890
Feature activation-1.26e-02

Pos logit contributions
live+0.108
aches+0.102
ail+0.098
ggy+0.098
lly+0.097
Neg logit contributions
 Can-0.074
 Please-0.073
ly-0.070
 louder-0.068
ting-0.064
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
ggy+0.020
live+0.019
zag+0.018
case+0.018
aches+0.018
Neg logit contributions
 Can-0.027
elt-0.025
 Please-0.025
 louder-0.024
Stop-0.023
Token<|endoftext|>
Token ID50256
Feature activation-3.02e-02

Pos logit contributions
aches+0.039
ggy+0.038
kins+0.037
kin+0.036
zag+0.036
Neg logit contributions
 Can-0.016
elt-0.014
ted-0.013
 It-0.013
 Look-0.012
TokenOnce
Token ID7454
Feature activation-8.91e-02

Pos logit contributions
aches+0.082
ail+0.074
oppy+0.073
live+0.072
zag+0.072
Neg logit contributions
 Can-0.035
 It-0.029
 Please-0.027
Stop-0.027
elt-0.027
Token upon
Token ID2402
Feature activation-7.18e-03

Pos logit contributions
ggy+0.215
bes+0.204
aches+0.203
live+0.201
asis+0.197
Neg logit contributions
 Can-0.103
elt-0.091
 hello-0.091
 eyes-0.089
 louder-0.083
Token a
Token ID257
Feature activation-1.34e-02

Pos logit contributions
ail+0.017
live+0.017
ggy+0.017
zag+0.017
aches+0.017
Neg logit contributions
 Can-0.009
elt-0.008
 Please-0.008
 louder-0.007
Stop-0.007
Token time
Token ID640
Feature activation-3.12e-02

Pos logit contributions
aches+0.025
ail+0.024
zag+0.023
kins+0.023
ggy+0.023
Neg logit contributions
 Can-0.022
 Please-0.020
elt-0.020
aw-0.019
 louder-0.019
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
aches+0.070
live+0.069
ggy+0.067
ves+0.066
case+0.065
Neg logit contributions
 Can-0.043
elt-0.040
 Please-0.037
 hello-0.037
 Look-0.036
Token there
Token ID612
Feature activation-1.88e-02

Pos logit contributions
aches+0.037
live+0.036
ggy+0.036
ves+0.035
ail+0.035
Neg logit contributions
 Can-0.025
elt-0.021
 Please-0.021
 Look-0.020
 louder-0.020
Token together
Token ID1978
Feature activation-5.78e-02

Pos logit contributions
ggy+0.114
live+0.110
aches+0.105
ail+0.104
ves+0.102
Neg logit contributions
 Can-0.088
 louder-0.082
elt-0.081
 Please-0.077
ting-0.077
Token every
Token ID790
Feature activation-3.45e-02

Pos logit contributions
ggy+0.119
live+0.116
ail+0.114
zag+0.111
aches+0.110
Neg logit contributions
 Can-0.089
 louder-0.081
ting-0.080
 Please-0.079
ted-0.079
Token day
Token ID1110
Feature activation-4.75e-02

Pos logit contributions
ggy+0.053
live+0.051
aches+0.050
case+0.049
zag+0.048
Neg logit contributions
 Can-0.076
elt-0.069
 Please-0.068
 louder-0.068
 Look-0.066
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
live+0.085
ggy+0.085
aches+0.082
zag+0.081
ail+0.078
Neg logit contributions
 Can-0.088
elt-0.083
 Please-0.082
ly-0.079
ting-0.079
Token<|endoftext|>
Token ID50256
Feature activation-3.01e-02

Pos logit contributions
aches+0.045
ggy+0.045
zag+0.044
kins+0.042
live+0.041
Neg logit contributions
 Can-0.037
elt-0.033
 Look-0.030
ted-0.030
 Please-0.030
TokenOnce
Token ID7454
Feature activation-8.90e-02

Pos logit contributions
aches+0.081
ail+0.073
live+0.072
zag+0.072
oppy+0.072
Neg logit contributions
 Can-0.035
 It-0.030
 Come-0.026
 Please-0.026
 Look-0.026
Token upon
Token ID2402
Feature activation-9.27e-03

Pos logit contributions
ggy+0.216
live+0.204
aches+0.202
bes+0.199
asis+0.195
Neg logit contributions
 Can-0.103
 eyes-0.091
elt-0.090
 hello-0.089
 louder-0.089
Token a
Token ID257
Feature activation-1.33e-02

Pos logit contributions
live+0.022
ail+0.022
zag+0.022
ggy+0.022
aches+0.022
Neg logit contributions
 Can-0.011
elt-0.010
 Please-0.010
 louder-0.009
 Look-0.009
Token time
Token ID640
Feature activation-3.30e-02

Pos logit contributions
aches+0.023
ggy+0.022
ail+0.022
zag+0.022
live+0.021
Neg logit contributions
 Can-0.024
 Please-0.022
elt-0.022
 louder-0.021
ly-0.021
Token,
Token ID11
Feature activation-1.74e-02

Pos logit contributions
aches+0.072
live+0.072
ggy+0.071
ves+0.068
case+0.066
Neg logit contributions
 Can-0.049
elt-0.044
 hello-0.041
 Please-0.041
 Look-0.040
Token in
Token ID287
Feature activation+7.88e-03

Pos logit contributions
live+0.038
aches+0.038
ggy+0.037
ves+0.036
ail+0.036
Neg logit contributions
 Can-0.028
 Please-0.024
elt-0.023
 Look-0.023
 louder-0.023
Token his
Token ID465
Feature activation-3.66e-02

Pos logit contributions
ggy+0.025
live+0.025
zag+0.024
case+0.024
ail+0.024
Neg logit contributions
 Can-0.018
elt-0.016
 Please-0.016
 louder-0.016
 ding-0.015
Token mom
Token ID1995
Feature activation-1.88e-02

Pos logit contributions
live+0.083
zag+0.078
ggy+0.076
aches+0.076
ves+0.075
Neg logit contributions
 Can-0.055
 eyes-0.051
 Please-0.049
 louder-0.049
 ding-0.045
Tokenmy
Token ID1820
Feature activation-4.00e-02

Pos logit contributions
ggy+0.044
live+0.043
ves+0.041
zag+0.040
case+0.040
Neg logit contributions
 Can-0.027
elt-0.025
 Please-0.023
 louder-0.023
ly-0.022
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
live+0.076
ggy+0.075
ves+0.068
zag+0.067
case+0.067
Neg logit contributions
 Can-0.074
elt-0.070
 louder-0.066
 Please-0.066
ly-0.065
Token<|endoftext|>
Token ID50256
Feature activation-2.59e-02

Pos logit contributions
ggy+0.030
aches+0.029
zag+0.028
live+0.028
ves+0.027
Neg logit contributions
 Can-0.026
elt-0.024
 Please-0.022
 Look-0.022
 Come-0.021
TokenOnce
Token ID7454
Feature activation-8.89e-02

Pos logit contributions
aches+0.069
ggy+0.062
zag+0.061
live+0.061
oppy+0.061
Neg logit contributions
 Can-0.033
elt-0.027
ted-0.026
 Please-0.026
 It-0.026
Token upon
Token ID2402
Feature activation-7.70e-03

Pos logit contributions
ggy+0.223
live+0.207
aches+0.207
bes+0.201
asis+0.199
Neg logit contributions
 Can-0.106
elt-0.094
 eyes-0.090
ly-0.087
 louder-0.085
Token a
Token ID257
Feature activation-1.22e-02

Pos logit contributions
live+0.018
ggy+0.018
ail+0.018
zag+0.018
aches+0.018
Neg logit contributions
 Can-0.010
elt-0.009
 Please-0.009
 louder-0.008
 Look-0.008
Token time
Token ID640
Feature activation-2.94e-02

Pos logit contributions
ggy+0.020
aches+0.020
zag+0.020
ail+0.020
live+0.019
Neg logit contributions
 Can-0.023
elt-0.021
 Please-0.021
 louder-0.020
ly-0.020
Token,
Token ID11
Feature activation-1.70e-02

Pos logit contributions
aches+0.064
live+0.064
ggy+0.063
ves+0.060
case+0.060
Neg logit contributions
 Can-0.043
elt-0.040
 Please-0.037
 hello-0.037
 Look-0.036
Token there
Token ID612
Feature activation-1.67e-02

Pos logit contributions
ggy+0.037
live+0.037
aches+0.036
ves+0.035
ail+0.035
Neg logit contributions
 Can-0.027
elt-0.024
 Please-0.023
 Look-0.022
 louder-0.022
Token story
Token ID1621
Feature activation-3.44e-03

Pos logit contributions
elt+0.052
 Can+0.051
ted+0.049
ting+0.049
Stop+0.047
Neg logit contributions
ggy-0.025
case-0.021
zag-0.020
live-0.020
aches-0.019
Token,
Token ID11
Feature activation+1.71e-02

Pos logit contributions
ggy+0.006
live+0.006
zag+0.005
case+0.005
aches+0.005
Neg logit contributions
 Can-0.007
elt-0.006
 Please-0.006
 louder-0.006
Stop-0.006
Token Lily
Token ID20037
Feature activation-9.80e-03

Pos logit contributions
elt+0.037
ted+0.032
ting+0.032
 Can+0.031
ened+0.031
Neg logit contributions
ggy-0.027
case-0.026
live-0.024
aches-0.024
ves-0.024
Token."
Token ID526
Feature activation-9.79e-03

Pos logit contributions
ggy+0.018
live+0.017
zag+0.016
case+0.016
aches+0.016
Neg logit contributions
 Can-0.019
 Please-0.017
elt-0.017
 louder-0.016
ly-0.016
Token<|endoftext|>
Token ID50256
Feature activation-1.83e-02

Pos logit contributions
ggy+0.019
live+0.018
aches+0.018
zag+0.018
case+0.017
Neg logit contributions
 Can-0.018
elt-0.016
 Please-0.016
 Look-0.015
 louder-0.015
TokenOnce
Token ID7454
Feature activation-8.88e-02

Pos logit contributions
aches+0.045
zag+0.041
ggy+0.041
live+0.041
ail+0.040
Neg logit contributions
 Can-0.026
elt-0.022
 Please-0.021
ted-0.021
 Look-0.021
Token upon
Token ID2402
Feature activation-4.17e-04

Pos logit contributions
ggy+0.217
aches+0.202
live+0.200
asis+0.196
bes+0.191
Neg logit contributions
 Can-0.116
elt-0.102
 eyes-0.098
ly-0.098
 Please-0.092
Token a
Token ID257
Feature activation-9.25e-03

Pos logit contributions
ggy+0.001
live+0.001
zag+0.001
ail+0.001
aches+0.001
Neg logit contributions
 Can-0.001
elt-0.001
 Please-0.001
 louder-0.000
Stop-0.000
Token time
Token ID640
Feature activation-2.80e-02

Pos logit contributions
ggy+0.015
aches+0.015
zag+0.014
ail+0.014
live+0.014
Neg logit contributions
 Can-0.018
elt-0.017
 Please-0.017
 louder-0.016
ly-0.016
Token,
Token ID11
Feature activation-1.47e-02

Pos logit contributions
ggy+0.057
live+0.056
aches+0.056
zag+0.053
case+0.053
Neg logit contributions
 Can-0.047
elt-0.042
 Please-0.040
 Look-0.039
 hello-0.039
Token there
Token ID612
Feature activation-1.42e-02

Pos logit contributions
ggy+0.030
live+0.029
aches+0.029
zag+0.028
ail+0.028
Neg logit contributions
 Can-0.026
 Please-0.023
elt-0.022
 Look-0.022
 louder-0.021
Token in
Token ID287
Feature activation-2.19e-02

Pos logit contributions
ggy+0.057
live+0.056
zag+0.053
ail+0.053
aches+0.053
Neg logit contributions
 Can-0.055
elt-0.050
 Please-0.049
 louder-0.047
ting-0.046
Token the
Token ID262
Feature activation-7.89e-03

Pos logit contributions
ggy+0.048
live+0.047
zag+0.045
case+0.045
ail+0.045
Neg logit contributions
 Can-0.035
elt-0.030
 Please-0.030
 louder-0.029
Stop-0.028
Token forest
Token ID8222
Feature activation-3.46e-02

Pos logit contributions
ggy+0.012
live+0.012
zag+0.011
case+0.011
ail+0.011
Neg logit contributions
 Can-0.017
elt-0.016
 Please-0.016
 louder-0.015
ted-0.015
Token.
Token ID13
Feature activation-1.81e-02

Pos logit contributions
ggy+0.057
live+0.055
zag+0.052
aches+0.052
ail+0.052
Neg logit contributions
 Can-0.070
elt-0.066
 Please-0.064
 louder-0.062
ly-0.060
Token<|endoftext|>
Token ID50256
Feature activation-2.59e-02

Pos logit contributions
ggy+0.035
aches+0.035
live+0.032
zag+0.032
case+0.031
Neg logit contributions
 Can-0.035
elt-0.032
 Please-0.029
 Look-0.029
ted-0.029
TokenOnce
Token ID7454
Feature activation-8.88e-02

Pos logit contributions
aches+0.072
zag+0.062
live+0.062
ggy+0.061
ail+0.061
Neg logit contributions
 Can-0.031
 It-0.025
ted-0.024
 Look-0.024
elt-0.024
Token upon
Token ID2402
Feature activation-9.84e-03

Pos logit contributions
ggy+0.222
aches+0.212
live+0.208
asis+0.201
bes+0.200
Neg logit contributions
 Can-0.100
 eyes-0.090
elt-0.089
 hello-0.082
 louder-0.081
Token a
Token ID257
Feature activation-1.11e-02

Pos logit contributions
live+0.024
ail+0.024
zag+0.023
ggy+0.023
aches+0.023
Neg logit contributions
 Can-0.012
elt-0.011
 Please-0.010
 louder-0.010
 Look-0.009
Token time
Token ID640
Feature activation-2.93e-02

Pos logit contributions
aches+0.020
ail+0.019
ggy+0.019
zag+0.019
kins+0.018
Neg logit contributions
 Can-0.019
 Please-0.018
elt-0.018
 louder-0.017
 Look-0.017
Token,
Token ID11
Feature activation-1.56e-02

Pos logit contributions
aches+0.065
live+0.064
ggy+0.063
ves+0.060
case+0.059
Neg logit contributions
 Can-0.043
elt-0.038
 hello-0.036
 Please-0.036
 Look-0.036
Token there
Token ID612
Feature activation-1.76e-02

Pos logit contributions
live+0.034
aches+0.034
ggy+0.033
ves+0.032
zag+0.032
Neg logit contributions
 Can-0.025
 Please-0.021
elt-0.021
 Look-0.021
 louder-0.020
Token with
Token ID351
Feature activation-2.44e-02

Pos logit contributions
ggy+0.059
live+0.056
zag+0.055
case+0.054
aches+0.054
Neg logit contributions
 Can-0.060
elt-0.056
 Please-0.054
 louder-0.051
Stop-0.051
Token his
Token ID465
Feature activation-4.21e-02

Pos logit contributions
ggy+0.054
live+0.053
zag+0.053
case+0.053
ail+0.051
Neg logit contributions
 Can-0.035
elt-0.032
 louder-0.031
 Please-0.030
ted-0.030
Token friends
Token ID2460
Feature activation-2.91e-02

Pos logit contributions
live+0.076
case+0.074
ggy+0.071
aches+0.069
ves+0.068
Neg logit contributions
 Can-0.073
 eyes-0.073
 hello-0.069
 louder-0.066
elt-0.065
Token.
Token ID13
Feature activation-1.82e-02

Pos logit contributions
ggy+0.054
live+0.052
aches+0.050
ves+0.049
zag+0.049
Neg logit contributions
 Can-0.055
elt-0.050
 Please-0.049
ting-0.048
 louder-0.047
Token<|endoftext|>
Token ID50256
Feature activation-2.52e-02

Pos logit contributions
ggy+0.037
aches+0.036
zag+0.035
live+0.034
ves+0.034
Neg logit contributions
 Can-0.032
elt-0.030
 Please-0.028
 Look-0.027
ted-0.027
TokenOnce
Token ID7454
Feature activation-8.88e-02

Pos logit contributions
aches+0.065
ggy+0.058
zag+0.058
live+0.057
ail+0.057
Neg logit contributions
 Can-0.033
 It-0.027
elt-0.027
ted-0.027
 Come-0.027
Token upon
Token ID2402
Feature activation-6.49e-03

Pos logit contributions
ggy+0.217
live+0.202
aches+0.201
bes+0.199
asis+0.195
Neg logit contributions
 Can-0.104
 hello-0.091
elt-0.090
 eyes-0.089
ly-0.084
Token a
Token ID257
Feature activation-1.56e-02

Pos logit contributions
live+0.015
ggy+0.015
ail+0.015
zag+0.015
aches+0.015
Neg logit contributions
 Can-0.008
elt-0.008
 Please-0.007
 louder-0.007
 Look-0.007
Token time
Token ID640
Feature activation-3.37e-02

Pos logit contributions
aches+0.027
ail+0.027
ggy+0.027
zag+0.026
kins+0.026
Neg logit contributions
 Can-0.027
 Please-0.025
elt-0.025
 louder-0.024
ly-0.024
Token,
Token ID11
Feature activation-1.66e-02

Pos logit contributions
aches+0.073
live+0.072
ggy+0.071
ves+0.068
case+0.067
Neg logit contributions
 Can-0.051
elt-0.046
 hello-0.043
 Please-0.043
 Look-0.042
Token there
Token ID612
Feature activation-1.71e-02

Pos logit contributions
live+0.035
aches+0.035
ggy+0.035
ail+0.034
ves+0.034
Neg logit contributions
 Can-0.027
elt-0.024
 Please-0.023
 Look-0.022
 louder-0.022
Token Lily
Token ID20037
Feature activation+2.51e-03

Pos logit contributions
elt+0.028
 Can+0.026
ened+0.025
ted+0.025
Stop+0.024
Neg logit contributions
ggy-0.025
case-0.022
aches-0.022
ail-0.022
live-0.021
Token,"
Token ID553
Feature activation+1.15e-03

Pos logit contributions
 Can+0.005
 Please+0.004
elt+0.004
ly+0.004
 louder+0.004
Neg logit contributions
ggy-0.005
live-0.005
aches-0.004
zag-0.004
case-0.004
Token Tom
Token ID4186
Feature activation+1.87e-02

Pos logit contributions
 Can+0.002
elt+0.002
 Please+0.002
ting+0.002
 louder+0.002
Neg logit contributions
ggy-0.002
live-0.002
aches-0.002
ail-0.002
lly-0.002
Token said
Token ID531
Feature activation-5.73e-03

Pos logit contributions
 Can+0.034
elt+0.032
 Please+0.031
 louder+0.030
ly+0.029
Neg logit contributions
ggy-0.035
live-0.034
zag-0.033
case-0.032
aches-0.032
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
ggy+0.009
live+0.009
zag+0.008
case+0.008
aches+0.008
Neg logit contributions
 Can-0.012
elt-0.012
 Please-0.011
 louder-0.011
ly-0.011
Token "
Token ID366
Feature activation+6.09e-02

Pos logit contributions
 Can+0.032
elt+0.031
ted+0.028
 Please+0.028
Stop+0.028
Neg logit contributions
ggy-0.042
case-0.040
zag-0.040
live-0.039
ail-0.039
TokenWe
Token ID1135
Feature activation-1.75e-02

Pos logit contributions
 Can+0.107
ened+0.105
 eyes+0.103
ted+0.099
elt+0.097
Neg logit contributions
ggy-0.112
ves-0.102
live-0.101
case-0.101
aches-0.100
Token can
Token ID460
Feature activation-3.72e-02

Pos logit contributions
ggy+0.041
live+0.040
zag+0.039
ail+0.038
ves+0.038
Neg logit contributions
 Can-0.025
 Please-0.022
elt-0.022
Stop-0.020
 Look-0.020
Token have
Token ID423
Feature activation+1.35e-02

Pos logit contributions
ggy+0.073
live+0.072
ves+0.069
zag+0.068
kins+0.068
Neg logit contributions
 Can-0.061
 Please-0.057
elt-0.053
 Look-0.052
 hello-0.052
Token some
Token ID617
Feature activation-6.52e-03

Pos logit contributions
 Can+0.022
elt+0.021
 Please+0.020
 louder+0.019
Stop+0.019
Neg logit contributions
ggy-0.028
live-0.026
aches-0.026
case-0.025
zag-0.025
Token hot
Token ID3024
Feature activation+3.03e-02

Pos logit contributions
ggy+0.012
aches+0.011
live+0.011
case+0.011
ves+0.011
Neg logit contributions
 Can-0.012
elt-0.011
 Please-0.011
ened-0.010
ting-0.010
Token the
Token ID262
Feature activation+9.26e-03

Pos logit contributions
 Can+0.001
elt+0.001
 Please+0.001
ly+0.001
 ding+0.001
Neg logit contributions
ggy-0.001
live-0.001
aches-0.001
lly-0.001
ves-0.001
Token sw
Token ID1509
Feature activation+3.93e-02

Pos logit contributions
 Can+0.015
elt+0.014
 Please+0.014
 louder+0.013
Stop+0.013
Neg logit contributions
ggy-0.020
ail-0.018
live-0.018
lly-0.018
zag-0.018
Tokenan
Token ID272
Feature activation+5.53e-03

Pos logit contributions
 Can+0.082
elt+0.075
 Please+0.074
 louder+0.070
Stop+0.070
Neg logit contributions
ggy-0.067
live-0.063
zag-0.062
ail-0.062
aches-0.061
Token replied
Token ID8712
Feature activation+1.31e-03

Pos logit contributions
 Can+0.010
elt+0.010
 Please+0.009
 louder+0.009
ly+0.009
Neg logit contributions
ggy-0.010
live-0.010
zag-0.009
case-0.009
aches-0.009
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
 Can+0.003
elt+0.003
 Please+0.003
Stop+0.003
 Look+0.003
Neg logit contributions
ggy-0.002
live-0.002
zag-0.002
ail-0.002
case-0.002
Token "
Token ID366
Feature activation+5.58e-02

Pos logit contributions
 Can+0.028
elt+0.027
ly+0.026
Stop+0.026
ened+0.026
Neg logit contributions
ggy-0.032
live-0.030
zag-0.030
ail-0.030
case-0.030
TokenIf
Token ID1532
Feature activation-9.75e-03

Pos logit contributions
 eyes+0.095
 Can+0.094
ened+0.094
ly+0.094
 hello+0.089
Neg logit contributions
ggy-0.104
live-0.100
zag-0.096
case-0.096
ves-0.096
Token you
Token ID345
Feature activation-2.58e-02

Pos logit contributions
ggy+0.021
aches+0.019
ves+0.019
live+0.019
lly+0.019
Neg logit contributions
 Can-0.015
elt-0.015
 Please-0.013
 ding-0.013
ksh-0.013
Token use
Token ID779
Feature activation+1.40e-02

Pos logit contributions
ggy+0.048
live+0.047
zag+0.044
ail+0.044
case+0.044
Neg logit contributions
 Can-0.047
elt-0.044
 Please-0.043
 louder-0.041
Stop-0.040
Token too
Token ID1165
Feature activation-3.56e-02

Pos logit contributions
elt+0.025
 Can+0.025
 Please+0.023
ksh+0.022
ly+0.022
Neg logit contributions
ggy-0.026
aches-0.025
ail-0.024
ves-0.024
zag-0.023
Token much
Token ID881
Feature activation+1.07e-02

Pos logit contributions
ggy+0.058
live+0.056
zag+0.053
ail+0.052
aches+0.051
Neg logit contributions
 Can-0.075
elt-0.069
 Please-0.069
 louder-0.066
Stop-0.064
Token and
Token ID290
Feature activation+1.34e-02

Pos logit contributions
ggy+0.009
case+0.009
zag+0.009
ail+0.008
aches+0.008
Neg logit contributions
 Can-0.008
Stop-0.007
 Please-0.007
elt-0.007
 Look-0.007
Token said
Token ID531
Feature activation+6.11e-03

Pos logit contributions
 Can+0.027
elt+0.025
 Please+0.025
Stop+0.024
 louder+0.023
Neg logit contributions
ggy-0.023
live-0.021
ail-0.020
aches-0.020
zag-0.020
Token �
Token ID6184
Feature activation+4.72e-02

Pos logit contributions
 Can+0.013
elt+0.012
 Please+0.011
Stop+0.011
ksh+0.011
Neg logit contributions
ggy-0.010
aches-0.009
ves-0.009
ail-0.009
zag-0.009
Token�
Token ID95
Feature activation+5.67e-02

Pos logit contributions
 Can+0.089
elt+0.081
 Please+0.080
ly+0.075
 louder+0.074
Neg logit contributions
ggy-0.086
live-0.084
zag-0.081
aches-0.080
ail-0.080
Token€
Token ID26391
Feature activation-2.02e-02

Pos logit contributions
 Can+0.096
elt+0.088
 Please+0.086
 louder+0.079
 Look+0.079
Neg logit contributions
ggy-0.114
live-0.110
aches-0.107
zag-0.107
ail-0.106
Token�
Token ID129
Feature activation+6.71e-02

Pos logit contributions
live+0.036
aches+0.035
ggy+0.035
zag+0.033
ail+0.033
Neg logit contributions
 Can-0.038
 Please-0.036
elt-0.035
 Look-0.033
 ding-0.032
Token�
Token ID241
Feature activation+4.19e-02

Pos logit contributions
 Can+0.123
elt+0.113
 Please+0.112
 louder+0.106
ly+0.104
Neg logit contributions
ggy-0.123
live-0.120
zag-0.115
ail-0.115
aches-0.114
Tokenoutside
Token ID43435
Feature activation+1.69e-03

Pos logit contributions
 Can+0.065
elt+0.062
 louder+0.061
ened+0.059
ted+0.059
Neg logit contributions
ggy-0.087
live-0.080
zag-0.077
case-0.077
lly-0.077
Tokenâ
Token ID22940
Feature activation+4.92e-02

Pos logit contributions
 Can+0.003
elt+0.003
 Please+0.003
 Look+0.003
ly+0.003
Neg logit contributions
ggy-0.003
live-0.003
case-0.003
zag-0.003
aches-0.003
Token€
Token ID26391
Feature activation-1.89e-02

Pos logit contributions
 Can+0.100
elt+0.095
 Please+0.090
 louder+0.089
Stop+0.086
Neg logit contributions
ggy-0.083
live-0.079
zag-0.076
case-0.075
aches-0.074
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
live+0.034
ggy+0.033
aches+0.033
zag+0.032
ail+0.031
Neg logit contributions
 Can-0.036
 Please-0.034
elt-0.033
 Look-0.031
 ding-0.029
Token."
Token ID526
Feature activation+1.17e-03

Pos logit contributions
ggy+0.038
live+0.036
zag+0.034
aches+0.034
ail+0.034
Neg logit contributions
 Can-0.044
elt-0.041
 Please-0.040
 louder-0.039
ly-0.038
Token
Token ID198
Feature activation-3.39e-02

Pos logit contributions
 Can+0.003
elt+0.003
ened+0.003
ting+0.003
ly+0.003
Neg logit contributions
ggy-0.001
live-0.001
case-0.001
ail-0.001
zag-0.001
Token
Token ID198
Feature activation-2.28e-02

Pos logit contributions
live+0.068
aches+0.067
zag+0.066
case+0.064
ggy+0.062
Neg logit contributions
 Can-0.057
 Please-0.053
elt-0.050
 louder-0.049
Stop-0.047
TokenHis
Token ID6653
Feature activation-1.23e-02

Pos logit contributions
live+0.047
zag+0.046
aches+0.046
case+0.046
ggy+0.046
Neg logit contributions
 Can-0.035
 Please-0.033
elt-0.031
 louder-0.031
Stop-0.029
Token dad
Token ID9955
Feature activation+2.40e-02

Pos logit contributions
ggy+0.025
live+0.024
zag+0.024
case+0.023
aches+0.023
Neg logit contributions
 Can-0.021
elt-0.019
 Please-0.018
 louder-0.018
Stop-0.017
Token hugged
Token ID41130
Feature activation+5.68e-02

Pos logit contributions
 Can+0.051
elt+0.045
 Please+0.044
ksh+0.043
 Look+0.042
Neg logit contributions
ggy-0.043
ail-0.037
lly-0.036
ves-0.036
live-0.035
Token him
Token ID683
Feature activation+1.12e-02

Pos logit contributions
elt+0.108
 Can+0.102
 Please+0.091
ksh+0.090
 ding+0.088
Neg logit contributions
ggy-0.096
ail-0.094
oppy-0.092
live-0.086
ves-0.086
Token and
Token ID290
Feature activation+2.60e-02

Pos logit contributions
 Can+0.019
elt+0.018
 Please+0.016
Stop+0.016
 Look+0.015
Neg logit contributions
ggy-0.023
ail-0.022
zag-0.021
case-0.021
live-0.021
Token said
Token ID531
Feature activation+8.05e-03

Pos logit contributions
 Can+0.061
elt+0.056
 Please+0.055
ring+0.053
Stop+0.053
Neg logit contributions
ggy-0.037
lly-0.033
live-0.032
aches-0.032
ail-0.031
Token,
Token ID11
Feature activation+2.21e-02

Pos logit contributions
 Can+0.019
elt+0.018
 Please+0.017
Stop+0.017
ksh+0.017
Neg logit contributions
ggy-0.011
ail-0.010
ves-0.010
aches-0.010
live-0.009
Token "
Token ID366
Feature activation+8.55e-02

Pos logit contributions
 Can+0.034
elt+0.033
ened+0.031
Stop+0.031
eared+0.031
Neg logit contributions
ggy-0.045
zag-0.042
ail-0.041
live-0.041
ves-0.041
Token so
Token ID523
Feature activation-2.38e-02

Pos logit contributions
elt+0.067
elling+0.066
 Can+0.062
eared+0.061
Stop+0.059
Neg logit contributions
ggy-0.050
case-0.044
aches-0.042
ves-0.042
lf-0.042
Token it
Token ID340
Feature activation-1.06e-02

Pos logit contributions
ggy+0.045
live+0.044
zag+0.043
ail+0.042
case+0.042
Neg logit contributions
 Can-0.044
 Please-0.040
elt-0.039
 louder-0.038
 ding-0.037
Token asked
Token ID1965
Feature activation+6.18e-03

Pos logit contributions
ggy+0.019
live+0.017
zag+0.017
case+0.017
aches+0.017
Neg logit contributions
 Can-0.021
elt-0.020
 Please-0.018
 Look-0.018
Stop-0.018
Token again
Token ID757
Feature activation-2.86e-02

Pos logit contributions
 Can+0.012
elt+0.012
 Please+0.011
Stop+0.011
ksh+0.011
Neg logit contributions
ggy-0.010
aches-0.009
ail-0.009
ves-0.009
live-0.009
Token,
Token ID11
Feature activation+5.16e-03

Pos logit contributions
ggy+0.050
live+0.047
zag+0.045
case+0.045
aches+0.045
Neg logit contributions
 Can-0.057
elt-0.053
 Please-0.051
 louder-0.049
Stop-0.049
Token "
Token ID366
Feature activation+6.15e-02

Pos logit contributions
elt+0.009
ened+0.009
elling+0.009
eared+0.009
Stop+0.009
Neg logit contributions
ggy-0.008
zag-0.007
ail-0.007
aches-0.007
ves-0.007
TokenWhat
Token ID2061
Feature activation-1.93e-02

Pos logit contributions
 eyes+0.092
ened+0.089
ly+0.088
 Can+0.088
elt+0.085
Neg logit contributions
ggy-0.123
live-0.122
lly-0.118
zag-0.116
aches-0.113
Token do
Token ID466
Feature activation+1.19e-02

Pos logit contributions
live+0.039
ail+0.038
ggy+0.038
zag+0.038
aches+0.037
Neg logit contributions
 Can-0.033
 Please-0.029
elt-0.028
 hello-0.027
 Look-0.027
Token you
Token ID345
Feature activation+7.74e-03

Pos logit contributions
 Can+0.013
 Please+0.012
elt+0.011
 louder+0.010
Stop+0.010
Neg logit contributions
ggy-0.030
live-0.030
zag-0.029
case-0.029
ail-0.029
Token mean
Token ID1612
Feature activation+3.32e-02

Pos logit contributions
 Can+0.011
 Please+0.010
elt+0.010
 Look+0.009
 louder+0.009
Neg logit contributions
ggy-0.018
live-0.017
ves-0.017
ail-0.017
zag-0.017
Token?
Token ID30
Feature activation+2.62e-02

Pos logit contributions
elt+0.086
ksh+0.083
 Can+0.082
ted+0.079
ened+0.079
Neg logit contributions
ggy-0.037
zag-0.031
case-0.030
live-0.029
aches-0.029
Token her
Token ID607
Feature activation-8.76e-03

Pos logit contributions
 Can+0.051
elt+0.049
 Please+0.044
Stop+0.042
 ding+0.042
Neg logit contributions
ggy-0.051
aches-0.046
ail-0.046
live-0.045
ves-0.044
Token hand
Token ID1021
Feature activation-7.37e-03

Pos logit contributions
ggy+0.017
case+0.016
ail+0.016
aches+0.016
zag+0.015
Neg logit contributions
 Can-0.016
elt-0.015
Stop-0.014
 Please-0.014
ted-0.013
Token.
Token ID13
Feature activation-4.60e-03

Pos logit contributions
ggy+0.012
case+0.011
live+0.011
zag+0.011
ail+0.011
Neg logit contributions
 Can-0.016
elt-0.015
Stop-0.014
 Please-0.014
 Look-0.014
Token They
Token ID1119
Feature activation-1.12e-02

Pos logit contributions
ggy+0.009
live+0.008
case+0.008
ail+0.008
zag+0.008
Neg logit contributions
 Can-0.008
elt-0.007
 Please-0.007
 louder-0.007
Stop-0.007
Token watched
Token ID7342
Feature activation+1.47e-02

Pos logit contributions
ggy+0.023
live+0.022
zag+0.021
case+0.021
ail+0.021
Neg logit contributions
 Can-0.018
elt-0.018
 louder-0.017
ted-0.017
 Please-0.016
Token the
Token ID262
Feature activation+8.71e-03

Pos logit contributions
elt+0.032
 Can+0.030
 Please+0.028
vious+0.027
Stop+0.027
Neg logit contributions
ggy-0.023
ves-0.021
 solve-0.020
lf-0.019
case-0.019
Token parade
Token ID16134
Feature activation-1.61e-02

Pos logit contributions
elt+0.014
 Can+0.013
 Please+0.012
ened+0.011
 Look+0.011
Neg logit contributions
ggy-0.019
case-0.018
zag-0.017
ves-0.017
lly-0.017
Token go
Token ID467
Feature activation-3.00e-03

Pos logit contributions
ggy+0.031
zag+0.031
case+0.030
ail+0.029
ves+0.028
Neg logit contributions
 Can-0.029
elt-0.027
 Please-0.026
Stop-0.024
 Look-0.024
Token by
Token ID416
Feature activation-1.78e-02

Pos logit contributions
zag+0.005
ggy+0.005
case+0.005
ail+0.005
ves+0.005
Neg logit contributions
 Can-0.006
elt-0.006
 Please-0.005
 Look-0.005
Stop-0.005
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
ggy+0.040
live+0.038
zag+0.038
ail+0.037
case+0.037
Neg logit contributions
 Can-0.026
elt-0.024
 Please-0.023
Stop-0.021
 louder-0.021
Token Lily
Token ID20037
Feature activation-8.21e-03

Pos logit contributions
ggy+0.018
live+0.017
zag+0.017
case+0.016
aches+0.016
Neg logit contributions
 Can-0.020
elt-0.020
 Please-0.018
Stop-0.017
ly-0.017
Token They
Token ID1119
Feature activation-4.90e-03

Pos logit contributions
ly+0.062
elt+0.058
ting+0.057
heart+0.057
ened+0.055
Neg logit contributions
case-0.053
ggy-0.051
ves-0.046
zag-0.045
live-0.045
Token point
Token ID966
Feature activation+3.65e-02

Pos logit contributions
ggy+0.009
live+0.009
ail+0.008
zag+0.008
ves+0.008
Neg logit contributions
 Can-0.009
 Please-0.008
elt-0.008
 louder-0.008
Stop-0.008
Token it
Token ID340
Feature activation-1.05e-02

Pos logit contributions
 Can+0.072
elt+0.071
ksh+0.070
 Please+0.068
Uh+0.067
Neg logit contributions
zag-0.051
ggy-0.046
ves-0.045
ail-0.044
kins-0.042
Token at
Token ID379
Feature activation+1.73e-02

Pos logit contributions
zag+0.017
ggy+0.017
case+0.017
ves+0.016
aches+0.016
Neg logit contributions
 Can-0.021
elt-0.021
 Please-0.019
Stop-0.019
 Look-0.018
Token things
Token ID1243
Feature activation-1.19e-02

Pos logit contributions
 Can+0.031
elt+0.030
 Please+0.029
ly+0.025
ted+0.025
Neg logit contributions
ggy-0.034
aches-0.031
live-0.031
ves-0.030
zag-0.030
Token and
Token ID290
Feature activation+8.62e-03

Pos logit contributions
ggy+0.020
live+0.019
zag+0.019
case+0.019
aches+0.019
Neg logit contributions
 Can-0.024
elt-0.023
 Please-0.022
 Look-0.020
Stop-0.020
Token make
Token ID787
Feature activation+1.18e-02

Pos logit contributions
elt+0.020
 Can+0.018
elling+0.017
eared+0.017
ted+0.017
Neg logit contributions
ggy-0.012
aches-0.012
case-0.012
zag-0.011
 solve-0.011
Token bright
Token ID6016
Feature activation+4.44e-02

Pos logit contributions
 Can+0.021
elt+0.021
 Please+0.019
ted+0.018
Stop+0.017
Neg logit contributions
ggy-0.024
ves-0.022
aches-0.022
case-0.021
zag-0.021
Token lights
Token ID7588
Feature activation+1.02e-02

Pos logit contributions
elt+0.089
 Please+0.086
 Can+0.083
OU+0.080
Stop+0.079
Neg logit contributions
ggy-0.061
kins-0.059
ves-0.058
zag-0.057
aches-0.056
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
 Can+0.027
elt+0.026
 Please+0.025
Stop+0.025
ted+0.024
Neg logit contributions
zag-0.010
ggy-0.010
case-0.010
ves-0.009
aches-0.009
Token They
Token ID1119
Feature activation+8.79e-05

Pos logit contributions
 Can+0.025
elt+0.025
ly+0.024
ened+0.024
ting+0.023
Neg logit contributions
ggy-0.030
case-0.029
live-0.028
ves-0.027
zag-0.027
Token big
Token ID1263
Feature activation+1.59e-02

Pos logit contributions
ggy+0.028
live+0.026
zag+0.025
case+0.025
ail+0.025
Neg logit contributions
 Can-0.036
elt-0.034
 Please-0.032
ted-0.031
 louder-0.031
Token difference
Token ID3580
Feature activation-3.95e-03

Pos logit contributions
elt+0.034
 Can+0.034
ted+0.032
 Please+0.031
Stop+0.031
Neg logit contributions
ggy-0.024
case-0.023
zag-0.022
ail-0.022
live-0.021
Token,
Token ID11
Feature activation+2.80e-02

Pos logit contributions
ggy+0.007
live+0.006
zag+0.006
case+0.006
aches+0.006
Neg logit contributions
 Can-0.008
elt-0.008
 Please-0.007
Stop-0.007
 louder-0.007
Token and
Token ID290
Feature activation-2.15e-03

Pos logit contributions
Stop+0.054
elt+0.054
ted+0.051
ksh+0.050
ly+0.050
Neg logit contributions
case-0.041
ggy-0.040
 solve-0.039
 overcome-0.038
aches-0.038
Token he
Token ID339
Feature activation-7.68e-03

Pos logit contributions
ggy+0.004
live+0.003
aches+0.003
case+0.003
zag+0.003
Neg logit contributions
 Can-0.004
elt-0.004
 Please-0.004
Stop-0.004
 ding-0.004
Token wanted
Token ID2227
Feature activation+5.80e-03

Pos logit contributions
ggy+0.013
live+0.013
zag+0.012
aches+0.012
case+0.012
Neg logit contributions
 Can-0.015
elt-0.014
 Please-0.014
 louder-0.013
ted-0.013
Token to
Token ID284
Feature activation-1.84e-02

Pos logit contributions
 Can+0.009
elt+0.009
 Please+0.008
 ding+0.008
Stop+0.008
Neg logit contributions
ggy-0.012
aches-0.011
ail-0.011
live-0.011
case-0.011
Token share
Token ID2648
Feature activation+5.10e-03

Pos logit contributions
live+0.035
ggy+0.034
zag+0.032
ves+0.032
ail+0.032
Neg logit contributions
 Can-0.034
 Please-0.030
elt-0.029
 Look-0.028
Stop-0.028
Token his
Token ID465
Feature activation-2.90e-02

Pos logit contributions
 Can+0.009
elt+0.009
 Please+0.008
Stop+0.008
ted+0.008
Neg logit contributions
ggy-0.010
aches-0.009
live-0.009
case-0.009
ves-0.009
Token success
Token ID1943
Feature activation-4.25e-02

Pos logit contributions
ggy+0.051
live+0.050
zag+0.048
case+0.047
aches+0.047
Neg logit contributions
 Can-0.056
elt-0.052
 Please-0.051
 louder-0.049
Stop-0.048
Token with
Token ID351
Feature activation-1.61e-02

Pos logit contributions
live+0.084
ggy+0.084
zag+0.079
ail+0.078
aches+0.077
Neg logit contributions
 Can-0.071
 Please-0.064
elt-0.063
ly-0.062
 louder-0.061
Token The
Token ID383
Feature activation+1.44e-03

Pos logit contributions
 Can+0.011
ly+0.011
elt+0.011
ting+0.011
ened+0.010
Neg logit contributions
ggy-0.011
case-0.010
live-0.010
zag-0.010
ail-0.010
Token cat
Token ID3797
Feature activation+4.34e-03

Pos logit contributions
 Can+0.002
elt+0.002
 Please+0.002
 louder+0.002
Stop+0.002
Neg logit contributions
ggy-0.003
live-0.003
zag-0.003
aches-0.003
ail-0.003
Token said
Token ID531
Feature activation+1.80e-02

Pos logit contributions
 Can+0.008
elt+0.007
 Please+0.007
 louder+0.007
ted+0.007
Neg logit contributions
ggy-0.008
live-0.008
zag-0.008
aches-0.007
case-0.007
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
 Can+0.035
elt+0.035
ksh+0.034
Stop+0.034
ting+0.033
Neg logit contributions
ggy-0.031
ves-0.028
aches-0.027
ail-0.026
live-0.026
Token "
Token ID366
Feature activation+6.88e-02

Pos logit contributions
Stop+0.034
elt+0.034
ened+0.033
eared+0.033
elling+0.033
Neg logit contributions
ggy-0.030
ves-0.028
case-0.028
ail-0.027
zag-0.027
TokenLet
Token ID5756
Feature activation+3.06e-02

Pos logit contributions
 eyes+0.126
ened+0.120
 hello+0.118
ening+0.117
 Can+0.115
Neg logit contributions
live-0.121
ggy-0.115
lly-0.103
aches-0.101
ves-0.100
Token's
Token ID338
Feature activation-1.65e-02

Pos logit contributions
 Can+0.058
elt+0.056
 Please+0.050
 louder+0.048
ted+0.048
Neg logit contributions
ggy-0.058
live-0.054
ail-0.053
aches-0.052
case-0.051
Token clean
Token ID3424
Feature activation+3.40e-02

Pos logit contributions
ggy+0.030
live+0.029
zag+0.028
ves+0.028
ail+0.027
Neg logit contributions
 Can-0.031
 Please-0.029
elt-0.028
 louder-0.027
 Look-0.027
Token it
Token ID340
Feature activation-4.71e-02

Pos logit contributions
elt+0.065
 Can+0.065
 Please+0.060
Stop+0.055
 Look+0.055
Neg logit contributions
ggy-0.061
ail-0.056
aches-0.056
live-0.056
case-0.054
Token together
Token ID1978
Feature activation-3.84e-02

Pos logit contributions
ggy+0.112
live+0.108
ves+0.101
aches+0.100
oppy+0.098
Neg logit contributions
 Can-0.065
 louder-0.058
 Please-0.055
ly-0.055
elt-0.054
Token!"
Token ID2474
Feature activation-1.53e-03

Pos logit contributions
ggy+0.075
live+0.073
zag+0.069
aches+0.068
ail+0.068
Neg logit contributions
 Can-0.069
 Please-0.062
elt-0.061
 louder-0.059
 Look-0.057
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
ggy+0.003
live+0.003
zag+0.003
case+0.003
ail+0.003
Neg logit contributions
 Can-0.004
elt-0.004
 Please-0.004
Stop-0.004
 Look-0.004
Token But
Token ID887
Feature activation-1.46e-02

Pos logit contributions
 Can+0.015
elt+0.014
 ding+0.014
 louder+0.014
ting+0.014
Neg logit contributions
case-0.023
ggy-0.022
live-0.022
ail-0.021
zag-0.020
Token she
Token ID673
Feature activation-7.43e-03

Pos logit contributions
ggy+0.033
live+0.032
zag+0.031
case+0.030
aches+0.030
Neg logit contributions
 Can-0.022
elt-0.020
 Please-0.019
 louder-0.018
Stop-0.017
Token didn
Token ID1422
Feature activation-2.41e-02

Pos logit contributions
ggy+0.015
zag+0.015
live+0.014
aches+0.014
ves+0.014
Neg logit contributions
 Can-0.012
 Please-0.011
elt-0.011
 louder-0.011
ly-0.010
Token't
Token ID470
Feature activation-1.55e-03

Pos logit contributions
live+0.042
ggy+0.041
aches+0.041
kins+0.040
ail+0.038
Neg logit contributions
 Can-0.047
 Please-0.043
 louder-0.040
elt-0.040
Stop-0.040
Token give
Token ID1577
Feature activation+6.16e-03

Pos logit contributions
ggy+0.003
live+0.003
zag+0.003
ail+0.003
ves+0.003
Neg logit contributions
 Can-0.003
 Please-0.003
elt-0.003
 louder-0.003
ly-0.002
Token up
Token ID510
Feature activation+2.81e-04

Pos logit contributions
 Can+0.011
elt+0.010
 Please+0.010
 louder+0.009
Stop+0.009
Neg logit contributions
ggy-0.012
live-0.012
zag-0.011
aches-0.011
ail-0.011
Token.
Token ID13
Feature activation+7.07e-03

Pos logit contributions
 Can+0.001
elt+0.001
 Please+0.001
Stop+0.001
 Look+0.001
Neg logit contributions
ggy-0.000
live-0.000
aches-0.000
case-0.000
ail-0.000
Token She
Token ID1375
Feature activation-2.99e-03

Pos logit contributions
 Can+0.010
ting+0.010
elt+0.010
 ding+0.010
Stop+0.010
Neg logit contributions
case-0.014
ail-0.013
ggy-0.013
live-0.013
oppy-0.013
Token tried
Token ID3088
Feature activation+1.01e-03

Pos logit contributions
ggy+0.006
live+0.005
zag+0.005
aches+0.005
ail+0.005
Neg logit contributions
 Can-0.006
elt-0.005
 Please-0.005
 louder-0.005
ly-0.005
Token again
Token ID757
Feature activation-2.47e-02

Pos logit contributions
 Can+0.002
elt+0.002
 Please+0.001
ly+0.001
ted+0.001
Neg logit contributions
ggy-0.002
zag-0.002
live-0.002
ail-0.002
aches-0.002
Token made
Token ID925
Feature activation+2.51e-04

Pos logit contributions
ggy+0.035
live+0.035
zag+0.033
ail+0.033
case+0.032
Neg logit contributions
 Can-0.036
 Please-0.034
elt-0.033
 louder-0.032
ly-0.031
Token was
Token ID373
Feature activation-8.09e-03

Pos logit contributions
 Can+0.000
elt+0.000
 Please+0.000
Stop+0.000
ted+0.000
Neg logit contributions
ggy-0.001
aches-0.000
ail-0.000
ves-0.000
live-0.000
Token a
Token ID257
Feature activation-2.10e-02

Pos logit contributions
ggy+0.015
live+0.014
zag+0.014
case+0.014
aches+0.014
Neg logit contributions
 Can-0.015
elt-0.014
 Please-0.014
 louder-0.013
Stop-0.013
Token magical
Token ID10883
Feature activation+1.35e-02

Pos logit contributions
ggy+0.035
live+0.032
zag+0.032
case+0.032
ail+0.031
Neg logit contributions
 Can-0.043
elt-0.041
 Please-0.039
ted-0.038
ting-0.037
Token bridge
Token ID7696
Feature activation-1.66e-02

Pos logit contributions
elt+0.029
 Can+0.028
Stop+0.026
ted+0.026
 Please+0.026
Neg logit contributions
ggy-0.022
zag-0.020
case-0.020
ail-0.019
live-0.019
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
ggy+0.027
zag+0.025
case+0.025
live+0.025
aches+0.024
Neg logit contributions
 Can-0.035
elt-0.033
 Please-0.032
Stop-0.031
 Look-0.030
Token When
Token ID1649
Feature activation-2.75e-02

Pos logit contributions
ggy+0.021
live+0.020
case+0.019
zag+0.019
ail+0.019
Neg logit contributions
 Can-0.019
elt-0.018
 Please-0.017
 louder-0.017
ly-0.017
Token they
Token ID484
Feature activation-1.02e-02

Pos logit contributions
ggy+0.050
live+0.048
zag+0.046
case+0.046
aches+0.046
Neg logit contributions
 Can-0.052
elt-0.049
 Please-0.047
 louder-0.045
Stop-0.044
Token wanted
Token ID2227
Feature activation-1.15e-03

Pos logit contributions
ggy+0.019
live+0.018
zag+0.018
ail+0.017
case+0.017
Neg logit contributions
 Can-0.019
elt-0.018
 Please-0.017
ted-0.016
ened-0.016
Token to
Token ID284
Feature activation-1.33e-02

Pos logit contributions
ggy+0.002
zag+0.002
ail+0.002
case+0.002
ves+0.002
Neg logit contributions
 Can-0.002
elt-0.002
 Please-0.002
Stop-0.002
ksh-0.002
Token travel
Token ID3067
Feature activation-7.86e-03

Pos logit contributions
live+0.026
ggy+0.025
ail+0.023
zag+0.023
case+0.023
Neg logit contributions
 Can-0.024
 Please-0.021
elt-0.021
 louder-0.021
 ding-0.020
Token,
Token ID11
Feature activation+1.61e-02

Pos logit contributions
 Can+0.021
elt+0.020
 Please+0.018
Stop+0.018
ted+0.018
Neg logit contributions
ggy-0.017
case-0.016
zag-0.015
live-0.015
aches-0.015
Token and
Token ID290
Feature activation+6.54e-03

Pos logit contributions
 Can+0.027
elt+0.027
elling+0.025
ened+0.025
 Please+0.025
Neg logit contributions
ggy-0.030
case-0.027
zag-0.026
lly-0.026
live-0.026
Token she
Token ID673
Feature activation-8.71e-04

Pos logit contributions
 Can+0.012
elt+0.012
 Please+0.011
Stop+0.010
elling+0.010
Neg logit contributions
ggy-0.012
live-0.011
aches-0.011
lly-0.011
case-0.010
Token ran
Token ID4966
Feature activation+1.45e-02

Pos logit contributions
ggy+0.002
live+0.001
zag+0.001
case+0.001
ail+0.001
Neg logit contributions
 Can-0.002
elt-0.002
 Please-0.002
 louder-0.001
Stop-0.001
Token out
Token ID503
Feature activation-1.90e-02

Pos logit contributions
 Can+0.025
elt+0.023
 Please+0.023
Stop+0.021
 Look+0.021
Neg logit contributions
ggy-0.028
zag-0.027
case-0.027
ail-0.027
live-0.027
Token to
Token ID284
Feature activation-1.31e-02

Pos logit contributions
ggy+0.037
zag+0.036
ail+0.035
live+0.035
case+0.035
Neg logit contributions
 Can-0.033
elt-0.030
 Please-0.029
Stop-0.028
 Look-0.027
Token play
Token ID711
Feature activation-6.38e-03

Pos logit contributions
ggy+0.025
live+0.024
zag+0.023
aches+0.023
case+0.023
Neg logit contributions
 Can-0.024
elt-0.022
 Please-0.021
 louder-0.020
Stop-0.020
Token.
Token ID13
Feature activation+6.98e-03

Pos logit contributions
ggy+0.011
live+0.011
zag+0.010
ail+0.010
case+0.010
Neg logit contributions
 Can-0.013
elt-0.012
 Please-0.012
 louder-0.011
ted-0.011
Token Everywhere
Token ID48201
Feature activation-1.34e-02

Pos logit contributions
 Can+0.010
 louder+0.009
 ding+0.009
elt+0.009
ened+0.009
Neg logit contributions
case-0.015
ggy-0.014
live-0.014
ail-0.013
zag-0.013
Token she
Token ID673
Feature activation-1.33e-02

Pos logit contributions
ggy+0.025
live+0.023
case+0.022
ail+0.022
zag+0.022
Neg logit contributions
 Can-0.025
elt-0.024
 Please-0.023
 louder-0.022
 ding-0.022
Token looked
Token ID3114
Feature activation+8.53e-03

Pos logit contributions
ggy+0.022
live+0.021
lly+0.021
ail+0.021
case+0.021
Neg logit contributions
 Can-0.027
elt-0.025
 Please-0.024
 louder-0.024
 ding-0.023
Token to
Token ID284
Feature activation-1.93e-02

Pos logit contributions
ggy+0.038
live+0.037
zag+0.036
case+0.035
aches+0.035
Neg logit contributions
 Can-0.039
elt-0.036
 Please-0.035
 louder-0.033
ly-0.033
Token answer
Token ID3280
Feature activation+1.83e-02

Pos logit contributions
live+0.034
ggy+0.033
ail+0.031
zag+0.030
ves+0.030
Neg logit contributions
 Can-0.037
 Please-0.034
elt-0.033
 louder-0.033
 ding-0.033
Token them
Token ID606
Feature activation-2.45e-02

Pos logit contributions
elt+0.035
 Can+0.035
 Please+0.032
 Look+0.030
ksh+0.030
Neg logit contributions
ggy-0.032
aches-0.030
ves-0.029
ail-0.029
case-0.029
Token.
Token ID13
Feature activation+6.64e-03

Pos logit contributions
ggy+0.041
live+0.039
zag+0.038
case+0.037
aches+0.037
Neg logit contributions
 Can-0.050
elt-0.047
 Please-0.045
 louder-0.043
Stop-0.043
Token At
Token ID1629
Feature activation-1.90e-02

Pos logit contributions
 Can+0.009
elt+0.009
 louder+0.008
 Please+0.008
 ding+0.008
Neg logit contributions
ggy-0.015
live-0.014
case-0.014
zag-0.014
ail-0.014
Token the
Token ID262
Feature activation+2.16e-03

Pos logit contributions
ggy+0.028
live+0.027
zag+0.026
ail+0.025
aches+0.025
Neg logit contributions
 Can-0.043
elt-0.040
 Please-0.039
 louder-0.037
 Look-0.037
Token end
Token ID886
Feature activation-1.29e-02

Pos logit contributions
 Can+0.005
elt+0.005
 Please+0.005
ted+0.004
ened+0.004
Neg logit contributions
ggy-0.003
live-0.003
case-0.003
zag-0.003
aches-0.003
Token of
Token ID286
Feature activation-2.15e-02

Pos logit contributions
ggy+0.026
live+0.026
aches+0.025
zag+0.025
ail+0.024
Neg logit contributions
 Can-0.022
elt-0.020
 Please-0.019
 louder-0.018
 ding-0.018
Token the
Token ID262
Feature activation-1.02e-02

Pos logit contributions
ggy+0.047
live+0.046
aches+0.045
zag+0.045
case+0.044
Neg logit contributions
 Can-0.033
elt-0.029
 Please-0.028
 louder-0.027
Stop-0.027
Token exam
Token ID2814
Feature activation-3.02e-02

Pos logit contributions
ggy+0.017
live+0.017
zag+0.016
case+0.016
aches+0.016
Neg logit contributions
 Can-0.021
elt-0.019
 Please-0.019
 louder-0.018
Stop-0.018
Token,
Token ID11
Feature activation-9.67e-03

Pos logit contributions
ggy+0.050
live+0.047
case+0.046
zag+0.046
ail+0.045
Neg logit contributions
 Can-0.063
elt-0.059
 Please-0.057
 louder-0.055
Stop-0.054
Token rescue
Token ID9992
Feature activation+2.13e-02

Pos logit contributions
elt+0.028
 Can+0.028
 Please+0.025
ted+0.025
ly+0.025
Neg logit contributions
ggy-0.018
aches-0.017
 solve-0.016
lly-0.015
ail-0.015
Token Tim
Token ID5045
Feature activation-2.73e-03

Pos logit contributions
 Can+0.037
elt+0.036
 Please+0.033
 Look+0.031
 louder+0.031
Neg logit contributions
ggy-0.042
live-0.039
aches-0.039
ail-0.038
ves-0.038
Tokenmy
Token ID1820
Feature activation-1.88e-02

Pos logit contributions
ggy+0.006
live+0.006
zag+0.005
case+0.005
aches+0.005
Neg logit contributions
 Can-0.004
elt-0.004
 Please-0.004
 louder-0.004
ly-0.004
Token and
Token ID290
Feature activation+9.85e-03

Pos logit contributions
ggy+0.037
live+0.036
zag+0.034
ves+0.034
aches+0.034
Neg logit contributions
 Can-0.032
elt-0.030
 Please-0.029
ly-0.028
 louder-0.028
Token his
Token ID465
Feature activation-1.51e-02

Pos logit contributions
 Can+0.020
elt+0.019
 Please+0.018
Stop+0.017
ted+0.017
Neg logit contributions
ggy-0.017
live-0.016
aches-0.015
ail-0.015
case-0.015
Token family
Token ID1641
Feature activation-2.81e-02

Pos logit contributions
ggy+0.027
live+0.026
zag+0.025
case+0.025
ail+0.025
Neg logit contributions
 Can-0.029
elt-0.027
 Please-0.026
 louder-0.025
Stop-0.025
Token.
Token ID13
Feature activation+2.13e-03

Pos logit contributions
live+0.051
ggy+0.050
zag+0.047
aches+0.046
ves+0.045
Neg logit contributions
 Can-0.051
elt-0.049
 Please-0.048
 louder-0.047
ly-0.047
Token They
Token ID1119
Feature activation-9.64e-03

Pos logit contributions
 Can+0.003
elt+0.003
 Please+0.003
 louder+0.003
Stop+0.003
Neg logit contributions
ggy-0.005
live-0.004
zag-0.004
case-0.004
ail-0.004
Token put
Token ID1234
Feature activation+1.72e-02

Pos logit contributions
ggy+0.019
live+0.019
zag+0.018
ves+0.018
aches+0.017
Neg logit contributions
 Can-0.016
 louder-0.015
elt-0.015
ted-0.014
 Please-0.014
Token out
Token ID503
Feature activation+3.12e-03

Pos logit contributions
 Can+0.034
 Please+0.032
elt+0.031
 Look+0.028
Stop+0.028
Neg logit contributions
ggy-0.028
aches-0.028
ail-0.028
kins-0.026
live-0.026
Token the
Token ID262
Feature activation-3.05e-03

Pos logit contributions
 Can+0.007
 Please+0.006
elt+0.006
Stop+0.006
 Look+0.005
Neg logit contributions
ggy-0.005
ail-0.005
aches-0.005
case-0.005
live-0.005
Token want
Token ID765
Feature activation+8.89e-03

Pos logit contributions
ggy+0.008
live+0.007
zag+0.007
case+0.007
ail+0.007
Neg logit contributions
 Can-0.006
elt-0.006
 Please-0.006
 louder-0.005
Stop-0.005
Token to
Token ID284
Feature activation-2.19e-02

Pos logit contributions
 Can+0.014
elt+0.013
 Please+0.013
ting+0.012
Stop+0.012
Neg logit contributions
ggy-0.019
aches-0.017
live-0.017
case-0.017
ail-0.017
Token be
Token ID307
Feature activation-1.43e-03

Pos logit contributions
live+0.043
ggy+0.042
zag+0.041
ail+0.041
ves+0.039
Neg logit contributions
 Can-0.038
 Please-0.035
elt-0.034
Stop-0.032
 ding-0.032
Token good
Token ID922
Feature activation+1.59e-02

Pos logit contributions
ggy+0.003
live+0.003
zag+0.002
case+0.002
ail+0.002
Neg logit contributions
 Can-0.003
elt-0.003
 Please-0.002
 louder-0.002
Stop-0.002
Token.
Token ID13
Feature activation+2.32e-02

Pos logit contributions
 Can+0.036
elt+0.033
Stop+0.032
 Please+0.032
 ding+0.031
Neg logit contributions
ggy-0.023
aches-0.020
live-0.020
zag-0.020
ail-0.020
Token They
Token ID1119
Feature activation-1.79e-03

Pos logit contributions
elt+0.040
ting+0.039
ened+0.038
ted+0.038
ly+0.037
Neg logit contributions
ggy-0.042
case-0.040
live-0.038
ves-0.037
zag-0.035
Token go
Token ID467
Feature activation-4.34e-03

Pos logit contributions
ggy+0.004
live+0.004
aches+0.003
case+0.003
zag+0.003
Neg logit contributions
 Can-0.003
elt-0.003
 Please-0.003
 louder-0.002
ted-0.002
Token to
Token ID284
Feature activation-2.58e-02

Pos logit contributions
ggy+0.011
live+0.011
case+0.010
aches+0.010
ves+0.010
Neg logit contributions
 Can-0.005
elt-0.005
 Please-0.004
 louder-0.004
ted-0.004
Token their
Token ID511
Feature activation-3.78e-02

Pos logit contributions
live+0.063
ggy+0.062
zag+0.060
ves+0.060
case+0.060
Neg logit contributions
 Can-0.034
 louder-0.030
 Please-0.030
elt-0.029
 ding-0.028
Token room
Token ID2119
Feature activation-1.71e-02

Pos logit contributions
live+0.070
ggy+0.070
ves+0.066
aches+0.065
zag+0.065
Neg logit contributions
 Can-0.071
 Please-0.064
 eyes-0.063
elt-0.062
 ding-0.060
Token.
Token ID13
Feature activation+1.83e-02

Pos logit contributions
ggy+0.035
live+0.033
zag+0.033
case+0.032
aches+0.032
Neg logit contributions
 Can-0.028
elt-0.026
 Please-0.025
 louder-0.024
Stop-0.024
Token and
Token ID290
Feature activation-1.82e-02

Pos logit contributions
live+0.102
ggy+0.100
aches+0.094
oppy+0.093
ves+0.092
Neg logit contributions
 Can-0.049
elt-0.048
 louder-0.045
 Please-0.045
ly-0.043
Token put
Token ID1234
Feature activation+4.12e-03

Pos logit contributions
ggy+0.034
live+0.033
zag+0.032
aches+0.032
case+0.032
Neg logit contributions
 Can-0.033
elt-0.031
 Please-0.030
 louder-0.029
ted-0.028
Token it
Token ID340
Feature activation-3.92e-02

Pos logit contributions
 Can+0.007
elt+0.006
 Please+0.006
 louder+0.005
Stop+0.005
Neg logit contributions
ggy-0.009
live-0.008
aches-0.008
ail-0.008
case-0.008
Token away
Token ID1497
Feature activation-3.04e-02

Pos logit contributions
ggy+0.079
live+0.077
zag+0.071
ves+0.071
case+0.071
Neg logit contributions
 Can-0.064
elt-0.062
 louder-0.059
 Please-0.058
ly-0.057
Token.
Token ID13
Feature activation-1.89e-02

Pos logit contributions
ggy+0.055
live+0.053
zag+0.050
aches+0.050
case+0.050
Neg logit contributions
 Can-0.057
elt-0.053
 Please-0.052
 louder-0.051
ly-0.050
Token
Token ID198
Feature activation-5.27e-02

Pos logit contributions
ggy+0.033
live+0.032
zag+0.030
case+0.030
ail+0.030
Neg logit contributions
 Can-0.037
elt-0.035
 Please-0.034
 louder-0.032
Stop-0.032
Token
Token ID198
Feature activation-3.78e-02

Pos logit contributions
aches+0.124
ggy+0.117
live+0.116
zag+0.115
asis+0.113
Neg logit contributions
 Can-0.076
 Please-0.067
elt-0.065
 Look-0.062
 Come-0.058
Token"
Token ID1
Feature activation+2.91e-02

Pos logit contributions
ggy+0.072
live+0.070
aches+0.069
zag+0.069
ail+0.067
Neg logit contributions
 Can-0.070
 Please-0.065
elt-0.063
 louder-0.059
 Look-0.059
TokenCan
Token ID6090
Feature activation-5.07e-03

Pos logit contributions
ly+0.047
ened+0.046
 eyes+0.046
 Can+0.045
elt+0.043
Neg logit contributions
live-0.056
ggy-0.053
lly-0.051
ves-0.048
case-0.048
Token I
Token ID314
Feature activation-1.77e-02

Pos logit contributions
ggy+0.010
live+0.010
case+0.010
zag+0.010
ail+0.010
Neg logit contributions
 Can-0.008
elt-0.008
 louder-0.007
 Please-0.007
ted-0.007
Token have
Token ID423
Feature activation+1.61e-02

Pos logit contributions
live+0.030
ggy+0.030
zag+0.028
ves+0.028
ail+0.028
Neg logit contributions
 Can-0.036
 Please-0.033
elt-0.033
 Look-0.031
ly-0.030
Token with
Token ID351
Feature activation-2.70e-02

Pos logit contributions
live+0.140
ggy+0.136
zag+0.131
ail+0.128
aches+0.126
Neg logit contributions
 Can-0.111
 Please-0.103
elt-0.101
ly-0.097
 louder-0.095
Token her
Token ID607
Feature activation-3.23e-02

Pos logit contributions
ggy+0.065
zag+0.065
live+0.064
case+0.062
ail+0.062
Neg logit contributions
 Can-0.034
 Please-0.029
elt-0.029
 louder-0.028
ted-0.027
Token family
Token ID1641
Feature activation-3.36e-02

Pos logit contributions
live+0.056
ggy+0.055
aches+0.053
zag+0.052
case+0.051
Neg logit contributions
 Can-0.064
 Please-0.057
elt-0.056
 louder-0.056
 eyes-0.055
Token.
Token ID13
Feature activation-2.36e-02

Pos logit contributions
live+0.061
ggy+0.060
zag+0.058
aches+0.058
ves+0.057
Neg logit contributions
 Can-0.061
 Please-0.055
elt-0.055
ly-0.054
 louder-0.054
Token<|endoftext|>
Token ID50256
Feature activation-1.32e-02

Pos logit contributions
ggy+0.044
aches+0.043
zag+0.041
live+0.040
ves+0.040
Neg logit contributions
 Can-0.046
elt-0.043
 Please-0.041
 Look-0.039
ted-0.038
TokenOnce
Token ID7454
Feature activation-7.56e-02

Pos logit contributions
aches+0.029
ggy+0.028
live+0.026
zag+0.026
ail+0.026
Neg logit contributions
 Can-0.023
elt-0.020
ted-0.019
 Please-0.019
 Look-0.019
Token upon
Token ID2402
Feature activation-1.12e-03

Pos logit contributions
ggy+0.181
live+0.167
aches+0.164
ves+0.160
bes+0.160
Neg logit contributions
 Can-0.099
elt-0.086
 hello-0.086
ly-0.084
 Please-0.080
Token a
Token ID257
Feature activation-1.10e-02

Pos logit contributions
ggy+0.003
live+0.002
zag+0.002
ail+0.002
aches+0.002
Neg logit contributions
 Can-0.002
elt-0.001
 Please-0.001
 louder-0.001
Stop-0.001
Token time
Token ID640
Feature activation-3.11e-02

Pos logit contributions
ggy+0.018
aches+0.017
ail+0.017
zag+0.017
live+0.017
Neg logit contributions
 Can-0.021
 Please-0.020
elt-0.020
 louder-0.019
ly-0.019
Token,
Token ID11
Feature activation-1.46e-02

Pos logit contributions
ggy+0.060
live+0.059
aches+0.058
zag+0.055
ves+0.055
Neg logit contributions
 Can-0.056
elt-0.051
 Please-0.049
 Look-0.047
 hello-0.047
Token there
Token ID612
Feature activation-1.51e-02

Pos logit contributions
aches+0.030
ggy+0.030
live+0.030
ves+0.029
ail+0.028
Neg logit contributions
 Can-0.025
elt-0.022
 Please-0.022
 Look-0.021
 louder-0.021
Token laughed
Token ID13818
Feature activation+4.23e-02

Pos logit contributions
 Can+0.018
elt+0.017
 Please+0.016
 louder+0.016
ted+0.016
Neg logit contributions
ggy-0.021
live-0.020
aches-0.019
zag-0.019
ail-0.019
Token and
Token ID290
Feature activation+1.38e-02

Pos logit contributions
 Can+0.076
elt+0.076
Stop+0.071
 spec+0.070
elling+0.070
Neg logit contributions
ggy-0.074
case-0.063
zag-0.062
ves-0.061
ail-0.059
Token cl
Token ID537
Feature activation+4.02e-02

Pos logit contributions
 Can+0.024
elt+0.023
 Please+0.022
 louder+0.021
ted+0.021
Neg logit contributions
ggy-0.027
live-0.026
case-0.025
zag-0.025
ail-0.025
Tokenapped
Token ID6320
Feature activation+3.29e-02

Pos logit contributions
 Can+0.062
elt+0.055
 Please+0.054
 louder+0.052
Stop+0.051
Neg logit contributions
ggy-0.088
live-0.084
zag-0.082
ail-0.081
case-0.080
Token.
Token ID13
Feature activation+9.79e-03

Pos logit contributions
elt+0.081
 Can+0.078
Stop+0.073
elling+0.073
ted+0.072
Neg logit contributions
ggy-0.045
ves-0.036
aches-0.036
zag-0.035
kins-0.034
Token
Token ID198
Feature activation-4.51e-02

Pos logit contributions
 Can+0.017
elt+0.016
 Please+0.014
 Look+0.014
 louder+0.013
Neg logit contributions
ggy-0.020
live-0.020
aches-0.020
case-0.019
ail-0.019
Token
Token ID198
Feature activation-1.29e-02

Pos logit contributions
aches+0.111
kins+0.108
kin+0.108
live+0.105
zag+0.102
Neg logit contributions
 Can-0.050
 It-0.044
 Come-0.039
Stop-0.039
elt-0.038
TokenTogether
Token ID41631
Feature activation-9.31e-03

Pos logit contributions
ail+0.024
aches+0.024
ggy+0.024
oppy+0.024
live+0.023
Neg logit contributions
 Can-0.023
Stop-0.021
 Please-0.021
elt-0.020
 ding-0.019
Token,
Token ID11
Feature activation-9.40e-03

Pos logit contributions
ggy+0.017
live+0.016
zag+0.015
aches+0.015
case+0.015
Neg logit contributions
 Can-0.018
elt-0.017
 Please-0.016
 louder-0.016
ly-0.016
Token Tim
Token ID5045
Feature activation+1.82e-03

Pos logit contributions
ggy+0.018
live+0.017
zag+0.017
case+0.017
ail+0.017
Neg logit contributions
 Can-0.017
elt-0.016
 Please-0.015
 louder-0.015
ted-0.014
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
 Can+0.002
elt+0.002
 Please+0.002
 louder+0.002
Stop+0.002
Neg logit contributions
ggy-0.004
live-0.004
zag-0.004
case-0.004
aches-0.004
Token the
Token ID262
Feature activation-7.55e-03

Pos logit contributions
 Can+0.013
elt+0.012
 Please+0.012
 ding+0.011
ly+0.011
Neg logit contributions
ggy-0.013
live-0.011
aches-0.011
ail-0.011
zag-0.011
Token grass
Token ID8701
Feature activation-1.67e-02

Pos logit contributions
ggy+0.016
live+0.016
zag+0.015
case+0.015
aches+0.015
Neg logit contributions
 Can-0.012
elt-0.011
 Please-0.011
 louder-0.010
Stop-0.010
Token and
Token ID290
Feature activation+4.65e-03

Pos logit contributions
ggy+0.023
zag+0.023
case+0.022
live+0.022
ail+0.021
Neg logit contributions
 Can-0.039
elt-0.036
 Please-0.036
 Look-0.034
Stop-0.034
Token stones
Token ID14966
Feature activation-3.25e-02

Pos logit contributions
 Can+0.010
elt+0.009
 Please+0.009
ly+0.008
elling+0.008
Neg logit contributions
ggy-0.008
live-0.007
aches-0.007
case-0.007
zag-0.007
Token.
Token ID13
Feature activation+7.39e-03

Pos logit contributions
ggy+0.053
live+0.051
zag+0.049
case+0.048
aches+0.048
Neg logit contributions
 Can-0.067
elt-0.063
 Please-0.061
 louder-0.059
Stop-0.058
Token
Token ID198
Feature activation-4.48e-02

Pos logit contributions
 Can+0.012
elt+0.011
 Please+0.011
 louder+0.010
ly+0.010
Neg logit contributions
ggy-0.015
live-0.015
case-0.014
zag-0.014
ail-0.014
Token
Token ID198
Feature activation-2.99e-02

Pos logit contributions
aches+0.110
asis+0.106
live+0.105
zag+0.104
ics+0.102
Neg logit contributions
 Can-0.061
 Please-0.053
elt-0.049
 louder-0.049
 Come-0.048
Token"
Token ID1
Feature activation+3.90e-02

Pos logit contributions
zag+0.062
aches+0.062
case+0.061
ggy+0.061
live+0.060
Neg logit contributions
 Can-0.048
 Please-0.045
Stop-0.042
 louder-0.041
elt-0.041
TokenHelp
Token ID22087
Feature activation+2.93e-02

Pos logit contributions
 Can+0.053
elt+0.051
ly+0.048
 Please+0.045
 hello+0.044
Neg logit contributions
ggy-0.093
live-0.088
ves-0.084
aches-0.082
lly-0.081
Token,
Token ID11
Feature activation+1.04e-03

Pos logit contributions
 Can+0.064
elt+0.062
 Please+0.058
ksh+0.055
 ding+0.055
Neg logit contributions
ggy-0.045
ail-0.041
live-0.040
oppy-0.038
aches-0.038
Token help
Token ID1037
Feature activation-1.64e-02

Pos logit contributions
 Can+0.002
elt+0.002
 Please+0.002
ened+0.002
ly+0.002
Neg logit contributions
ggy-0.002
live-0.002
aches-0.002
ail-0.002
zag-0.001
Token-
Token ID12
Feature activation+1.55e-02

Pos logit contributions
 Can+0.033
elt+0.031
 Please+0.030
 louder+0.029
Stop+0.028
Neg logit contributions
ggy-0.022
live-0.021
zag-0.020
aches-0.020
case-0.020
Tokenyear
Token ID1941
Feature activation+2.88e-02

Pos logit contributions
 Can+0.024
 Please+0.021
elt+0.020
 Look+0.019
 louder+0.019
Neg logit contributions
ggy-0.035
live-0.034
aches-0.033
zag-0.033
ail-0.032
Token old
Token ID1468
Feature activation+7.53e-03

Pos logit contributions
 Can+0.047
elt+0.042
 Please+0.041
Stop+0.039
 Look+0.039
Neg logit contributions
ggy-0.063
aches-0.059
live-0.059
ail-0.058
zag-0.058
Token girl
Token ID2576
Feature activation-2.23e-02

Pos logit contributions
 Can+0.011
elt+0.010
 Please+0.009
 Look+0.009
 ding+0.009
Neg logit contributions
ggy-0.017
live-0.016
aches-0.016
ail-0.016
zag-0.016
Token played
Token ID2826
Feature activation-1.49e-02

Pos logit contributions
ggy+0.040
live+0.038
zag+0.036
case+0.036
ves+0.035
Neg logit contributions
 Can-0.041
elt-0.040
 Please-0.037
ted-0.037
ened-0.036
Token together
Token ID1978
Feature activation-4.68e-02

Pos logit contributions
ggy+0.027
live+0.026
zag+0.025
case+0.025
ail+0.025
Neg logit contributions
 Can-0.028
elt-0.026
 Please-0.025
 louder-0.024
Stop-0.024
Token all
Token ID477
Feature activation-1.83e-02

Pos logit contributions
ggy+0.085
live+0.083
ail+0.079
zag+0.078
case+0.078
Neg logit contributions
 Can-0.086
elt-0.079
 Please-0.077
 louder-0.076
ly-0.074
Token day
Token ID1110
Feature activation-5.00e-02

Pos logit contributions
ggy+0.028
live+0.026
case+0.026
ail+0.025
aches+0.025
Neg logit contributions
 Can-0.040
elt-0.036
 louder-0.036
ly-0.035
 Please-0.035
Token long
Token ID890
Feature activation-1.13e-02

Pos logit contributions
live+0.097
ggy+0.092
aches+0.091
ail+0.088
zag+0.087
Neg logit contributions
 Can-0.079
ly-0.077
 Please-0.076
 louder-0.073
ting-0.072
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
ggy+0.019
live+0.017
ves+0.017
aches+0.016
case+0.016
Neg logit contributions
 Can-0.024
elt-0.023
 Please-0.022
 Look-0.021
 ding-0.021
Token<|endoftext|>
Token ID50256
Feature activation-1.55e-02

Pos logit contributions
ggy+0.029
aches+0.027
live+0.027
zag+0.026
ail+0.026
Neg logit contributions
 Can-0.026
elt-0.024
 Please-0.022
 Look-0.021
ted-0.021