TOP ACTIVATIONS
MAX = 0.150

. The merry - go - round went faster and faster
. The merry - go - round spins . Jill and
was a beautiful pink c asser ole . The
to the merry - go - round . They climbed on
. They ran to the officer and hugged Spot . They
Rachel then went through the security gate . She was so
much ." The officer laughed and pet ted Spot
. He kept throwing a tant rum . His mom was
his feet and threw a tant rum . "
stubborn . She throws a tant rum . She kicks and
brave and for winning the fight . <|endoftext|> Once there were
, the merry - go - round . They think .
tickets and went to the security gate . At
stopped the merry - go - round and helped them get
surprise . He threw a tant rum . He kicked and
stopped the merry - go - round . Tom and Lily
to the merry - go - round . They
said . The officer shook his head . He
listen . She threw a tant rum in the store .
and the merry - go - round . They have fun

MIN ACTIVATIONS
MIN = -0.158

thing she always hated was vegetables . Every day , when
ier ? You are y umm ier . You are my
said , " That is soil , Daisy . It may
so many beautiful things : vegetables and fruits , a big
a . Ell a loved vegetables ! Every day she went
said , " I love vegetables ! They 're so delicious
them out . They were vegetables ! They were carrots ,
was a family who liked vegetables . They especially liked caul
the materials he needed : soil and pe b bles .
saw many interesting things like vegetables and fruits .
full of all types of vegetables . All the vegetables were
sees tomatoes . She likes vegetables . She likes to eat
loved seeing what fruits and vegetables were in season , and
herbs . " Those are herbs , Lily ," she said
. Lily didn 't like vegetables , but she wanted to
healthy food . There were vegetables , fruits and nuts .
bananas . There were also vegetables , like carrots and peas
old , and he loved vegetables . Every day he would
. It was even y umm ier than the first card
with pictures of fruits , vegetables and flowers to help remind

INTERVAL 0.073 - 0.150
CONTAINS 0.634%

the girl and sw am away back into the sea .
â The second friend said ,
. And they learned that not all flowers and pets were
was wrong . I should not have taken your bike .
. They walk to the train station , following the steam

INTERVAL -0.004 - 0.073
CONTAINS 54.702%

play with Lily and Ben anymore . He runs back to
!" cried Tim my . " My knee hurts !"
she went to her grand parent 's house . She chose
in the air . He looked around and noticed a bird
that day on , he was a much happier man .

INTERVAL -0.081 - -0.004
CONTAINS 44.358%

be so spoiled anymore ! <|endoftext|> Jim was a strong boy
pants ! They will help me mail letters !"
saw planes flying in the sky . Lily felt aw ed
as he could . When he had had
" Sorry , sweet ie , we don 't

INTERVAL -0.158 - -0.081
CONTAINS 0.306%

He knows a lot about stones and metals . He will
The bear loved to eat honey from the be eh ive
it , like apples and carrots . One
made sure he remembered his mum 's lesson - never disturb
stories about hopping little b unn ies who came to visit
Token.
Token ID13
Feature activation+2.41e-02

Pos logit contributions
arl+0.047
ags+0.040
ps+0.039
agged+0.039
 stern+0.039
Neg logit contributions
-0.021
light-0.020
 whenever-0.016
pen-0.015
 Tues-0.014
Token The
Token ID383
Feature activation+2.83e-02

Pos logit contributions
ps+0.043
chie+0.042
arl+0.042
outing+0.039
 stern+0.038
Neg logit contributions
-0.036
 whenever-0.032
light-0.029
 Tues-0.028
 nap-0.027
Token merry
Token ID44570
Feature activation+1.05e-01

Pos logit contributions
arl+0.051
ps+0.045
ags+0.045
agged+0.045
uzz+0.043
Neg logit contributions
-0.045
 nap-0.040
 night-0.038
 bal-0.036
 dream-0.035
Token-
Token ID12
Feature activation+1.21e-01

Pos logit contributions
arl+0.204
ps+0.173
 stern+0.169
ags+0.166
uzz+0.161
Neg logit contributions
light-0.148
-0.146
 whenever-0.128
 awoke-0.126
 nap-0.121
Tokengo
Token ID2188
Feature activation+7.44e-02

Pos logit contributions
arl+0.243
ps+0.220
ags+0.217
ses+0.211
 stern+0.208
Neg logit contributions
-0.167
light-0.151
 bal-0.146
 whenever-0.130
on-0.125
Token-
Token ID12
Feature activation+1.50e-01

Pos logit contributions
arl+0.139
ps+0.117
ags+0.115
 stern+0.114
agged+0.109
Neg logit contributions
-0.115
light-0.114
 awoke-0.096
 whenever-0.095
pen-0.094
Tokenround
Token ID744
Feature activation+4.66e-02

Pos logit contributions
arl+0.259
ps+0.229
ags+0.223
 stern+0.219
ses+0.216
Neg logit contributions
-0.261
light-0.240
 whenever-0.222
 bal-0.219
 Tues-0.198
Token went
Token ID1816
Feature activation+8.41e-03

Pos logit contributions
arl+0.105
ps+0.093
ags+0.090
 stern+0.089
agged+0.087
Neg logit contributions
-0.052
light-0.050
 awoke-0.040
?'-0.039
 whenever-0.039
Token faster
Token ID5443
Feature activation+6.88e-02

Pos logit contributions
arl+0.016
ses+0.015
ps+0.014
 apology+0.014
agged+0.014
Neg logit contributions
-0.012
 whenever-0.012
 night-0.010
 when-0.010
light-0.010
Token and
Token ID290
Feature activation+3.66e-02

Pos logit contributions
arl+0.165
ps+0.144
ags+0.142
 stern+0.142
agged+0.140
Neg logit contributions
-0.068
light-0.064
 whenever-0.054
pen-0.048
 awoke-0.047
Token faster
Token ID5443
Feature activation+7.07e-02

Pos logit contributions
arl+0.057
ags+0.045
ps+0.043
agged+0.043
uzz+0.042
Neg logit contributions
-0.075
light-0.067
 whenever-0.065
 tucked-0.061
 since-0.060
Token.
Token ID13
Feature activation+8.36e-03

Pos logit contributions
arl+0.063
ps+0.056
ags+0.054
 stern+0.053
agged+0.052
Neg logit contributions
-0.042
light-0.041
 awoke-0.034
?'-0.034
pen-0.033
Token The
Token ID383
Feature activation+2.31e-02

Pos logit contributions
arl+0.015
ps+0.014
 stern+0.013
outing+0.013
ags+0.013
Neg logit contributions
-0.013
light-0.012
 whenever-0.011
?'-0.010
 awoke-0.010
Token merry
Token ID44570
Feature activation+1.01e-01

Pos logit contributions
arl+0.041
uzz+0.035
ps+0.034
agged+0.034
 stern+0.034
Neg logit contributions
-0.037
light-0.034
 awoke-0.030
 nap-0.030
 whenever-0.030
Token-
Token ID12
Feature activation+1.17e-01

Pos logit contributions
arl+0.193
 stern+0.164
ps+0.161
uzz+0.158
outing+0.155
Neg logit contributions
light-0.139
-0.137
 awoke-0.121
 whenever-0.120
 nap-0.117
Tokengo
Token ID2188
Feature activation+7.18e-02

Pos logit contributions
arl+0.254
ses+0.226
ags+0.226
ps+0.221
 stern+0.220
Neg logit contributions
-0.139
light-0.129
 bal-0.119
 whenever-0.110
me-0.102
Token-
Token ID12
Feature activation+1.49e-01

Pos logit contributions
arl+0.136
 stern+0.115
ps+0.113
ags+0.112
ses+0.109
Neg logit contributions
-0.108
light-0.106
 awoke-0.089
 whenever-0.089
pen-0.086
Tokenround
Token ID744
Feature activation+4.56e-02

Pos logit contributions
arl+0.260
ps+0.222
ags+0.221
 stern+0.217
ses+0.216
Neg logit contributions
-0.258
light-0.244
 whenever-0.223
 bal-0.213
 Tues-0.203
Token spins
Token ID37621
Feature activation-2.92e-03

Pos logit contributions
arl+0.092
ps+0.079
ags+0.078
 stern+0.077
agged+0.075
Neg logit contributions
-0.061
light-0.059
 awoke-0.050
 whenever-0.048
pen-0.048
Token.
Token ID13
Feature activation+4.71e-03

Pos logit contributions
+0.003
light+0.003
 whenever+0.003
 Tues+0.002
 since+0.002
Neg logit contributions
arl-0.007
 apology-0.006
ses-0.006
ags-0.006
agged-0.006
Token Jill
Token ID22772
Feature activation+2.78e-03

Pos logit contributions
arl+0.009
ps+0.008
 stern+0.008
outing+0.007
ags+0.007
Neg logit contributions
-0.007
light-0.006
 whenever-0.006
?'-0.006
 awoke-0.005
Token and
Token ID290
Feature activation+1.08e-02

Pos logit contributions
arl+0.005
ps+0.004
ks+0.004
 stern+0.004
outs+0.004
Neg logit contributions
light-0.004
-0.004
 supper-0.004
?'-0.004
umm-0.004
Token was
Token ID373
Feature activation-8.45e-03

Pos logit contributions
ps+0.021
 stern+0.021
arl+0.019
Their+0.018
 innocent+0.018
Neg logit contributions
light-0.017
pen-0.017
-0.016
?'-0.016
 Tues-0.015
Token a
Token ID257
Feature activation-7.04e-03

Pos logit contributions
light+0.015
+0.015
pen+0.014
 awoke+0.014
?'+0.014
Neg logit contributions
arl-0.011
ps-0.011
 stern-0.011
ags-0.009
Their-0.009
Token beautiful
Token ID4950
Feature activation-4.75e-02

Pos logit contributions
+0.013
light+0.013
pen+0.011
 awoke+0.011
 whenever+0.011
Neg logit contributions
arl-0.010
 stern-0.009
ps-0.008
Their-0.008
ses-0.007
Token pink
Token ID11398
Feature activation-4.31e-02

Pos logit contributions
+0.079
light+0.072
pen+0.067
 awoke+0.064
 whenever+0.063
Neg logit contributions
arl-0.073
 stern-0.069
ps-0.064
 apology-0.059
ses-0.059
Token c
Token ID269
Feature activation-1.31e-02

Pos logit contributions
+0.070
light+0.063
pen+0.058
 awoke+0.057
 Tues+0.056
Neg logit contributions
arl-0.067
ps-0.060
 stern-0.059
ses-0.055
sed-0.054
Tokenasser
Token ID24929
Feature activation+1.46e-01

Pos logit contributions
+0.020
light+0.020
 awoke+0.017
?'+0.017
 whenever+0.016
Neg logit contributions
arl-0.024
ps-0.021
ags-0.020
 stern-0.019
agged-0.019
Tokenole
Token ID2305
Feature activation-2.02e-03

Pos logit contributions
arl+0.272
ps+0.235
ags+0.231
ses+0.225
 stern+0.225
Neg logit contributions
-0.218
light-0.209
 whenever-0.178
?'-0.177
 awoke-0.173
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
light+0.003
 awoke+0.003
 recite+0.003
?'+0.002
pen+0.002
Neg logit contributions
arl-0.003
 stern-0.003
ps-0.003
 greedy-0.003
 worker-0.003
Token
Token ID198
Feature activation-2.02e-02

Pos logit contributions
light+0.027
pen+0.025
 awoke+0.024
+0.023
 recite+0.023
Neg logit contributions
arl-0.022
 stern-0.019
Their-0.018
ags-0.017
 Their-0.017
Token
Token ID198
Feature activation-1.75e-02

Pos logit contributions
light+0.033
+0.029
 awoke+0.028
pen+0.027
 tucked+0.026
Neg logit contributions
arl-0.035
Their-0.031
 stern-0.029
uzz-0.028
ags-0.028
TokenThe
Token ID464
Feature activation+1.18e-02

Pos logit contributions
light+0.030
 awoke+0.027
pen+0.027
ridge+0.026
 laid+0.025
Neg logit contributions
arl-0.027
Their-0.026
 stern-0.022
outing-0.021
uzz-0.020
Token to
Token ID284
Feature activation+3.57e-02

Pos logit contributions
arl+0.085
ps+0.074
outing+0.073
ses+0.072
ags+0.072
Neg logit contributions
-0.047
 whenever-0.041
light-0.039
 nap-0.036
 night-0.036
Token the
Token ID262
Feature activation+2.09e-02

Pos logit contributions
arl+0.085
agged+0.075
ps+0.073
outing+0.073
ags+0.071
Neg logit contributions
-0.040
 nap-0.030
light-0.029
 whenever-0.029
?'-0.028
Token merry
Token ID44570
Feature activation+1.19e-01

Pos logit contributions
arl+0.039
ps+0.033
ags+0.032
 stern+0.032
ses+0.031
Neg logit contributions
light-0.031
-0.031
 awoke-0.026
?'-0.026
pen-0.026
Token-
Token ID12
Feature activation+1.22e-01

Pos logit contributions
arl+0.204
ps+0.174
ags+0.169
 stern+0.163
agged+0.161
Neg logit contributions
-0.199
light-0.190
 awoke-0.165
?'-0.164
pen-0.160
Tokengo
Token ID2188
Feature activation+7.23e-02

Pos logit contributions
arl+0.207
ps+0.186
ags+0.180
ses+0.177
 stern+0.173
Neg logit contributions
-0.209
light-0.192
 bal-0.175
 whenever-0.169
pen-0.165
Token-
Token ID12
Feature activation+1.44e-01

Pos logit contributions
arl+0.121
ps+0.103
ags+0.099
 stern+0.096
orts+0.095
Neg logit contributions
-0.122
light-0.118
 awoke-0.105
?'-0.103
pen-0.100
Tokenround
Token ID744
Feature activation+4.98e-02

Pos logit contributions
arl+0.225
ps+0.202
ags+0.192
ses+0.191
 stern+0.189
Neg logit contributions
-0.271
light-0.249
 whenever-0.229
 bal-0.223
pen-0.219
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
arl+0.117
ps+0.101
ags+0.099
agged+0.098
 stern+0.097
Neg logit contributions
-0.054
light-0.054
 whenever-0.045
pen-0.041
 awoke-0.040
Token They
Token ID1119
Feature activation+2.90e-02

Pos logit contributions
ps+0.023
arl+0.022
chie+0.022
outing+0.021
ags+0.020
Neg logit contributions
-0.019
 whenever-0.016
light-0.015
 nap-0.014
 Tues-0.014
Token climbed
Token ID19952
Feature activation-1.04e-02

Pos logit contributions
arl+0.059
ps+0.052
ags+0.050
 stern+0.050
agged+0.049
Neg logit contributions
-0.038
light-0.037
 awoke-0.031
?'-0.030
pen-0.030
Token on
Token ID319
Feature activation+4.69e-02

Pos logit contributions
+0.014
light+0.012
 whenever+0.012
 awoke+0.011
 Tues+0.011
Neg logit contributions
arl-0.021
 stern-0.018
ps-0.018
outing-0.018
 apology-0.017
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
arl+0.126
agged+0.113
outing+0.111
ags+0.111
ps+0.109
Neg logit contributions
-0.046
 whenever-0.037
light-0.035
 nap-0.029
 night-0.029
Token They
Token ID1119
Feature activation+3.04e-02

Pos logit contributions
arl+0.019
ps+0.018
ags+0.017
agged+0.016
outing+0.016
Neg logit contributions
-0.019
light-0.017
 whenever-0.017
 Tues-0.015
 nap-0.015
Token ran
Token ID4966
Feature activation+4.22e-02

Pos logit contributions
arl+0.058
ps+0.052
 stern+0.050
ags+0.049
Their+0.048
Neg logit contributions
-0.041
light-0.041
?'-0.035
 awoke-0.033
 Tues-0.033
Token to
Token ID284
Feature activation+3.87e-02

Pos logit contributions
arl+0.100
ps+0.088
ags+0.087
outing+0.086
agged+0.086
Neg logit contributions
-0.047
 whenever-0.038
light-0.037
 nap-0.030
 awoke-0.030
Token the
Token ID262
Feature activation+2.27e-02

Pos logit contributions
arl+0.100
agged+0.092
outing+0.087
ps+0.086
outed+0.085
Neg logit contributions
-0.035
 nap-0.028
 whenever-0.027
 prepare-0.026
 visit-0.026
Token officer
Token ID3818
Feature activation+1.44e-01

Pos logit contributions
arl+0.040
agged+0.037
ags+0.034
ps+0.034
uzz+0.034
Neg logit contributions
-0.039
light-0.033
 nap-0.032
 tucked-0.030
's-0.030
Token and
Token ID290
Feature activation+9.02e-03

Pos logit contributions
arl+0.289
ps+0.260
 stern+0.255
ags+0.244
uzz+0.241
Neg logit contributions
light-0.175
-0.173
 awoke-0.149
?'-0.148
 Tues-0.141
Token hugged
Token ID41130
Feature activation+2.06e-02

Pos logit contributions
arl+0.020
ps+0.017
ags+0.017
agged+0.016
 stern+0.016
Neg logit contributions
-0.011
light-0.010
 awoke-0.009
pen-0.008
 whenever-0.008
Token Spot
Token ID15899
Feature activation+3.38e-02

Pos logit contributions
arl+0.051
agged+0.045
ps+0.045
ses+0.044
ags+0.044
Neg logit contributions
-0.022
light-0.017
 whenever-0.017
pen-0.014
?'-0.013
Token.
Token ID13
Feature activation+7.71e-03

Pos logit contributions
arl+0.081
ps+0.069
ags+0.068
agged+0.068
orts+0.066
Neg logit contributions
-0.039
light-0.034
 whenever-0.029
pen-0.026
's-0.025
Token They
Token ID1119
Feature activation+2.70e-02

Pos logit contributions
arl+0.015
ps+0.014
ags+0.013
agged+0.013
 stern+0.013
Neg logit contributions
-0.011
light-0.010
 whenever-0.009
 awoke-0.008
?'-0.008
TokenRachel
Token ID44045
Feature activation-2.54e-02

Pos logit contributions
light+0.018
+0.018
 awoke+0.016
pen+0.016
?'+0.015
Neg logit contributions
arl-0.022
ps-0.019
Their-0.019
 stern-0.019
ags-0.019
Token then
Token ID788
Feature activation-3.24e-02

Pos logit contributions
umm+0.037
light+0.034
 avail+0.034
 Tues+0.033
?'+0.033
Neg logit contributions
ps-0.039
 stern-0.037
ks-0.036
Their-0.034
outed-0.034
Token went
Token ID1816
Feature activation-1.09e-02

Pos logit contributions
umm+0.046
light+0.046
 Tues+0.044
+0.042
ridge+0.041
Neg logit contributions
 stern-0.051
ks-0.044
 greedy-0.043
 angry-0.043
 mad-0.043
Token through
Token ID832
Feature activation+1.74e-02

Pos logit contributions
+0.014
light+0.013
 awoke+0.011
 whenever+0.011
 tucked+0.011
Neg logit contributions
arl-0.023
ps-0.020
ses-0.020
ags-0.019
agged-0.019
Token the
Token ID262
Feature activation+2.05e-03

Pos logit contributions
arl+0.030
ps+0.028
 stern+0.027
ags+0.026
Their+0.025
Neg logit contributions
-0.026
light-0.025
 awoke-0.022
?'-0.021
 bal-0.021
Token security
Token ID2324
Feature activation+1.43e-01

Pos logit contributions
arl+0.004
ps+0.003
 stern+0.003
ags+0.003
Their+0.003
Neg logit contributions
-0.003
light-0.003
?'-0.003
 awoke-0.003
pen-0.003
Token gate
Token ID8946
Feature activation+6.47e-02

Pos logit contributions
arl+0.256
ps+0.214
ags+0.210
agged+0.203
 stern+0.203
Neg logit contributions
-0.231
light-0.225
 awoke-0.194
 whenever-0.190
pen-0.188
Token.
Token ID13
Feature activation-7.74e-03

Pos logit contributions
arl+0.132
ps+0.116
 stern+0.113
ags+0.113
agged+0.109
Neg logit contributions
-0.083
light-0.079
 awoke-0.067
?'-0.066
pen-0.064
Token She
Token ID1375
Feature activation-1.11e-02

Pos logit contributions
light+0.012
+0.012
pen+0.011
 awoke+0.010
 bal+0.010
Neg logit contributions
arl-0.013
ps-0.011
 stern-0.011
ags-0.011
Their-0.010
Token was
Token ID373
Feature activation-8.10e-03

Pos logit contributions
umm+0.018
light+0.018
+0.017
 masse+0.017
?'+0.016
Neg logit contributions
 stern-0.014
ps-0.014
outed-0.014
ks-0.014
arl-0.013
Token so
Token ID523
Feature activation-1.27e-02

Pos logit contributions
+0.014
light+0.013
 awoke+0.012
 whenever+0.011
?'+0.011
Neg logit contributions
arl-0.014
ps-0.011
ags-0.011
orts-0.011
agged-0.011
Token much
Token ID881
Feature activation+1.46e-02

Pos logit contributions
arl+0.113
ags+0.101
ps+0.100
orts+0.098
agged+0.098
Neg logit contributions
-0.060
light-0.049
 whenever-0.045
 since-0.041
 awoke-0.041
Token."
Token ID526
Feature activation+3.41e-02

Pos logit contributions
arl+0.026
ps+0.023
ags+0.021
 stern+0.020
orts+0.020
Neg logit contributions
-0.023
light-0.023
 awoke-0.020
?'-0.020
pen-0.020
Token
Token ID198
Feature activation+2.03e-02

Pos logit contributions
arl+0.076
ps+0.069
ags+0.066
 stern+0.064
agged+0.064
Neg logit contributions
-0.040
light-0.036
 whenever-0.030
?'-0.029
 awoke-0.029
Token
Token ID198
Feature activation+1.79e-02

Pos logit contributions
arl+0.046
ps+0.042
ags+0.040
 stern+0.039
agged+0.039
Neg logit contributions
-0.023
light-0.020
?'-0.016
 whenever-0.016
 awoke-0.016
TokenThe
Token ID464
Feature activation+4.08e-02

Pos logit contributions
arl+0.038
ps+0.036
ses+0.035
ags+0.034
agged+0.033
Neg logit contributions
-0.024
light-0.020
pen-0.016
 whenever-0.016
?'-0.016
Token officer
Token ID3818
Feature activation+1.43e-01

Pos logit contributions
arl+0.057
ps+0.051
uzz+0.050
outing+0.048
orts+0.048
Neg logit contributions
-0.077
 whenever-0.063
 nap-0.063
 masse-0.063
light-0.063
Token laughed
Token ID13818
Feature activation+2.45e-02

Pos logit contributions
arl+0.334
ags+0.285
ps+0.270
orts+0.263
agged+0.262
Neg logit contributions
-0.180
light-0.166
 awoke-0.146
 tucked-0.142
 laid-0.136
Token and
Token ID290
Feature activation+2.05e-02

Pos logit contributions
arl+0.058
ps+0.052
ags+0.050
orts+0.050
ses+0.050
Neg logit contributions
-0.025
light-0.022
 whenever-0.019
 awoke-0.017
 tucked-0.016
Token pet
Token ID4273
Feature activation+1.64e-02

Pos logit contributions
arl+0.044
ags+0.038
ps+0.038
uzz+0.036
agged+0.036
Neg logit contributions
-0.026
light-0.024
 awoke-0.021
 tucked-0.021
 laid-0.020
Tokented
Token ID1513
Feature activation+8.55e-03

Pos logit contributions
arl+0.035
ps+0.030
ags+0.030
 stern+0.028
ses+0.028
Neg logit contributions
-0.021
light-0.020
 awoke-0.017
 whenever-0.017
pen-0.017
Token Spot
Token ID15899
Feature activation+5.63e-02

Pos logit contributions
arl+0.020
outing+0.016
ses+0.016
ags+0.016
ps+0.016
Neg logit contributions
-0.010
 whenever-0.008
light-0.008
pen-0.007
 awoke-0.007
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
arl+0.011
ags+0.010
ses+0.010
ps+0.010
agged+0.010
Neg logit contributions
-0.005
light-0.004
 whenever-0.004
 since-0.003
?'-0.003
Token He
Token ID679
Feature activation+2.21e-02

Pos logit contributions
arl+0.039
ps+0.037
ags+0.034
outing+0.034
agged+0.034
Neg logit contributions
-0.017
light-0.013
 whenever-0.012
 Tues-0.011
?'-0.011
Token kept
Token ID4030
Feature activation-5.06e-02

Pos logit contributions
arl+0.042
ps+0.037
ags+0.036
 stern+0.035
agged+0.035
Neg logit contributions
-0.031
light-0.031
 awoke-0.026
?'-0.025
 whenever-0.025
Token throwing
Token ID9644
Feature activation+8.06e-03

Pos logit contributions
+0.061
light+0.051
 pol+0.048
 Tues+0.046
pen+0.046
Neg logit contributions
arl-0.110
agged-0.092
uzz-0.090
ags-0.090
outing-0.089
Token a
Token ID257
Feature activation+2.78e-02

Pos logit contributions
arl+0.016
ps+0.013
agged+0.013
ags+0.013
uzz+0.013
Neg logit contributions
-0.012
light-0.011
 whenever-0.009
pen-0.009
 awoke-0.009
Token tant
Token ID24246
Feature activation+1.43e-01

Pos logit contributions
arl+0.047
ps+0.043
ags+0.042
agged+0.041
uzz+0.039
Neg logit contributions
-0.046
light-0.043
 nap-0.038
 awoke-0.038
pen-0.038
Tokenrum
Token ID6582
Feature activation+3.89e-02

Pos logit contributions
arl+0.248
ses+0.202
ags+0.200
sed+0.191
 stern+0.188
Neg logit contributions
light-0.219
-0.215
pen-0.194
?'-0.193
 bal-0.183
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
arl+0.087
ags+0.075
ps+0.075
orts+0.073
agged+0.073
Neg logit contributions
-0.046
light-0.043
 whenever-0.036
 awoke-0.033
?'-0.033
Token His
Token ID2399
Feature activation+2.94e-02

Pos logit contributions
arl+0.024
ps+0.023
chie+0.021
ags+0.021
outing+0.021
Neg logit contributions
-0.012
light-0.010
 whenever-0.009
 Tues-0.009
?'-0.008
Token mom
Token ID1995
Feature activation-2.65e-02

Pos logit contributions
arl+0.056
ps+0.049
ags+0.047
agged+0.047
orts+0.046
Neg logit contributions
-0.042
light-0.040
 nap-0.034
 awoke-0.034
 whenever-0.033
Token was
Token ID373
Feature activation-5.74e-03

Pos logit contributions
+0.033
light+0.032
 awoke+0.026
?'+0.026
 whenever+0.025
Neg logit contributions
arl-0.056
ps-0.049
ags-0.048
 stern-0.047
agged-0.046
Token his
Token ID465
Feature activation+6.43e-02

Pos logit contributions
arl+0.091
ses+0.085
sed+0.081
outing+0.079
itates+0.077
Neg logit contributions
-0.025
 whenever-0.021
light-0.020
 awoke-0.018
 when-0.018
Token feet
Token ID3625
Feature activation-2.10e-02

Pos logit contributions
arl+0.111
ps+0.092
ags+0.090
agged+0.088
 stern+0.088
Neg logit contributions
-0.106
light-0.101
 awoke-0.087
 nap-0.086
 whenever-0.085
Token and
Token ID290
Feature activation+1.24e-02

Pos logit contributions
+0.024
light+0.022
 whenever+0.018
 awoke+0.018
pen+0.017
Neg logit contributions
arl-0.047
ps-0.041
ags-0.041
agged-0.040
 stern-0.040
Token threw
Token ID9617
Feature activation+2.17e-02

Pos logit contributions
arl+0.024
ags+0.020
ps+0.020
agged+0.019
uzz+0.019
Neg logit contributions
-0.018
light-0.018
 awoke-0.015
 tucked-0.015
 whenever-0.014
Token a
Token ID257
Feature activation+2.65e-02

Pos logit contributions
arl+0.046
outing+0.039
agged+0.038
itates+0.037
ses+0.037
Neg logit contributions
-0.029
light-0.024
 whenever-0.022
 nap-0.021
 awoke-0.021
Token tant
Token ID24246
Feature activation+1.43e-01

Pos logit contributions
arl+0.044
ps+0.040
ags+0.039
agged+0.038
uzz+0.037
Neg logit contributions
-0.045
light-0.041
 nap-0.037
 awoke-0.036
pen-0.036
Tokenrum
Token ID6582
Feature activation+4.55e-02

Pos logit contributions
arl+0.234
 stern+0.183
ags+0.179
ses+0.177
ps+0.170
Neg logit contributions
light-0.234
-0.230
pen-0.211
?'-0.205
 whenever-0.198
Token.
Token ID13
Feature activation+5.29e-03

Pos logit contributions
arl+0.100
ags+0.086
ps+0.085
 stern+0.083
agged+0.082
Neg logit contributions
-0.055
light-0.052
 whenever-0.042
 awoke-0.041
?'-0.041
Token
Token ID198
Feature activation-3.33e-03

Pos logit contributions
arl+0.012
ps+0.011
ags+0.011
agged+0.010
chie+0.010
Neg logit contributions
-0.006
light-0.005
 whenever-0.004
?'-0.004
 Tues-0.004
Token
Token ID198
Feature activation-2.41e-03

Pos logit contributions
+0.004
light+0.003
?'+0.002
 whenever+0.002
 awoke+0.002
Neg logit contributions
arl-0.008
ps-0.007
ags-0.007
ses-0.007
agged-0.007
Token"
Token ID1
Feature activation+4.34e-02

Pos logit contributions
+0.003
light+0.003
?'+0.003
 whenever+0.003
 awoke+0.003
Neg logit contributions
arl-0.005
ps-0.004
ags-0.004
ses-0.004
agged-0.004
Token stubborn
Token ID24444
Feature activation-2.01e-02

Pos logit contributions
arl+0.034
orts+0.026
uzz+0.026
chie+0.026
Their+0.025
Neg logit contributions
-0.040
 tucked-0.035
 laid-0.033
 pol-0.033
 nap-0.032
Token.
Token ID13
Feature activation+9.50e-03

Pos logit contributions
+0.028
light+0.028
 awoke+0.024
?'+0.023
pen+0.023
Neg logit contributions
arl-0.039
ps-0.034
 stern-0.033
ags-0.032
agged-0.031
Token She
Token ID1375
Feature activation+3.90e-04

Pos logit contributions
arl+0.019
ps+0.016
ags+0.016
 stern+0.015
agged+0.015
Neg logit contributions
-0.013
light-0.013
 awoke-0.011
pen-0.011
?'-0.011
Token throws
Token ID12542
Feature activation+3.46e-02

Pos logit contributions
arl+0.001
ps+0.000
ags+0.000
 stern+0.000
agged+0.000
Neg logit contributions
light-0.001
-0.001
 awoke-0.001
?'-0.001
 whenever-0.001
Token a
Token ID257
Feature activation+3.47e-02

Pos logit contributions
arl+0.078
agged+0.066
outing+0.065
ses+0.065
uzz+0.065
Neg logit contributions
-0.041
light-0.033
 whenever-0.032
 nap-0.030
pen-0.029
Token tant
Token ID24246
Feature activation+1.43e-01

Pos logit contributions
arl+0.050
ps+0.046
ags+0.044
agged+0.043
uzz+0.041
Neg logit contributions
-0.065
light-0.060
 nap-0.055
 bal-0.054
pen-0.054
Tokenrum
Token ID6582
Feature activation+4.40e-02

Pos logit contributions
arl+0.245
ags+0.191
ses+0.191
 stern+0.188
ps+0.181
Neg logit contributions
light-0.226
-0.221
pen-0.199
?'-0.198
 whenever-0.193
Token.
Token ID13
Feature activation+8.94e-03

Pos logit contributions
arl+0.096
ps+0.082
ags+0.082
 stern+0.081
agged+0.080
Neg logit contributions
-0.054
light-0.051
 whenever-0.041
 awoke-0.040
?'-0.040
Token She
Token ID1375
Feature activation+1.77e-03

Pos logit contributions
arl+0.018
ps+0.016
ags+0.015
 stern+0.015
agged+0.015
Neg logit contributions
-0.012
light-0.011
 awoke-0.009
?'-0.009
 whenever-0.009
Token kicks
Token ID17364
Feature activation+4.13e-02

Pos logit contributions
arl+0.003
ps+0.002
 stern+0.002
ags+0.002
orts+0.002
Neg logit contributions
light-0.003
-0.003
 awoke-0.003
?'-0.003
 whenever-0.003
Token and
Token ID290
Feature activation+2.91e-02

Pos logit contributions
arl+0.096
agged+0.083
ps+0.081
ses+0.081
ags+0.080
Neg logit contributions
-0.047
light-0.037
 whenever-0.037
pen-0.032
 nap-0.032
Token brave
Token ID14802
Feature activation-2.05e-02

Pos logit contributions
arl+0.003
ags+0.002
ses+0.002
ps+0.002
agged+0.002
Neg logit contributions
-0.003
light-0.003
 whenever-0.003
 tucked-0.002
 masse-0.002
Token and
Token ID290
Feature activation+4.30e-03

Pos logit contributions
light+0.029
+0.029
 awoke+0.025
?'+0.025
 recite+0.024
Neg logit contributions
arl-0.037
ps-0.033
 stern-0.031
ks-0.030
ags-0.030
Token for
Token ID329
Feature activation-2.93e-04

Pos logit contributions
arl+0.007
ps+0.006
ags+0.006
 stern+0.006
agged+0.006
Neg logit contributions
-0.007
light-0.007
 awoke-0.006
?'-0.006
 whenever-0.006
Token winning
Token ID5442
Feature activation+6.22e-02

Pos logit contributions
+0.000
light+0.000
 awoke+0.000
 whenever+0.000
?'+0.000
Neg logit contributions
arl-0.001
ps-0.001
ags-0.000
 stern-0.000
agged-0.000
Token the
Token ID262
Feature activation+1.16e-02

Pos logit contributions
arl+0.131
ps+0.114
ags+0.111
 stern+0.110
agged+0.108
Neg logit contributions
-0.080
light-0.075
 awoke-0.062
?'-0.061
 whenever-0.061
Token fight
Token ID1907
Feature activation+1.42e-01

Pos logit contributions
arl+0.021
ps+0.018
ags+0.018
agged+0.017
 stern+0.017
Neg logit contributions
-0.018
light-0.017
 awoke-0.015
?'-0.015
 whenever-0.015
Token.
Token ID13
Feature activation+1.63e-04

Pos logit contributions
arl+0.258
ps+0.234
 stern+0.218
ags+0.214
Their+0.209
Neg logit contributions
light-0.202
-0.193
 awoke-0.174
?'-0.167
pen-0.164
Token<|endoftext|>
Token ID50256
Feature activation-8.64e-03

Pos logit contributions
arl+0.000
ps+0.000
ags+0.000
 stern+0.000
agged+0.000
Neg logit contributions
-0.000
light-0.000
 awoke-0.000
?'-0.000
pen-0.000
TokenOnce
Token ID7454
Feature activation+2.76e-03

Pos logit contributions
light+0.011
+0.011
 awoke+0.009
pen+0.009
?'+0.009
Neg logit contributions
arl-0.017
ps-0.015
ags-0.015
agged-0.014
 stern-0.014
Token there
Token ID612
Feature activation+7.29e-03

Pos logit contributions
arl+0.006
ps+0.005
ags+0.005
 stern+0.005
agged+0.005
Neg logit contributions
-0.003
light-0.003
 whenever-0.003
?'-0.003
 awoke-0.003
Token were
Token ID547
Feature activation-1.97e-02

Pos logit contributions
arl+0.012
uzz+0.012
 stern+0.011
Their+0.011
ps+0.011
Neg logit contributions
light-0.010
 masse-0.009
ridge-0.009
-0.009
 recite-0.008
Token,
Token ID11
Feature activation-3.61e-04

Pos logit contributions
arl+0.083
ags+0.075
agged+0.074
ps+0.073
 apology+0.072
Neg logit contributions
-0.030
light-0.027
 whenever-0.020
 awoke-0.018
pen-0.018
Token the
Token ID262
Feature activation-2.67e-03

Pos logit contributions
+0.000
light+0.000
 whenever+0.000
 awoke+0.000
 tucked+0.000
Neg logit contributions
arl-0.001
ags-0.001
ps-0.001
agged-0.001
 stern-0.001
Token merry
Token ID44570
Feature activation+9.65e-02

Pos logit contributions
+0.004
light+0.004
 nap+0.003
 awoke+0.003
 whenever+0.003
Neg logit contributions
arl-0.005
ps-0.004
agged-0.004
ags-0.004
Their-0.004
Token-
Token ID12
Feature activation+1.06e-01

Pos logit contributions
arl+0.204
 stern+0.177
ps+0.169
outing+0.168
itates+0.168
Neg logit contributions
light-0.112
-0.100
 whenever-0.098
 nap-0.094
 night-0.093
Tokengo
Token ID2188
Feature activation+6.87e-02

Pos logit contributions
arl+0.219
 stern+0.192
ses+0.189
ps+0.188
ags+0.185
Neg logit contributions
-0.140
light-0.137
 bal-0.120
 whenever-0.107
 laid-0.102
Token-
Token ID12
Feature activation+1.42e-01

Pos logit contributions
arl+0.143
 stern+0.122
ps+0.117
ags+0.116
agged+0.115
Neg logit contributions
light-0.096
-0.093
 whenever-0.076
 nap-0.073
 awoke-0.072
Tokenround
Token ID744
Feature activation+4.03e-02

Pos logit contributions
arl+0.254
 stern+0.219
ps+0.212
ags+0.212
ses+0.210
Neg logit contributions
-0.242
light-0.226
 whenever-0.207
 bal-0.200
's-0.184
Token.
Token ID13
Feature activation+5.59e-03

Pos logit contributions
arl+0.089
ps+0.077
ags+0.076
 stern+0.075
agged+0.075
Neg logit contributions
-0.048
light-0.046
 whenever-0.036
 awoke-0.036
pen-0.036
Token They
Token ID1119
Feature activation+2.23e-02

Pos logit contributions
arl+0.009
ps+0.008
ags+0.007
 stern+0.007
outing+0.007
Neg logit contributions
-0.010
light-0.009
 whenever-0.008
 Tues-0.008
 awoke-0.008
Token think
Token ID892
Feature activation+1.86e-02

Pos logit contributions
arl+0.042
ps+0.036
ags+0.035
 stern+0.034
agged+0.033
Neg logit contributions
-0.033
light-0.032
 awoke-0.028
?'-0.027
pen-0.027
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
arl+0.037
ps+0.032
ags+0.031
 stern+0.031
agged+0.030
Neg logit contributions
-0.026
light-0.025
 awoke-0.021
 whenever-0.021
?'-0.020
Token tickets
Token ID8587
Feature activation+4.11e-02

Pos logit contributions
arl+0.046
ps+0.039
ags+0.038
agged+0.037
orts+0.037
Neg logit contributions
-0.047
light-0.045
 whenever-0.039
 awoke-0.039
?'-0.038
Token and
Token ID290
Feature activation-5.18e-03

Pos logit contributions
arl+0.078
ps+0.070
 stern+0.067
ags+0.066
Their+0.063
Neg logit contributions
-0.057
light-0.055
?'-0.048
 awoke-0.047
pen-0.047
Token went
Token ID1816
Feature activation-1.53e-03

Pos logit contributions
+0.007
light+0.007
?'+0.007
pen+0.006
 awoke+0.006
Neg logit contributions
arl-0.009
ps-0.008
 stern-0.008
Their-0.008
ags-0.008
Token to
Token ID284
Feature activation+2.72e-02

Pos logit contributions
+0.002
light+0.001
 whenever+0.001
 tucked+0.001
 nap+0.001
Neg logit contributions
arl-0.003
ses-0.003
ps-0.003
outing-0.003
 apology-0.003
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
arl+0.061
ps+0.053
agged+0.053
ags+0.052
outing+0.051
Neg logit contributions
-0.033
light-0.029
 whenever-0.025
?'-0.025
 nap-0.024
Token security
Token ID2324
Feature activation+1.42e-01

Pos logit contributions
arl+0.023
ps+0.020
ags+0.019
 stern+0.019
Their+0.019
Neg logit contributions
-0.016
light-0.015
 awoke-0.013
?'-0.013
pen-0.012
Token gate
Token ID8946
Feature activation+5.88e-02

Pos logit contributions
arl+0.252
ps+0.210
ags+0.208
agged+0.203
 stern+0.197
Neg logit contributions
-0.233
light-0.225
 awoke-0.196
 whenever-0.193
?'-0.189
Token.
Token ID13
Feature activation+1.52e-03

Pos logit contributions
arl+0.125
ps+0.109
ags+0.107
agged+0.104
 stern+0.104
Neg logit contributions
-0.075
light-0.071
 awoke-0.058
 whenever-0.058
?'-0.057
Token
Token ID198
Feature activation-7.60e-03

Pos logit contributions
arl+0.003
ps+0.003
ags+0.003
 stern+0.003
ses+0.003
Neg logit contributions
-0.002
light-0.001
 whenever-0.001
?'-0.001
 awoke-0.001
Token
Token ID198
Feature activation-5.57e-03

Pos logit contributions
+0.009
light+0.009
 awoke+0.007
?'+0.007
 whenever+0.007
Neg logit contributions
arl-0.016
ps-0.014
ags-0.014
 stern-0.014
agged-0.014
TokenAt
Token ID2953
Feature activation-1.14e-02

Pos logit contributions
+0.007
light+0.007
 awoke+0.006
?'+0.006
pen+0.006
Neg logit contributions
arl-0.011
ps-0.010
ags-0.010
 stern-0.009
agged-0.009
Token stopped
Token ID5025
Feature activation+3.79e-02

Pos logit contributions
light+0.074
+0.073
?'+0.065
 whenever+0.065
 Tues+0.064
Neg logit contributions
arl-0.074
ps-0.066
 stern-0.064
ags-0.059
Their-0.058
Token the
Token ID262
Feature activation+7.85e-03

Pos logit contributions
arl+0.075
ps+0.065
ags+0.063
 stern+0.062
Their+0.061
Neg logit contributions
-0.053
light-0.053
 awoke-0.045
?'-0.043
pen-0.043
Token merry
Token ID44570
Feature activation+1.10e-01

Pos logit contributions
arl+0.013
ps+0.011
ags+0.010
agged+0.010
 stern+0.010
Neg logit contributions
-0.014
light-0.013
 awoke-0.012
?'-0.011
 whenever-0.011
Token-
Token ID12
Feature activation+1.16e-01

Pos logit contributions
arl+0.184
ps+0.157
ags+0.151
 stern+0.146
agged+0.144
Neg logit contributions
-0.189
light-0.180
 awoke-0.157
?'-0.156
pen-0.153
Tokengo
Token ID2188
Feature activation+7.18e-02

Pos logit contributions
arl+0.216
ps+0.191
ags+0.189
ses+0.186
sed+0.178
Neg logit contributions
-0.182
light-0.172
 bal-0.152
 whenever-0.148
 tucked-0.134
Token-
Token ID12
Feature activation+1.41e-01

Pos logit contributions
arl+0.120
ps+0.103
ags+0.098
 stern+0.096
agged+0.094
Neg logit contributions
-0.121
light-0.117
 awoke-0.103
?'-0.101
pen-0.100
Tokenround
Token ID744
Feature activation+4.58e-02

Pos logit contributions
arl+0.234
ps+0.204
ags+0.201
ses+0.196
 stern+0.194
Neg logit contributions
-0.250
light-0.235
 whenever-0.215
 bal-0.206
 Tues-0.196
Token and
Token ID290
Feature activation+8.09e-03

Pos logit contributions
arl+0.098
ps+0.087
ags+0.084
 stern+0.083
agged+0.082
Neg logit contributions
-0.055
light-0.053
 awoke-0.043
?'-0.042
pen-0.041
Token helped
Token ID4193
Feature activation+4.33e-03

Pos logit contributions
arl+0.014
ps+0.012
 stern+0.012
ags+0.012
orts+0.011
Neg logit contributions
-0.012
light-0.012
?'-0.011
pen-0.010
 awoke-0.010
Token them
Token ID606
Feature activation+6.46e-02

Pos logit contributions
arl+0.006
ps+0.005
ags+0.005
 stern+0.004
Their+0.004
Neg logit contributions
-0.009
light-0.009
 awoke-0.008
?'-0.007
pen-0.007
Token get
Token ID651
Feature activation+2.64e-02

Pos logit contributions
arl+0.121
ps+0.111
ags+0.104
Their+0.100
uzz+0.100
Neg logit contributions
light-0.091
-0.083
 awoke-0.080
umm-0.077
 nights-0.074
Token surprise
Token ID5975
Feature activation+2.77e-02

Pos logit contributions
+0.017
light+0.016
 awoke+0.014
?'+0.014
 whenever+0.014
Neg logit contributions
arl-0.020
ps-0.017
ags-0.017
agged-0.017
Their-0.017
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
arl+0.051
ps+0.045
 stern+0.043
ags+0.042
Their+0.041
Neg logit contributions
-0.040
light-0.039
 awoke-0.034
?'-0.033
pen-0.033
Token He
Token ID679
Feature activation+1.50e-02

Pos logit contributions
arl+0.022
ps+0.021
ags+0.019
 stern+0.019
outing+0.019
Neg logit contributions
-0.012
light-0.010
 whenever-0.009
?'-0.009
 awoke-0.008
Token threw
Token ID9617
Feature activation+1.49e-02

Pos logit contributions
arl+0.031
ags+0.026
ps+0.026
 stern+0.025
agged+0.024
Neg logit contributions
-0.021
light-0.019
 awoke-0.017
 tucked-0.016
 laid-0.016
Token a
Token ID257
Feature activation+2.72e-02

Pos logit contributions
arl+0.033
agged+0.028
ps+0.028
outing+0.027
ags+0.027
Neg logit contributions
-0.018
light-0.016
 whenever-0.014
pen-0.013
 awoke-0.013
Token tant
Token ID24246
Feature activation+1.41e-01

Pos logit contributions
arl+0.048
ps+0.045
ags+0.044
agged+0.043
uzz+0.042
Neg logit contributions
-0.042
light-0.039
 nap-0.035
 awoke-0.034
pen-0.034
Tokenrum
Token ID6582
Feature activation+4.18e-02

Pos logit contributions
arl+0.241
ags+0.192
ses+0.192
 stern+0.186
ps+0.181
Neg logit contributions
light-0.223
-0.215
pen-0.196
?'-0.193
 whenever-0.188
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
arl+0.093
ags+0.079
ps+0.079
 stern+0.077
orts+0.076
Neg logit contributions
-0.050
light-0.048
 whenever-0.039
 awoke-0.038
?'-0.037
Token He
Token ID679
Feature activation+1.84e-02

Pos logit contributions
arl+0.022
ps+0.020
ags+0.019
 stern+0.019
outing+0.018
Neg logit contributions
-0.013
light-0.012
 whenever-0.010
?'-0.010
 awoke-0.010
Token kicked
Token ID12165
Feature activation+2.80e-02

Pos logit contributions
arl+0.036
ps+0.031
ags+0.030
 stern+0.030
agged+0.029
Neg logit contributions
-0.026
light-0.025
 awoke-0.021
?'-0.021
pen-0.020
Token and
Token ID290
Feature activation+2.29e-02

Pos logit contributions
arl+0.065
agged+0.056
ags+0.055
ps+0.055
outing+0.055
Neg logit contributions
-0.032
light-0.026
 whenever-0.024
pen-0.023
 nap-0.022
Token stopped
Token ID5025
Feature activation+3.98e-02

Pos logit contributions
arl+0.020
ags+0.018
agged+0.017
ps+0.016
orts+0.016
Neg logit contributions
-0.009
light-0.008
 awoke-0.007
 tucked-0.007
 laid-0.007
Token the
Token ID262
Feature activation+1.22e-02

Pos logit contributions
arl+0.074
ps+0.064
ags+0.062
 stern+0.061
Their+0.059
Neg logit contributions
-0.060
light-0.059
 awoke-0.051
?'-0.049
pen-0.049
Token merry
Token ID44570
Feature activation+1.15e-01

Pos logit contributions
arl+0.021
agged+0.018
ps+0.018
ags+0.018
uzz+0.017
Neg logit contributions
-0.020
light-0.018
 nap-0.016
 awoke-0.016
?'-0.015
Token-
Token ID12
Feature activation+1.19e-01

Pos logit contributions
arl+0.223
ps+0.192
 stern+0.186
ags+0.185
agged+0.181
Neg logit contributions
light-0.158
-0.156
 whenever-0.135
 nap-0.132
 awoke-0.130
Tokengo
Token ID2188
Feature activation+7.27e-02

Pos logit contributions
arl+0.223
ags+0.203
ses+0.196
ps+0.190
sed+0.188
Neg logit contributions
-0.189
light-0.174
 bal-0.153
 whenever-0.143
on-0.142
Token-
Token ID12
Feature activation+1.41e-01

Pos logit contributions
arl+0.147
ags+0.124
 stern+0.123
ps+0.122
ses+0.121
Neg logit contributions
light-0.105
-0.104
 awoke-0.081
 whenever-0.081
pen-0.080
Tokenround
Token ID744
Feature activation+4.43e-02

Pos logit contributions
arl+0.249
ags+0.225
ses+0.219
ps+0.213
sed+0.212
Neg logit contributions
-0.239
light-0.217
 whenever-0.193
 bal-0.189
's-0.177
Token.
Token ID13
Feature activation+9.05e-03

Pos logit contributions
arl+0.100
ags+0.086
ps+0.086
agged+0.084
 stern+0.083
Neg logit contributions
-0.052
light-0.051
 whenever-0.040
 awoke-0.039
pen-0.039
Token Tom
Token ID4186
Feature activation+3.82e-02

Pos logit contributions
ps+0.021
arl+0.020
chie+0.020
kind+0.019
outing+0.019
Neg logit contributions
-0.010
 whenever-0.006
 Little-0.006
light-0.006
?'-0.005
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
arl+0.065
ps+0.063
 stern+0.061
ks+0.056
outing+0.055
Neg logit contributions
light-0.056
-0.052
?'-0.050
 Tues-0.048
 supper-0.047
Token Lily
Token ID20037
Feature activation+1.01e-02

Pos logit contributions
arl+0.030
ags+0.028
agged+0.026
outing+0.026
ps+0.026
Neg logit contributions
-0.015
 lay-0.013
 laid-0.013
 tucked-0.012
 chew-0.012
Token to
Token ID284
Feature activation+3.89e-02

Pos logit contributions
arl+0.097
outing+0.090
ses+0.089
agged+0.088
ps+0.088
Neg logit contributions
-0.038
 whenever-0.029
light-0.022
 bal-0.022
 Tues-0.021
Token the
Token ID262
Feature activation+2.25e-02

Pos logit contributions
arl+0.103
agged+0.097
outing+0.091
outed+0.090
itates+0.089
Neg logit contributions
-0.032
 nap-0.027
 visit-0.025
 prepare-0.025
 whenever-0.024
Token merry
Token ID44570
Feature activation+1.16e-01

Pos logit contributions
arl+0.037
uzz+0.032
agged+0.032
ps+0.032
ags+0.031
Neg logit contributions
-0.039
light-0.035
 awoke-0.032
 nap-0.032
 tucked-0.032
Token-
Token ID12
Feature activation+1.20e-01

Pos logit contributions
arl+0.224
ps+0.192
 stern+0.188
ags+0.182
ses+0.179
Neg logit contributions
light-0.161
-0.161
 whenever-0.143
 awoke-0.139
 Tues-0.136
Tokengo
Token ID2188
Feature activation+6.75e-02

Pos logit contributions
arl+0.231
ses+0.207
ps+0.203
ags+0.198
 stern+0.197
Neg logit contributions
-0.181
light-0.162
 bal-0.149
 whenever-0.148
's-0.132
Token-
Token ID12
Feature activation+1.41e-01

Pos logit contributions
arl+0.132
ps+0.112
 stern+0.111
ses+0.109
ags+0.108
Neg logit contributions
-0.100
light-0.096
 whenever-0.082
 awoke-0.078
pen-0.077
Tokenround
Token ID744
Feature activation+4.83e-02

Pos logit contributions
arl+0.241
ps+0.213
ses+0.211
 stern+0.208
ags+0.203
Neg logit contributions
-0.244
light-0.221
 whenever-0.209
 bal-0.198
 Tues-0.189
Token.
Token ID13
Feature activation+8.09e-03

Pos logit contributions
arl+0.112
ps+0.097
agged+0.096
ags+0.096
 stern+0.096
Neg logit contributions
-0.053
light-0.051
 whenever-0.042
 awoke-0.038
pen-0.038
Token
Token ID198
Feature activation+4.78e-04

Pos logit contributions
arl+0.015
ps+0.015
chie+0.014
 stern+0.013
outing+0.013
Neg logit contributions
-0.012
 whenever-0.010
light-0.010
?'-0.009
 Little-0.009
Token
Token ID198
Feature activation+3.30e-03

Pos logit contributions
arl+0.001
ps+0.001
ags+0.001
 stern+0.001
agged+0.001
Neg logit contributions
-0.001
light-0.001
 awoke-0.000
?'-0.000
 whenever-0.000
TokenThey
Token ID2990
Feature activation+3.16e-02

Pos logit contributions
arl+0.006
ps+0.005
ags+0.005
 stern+0.005
Their+0.005
Neg logit contributions
-0.005
light-0.005
 awoke-0.005
?'-0.004
pen-0.004
Token said
Token ID531
Feature activation+1.51e-02

Pos logit contributions
arl+0.081
ps+0.071
ags+0.068
 stern+0.068
agged+0.066
Neg logit contributions
-0.056
light-0.055
 awoke-0.046
?'-0.045
 whenever-0.045
Token.
Token ID13
Feature activation+3.23e-02

Pos logit contributions
arl+0.035
ps+0.030
ags+0.030
agged+0.029
ses+0.029
Neg logit contributions
-0.017
light-0.016
 whenever-0.013
pen-0.012
?'-0.012
Token
Token ID198
Feature activation+2.02e-02

Pos logit contributions
arl+0.071
ps+0.065
ags+0.062
agged+0.061
 stern+0.060
Neg logit contributions
-0.038
light-0.034
 whenever-0.028
?'-0.028
 awoke-0.027
Token
Token ID198
Feature activation+1.85e-02

Pos logit contributions
arl+0.043
ps+0.038
ags+0.037
 stern+0.037
agged+0.036
Neg logit contributions
-0.025
light-0.024
 awoke-0.019
?'-0.019
 whenever-0.019
TokenThe
Token ID464
Feature activation+3.60e-02

Pos logit contributions
arl+0.036
ps+0.032
ags+0.031
 stern+0.030
agged+0.029
Neg logit contributions
-0.026
light-0.025
 awoke-0.021
?'-0.021
pen-0.021
Token officer
Token ID3818
Feature activation+1.40e-01

Pos logit contributions
arl+0.057
ps+0.050
ags+0.049
agged+0.048
Their+0.047
Neg logit contributions
-0.064
light-0.056
 awoke-0.052
 nap-0.052
?'-0.051
Token shook
Token ID14682
Feature activation+1.24e-02

Pos logit contributions
arl+0.295
ags+0.251
ps+0.247
agged+0.236
orts+0.234
Neg logit contributions
-0.193
light-0.182
 awoke-0.157
pen-0.148
 tucked-0.146
Token his
Token ID465
Feature activation+5.06e-02

Pos logit contributions
arl+0.029
ps+0.025
ses+0.024
ags+0.024
agged+0.024
Neg logit contributions
-0.015
light-0.012
 whenever-0.011
 awoke-0.010
?'-0.010
Token head
Token ID1182
Feature activation-5.78e-03

Pos logit contributions
arl+0.104
agged+0.089
orts+0.088
Their+0.087
outing+0.086
Neg logit contributions
-0.060
light-0.057
 nap-0.052
 awoke-0.046
 tucked-0.046
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
+0.006
light+0.005
 whenever+0.004
 tucked+0.004
's+0.004
Neg logit contributions
arl-0.014
ags-0.013
ses-0.012
orts-0.012
ps-0.012
Token He
Token ID679
Feature activation+2.00e-02

Pos logit contributions
arl+0.036
ps+0.036
ags+0.034
agged+0.033
outing+0.033
Neg logit contributions
-0.019
light-0.014
 whenever-0.014
 awoke-0.013
pen-0.013
Token listen
Token ID6004
Feature activation+4.99e-03

Pos logit contributions
arl+0.145
ps+0.127
ags+0.124
 stern+0.121
agged+0.120
Neg logit contributions
-0.101
light-0.095
 awoke-0.080
?'-0.079
 whenever-0.078
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
arl+0.012
ags+0.011
ps+0.011
ses+0.011
agged+0.011
Neg logit contributions
-0.005
light-0.004
 whenever-0.004
 since-0.003
 awoke-0.003
Token She
Token ID1375
Feature activation+2.05e-03

Pos logit contributions
arl+0.021
ps+0.019
ags+0.018
 stern+0.018
agged+0.018
Neg logit contributions
-0.014
light-0.013
 whenever-0.010
?'-0.010
 awoke-0.010
Token threw
Token ID9617
Feature activation+1.87e-02

Pos logit contributions
arl+0.004
ps+0.003
 stern+0.003
ags+0.003
agged+0.003
Neg logit contributions
-0.003
light-0.003
 awoke-0.002
?'-0.002
pen-0.002
Token a
Token ID257
Feature activation+2.96e-02

Pos logit contributions
arl+0.043
outing+0.037
agged+0.036
ps+0.036
ses+0.035
Neg logit contributions
-0.022
 whenever-0.018
light-0.017
 nap-0.016
pen-0.016
Token tant
Token ID24246
Feature activation+1.40e-01

Pos logit contributions
arl+0.039
ps+0.038
ags+0.037
agged+0.037
uzz+0.033
Neg logit contributions
-0.058
 nap-0.053
light-0.052
 bal-0.049
 awoke-0.049
Tokenrum
Token ID6582
Feature activation+4.15e-02

Pos logit contributions
arl+0.242
ses+0.197
ags+0.194
 stern+0.187
sed+0.185
Neg logit contributions
light-0.214
-0.206
pen-0.187
?'-0.186
 bal-0.184
Token in
Token ID287
Feature activation+1.38e-02

Pos logit contributions
arl+0.097
ags+0.083
ps+0.083
orts+0.081
 stern+0.081
Neg logit contributions
-0.045
light-0.043
 whenever-0.035
 awoke-0.033
?'-0.032
Token the
Token ID262
Feature activation-3.30e-03

Pos logit contributions
arl+0.030
ps+0.026
agged+0.025
ags+0.025
orts+0.025
Neg logit contributions
-0.018
light-0.016
 whenever-0.013
pen-0.013
 nap-0.013
Token store
Token ID3650
Feature activation+6.04e-02

Pos logit contributions
+0.004
light+0.004
 awoke+0.003
 nap+0.003
?'+0.003
Neg logit contributions
arl-0.007
ps-0.006
uzz-0.006
agged-0.006
ags-0.006
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
arl+0.146
ps+0.129
ags+0.128
agged+0.125
orts+0.122
Neg logit contributions
-0.059
light-0.055
 whenever-0.045
pen-0.042
 tucked-0.041
Token and
Token ID290
Feature activation-1.70e-03

Pos logit contributions
arl+0.097
ps+0.084
ags+0.084
 stern+0.082
agged+0.082
Neg logit contributions
-0.043
light-0.041
 whenever-0.032
 awoke-0.031
 tucked-0.030
Token the
Token ID262
Feature activation+9.14e-03

Pos logit contributions
+0.002
light+0.002
 awoke+0.002
?'+0.002
pen+0.002
Neg logit contributions
arl-0.003
ps-0.003
ags-0.003
 stern-0.003
Their-0.003
Token merry
Token ID44570
Feature activation+9.86e-02

Pos logit contributions
arl+0.016
ps+0.013
agged+0.013
ags+0.013
Their+0.012
Neg logit contributions
-0.015
light-0.014
 nap-0.013
 awoke-0.012
 whenever-0.012
Token-
Token ID12
Feature activation+1.08e-01

Pos logit contributions
arl+0.207
 stern+0.185
outing+0.176
ps+0.174
itates+0.173
Neg logit contributions
light-0.115
 whenever-0.101
 night-0.100
 nap-0.099
-0.097
Tokengo
Token ID2188
Feature activation+7.48e-02

Pos logit contributions
arl+0.229
ses+0.215
ps+0.211
 stern+0.205
ags+0.203
Neg logit contributions
-0.126
light-0.112
 bal-0.109
 whenever-0.092
atmeal-0.092
Token-
Token ID12
Feature activation+1.40e-01

Pos logit contributions
arl+0.165
 stern+0.145
ses+0.141
ps+0.139
sed+0.137
Neg logit contributions
-0.094
light-0.093
 whenever-0.076
 nap-0.071
's-0.071
Tokenround
Token ID744
Feature activation+4.70e-02

Pos logit contributions
arl+0.257
ps+0.231
ses+0.231
 sorry+0.222
 stern+0.222
Neg logit contributions
-0.231
light-0.206
 whenever-0.197
 bal-0.185
 tomorrow-0.174
Token.
Token ID13
Feature activation+2.38e-03

Pos logit contributions
arl+0.110
ags+0.095
ps+0.094
agged+0.094
 stern+0.094
Neg logit contributions
-0.052
light-0.049
 whenever-0.040
 awoke-0.036
pen-0.035
Token They
Token ID1119
Feature activation+2.25e-02

Pos logit contributions
arl+0.004
ps+0.003
ags+0.003
 stern+0.003
agged+0.003
Neg logit contributions
-0.005
light-0.004
 awoke-0.004
 whenever-0.004
?'-0.004
Token have
Token ID423
Feature activation+1.30e-02

Pos logit contributions
arl+0.041
ps+0.036
 stern+0.034
ags+0.034
orts+0.032
Neg logit contributions
light-0.035
-0.032
 awoke-0.028
?'-0.028
pen-0.027
Token fun
Token ID1257
Feature activation-9.10e-03

Pos logit contributions
arl+0.025
ps+0.020
orts+0.019
outing+0.019
ags+0.019
Neg logit contributions
-0.021
light-0.019
 nap-0.017
 awoke-0.017
 tucked-0.017
Token thing
Token ID1517
Feature activation-5.59e-03

Pos logit contributions
+0.020
light+0.020
pen+0.018
ridge+0.017
?'+0.016
Neg logit contributions
arl-0.030
uzz-0.027
 stern-0.027
Their-0.026
ps-0.025
Token she
Token ID673
Feature activation-5.08e-02

Pos logit contributions
light+0.007
+0.007
pen+0.007
 awoke+0.006
ridge+0.006
Neg logit contributions
arl-0.010
 stern-0.010
ps-0.009
outs-0.009
ses-0.009
Token always
Token ID1464
Feature activation-3.34e-02

Pos logit contributions
light+0.070
+0.066
?'+0.059
pen+0.056
 whenever+0.056
Neg logit contributions
arl-0.086
ps-0.083
outed-0.079
 stern-0.079
 greedy-0.078
Token hated
Token ID16563
Feature activation+2.16e-02

Pos logit contributions
+0.051
light+0.049
?'+0.043
pen+0.043
 masse+0.042
Neg logit contributions
arl-0.052
ps-0.051
 stern-0.049
ks-0.047
 greedy-0.047
Token was
Token ID373
Feature activation-7.27e-03

Pos logit contributions
arl+0.035
ps+0.032
 stern+0.030
uzz+0.029
ags+0.029
Neg logit contributions
-0.033
light-0.033
 awoke-0.030
pen-0.029
?'-0.029
Token vegetables
Token ID13701
Feature activation-1.58e-01

Pos logit contributions
+0.010
light+0.010
 awoke+0.008
?'+0.008
pen+0.008
Neg logit contributions
arl-0.014
ps-0.013
ags-0.012
 stern-0.012
agged-0.012
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
light+0.262
+0.242
 awoke+0.240
?'+0.230
 recite+0.225
Neg logit contributions
arl-0.242
 stern-0.214
ps-0.212
 greedy-0.200
 innocent-0.191
Token Every
Token ID3887
Feature activation-1.64e-03

Pos logit contributions
light+0.018
+0.017
 awoke+0.015
pen+0.014
 tucked+0.014
Neg logit contributions
arl-0.021
ags-0.018
 stern-0.018
orts-0.018
ps-0.018
Token day
Token ID1110
Feature activation-6.31e-03

Pos logit contributions
light+0.002
pen+0.002
+0.002
ridge+0.002
 recite+0.002
Neg logit contributions
arl-0.003
 stern-0.003
Their-0.002
uzz-0.002
ps-0.002
Token,
Token ID11
Feature activation-4.13e-03

Pos logit contributions
light+0.010
umm+0.010
asts+0.010
 repay+0.010
atmeal+0.010
Neg logit contributions
 stern-0.009
 loud-0.008
 Lia-0.008
 sorry-0.008
ks-0.007
Token when
Token ID618
Feature activation-3.11e-02

Pos logit contributions
light+0.006
+0.006
 awoke+0.005
?'+0.005
pen+0.005
Neg logit contributions
arl-0.007
 stern-0.007
ps-0.007
ags-0.006
Their-0.006
Tokenier
Token ID959
Feature activation-3.06e-02

Pos logit contributions
+0.268
light+0.255
 awoke+0.228
pen+0.222
?'+0.219
Neg logit contributions
arl-0.223
ps-0.187
ags-0.177
agged-0.169
 stern-0.165
Token?
Token ID30
Feature activation-9.13e-03

Pos logit contributions
light+0.048
+0.048
 awoke+0.043
pen+0.041
 laid+0.039
Neg logit contributions
arl-0.056
ps-0.048
ags-0.044
 stern-0.044
ks-0.042
Token You
Token ID921
Feature activation+1.74e-02

Pos logit contributions
light+0.016
+0.016
 awoke+0.015
pen+0.014
?'+0.014
Neg logit contributions
arl-0.014
ps-0.011
ags-0.011
 stern-0.011
orts-0.011
Token are
Token ID389
Feature activation-5.88e-03

Pos logit contributions
arl+0.029
ps+0.027
 greedy+0.025
 stern+0.024
ags+0.024
Neg logit contributions
light-0.027
-0.024
 Tues-0.023
 masse-0.023
 awoke-0.022
Token y
Token ID331
Feature activation-5.52e-02

Pos logit contributions
+0.011
light+0.010
?'+0.009
 awoke+0.009
 whenever+0.009
Neg logit contributions
arl-0.009
ps-0.007
ags-0.007
 stern-0.007
orts-0.007
Tokenumm
Token ID13929
Feature activation-1.57e-01

Pos logit contributions
+0.060
light+0.056
 awoke+0.052
 whenever+0.045
 laid+0.043
Neg logit contributions
arl-0.136
ags-0.115
agged-0.113
ps-0.110
outing-0.109
Tokenier
Token ID959
Feature activation-4.36e-02

Pos logit contributions
+0.293
light+0.279
 awoke+0.250
pen+0.241
?'+0.241
Neg logit contributions
arl-0.246
ps-0.206
ags-0.196
 stern-0.188
agged-0.187
Token.
Token ID13
Feature activation-7.35e-04

Pos logit contributions
+0.072
light+0.071
 awoke+0.065
?'+0.060
pen+0.059
Neg logit contributions
arl-0.074
ps-0.065
 stern-0.060
uzz-0.059
ks-0.058
Token You
Token ID921
Feature activation+1.90e-02

Pos logit contributions
light+0.001
+0.001
 awoke+0.001
pen+0.001
?'+0.001
Neg logit contributions
arl-0.001
ags-0.001
ps-0.001
orts-0.001
ses-0.001
Token are
Token ID389
Feature activation-2.55e-03

Pos logit contributions
arl+0.033
ps+0.031
 stern+0.028
 greedy+0.028
ags+0.027
Neg logit contributions
light-0.029
-0.026
 Tues-0.025
 awoke-0.024
 masse-0.024
Token my
Token ID616
Feature activation-3.20e-02

Pos logit contributions
+0.005
light+0.005
 awoke+0.004
?'+0.004
 whenever+0.004
Neg logit contributions
arl-0.004
ps-0.003
ags-0.003
 stern-0.003
agged-0.003
Token said
Token ID531
Feature activation-1.57e-02

Pos logit contributions
+0.031
light+0.031
 whenever+0.026
 awoke+0.026
?'+0.025
Neg logit contributions
arl-0.039
ps-0.035
 stern-0.034
ags-0.033
agged-0.032
Token,
Token ID11
Feature activation+5.44e-04

Pos logit contributions
+0.012
light+0.011
 awoke+0.008
?'+0.007
 whenever+0.007
Neg logit contributions
arl-0.042
ps-0.037
ags-0.037
agged-0.036
orts-0.035
Token "
Token ID366
Feature activation+3.54e-02

Pos logit contributions
arl+0.001
ags+0.001
ps+0.001
agged+0.001
orts+0.001
Neg logit contributions
-0.001
light-0.001
 awoke-0.001
 whenever-0.001
 tucked-0.001
TokenThat
Token ID2504
Feature activation+1.04e-02

Pos logit contributions
arl+0.072
ps+0.063
ags+0.063
agged+0.059
ses+0.059
Neg logit contributions
-0.053
light-0.049
 whenever-0.039
 awoke-0.039
?'-0.038
Token is
Token ID318
Feature activation+7.76e-04

Pos logit contributions
ps+0.021
 stern+0.021
 greedy+0.019
arl+0.019
outs+0.019
Neg logit contributions
light-0.009
pen-0.009
 Tues-0.008
-0.008
 concentrate-0.008
Token soil
Token ID9260
Feature activation-1.54e-01

Pos logit contributions
arl+0.001
ps+0.001
 stern+0.001
ags+0.001
 greedy+0.001
Neg logit contributions
light-0.002
-0.001
pen-0.001
 awoke-0.001
 whenever-0.001
Token,
Token ID11
Feature activation-3.30e-02

Pos logit contributions
light+0.195
+0.192
 awoke+0.177
pen+0.175
 recite+0.171
Neg logit contributions
arl-0.288
ps-0.267
 stern-0.252
ags-0.235
 innocent-0.232
Token Daisy
Token ID40355
Feature activation-4.88e-02

Pos logit contributions
+0.055
light+0.053
 awoke+0.046
?'+0.045
 whenever+0.045
Neg logit contributions
arl-0.057
ps-0.048
ags-0.046
 stern-0.046
agged-0.044
Token.
Token ID13
Feature activation-9.25e-03

Pos logit contributions
+0.069
light+0.068
 awoke+0.060
pen+0.058
?'+0.057
Neg logit contributions
arl-0.090
ps-0.080
 stern-0.076
ags-0.073
uzz-0.072
Token It
Token ID632
Feature activation+6.93e-03

Pos logit contributions
+0.015
light+0.014
 awoke+0.013
?'+0.012
pen+0.012
Neg logit contributions
arl-0.016
ps-0.014
ags-0.013
 stern-0.013
orts-0.013
Token may
Token ID743
Feature activation-2.12e-02

Pos logit contributions
ps+0.013
arl+0.013
 stern+0.012
 greedy+0.011
ses+0.011
Neg logit contributions
light-0.009
pen-0.008
-0.008
 Tues-0.007
?'-0.007
Token so
Token ID523
Feature activation+1.12e-02

Pos logit contributions
arl+0.048
ps+0.041
ags+0.039
 stern+0.039
agged+0.038
Neg logit contributions
-0.045
light-0.043
 awoke-0.037
?'-0.037
pen-0.036
Token many
Token ID867
Feature activation+2.37e-02

Pos logit contributions
arl+0.016
ps+0.013
ags+0.012
ses+0.012
agged+0.011
Neg logit contributions
-0.023
light-0.021
 whenever-0.020
pen-0.019
 awoke-0.019
Token beautiful
Token ID4950
Feature activation-4.16e-02

Pos logit contributions
arl+0.031
 stern+0.027
ps+0.027
ags+0.026
 innocent+0.025
Neg logit contributions
light-0.047
-0.045
 awoke-0.040
?'-0.039
pen-0.038
Token things
Token ID1243
Feature activation-1.62e-02

Pos logit contributions
+0.087
light+0.087
 awoke+0.078
?'+0.077
 concentrate+0.074
Neg logit contributions
arl-0.043
ps-0.038
 stern-0.036
ags-0.034
outs-0.034
Token:
Token ID25
Feature activation+4.85e-03

Pos logit contributions
light+0.023
+0.022
 awoke+0.019
?'+0.019
pen+0.019
Neg logit contributions
arl-0.029
ps-0.027
 stern-0.026
ags-0.025
uzz-0.024
Token vegetables
Token ID13701
Feature activation-1.54e-01

Pos logit contributions
arl+0.009
ps+0.008
ags+0.007
agged+0.007
 stern+0.007
Neg logit contributions
-0.008
light-0.007
 awoke-0.006
 whenever-0.006
?'-0.006
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
light+0.217
 masse+0.189
 Tues+0.187
+0.185
 awoke+0.184
Neg logit contributions
arl-0.258
ps-0.247
 stern-0.239
ks-0.217
outing-0.213
Token fruits
Token ID15921
Feature activation-1.08e-01

Pos logit contributions
light+0.019
+0.018
?'+0.017
pen+0.017
 awoke+0.017
Neg logit contributions
arl-0.020
ps-0.019
 stern-0.019
 greedy-0.019
 innocent-0.018
Token,
Token ID11
Feature activation-1.34e-02

Pos logit contributions
light+0.157
+0.149
 awoke+0.136
?'+0.133
 Tues+0.131
Neg logit contributions
arl-0.193
ps-0.174
 stern-0.167
ags-0.155
Their-0.153
Token a
Token ID257
Feature activation+1.24e-02

Pos logit contributions
light+0.020
+0.019
 awoke+0.017
?'+0.017
pen+0.016
Neg logit contributions
arl-0.024
ps-0.022
 stern-0.021
ags-0.020
 greedy-0.019
Token big
Token ID1263
Feature activation-1.08e-02

Pos logit contributions
arl+0.019
 stern+0.016
ps+0.015
 greedy+0.014
 innocent+0.014
Neg logit contributions
light-0.022
-0.021
 awoke-0.019
pen-0.019
 whenever-0.019
Tokena
Token ID64
Feature activation-1.01e-02

Pos logit contributions
+0.036
light+0.036
 awoke+0.032
?'+0.031
 whenever+0.031
Neg logit contributions
arl-0.026
ps-0.021
ags-0.020
 stern-0.020
agged-0.019
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
+0.015
light+0.015
 awoke+0.013
pen+0.013
?'+0.013
Neg logit contributions
arl-0.018
ps-0.015
 stern-0.015
uzz-0.015
Their-0.015
Token Ell
Token ID7122
Feature activation-9.89e-03

Pos logit contributions
+0.037
light+0.036
 awoke+0.031
?'+0.030
pen+0.030
Neg logit contributions
arl-0.049
ps-0.042
ags-0.041
 stern-0.040
uzz-0.040
Tokena
Token ID64
Feature activation-5.97e-03

Pos logit contributions
+0.020
light+0.019
 awoke+0.017
?'+0.017
pen+0.017
Neg logit contributions
arl-0.014
ps-0.011
ags-0.011
 stern-0.010
agged-0.010
Token loved
Token ID6151
Feature activation-9.87e-03

Pos logit contributions
light+0.010
pen+0.010
 avail+0.010
ridge+0.009
 supper+0.009
Neg logit contributions
 greedy-0.007
uzz-0.007
arl-0.007
Their-0.007
 stern-0.007
Token vegetables
Token ID13701
Feature activation-1.53e-01

Pos logit contributions
+0.018
light+0.017
 awoke+0.015
pen+0.015
?'+0.015
Neg logit contributions
arl-0.015
ps-0.012
ags-0.012
 stern-0.012
Their-0.012
Token!
Token ID0
Feature activation-1.56e-02

Pos logit contributions
light+0.273
+0.267
 awoke+0.257
 recite+0.239
pen+0.238
Neg logit contributions
arl-0.207
ags-0.180
 stern-0.180
ps-0.180
 greedy-0.168
Token Every
Token ID3887
Feature activation-5.85e-03

Pos logit contributions
light+0.027
+0.025
 awoke+0.023
pen+0.022
 tucked+0.022
Neg logit contributions
arl-0.027
ags-0.022
uzz-0.022
orts-0.021
ps-0.021
Token day
Token ID1110
Feature activation-3.88e-03

Pos logit contributions
pen+0.010
light+0.010
ridge+0.010
+0.009
 recite+0.009
Neg logit contributions
arl-0.007
 loud-0.006
 hard-0.006
 second-0.005
 stern-0.005
Token she
Token ID673
Feature activation-2.37e-02

Pos logit contributions
light+0.006
?'+0.006
 repay+0.006
 recite+0.006
pen+0.006
Neg logit contributions
 stern-0.005
arl-0.004
uzz-0.004
 hello-0.004
 Lia-0.004
Token went
Token ID1816
Feature activation-1.68e-02

Pos logit contributions
light+0.039
 avail+0.038
umm+0.037
+0.035
mmm+0.035
Neg logit contributions
ps-0.028
 greedy-0.027
suits-0.024
co-0.023
outed-0.023
Token said
Token ID531
Feature activation-2.02e-02

Pos logit contributions
+0.035
?'+0.033
light+0.032
 masse+0.031
pen+0.031
Neg logit contributions
 stern-0.028
ps-0.027
 mad-0.026
 greedy-0.026
 angry-0.026
Token,
Token ID11
Feature activation-9.10e-04

Pos logit contributions
+0.019
light+0.018
 awoke+0.014
?'+0.013
pen+0.013
Neg logit contributions
arl-0.049
ps-0.044
ags-0.042
 stern-0.042
agged-0.041
Token "
Token ID366
Feature activation+3.17e-02

Pos logit contributions
+0.002
light+0.001
 whenever+0.001
?'+0.001
 awoke+0.001
Neg logit contributions
arl-0.001
ps-0.001
ags-0.001
agged-0.001
orts-0.001
TokenI
Token ID40
Feature activation-3.63e-02

Pos logit contributions
arl+0.043
Their+0.034
ags+0.034
agged+0.033
ps+0.033
Neg logit contributions
-0.055
light-0.054
 awoke-0.052
?'-0.050
 tucked-0.049
Token love
Token ID1842
Feature activation-1.78e-02

Pos logit contributions
light+0.058
 Tues+0.051
+0.050
 masse+0.050
 nights+0.049
Neg logit contributions
arl-0.057
 stern-0.053
 greedy-0.048
 mad-0.048
uzz-0.047
Token vegetables
Token ID13701
Feature activation-1.53e-01

Pos logit contributions
light+0.032
 awoke+0.031
+0.029
pen+0.027
 recite+0.025
Neg logit contributions
arl-0.023
 greedy-0.023
ps-0.022
 sorry-0.021
 stern-0.021
Token!
Token ID0
Feature activation-1.10e-02

Pos logit contributions
light+0.212
+0.210
 awoke+0.191
pen+0.181
 recite+0.173
Neg logit contributions
arl-0.284
ps-0.249
 stern-0.243
ags-0.234
 greedy-0.230
Token They
Token ID1119
Feature activation-2.08e-03

Pos logit contributions
light+0.019
pen+0.018
 awoke+0.017
 nights+0.017
+0.017
Neg logit contributions
arl-0.018
 stern-0.013
 sorry-0.012
uzz-0.012
ses-0.012
Token're
Token ID821
Feature activation-5.04e-02

Pos logit contributions
light+0.003
 masse+0.003
 Tues+0.003
+0.003
 awoke+0.003
Neg logit contributions
ps-0.003
arl-0.003
 greedy-0.003
 stern-0.003
 innocent-0.002
Token so
Token ID523
Feature activation-1.60e-02

Pos logit contributions
light+0.117
pen+0.107
 awoke+0.105
+0.103
 bal+0.097
Neg logit contributions
arl-0.047
ps-0.044
 sorry-0.044
 greedy-0.043
 stern-0.041
Token delicious
Token ID12625
Feature activation-7.82e-02

Pos logit contributions
+0.030
light+0.030
 awoke+0.026
?'+0.026
pen+0.026
Neg logit contributions
arl-0.023
ps-0.019
ags-0.019
 stern-0.018
agged-0.018
Token them
Token ID606
Feature activation+4.18e-02

Pos logit contributions
+0.003
light+0.002
 whenever+0.002
pen+0.002
 awoke+0.002
Neg logit contributions
arl-0.004
agged-0.003
ags-0.003
ps-0.003
orts-0.003
Token out
Token ID503
Feature activation+1.07e-02

Pos logit contributions
arl+0.087
ps+0.074
ags+0.073
agged+0.073
orts+0.071
Neg logit contributions
-0.057
light-0.053
 whenever-0.045
 awoke-0.043
pen-0.043
Token.
Token ID13
Feature activation+1.99e-04

Pos logit contributions
arl+0.027
ags+0.023
agged+0.023
ses+0.023
ps+0.023
Neg logit contributions
-0.011
light-0.010
 whenever-0.008
 tucked-0.007
 awoke-0.007
Token They
Token ID1119
Feature activation+1.96e-02

Pos logit contributions
arl+0.000
ps+0.000
ags+0.000
agged+0.000
chie+0.000
Neg logit contributions
-0.000
 whenever-0.000
light-0.000
 Tues-0.000
 Little-0.000
Token were
Token ID547
Feature activation-1.21e-03

Pos logit contributions
arl+0.039
ps+0.035
 stern+0.033
ags+0.033
Their+0.032
Neg logit contributions
-0.026
light-0.026
 awoke-0.021
?'-0.021
pen-0.021
Token vegetables
Token ID13701
Feature activation-1.51e-01

Pos logit contributions
+0.002
light+0.002
 tucked+0.002
 whenever+0.002
 laid+0.002
Neg logit contributions
arl-0.002
orts-0.001
agged-0.001
ags-0.001
 apology-0.001
Token!
Token ID0
Feature activation+2.94e-03

Pos logit contributions
light+0.182
 awoke+0.174
+0.168
?'+0.167
 Tues+0.162
Neg logit contributions
arl-0.297
 stern-0.274
ps-0.269
outing-0.246
ks-0.245
Token They
Token ID1119
Feature activation+1.92e-02

Pos logit contributions
arl+0.005
ps+0.005
ags+0.004
 stern+0.004
ses+0.004
Neg logit contributions
-0.005
light-0.004
 whenever-0.004
 Tues-0.004
?'-0.004
Token were
Token ID547
Feature activation-3.87e-03

Pos logit contributions
arl+0.031
ps+0.030
 stern+0.029
uzz+0.028
ks+0.028
Neg logit contributions
light-0.028
-0.027
?'-0.025
 masse-0.025
 Tues-0.025
Token carrots
Token ID34397
Feature activation-1.25e-01

Pos logit contributions
+0.007
light+0.007
 awoke+0.006
 whenever+0.006
?'+0.006
Neg logit contributions
arl-0.006
ps-0.005
ags-0.005
 stern-0.005
agged-0.005
Token,
Token ID11
Feature activation-1.76e-02

Pos logit contributions
light+0.152
+0.144
 awoke+0.143
 Tues+0.140
?'+0.139
Neg logit contributions
arl-0.245
ps-0.219
 stern-0.218
ses-0.201
ks-0.200
Token was
Token ID373
Feature activation-1.17e-02

Pos logit contributions
uzz+0.009
 stern+0.008
Their+0.008
arl+0.008
ps+0.008
Neg logit contributions
light-0.009
 masse-0.009
ridge-0.008
 recite-0.008
-0.008
Token a
Token ID257
Feature activation-4.13e-03

Pos logit contributions
light+0.023
+0.022
 awoke+0.021
pen+0.021
 recite+0.020
Neg logit contributions
arl-0.014
ags-0.012
ps-0.012
Their-0.012
uzz-0.011
Token family
Token ID1641
Feature activation-4.40e-02

Pos logit contributions
+0.007
light+0.007
 awoke+0.006
?'+0.006
 whenever+0.006
Neg logit contributions
arl-0.007
ps-0.006
ags-0.006
agged-0.006
 stern-0.005
Token who
Token ID508
Feature activation+2.39e-02

Pos logit contributions
light+0.081
+0.081
 awoke+0.076
pen+0.071
ridge+0.071
Neg logit contributions
arl-0.056
Their-0.047
uzz-0.047
 stern-0.045
ps-0.044
Token liked
Token ID8288
Feature activation+1.39e-02

Pos logit contributions
arl+0.034
ps+0.032
uzz+0.030
 stern+0.030
ks+0.030
Neg logit contributions
light-0.039
-0.036
?'-0.035
pen-0.034
 masse-0.033
Token vegetables
Token ID13701
Feature activation-1.49e-01

Pos logit contributions
arl+0.026
ps+0.022
 stern+0.022
ags+0.021
Their+0.021
Neg logit contributions
-0.021
light-0.020
 awoke-0.017
pen-0.017
?'-0.017
Token.
Token ID13
Feature activation-2.69e-02

Pos logit contributions
light+0.241
+0.240
 awoke+0.222
pen+0.214
?'+0.212
Neg logit contributions
arl-0.218
ps-0.204
 stern-0.193
ks-0.182
 greedy-0.181
Token They
Token ID1119
Feature activation-4.12e-03

Pos logit contributions
light+0.047
+0.047
 awoke+0.041
pen+0.041
?'+0.039
Neg logit contributions
arl-0.042
ags-0.035
ps-0.034
Their-0.033
 stern-0.033
Token especially
Token ID2592
Feature activation-4.05e-02

Pos logit contributions
light+0.007
 masse+0.007
+0.006
umm+0.006
pen+0.006
Neg logit contributions
 greedy-0.005
ps-0.005
arl-0.005
uzz-0.005
Their-0.004
Token liked
Token ID8288
Feature activation+1.57e-02

Pos logit contributions
light+0.083
+0.082
?'+0.076
 awoke+0.073
pen+0.072
Neg logit contributions
arl-0.046
 greedy-0.041
ps-0.041
 stern-0.039
agged-0.039
Token caul
Token ID46087
Feature activation-1.04e-01

Pos logit contributions
arl+0.020
 stern+0.018
 greedy+0.018
ps+0.018
Their+0.017
Neg logit contributions
light-0.030
-0.029
 awoke-0.027
pen-0.026
 bal-0.025
Token the
Token ID262
Feature activation-3.82e-02

Pos logit contributions
light+0.035
+0.034
 awoke+0.031
pen+0.029
 recite+0.029
Neg logit contributions
arl-0.038
ps-0.035
ags-0.034
Their-0.033
 stern-0.032
Token materials
Token ID5696
Feature activation-2.90e-02

Pos logit contributions
+0.071
light+0.071
 awoke+0.063
?'+0.061
pen+0.060
Neg logit contributions
arl-0.056
ps-0.048
ags-0.045
 stern-0.044
orts-0.042
Token he
Token ID339
Feature activation-2.79e-02

Pos logit contributions
light+0.042
+0.039
 awoke+0.038
 masse+0.035
pen+0.034
Neg logit contributions
arl-0.051
ps-0.047
 stern-0.045
ags-0.043
Their-0.043
Token needed
Token ID2622
Feature activation-3.23e-02

Pos logit contributions
light+0.034
+0.031
 awoke+0.027
 Tues+0.027
 masse+0.026
Neg logit contributions
arl-0.055
ps-0.052
 stern-0.050
 greedy-0.048
ags-0.048
Token:
Token ID25
Feature activation+3.27e-03

Pos logit contributions
light+0.049
+0.047
 awoke+0.044
 masse+0.043
 recite+0.041
Neg logit contributions
arl-0.054
ps-0.050
 stern-0.047
ags-0.046
uzz-0.045
Token soil
Token ID9260
Feature activation-1.47e-01

Pos logit contributions
arl+0.006
ps+0.005
ags+0.005
 stern+0.005
agged+0.005
Neg logit contributions
-0.005
light-0.005
 awoke-0.004
?'-0.004
 whenever-0.004
Token and
Token ID290
Feature activation-2.11e-02

Pos logit contributions
light+0.192
+0.186
 awoke+0.175
 recite+0.169
pen+0.166
Neg logit contributions
arl-0.271
ps-0.250
 stern-0.232
ags-0.222
uzz-0.216
Token pe
Token ID613
Feature activation-3.61e-02

Pos logit contributions
light+0.034
?'+0.033
 awoke+0.033
pen+0.033
+0.033
Neg logit contributions
ps-0.028
arl-0.027
 stern-0.026
 innocent-0.025
 greedy-0.024
Tokenb
Token ID65
Feature activation-2.93e-02

Pos logit contributions
+0.067
light+0.066
 awoke+0.058
?'+0.057
pen+0.057
Neg logit contributions
arl-0.054
ps-0.043
ags-0.042
 stern-0.042
agged-0.040
Tokenbles
Token ID7689
Feature activation-7.51e-03

Pos logit contributions
+0.068
light+0.066
 whenever+0.059
?'+0.059
pen+0.059
Neg logit contributions
arl-0.030
ps-0.022
 stern-0.021
ags-0.021
agged-0.020
Token.
Token ID13
Feature activation-5.51e-03

Pos logit contributions
light+0.010
+0.010
 awoke+0.009
?'+0.009
 masse+0.009
Neg logit contributions
arl-0.014
ps-0.013
 stern-0.012
ags-0.012
uzz-0.011
Token saw
Token ID2497
Feature activation+2.54e-02

Pos logit contributions
light+0.035
+0.033
?'+0.030
umm+0.030
pen+0.030
Neg logit contributions
 stern-0.029
arl-0.028
outed-0.028
uzz-0.026
 greedy-0.026
Token many
Token ID867
Feature activation+2.51e-02

Pos logit contributions
arl+0.046
ps+0.039
ags+0.038
agged+0.036
 stern+0.036
Neg logit contributions
-0.041
light-0.039
 awoke-0.034
 whenever-0.034
?'-0.033
Token interesting
Token ID3499
Feature activation-1.17e-02

Pos logit contributions
arl+0.031
ps+0.026
 stern+0.025
ags+0.025
Their+0.023
Neg logit contributions
light-0.052
-0.052
 awoke-0.046
?'-0.045
pen-0.044
Token things
Token ID1243
Feature activation-1.56e-02

Pos logit contributions
+0.025
light+0.024
 awoke+0.022
?'+0.022
pen+0.021
Neg logit contributions
arl-0.014
ps-0.011
ags-0.011
 stern-0.011
agged-0.010
Token like
Token ID588
Feature activation-8.87e-03

Pos logit contributions
light+0.023
+0.021
 awoke+0.020
?'+0.019
pen+0.019
Neg logit contributions
arl-0.028
ps-0.026
 stern-0.024
ags-0.023
ks-0.023
Token vegetables
Token ID13701
Feature activation-1.47e-01

Pos logit contributions
+0.016
light+0.015
 awoke+0.013
 whenever+0.013
pen+0.013
Neg logit contributions
arl-0.014
ps-0.012
ags-0.012
agged-0.011
 stern-0.011
Token and
Token ID290
Feature activation-1.53e-02

Pos logit contributions
light+0.225
 awoke+0.195
+0.193
 Tues+0.186
?'+0.184
Neg logit contributions
arl-0.253
ps-0.226
 stern-0.219
 greedy-0.203
ks-0.203
Token fruits
Token ID15921
Feature activation-1.08e-01

Pos logit contributions
light+0.025
+0.024
?'+0.022
pen+0.022
 awoke+0.021
Neg logit contributions
arl-0.023
ps-0.022
 stern-0.021
 greedy-0.020
 innocent-0.020
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
light+0.165
+0.161
 awoke+0.144
?'+0.143
 Tues+0.138
Neg logit contributions
arl-0.189
ps-0.166
 stern-0.158
ags-0.153
Their-0.147
Token
Token ID198
Feature activation-1.77e-02

Pos logit contributions
+0.014
light+0.013
 awoke+0.011
?'+0.011
 whenever+0.011
Neg logit contributions
arl-0.022
ps-0.020
ags-0.019
 stern-0.019
agged-0.018
Token
Token ID198
Feature activation-1.53e-02

Pos logit contributions
+0.023
light+0.023
 awoke+0.019
pen+0.018
?'+0.018
Neg logit contributions
arl-0.036
ps-0.031
ags-0.030
 stern-0.030
Their-0.029
Token full
Token ID1336
Feature activation-6.52e-02

Pos logit contributions
+0.086
light+0.082
 awoke+0.075
pen+0.071
?'+0.069
Neg logit contributions
arl-0.102
 stern-0.087
ps-0.086
uzz-0.084
ags-0.084
Token of
Token ID286
Feature activation-2.28e-02

Pos logit contributions
light+0.095
+0.092
pen+0.089
 recite+0.087
 awoke+0.086
Neg logit contributions
arl-0.109
 stern-0.097
ags-0.092
 greedy-0.089
agged-0.088
Token all
Token ID477
Feature activation-3.08e-02

Pos logit contributions
light+0.046
+0.044
 awoke+0.041
pen+0.041
ridge+0.037
Neg logit contributions
arl-0.027
ps-0.025
ags-0.024
 innocent-0.024
 greedy-0.024
Token types
Token ID3858
Feature activation-5.69e-02

Pos logit contributions
light+0.044
+0.043
pen+0.040
 awoke+0.038
 recite+0.036
Neg logit contributions
arl-0.055
ps-0.050
ags-0.049
Their-0.047
 stern-0.046
Token of
Token ID286
Feature activation-1.68e-02

Pos logit contributions
+0.048
light+0.046
 awoke+0.034
?'+0.033
pen+0.032
Neg logit contributions
arl-0.142
ps-0.128
ags-0.125
 stern-0.124
agged-0.122
Token vegetables
Token ID13701
Feature activation-1.47e-01

Pos logit contributions
light+0.032
+0.030
 awoke+0.027
pen+0.026
 bal+0.025
Neg logit contributions
arl-0.023
ps-0.022
ags-0.021
 innocent-0.021
 stern-0.020
Token.
Token ID13
Feature activation-1.75e-02

Pos logit contributions
light+0.245
+0.238
 awoke+0.224
 recite+0.212
pen+0.211
Neg logit contributions
arl-0.224
ps-0.199
 stern-0.189
ks-0.184
uzz-0.179
Token All
Token ID1439
Feature activation-1.60e-03

Pos logit contributions
+0.027
light+0.027
 awoke+0.023
pen+0.023
?'+0.022
Neg logit contributions
arl-0.031
ps-0.026
ags-0.026
 stern-0.025
agged-0.025
Token the
Token ID262
Feature activation-3.77e-02

Pos logit contributions
light+0.002
+0.002
pen+0.002
 awoke+0.002
 recite+0.002
Neg logit contributions
arl-0.003
ps-0.002
ags-0.002
Their-0.002
 stern-0.002
Token vegetables
Token ID13701
Feature activation-1.40e-01

Pos logit contributions
light+0.075
+0.072
 awoke+0.065
pen+0.064
?'+0.063
Neg logit contributions
arl-0.049
ps-0.043
 greedy-0.042
ags-0.041
 innocent-0.040
Token were
Token ID547
Feature activation-1.94e-02

Pos logit contributions
light+0.201
+0.197
 masse+0.188
 Tues+0.178
 Tooth+0.171
Neg logit contributions
arl-0.226
ps-0.221
 stern-0.208
 greedy-0.202
ks-0.199
Token sees
Token ID7224
Feature activation+3.90e-02

Pos logit contributions
light+0.058
+0.053
 awoke+0.049
?'+0.049
 bal+0.049
Neg logit contributions
arl-0.045
ps-0.045
ks-0.038
 stern-0.037
 greedy-0.037
Token tomatoes
Token ID23972
Feature activation-1.33e-01

Pos logit contributions
arl+0.070
ps+0.060
ags+0.058
 stern+0.057
agged+0.056
Neg logit contributions
-0.061
light-0.059
 awoke-0.051
 whenever-0.050
?'-0.050
Token.
Token ID13
Feature activation-1.57e-03

Pos logit contributions
light+0.203
+0.186
 awoke+0.179
?'+0.173
 Tues+0.173
Neg logit contributions
arl-0.230
ps-0.215
 stern-0.185
ags-0.183
ows-0.181
Token She
Token ID1375
Feature activation-1.17e-02

Pos logit contributions
+0.002
light+0.002
 awoke+0.002
?'+0.002
 whenever+0.002
Neg logit contributions
arl-0.003
ps-0.003
ags-0.003
 stern-0.003
agged-0.003
Token likes
Token ID7832
Feature activation+1.52e-02

Pos logit contributions
light+0.026
+0.023
?'+0.022
pen+0.022
 whenever+0.022
Neg logit contributions
arl-0.013
ps-0.013
 stern-0.011
ses-0.010
ags-0.010
Token vegetables
Token ID13701
Feature activation-1.47e-01

Pos logit contributions
arl+0.026
ps+0.022
 stern+0.021
ags+0.021
Their+0.021
Neg logit contributions
light-0.025
-0.025
 awoke-0.022
?'-0.021
pen-0.021
Token.
Token ID13
Feature activation-3.62e-04

Pos logit contributions
light+0.215
+0.205
 awoke+0.191
?'+0.182
pen+0.179
Neg logit contributions
arl-0.260
ps-0.234
 stern-0.231
ags-0.216
Their-0.212
Token She
Token ID1375
Feature activation-9.62e-03

Pos logit contributions
+0.001
light+0.000
 awoke+0.000
?'+0.000
pen+0.000
Neg logit contributions
arl-0.001
ps-0.001
ags-0.001
 stern-0.001
orts-0.001
Token likes
Token ID7832
Feature activation+1.65e-02

Pos logit contributions
light+0.022
+0.019
?'+0.019
pen+0.019
 whenever+0.019
Neg logit contributions
ps-0.010
arl-0.010
 stern-0.008
ses-0.008
 greedy-0.008
Token to
Token ID284
Feature activation-1.29e-02

Pos logit contributions
arl+0.029
ps+0.025
 stern+0.024
ags+0.023
Their+0.023
Neg logit contributions
light-0.026
-0.026
 awoke-0.023
?'-0.022
pen-0.022
Token eat
Token ID4483
Feature activation-4.32e-02

Pos logit contributions
light+0.016
+0.016
 awoke+0.012
pen+0.012
?'+0.011
Neg logit contributions
arl-0.027
ps-0.025
ags-0.024
 stern-0.024
Their-0.023
Token loved
Token ID6151
Feature activation+5.26e-03

Pos logit contributions
ps+0.003
arl+0.003
 stern+0.003
uzz+0.003
ks+0.003
Neg logit contributions
light-0.002
-0.002
?'-0.002
 Tues-0.002
pen-0.002
Token seeing
Token ID4379
Feature activation-1.23e-02

Pos logit contributions
arl+0.011
ps+0.009
ags+0.009
 stern+0.009
agged+0.009
Neg logit contributions
-0.007
light-0.007
 awoke-0.006
?'-0.006
 whenever-0.005
Token what
Token ID644
Feature activation+4.36e-03

Pos logit contributions
+0.020
light+0.019
 awoke+0.017
?'+0.017
pen+0.016
Neg logit contributions
arl-0.021
ps-0.018
ags-0.017
 stern-0.017
uzz-0.016
Token fruits
Token ID15921
Feature activation-1.25e-01

Pos logit contributions
arl+0.007
 stern+0.007
 greedy+0.007
Their+0.007
ps+0.007
Neg logit contributions
light-0.006
-0.005
 awoke-0.005
?'-0.005
pen-0.005
Token and
Token ID290
Feature activation-2.40e-02

Pos logit contributions
light+0.189
+0.169
 masse+0.163
?'+0.160
 awoke+0.157
Neg logit contributions
arl-0.200
ps-0.189
 stern-0.176
 theirs-0.166
ks-0.165
Token vegetables
Token ID13701
Feature activation-1.46e-01

Pos logit contributions
light+0.035
+0.034
?'+0.032
pen+0.031
 awoke+0.030
Neg logit contributions
ps-0.038
arl-0.036
 greedy-0.035
 innocent-0.035
 stern-0.035
Token were
Token ID547
Feature activation-2.23e-02

Pos logit contributions
light+0.208
+0.196
 masse+0.178
 awoke+0.176
?'+0.175
Neg logit contributions
arl-0.250
ps-0.238
 stern-0.223
ks-0.212
Their-0.207
Token in
Token ID287
Feature activation-2.11e-02

Pos logit contributions
light+0.034
+0.033
 awoke+0.029
pen+0.029
?'+0.028
Neg logit contributions
arl-0.038
ps-0.035
 stern-0.034
ags-0.033
 innocent-0.031
Token season
Token ID1622
Feature activation-5.05e-02

Pos logit contributions
+0.031
light+0.031
 awoke+0.027
?'+0.026
pen+0.025
Neg logit contributions
arl-0.038
ps-0.034
ags-0.032
 stern-0.032
Their-0.031
Token,
Token ID11
Feature activation-1.81e-02

Pos logit contributions
+0.070
light+0.068
 awoke+0.065
?'+0.061
 bal+0.059
Neg logit contributions
arl-0.091
ps-0.080
 stern-0.076
ags-0.075
uzz-0.073
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
+0.025
light+0.024
?'+0.021
 awoke+0.021
pen+0.020
Neg logit contributions
arl-0.034
ps-0.031
 stern-0.030
ags-0.029
uzz-0.028
Token herbs
Token ID25411
Feature activation-1.09e-01

Pos logit contributions
+0.035
light+0.032
 awoke+0.027
?'+0.027
 whenever+0.027
Neg logit contributions
arl-0.054
ps-0.047
ags-0.046
agged-0.045
 stern-0.045
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
light+0.174
+0.162
 awoke+0.157
 Tues+0.148
pen+0.145
Neg logit contributions
arl-0.173
ps-0.158
 stern-0.156
ags-0.142
outing-0.142
Token "
Token ID366
Feature activation+2.46e-02

Pos logit contributions
+0.022
light+0.020
 whenever+0.018
 awoke+0.017
?'+0.017
Neg logit contributions
arl-0.027
ps-0.024
ags-0.023
agged-0.022
orts-0.021
TokenThose
Token ID9627
Feature activation-2.15e-02

Pos logit contributions
arl+0.042
ps+0.035
ags+0.034
 stern+0.034
agged+0.033
Neg logit contributions
-0.041
light-0.040
 awoke-0.035
?'-0.034
pen-0.034
Token are
Token ID389
Feature activation-1.39e-02

Pos logit contributions
light+0.033
+0.032
 awoke+0.028
?'+0.027
pen+0.027
Neg logit contributions
arl-0.039
ps-0.034
ags-0.032
 stern-0.032
orts-0.031
Token herbs
Token ID25411
Feature activation-1.46e-01

Pos logit contributions
+0.024
light+0.024
 awoke+0.021
pen+0.020
?'+0.020
Neg logit contributions
arl-0.023
ps-0.019
ags-0.018
 stern-0.018
Their-0.017
Token,
Token ID11
Feature activation-3.94e-02

Pos logit contributions
light+0.195
+0.181
 awoke+0.172
 Tues+0.162
pen+0.161
Neg logit contributions
arl-0.286
ps-0.257
 stern-0.247
ags-0.238
 greedy-0.227
Token Lily
Token ID20037
Feature activation-1.70e-02

Pos logit contributions
+0.067
light+0.065
 awoke+0.057
?'+0.056
pen+0.055
Neg logit contributions
arl-0.065
ps-0.056
ags-0.053
 stern-0.053
agged-0.051
Token,"
Token ID553
Feature activation+2.37e-03

Pos logit contributions
+0.026
light+0.026
 awoke+0.024
 Tues+0.023
pen+0.022
Neg logit contributions
arl-0.029
 stern-0.025
ps-0.025
 greedy-0.023
ags-0.023
Token she
Token ID673
Feature activation-1.52e-02

Pos logit contributions
arl+0.005
ps+0.004
 stern+0.004
ags+0.004
Their+0.004
Neg logit contributions
-0.003
light-0.003
 awoke-0.002
pen-0.002
?'-0.002
Token said
Token ID531
Feature activation-4.69e-03

Pos logit contributions
+0.020
light+0.019
 awoke+0.016
?'+0.015
 whenever+0.015
Neg logit contributions
arl-0.032
ps-0.028
ags-0.027
agged-0.026
 stern-0.026
Token.
Token ID13
Feature activation-7.37e-03

Pos logit contributions
light+0.056
+0.055
 awoke+0.051
?'+0.049
pen+0.046
Neg logit contributions
arl-0.053
ps-0.050
 stern-0.046
ags-0.045
 greedy-0.043
Token Lily
Token ID20037
Feature activation-1.95e-03

Pos logit contributions
+0.011
light+0.011
 awoke+0.009
 whenever+0.009
?'+0.009
Neg logit contributions
arl-0.014
ps-0.012
ags-0.012
 stern-0.011
agged-0.011
Token didn
Token ID1422
Feature activation+2.06e-02

Pos logit contributions
light+0.003
+0.002
?'+0.002
pen+0.002
 awoke+0.002
Neg logit contributions
 stern-0.003
ps-0.003
outed-0.003
arl-0.003
Their-0.003
Token't
Token ID470
Feature activation+5.18e-02

Pos logit contributions
arl+0.029
ps+0.024
Their+0.023
 stern+0.022
ags+0.022
Neg logit contributions
light-0.042
 awoke-0.035
-0.034
 nights-0.034
 laid-0.034
Token like
Token ID588
Feature activation-1.96e-02

Pos logit contributions
arl+0.107
ps+0.094
ags+0.092
 stern+0.090
agged+0.089
Neg logit contributions
-0.069
light-0.063
 awoke-0.054
 whenever-0.054
?'-0.053
Token vegetables
Token ID13701
Feature activation-1.46e-01

Pos logit contributions
light+0.032
+0.030
 awoke+0.027
?'+0.026
pen+0.026
Neg logit contributions
arl-0.032
ps-0.030
 stern-0.030
 greedy-0.028
 innocent-0.027
Token,
Token ID11
Feature activation-1.69e-02

Pos logit contributions
light+0.190
+0.173
 awoke+0.157
?'+0.156
 Tues+0.143
Neg logit contributions
arl-0.289
ps-0.261
 stern-0.261
 greedy-0.246
outing-0.236
Token but
Token ID475
Feature activation-6.41e-02

Pos logit contributions
+0.025
light+0.024
 awoke+0.020
?'+0.020
pen+0.020
Neg logit contributions
arl-0.032
ps-0.028
ags-0.027
 stern-0.027
agged-0.026
Token she
Token ID673
Feature activation-5.12e-02

Pos logit contributions
light+0.089
pen+0.081
+0.081
?'+0.076
 awoke+0.073
Neg logit contributions
arl-0.113
 stern-0.109
ps-0.101
Their-0.100
 innocent-0.099
Token wanted
Token ID2227
Feature activation-7.69e-03

Pos logit contributions
light+0.078
+0.073
?'+0.068
pen+0.067
 awoke+0.065
Neg logit contributions
 stern-0.078
arl-0.074
outed-0.074
ps-0.073
 greedy-0.072
Token to
Token ID284
Feature activation-2.48e-02

Pos logit contributions
light+0.013
+0.013
 awoke+0.011
pen+0.011
?'+0.011
Neg logit contributions
arl-0.011
 stern-0.011
ps-0.011
ags-0.010
Their-0.010
Token healthy
Token ID5448
Feature activation-3.76e-02

Pos logit contributions
arl+0.019
ps+0.016
ags+0.015
 stern+0.015
agged+0.015
Neg logit contributions
-0.021
light-0.020
 awoke-0.018
 whenever-0.018
pen-0.018
Token food
Token ID2057
Feature activation-4.76e-02

Pos logit contributions
+0.075
light+0.071
?'+0.067
 awoke+0.064
pen+0.061
Neg logit contributions
arl-0.049
ps-0.044
 stern-0.041
ags-0.041
orts-0.039
Token.
Token ID13
Feature activation-6.57e-03

Pos logit contributions
light+0.079
+0.076
 awoke+0.070
?'+0.068
pen+0.066
Neg logit contributions
arl-0.074
ps-0.068
 stern-0.065
ks-0.061
uzz-0.059
Token There
Token ID1318
Feature activation-2.08e-02

Pos logit contributions
light+0.011
+0.010
pen+0.009
 awoke+0.009
?'+0.009
Neg logit contributions
arl-0.011
ps-0.010
 stern-0.009
ags-0.009
agged-0.009
Token were
Token ID547
Feature activation-3.86e-05

Pos logit contributions
 masse+0.033
light+0.032
 Tooth+0.031
ridge+0.030
 recite+0.030
Neg logit contributions
 stern-0.028
ps-0.025
outs-0.025
ks-0.025
Their-0.025
Token vegetables
Token ID13701
Feature activation-1.46e-01

Pos logit contributions
+0.000
light+0.000
?'+0.000
 awoke+0.000
pen+0.000
Neg logit contributions
arl-0.000
ps-0.000
 stern-0.000
ags-0.000
Their-0.000
Token,
Token ID11
Feature activation-1.80e-02

Pos logit contributions
light+0.230
+0.209
 Tues+0.209
?'+0.204
 awoke+0.202
Neg logit contributions
arl-0.223
ps-0.208
 stern-0.207
 greedy-0.188
outing-0.180
Token fruits
Token ID15921
Feature activation-1.09e-01

Pos logit contributions
+0.028
light+0.028
?'+0.024
 awoke+0.024
pen+0.024
Neg logit contributions
arl-0.031
ps-0.027
 stern-0.026
ags-0.026
Their-0.025
Token and
Token ID290
Feature activation-6.44e-03

Pos logit contributions
light+0.173
+0.159
?'+0.156
 recite+0.153
 masse+0.151
Neg logit contributions
arl-0.179
ps-0.158
 stern-0.152
 greedy-0.139
outing-0.137
Token nuts
Token ID14380
Feature activation-6.25e-02

Pos logit contributions
light+0.010
+0.009
?'+0.009
pen+0.009
 awoke+0.009
Neg logit contributions
ps-0.009
arl-0.009
 stern-0.009
 greedy-0.009
 innocent-0.009
Token.
Token ID13
Feature activation-4.99e-03

Pos logit contributions
light+0.091
+0.091
 awoke+0.079
?'+0.078
 Tues+0.077
Neg logit contributions
arl-0.113
ps-0.102
 stern-0.095
ags-0.091
uzz-0.090
Token bananas
Token ID35484
Feature activation-5.48e-02

Pos logit contributions
light+0.010
+0.010
?'+0.009
 awoke+0.009
pen+0.009
Neg logit contributions
arl-0.009
ps-0.009
 stern-0.008
Their-0.008
ags-0.008
Token.
Token ID13
Feature activation-7.41e-03

Pos logit contributions
light+0.080
+0.080
 awoke+0.069
?'+0.068
pen+0.065
Neg logit contributions
arl-0.103
ps-0.090
ags-0.085
 stern-0.084
agged-0.081
Token There
Token ID1318
Feature activation-2.25e-02

Pos logit contributions
+0.012
 whenever+0.010
light+0.009
 Tues+0.009
 bal+0.009
Neg logit contributions
arl-0.014
ps-0.012
ags-0.011
 stern-0.011
chie-0.011
Token were
Token ID547
Feature activation+7.72e-04

Pos logit contributions
light+0.032
 masse+0.028
+0.028
?'+0.027
 Tooth+0.027
Neg logit contributions
 stern-0.035
arl-0.035
ps-0.034
outs-0.033
 Lia-0.033
Token also
Token ID635
Feature activation-2.95e-02

Pos logit contributions
arl+0.001
ps+0.001
ags+0.001
 stern+0.001
agged+0.001
Neg logit contributions
-0.001
light-0.001
 awoke-0.001
 whenever-0.001
?'-0.001
Token vegetables
Token ID13701
Feature activation-1.46e-01

Pos logit contributions
+0.051
light+0.050
 awoke+0.044
?'+0.043
pen+0.043
Neg logit contributions
arl-0.047
ps-0.041
ags-0.039
 stern-0.039
Their-0.037
Token,
Token ID11
Feature activation-2.12e-02

Pos logit contributions
light+0.201
+0.185
 awoke+0.175
?'+0.170
 Tues+0.164
Neg logit contributions
arl-0.273
ps-0.250
 stern-0.242
 greedy-0.226
ks-0.222
Token like
Token ID588
Feature activation-1.26e-02

Pos logit contributions
+0.032
light+0.030
 whenever+0.026
 awoke+0.026
?'+0.025
Neg logit contributions
arl-0.040
ps-0.034
ags-0.033
 stern-0.032
agged-0.032
Token carrots
Token ID34397
Feature activation-1.16e-01

Pos logit contributions
+0.019
light+0.018
 awoke+0.016
 whenever+0.016
?'+0.015
Neg logit contributions
arl-0.023
ps-0.020
ags-0.019
 stern-0.019
agged-0.019
Token and
Token ID290
Feature activation-9.41e-03

Pos logit contributions
light+0.154
+0.142
 awoke+0.130
?'+0.129
 Tues+0.125
Neg logit contributions
arl-0.225
ps-0.204
 stern-0.194
ags-0.183
 greedy-0.183
Token peas
Token ID22589
Feature activation-1.02e-01

Pos logit contributions
light+0.015
+0.014
?'+0.013
pen+0.013
 awoke+0.013
Neg logit contributions
arl-0.014
ps-0.014
 stern-0.013
 greedy-0.013
 innocent-0.012
Token old
Token ID1468
Feature activation-5.45e-02

Pos logit contributions
light+0.052
+0.052
 awoke+0.048
pen+0.047
?'+0.046
Neg logit contributions
arl-0.020
 stern-0.013
ags-0.012
ps-0.012
uzz-0.011
Token,
Token ID11
Feature activation-2.68e-02

Pos logit contributions
+0.068
light+0.066
 awoke+0.054
?'+0.053
pen+0.052
Neg logit contributions
arl-0.114
ps-0.101
ags-0.098
 stern-0.097
agged-0.095
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
+0.039
light+0.038
?'+0.033
 awoke+0.033
pen+0.032
Neg logit contributions
arl-0.049
ps-0.043
 stern-0.043
ags-0.041
uzz-0.041
Token he
Token ID339
Feature activation-2.73e-02

Pos logit contributions
+0.019
light+0.017
 whenever+0.014
 awoke+0.013
 Tues+0.013
Neg logit contributions
arl-0.041
ps-0.036
ags-0.036
orts-0.034
agged-0.034
Token loved
Token ID6151
Feature activation+9.46e-04

Pos logit contributions
light+0.047
+0.045
pen+0.042
?'+0.042
 awoke+0.041
Neg logit contributions
arl-0.036
ps-0.034
 stern-0.034
 greedy-0.033
Their-0.033
Token vegetables
Token ID13701
Feature activation-1.46e-01

Pos logit contributions
arl+0.002
ps+0.001
ags+0.001
 stern+0.001
Their+0.001
Neg logit contributions
-0.002
light-0.001
 awoke-0.001
pen-0.001
?'-0.001
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
light+0.238
+0.236
 awoke+0.223
?'+0.206
pen+0.206
Neg logit contributions
arl-0.225
ps-0.195
 stern-0.193
ags-0.185
uzz-0.179
Token Every
Token ID3887
Feature activation+1.93e-04

Pos logit contributions
light+0.021
+0.021
 awoke+0.018
pen+0.018
 tucked+0.017
Neg logit contributions
arl-0.027
ags-0.023
ps-0.022
uzz-0.022
orts-0.022
Token day
Token ID1110
Feature activation-4.99e-03

Pos logit contributions
arl+0.000
 stern+0.000
 loud+0.000
Their+0.000
ags+0.000
Neg logit contributions
pen-0.000
light-0.000
-0.000
ridge-0.000
 recite-0.000
Token he
Token ID339
Feature activation-9.00e-03

Pos logit contributions
light+0.007
pen+0.007
+0.007
 recite+0.007
ridge+0.007
Neg logit contributions
arl-0.008
Their-0.007
 stern-0.007
 loud-0.007
ks-0.007
Token would
Token ID561
Feature activation+5.50e-03

Pos logit contributions
umm+0.015
 avail+0.014
light+0.014
+0.014
mmm+0.013
Neg logit contributions
ps-0.011
 greedy-0.010
suits-0.010
ks-0.009
arl-0.009
Token.
Token ID13
Feature activation-2.29e-03

Pos logit contributions
arl+0.010
ps+0.009
 stern+0.009
ags+0.009
agged+0.008
Neg logit contributions
-0.007
light-0.007
 awoke-0.006
pen-0.006
?'-0.006
Token It
Token ID632
Feature activation+1.29e-02

Pos logit contributions
light+0.003
+0.003
 awoke+0.003
pen+0.003
?'+0.003
Neg logit contributions
arl-0.004
ps-0.004
ags-0.004
 stern-0.003
orts-0.003
Token was
Token ID373
Feature activation-1.00e-02

Pos logit contributions
ps+0.024
 stern+0.023
arl+0.022
 innocent+0.021
Their+0.021
Neg logit contributions
light-0.015
-0.014
?'-0.014
pen-0.013
 Tues-0.013
Token even
Token ID772
Feature activation-6.66e-02

Pos logit contributions
light+0.018
+0.018
pen+0.016
 awoke+0.016
?'+0.016
Neg logit contributions
arl-0.014
ps-0.013
 stern-0.012
ags-0.011
Their-0.011
Token y
Token ID331
Feature activation-4.53e-02

Pos logit contributions
+0.114
light+0.112
 awoke+0.099
?'+0.098
pen+0.096
Neg logit contributions
arl-0.105
ps-0.091
 stern-0.089
ags-0.086
Their-0.082
Tokenumm
Token ID13929
Feature activation-1.46e-01

Pos logit contributions
+0.051
light+0.050
 awoke+0.050
 recite+0.043
?'+0.041
Neg logit contributions
arl-0.105
ags-0.088
agged-0.087
ps-0.084
outing-0.083
Tokenier
Token ID959
Feature activation-2.15e-02

Pos logit contributions
+0.310
light+0.284
 awoke+0.269
 recite+0.267
pen+0.262
Neg logit contributions
arl-0.197
ps-0.162
ags-0.142
agged-0.140
outing-0.132
Token than
Token ID621
Feature activation-6.89e-04

Pos logit contributions
+0.035
light+0.034
 awoke+0.033
?'+0.030
pen+0.030
Neg logit contributions
arl-0.033
ps-0.031
 stern-0.028
uzz-0.027
ks-0.027
Token the
Token ID262
Feature activation-1.82e-02

Pos logit contributions
+0.001
light+0.001
 awoke+0.001
?'+0.001
pen+0.001
Neg logit contributions
arl-0.002
ps-0.001
ags-0.001
 stern-0.001
agged-0.001
Token first
Token ID717
Feature activation+4.71e-02

Pos logit contributions
+0.029
light+0.027
 whenever+0.023
pen+0.023
 nap+0.023
Neg logit contributions
arl-0.034
ps-0.028
ags-0.028
agged-0.027
uzz-0.027
Token card
Token ID2657
Feature activation+5.95e-02

Pos logit contributions
arl+0.095
ps+0.082
ags+0.081
 stern+0.078
agged+0.077
Neg logit contributions
-0.066
light-0.063
 awoke-0.053
?'-0.052
 whenever-0.052
Token with
Token ID351
Feature activation+2.31e-02

Pos logit contributions
arl+0.019
ps+0.017
 stern+0.016
ags+0.016
uzz+0.015
Neg logit contributions
-0.011
light-0.011
 awoke-0.009
?'-0.009
pen-0.009
Token pictures
Token ID5986
Feature activation-2.37e-02

Pos logit contributions
arl+0.042
ps+0.036
ags+0.035
 stern+0.034
agged+0.033
Neg logit contributions
-0.036
light-0.035
 awoke-0.030
?'-0.029
pen-0.029
Token of
Token ID286
Feature activation-1.07e-02

Pos logit contributions
light+0.029
+0.029
 awoke+0.025
?'+0.025
 masse+0.025
Neg logit contributions
arl-0.044
ps-0.041
 stern-0.039
ags-0.037
uzz-0.036
Token fruits
Token ID15921
Feature activation-1.06e-01

Pos logit contributions
+0.017
light+0.017
 awoke+0.014
?'+0.014
pen+0.014
Neg logit contributions
arl-0.018
ps-0.016
ags-0.015
 stern-0.015
Their-0.015
Token,
Token ID11
Feature activation-1.80e-02

Pos logit contributions
light+0.144
+0.137
 awoke+0.126
 masse+0.124
?'+0.123
Neg logit contributions
arl-0.181
 stern-0.160
ps-0.160
ks-0.144
outs-0.142
Token vegetables
Token ID13701
Feature activation-1.46e-01

Pos logit contributions
+0.027
light+0.026
?'+0.023
 awoke+0.023
pen+0.022
Neg logit contributions
arl-0.031
ps-0.028
 stern-0.027
ags-0.026
Their-0.025
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
light+0.191
+0.180
 masse+0.169
 Tues+0.168
?'+0.163
Neg logit contributions
arl-0.256
ps-0.238
 stern-0.230
ks-0.206
 innocent-0.202
Token flowers
Token ID12734
Feature activation-9.03e-02

Pos logit contributions
light+0.018
+0.018
?'+0.017
pen+0.017
 awoke+0.016
Neg logit contributions
ps-0.016
arl-0.016
 stern-0.015
 greedy-0.015
 innocent-0.015
Token to
Token ID284
Feature activation+1.85e-03

Pos logit contributions
+0.124
light+0.121
 awoke+0.105
?'+0.104
pen+0.104
Neg logit contributions
arl-0.164
ps-0.149
 stern-0.140
ags-0.135
uzz-0.133
Token help
Token ID1037
Feature activation-3.22e-03

Pos logit contributions
arl+0.004
ps+0.003
ags+0.003
 stern+0.003
orts+0.003
Neg logit contributions
light-0.002
-0.002
 awoke-0.002
pen-0.002
 bal-0.002
Token remind
Token ID7101
Feature activation-3.82e-03

Pos logit contributions
light+0.006
+0.005
 awoke+0.005
pen+0.005
?'+0.005
Neg logit contributions
arl-0.005
ps-0.004
 stern-0.004
ags-0.004
Their-0.004
Token the
Token ID262
Feature activation+8.66e-03

Pos logit contributions
+0.023
light+0.022
 whenever+0.018
 awoke+0.018
?'+0.018
Neg logit contributions
arl-0.041
ps-0.035
ags-0.034
agged-0.034
 stern-0.034
Token girl
Token ID2576
Feature activation+3.19e-02

Pos logit contributions
arl+0.018
ags+0.016
agged+0.016
ps+0.016
uzz+0.015
Neg logit contributions
-0.012
light-0.010
 nap-0.009
 awoke-0.009
 mashed-0.009
Token and
Token ID290
Feature activation+5.17e-04

Pos logit contributions
arl+0.066
ps+0.058
ags+0.056
 stern+0.056
agged+0.054
Neg logit contributions
-0.041
light-0.039
 awoke-0.033
?'-0.032
pen-0.031
Token sw
Token ID1509
Feature activation+3.38e-02

Pos logit contributions
arl+0.001
ps+0.001
 stern+0.001
Their+0.001
uzz+0.001
Neg logit contributions
-0.001
light-0.001
?'-0.001
pen-0.001
 awoke-0.001
Tokenam
Token ID321
Feature activation-6.47e-04

Pos logit contributions
arl+0.077
 stern+0.064
 amused+0.061
 greedy+0.061
 bluff+0.060
Neg logit contributions
light-0.036
-0.033
pen-0.028
 laid-0.027
en-0.026
Token away
Token ID1497
Feature activation+7.84e-02

Pos logit contributions
+0.001
light+0.001
 whenever+0.001
 awoke+0.001
 tucked+0.001
Neg logit contributions
arl-0.001
ses-0.001
ps-0.001
outing-0.001
ags-0.001
Token back
Token ID736
Feature activation+3.83e-02

Pos logit contributions
arl+0.185
ags+0.163
ps+0.159
agged+0.158
orts+0.154
Neg logit contributions
-0.087
light-0.078
 whenever-0.068
 awoke-0.060
 tucked-0.060
Token into
Token ID656
Feature activation-1.36e-02

Pos logit contributions
arl+0.078
ps+0.068
ags+0.066
 stern+0.065
agged+0.064
Neg logit contributions
-0.052
light-0.050
 whenever-0.041
 awoke-0.041
?'-0.040
Token the
Token ID262
Feature activation+2.45e-03

Pos logit contributions
+0.018
light+0.017
 awoke+0.014
?'+0.014
 whenever+0.014
Neg logit contributions
arl-0.029
ps-0.025
ags-0.024
 stern-0.024
agged-0.024
Token sea
Token ID5417
Feature activation+1.55e-03

Pos logit contributions
arl+0.004
 stern+0.003
ps+0.003
ags+0.003
Their+0.003
Neg logit contributions
light-0.004
-0.004
pen-0.003
 awoke-0.003
 whenever-0.003
Token.
Token ID13
Feature activation+2.05e-03

Pos logit contributions
arl+0.003
ps+0.003
 stern+0.003
ags+0.003
Their+0.002
Neg logit contributions
-0.002
light-0.002
 awoke-0.002
?'-0.002
pen-0.002
Tokenâ
Token ID22940
Feature activation-1.21e-02

Pos logit contributions
+0.000
light+0.000
 awoke+0.000
?'+0.000
 whenever+0.000
Neg logit contributions
arl-0.000
ps-0.000
ags-0.000
 stern-0.000
agged-0.000
Token
Token ID26391
Feature activation+1.56e-02

Pos logit contributions
+0.019
light+0.017
?'+0.015
 awoke+0.014
pen+0.014
Neg logit contributions
arl-0.023
ps-0.020
 stern-0.020
uzz-0.019
ses-0.019
Token
Token ID198
Feature activation-1.78e-02

Pos logit contributions
arl+0.029
ps+0.025
ags+0.024
 stern+0.024
orts+0.023
Neg logit contributions
-0.023
light-0.022
 awoke-0.019
?'-0.019
pen-0.018
Token
Token ID198
Feature activation-1.67e-02

Pos logit contributions
+0.022
light+0.022
 awoke+0.018
?'+0.018
pen+0.018
Neg logit contributions
arl-0.037
ps-0.032
ags-0.031
 stern-0.031
Their-0.031
TokenThe
Token ID464
Feature activation+1.81e-02

Pos logit contributions
light+0.024
+0.024
 awoke+0.021
?'+0.020
pen+0.020
Neg logit contributions
arl-0.031
ps-0.026
ags-0.026
 stern-0.025
Their-0.025
Token second
Token ID1218
Feature activation+7.98e-02

Pos logit contributions
arl+0.031
ags+0.027
agged+0.027
ps+0.027
orts+0.026
Neg logit contributions
-0.031
light-0.028
 nap-0.025
 bal-0.025
 whenever-0.024
Token friend
Token ID1545
Feature activation+3.93e-02

Pos logit contributions
arl+0.126
ps+0.106
ags+0.101
 stern+0.101
agged+0.097
Neg logit contributions
-0.141
light-0.137
 awoke-0.121
?'-0.120
pen-0.118
Token said
Token ID531
Feature activation-2.52e-02

Pos logit contributions
arl+0.071
ps+0.064
 stern+0.063
uzz+0.061
orts+0.058
Neg logit contributions
light-0.054
-0.051
?'-0.045
 awoke-0.044
 Tues-0.044
Token,
Token ID11
Feature activation-2.55e-04

Pos logit contributions
+0.020
light+0.017
 awoke+0.013
?'+0.012
 whenever+0.012
Neg logit contributions
arl-0.066
ps-0.060
ags-0.059
agged-0.057
ses-0.057
Token
Token ID6184
Feature activation+1.03e-02

Pos logit contributions
+0.000
light+0.000
 awoke+0.000
 whenever+0.000
?'+0.000
Neg logit contributions
arl-0.000
ags-0.000
ps-0.000
agged-0.000
ses-0.000
Token
Token ID95
Feature activation+1.35e-02

Pos logit contributions
arl+0.018
ps+0.017
ags+0.016
 stern+0.015
agged+0.015
Neg logit contributions
-0.017
light-0.015
?'-0.014
pen-0.013
 whenever-0.013
Token.
Token ID13
Feature activation+9.66e-03

Pos logit contributions
+0.018
light+0.018
?'+0.016
 awoke+0.015
pen+0.015
Neg logit contributions
arl-0.027
ps-0.024
ags-0.023
 stern-0.023
ks-0.022
Token And
Token ID843
Feature activation+3.98e-02

Pos logit contributions
arl+0.015
ps+0.013
 stern+0.012
ags+0.012
agged+0.012
Neg logit contributions
-0.018
light-0.017
 whenever-0.015
?'-0.015
 awoke-0.014
Token they
Token ID484
Feature activation+1.77e-02

Pos logit contributions
arl+0.069
ps+0.059
ags+0.057
 stern+0.057
Their+0.055
Neg logit contributions
-0.064
light-0.063
 awoke-0.055
?'-0.054
pen-0.053
Token learned
Token ID4499
Feature activation+5.02e-02

Pos logit contributions
arl+0.035
ps+0.031
ags+0.030
 stern+0.029
agged+0.029
Neg logit contributions
-0.025
light-0.024
 awoke-0.020
?'-0.020
 whenever-0.019
Token that
Token ID326
Feature activation+3.75e-03

Pos logit contributions
arl+0.128
agged+0.116
ps+0.114
ags+0.113
outing+0.111
Neg logit contributions
-0.044
 whenever-0.033
light-0.029
 when-0.028
 --0.025
Token not
Token ID407
Feature activation+7.86e-02

Pos logit contributions
arl+0.005
ps+0.005
 stern+0.005
ses+0.005
 greedy+0.005
Neg logit contributions
light-0.006
-0.006
?'-0.006
 awoke-0.006
pen-0.006
Token all
Token ID477
Feature activation-3.70e-03

Pos logit contributions
arl+0.141
ps+0.122
ags+0.115
 stern+0.115
Their+0.111
Neg logit contributions
-0.122
light-0.122
 awoke-0.104
?'-0.104
pen-0.101
Token flowers
Token ID12734
Feature activation-8.98e-02

Pos logit contributions
+0.006
light+0.006
 awoke+0.005
?'+0.005
pen+0.005
Neg logit contributions
arl-0.007
ps-0.006
ags-0.006
 stern-0.006
agged-0.005
Token and
Token ID290
Feature activation+3.86e-03

Pos logit contributions
light+0.129
+0.126
 awoke+0.109
?'+0.109
pen+0.107
Neg logit contributions
arl-0.166
ps-0.151
ags-0.139
 stern-0.139
ses-0.132
Token pets
Token ID17252
Feature activation+3.59e-03

Pos logit contributions
arl+0.006
ps+0.005
ags+0.005
 stern+0.005
agged+0.005
Neg logit contributions
-0.007
light-0.006
 awoke-0.006
?'-0.006
pen-0.005
Token were
Token ID547
Feature activation-6.04e-04

Pos logit contributions
arl+0.007
ps+0.006
ags+0.006
 stern+0.006
agged+0.006
Neg logit contributions
-0.005
light-0.005
 awoke-0.004
?'-0.004
pen-0.004
Token was
Token ID373
Feature activation+3.09e-03

Pos logit contributions
light+0.038
+0.037
?'+0.032
 Tues+0.031
 awoke+0.031
Neg logit contributions
arl-0.046
ps-0.040
 stern-0.039
orts-0.037
uzz-0.037
Token wrong
Token ID2642
Feature activation+2.18e-02

Pos logit contributions
arl+0.005
uzz+0.004
orts+0.004
agged+0.004
ags+0.004
Neg logit contributions
-0.005
light-0.005
 pol-0.004
 bal-0.004
's-0.004
Token.
Token ID13
Feature activation+5.00e-03

Pos logit contributions
arl+0.036
ps+0.031
 stern+0.029
ags+0.028
ks+0.028
Neg logit contributions
light-0.037
-0.036
 awoke-0.034
pen-0.033
 recite-0.031
Token I
Token ID314
Feature activation-2.55e-02

Pos logit contributions
arl+0.010
ps+0.009
 stern+0.008
ags+0.008
ses+0.008
Neg logit contributions
light-0.007
-0.006
pen-0.006
 awoke-0.006
 whenever-0.005
Token should
Token ID815
Feature activation-1.10e-02

Pos logit contributions
light+0.036
+0.035
?'+0.030
 awoke+0.030
pen+0.029
Neg logit contributions
arl-0.049
ps-0.043
 stern-0.041
ags-0.040
orts-0.039
Token not
Token ID407
Feature activation+7.36e-02

Pos logit contributions
light+0.017
+0.017
 awoke+0.015
pen+0.014
?'+0.014
Neg logit contributions
arl-0.020
ps-0.017
 stern-0.017
ags-0.016
Their-0.016
Token have
Token ID423
Feature activation+1.87e-02

Pos logit contributions
arl+0.134
ps+0.107
 mad+0.105
ags+0.103
Their+0.103
Neg logit contributions
light-0.118
 awoke-0.104
 nights-0.101
illy-0.100
ridge-0.099
Token taken
Token ID2077
Feature activation+2.73e-02

Pos logit contributions
arl+0.034
ps+0.029
ags+0.028
 stern+0.028
agged+0.027
Neg logit contributions
-0.029
light-0.028
 awoke-0.024
?'-0.024
pen-0.023
Token your
Token ID534
Feature activation+1.97e-02

Pos logit contributions
arl+0.047
ps+0.041
ags+0.039
 stern+0.038
orts+0.037
Neg logit contributions
-0.044
light-0.043
 awoke-0.038
?'-0.037
pen-0.037
Token bike
Token ID7161
Feature activation+4.19e-02

Pos logit contributions
arl+0.032
ps+0.027
ags+0.026
 stern+0.026
agged+0.025
Neg logit contributions
-0.034
light-0.033
 awoke-0.029
?'-0.028
 whenever-0.028
Token.
Token ID13
Feature activation+7.40e-03

Pos logit contributions
arl+0.092
ps+0.083
ags+0.081
agged+0.081
ses+0.080
Neg logit contributions
-0.050
light-0.046
 whenever-0.041
?'-0.037
pen-0.036
Token.
Token ID13
Feature activation+3.00e-03

Pos logit contributions
arl+0.031
ps+0.028
ags+0.027
 stern+0.026
Their+0.026
Neg logit contributions
-0.021
light-0.021
 awoke-0.018
?'-0.017
pen-0.017
Token They
Token ID1119
Feature activation+2.30e-02

Pos logit contributions
arl+0.005
ps+0.004
ags+0.004
 stern+0.004
orts+0.004
Neg logit contributions
light-0.005
-0.005
 awoke-0.004
?'-0.004
pen-0.004
Token walk
Token ID2513
Feature activation+3.55e-02

Pos logit contributions
arl+0.044
ps+0.040
 stern+0.037
ags+0.037
uzz+0.036
Neg logit contributions
light-0.031
-0.027
?'-0.025
pen-0.025
 awoke-0.024
Token to
Token ID284
Feature activation+3.07e-02

Pos logit contributions
arl+0.094
outing+0.080
ps+0.079
ses+0.078
agged+0.078
Neg logit contributions
-0.033
 whenever-0.032
light-0.028
 night-0.027
 tomorrow-0.025
Token the
Token ID262
Feature activation+1.97e-02

Pos logit contributions
arl+0.076
agged+0.069
ps+0.067
outing+0.067
ags+0.064
Neg logit contributions
-0.031
light-0.025
 whenever-0.023
 nap-0.023
 prepare-0.022
Token train
Token ID4512
Feature activation+8.39e-02

Pos logit contributions
arl+0.035
agged+0.033
uzz+0.032
ps+0.032
ags+0.031
Neg logit contributions
-0.032
light-0.029
 night-0.025
 tucked-0.025
 visit-0.024
Token station
Token ID4429
Feature activation+8.73e-02

Pos logit contributions
arl+0.157
ps+0.136
ags+0.131
 stern+0.129
agged+0.126
Neg logit contributions
-0.126
light-0.122
 awoke-0.104
?'-0.102
 whenever-0.102
Token,
Token ID11
Feature activation-2.32e-03

Pos logit contributions
arl+0.217
ags+0.189
ps+0.187
agged+0.187
ses+0.179
Neg logit contributions
-0.084
light-0.079
 whenever-0.065
 awoke-0.060
 tucked-0.060
Token following
Token ID1708
Feature activation-1.43e-02

Pos logit contributions
+0.003
light+0.003
 whenever+0.002
 tucked+0.002
 awoke+0.002
Neg logit contributions
arl-0.005
agged-0.004
ags-0.004
ps-0.004
orts-0.004
Token the
Token ID262
Feature activation+1.14e-02

Pos logit contributions
+0.021
light+0.018
 whenever+0.016
 awoke+0.015
 Tues+0.015
Neg logit contributions
arl-0.031
agged-0.025
ps-0.025
ags-0.025
outing-0.024
Token steam
Token ID13324
Feature activation+3.22e-03

Pos logit contributions
arl+0.020
agged+0.019
uzz+0.019
ags+0.018
orts+0.017
Neg logit contributions
-0.018
light-0.017
 night-0.015
 tucked-0.015
 nap-0.014
Token play
Token ID711
Feature activation+3.30e-02

Pos logit contributions
light+0.011
+0.011
 awoke+0.010
?'+0.009
pen+0.009
Neg logit contributions
arl-0.013
ps-0.011
ags-0.010
Their-0.010
 stern-0.010
Token with
Token ID351
Feature activation+3.08e-02

Pos logit contributions
arl+0.070
ps+0.062
ags+0.060
 stern+0.059
agged+0.058
Neg logit contributions
-0.041
light-0.039
 awoke-0.032
?'-0.032
 whenever-0.031
Token Lily
Token ID20037
Feature activation+2.18e-03

Pos logit contributions
arl+0.065
ags+0.055
agged+0.055
ps+0.054
 stern+0.053
Neg logit contributions
-0.039
light-0.038
 whenever-0.033
pen-0.031
 nap-0.030
Token and
Token ID290
Feature activation-6.47e-03

Pos logit contributions
arl+0.005
ps+0.004
ags+0.004
 stern+0.004
agged+0.004
Neg logit contributions
-0.003
light-0.003
 awoke-0.002
?'-0.002
pen-0.002
Token Ben
Token ID3932
Feature activation-1.62e-02

Pos logit contributions
+0.010
light+0.010
 awoke+0.009
 whenever+0.009
?'+0.008
Neg logit contributions
arl-0.012
ps-0.010
ags-0.010
agged-0.009
 stern-0.009
Token anymore
Token ID7471
Feature activation+4.29e-02

Pos logit contributions
+0.021
light+0.020
 awoke+0.017
 whenever+0.016
?'+0.016
Neg logit contributions
arl-0.034
ps-0.029
ags-0.029
 stern-0.028
agged-0.028
Token.
Token ID13
Feature activation+1.79e-03

Pos logit contributions
arl+0.081
ps+0.071
 stern+0.068
ags+0.066
uzz+0.066
Neg logit contributions
-0.062
light-0.060
 awoke-0.052
?'-0.051
pen-0.050
Token He
Token ID679
Feature activation+1.12e-02

Pos logit contributions
arl+0.004
ps+0.004
ags+0.004
agged+0.004
outing+0.003
Neg logit contributions
-0.002
light-0.002
 whenever-0.001
 awoke-0.001
 nap-0.001
Token runs
Token ID4539
Feature activation+4.62e-02

Pos logit contributions
arl+0.018
ps+0.015
ags+0.014
 stern+0.014
agged+0.014
Neg logit contributions
-0.020
light-0.019
 awoke-0.017
?'-0.017
 whenever-0.016
Token back
Token ID736
Feature activation+5.32e-02

Pos logit contributions
arl+0.110
ags+0.100
ses+0.098
outing+0.098
agged+0.098
Neg logit contributions
-0.048
 whenever-0.042
 when-0.035
 since-0.034
 bal-0.034
Token to
Token ID284
Feature activation+3.08e-02

Pos logit contributions
arl+0.118
ps+0.105
ags+0.102
agged+0.101
 stern+0.100
Neg logit contributions
-0.061
light-0.057
 whenever-0.048
 awoke-0.046
?'-0.045
Token!"
Token ID2474
Feature activation+2.10e-02

Pos logit contributions
light+0.055
+0.055
 awoke+0.048
 Tues+0.048
pen+0.046
Neg logit contributions
arl-0.071
ps-0.061
 stern-0.059
outing-0.056
 mad-0.055
Token cried
Token ID16896
Feature activation+2.81e-02

Pos logit contributions
arl+0.034
 stern+0.030
ps+0.030
Their+0.029
ags+0.028
Neg logit contributions
-0.034
light-0.033
 awoke-0.029
pen-0.029
?'-0.028
Token Tim
Token ID5045
Feature activation+3.20e-02

Pos logit contributions
arl+0.053
ps+0.046
ags+0.044
 stern+0.044
agged+0.043
Neg logit contributions
-0.042
light-0.040
 awoke-0.034
?'-0.034
 whenever-0.033
Tokenmy
Token ID1820
Feature activation-6.37e-03

Pos logit contributions
arl+0.077
ps+0.069
ags+0.067
 stern+0.067
agged+0.065
Neg logit contributions
-0.030
light-0.029
 awoke-0.022
?'-0.021
pen-0.021
Token.
Token ID13
Feature activation+9.61e-03

Pos logit contributions
light+0.009
+0.009
 awoke+0.008
?'+0.008
 Tues+0.008
Neg logit contributions
arl-0.011
 stern-0.010
ps-0.010
ags-0.009
uzz-0.009
Token "
Token ID366
Feature activation+4.16e-02

Pos logit contributions
arl+0.018
ps+0.016
ags+0.015
agged+0.015
orts+0.014
Neg logit contributions
-0.015
light-0.014
 whenever-0.012
?'-0.012
 awoke-0.012
TokenMy
Token ID3666
Feature activation-1.74e-02

Pos logit contributions
arl+0.080
ps+0.069
ags+0.067
 stern+0.066
agged+0.065
Neg logit contributions
-0.060
light-0.058
 awoke-0.049
?'-0.048
pen-0.048
Token knee
Token ID10329
Feature activation-1.17e-02

Pos logit contributions
+0.028
light+0.028
 awoke+0.024
?'+0.023
pen+0.023
Neg logit contributions
arl-0.030
ps-0.026
ags-0.025
 stern-0.024
agged-0.024
Token hurts
Token ID20406
Feature activation-3.45e-02

Pos logit contributions
light+0.022
+0.021
 awoke+0.019
 Tues+0.019
pen+0.019
Neg logit contributions
arl-0.017
ps-0.015
 stern-0.014
ses-0.013
 greedy-0.012
Token!"
Token ID2474
Feature activation+2.47e-02

Pos logit contributions
light+0.050
+0.050
pen+0.044
 awoke+0.043
 recite+0.042
Neg logit contributions
arl-0.064
ps-0.055
 stern-0.053
ags-0.052
Their-0.050
Token
Token ID198
Feature activation+1.50e-03

Pos logit contributions
arl+0.052
ps+0.045
ags+0.044
agged+0.042
 stern+0.042
Neg logit contributions
-0.033
light-0.031
 awoke-0.026
 whenever-0.026
?'-0.026
Token she
Token ID673
Feature activation-2.49e-02

Pos logit contributions
+0.048
light+0.047
pen+0.041
 awoke+0.039
?'+0.038
Neg logit contributions
arl-0.079
 stern-0.072
ps-0.071
Their-0.068
ags-0.068
Token went
Token ID1816
Feature activation-1.89e-02

Pos logit contributions
+0.033
light+0.032
umm+0.030
 Tues+0.030
 masse+0.029
Neg logit contributions
arl-0.040
 stern-0.037
ps-0.037
outed-0.037
ks-0.036
Token to
Token ID284
Feature activation+1.12e-02

Pos logit contributions
+0.023
light+0.022
 awoke+0.018
?'+0.018
 whenever+0.018
Neg logit contributions
arl-0.041
ps-0.036
ags-0.034
agged-0.034
 stern-0.034
Token her
Token ID607
Feature activation+1.10e-02

Pos logit contributions
arl+0.020
ps+0.018
ags+0.018
 stern+0.017
Their+0.017
Neg logit contributions
light-0.016
-0.016
 awoke-0.014
 bal-0.013
pen-0.013
Token grand
Token ID4490
Feature activation+1.20e-02

Pos logit contributions
arl+0.019
ps+0.017
 stern+0.016
ags+0.016
ses+0.016
Neg logit contributions
-0.017
light-0.016
 awoke-0.015
pen-0.014
?'-0.014
Tokenparent
Token ID8000
Feature activation+2.15e-02

Pos logit contributions
arl+0.027
ps+0.025
ags+0.024
 stern+0.024
agged+0.023
Neg logit contributions
-0.013
light-0.012
 awoke-0.009
 whenever-0.009
 bal-0.009
Token's
Token ID338
Feature activation-4.68e-02

Pos logit contributions
arl+0.045
 stern+0.044
ps+0.041
ks+0.040
outs+0.039
Neg logit contributions
light-0.019
umm-0.016
 bal-0.016
 awoke-0.015
-0.014
Token house
Token ID2156
Feature activation-7.69e-03

Pos logit contributions
+0.058
light+0.055
 awoke+0.046
?'+0.044
pen+0.044
Neg logit contributions
arl-0.100
ps-0.088
ags-0.085
 stern-0.084
agged-0.082
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
+0.011
light+0.011
 awoke+0.009
pen+0.009
?'+0.009
Neg logit contributions
arl-0.014
ps-0.012
 stern-0.012
ags-0.012
uzz-0.011
Token She
Token ID1375
Feature activation-1.61e-02

Pos logit contributions
light+0.021
+0.021
pen+0.018
 awoke+0.018
?'+0.017
Neg logit contributions
arl-0.023
 stern-0.020
ps-0.019
ags-0.019
Their-0.018
Token chose
Token ID7690
Feature activation-2.53e-02

Pos logit contributions
light+0.021
+0.021
pen+0.018
?'+0.018
umm+0.017
Neg logit contributions
arl-0.029
 stern-0.027
ps-0.027
outed-0.026
Their-0.025
Token in
Token ID287
Feature activation-4.02e-03

Pos logit contributions
arl+0.081
ps+0.071
ags+0.069
 stern+0.068
agged+0.067
Neg logit contributions
-0.051
light-0.050
 awoke-0.041
?'-0.040
pen-0.040
Token the
Token ID262
Feature activation-1.57e-02

Pos logit contributions
+0.006
light+0.005
 awoke+0.005
?'+0.004
 whenever+0.004
Neg logit contributions
arl-0.008
ps-0.007
ags-0.007
 stern-0.007
agged-0.007
Token air
Token ID1633
Feature activation-1.59e-02

Pos logit contributions
+0.022
light+0.022
 whenever+0.019
pen+0.019
 awoke+0.018
Neg logit contributions
arl-0.029
ps-0.025
ags-0.025
 stern-0.025
Their-0.023
Token.
Token ID13
Feature activation-8.63e-03

Pos logit contributions
+0.022
light+0.021
 awoke+0.018
pen+0.018
?'+0.018
Neg logit contributions
arl-0.030
ps-0.027
 stern-0.026
ags-0.025
Their-0.024
Token He
Token ID679
Feature activation-1.16e-03

Pos logit contributions
+0.013
light+0.013
 awoke+0.011
?'+0.011
pen+0.011
Neg logit contributions
arl-0.016
ps-0.013
ags-0.013
 stern-0.013
agged-0.012
Token looked
Token ID3114
Feature activation+1.47e-02

Pos logit contributions
light+0.002
+0.002
?'+0.001
pen+0.001
 Tues+0.001
Neg logit contributions
arl-0.002
ps-0.002
 stern-0.002
Their-0.002
ks-0.002
Token around
Token ID1088
Feature activation-1.90e-02

Pos logit contributions
arl+0.028
ps+0.024
ags+0.023
agged+0.022
orts+0.022
Neg logit contributions
-0.023
light-0.021
 awoke-0.019
?'-0.018
 whenever-0.018
Token and
Token ID290
Feature activation-1.26e-02

Pos logit contributions
+0.025
light+0.025
 awoke+0.021
pen+0.020
?'+0.020
Neg logit contributions
arl-0.036
ps-0.033
 stern-0.032
ags-0.031
Their-0.030
Token noticed
Token ID6810
Feature activation-8.93e-04

Pos logit contributions
+0.017
light+0.016
?'+0.014
pen+0.013
 awoke+0.013
Neg logit contributions
arl-0.024
ps-0.022
 stern-0.022
ags-0.020
Their-0.020
Token a
Token ID257
Feature activation+4.41e-03

Pos logit contributions
+0.001
light+0.001
 awoke+0.001
?'+0.001
 whenever+0.001
Neg logit contributions
arl-0.002
ps-0.001
ags-0.001
 stern-0.001
agged-0.001
Token bird
Token ID6512
Feature activation+4.10e-03

Pos logit contributions
arl+0.007
ps+0.006
ags+0.006
agged+0.006
 stern+0.006
Neg logit contributions
-0.008
light-0.008
 awoke-0.007
?'-0.006
pen-0.006
Token that
Token ID326
Feature activation-2.51e-03

Pos logit contributions
arl+0.041
ps+0.037
 stern+0.036
ags+0.035
Their+0.034
Neg logit contributions
light-0.025
-0.025
 awoke-0.021
pen-0.020
?'-0.020
Token day
Token ID1110
Feature activation-1.18e-02

Pos logit contributions
light+0.004
pen+0.003
ridge+0.003
+0.003
 bal+0.003
Neg logit contributions
arl-0.004
 stern-0.004
ps-0.003
 apology-0.003
Their-0.003
Token on
Token ID319
Feature activation+2.42e-02

Pos logit contributions
light+0.022
+0.021
pen+0.019
 awoke+0.019
 laid+0.019
Neg logit contributions
arl-0.017
ps-0.014
 stern-0.014
ags-0.013
agged-0.013
Token,
Token ID11
Feature activation+3.39e-05

Pos logit contributions
 stern+0.042
arl+0.042
ps+0.039
 sorry+0.038
Their+0.037
Neg logit contributions
light-0.032
-0.027
 recite-0.027
 awoke-0.026
pen-0.026
Token he
Token ID339
Feature activation-1.01e-02

Pos logit contributions
arl+0.000
 stern+0.000
ps+0.000
uzz+0.000
orts+0.000
Neg logit contributions
light-0.000
-0.000
 awoke-0.000
?'-0.000
 tucked-0.000
Token was
Token ID373
Feature activation-2.48e-03

Pos logit contributions
light+0.015
+0.013
 supper+0.013
 nap+0.013
?'+0.013
Neg logit contributions
ps-0.016
arl-0.014
 greedy-0.013
ks-0.013
 stern-0.013
Token a
Token ID257
Feature activation+2.33e-03

Pos logit contributions
light+0.004
+0.004
 awoke+0.003
pen+0.003
?'+0.003
Neg logit contributions
arl-0.004
ps-0.004
 stern-0.004
ags-0.004
Their-0.004
Token much
Token ID881
Feature activation+3.13e-02

Pos logit contributions
arl+0.003
 stern+0.003
ps+0.002
Their+0.002
ags+0.002
Neg logit contributions
light-0.004
-0.004
 awoke-0.004
pen-0.004
?'-0.004
Token happier
Token ID23030
Feature activation+1.61e-02

Pos logit contributions
arl+0.050
 stern+0.044
ps+0.042
Their+0.040
ags+0.040
Neg logit contributions
light-0.053
-0.052
 awoke-0.045
?'-0.045
pen-0.045
Token man
Token ID582
Feature activation+3.23e-02

Pos logit contributions
arl+0.024
 stern+0.021
ps+0.019
 greedy+0.019
 innocent+0.018
Neg logit contributions
-0.029
light-0.028
ridge-0.025
pen-0.025
 awoke-0.025
Token.
Token ID13
Feature activation-7.85e-03

Pos logit contributions
arl+0.057
ps+0.052
 stern+0.052
ks+0.049
uzz+0.048
Neg logit contributions
light-0.042
-0.041
 Tues-0.037
?'-0.037
 awoke-0.036
Token be
Token ID307
Feature activation-1.02e-02

Pos logit contributions
arl+0.098
ps+0.089
 stern+0.084
 greedy+0.082
Their+0.081
Neg logit contributions
light-0.103
-0.089
 awoke-0.088
 Tues-0.084
 nights-0.084
Token so
Token ID523
Feature activation-4.29e-03

Pos logit contributions
+0.015
light+0.013
 laid+0.013
 whenever+0.012
pen+0.012
Neg logit contributions
arl-0.020
orts-0.016
ses-0.016
ags-0.015
agged-0.015
Token spoiled
Token ID34781
Feature activation-1.57e-02

Pos logit contributions
+0.007
 chew+0.006
 bal+0.006
 lay+0.006
 whenever+0.006
Neg logit contributions
arl-0.006
ags-0.006
ses-0.006
suits-0.006
agged-0.006
Token anymore
Token ID7471
Feature activation+2.81e-02

Pos logit contributions
light+0.022
+0.022
 awoke+0.019
?'+0.018
pen+0.018
Neg logit contributions
arl-0.030
ps-0.026
 stern-0.025
ags-0.025
uzz-0.024
Token!
Token ID0
Feature activation-1.01e-04

Pos logit contributions
arl+0.044
ps+0.040
 stern+0.038
ks+0.038
uzz+0.038
Neg logit contributions
light-0.044
-0.044
 awoke-0.040
 masse-0.038
?'-0.038
Token<|endoftext|>
Token ID50256
Feature activation-2.63e-02

Pos logit contributions
+0.000
light+0.000
 awoke+0.000
?'+0.000
 whenever+0.000
Neg logit contributions
arl-0.000
ps-0.000
ags-0.000
 stern-0.000
agged-0.000
TokenJim
Token ID18050
Feature activation-1.35e-02

Pos logit contributions
light+0.036
+0.035
 awoke+0.029
?'+0.028
pen+0.028
Neg logit contributions
arl-0.051
ps-0.044
ags-0.043
 stern-0.042
agged-0.042
Token was
Token ID373
Feature activation-2.72e-02

Pos logit contributions
umm+0.024
 masse+0.023
light+0.022
 avail+0.022
 supper+0.021
Neg logit contributions
ps-0.017
ks-0.016
arl-0.016
ighed-0.015
 greedy-0.015
Token a
Token ID257
Feature activation-1.76e-02

Pos logit contributions
+0.044
light+0.044
 awoke+0.038
pen+0.037
 whenever+0.036
Neg logit contributions
arl-0.046
ps-0.040
ags-0.040
 stern-0.039
agged-0.038
Token strong
Token ID1913
Feature activation-3.77e-02

Pos logit contributions
+0.031
light+0.029
?'+0.026
 whenever+0.026
 awoke+0.026
Neg logit contributions
arl-0.029
ps-0.026
ags-0.025
agged-0.024
orts-0.023
Token boy
Token ID2933
Feature activation+1.05e-02

Pos logit contributions
+0.063
light+0.061
 awoke+0.053
?'+0.052
 whenever+0.052
Neg logit contributions
arl-0.064
ps-0.055
ags-0.053
 stern-0.051
agged-0.051
Token pants
Token ID12581
Feature activation-5.34e-02

Pos logit contributions
light+0.037
+0.037
 awoke+0.033
pen+0.032
?'+0.031
Neg logit contributions
arl-0.045
ps-0.038
ags-0.036
uzz-0.034
 stern-0.034
Token!
Token ID0
Feature activation-1.67e-03

Pos logit contributions
light+0.072
+0.071
 awoke+0.064
pen+0.060
?'+0.059
Neg logit contributions
arl-0.104
ps-0.090
ags-0.086
 stern-0.085
uzz-0.083
Token They
Token ID1119
Feature activation+1.65e-02

Pos logit contributions
light+0.003
+0.003
pen+0.003
?'+0.002
 recite+0.002
Neg logit contributions
arl-0.003
ps-0.002
uzz-0.002
ags-0.002
 stern-0.002
Token will
Token ID481
Feature activation+8.75e-03

Pos logit contributions
ps+0.021
arl+0.021
 stern+0.018
 greedy+0.017
 innocent+0.017
Neg logit contributions
light-0.029
 Tues-0.027
 masse-0.026
?'-0.026
-0.026
Token help
Token ID1037
Feature activation-7.58e-04

Pos logit contributions
arl+0.018
ps+0.015
ags+0.014
Their+0.014
 stern+0.014
Neg logit contributions
light-0.012
-0.011
 awoke-0.011
?'-0.010
pen-0.010
Token me
Token ID502
Feature activation-2.58e-02

Pos logit contributions
light+0.001
 awoke+0.001
+0.001
umm+0.001
 bal+0.001
Neg logit contributions
arl-0.001
ps-0.001
Their-0.001
 stern-0.001
 loud-0.001
Token mail
Token ID6920
Feature activation-1.19e-03

Pos logit contributions
light+0.033
+0.032
 awoke+0.031
 Tues+0.028
 nights+0.027
Neg logit contributions
arl-0.051
ps-0.045
ags-0.043
 stern-0.041
Their-0.040
Token letters
Token ID7475
Feature activation-8.77e-03

Pos logit contributions
+0.002
light+0.002
 awoke+0.002
pen+0.002
?'+0.002
Neg logit contributions
arl-0.002
ps-0.002
ags-0.002
 stern-0.002
Their-0.002
Token!"
Token ID2474
Feature activation+3.60e-02

Pos logit contributions
light+0.012
+0.012
 awoke+0.011
 masse+0.011
pen+0.011
Neg logit contributions
arl-0.016
ps-0.014
ags-0.013
uzz-0.013
 stern-0.012
Token
Token ID198
Feature activation+1.71e-02

Pos logit contributions
arl+0.061
 stern+0.051
Their+0.048
ags+0.048
 Their+0.046
Neg logit contributions
light-0.061
-0.059
pen-0.052
 awoke-0.051
ridge-0.049
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
arl+0.031
Their+0.027
 stern+0.025
ags+0.024
ps+0.024
Neg logit contributions
light-0.028
-0.026
umm-0.024
 awoke-0.023
pen-0.023
Token saw
Token ID2497
Feature activation+3.69e-02

Pos logit contributions
+0.002
light+0.002
?'+0.002
pen+0.002
 awoke+0.002
Neg logit contributions
arl-0.003
ps-0.003
 stern-0.003
ags-0.003
Their-0.003
Token planes
Token ID13016
Feature activation-1.72e-02

Pos logit contributions
arl+0.075
ags+0.062
ps+0.061
agged+0.061
orts+0.059
Neg logit contributions
-0.054
light-0.051
 whenever-0.046
 awoke-0.043
 tucked-0.043
Token flying
Token ID7348
Feature activation+2.58e-02

Pos logit contributions
+0.025
light+0.024
 awoke+0.021
?'+0.021
pen+0.020
Neg logit contributions
arl-0.031
ps-0.028
ags-0.026
 stern-0.026
Their-0.025
Token in
Token ID287
Feature activation+1.00e-02

Pos logit contributions
arl+0.060
 apology+0.052
 stern+0.050
ses+0.050
agged+0.050
Neg logit contributions
-0.032
light-0.030
 whenever-0.028
 night-0.023
 since-0.023
Token the
Token ID262
Feature activation-6.80e-03

Pos logit contributions
arl+0.022
ps+0.019
agged+0.018
ags+0.018
 stern+0.018
Neg logit contributions
-0.013
light-0.012
 whenever-0.010
 awoke-0.010
?'-0.009
Token sky
Token ID6766
Feature activation-1.24e-02

Pos logit contributions
+0.009
light+0.009
?'+0.007
pen+0.007
 awoke+0.007
Neg logit contributions
arl-0.013
ps-0.012
ags-0.012
 stern-0.011
agged-0.011
Token.
Token ID13
Feature activation-7.74e-03

Pos logit contributions
+0.018
light+0.017
 awoke+0.016
?'+0.015
pen+0.015
Neg logit contributions
arl-0.022
ps-0.020
ags-0.018
 stern-0.018
uzz-0.018
Token Lily
Token ID20037
Feature activation-6.24e-04

Pos logit contributions
+0.013
light+0.013
 awoke+0.011
?'+0.011
pen+0.011
Neg logit contributions
arl-0.013
ps-0.011
ags-0.010
 stern-0.010
agged-0.010
Token felt
Token ID2936
Feature activation-2.26e-02

Pos logit contributions
umm+0.001
light+0.001
 supper+0.001
 masse+0.001
?'+0.001
Neg logit contributions
ps-0.001
 stern-0.001
ks-0.001
outed-0.001
arl-0.001
Token aw
Token ID3253
Feature activation+9.19e-03

Pos logit contributions
light+0.048
+0.045
pen+0.041
?'+0.041
 awoke+0.041
Neg logit contributions
arl-0.027
ps-0.023
 stern-0.022
ags-0.021
Their-0.020
Tokened
Token ID276
Feature activation-2.64e-03

Pos logit contributions
arl+0.015
ps+0.013
ags+0.013
agged+0.012
 stern+0.012
Neg logit contributions
-0.015
light-0.015
 awoke-0.013
?'-0.013
pen-0.013
Token as
Token ID355
Feature activation+2.11e-02

Pos logit contributions
arl+0.041
ps+0.035
 stern+0.032
ags+0.031
Their+0.030
Neg logit contributions
light-0.042
-0.040
?'-0.038
 awoke-0.038
pen-0.035
Token he
Token ID339
Feature activation-1.53e-02

Pos logit contributions
arl+0.044
ps+0.040
uzz+0.038
 stern+0.038
Their+0.038
Neg logit contributions
-0.024
light-0.024
 awoke-0.021
?'-0.020
pen-0.019
Token could
Token ID714
Feature activation-3.83e-02

Pos logit contributions
+0.020
light+0.019
?'+0.017
 Tues+0.016
 awoke+0.016
Neg logit contributions
arl-0.029
ps-0.027
 stern-0.025
agged-0.025
uzz-0.024
Token.
Token ID13
Feature activation-5.84e-03

Pos logit contributions
light+0.056
+0.055
 masse+0.048
 Tues+0.048
 awoke+0.047
Neg logit contributions
arl-0.068
ps-0.063
 stern-0.059
ags-0.058
uzz-0.058
Token
Token ID220
Feature activation+3.88e-02

Pos logit contributions
light+0.010
+0.010
 awoke+0.009
pen+0.009
?'+0.008
Neg logit contributions
arl-0.009
ags-0.008
ps-0.008
 stern-0.007
orts-0.007
Token
Token ID198
Feature activation-1.38e-02

Pos logit contributions
arl+0.076
ps+0.065
ags+0.064
orts+0.062
agged+0.062
Neg logit contributions
light-0.055
-0.053
 awoke-0.047
 tucked-0.043
pen-0.043
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
light+0.019
+0.018
 awoke+0.015
?'+0.015
pen+0.015
Neg logit contributions
arl-0.028
ps-0.023
ags-0.023
 stern-0.023
Their-0.023
TokenWhen
Token ID2215
Feature activation+3.11e-03

Pos logit contributions
light+0.019
+0.018
 awoke+0.017
?'+0.016
pen+0.016
Neg logit contributions
arl-0.021
Their-0.018
ps-0.017
 stern-0.017
ags-0.016
Token he
Token ID339
Feature activation-2.38e-03

Pos logit contributions
arl+0.006
ps+0.006
 stern+0.006
Their+0.006
ags+0.005
Neg logit contributions
-0.004
light-0.004
pen-0.003
?'-0.003
 awoke-0.003
Token had
Token ID550
Feature activation+5.15e-03

Pos logit contributions
light+0.003
+0.003
?'+0.003
 masse+0.003
umm+0.003
Neg logit contributions
arl-0.004
ps-0.004
Their-0.003
 stern-0.003
 Lia-0.003
Token had
Token ID550
Feature activation+4.24e-04

Pos logit contributions
arl+0.008
ps+0.008
 stern+0.008
Their+0.008
agged+0.008
Neg logit contributions
light-0.008
-0.007
?'-0.006
pen-0.006
 whenever-0.006
Token
Token ID198
Feature activation+1.56e-02

Pos logit contributions
arl+0.067
ps+0.066
ags+0.062
agged+0.060
outing+0.058
Neg logit contributions
-0.025
light-0.018
 whenever-0.018
?'-0.015
 bal-0.015
Token
Token ID198
Feature activation+1.54e-02

Pos logit contributions
arl+0.033
ps+0.029
ags+0.028
 stern+0.028
agged+0.027
Neg logit contributions
-0.020
light-0.019
 awoke-0.016
?'-0.015
pen-0.015
Token"
Token ID1
Feature activation+4.94e-02

Pos logit contributions
arl+0.032
ps+0.028
ags+0.027
 stern+0.027
agged+0.026
Neg logit contributions
-0.021
light-0.020
 awoke-0.016
?'-0.016
 whenever-0.016
TokenSorry
Token ID14385
Feature activation+1.29e-02

Pos logit contributions
arl+0.091
ps+0.084
ags+0.082
agged+0.077
ses+0.076
Neg logit contributions
-0.082
light-0.066
 whenever-0.063
 bal-0.061
 masse-0.060
Token,
Token ID11
Feature activation+8.00e-03

Pos logit contributions
arl+0.022
 stern+0.020
ps+0.019
 sorry+0.019
 loud+0.019
Neg logit contributions
light-0.018
pen-0.018
 awoke-0.017
-0.017
 drifted-0.016
Token sweet
Token ID6029
Feature activation-4.99e-02

Pos logit contributions
arl+0.014
agged+0.013
ags+0.013
orts+0.012
uzz+0.012
Neg logit contributions
-0.013
light-0.011
 tucked-0.010
 nap-0.010
 whenever-0.010
Tokenie
Token ID494
Feature activation-3.36e-02

Pos logit contributions
+0.071
light+0.068
 awoke+0.058
?'+0.057
pen+0.056
Neg logit contributions
arl-0.098
ps-0.085
ags-0.082
 stern-0.081
agged-0.079
Token,
Token ID11
Feature activation+1.12e-02

Pos logit contributions
+0.045
light+0.043
 awoke+0.036
?'+0.035
pen+0.035
Neg logit contributions
arl-0.068
ps-0.059
 stern-0.057
ags-0.057
orts-0.055
Token we
Token ID356
Feature activation-1.49e-03

Pos logit contributions
arl+0.022
ps+0.019
ags+0.018
 stern+0.018
agged+0.017
Neg logit contributions
-0.016
light-0.016
 awoke-0.013
?'-0.013
pen-0.013
Token don
Token ID836
Feature activation+2.96e-02

Pos logit contributions
light+0.002
+0.002
 Tues+0.002
 awoke+0.002
 masse+0.002
Neg logit contributions
arl-0.003
ps-0.002
 stern-0.002
ags-0.002
 greedy-0.002
Token't
Token ID470
Feature activation+6.03e-02

Pos logit contributions
arl+0.041
ps+0.035
 stern+0.032
ags+0.032
Their+0.032
Neg logit contributions
light-0.060
 awoke-0.052
-0.051
 laid-0.050
 nights-0.049
Token He
Token ID679
Feature activation+1.06e-02

Pos logit contributions
arl+0.010
ps+0.009
ags+0.009
agged+0.009
 stern+0.009
Neg logit contributions
-0.009
light-0.008
 awoke-0.007
?'-0.007
 whenever-0.007
Token knows
Token ID4206
Feature activation+1.10e-02

Pos logit contributions
arl+0.017
ps+0.016
 stern+0.015
ses+0.014
ags+0.013
Neg logit contributions
light-0.018
-0.017
 Tues-0.015
pen-0.015
?'-0.015
Token a
Token ID257
Feature activation+8.98e-03

Pos logit contributions
arl+0.018
ps+0.016
 stern+0.016
uzz+0.015
ags+0.015
Neg logit contributions
light-0.017
-0.017
 awoke-0.015
pen-0.015
?'-0.014
Token lot
Token ID1256
Feature activation+5.37e-02

Pos logit contributions
arl+0.015
ps+0.013
agged+0.013
ags+0.013
orts+0.012
Neg logit contributions
-0.015
light-0.014
 awoke-0.013
?'-0.013
 nap-0.013
Token about
Token ID546
Feature activation+1.83e-02

Pos logit contributions
arl+0.116
ps+0.103
ags+0.101
agged+0.100
 stern+0.099
Neg logit contributions
-0.064
light-0.058
 whenever-0.048
?'-0.048
 awoke-0.048
Token stones
Token ID14966
Feature activation-8.20e-02

Pos logit contributions
arl+0.032
ps+0.027
ags+0.026
 stern+0.026
Their+0.025
Neg logit contributions
-0.029
light-0.029
 awoke-0.025
?'-0.024
pen-0.024
Token and
Token ID290
Feature activation-3.51e-03

Pos logit contributions
light+0.109
+0.109
 awoke+0.092
pen+0.088
?'+0.088
Neg logit contributions
arl-0.161
ps-0.143
 stern-0.135
ags-0.134
orts-0.128
Token metals
Token ID21782
Feature activation-3.83e-02

Pos logit contributions
light+0.006
+0.006
 awoke+0.005
?'+0.005
pen+0.005
Neg logit contributions
arl-0.006
ps-0.005
 stern-0.005
ags-0.005
ses-0.005
Token.
Token ID13
Feature activation+6.60e-03

Pos logit contributions
+0.054
light+0.053
 awoke+0.046
?'+0.044
pen+0.043
Neg logit contributions
arl-0.074
ps-0.065
ags-0.062
 stern-0.061
orts-0.059
Token He
Token ID679
Feature activation+1.02e-02

Pos logit contributions
arl+0.012
ps+0.011
ags+0.011
 stern+0.010
agged+0.010
Neg logit contributions
-0.010
light-0.009
 awoke-0.008
?'-0.008
 whenever-0.008
Token will
Token ID481
Feature activation+1.33e-02

Pos logit contributions
arl+0.014
ps+0.014
 stern+0.013
ses+0.012
 greedy+0.012
Neg logit contributions
light-0.018
-0.017
?'-0.016
 Tues-0.016
 whenever-0.016
Token The
Token ID383
Feature activation-5.36e-03

Pos logit contributions
+0.038
light+0.037
pen+0.033
 awoke+0.033
?'+0.033
Neg logit contributions
arl-0.028
uzz-0.022
ags-0.022
ps-0.021
ses-0.021
Token bear
Token ID6842
Feature activation+1.82e-02

Pos logit contributions
+0.007
light+0.007
 awoke+0.006
 whenever+0.005
?'+0.005
Neg logit contributions
arl-0.011
ps-0.010
ags-0.009
 stern-0.009
Their-0.009
Token loved
Token ID6151
Feature activation-4.80e-03

Pos logit contributions
 greedy+0.023
uzz+0.022
 stern+0.022
arl+0.022
ks+0.022
Neg logit contributions
light-0.029
 avail-0.028
-0.028
umm-0.028
 masse-0.027
Token to
Token ID284
Feature activation-2.29e-02

Pos logit contributions
+0.008
light+0.008
pen+0.007
?'+0.007
 awoke+0.007
Neg logit contributions
arl-0.008
ps-0.007
ags-0.007
 stern-0.006
 greedy-0.006
Token eat
Token ID4483
Feature activation-5.40e-02

Pos logit contributions
+0.034
light+0.034
pen+0.029
 since+0.028
oph+0.028
Neg logit contributions
arl-0.041
ps-0.036
ags-0.035
 apology-0.033
 stern-0.033
Token honey
Token ID12498
Feature activation-1.01e-01

Pos logit contributions
light+0.103
+0.101
 awoke+0.089
?'+0.086
 recite+0.085
Neg logit contributions
arl-0.073
ps-0.064
ags-0.063
 loud-0.061
 stern-0.061
Token from
Token ID422
Feature activation-2.93e-02

Pos logit contributions
+0.161
light+0.152
 awoke+0.145
?'+0.139
pen+0.134
Neg logit contributions
arl-0.158
ps-0.149
 stern-0.140
ags-0.136
ows-0.132
Token the
Token ID262
Feature activation-1.87e-02

Pos logit contributions
+0.047
light+0.045
 awoke+0.038
pen+0.038
?'+0.037
Neg logit contributions
arl-0.051
ps-0.046
ags-0.044
 stern-0.043
ses-0.041
Token be
Token ID307
Feature activation+8.41e-03

Pos logit contributions
+0.031
light+0.030
pen+0.028
?'+0.026
 awoke+0.026
Neg logit contributions
arl-0.032
ps-0.025
ags-0.025
 stern-0.024
 greedy-0.024
Tokeneh
Token ID17231
Feature activation-4.85e-02

Pos logit contributions
arl+0.013
ps+0.011
ags+0.011
agged+0.011
 stern+0.010
Neg logit contributions
-0.015
light-0.014
 awoke-0.013
pen-0.013
?'-0.013
Tokenive
Token ID425
Feature activation-5.63e-02

Pos logit contributions
+0.068
light+0.064
 awoke+0.059
?'+0.057
 Tues+0.053
Neg logit contributions
arl-0.097
ags-0.082
ps-0.080
agged-0.079
orts-0.078
Token it
Token ID340
Feature activation-1.00e-02

Pos logit contributions
+0.013
light+0.013
 awoke+0.011
?'+0.010
pen+0.010
Neg logit contributions
arl-0.020
ps-0.017
ags-0.017
 stern-0.016
Their-0.016
Token,
Token ID11
Feature activation-2.32e-02

Pos logit contributions
+0.016
light+0.016
 awoke+0.014
pen+0.014
ridge+0.013
Neg logit contributions
arl-0.017
ags-0.014
 stern-0.014
ps-0.014
agged-0.013
Token like
Token ID588
Feature activation-1.95e-02

Pos logit contributions
light+0.036
+0.035
 awoke+0.031
pen+0.030
umm+0.029
Neg logit contributions
arl-0.038
 stern-0.035
ps-0.033
 greedy-0.033
ags-0.032
Token apples
Token ID22514
Feature activation-8.46e-02

Pos logit contributions
+0.030
light+0.030
 awoke+0.026
?'+0.025
pen+0.025
Neg logit contributions
arl-0.035
ps-0.031
 stern-0.029
ags-0.029
Their-0.028
Token and
Token ID290
Feature activation-1.55e-02

Pos logit contributions
light+0.123
+0.113
 awoke+0.109
?'+0.103
 recite+0.100
Neg logit contributions
arl-0.158
ps-0.139
 stern-0.130
ags-0.128
Their-0.128
Token carrots
Token ID34397
Feature activation-1.13e-01

Pos logit contributions
light+0.028
+0.025
?'+0.025
 awoke+0.024
pen+0.024
Neg logit contributions
arl-0.022
ps-0.022
 greedy-0.019
 stern-0.019
Their-0.018
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
+0.171
light+0.168
 awoke+0.148
?'+0.143
pen+0.139
Neg logit contributions
arl-0.206
ps-0.180
ags-0.169
 stern-0.167
Their-0.163
Token
Token ID220
Feature activation+2.70e-02

Pos logit contributions
+0.016
light+0.015
 awoke+0.013
?'+0.012
pen+0.012
Neg logit contributions
arl-0.024
ps-0.021
ags-0.020
 stern-0.020
agged-0.020
Token
Token ID198
Feature activation-2.21e-02

Pos logit contributions
arl+0.062
ps+0.057
 stern+0.054
ags+0.054
ses+0.053
Neg logit contributions
-0.029
light-0.026
 whenever-0.022
?'-0.022
 awoke-0.020
Token
Token ID198
Feature activation-2.02e-02

Pos logit contributions
+0.025
light+0.023
 whenever+0.019
?'+0.019
 awoke+0.019
Neg logit contributions
arl-0.049
ps-0.045
ags-0.043
 stern-0.042
agged-0.041
TokenOne
Token ID3198
Feature activation+2.61e-02

Pos logit contributions
+0.027
light+0.027
 awoke+0.022
?'+0.022
pen+0.022
Neg logit contributions
arl-0.041
ps-0.035
ags-0.034
 stern-0.034
agged-0.033
Token made
Token ID925
Feature activation-1.14e-02

Pos logit contributions
arl+0.015
 stern+0.015
ps+0.014
Their+0.013
uzz+0.013
Neg logit contributions
light-0.013
-0.013
?'-0.012
pen-0.012
 Tues-0.011
Token sure
Token ID1654
Feature activation+8.97e-03

Pos logit contributions
light+0.019
+0.019
 awoke+0.016
pen+0.016
?'+0.016
Neg logit contributions
arl-0.018
ps-0.017
 stern-0.016
ags-0.016
Their-0.015
Token he
Token ID339
Feature activation-3.74e-03

Pos logit contributions
arl+0.017
ps+0.015
 stern+0.015
ags+0.015
Their+0.015
Neg logit contributions
-0.012
light-0.012
 awoke-0.010
?'-0.010
pen-0.010
Token remembered
Token ID12086
Feature activation-2.71e-03

Pos logit contributions
light+0.005
 Tues+0.004
+0.004
?'+0.004
 masse+0.004
Neg logit contributions
ps-0.007
 stern-0.006
arl-0.006
ks-0.006
uzz-0.006
Token his
Token ID465
Feature activation+2.22e-02

Pos logit contributions
light+0.005
+0.004
 awoke+0.004
pen+0.004
 bal+0.004
Neg logit contributions
arl-0.004
 stern-0.004
ps-0.004
Their-0.003
uzz-0.003
Token mum
Token ID25682
Feature activation-8.40e-02

Pos logit contributions
arl+0.046
ps+0.041
 stern+0.041
ags+0.040
ses+0.038
Neg logit contributions
light-0.028
-0.027
 awoke-0.022
pen-0.022
 whenever-0.021
Token's
Token ID338
Feature activation-4.95e-02

Pos logit contributions
light+0.076
 Tues+0.067
+0.064
?'+0.061
 masse+0.060
Neg logit contributions
 stern-0.179
ps-0.172
arl-0.170
ks-0.162
 innocent-0.161
Token lesson
Token ID11483
Feature activation+7.19e-03

Pos logit contributions
+0.087
light+0.084
 awoke+0.074
?'+0.073
 whenever+0.072
Neg logit contributions
arl-0.080
ps-0.068
ags-0.065
 stern-0.063
agged-0.062
Token -
Token ID532
Feature activation-6.85e-03

Pos logit contributions
arl+0.013
ps+0.012
 stern+0.012
uzz+0.011
ags+0.011
Neg logit contributions
light-0.010
-0.010
 awoke-0.008
?'-0.008
pen-0.008
Token never
Token ID1239
Feature activation+6.42e-02

Pos logit contributions
light+0.009
 awoke+0.009
pen+0.008
+0.008
?'+0.008
Neg logit contributions
arl-0.012
 stern-0.011
ps-0.011
uzz-0.010
Their-0.010
Token disturb
Token ID17037
Feature activation-4.96e-03

Pos logit contributions
arl+0.099
 greedy+0.096
 stern+0.094
 mad+0.092
 angry+0.088
Neg logit contributions
light-0.103
-0.088
pen-0.088
 Tues-0.087
?'-0.086
Token stories
Token ID3923
Feature activation-1.80e-02

Pos logit contributions
+0.067
light+0.063
 whenever+0.053
 awoke+0.053
?'+0.053
Neg logit contributions
arl-0.098
ps-0.085
ags-0.083
agged-0.082
 stern-0.081
Token about
Token ID546
Feature activation+1.69e-04

Pos logit contributions
light+0.026
+0.025
pen+0.023
 awoke+0.022
 masse+0.021
Neg logit contributions
arl-0.033
ps-0.028
 stern-0.027
ags-0.027
uzz-0.026
Token hopping
Token ID47153
Feature activation-1.87e-02

Pos logit contributions
arl+0.000
ps+0.000
 stern+0.000
ags+0.000
orts+0.000
Neg logit contributions
-0.000
light-0.000
 awoke-0.000
pen-0.000
?'-0.000
Token little
Token ID1310
Feature activation+2.25e-04

Pos logit contributions
+0.026
light+0.024
?'+0.021
 awoke+0.021
 whenever+0.021
Neg logit contributions
arl-0.037
ps-0.033
ags-0.032
 stern-0.031
agged-0.031
Token b
Token ID275
Feature activation-2.47e-02

Pos logit contributions
 greedy+0.000
arl+0.000
ps+0.000
 stern+0.000
 innocent+0.000
Neg logit contributions
light-0.000
-0.000
 awoke-0.000
?'-0.000
 Tues-0.000
Tokenunn
Token ID20935
Feature activation-8.49e-02

Pos logit contributions
+0.038
light+0.038
 awoke+0.035
?'+0.032
 tucked+0.031
Neg logit contributions
arl-0.048
ps-0.040
orts-0.039
ags-0.038
agged-0.037
Tokenies
Token ID444
Feature activation-6.84e-02

Pos logit contributions
+0.144
light+0.140
 awoke+0.122
?'+0.120
 whenever+0.119
Neg logit contributions
arl-0.141
ps-0.120
ags-0.115
 stern-0.113
agged-0.110
Token who
Token ID508
Feature activation+2.42e-02

Pos logit contributions
light+0.100
+0.095
 awoke+0.087
pen+0.084
 masse+0.081
Neg logit contributions
arl-0.120
ps-0.106
 stern-0.100
ags-0.097
uzz-0.092
Token came
Token ID1625
Feature activation+2.99e-02

Pos logit contributions
arl+0.040
ps+0.038
 stern+0.036
ks+0.034
 greedy+0.034
Neg logit contributions
light-0.036
-0.034
 masse-0.030
 bal-0.030
 awoke-0.029
Token to
Token ID284
Feature activation-8.32e-03

Pos logit contributions
arl+0.058
ps+0.051
ags+0.048
 stern+0.048
orts+0.046
Neg logit contributions
-0.041
light-0.041
 awoke-0.035
pen-0.034
?'-0.033
Token visit
Token ID3187
Feature activation-3.52e-03

Pos logit contributions
+0.012
light+0.012
 awoke+0.010
pen+0.009
?'+0.009
Neg logit contributions
arl-0.016
ps-0.014
ags-0.013
 stern-0.013
Their-0.013