TOP ACTIVATIONS
MAX = 0.119

," Lily makes her doll say . " Hi
box with a sign that says " Do not open ".
," Tom makes his doll say . They pretend
truck with a sign that says " News ". They are
It has a label that says " mail ". They are
," she made the seal say . But then
she had heard her parents say that you must never ,
I can make the doll say hello !" L ila said
as Mia makes the robot say hello and dance and do
It has a sign that says " test ". They are
had heard his Mom my say something very special . She
suddenly he heard a voice say
It has a button that says " power ". L ila
," she made the bear say . She did
they see a sign that says " Do not go in
see her ?" a voice says . " Of
you doing ?" a voice says . They turn
, they heard their mom say that there was a festival
my friends ?" the robot says . " Yes
he heard a tiny voice say , " Hi , weak

MIN ACTIVATIONS
MIN = -0.119

the pony went off on even greater adventures . He was
to stay in . So even though the sc ooter was
being careful driving . So even though the prize was gone
story : " Together , even the smallest of us can
helper . On the way back home , Tim my asked
. He decided to go even further and explore the other
big before ! She could even sit inside it .
to climb again , reaching even higher than before . When
. From then on , even when he was very busy
and be able to help even more people . <|endoftext|> Once
read the book and learned even more about how volcan oes
back another day and spend even more time playing . And
home . On the way back , her mom my explained
On the way back , Ben and Lily saw
said , " Remember , even though something may be bitter
On the way back home , Lily thought about
On the way back home , Betsy could not
to stay out of trouble even when they were being nos
homework . On the way back to school , they saw
day on , they worked even harder for their kind king

INTERVAL 0.059 - 0.119
CONTAINS 1.244%

was very careful not to let the wind take it away
at the desk . He said , " Wow , this
! Happy birthday !" they shouted . L ily
" Mom my ," he said , " it 's so
got scared . The frog said , " Don 't worry

INTERVAL -0.000 - 0.059
CONTAINS 50.605%

but her mom reassured her that it was going to be
They felt miserable . They had wasted their toys and
and every time it rains , he looks for his box
with his new sp ade , his friend Johnny came over
?" Her mom my said , " Let 's take it

INTERVAL -0.060 - -0.000
CONTAINS 47.053%

she saw the bug . Jane wanted to make
. It looked down at Sara and Ben . It waved
time , there was a little girl named Lily . One
pretty . " Mom my , can we go
with a smile on his ch ubby little face . <|endoftext|>

INTERVAL -0.119 - -0.060
CONTAINS 1.098%

the wheels and ride the bike ." L ily
He made sure to check all the places carefully .
does not want to go back . She wants to have
felt bad and said , " I 'm sorry , Mom
. The pirate realized that even though things can be scary
Token,"
Token ID553
Feature activation+2.24e-02

Pos logit contributions
 may+0.079
 seem+0.061
ches+0.057
hell+0.053
ves+0.053
Neg logit contributions
Un-0.100
 dads-0.087
 former-0.086
rous-0.086
har-0.086
Token Lily
Token ID20037
Feature activation+1.72e-03

Pos logit contributions
 may+0.037
 seem+0.030
ves+0.028
ches+0.026
otion+0.025
Neg logit contributions
Un-0.036
burn-0.033
ind-0.033
 sunscreen-0.033
 yourselves-0.033
Token makes
Token ID1838
Feature activation+5.97e-02

Pos logit contributions
 may+0.003
 seem+0.003
ves+0.002
ches+0.002
'd+0.002
Neg logit contributions
Un-0.003
 yourselves-0.002
 sunscreen-0.002
S-0.002
ind-0.002
Token her
Token ID607
Feature activation-8.64e-03

Pos logit contributions
 may+0.132
 seem+0.112
ves+0.108
ches+0.104
 leap+0.100
Neg logit contributions
Un-0.063
 yourselves-0.055
 dads-0.054
 sunscreen-0.054
S-0.054
Token doll
Token ID3654
Feature activation-2.01e-02

Pos logit contributions
Un+0.012
 yourselves+0.010
ind+0.010
S+0.010
 late+0.010
Neg logit contributions
 may-0.017
 seem-0.014
ves-0.013
ches-0.013
 leap-0.013
Token say
Token ID910
Feature activation+1.19e-01

Pos logit contributions
Un+0.028
 sunscreen+0.025
 dads+0.025
 yourselves+0.025
S+0.025
Neg logit contributions
 may-0.037
 seem-0.030
ves-0.029
ches-0.028
'd-0.027
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
 may+0.248
 seem+0.206
ves+0.193
ches+0.192
 leap+0.191
Neg logit contributions
Un-0.132
 each-0.121
 late-0.119
S-0.119
 dads-0.116
Token
Token ID198
Feature activation+1.16e-03

Pos logit contributions
 may+0.030
 seem+0.025
ves+0.023
ches+0.022
otion+0.021
Neg logit contributions
Un-0.026
ind-0.025
 least-0.025
 sunscreen-0.024
 yourselves-0.024
Token
Token ID198
Feature activation-1.99e-03

Pos logit contributions
 may+0.002
 seem+0.002
ves+0.002
ches+0.002
otion+0.002
Neg logit contributions
Un-0.002
 least-0.001
 sunscreen-0.001
 yourselves-0.001
 cher-0.001
Token"
Token ID1
Feature activation-5.35e-02

Pos logit contributions
 least+0.003
elt+0.003
 cher+0.003
cream+0.003
burn+0.003
Neg logit contributions
 may-0.003
ves-0.003
 seem-0.002
ches-0.002
fters-0.002
TokenHi
Token ID17250
Feature activation-4.52e-03

Pos logit contributions
 least+0.066
 home+0.065
ind+0.064
 former+0.064
 property+0.063
Neg logit contributions
 may-0.105
 seem-0.091
ves-0.088
'd-0.083
 opener-0.078
Token box
Token ID3091
Feature activation+3.54e-02

Pos logit contributions
 may+0.031
 seem+0.024
ves+0.022
 leap+0.022
ches+0.021
Neg logit contributions
Un-0.026
 yourselves-0.024
S-0.024
kind-0.024
 late-0.024
Token with
Token ID351
Feature activation+4.43e-02

Pos logit contributions
 may+0.087
 seem+0.074
ches+0.070
ves+0.069
izards+0.067
Neg logit contributions
Un-0.028
 each-0.028
 late-0.028
 next-0.027
Mom-0.026
Token a
Token ID257
Feature activation+3.00e-02

Pos logit contributions
 may+0.093
 seem+0.078
ves+0.075
 must+0.071
hell+0.070
Neg logit contributions
Un-0.053
burn-0.047
 least-0.047
 yourselves-0.044
thank-0.044
Token sign
Token ID1051
Feature activation+3.47e-02

Pos logit contributions
 may+0.054
 seem+0.046
ves+0.043
ches+0.043
otion+0.040
Neg logit contributions
Un-0.042
 cher-0.039
 yourselves-0.038
 sunscreen-0.038
S-0.038
Token that
Token ID326
Feature activation+6.78e-02

Pos logit contributions
 may+0.071
 seem+0.058
ves+0.058
ches+0.054
otion+0.053
Neg logit contributions
Un-0.042
 yourselves-0.039
 least-0.038
 sunscreen-0.038
 former-0.037
Token says
Token ID1139
Feature activation+1.18e-01

Pos logit contributions
 may+0.124
 seem+0.101
ves+0.095
ches+0.090
'd+0.088
Neg logit contributions
Un-0.099
 yourselves-0.091
ind-0.090
 sunscreen-0.090
S-0.089
Token "
Token ID366
Feature activation-3.68e-02

Pos logit contributions
 may+0.213
ches+0.181
 seem+0.176
 leap+0.170
ologists+0.167
Neg logit contributions
Un-0.151
 cher-0.141
 each-0.140
S-0.138
 dads-0.136
TokenDo
Token ID5211
Feature activation+4.69e-02

Pos logit contributions
 sunscreen+0.049
 least+0.049
 dads+0.048
 yourselves+0.048
 former+0.045
Neg logit contributions
 may-0.065
 seem-0.059
ves-0.055
'd-0.052
ches-0.052
Token not
Token ID407
Feature activation-1.66e-02

Pos logit contributions
 may+0.097
 trigger+0.082
ches+0.076
'd+0.076
 leap+0.076
Neg logit contributions
Un-0.054
 each-0.048
S-0.047
 yourselves-0.047
 dads-0.047
Token open
Token ID1280
Feature activation-1.01e-02

Pos logit contributions
Un+0.028
thank+0.027
 sunscreen+0.027
Thank+0.027
burn+0.027
Neg logit contributions
 may-0.025
 seem-0.023
 leap-0.019
 must-0.018
 opener-0.017
Token".
Token ID1911
Feature activation+1.83e-02

Pos logit contributions
Un+0.016
Thank+0.015
 dads+0.014
S+0.014
 sunscreen+0.014
Neg logit contributions
 may-0.017
 seem-0.013
ves-0.013
ches-0.012
 must-0.012
Token,"
Token ID553
Feature activation+2.81e-02

Pos logit contributions
 may+0.014
 seem+0.011
ches+0.011
ves+0.010
hell+0.010
Neg logit contributions
Un-0.020
 former-0.018
har-0.018
 least-0.018
 dads-0.018
Token Tom
Token ID4186
Feature activation+1.21e-02

Pos logit contributions
 may+0.045
 seem+0.036
ves+0.033
ches+0.031
 leap+0.030
Neg logit contributions
Un-0.047
 yourselves-0.044
ind-0.043
 dads-0.043
 sunscreen-0.043
Token makes
Token ID1838
Feature activation+6.00e-02

Pos logit contributions
 may+0.021
 seem+0.017
ves+0.017
ches+0.015
otion+0.015
Neg logit contributions
Un-0.018
 dads-0.017
 least-0.017
 yourselves-0.017
 sunscreen-0.017
Token his
Token ID465
Feature activation-6.66e-03

Pos logit contributions
 may+0.128
 seem+0.109
ves+0.104
ches+0.101
 leap+0.097
Neg logit contributions
Un-0.068
 yourselves-0.059
Thank-0.058
 least-0.058
 dads-0.058
Token doll
Token ID3654
Feature activation-2.14e-02

Pos logit contributions
Un+0.009
 yourselves+0.008
 late+0.008
S+0.008
Thank+0.008
Neg logit contributions
 may-0.013
 seem-0.011
ves-0.010
ches-0.010
 leap-0.010
Token say
Token ID910
Feature activation+1.18e-01

Pos logit contributions
Un+0.030
 yourselves+0.028
 sunscreen+0.027
 dads+0.027
S+0.027
Neg logit contributions
 may-0.040
 seem-0.032
ves-0.031
ches-0.029
'd-0.028
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
 may+0.250
 seem+0.206
ves+0.194
ches+0.191
 leap+0.190
Neg logit contributions
Un-0.132
 late-0.121
 each-0.120
S-0.119
 dads-0.117
Token
Token ID198
Feature activation+4.58e-03

Pos logit contributions
 may+0.041
 seem+0.033
ves+0.032
ches+0.030
'd+0.029
Neg logit contributions
Un-0.030
 yourselves-0.028
 sunscreen-0.028
 dads-0.028
 least-0.027
Token
Token ID198
Feature activation+3.38e-04

Pos logit contributions
 may+0.010
 seem+0.008
ves+0.008
ches+0.007
'd+0.007
Neg logit contributions
Un-0.006
 yourselves-0.005
S-0.005
 late-0.005
 dads-0.005
TokenThey
Token ID2990
Feature activation+1.56e-02

Pos logit contributions
 may+0.001
ves+0.001
 seem+0.000
ches+0.000
'd+0.000
Neg logit contributions
 least-0.000
 cher-0.000
elt-0.000
 sunscreen-0.000
burn-0.000
Token pretend
Token ID16614
Feature activation-6.28e-03

Pos logit contributions
 may+0.027
ves+0.022
 seem+0.022
ches+0.020
otion+0.020
Neg logit contributions
Un-0.022
 sunscreen-0.021
 dads-0.021
 each-0.021
 least-0.021
Token truck
Token ID7779
Feature activation+4.75e-04

Pos logit contributions
 may+0.033
 seem+0.026
ves+0.024
 leap+0.024
ches+0.023
Neg logit contributions
Un-0.027
 yourselves-0.025
S-0.024
kind-0.024
thank-0.024
Token with
Token ID351
Feature activation+4.44e-02

Pos logit contributions
 may+0.001
 seem+0.001
ves+0.001
ches+0.001
 leap+0.001
Neg logit contributions
Un-0.001
 late-0.000
Thank-0.000
 yourselves-0.000
 each-0.000
Token a
Token ID257
Feature activation+3.16e-02

Pos logit contributions
 may+0.084
 seem+0.071
ves+0.069
 must+0.068
 trigger+0.065
Neg logit contributions
 least-0.055
Un-0.053
har-0.052
 former-0.052
ety-0.051
Token sign
Token ID1051
Feature activation+3.47e-02

Pos logit contributions
 may+0.061
 seem+0.050
ves+0.048
ches+0.046
'd+0.044
Neg logit contributions
Un-0.042
 yourselves-0.039
 dads-0.038
 sunscreen-0.038
S-0.038
Token that
Token ID326
Feature activation+6.70e-02

Pos logit contributions
 may+0.062
ves+0.050
 seem+0.049
otion+0.046
 must+0.046
Neg logit contributions
Un-0.051
 former-0.048
 least-0.048
 yourselves-0.047
 sunscreen-0.047
Token says
Token ID1139
Feature activation+1.18e-01

Pos logit contributions
 may+0.115
 seem+0.090
ves+0.084
ches+0.079
'd+0.077
Neg logit contributions
Un-0.108
ind-0.100
 yourselves-0.100
S-0.100
 sunscreen-0.099
Token "
Token ID366
Feature activation-3.63e-02

Pos logit contributions
 may+0.208
ches+0.176
 seem+0.172
 leap+0.165
iggle+0.160
Neg logit contributions
Un-0.161
 each-0.144
 cher-0.143
 dads-0.141
S-0.140
TokenNews
Token ID9980
Feature activation+4.43e-02

Pos logit contributions
 sunscreen+0.049
 yourselves+0.049
 dads+0.048
 least+0.048
 cher+0.046
Neg logit contributions
 may-0.064
 seem-0.058
ves-0.053
ches-0.051
'd-0.050
Token".
Token ID1911
Feature activation+1.77e-02

Pos logit contributions
 may+0.073
 seem+0.058
ves+0.055
ches+0.052
'd+0.049
Neg logit contributions
Un-0.072
 sunscreen-0.067
 yourselves-0.066
 dads-0.066
S-0.066
Token They
Token ID1119
Feature activation+2.67e-02

Pos logit contributions
 may+0.038
 seem+0.032
ves+0.031
'd+0.030
ches+0.029
Neg logit contributions
Un-0.019
S-0.018
 late-0.018
 sunscreen-0.018
 each-0.018
Token are
Token ID389
Feature activation+1.09e-02

Pos logit contributions
 may+0.048
 seem+0.038
ves+0.036
ches+0.034
'd+0.034
Neg logit contributions
Un-0.040
 yourselves-0.037
 sunscreen-0.036
S-0.036
 late-0.036
Token It
Token ID632
Feature activation+3.12e-02

Pos logit contributions
 may+0.011
ves+0.010
 seem+0.009
hell+0.009
ides+0.009
Neg logit contributions
Un-0.006
 late-0.005
 each-0.005
 Dad-0.005
S-0.005
Token has
Token ID468
Feature activation-9.47e-04

Pos logit contributions
ides+0.051
 may+0.050
 trigger+0.050
fters+0.049
hell+0.049
Neg logit contributions
 each-0.038
 then-0.035
 least-0.035
 cher-0.034
Un-0.033
Token a
Token ID257
Feature activation+2.47e-02

Pos logit contributions
Un+0.001
 least+0.001
 dads+0.001
kind+0.001
burn+0.001
Neg logit contributions
 may-0.002
 seem-0.002
ves-0.002
 must-0.002
'd-0.002
Token label
Token ID6167
Feature activation+5.07e-02

Pos logit contributions
 may+0.045
 seem+0.038
ves+0.035
ches+0.034
otion+0.033
Neg logit contributions
Un-0.035
 yourselves-0.031
 cher-0.031
 sunscreen-0.031
 dads-0.031
Token that
Token ID326
Feature activation+7.50e-02

Pos logit contributions
 may+0.113
 seem+0.095
ves+0.092
ches+0.088
'd+0.086
Neg logit contributions
Un-0.053
 yourselves-0.047
 dads-0.046
 sunscreen-0.046
S-0.045
Token says
Token ID1139
Feature activation+1.17e-01

Pos logit contributions
 may+0.126
 seem+0.101
ves+0.096
ches+0.092
'd+0.088
Neg logit contributions
Un-0.116
 yourselves-0.108
 dads-0.106
 sunscreen-0.106
 cher-0.105
Token "
Token ID366
Feature activation-2.95e-02

Pos logit contributions
 may+0.204
 seem+0.169
ches+0.165
 leap+0.162
ologists+0.157
Neg logit contributions
Un-0.157
 cher-0.148
 each-0.146
S-0.144
 yourselves-0.143
Tokenmail
Token ID4529
Feature activation+6.06e-02

Pos logit contributions
 dads+0.039
 sunscreen+0.038
 least+0.038
 yourselves+0.037
Un+0.036
Neg logit contributions
 may-0.055
 seem-0.049
ves-0.045
ches-0.043
'd-0.042
Token".
Token ID1911
Feature activation+2.18e-02

Pos logit contributions
 may+0.115
 seem+0.097
ves+0.089
 leap+0.088
ologists+0.085
Neg logit contributions
Un-0.076
 yourselves-0.070
ind-0.068
 each-0.068
 late-0.068
Token They
Token ID1119
Feature activation+2.68e-02

Pos logit contributions
 may+0.045
 seem+0.037
'd+0.036
ves+0.036
hell+0.035
Neg logit contributions
Un-0.025
 each-0.024
S-0.024
 late-0.024
 yourselves-0.023
Token are
Token ID389
Feature activation+7.98e-03

Pos logit contributions
 may+0.051
ves+0.044
 seem+0.041
ches+0.041
otion+0.039
Neg logit contributions
Un-0.033
 each-0.032
 dads-0.032
 least-0.031
 sunscreen-0.031
Token,"
Token ID553
Feature activation+2.64e-02

Pos logit contributions
 may+0.018
 seem+0.014
hell+0.013
 must+0.012
ves+0.012
Neg logit contributions
Un-0.028
kind-0.026
 former-0.025
har-0.025
Thank-0.025
Token she
Token ID673
Feature activation+2.67e-02

Pos logit contributions
 may+0.050
 seem+0.042
ves+0.039
ches+0.036
otion+0.036
Neg logit contributions
Un-0.037
ind-0.034
 yourselves-0.034
kind-0.033
burn-0.033
Token made
Token ID925
Feature activation+6.03e-02

Pos logit contributions
 may+0.058
 seem+0.048
ves+0.046
otion+0.044
ches+0.044
Neg logit contributions
Un-0.030
S-0.027
 dads-0.027
Thank-0.026
 late-0.026
Token the
Token ID262
Feature activation+1.69e-02

Pos logit contributions
 may+0.140
 seem+0.120
ves+0.115
ches+0.111
'd+0.109
Neg logit contributions
Un-0.057
 yourselves-0.050
 sunscreen-0.049
 dads-0.049
S-0.048
Token seal
Token ID13810
Feature activation+8.11e-03

Pos logit contributions
 may+0.031
 seem+0.026
ves+0.024
ches+0.023
'd+0.023
Neg logit contributions
Un-0.024
 sunscreen-0.023
 dads-0.023
 yourselves-0.022
 least-0.022
Token say
Token ID910
Feature activation+1.17e-01

Pos logit contributions
 may+0.015
 seem+0.012
ves+0.012
ches+0.011
'd+0.011
Neg logit contributions
Un-0.011
 dads-0.010
 cher-0.010
S-0.010
 late-0.010
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
 may+0.267
 seem+0.219
ches+0.205
ves+0.203
 leap+0.202
Neg logit contributions
Un-0.110
 each-0.109
 late-0.105
S-0.102
 responsible-0.099
Token
Token ID198
Feature activation+8.48e-03

Pos logit contributions
 may+0.041
 seem+0.033
ves+0.032
ches+0.030
otion+0.029
Neg logit contributions
Un-0.029
 yourselves-0.027
ind-0.026
 least-0.026
 sunscreen-0.026
Token
Token ID198
Feature activation+4.90e-03

Pos logit contributions
 may+0.016
 seem+0.013
ves+0.013
ches+0.012
otion+0.011
Neg logit contributions
Un-0.011
 yourselves-0.010
 least-0.010
 sunscreen-0.010
 dads-0.010
TokenBut
Token ID1537
Feature activation+2.16e-03

Pos logit contributions
 may+0.009
ves+0.008
 seem+0.008
ches+0.007
fters+0.007
Neg logit contributions
 least-0.006
elt-0.006
burn-0.006
 former-0.006
 sunscreen-0.006
Token then
Token ID788
Feature activation-1.73e-02

Pos logit contributions
 may+0.004
 seem+0.003
ves+0.003
 must+0.003
ches+0.003
Neg logit contributions
Un-0.003
ind-0.003
kind-0.003
view-0.003
elt-0.003
Token she
Token ID673
Feature activation+1.47e-02

Pos logit contributions
Un+0.056
kind+0.052
ind+0.051
thank+0.051
burn+0.050
Neg logit contributions
 may-0.077
 seem-0.064
ves-0.057
 must-0.057
otion-0.055
Token had
Token ID550
Feature activation-2.47e-02

Pos logit contributions
 may+0.024
 seem+0.020
ves+0.020
otion+0.019
ches+0.018
Neg logit contributions
Un-0.022
 least-0.021
 each-0.021
 late-0.020
Thank-0.020
Token heard
Token ID2982
Feature activation+4.56e-02

Pos logit contributions
Un+0.033
ind+0.031
 dads+0.030
kind+0.029
burn+0.029
Neg logit contributions
 may-0.049
 seem-0.040
ves-0.039
'd-0.037
 must-0.037
Token her
Token ID607
Feature activation-4.93e-03

Pos logit contributions
 may+0.089
 seem+0.073
ves+0.070
ches+0.067
 must+0.066
Neg logit contributions
Un-0.061
 dads-0.057
 yourselves-0.055
 sunscreen-0.054
ind-0.054
Token parents
Token ID3397
Feature activation-1.94e-02

Pos logit contributions
Un+0.007
 yourselves+0.006
S+0.006
Thank+0.006
P+0.006
Neg logit contributions
 may-0.010
 seem-0.008
ves-0.008
ches-0.007
 story-0.007
Token say
Token ID910
Feature activation+1.17e-01

Pos logit contributions
Un+0.025
 least+0.023
 late+0.022
 yourselves+0.022
 sunscreen+0.022
Neg logit contributions
 may-0.038
ves-0.030
 seem-0.030
ches-0.029
'd-0.027
Token that
Token ID326
Feature activation+3.03e-02

Pos logit contributions
 may+0.223
 seem+0.193
ches+0.184
ologists+0.178
ves+0.178
Neg logit contributions
Un-0.131
 each-0.131
 cher-0.122
 responsible-0.121
 late-0.119
Token you
Token ID345
Feature activation+3.60e-02

Pos logit contributions
 may+0.058
 seem+0.048
ves+0.045
ches+0.043
'd+0.042
Neg logit contributions
Un-0.041
 yourselves-0.038
 dads-0.037
 sunscreen-0.037
S-0.037
Token must
Token ID1276
Feature activation+1.92e-02

Pos logit contributions
 may+0.052
 seem+0.040
ves+0.039
ches+0.036
otion+0.035
Neg logit contributions
Un-0.062
 least-0.058
 yourselves-0.058
 sunscreen-0.058
 dads-0.058
Token never
Token ID1239
Feature activation-4.43e-02

Pos logit contributions
 may+0.035
 seem+0.028
ves+0.027
ches+0.026
otion+0.025
Neg logit contributions
Un-0.027
 yourselves-0.026
 sunscreen-0.025
 least-0.025
S-0.025
Token,
Token ID11
Feature activation+4.97e-03

Pos logit contributions
Un+0.066
 least+0.061
 yourselves+0.060
 dads+0.060
S+0.060
Neg logit contributions
 may-0.078
 seem-0.062
ves-0.061
ches-0.058
otion-0.055
Token I
Token ID314
Feature activation-1.42e-02

Pos logit contributions
 may+0.033
 seem+0.027
ves+0.026
ches+0.024
'd+0.024
Neg logit contributions
Un-0.022
 yourselves-0.020
ind-0.020
 dads-0.020
S-0.020
Token can
Token ID460
Feature activation-7.96e-04

Pos logit contributions
Un+0.024
 yourselves+0.022
 sunscreen+0.022
 dads+0.022
S+0.022
Neg logit contributions
 may-0.022
 seem-0.018
ves-0.017
ches-0.016
'd-0.015
Token make
Token ID787
Feature activation+4.45e-02

Pos logit contributions
Un+0.001
 least+0.001
 each+0.001
 yourselves+0.001
 sunscreen+0.001
Neg logit contributions
 may-0.001
ves-0.001
 seem-0.001
ches-0.001
hell-0.001
Token the
Token ID262
Feature activation+2.16e-02

Pos logit contributions
 may+0.099
 seem+0.085
ves+0.082
ches+0.079
 leap+0.077
Neg logit contributions
Un-0.047
Thank-0.041
thank-0.041
 dads-0.041
S-0.040
Token doll
Token ID3654
Feature activation-1.28e-02

Pos logit contributions
 may+0.037
 seem+0.028
ves+0.026
 must+0.025
 leap+0.025
Neg logit contributions
Un-0.036
 dads-0.032
S-0.032
elt-0.032
 yourselves-0.031
Token say
Token ID910
Feature activation+1.16e-01

Pos logit contributions
Un+0.022
 yourselves+0.020
 dads+0.020
 sunscreen+0.020
S+0.020
Neg logit contributions
 may-0.021
 seem-0.017
ves-0.015
ches-0.015
'd-0.014
Token hello
Token ID23748
Feature activation+3.54e-02

Pos logit contributions
 may+0.198
 seem+0.160
ves+0.153
ches+0.152
 opener+0.144
Neg logit contributions
Un-0.165
 yourselves-0.159
 cher-0.156
S-0.153
 late-0.152
Token!"
Token ID2474
Feature activation+4.09e-02

Pos logit contributions
 may+0.050
 seem+0.042
ches+0.035
hell+0.035
 must+0.035
Neg logit contributions
Un-0.064
 sunscreen-0.061
 former-0.059
P-0.059
Thank-0.059
Token L
Token ID406
Feature activation+1.36e-02

Pos logit contributions
 may+0.085
 seem+0.070
ves+0.067
ches+0.065
'd+0.064
Neg logit contributions
Un-0.049
S-0.045
 yourselves-0.044
 dads-0.044
 sunscreen-0.044
Tokenila
Token ID10102
Feature activation+2.63e-02

Pos logit contributions
 may+0.030
 seem+0.024
ches+0.023
 trigger+0.022
'd+0.022
Neg logit contributions
S-0.016
 dads-0.014
 yourselves-0.014
P-0.014
Un-0.014
Token said
Token ID531
Feature activation+9.41e-02

Pos logit contributions
 may+0.047
 seem+0.039
ves+0.036
ches+0.036
'd+0.034
Neg logit contributions
Un-0.039
 yourselves-0.037
 sunscreen-0.036
ind-0.036
 dads-0.035
Token as
Token ID355
Feature activation+3.95e-03

Pos logit contributions
 may+0.133
 seem+0.107
ves+0.104
'd+0.103
ches+0.099
Neg logit contributions
Un-0.067
 each-0.064
S-0.062
 yourselves-0.061
 late-0.060
Token Mia
Token ID32189
Feature activation+1.01e-02

Pos logit contributions
 may+0.008
 seem+0.006
ves+0.006
ches+0.006
'd+0.006
Neg logit contributions
Un-0.005
 late-0.005
 yourselves-0.005
 each-0.004
S-0.004
Token makes
Token ID1838
Feature activation+5.08e-02

Pos logit contributions
 may+0.017
 seem+0.014
ves+0.013
ches+0.012
'd+0.012
Neg logit contributions
Un-0.016
 yourselves-0.014
 dads-0.014
 least-0.014
 sunscreen-0.014
Token the
Token ID262
Feature activation+2.18e-02

Pos logit contributions
 may+0.113
 seem+0.094
ves+0.090
ches+0.088
'd+0.087
Neg logit contributions
Un-0.053
 yourselves-0.048
 sunscreen-0.047
 dads-0.046
 late-0.046
Token robot
Token ID9379
Feature activation-1.67e-02

Pos logit contributions
 may+0.040
 seem+0.033
ves+0.031
ches+0.030
'd+0.029
Neg logit contributions
Un-0.032
 yourselves-0.029
 dads-0.029
 sunscreen-0.029
S-0.028
Token say
Token ID910
Feature activation+1.16e-01

Pos logit contributions
Un+0.025
 yourselves+0.023
 sunscreen+0.023
 dads+0.023
S+0.023
Neg logit contributions
 may-0.029
 seem-0.023
ves-0.022
ches-0.021
'd-0.020
Token hello
Token ID23748
Feature activation+3.17e-02

Pos logit contributions
 may+0.226
 seem+0.187
ches+0.180
ves+0.176
 leap+0.175
Neg logit contributions
Un-0.141
 each-0.132
 yourselves-0.127
 late-0.127
 dads-0.125
Token and
Token ID290
Feature activation-2.70e-03

Pos logit contributions
 may+0.052
 seem+0.044
ches+0.037
ves+0.037
'd+0.036
Neg logit contributions
Un-0.049
 sunscreen-0.049
 yourselves-0.048
P-0.047
rous-0.047
Token dance
Token ID9280
Feature activation-1.24e-02

Pos logit contributions
ind+0.005
Un+0.005
P+0.005
S+0.005
elt+0.005
Neg logit contributions
 may-0.004
 seem-0.003
 leap-0.003
 roar-0.003
 must-0.003
Token and
Token ID290
Feature activation-4.59e-03

Pos logit contributions
Un+0.018
 yourselves+0.016
 least+0.016
 sunscreen+0.016
 dads+0.016
Neg logit contributions
 may-0.023
 seem-0.019
ves-0.018
ches-0.017
'd-0.016
Token do
Token ID466
Feature activation+3.64e-02

Pos logit contributions
ind+0.009
Un+0.008
S+0.008
P+0.008
elt+0.008
Neg logit contributions
 may-0.007
 seem-0.006
 leap-0.005
 roar-0.005
ves-0.004
Token It
Token ID632
Feature activation+3.31e-02

Pos logit contributions
 may+0.008
 seem+0.006
ves+0.006
ches+0.006
'd+0.005
Neg logit contributions
Un-0.009
 yourselves-0.008
 sunscreen-0.008
 dads-0.008
S-0.008
Token has
Token ID468
Feature activation+6.78e-05

Pos logit contributions
ides+0.056
 trigger+0.056
 may+0.055
fters+0.055
izards+0.054
Neg logit contributions
 each-0.038
 then-0.035
 cher-0.034
Mom-0.033
 next-0.033
Token a
Token ID257
Feature activation+2.34e-02

Pos logit contributions
 may+0.000
 seem+0.000
ves+0.000
 must+0.000
'd+0.000
Neg logit contributions
Un-0.000
 least-0.000
burn-0.000
 dads-0.000
ind-0.000
Token sign
Token ID1051
Feature activation+2.85e-02

Pos logit contributions
 may+0.043
 seem+0.035
ves+0.033
ches+0.032
otion+0.031
Neg logit contributions
Un-0.033
 yourselves-0.030
 cher-0.030
 sunscreen-0.030
 dads-0.030
Token that
Token ID326
Feature activation+6.60e-02

Pos logit contributions
 may+0.059
 seem+0.048
ves+0.048
ches+0.045
 must+0.044
Neg logit contributions
Un-0.035
 least-0.032
 former-0.032
 yourselves-0.031
 sunscreen-0.031
Token says
Token ID1139
Feature activation+1.16e-01

Pos logit contributions
 may+0.107
 seem+0.084
ves+0.078
ches+0.074
'd+0.071
Neg logit contributions
Un-0.110
 yourselves-0.102
S-0.101
 sunscreen-0.101
 dads-0.100
Token "
Token ID366
Feature activation-3.58e-02

Pos logit contributions
 may+0.204
ches+0.170
 seem+0.167
 leap+0.160
ologists+0.160
Neg logit contributions
Un-0.153
 each-0.142
 cher-0.141
 yourselves-0.141
S-0.140
Tokentest
Token ID9288
Feature activation+1.27e-02

Pos logit contributions
 sunscreen+0.049
 dads+0.048
 least+0.048
 yourselves+0.047
 former+0.045
Neg logit contributions
 may-0.062
 seem-0.058
ves-0.052
'd-0.049
 opener-0.048
Token".
Token ID1911
Feature activation+1.93e-02

Pos logit contributions
 may+0.022
 seem+0.018
ves+0.017
ches+0.016
'd+0.016
Neg logit contributions
Un-0.019
 yourselves-0.018
 dads-0.017
 sunscreen-0.017
S-0.017
Token They
Token ID1119
Feature activation+2.51e-02

Pos logit contributions
 may+0.036
 seem+0.031
ves+0.029
ches+0.026
otion+0.026
Neg logit contributions
Un-0.027
ind-0.025
 least-0.025
kind-0.024
 yourselves-0.024
Token are
Token ID389
Feature activation+8.18e-03

Pos logit contributions
 may+0.046
ves+0.039
 seem+0.037
ches+0.037
otion+0.035
Neg logit contributions
Un-0.033
 dads-0.032
 each-0.031
 sunscreen-0.031
 least-0.031
Token had
Token ID550
Feature activation-2.20e-02

Pos logit contributions
 may+0.025
 seem+0.019
'd+0.018
ves+0.017
ches+0.017
Neg logit contributions
Un-0.029
 yourselves-0.027
S-0.027
 cher-0.027
 dads-0.027
Token heard
Token ID2982
Feature activation+3.81e-02

Pos logit contributions
Un+0.031
ind+0.029
kind+0.029
 dads+0.028
 yourselves+0.028
Neg logit contributions
 may-0.041
'd-0.034
 seem-0.033
 must-0.031
ves-0.031
Token his
Token ID465
Feature activation-6.66e-03

Pos logit contributions
 may+0.067
 seem+0.055
ves+0.052
ches+0.048
 must+0.047
Neg logit contributions
Un-0.057
 least-0.053
 dads-0.053
 yourselves-0.053
Thank-0.052
Token Mom
Token ID11254
Feature activation-1.20e-02

Pos logit contributions
 yourselves+0.009
Un+0.009
S+0.008
Thank+0.008
thank+0.008
Neg logit contributions
 may-0.013
 story-0.011
 seem-0.011
ves-0.010
 must-0.010
Tokenmy
Token ID1820
Feature activation-7.19e-04

Pos logit contributions
Un+0.014
 least+0.012
ind+0.012
 sunscreen+0.012
 dads+0.012
Neg logit contributions
 may-0.025
 seem-0.020
ves-0.020
ches-0.019
otion-0.018
Token say
Token ID910
Feature activation+1.16e-01

Pos logit contributions
Un+0.001
 yourselves+0.001
 dads+0.001
 sunscreen+0.001
S+0.001
Neg logit contributions
 may-0.001
 seem-0.001
ves-0.001
ches-0.001
'd-0.001
Token something
Token ID1223
Feature activation+4.25e-02

Pos logit contributions
 may+0.208
 seem+0.171
ches+0.166
ves+0.162
 leap+0.156
Neg logit contributions
Un-0.158
 each-0.145
 cher-0.140
 dads-0.139
 yourselves-0.138
Token very
Token ID845
Feature activation+7.13e-03

Pos logit contributions
 may+0.089
 seem+0.073
ves+0.072
ches+0.069
otion+0.068
Neg logit contributions
Un-0.048
 cher-0.042
 each-0.042
 late-0.042
 sunscreen-0.041
Token special
Token ID2041
Feature activation+7.66e-03

Pos logit contributions
 may+0.015
 seem+0.012
ves+0.012
ches+0.011
'd+0.011
Neg logit contributions
Un-0.009
 yourselves-0.008
 sunscreen-0.008
 dads-0.008
 least-0.008
Token.
Token ID13
Feature activation+9.83e-04

Pos logit contributions
 may+0.015
 seem+0.013
ves+0.012
ches+0.012
 leap+0.012
Neg logit contributions
Un-0.009
 each-0.008
 least-0.008
 cher-0.008
 yourselves-0.008
Token She
Token ID1375
Feature activation+2.44e-04

Pos logit contributions
 may+0.001
 seem+0.001
'd+0.001
 must+0.001
ves+0.001
Neg logit contributions
 least-0.002
ind-0.002
burn-0.002
Un-0.002
P-0.002
Token suddenly
Token ID6451
Feature activation-9.89e-03

Pos logit contributions
 may+0.050
 seem+0.038
ves+0.036
ches+0.035
hell+0.032
Neg logit contributions
Un-0.066
 late-0.062
 yourselves-0.061
 sunscreen-0.060
 least-0.060
Token he
Token ID339
Feature activation+6.96e-03

Pos logit contributions
Un+0.012
 late+0.011
 each+0.011
 cher+0.011
S+0.010
Neg logit contributions
 may-0.019
 seem-0.016
ves-0.015
otion-0.015
ches-0.015
Token heard
Token ID2982
Feature activation+3.53e-02

Pos logit contributions
 may+0.013
 seem+0.011
otion+0.011
ves+0.010
 opener+0.010
Neg logit contributions
Un-0.009
S-0.008
 least-0.008
 late-0.008
P-0.008
Token a
Token ID257
Feature activation+1.09e-02

Pos logit contributions
 may+0.089
 seem+0.076
ves+0.074
ches+0.072
'd+0.070
Neg logit contributions
Un-0.027
 yourselves-0.023
 sunscreen-0.022
 late-0.022
S-0.022
Token voice
Token ID3809
Feature activation+5.29e-02

Pos logit contributions
 may+0.022
ves+0.019
ches+0.018
izards+0.018
 seem+0.018
Neg logit contributions
Un-0.014
S-0.012
 responsible-0.012
 late-0.011
 cher-0.011
Token say
Token ID910
Feature activation+1.16e-01

Pos logit contributions
 may+0.117
ves+0.095
 seem+0.093
'd+0.090
ches+0.088
Neg logit contributions
Un-0.057
 late-0.050
 each-0.049
 cher-0.048
 least-0.048
Token
Token ID6184
Feature activation+1.53e-02

Pos logit contributions
 may+0.242
 seem+0.211
ches+0.210
oons+0.204
fters+0.204
Neg logit contributions
Un-0.111
ful-0.105
S-0.100
 each-0.100
 cher-0.100
Token
Token ID95
Feature activation+5.37e-03

Pos logit contributions
 may+0.025
ves+0.019
'd+0.019
 seem+0.019
ologists+0.018
Neg logit contributions
Un-0.025
 yourselves-0.023
ind-0.023
S-0.022
 least-0.022
Token
Token ID26391
Feature activation+2.31e-02

Pos logit contributions
 may+0.011
 seem+0.009
ves+0.009
izards+0.009
 leap+0.008
Neg logit contributions
Un-0.007
 yourselves-0.006
Thank-0.006
thank-0.006
 late-0.006
Token
Token ID129
Feature activation+2.74e-02

Pos logit contributions
 may+0.051
 seem+0.044
otion+0.042
ches+0.042
 leap+0.042
Neg logit contributions
Un-0.025
S-0.022
Thank-0.020
 late-0.020
burn-0.019
Token
Token ID241
Feature activation-3.85e-02

Pos logit contributions
 may+0.053
 seem+0.040
'd+0.040
ves+0.040
hell+0.037
Neg logit contributions
Un-0.040
 yourselves-0.037
ind-0.037
 late-0.034
 each-0.034
Token It
Token ID632
Feature activation+2.90e-02

Pos logit contributions
 may+0.015
 seem+0.012
ves+0.012
ches+0.012
'd+0.012
Neg logit contributions
Un-0.007
 yourselves-0.007
S-0.007
 each-0.007
 dads-0.007
Token has
Token ID468
Feature activation-1.99e-03

Pos logit contributions
 may+0.041
ves+0.034
 seem+0.033
otion+0.032
hell+0.032
Neg logit contributions
 least-0.046
 each-0.045
Un-0.044
 yourselves-0.042
 cher-0.042
Token a
Token ID257
Feature activation+2.27e-02

Pos logit contributions
Un+0.002
S+0.002
thank+0.002
 dads+0.002
burn+0.002
Neg logit contributions
 may-0.004
 seem-0.004
ves-0.003
 must-0.003
'd-0.003
Token button
Token ID4936
Feature activation+3.54e-03

Pos logit contributions
 may+0.042
 seem+0.034
ves+0.033
ches+0.031
'd+0.030
Neg logit contributions
Un-0.032
 yourselves-0.029
 dads-0.029
 sunscreen-0.029
S-0.029
Token that
Token ID326
Feature activation+6.82e-02

Pos logit contributions
 may+0.007
 seem+0.006
ves+0.006
ches+0.005
 must+0.005
Neg logit contributions
Un-0.005
S-0.004
 dads-0.004
 yourselves-0.004
Thank-0.004
Token says
Token ID1139
Feature activation+1.16e-01

Pos logit contributions
 may+0.103
 seem+0.080
ves+0.075
ches+0.071
'd+0.067
Neg logit contributions
Un-0.120
 yourselves-0.111
 sunscreen-0.110
 dads-0.110
S-0.109
Token "
Token ID366
Feature activation-3.04e-02

Pos logit contributions
 may+0.192
 seem+0.157
ches+0.146
ves+0.142
 leap+0.142
Neg logit contributions
Un-0.171
 yourselves-0.160
 each-0.157
 cher-0.156
S-0.154
Tokenpower
Token ID6477
Feature activation+2.48e-02

Pos logit contributions
 dads+0.044
 sunscreen+0.043
 yourselves+0.042
 cher+0.041
Un+0.041
Neg logit contributions
 may-0.053
 seem-0.047
ves-0.043
ches-0.042
 opener-0.040
Token".
Token ID1911
Feature activation+2.48e-02

Pos logit contributions
 may+0.043
 seem+0.035
ves+0.033
ches+0.032
'd+0.031
Neg logit contributions
Un-0.037
 yourselves-0.034
 dads-0.033
 sunscreen-0.033
S-0.033
Token L
Token ID406
Feature activation+9.02e-03

Pos logit contributions
 may+0.054
 seem+0.045
ves+0.044
ches+0.043
'd+0.042
Neg logit contributions
 yourselves-0.026
Un-0.026
S-0.025
 sunscreen-0.025
 late-0.024
Tokenila
Token ID10102
Feature activation+2.39e-02

Pos logit contributions
 may+0.020
 seem+0.016
 trigger+0.015
'd+0.015
 leap+0.015
Neg logit contributions
S-0.011
 dads-0.010
Un-0.010
 yourselves-0.010
P-0.009
Token,"
Token ID553
Feature activation+2.75e-02

Pos logit contributions
 may+0.041
 seem+0.033
ves+0.029
hell+0.028
ches+0.028
Neg logit contributions
Un-0.049
 former-0.045
kind-0.045
 least-0.045
 sunscreen-0.044
Token she
Token ID673
Feature activation+2.14e-02

Pos logit contributions
 may+0.051
 seem+0.042
ves+0.039
otion+0.036
 leap+0.036
Neg logit contributions
Un-0.039
ind-0.037
kind-0.037
 yourselves-0.036
burn-0.036
Token made
Token ID925
Feature activation+5.69e-02

Pos logit contributions
 may+0.044
 seem+0.037
ves+0.035
ches+0.034
otion+0.033
Neg logit contributions
Un-0.026
S-0.023
 dads-0.023
Thank-0.023
 late-0.023
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
 may+0.136
 seem+0.115
ves+0.110
'd+0.107
ches+0.107
Neg logit contributions
Un-0.050
 yourselves-0.045
 sunscreen-0.044
 dads-0.044
 late-0.043
Token bear
Token ID6842
Feature activation+1.63e-02

Pos logit contributions
 may+0.029
 seem+0.024
ves+0.023
ches+0.022
otion+0.022
Neg logit contributions
Un-0.018
 sunscreen-0.018
 dads-0.017
 least-0.017
 yourselves-0.017
Token say
Token ID910
Feature activation+1.15e-01

Pos logit contributions
 may+0.030
 seem+0.023
ves+0.023
'd+0.022
ches+0.022
Neg logit contributions
Un-0.022
 late-0.021
 dads-0.021
 cher-0.021
S-0.021
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
 may+0.259
 seem+0.213
ches+0.199
ves+0.199
 leap+0.197
Neg logit contributions
Un-0.114
 each-0.110
 late-0.108
S-0.105
 dads-0.102
Token
Token ID198
Feature activation+6.08e-03

Pos logit contributions
 may+0.039
 seem+0.032
ves+0.030
ches+0.029
otion+0.028
Neg logit contributions
Un-0.030
 yourselves-0.029
 least-0.029
ind-0.029
 sunscreen-0.028
Token
Token ID198
Feature activation+1.93e-03

Pos logit contributions
 may+0.011
 seem+0.009
ves+0.009
ches+0.009
otion+0.008
Neg logit contributions
Un-0.008
 least-0.008
 yourselves-0.008
 sunscreen-0.008
 dads-0.007
TokenShe
Token ID3347
Feature activation+5.03e-03

Pos logit contributions
 may+0.003
ves+0.003
 seem+0.003
ches+0.003
fters+0.003
Neg logit contributions
 least-0.003
elt-0.003
burn-0.002
 former-0.002
 cher-0.002
Token did
Token ID750
Feature activation+4.93e-02

Pos logit contributions
 may+0.009
 seem+0.007
ves+0.007
ches+0.006
'd+0.006
Neg logit contributions
Un-0.008
 yourselves-0.008
ind-0.007
 late-0.007
 sunscreen-0.007
Token they
Token ID484
Feature activation+2.82e-02

Pos logit contributions
 may+0.005
 seem+0.004
ves+0.004
ches+0.004
'd+0.003
Neg logit contributions
Un-0.003
 yourselves-0.003
 sunscreen-0.003
 dads-0.003
 late-0.003
Token see
Token ID766
Feature activation+5.64e-03

Pos logit contributions
 may+0.050
 seem+0.040
ves+0.036
ches+0.036
'd+0.035
Neg logit contributions
Un-0.043
 yourselves-0.040
S-0.039
ind-0.039
 cher-0.039
Token a
Token ID257
Feature activation+1.27e-02

Pos logit contributions
 may+0.010
ves+0.008
 seem+0.008
 must+0.007
izards+0.007
Neg logit contributions
har-0.009
thank-0.008
ind-0.008
 former-0.008
kind-0.008
Token sign
Token ID1051
Feature activation+2.34e-02

Pos logit contributions
 may+0.025
 seem+0.021
ves+0.020
ches+0.019
'd+0.018
Neg logit contributions
Un-0.016
 yourselves-0.015
 sunscreen-0.015
 dads-0.014
 least-0.014
Token that
Token ID326
Feature activation+5.80e-02

Pos logit contributions
 may+0.046
 seem+0.037
ves+0.037
otion+0.034
ches+0.034
Neg logit contributions
Un-0.030
 former-0.029
 yourselves-0.028
 sunscreen-0.028
 least-0.028
Token says
Token ID1139
Feature activation+1.15e-01

Pos logit contributions
 may+0.095
 seem+0.074
ves+0.070
ches+0.065
'd+0.063
Neg logit contributions
Un-0.096
 yourselves-0.089
 sunscreen-0.089
S-0.089
ind-0.088
Token "
Token ID366
Feature activation-3.58e-02

Pos logit contributions
 may+0.196
 seem+0.157
ches+0.157
 leap+0.146
ologists+0.144
Neg logit contributions
Un-0.167
 each-0.153
 cher-0.152
S-0.152
 dads-0.152
TokenDo
Token ID5211
Feature activation+5.02e-02

Pos logit contributions
 sunscreen+0.048
 dads+0.047
 yourselves+0.047
 least+0.046
 cher+0.045
Neg logit contributions
 may-0.064
 seem-0.059
ves-0.053
ches-0.051
'd-0.050
Token not
Token ID407
Feature activation-1.30e-02

Pos logit contributions
 may+0.104
 seem+0.082
ches+0.082
 trigger+0.081
'd+0.080
Neg logit contributions
Un-0.060
 dads-0.053
 yourselves-0.053
S-0.052
 each-0.052
Token go
Token ID467
Feature activation-2.35e-02

Pos logit contributions
Un+0.022
 sunscreen+0.022
thank+0.021
ind+0.021
burn+0.021
Neg logit contributions
 may-0.020
 seem-0.018
 leap-0.014
 must-0.014
 story-0.013
Token in
Token ID287
Feature activation-1.77e-03

Pos logit contributions
Un+0.029
 sunscreen+0.027
elt+0.026
Thank+0.026
 least+0.026
Neg logit contributions
 may-0.047
 seem-0.040
ves-0.037
ches-0.037
 must-0.036
Token see
Token ID766
Feature activation+2.77e-03

Pos logit contributions
ams+0.021
elt+0.021
amed+0.020
kind+0.020
attered+0.020
Neg logit contributions
 may-0.014
 seem-0.011
 roar-0.010
 hop-0.009
 grant-0.009
Token her
Token ID607
Feature activation-5.55e-03

Pos logit contributions
 may+0.005
ches+0.004
 seem+0.004
ves+0.003
 must+0.003
Neg logit contributions
har-0.004
thank-0.004
kind-0.004
Thank-0.004
 former-0.004
Token?"
Token ID1701
Feature activation+2.59e-02

Pos logit contributions
Un+0.009
Thank+0.008
elt+0.008
S+0.007
P+0.007
Neg logit contributions
 may-0.010
 seem-0.008
ches-0.008
ves-0.008
 story-0.008
Token a
Token ID257
Feature activation+1.69e-02

Pos logit contributions
 may+0.053
 seem+0.044
ves+0.042
ches+0.041
'd+0.040
Neg logit contributions
Un-0.032
 yourselves-0.029
S-0.028
 dads-0.028
 late-0.028
Token voice
Token ID3809
Feature activation+5.76e-02

Pos logit contributions
 may+0.034
 seem+0.029
ves+0.027
ches+0.026
otion+0.025
Neg logit contributions
Un-0.022
S-0.019
 late-0.019
 dads-0.019
Thank-0.019
Token says
Token ID1139
Feature activation+1.15e-01

Pos logit contributions
 may+0.118
'd+0.092
 seem+0.091
ves+0.091
 leap+0.087
Neg logit contributions
Un-0.070
 late-0.065
S-0.063
 cher-0.063
 each-0.063
Token.
Token ID13
Feature activation+2.38e-02

Pos logit contributions
 may+0.283
 seem+0.229
ves+0.229
'd+0.227
fters+0.218
Neg logit contributions
 late-0.100
Un-0.093
 each-0.093
elt-0.086
 together-0.085
Token
Token ID198
Feature activation+1.11e-02

Pos logit contributions
 may+0.054
 seem+0.046
ves+0.046
'd+0.045
hell+0.045
Neg logit contributions
Un-0.024
 late-0.023
S-0.021
Thank-0.020
Mom-0.019
Token
Token ID198
Feature activation+6.98e-03

Pos logit contributions
 may+0.026
 seem+0.022
 leap+0.021
ves+0.021
'd+0.021
Neg logit contributions
Un-0.011
S-0.010
 late-0.010
Mom-0.009
 yourselves-0.009
Token"
Token ID1
Feature activation-4.56e-02

Pos logit contributions
 may+0.013
 seem+0.010
ves+0.010
ches+0.009
'd+0.009
Neg logit contributions
 least-0.010
 former-0.010
Un-0.010
burn-0.010
 cher-0.009
TokenOf
Token ID5189
Feature activation-1.11e-02

Pos logit contributions
 former+0.063
ind+0.060
 least+0.057
eline+0.055
 property+0.055
Neg logit contributions
 may-0.089
 seem-0.076
'd-0.068
 opener-0.068
ves-0.066
Token you
Token ID345
Feature activation+3.37e-02

Pos logit contributions
Un+0.014
thank+0.013
 former+0.013
burn+0.013
Thank+0.013
Neg logit contributions
 may-0.019
 seem-0.016
ves-0.015
izards-0.014
ches-0.013
Token doing
Token ID1804
Feature activation+1.90e-02

Pos logit contributions
 may+0.045
 seem+0.035
otion+0.030
ves+0.029
 must+0.028
Neg logit contributions
Un-0.064
kind-0.060
S-0.060
thank-0.060
elt-0.059
Token?"
Token ID1701
Feature activation+2.91e-02

Pos logit contributions
 may+0.038
 seem+0.032
ves+0.030
ches+0.029
otion+0.029
Neg logit contributions
Un-0.024
 dads-0.021
 sunscreen-0.021
 yourselves-0.021
S-0.021
Token a
Token ID257
Feature activation+1.65e-02

Pos logit contributions
 may+0.047
 seem+0.042
ves+0.034
otion+0.032
 must+0.032
Neg logit contributions
burn-0.049
ind-0.048
 former-0.047
 least-0.046
Un-0.046
Token voice
Token ID3809
Feature activation+5.75e-02

Pos logit contributions
 may+0.031
 seem+0.025
ves+0.024
ches+0.023
'd+0.022
Neg logit contributions
Un-0.023
 dads-0.021
S-0.021
 yourselves-0.021
 late-0.020
Token says
Token ID1139
Feature activation+1.15e-01

Pos logit contributions
 may+0.117
 seem+0.090
'd+0.090
ves+0.089
hell+0.084
Neg logit contributions
Un-0.073
 late-0.067
 yourselves-0.066
 dads-0.066
 each-0.065
Token.
Token ID13
Feature activation+2.83e-02

Pos logit contributions
 may+0.271
 seem+0.218
'd+0.215
ves+0.213
ches+0.207
Neg logit contributions
Un-0.107
 late-0.105
 each-0.103
 cher-0.095
 dads-0.095
Token
Token ID198
Feature activation+1.91e-02

Pos logit contributions
 may+0.056
 seem+0.045
ves+0.044
ches+0.043
'd+0.041
Neg logit contributions
Un-0.038
 yourselves-0.034
S-0.034
 late-0.034
 cher-0.033
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
 may+0.039
 seem+0.032
ves+0.031
ches+0.030
'd+0.029
Neg logit contributions
Un-0.024
 yourselves-0.022
 dads-0.021
S-0.021
 sunscreen-0.021
TokenThey
Token ID2990
Feature activation+2.10e-02

Pos logit contributions
 may+0.022
 seem+0.018
ves+0.018
hell+0.015
 must+0.015
Neg logit contributions
 former-0.023
 least-0.022
burn-0.022
 sunscreen-0.021
ind-0.021
Token turn
Token ID1210
Feature activation-1.54e-02

Pos logit contributions
 may+0.035
ves+0.031
fs+0.027
ches+0.027
 seem+0.027
Neg logit contributions
 each-0.030
 dads-0.030
Un-0.029
 least-0.029
 sunscreen-0.028
Token,
Token ID11
Feature activation+8.42e-04

Pos logit contributions
Un+0.003
 yourselves+0.002
 least+0.002
ind+0.002
 sunscreen+0.002
Neg logit contributions
 may-0.002
 seem-0.002
ves-0.002
ches-0.002
'd-0.002
Token they
Token ID484
Feature activation+2.62e-02

Pos logit contributions
 may+0.001
 seem+0.001
ves+0.001
otion+0.001
'd+0.001
Neg logit contributions
Un-0.001
ind-0.001
 former-0.001
 least-0.001
kind-0.001
Token heard
Token ID2982
Feature activation+3.71e-02

Pos logit contributions
 may+0.045
 seem+0.035
'd+0.033
 must+0.031
ves+0.030
Neg logit contributions
Un-0.041
 yourselves-0.040
ind-0.038
burn-0.038
 cher-0.038
Token their
Token ID511
Feature activation-1.26e-05

Pos logit contributions
 may+0.063
ves+0.057
 seem+0.054
 story+0.053
 must+0.050
Neg logit contributions
thank-0.054
 least-0.052
Thank-0.051
P-0.050
har-0.050
Token mom
Token ID1995
Feature activation-3.18e-02

Pos logit contributions
Un+0.000
 yourselves+0.000
thank+0.000
Thank+0.000
S+0.000
Neg logit contributions
 may-0.000
 seem-0.000
ves-0.000
 story-0.000
 must-0.000
Token say
Token ID910
Feature activation+1.14e-01

Pos logit contributions
Un+0.044
 each+0.039
 least+0.039
 late+0.039
 cher+0.038
Neg logit contributions
 may-0.061
ves-0.049
 seem-0.047
ches-0.044
hell-0.044
Token that
Token ID326
Feature activation+2.68e-02

Pos logit contributions
 may+0.223
ches+0.193
 seem+0.182
ides+0.181
ves+0.178
Neg logit contributions
Un-0.134
 each-0.127
 cher-0.123
 dads-0.122
 late-0.118
Token there
Token ID612
Feature activation+4.85e-02

Pos logit contributions
 may+0.047
 seem+0.039
ves+0.037
 must+0.034
otion+0.033
Neg logit contributions
ind-0.040
Un-0.039
 yourselves-0.038
kind-0.038
 least-0.037
Token was
Token ID373
Feature activation+3.20e-03

Pos logit contributions
 may+0.081
ches+0.067
 seem+0.064
ides+0.063
ves+0.063
Neg logit contributions
Un-0.069
 each-0.067
 cher-0.064
 yourselves-0.062
 least-0.062
Token a
Token ID257
Feature activation+1.47e-02

Pos logit contributions
 may+0.006
 seem+0.006
ves+0.005
 must+0.005
 story+0.005
Neg logit contributions
thank-0.004
ind-0.004
elt-0.004
Thank-0.004
har-0.004
Token festival
Token ID10876
Feature activation+2.72e-03

Pos logit contributions
 may+0.026
 seem+0.020
ves+0.019
 story+0.019
 leap+0.019
Neg logit contributions
 yourselves-0.021
Un-0.021
elt-0.020
thank-0.020
 least-0.020
Token my
Token ID616
Feature activation+4.71e-03

Pos logit contributions
Un+0.016
burn+0.015
elt+0.015
view+0.015
Sorry+0.014
Neg logit contributions
 may-0.021
 must-0.017
 seem-0.017
 roar-0.017
ches-0.016
Token friends
Token ID2460
Feature activation+1.11e-02

Pos logit contributions
 may+0.009
 seem+0.007
 must+0.007
 story+0.006
ves+0.006
Neg logit contributions
Un-0.007
S-0.006
elt-0.006
 yourselves-0.006
Thank-0.006
Token?"
Token ID1701
Feature activation+2.80e-02

Pos logit contributions
 may+0.017
 seem+0.013
ches+0.012
 story+0.012
 must+0.012
Neg logit contributions
Un-0.019
elt-0.018
 cher-0.017
 dads-0.017
 yourselves-0.017
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
 may+0.053
 seem+0.044
ves+0.041
ches+0.038
otion+0.037
Neg logit contributions
Un-0.039
 sunscreen-0.035
 yourselves-0.035
ind-0.035
burn-0.035
Token robot
Token ID9379
Feature activation-1.35e-02

Pos logit contributions
 may+0.025
 seem+0.020
ves+0.019
ches+0.018
'd+0.018
Neg logit contributions
Un-0.022
 yourselves-0.020
 sunscreen-0.020
 dads-0.020
S-0.019
Token says
Token ID1139
Feature activation+1.14e-01

Pos logit contributions
Un+0.017
 each+0.016
 dads+0.016
 late+0.015
 cher+0.015
Neg logit contributions
 may-0.026
 seem-0.021
ves-0.021
otion-0.020
fs-0.019
Token.
Token ID13
Feature activation+2.52e-02

Pos logit contributions
 may+0.272
 seem+0.215
ves+0.215
'd+0.211
ches+0.207
Neg logit contributions
 late-0.105
Un-0.103
 each-0.101
kind-0.091
S-0.091
Token
Token ID198
Feature activation+1.37e-02

Pos logit contributions
 may+0.056
 seem+0.048
ves+0.046
ches+0.045
 leap+0.044
Neg logit contributions
Un-0.027
S-0.024
 late-0.024
Thank-0.023
Mom-0.022
Token
Token ID198
Feature activation+9.89e-03

Pos logit contributions
 may+0.030
 seem+0.025
ves+0.024
ches+0.023
 leap+0.023
Neg logit contributions
Un-0.015
S-0.014
 late-0.014
 yourselves-0.014
Thank-0.013
Token"
Token ID1
Feature activation-4.32e-02

Pos logit contributions
 may+0.021
 seem+0.017
ves+0.017
ches+0.016
'd+0.015
Neg logit contributions
Un-0.011
 sunscreen-0.011
burn-0.011
 least-0.010
 yourselves-0.010
TokenYes
Token ID5297
Feature activation+3.04e-02

Pos logit contributions
ind+0.050
burn+0.047
 former+0.046
eline+0.045
 yourselves+0.045
Neg logit contributions
 may-0.091
 seem-0.080
 opener-0.073
ves-0.073
'd-0.072
Token he
Token ID339
Feature activation+3.82e-03

Pos logit contributions
 may+0.008
 seem+0.006
ves+0.006
ches+0.006
'd+0.006
Neg logit contributions
Un-0.005
 late-0.004
 yourselves-0.004
Thank-0.004
 cher-0.004
Token heard
Token ID2982
Feature activation+3.18e-02

Pos logit contributions
 may+0.007
 seem+0.006
ves+0.006
ches+0.005
otion+0.005
Neg logit contributions
Un-0.005
 yourselves-0.005
 sunscreen-0.005
 dads-0.005
S-0.005
Token a
Token ID257
Feature activation+5.56e-03

Pos logit contributions
 may+0.067
 seem+0.059
ves+0.057
ches+0.053
 story+0.052
Neg logit contributions
Un-0.033
 least-0.033
 dads-0.033
thank-0.032
 yourselves-0.032
Token tiny
Token ID7009
Feature activation+1.07e-02

Pos logit contributions
 may+0.011
ves+0.009
 seem+0.009
ches+0.009
izards+0.009
Neg logit contributions
Un-0.008
S-0.006
 late-0.006
 responsible-0.006
 cher-0.006
Token voice
Token ID3809
Feature activation+4.71e-02

Pos logit contributions
 may+0.021
ves+0.018
 seem+0.017
'd+0.016
ches+0.016
Neg logit contributions
Un-0.014
ind-0.013
S-0.012
 cher-0.012
 late-0.012
Token say
Token ID910
Feature activation+1.14e-01

Pos logit contributions
 may+0.096
 seem+0.077
ves+0.076
ches+0.072
'd+0.071
Neg logit contributions
Un-0.059
 late-0.052
 yourselves-0.052
 dads-0.051
 least-0.051
Token,
Token ID11
Feature activation+3.23e-02

Pos logit contributions
 may+0.242
 seem+0.206
ches+0.206
oons+0.204
ides+0.203
Neg logit contributions
Un-0.115
ful-0.099
 each-0.099
S-0.098
 home-0.096
Token "
Token ID366
Feature activation-5.22e-02

Pos logit contributions
 may+0.058
ves+0.048
 seem+0.046
ches+0.043
hell+0.043
Neg logit contributions
Un-0.048
 late-0.043
 each-0.042
elt-0.042
 cher-0.042
TokenHi
Token ID17250
Feature activation-2.98e-03

Pos logit contributions
Un+0.042
 yourselves+0.036
 sunscreen+0.035
 dads+0.035
 cher+0.034
Neg logit contributions
 may-0.129
 seem-0.111
ves-0.107
ches-0.104
'd-0.102
Token,
Token ID11
Feature activation+1.31e-03

Pos logit contributions
Un+0.004
 cher+0.003
 each+0.003
S+0.003
 yourselves+0.003
Neg logit contributions
 may-0.006
ves-0.005
 seem-0.005
ches-0.005
'd-0.005
Token weak
Token ID4939
Feature activation-4.65e-03

Pos logit contributions
 may+0.002
ves+0.002
ides+0.002
ches+0.002
 seem+0.002
Neg logit contributions
Un-0.002
 cher-0.001
 dads-0.001
 each-0.001
 yourselves-0.001
Token the
Token ID262
Feature activation-5.72e-03

Pos logit contributions
Un+0.046
 yourselves+0.043
 sunscreen+0.042
ind+0.042
 dads+0.042
Neg logit contributions
 may-0.046
 seem-0.036
ves-0.034
ches-0.032
'd-0.031
Token pony
Token ID26902
Feature activation-2.62e-02

Pos logit contributions
Un+0.007
 yourselves+0.007
elt+0.007
 dads+0.007
S+0.007
Neg logit contributions
 may-0.011
 seem-0.009
ves-0.009
ches-0.008
'd-0.008
Token went
Token ID1816
Feature activation-4.14e-02

Pos logit contributions
Un+0.044
 yourselves+0.043
 sunscreen+0.041
 dads+0.040
 late+0.040
Neg logit contributions
 may-0.045
 seem-0.034
'd-0.032
ves-0.032
ches-0.031
Token off
Token ID572
Feature activation-3.13e-02

Pos logit contributions
Un+0.059
 yourselves+0.057
kind+0.054
 dads+0.054
 least+0.053
Neg logit contributions
 may-0.076
 seem-0.062
ches-0.060
ves-0.060
 leap-0.057
Token on
Token ID319
Feature activation-8.84e-03

Pos logit contributions
Un+0.043
 yourselves+0.042
 dads+0.040
 sunscreen+0.039
S+0.039
Neg logit contributions
 may-0.059
 seem-0.050
ches-0.046
ves-0.046
 leap-0.045
Token even
Token ID772
Feature activation-1.19e-01

Pos logit contributions
elt+0.013
Un+0.013
ety+0.013
 dads+0.013
burn+0.013
Neg logit contributions
 may-0.016
 seem-0.013
ves-0.011
 leap-0.011
 story-0.011
Token greater
Token ID3744
Feature activation-6.46e-02

Pos logit contributions
burn+0.185
 yourselves+0.180
 dads+0.179
Un+0.178
 sunscreen+0.177
Neg logit contributions
 may-0.192
 seem-0.166
'd-0.148
 leap-0.146
ches-0.137
Token adventures
Token ID17545
Feature activation-3.52e-02

Pos logit contributions
Un+0.097
burn+0.094
ety+0.093
elt+0.092
Mom+0.087
Neg logit contributions
 may-0.122
 seem-0.097
ves-0.092
 leap-0.088
 must-0.087
Token.
Token ID13
Feature activation-1.94e-02

Pos logit contributions
Un+0.048
 yourselves+0.044
 dads+0.043
 sunscreen+0.043
S+0.043
Neg logit contributions
 may-0.067
 seem-0.055
ves-0.052
ches-0.050
'd-0.048
Token He
Token ID679
Feature activation-1.30e-02

Pos logit contributions
Un+0.029
 yourselves+0.028
 sunscreen+0.028
ind+0.028
kind+0.028
Neg logit contributions
 may-0.034
 seem-0.027
ves-0.024
ches-0.023
'd-0.023
Token was
Token ID373
Feature activation-6.77e-04

Pos logit contributions
Un+0.023
 yourselves+0.022
 sunscreen+0.022
 dads+0.021
 cher+0.021
Neg logit contributions
 may-0.021
 seem-0.016
'd-0.015
ves-0.014
ches-0.014
Token to
Token ID284
Feature activation+3.33e-02

Pos logit contributions
Un+0.069
Thank+0.064
 yourselves+0.064
S+0.063
P+0.062
Neg logit contributions
 may-0.060
 seem-0.049
ves-0.045
ches-0.043
'd-0.042
Token stay
Token ID2652
Feature activation+9.54e-03

Pos logit contributions
 may+0.051
 seem+0.041
 leap+0.036
ves+0.035
'd+0.034
Neg logit contributions
elt-0.059
Un-0.059
kind-0.056
ind-0.056
 late-0.055
Token in
Token ID287
Feature activation-3.66e-04

Pos logit contributions
 may+0.018
 seem+0.014
ves+0.014
ches+0.014
 must+0.013
Neg logit contributions
Un-0.014
 dads-0.012
elt-0.012
 cher-0.011
 yourselves-0.011
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
Un+0.001
elt+0.001
 dads+0.001
kind+0.001
 yourselves+0.000
Neg logit contributions
 may-0.001
 seem-0.001
ves-0.000
 must-0.000
 leap-0.000
Token So
Token ID1406
Feature activation-4.43e-02

Pos logit contributions
Un+0.019
ind+0.018
 yourselves+0.018
 sunscreen+0.017
 dads+0.017
Neg logit contributions
 may-0.021
 seem-0.017
ves-0.015
ches-0.015
 must-0.014
Token even
Token ID772
Feature activation-1.16e-01

Pos logit contributions
Un+0.075
ind+0.068
 yourselves+0.068
kind+0.067
thank+0.067
Neg logit contributions
 may-0.075
 seem-0.057
ves-0.054
ches-0.050
 leap-0.050
Token though
Token ID996
Feature activation-2.74e-04

Pos logit contributions
Un+0.151
 yourselves+0.148
 sunscreen+0.142
ind+0.140
 dads+0.138
Neg logit contributions
 may-0.223
 seem-0.187
ches-0.175
ves-0.174
'd-0.172
Token the
Token ID262
Feature activation-5.16e-03

Pos logit contributions
Un+0.000
 yourselves+0.000
 dads+0.000
S+0.000
 sunscreen+0.000
Neg logit contributions
 may-0.000
 seem-0.000
ves-0.000
ches-0.000
'd-0.000
Token sc
Token ID629
Feature activation-5.16e-02

Pos logit contributions
Un+0.007
 late+0.007
 yourselves+0.006
elt+0.006
Our+0.006
Neg logit contributions
 may-0.010
 seem-0.008
 story-0.008
'd-0.008
 must-0.007
Tokenooter
Token ID25141
Feature activation-4.19e-02

Pos logit contributions
Un+0.072
 sunscreen+0.064
 yourselves+0.064
 late+0.064
Thank+0.064
Neg logit contributions
 may-0.099
 seem-0.082
ves-0.078
ches-0.075
'd-0.072
Token was
Token ID373
Feature activation+1.06e-03

Pos logit contributions
Un+0.080
 yourselves+0.076
 sunscreen+0.074
 dads+0.074
ind+0.073
Neg logit contributions
 may-0.062
 seem-0.046
ves-0.043
'd-0.040
ches-0.040
Token being
Token ID852
Feature activation-3.94e-02

Pos logit contributions
Un+0.034
elt+0.033
S+0.032
kind+0.032
P+0.032
Neg logit contributions
 may-0.042
 seem-0.034
ves-0.030
 must-0.029
'd-0.028
Token careful
Token ID8161
Feature activation+4.07e-02

Pos logit contributions
Un+0.060
burn+0.060
view+0.058
Mom+0.056
 sunscreen+0.055
Neg logit contributions
 may-0.067
 must-0.057
 seem-0.056
ches-0.049
'd-0.049
Token driving
Token ID5059
Feature activation-5.81e-02

Pos logit contributions
 may+0.084
 seem+0.070
ves+0.067
ches+0.064
'd+0.062
Neg logit contributions
Un-0.051
 sunscreen-0.045
S-0.045
 dads-0.045
 cher-0.044
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
Un+0.088
elt+0.083
burn+0.082
Thank+0.079
S+0.078
Neg logit contributions
 may-0.111
 seem-0.085
ches-0.081
ves-0.080
 must-0.080
Token So
Token ID1406
Feature activation-4.62e-02

Pos logit contributions
Un+0.024
 sunscreen+0.024
burn+0.023
 dads+0.023
ind+0.023
Neg logit contributions
 may-0.028
 seem-0.023
ves-0.020
ches-0.019
 must-0.019
Token even
Token ID772
Feature activation-1.13e-01

Pos logit contributions
Un+0.075
ind+0.068
elt+0.067
 sunscreen+0.067
kind+0.067
Neg logit contributions
 may-0.083
 seem-0.064
ves-0.060
 must-0.056
 leap-0.055
Token though
Token ID996
Feature activation+1.28e-03

Pos logit contributions
Un+0.153
burn+0.150
 sunscreen+0.149
 yourselves+0.147
ind+0.145
Neg logit contributions
 may-0.216
 seem-0.181
'd-0.160
ves-0.160
 must-0.159
Token the
Token ID262
Feature activation-3.66e-03

Pos logit contributions
 may+0.002
 seem+0.002
ves+0.002
 leap+0.002
 must+0.002
Neg logit contributions
Un-0.002
elt-0.002
thank-0.002
kind-0.002
burn-0.002
Token prize
Token ID11596
Feature activation-5.39e-02

Pos logit contributions
Un+0.007
elt+0.006
burn+0.006
ind+0.006
 yourselves+0.006
Neg logit contributions
 may-0.006
 seem-0.005
ves-0.004
 leap-0.004
hell-0.004
Token was
Token ID373
Feature activation-2.47e-03

Pos logit contributions
Un+0.088
 least+0.082
 yourselves+0.082
 dads+0.082
 cher+0.082
Neg logit contributions
 may-0.083
 seem-0.066
ves-0.062
ches-0.060
otion-0.057
Token gone
Token ID3750
Feature activation-1.93e-02

Pos logit contributions
Un+0.003
 yourselves+0.003
P+0.003
S+0.003
elt+0.003
Neg logit contributions
 may-0.005
 seem-0.004
 must-0.004
'd-0.004
ches-0.004
Token story
Token ID1621
Feature activation+5.75e-03

Pos logit contributions
 cher+0.092
Un+0.091
 least+0.091
 dads+0.090
 elderly+0.090
Neg logit contributions
 may-0.048
ves-0.036
 seem-0.033
ches-0.033
ows-0.032
Token:
Token ID25
Feature activation-4.79e-02

Pos logit contributions
 may+0.009
 seem+0.007
ves+0.007
ches+0.006
'd+0.006
Neg logit contributions
Un-0.010
 yourselves-0.009
 sunscreen-0.009
 dads-0.009
S-0.009
Token "
Token ID366
Feature activation-7.51e-02

Pos logit contributions
Un+0.071
 yourselves+0.067
 least+0.066
 late+0.065
Thank+0.065
Neg logit contributions
 may-0.082
 seem-0.066
ves-0.066
ches-0.063
hell-0.060
TokenTogether
Token ID41631
Feature activation-3.00e-02

Pos logit contributions
 sunscreen+0.086
 dads+0.086
 cher+0.079
 yourselves+0.077
 automobile+0.074
Neg logit contributions
 may-0.141
 seem-0.125
'd-0.122
ves-0.112
 must-0.109
Token,
Token ID11
Feature activation-2.22e-02

Pos logit contributions
Un+0.053
 sunscreen+0.048
S+0.048
ind+0.048
 yourselves+0.047
Neg logit contributions
 may-0.048
 seem-0.037
ves-0.034
ches-0.032
 must-0.031
Token even
Token ID772
Feature activation-1.12e-01

Pos logit contributions
Un+0.032
ind+0.028
 sunscreen+0.028
 yourselves+0.028
S+0.028
Neg logit contributions
 may-0.042
 seem-0.034
ves-0.032
ches-0.030
 leap-0.030
Token the
Token ID262
Feature activation+5.49e-04

Pos logit contributions
view+0.186
burn+0.185
Un+0.182
 sunscreen+0.179
har+0.176
Neg logit contributions
 may-0.180
 seem-0.147
 must-0.123
 leap-0.121
'd-0.120
Token smallest
Token ID18197
Feature activation-4.57e-02

Pos logit contributions
 may+0.001
 seem+0.001
ves+0.001
'd+0.001
ches+0.001
Neg logit contributions
Un-0.001
 yourselves-0.001
 late-0.001
 sunscreen-0.001
 dads-0.001
Token of
Token ID286
Feature activation-8.06e-03

Pos logit contributions
Un+0.066
 yourselves+0.059
 sunscreen+0.058
 dads+0.058
S+0.058
Neg logit contributions
 may-0.085
 seem-0.070
ves-0.066
ches-0.063
'd-0.061
Token us
Token ID514
Feature activation+7.86e-03

Pos logit contributions
Un+0.012
burn+0.010
elt+0.010
 late+0.010
thank+0.010
Neg logit contributions
 may-0.016
 seem-0.013
ves-0.012
 leap-0.012
izards-0.012
Token can
Token ID460
Feature activation-3.90e-03

Pos logit contributions
 may+0.013
 seem+0.010
ves+0.009
ches+0.009
'd+0.009
Neg logit contributions
Un-0.013
S-0.012
 sunscreen-0.012
 yourselves-0.012
burn-0.012
Token helper
Token ID31904
Feature activation+4.70e-04

Pos logit contributions
Un+0.025
 yourselves+0.022
 sunscreen+0.021
 least+0.020
 dads+0.020
Neg logit contributions
 may-0.057
 seem-0.049
ves-0.046
ches-0.045
'd-0.044
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
 may+0.001
 seem+0.001
ves+0.001
ches+0.001
'd+0.001
Neg logit contributions
Un-0.001
 yourselves-0.001
 sunscreen-0.001
 dads-0.001
S-0.001
Token On
Token ID1550
Feature activation-8.75e-03

Pos logit contributions
Un+0.020
ind+0.019
 yourselves+0.019
kind+0.019
 sunscreen+0.019
Neg logit contributions
 may-0.022
 seem-0.017
ves-0.015
ches-0.015
 must-0.014
Token the
Token ID262
Feature activation-2.10e-02

Pos logit contributions
Un+0.013
elt+0.012
ind+0.012
 yourselves+0.012
kind+0.012
Neg logit contributions
 may-0.016
ves-0.013
 seem-0.013
ches-0.012
 must-0.011
Token way
Token ID835
Feature activation-3.41e-02

Pos logit contributions
Un+0.041
 yourselves+0.040
elt+0.040
kind+0.039
S+0.039
Neg logit contributions
 may-0.029
 seem-0.021
ves-0.019
ches-0.018
 leap-0.017
Token back
Token ID736
Feature activation-1.11e-01

Pos logit contributions
 home+0.026
 each+0.025
Un+0.024
 late+0.023
 least+0.023
Neg logit contributions
 may-0.082
 seem-0.073
ves-0.071
ches-0.068
ides-0.068
Token home
Token ID1363
Feature activation-6.17e-02

Pos logit contributions
Un+0.110
 yourselves+0.095
kind+0.095
Thank+0.095
ind+0.094
Neg logit contributions
 may-0.261
 seem-0.220
ves-0.207
 must-0.203
 leap-0.203
Token,
Token ID11
Feature activation-1.06e-02

Pos logit contributions
Un+0.108
 sunscreen+0.100
 yourselves+0.099
ind+0.099
S+0.098
Neg logit contributions
 may-0.096
 seem-0.075
ves-0.070
ches-0.064
'd-0.063
Token Tim
Token ID5045
Feature activation-1.43e-02

Pos logit contributions
ind+0.019
Un+0.019
kind+0.018
 yourselves+0.017
 least+0.017
Neg logit contributions
 may-0.016
 seem-0.012
ves-0.011
 must-0.010
'd-0.010
Tokenmy
Token ID1820
Feature activation-3.80e-02

Pos logit contributions
Un+0.012
 yourselves+0.010
 dads+0.010
 least+0.010
 sunscreen+0.010
Neg logit contributions
 may-0.035
 seem-0.030
ves-0.029
ches-0.028
otion-0.027
Token asked
Token ID1965
Feature activation+1.68e-02

Pos logit contributions
Un+0.058
 least+0.052
 sunscreen+0.052
 yourselves+0.052
 late+0.052
Neg logit contributions
 may-0.066
 seem-0.053
ves-0.050
ches-0.047
otion-0.047
Token.
Token ID13
Feature activation-8.93e-03

Pos logit contributions
 may+0.079
 seem+0.065
ves+0.061
ches+0.059
'd+0.056
Neg logit contributions
Un-0.061
 dads-0.056
 yourselves-0.056
 sunscreen-0.056
 cher-0.055
Token He
Token ID679
Feature activation-4.31e-03

Pos logit contributions
Un+0.016
ind+0.015
 yourselves+0.015
burn+0.015
kind+0.015
Neg logit contributions
 may-0.013
 seem-0.010
ves-0.009
ches-0.009
'd-0.008
Token decided
Token ID3066
Feature activation-3.66e-02

Pos logit contributions
Un+0.007
 yourselves+0.007
 late+0.006
 sunscreen+0.006
 cher+0.006
Neg logit contributions
 may-0.007
 seem-0.006
ves-0.005
ches-0.005
'd-0.005
Token to
Token ID284
Feature activation+2.27e-03

Pos logit contributions
Un+0.053
ind+0.051
kind+0.048
 former+0.048
burn+0.048
Neg logit contributions
 may-0.070
 seem-0.057
ves-0.053
'd-0.051
 must-0.050
Token go
Token ID467
Feature activation-5.07e-02

Pos logit contributions
 may+0.004
 seem+0.003
ves+0.003
ches+0.003
 leap+0.002
Neg logit contributions
Un-0.004
 yourselves-0.004
 late-0.004
elt-0.003
ind-0.003
Token even
Token ID772
Feature activation-1.11e-01

Pos logit contributions
Un+0.068
 yourselves+0.059
kind+0.059
Thank+0.058
 cher+0.057
Neg logit contributions
 may-0.103
 seem-0.088
ches-0.083
 leap-0.079
ves-0.078
Token further
Token ID2252
Feature activation-5.60e-02

Pos logit contributions
Un+0.184
elt+0.181
ind+0.173
 cher+0.172
ich+0.171
Neg logit contributions
 may-0.157
 seem-0.144
ches-0.127
 leap-0.123
'd-0.114
Token and
Token ID290
Feature activation-3.37e-02

Pos logit contributions
Un+0.109
Sweet+0.100
elt+0.100
Mom+0.098
burn+0.097
Neg logit contributions
 may-0.079
 seem-0.066
ches-0.061
 might-0.057
 leap-0.053
Token explore
Token ID7301
Feature activation-2.48e-02

Pos logit contributions
Un+0.054
ind+0.053
elt+0.051
 yourselves+0.050
 cher+0.050
Neg logit contributions
 may-0.058
 seem-0.049
 leap-0.042
'd-0.041
ches-0.040
Token the
Token ID262
Feature activation-7.65e-03

Pos logit contributions
Un+0.034
 dads+0.030
 yourselves+0.030
burn+0.030
S+0.030
Neg logit contributions
 may-0.049
 seem-0.040
ves-0.038
ches-0.036
'd-0.035
Token other
Token ID584
Feature activation-6.42e-02

Pos logit contributions
Un+0.013
elt+0.012
burn+0.012
 dads+0.011
S+0.011
Neg logit contributions
 may-0.013
 seem-0.009
 story-0.009
 must-0.009
'd-0.009
Token big
Token ID1263
Feature activation+1.42e-02

Pos logit contributions
 may+0.065
 seem+0.050
ves+0.048
'd+0.047
ches+0.046
Neg logit contributions
Un-0.065
elt-0.059
 dads-0.058
 yourselves-0.058
S-0.058
Token before
Token ID878
Feature activation+3.48e-02

Pos logit contributions
 may+0.026
 seem+0.020
ves+0.018
 leap+0.018
 must+0.018
Neg logit contributions
Un-0.022
S-0.020
 yourselves-0.020
kind-0.020
ind-0.020
Token!
Token ID0
Feature activation-1.04e-02

Pos logit contributions
 may+0.063
 seem+0.051
ves+0.049
ches+0.045
'd+0.044
Neg logit contributions
Un-0.051
 dads-0.047
kind-0.047
 yourselves-0.047
ind-0.047
Token She
Token ID1375
Feature activation-3.39e-03

Pos logit contributions
Un+0.017
ind+0.017
 dads+0.017
kind+0.016
 sunscreen+0.016
Neg logit contributions
 may-0.016
 seem-0.013
ves-0.011
ches-0.011
 must-0.011
Token could
Token ID714
Feature activation-4.77e-03

Pos logit contributions
Un+0.005
 yourselves+0.005
 sunscreen+0.005
 dads+0.005
S+0.005
Neg logit contributions
 may-0.006
 seem-0.005
ves-0.004
ches-0.004
'd-0.004
Token even
Token ID772
Feature activation-1.11e-01

Pos logit contributions
Un+0.007
 yourselves+0.007
 sunscreen+0.007
 dads+0.007
 least+0.007
Neg logit contributions
 may-0.008
 seem-0.007
ves-0.006
ches-0.006
'd-0.006
Token sit
Token ID1650
Feature activation-6.65e-03

Pos logit contributions
Un+0.200
ind+0.194
 yourselves+0.194
S+0.192
kind+0.191
Neg logit contributions
 may-0.161
 seem-0.141
 leap-0.120
'd-0.111
ves-0.111
Token inside
Token ID2641
Feature activation-3.29e-03

Pos logit contributions
Un+0.009
 dads+0.009
 cher+0.009
 least+0.008
 former+0.008
Neg logit contributions
 may-0.012
 seem-0.010
ves-0.009
ches-0.009
'd-0.009
Token it
Token ID340
Feature activation+7.19e-03

Pos logit contributions
Un+0.004
 dads+0.004
 yourselves+0.004
 sunscreen+0.004
S+0.004
Neg logit contributions
 may-0.007
 seem-0.006
ves-0.005
ches-0.005
'd-0.005
Token.
Token ID13
Feature activation-8.15e-03

Pos logit contributions
 may+0.013
 seem+0.011
ches+0.010
ves+0.010
 must+0.009
Neg logit contributions
Un-0.011
 dads-0.011
 yourselves-0.010
S-0.010
kind-0.010
Token
Token ID220
Feature activation+1.68e-02

Pos logit contributions
Un+0.013
ind+0.013
kind+0.012
 yourselves+0.012
 dads+0.012
Neg logit contributions
 may-0.014
 seem-0.011
ves-0.009
ches-0.009
 must-0.009
Token to
Token ID284
Feature activation+2.23e-02

Pos logit contributions
Un+0.031
Thank+0.028
 yourselves+0.028
S+0.028
kind+0.028
Neg logit contributions
 may-0.059
 seem-0.049
ves-0.047
ches-0.046
 leap-0.044
Token climb
Token ID12080
Feature activation-2.26e-02

Pos logit contributions
 may+0.035
 seem+0.030
 leap+0.025
ves+0.025
 must+0.024
Neg logit contributions
Un-0.039
elt-0.038
ind-0.036
S-0.036
kind-0.036
Token again
Token ID757
Feature activation-6.41e-02

Pos logit contributions
Un+0.041
 dads+0.038
Thank+0.038
elt+0.038
kind+0.037
Neg logit contributions
 may-0.036
 seem-0.028
ches-0.026
ves-0.026
 must-0.024
Token,
Token ID11
Feature activation+4.01e-03

Pos logit contributions
kind+0.122
Un+0.120
 dads+0.119
burn+0.118
ind+0.116
Neg logit contributions
 may-0.086
 seem-0.072
 must-0.057
 leap-0.057
 might-0.055
Token reaching
Token ID8978
Feature activation+1.05e-02

Pos logit contributions
 may+0.007
 seem+0.006
 leap+0.005
ves+0.005
'd+0.005
Neg logit contributions
Un-0.006
ind-0.006
kind-0.006
P-0.006
elt-0.006
Token even
Token ID772
Feature activation-1.10e-01

Pos logit contributions
 may+0.019
 seem+0.015
ches+0.014
 leap+0.014
 must+0.013
Neg logit contributions
Un-0.017
Thank-0.015
S-0.015
kind-0.015
Sorry-0.015
Token higher
Token ID2440
Feature activation-3.01e-02

Pos logit contributions
Un+0.163
ind+0.163
burn+0.161
 dads+0.154
kind+0.153
Neg logit contributions
 may-0.186
 seem-0.169
 leap-0.145
ches-0.139
'd-0.138
Token than
Token ID621
Feature activation-3.77e-02

Pos logit contributions
Un+0.048
burn+0.044
kind+0.044
elt+0.043
Thank+0.042
Neg logit contributions
 may-0.052
 seem-0.044
ves-0.038
 leap-0.037
ches-0.037
Token before
Token ID878
Feature activation+3.04e-02

Pos logit contributions
Un+0.061
ind+0.059
burn+0.057
S+0.056
kind+0.056
Neg logit contributions
 may-0.065
 seem-0.055
'd-0.046
 leap-0.045
ves-0.044
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
 may+0.050
 seem+0.040
ves+0.036
 leap+0.034
 must+0.033
Neg logit contributions
Un-0.050
 dads-0.048
kind-0.048
ind-0.048
S-0.046
Token When
Token ID1649
Feature activation-1.83e-03

Pos logit contributions
Un+0.020
ind+0.020
burn+0.019
kind+0.019
 yourselves+0.019
Neg logit contributions
 may-0.019
 seem-0.015
ves-0.013
ches-0.012
'd-0.012
Token.
Token ID13
Feature activation-6.33e-03

Pos logit contributions
 may+0.062
 seem+0.051
ves+0.049
ches+0.045
'd+0.045
Neg logit contributions
Un-0.046
burn-0.041
 cher-0.041
 dads-0.041
S-0.041
Token From
Token ID3574
Feature activation-3.89e-02

Pos logit contributions
Un+0.010
burn+0.010
ind+0.010
 sunscreen+0.010
 yourselves+0.010
Neg logit contributions
 may-0.010
 seem-0.008
ves-0.007
ches-0.007
 must-0.007
Token then
Token ID788
Feature activation-6.20e-02

Pos logit contributions
burn+0.066
Un+0.063
elt+0.061
Sweet+0.059
 former+0.059
Neg logit contributions
 may-0.067
 must-0.048
ves-0.046
ches-0.044
 might-0.043
Token on
Token ID319
Feature activation-4.11e-03

Pos logit contributions
Un+0.070
 yourselves+0.061
 late+0.061
S+0.060
Thank+0.060
Neg logit contributions
 may-0.131
 seem-0.112
ves-0.107
ches-0.103
'd-0.100
Token,
Token ID11
Feature activation-2.24e-02

Pos logit contributions
Un+0.007
 yourselves+0.006
 dads+0.006
 sunscreen+0.006
S+0.006
Neg logit contributions
 may-0.007
 seem-0.005
ves-0.005
ches-0.005
'd-0.005
Token even
Token ID772
Feature activation-1.10e-01

Pos logit contributions
Un+0.031
 yourselves+0.029
ind+0.029
 sunscreen+0.029
S+0.028
Neg logit contributions
 may-0.043
 seem-0.035
ves-0.033
ches-0.031
 leap-0.030
Token when
Token ID618
Feature activation+1.86e-02

Pos logit contributions
 yourselves+0.144
 sunscreen+0.142
Un+0.139
ind+0.139
burn+0.139
Neg logit contributions
 may-0.217
 seem-0.184
'd-0.163
 must-0.160
ches-0.157
Token he
Token ID339
Feature activation-9.65e-03

Pos logit contributions
 may+0.029
 seem+0.023
ves+0.020
 must+0.018
 leap+0.018
Neg logit contributions
Un-0.033
elt-0.031
S-0.031
ind-0.031
thank-0.030
Token was
Token ID373
Feature activation-1.19e-02

Pos logit contributions
Un+0.017
 yourselves+0.016
 sunscreen+0.016
 dads+0.016
 cher+0.015
Neg logit contributions
 may-0.015
 seem-0.012
ves-0.011
ches-0.010
'd-0.010
Token very
Token ID845
Feature activation-1.93e-02

Pos logit contributions
Un+0.019
S+0.018
 yourselves+0.017
ind+0.017
Thank+0.017
Neg logit contributions
 may-0.020
 seem-0.017
ves-0.015
ches-0.015
'd-0.015
Token busy
Token ID8179
Feature activation-1.54e-02

Pos logit contributions
Un+0.025
burn+0.024
 yourselves+0.023
 cher+0.022
 sunscreen+0.022
Neg logit contributions
 may-0.038
 seem-0.034
ves-0.029
'd-0.029
 leap-0.029
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
Un+0.002
 dads+0.002
 sunscreen+0.002
 cher+0.002
 yourselves+0.002
Neg logit contributions
 may-0.003
 seem-0.003
ches-0.002
'd-0.002
ves-0.002
Token be
Token ID307
Feature activation+7.00e-03

Pos logit contributions
 may+0.022
 seem+0.018
 leap+0.015
'd+0.015
 must+0.015
Neg logit contributions
Un-0.024
ind-0.023
S-0.023
kind-0.022
elt-0.022
Token able
Token ID1498
Feature activation+1.75e-02

Pos logit contributions
 may+0.011
ves+0.010
'd+0.010
 seem+0.009
 must+0.009
Neg logit contributions
Un-0.010
elt-0.009
 cher-0.009
 least-0.008
 dads-0.008
Token to
Token ID284
Feature activation+1.06e-02

Pos logit contributions
 may+0.038
 seem+0.032
ves+0.030
'd+0.029
ches+0.029
Neg logit contributions
Un-0.020
S-0.018
 yourselves-0.018
 sunscreen-0.018
ind-0.017
Token help
Token ID1037
Feature activation+3.07e-02

Pos logit contributions
 may+0.014
 seem+0.011
ves+0.010
 leap+0.009
ches+0.009
Neg logit contributions
Un-0.021
elt-0.020
S-0.019
 dads-0.019
kind-0.019
Token even
Token ID772
Feature activation-1.10e-01

Pos logit contributions
 may+0.054
 seem+0.045
ves+0.042
ches+0.040
 leap+0.037
Neg logit contributions
Un-0.046
elt-0.044
Thank-0.044
kind-0.043
 sunscreen-0.042
Token more
Token ID517
Feature activation-5.77e-02

Pos logit contributions
Un+0.186
elt+0.185
kind+0.178
ind+0.178
eline+0.177
Neg logit contributions
 may-0.165
 seem-0.146
 leap-0.121
ves-0.112
'd-0.110
Token people
Token ID661
Feature activation-2.86e-02

Pos logit contributions
Un+0.084
elt+0.082
burn+0.079
ety+0.078
Thank+0.077
Neg logit contributions
 may-0.106
 seem-0.086
ves-0.083
ches-0.074
 leap-0.074
Token.
Token ID13
Feature activation-5.34e-03

Pos logit contributions
Un+0.040
S+0.036
 yourselves+0.036
 sunscreen+0.036
 dads+0.036
Neg logit contributions
 may-0.055
 seem-0.046
ves-0.042
ches-0.041
'd-0.040
Token<|endoftext|>
Token ID50256
Feature activation+3.17e-02

Pos logit contributions
Un+0.008
 sunscreen+0.008
ind+0.008
kind+0.007
 dads+0.007
Neg logit contributions
 may-0.009
 seem-0.008
ves-0.007
ches-0.007
'd-0.006
TokenOnce
Token ID7454
Feature activation+2.03e-02

Pos logit contributions
 may+0.079
 seem+0.070
ves+0.069
 trigger+0.068
 leap+0.067
Neg logit contributions
 late-0.023
S-0.022
Un-0.021
Mom-0.019
P-0.018
Token read
Token ID1100
Feature activation+4.72e-03

Pos logit contributions
 yourselves+0.014
Un+0.014
 late+0.013
 sunscreen+0.013
burn+0.013
Neg logit contributions
 may-0.013
'd-0.010
 seem-0.009
ves-0.009
 must-0.009
Token the
Token ID262
Feature activation+6.40e-04

Pos logit contributions
 may+0.009
ves+0.007
 seem+0.007
ches+0.007
 must+0.007
Neg logit contributions
Un-0.006
 yourselves-0.006
 least-0.006
burn-0.006
ind-0.006
Token book
Token ID1492
Feature activation-6.85e-03

Pos logit contributions
 may+0.001
 seem+0.001
ves+0.001
ches+0.001
 leap+0.001
Neg logit contributions
Un-0.001
 yourselves-0.001
S-0.001
 dads-0.001
 late-0.001
Token and
Token ID290
Feature activation-2.92e-03

Pos logit contributions
Un+0.010
 yourselves+0.010
 sunscreen+0.009
ind+0.009
 dads+0.009
Neg logit contributions
 may-0.013
 seem-0.011
ves-0.010
ches-0.010
 must-0.010
Token learned
Token ID4499
Feature activation-1.50e-03

Pos logit contributions
ind+0.005
Un+0.005
S+0.005
 yourselves+0.005
kind+0.005
Neg logit contributions
 may-0.005
 seem-0.004
'd-0.003
ves-0.003
 must-0.003
Token even
Token ID772
Feature activation-1.09e-01

Pos logit contributions
Un+0.002
elt+0.002
Thank+0.002
thank+0.002
burn+0.002
Neg logit contributions
 may-0.003
 seem-0.002
ves-0.002
 must-0.002
 story-0.002
Token more
Token ID517
Feature activation-5.35e-02

Pos logit contributions
Un+0.131
 sunscreen+0.128
 yourselves+0.128
burn+0.127
 dads+0.122
Neg logit contributions
 may-0.228
 seem-0.196
ches-0.169
 must-0.168
'd-0.167
Token about
Token ID546
Feature activation-1.90e-02

Pos logit contributions
Un+0.095
 yourselves+0.089
 sunscreen+0.089
 dads+0.088
 least+0.088
Neg logit contributions
 may-0.081
 seem-0.063
ves-0.058
ches-0.054
 must-0.051
Token how
Token ID703
Feature activation-1.88e-02

Pos logit contributions
Un+0.027
elt+0.026
S+0.025
burn+0.025
 dads+0.024
Neg logit contributions
 may-0.036
 seem-0.030
ves-0.027
 must-0.025
ches-0.025
Token volcan
Token ID17516
Feature activation-2.66e-02

Pos logit contributions
Un+0.021
 yourselves+0.018
 sunscreen+0.018
 dads+0.018
 least+0.018
Neg logit contributions
 may-0.041
 seem-0.035
ves-0.033
ches-0.032
'd-0.031
Tokenoes
Token ID3028
Feature activation+2.55e-02

Pos logit contributions
Un+0.044
 sunscreen+0.042
 yourselves+0.041
Thank+0.041
S+0.041
Neg logit contributions
 may-0.045
 seem-0.038
ches-0.032
ves-0.031
'd-0.030
Token back
Token ID736
Feature activation-7.30e-02

Pos logit contributions
 may+0.057
 seem+0.047
ches+0.044
 must+0.043
'd+0.041
Neg logit contributions
Un-0.049
Thank-0.046
 cher-0.045
P-0.044
 yourselves-0.044
Token another
Token ID1194
Feature activation-4.43e-02

Pos logit contributions
elt+0.135
kind+0.130
Thank+0.130
P+0.127
Sorry+0.124
Neg logit contributions
 may-0.094
 seem-0.074
 must-0.069
 might-0.068
ves-0.066
Token day
Token ID1110
Feature activation+3.82e-03

Pos logit contributions
elt+0.085
 yourselves+0.083
Un+0.078
kind+0.077
cream+0.073
Neg logit contributions
 may-0.065
 story-0.053
 seem-0.050
'd-0.047
 leap-0.047
Token and
Token ID290
Feature activation-5.91e-03

Pos logit contributions
 may+0.006
 seem+0.005
 must+0.004
ves+0.004
'd+0.004
Neg logit contributions
Un-0.007
kind-0.007
ind-0.006
burn-0.006
Thank-0.006
Token spend
Token ID4341
Feature activation-4.57e-02

Pos logit contributions
ind+0.011
Un+0.011
P+0.010
elt+0.010
kind+0.010
Neg logit contributions
 may-0.009
 seem-0.008
 must-0.007
'd-0.006
 might-0.006
Token even
Token ID772
Feature activation-1.09e-01

Pos logit contributions
elt+0.090
Un+0.088
ich+0.085
kind+0.080
burn+0.079
Neg logit contributions
 may-0.064
ves-0.057
 seem-0.050
ches-0.046
hell-0.045
Token more
Token ID517
Feature activation-5.95e-02

Pos logit contributions
Un+0.135
ind+0.135
ich+0.131
elt+0.130
 yourselves+0.127
Neg logit contributions
 may-0.209
 seem-0.191
'd-0.172
ches-0.170
 leap-0.168
Token time
Token ID640
Feature activation+3.95e-03

Pos logit contributions
Un+0.098
elt+0.093
 least+0.091
burn+0.090
Thank+0.087
Neg logit contributions
 may-0.099
ves-0.082
 seem-0.082
hell-0.074
ches-0.073
Token playing
Token ID2712
Feature activation-2.53e-02

Pos logit contributions
 may+0.007
 seem+0.006
ves+0.006
ches+0.005
'd+0.005
Neg logit contributions
Un-0.006
 least-0.005
elt-0.005
 dads-0.005
 sunscreen-0.005
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
Un+0.041
elt+0.038
 dads+0.037
ety+0.037
burn+0.036
Neg logit contributions
 may-0.042
ves-0.035
 seem-0.034
ches-0.031
 must-0.030
Token And
Token ID843
Feature activation-4.89e-02

Pos logit contributions
Un+0.022
ind+0.022
kind+0.021
 yourselves+0.021
thank+0.021
Neg logit contributions
 may-0.023
 seem-0.018
ves-0.016
ches-0.016
 must-0.015
Token home
Token ID1363
Feature activation-3.30e-02

Pos logit contributions
Un+0.042
kind+0.035
Thank+0.035
 yourselves+0.034
S+0.032
Neg logit contributions
 may-0.097
 seem-0.084
ches-0.081
 leap-0.077
ves-0.077
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
Un+0.067
P+0.065
Thank+0.065
kind+0.064
ind+0.062
Neg logit contributions
 may-0.039
 seem-0.034
ves-0.026
 opener-0.026
 must-0.025
Token On
Token ID1550
Feature activation-8.71e-03

Pos logit contributions
Un+0.023
ind+0.023
kind+0.022
 least+0.022
 dads+0.022
Neg logit contributions
 may-0.024
 seem-0.019
ves-0.017
ches-0.016
 must-0.016
Token the
Token ID262
Feature activation-1.54e-02

Pos logit contributions
Un+0.013
elt+0.012
kind+0.012
ind+0.011
 dads+0.011
Neg logit contributions
 may-0.016
 seem-0.013
ves-0.013
ches-0.012
 must-0.012
Token way
Token ID835
Feature activation-3.09e-02

Pos logit contributions
Un+0.030
kind+0.030
 yourselves+0.030
elt+0.030
S+0.029
Neg logit contributions
 may-0.020
 seem-0.015
ves-0.013
 leap-0.013
ches-0.013
Token back
Token ID736
Feature activation-1.08e-01

Pos logit contributions
Un+0.020
 each+0.018
 late+0.018
 home+0.018
 sunscreen+0.018
Neg logit contributions
 may-0.078
 seem-0.069
ves-0.067
ches-0.064
hell-0.064
Token,
Token ID11
Feature activation-1.41e-02

Pos logit contributions
Un+0.114
kind+0.104
Thank+0.101
 yourselves+0.098
ind+0.098
Neg logit contributions
 may-0.249
 seem-0.210
 leap-0.196
ves-0.193
 must-0.193
Token her
Token ID607
Feature activation-2.37e-02

Pos logit contributions
Un+0.022
ind+0.021
kind+0.021
elt+0.020
 dads+0.020
Neg logit contributions
 may-0.025
 seem-0.020
ves-0.018
otion-0.017
 must-0.016
Token mom
Token ID1995
Feature activation-5.38e-02

Pos logit contributions
Un+0.034
 yourselves+0.032
S+0.029
P+0.029
Thank+0.029
Neg logit contributions
 may-0.047
ves-0.035
 seem-0.035
 story-0.034
 must-0.034
Tokenmy
Token ID1820
Feature activation-3.89e-02

Pos logit contributions
Un+0.052
 yourselves+0.047
 sunscreen+0.045
 dads+0.044
 cher+0.044
Neg logit contributions
 may-0.124
 seem-0.105
ves-0.101
ches-0.099
'd-0.097
Token explained
Token ID4893
Feature activation+1.14e-02

Pos logit contributions
Un+0.050
 dads+0.044
S+0.044
 late+0.044
 sunscreen+0.043
Neg logit contributions
 may-0.078
 seem-0.065
ves-0.061
otion-0.058
ches-0.057
Token
Token ID198
Feature activation-1.14e-02

Pos logit contributions
 may+0.004
 seem+0.003
ves+0.003
ches+0.003
 opener+0.003
Neg logit contributions
Un-0.003
ind-0.003
burn-0.003
 least-0.003
kind-0.003
Token
Token ID198
Feature activation-1.05e-02

Pos logit contributions
Un+0.013
S+0.011
 late+0.011
Mom+0.011
 yourselves+0.011
Neg logit contributions
 may-0.026
 seem-0.022
 leap-0.020
ches-0.020
ves-0.020
TokenOn
Token ID2202
Feature activation-2.25e-03

Pos logit contributions
 least+0.014
burn+0.013
elt+0.013
 former+0.013
 cher+0.013
Neg logit contributions
 may-0.019
ves-0.017
 seem-0.015
 opener-0.014
ches-0.014
Token the
Token ID262
Feature activation-1.91e-02

Pos logit contributions
Un+0.003
elt+0.003
ind+0.003
kind+0.003
thank+0.003
Neg logit contributions
 may-0.004
ves-0.003
 seem-0.003
ches-0.003
 leap-0.003
Token way
Token ID835
Feature activation-3.00e-02

Pos logit contributions
Un+0.031
 sunscreen+0.028
 yourselves+0.028
 dads+0.028
 least+0.028
Neg logit contributions
 may-0.032
 seem-0.026
ves-0.024
ches-0.023
otion-0.022
Token back
Token ID736
Feature activation-1.08e-01

Pos logit contributions
 home+0.025
 each+0.023
 dads+0.021
Mom+0.020
Un+0.020
Neg logit contributions
 may-0.071
 seem-0.065
ves-0.062
hell-0.062
ides-0.061
Token,
Token ID11
Feature activation-1.28e-02

Pos logit contributions
Un+0.109
 yourselves+0.094
Thank+0.094
kind+0.092
 sunscreen+0.092
Neg logit contributions
 may-0.247
 seem-0.208
ves-0.200
ches-0.191
 must-0.190
Token Ben
Token ID3932
Feature activation+6.60e-03

Pos logit contributions
Un+0.018
ind+0.017
kind+0.017
thank+0.017
 yourselves+0.016
Neg logit contributions
 may-0.024
 seem-0.019
ves-0.018
otion-0.017
 leap-0.017
Token and
Token ID290
Feature activation-1.40e-03

Pos logit contributions
 may+0.012
 seem+0.010
ves+0.010
ches+0.009
'd+0.009
Neg logit contributions
Un-0.009
 yourselves-0.008
 sunscreen-0.008
 dads-0.008
S-0.008
Token Lily
Token ID20037
Feature activation-1.03e-02

Pos logit contributions
Un+0.002
 yourselves+0.002
ind+0.002
 least+0.002
kind+0.002
Neg logit contributions
 may-0.002
 seem-0.002
ves-0.002
ches-0.002
'd-0.002
Token saw
Token ID2497
Feature activation-5.27e-03

Pos logit contributions
Un+0.015
 yourselves+0.014
 sunscreen+0.014
 dads+0.014
S+0.014
Neg logit contributions
 may-0.018
 seem-0.015
ves-0.014
ches-0.013
'd-0.013
Token said
Token ID531
Feature activation+7.02e-02

Pos logit contributions
 yourselves+0.039
Un+0.038
ind+0.037
 least+0.036
 sunscreen+0.035
Neg logit contributions
 may-0.039
ves-0.030
 seem-0.029
ches-0.029
 must-0.027
Token,
Token ID11
Feature activation+8.90e-03

Pos logit contributions
 may+0.125
 seem+0.101
ves+0.095
ches+0.093
'd+0.090
Neg logit contributions
Un-0.103
S-0.095
 cher-0.095
 dads-0.095
 yourselves-0.094
Token "
Token ID366
Feature activation-6.56e-02

Pos logit contributions
 may+0.012
 seem+0.009
ves+0.008
ches+0.007
'd+0.007
Neg logit contributions
Un-0.017
 yourselves-0.017
ind-0.017
 least-0.017
 sunscreen-0.016
TokenRemember
Token ID16676
Feature activation-3.42e-02

Pos logit contributions
 sunscreen+0.076
 dads+0.074
 cher+0.073
 yourselves+0.072
 least+0.072
Neg logit contributions
 may-0.129
'd-0.110
 seem-0.106
ves-0.101
ches-0.097
Token,
Token ID11
Feature activation-2.17e-02

Pos logit contributions
Un+0.052
Thank+0.047
thank+0.047
S+0.046
kind+0.046
Neg logit contributions
 may-0.063
 seem-0.051
ves-0.047
ches-0.045
 must-0.043
Token even
Token ID772
Feature activation-1.08e-01

Pos logit contributions
Un+0.029
 yourselves+0.025
ind+0.025
thank+0.025
kind+0.025
Neg logit contributions
 may-0.043
 seem-0.035
ves-0.033
ches-0.032
 leap-0.031
Token though
Token ID996
Feature activation+7.78e-03

Pos logit contributions
view+0.184
har+0.180
burn+0.179
eline+0.177
Un+0.175
Neg logit contributions
 may-0.157
 seem-0.131
 must-0.105
 hop-0.102
 opener-0.100
Token something
Token ID1223
Feature activation+2.40e-02

Pos logit contributions
 may+0.014
 seem+0.011
ves+0.011
 must+0.010
 might+0.009
Neg logit contributions
Un-0.012
thank-0.012
kind-0.012
elt-0.011
burn-0.011
Token may
Token ID743
Feature activation+2.72e-02

Pos logit contributions
 may+0.032
 seem+0.023
ves+0.022
ches+0.020
'd+0.019
Neg logit contributions
Un-0.047
 yourselves-0.044
 sunscreen-0.043
 dads-0.043
S-0.043
Token be
Token ID307
Feature activation+3.05e-03

Pos logit contributions
 may+0.042
 seem+0.035
ves+0.029
ches+0.028
'd+0.027
Neg logit contributions
Un-0.049
S-0.045
 yourselves-0.045
Thank-0.045
 dads-0.045
Token bitter
Token ID12922
Feature activation+1.26e-02

Pos logit contributions
 may+0.006
 seem+0.005
ches+0.005
 must+0.004
'd+0.004
Neg logit contributions
Un-0.005
Mom-0.004
view-0.004
Thank-0.004
Sorry-0.004
Token
Token ID198
Feature activation-1.80e-02

Pos logit contributions
 may+0.023
 seem+0.019
ves+0.018
ches+0.017
'd+0.017
Neg logit contributions
Un-0.015
 yourselves-0.013
 sunscreen-0.013
 dads-0.013
 least-0.013
Token
Token ID198
Feature activation-1.82e-02

Pos logit contributions
Un+0.022
S+0.020
 yourselves+0.019
 late+0.019
Thank+0.019
Neg logit contributions
 may-0.038
 seem-0.031
ves-0.029
ches-0.029
 leap-0.028
TokenOn
Token ID2202
Feature activation-9.70e-03

Pos logit contributions
Un+0.018
 dads+0.017
 yourselves+0.017
 sunscreen+0.017
 least+0.016
Neg logit contributions
 may-0.040
ves-0.034
 seem-0.034
ches-0.032
'd-0.031
Token the
Token ID262
Feature activation-1.87e-02

Pos logit contributions
elt+0.016
Un+0.015
ind+0.015
kind+0.015
 yourselves+0.015
Neg logit contributions
 may-0.016
ves-0.013
 seem-0.012
 story-0.012
ches-0.011
Token way
Token ID835
Feature activation-3.01e-02

Pos logit contributions
Un+0.034
elt+0.034
 yourselves+0.033
ind+0.032
kind+0.032
Neg logit contributions
 may-0.028
 seem-0.022
ves-0.020
ches-0.018
'd-0.018
Token back
Token ID736
Feature activation-1.08e-01

Pos logit contributions
Un+0.012
 home+0.012
 late+0.011
 each+0.011
 least+0.010
Neg logit contributions
 may-0.082
 seem-0.074
ves-0.072
ches-0.070
hell-0.069
Token home
Token ID1363
Feature activation-5.90e-02

Pos logit contributions
Un+0.095
ind+0.083
kind+0.082
 yourselves+0.081
Thank+0.080
Neg logit contributions
 may-0.261
 seem-0.223
ves-0.210
 must-0.209
 opener-0.204
Token,
Token ID11
Feature activation-5.50e-03

Pos logit contributions
Un+0.109
 sunscreen+0.102
ind+0.100
 yourselves+0.099
burn+0.098
Neg logit contributions
 may-0.087
 seem-0.068
ves-0.063
otion-0.055
ches-0.055
Token Lily
Token ID20037
Feature activation-1.64e-02

Pos logit contributions
Un+0.009
ind+0.009
kind+0.009
elt+0.009
view+0.008
Neg logit contributions
 may-0.009
 seem-0.007
ves-0.006
 must-0.006
otion-0.006
Token thought
Token ID1807
Feature activation+2.61e-02

Pos logit contributions
Un+0.026
 yourselves+0.024
 sunscreen+0.024
 dads+0.024
S+0.024
Neg logit contributions
 may-0.028
 seem-0.022
ves-0.021
ches-0.020
'd-0.019
Token about
Token ID546
Feature activation-1.38e-02

Pos logit contributions
 may+0.045
 seem+0.037
 trigger+0.036
ves+0.035
ides+0.035
Neg logit contributions
Un-0.035
 each-0.035
 late-0.034
 cher-0.033
 responsible-0.032
Token
Token ID198
Feature activation-1.43e-02

Pos logit contributions
Un+0.010
ind+0.010
 yourselves+0.010
kind+0.010
burn+0.010
Neg logit contributions
 may-0.009
 seem-0.007
ves-0.006
ches-0.006
 must-0.006
Token
Token ID198
Feature activation-1.42e-02

Pos logit contributions
Un+0.017
S+0.015
 yourselves+0.015
 late+0.015
 dads+0.015
Neg logit contributions
 may-0.030
 seem-0.025
ves-0.024
ches-0.023
 leap-0.022
TokenOn
Token ID2202
Feature activation-5.28e-03

Pos logit contributions
 least+0.015
 yourselves+0.015
burn+0.015
 sunscreen+0.015
 dads+0.015
Neg logit contributions
 may-0.029
ves-0.025
 seem-0.025
'd-0.023
ches-0.023
Token the
Token ID262
Feature activation-1.58e-02

Pos logit contributions
Un+0.007
elt+0.007
 yourselves+0.007
ind+0.007
 dads+0.006
Neg logit contributions
 may-0.010
ves-0.008
 seem-0.008
ches-0.007
 must-0.007
Token way
Token ID835
Feature activation-2.88e-02

Pos logit contributions
Un+0.026
 yourselves+0.025
elt+0.024
 late+0.024
S+0.023
Neg logit contributions
 may-0.027
 seem-0.022
ves-0.020
 leap-0.019
'd-0.019
Token back
Token ID736
Feature activation-1.08e-01

Pos logit contributions
Un+0.026
 each+0.023
 dads+0.023
 cher+0.023
 sunscreen+0.022
Neg logit contributions
 may-0.066
 seem-0.057
ves-0.055
ches-0.053
hell-0.053
Token home
Token ID1363
Feature activation-5.78e-02

Pos logit contributions
Un+0.143
Thank+0.132
ind+0.128
kind+0.127
 yourselves+0.126
Neg logit contributions
 may-0.218
 seem-0.178
 must-0.166
ves-0.165
 leap-0.165
Token,
Token ID11
Feature activation-5.91e-03

Pos logit contributions
Un+0.100
 sunscreen+0.092
ind+0.091
 yourselves+0.091
S+0.090
Neg logit contributions
 may-0.092
 seem-0.073
ves-0.068
ches-0.062
'd-0.061
Token Betsy
Token ID45775
Feature activation-8.52e-04

Pos logit contributions
Un+0.009
ind+0.008
kind+0.008
 yourselves+0.008
 dads+0.008
Neg logit contributions
 may-0.011
 seem-0.009
ves-0.008
 leap-0.007
 must-0.007
Token could
Token ID714
Feature activation-9.49e-03

Pos logit contributions
Un+0.001
 yourselves+0.001
 dads+0.001
 sunscreen+0.001
S+0.001
Neg logit contributions
 may-0.001
 seem-0.001
ves-0.001
ches-0.001
'd-0.001
Token not
Token ID407
Feature activation-3.58e-02

Pos logit contributions
Un+0.015
 yourselves+0.014
S+0.014
ind+0.014
 dads+0.014
Neg logit contributions
 may-0.016
 seem-0.013
ves-0.012
ches-0.012
'd-0.011
Token to
Token ID284
Feature activation+4.47e-03

Pos logit contributions
Un+0.026
ind+0.023
 yourselves+0.023
 dads+0.022
S+0.022
Neg logit contributions
 may-0.044
 seem-0.037
ves-0.034
ches-0.033
'd-0.033
Token stay
Token ID2652
Feature activation-5.30e-03

Pos logit contributions
 may+0.007
 seem+0.006
ves+0.005
 leap+0.005
ches+0.005
Neg logit contributions
Un-0.008
ind-0.008
elt-0.008
S-0.007
kind-0.007
Token out
Token ID503
Feature activation-1.98e-02

Pos logit contributions
Un+0.008
elt+0.007
 cher+0.007
 dads+0.007
Some+0.007
Neg logit contributions
 may-0.009
 seem-0.007
ches-0.007
 must-0.007
 leap-0.007
Token of
Token ID286
Feature activation-4.00e-03

Pos logit contributions
Un+0.036
Thank+0.032
 yourselves+0.032
ind+0.032
 dads+0.032
Neg logit contributions
 may-0.030
 seem-0.024
ves-0.021
ches-0.021
 leap-0.021
Token trouble
Token ID5876
Feature activation-1.22e-03

Pos logit contributions
Un+0.007
elt+0.007
ind+0.006
ety+0.006
burn+0.006
Neg logit contributions
 may-0.007
 seem-0.005
 leap-0.005
 must-0.005
 battle-0.005
Token even
Token ID772
Feature activation-1.07e-01

Pos logit contributions
Un+0.002
S+0.002
 dads+0.002
Thank+0.002
 sunscreen+0.002
Neg logit contributions
 may-0.002
 seem-0.002
ves-0.001
ches-0.001
 must-0.001
Token when
Token ID618
Feature activation+1.34e-02

Pos logit contributions
burn+0.154
ind+0.154
elt+0.151
ich+0.149
Un+0.148
Neg logit contributions
 may-0.188
 seem-0.168
ches-0.147
 leap-0.143
'd-0.143
Token they
Token ID484
Feature activation+6.05e-03

Pos logit contributions
 may+0.021
 seem+0.017
ves+0.015
 must+0.014
'd+0.014
Neg logit contributions
Un-0.024
ind-0.022
elt-0.022
 dads-0.022
S-0.021
Token were
Token ID547
Feature activation-1.12e-02

Pos logit contributions
 may+0.009
 seem+0.007
ves+0.007
ches+0.007
'd+0.006
Neg logit contributions
Un-0.010
 yourselves-0.010
 dads-0.009
 sunscreen-0.009
S-0.009
Token being
Token ID852
Feature activation-2.58e-02

Pos logit contributions
Un+0.016
S+0.014
Thank+0.014
 yourselves+0.014
 dads+0.013
Neg logit contributions
 may-0.021
 seem-0.018
ches-0.017
ves-0.017
 must-0.016
Token nos
Token ID43630
Feature activation-5.01e-03

Pos logit contributions
Un+0.039
view+0.035
S+0.034
burn+0.034
 former+0.034
Neg logit contributions
 may-0.044
 must-0.037
 seem-0.037
ches-0.036
'd-0.034
Token homework
Token ID26131
Feature activation-3.30e-02

Pos logit contributions
Un+0.024
Thank+0.022
S+0.021
 yourselves+0.021
ind+0.021
Neg logit contributions
 may-0.026
 seem-0.020
 story-0.020
ves-0.020
 might-0.018
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
Un+0.058
ind+0.057
 yourselves+0.056
rous+0.055
elt+0.054
Neg logit contributions
 may-0.050
 seem-0.039
ves-0.039
'd-0.035
ches-0.035
Token On
Token ID1550
Feature activation-2.74e-03

Pos logit contributions
ind+0.022
Un+0.021
kind+0.021
 yourselves+0.021
 least+0.021
Neg logit contributions
 may-0.021
 seem-0.016
 opener-0.014
ves-0.014
ches-0.014
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
elt+0.004
Un+0.004
ind+0.004
thank+0.004
kind+0.004
Neg logit contributions
 may-0.005
ves-0.004
 seem-0.004
ches-0.004
 must-0.004
Token way
Token ID835
Feature activation-2.90e-02

Pos logit contributions
elt+0.025
Un+0.025
S+0.024
 yourselves+0.024
Thank+0.024
Neg logit contributions
 may-0.016
 seem-0.013
 story-0.011
ves-0.011
ches-0.010
Token back
Token ID736
Feature activation-1.07e-01

Pos logit contributions
Un+0.027
 each+0.025
 late+0.024
 least+0.024
 dads+0.024
Neg logit contributions
 may-0.065
 seem-0.057
ves-0.055
ches-0.053
hell-0.052
Token to
Token ID284
Feature activation+8.47e-04

Pos logit contributions
Un+0.165
Thank+0.155
kind+0.153
ind+0.152
thank+0.150
Neg logit contributions
 may-0.193
 seem-0.158
ves-0.141
 must-0.141
 opener-0.138
Token school
Token ID1524
Feature activation-7.10e-02

Pos logit contributions
 may+0.001
 seem+0.001
 story+0.001
 opener+0.001
otion+0.001
Neg logit contributions
elt-0.001
kind-0.001
P-0.001
ind-0.001
Un-0.001
Token,
Token ID11
Feature activation-4.49e-03

Pos logit contributions
Un+0.122
 yourselves+0.114
 sunscreen+0.113
S+0.112
ind+0.112
Neg logit contributions
 may-0.112
 seem-0.088
ves-0.082
ches-0.076
'd-0.074
Token they
Token ID484
Feature activation+1.85e-02

Pos logit contributions
Un+0.008
ind+0.008
kind+0.008
elt+0.007
 yourselves+0.007
Neg logit contributions
 may-0.007
 seem-0.006
ves-0.005
 must-0.005
 leap-0.004
Token saw
Token ID2497
Feature activation-1.63e-03

Pos logit contributions
 may+0.031
 seem+0.025
ves+0.023
ches+0.022
'd+0.022
Neg logit contributions
Un-0.030
 yourselves-0.028
S-0.027
 sunscreen-0.027
 cher-0.027
Token day
Token ID1110
Feature activation-1.91e-02

Pos logit contributions
Un+0.031
 yourselves+0.028
 sunscreen+0.028
 dads+0.028
S+0.027
Neg logit contributions
 may-0.063
 seem-0.053
ves-0.051
ches-0.049
'd-0.048
Token on
Token ID319
Feature activation-1.06e-02

Pos logit contributions
Un+0.022
 yourselves+0.019
 late+0.019
S+0.019
Thank+0.018
Neg logit contributions
 may-0.041
 seem-0.035
ves-0.033
ches-0.032
'd-0.031
Token,
Token ID11
Feature activation-2.95e-02

Pos logit contributions
Un+0.016
 yourselves+0.015
 sunscreen+0.015
 dads+0.015
 cher+0.015
Neg logit contributions
 may-0.018
 seem-0.015
ves-0.014
ches-0.013
'd-0.013
Token they
Token ID484
Feature activation+9.53e-03

Pos logit contributions
Un+0.037
 yourselves+0.033
 sunscreen+0.033
 dads+0.032
ind+0.032
Neg logit contributions
 may-0.060
 seem-0.050
ves-0.047
ches-0.045
'd-0.044
Token worked
Token ID3111
Feature activation-3.65e-02

Pos logit contributions
 may+0.017
 seem+0.013
ves+0.012
ches+0.012
'd+0.012
Neg logit contributions
Un-0.015
 yourselves-0.014
 sunscreen-0.013
 late-0.013
S-0.013
Token even
Token ID772
Feature activation-1.07e-01

Pos logit contributions
Un+0.032
 yourselves+0.030
 sunscreen+0.028
elt+0.027
view+0.027
Neg logit contributions
 may-0.089
ves-0.075
 seem-0.075
ches-0.073
 must-0.070
Token harder
Token ID7069
Feature activation-6.00e-02

Pos logit contributions
 yourselves+0.141
ind+0.134
Un+0.133
 sunscreen+0.129
burn+0.128
Neg logit contributions
 may-0.214
 seem-0.184
'd-0.166
ches-0.163
 leap-0.157
Token for
Token ID329
Feature activation+9.21e-03

Pos logit contributions
Un+0.076
burn+0.076
 sunscreen+0.073
 yourselves+0.072
Thank+0.072
Neg logit contributions
 may-0.120
 seem-0.102
ves-0.095
ches-0.088
 must-0.088
Token their
Token ID511
Feature activation-2.20e-02

Pos logit contributions
 may+0.016
 seem+0.013
ves+0.012
 battle+0.012
 must+0.011
Neg logit contributions
elt-0.015
burn-0.014
ety-0.014
S-0.014
view-0.014
Token kind
Token ID1611
Feature activation-2.11e-02

Pos logit contributions
Un+0.033
burn+0.031
 yourselves+0.031
 late+0.031
elt+0.031
Neg logit contributions
 may-0.041
 seem-0.033
 leap-0.029
'd-0.029
ves-0.029
Token king
Token ID5822
Feature activation-5.25e-02

Pos logit contributions
Un+0.030
 yourselves+0.027
 sunscreen+0.026
 late+0.026
 least+0.026
Neg logit contributions
 may-0.040
 seem-0.033
ves-0.031
ches-0.029
'd-0.029
Token was
Token ID373
Feature activation+1.94e-03

Pos logit contributions
Un+0.010
 yourselves+0.010
 sunscreen+0.010
 late+0.009
 dads+0.009
Neg logit contributions
 may-0.012
 seem-0.009
ves-0.009
'd-0.009
ches-0.009
Token very
Token ID845
Feature activation-8.24e-03

Pos logit contributions
 may+0.003
'd+0.003
 seem+0.003
ves+0.003
ches+0.003
Neg logit contributions
 yourselves-0.003
elt-0.003
Un-0.003
acher-0.003
ind-0.003
Token careful
Token ID8161
Feature activation+3.27e-02

Pos logit contributions
Un+0.011
burn+0.010
 yourselves+0.010
view+0.010
elt+0.010
Neg logit contributions
 may-0.016
 seem-0.013
ves-0.012
'd-0.012
 leap-0.012
Token not
Token ID407
Feature activation-5.05e-03

Pos logit contributions
 may+0.073
 seem+0.062
ves+0.059
ches+0.057
'd+0.056
Neg logit contributions
Un-0.034
 yourselves-0.030
 sunscreen-0.030
 dads-0.030
S-0.029
Token to
Token ID284
Feature activation+2.68e-02

Pos logit contributions
Un+0.006
 yourselves+0.005
 sunscreen+0.005
ind+0.005
S+0.005
Neg logit contributions
 may-0.011
 seem-0.010
ves-0.009
'd-0.009
ches-0.009
Token let
Token ID1309
Feature activation+7.62e-02

Pos logit contributions
 may+0.042
 seem+0.038
 leap+0.030
 must+0.030
'd+0.028
Neg logit contributions
Un-0.045
elt-0.044
 sunscreen-0.042
kind-0.042
Thank-0.042
Token the
Token ID262
Feature activation+2.22e-03

Pos logit contributions
 may+0.135
ves+0.113
otion+0.106
 seem+0.106
hell+0.105
Neg logit contributions
Un-0.104
 each-0.102
 yourselves-0.097
 least-0.095
 sunscreen-0.094
Token wind
Token ID2344
Feature activation+1.08e-06

Pos logit contributions
 may+0.004
 seem+0.003
ves+0.003
ches+0.003
'd+0.003
Neg logit contributions
Un-0.003
 yourselves-0.003
 dads-0.003
 sunscreen-0.003
S-0.003
Token take
Token ID1011
Feature activation+2.66e-02

Feature not active on prev token;
no predictions were affected.
Token it
Token ID340
Feature activation+1.41e-02

Pos logit contributions
 may+0.046
 seem+0.037
ves+0.034
ches+0.033
 leap+0.032
Neg logit contributions
Un-0.042
 dads-0.038
 yourselves-0.038
S-0.038
Thank-0.038
Token away
Token ID1497
Feature activation+5.88e-03

Pos logit contributions
 may+0.029
 seem+0.025
ches+0.023
hell+0.022
ves+0.022
Neg logit contributions
Un-0.018
 dads-0.016
 yourselves-0.016
Thank-0.016
elt-0.015
Token at
Token ID379
Feature activation+1.64e-02

Pos logit contributions
Un+0.007
 yourselves+0.007
 sunscreen+0.007
S+0.007
Thank+0.007
Neg logit contributions
 may-0.010
 seem-0.008
ves-0.008
ches-0.008
'd-0.007
Token the
Token ID262
Feature activation+5.84e-03

Pos logit contributions
 may+0.024
 seem+0.020
 story+0.019
 must+0.018
 opener+0.018
Neg logit contributions
elt-0.027
Sorry-0.026
kind-0.025
S-0.025
P-0.025
Token desk
Token ID6915
Feature activation-3.77e-02

Pos logit contributions
 may+0.010
 seem+0.008
'd+0.008
ves+0.008
 story+0.008
Neg logit contributions
elt-0.009
S-0.008
Un-0.008
ety-0.008
 dads-0.008
Token.
Token ID13
Feature activation-4.68e-03

Pos logit contributions
Un+0.058
 sunscreen+0.053
 dads+0.053
 yourselves+0.052
Thank+0.052
Neg logit contributions
 may-0.066
 seem-0.054
ves-0.052
ches-0.048
'd-0.048
Token He
Token ID679
Feature activation-8.74e-03

Pos logit contributions
ind+0.008
kind+0.008
P+0.008
 least+0.008
 dads+0.008
Neg logit contributions
 may-0.007
 seem-0.006
ves-0.005
 must-0.005
 opener-0.004
Token said
Token ID531
Feature activation+6.51e-02

Pos logit contributions
 yourselves+0.016
Un+0.015
 sunscreen+0.015
burn+0.014
S+0.014
Neg logit contributions
 may-0.014
'd-0.010
ves-0.010
 seem-0.010
 must-0.010
Token,
Token ID11
Feature activation+8.43e-03

Pos logit contributions
 may+0.105
 seem+0.083
ves+0.078
ches+0.071
'd+0.070
Neg logit contributions
Un-0.109
 yourselves-0.102
 least-0.101
ind-0.101
 sunscreen-0.100
Token "
Token ID366
Feature activation-6.55e-02

Pos logit contributions
 may+0.011
 seem+0.008
ves+0.008
'd+0.007
ches+0.007
Neg logit contributions
Un-0.016
 yourselves-0.016
ind-0.015
 least-0.015
 sunscreen-0.015
TokenWow
Token ID22017
Feature activation+2.81e-02

Pos logit contributions
 dads+0.085
 sunscreen+0.082
 yourselves+0.077
 property+0.077
 cher+0.076
Neg logit contributions
 may-0.114
'd-0.103
 seem-0.099
hell-0.087
 Would-0.086
Token,
Token ID11
Feature activation+1.83e-03

Pos logit contributions
 may+0.053
 seem+0.045
ves+0.044
 opener+0.042
ches+0.042
Neg logit contributions
 yourselves-0.033
 each-0.033
Un-0.033
 cher-0.032
S-0.031
Token this
Token ID428
Feature activation+9.82e-03

Pos logit contributions
 may+0.004
 seem+0.003
ves+0.003
 must+0.003
otion+0.003
Neg logit contributions
Un-0.002
thank-0.002
ind-0.002
elt-0.002
kind-0.002
Token!
Token ID0
Feature activation+2.49e-03

Pos logit contributions
Un+0.004
 yourselves+0.004
 dads+0.004
 sunscreen+0.004
S+0.004
Neg logit contributions
 may-0.005
 seem-0.004
ves-0.004
ches-0.004
'd-0.004
Token Happy
Token ID14628
Feature activation+3.11e-02

Pos logit contributions
 may+0.005
 seem+0.004
ves+0.004
ches+0.004
hell+0.004
Neg logit contributions
Un-0.003
 yourselves-0.003
S-0.003
elt-0.003
Thank-0.003
Token birthday
Token ID10955
Feature activation-2.25e-02

Pos logit contributions
 may+0.067
 seem+0.055
ves+0.054
 story+0.051
'd+0.051
Neg logit contributions
Un-0.036
 sunscreen-0.032
 late-0.031
 yourselves-0.031
kind-0.030
Token!"
Token ID2474
Feature activation+3.49e-02

Pos logit contributions
Un+0.033
 yourselves+0.031
 dads+0.030
S+0.030
 sunscreen+0.030
Neg logit contributions
 may-0.040
 seem-0.032
ves-0.031
ches-0.029
'd-0.028
Token they
Token ID484
Feature activation+3.20e-02

Pos logit contributions
 may+0.066
 seem+0.055
ves+0.052
ches+0.049
'd+0.048
Neg logit contributions
Un-0.048
 yourselves-0.044
 sunscreen-0.043
 dads-0.043
ind-0.043
Token shouted
Token ID17293
Feature activation+8.29e-02

Pos logit contributions
 may+0.061
 seem+0.050
ves+0.047
ches+0.046
'd+0.044
Neg logit contributions
Un-0.044
 yourselves-0.040
 sunscreen-0.039
S-0.039
 dads-0.039
Token.
Token ID13
Feature activation+2.31e-02

Pos logit contributions
 may+0.167
 seem+0.135
ves+0.133
ches+0.126
'd+0.125
Neg logit contributions
Un-0.103
 late-0.094
 dads-0.093
 each-0.093
 cher-0.092
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
 may+0.039
 seem+0.031
ves+0.029
ches+0.027
'd+0.026
Neg logit contributions
Un-0.037
 yourselves-0.035
 least-0.034
ind-0.034
 sunscreen-0.034
Token
Token ID198
Feature activation+6.76e-03

Pos logit contributions
 may+0.019
ves+0.016
 seem+0.016
ches+0.014
hell+0.013
Neg logit contributions
Un-0.016
 least-0.015
burn-0.015
 sunscreen-0.015
 cher-0.015
TokenL
Token ID43
Feature activation-1.50e-02

Pos logit contributions
 may+0.011
ves+0.010
 seem+0.009
ches+0.008
 opener+0.008
Neg logit contributions
burn-0.011
 least-0.010
elt-0.010
 cher-0.010
 former-0.010
Tokenily
Token ID813
Feature activation-4.99e-03

Pos logit contributions
 yourselves+0.022
Un+0.021
S+0.021
 late+0.020
 least+0.020
Neg logit contributions
 may-0.028
 seem-0.022
ves-0.021
 leap-0.020
'd-0.020
Token"
Token ID1
Feature activation-6.08e-02

Pos logit contributions
Un+0.017
S+0.015
 late+0.015
Thank+0.014
 yourselves+0.014
Neg logit contributions
 may-0.032
 seem-0.027
ves-0.026
ches-0.025
otion-0.025
TokenMom
Token ID29252
Feature activation-4.21e-02

Pos logit contributions
 dads+0.057
 yourselves+0.057
 cher+0.055
 sunscreen+0.054
ind+0.054
Neg logit contributions
 may-0.131
'd-0.114
 seem-0.110
ves-0.103
 must-0.099
Tokenmy
Token ID1820
Feature activation-3.24e-02

Pos logit contributions
Un+0.046
 yourselves+0.041
 dads+0.040
 sunscreen+0.040
S+0.040
Neg logit contributions
 may-0.091
 seem-0.077
ves-0.074
ches-0.071
'd-0.069
Token,"
Token ID553
Feature activation+1.77e-02

Pos logit contributions
Un+0.041
 cher+0.040
 least+0.040
 dads+0.040
 yourselves+0.039
Neg logit contributions
 may-0.059
ves-0.050
 seem-0.050
ches-0.046
izards-0.045
Token he
Token ID339
Feature activation-3.53e-03

Pos logit contributions
 may+0.029
 seem+0.023
ves+0.022
ches+0.021
'd+0.020
Neg logit contributions
Un-0.029
 yourselves-0.026
 sunscreen-0.026
 dads-0.026
S-0.026
Token said
Token ID531
Feature activation+8.03e-02

Pos logit contributions
Un+0.005
 yourselves+0.005
 sunscreen+0.005
ind+0.005
 least+0.005
Neg logit contributions
 may-0.006
 seem-0.005
ves-0.005
ches-0.005
'd-0.004
Token,
Token ID11
Feature activation+2.37e-02

Pos logit contributions
 may+0.138
 seem+0.115
ves+0.106
ches+0.099
 leap+0.097
Neg logit contributions
Un-0.124
 yourselves-0.118
ind-0.114
 sunscreen-0.114
 least-0.113
Token "
Token ID366
Feature activation-4.56e-02

Pos logit contributions
 may+0.029
 seem+0.023
ves+0.019
'd+0.018
otion+0.018
Neg logit contributions
Un-0.045
ind-0.045
 yourselves-0.044
 least-0.044
 dads-0.044
Tokenit
Token ID270
Feature activation+2.87e-02

Pos logit contributions
 dads+0.056
 cher+0.055
 sunscreen+0.054
 least+0.050
rous+0.048
Neg logit contributions
 may-0.085
 seem-0.078
'd-0.076
 opener-0.070
ves-0.068
Token's
Token ID338
Feature activation+8.11e-03

Pos logit contributions
 may+0.046
 seem+0.037
ves+0.036
otion+0.032
ches+0.032
Neg logit contributions
Un-0.046
 yourselves-0.043
 least-0.043
 late-0.042
Thank-0.042
Token so
Token ID523
Feature activation+1.14e-02

Pos logit contributions
 may+0.014
 seem+0.012
 must+0.011
'd+0.010
ches+0.010
Neg logit contributions
elt-0.012
Un-0.012
 dads-0.012
S-0.012
P-0.012
Token got
Token ID1392
Feature activation-3.49e-02

Pos logit contributions
 may+0.015
 seem+0.012
ves+0.011
ches+0.010
otion+0.010
Neg logit contributions
Un-0.013
 dads-0.012
 least-0.012
 yourselves-0.012
 sunscreen-0.012
Token scared
Token ID12008
Feature activation+1.73e-02

Pos logit contributions
Un+0.039
 yourselves+0.035
 dads+0.034
 sunscreen+0.034
S+0.034
Neg logit contributions
 may-0.075
 seem-0.063
ves-0.060
ches-0.058
'd-0.057
Token.
Token ID13
Feature activation-9.88e-03

Pos logit contributions
 may+0.032
 seem+0.026
ves+0.024
ches+0.024
'd+0.023
Neg logit contributions
Un-0.025
 yourselves-0.023
 dads-0.022
 sunscreen-0.022
S-0.022
Token The
Token ID383
Feature activation-6.54e-03

Pos logit contributions
Un+0.015
 yourselves+0.014
 dads+0.014
ind+0.014
 sunscreen+0.014
Neg logit contributions
 may-0.017
 seem-0.014
ves-0.013
ches-0.012
'd-0.012
Token frog
Token ID21264
Feature activation+3.34e-03

Pos logit contributions
Un+0.009
 yourselves+0.008
 dads+0.008
 sunscreen+0.008
S+0.008
Neg logit contributions
 may-0.012
 seem-0.010
ves-0.010
ches-0.009
'd-0.009
Token said
Token ID531
Feature activation+6.55e-02

Pos logit contributions
 may+0.006
 seem+0.005
ves+0.004
ches+0.004
'd+0.004
Neg logit contributions
Un-0.005
 yourselves-0.005
 sunscreen-0.005
 dads-0.005
 cher-0.005
Token,
Token ID11
Feature activation+9.91e-03

Pos logit contributions
 may+0.122
 seem+0.100
ves+0.095
ches+0.093
'd+0.088
Neg logit contributions
Un-0.091
 yourselves-0.083
 cher-0.082
S-0.082
 dads-0.082
Token "
Token ID366
Feature activation-6.46e-02

Pos logit contributions
 may+0.014
 seem+0.011
ves+0.010
ches+0.009
'd+0.009
Neg logit contributions
Un-0.018
 yourselves-0.017
 sunscreen-0.017
 dads-0.017
ind-0.017
TokenDon
Token ID3987
Feature activation+1.43e-02

Pos logit contributions
 dads+0.075
 cher+0.072
 sunscreen+0.072
 yourselves+0.068
Un+0.068
Neg logit contributions
 may-0.128
 seem-0.111
'd-0.103
ches-0.100
ves-0.100
Token't
Token ID470
Feature activation-4.04e-02

Pos logit contributions
 may+0.020
izards+0.017
hell+0.017
eways+0.016
oons+0.016
Neg logit contributions
Un-0.024
Mom-0.023
 late-0.022
 least-0.022
S-0.022
Token worry
Token ID5490
Feature activation+1.25e-02

Pos logit contributions
Un+0.060
 yourselves+0.054
 sunscreen+0.053
ind+0.053
 cher+0.053
Neg logit contributions
 may-0.075
 seem-0.064
 leap-0.055
'd-0.053
ves-0.053
Token but
Token ID475
Feature activation-1.68e-04

Pos logit contributions
 may+0.019
 seem+0.015
'd+0.013
 must+0.012
 leap+0.012
Neg logit contributions
ind-0.022
kind-0.022
Un-0.022
 yourselves-0.021
rous-0.021
Token her
Token ID607
Feature activation-1.07e-02

Pos logit contributions
Un+0.000
 yourselves+0.000
ind+0.000
thank+0.000
 dads+0.000
Neg logit contributions
 may-0.000
 seem-0.000
ves-0.000
 must-0.000
'd-0.000
Token mom
Token ID1995
Feature activation-4.16e-02

Pos logit contributions
Un+0.013
 yourselves+0.013
S+0.012
thank+0.012
Thank+0.012
Neg logit contributions
 may-0.022
 seem-0.017
ves-0.017
 story-0.017
 must-0.016
Token reassured
Token ID48171
Feature activation+1.87e-02

Pos logit contributions
Un+0.058
 yourselves+0.054
 sunscreen+0.053
 dads+0.052
S+0.052
Neg logit contributions
 may-0.078
 seem-0.064
ves-0.060
ches-0.058
'd-0.057
Token her
Token ID607
Feature activation-2.45e-02

Pos logit contributions
 may+0.042
 seem+0.035
ves+0.034
ches+0.033
'd+0.032
Neg logit contributions
Un-0.019
 yourselves-0.017
 dads-0.017
 sunscreen-0.016
S-0.016
Token that
Token ID326
Feature activation+5.10e-03

Pos logit contributions
Un+0.027
 each+0.025
 late+0.025
 cher+0.025
 responsible+0.025
Neg logit contributions
 may-0.050
 seem-0.041
ves-0.040
ches-0.039
 leap-0.039
Token it
Token ID340
Feature activation+1.85e-02

Pos logit contributions
 may+0.009
 seem+0.007
ves+0.007
'd+0.006
ches+0.006
Neg logit contributions
Un-0.008
 yourselves-0.008
 dads-0.007
ind-0.007
 sunscreen-0.007
Token was
Token ID373
Feature activation+1.85e-02

Pos logit contributions
 may+0.029
 seem+0.024
ves+0.023
 leap+0.022
otion+0.022
Neg logit contributions
Un-0.029
 least-0.027
 each-0.027
 late-0.026
kind-0.026
Token going
Token ID1016
Feature activation-6.73e-03

Pos logit contributions
 may+0.032
 seem+0.026
ches+0.024
ves+0.024
'd+0.023
Neg logit contributions
Un-0.029
 dads-0.027
elt-0.027
 yourselves-0.027
S-0.026
Token to
Token ID284
Feature activation+3.00e-02

Pos logit contributions
Un+0.008
 yourselves+0.008
 dads+0.008
 cher+0.007
 sunscreen+0.007
Neg logit contributions
 may-0.014
 seem-0.012
ches-0.011
 must-0.011
ves-0.011
Token be
Token ID307
Feature activation-1.73e-03

Pos logit contributions
 may+0.048
 seem+0.041
ves+0.033
 must+0.032
 leap+0.032
Neg logit contributions
Un-0.052
elt-0.050
kind-0.049
ind-0.049
 dads-0.048
Token
Token ID198
Feature activation+9.63e-03

Pos logit contributions
 may+0.029
 seem+0.024
ves+0.023
ches+0.022
otion+0.021
Neg logit contributions
Un-0.021
 sunscreen-0.019
 dads-0.019
 least-0.019
 yourselves-0.019
TokenThey
Token ID2990
Feature activation+1.46e-02

Pos logit contributions
 may+0.016
ves+0.014
 opener+0.013
 seem+0.013
hell+0.012
Neg logit contributions
burn-0.014
 least-0.014
 sunscreen-0.013
 cher-0.013
 former-0.013
Token felt
Token ID2936
Feature activation+7.63e-03

Pos logit contributions
 may+0.027
 seem+0.022
ves+0.021
ches+0.020
'd+0.019
Neg logit contributions
Un-0.021
 yourselves-0.019
 sunscreen-0.019
 dads-0.019
S-0.019
Token miserable
Token ID22444
Feature activation-2.99e-02

Pos logit contributions
 may+0.014
 seem+0.012
ves+0.011
ches+0.011
 leap+0.010
Neg logit contributions
Un-0.011
 yourselves-0.010
S-0.010
 dads-0.010
ind-0.010
Token.
Token ID13
Feature activation+3.12e-03

Pos logit contributions
Un+0.048
 dads+0.045
 sunscreen+0.044
S+0.043
ind+0.043
Neg logit contributions
 may-0.052
 seem-0.042
ves-0.037
otion-0.036
ches-0.036
Token They
Token ID1119
Feature activation+1.95e-02

Pos logit contributions
 may+0.005
 seem+0.004
 must+0.003
 opener+0.003
ves+0.003
Neg logit contributions
burn-0.006
ind-0.006
Un-0.006
 cream-0.005
 sunscreen-0.005
Token had
Token ID550
Feature activation-3.27e-02

Pos logit contributions
 may+0.036
 seem+0.030
ves+0.030
ches+0.028
otion+0.027
Neg logit contributions
Un-0.026
 least-0.024
 yourselves-0.024
 dads-0.024
 sunscreen-0.024
Token wasted
Token ID18359
Feature activation-1.21e-02

Pos logit contributions
Un+0.048
ind+0.044
 yourselves+0.043
S+0.043
elt+0.042
Neg logit contributions
 may-0.061
 seem-0.051
'd-0.045
 must-0.045
ves-0.045
Token their
Token ID511
Feature activation-1.54e-02

Pos logit contributions
Un+0.019
ind+0.017
elt+0.017
P+0.016
burn+0.016
Neg logit contributions
 may-0.023
 seem-0.018
 must-0.017
ves-0.016
 might-0.016
Token toys
Token ID14958
Feature activation-5.02e-02

Pos logit contributions
Un+0.022
burn+0.019
 yourselves+0.019
S+0.019
ind+0.019
Neg logit contributions
 may-0.031
 seem-0.025
ves-0.024
 leap-0.022
ches-0.022
Token and
Token ID290
Feature activation-5.54e-03

Pos logit contributions
Un+0.088
elt+0.082
ind+0.080
 yourselves+0.080
Thank+0.079
Neg logit contributions
 may-0.083
 seem-0.067
ves-0.059
hell-0.059
ches-0.058
Token and
Token ID290
Feature activation-1.93e-02

Pos logit contributions
Un+0.001
 yourselves+0.001
 dads+0.001
 sunscreen+0.001
S+0.001
Neg logit contributions
 may-0.001
 seem-0.001
ves-0.001
ches-0.001
'd-0.001
Token every
Token ID790
Feature activation-3.19e-02

Pos logit contributions
Un+0.032
 yourselves+0.030
S+0.030
ind+0.030
 sunscreen+0.030
Neg logit contributions
 may-0.033
 seem-0.025
ves-0.023
'd-0.022
ches-0.021
Token time
Token ID640
Feature activation-1.51e-03

Pos logit contributions
Un+0.049
 late+0.045
 sunscreen+0.045
 yourselves+0.045
S+0.044
Neg logit contributions
 may-0.055
 seem-0.044
ves-0.042
ches-0.040
otion-0.038
Token it
Token ID340
Feature activation+1.53e-02

Pos logit contributions
Un+0.002
 each+0.002
 late+0.002
 cher+0.002
 sunscreen+0.002
Neg logit contributions
 may-0.002
 seem-0.002
ves-0.002
ches-0.002
otion-0.002
Token rains
Token ID29424
Feature activation-1.64e-02

Pos logit contributions
 may+0.024
 seem+0.019
ves+0.018
ches+0.017
otion+0.016
Neg logit contributions
Un-0.025
 least-0.024
 late-0.023
 yourselves-0.023
 sunscreen-0.023
Token,
Token ID11
Feature activation+7.23e-03

Pos logit contributions
Un+0.027
 dads+0.025
 yourselves+0.025
S+0.025
ind+0.025
Neg logit contributions
 may-0.027
 seem-0.021
ves-0.020
ches-0.019
'd-0.018
Token he
Token ID339
Feature activation+2.66e-03

Pos logit contributions
 may+0.011
 seem+0.009
ves+0.008
ches+0.007
 leap+0.007
Neg logit contributions
Un-0.013
 yourselves-0.012
ind-0.012
 dads-0.012
S-0.012
Token looks
Token ID3073
Feature activation+8.28e-03

Pos logit contributions
 may+0.004
 seem+0.003
'd+0.003
ches+0.003
ves+0.003
Neg logit contributions
Un-0.005
 yourselves-0.005
 cher-0.005
S-0.005
 sunscreen-0.005
Token for
Token ID329
Feature activation+2.05e-02

Pos logit contributions
 may+0.017
 seem+0.014
ches+0.013
 leap+0.013
ves+0.013
Neg logit contributions
Un-0.011
 yourselves-0.010
S-0.009
 dads-0.009
Thank-0.009
Token his
Token ID465
Feature activation-1.75e-02

Pos logit contributions
 may+0.043
 seem+0.035
ves+0.034
ches+0.032
 leap+0.032
Neg logit contributions
Un-0.026
burn-0.023
Thank-0.023
S-0.023
ind-0.023
Token box
Token ID3091
Feature activation+6.18e-03

Pos logit contributions
Un+0.025
 yourselves+0.023
S+0.022
Thank+0.022
ind+0.021
Neg logit contributions
 may-0.035
 seem-0.027
ves-0.026
 leap-0.025
ches-0.025
Token with
Token ID351
Feature activation-1.45e-03

Pos logit contributions
Un+0.020
 yourselves+0.018
 sunscreen+0.018
 dads+0.018
S+0.018
Neg logit contributions
 may-0.037
 seem-0.031
ves-0.029
ches-0.028
'd-0.028
Token his
Token ID465
Feature activation-1.68e-02

Pos logit contributions
Un+0.002
ind+0.002
kind+0.002
Thank+0.002
thank+0.002
Neg logit contributions
 may-0.002
 seem-0.002
ves-0.002
ches-0.002
 leap-0.002
Token new
Token ID649
Feature activation-4.73e-02

Pos logit contributions
Un+0.026
 yourselves+0.024
S+0.023
Thank+0.023
thank+0.022
Neg logit contributions
 may-0.032
ves-0.023
ches-0.023
 might-0.023
 seem-0.023
Token sp
Token ID599
Feature activation-5.17e-03

Pos logit contributions
Un+0.077
 yourselves+0.067
Thank+0.066
 late+0.065
S+0.064
Neg logit contributions
 may-0.087
 seem-0.064
 leap-0.064
ves-0.064
 story-0.062
Tokenade
Token ID671
Feature activation-1.96e-02

Pos logit contributions
Un+0.009
ind+0.009
 yourselves+0.009
S+0.009
 dads+0.009
Neg logit contributions
 may-0.007
 seem-0.006
ves-0.005
'd-0.004
otion-0.004
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
Un+0.033
 yourselves+0.030
ind+0.030
S+0.030
Thank+0.030
Neg logit contributions
 may-0.032
 seem-0.025
ves-0.023
ches-0.023
'd-0.022
Token his
Token ID465
Feature activation-7.08e-03

Pos logit contributions
 may+0.014
 seem+0.009
ves+0.007
'd+0.006
 must+0.006
Neg logit contributions
Un-0.037
ind-0.037
kind-0.037
 yourselves-0.036
 dads-0.036
Token friend
Token ID1545
Feature activation-1.13e-02

Pos logit contributions
Un+0.010
 yourselves+0.009
S+0.009
burn+0.009
Thank+0.008
Neg logit contributions
 may-0.014
 seem-0.010
ves-0.010
 leap-0.010
'd-0.010
Token Johnny
Token ID15470
Feature activation+1.03e-02

Pos logit contributions
Un+0.018
 yourselves+0.017
ind+0.016
 sunscreen+0.016
kind+0.016
Neg logit contributions
 may-0.019
 seem-0.015
ves-0.014
ches-0.014
'd-0.014
Token came
Token ID1625
Feature activation+7.49e-05

Pos logit contributions
 may+0.020
 seem+0.017
ves+0.016
ches+0.015
 leap+0.015
Neg logit contributions
Un-0.013
 least-0.012
 dads-0.012
 late-0.012
 each-0.012
Token over
Token ID625
Feature activation-3.50e-02

Pos logit contributions
 may+0.000
ches+0.000
 seem+0.000
 leap+0.000
'd+0.000
Neg logit contributions
Un-0.000
 yourselves-0.000
 cher-0.000
 sunscreen-0.000
Thank-0.000
Token?"
Token ID1701
Feature activation+1.55e-02

Pos logit contributions
Un+0.014
 dads+0.013
 yourselves+0.013
S+0.013
ind+0.013
Neg logit contributions
 may-0.015
 seem-0.012
ches-0.011
ves-0.011
 leap-0.010
Token Her
Token ID2332
Feature activation-1.17e-02

Pos logit contributions
 may+0.024
 seem+0.018
ves+0.016
 story+0.014
 must+0.014
Neg logit contributions
ind-0.027
 yourselves-0.027
Un-0.027
kind-0.026
 least-0.025
Token mom
Token ID1995
Feature activation-2.84e-02

Pos logit contributions
Un+0.011
 yourselves+0.010
 sunscreen+0.009
S+0.009
 least+0.009
Neg logit contributions
 may-0.028
 seem-0.023
ves-0.023
ches-0.022
'd-0.021
Tokenmy
Token ID1820
Feature activation-1.26e-02

Pos logit contributions
Un+0.023
 dads+0.019
S+0.019
 yourselves+0.019
 least+0.019
Neg logit contributions
 may-0.070
 seem-0.061
ves-0.059
ches-0.056
 leap-0.055
Token said
Token ID531
Feature activation+8.60e-02

Pos logit contributions
Un+0.016
S+0.015
 each+0.015
 dads+0.015
Thank+0.015
Neg logit contributions
 may-0.025
 seem-0.021
ves-0.020
otion-0.019
izards-0.019
Token,
Token ID11
Feature activation+2.55e-02

Pos logit contributions
 may+0.169
 seem+0.143
ches+0.140
izards+0.136
ves+0.135
Neg logit contributions
 each-0.102
 cher-0.095
Un-0.094
S-0.092
 then-0.090
Token "
Token ID366
Feature activation-5.62e-02

Pos logit contributions
 may+0.035
 seem+0.026
ves+0.023
ches+0.022
'd+0.022
Neg logit contributions
Un-0.048
 yourselves-0.047
ind-0.046
 sunscreen-0.046
 least-0.046
TokenLet
Token ID5756
Feature activation+2.59e-02

Pos logit contributions
 sunscreen+0.060
 former+0.059
 yourselves+0.059
ind+0.058
Un+0.056
Neg logit contributions
 may-0.121
 seem-0.100
'd-0.096
 might-0.092
 must-0.091
Token's
Token ID338
Feature activation-4.27e-02

Pos logit contributions
 may+0.046
 seem+0.037
ves+0.036
ches+0.033
otion+0.033
Neg logit contributions
Un-0.038
 yourselves-0.035
S-0.035
 sunscreen-0.035
 dads-0.035
Token take
Token ID1011
Feature activation+8.20e-03

Pos logit contributions
Un+0.081
elt+0.075
S+0.074
ind+0.074
 late+0.074
Neg logit contributions
 may-0.063
 seem-0.050
 leap-0.043
 must-0.041
ves-0.039
Token it
Token ID340
Feature activation+5.27e-03

Pos logit contributions
 may+0.013
 seem+0.010
ches+0.010
 leap+0.010
 must+0.009
Neg logit contributions
Un-0.015
view-0.014
 former-0.014
ind-0.013
ich-0.013
Token she
Token ID673
Feature activation+1.47e-02

Pos logit contributions
 may+0.093
 seem+0.077
ves+0.070
'd+0.065
otion+0.064
Neg logit contributions
Un-0.079
 dads-0.072
ind-0.072
 sunscreen-0.072
burn-0.071
Token saw
Token ID2497
Feature activation+1.04e-02

Pos logit contributions
 may+0.025
 seem+0.020
ves+0.018
ches+0.018
'd+0.018
Neg logit contributions
Un-0.024
 yourselves-0.022
 dads-0.022
 sunscreen-0.022
S-0.022
Token the
Token ID262
Feature activation+1.99e-02

Pos logit contributions
 may+0.016
 seem+0.013
ves+0.011
 must+0.011
ches+0.010
Neg logit contributions
Un-0.019
burn-0.017
 dads-0.017
Thank-0.017
S-0.017
Token bug
Token ID5434
Feature activation+1.44e-02

Pos logit contributions
 may+0.038
 seem+0.031
ves+0.030
ches+0.029
otion+0.028
Neg logit contributions
Un-0.027
 yourselves-0.025
 cher-0.024
 least-0.024
 late-0.024
Token.
Token ID13
Feature activation-7.18e-05

Pos logit contributions
 may+0.028
 seem+0.023
ves+0.022
ches+0.021
'd+0.021
Neg logit contributions
Un-0.018
 yourselves-0.016
 cher-0.016
 least-0.016
 late-0.016
Token
Token ID198
Feature activation-8.61e-03

Pos logit contributions
Un+0.000
burn+0.000
ind+0.000
 sunscreen+0.000
 least+0.000
Neg logit contributions
 may-0.000
 seem-0.000
ves-0.000
 must-0.000
otion-0.000
Token
Token ID198
Feature activation-8.72e-03

Pos logit contributions
Un+0.009
S+0.009
 late+0.008
Mom+0.008
 yourselves+0.008
Neg logit contributions
 may-0.019
 seem-0.016
ches-0.015
ves-0.015
 leap-0.015
TokenJane
Token ID41083
Feature activation-9.15e-03

Pos logit contributions
Un+0.009
 least+0.009
burn+0.009
 sunscreen+0.009
 dads+0.009
Neg logit contributions
 may-0.018
 seem-0.016
ves-0.015
'd-0.014
ches-0.014
Token wanted
Token ID2227
Feature activation-3.34e-02

Pos logit contributions
Un+0.016
 yourselves+0.015
 sunscreen+0.015
 late+0.014
 dads+0.014
Neg logit contributions
 may-0.015
 seem-0.011
ves-0.011
ches-0.010
'd-0.010
Token to
Token ID284
Feature activation+1.91e-02

Pos logit contributions
Un+0.047
ind+0.043
burn+0.043
 yourselves+0.042
Thank+0.041
Neg logit contributions
 may-0.066
 seem-0.056
 must-0.050
ves-0.049
ches-0.048
Token make
Token ID787
Feature activation+3.67e-02

Pos logit contributions
 may+0.029
 seem+0.023
ves+0.020
 leap+0.019
'd+0.019
Neg logit contributions
Un-0.035
 late-0.032
ind-0.032
elt-0.032
S-0.032
Token.
Token ID13
Feature activation+3.57e-03

Pos logit contributions
 may+0.000
 seem+0.000
ves+0.000
ches+0.000
'd+0.000
Neg logit contributions
Un-0.000
 yourselves-0.000
 dads-0.000
 sunscreen-0.000
S-0.000
Token It
Token ID632
Feature activation+1.71e-02

Pos logit contributions
 may+0.007
 seem+0.005
ves+0.005
ches+0.005
otion+0.005
Neg logit contributions
Un-0.005
ind-0.005
 dads-0.005
 least-0.005
kind-0.005
Token looked
Token ID3114
Feature activation+2.49e-02

Pos logit contributions
 may+0.028
 seem+0.021
ves+0.020
ches+0.020
'd+0.019
Neg logit contributions
Un-0.029
 yourselves-0.027
 dads-0.027
 sunscreen-0.027
ind-0.027
Token down
Token ID866
Feature activation-1.98e-02

Pos logit contributions
 may+0.051
 seem+0.040
ches+0.038
ves+0.037
 leap+0.037
Neg logit contributions
Un-0.034
 yourselves-0.030
burn-0.030
 dads-0.029
ind-0.029
Token at
Token ID379
Feature activation+1.52e-02

Pos logit contributions
Un+0.031
 dads+0.029
 sunscreen+0.028
 least+0.028
 yourselves+0.028
Neg logit contributions
 may-0.035
 seem-0.027
ves-0.025
'd-0.024
ches-0.024
Token Sara
Token ID24799
Feature activation-8.50e-03

Pos logit contributions
 may+0.022
 seem+0.019
ves+0.015
otion+0.015
 might+0.015
Neg logit contributions
kind-0.027
 responsible-0.026
Un-0.026
elt-0.025
 former-0.024
Token and
Token ID290
Feature activation+1.42e-02

Pos logit contributions
Un+0.010
 yourselves+0.009
 late+0.009
 least+0.009
 cher+0.009
Neg logit contributions
 may-0.018
 seem-0.015
ves-0.014
'd-0.013
ches-0.013
Token Ben
Token ID3932
Feature activation+2.13e-02

Pos logit contributions
 may+0.017
 popped+0.012
 leap+0.012
 nib+0.012
 hops+0.012
Neg logit contributions
ind-0.030
kind-0.029
 former-0.027
 yourselves-0.027
 property-0.026
Token.
Token ID13
Feature activation+3.70e-03

Pos logit contributions
 may+0.046
 seem+0.038
ves+0.038
ches+0.035
'd+0.035
Neg logit contributions
 each-0.021
Un-0.021
 least-0.020
 cher-0.020
 late-0.020
Token It
Token ID632
Feature activation+1.89e-02

Pos logit contributions
 may+0.006
 seem+0.004
ves+0.004
ches+0.004
otion+0.004
Neg logit contributions
Un-0.007
 least-0.006
 yourselves-0.006
ind-0.006
 dads-0.006
Token waved
Token ID26834
Feature activation+9.12e-03

Pos logit contributions
 may+0.030
 seem+0.023
ves+0.022
ches+0.021
'd+0.020
Neg logit contributions
Un-0.032
 yourselves-0.030
 dads-0.030
 sunscreen-0.030
ind-0.029
Token time
Token ID640
Feature activation+6.18e-03

Pos logit contributions
Un+0.058
S+0.054
 late+0.053
 sunscreen+0.053
 dads+0.053
Neg logit contributions
 may-0.044
 seem-0.034
ves-0.031
ches-0.029
otion-0.028
Token,
Token ID11
Feature activation+2.76e-03

Pos logit contributions
 may+0.012
ches+0.010
ves+0.010
 seem+0.010
 trigger+0.010
Neg logit contributions
Un-0.007
S-0.007
 late-0.007
 cher-0.007
 each-0.006
Token there
Token ID612
Feature activation+3.48e-02

Pos logit contributions
 may+0.006
 seem+0.005
ves+0.005
ches+0.004
 leap+0.004
Neg logit contributions
Un-0.003
 late-0.003
 yourselves-0.003
S-0.003
 cher-0.003
Token was
Token ID373
Feature activation-1.11e-02

Pos logit contributions
 may+0.070
ides+0.067
 trigger+0.066
ches+0.065
fters+0.063
Neg logit contributions
 each-0.038
Un-0.034
S-0.033
Mom-0.033
 least-0.031
Token a
Token ID257
Feature activation+4.33e-03

Pos logit contributions
Un+0.012
kind+0.012
 dads+0.012
 yourselves+0.011
ind+0.011
Neg logit contributions
 may-0.022
 seem-0.019
ves-0.018
izards-0.017
ches-0.017
Token little
Token ID1310
Feature activation-1.99e-02

Pos logit contributions
 may+0.009
 seem+0.007
ves+0.007
ches+0.006
'd+0.006
Neg logit contributions
Un-0.006
 yourselves-0.005
 sunscreen-0.005
 dads-0.005
 least-0.005
Token girl
Token ID2576
Feature activation+3.05e-02

Pos logit contributions
Un+0.027
 yourselves+0.026
 former+0.025
view+0.024
thank+0.024
Neg logit contributions
 may-0.038
 seem-0.029
 leap-0.028
ches-0.027
 must-0.027
Token named
Token ID3706
Feature activation+3.94e-02

Pos logit contributions
 may+0.052
ves+0.042
ches+0.041
 seem+0.041
 must+0.039
Neg logit contributions
 sunscreen-0.042
Un-0.042
 yourselves-0.041
 former-0.041
 dads-0.041
Token Lily
Token ID20037
Feature activation+3.05e-02

Pos logit contributions
 may+0.079
 seem+0.066
ves+0.062
ches+0.060
'd+0.058
Neg logit contributions
Un-0.051
 yourselves-0.046
 dads-0.045
 sunscreen-0.045
 least-0.044
Token.
Token ID13
Feature activation-9.25e-03

Pos logit contributions
 may+0.049
 seem+0.039
ves+0.036
ches+0.036
'd+0.035
Neg logit contributions
Un-0.048
 sunscreen-0.047
 yourselves-0.047
 dads-0.046
 former-0.045
Token One
Token ID1881
Feature activation-4.47e-02

Pos logit contributions
Un+0.008
S+0.008
Thank+0.008
 late+0.008
Mom+0.007
Neg logit contributions
 may-0.022
ves-0.020
 seem-0.018
ches-0.018
 leap-0.018
Token pretty
Token ID2495
Feature activation+1.33e-02

Pos logit contributions
 may+0.017
 seem+0.014
ves+0.013
ches+0.013
'd+0.012
Neg logit contributions
Un-0.009
burn-0.009
 yourselves-0.009
 least-0.008
 dads-0.008
Token.
Token ID13
Feature activation-7.78e-03

Pos logit contributions
 may+0.024
 seem+0.019
ves+0.017
ches+0.017
 must+0.017
Neg logit contributions
Un-0.021
 dads-0.019
 yourselves-0.019
Thank-0.018
 sunscreen-0.018
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
Un+0.012
ind+0.011
 yourselves+0.011
burn+0.011
kind+0.011
Neg logit contributions
 may-0.013
 seem-0.010
ves-0.009
ches-0.009
'd-0.009
Token
Token ID198
Feature activation-1.54e-02

Pos logit contributions
Un+0.019
S+0.018
 late+0.017
 yourselves+0.017
Mom+0.016
Neg logit contributions
 may-0.038
 seem-0.032
ves-0.029
ches-0.029
 leap-0.029
Token"
Token ID1
Feature activation-6.06e-02

Pos logit contributions
 least+0.016
 cher+0.016
burn+0.016
Un+0.016
elt+0.015
Neg logit contributions
 may-0.032
ves-0.027
 seem-0.027
ches-0.025
'd-0.025
TokenMom
Token ID29252
Feature activation-3.94e-02

Pos logit contributions
 home+0.091
 least+0.085
ind+0.085
 dads+0.082
 yourselves+0.081
Neg logit contributions
 may-0.101
'd-0.081
ves-0.080
 seem-0.078
 opener-0.074
Tokenmy
Token ID1820
Feature activation-3.01e-02

Pos logit contributions
Un+0.060
 yourselves+0.055
 sunscreen+0.054
ind+0.054
S+0.053
Neg logit contributions
 may-0.071
 seem-0.056
ves-0.051
ches-0.051
'd-0.050
Token,
Token ID11
Feature activation+1.27e-02

Pos logit contributions
Un+0.044
 cher+0.041
 dads+0.041
 yourselves+0.041
 least+0.041
Neg logit contributions
 may-0.052
 seem-0.043
ves-0.042
ches-0.039
otion-0.037
Token can
Token ID460
Feature activation-1.95e-02

Pos logit contributions
 may+0.019
 seem+0.014
ves+0.012
ches+0.012
'd+0.012
Neg logit contributions
Un-0.024
ind-0.022
 yourselves-0.022
thank-0.022
S-0.022
Token we
Token ID356
Feature activation-3.69e-03

Pos logit contributions
Un+0.025
 yourselves+0.023
 sunscreen+0.022
 dads+0.022
 least+0.022
Neg logit contributions
 may-0.038
 seem-0.031
ves-0.031
ches-0.030
'd-0.028
Token go
Token ID467
Feature activation-4.34e-02

Pos logit contributions
Un+0.007
 yourselves+0.007
 late+0.006
ind+0.006
elt+0.006
Neg logit contributions
 may-0.006
 seem-0.004
 must-0.004
'd-0.004
 leap-0.004
Token with
Token ID351
Feature activation-1.27e-02

Pos logit contributions
Un+0.065
 yourselves+0.059
P+0.059
ind+0.058
 sunscreen+0.057
Neg logit contributions
 may-0.068
 seem-0.057
ves-0.053
ches-0.049
'd-0.049
Token a
Token ID257
Feature activation-5.66e-03

Pos logit contributions
Un+0.016
 yourselves+0.015
ind+0.014
 dads+0.013
elt+0.013
Neg logit contributions
 may-0.026
 seem-0.022
ves-0.020
 leap-0.020
ches-0.019
Token smile
Token ID8212
Feature activation+9.14e-03

Pos logit contributions
Un+0.008
 yourselves+0.008
ind+0.008
kind+0.007
 least+0.007
Neg logit contributions
 may-0.011
 seem-0.008
ves-0.008
 leap-0.008
'd-0.008
Token on
Token ID319
Feature activation+2.77e-02

Pos logit contributions
 may+0.018
 seem+0.014
ves+0.014
ches+0.013
'd+0.013
Neg logit contributions
Un-0.012
 yourselves-0.012
 least-0.011
 dads-0.011
 sunscreen-0.011
Token his
Token ID465
Feature activation-1.11e-02

Pos logit contributions
 may+0.051
 seem+0.042
ves+0.039
ches+0.037
 leap+0.036
Neg logit contributions
Un-0.040
 yourselves-0.037
 least-0.037
 dads-0.036
elt-0.036
Token ch
Token ID442
Feature activation-9.27e-03

Pos logit contributions
Un+0.020
 yourselves+0.019
Thank+0.019
 late+0.019
S+0.019
Neg logit contributions
 may-0.016
 seem-0.012
ves-0.012
ches-0.011
 leap-0.011
Tokenubby
Token ID38393
Feature activation-5.24e-03

Pos logit contributions
Un+0.017
 yourselves+0.016
 dads+0.016
 sunscreen+0.016
 late+0.016
Neg logit contributions
 may-0.013
 seem-0.010
ves-0.009
ches-0.009
 leap-0.008
Token little
Token ID1310
Feature activation-3.00e-02

Pos logit contributions
Un+0.009
 yourselves+0.008
 late+0.008
 least+0.008
S+0.008
Neg logit contributions
 may-0.009
 seem-0.007
ves-0.006
 leap-0.006
ches-0.006
Token face
Token ID1986
Feature activation+1.58e-02

Pos logit contributions
Un+0.050
 yourselves+0.046
 late+0.044
 least+0.044
ind+0.043
Neg logit contributions
 may-0.052
 leap-0.040
 seem-0.040
ves-0.035
'd-0.035
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 may+0.027
 seem+0.021
ves+0.020
ches+0.019
'd+0.019
Neg logit contributions
Un-0.025
 yourselves-0.024
 dads-0.023
 least-0.023
ind-0.023
Token<|endoftext|>
Token ID50256
Feature activation+2.40e-02

Pos logit contributions
Un+0.019
 yourselves+0.018
ind+0.018
 dads+0.018
 sunscreen+0.018
Neg logit contributions
 may-0.021
 seem-0.016
ves-0.015
ches-0.014
'd-0.014
Token the
Token ID262
Feature activation+3.43e-03

Pos logit contributions
Un+0.064
 dads+0.059
elt+0.059
thank+0.058
Thank+0.057
Neg logit contributions
 may-0.048
 seem-0.035
ches-0.033
ves-0.031
 must-0.030
Token wheels
Token ID13666
Feature activation-2.26e-02

Pos logit contributions
 may+0.006
 seem+0.005
ves+0.005
hell+0.005
 trigger+0.005
Neg logit contributions
Un-0.005
elt-0.005
S-0.005
 former-0.005
ind-0.005
Token and
Token ID290
Feature activation+8.07e-03

Pos logit contributions
Un+0.037
 former+0.035
 dads+0.034
S+0.033
Thank+0.033
Neg logit contributions
 may-0.040
 seem-0.033
ches-0.030
ves-0.028
'd-0.027
Token ride
Token ID6594
Feature activation-2.73e-02

Pos logit contributions
 may+0.012
 seem+0.010
 leap+0.008
 hop+0.007
 roar+0.007
Neg logit contributions
ind-0.016
S-0.016
elt-0.016
Un-0.015
P-0.015
Token the
Token ID262
Feature activation-2.37e-03

Pos logit contributions
Un+0.042
elt+0.042
 former+0.041
 least+0.040
Thank+0.040
Neg logit contributions
 may-0.048
ves-0.037
 seem-0.037
ches-0.036
 must-0.033
Token bike
Token ID7161
Feature activation-6.15e-02

Pos logit contributions
Un+0.003
elt+0.003
 former+0.003
S+0.003
ind+0.003
Neg logit contributions
 may-0.005
 seem-0.004
ves-0.004
hell-0.003
'd-0.003
Token."
Token ID526
Feature activation+1.70e-02

Pos logit contributions
Un+0.103
 sunscreen+0.099
 dads+0.099
elt+0.096
Thank+0.095
Neg logit contributions
 may-0.103
ches-0.083
 seem-0.081
ves-0.076
 might-0.072
Token
Token ID198
Feature activation+7.06e-03

Pos logit contributions
 may+0.032
 seem+0.026
ves+0.024
ches+0.023
otion+0.023
Neg logit contributions
Un-0.024
ind-0.022
kind-0.022
 yourselves-0.021
 sunscreen-0.021
Token
Token ID198
Feature activation+2.88e-03

Pos logit contributions
 may+0.015
 seem+0.012
ves+0.012
ches+0.011
'd+0.011
Neg logit contributions
Un-0.009
 yourselves-0.008
S-0.008
 dads-0.008
 sunscreen-0.008
TokenL
Token ID43
Feature activation-1.81e-02

Pos logit contributions
 may+0.005
ves+0.004
 seem+0.004
 opener+0.004
ches+0.004
Neg logit contributions
 least-0.004
burn-0.004
elt-0.004
 cher-0.004
thank-0.003
Tokenily
Token ID813
Feature activation-7.31e-03

Pos logit contributions
Un+0.030
 yourselves+0.028
S+0.028
 least+0.027
 late+0.027
Neg logit contributions
 may-0.030
 seem-0.023
ves-0.022
ches-0.021
 leap-0.020
Token He
Token ID679
Feature activation-9.81e-03

Pos logit contributions
Un+0.009
 yourselves+0.008
ind+0.008
 sunscreen+0.008
 dads+0.008
Neg logit contributions
 may-0.008
 seem-0.006
ves-0.006
ches-0.005
'd-0.005
Token made
Token ID925
Feature activation+3.77e-02

Pos logit contributions
Un+0.016
 yourselves+0.015
 sunscreen+0.014
 dads+0.014
S+0.014
Neg logit contributions
 may-0.017
 seem-0.013
ves-0.013
ches-0.012
'd-0.012
Token sure
Token ID1654
Feature activation+1.54e-03

Pos logit contributions
 may+0.092
 seem+0.078
ves+0.074
otion+0.074
'd+0.073
Neg logit contributions
Un-0.029
 late-0.028
 yourselves-0.027
 each-0.027
 sunscreen-0.026
Token to
Token ID284
Feature activation+1.47e-02

Pos logit contributions
 may+0.003
 seem+0.002
ves+0.002
ches+0.002
'd+0.002
Neg logit contributions
Un-0.002
ind-0.002
S-0.002
 yourselves-0.002
thank-0.002
Token check
Token ID2198
Feature activation+2.57e-03

Pos logit contributions
 may+0.023
 seem+0.019
 leap+0.016
ves+0.016
'd+0.016
Neg logit contributions
elt-0.026
Un-0.026
kind-0.024
burn-0.023
 late-0.023
Token all
Token ID477
Feature activation-7.08e-02

Pos logit contributions
 may+0.005
 seem+0.004
ves+0.004
ches+0.004
'd+0.003
Neg logit contributions
Un-0.004
Thank-0.003
thank-0.003
S-0.003
 dads-0.003
Token the
Token ID262
Feature activation-2.34e-03

Pos logit contributions
Un+0.102
 yourselves+0.091
 dads+0.090
S+0.090
 sunscreen+0.090
Neg logit contributions
 may-0.133
 seem-0.108
ves-0.101
ches-0.099
 leap-0.094
Token places
Token ID4113
Feature activation-1.77e-02

Pos logit contributions
Un+0.003
 yourselves+0.003
 cher+0.003
 late+0.003
 least+0.003
Neg logit contributions
 may-0.004
 seem-0.004
ves-0.003
ches-0.003
otion-0.003
Token carefully
Token ID7773
Feature activation-1.92e-02

Pos logit contributions
Un+0.025
 yourselves+0.023
 dads+0.023
S+0.023
 sunscreen+0.023
Neg logit contributions
 may-0.033
 seem-0.027
ves-0.025
ches-0.024
'd-0.023
Token.
Token ID13
Feature activation-8.74e-03

Pos logit contributions
Un+0.029
 dads+0.026
S+0.026
 yourselves+0.026
ind+0.026
Neg logit contributions
 may-0.036
 seem-0.029
ves-0.027
ches-0.026
'd-0.025
Token
Token ID198
Feature activation-1.83e-02

Pos logit contributions
Un+0.014
 yourselves+0.013
ind+0.013
 sunscreen+0.013
 dads+0.013
Neg logit contributions
 may-0.014
 seem-0.011
ves-0.010
ches-0.010
'd-0.010
Token does
Token ID857
Feature activation+4.30e-02

Pos logit contributions
 may+0.008
ves+0.007
 seem+0.006
otion+0.006
fs+0.006
Neg logit contributions
Un-0.006
 least-0.006
 dads-0.006
 sunscreen-0.006
 cher-0.006
Token not
Token ID407
Feature activation-2.29e-02

Pos logit contributions
 may+0.083
 seem+0.068
ves+0.065
ches+0.062
'd+0.060
Neg logit contributions
Un-0.058
 yourselves-0.053
 dads-0.052
 sunscreen-0.052
 cher-0.052
Token want
Token ID765
Feature activation-1.37e-02

Pos logit contributions
Un+0.034
 yourselves+0.031
 dads+0.031
 sunscreen+0.031
S+0.031
Neg logit contributions
 may-0.041
 seem-0.033
ves-0.031
ches-0.030
'd-0.029
Token to
Token ID284
Feature activation+1.71e-02

Pos logit contributions
Un+0.022
ind+0.021
thank+0.021
Thank+0.021
eline+0.020
Neg logit contributions
 may-0.023
 seem-0.020
 must-0.017
 might-0.017
 opener-0.016
Token go
Token ID467
Feature activation-3.40e-02

Pos logit contributions
 may+0.026
 seem+0.023
ves+0.018
 leap+0.018
ches+0.017
Neg logit contributions
Un-0.030
elt-0.029
kind-0.028
Thank-0.027
 late-0.027
Token back
Token ID736
Feature activation-9.07e-02

Pos logit contributions
Un+0.054
 former+0.049
Thank+0.048
 cher+0.046
thank+0.045
Neg logit contributions
 may-0.055
 seem-0.051
ches-0.045
 must-0.042
 opener-0.042
Token.
Token ID13
Feature activation-5.21e-04

Pos logit contributions
Un+0.164
Thank+0.163
Sorry+0.154
thank+0.153
elt+0.152
Neg logit contributions
 may-0.123
 seem-0.105
 might-0.090
ches-0.089
ves-0.088
Token She
Token ID1375
Feature activation+1.93e-04

Pos logit contributions
burn+0.001
ind+0.001
thank+0.001
kind+0.001
 sunscreen+0.001
Neg logit contributions
 may-0.001
 seem-0.001
 might-0.001
 must-0.001
otion-0.001
Token wants
Token ID3382
Feature activation-2.89e-02

Pos logit contributions
 may+0.000
 seem+0.000
ves+0.000
otion+0.000
hell+0.000
Neg logit contributions
Un-0.000
 least-0.000
 dads-0.000
 yourselves-0.000
 sunscreen-0.000
Token to
Token ID284
Feature activation+1.36e-02

Pos logit contributions
ind+0.047
Un+0.045
thank+0.045
 former+0.044
Thank+0.044
Neg logit contributions
 may-0.043
 seem-0.039
 opener-0.032
 might-0.032
ves-0.031
Token have
Token ID423
Feature activation-4.19e-02

Pos logit contributions
 may+0.021
 seem+0.018
ves+0.015
 leap+0.015
ches+0.014
Neg logit contributions
Un-0.024
elt-0.023
ind-0.022
kind-0.022
 late-0.022
Token felt
Token ID2936
Feature activation-9.32e-04

Pos logit contributions
Un+0.044
 yourselves+0.041
 sunscreen+0.041
 dads+0.040
 cher+0.040
Neg logit contributions
 may-0.045
 seem-0.035
ves-0.033
ches-0.032
'd-0.031
Token bad
Token ID2089
Feature activation-1.81e-02

Pos logit contributions
Un+0.001
ind+0.001
burn+0.001
 yourselves+0.001
 dads+0.001
Neg logit contributions
 may-0.002
 seem-0.001
 must-0.001
ves-0.001
ches-0.001
Token and
Token ID290
Feature activation-4.85e-03

Pos logit contributions
Un+0.019
 late+0.018
 yourselves+0.018
 least+0.017
S+0.017
Neg logit contributions
 may-0.039
 seem-0.033
ves-0.032
ches-0.031
'd-0.030
Token said
Token ID531
Feature activation+6.81e-02

Pos logit contributions
Un+0.007
 yourselves+0.007
ind+0.007
 sunscreen+0.007
burn+0.007
Neg logit contributions
 may-0.009
 seem-0.007
ves-0.006
ches-0.006
'd-0.006
Token,
Token ID11
Feature activation+1.20e-02

Pos logit contributions
 may+0.124
 seem+0.102
ches+0.099
ves+0.097
 leap+0.092
Neg logit contributions
Un-0.094
 cher-0.087
S-0.087
 late-0.087
 dads-0.086
Token "
Token ID366
Feature activation-6.36e-02

Pos logit contributions
 may+0.017
 seem+0.013
ves+0.012
ches+0.011
'd+0.010
Neg logit contributions
Un-0.022
 yourselves-0.021
ind-0.021
 sunscreen-0.021
 dads-0.021
TokenI
Token ID40
Feature activation-4.42e-02

Pos logit contributions
 sunscreen+0.061
 dads+0.060
 cher+0.057
Un+0.053
 least+0.053
Neg logit contributions
 may-0.142
 seem-0.119
'd-0.116
ves-0.115
 must-0.109
Token'm
Token ID1101
Feature activation+6.00e-03

Pos logit contributions
Un+0.062
 least+0.057
S+0.057
 late+0.057
 dads+0.057
Neg logit contributions
 may-0.080
 seem-0.065
ves-0.064
ches-0.060
otion-0.058
Token sorry
Token ID7926
Feature activation+3.81e-03

Pos logit contributions
 may+0.008
 seem+0.007
otion+0.006
 must+0.006
ves+0.005
Neg logit contributions
Un-0.011
ind-0.011
thank-0.011
view-0.010
S-0.010
Token,
Token ID11
Feature activation+3.32e-03

Pos logit contributions
 may+0.008
ves+0.007
 seem+0.006
ches+0.006
'd+0.006
Neg logit contributions
Un-0.005
 yourselves-0.004
 each-0.004
 late-0.004
 dads-0.004
Token Mom
Token ID11254
Feature activation-3.59e-02

Pos logit contributions
 may+0.008
 seem+0.007
ves+0.006
'd+0.006
 must+0.006
Neg logit contributions
Un-0.003
ind-0.003
thank-0.003
kind-0.002
 yourselves-0.002
Token.
Token ID13
Feature activation-8.78e-03

Pos logit contributions
Un+0.016
 yourselves+0.014
 dads+0.014
 sunscreen+0.014
 least+0.014
Neg logit contributions
 may-0.023
 seem-0.019
ves-0.018
ches-0.017
 leap-0.017
Token The
Token ID383
Feature activation-4.05e-03

Pos logit contributions
Un+0.013
 late+0.012
 yourselves+0.012
Thank+0.012
 cher+0.011
Neg logit contributions
 may-0.016
 seem-0.013
ves-0.013
ches-0.012
'd-0.012
Token pirate
Token ID25868
Feature activation-2.19e-02

Pos logit contributions
Un+0.006
 least+0.006
 dads+0.006
 sunscreen+0.006
 late+0.006
Neg logit contributions
 may-0.007
 seem-0.006
ves-0.006
ches-0.006
otion-0.005
Token realized
Token ID6939
Feature activation-1.84e-02

Pos logit contributions
Un+0.034
 yourselves+0.031
 sunscreen+0.031
 dads+0.031
S+0.031
Neg logit contributions
 may-0.038
 seem-0.030
ves-0.029
ches-0.027
'd-0.026
Token that
Token ID326
Feature activation+9.81e-04

Pos logit contributions
Un+0.025
thank+0.023
kind+0.023
elt+0.023
burn+0.023
Neg logit contributions
 may-0.036
 seem-0.029
ves-0.028
 must-0.026
ches-0.025
Token even
Token ID772
Feature activation-8.58e-02

Pos logit contributions
 may+0.002
 seem+0.001
ves+0.001
 leap+0.001
ches+0.001
Neg logit contributions
Un-0.002
 yourselves-0.001
 dads-0.001
 sunscreen-0.001
S-0.001
Token though
Token ID996
Feature activation+1.70e-02

Pos logit contributions
Un+0.120
Mom+0.114
 yourselves+0.114
ind+0.113
 dads+0.113
Neg logit contributions
 may-0.161
 seem-0.135
ches-0.118
 must-0.116
 leap-0.113
Token things
Token ID1243
Feature activation+4.95e-03

Pos logit contributions
 may+0.026
 seem+0.020
ves+0.018
ches+0.017
'd+0.017
Neg logit contributions
Un-0.030
 yourselves-0.028
 dads-0.028
 sunscreen-0.028
S-0.028
Token can
Token ID460
Feature activation+2.51e-02

Pos logit contributions
 may+0.006
ves+0.005
 seem+0.005
ches+0.004
otion+0.004
Neg logit contributions
Un-0.009
 least-0.009
 yourselves-0.009
 sunscreen-0.009
 each-0.009
Token be
Token ID307
Feature activation+5.20e-03

Pos logit contributions
 may+0.042
ves+0.032
 seem+0.032
ches+0.031
otion+0.030
Neg logit contributions
Un-0.039
 least-0.036
 yourselves-0.035
 each-0.035
 sunscreen-0.035
Token scary
Token ID14343
Feature activation+1.02e-02

Pos logit contributions
 may+0.010
 seem+0.009
ches+0.008
 must+0.008
'd+0.008
Neg logit contributions
Un-0.007
Mom-0.007
 yourselves-0.006
ind-0.006
Thank-0.006