TOP ACTIVATIONS
MAX = 8.210

They had never seen a mer maid before .
swimming around her . The mer maid smiled and said ,
, she saw a beautiful mer maid swimming around her .
beach , he saw a mer maid . She had beautiful
. He even saw a mer maid !
again . He said the mer maid and a dolphin were
demands , crab ," the mer maid said . " I
sun . A mer maid that lived in the
passport belonged to a magical mer maid who had been visiting
castle so much . The mer maid said , " I
Jack asked the mer maid why the waves were
yes . She followed the mer maid through the hidden door
water . The mer maid smiled at Melody .
made . The mer maid learnt that taking risks
the clam stepped a beautiful mer maid . " I heard
was so thankful that the mer maid had forgiven him .
sometimes pretend to be a mer maid .
Joe waved goodbye to the mer maid , and hopped back
. " I am a mer maid . Do you want
telling a story about a mer maid who lived in the

MIN ACTIVATIONS
MIN = -7.217

. It was covered in de w from the wet rain
a special job every day during nap time and he was
for helping them stay safe during the storm . <|endoftext|> Once
, the farmer in the de ll ". Suddenly , he
do can prevent us from feeling upset in the long run
<|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> L ila and Ben are friends
. Later , during recess , Lily 's friend
ometer wherever it took him during his journey , and he
awake at night and sleep during the day ." Lily thought
and looked at the ost ric hes . They saw one
exciting things it had found during its journey . The purple
sack , it disappeared right before his eyes !
the pond modest , just between them . <|endoftext|> R andy
. They were called ost ric hes . Lily and Ben
the water and forget about feeling uncomfortable . <|endoftext|> Once upon
of water from the morning de w . Then
and his family lived in before . Still , he had
higher and faster than ever before . So it hopped on
and run to the ost ric hes . "
ie . Unlike other ost ric hes , his feathers gl

INTERVAL 4.353 - 8.210
CONTAINS 0.554%

apples to Mrs . Lee , who lived across the street
girl nodded and said , " Okay . I don 't
. She asked her mom , " Mom my , can
the child . " Child , I regret many
She asked her mom my , " What is that big

INTERVAL 0.496 - 4.353
CONTAINS 38.192%

't stop crying . He missed his cap and felt powerless
clay . He mixed it together and formed it into a
you , thank you ! Can I try it on ?"
listened to the owl 's warning and started to look for
uffy white clouds . " What a beautiful

INTERVAL -3.360 - 0.496
CONTAINS 58.270%

the trees , low on the ground and in all the
it and sang songs to it . Little by
their pictures . She gave away their dog to a stranger
friend 's birthday party . Lucy was so happy and couldn
too ," said the family . " Thank you !" <|endoftext|>

INTERVAL -7.217 - -3.360
CONTAINS 2.984%

and choking . From then on , John decided to be
they were happy . <|endoftext|> S ara loved to visit the
friends . One day , while they were playing , they
day he decided to go for a walk . As he
new doll . She was so happy ! She thanked her
Token They
Token ID1119
Feature activation+6.65e-01

Pos logit contributions
 glad+0.006
 Lightning+0.006
 much+0.006
 And+0.006
 Thunder+0.005
Neg logit contributions
ware-0.022
ge-0.022
rily-0.021
pron-0.020
yt-0.020
Token had
Token ID550
Feature activation-9.38e-02

Pos logit contributions
ware+0.017
rily+0.017
ro+0.015
ge+0.015
bell+0.015
Neg logit contributions
 glad-0.012
ow-0.011
e-0.011
 ost-0.011
 much-0.011
Token never
Token ID1239
Feature activation+3.18e-01

Pos logit contributions
 glad+0.001
ow+0.001
 ost+0.001
e+0.001
 much+0.001
Neg logit contributions
ware-0.003
rily-0.003
ro-0.003
ge-0.002
bell-0.002
Token seen
Token ID1775
Feature activation+1.23e+00

Pos logit contributions
ware+0.008
rily+0.008
ro+0.007
ge+0.007
bell+0.007
Neg logit contributions
 glad-0.006
 ost-0.005
ow-0.005
e-0.005
 much-0.005
Token a
Token ID257
Feature activation+4.13e+00

Pos logit contributions
ware+0.037
rily+0.037
ro+0.035
ge+0.034
bell+0.034
Neg logit contributions
 glad-0.015
ow-0.014
 ost-0.014
e-0.013
 much-0.013
Token mer
Token ID4017
Feature activation+8.21e+00

Pos logit contributions
rily+0.103
ware+0.098
ge+0.096
driver+0.095
ro+0.093
Neg logit contributions
 glad-0.077
 much-0.072
 ost-0.072
 very-0.071
 afraid-0.071
Tokenmaid
Token ID23151
Feature activation+1.28e+00

Pos logit contributions
driver+0.138
ge+0.128
ware+0.125
 opener+0.120
Exc+0.119
Neg logit contributions
ow-0.196
 glad-0.184
e-0.181
 afraid-0.181
ap-0.179
Token before
Token ID878
Feature activation-4.71e+00

Pos logit contributions
ware+0.030
rily+0.029
ro+0.027
driver+0.027
ge+0.026
Neg logit contributions
e-0.024
ow-0.024
 glad-0.023
 ost-0.023
 very-0.022
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 glad+0.080
ow+0.077
 ost+0.076
e+0.075
 much+0.075
Neg logit contributions
ware-0.119
rily-0.117
ro-0.110
ge-0.108
bell-0.106
Token
Token ID198
Feature activation-3.48e-02

Pos logit contributions
 glad+0.012
 much+0.011
ow+0.010
 Lightning+0.010
 And+0.010
Neg logit contributions
ware-0.032
rily-0.031
ge-0.031
ro-0.028
yt-0.028
Token
Token ID198
Feature activation+5.19e-01

Pos logit contributions
 glad+0.001
ow+0.001
 ost+0.001
e+0.001
 much+0.000
Neg logit contributions
ware-0.001
rily-0.001
ro-0.001
ge-0.001
bell-0.001
Token swimming
Token ID14899
Feature activation+2.47e+00

Pos logit contributions
ware+0.021
rily+0.021
ro+0.020
ge+0.020
bell+0.019
Neg logit contributions
 glad-0.011
ow-0.010
 ost-0.010
e-0.010
 much-0.010
Token around
Token ID1088
Feature activation-5.60e-01

Pos logit contributions
ware+0.073
rily+0.070
ge+0.068
ro+0.066
bell+0.066
Neg logit contributions
 glad-0.031
ow-0.031
e-0.030
 much-0.030
 ost-0.030
Token her
Token ID607
Feature activation+7.95e-01

Pos logit contributions
 glad+0.007
e+0.007
ow+0.007
 ost+0.007
 much+0.007
Neg logit contributions
ware-0.016
rily-0.016
ro-0.015
driver-0.015
ge-0.015
Token.
Token ID13
Feature activation-7.50e-01

Pos logit contributions
ware+0.019
rily+0.019
ro+0.018
ge+0.017
bell+0.017
Neg logit contributions
 glad-0.014
ow-0.014
 ost-0.014
e-0.014
 much-0.013
Token The
Token ID383
Feature activation+1.93e-01

Pos logit contributions
 glad+0.012
ow+0.012
e+0.012
 ost+0.012
 much+0.011
Neg logit contributions
ware-0.019
rily-0.019
ro-0.018
bell-0.017
ge-0.017
Token mer
Token ID4017
Feature activation+8.18e+00

Pos logit contributions
ware+0.003
rily+0.003
ro+0.003
ge+0.003
bell+0.003
Neg logit contributions
 glad-0.005
 ost-0.005
ow-0.005
e-0.005
 much-0.005
Tokenmaid
Token ID23151
Feature activation+1.47e-01

Pos logit contributions
driver+0.124
ge+0.121
 opener+0.114
 afford+0.113
ware+0.112
Neg logit contributions
 glad-0.200
ow-0.197
 lives-0.195
 ost-0.193
 afraid-0.188
Token smiled
Token ID13541
Feature activation+2.87e-01

Pos logit contributions
ware+0.004
rily+0.003
ro+0.003
ge+0.003
bell+0.003
Neg logit contributions
 glad-0.003
ow-0.003
 ost-0.003
e-0.003
 much-0.003
Token and
Token ID290
Feature activation-9.89e-01

Pos logit contributions
ware+0.008
ge+0.007
ro+0.007
rily+0.007
driver+0.007
Neg logit contributions
 glad-0.005
 much-0.005
 very-0.005
ow-0.005
 afraid-0.004
Token said
Token ID531
Feature activation+1.39e+00

Pos logit contributions
ow+0.020
e+0.020
 glad+0.019
 ost+0.019
 much+0.018
Neg logit contributions
rily-0.021
ware-0.021
ro-0.019
bell-0.019
ge-0.018
Token,
Token ID11
Feature activation+2.91e+00

Pos logit contributions
ware+0.039
rily+0.038
ro+0.037
ge+0.036
bell+0.035
Neg logit contributions
 glad-0.020
ow-0.019
 much-0.019
 ost-0.019
e-0.018
Token,
Token ID11
Feature activation+9.78e-01

Pos logit contributions
 glad+0.008
ow+0.007
 much+0.007
e+0.007
 ost+0.007
Neg logit contributions
ware-0.016
rily-0.016
ro-0.015
ge-0.015
bell-0.015
Token she
Token ID673
Feature activation+1.78e-01

Pos logit contributions
ware+0.021
rily+0.021
ge+0.020
ro+0.019
bell+0.019
Neg logit contributions
e-0.019
ow-0.019
 glad-0.019
 ost-0.019
 much-0.017
Token saw
Token ID2497
Feature activation+2.74e-01

Pos logit contributions
ware+0.004
rily+0.004
ro+0.004
ge+0.004
driver+0.004
Neg logit contributions
 glad-0.003
ow-0.003
e-0.003
 much-0.003
 ost-0.003
Token a
Token ID257
Feature activation+3.17e+00

Pos logit contributions
ware+0.008
rily+0.008
ro+0.008
fully+0.008
ge+0.008
Neg logit contributions
 glad-0.003
ow-0.003
e-0.003
 ost-0.003
 much-0.003
Token beautiful
Token ID4950
Feature activation+2.52e+00

Pos logit contributions
ware+0.083
rily+0.083
ro+0.077
ge+0.076
bell+0.074
Neg logit contributions
 glad-0.052
 ost-0.049
 much-0.048
ow-0.047
e-0.047
Token mer
Token ID4017
Feature activation+8.16e+00

Pos logit contributions
ware+0.061
rily+0.061
ro+0.056
ge+0.056
bell+0.054
Neg logit contributions
 glad-0.045
 ost-0.043
ow-0.042
e-0.042
 much-0.042
Tokenmaid
Token ID23151
Feature activation+7.57e-01

Pos logit contributions
driver+0.123
ge+0.120
ware+0.117
 afford+0.108
 opener+0.104
Neg logit contributions
ow-0.198
 glad-0.195
e-0.193
 afraid-0.192
 very-0.191
Token swimming
Token ID14899
Feature activation+2.47e+00

Pos logit contributions
ware+0.021
rily+0.021
ro+0.020
ge+0.020
bell+0.019
Neg logit contributions
 glad-0.011
ow-0.010
 ost-0.010
e-0.010
 much-0.010
Token around
Token ID1088
Feature activation-5.60e-01

Pos logit contributions
ware+0.073
rily+0.070
ge+0.068
ro+0.066
bell+0.066
Neg logit contributions
 glad-0.031
ow-0.031
e-0.030
 much-0.030
 ost-0.030
Token her
Token ID607
Feature activation+7.95e-01

Pos logit contributions
 glad+0.007
e+0.007
ow+0.007
 ost+0.007
 much+0.007
Neg logit contributions
ware-0.016
rily-0.016
ro-0.015
driver-0.015
ge-0.015
Token.
Token ID13
Feature activation-7.50e-01

Pos logit contributions
ware+0.019
rily+0.019
ro+0.018
ge+0.017
bell+0.017
Neg logit contributions
 glad-0.014
ow-0.014
 ost-0.014
e-0.014
 much-0.013
Token beach
Token ID10481
Feature activation-1.85e-01

Pos logit contributions
 glad+0.024
ow+0.023
e+0.023
 ost+0.022
 much+0.022
Neg logit contributions
ware-0.024
rily-0.024
ro-0.022
ge-0.022
bell-0.021
Token,
Token ID11
Feature activation+3.66e-01

Pos logit contributions
 glad+0.003
ow+0.003
 ost+0.003
e+0.003
 much+0.003
Neg logit contributions
ware-0.005
rily-0.005
ro-0.005
ge-0.005
bell-0.004
Token he
Token ID339
Feature activation+6.19e-01

Pos logit contributions
ware+0.008
rily+0.008
ro+0.008
ge+0.007
bell+0.007
Neg logit contributions
 glad-0.007
ow-0.007
 ost-0.007
e-0.007
 much-0.007
Token saw
Token ID2497
Feature activation-1.42e-02

Pos logit contributions
rily+0.015
ware+0.015
ro+0.013
ge+0.013
bell+0.013
Neg logit contributions
 glad-0.012
ow-0.011
 ost-0.011
e-0.011
 much-0.011
Token a
Token ID257
Feature activation+2.89e+00

Pos logit contributions
ow+0.000
e+0.000
 glad+0.000
 ost+0.000
opl+0.000
Neg logit contributions
ware-0.000
rily-0.000
ro-0.000
fully-0.000
driver-0.000
Token mer
Token ID4017
Feature activation+8.08e+00

Pos logit contributions
ware+0.074
rily+0.074
ro+0.068
ge+0.067
bell+0.066
Neg logit contributions
 glad-0.048
 ost-0.046
ow-0.045
 much-0.045
e-0.044
Tokenmaid
Token ID23151
Feature activation+1.23e+00

Pos logit contributions
driver+0.116
ge+0.115
ware+0.111
ro+0.103
 opener+0.099
Neg logit contributions
ow-0.204
 glad-0.200
 afraid-0.199
 very-0.197
e-0.195
Token.
Token ID13
Feature activation-3.78e-01

Pos logit contributions
ware+0.036
rily+0.035
ro+0.033
ge+0.033
bell+0.032
Neg logit contributions
 glad-0.016
ow-0.015
 ost-0.015
e-0.015
 much-0.015
Token She
Token ID1375
Feature activation+1.09e+00

Pos logit contributions
 glad+0.005
 much+0.005
 Lightning+0.005
 And+0.005
 Thunder+0.005
Neg logit contributions
ge-0.011
ware-0.011
rily-0.011
ro-0.010
yt-0.010
Token had
Token ID550
Feature activation-7.79e-02

Pos logit contributions
ware+0.027
rily+0.027
ro+0.025
ge+0.025
driver+0.024
Neg logit contributions
 glad-0.019
 ost-0.019
ow-0.018
e-0.018
 much-0.018
Token beautiful
Token ID4950
Feature activation-7.61e-02

Pos logit contributions
 glad+0.001
ow+0.001
 ost+0.001
e+0.001
 much+0.001
Neg logit contributions
ware-0.002
rily-0.002
ro-0.002
ge-0.002
bell-0.002
Token.
Token ID13
Feature activation-7.22e-01

Pos logit contributions
 glad+0.006
ow+0.006
 ost+0.006
e+0.006
 much+0.005
Neg logit contributions
ware-0.010
rily-0.010
ro-0.009
ge-0.009
bell-0.009
Token He
Token ID679
Feature activation+1.14e+00

Pos logit contributions
 glad+0.012
 ost+0.012
ow+0.012
e+0.012
 afraid+0.011
Neg logit contributions
ware-0.018
rily-0.017
ro-0.017
bell-0.016
driver-0.016
Token even
Token ID772
Feature activation-8.73e-01

Pos logit contributions
ware+0.028
rily+0.028
ro+0.026
ge+0.025
bell+0.025
Neg logit contributions
 glad-0.020
ow-0.020
 ost-0.019
e-0.019
 much-0.019
Token saw
Token ID2497
Feature activation+2.67e-01

Pos logit contributions
 glad+0.015
e+0.015
 ost+0.015
 much+0.014
ow+0.014
Neg logit contributions
rily-0.022
ware-0.022
ro-0.020
ge-0.020
bell-0.019
Token a
Token ID257
Feature activation+2.37e+00

Pos logit contributions
ware+0.008
rily+0.008
ro+0.008
ge+0.008
bell+0.008
Neg logit contributions
 glad-0.003
ow-0.003
 ost-0.003
e-0.003
 much-0.003
Token mer
Token ID4017
Feature activation+8.03e+00

Pos logit contributions
rily+0.063
ware+0.063
ro+0.058
ge+0.058
bell+0.056
Neg logit contributions
 glad-0.038
 ost-0.036
 much-0.036
e-0.034
 afraid-0.034
Tokenmaid
Token ID23151
Feature activation+7.04e-01

Pos logit contributions
driver+0.127
ge+0.124
ware+0.117
Exc+0.111
 opener+0.110
Neg logit contributions
 glad-0.193
ow-0.186
 lives-0.185
 very-0.185
e-0.185
Token!
Token ID0
Feature activation-7.43e-01

Pos logit contributions
ware+0.019
rily+0.019
ro+0.017
ge+0.017
bell+0.017
Neg logit contributions
 glad-0.011
ow-0.010
 ost-0.010
e-0.010
 much-0.010
Token
Token ID220
Feature activation-1.33e+00

Pos logit contributions
 glad+0.012
 ost+0.012
e+0.012
ow+0.012
 afraid+0.011
Neg logit contributions
ware-0.018
rily-0.018
ro-0.018
bell-0.017
driver-0.016
Token
Token ID198
Feature activation+1.10e+00

Pos logit contributions
 glad+0.020
 ost+0.020
ow+0.019
e+0.019
 much+0.019
Neg logit contributions
ware-0.035
rily-0.035
ro-0.033
ge-0.032
bell-0.032
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
ware+0.034
rily+0.034
ge+0.031
bell+0.031
ro+0.031
Neg logit contributions
 glad-0.014
 much-0.013
e-0.013
ow-0.012
 ost-0.012
Token again
Token ID757
Feature activation-2.33e+00

Pos logit contributions
ware+0.034
rily+0.034
ro+0.032
ge+0.031
yt+0.030
Neg logit contributions
 much-0.019
 glad-0.019
ow-0.019
 ost-0.018
e-0.017
Token.
Token ID13
Feature activation-5.41e-01

Pos logit contributions
 glad+0.039
ow+0.037
 ost+0.037
 much+0.037
e+0.037
Neg logit contributions
ware-0.060
rily-0.059
ro-0.055
ge-0.054
bell-0.053
Token He
Token ID679
Feature activation+1.21e+00

Pos logit contributions
 glad+0.010
 ost+0.010
e+0.009
ow+0.009
 much+0.009
Neg logit contributions
ware-0.013
rily-0.013
ro-0.012
bell-0.012
ge-0.012
Token said
Token ID531
Feature activation+1.63e+00

Pos logit contributions
ware+0.030
rily+0.030
ro+0.028
ge+0.027
bell+0.026
Neg logit contributions
 glad-0.021
ow-0.020
 ost-0.020
e-0.020
 much-0.020
Token the
Token ID262
Feature activation+3.23e-01

Pos logit contributions
ware+0.039
rily+0.039
ro+0.036
ge+0.035
bell+0.035
Neg logit contributions
 glad-0.030
ow-0.029
 ost-0.029
e-0.028
 much-0.028
Token mer
Token ID4017
Feature activation+8.00e+00

Pos logit contributions
ware+0.008
rily+0.008
bell+0.007
ro+0.007
ge+0.007
Neg logit contributions
ow-0.005
 glad-0.005
e-0.005
 ost-0.005
 much-0.005
Tokenmaid
Token ID23151
Feature activation+7.85e-01

Pos logit contributions
ge+0.135
driver+0.134
 opener+0.122
coat+0.117
 afford+0.116
Neg logit contributions
ow-0.184
 lives-0.179
 glad-0.177
e-0.169
ap-0.169
Token and
Token ID290
Feature activation+1.66e+00

Pos logit contributions
ware+0.018
rily+0.017
ro+0.016
bell+0.016
ge+0.016
Neg logit contributions
 glad-0.015
 ost-0.014
e-0.014
ow-0.014
 much-0.014
Token a
Token ID257
Feature activation+3.39e+00

Pos logit contributions
ware+0.042
rily+0.041
ro+0.038
ge+0.037
bell+0.037
Neg logit contributions
 glad-0.028
ow-0.027
e-0.027
 ost-0.027
 much-0.026
Token dolphin
Token ID44136
Feature activation+1.45e-01

Pos logit contributions
ware+0.082
rily+0.079
bell+0.075
ro+0.074
ge+0.072
Neg logit contributions
ow-0.060
e-0.059
 ost-0.057
 glad-0.056
aw-0.053
Token were
Token ID547
Feature activation-2.42e+00

Pos logit contributions
ware+0.004
rily+0.004
ro+0.003
ge+0.003
bell+0.003
Neg logit contributions
 glad-0.003
ow-0.002
 ost-0.002
e-0.002
 much-0.002
Token demands
Token ID8665
Feature activation+1.26e+00

Pos logit contributions
ware+0.040
rily+0.040
ro+0.037
ge+0.037
bell+0.036
Neg logit contributions
 glad-0.026
ow-0.025
 ost-0.025
e-0.024
 much-0.024
Token,
Token ID11
Feature activation+2.71e+00

Pos logit contributions
ware+0.028
rily+0.028
ro+0.026
ge+0.025
bell+0.025
Neg logit contributions
 glad-0.025
ow-0.024
 ost-0.024
e-0.024
 much-0.023
Token crab
Token ID32202
Feature activation-8.80e-01

Pos logit contributions
ware+0.066
rily+0.065
ro+0.061
ge+0.061
bell+0.059
Neg logit contributions
 glad-0.049
 ost-0.047
ow-0.046
 much-0.046
e-0.045
Token,"
Token ID553
Feature activation+6.19e-01

Pos logit contributions
 glad+0.019
 ost+0.018
ow+0.018
e+0.017
 much+0.017
Neg logit contributions
rily-0.018
ware-0.018
ro-0.017
ge-0.016
bell-0.016
Token the
Token ID262
Feature activation+4.64e-01

Pos logit contributions
rily+0.014
ware+0.014
ro+0.012
bell+0.012
ge+0.011
Neg logit contributions
 glad-0.012
ow-0.012
opl-0.011
ained-0.011
adow-0.011
Token mer
Token ID4017
Feature activation+7.94e+00

Pos logit contributions
ware+0.010
rily+0.010
ge+0.009
ro+0.009
bell+0.009
Neg logit contributions
 glad-0.010
 ost-0.009
ow-0.009
 afraid-0.009
 much-0.009
Tokenmaid
Token ID23151
Feature activation+7.77e-03

Pos logit contributions
driver+0.108
coat+0.101
pron+0.100
 ribbon+0.099
 markers+0.099
Neg logit contributions
ow-0.208
 glad-0.205
 afraid-0.197
e-0.196
 lives-0.192
Token said
Token ID531
Feature activation+2.01e+00

Pos logit contributions
rily+0.000
ware+0.000
ro+0.000
ge+0.000
bell+0.000
Neg logit contributions
 glad-0.000
 ost-0.000
ow-0.000
e-0.000
 much-0.000
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
ware+0.050
rily+0.049
ro+0.046
ge+0.045
bell+0.045
Neg logit contributions
 glad-0.035
ow-0.034
 ost-0.033
e-0.033
 much-0.033
Token "
Token ID366
Feature activation+4.64e+00

Pos logit contributions
 glad+0.002
ow+0.002
 ost+0.002
e+0.002
 much+0.002
Neg logit contributions
ware-0.004
rily-0.003
ro-0.003
ge-0.003
bell-0.003
TokenI
Token ID40
Feature activation+1.82e+00

Pos logit contributions
ware+0.125
rily+0.122
ge+0.111
yt+0.110
bell+0.110
Neg logit contributions
e-0.076
 glad-0.074
 ost-0.069
 much-0.068
ow-0.067
Token sun
Token ID4252
Feature activation-4.54e-01

Pos logit contributions
ow+0.076
 glad+0.069
e+0.068
 ost+0.067
 afraid+0.063
Neg logit contributions
ware-0.088
rily-0.086
ro-0.084
bell-0.082
ly-0.077
Token.
Token ID13
Feature activation-8.33e-01

Pos logit contributions
ow+0.007
 ost+0.006
e+0.006
 glad+0.006
 much+0.006
Neg logit contributions
ware-0.012
rily-0.012
ge-0.012
ro-0.011
driver-0.011
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
 glad+0.015
 ost+0.015
ow+0.015
e+0.015
 afraid+0.014
Neg logit contributions
ware-0.019
ro-0.019
rily-0.018
bell-0.018
fully-0.017
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
ware+0.033
rily+0.032
ro+0.031
ge+0.030
bell+0.030
Neg logit contributions
 glad-0.020
ow-0.020
 ost-0.020
e-0.019
 much-0.018
TokenA
Token ID32
Feature activation+2.73e+00

Pos logit contributions
ro+0.021
ware+0.021
bell+0.020
ge+0.020
rily+0.019
Neg logit contributions
ow-0.032
 glad-0.031
 ost-0.030
e-0.029
 sorry-0.029
Token mer
Token ID4017
Feature activation+7.93e+00

Pos logit contributions
ware+0.049
rily+0.047
ro+0.045
bell+0.043
ge+0.042
Neg logit contributions
ow-0.065
 glad-0.064
e-0.064
 ost-0.062
 much-0.060
Tokenmaid
Token ID23151
Feature activation+3.45e-01

Pos logit contributions
ge+0.116
driver+0.112
ware+0.106
clips+0.104
 opener+0.103
Neg logit contributions
ow-0.207
 glad-0.199
 lives-0.196
 afraid-0.193
e-0.191
Token that
Token ID326
Feature activation+1.53e+00

Pos logit contributions
ware+0.009
rily+0.009
ro+0.009
ge+0.008
bell+0.008
Neg logit contributions
 glad-0.005
ow-0.005
 ost-0.005
 much-0.005
e-0.005
Token lived
Token ID5615
Feature activation+4.80e-02

Pos logit contributions
ware+0.026
yt+0.021
ro+0.020
bell+0.020
 stuffed+0.019
Neg logit contributions
ow-0.041
aw-0.036
opl-0.036
 glad-0.036
e-0.036
Token in
Token ID287
Feature activation-3.27e+00

Pos logit contributions
ware+0.001
rily+0.001
ro+0.001
ge+0.001
bell+0.001
Neg logit contributions
 glad-0.001
ow-0.001
 ost-0.001
e-0.001
 much-0.001
Token the
Token ID262
Feature activation-2.62e+00

Pos logit contributions
 ost+0.052
 glad+0.051
aw+0.050
ow+0.050
orry+0.048
Neg logit contributions
ware-0.079
rily-0.076
ro-0.075
 unw-0.070
bell-0.069
Token passport
Token ID17981
Feature activation+3.35e-01

Pos logit contributions
ware+0.006
rily+0.006
ro+0.006
bell+0.006
ge+0.006
Neg logit contributions
 glad-0.005
ow-0.005
e-0.005
 ost-0.005
 much-0.005
Token belonged
Token ID19611
Feature activation-1.43e+00

Pos logit contributions
rily+0.009
ware+0.009
ro+0.008
ge+0.008
bell+0.008
Neg logit contributions
 glad-0.005
ow-0.005
 ost-0.005
e-0.005
 much-0.005
Token to
Token ID284
Feature activation-1.68e-01

Pos logit contributions
 glad+0.031
ow+0.031
 much+0.031
e+0.030
 ost+0.030
Neg logit contributions
ware-0.030
rily-0.029
ro-0.026
ge-0.026
bell-0.025
Token a
Token ID257
Feature activation+2.31e+00

Pos logit contributions
 glad+0.002
ow+0.002
 ost+0.002
e+0.002
 much+0.002
Neg logit contributions
ware-0.005
rily-0.005
ro-0.005
ge-0.005
bell-0.005
Token magical
Token ID10883
Feature activation+3.19e+00

Pos logit contributions
ware+0.060
rily+0.060
ro+0.055
ge+0.055
bell+0.053
Neg logit contributions
 glad-0.038
 ost-0.036
 much-0.036
e-0.035
ow-0.035
Token mer
Token ID4017
Feature activation+7.83e+00

Pos logit contributions
rily+0.078
ware+0.078
ge+0.072
ro+0.072
bell+0.069
Neg logit contributions
 glad-0.056
 ost-0.054
 much-0.053
ow-0.053
e-0.053
Tokenmaid
Token ID23151
Feature activation-1.47e-01

Pos logit contributions
driver+0.117
 opener+0.116
ge+0.111
Exc+0.107
uit+0.104
Neg logit contributions
 lives-0.186
adow-0.185
ow-0.185
 glad-0.181
 very-0.181
Token who
Token ID508
Feature activation-1.50e+00

Pos logit contributions
 glad+0.002
ow+0.002
e+0.002
 ost+0.002
 much+0.002
Neg logit contributions
ware-0.004
rily-0.004
ro-0.004
ge-0.004
bell-0.003
Token had
Token ID550
Feature activation-7.60e-01

Pos logit contributions
 glad+0.025
 ost+0.024
ow+0.024
 much+0.023
e+0.022
Neg logit contributions
ware-0.038
rily-0.037
ro-0.035
bell-0.034
ge-0.034
Token been
Token ID587
Feature activation-2.70e+00

Pos logit contributions
 glad+0.012
ow+0.012
 ost+0.011
e+0.011
 much+0.011
Neg logit contributions
ware-0.020
rily-0.020
ro-0.019
ge-0.018
bell-0.018
Token visiting
Token ID10013
Feature activation+6.75e-01

Pos logit contributions
 glad+0.043
ow+0.041
 ost+0.041
e+0.040
 much+0.039
Neg logit contributions
ware-0.069
rily-0.069
ro-0.064
ge-0.064
bell-0.063
Token castle
Token ID16669
Feature activation+1.07e+00

Pos logit contributions
ware+0.032
rily+0.032
ge+0.030
ro+0.030
bell+0.029
Neg logit contributions
 glad-0.026
ow-0.025
 ost-0.025
e-0.025
 much-0.024
Token so
Token ID523
Feature activation-3.55e+00

Pos logit contributions
rily+0.031
ware+0.031
ro+0.028
ge+0.027
oph+0.027
Neg logit contributions
 much-0.015
 very-0.015
ow-0.015
adow-0.014
e-0.013
Token much
Token ID881
Feature activation-2.26e+00

Pos logit contributions
 glad+0.052
e+0.052
 ost+0.051
ow+0.051
 much+0.046
Neg logit contributions
rily-0.096
ware-0.096
ro-0.089
bell-0.088
ge-0.087
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
 much+0.029
 glad+0.028
 afraid+0.027
 ost+0.026
 very+0.026
Neg logit contributions
rily-0.069
ware-0.067
ge-0.063
ro-0.062
driver-0.061
Token The
Token ID383
Feature activation+1.52e-01

Pos logit contributions
ware+0.000
rily+0.000
ge+0.000
yt+0.000
ro+0.000
Neg logit contributions
 glad-0.000
 Lightning-0.000
ow-0.000
 Thunder-0.000
 much-0.000
Token mer
Token ID4017
Feature activation+7.73e+00

Pos logit contributions
ware+0.003
rily+0.002
ro+0.002
ge+0.002
bell+0.002
Neg logit contributions
 glad-0.004
 ost-0.004
ow-0.004
e-0.004
 much-0.004
Tokenmaid
Token ID23151
Feature activation+3.63e-01

Pos logit contributions
ge+0.112
driver+0.108
 opener+0.097
ware+0.096
coat+0.095
Neg logit contributions
 glad-0.198
ow-0.198
 lives-0.197
e-0.189
 afraid-0.186
Token said
Token ID531
Feature activation+1.79e+00

Pos logit contributions
ware+0.009
rily+0.009
ro+0.008
ge+0.008
bell+0.008
Neg logit contributions
 glad-0.006
ow-0.006
 ost-0.006
e-0.006
 much-0.006
Token,
Token ID11
Feature activation+3.17e+00

Pos logit contributions
ware+0.050
rily+0.049
ro+0.046
ge+0.046
bell+0.045
Neg logit contributions
 glad-0.026
ow-0.025
 ost-0.025
e-0.024
 much-0.024
Token "
Token ID366
Feature activation+4.13e+00

Pos logit contributions
ware+0.078
rily+0.076
ge+0.073
ro+0.072
yt+0.069
Neg logit contributions
 glad-0.059
 much-0.054
ow-0.053
 ost-0.053
 afraid-0.052
TokenI
Token ID40
Feature activation+1.46e+00

Pos logit contributions
ware+0.105
rily+0.104
ro+0.094
ge+0.094
bell+0.093
Neg logit contributions
e-0.069
 glad-0.069
 ost-0.068
ow-0.067
 much-0.066
Token
Token ID198
Feature activation+7.45e-01

Pos logit contributions
 glad+0.017
 much+0.016
 And+0.016
 Lightning+0.015
 Thunder+0.015
Neg logit contributions
ware-0.025
ge-0.025
rily-0.024
ro-0.023
bell-0.022
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
ware+0.026
rily+0.025
ro+0.024
ge+0.024
bell+0.023
Neg logit contributions
 glad-0.007
 much-0.007
 many-0.006
<|endoftext|>-0.006
e-0.006
TokenJack
Token ID14295
Feature activation-3.28e-01

Pos logit contributions
ware+0.028
rily+0.027
ro+0.024
ge+0.024
bell+0.023
Neg logit contributions
 glad-0.026
 much-0.026
e-0.025
 many-0.023
aw-0.023
Token asked
Token ID1965
Feature activation+2.02e+00

Pos logit contributions
 glad+0.006
ow+0.006
 ost+0.006
e+0.006
 much+0.006
Neg logit contributions
ware-0.008
rily-0.007
ro-0.007
ge-0.007
bell-0.007
Token the
Token ID262
Feature activation+8.21e-01

Pos logit contributions
ware+0.070
rily+0.070
ro+0.068
ge+0.067
bell+0.065
Neg logit contributions
 glad-0.016
 much-0.015
ow-0.014
 ost-0.014
e-0.013
Token mer
Token ID4017
Feature activation+7.71e+00

Pos logit contributions
rily+0.016
ware+0.015
ge+0.015
driver+0.014
ro+0.014
Neg logit contributions
 glad-0.019
 ost-0.018
 afraid-0.018
 much-0.018
 very-0.018
Tokenmaid
Token ID23151
Feature activation+6.84e-01

Pos logit contributions
ge+0.124
driver+0.124
coat+0.119
 afford+0.117
 checkout+0.115
Neg logit contributions
ow-0.178
 lives-0.169
 ost-0.165
 glad-0.165
e-0.163
Token why
Token ID1521
Feature activation+3.34e-02

Pos logit contributions
ware+0.017
rily+0.017
ro+0.016
ge+0.016
bell+0.015
Neg logit contributions
 glad-0.012
e-0.011
 ost-0.011
ow-0.011
 much-0.011
Token the
Token ID262
Feature activation+3.23e-01

Pos logit contributions
ware+0.001
rily+0.001
ro+0.001
ge+0.001
bell+0.001
Neg logit contributions
 glad-0.001
ow-0.001
 ost-0.001
e-0.001
 much-0.001
Token waves
Token ID9813
Feature activation+1.57e+00

Pos logit contributions
rily+0.006
ware+0.006
ge+0.006
driver+0.006
ro+0.006
Neg logit contributions
 glad-0.007
 ost-0.007
 afraid-0.007
 much-0.007
 lives-0.007
Token were
Token ID547
Feature activation-2.09e+00

Pos logit contributions
ware+0.040
rily+0.040
ro+0.037
ge+0.036
driver+0.035
Neg logit contributions
 glad-0.026
ow-0.026
e-0.025
 ost-0.025
 much-0.025
Token yes
Token ID3763
Feature activation+3.20e+00

Pos logit contributions
ware+0.045
rily+0.044
ge+0.042
ro+0.042
bell+0.040
Neg logit contributions
 glad-0.034
 much-0.033
ow-0.032
 ost-0.030
e-0.030
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
ware+0.109
rily+0.098
ro+0.094
ge+0.091
driver+0.091
Neg logit contributions
 much-0.040
 very-0.039
ow-0.039
 glad-0.036
e-0.030
Token She
Token ID1375
Feature activation+1.85e+00

Pos logit contributions
ware+0.015
rily+0.014
ge+0.014
ro+0.013
pron+0.013
Neg logit contributions
 glad-0.005
 much-0.005
 Lightning-0.005
 And-0.005
 many-0.005
Token followed
Token ID3940
Feature activation+2.42e+00

Pos logit contributions
ware+0.047
rily+0.046
ge+0.044
ro+0.043
driver+0.041
Neg logit contributions
 glad-0.031
ow-0.030
 much-0.030
e-0.029
 ost-0.029
Token the
Token ID262
Feature activation+1.11e-01

Pos logit contributions
ware+0.078
rily+0.076
ro+0.072
ge+0.071
bell+0.069
Neg logit contributions
 glad-0.025
ow-0.024
e-0.024
 much-0.023
 ost-0.023
Token mer
Token ID4017
Feature activation+7.61e+00

Pos logit contributions
ware+0.003
rily+0.003
ro+0.002
ge+0.002
bell+0.002
Neg logit contributions
 glad-0.002
ow-0.002
 ost-0.002
e-0.002
 much-0.002
Tokenmaid
Token ID23151
Feature activation-4.95e-01

Pos logit contributions
driver+0.118
coat+0.116
ge+0.115
pron+0.107
ware+0.104
Neg logit contributions
 lives-0.176
ow-0.170
ap-0.170
e-0.169
 ost-0.166
Token through
Token ID832
Feature activation-2.58e+00

Pos logit contributions
 glad+0.009
ow+0.009
 ost+0.009
e+0.009
 much+0.009
Neg logit contributions
ware-0.012
rily-0.011
ro-0.011
ge-0.010
bell-0.010
Token the
Token ID262
Feature activation-2.30e+00

Pos logit contributions
 glad+0.025
ow+0.023
 ost+0.023
e+0.022
 much+0.021
Neg logit contributions
ware-0.084
rily-0.084
ro-0.080
ge-0.079
bell-0.077
Token hidden
Token ID7104
Feature activation+2.67e+00

Pos logit contributions
ow+0.041
 glad+0.039
e+0.038
 ost+0.035
 afraid+0.034
Neg logit contributions
ware-0.057
rily-0.056
ro-0.054
bell-0.054
yt-0.051
Token door
Token ID3420
Feature activation-1.69e+00

Pos logit contributions
ware+0.065
rily+0.065
ro+0.060
ge+0.060
bell+0.058
Neg logit contributions
 glad-0.046
 ost-0.045
ow-0.044
e-0.044
 much-0.044
Token water
Token ID1660
Feature activation-3.91e-01

Pos logit contributions
ow+0.023
 glad+0.022
e+0.021
 ost+0.020
 afraid+0.020
Neg logit contributions
ware-0.029
rily-0.029
ro-0.027
bell-0.026
ge-0.025
Token.
Token ID13
Feature activation-7.41e-01

Pos logit contributions
 ost+0.005
 glad+0.005
ow+0.005
 much+0.005
e+0.005
Neg logit contributions
ware-0.011
rily-0.011
ge-0.010
ro-0.010
bell-0.010
Token
Token ID198
Feature activation+4.76e-01

Pos logit contributions
 glad+0.010
 much+0.009
ow+0.009
e+0.009
 ost+0.009
Neg logit contributions
ware-0.022
rily-0.022
ge-0.020
ro-0.020
bell-0.019
Token
Token ID198
Feature activation+1.23e+00

Pos logit contributions
ware+0.014
rily+0.014
ro+0.013
ge+0.013
bell+0.012
Neg logit contributions
 glad-0.007
 much-0.006
ow-0.006
e-0.006
 ost-0.006
TokenThe
Token ID464
Feature activation-1.29e-01

Pos logit contributions
ware+0.033
rily+0.033
ge+0.029
ro+0.029
yt+0.028
Neg logit contributions
 glad-0.020
 much-0.020
e-0.020
 many-0.018
ow-0.018
Token mer
Token ID4017
Feature activation+7.59e+00

Pos logit contributions
ow+0.003
 glad+0.003
e+0.003
 ost+0.003
 much+0.003
Neg logit contributions
ware-0.003
rily-0.002
ro-0.002
bell-0.002
ge-0.002
Tokenmaid
Token ID23151
Feature activation-3.93e-01

Pos logit contributions
ge+0.112
driver+0.109
coat+0.107
pron+0.099
ware+0.098
Neg logit contributions
 lives-0.186
 glad-0.183
e-0.176
ap-0.176
aw-0.174
Token smiled
Token ID13541
Feature activation+1.16e-01

Pos logit contributions
 glad+0.007
ow+0.007
 ost+0.007
e+0.007
 much+0.007
Neg logit contributions
ware-0.009
rily-0.009
ro-0.008
ge-0.008
bell-0.008
Token at
Token ID379
Feature activation-9.93e-02

Pos logit contributions
ware+0.003
rily+0.003
ge+0.003
ro+0.003
driver+0.003
Neg logit contributions
 glad-0.002
 much-0.002
ow-0.002
 very-0.002
 afraid-0.002
Token Melody
Token ID38906
Feature activation-2.19e+00

Pos logit contributions
 glad+0.002
e+0.002
ow+0.001
 much+0.001
 ost+0.001
Neg logit contributions
ware-0.003
rily-0.003
ro-0.002
ge-0.002
bell-0.002
Token.
Token ID13
Feature activation+3.43e-01

Pos logit contributions
 glad+0.046
 ost+0.046
ow+0.045
e+0.044
 afraid+0.043
Neg logit contributions
rily-0.046
ware-0.045
ro-0.042
ge-0.041
bell-0.040
Token made
Token ID925
Feature activation-4.88e-01

Pos logit contributions
ware+0.025
rily+0.024
ro+0.023
ge+0.022
bell+0.021
Neg logit contributions
 glad-0.023
ow-0.022
e-0.022
 ost-0.021
 much-0.021
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
 ost+0.008
 glad+0.008
ow+0.008
 afraid+0.008
e+0.007
Neg logit contributions
rily-0.012
ware-0.012
fully-0.010
ge-0.010
ro-0.010
Token
Token ID220
Feature activation-1.77e+00

Pos logit contributions
 ost+0.023
ow+0.021
 afraid+0.021
e+0.021
 glad+0.021
Neg logit contributions
rily-0.027
ro-0.026
ware-0.025
bell-0.024
oph-0.023
Token
Token ID198
Feature activation+3.46e-01

Pos logit contributions
 ost+0.028
 glad+0.028
ow+0.027
e+0.027
 afraid+0.026
Neg logit contributions
rily-0.045
ware-0.045
ro-0.043
ge-0.041
bell-0.040
TokenThe
Token ID464
Feature activation-5.61e-01

Pos logit contributions
rily+0.009
ware+0.009
ro+0.009
ge+0.008
bell+0.008
Neg logit contributions
ow-0.005
 ost-0.005
 glad-0.005
 afraid-0.005
e-0.005
Token mer
Token ID4017
Feature activation+7.55e+00

Pos logit contributions
ow+0.008
 glad+0.008
e+0.008
 ost+0.008
 much+0.007
Neg logit contributions
ware-0.015
rily-0.015
ro-0.014
bell-0.014
ge-0.014
Tokenmaid
Token ID23151
Feature activation-2.38e-01

Pos logit contributions
driver+0.114
 opener+0.112
ge+0.110
ware+0.102
coat+0.100
Neg logit contributions
ow-0.181
 glad-0.180
 lives-0.180
ap-0.175
ang-0.170
Token learnt
Token ID26338
Feature activation+3.07e-01

Pos logit contributions
 glad+0.005
ow+0.004
 ost+0.004
e+0.004
 much+0.004
Neg logit contributions
ware-0.005
rily-0.005
ro-0.005
ge-0.005
bell-0.005
Token that
Token ID326
Feature activation+1.33e+00

Pos logit contributions
rily+0.009
ware+0.009
ro+0.008
ge+0.008
bell+0.008
Neg logit contributions
 glad-0.004
e-0.004
ow-0.004
 ost-0.004
 afraid-0.004
Token taking
Token ID2263
Feature activation-6.61e-02

Pos logit contributions
ware+0.030
rily+0.028
ge+0.028
ro+0.027
fully+0.027
Neg logit contributions
ow-0.025
aw-0.023
 ost-0.023
e-0.023
 glad-0.023
Token risks
Token ID7476
Feature activation+8.40e-02

Pos logit contributions
 glad+0.001
 ost+0.001
ow+0.001
 afraid+0.001
adow+0.001
Neg logit contributions
rily-0.002
ware-0.001
ro-0.001
ly-0.001
fully-0.001
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
 glad+0.028
 ost+0.027
ow+0.026
e+0.025
 much+0.025
Neg logit contributions
ware-0.050
rily-0.049
ro-0.046
ge-0.046
bell-0.045
Token clam
Token ID30726
Feature activation-1.37e+00

Pos logit contributions
 glad+0.037
 ost+0.035
 much+0.034
ow+0.034
e+0.033
Neg logit contributions
ware-0.042
rily-0.042
ro-0.040
ge-0.039
bell-0.038
Token stepped
Token ID10764
Feature activation+2.97e-01

Pos logit contributions
 glad+0.026
ow+0.025
 ost+0.025
e+0.024
 much+0.024
Neg logit contributions
ware-0.032
rily-0.032
ro-0.029
ge-0.029
bell-0.028
Token a
Token ID257
Feature activation+2.27e+00

Pos logit contributions
ware+0.007
rily+0.007
ro+0.007
ge+0.007
bell+0.007
Neg logit contributions
 glad-0.005
 ost-0.005
ow-0.005
e-0.005
 much-0.005
Token beautiful
Token ID4950
Feature activation+1.96e+00

Pos logit contributions
ware+0.060
rily+0.060
ro+0.055
ge+0.055
bell+0.053
Neg logit contributions
 glad-0.037
 much-0.034
 ost-0.034
e-0.033
ow-0.033
Token mer
Token ID4017
Feature activation+7.46e+00

Pos logit contributions
rily+0.045
ware+0.044
ge+0.042
driver+0.042
ro+0.040
Neg logit contributions
 glad-0.036
 ost-0.035
 much-0.035
e-0.035
 very-0.034
Tokenmaid
Token ID23151
Feature activation-4.75e-02

Pos logit contributions
driver+0.115
ge+0.107
ware+0.100
coat+0.100
 opener+0.099
Neg logit contributions
ow-0.178
 glad-0.172
 lives-0.170
 very-0.170
 ost-0.169
Token.
Token ID13
Feature activation-5.81e-01

Pos logit contributions
 glad+0.001
ow+0.001
e+0.001
 ost+0.001
 much+0.001
Neg logit contributions
ware-0.001
rily-0.001
ro-0.001
ge-0.001
bell-0.001
Token "
Token ID366
Feature activation+3.76e+00

Pos logit contributions
 glad+0.009
 ost+0.009
e+0.008
ow+0.008
 much+0.008
Neg logit contributions
ware-0.016
rily-0.016
ro-0.015
bell-0.014
ge-0.014
TokenI
Token ID40
Feature activation+1.37e+00

Pos logit contributions
ware+0.088
rily+0.086
ge+0.078
ro+0.078
driver+0.076
Neg logit contributions
e-0.073
 glad-0.072
 ost-0.071
 much-0.068
ow-0.068
Token heard
Token ID2982
Feature activation-8.51e-01

Pos logit contributions
rily+0.036
ware+0.036
ro+0.032
ge+0.032
bell+0.031
Neg logit contributions
 glad-0.023
ow-0.022
e-0.022
 much-0.022
 afraid-0.021
Token was
Token ID373
Feature activation-1.72e+00

Pos logit contributions
ware+0.047
rily+0.046
ro+0.043
ge+0.043
bell+0.041
Neg logit contributions
 glad-0.035
ow-0.033
e-0.033
 ost-0.032
 much-0.032
Token so
Token ID523
Feature activation-4.86e+00

Pos logit contributions
 ost+0.023
 lives+0.020
ow+0.020
 Lightning+0.020
 Thunder+0.020
Neg logit contributions
rily-0.049
ware-0.044
bell-0.044
ge-0.043
fully-0.042
Token thankful
Token ID25535
Feature activation+1.44e+00

Pos logit contributions
 ost+0.070
ow+0.062
e+0.060
 Lightning+0.059
 Thunder+0.058
Neg logit contributions
rily-0.133
bell-0.124
ware-0.124
ge-0.121
ro-0.120
Token that
Token ID326
Feature activation+4.80e-01

Pos logit contributions
ware+0.040
rily+0.040
ro+0.037
ge+0.037
bell+0.036
Neg logit contributions
 glad-0.022
ow-0.020
 much-0.020
 ost-0.020
e-0.019
Token the
Token ID262
Feature activation-2.49e-01

Pos logit contributions
ware+0.010
rily+0.010
ro+0.009
ge+0.009
bell+0.009
Neg logit contributions
ow-0.010
 glad-0.009
e-0.009
 ost-0.009
aw-0.009
Token mer
Token ID4017
Feature activation+7.43e+00

Pos logit contributions
 glad+0.005
 ost+0.005
 much+0.005
e+0.005
ow+0.005
Neg logit contributions
rily-0.006
ware-0.006
ro-0.005
ge-0.005
bell-0.005
Tokenmaid
Token ID23151
Feature activation+2.44e-01

Pos logit contributions
driver+0.110
coat+0.110
ge+0.105
 ribbon+0.102
uit+0.099
Neg logit contributions
 lives-0.182
 glad-0.177
 ost-0.165
e-0.165
ow-0.164
Token had
Token ID550
Feature activation-3.50e-01

Pos logit contributions
ware+0.006
rily+0.006
ro+0.005
ge+0.005
bell+0.005
Neg logit contributions
 glad-0.005
ow-0.004
 ost-0.004
e-0.004
 much-0.004
Token forgiven
Token ID32350
Feature activation+1.87e+00

Pos logit contributions
 glad+0.006
ow+0.005
e+0.005
 much+0.005
 ost+0.005
Neg logit contributions
ware-0.009
rily-0.009
ro-0.009
ge-0.008
bell-0.008
Token him
Token ID683
Feature activation+2.97e-01

Pos logit contributions
ware+0.053
rily+0.052
ge+0.049
ro+0.048
bell+0.048
Neg logit contributions
 glad-0.027
ow-0.026
e-0.026
 much-0.025
 ost-0.025
Token.
Token ID13
Feature activation-9.34e-01

Pos logit contributions
ware+0.007
rily+0.007
ro+0.007
ge+0.007
bell+0.006
Neg logit contributions
 glad-0.005
ow-0.005
 ost-0.005
e-0.005
 much-0.005
Token sometimes
Token ID3360
Feature activation-3.53e-01

Pos logit contributions
rily+0.004
ware+0.004
ro+0.003
ge+0.003
bell+0.003
Neg logit contributions
 glad-0.003
 ost-0.003
e-0.003
ow-0.002
 much-0.002
Token pretend
Token ID16614
Feature activation+1.37e+00

Pos logit contributions
 glad+0.006
ow+0.006
 ost+0.006
e+0.006
 much+0.006
Neg logit contributions
ware-0.008
rily-0.008
ro-0.008
ge-0.008
bell-0.008
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
ware+0.029
rily+0.027
ro+0.026
bell+0.025
ge+0.024
Neg logit contributions
 glad-0.029
ow-0.028
e-0.028
 ost-0.027
 afraid-0.026
Token be
Token ID307
Feature activation-3.12e+00

Pos logit contributions
ware+0.028
rily+0.028
ro+0.026
ge+0.026
bell+0.025
Neg logit contributions
 glad-0.025
ow-0.024
 ost-0.024
e-0.024
 much-0.023
Token a
Token ID257
Feature activation+2.31e+00

Pos logit contributions
 glad+0.045
ow+0.044
 ost+0.042
e+0.041
 much+0.039
Neg logit contributions
ware-0.086
rily-0.084
ro-0.081
ge-0.078
bell-0.076
Token mer
Token ID4017
Feature activation+7.40e+00

Pos logit contributions
rily+0.061
ware+0.060
ge+0.058
ro+0.056
driver+0.054
Neg logit contributions
 glad-0.039
 much-0.038
 ost-0.036
 very-0.035
 afraid-0.035
Tokenmaid
Token ID23151
Feature activation+3.20e-01

Pos logit contributions
driver+0.119
ge+0.116
boards+0.111
 opener+0.110
ware+0.108
Neg logit contributions
ow-0.178
ap-0.168
e-0.163
 ost-0.162
 lives-0.160
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
rily+0.008
ware+0.008
ge+0.008
ro+0.008
bell+0.007
Neg logit contributions
ow-0.005
 ost-0.005
 glad-0.005
e-0.005
 much-0.005
Token
Token ID220
Feature activation-1.20e+00

Pos logit contributions
 glad+0.001
 much+0.001
ow+0.001
 ost+0.001
e+0.001
Neg logit contributions
rily-0.003
ware-0.003
ge-0.003
ro-0.003
bell-0.003
Token
Token ID198
Feature activation+1.71e+00

Pos logit contributions
 glad+0.017
ow+0.016
 much+0.015
e+0.015
 ost+0.015
Neg logit contributions
ware-0.034
rily-0.034
ro-0.032
ge-0.032
bell-0.031
Token
Token ID198
Feature activation+1.97e+00

Pos logit contributions
ware+0.053
rily+0.053
ro+0.050
ge+0.049
driver+0.048
Neg logit contributions
 glad-0.020
 much-0.019
e-0.018
ow-0.018
 ost-0.017
Token Joe
Token ID5689
Feature activation-9.98e-01

Pos logit contributions
 ost+0.015
 glad+0.014
e+0.014
ow+0.014
 afraid+0.013
Neg logit contributions
rily-0.015
ware-0.015
ro-0.014
bell-0.014
ge-0.013
Token waved
Token ID26834
Feature activation+3.81e-01

Pos logit contributions
 glad+0.019
ow+0.018
 ost+0.018
e+0.018
 much+0.018
Neg logit contributions
ware-0.023
rily-0.023
ro-0.021
ge-0.021
bell-0.021
Token goodbye
Token ID24829
Feature activation-2.97e-01

Pos logit contributions
ware+0.011
ge+0.010
rily+0.010
ro+0.010
driver+0.009
Neg logit contributions
 glad-0.006
 much-0.005
ow-0.005
 very-0.005
e-0.005
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
 glad+0.006
ow+0.006
 much+0.006
 very+0.006
 ost+0.006
Neg logit contributions
ware-0.007
rily-0.006
ge-0.006
ro-0.006
bell-0.006
Token the
Token ID262
Feature activation-7.79e-01

Pos logit contributions
ware+0.033
rily+0.033
ro+0.031
ge+0.031
bell+0.030
Neg logit contributions
 glad-0.014
ow-0.013
 ost-0.013
e-0.012
 much-0.012
Token mer
Token ID4017
Feature activation+7.34e+00

Pos logit contributions
ow+0.020
 glad+0.020
e+0.020
 ost+0.020
 much+0.020
Neg logit contributions
ware-0.012
rily-0.012
ro-0.011
bell-0.010
ge-0.010
Tokenmaid
Token ID23151
Feature activation-6.06e-01

Pos logit contributions
driver+0.134
ge+0.117
 opener+0.112
coat+0.108
 afford+0.107
Neg logit contributions
 glad-0.166
ow-0.165
 lives-0.163
 ost-0.160
e-0.160
Token,
Token ID11
Feature activation+4.07e+00

Pos logit contributions
 glad+0.012
 ost+0.012
ow+0.012
e+0.012
 much+0.012
Neg logit contributions
ware-0.013
rily-0.013
ro-0.012
ge-0.012
bell-0.012
Token and
Token ID290
Feature activation+4.34e-01

Pos logit contributions
ware+0.097
rily+0.094
ro+0.088
ge+0.086
bell+0.084
Neg logit contributions
 glad-0.080
ow-0.074
 ost-0.073
 much-0.073
e-0.072
Token hopped
Token ID45230
Feature activation+2.39e+00

Pos logit contributions
rily+0.010
ware+0.009
ro+0.009
ge+0.009
fully+0.008
Neg logit contributions
 glad-0.008
ow-0.008
 ost-0.008
e-0.008
 much-0.008
Token back
Token ID736
Feature activation-1.31e+00

Pos logit contributions
ware+0.062
rily+0.058
ge+0.056
ro+0.055
bell+0.054
Neg logit contributions
 glad-0.041
 much-0.039
ow-0.039
e-0.039
 very-0.037
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
ware+0.042
rily+0.041
ge+0.038
ro+0.038
bell+0.037
Neg logit contributions
ow-0.024
 glad-0.024
e-0.023
 much-0.023
 ost-0.022
Token "
Token ID366
Feature activation+4.23e+00

Pos logit contributions
 glad+0.010
 much+0.009
 Lightning+0.009
 afraid+0.009
 And+0.009
Neg logit contributions
ware-0.017
rily-0.016
ge-0.016
yt-0.015
uing-0.015
TokenI
Token ID40
Feature activation+1.73e+00

Pos logit contributions
ware+0.099
rily+0.098
ge+0.087
yt+0.087
bell+0.085
Neg logit contributions
e-0.083
 glad-0.082
 much-0.077
 ost-0.076
ow-0.075
Token am
Token ID716
Feature activation-2.03e+00

Pos logit contributions
rily+0.051
ware+0.050
ge+0.046
ro+0.045
driver+0.045
Neg logit contributions
e-0.024
 glad-0.024
ow-0.023
 much-0.022
 afraid-0.022
Token a
Token ID257
Feature activation+2.55e+00

Pos logit contributions
ow+0.029
opl+0.028
aw+0.028
 ost+0.028
e+0.027
Neg logit contributions
rily-0.051
ware-0.048
ro-0.044
bell-0.044
yt-0.043
Token mer
Token ID4017
Feature activation+7.32e+00

Pos logit contributions
ware+0.065
rily+0.065
ge+0.060
ro+0.060
bell+0.058
Neg logit contributions
 glad-0.044
 ost-0.041
 much-0.041
e-0.040
ow-0.040
Tokenmaid
Token ID23151
Feature activation-3.02e-01

Pos logit contributions
ware+0.105
ge+0.103
driver+0.102
Exc+0.097
boards+0.094
Neg logit contributions
ow-0.193
 glad-0.182
 sorry-0.179
ap-0.178
e-0.177
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
 glad+0.006
ow+0.006
 ost+0.006
e+0.006
 much+0.006
Neg logit contributions
ware-0.007
rily-0.007
ro-0.006
ge-0.006
bell-0.006
Token Do
Token ID2141
Feature activation+1.19e+00

Pos logit contributions
 glad+0.033
 ost+0.032
ow+0.032
e+0.032
 much+0.031
Neg logit contributions
ware-0.038
rily-0.038
ro-0.036
bell-0.034
ge-0.034
Token you
Token ID345
Feature activation+4.92e-01

Pos logit contributions
ware+0.026
ro+0.025
rily+0.024
fully+0.023
bell+0.023
Neg logit contributions
 glad-0.021
 afraid-0.020
 ost-0.020
ow-0.019
e-0.019
Token want
Token ID765
Feature activation+9.20e-01

Pos logit contributions
ware+0.012
rily+0.011
ro+0.011
ge+0.011
bell+0.010
Neg logit contributions
 glad-0.009
 ost-0.009
ow-0.009
 much-0.008
e-0.008
Token telling
Token ID5149
Feature activation-8.54e-02

Pos logit contributions
 ost+0.032
 lives+0.028
opl+0.028
 glad+0.027
ow+0.027
Neg logit contributions
rily-0.062
ware-0.062
bell-0.059
ro-0.058
ge-0.058
Token a
Token ID257
Feature activation+3.89e+00

Pos logit contributions
 glad+0.001
 ost+0.001
ow+0.001
e+0.001
 much+0.001
Neg logit contributions
ware-0.002
rily-0.002
ro-0.002
ge-0.002
bell-0.002
Token story
Token ID1621
Feature activation+1.59e+00

Pos logit contributions
ware+0.099
rily+0.096
ro+0.092
bell+0.092
ge+0.090
Neg logit contributions
 glad-0.063
ow-0.062
e-0.061
 ost-0.059
aw-0.055
Token about
Token ID546
Feature activation-1.36e+00

Pos logit contributions
rily+0.049
ware+0.048
ro+0.045
ge+0.045
bell+0.043
Neg logit contributions
ow-0.019
 glad-0.019
 much-0.019
e-0.018
 ost-0.017
Token a
Token ID257
Feature activation+2.75e+00

Pos logit contributions
ow+0.017
 glad+0.016
e+0.015
 ost+0.015
aw+0.014
Neg logit contributions
ware-0.040
rily-0.038
ge-0.036
ro-0.036
bell-0.036
Token mer
Token ID4017
Feature activation+7.32e+00

Pos logit contributions
ware+0.072
rily+0.071
ro+0.067
ge+0.066
bell+0.064
Neg logit contributions
 glad-0.044
ow-0.042
 ost-0.042
e-0.041
 much-0.041
Tokenmaid
Token ID23151
Feature activation+3.72e-01

Pos logit contributions
driver+0.125
 opener+0.119
ge+0.117
Exc+0.109
ware+0.108
Neg logit contributions
ow-0.168
adow-0.159
 lives-0.156
 afraid-0.155
ap-0.155
Token who
Token ID508
Feature activation-1.40e+00

Pos logit contributions
ware+0.010
rily+0.010
ro+0.009
ge+0.009
bell+0.009
Neg logit contributions
 glad-0.006
ow-0.006
 ost-0.006
e-0.006
 much-0.005
Token lived
Token ID5615
Feature activation-8.55e-01

Pos logit contributions
 glad+0.024
 ost+0.024
ow+0.023
 much+0.022
e+0.022
Neg logit contributions
ware-0.034
rily-0.033
bell-0.031
ro-0.031
ge-0.031
Token in
Token ID287
Feature activation-4.18e+00

Pos logit contributions
 glad+0.012
ow+0.012
e+0.012
 ost+0.012
 much+0.011
Neg logit contributions
ware-0.024
rily-0.024
ro-0.022
ge-0.022
bell-0.022
Token the
Token ID262
Feature activation-3.59e+00

Pos logit contributions
 ost+0.053
 glad+0.051
ow+0.050
aw+0.049
adow+0.044
Neg logit contributions
ware-0.120
rily-0.118
ro-0.112
ge-0.110
bell-0.109
Token.
Token ID13
Feature activation-9.79e-01

Pos logit contributions
 much+0.012
e+0.012
ow+0.012
 glad+0.011
 very+0.011
Neg logit contributions
ware-0.025
rily-0.024
ro-0.023
ge-0.022
driver-0.022
Token It
Token ID632
Feature activation+1.12e+00

Pos logit contributions
 ost+0.018
e+0.017
 glad+0.017
ow+0.017
aw+0.016
Neg logit contributions
rily-0.022
ware-0.022
ro-0.021
bell-0.020
ge-0.019
Token was
Token ID373
Feature activation-2.04e+00

Pos logit contributions
ware+0.030
rily+0.029
ro+0.028
ge+0.027
bell+0.026
Neg logit contributions
 glad-0.017
ow-0.017
 much-0.017
 ost-0.017
e-0.017
Token covered
Token ID5017
Feature activation+5.30e-01

Pos logit contributions
 glad+0.022
ow+0.020
 much+0.020
 ost+0.020
e+0.020
Neg logit contributions
ware-0.064
rily-0.064
ro-0.061
ge-0.060
bell-0.059
Token in
Token ID287
Feature activation-2.07e+00

Pos logit contributions
ware+0.015
rily+0.014
ro+0.014
ge+0.014
bell+0.014
Neg logit contributions
e-0.007
 glad-0.007
ow-0.007
 much-0.007
 ost-0.007
Token de
Token ID390
Feature activation-6.79e+00

Pos logit contributions
 glad+0.032
 ost+0.030
ow+0.030
e+0.030
 much+0.029
Neg logit contributions
ware-0.056
rily-0.055
ro-0.052
ge-0.051
bell-0.050
Tokenw
Token ID86
Feature activation-1.84e+00

Pos logit contributions
 glad+0.125
 ost+0.124
ow+0.118
 afraid+0.115
 Thunder+0.114
Neg logit contributions
ware-0.160
rily-0.157
ge-0.151
ro-0.146
pron-0.146
Token from
Token ID422
Feature activation-3.16e+00

Pos logit contributions
 glad+0.036
ow+0.035
e+0.035
 much+0.034
 ost+0.034
Neg logit contributions
ware-0.042
rily-0.041
ro-0.038
ge-0.037
bell-0.037
Token the
Token ID262
Feature activation-1.24e+00

Pos logit contributions
 glad+0.037
ow+0.035
 ost+0.034
e+0.034
 much+0.033
Neg logit contributions
ware-0.097
rily-0.096
ro-0.090
ge-0.090
bell-0.087
Token wet
Token ID9583
Feature activation+2.04e+00

Pos logit contributions
 glad+0.022
ow+0.021
 ost+0.021
e+0.021
 much+0.020
Neg logit contributions
ware-0.030
rily-0.030
ro-0.028
ge-0.027
bell-0.027
Token rain
Token ID6290
Feature activation+4.88e-01

Pos logit contributions
rily+0.053
ware+0.052
ge+0.050
bell+0.047
uing+0.046
Neg logit contributions
 glad-0.032
e-0.031
 much-0.031
 ost-0.030
 Lightning-0.030
Token a
Token ID257
Feature activation+3.39e+00

Pos logit contributions
ware+0.023
rily+0.022
ro+0.021
ge+0.021
bell+0.021
Neg logit contributions
 glad-0.012
ow-0.011
 much-0.011
e-0.011
 ost-0.011
Token special
Token ID2041
Feature activation+3.75e+00

Pos logit contributions
rily+0.086
ware+0.084
ro+0.077
ge+0.077
fully+0.074
Neg logit contributions
 glad-0.063
 much-0.061
 very-0.057
 afraid-0.055
 ost-0.055
Token job
Token ID1693
Feature activation+4.73e-01

Pos logit contributions
rily+0.072
ware+0.069
ge+0.063
ro+0.062
bell+0.060
Neg logit contributions
 glad-0.087
ow-0.084
 much-0.084
e-0.083
 ost-0.083
Token every
Token ID790
Feature activation-1.21e+00

Pos logit contributions
rily+0.012
ware+0.012
ro+0.011
ge+0.011
bell+0.010
Neg logit contributions
 glad-0.008
 much-0.008
ow-0.008
 ost-0.008
e-0.008
Token day
Token ID1110
Feature activation-7.98e-01

Pos logit contributions
 glad+0.010
ow+0.009
 ost+0.009
e+0.009
 much+0.008
Neg logit contributions
ware-0.041
rily-0.040
ro-0.039
ge-0.038
bell-0.038
Token during
Token ID1141
Feature activation-6.78e+00

Pos logit contributions
 glad+0.013
 much+0.013
ow+0.013
e+0.013
 ost+0.012
Neg logit contributions
ware-0.021
rily-0.021
ro-0.019
ge-0.019
bell-0.018
Token nap
Token ID25422
Feature activation-1.59e+00

Pos logit contributions
ow+0.094
 ost+0.092
 glad+0.091
aw+0.088
adow+0.084
Neg logit contributions
ware-0.188
rily-0.181
ge-0.174
ro-0.172
bell-0.169
Token time
Token ID640
Feature activation-1.58e+00

Pos logit contributions
 glad+0.023
e+0.023
 much+0.023
ow+0.022
 ost+0.021
Neg logit contributions
ware-0.043
rily-0.043
ro-0.041
bell-0.041
ge-0.041
Token and
Token ID290
Feature activation-1.59e+00

Pos logit contributions
 much+0.023
 glad+0.022
e+0.022
ow+0.021
 very+0.020
Neg logit contributions
ware-0.044
rily-0.044
ro-0.041
bell-0.040
ge-0.040
Token he
Token ID339
Feature activation+3.30e-01

Pos logit contributions
 glad+0.028
ow+0.028
 ost+0.028
e+0.027
 much+0.026
Neg logit contributions
ware-0.038
rily-0.038
ro-0.035
ge-0.035
bell-0.034
Token was
Token ID373
Feature activation-3.07e+00

Pos logit contributions
ware+0.008
rily+0.008
ro+0.007
bell+0.007
ge+0.007
Neg logit contributions
 glad-0.006
 ost-0.006
ow-0.006
e-0.006
 much-0.005
Token for
Token ID329
Feature activation-1.77e+00

Pos logit contributions
 glad+0.030
 ost+0.029
ow+0.029
e+0.028
 much+0.028
Neg logit contributions
ware-0.043
rily-0.042
ro-0.040
ge-0.039
bell-0.038
Token helping
Token ID5742
Feature activation+1.85e+00

Pos logit contributions
 glad+0.024
e+0.022
ow+0.022
 ost+0.022
 much+0.022
Neg logit contributions
ware-0.051
rily-0.051
ro-0.048
bell-0.047
ge-0.047
Token them
Token ID606
Feature activation+2.51e-01

Pos logit contributions
ware+0.050
rily+0.048
ro+0.046
bell+0.045
yt+0.044
Neg logit contributions
e-0.029
 glad-0.028
 much-0.028
ow-0.028
 ost-0.027
Token stay
Token ID2652
Feature activation-2.89e+00

Pos logit contributions
ware+0.006
rily+0.006
ro+0.006
bell+0.006
ge+0.006
Neg logit contributions
 glad-0.004
ow-0.004
 much-0.004
e-0.004
 ost-0.004
Token safe
Token ID3338
Feature activation-1.86e+00

Pos logit contributions
 glad+0.045
 much+0.042
e+0.042
ow+0.040
 ost+0.040
Neg logit contributions
ware-0.081
rily-0.076
ro-0.074
ge-0.072
bell-0.071
Token during
Token ID1141
Feature activation-6.75e+00

Pos logit contributions
 much+0.027
 glad+0.026
 very+0.026
 many+0.023
e+0.023
Neg logit contributions
ware-0.054
rily-0.050
bell-0.050
ro-0.048
oph-0.047
Token the
Token ID262
Feature activation-3.02e+00

Pos logit contributions
ow+0.072
 glad+0.071
 ost+0.068
e+0.065
 afraid+0.062
Neg logit contributions
ware-0.208
rily-0.207
ro-0.195
ge-0.194
bell-0.188
Token storm
Token ID6388
Feature activation-5.09e-01

Pos logit contributions
ow+0.043
 glad+0.042
e+0.041
 ost+0.040
 afraid+0.038
Neg logit contributions
ware-0.083
rily-0.082
ro-0.078
ge-0.076
bell-0.076
Token.
Token ID13
Feature activation-4.04e-01

Pos logit contributions
ow+0.008
 glad+0.008
 much+0.008
e+0.008
 ost+0.007
Neg logit contributions
ware-0.014
rily-0.014
bell-0.012
ro-0.012
ge-0.012
Token<|endoftext|>
Token ID50256
Feature activation-1.41e+00

Pos logit contributions
 ost+0.006
ow+0.006
 glad+0.006
e+0.006
 afraid+0.006
Neg logit contributions
rily-0.010
ware-0.010
ro-0.010
bell-0.009
ge-0.009
TokenOnce
Token ID7454
Feature activation-5.91e-01

Pos logit contributions
 glad+0.021
 ost+0.020
ow+0.020
e+0.019
 much+0.019
Neg logit contributions
ware-0.038
rily-0.038
ro-0.036
ge-0.035
bell-0.034
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
 glad+0.072
 ost+0.071
ow+0.069
e+0.069
 afraid+0.066
Neg logit contributions
rily-0.098
ware-0.098
ro-0.090
ge-0.088
bell-0.088
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
ware+0.035
rily+0.034
ro+0.032
ge+0.032
bell+0.031
Neg logit contributions
 glad-0.022
ow-0.020
 much-0.020
 ost-0.020
e-0.020
Token farmer
Token ID18739
Feature activation-1.34e+00

Pos logit contributions
 glad+0.015
ow+0.015
e+0.015
 ost+0.014
 much+0.014
Neg logit contributions
ware-0.028
rily-0.028
ro-0.026
ge-0.026
bell-0.025
Token in
Token ID287
Feature activation-3.21e+00

Pos logit contributions
 glad+0.027
 ost+0.026
ow+0.026
 much+0.026
e+0.026
Neg logit contributions
ware-0.029
rily-0.028
ro-0.026
ge-0.026
bell-0.025
Token the
Token ID262
Feature activation-2.85e+00

Pos logit contributions
 glad+0.037
ow+0.036
 ost+0.036
 afraid+0.032
e+0.031
Neg logit contributions
rily-0.095
ware-0.095
ro-0.090
ge-0.088
bell-0.086
Token de
Token ID390
Feature activation-6.70e+00

Pos logit contributions
ow+0.047
 glad+0.045
e+0.043
 ost+0.042
 afraid+0.041
Neg logit contributions
ware-0.072
rily-0.071
ro-0.066
bell-0.066
pron-0.063
Tokenll
Token ID297
Feature activation+4.20e+00

Pos logit contributions
 ost+0.160
 glad+0.157
 Thunder+0.153
ow+0.152
 Lightning+0.151
Neg logit contributions
ware-0.114
rily-0.110
ge-0.108
pron-0.104
ro-0.102
Token".
Token ID1911
Feature activation-4.12e-02

Pos logit contributions
ware+0.101
rily+0.097
ro+0.092
ge+0.088
bell+0.088
Neg logit contributions
 glad-0.080
e-0.078
ow-0.077
 ost-0.075
 much-0.074
Token Suddenly
Token ID24975
Feature activation-8.78e-01

Pos logit contributions
 ost+0.001
 glad+0.001
e+0.001
ow+0.001
 much+0.001
Neg logit contributions
rily-0.001
ware-0.001
ro-0.001
ge-0.001
bell-0.001
Token,
Token ID11
Feature activation+5.15e-01

Pos logit contributions
 glad+0.017
 ost+0.017
e+0.017
ow+0.016
opl+0.016
Neg logit contributions
ware-0.019
rily-0.019
ro-0.017
fully-0.017
ge-0.017
Token he
Token ID339
Feature activation+8.81e-01

Pos logit contributions
rily+0.010
ware+0.010
fully+0.010
ro+0.009
ge+0.009
Neg logit contributions
e-0.011
 ost-0.010
opl-0.010
ow-0.010
 lives-0.010
Token do
Token ID466
Feature activation-5.36e-01

Pos logit contributions
rily+0.017
ware+0.017
bell+0.015
ro+0.015
ge+0.014
Neg logit contributions
 glad-0.011
 much-0.011
e-0.011
ow-0.010
 ost-0.010
Token can
Token ID460
Feature activation+2.63e+00

Pos logit contributions
 glad+0.008
 much+0.007
e+0.007
ow+0.007
 ost+0.007
Neg logit contributions
ware-0.015
rily-0.015
ro-0.014
ge-0.014
bell-0.014
Token prevent
Token ID2948
Feature activation-3.48e+00

Pos logit contributions
rily+0.077
ware+0.073
yt+0.065
ro+0.064
oph+0.062
Neg logit contributions
 much-0.042
e-0.042
ow-0.041
 very-0.039
 glad-0.036
Token us
Token ID514
Feature activation-2.19e+00

Pos logit contributions
 glad+0.057
 ost+0.054
ow+0.054
e+0.053
adow+0.052
Neg logit contributions
rily-0.087
ware-0.084
ge-0.083
ro-0.080
fully-0.076
Token from
Token ID422
Feature activation-5.90e+00

Pos logit contributions
 glad+0.038
ow+0.037
 ost+0.036
e+0.036
 much+0.035
Neg logit contributions
ware-0.055
rily-0.054
ro-0.050
ge-0.049
bell-0.049
Token feeling
Token ID4203
Feature activation-6.62e+00

Pos logit contributions
 glad+0.079
ow+0.078
 ost+0.077
e+0.075
 much+0.073
Neg logit contributions
ware-0.166
rily-0.164
ge-0.156
ro-0.154
bell-0.148
Token upset
Token ID9247
Feature activation-7.27e-01

Pos logit contributions
 ost+0.084
ow+0.082
 glad+0.078
e+0.077
 Thunder+0.070
Neg logit contributions
rily-0.190
ware-0.189
ro-0.179
ge-0.179
bell-0.172
Token in
Token ID287
Feature activation-3.10e+00

Pos logit contributions
 much+0.010
 glad+0.010
 afraid+0.010
e+0.009
ow+0.009
Neg logit contributions
ware-0.021
rily-0.020
bell-0.019
pron-0.019
ro-0.018
Token the
Token ID262
Feature activation-4.02e+00

Pos logit contributions
 glad+0.039
ow+0.037
 ost+0.037
e+0.036
 much+0.035
Neg logit contributions
ware-0.091
rily-0.091
ro-0.086
ge-0.085
bell-0.083
Token long
Token ID890
Feature activation+1.59e-01

Pos logit contributions
ow+0.066
 ost+0.063
 glad+0.063
e+0.062
adow+0.055
Neg logit contributions
ware-0.100
ro-0.096
rily-0.095
bell-0.092
ge-0.089
Token run
Token ID1057
Feature activation+1.01e+00

Pos logit contributions
rily+0.005
ware+0.005
ge+0.004
bell+0.004
ro+0.004
Neg logit contributions
e-0.002
 glad-0.002
 much-0.002
 ost-0.002
ow-0.002
Token<|endoftext|>
Token ID50256
Feature activation-1.68e+00

Pos logit contributions
 glad+0.024
ow+0.024
 ost+0.024
e+0.023
 much+0.022
Neg logit contributions
ware-0.041
rily-0.041
ro-0.039
ge-0.038
bell-0.038
Token<|endoftext|>
Token ID50256
Feature activation-2.43e+00

Pos logit contributions
 glad+0.024
 ost+0.023
ow+0.022
 afraid+0.021
 much+0.020
Neg logit contributions
ware-0.047
ro-0.045
rily-0.045
ge-0.042
bell-0.042
Token<|endoftext|>
Token ID50256
Feature activation-2.80e+00

Pos logit contributions
 glad+0.036
ow+0.034
 ost+0.034
 afraid+0.032
 much+0.031
Neg logit contributions
ware-0.066
rily-0.063
ro-0.063
ge-0.059
bell-0.058
Token<|endoftext|>
Token ID50256
Feature activation-2.95e+00

Pos logit contributions
 glad+0.042
ow+0.040
 ost+0.039
 afraid+0.037
 much+0.036
Neg logit contributions
ware-0.075
ro-0.072
rily-0.072
ge-0.068
bell-0.066
Token<|endoftext|>
Token ID50256
Feature activation-3.01e+00

Pos logit contributions
 glad+0.044
ow+0.041
 ost+0.041
 afraid+0.038
 much+0.038
Neg logit contributions
ware-0.080
ro-0.076
rily-0.076
ge-0.072
bell-0.070
TokenL
Token ID43
Feature activation-6.60e+00

Pos logit contributions
 glad+0.044
ow+0.041
 ost+0.041
 afraid+0.038
 much+0.038
Neg logit contributions
ware-0.082
ro-0.079
rily-0.079
ge-0.075
bell-0.072
Tokenila
Token ID10102
Feature activation-6.08e+00

Pos logit contributions
 glad+0.111
ow+0.108
e+0.106
 ost+0.105
 much+0.104
Neg logit contributions
ware-0.167
rily-0.167
ro-0.155
ge-0.153
bell-0.150
Token and
Token ID290
Feature activation-7.03e-01

Pos logit contributions
 glad+0.145
 ost+0.142
ow+0.141
 much+0.137
e+0.136
Neg logit contributions
ware-0.108
rily-0.106
ro-0.100
bell-0.094
ge-0.094
Token Ben
Token ID3932
Feature activation-3.26e+00

Pos logit contributions
 ost+0.013
ang+0.011
 dangers+0.011
ards+0.011
opl+0.011
Neg logit contributions
rily-0.014
fully-0.014
bell-0.013
ro-0.013
ge-0.012
Token are
Token ID389
Feature activation-3.45e+00

Pos logit contributions
 ost+0.078
 glad+0.076
ow+0.074
 Lightning+0.073
 afraid+0.072
Neg logit contributions
ware-0.055
ro-0.054
rily-0.053
ge-0.048
bell-0.048
Token friends
Token ID2460
Feature activation-3.37e+00

Pos logit contributions
 ost+0.044
e+0.041
ow+0.040
 Thunder+0.039
elope+0.039
Neg logit contributions
rily-0.091
ware-0.089
yt-0.086
bell-0.085
ro-0.084
Token.
Token ID13
Feature activation-7.41e-01

Pos logit contributions
ware+0.005
rily+0.005
ro+0.004
ge+0.004
bell+0.004
Neg logit contributions
 glad-0.003
ow-0.003
e-0.003
 much-0.003
 ost-0.003
Token
Token ID198
Feature activation+1.84e-02

Pos logit contributions
 ost+0.013
ow+0.013
 glad+0.013
e+0.012
 much+0.012
Neg logit contributions
rily-0.017
ware-0.017
ro-0.017
bell-0.016
ge-0.015
Token
Token ID198
Feature activation+4.50e-01

Pos logit contributions
rily+0.000
ro+0.000
ware+0.000
ge+0.000
bell+0.000
Neg logit contributions
 ost-0.000
ow-0.000
e-0.000
 glad-0.000
 afraid-0.000
TokenLater
Token ID18602
Feature activation-9.99e-01

Pos logit contributions
ro+0.007
rily+0.006
ge+0.006
ware+0.006
bell+0.006
Neg logit contributions
ow-0.012
 ost-0.012
 glad-0.011
 afraid-0.011
 Thunder-0.011
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
 glad+0.016
 ost+0.016
e+0.016
ow+0.015
 afraid+0.015
Neg logit contributions
ware-0.025
rily-0.024
bell-0.023
ro-0.023
fully-0.022
Token during
Token ID1141
Feature activation-6.57e+00

Pos logit contributions
ware+0.022
rily+0.021
ro+0.020
ge+0.019
bell+0.019
Neg logit contributions
 glad-0.022
ow-0.021
 ost-0.021
e-0.021
 much-0.020
Token recess
Token ID28836
Feature activation-1.73e+00

Pos logit contributions
aw+0.117
adow+0.113
ow+0.111
 ost+0.110
ang+0.103
Neg logit contributions
ro-0.140
ware-0.137
bell-0.131
yt-0.131
 ribbon-0.131
Token,
Token ID11
Feature activation+6.04e-01

Pos logit contributions
 glad+0.026
ow+0.025
 ost+0.025
e+0.024
 much+0.024
Neg logit contributions
ware-0.047
rily-0.046
ro-0.044
ge-0.043
bell-0.042
Token Lily
Token ID20037
Feature activation-1.40e+00

Pos logit contributions
ware+0.013
rily+0.012
ro+0.012
ge+0.011
bell+0.011
Neg logit contributions
 glad-0.013
ow-0.012
 ost-0.012
e-0.012
 much-0.012
Token's
Token ID338
Feature activation+8.53e-01

Pos logit contributions
 glad+0.027
 ost+0.026
ow+0.026
 much+0.025
e+0.025
Neg logit contributions
ware-0.031
rily-0.031
ro-0.030
ge-0.028
bell-0.028
Token friend
Token ID1545
Feature activation-9.05e-01

Pos logit contributions
ware+0.021
rily+0.021
ro+0.019
ge+0.019
bell+0.019
Neg logit contributions
 glad-0.015
ow-0.014
 ost-0.014
e-0.014
 much-0.014
Tokenometer
Token ID15635
Feature activation-2.21e+00

Pos logit contributions
ware+0.014
rily+0.014
ro+0.013
ge+0.012
bell+0.012
Neg logit contributions
 glad-0.014
e-0.014
 ost-0.014
 much-0.013
ow-0.013
Token wherever
Token ID14530
Feature activation-1.94e+00

Pos logit contributions
 glad+0.043
 ost+0.041
ow+0.041
e+0.040
 much+0.040
Neg logit contributions
ware-0.050
rily-0.050
ro-0.046
ge-0.046
bell-0.045
Token it
Token ID340
Feature activation-5.05e-01

Pos logit contributions
 glad+0.042
ow+0.041
 ost+0.040
e+0.040
 afraid+0.038
Neg logit contributions
ware-0.040
rily-0.039
ro-0.036
ge-0.036
bell-0.035
Token took
Token ID1718
Feature activation-1.65e+00

Pos logit contributions
 glad+0.009
ow+0.009
 ost+0.008
e+0.008
 much+0.008
Neg logit contributions
rily-0.013
ware-0.012
ro-0.011
ge-0.011
bell-0.011
Token him
Token ID683
Feature activation-5.76e-01

Pos logit contributions
 glad+0.029
 ost+0.027
ow+0.027
e+0.026
 much+0.026
Neg logit contributions
rily-0.040
ware-0.040
ro-0.037
ge-0.037
bell-0.036
Token during
Token ID1141
Feature activation-6.57e+00

Pos logit contributions
 glad+0.010
ow+0.010
 ost+0.010
e+0.010
 much+0.010
Neg logit contributions
ware-0.014
rily-0.014
ro-0.013
ge-0.013
bell-0.012
Token his
Token ID465
Feature activation-6.46e-01

Pos logit contributions
 glad+0.072
ow+0.069
 ost+0.066
e+0.065
 much+0.060
Neg logit contributions
ware-0.205
rily-0.202
ro-0.192
ge-0.190
bell-0.187
Token journey
Token ID7002
Feature activation-7.59e-01

Pos logit contributions
 glad+0.012
ow+0.012
 ost+0.011
e+0.011
 much+0.011
Neg logit contributions
ware-0.015
rily-0.014
ro-0.014
bell-0.013
ge-0.013
Token,
Token ID11
Feature activation+2.57e+00

Pos logit contributions
 much+0.011
ow+0.010
 glad+0.010
e+0.010
 very+0.010
Neg logit contributions
ware-0.022
rily-0.022
ro-0.020
ge-0.019
bell-0.019
Token and
Token ID290
Feature activation-7.43e-01

Pos logit contributions
ware+0.064
rily+0.063
ro+0.058
ge+0.057
bell+0.056
Neg logit contributions
 glad-0.048
 ost-0.044
 much-0.044
ow-0.043
e-0.043
Token he
Token ID339
Feature activation+2.62e-01

Pos logit contributions
 glad+0.014
ow+0.013
 ost+0.013
e+0.013
 much+0.013
Neg logit contributions
ware-0.018
rily-0.018
ro-0.016
ge-0.016
bell-0.016
Token awake
Token ID21693
Feature activation+1.26e+00

Pos logit contributions
 glad+0.038
ow+0.037
 ost+0.037
e+0.035
 Lightning+0.033
Neg logit contributions
rily-0.080
ware-0.078
ro-0.074
ge-0.073
bell-0.073
Token at
Token ID379
Feature activation-1.07e+00

Pos logit contributions
ware+0.035
ge+0.032
rily+0.032
ro+0.032
oph+0.031
Neg logit contributions
 much-0.020
 very-0.019
 glad-0.018
 many-0.018
 afraid-0.018
Token night
Token ID1755
Feature activation-1.21e+00

Pos logit contributions
 glad+0.014
 ost+0.013
ow+0.013
e+0.013
aw+0.012
Neg logit contributions
ware-0.030
rily-0.029
ro-0.028
ge-0.027
bell-0.027
Token and
Token ID290
Feature activation-2.56e+00

Pos logit contributions
 glad+0.023
ow+0.022
 much+0.022
e+0.022
 ost+0.022
Neg logit contributions
ware-0.028
rily-0.028
ge-0.026
ro-0.025
bell-0.025
Token sleep
Token ID3993
Feature activation-1.62e-01

Pos logit contributions
 glad+0.043
 ost+0.042
ow+0.042
e+0.042
 much+0.039
Neg logit contributions
ware-0.062
rily-0.061
ro-0.058
ge-0.056
bell-0.056
Token during
Token ID1141
Feature activation-6.54e+00

Pos logit contributions
 glad+0.003
 much+0.003
e+0.003
ow+0.003
 ost+0.003
Neg logit contributions
ware-0.004
rily-0.004
ro-0.003
ge-0.003
bell-0.003
Token the
Token ID262
Feature activation-3.18e+00

Pos logit contributions
ow+0.095
 ost+0.093
ang+0.092
 glad+0.092
aw+0.090
Neg logit contributions
ware-0.168
rily-0.161
ro-0.153
bell-0.150
ge-0.144
Token day
Token ID1110
Feature activation-2.52e+00

Pos logit contributions
 glad+0.034
ow+0.034
e+0.033
 ost+0.032
aw+0.028
Neg logit contributions
ware-0.095
rily-0.093
bell-0.091
ro-0.091
pron-0.087
Token."
Token ID526
Feature activation+1.37e-01

Pos logit contributions
 glad+0.052
ow+0.050
 ost+0.050
e+0.050
 much+0.049
Neg logit contributions
ware-0.054
rily-0.054
ro-0.049
ge-0.049
bell-0.047
Token Lily
Token ID20037
Feature activation+1.98e-01

Pos logit contributions
ware+0.004
rily+0.004
ro+0.003
ge+0.003
bell+0.003
Neg logit contributions
 glad-0.002
ow-0.002
 ost-0.002
e-0.002
 much-0.002
Token thought
Token ID1807
Feature activation+3.52e+00

Pos logit contributions
ware+0.004
rily+0.004
ro+0.004
ge+0.004
bell+0.004
Neg logit contributions
 glad-0.004
 ost-0.004
ow-0.004
 much-0.004
e-0.004
Token and
Token ID290
Feature activation-3.46e-01

Pos logit contributions
 much+0.003
e+0.003
 glad+0.003
 lived+0.003
opl+0.003
Neg logit contributions
ge-0.004
rily-0.004
ware-0.004
ro-0.004
fully-0.004
Token looked
Token ID3114
Feature activation+8.85e-01

Pos logit contributions
opl+0.009
 glad+0.008
 much+0.008
ained+0.008
 ost+0.008
Neg logit contributions
ware-0.006
rily-0.006
bell-0.005
ro-0.005
ge-0.005
Token at
Token ID379
Feature activation-4.19e-01

Pos logit contributions
rily+0.023
ware+0.022
ge+0.020
bell+0.020
ly+0.020
Neg logit contributions
e-0.015
ow-0.014
 ost-0.014
opl-0.013
 glad-0.013
Token the
Token ID262
Feature activation+7.22e-02

Pos logit contributions
 glad+0.007
adow+0.007
e+0.007
ow+0.007
 afraid+0.006
Neg logit contributions
rily-0.010
ware-0.010
ro-0.009
ge-0.009
fully-0.009
Token ost
Token ID23619
Feature activation-6.74e-01

Pos logit contributions
ware+0.002
rily+0.002
bell+0.002
ro+0.002
ge+0.002
Neg logit contributions
e-0.001
ow-0.001
 glad-0.001
adow-0.001
 much-0.001
Tokenric
Token ID1173
Feature activation-6.50e+00

Pos logit contributions
 much+0.009
 glad+0.009
 sorry+0.008
 afraid+0.008
 Lightning+0.008
Neg logit contributions
ware-0.018
rily-0.018
ge-0.018
ro-0.018
bell-0.017
Tokenhes
Token ID956
Feature activation-1.86e+00

Pos logit contributions
 ost+0.087
 glad+0.084
 lives+0.076
elope+0.073
 haste+0.073
Neg logit contributions
ware-0.176
rily-0.176
ro-0.158
fully-0.157
driver-0.156
Token.
Token ID13
Feature activation-6.68e-01

Pos logit contributions
 glad+0.032
ow+0.031
 ost+0.031
e+0.031
 much+0.030
Neg logit contributions
ware-0.046
rily-0.045
ro-0.042
ge-0.042
bell-0.041
Token They
Token ID1119
Feature activation+2.41e-01

Pos logit contributions
 glad+0.010
e+0.010
ow+0.010
 ost+0.010
 much+0.009
Neg logit contributions
ware-0.018
rily-0.018
ro-0.017
bell-0.016
ge-0.016
Token saw
Token ID2497
Feature activation+5.57e-01

Pos logit contributions
ware+0.005
rily+0.005
ro+0.005
ge+0.005
bell+0.005
Neg logit contributions
 glad-0.005
ow-0.004
 ost-0.004
e-0.004
 much-0.004
Token one
Token ID530
Feature activation+6.19e-01

Pos logit contributions
ware+0.015
rily+0.015
bell+0.013
ro+0.013
fully+0.013
Neg logit contributions
e-0.008
adow-0.008
opl-0.008
 glad-0.008
ow-0.008
Token exciting
Token ID7895
Feature activation+7.46e-01

Pos logit contributions
 glad+0.038
 ost+0.036
 much+0.035
e+0.035
ow+0.034
Neg logit contributions
rily-0.069
ware-0.069
ro-0.064
ge-0.063
bell-0.062
Token things
Token ID1243
Feature activation-6.15e-01

Pos logit contributions
rily+0.020
ware+0.019
ro+0.018
ge+0.018
bell+0.018
Neg logit contributions
 glad-0.012
 ost-0.011
ow-0.011
 much-0.011
e-0.011
Token it
Token ID340
Feature activation+3.09e-01

Pos logit contributions
 glad+0.010
 much+0.009
ow+0.009
 ost+0.009
e+0.009
Neg logit contributions
ware-0.016
rily-0.016
ro-0.016
ge-0.015
bell-0.015
Token had
Token ID550
Feature activation-1.87e+00

Pos logit contributions
rily+0.009
ware+0.008
ro+0.008
ge+0.008
bell+0.008
Neg logit contributions
 glad-0.005
 much-0.005
 lived-0.004
e-0.004
ow-0.004
Token found
Token ID1043
Feature activation+3.19e-01

Pos logit contributions
 much+0.029
e+0.029
 glad+0.029
ow+0.027
 very+0.027
Neg logit contributions
rily-0.052
ware-0.051
ro-0.046
ge-0.046
driver-0.046
Token during
Token ID1141
Feature activation-6.49e+00

Pos logit contributions
ware+0.009
rily+0.008
ro+0.008
ge+0.008
bell+0.008
Neg logit contributions
 glad-0.005
 much-0.005
e-0.005
ow-0.005
 ost-0.005
Token its
Token ID663
Feature activation-1.74e+00

Pos logit contributions
 glad+0.067
ow+0.064
 ost+0.062
e+0.060
 much+0.057
Neg logit contributions
ware-0.205
rily-0.204
ro-0.193
ge-0.191
bell-0.188
Token journey
Token ID7002
Feature activation-6.68e-01

Pos logit contributions
 glad+0.033
ow+0.032
 ost+0.032
e+0.031
 much+0.030
Neg logit contributions
ware-0.039
rily-0.039
ro-0.037
ge-0.036
bell-0.036
Token.
Token ID13
Feature activation-9.18e-01

Pos logit contributions
 much+0.010
 glad+0.009
ow+0.009
e+0.009
 very+0.009
Neg logit contributions
ware-0.019
rily-0.019
ro-0.017
bell-0.017
ge-0.017
Token The
Token ID383
Feature activation-1.50e-01

Pos logit contributions
 glad+0.014
 ost+0.014
ow+0.014
e+0.014
 afraid+0.013
Neg logit contributions
rily-0.024
ware-0.023
ro-0.022
bell-0.021
ge-0.021
Token purple
Token ID14032
Feature activation+2.54e+00

Pos logit contributions
ow+0.003
 glad+0.003
e+0.003
 ost+0.003
 much+0.003
Neg logit contributions
ware-0.004
rily-0.003
ro-0.003
bell-0.003
ge-0.003
Token sack
Token ID23704
Feature activation-8.51e-02

Pos logit contributions
 very+0.008
 much+0.008
 glad+0.008
 many+0.008
 lives+0.008
Neg logit contributions
ware-0.011
rily-0.011
ge-0.011
ro-0.010
bell-0.010
Token,
Token ID11
Feature activation+1.51e-01

Pos logit contributions
ow+0.001
 much+0.001
 glad+0.001
e+0.001
 very+0.001
Neg logit contributions
ware-0.003
rily-0.003
ge-0.002
ro-0.002
yt-0.002
Token it
Token ID340
Feature activation-4.11e-01

Pos logit contributions
ware+0.003
rily+0.003
driver+0.003
ro+0.003
ge+0.003
Neg logit contributions
ow-0.003
 glad-0.003
e-0.003
 ost-0.003
adow-0.003
Token disappeared
Token ID12120
Feature activation+1.34e+00

Pos logit contributions
 glad+0.007
ow+0.007
e+0.007
 much+0.007
 ost+0.007
Neg logit contributions
ware-0.010
rily-0.010
ro-0.009
ge-0.009
bell-0.009
Token right
Token ID826
Feature activation-5.63e-01

Pos logit contributions
ware+0.038
rily+0.034
ge+0.033
ro+0.033
uing+0.033
Neg logit contributions
 much-0.020
 very-0.019
 glad-0.019
ow-0.018
 afraid-0.018
Token before
Token ID878
Feature activation-6.48e+00

Pos logit contributions
ow+0.007
 very+0.007
 glad+0.007
 afraid+0.007
 much+0.007
Neg logit contributions
ware-0.016
rily-0.015
ge-0.015
driver-0.015
bell-0.015
Token his
Token ID465
Feature activation+3.35e-01

Pos logit contributions
 glad+0.112
ow+0.107
 ost+0.107
e+0.105
 afraid+0.099
Neg logit contributions
ware-0.160
rily-0.160
ro-0.148
ge-0.147
bell-0.145
Token eyes
Token ID2951
Feature activation-6.51e-01

Pos logit contributions
ware+0.009
rily+0.009
ge+0.008
ro+0.008
bell+0.007
Neg logit contributions
 glad-0.006
ow-0.005
 much-0.005
 ost-0.005
e-0.005
Token!
Token ID0
Feature activation-1.70e+00

Pos logit contributions
 much+0.010
e+0.010
ow+0.010
 scared+0.009
 very+0.009
Neg logit contributions
ware-0.018
rily-0.017
ro-0.016
ge-0.016
driver-0.016
Token
Token ID220
Feature activation-1.70e+00

Pos logit contributions
 ost+0.030
 glad+0.029
e+0.028
ow+0.028
 afraid+0.027
Neg logit contributions
ware-0.041
ro-0.040
rily-0.040
driver-0.038
bell-0.037
Token
Token ID198
Feature activation+7.84e-01

Pos logit contributions
 glad+0.026
 ost+0.025
ow+0.025
e+0.024
 much+0.023
Neg logit contributions
ware-0.046
rily-0.045
ro-0.043
ge-0.042
bell-0.041
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
ow+0.026
 glad+0.024
adow+0.023
 much+0.023
 ost+0.023
Neg logit contributions
rily-0.044
ware-0.043
ge-0.040
fully-0.039
uing-0.038
Token pond
Token ID16723
Feature activation-8.87e-01

Pos logit contributions
 glad+0.023
ow+0.022
e+0.022
 ost+0.022
 much+0.021
Neg logit contributions
ware-0.030
rily-0.030
ro-0.028
ge-0.027
bell-0.027
Token modest
Token ID12949
Feature activation+2.09e-01

Pos logit contributions
 glad+0.014
ow+0.013
 much+0.013
 ost+0.013
e+0.013
Neg logit contributions
ware-0.024
rily-0.023
ro-0.022
ge-0.022
bell-0.021
Token,
Token ID11
Feature activation+3.10e+00

Pos logit contributions
ware+0.005
rily+0.005
bell+0.005
ge+0.005
ro+0.005
Neg logit contributions
 glad-0.004
e-0.004
 much-0.004
ow-0.004
 ost-0.004
Token just
Token ID655
Feature activation-2.10e+00

Pos logit contributions
ware+0.072
rily+0.071
ro+0.066
ge+0.065
bell+0.063
Neg logit contributions
 glad-0.060
ow-0.057
 ost-0.057
e-0.056
 much-0.056
Token between
Token ID1022
Feature activation-6.47e+00

Pos logit contributions
 glad+0.037
 ost+0.035
ow+0.035
e+0.035
 much+0.034
Neg logit contributions
ware-0.051
rily-0.051
ro-0.048
ge-0.047
bell-0.046
Token them
Token ID606
Feature activation-1.96e+00

Pos logit contributions
ow+0.098
 glad+0.098
 ost+0.096
e+0.092
 afraid+0.090
Neg logit contributions
rily-0.167
ware-0.164
ge-0.154
ro-0.154
bell-0.152
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
 glad+0.035
ow+0.034
 ost+0.033
e+0.033
 much+0.033
Neg logit contributions
ware-0.048
rily-0.047
ro-0.044
ge-0.043
bell-0.042
Token<|endoftext|>
Token ID50256
Feature activation-1.70e+00

Pos logit contributions
 ost+0.019
ow+0.018
 glad+0.018
e+0.018
 afraid+0.018
Neg logit contributions
rily-0.029
ware-0.028
ro-0.028
bell-0.026
ge-0.026
TokenR
Token ID49
Feature activation-3.94e+00

Pos logit contributions
 glad+0.021
ow+0.020
 ost+0.020
e+0.019
 much+0.018
Neg logit contributions
ware-0.051
rily-0.050
ro-0.048
ge-0.046
bell-0.046
Tokenandy
Token ID10757
Feature activation-4.05e+00

Pos logit contributions
 glad+0.081
 ost+0.078
 afraid+0.077
 Lightning+0.075
 sorry+0.075
Neg logit contributions
rily-0.086
ware-0.085
ro-0.084
ge-0.079
bell-0.075
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
ware+0.006
rily+0.006
ro+0.005
ge+0.005
bell+0.005
Neg logit contributions
 glad-0.003
ow-0.003
 much-0.003
 ost-0.003
e-0.003
Token They
Token ID1119
Feature activation+4.52e-01

Pos logit contributions
 glad+0.015
ow+0.014
 ost+0.014
e+0.014
 much+0.014
Neg logit contributions
ware-0.030
rily-0.029
ro-0.028
ge-0.027
bell-0.027
Token were
Token ID547
Feature activation-1.97e+00

Pos logit contributions
ware+0.011
rily+0.010
ro+0.010
ge+0.010
bell+0.009
Neg logit contributions
 glad-0.008
ow-0.008
 ost-0.008
e-0.008
 much-0.008
Token called
Token ID1444
Feature activation-3.41e-01

Pos logit contributions
e+0.026
opl+0.025
 Lightning+0.024
ow+0.024
 Fire+0.024
Neg logit contributions
rily-0.051
bell-0.044
ware-0.043
clips-0.043
yt-0.043
Token ost
Token ID23619
Feature activation-1.52e+00

Pos logit contributions
 glad+0.006
e+0.006
 much+0.005
ow+0.005
 afraid+0.005
Neg logit contributions
rily-0.008
ware-0.008
ro-0.008
bell-0.007
ge-0.007
Tokenric
Token ID1173
Feature activation-6.44e+00

Pos logit contributions
 glad+0.018
 much+0.017
e+0.015
 afraid+0.015
 ost+0.014
Neg logit contributions
ware-0.045
rily-0.045
ge-0.043
ro-0.043
bell-0.042
Tokenhes
Token ID956
Feature activation-1.45e+00

Pos logit contributions
 glad+0.069
 ost+0.068
 lives+0.063
ow+0.063
 Thunder+0.060
Neg logit contributions
rily-0.188
ware-0.186
ro-0.169
driver-0.169
fully-0.168
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
 glad+0.028
 ost+0.026
ow+0.026
e+0.025
 much+0.025
Neg logit contributions
rily-0.033
ware-0.033
ro-0.031
bell-0.030
ge-0.030
Token Lily
Token ID20037
Feature activation-7.93e-01

Pos logit contributions
 glad+0.020
ow+0.020
e+0.019
 ost+0.019
 much+0.018
Neg logit contributions
rily-0.036
ware-0.036
ro-0.034
bell-0.033
ge-0.032
Token and
Token ID290
Feature activation+2.05e+00

Pos logit contributions
 glad+0.015
ow+0.014
e+0.014
 ost+0.014
 much+0.014
Neg logit contributions
ware-0.019
rily-0.019
ro-0.017
ge-0.017
bell-0.016
Token Ben
Token ID3932
Feature activation-6.06e-01

Pos logit contributions
ware+0.047
ro+0.045
rily+0.044
ge+0.040
driver+0.040
Neg logit contributions
 glad-0.038
 much-0.036
adow-0.036
e-0.035
aks-0.034
Token the
Token ID262
Feature activation-3.42e+00

Pos logit contributions
 glad+0.030
ow+0.029
 ost+0.028
e+0.027
 much+0.026
Neg logit contributions
ware-0.089
rily-0.088
ro-0.083
ge-0.082
bell-0.081
Token water
Token ID1660
Feature activation-1.10e+00

Pos logit contributions
ow+0.065
e+0.062
 glad+0.060
 ost+0.058
 much+0.055
Neg logit contributions
ware-0.079
rily-0.076
ro-0.074
bell-0.071
yt-0.068
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
 glad+0.018
ow+0.018
 much+0.018
e+0.017
 ost+0.017
Neg logit contributions
ware-0.028
rily-0.028
ge-0.026
bell-0.025
ro-0.025
Token forget
Token ID6044
Feature activation-1.87e+00

Pos logit contributions
 glad+0.019
ow+0.018
 ost+0.018
e+0.018
 much+0.017
Neg logit contributions
ware-0.026
rily-0.025
ro-0.024
ge-0.023
bell-0.023
Token about
Token ID546
Feature activation-2.72e+00

Pos logit contributions
 glad+0.027
ow+0.026
e+0.026
 ost+0.026
 afraid+0.024
Neg logit contributions
rily-0.051
ware-0.051
ro-0.047
ge-0.047
bell-0.046
Token feeling
Token ID4203
Feature activation-6.43e+00

Pos logit contributions
ow+0.034
 glad+0.033
e+0.033
 ost+0.032
 much+0.030
Neg logit contributions
ware-0.079
rily-0.078
ge-0.074
ro-0.074
bell-0.070
Token uncomfortable
Token ID12916
Feature activation-4.14e-01

Pos logit contributions
ow+0.091
 ost+0.088
elope+0.081
e+0.078
opl+0.074
Neg logit contributions
rily-0.176
ware-0.167
ro-0.159
ge-0.155
bell-0.150
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 glad+0.006
 much+0.006
ow+0.006
e+0.006
 ost+0.006
Neg logit contributions
ware-0.011
rily-0.011
bell-0.011
ge-0.010
ro-0.010
Token<|endoftext|>
Token ID50256
Feature activation-1.30e+00

Pos logit contributions
 ost+0.026
ow+0.026
e+0.025
 afraid+0.025
 glad+0.024
Neg logit contributions
ro-0.028
ware-0.027
rily-0.027
bell-0.026
ge-0.025
TokenOnce
Token ID7454
Feature activation-7.00e-01

Pos logit contributions
 glad+0.019
ow+0.019
 ost+0.018
e+0.018
 much+0.018
Neg logit contributions
ware-0.035
rily-0.035
ro-0.033
ge-0.032
bell-0.032
Token upon
Token ID2402
Feature activation-1.57e+00

Pos logit contributions
 glad+0.018
 ost+0.017
ow+0.017
e+0.017
 much+0.017
Neg logit contributions
ware-0.012
rily-0.012
ro-0.010
ge-0.010
bell-0.010
Token of
Token ID286
Feature activation-1.76e+00

Pos logit contributions
 glad+0.019
 much+0.018
e+0.018
ow+0.018
 ost+0.017
Neg logit contributions
ware-0.034
rily-0.034
ro-0.031
ge-0.031
bell-0.031
Token water
Token ID1660
Feature activation+4.46e-01

Pos logit contributions
 glad+0.031
ow+0.031
 ost+0.030
e+0.030
 afraid+0.028
Neg logit contributions
ware-0.043
rily-0.042
ro-0.039
ge-0.038
bell-0.037
Token from
Token ID422
Feature activation-3.95e+00

Pos logit contributions
ware+0.011
rily+0.011
ge+0.011
bell+0.011
ro+0.010
Neg logit contributions
 much-0.007
ow-0.007
 glad-0.007
e-0.007
 very-0.007
Token the
Token ID262
Feature activation-1.50e+00

Pos logit contributions
 glad+0.045
ow+0.042
 ost+0.042
e+0.040
 much+0.039
Neg logit contributions
ware-0.123
rily-0.122
ro-0.115
ge-0.113
bell-0.111
Token morning
Token ID3329
Feature activation-2.61e-01

Pos logit contributions
 glad+0.025
ow+0.024
e+0.024
 ost+0.023
 much+0.022
Neg logit contributions
ware-0.038
rily-0.037
ro-0.036
bell-0.034
ge-0.034
Token de
Token ID390
Feature activation-6.42e+00

Pos logit contributions
e+0.005
 glad+0.005
 much+0.005
ow+0.005
 ost+0.005
Neg logit contributions
ware-0.006
rily-0.006
ge-0.006
ro-0.005
bell-0.005
Tokenw
Token ID86
Feature activation-9.40e-01

Pos logit contributions
 glad+0.098
 ost+0.096
 Thunder+0.092
ow+0.091
 Lightning+0.090
Neg logit contributions
ware-0.172
rily-0.165
ge-0.164
pron-0.157
ro-0.156
Token.
Token ID13
Feature activation-3.19e-01

Pos logit contributions
ow+0.019
e+0.019
 glad+0.019
 much+0.019
 ost+0.018
Neg logit contributions
ware-0.020
rily-0.019
bell-0.018
ro-0.018
ge-0.018
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
 ost+0.005
 glad+0.005
ow+0.005
e+0.005
 afraid+0.005
Neg logit contributions
ware-0.008
rily-0.008
ro-0.008
ge-0.007
bell-0.007
Token
Token ID198
Feature activation+1.75e+00

Pos logit contributions
ware+0.038
rily+0.038
ro+0.036
ge+0.035
bell+0.034
Neg logit contributions
 glad-0.022
ow-0.021
 ost-0.021
e-0.020
 much-0.020
TokenThen
Token ID6423
Feature activation-1.60e+00

Pos logit contributions
ware+0.036
rily+0.035
ro+0.034
ge+0.033
bell+0.031
Neg logit contributions
 glad-0.038
ow-0.037
 ost-0.037
e-0.036
 afraid-0.035
Token and
Token ID290
Feature activation+3.93e-01

Pos logit contributions
ware+0.026
rily+0.025
ro+0.024
ge+0.023
bell+0.023
Neg logit contributions
 glad-0.023
 ost-0.023
ow-0.022
 much-0.022
e-0.022
Token his
Token ID465
Feature activation+1.35e+00

Pos logit contributions
ware+0.008
rily+0.008
ro+0.008
bell+0.007
ge+0.007
Neg logit contributions
 glad-0.008
ow-0.008
e-0.008
 ost-0.007
 much-0.007
Token family
Token ID1641
Feature activation-3.27e-01

Pos logit contributions
ware+0.035
rily+0.034
ro+0.033
bell+0.031
ge+0.031
Neg logit contributions
e-0.021
 glad-0.021
ow-0.021
 ost-0.020
 much-0.020
Token lived
Token ID5615
Feature activation-9.30e-01

Pos logit contributions
 glad+0.006
ow+0.006
 ost+0.006
e+0.005
 much+0.005
Neg logit contributions
ware-0.008
rily-0.008
ro-0.007
ge-0.007
bell-0.007
Token in
Token ID287
Feature activation-4.25e+00

Pos logit contributions
 glad+0.014
 ost+0.014
ow+0.014
e+0.013
 Thunder+0.013
Neg logit contributions
ware-0.024
rily-0.024
ro-0.023
ge-0.023
bell-0.022
Token before
Token ID878
Feature activation-6.41e+00

Pos logit contributions
ow+0.064
 ost+0.063
 glad+0.060
aw+0.059
adow+0.057
Neg logit contributions
ware-0.104
ge-0.102
rily-0.101
ro-0.098
bell-0.097
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
 glad+0.120
ow+0.119
 ost+0.116
e+0.115
 afraid+0.106
Neg logit contributions
ware-0.146
rily-0.142
ge-0.134
bell-0.134
ro-0.132
Token Still
Token ID7831
Feature activation-9.06e-01

Pos logit contributions
 glad+0.004
 ost+0.004
e+0.004
ow+0.004
 afraid+0.003
Neg logit contributions
ware-0.005
rily-0.005
bell-0.005
ro-0.005
driver-0.004
Token,
Token ID11
Feature activation+3.22e-01

Pos logit contributions
ang+0.020
 ost+0.019
e+0.019
 glad+0.019
ow+0.018
Neg logit contributions
ware-0.015
ge-0.014
ro-0.014
fully-0.013
driver-0.013
Token he
Token ID339
Feature activation+7.65e-01

Pos logit contributions
ware+0.006
bell+0.005
rily+0.005
fully+0.005
ge+0.005
Neg logit contributions
ow-0.007
adow-0.007
e-0.007
 glad-0.007
 ost-0.007
Token had
Token ID550
Feature activation-5.59e-01

Pos logit contributions
ware+0.018
rily+0.018
ro+0.017
ge+0.016
bell+0.016
Neg logit contributions
 glad-0.014
 ost-0.014
ow-0.014
e-0.014
 much-0.013
Token higher
Token ID2440
Feature activation-9.78e-01

Pos logit contributions
 much+0.002
 glad+0.002
e+0.002
ow+0.002
 ost+0.002
Neg logit contributions
ware-0.004
rily-0.004
ro-0.004
ge-0.004
bell-0.004
Token and
Token ID290
Feature activation-3.01e+00

Pos logit contributions
 ost+0.016
 very+0.016
 much+0.016
 glad+0.016
e+0.015
Neg logit contributions
ware-0.025
rily-0.024
bell-0.023
driver-0.022
pron-0.022
Token faster
Token ID5443
Feature activation-4.39e-01

Pos logit contributions
 glad+0.059
ow+0.057
 ost+0.057
e+0.056
 much+0.056
Neg logit contributions
ware-0.068
rily-0.067
ro-0.062
ge-0.061
bell-0.060
Token than
Token ID621
Feature activation-2.14e+00

Pos logit contributions
 much+0.006
 glad+0.006
e+0.006
 ost+0.006
 very+0.006
Neg logit contributions
ware-0.013
rily-0.012
bell-0.012
pron-0.012
ge-0.011
Token ever
Token ID1683
Feature activation-2.69e+00

Pos logit contributions
 glad+0.036
 much+0.034
ow+0.034
 ost+0.034
e+0.033
Neg logit contributions
rily-0.056
ware-0.056
ro-0.051
ge-0.050
bell-0.050
Token before
Token ID878
Feature activation-6.40e+00

Pos logit contributions
 glad+0.051
 much+0.049
ow+0.049
e+0.049
 ost+0.049
Neg logit contributions
ware-0.065
rily-0.064
ro-0.058
bell-0.057
driver-0.057
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 glad+0.112
ow+0.108
 ost+0.107
e+0.106
 much+0.104
Neg logit contributions
ware-0.158
rily-0.157
ro-0.146
ge-0.144
bell-0.141
Token So
Token ID1406
Feature activation-1.31e+00

Pos logit contributions
 ost+0.019
 glad+0.019
e+0.018
ow+0.018
 afraid+0.018
Neg logit contributions
rily-0.023
ware-0.022
ro-0.022
bell-0.020
fully-0.020
Token it
Token ID340
Feature activation-1.59e-01

Pos logit contributions
opl+0.029
ow+0.028
 ost+0.028
 glad+0.027
adow+0.027
Neg logit contributions
rily-0.025
ware-0.024
fully-0.023
ro-0.022
yt-0.022
Token hopped
Token ID45230
Feature activation+2.37e+00

Pos logit contributions
 glad+0.002
 ost+0.002
ow+0.002
e+0.002
 much+0.002
Neg logit contributions
ware-0.004
rily-0.004
ro-0.004
ge-0.004
bell-0.004
Token on
Token ID319
Feature activation-2.66e+00

Pos logit contributions
ware+0.063
rily+0.060
ge+0.057
ro+0.056
bell+0.056
Neg logit contributions
 glad-0.039
 much-0.038
ow-0.037
e-0.036
 ost-0.036
Token and
Token ID290
Feature activation+1.58e-01

Pos logit contributions
 glad+0.035
ow+0.035
 much+0.034
e+0.034
 ost+0.033
Neg logit contributions
ware-0.042
rily-0.041
ge-0.038
ro-0.038
bell-0.036
Token run
Token ID1057
Feature activation+2.41e+00

Pos logit contributions
ware+0.003
rily+0.003
ro+0.003
bell+0.003
ge+0.003
Neg logit contributions
 glad-0.004
 much-0.003
 ost-0.003
ow-0.003
e-0.003
Token to
Token ID284
Feature activation+7.63e-01

Pos logit contributions
ware+0.057
rily+0.055
ge+0.049
pron+0.049
bell+0.048
Neg logit contributions
ow-0.047
 much-0.046
 ost-0.046
 glad-0.046
e-0.045
Token the
Token ID262
Feature activation-5.96e-02

Pos logit contributions
rily+0.022
ware+0.022
ge+0.021
ro+0.021
bell+0.020
Neg logit contributions
 glad-0.010
ow-0.009
 ost-0.009
e-0.009
 much-0.009
Token ost
Token ID23619
Feature activation-7.17e-01

Pos logit contributions
 glad+0.001
ow+0.001
e+0.001
 much+0.001
 ost+0.001
Neg logit contributions
ware-0.001
rily-0.001
ro-0.001
bell-0.001
ge-0.001
Tokenric
Token ID1173
Feature activation-6.39e+00

Pos logit contributions
 glad+0.009
 much+0.009
 sorry+0.008
 afraid+0.008
 Lightning+0.008
Neg logit contributions
ware-0.020
rily-0.020
ro-0.020
ge-0.020
bell-0.019
Tokenhes
Token ID956
Feature activation-2.03e+00

Pos logit contributions
 ost+0.081
 glad+0.081
 lives+0.072
 haste+0.070
elope+0.070
Neg logit contributions
rily-0.173
ware-0.172
ro-0.159
driver-0.154
fully-0.154
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
 glad+0.037
 ost+0.035
ow+0.035
e+0.034
 much+0.034
Neg logit contributions
ware-0.049
rily-0.049
ro-0.045
ge-0.044
bell-0.044
Token
Token ID198
Feature activation-3.68e-01

Pos logit contributions
 glad+0.024
e+0.024
ow+0.023
 ost+0.023
 much+0.022
Neg logit contributions
ware-0.036
rily-0.036
ro-0.035
bell-0.033
ge-0.032
Token
Token ID198
Feature activation+2.32e-02

Pos logit contributions
 glad+0.006
ow+0.006
 ost+0.006
e+0.006
 afraid+0.006
Neg logit contributions
ware-0.009
rily-0.009
ro-0.008
ge-0.008
bell-0.008
Token"
Token ID1
Feature activation+3.56e+00

Pos logit contributions
ro+0.000
ge+0.000
rily+0.000
ware+0.000
bell+0.000
Neg logit contributions
ow-0.001
 glad-0.001
 ost-0.001
adow-0.001
 Thunder-0.001
Tokenie
Token ID494
Feature activation-1.16e+00

Pos logit contributions
ow+0.002
 glad+0.002
e+0.002
 ost+0.002
 much+0.002
Neg logit contributions
ware-0.002
rily-0.002
ro-0.002
bell-0.002
ge-0.002
Token.
Token ID13
Feature activation-4.61e-01

Pos logit contributions
 glad+0.027
 ost+0.025
ow+0.025
 Lightning+0.025
 afraid+0.024
Neg logit contributions
rily-0.021
ware-0.020
ro-0.019
ge-0.018
fully-0.018
Token Unlike
Token ID12101
Feature activation+7.71e-02

Pos logit contributions
 glad+0.009
e+0.009
 ost+0.008
ow+0.008
 afraid+0.008
Neg logit contributions
ware-0.010
ro-0.010
rily-0.010
bell-0.009
ge-0.009
Token other
Token ID584
Feature activation+7.49e-01

Pos logit contributions
ware+0.002
ge+0.002
rily+0.002
ro+0.002
fully+0.002
Neg logit contributions
aw-0.001
 Lightning-0.001
e-0.001
 glad-0.001
ow-0.001
Token ost
Token ID23619
Feature activation-2.17e+00

Pos logit contributions
ware+0.019
rily+0.019
ro+0.018
ge+0.018
yt+0.018
Neg logit contributions
 glad-0.011
e-0.010
 much-0.010
ow-0.010
 Lightning-0.009
Tokenric
Token ID1173
Feature activation-6.39e+00

Pos logit contributions
 glad+0.023
 much+0.022
 afraid+0.020
 ost+0.020
 Lightning+0.019
Neg logit contributions
ware-0.066
rily-0.065
ge-0.064
ro-0.064
bell-0.062
Tokenhes
Token ID956
Feature activation-1.78e+00

Pos logit contributions
 ost+0.075
 glad+0.066
 lives+0.062
ow+0.061
elope+0.059
Neg logit contributions
rily-0.186
ware-0.186
ro-0.172
fully-0.172
driver-0.168
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
 glad+0.033
 ost+0.029
ow+0.028
 afraid+0.027
 Lightning+0.027
Neg logit contributions
ware-0.043
rily-0.041
ro-0.041
ge-0.038
fully-0.037
Token his
Token ID465
Feature activation+1.40e+00

Pos logit contributions
ware+0.036
rily+0.035
ro+0.033
bell+0.032
ge+0.031
Neg logit contributions
e-0.031
ow-0.031
 glad-0.029
opl-0.028
adow-0.028
Token feathers
Token ID27400
Feature activation-2.86e-01

Pos logit contributions
ware+0.033
rily+0.032
ro+0.031
bell+0.029
ge+0.029
Neg logit contributions
 glad-0.024
e-0.023
 much-0.023
ow-0.022
 Lightning-0.022
Token gl
Token ID1278
Feature activation+3.06e+00

Pos logit contributions
 glad+0.005
ow+0.005
 ost+0.005
e+0.004
 much+0.004
Neg logit contributions
ware-0.007
rily-0.007
ro-0.007
ge-0.007
bell-0.007
Token apples
Token ID22514
Feature activation-4.98e-02

Pos logit contributions
 glad+0.017
ow+0.016
 ost+0.016
e+0.016
 much+0.016
Neg logit contributions
ware-0.017
rily-0.017
ro-0.015
ge-0.015
bell-0.015
Token to
Token ID284
Feature activation+1.16e+00

Pos logit contributions
ow+0.001
 very+0.001
 much+0.001
 glad+0.001
 ost+0.001
Neg logit contributions
ware-0.001
rily-0.001
ro-0.001
bell-0.001
yt-0.001
Token Mrs
Token ID9074
Feature activation-5.20e-01

Pos logit contributions
ware+0.030
rily+0.030
ro+0.028
ge+0.028
yt+0.027
Neg logit contributions
 glad-0.019
 much-0.019
e-0.019
ow-0.018
 ost-0.018
Token.
Token ID13
Feature activation+3.34e+00

Pos logit contributions
 glad+0.008
ow+0.008
 ost+0.008
e+0.008
 much+0.007
Neg logit contributions
rily-0.014
ware-0.014
ro-0.013
ge-0.013
bell-0.012
Token Lee
Token ID5741
Feature activation-1.00e+00

Pos logit contributions
rily+0.088
ware+0.087
ge+0.081
uing+0.078
fully+0.076
Neg logit contributions
 glad-0.056
 Lightning-0.052
 Thunder-0.052
 much-0.052
 afraid-0.051
Token,
Token ID11
Feature activation+4.75e+00

Pos logit contributions
ow+0.016
 glad+0.016
 much+0.015
 very+0.014
e+0.014
Neg logit contributions
rily-0.027
ware-0.026
ro-0.026
ge-0.024
yt-0.024
Token who
Token ID508
Feature activation-2.34e-01

Pos logit contributions
ware+0.118
rily+0.116
ro+0.108
ge+0.107
bell+0.103
Neg logit contributions
 glad-0.090
 much-0.083
ow-0.082
 ost-0.081
e-0.080
Token lived
Token ID5615
Feature activation-1.42e-03

Pos logit contributions
 glad+0.004
 ost+0.004
ow+0.004
e+0.004
 much+0.004
Neg logit contributions
ware-0.005
rily-0.005
ro-0.005
bell-0.005
ge-0.005
Token across
Token ID1973
Feature activation-3.24e+00

Pos logit contributions
 glad+0.000
 much+0.000
e+0.000
ow+0.000
 very+0.000
Neg logit contributions
ware-0.000
rily-0.000
ro-0.000
ge-0.000
bell-0.000
Token the
Token ID262
Feature activation-2.00e+00

Pos logit contributions
 glad+0.038
ow+0.038
 ost+0.037
adow+0.033
e+0.033
Neg logit contributions
ware-0.097
rily-0.094
ro-0.090
bell-0.088
ge-0.087
Token street
Token ID4675
Feature activation-1.99e+00

Pos logit contributions
ow+0.036
 glad+0.035
e+0.034
 ost+0.033
 afraid+0.032
Neg logit contributions
ware-0.048
rily-0.047
ro-0.046
bell-0.044
pron-0.042
Token girl
Token ID2576
Feature activation-1.15e+00

Pos logit contributions
ware+0.012
rily+0.012
ro+0.011
ge+0.011
bell+0.010
Neg logit contributions
 glad-0.014
ow-0.014
 ost-0.014
e-0.014
 much-0.013
Token nodded
Token ID14464
Feature activation+2.28e+00

Pos logit contributions
 glad+0.022
ow+0.021
 ost+0.021
e+0.021
 much+0.020
Neg logit contributions
ware-0.026
rily-0.026
ro-0.024
ge-0.024
bell-0.023
Token and
Token ID290
Feature activation-4.77e-01

Pos logit contributions
ware+0.066
ge+0.061
ro+0.061
rily+0.061
driver+0.060
Neg logit contributions
 very-0.035
 glad-0.034
 much-0.032
ow-0.032
e-0.032
Token said
Token ID531
Feature activation+1.72e+00

Pos logit contributions
 glad+0.010
 ost+0.010
ow+0.010
e+0.009
opl+0.009
Neg logit contributions
ware-0.010
rily-0.010
ro-0.009
ge-0.009
bell-0.009
Token,
Token ID11
Feature activation+3.40e+00

Pos logit contributions
ware+0.047
rily+0.047
ro+0.044
ge+0.044
bell+0.043
Neg logit contributions
 glad-0.026
ow-0.024
 ost-0.024
e-0.024
 much-0.024
Token "
Token ID366
Feature activation+5.14e+00

Pos logit contributions
ware+0.088
rily+0.085
ge+0.084
ro+0.081
yt+0.077
Neg logit contributions
 glad-0.061
 much-0.055
 very-0.053
ow-0.053
 ost-0.052
TokenOkay
Token ID16454
Feature activation+1.35e+00

Pos logit contributions
ware+0.129
rily+0.126
ge+0.116
bell+0.111
driver+0.111
Neg logit contributions
 glad-0.095
e-0.091
 much-0.088
 ost-0.086
 Thunder-0.085
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
ware+0.041
rily+0.039
ge+0.038
ro+0.038
driver+0.035
Neg logit contributions
 glad-0.018
 much-0.017
ow-0.017
 very-0.016
 sorry-0.016
Token I
Token ID314
Feature activation+2.40e+00

Pos logit contributions
 glad+0.024
 ost+0.024
ow+0.023
e+0.023
 much+0.023
Neg logit contributions
ware-0.026
rily-0.025
ro-0.024
bell-0.023
ge-0.023
Token don
Token ID836
Feature activation-1.15e-01

Pos logit contributions
rily+0.058
ware+0.058
ge+0.050
ro+0.050
fully+0.048
Neg logit contributions
 glad-0.047
ow-0.047
e-0.046
 much-0.045
 afraid-0.044
Token't
Token ID470
Feature activation+5.08e-01

Pos logit contributions
 glad+0.003
ow+0.003
 ost+0.003
e+0.003
 much+0.003
Neg logit contributions
ware-0.002
rily-0.002
ro-0.002
ge-0.002
bell-0.002
Token.
Token ID13
Feature activation-1.90e-01

Pos logit contributions
rily+0.007
ware+0.007
ro+0.006
bell+0.006
ge+0.006
Neg logit contributions
 glad-0.004
 much-0.004
 ost-0.004
ow-0.004
e-0.004
Token She
Token ID1375
Feature activation+1.12e+00

Pos logit contributions
 glad+0.002
 much+0.002
ow+0.002
 Lightning+0.002
 Thunder+0.002
Neg logit contributions
ware-0.006
rily-0.006
ge-0.006
ro-0.005
yt-0.005
Token asked
Token ID1965
Feature activation+1.76e+00

Pos logit contributions
rily+0.025
ware+0.025
ro+0.023
ge+0.022
bell+0.022
Neg logit contributions
 glad-0.023
ow-0.022
e-0.022
 ost-0.022
 much-0.022
Token her
Token ID607
Feature activation+1.43e+00

Pos logit contributions
ware+0.045
rily+0.044
ro+0.041
fully+0.040
bell+0.040
Neg logit contributions
 glad-0.030
 ost-0.029
e-0.028
ow-0.028
 afraid-0.027
Token mom
Token ID1995
Feature activation+1.16e+00

Pos logit contributions
rily+0.037
ware+0.037
ro+0.035
ge+0.034
bell+0.033
Neg logit contributions
 glad-0.025
 ost-0.023
ow-0.023
 much-0.023
e-0.022
Token,
Token ID11
Feature activation+4.92e+00

Pos logit contributions
rily+0.033
ware+0.032
ro+0.031
ge+0.029
yt+0.029
Neg logit contributions
 glad-0.018
 much-0.018
ow-0.018
 very-0.017
 many-0.016
Token "
Token ID366
Feature activation+5.41e+00

Pos logit contributions
ware+0.130
rily+0.129
ro+0.122
ge+0.118
bell+0.116
Neg logit contributions
 glad-0.093
 much-0.080
ow-0.080
 very-0.080
 afraid-0.079
TokenMom
Token ID29252
Feature activation+1.20e-01

Pos logit contributions
rily+0.141
ware+0.134
ro+0.130
yt+0.126
uing+0.126
Neg logit contributions
e-0.095
 very-0.081
 Fire-0.080
 ost-0.079
 middle-0.079
Tokenmy
Token ID1820
Feature activation-4.75e-01

Pos logit contributions
rily+0.004
ware+0.004
ro+0.003
ge+0.003
driver+0.003
Neg logit contributions
ow-0.002
 glad-0.002
e-0.002
 very-0.002
 much-0.002
Token,
Token ID11
Feature activation+3.08e+00

Pos logit contributions
 glad+0.006
ow+0.006
e+0.006
 ost+0.005
 much+0.005
Neg logit contributions
ware-0.014
rily-0.014
ro-0.013
ge-0.013
bell-0.013
Token can
Token ID460
Feature activation+3.02e+00

Pos logit contributions
rily+0.082
ware+0.080
ro+0.077
ge+0.075
bell+0.071
Neg logit contributions
 glad-0.053
 ost-0.048
 much-0.048
ow-0.048
 afraid-0.046
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
ware+0.065
rily+0.065
ro+0.061
ge+0.061
bell+0.060
Neg logit contributions
 glad-0.021
 ost-0.019
ow-0.019
e-0.018
 much-0.018
Token child
Token ID1200
Feature activation-4.05e-01

Pos logit contributions
ware+0.025
rily+0.024
ro+0.023
bell+0.022
ge+0.022
Neg logit contributions
ow-0.026
 glad-0.026
e-0.026
 ost-0.026
 much-0.025
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
 glad+0.007
ow+0.007
 ost+0.007
e+0.007
 much+0.007
Neg logit contributions
ware-0.010
rily-0.010
ro-0.009
ge-0.009
bell-0.009
Token
Token ID198
Feature activation+2.06e+00

Pos logit contributions
ware+0.029
rily+0.029
ro+0.027
ge+0.027
bell+0.026
Neg logit contributions
 glad-0.015
ow-0.014
 ost-0.014
e-0.014
 much-0.014
Token
Token ID198
Feature activation+2.37e+00

Pos logit contributions
ware+0.059
rily+0.058
ro+0.054
ge+0.054
bell+0.053
Neg logit contributions
 glad-0.029
ow-0.027
e-0.027
 ost-0.027
 much-0.027
Token"
Token ID1
Feature activation+4.53e+00

Pos logit contributions
ware+0.058
rily+0.058
ro+0.053
ge+0.052
bell+0.051
Neg logit contributions
 glad-0.042
e-0.041
ow-0.041
 ost-0.040
 much-0.040
TokenChild
Token ID16424
Feature activation-1.42e-01

Pos logit contributions
rily+0.120
ware+0.116
ge+0.107
ro+0.106
yt+0.103
Neg logit contributions
 glad-0.077
 much-0.074
 ost-0.073
 afraid-0.072
ow-0.070
Token,
Token ID11
Feature activation+3.41e+00

Pos logit contributions
 glad+0.002
ow+0.002
 ost+0.002
e+0.002
 much+0.002
Neg logit contributions
ware-0.004
rily-0.004
ro-0.003
ge-0.003
bell-0.003
Token I
Token ID314
Feature activation+2.35e+00

Pos logit contributions
ware+0.086
rily+0.085
ro+0.080
ge+0.078
bell+0.077
Neg logit contributions
 glad-0.057
ow-0.055
 ost-0.055
e-0.054
 much-0.053
Token regret
Token ID13721
Feature activation+1.78e-01

Pos logit contributions
rily+0.064
ware+0.063
ro+0.058
ge+0.057
bell+0.056
Neg logit contributions
 glad-0.038
ow-0.037
e-0.036
 much-0.035
 afraid-0.035
Token many
Token ID867
Feature activation-8.80e-01

Pos logit contributions
ware+0.005
rily+0.005
ro+0.005
ge+0.005
bell+0.004
Neg logit contributions
 glad-0.003
ow-0.002
 ost-0.002
e-0.002
 much-0.002
TokenShe
Token ID3347
Feature activation+1.22e+00

Pos logit contributions
ware+0.059
rily+0.056
ge+0.050
ro+0.049
driver+0.048
Neg logit contributions
 glad-0.045
 much-0.044
e-0.042
 very-0.041
 ost-0.041
Token asked
Token ID1965
Feature activation+1.88e+00

Pos logit contributions
ware+0.026
rily+0.025
ro+0.023
ge+0.023
bell+0.023
Neg logit contributions
 glad-0.026
ow-0.025
 ost-0.025
e-0.025
 much-0.024
Token her
Token ID607
Feature activation+1.66e+00

Pos logit contributions
ware+0.047
rily+0.046
bell+0.043
ro+0.042
fully+0.042
Neg logit contributions
 glad-0.031
 ost-0.031
e-0.031
ow-0.030
 afraid-0.029
Token mom
Token ID1995
Feature activation+1.76e+00

Pos logit contributions
ware+0.039
rily+0.038
ro+0.035
bell+0.035
ge+0.035
Neg logit contributions
 glad-0.031
e-0.030
ow-0.030
 ost-0.030
 much-0.028
Tokenmy
Token ID1820
Feature activation+1.25e+00

Pos logit contributions
rily+0.053
ware+0.051
ro+0.050
ge+0.046
yt+0.045
Neg logit contributions
 glad-0.026
ow-0.026
 very-0.026
 much-0.026
 many-0.023
Token,
Token ID11
Feature activation+4.92e+00

Pos logit contributions
rily+0.035
ware+0.035
ro+0.034
ge+0.032
bell+0.031
Neg logit contributions
 glad-0.019
ow-0.018
 much-0.017
 ost-0.016
 very-0.016
Token "
Token ID366
Feature activation+5.49e+00

Pos logit contributions
ware+0.129
rily+0.128
ro+0.121
ge+0.117
bell+0.113
Neg logit contributions
 glad-0.093
 much-0.082
 very-0.080
ow-0.080
 afraid-0.080
TokenWhat
Token ID2061
Feature activation+1.64e+00

Pos logit contributions
rily+0.153
ware+0.145
ro+0.143
ge+0.141
uing+0.134
Neg logit contributions
 Fire-0.083
 ost-0.082
e-0.081
 mist-0.079
 named-0.078
Token is
Token ID318
Feature activation-1.48e+00

Pos logit contributions
ware+0.040
bell+0.037
rily+0.037
driver+0.036
ro+0.036
Neg logit contributions
 glad-0.026
 ost-0.025
ow-0.024
opl-0.023
ang-0.023
Token that
Token ID326
Feature activation+1.03e+00

Pos logit contributions
opl+0.028
adow+0.028
e+0.027
ow+0.027
 glad+0.027
Neg logit contributions
ware-0.035
rily-0.032
driver-0.029
bell-0.029
fully-0.028
Token big
Token ID1263
Feature activation+1.06e+00

Pos logit contributions
ware+0.024
rily+0.023
ro+0.022
bell+0.022
ge+0.021
Neg logit contributions
 glad-0.019
e-0.018
ow-0.018
 ost-0.018
 much-0.017
Token't
Token ID470
Feature activation-7.89e-01

Pos logit contributions
ware+0.002
rily+0.002
ro+0.002
bell+0.002
ge+0.002
Neg logit contributions
 glad-0.003
 ost-0.003
ow-0.003
e-0.003
 afraid-0.002
Token stop
Token ID2245
Feature activation-2.25e+00

Pos logit contributions
e+0.013
ow+0.012
 much+0.012
 glad+0.012
 very+0.012
Neg logit contributions
ware-0.022
rily-0.022
ro-0.019
driver-0.019
clips-0.019
Token crying
Token ID13774
Feature activation+1.24e+00

Pos logit contributions
 glad+0.028
ow+0.026
 ost+0.026
e+0.026
 much+0.025
Neg logit contributions
ware-0.068
rily-0.067
ro-0.063
ge-0.062
bell-0.061
Token.
Token ID13
Feature activation-3.49e-01

Pos logit contributions
ware+0.033
rily+0.032
ro+0.030
bell+0.030
ge+0.029
Neg logit contributions
 much-0.020
 glad-0.020
ow-0.019
e-0.019
 afraid-0.018
Token He
Token ID679
Feature activation+1.34e+00

Pos logit contributions
 ost+0.006
e+0.006
 glad+0.006
ow+0.006
 much+0.006
Neg logit contributions
rily-0.008
ware-0.008
ro-0.008
bell-0.007
ge-0.007
Token missed
Token ID6825
Feature activation+6.78e-01

Pos logit contributions
ware+0.030
rily+0.030
ro+0.028
bell+0.028
ge+0.027
Neg logit contributions
 glad-0.026
 ost-0.025
ow-0.025
e-0.024
 much-0.024
Token his
Token ID465
Feature activation+2.55e+00

Pos logit contributions
ware+0.019
rily+0.018
ro+0.017
ge+0.017
bell+0.017
Neg logit contributions
 glad-0.010
ow-0.010
 ost-0.009
e-0.009
 much-0.009
Token cap
Token ID1451
Feature activation+1.43e+00

Pos logit contributions
rily+0.062
ware+0.058
ge+0.054
fully+0.052
driver+0.052
Neg logit contributions
 glad-0.050
ow-0.047
 much-0.047
e-0.046
 ost-0.046
Token and
Token ID290
Feature activation+5.15e-01

Pos logit contributions
ware+0.040
rily+0.038
ge+0.035
ro+0.034
yt+0.034
Neg logit contributions
 much-0.023
ow-0.023
 very-0.022
 glad-0.021
e-0.021
Token felt
Token ID2936
Feature activation-3.65e+00

Pos logit contributions
ware+0.013
rily+0.013
ro+0.012
ge+0.012
bell+0.011
Neg logit contributions
ow-0.009
 glad-0.009
 ost-0.009
e-0.008
 much-0.008
Token powerless
Token ID34209
Feature activation+2.34e+00

Pos logit contributions
 ost+0.048
ow+0.045
elope+0.040
e+0.040
adow+0.038
Neg logit contributions
rily-0.107
ware-0.100
ge-0.095
ro-0.095
bell-0.095
Token clay
Token ID21558
Feature activation+1.00e-01

Pos logit contributions
ge+0.009
ro+0.009
rily+0.009
ware+0.009
bell+0.008
Neg logit contributions
 glad-0.005
 very-0.005
 much-0.005
 afraid-0.005
 sorry-0.005
Token.
Token ID13
Feature activation+3.51e-02

Pos logit contributions
ware+0.002
rily+0.002
ro+0.002
ge+0.002
bell+0.002
Neg logit contributions
ow-0.002
 glad-0.002
e-0.002
 much-0.002
 ost-0.002
Token He
Token ID679
Feature activation+1.65e+00

Pos logit contributions
ware+0.001
rily+0.001
ro+0.001
ge+0.001
bell+0.001
Neg logit contributions
 glad-0.001
ow-0.001
 ost-0.001
 much-0.001
e-0.001
Token mixed
Token ID7668
Feature activation+1.73e+00

Pos logit contributions
rily+0.040
ware+0.040
ro+0.036
ge+0.036
bell+0.035
Neg logit contributions
 glad-0.029
e-0.029
 much-0.028
ow-0.028
 ost-0.027
Token it
Token ID340
Feature activation+4.89e-02

Pos logit contributions
ware+0.052
rily+0.051
bell+0.050
ro+0.050
ge+0.049
Neg logit contributions
 much-0.021
 very-0.019
 many-0.019
e-0.019
 glad-0.019
Token together
Token ID1978
Feature activation+9.64e-01

Pos logit contributions
ware+0.001
rily+0.001
ro+0.001
ge+0.001
bell+0.001
Neg logit contributions
 much-0.001
 glad-0.001
ow-0.001
e-0.001
 very-0.001
Token and
Token ID290
Feature activation-6.53e-01

Pos logit contributions
ware+0.024
rily+0.024
bell+0.023
ro+0.022
yt+0.022
Neg logit contributions
 much-0.017
 very-0.016
 glad-0.016
e-0.015
 many-0.015
Token formed
Token ID7042
Feature activation+1.46e-01

Pos logit contributions
 glad+0.012
ow+0.012
e+0.011
 ost+0.011
 much+0.011
Neg logit contributions
ware-0.016
rily-0.015
ro-0.014
ge-0.014
bell-0.014
Token it
Token ID340
Feature activation-1.20e+00

Pos logit contributions
ware+0.005
rily+0.005
ro+0.005
ge+0.005
bell+0.005
Neg logit contributions
 glad-0.001
 much-0.001
e-0.001
ow-0.001
 ost-0.001
Token into
Token ID656
Feature activation-2.33e+00

Pos logit contributions
 glad+0.020
 much+0.020
ow+0.020
e+0.019
 ost+0.019
Neg logit contributions
ware-0.030
rily-0.030
ro-0.028
ge-0.027
bell-0.027
Token a
Token ID257
Feature activation+8.20e-01

Pos logit contributions
 glad+0.022
 much+0.021
e+0.020
 ost+0.019
 many+0.018
Neg logit contributions
ware-0.076
rily-0.076
ro-0.073
bell-0.071
ge-0.071
Token you
Token ID345
Feature activation-4.57e-01

Pos logit contributions
rily+0.016
ware+0.016
ro+0.015
bell+0.014
ge+0.014
Neg logit contributions
 glad-0.011
 afraid-0.010
 much-0.010
 ost-0.010
ow-0.010
Token,
Token ID11
Feature activation+3.79e+00

Pos logit contributions
 glad+0.009
ow+0.009
 much+0.009
e+0.009
 ost+0.009
Neg logit contributions
ware-0.011
rily-0.010
ro-0.010
ge-0.009
bell-0.009
Token thank
Token ID5875
Feature activation+6.85e-01

Pos logit contributions
ware+0.093
rily+0.093
ro+0.086
ge+0.084
bell+0.084
Neg logit contributions
 glad-0.065
ow-0.063
 ost-0.063
e-0.062
 much-0.060
Token you
Token ID345
Feature activation-4.16e-01

Pos logit contributions
rily+0.017
ware+0.017
ro+0.016
bell+0.016
ge+0.015
Neg logit contributions
 glad-0.011
 afraid-0.010
 much-0.010
 ost-0.010
ow-0.010
Token!
Token ID0
Feature activation-2.06e+00

Pos logit contributions
 glad+0.008
 much+0.008
ow+0.008
e+0.008
 ost+0.008
Neg logit contributions
ware-0.010
rily-0.009
ge-0.009
ro-0.009
bell-0.008
Token Can
Token ID1680
Feature activation+1.37e+00

Pos logit contributions
 glad+0.038
 ost+0.038
ow+0.037
 afraid+0.036
 much+0.036
Neg logit contributions
rily-0.046
ware-0.046
ro-0.044
bell-0.042
driver-0.041
Token I
Token ID314
Feature activation+2.94e+00

Pos logit contributions
rily+0.037
ware+0.036
ge+0.034
ro+0.034
yt+0.032
Neg logit contributions
 glad-0.023
ow-0.021
e-0.021
 much-0.021
 ost-0.021
Token try
Token ID1949
Feature activation+1.55e+00

Pos logit contributions
rily+0.093
ware+0.089
ge+0.083
ro+0.078
driver+0.077
Neg logit contributions
e-0.044
 named-0.043
ow-0.042
 glad-0.040
 very-0.039
Token it
Token ID340
Feature activation+1.19e+00

Pos logit contributions
rily+0.046
ro+0.045
ware+0.044
ge+0.041
bell+0.041
Neg logit contributions
 glad-0.021
 much-0.021
e-0.020
ow-0.019
 very-0.019
Token on
Token ID319
Feature activation-2.54e+00

Pos logit contributions
rily+0.040
ware+0.038
ro+0.037
ge+0.037
 trigger+0.035
Neg logit contributions
 very-0.014
 much-0.013
ow-0.013
 many-0.012
e-0.012
Token?"
Token ID1701
Feature activation-6.26e-01

Pos logit contributions
e+0.030
 very+0.025
 ost+0.024
 much+0.024
 glad+0.024
Neg logit contributions
rily-0.080
ware-0.080
ro-0.079
ge-0.078
driver-0.076
Token listened
Token ID16399
Feature activation+1.34e+00

Pos logit contributions
 glad+0.008
 ost+0.007
ow+0.007
e+0.007
 much+0.007
Neg logit contributions
ware-0.008
rily-0.008
ro-0.008
bell-0.008
ge-0.007
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
ware+0.030
rily+0.028
ro+0.027
ge+0.026
yt+0.025
Neg logit contributions
 much-0.029
 glad-0.028
 very-0.028
 many-0.027
 afraid-0.026
Token the
Token ID262
Feature activation+1.48e-01

Pos logit contributions
ware+0.035
rily+0.035
ro+0.033
fully+0.033
ge+0.033
Neg logit contributions
ow-0.017
 ost-0.017
e-0.016
 glad-0.016
aw-0.016
Token owl
Token ID39610
Feature activation-1.44e+00

Pos logit contributions
ware+0.003
rily+0.003
ro+0.003
bell+0.003
ge+0.003
Neg logit contributions
e-0.003
ow-0.003
 glad-0.003
 ost-0.002
 much-0.002
Token's
Token ID338
Feature activation+1.00e+00

Pos logit contributions
 glad+0.033
 ost+0.032
e+0.031
 much+0.030
ow+0.030
Neg logit contributions
rily-0.027
ware-0.027
ge-0.026
ro-0.025
bell-0.024
Token warning
Token ID6509
Feature activation+1.73e+00

Pos logit contributions
ware+0.021
rily+0.021
ro+0.020
ge+0.019
bell+0.019
Neg logit contributions
 glad-0.020
e-0.020
 ost-0.019
ow-0.019
aw-0.019
Token and
Token ID290
Feature activation-4.78e-01

Pos logit contributions
ware+0.036
rily+0.035
ro+0.033
ge+0.032
bell+0.031
Neg logit contributions
 glad-0.037
 ost-0.036
ow-0.035
e-0.035
 much-0.034
Token started
Token ID2067
Feature activation-1.06e+00

Pos logit contributions
 glad+0.009
ow+0.009
 ost+0.009
e+0.009
 much+0.009
Neg logit contributions
ware-0.011
rily-0.010
ro-0.010
ge-0.010
bell-0.009
Token to
Token ID284
Feature activation+7.91e-01

Pos logit contributions
aw+0.028
 glad+0.028
opl+0.028
e+0.028
 ost+0.027
Neg logit contributions
ge-0.014
ware-0.014
rily-0.014
ro-0.013
bell-0.012
Token look
Token ID804
Feature activation-1.44e-01

Pos logit contributions
ware+0.018
rily+0.018
ro+0.016
ge+0.016
bell+0.016
Neg logit contributions
 glad-0.016
ow-0.015
 ost-0.015
e-0.015
 much-0.015
Token for
Token ID329
Feature activation-2.36e+00

Pos logit contributions
ow+0.002
e+0.002
 ost+0.002
 glad+0.002
 Thunder+0.002
Neg logit contributions
rily-0.004
ware-0.004
ge-0.003
ro-0.003
bell-0.003
Tokenuffy
Token ID15352
Feature activation-1.90e+00

Pos logit contributions
 glad+0.010
e+0.009
 ost+0.009
ow+0.009
 much+0.009
Neg logit contributions
ware-0.008
rily-0.008
ro-0.008
ge-0.008
bell-0.007
Token white
Token ID2330
Feature activation-3.05e-01

Pos logit contributions
 glad+0.037
 ost+0.036
ow+0.036
e+0.034
 much+0.034
Neg logit contributions
rily-0.043
ware-0.042
ro-0.039
ge-0.038
bell-0.037
Token clouds
Token ID15114
Feature activation-1.25e+00

Pos logit contributions
 glad+0.005
 ost+0.005
ow+0.005
e+0.005
 much+0.005
Neg logit contributions
rily-0.007
ware-0.007
ro-0.007
ge-0.007
bell-0.007
Token.
Token ID13
Feature activation-1.59e-01

Pos logit contributions
 glad+0.020
ow+0.020
 ost+0.019
 much+0.019
e+0.019
Neg logit contributions
ware-0.032
rily-0.032
ro-0.030
ge-0.029
bell-0.028
Token
Token ID220
Feature activation-9.32e-01

Pos logit contributions
 glad+0.003
ow+0.003
 ost+0.003
 much+0.003
e+0.003
Neg logit contributions
ware-0.004
rily-0.004
ro-0.004
ge-0.004
bell-0.004
Token
Token ID198
Feature activation+1.35e+00

Pos logit contributions
 glad+0.014
 ost+0.014
ow+0.014
e+0.013
 much+0.013
Neg logit contributions
ware-0.025
rily-0.025
ro-0.023
ge-0.023
bell-0.022
Token
Token ID198
Feature activation+1.30e+00

Pos logit contributions
ware+0.041
rily+0.040
ro+0.037
ge+0.037
bell+0.037
Neg logit contributions
 glad-0.018
 much-0.017
e-0.016
ow-0.015
 ost-0.015
Token"
Token ID1
Feature activation+4.40e+00

Pos logit contributions
ware+0.028
rily+0.027
ro+0.024
ge+0.024
bell+0.024
Neg logit contributions
 glad-0.028
 much-0.027
e-0.027
ow-0.026
 ost-0.026
TokenWhat
Token ID2061
Feature activation+1.26e+00

Pos logit contributions
ware+0.110
rily+0.107
ge+0.101
driver+0.097
ro+0.096
Neg logit contributions
 glad-0.079
e-0.078
 ost-0.077
 much-0.076
 Lightning-0.071
Token a
Token ID257
Feature activation+3.08e+00

Pos logit contributions
ware+0.031
rily+0.029
bell+0.028
ro+0.028
 markers+0.027
Neg logit contributions
 glad-0.020
 ost-0.019
ow-0.019
opl-0.018
ang-0.018
Token beautiful
Token ID4950
Feature activation+3.03e+00

Pos logit contributions
ware+0.073
rily+0.072
bell+0.071
ro+0.068
yt+0.065
Neg logit contributions
ow-0.054
e-0.050
 glad-0.049
 ost-0.048
 afraid-0.045
Token the
Token ID262
Feature activation-3.09e+00

Pos logit contributions
 glad+0.017
 much+0.016
 ost+0.015
e+0.015
 many+0.013
Neg logit contributions
ware-0.102
rily-0.102
ro-0.097
ge-0.095
bell-0.095
Token trees
Token ID7150
Feature activation-2.25e+00

Pos logit contributions
ow+0.066
 glad+0.062
e+0.060
 ost+0.058
 afraid+0.057
Neg logit contributions
ware-0.068
ro-0.065
bell-0.062
rily-0.061
pron-0.059
Token,
Token ID11
Feature activation+3.45e+00

Pos logit contributions
 glad+0.043
ow+0.041
 ost+0.041
e+0.040
 much+0.040
Neg logit contributions
ware-0.052
rily-0.051
ro-0.048
ge-0.047
bell-0.046
Token low
Token ID1877
Feature activation+4.36e-02

Pos logit contributions
ware+0.070
rily+0.069
ro+0.063
ge+0.062
bell+0.061
Neg logit contributions
 glad-0.077
 ost-0.073
ow-0.073
e-0.072
 much-0.071
Token on
Token ID319
Feature activation-2.75e+00

Pos logit contributions
rily+0.001
ware+0.001
bell+0.001
ge+0.001
pron+0.001
Neg logit contributions
 ost-0.001
ow-0.001
 glad-0.001
 much-0.001
 very-0.001
Token the
Token ID262
Feature activation-8.18e-01

Pos logit contributions
 glad+0.024
 ost+0.022
ow+0.022
e+0.022
 much+0.021
Neg logit contributions
ware-0.092
rily-0.091
ro-0.087
ge-0.086
bell-0.084
Token ground
Token ID2323
Feature activation-1.74e+00

Pos logit contributions
 glad+0.014
ow+0.014
e+0.013
 much+0.013
 ost+0.013
Neg logit contributions
ware-0.021
rily-0.020
ro-0.019
bell-0.018
ge-0.018
Token and
Token ID290
Feature activation-5.44e-01

Pos logit contributions
 glad+0.032
 ost+0.030
ow+0.030
e+0.030
 much+0.029
Neg logit contributions
ware-0.042
rily-0.041
ro-0.039
ge-0.038
bell-0.037
Token in
Token ID287
Feature activation-3.32e+00

Pos logit contributions
 glad+0.010
ow+0.010
 ost+0.010
e+0.009
 much+0.009
Neg logit contributions
ware-0.013
rily-0.013
ro-0.012
ge-0.012
bell-0.011
Token all
Token ID477
Feature activation-3.82e+00

Pos logit contributions
 glad+0.037
ow+0.035
 ost+0.034
e+0.033
 much+0.032
Neg logit contributions
ware-0.103
rily-0.102
ro-0.097
ge-0.096
bell-0.094
Token the
Token ID262
Feature activation-2.47e+00

Pos logit contributions
ow+0.056
 ost+0.054
 glad+0.052
e+0.051
 Lightning+0.049
Neg logit contributions
rily-0.100
ware-0.098
ge-0.097
bell-0.091
ro-0.091
Token it
Token ID340
Feature activation-1.57e-01

Pos logit contributions
ware+0.037
rily+0.036
ro+0.034
ge+0.033
bell+0.033
Neg logit contributions
 glad-0.024
ow-0.023
e-0.023
 ost-0.022
 much-0.022
Token and
Token ID290
Feature activation-6.82e-02

Pos logit contributions
 glad+0.003
ow+0.003
 much+0.003
e+0.003
 ost+0.003
Neg logit contributions
ware-0.004
rily-0.004
ro-0.003
ge-0.003
bell-0.003
Token sang
Token ID25889
Feature activation+5.97e-01

Pos logit contributions
 glad+0.001
ow+0.001
 ost+0.001
e+0.001
 much+0.001
Neg logit contributions
ware-0.002
rily-0.002
ro-0.001
ge-0.001
bell-0.001
Token songs
Token ID7259
Feature activation+1.44e+00

Pos logit contributions
ware+0.018
rily+0.018
ge+0.018
bell+0.017
ro+0.017
Neg logit contributions
 much-0.008
 glad-0.007
e-0.007
 very-0.007
 many-0.007
Token to
Token ID284
Feature activation+1.82e+00

Pos logit contributions
ware+0.040
rily+0.039
ge+0.037
ro+0.037
bell+0.036
Neg logit contributions
 much-0.021
ow-0.020
 very-0.020
 glad-0.020
e-0.020
Token it
Token ID340
Feature activation-7.37e-01

Pos logit contributions
ware+0.048
rily+0.047
ro+0.044
ge+0.043
bell+0.043
Neg logit contributions
 glad-0.030
e-0.029
 much-0.028
ow-0.028
 ost-0.028
Token.
Token ID13
Feature activation-5.42e-01

Pos logit contributions
 glad+0.012
 much+0.012
ow+0.012
e+0.012
 ost+0.011
Neg logit contributions
ware-0.019
rily-0.019
ge-0.017
ro-0.017
bell-0.017
Token
Token ID198
Feature activation+3.91e-01

Pos logit contributions
 glad+0.008
 ost+0.008
ow+0.008
e+0.008
 much+0.008
Neg logit contributions
ware-0.014
rily-0.014
ro-0.013
ge-0.013
bell-0.013
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
ware+0.010
rily+0.010
ro+0.010
ge+0.010
bell+0.009
Neg logit contributions
 glad-0.006
 ost-0.006
ow-0.006
e-0.006
 much-0.005
TokenLittle
Token ID22253
Feature activation-1.13e+00

Pos logit contributions
ware+0.023
rily+0.023
ro+0.021
ge+0.020
bell+0.020
Neg logit contributions
 glad-0.022
e-0.021
 much-0.021
ow-0.021
 ost-0.021
Token by
Token ID416
Feature activation-1.04e+00

Pos logit contributions
ow+0.017
e+0.017
 glad+0.016
 Lightning+0.016
 Thunder+0.016
Neg logit contributions
rily-0.032
ware-0.032
ge-0.029
ro-0.028
bell-0.027
Token their
Token ID511
Feature activation+1.32e+00

Pos logit contributions
ware+0.019
rily+0.019
ro+0.018
ge+0.018
bell+0.017
Neg logit contributions
 glad-0.012
ow-0.012
 ost-0.012
e-0.012
 much-0.012
Token pictures
Token ID5986
Feature activation+1.64e-01

Pos logit contributions
rily+0.032
ware+0.032
ro+0.030
ge+0.030
bell+0.029
Neg logit contributions
 glad-0.023
 ost-0.022
ow-0.022
e-0.022
 much-0.021
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
rily+0.005
ware+0.004
ge+0.004
ro+0.004
yt+0.004
Neg logit contributions
 much-0.002
 glad-0.002
ow-0.002
 ost-0.002
e-0.002
Token She
Token ID1375
Feature activation+9.00e-01

Pos logit contributions
ow+0.017
 glad+0.016
 ost+0.016
e+0.016
 afraid+0.015
Neg logit contributions
rily-0.025
ware-0.025
ro-0.024
bell-0.023
ge-0.022
Token gave
Token ID2921
Feature activation+3.49e-01

Pos logit contributions
rily+0.025
ware+0.024
ro+0.023
ge+0.023
bell+0.022
Neg logit contributions
 glad-0.014
ow-0.013
 ost-0.013
e-0.013
 much-0.012
Token away
Token ID1497
Feature activation-4.05e-01

Pos logit contributions
ware+0.006
rily+0.006
ro+0.006
ge+0.006
bell+0.006
Neg logit contributions
 glad-0.008
ow-0.008
 ost-0.008
e-0.008
 much-0.008
Token their
Token ID511
Feature activation+2.08e+00

Pos logit contributions
 glad+0.007
e+0.006
ow+0.006
 afraid+0.006
opl+0.006
Neg logit contributions
ware-0.010
rily-0.010
ro-0.009
bell-0.009
driver-0.009
Token dog
Token ID3290
Feature activation-1.33e+00

Pos logit contributions
ware+0.049
rily+0.048
ro+0.045
ge+0.044
bell+0.043
Neg logit contributions
 glad-0.039
ow-0.037
 ost-0.037
e-0.037
 much-0.036
Token to
Token ID284
Feature activation+1.32e+00

Pos logit contributions
 glad+0.028
 ost+0.026
e+0.025
 much+0.025
ow+0.025
Neg logit contributions
ware-0.027
ro-0.026
rily-0.026
driver-0.024
bell-0.024
Token a
Token ID257
Feature activation+3.09e+00

Pos logit contributions
ware+0.035
rily+0.035
ro+0.033
ge+0.032
bell+0.032
Neg logit contributions
 glad-0.020
ow-0.019
 ost-0.019
e-0.019
 much-0.019
Token stranger
Token ID16195
Feature activation+1.96e-01

Pos logit contributions
rily+0.081
ware+0.080
ge+0.072
ro+0.072
fully+0.070
Neg logit contributions
 glad-0.054
 much-0.052
 ost-0.051
 very-0.048
e-0.048
Token friend
Token ID1545
Feature activation+2.40e-01

Pos logit contributions
rily+0.008
ware+0.008
ro+0.008
ge+0.008
driver+0.007
Neg logit contributions
 glad-0.006
e-0.005
ow-0.005
 much-0.005
 ost-0.005
Token's
Token ID338
Feature activation+9.25e-02

Pos logit contributions
ware+0.007
rily+0.007
ro+0.007
ge+0.006
bell+0.006
Neg logit contributions
 glad-0.003
ow-0.003
e-0.003
 ost-0.003
 much-0.003
Token birthday
Token ID10955
Feature activation+2.58e+00

Pos logit contributions
rily+0.002
ware+0.002
ge+0.002
ro+0.002
driver+0.002
Neg logit contributions
 glad-0.002
e-0.001
ow-0.001
 much-0.001
 ost-0.001
Token party
Token ID2151
Feature activation+2.48e+00

Pos logit contributions
rily+0.071
ge+0.069
ware+0.068
ro+0.064
driver+0.063
Neg logit contributions
 very-0.040
 much-0.039
 named-0.039
 glad-0.037
ow-0.037
Token.
Token ID13
Feature activation+3.26e-03

Pos logit contributions
rily+0.074
ware+0.073
ge+0.071
ro+0.067
driver+0.065
Neg logit contributions
ow-0.031
 named-0.031
 much-0.030
 very-0.030
e-0.030
Token Lucy
Token ID22162
Feature activation-1.07e+00

Pos logit contributions
ware+0.000
rily+0.000
ro+0.000
ge+0.000
bell+0.000
Neg logit contributions
 glad-0.000
ow-0.000
 ost-0.000
e-0.000
 much-0.000
Token was
Token ID373
Feature activation-1.69e+00

Pos logit contributions
 glad+0.017
ow+0.017
e+0.017
 ost+0.016
 much+0.016
Neg logit contributions
ware-0.028
rily-0.028
ro-0.026
ge-0.026
driver-0.025
Token so
Token ID523
Feature activation-4.93e+00

Pos logit contributions
 ost+0.026
ow+0.024
e+0.023
aw+0.023
opl+0.023
Neg logit contributions
rily-0.044
ware-0.041
bell-0.040
ro-0.039
ge-0.038
Token happy
Token ID3772
Feature activation+8.04e-01

Pos logit contributions
 ost+0.062
ow+0.061
 glad+0.060
e+0.059
 much+0.054
Neg logit contributions
rily-0.143
ware-0.142
ro-0.134
bell-0.132
ge-0.131
Token and
Token ID290
Feature activation-2.07e+00

Pos logit contributions
rily+0.020
ware+0.020
ro+0.018
ge+0.018
bell+0.018
Neg logit contributions
 glad-0.015
ow-0.014
 much-0.014
 ost-0.014
e-0.013
Token couldn
Token ID3521
Feature activation+1.75e-02

Pos logit contributions
 glad+0.038
ow+0.036
 ost+0.036
 much+0.036
e+0.036
Neg logit contributions
ware-0.050
rily-0.049
ro-0.046
ge-0.045
bell-0.044
Token too
Token ID1165
Feature activation-3.96e+00

Pos logit contributions
aw+0.012
 ost+0.011
ow+0.011
opl+0.011
e+0.011
Neg logit contributions
rily-0.021
ware-0.018
clips-0.017
pron-0.017
fully-0.017
Token,"
Token ID553
Feature activation+1.55e+00

Pos logit contributions
e+0.064
ow+0.063
 ost+0.061
 glad+0.057
aw+0.056
Neg logit contributions
rily-0.100
ge-0.094
ware-0.093
yt-0.092
bell-0.091
Token said
Token ID531
Feature activation+2.08e+00

Pos logit contributions
rily+0.035
ware+0.034
ro+0.031
bell+0.030
ge+0.030
Neg logit contributions
 glad-0.030
ow-0.029
 ost-0.028
e-0.028
 much-0.028
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
rily+0.057
ware+0.057
ro+0.052
bell+0.051
fully+0.051
Neg logit contributions
 glad-0.030
 ost-0.027
adow-0.027
 much-0.027
opl-0.027
Token family
Token ID1641
Feature activation+5.71e-01

Pos logit contributions
rily+0.030
ware+0.030
ro+0.028
ge+0.028
bell+0.026
Neg logit contributions
 glad-0.026
 ost-0.025
ow-0.025
 much-0.025
e-0.024
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
rily+0.013
ware+0.013
ro+0.012
ge+0.012
bell+0.012
Neg logit contributions
 glad-0.011
 ost-0.011
e-0.010
ow-0.010
 much-0.010
Token "
Token ID366
Feature activation+4.84e+00

Pos logit contributions
ware+0.003
rily+0.003
ge+0.003
ro+0.003
bell+0.003
Neg logit contributions
 glad-0.002
ow-0.002
 much-0.002
 ost-0.002
e-0.002
TokenThank
Token ID10449
Feature activation+1.90e+00

Pos logit contributions
ware+0.130
rily+0.130
yt+0.120
ge+0.119
bell+0.117
Neg logit contributions
e-0.079
 glad-0.078
 much-0.074
 ost-0.066
ow-0.064
Token you
Token ID345
Feature activation+3.69e-01

Pos logit contributions
ware+0.046
rily+0.045
fully+0.042
ro+0.042
ge+0.040
Neg logit contributions
 glad-0.032
 ost-0.030
 afraid-0.030
ow-0.030
 much-0.028
Token!"
Token ID2474
Feature activation+5.92e-01

Pos logit contributions
rily+0.009
ware+0.009
ro+0.008
ge+0.008
bell+0.008
Neg logit contributions
 glad-0.006
 ost-0.006
ow-0.006
e-0.006
 afraid-0.006
Token<|endoftext|>
Token ID50256
Feature activation-9.68e-01

Pos logit contributions
rily+0.015
ware+0.015
ro+0.014
ge+0.014
bell+0.014
Neg logit contributions
 glad-0.009
 ost-0.009
ow-0.009
e-0.009
 much-0.009
Token and
Token ID290
Feature activation-1.49e+00

Pos logit contributions
ware+0.026
rily+0.025
ge+0.023
ro+0.023
bell+0.023
Neg logit contributions
 much-0.019
 glad-0.019
ow-0.019
e-0.019
 very-0.018
Token choking
Token ID39302
Feature activation+2.55e+00

Pos logit contributions
 glad+0.024
ow+0.023
 ost+0.023
e+0.023
 much+0.022
Neg logit contributions
ware-0.038
rily-0.038
ro-0.036
ge-0.035
bell-0.034
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
ware+0.067
rily+0.065
ro+0.061
ge+0.061
bell+0.060
Neg logit contributions
ow-0.040
 glad-0.040
 much-0.040
e-0.039
 ost-0.037
Token From
Token ID3574
Feature activation-2.56e+00

Pos logit contributions
 glad+0.027
 ost+0.026
ow+0.026
e+0.026
 much+0.025
Neg logit contributions
ware-0.043
rily-0.043
ro-0.040
ge-0.040
bell-0.039
Token then
Token ID788
Feature activation-7.40e-01

Pos logit contributions
 glad+0.042
ow+0.041
 ost+0.041
e+0.041
 much+0.039
Neg logit contributions
ware-0.064
rily-0.064
ro-0.059
ge-0.058
bell-0.057
Token on
Token ID319
Feature activation-3.92e+00

Pos logit contributions
 ost+0.013
 glad+0.012
adow+0.012
ow+0.012
 afraid+0.012
Neg logit contributions
driver-0.016
ware-0.016
rily-0.016
bell-0.016
ge-0.016
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
 glad+0.060
ow+0.057
 ost+0.056
e+0.056
 much+0.055
Neg logit contributions
ware-0.106
rily-0.105
ro-0.099
ge-0.097
bell-0.095
Token John
Token ID1757
Feature activation-1.78e+00

Pos logit contributions
ware+0.022
rily+0.022
ge+0.021
ro+0.020
bell+0.020
Neg logit contributions
ow-0.020
 glad-0.020
 ost-0.020
e-0.020
 afraid-0.019
Token decided
Token ID3066
Feature activation+1.96e+00

Pos logit contributions
 glad+0.030
 ost+0.030
ow+0.029
 much+0.028
e+0.027
Neg logit contributions
ware-0.043
rily-0.043
ro-0.041
ge-0.040
bell-0.039
Token to
Token ID284
Feature activation+1.74e+00

Pos logit contributions
ware+0.043
rily+0.043
ro+0.039
ge+0.037
bell+0.036
Neg logit contributions
 glad-0.043
 much-0.041
ow-0.039
 ost-0.039
 very-0.039
Token be
Token ID307
Feature activation-1.79e+00

Pos logit contributions
ware+0.041
rily+0.040
ge+0.037
ro+0.036
bell+0.035
Neg logit contributions
e-0.033
ow-0.033
 glad-0.033
 much-0.031
 ost-0.031
Token they
Token ID484
Feature activation-6.60e-01

Pos logit contributions
ow+0.013
 glad+0.012
 ost+0.012
e+0.012
 much+0.011
Neg logit contributions
ware-0.024
rily-0.024
ro-0.023
bell-0.022
ge-0.022
Token were
Token ID547
Feature activation-3.05e+00

Pos logit contributions
 glad+0.012
ow+0.012
 ost+0.012
e+0.011
 much+0.011
Neg logit contributions
ware-0.016
rily-0.015
ro-0.014
bell-0.014
ge-0.014
Token happy
Token ID3772
Feature activation-1.72e-01

Pos logit contributions
ow+0.050
opl+0.048
aw+0.046
crow+0.045
 Lightning+0.044
Neg logit contributions
rily-0.073
ware-0.063
 stuffed-0.061
 groom-0.061
clips-0.060
Token.
Token ID13
Feature activation-2.14e+00

Pos logit contributions
 glad+0.003
ow+0.003
 ost+0.003
e+0.003
 much+0.003
Neg logit contributions
ware-0.004
rily-0.004
ro-0.004
ge-0.004
bell-0.004
Token<|endoftext|>
Token ID50256
Feature activation-2.07e+00

Pos logit contributions
ow+0.039
e+0.038
 ost+0.038
 glad+0.037
adow+0.034
Neg logit contributions
ware-0.049
ro-0.049
rily-0.048
bell-0.046
driver-0.043
TokenS
Token ID50
Feature activation-3.75e+00

Pos logit contributions
 glad+0.027
 ost+0.025
ow+0.024
 afraid+0.022
e+0.022
Neg logit contributions
ware-0.061
rily-0.059
ro-0.058
ge-0.055
bell-0.054
Tokenara
Token ID3301
Feature activation-3.35e+00

Pos logit contributions
 glad+0.074
ow+0.074
e+0.072
 ost+0.072
 much+0.070
Neg logit contributions
rily-0.085
ware-0.085
ro-0.079
ge-0.076
fully-0.074
Token loved
Token ID6151
Feature activation+9.37e-01

Pos logit contributions
 glad+0.070
 ost+0.069
ow+0.067
 much+0.066
 Lightning+0.064
Neg logit contributions
ware-0.071
rily-0.068
ro-0.067
ge-0.062
bell-0.061
Token to
Token ID284
Feature activation+9.08e-01

Pos logit contributions
ware+0.018
rily+0.018
ro+0.017
ge+0.016
bell+0.016
Neg logit contributions
 glad-0.021
ow-0.021
 ost-0.020
e-0.020
 much-0.020
Token visit
Token ID3187
Feature activation+5.34e-02

Pos logit contributions
rily+0.022
ware+0.021
ge+0.020
ro+0.019
driver+0.018
Neg logit contributions
 glad-0.017
e-0.017
ow-0.017
 much-0.016
 ost-0.016
Token the
Token ID262
Feature activation-2.13e+00

Pos logit contributions
ware+0.001
rily+0.001
bell+0.001
ro+0.001
ge+0.001
Neg logit contributions
 glad-0.001
 ost-0.001
 afraid-0.001
ow-0.001
 much-0.001
Token friends
Token ID2460
Feature activation-6.02e-01

Pos logit contributions
ware+0.013
rily+0.013
ro+0.012
ge+0.012
bell+0.012
Neg logit contributions
 glad-0.006
 ost-0.005
ow-0.005
e-0.005
 much-0.005
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
 glad+0.009
ow+0.009
e+0.009
 much+0.009
 ost+0.009
Neg logit contributions
rily-0.016
ware-0.016
ro-0.015
ge-0.015
bell-0.015
Token One
Token ID1881
Feature activation+1.45e-01

Pos logit contributions
rily+0.004
ware+0.004
ge+0.004
driver+0.004
ro+0.003
Neg logit contributions
 much-0.002
 glad-0.001
 And-0.001
ow-0.001
 many-0.001
Token day
Token ID1110
Feature activation-1.14e-02

Pos logit contributions
ro+0.003
bell+0.003
fully+0.003
ly+0.003
pron+0.003
Neg logit contributions
cks-0.003
 ost-0.003
icks-0.003
 sorry-0.003
 terrified-0.003
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
 ost+0.000
 glad+0.000
opl+0.000
ow+0.000
e+0.000
Neg logit contributions
ware-0.000
ro-0.000
rily-0.000
ge-0.000
fully-0.000
Token while
Token ID981
Feature activation-4.58e+00

Pos logit contributions
rily+0.033
ware+0.032
ge+0.031
ro+0.030
bell+0.029
Neg logit contributions
 glad-0.016
 much-0.016
 very-0.015
 many-0.015
 named-0.014
Token they
Token ID484
Feature activation-8.64e-01

Pos logit contributions
ow+0.087
adow+0.082
 ost+0.081
aw+0.081
e+0.079
Neg logit contributions
ware-0.103
bell-0.094
ge-0.093
rily-0.091
ro-0.090
Token were
Token ID547
Feature activation-2.35e+00

Pos logit contributions
 glad+0.013
ow+0.013
e+0.013
 much+0.013
 ost+0.013
Neg logit contributions
ware-0.023
rily-0.023
ro-0.021
ge-0.021
bell-0.021
Token playing
Token ID2712
Feature activation+1.39e+00

Pos logit contributions
elope+0.036
opl+0.035
e+0.032
 ost+0.031
 Fire+0.031
Neg logit contributions
ware-0.056
ge-0.054
ro-0.053
rily-0.053
driver-0.053
Token,
Token ID11
Feature activation+6.72e-01

Pos logit contributions
ware+0.033
rily+0.031
ro+0.031
fully+0.029
driver+0.029
Neg logit contributions
 glad-0.025
 ost-0.025
ow-0.023
 much-0.022
 afraid-0.022
Token they
Token ID484
Feature activation-6.10e-01

Pos logit contributions
ware+0.013
rily+0.013
ro+0.012
bell+0.011
fully+0.011
Neg logit contributions
ow-0.015
 glad-0.015
opl-0.015
 ost-0.015
e-0.015
Token day
Token ID1110
Feature activation+4.52e-01

Pos logit contributions
cks+0.014
opl+0.013
icks+0.013
 elbows+0.012
 ost+0.012
Neg logit contributions
bell-0.012
ro-0.012
ory-0.012
ly-0.011
ware-0.011
Token he
Token ID339
Feature activation+6.05e-01

Pos logit contributions
ware+0.011
rily+0.011
ro+0.010
ge+0.010
bell+0.010
Neg logit contributions
 ost-0.008
 glad-0.008
ow-0.008
e-0.007
 much-0.007
Token decided
Token ID3066
Feature activation+2.30e+00

Pos logit contributions
rily+0.015
ware+0.015
ro+0.014
ge+0.014
driver+0.013
Neg logit contributions
 glad-0.011
e-0.010
ow-0.010
 much-0.010
 ost-0.010
Token to
Token ID284
Feature activation+1.80e+00

Pos logit contributions
rily+0.052
ware+0.051
ro+0.046
ge+0.044
bell+0.043
Neg logit contributions
 glad-0.049
 much-0.048
 ost-0.046
 afraid-0.046
ow-0.046
Token go
Token ID467
Feature activation+7.05e-01

Pos logit contributions
rily+0.040
ware+0.040
ro+0.036
ge+0.036
bell+0.035
Neg logit contributions
ow-0.035
e-0.035
 glad-0.035
 ost-0.034
 much-0.034
Token for
Token ID329
Feature activation-3.93e+00

Pos logit contributions
rily+0.018
ware+0.018
ro+0.017
ge+0.017
bell+0.016
Neg logit contributions
 glad-0.012
ow-0.011
 much-0.011
 ost-0.011
e-0.011
Token a
Token ID257
Feature activation+2.13e+00

Pos logit contributions
 glad+0.049
 ost+0.048
ow+0.047
aw+0.043
adow+0.043
Neg logit contributions
ware-0.112
rily-0.104
ro-0.103
ge-0.103
driver-0.101
Token walk
Token ID2513
Feature activation+1.05e+00

Pos logit contributions
rily+0.046
ware+0.044
ro+0.040
ge+0.038
driver+0.036
Neg logit contributions
 much-0.047
 glad-0.047
ow-0.045
 ost-0.045
 very-0.043
Token.
Token ID13
Feature activation-5.81e-02

Pos logit contributions
ware+0.028
rily+0.028
ro+0.025
ge+0.025
bell+0.025
Neg logit contributions
 glad-0.016
ow-0.016
e-0.016
 ost-0.016
 much-0.016
Token As
Token ID1081
Feature activation-3.51e+00

Pos logit contributions
 glad+0.001
 much+0.001
 Lightning+0.001
ow+0.001
 Thunder+0.001
Neg logit contributions
ware-0.002
rily-0.002
ge-0.002
yt-0.001
ro-0.001
Token he
Token ID339
Feature activation+6.69e-01

Pos logit contributions
 glad+0.067
 much+0.064
 ost+0.062
e+0.062
ow+0.062
Neg logit contributions
ware-0.084
rily-0.084
ro-0.077
ge-0.075
bell-0.074
Token new
Token ID649
Feature activation+2.84e+00

Pos logit contributions
rily+0.094
ge+0.090
ware+0.090
ro+0.084
bell+0.083
Neg logit contributions
 glad-0.071
 much-0.070
 very-0.069
 ost-0.068
 mist-0.068
Token doll
Token ID3654
Feature activation-8.97e-01

Pos logit contributions
rily+0.069
ware+0.067
ge+0.065
ro+0.063
bell+0.061
Neg logit contributions
 glad-0.050
 much-0.049
 ost-0.048
 very-0.048
 mist-0.046
Token.
Token ID13
Feature activation-7.15e-01

Pos logit contributions
e+0.015
 glad+0.015
ow+0.015
 ost+0.014
 much+0.014
Neg logit contributions
ware-0.023
rily-0.023
ro-0.021
ge-0.021
bell-0.020
Token She
Token ID1375
Feature activation+9.56e-01

Pos logit contributions
 glad+0.012
 ost+0.012
e+0.012
ow+0.012
 afraid+0.012
Neg logit contributions
ware-0.017
rily-0.017
ro-0.017
bell-0.017
driver-0.015
Token was
Token ID373
Feature activation-1.77e+00

Pos logit contributions
rily+0.025
ware+0.024
ro+0.023
ge+0.023
driver+0.022
Neg logit contributions
 glad-0.016
 lived-0.015
 very-0.015
 named-0.015
e-0.015
Token so
Token ID523
Feature activation-5.05e+00

Pos logit contributions
 ost+0.027
ow+0.026
e+0.025
 Lightning+0.025
 glad+0.025
Neg logit contributions
rily-0.046
ware-0.045
bell-0.042
ro-0.041
ge-0.041
Token happy
Token ID3772
Feature activation+2.40e-01

Pos logit contributions
 glad+0.058
ow+0.057
 ost+0.057
e+0.055
 much+0.052
Neg logit contributions
rily-0.153
ware-0.153
ro-0.144
ge-0.142
bell-0.141
Token!
Token ID0
Feature activation-1.51e+00

Pos logit contributions
ware+0.005
rily+0.005
ro+0.005
ge+0.005
bell+0.005
Neg logit contributions
 glad-0.005
ow-0.005
 ost-0.005
 much-0.005
e-0.005
Token She
Token ID1375
Feature activation+8.96e-01

Pos logit contributions
 ost+0.029
e+0.029
ow+0.028
 glad+0.028
 afraid+0.027
Neg logit contributions
ware-0.033
rily-0.032
ro-0.032
bell-0.031
oph-0.028
Token thanked
Token ID26280
Feature activation+2.88e+00

Pos logit contributions
ware+0.023
rily+0.023
ge+0.022
ro+0.021
driver+0.021
Neg logit contributions
 glad-0.015
 lived-0.014
 named-0.014
e-0.014
 much-0.014
Token her
Token ID607
Feature activation+2.36e+00

Pos logit contributions
ware+0.090
rily+0.089
ro+0.083
bell+0.083
ge+0.083
Neg logit contributions
 glad-0.031
ow-0.029
 ost-0.029
e-0.028
 much-0.028