TOP ACTIVATIONS
MAX = 4.979

. They were called ost ric hes . Lily and Ben
ostr ich pen . Ost ric hes are big birds with
but be careful . Ost ric hes can be gr umpy
ostr ich pen . Ost ric hes are big birds with
and run to the ost ric hes . "
and looked at the ost ric hes . They saw one
. She hugs the re ind eer . "
harder to push the wheel bar row . But her backyard
and he gave the re ind eer a big hug .
cried . " The ost ric hes are mean ! They
mom and to the ost ric hes . They learned their
and gave her the t eddy bear . "
The elf thanked the re ind eer for helping him and
smiled and gave her t eddy bear a big hug .
should not tease the ost ric hes . They are not
, " This is ind igo , and this is violet
. He told his re ind eer to stay put and
shut and hugged her t eddy bear tightly .
and he brought his t eddy bear with him .
smiled and hugged her t eddy bear . <|endoftext|> Once upon

MIN ACTIVATIONS
MIN = -3.813

and finally Mark felt something tug ging at the line .
bled up all the ch ocol ates , and she was
save his pup from the injuries and the pup died soon
with a picnic in the shelter . <|endoftext|> Once upon a
saw all the tasty ch ocol ates inside . She couldn
he was known as Prince Mitt ens and he lived happily
the information and continued his stroll . He wasn 't so
forth , while the small bush would w iggle and j
was the same color as Mitt ens ! Lily was so
dream called a " night mare ". She woke up crying
and put them under the shelter . Then they sat down
k ang aroo in the bush ! <|endoftext|> Once upon a
the house in their new boots . They laughed and played
took out his hand ker chief and carefully picked up the
lady was known as the blanket covering silly lady . <|endoftext|>
, Tim my felt something tug on his line . He
other while they â re facing something that â
know you value it a lot ." They looked everywhere and
they shared together on those walks . Spot always
to pick them off the bush . She popped

INTERVAL 2.781 - 4.979
CONTAINS 0.015%

know what the dog was doing there but she was happy
the rock . He kept doing it until the sun went
she wondered what it was doing . She watched it fly
a big jungle and was always looking out for his animal
Sam got angry and said : " No , Lily .

INTERVAL 0.583 - 2.781
CONTAINS 14.220%

has a name . The name is Hal ley . Tom
<|endoftext|> One day , a boy named Tim went to the
looked down and saw a brightly - colored bird , who
decided to paint a big picture together , but it was
happen . At last , a miracle happened

INTERVAL -1.615 - 0.583
CONTAINS 85.381%

They both laughed and enjoyed their day at the park .
Finally , he found the squirrel and told him about the
is blue and has stars on it . He likes it
. It â s so much fun ! â
hit the bad guys ." She showed Max how she could

INTERVAL -3.813 - -1.615
CONTAINS 0.384%

whipped cream . S ara and Tom want to
'm proud of you and Spot . Now , let 's
. She hugs him and wipes his face with a towel
They take their toys and books and TV and fun .
, sometimes they kicked too hard or too soft , sometimes
Token.
Token ID13
Feature activation+8.19e-04

Pos logit contributions
 himself+0.021
 seek+0.020
lot+0.020
dy+0.019
die+0.019
Neg logit contributions
 course-0.020
ocol-0.019
isher-0.018
ement-0.018
-0.017
Token They
Token ID1119
Feature activation-6.19e-01

Pos logit contributions
 seek+0.000
 himself+0.000
lot+0.000
 active+0.000
dy+0.000
Neg logit contributions
 course-0.000
ocol-0.000
-0.000
ement-0.000
isher-0.000
Token were
Token ID547
Feature activation+2.46e-01

Pos logit contributions
 course+0.022
ocol+0.018
ir+0.018
isher+0.017
 ribbon+0.017
Neg logit contributions
 himself-0.019
lot-0.018
 seek-0.018
dy-0.018
die-0.017
Token called
Token ID1444
Feature activation+1.63e+00

Pos logit contributions
 seek+0.007
 himself+0.007
rated+0.007
lot+0.007
athan+0.007
Neg logit contributions
 course-0.007
ocol-0.006
ir-0.006
 grace-0.006
ement-0.006
Token ost
Token ID23619
Feature activation+1.38e+00

Pos logit contributions
 himself+0.059
 seek+0.059
lot+0.057
dy+0.056
die+0.055
Neg logit contributions
 course-0.046
ocol-0.043
isher-0.039
ement-0.038
ir-0.038
Tokenric
Token ID1173
Feature activation+4.88e+00

Pos logit contributions
 himself+0.050
 seek+0.050
lot+0.046
 active+0.046
dy+0.046
Neg logit contributions
 course-0.040
ocol-0.037
isher-0.036
ement-0.034
ir-0.033
Tokenhes
Token ID956
Feature activation-6.92e-01

Pos logit contributions
 seek+0.182
dy+0.163
die+0.161
 active+0.160
 himself+0.154
Neg logit contributions
 course-0.143
ir-0.134
ocol-0.132
ement-0.120
isher-0.111
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
 course+0.024
ocol+0.021
ir+0.021
isher+0.021
ement+0.020
Neg logit contributions
 seek-0.022
 himself-0.021
lot-0.021
die-0.020
dy-0.020
Token Lily
Token ID20037
Feature activation-5.71e-02

Pos logit contributions
 seek+0.004
 himself+0.004
lot+0.004
 active+0.004
dy+0.003
Neg logit contributions
 course-0.004
ocol-0.004
-0.004
ement-0.004
ir-0.004
Token and
Token ID290
Feature activation-3.07e-01

Pos logit contributions
 course+0.002
 ribbon+0.001
ement+0.001
+0.001
ir+0.001
Neg logit contributions
 himself-0.002
 afloat-0.002
 seek-0.002
 late-0.002
lot-0.002
Token Ben
Token ID3932
Feature activation-4.92e-01

Pos logit contributions
 course+0.011
ocol+0.010
isher+0.009
+0.009
ir+0.009
Neg logit contributions
 seek-0.009
lot-0.009
 himself-0.009
 active-0.009
dy-0.009
Token ostr
Token ID42387
Feature activation+1.47e+00

Pos logit contributions
 himself+0.009
lot+0.009
 seek+0.009
 active+0.009
dy+0.009
Neg logit contributions
 course-0.010
ocol-0.008
isher-0.008
-0.007
 ribbon-0.007
Tokenich
Token ID488
Feature activation-1.31e-01

Pos logit contributions
 himself+0.036
lot+0.034
 seek+0.034
dy+0.033
die+0.030
Neg logit contributions
 course-0.059
ocol-0.057
isher-0.056
ement-0.052
-0.051
Token pen
Token ID3112
Feature activation-1.75e-01

Pos logit contributions
 course+0.005
ocol+0.005
isher+0.005
ement+0.005
+0.005
Neg logit contributions
 himself-0.003
 seek-0.003
lot-0.003
dy-0.003
die-0.003
Token.
Token ID13
Feature activation-1.76e-02

Pos logit contributions
 course+0.007
ocol+0.007
isher+0.006
ement+0.006
+0.006
Neg logit contributions
 himself-0.005
 seek-0.004
lot-0.004
dy-0.004
die-0.004
Token Ost
Token ID38919
Feature activation+3.93e-01

Pos logit contributions
 course+0.001
ocol+0.001
isher+0.001
ement+0.001
+0.001
Neg logit contributions
 himself-0.000
 seek-0.000
lot-0.000
dy-0.000
die-0.000
Tokenric
Token ID1173
Feature activation+4.51e+00

Pos logit contributions
 himself+0.014
 seek+0.013
lot+0.013
dy+0.013
 active+0.012
Neg logit contributions
 course-0.012
ocol-0.011
isher-0.011
ement-0.010
-0.010
Tokenhes
Token ID956
Feature activation-1.44e-01

Pos logit contributions
 seek+0.167
 active+0.150
dy+0.149
die+0.148
 himself+0.142
Neg logit contributions
 course-0.138
ir-0.123
ocol-0.121
ement-0.111
isher-0.101
Token are
Token ID389
Feature activation+3.71e-01

Pos logit contributions
 course+0.004
ocol+0.003
isher+0.003
ir+0.003
ement+0.003
Neg logit contributions
 seek-0.006
 himself-0.006
lot-0.006
dy-0.005
die-0.005
Token big
Token ID1263
Feature activation-1.84e-01

Pos logit contributions
athan+0.011
lot+0.011
rated+0.011
any+0.011
 himself+0.011
Neg logit contributions
 course-0.011
ocol-0.010
 ribbon-0.010
 rib-0.009
ir-0.009
Token birds
Token ID10087
Feature activation+6.53e-01

Pos logit contributions
 course+0.007
ocol+0.007
isher+0.006
ement+0.006
ir+0.006
Neg logit contributions
 himself-0.005
 seek-0.005
lot-0.005
dy-0.004
 active-0.004
Token with
Token ID351
Feature activation-1.28e-01

Pos logit contributions
 himself+0.021
 seek+0.020
lot+0.020
dy+0.019
die+0.019
Neg logit contributions
 course-0.022
ocol-0.020
isher-0.019
ement-0.018
ir-0.018
Token but
Token ID475
Feature activation-3.33e-01

Pos logit contributions
 course+0.023
ocol+0.022
isher+0.022
+0.021
ement+0.021
Neg logit contributions
 himself-0.029
 seek-0.027
dy-0.026
lot-0.026
die-0.025
Token be
Token ID307
Feature activation+1.43e-01

Pos logit contributions
 course+0.013
ocol+0.012
isher+0.012
ement+0.012
+0.011
Neg logit contributions
 himself-0.009
 seek-0.009
dy-0.008
lot-0.008
die-0.008
Token careful
Token ID8161
Feature activation-8.49e-02

Pos logit contributions
 seek+0.004
lot+0.004
rated+0.004
 himself+0.004
athan+0.003
Neg logit contributions
 course-0.005
ocol-0.005
isher-0.004
ir-0.004
 ribbon-0.004
Token.
Token ID13
Feature activation-2.84e-01

Pos logit contributions
 course+0.002
ocol+0.002
isher+0.002
ement+0.002
ir+0.002
Neg logit contributions
lot-0.003
 seek-0.003
die-0.003
 himself-0.003
idi-0.003
Token Ost
Token ID38919
Feature activation+4.50e-01

Pos logit contributions
 course+0.011
isher+0.011
ocol+0.010
ement+0.010
ir+0.009
Neg logit contributions
 himself-0.008
dy-0.007
 seek-0.007
lot-0.007
die-0.006
Tokenric
Token ID1173
Feature activation+4.50e+00

Pos logit contributions
 himself+0.015
 seek+0.015
lot+0.014
dy+0.014
die+0.014
Neg logit contributions
 course-0.014
ocol-0.013
isher-0.013
ement-0.012
-0.012
Tokenhes
Token ID956
Feature activation-2.57e-01

Pos logit contributions
 seek+0.156
dy+0.141
 active+0.139
die+0.139
 himself+0.136
Neg logit contributions
 course-0.147
ocol-0.133
ir-0.126
isher-0.119
ement-0.117
Token can
Token ID460
Feature activation-3.38e-01

Pos logit contributions
 course+0.007
ocol+0.006
isher+0.006
ement+0.005
ir+0.005
Neg logit contributions
 himself-0.010
 seek-0.010
lot-0.010
dy-0.009
die-0.009
Token be
Token ID307
Feature activation+3.46e-01

Pos logit contributions
 course+0.012
ocol+0.011
isher+0.011
ement+0.010
+0.010
Neg logit contributions
 himself-0.010
 seek-0.010
lot-0.010
dy-0.009
die-0.009
Token gr
Token ID1036
Feature activation+8.94e-01

Pos logit contributions
 himself+0.010
 seek+0.009
lot+0.009
athan+0.009
die+0.008
Neg logit contributions
 course-0.012
ocol-0.012
ir-0.011
isher-0.011
 ribbon-0.010
Tokenumpy
Token ID32152
Feature activation+2.07e+00

Pos logit contributions
 himself+0.021
 seek+0.021
lot+0.019
dy+0.019
die+0.018
Neg logit contributions
 course-0.037
ocol-0.036
isher-0.035
ir-0.033
ement-0.033
Token ostr
Token ID42387
Feature activation+1.46e+00

Pos logit contributions
 himself+0.007
lot+0.007
 seek+0.007
 active+0.007
dy+0.006
Neg logit contributions
 course-0.007
isher-0.006
ocol-0.006
-0.006
 ribbon-0.006
Tokenich
Token ID488
Feature activation-3.03e-01

Pos logit contributions
 himself+0.037
lot+0.035
 seek+0.035
dy+0.034
die+0.031
Neg logit contributions
 course-0.058
ocol-0.055
isher-0.054
ement-0.051
-0.050
Token pen
Token ID3112
Feature activation-1.36e-01

Pos logit contributions
 course+0.012
ocol+0.011
isher+0.011
ement+0.011
+0.010
Neg logit contributions
 himself-0.008
 seek-0.007
lot-0.007
dy-0.007
die-0.007
Token.
Token ID13
Feature activation-3.17e-02

Pos logit contributions
 course+0.005
ocol+0.005
isher+0.005
ement+0.005
+0.004
Neg logit contributions
 himself-0.004
 seek-0.004
lot-0.003
dy-0.003
die-0.003
Token Ost
Token ID38919
Feature activation+3.53e-01

Pos logit contributions
 course+0.001
ocol+0.001
isher+0.001
ement+0.001
+0.001
Neg logit contributions
 himself-0.001
 seek-0.001
lot-0.001
dy-0.001
 active-0.001
Tokenric
Token ID1173
Feature activation+4.37e+00

Pos logit contributions
 seek+0.012
 himself+0.012
lot+0.012
dy+0.012
 active+0.012
Neg logit contributions
 course-0.011
ocol-0.010
isher-0.009
ement-0.009
-0.009
Tokenhes
Token ID956
Feature activation-1.12e-01

Pos logit contributions
 seek+0.168
 active+0.152
dy+0.151
die+0.149
 afloat+0.143
Neg logit contributions
 course-0.127
ir-0.114
ocol-0.109
ement-0.103
isher-0.089
Token are
Token ID389
Feature activation+2.51e-01

Pos logit contributions
 course+0.003
ocol+0.003
isher+0.003
ir+0.002
ement+0.002
Neg logit contributions
 seek-0.004
lot-0.004
 himself-0.004
die-0.004
dy-0.004
Token big
Token ID1263
Feature activation-1.63e-01

Pos logit contributions
rated+0.007
lot+0.007
 seek+0.007
 himself+0.007
athan+0.007
Neg logit contributions
 course-0.007
ocol-0.007
 ribbon-0.007
 rib-0.006
 icing-0.006
Token birds
Token ID10087
Feature activation+6.68e-01

Pos logit contributions
 course+0.006
ocol+0.006
isher+0.006
ement+0.005
ir+0.005
Neg logit contributions
 himself-0.004
 seek-0.004
lot-0.004
dy-0.004
 active-0.004
Token with
Token ID351
Feature activation-1.55e-01

Pos logit contributions
 himself+0.020
 seek+0.020
lot+0.019
dy+0.018
die+0.018
Neg logit contributions
 course-0.023
ocol-0.022
isher-0.021
ement-0.020
-0.020
Token and
Token ID290
Feature activation-2.58e-01

Pos logit contributions
 course+0.022
ocol+0.022
isher+0.021
ement+0.020
+0.020
Neg logit contributions
 himself-0.012
 seek-0.011
lot-0.011
dy-0.010
 active-0.010
Token run
Token ID1057
Feature activation+1.81e-01

Pos logit contributions
 course+0.009
ocol+0.008
isher+0.008
ement+0.008
+0.008
Neg logit contributions
 himself-0.008
 seek-0.008
lot-0.007
dy-0.007
die-0.007
Token to
Token ID284
Feature activation-2.29e-01

Pos logit contributions
 himself+0.005
 seek+0.005
lot+0.005
dy+0.004
die+0.004
Neg logit contributions
 course-0.007
ocol-0.006
isher-0.006
ement-0.006
-0.006
Token the
Token ID262
Feature activation+3.26e-01

Pos logit contributions
 course+0.010
ocol+0.010
isher+0.010
ement+0.010
+0.009
Neg logit contributions
 himself-0.005
 seek-0.004
lot-0.004
dy-0.004
die-0.004
Token ost
Token ID23619
Feature activation+5.59e-01

Pos logit contributions
 himself+0.009
lot+0.009
 seek+0.009
dy+0.008
 active+0.008
Neg logit contributions
 course-0.012
ocol-0.011
isher-0.011
 ribbon-0.010
-0.010
Tokenric
Token ID1173
Feature activation+4.24e+00

Pos logit contributions
 himself+0.019
 seek+0.018
lot+0.018
dy+0.018
die+0.017
Neg logit contributions
 course-0.017
ocol-0.016
isher-0.016
ement-0.015
-0.014
Tokenhes
Token ID956
Feature activation-8.84e-01

Pos logit contributions
 seek+0.139
dy+0.128
 himself+0.127
die+0.126
 active+0.125
Neg logit contributions
 course-0.142
ocol-0.129
ir-0.127
ement-0.120
isher-0.119
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
 course+0.029
ocol+0.027
isher+0.026
ement+0.025
ir+0.025
Neg logit contributions
 himself-0.028
 seek-0.028
lot-0.027
dy-0.026
die-0.026
Token
Token ID198
Feature activation-1.43e-01

Pos logit contributions
 seek+0.006
 active+0.005
die+0.005
dy+0.005
 himself+0.005
Neg logit contributions
 course-0.006
-0.006
ocol-0.006
ement-0.006
isher-0.006
Token
Token ID198
Feature activation-2.83e-01

Pos logit contributions
 course+0.005
ir+0.004
+0.004
ocol+0.004
ement+0.004
Neg logit contributions
 seek-0.004
 active-0.004
lot-0.004
dy-0.004
die-0.004
Token"
Token ID1
Feature activation-1.08e+00

Pos logit contributions
 course+0.009
ocol+0.007
ir+0.007
+0.007
isher+0.007
Neg logit contributions
 himself-0.009
lot-0.009
dy-0.009
die-0.009
 active-0.008
Token and
Token ID290
Feature activation+3.57e-01

Pos logit contributions
 seek+0.011
lot+0.010
 himself+0.010
die+0.010
dy+0.010
Neg logit contributions
 course-0.014
ocol-0.014
isher-0.013
ir-0.013
ement-0.013
Token looked
Token ID3114
Feature activation+3.66e-01

Pos logit contributions
 himself+0.008
 seek+0.008
lot+0.008
dy+0.007
die+0.007
Neg logit contributions
ocol-0.015
 course-0.015
isher-0.014
ement-0.014
ir-0.013
Token at
Token ID379
Feature activation+7.35e-01

Pos logit contributions
 seek+0.012
 himself+0.012
lot+0.012
dy+0.011
die+0.011
Neg logit contributions
 course-0.011
ocol-0.011
isher-0.011
ement-0.010
ir-0.010
Token the
Token ID262
Feature activation+2.07e-01

Pos logit contributions
 himself+0.016
dy+0.014
 seek+0.013
lot+0.013
die+0.013
Neg logit contributions
 course-0.032
ocol-0.032
isher-0.030
ement-0.029
-0.029
Token ost
Token ID23619
Feature activation+5.45e-01

Pos logit contributions
 himself+0.006
 seek+0.006
lot+0.005
dy+0.005
die+0.005
Neg logit contributions
 course-0.008
ocol-0.007
isher-0.007
ement-0.007
-0.007
Tokenric
Token ID1173
Feature activation+4.17e+00

Pos logit contributions
 himself+0.018
 seek+0.018
lot+0.018
dy+0.017
die+0.017
Neg logit contributions
 course-0.017
ocol-0.016
isher-0.016
ement-0.014
-0.014
Tokenhes
Token ID956
Feature activation-8.74e-01

Pos logit contributions
 seek+0.144
dy+0.132
die+0.130
 himself+0.128
 active+0.128
Neg logit contributions
 course-0.135
ocol-0.123
ir-0.120
ement-0.115
isher-0.112
Token.
Token ID13
Feature activation+5.86e-02

Pos logit contributions
 course+0.029
ocol+0.027
isher+0.027
ement+0.026
+0.025
Neg logit contributions
 himself-0.027
 seek-0.027
lot-0.026
dy-0.026
die-0.025
Token They
Token ID1119
Feature activation-7.27e-01

Pos logit contributions
 seek+0.002
 himself+0.002
lot+0.002
 active+0.002
dy+0.002
Neg logit contributions
 course-0.002
ocol-0.002
-0.002
isher-0.002
ement-0.002
Token saw
Token ID2497
Feature activation-6.51e-01

Pos logit contributions
 course+0.028
ocol+0.026
isher+0.025
ement+0.024
+0.024
Neg logit contributions
 himself-0.020
 seek-0.019
lot-0.019
dy-0.018
die-0.017
Token one
Token ID530
Feature activation+7.93e-01

Pos logit contributions
 course+0.025
ocol+0.023
isher+0.022
ement+0.021
ir+0.021
Neg logit contributions
 seek-0.019
lot-0.018
 himself-0.017
 active-0.017
die-0.017
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
 course+0.045
ocol+0.044
isher+0.043
+0.039
ir+0.037
Neg logit contributions
 himself-0.022
 having-0.018
lot-0.017
dy-0.016
 late-0.016
Token She
Token ID1375
Feature activation-1.98e-01

Pos logit contributions
 course+0.001
isher+0.001
ocol+0.001
ement+0.001
ir+0.001
Neg logit contributions
 himself-0.000
 seek-0.000
lot-0.000
dy-0.000
die-0.000
Token hugs
Token ID40657
Feature activation-6.96e-01

Pos logit contributions
ocol+0.008
isher+0.008
 course+0.008
ir+0.007
+0.007
Neg logit contributions
 himself-0.005
dy-0.004
lot-0.004
die-0.004
 having-0.004
Token the
Token ID262
Feature activation-1.15e-01

Pos logit contributions
ocol+0.027
 course+0.027
isher+0.026
ement+0.025
+0.024
Neg logit contributions
 himself-0.020
dy-0.016
 seek-0.016
lot-0.016
die-0.015
Token re
Token ID302
Feature activation+1.79e+00

Pos logit contributions
 course+0.004
ocol+0.004
isher+0.003
ement+0.003
 ribbon+0.003
Neg logit contributions
 himself-0.004
 seek-0.004
lot-0.003
dy-0.003
die-0.003
Tokenind
Token ID521
Feature activation+4.03e+00

Pos logit contributions
 himself+0.061
lot+0.058
dy+0.057
 seek+0.057
die+0.055
Neg logit contributions
 course-0.057
ocol-0.051
isher-0.049
-0.049
ement-0.047
Tokeneer
Token ID28153
Feature activation+2.56e-01

Pos logit contributions
 seek+0.088
 himself+0.084
lot+0.078
dy+0.076
 active+0.074
Neg logit contributions
 course-0.171
isher-0.168
ocol-0.168
ement-0.155
ir-0.153
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
 seek+0.007
 himself+0.007
lot+0.007
dy+0.006
die+0.006
Neg logit contributions
 course-0.009
ocol-0.009
isher-0.009
ement-0.008
ir-0.008
Token
Token ID198
Feature activation-1.74e-01

Pos logit contributions
 himself+0.002
lot+0.002
 seek+0.002
dy+0.002
die+0.001
Neg logit contributions
 course-0.007
isher-0.007
ocol-0.007
ement-0.007
ir-0.007
Token
Token ID198
Feature activation-2.82e-01

Pos logit contributions
 course+0.008
ocol+0.007
isher+0.007
ement+0.007
+0.007
Neg logit contributions
 seek-0.003
 himself-0.003
lot-0.003
dy-0.003
 active-0.003
Token"
Token ID1
Feature activation-1.04e+00

Pos logit contributions
 course+0.011
ocol+0.011
isher+0.010
+0.010
ement+0.010
Neg logit contributions
 himself-0.007
 seek-0.007
lot-0.007
dy-0.006
 active-0.006
Token harder
Token ID7069
Feature activation+1.30e-01

Pos logit contributions
lot+0.027
dy+0.025
 himself+0.025
die+0.024
 seek+0.024
Neg logit contributions
 course-0.032
ocol-0.028
isher-0.027
 ribbon-0.026
 rib-0.026
Token to
Token ID284
Feature activation+5.32e-03

Pos logit contributions
 seek+0.005
lot+0.005
die+0.005
 active+0.004
dy+0.004
Neg logit contributions
 course-0.004
ocol-0.003
ement-0.003
ir-0.003
isher-0.003
Token push
Token ID4574
Feature activation-3.87e-01

Pos logit contributions
lot+0.000
dy+0.000
die+0.000
 seek+0.000
 himself+0.000
Neg logit contributions
 course-0.000
ocol-0.000
-0.000
isher-0.000
 ribbon-0.000
Token the
Token ID262
Feature activation+2.80e-01

Pos logit contributions
 course+0.014
ocol+0.012
isher+0.012
ement+0.011
ir+0.011
Neg logit contributions
lot-0.012
 seek-0.012
dy-0.011
die-0.011
 active-0.011
Token wheel
Token ID7825
Feature activation+8.75e-01

Pos logit contributions
 himself+0.010
lot+0.009
 seek+0.009
dy+0.009
die+0.009
Neg logit contributions
 course-0.009
ocol-0.008
isher-0.008
 ribbon-0.007
ement-0.007
Tokenbar
Token ID5657
Feature activation+4.02e+00

Pos logit contributions
 seek+0.025
 himself+0.025
lot+0.024
dy+0.024
die+0.024
Neg logit contributions
 course-0.031
isher-0.029
ocol-0.028
ement-0.027
-0.026
Tokenrow
Token ID808
Feature activation+1.02e+00

Pos logit contributions
 seek+0.130
die+0.124
lot+0.123
dy+0.119
 active+0.119
Neg logit contributions
 course-0.132
isher-0.129
ocol-0.118
ement-0.113
ir-0.112
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
 seek+0.031
lot+0.031
die+0.031
dy+0.030
 himself+0.029
Neg logit contributions
 course-0.034
isher-0.031
ement-0.030
ocol-0.029
-0.029
Token But
Token ID887
Feature activation+1.50e-01

Pos logit contributions
 seek+0.000
lot+0.000
 active+0.000
dy+0.000
 himself+0.000
Neg logit contributions
 course-0.000
-0.000
ocol-0.000
ement-0.000
isher-0.000
Token her
Token ID607
Feature activation+3.43e-01

Pos logit contributions
lot+0.005
die+0.005
dy+0.005
 active+0.004
 seek+0.004
Neg logit contributions
 course-0.005
ocol-0.004
isher-0.004
-0.004
ement-0.004
Token backyard
Token ID24296
Feature activation-4.11e-02

Pos logit contributions
 himself+0.010
 seek+0.010
lot+0.010
die+0.009
dy+0.009
Neg logit contributions
 course-0.012
isher-0.011
ocol-0.011
ement-0.010
-0.010
Token and
Token ID290
Feature activation+2.80e-02

Pos logit contributions
 course+0.018
isher+0.015
ocol+0.015
ement+0.014
+0.014
Neg logit contributions
 seek-0.019
die-0.019
lot-0.018
dy-0.018
 active-0.018
Token he
Token ID339
Feature activation+9.24e-01

Pos logit contributions
lot+0.001
die+0.001
dy+0.001
 seek+0.001
 afloat+0.001
Neg logit contributions
 course-0.001
 grace-0.001
isher-0.001
 ribbon-0.001
 please-0.001
Token gave
Token ID2921
Feature activation+5.72e-01

Pos logit contributions
 himself+0.027
lot+0.026
 seek+0.025
 afloat+0.025
 late+0.024
Neg logit contributions
 course-0.029
 ribbon-0.025
isher-0.024
-0.023
ir-0.023
Token the
Token ID262
Feature activation+2.37e-01

Pos logit contributions
 seek+0.015
lot+0.014
 himself+0.014
dy+0.014
 active+0.014
Neg logit contributions
 course-0.022
isher-0.021
ocol-0.021
ement-0.019
ir-0.019
Token re
Token ID302
Feature activation+1.86e+00

Pos logit contributions
 himself+0.006
 seek+0.006
lot+0.006
dy+0.006
die+0.006
Neg logit contributions
 course-0.009
ocol-0.009
isher-0.008
ement-0.008
-0.008
Tokenind
Token ID521
Feature activation+3.86e+00

Pos logit contributions
 himself+0.061
lot+0.058
dy+0.058
 seek+0.056
die+0.055
Neg logit contributions
 course-0.063
ocol-0.056
-0.056
isher-0.053
ement-0.052
Tokeneer
Token ID28153
Feature activation+5.87e-01

Pos logit contributions
 himself+0.073
 seek+0.069
lot+0.066
dy+0.063
die+0.060
Neg logit contributions
 course-0.178
ocol-0.171
isher-0.169
ement-0.161
-0.159
Token a
Token ID257
Feature activation-3.43e-02

Pos logit contributions
 seek+0.015
lot+0.015
 himself+0.014
dy+0.014
die+0.014
Neg logit contributions
 course-0.022
ocol-0.021
isher-0.021
ir-0.019
ement-0.019
Token big
Token ID1263
Feature activation+6.44e-01

Pos logit contributions
ocol+0.002
 course+0.001
isher+0.001
ement+0.001
ir+0.001
Neg logit contributions
 seek-0.001
 himself-0.001
lot-0.001
dy-0.001
die-0.001
Token hug
Token ID16225
Feature activation-7.41e-01

Pos logit contributions
 seek+0.022
dy+0.021
 himself+0.021
 active+0.020
die+0.020
Neg logit contributions
ocol-0.019
 course-0.018
ement-0.018
isher-0.018
-0.017
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 course+0.025
ocol+0.023
isher+0.023
ement+0.022
+0.021
Neg logit contributions
 himself-0.023
 seek-0.022
lot-0.022
dy-0.021
die-0.021
Token cried
Token ID16896
Feature activation-6.98e-01

Pos logit contributions
 course+0.002
ocol+0.002
isher+0.002
ement+0.002
ir+0.002
Neg logit contributions
 himself-0.001
 seek-0.001
lot-0.001
dy-0.001
die-0.001
Token.
Token ID13
Feature activation+7.37e-03

Pos logit contributions
 course+0.025
ocol+0.024
isher+0.023
ement+0.022
+0.022
Neg logit contributions
 himself-0.021
 seek-0.019
lot-0.019
dy-0.019
 active-0.018
Token "
Token ID366
Feature activation-3.67e-02

Pos logit contributions
 seek+0.000
 himself+0.000
lot+0.000
dy+0.000
 active+0.000
Neg logit contributions
 course-0.000
-0.000
ement-0.000
ocol-0.000
isher-0.000
TokenThe
Token ID464
Feature activation-1.39e-01

Pos logit contributions
 course+0.001
ocol+0.001
+0.001
isher+0.001
ir+0.001
Neg logit contributions
lot-0.001
 seek-0.001
 himself-0.001
dy-0.001
die-0.001
Token ost
Token ID23619
Feature activation+6.10e-01

Pos logit contributions
 course+0.005
ocol+0.004
isher+0.004
ement+0.004
+0.004
Neg logit contributions
 himself-0.004
 seek-0.004
lot-0.004
dy-0.004
die-0.004
Tokenric
Token ID1173
Feature activation+3.84e+00

Pos logit contributions
 himself+0.018
lot+0.017
dy+0.017
 seek+0.016
die+0.015
Neg logit contributions
 course-0.021
isher-0.020
ocol-0.020
ement-0.019
-0.019
Tokenhes
Token ID956
Feature activation-6.59e-01

Pos logit contributions
 seek+0.126
dy+0.115
 himself+0.115
die+0.114
 active+0.111
Neg logit contributions
 course-0.130
ocol-0.119
ir-0.113
isher-0.112
ement-0.108
Token are
Token ID389
Feature activation-3.72e-02

Pos logit contributions
 course+0.017
ocol+0.015
isher+0.015
ement+0.013
ir+0.013
Neg logit contributions
 himself-0.026
 seek-0.025
lot-0.025
dy-0.025
die-0.024
Token mean
Token ID1612
Feature activation+1.45e+00

Pos logit contributions
 course+0.001
ocol+0.001
isher+0.001
ir+0.001
ement+0.001
Neg logit contributions
 himself-0.001
lot-0.001
 seek-0.001
dy-0.001
die-0.001
Token!
Token ID0
Feature activation-5.80e-02

Pos logit contributions
 seek+0.050
 himself+0.049
lot+0.049
die+0.046
dy+0.046
Neg logit contributions
 course-0.045
isher-0.040
ocol-0.039
ir-0.036
ement-0.035
Token They
Token ID1119
Feature activation-1.46e+00

Pos logit contributions
 course+0.002
ocol+0.002
isher+0.002
+0.001
ement+0.001
Neg logit contributions
 seek-0.002
 himself-0.002
lot-0.002
dy-0.002
die-0.002
Token mom
Token ID1995
Feature activation-2.00e-01

Pos logit contributions
 course+0.021
isher+0.021
ocol+0.021
ement+0.020
+0.019
Neg logit contributions
 himself-0.009
 seek-0.006
 having-0.006
dy-0.006
lot-0.006
Token and
Token ID290
Feature activation+4.96e-01

Pos logit contributions
 course+0.008
ocol+0.007
isher+0.007
ement+0.007
+0.007
Neg logit contributions
 himself-0.005
 seek-0.005
lot-0.005
dy-0.005
die-0.005
Token to
Token ID284
Feature activation+9.47e-01

Pos logit contributions
 himself+0.013
 seek+0.013
lot+0.013
dy+0.012
die+0.012
Neg logit contributions
 course-0.019
isher-0.018
ocol-0.018
ement-0.016
-0.016
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
 himself+0.022
 seek+0.018
 having+0.018
dy+0.017
lot+0.016
Neg logit contributions
isher-0.041
ocol-0.041
ement-0.040
 course-0.040
-0.038
Token ost
Token ID23619
Feature activation+6.51e-01

Pos logit contributions
 himself+0.026
 seek+0.025
dy+0.023
lot+0.023
 active+0.022
Neg logit contributions
 course-0.042
ocol-0.040
isher-0.040
ement-0.038
-0.038
Tokenric
Token ID1173
Feature activation+3.76e+00

Pos logit contributions
 himself+0.021
 seek+0.020
lot+0.020
dy+0.020
die+0.019
Neg logit contributions
 course-0.021
ocol-0.020
isher-0.019
ement-0.018
-0.018
Tokenhes
Token ID956
Feature activation-8.41e-01

Pos logit contributions
 seek+0.122
dy+0.111
 himself+0.110
die+0.109
 active+0.107
Neg logit contributions
 course-0.127
ocol-0.117
isher-0.112
ir-0.112
ement-0.107
Token.
Token ID13
Feature activation+3.30e-01

Pos logit contributions
 course+0.030
ocol+0.028
isher+0.028
ement+0.026
+0.026
Neg logit contributions
 himself-0.025
 seek-0.024
lot-0.023
dy-0.023
die-0.022
Token They
Token ID1119
Feature activation-7.97e-01

Pos logit contributions
 himself+0.009
 seek+0.009
lot+0.008
dy+0.008
die+0.008
Neg logit contributions
 course-0.012
ocol-0.011
isher-0.011
-0.011
ement-0.011
Token learned
Token ID4499
Feature activation-8.99e-02

Pos logit contributions
ocol+0.043
isher+0.039
 course+0.037
ement+0.037
ir+0.036
Neg logit contributions
 himself-0.013
 seek-0.011
dy-0.011
 even-0.010
lot-0.010
Token their
Token ID511
Feature activation-3.65e-01

Pos logit contributions
 course+0.004
ocol+0.004
isher+0.004
ement+0.004
+0.003
Neg logit contributions
 himself-0.002
 seek-0.002
lot-0.002
dy-0.002
 active-0.002
Token and
Token ID290
Feature activation+9.19e-01

Pos logit contributions
 course+0.020
 ribbon+0.016
+0.016
 rib+0.015
 please+0.014
Neg logit contributions
rated-0.023
 himself-0.021
lling-0.021
lot-0.021
 active-0.021
Token gave
Token ID2921
Feature activation+7.54e-01

Pos logit contributions
lot+0.027
dy+0.025
 himself+0.025
die+0.024
 seek+0.024
Neg logit contributions
 course-0.032
ocol-0.028
isher-0.027
 ribbon-0.026
 rib-0.026
Token her
Token ID607
Feature activation+4.30e-01

Pos logit contributions
 seek+0.025
 himself+0.024
lot+0.024
dy+0.023
 active+0.023
Neg logit contributions
 course-0.024
ocol-0.023
isher-0.022
ement-0.020
ir-0.020
Token the
Token ID262
Feature activation+1.59e-01

Pos logit contributions
lot+0.012
 seek+0.012
 himself+0.012
dy+0.011
die+0.011
Neg logit contributions
 course-0.016
ocol-0.014
isher-0.014
ement-0.013
-0.013
Token t
Token ID256
Feature activation+1.09e+00

Pos logit contributions
 himself+0.006
 seek+0.005
lot+0.005
dy+0.005
die+0.005
Neg logit contributions
 course-0.005
ocol-0.004
isher-0.004
ement-0.004
-0.004
Tokeneddy
Token ID21874
Feature activation+3.74e+00

Pos logit contributions
 himself+0.037
 seek+0.034
lot+0.034
dy+0.034
die+0.033
Neg logit contributions
 course-0.036
ocol-0.032
isher-0.030
-0.030
ement-0.029
Token bear
Token ID6842
Feature activation-3.97e-01

Pos logit contributions
 himself+0.094
 seek+0.090
lot+0.087
dy+0.085
die+0.081
Neg logit contributions
 course-0.149
ocol-0.141
isher-0.139
ement-0.132
-0.130
Token.
Token ID13
Feature activation-3.61e-01

Pos logit contributions
 course+0.013
isher+0.011
ocol+0.011
ir+0.010
ement+0.010
Neg logit contributions
 seek-0.013
lot-0.012
 himself-0.012
die-0.012
dy-0.012
Token
Token ID198
Feature activation-5.23e-01

Pos logit contributions
 course+0.013
+0.011
ocol+0.011
isher+0.010
ement+0.010
Neg logit contributions
 seek-0.011
lot-0.010
 himself-0.010
 afloat-0.010
 active-0.010
Token
Token ID198
Feature activation-6.06e-01

Pos logit contributions
 course+0.020
ir+0.017
ocol+0.017
+0.016
ay+0.015
Neg logit contributions
 seek-0.014
 afloat-0.014
 active-0.014
lot-0.013
 miners-0.013
Token"
Token ID1
Feature activation-1.15e+00

Pos logit contributions
 course+0.023
ocol+0.019
ir+0.019
+0.019
isher+0.018
Neg logit contributions
lot-0.017
 himself-0.017
dy-0.016
 afloat-0.016
die-0.015
Token The
Token ID383
Feature activation-6.99e-01

Pos logit contributions
 course+0.013
+0.012
ocol+0.011
ement+0.010
ir+0.010
Neg logit contributions
lot-0.012
 afloat-0.011
lling-0.011
 seek-0.011
 active-0.011
Token elf
Token ID23878
Feature activation+1.05e-01

Pos logit contributions
 course+0.025
ocol+0.022
isher+0.022
 ribbon+0.020
ement+0.019
Neg logit contributions
 himself-0.022
lot-0.021
 seek-0.020
dy-0.019
 active-0.019
Token thanked
Token ID26280
Feature activation-5.31e-01

Pos logit contributions
 himself+0.003
 seek+0.003
lot+0.003
dy+0.003
die+0.003
Neg logit contributions
 course-0.004
ement-0.003
isher-0.003
ocol-0.003
ir-0.003
Token the
Token ID262
Feature activation+4.05e-01

Pos logit contributions
 course+0.021
ocol+0.019
isher+0.018
ement+0.017
ir+0.017
Neg logit contributions
 seek-0.014
lot-0.014
dy-0.014
 himself-0.013
 active-0.013
Token re
Token ID302
Feature activation+1.82e+00

Pos logit contributions
lot+0.012
 himself+0.012
 seek+0.011
dy+0.011
die+0.011
Neg logit contributions
 course-0.014
ocol-0.013
isher-0.013
ement-0.012
 ribbon-0.012
Tokenind
Token ID521
Feature activation+3.71e+00

Pos logit contributions
 himself+0.062
lot+0.058
 seek+0.058
dy+0.058
die+0.056
Neg logit contributions
 course-0.058
ocol-0.052
-0.051
isher-0.051
ement-0.049
Tokeneer
Token ID28153
Feature activation+4.72e-01

Pos logit contributions
 seek+0.091
 himself+0.082
lot+0.078
dy+0.077
 active+0.076
Neg logit contributions
 course-0.152
isher-0.149
ocol-0.147
ement-0.135
ir-0.134
Token for
Token ID329
Feature activation+3.85e-02

Pos logit contributions
 seek+0.016
lot+0.015
 himself+0.014
 active+0.014
dy+0.014
Neg logit contributions
 course-0.015
isher-0.013
ocol-0.013
ement-0.013
ir-0.012
Token helping
Token ID5742
Feature activation+5.60e-01

Pos logit contributions
 himself+0.001
 seek+0.001
lot+0.001
dy+0.001
die+0.001
Neg logit contributions
 course-0.002
ocol-0.002
isher-0.002
ement-0.001
-0.001
Token him
Token ID683
Feature activation+4.44e-01

Pos logit contributions
 seek+0.020
lot+0.020
dy+0.019
die+0.019
 active+0.018
Neg logit contributions
 course-0.016
ocol-0.015
isher-0.013
ir-0.013
ement-0.012
Token and
Token ID290
Feature activation-2.59e-01

Pos logit contributions
lot+0.015
die+0.015
 seek+0.015
dy+0.014
Andrew+0.014
Neg logit contributions
 course-0.012
ocol-0.010
isher-0.010
-0.010
ir-0.010
Token smiled
Token ID13541
Feature activation-2.81e-01

Pos logit contributions
 course+0.005
 ribbon+0.004
ement+0.004
isher+0.004
ipped+0.004
Neg logit contributions
 himself-0.006
 late-0.006
 afloat-0.006
 seek-0.006
lot-0.006
Token and
Token ID290
Feature activation+3.16e-01

Pos logit contributions
 course+0.012
ocol+0.011
isher+0.011
ement+0.010
+0.010
Neg logit contributions
 himself-0.007
 seek-0.007
lot-0.006
dy-0.006
die-0.006
Token gave
Token ID2921
Feature activation+7.96e-01

Pos logit contributions
 himself+0.010
lot+0.010
 seek+0.009
dy+0.009
die+0.009
Neg logit contributions
 course-0.010
isher-0.009
 ribbon-0.008
-0.008
ocol-0.008
Token her
Token ID607
Feature activation+1.89e-01

Pos logit contributions
 himself+0.025
 seek+0.022
dy+0.022
lot+0.021
die+0.021
Neg logit contributions
 course-0.028
ocol-0.028
isher-0.027
-0.026
ement-0.025
Token t
Token ID256
Feature activation+9.21e-01

Pos logit contributions
 himself+0.005
 seek+0.005
dy+0.004
lot+0.004
die+0.004
Neg logit contributions
 course-0.007
ocol-0.007
isher-0.007
ement-0.007
ir-0.007
Tokeneddy
Token ID21874
Feature activation+3.70e+00

Pos logit contributions
 himself+0.028
dy+0.025
lot+0.024
die+0.024
 seek+0.024
Neg logit contributions
 course-0.036
ocol-0.032
-0.030
ement-0.030
isher-0.028
Token bear
Token ID6842
Feature activation+3.57e-03

Pos logit contributions
 himself+0.082
dy+0.074
 seek+0.073
lot+0.071
die+0.070
Neg logit contributions
 course-0.161
ocol-0.159
isher-0.153
ement-0.149
-0.149
Token a
Token ID257
Feature activation+9.63e-02

Pos logit contributions
 himself+0.000
 seek+0.000
lot+0.000
dy+0.000
die+0.000
Neg logit contributions
 course-0.000
ocol-0.000
isher-0.000
ement-0.000
-0.000
Token big
Token ID1263
Feature activation+7.08e-01

Pos logit contributions
 seek+0.002
 himself+0.002
lot+0.002
dy+0.002
 active+0.002
Neg logit contributions
ocol-0.004
isher-0.004
 course-0.004
ement-0.004
ir-0.004
Token hug
Token ID16225
Feature activation-7.59e-01

Pos logit contributions
 seek+0.023
 himself+0.021
dy+0.020
 active+0.019
 late+0.019
Neg logit contributions
ocol-0.024
isher-0.022
ement-0.022
 course-0.021
ir-0.020
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
 course+0.026
ocol+0.025
isher+0.024
ement+0.023
+0.022
Neg logit contributions
 himself-0.023
 seek-0.022
lot-0.022
dy-0.021
die-0.021
Token should
Token ID815
Feature activation-1.07e-01

Pos logit contributions
 course+0.025
ocol+0.023
isher+0.023
ement+0.021
+0.021
Neg logit contributions
 himself-0.025
 seek-0.025
lot-0.024
dy-0.024
die-0.023
Token not
Token ID407
Feature activation+5.46e-01

Pos logit contributions
 course+0.005
ocol+0.004
isher+0.004
ement+0.004
+0.004
Neg logit contributions
 himself-0.002
 seek-0.002
lot-0.002
dy-0.002
 active-0.002
Token tease
Token ID34691
Feature activation-4.83e-01

Pos logit contributions
 himself+0.018
 seek+0.017
lot+0.017
dy+0.016
die+0.016
Neg logit contributions
 course-0.018
ocol-0.017
isher-0.016
-0.015
ement-0.015
Token the
Token ID262
Feature activation+5.68e-01

Pos logit contributions
 course+0.015
ocol+0.013
isher+0.013
ir+0.012
ement+0.012
Neg logit contributions
 seek-0.017
lot-0.017
 himself-0.017
dy-0.016
die-0.016
Token ost
Token ID23619
Feature activation+7.24e-01

Pos logit contributions
 himself+0.014
 seek+0.013
lot+0.013
dy+0.013
 active+0.012
Neg logit contributions
 course-0.023
ocol-0.022
isher-0.022
ement-0.021
-0.020
Tokenric
Token ID1173
Feature activation+3.69e+00

Pos logit contributions
 himself+0.024
lot+0.022
 seek+0.022
dy+0.022
die+0.021
Neg logit contributions
 course-0.023
ocol-0.022
isher-0.022
ement-0.020
-0.020
Tokenhes
Token ID956
Feature activation-1.30e+00

Pos logit contributions
 seek+0.120
 himself+0.110
dy+0.109
die+0.108
 active+0.105
Neg logit contributions
 course-0.127
ocol-0.116
ir-0.111
isher-0.108
ement-0.105
Token.
Token ID13
Feature activation-1.71e-01

Pos logit contributions
 course+0.044
ocol+0.039
isher+0.038
ir+0.036
ement+0.036
Neg logit contributions
 seek-0.041
lot-0.041
 himself-0.041
dy-0.040
die-0.039
Token They
Token ID1119
Feature activation-1.14e+00

Pos logit contributions
 course+0.007
ocol+0.006
isher+0.006
ement+0.006
+0.006
Neg logit contributions
 himself-0.005
 seek-0.004
lot-0.004
dy-0.004
 active-0.004
Token are
Token ID389
Feature activation-1.04e-01

Pos logit contributions
ocol+0.033
isher+0.032
 course+0.031
ement+0.030
+0.027
Neg logit contributions
 himself-0.041
 seek-0.039
dy-0.038
lot-0.038
 active-0.037
Token not
Token ID407
Feature activation+9.97e-01

Pos logit contributions
 course+0.004
ocol+0.004
isher+0.004
ir+0.003
 ribbon+0.003
Neg logit contributions
 seek-0.003
 himself-0.003
lot-0.003
dy-0.002
die-0.002
Token,
Token ID11
Feature activation+7.88e-01

Pos logit contributions
 himself+0.004
 seek+0.003
dy+0.003
lot+0.003
 active+0.003
Neg logit contributions
 course-0.006
isher-0.006
ocol-0.005
-0.005
ement-0.005
Token "
Token ID366
Feature activation+2.96e-01

Pos logit contributions
 himself+0.021
 seek+0.021
lot+0.020
dy+0.019
die+0.019
Neg logit contributions
 course-0.030
ocol-0.028
-0.027
isher-0.027
ement-0.026
TokenThis
Token ID1212
Feature activation+1.01e-01

Pos logit contributions
 himself+0.007
 seek+0.007
lot+0.007
dy+0.006
die+0.006
Neg logit contributions
ocol-0.011
isher-0.011
 course-0.011
ement-0.011
-0.010
Token is
Token ID318
Feature activation-1.13e-01

Pos logit contributions
 himself+0.003
 seek+0.003
dy+0.003
lot+0.003
 having+0.003
Neg logit contributions
isher-0.003
ocol-0.003
 course-0.003
ement-0.003
-0.003
Token ind
Token ID773
Feature activation+1.18e+00

Pos logit contributions
 course+0.004
ocol+0.004
isher+0.004
ement+0.004
+0.004
Neg logit contributions
 himself-0.003
 seek-0.003
lot-0.003
dy-0.003
 active-0.003
Tokenigo
Token ID14031
Feature activation+3.69e+00

Pos logit contributions
dy+0.032
 himself+0.031
lot+0.029
die+0.028
 seek+0.028
Neg logit contributions
 course-0.043
isher-0.042
ocol-0.040
-0.037
 emanating-0.034
Token,
Token ID11
Feature activation-2.45e-01

Pos logit contributions
 himself+0.098
 seek+0.093
lot+0.091
dy+0.091
die+0.085
Neg logit contributions
 course-0.137
isher-0.134
ocol-0.134
ement-0.123
-0.122
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
isher+0.008
 course+0.007
ocol+0.007
ement+0.007
+0.006
Neg logit contributions
 seek-0.008
 himself-0.008
dy-0.007
lot-0.007
 having-0.007
Token this
Token ID428
Feature activation+3.99e-01

Pos logit contributions
 himself+0.046
 seek+0.046
lot+0.043
dy+0.043
 having+0.042
Neg logit contributions
isher-0.037
 course-0.036
ocol-0.035
ement-0.033
-0.030
Token is
Token ID318
Feature activation+3.22e-01

Pos logit contributions
 himself+0.013
 seek+0.013
lot+0.013
dy+0.012
die+0.012
Neg logit contributions
 course-0.012
ocol-0.012
isher-0.012
ement-0.011
-0.011
Token violet
Token ID45725
Feature activation+4.96e-01

Pos logit contributions
 himself+0.009
 seek+0.009
lot+0.008
dy+0.008
die+0.008
Neg logit contributions
 course-0.012
ocol-0.011
isher-0.011
ement-0.010
-0.010
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
 course+0.007
ocol+0.007
isher+0.007
ement+0.006
+0.006
Neg logit contributions
 himself-0.006
 seek-0.006
lot-0.006
dy-0.006
die-0.006
Token He
Token ID679
Feature activation+3.86e-02

Pos logit contributions
 seek+0.005
 active+0.005
rated+0.005
dy+0.005
lot+0.004
Neg logit contributions
 course-0.007
-0.007
ocol-0.006
ement-0.006
isher-0.006
Token told
Token ID1297
Feature activation+5.40e-01

Pos logit contributions
 himself+0.001
lot+0.001
 seek+0.001
dy+0.001
 active+0.001
Neg logit contributions
 course-0.001
isher-0.001
 ribbon-0.001
ocol-0.001
ement-0.001
Token his
Token ID465
Feature activation+3.43e-01

Pos logit contributions
 seek+0.016
lot+0.015
dy+0.014
 active+0.014
die+0.014
Neg logit contributions
 course-0.020
ocol-0.018
isher-0.017
ir-0.017
ement-0.016
Token re
Token ID302
Feature activation+2.02e+00

Pos logit contributions
 seek+0.011
 himself+0.011
lot+0.010
die+0.009
 afloat+0.009
Neg logit contributions
 course-0.011
ocol-0.010
isher-0.010
 ribbon-0.010
ement-0.010
Tokenind
Token ID521
Feature activation+3.65e+00

Pos logit contributions
 himself+0.069
lot+0.066
dy+0.066
 seek+0.066
die+0.064
Neg logit contributions
 course-0.063
ocol-0.058
isher-0.055
-0.055
ement-0.053
Tokeneer
Token ID28153
Feature activation+5.25e-01

Pos logit contributions
 seek+0.089
 himself+0.079
 active+0.075
dy+0.072
 afloat+0.072
Neg logit contributions
 course-0.149
isher-0.144
ocol-0.144
ir-0.133
ement-0.130
Token to
Token ID284
Feature activation+6.25e-01

Pos logit contributions
 seek+0.017
 himself+0.016
 active+0.016
lot+0.016
die+0.016
Neg logit contributions
 course-0.017
ocol-0.014
isher-0.014
ement-0.013
ir-0.013
Token stay
Token ID2652
Feature activation+4.34e-02

Pos logit contributions
 himself+0.020
 seek+0.019
lot+0.019
dy+0.019
die+0.018
Neg logit contributions
 course-0.021
ocol-0.019
isher-0.019
ement-0.017
-0.017
Token put
Token ID1234
Feature activation+7.66e-01

Pos logit contributions
lot+0.001
 himself+0.001
 seek+0.001
dy+0.001
die+0.001
Neg logit contributions
 course-0.002
ocol-0.002
isher-0.001
ement-0.001
ir-0.001
Token and
Token ID290
Feature activation+9.21e-01

Pos logit contributions
 himself+0.020
 seek+0.020
lot+0.019
dy+0.018
die+0.018
Neg logit contributions
 course-0.030
ocol-0.028
isher-0.028
ement-0.026
-0.026
Token shut
Token ID4423
Feature activation-5.27e-01

Pos logit contributions
 course+0.034
ocol+0.034
isher+0.033
ement+0.031
+0.031
Neg logit contributions
 himself-0.017
 seek-0.015
lot-0.014
dy-0.014
 active-0.013
Token and
Token ID290
Feature activation+3.90e-01

Pos logit contributions
 course+0.022
ocol+0.021
isher+0.021
ement+0.020
+0.019
Neg logit contributions
 himself-0.012
 seek-0.012
lot-0.011
dy-0.011
die-0.011
Token hugged
Token ID41130
Feature activation-2.06e-01

Pos logit contributions
 himself+0.010
 seek+0.010
lot+0.009
dy+0.009
 active+0.009
Neg logit contributions
 course-0.015
ocol-0.015
isher-0.015
ement-0.014
-0.014
Token her
Token ID607
Feature activation-1.98e-01

Pos logit contributions
ocol+0.008
 course+0.008
isher+0.008
+0.007
ement+0.007
Neg logit contributions
 himself-0.006
 seek-0.005
lot-0.005
dy-0.005
die-0.005
Token t
Token ID256
Feature activation+8.43e-01

Pos logit contributions
 course+0.008
ocol+0.008
isher+0.007
ement+0.007
+0.007
Neg logit contributions
 himself-0.005
 seek-0.005
dy-0.005
lot-0.005
 active-0.005
Tokeneddy
Token ID21874
Feature activation+3.64e+00

Pos logit contributions
 himself+0.025
dy+0.023
lot+0.023
 seek+0.022
die+0.021
Neg logit contributions
 course-0.034
ocol-0.030
-0.028
ement-0.027
isher-0.026
Token bear
Token ID6842
Feature activation-1.79e-01

Pos logit contributions
 himself+0.087
 seek+0.080
lot+0.078
dy+0.078
die+0.074
Neg logit contributions
 course-0.151
ocol-0.147
isher-0.143
-0.136
ement-0.136
Token tightly
Token ID17707
Feature activation-2.77e-01

Pos logit contributions
 course+0.005
ocol+0.005
isher+0.005
ement+0.005
+0.004
Neg logit contributions
 himself-0.006
 seek-0.006
lot-0.006
dy-0.006
die-0.006
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
 course+0.010
ocol+0.010
isher+0.010
ement+0.009
+0.009
Neg logit contributions
 himself-0.008
 seek-0.007
lot-0.007
dy-0.007
 active-0.007
Token
Token ID198
Feature activation+1.09e-01

Pos logit contributions
 himself+0.003
 seek+0.003
lot+0.003
dy+0.003
 active+0.003
Neg logit contributions
 course-0.007
ocol-0.006
isher-0.006
-0.006
ement-0.006
Token
Token ID198
Feature activation-4.73e-02

Pos logit contributions
 seek+0.003
 himself+0.002
dy+0.002
lot+0.002
 active+0.002
Neg logit contributions
 course-0.004
ir-0.004
-0.004
ement-0.004
ocol-0.004
Token and
Token ID290
Feature activation+9.19e-01

Pos logit contributions
 course+0.006
ocol+0.005
isher+0.005
ement+0.004
ir+0.004
Neg logit contributions
lot-0.006
dy-0.006
die-0.006
rated-0.006
athan-0.005
Token he
Token ID339
Feature activation+6.36e-01

Pos logit contributions
lot+0.027
dy+0.025
 himself+0.025
die+0.024
 seek+0.024
Neg logit contributions
 course-0.032
ocol-0.028
isher-0.027
 ribbon-0.026
 rib-0.026
Token brought
Token ID3181
Feature activation+3.30e-01

Pos logit contributions
 seek+0.018
lot+0.018
 himself+0.018
dy+0.017
die+0.017
Neg logit contributions
 course-0.022
isher-0.019
ocol-0.018
 ribbon-0.017
ement-0.017
Token his
Token ID465
Feature activation+1.86e-01

Pos logit contributions
 seek+0.011
lot+0.010
dy+0.010
die+0.010
 active+0.010
Neg logit contributions
 course-0.011
ocol-0.010
isher-0.010
ement-0.009
ir-0.009
Token t
Token ID256
Feature activation+1.00e+00

Pos logit contributions
lot+0.006
 himself+0.006
 seek+0.006
dy+0.006
die+0.005
Neg logit contributions
 course-0.006
ocol-0.006
isher-0.005
ement-0.005
 ribbon-0.005
Tokeneddy
Token ID21874
Feature activation+3.63e+00

Pos logit contributions
 himself+0.036
 seek+0.034
lot+0.033
dy+0.033
die+0.032
Neg logit contributions
 course-0.030
ocol-0.028
isher-0.026
ement-0.025
-0.025
Token bear
Token ID6842
Feature activation-4.24e-01

Pos logit contributions
 himself+0.092
 seek+0.090
lot+0.086
dy+0.083
 active+0.081
Neg logit contributions
 course-0.144
isher-0.134
ocol-0.134
ement-0.125
-0.123
Token with
Token ID351
Feature activation+8.53e-02

Pos logit contributions
 course+0.014
isher+0.012
ocol+0.012
 ribbon+0.011
ir+0.011
Neg logit contributions
lot-0.013
 seek-0.013
die-0.013
 himself-0.012
dy-0.012
Token him
Token ID683
Feature activation-5.30e-01

Pos logit contributions
 himself+0.002
 seek+0.002
lot+0.002
dy+0.002
die+0.002
Neg logit contributions
 course-0.003
ocol-0.003
isher-0.003
ement-0.003
-0.003
Token.
Token ID13
Feature activation-3.76e-01

Pos logit contributions
 course+0.014
ocol+0.013
isher+0.012
ement+0.011
+0.011
Neg logit contributions
 seek-0.020
lot-0.019
 himself-0.019
die-0.018
dy-0.018
Token
Token ID220
Feature activation+2.82e-02

Pos logit contributions
 course+0.013
+0.012
ocol+0.011
isher+0.011
ement+0.011
Neg logit contributions
 seek-0.011
 active-0.011
 afloat-0.010
rated-0.010
lling-0.010
Token smiled
Token ID13541
Feature activation-3.20e-01

Pos logit contributions
 course+0.011
isher+0.010
+0.010
ement+0.010
ipped+0.009
Neg logit contributions
 seek-0.009
 himself-0.009
lot-0.009
 active-0.008
die-0.008
Token and
Token ID290
Feature activation+4.45e-01

Pos logit contributions
 course+0.012
isher+0.011
ocol+0.011
ement+0.011
+0.010
Neg logit contributions
 seek-0.009
 himself-0.008
lot-0.008
dy-0.008
die-0.008
Token hugged
Token ID41130
Feature activation-1.16e-01

Pos logit contributions
 himself+0.013
 seek+0.013
lot+0.013
dy+0.012
die+0.012
Neg logit contributions
 course-0.016
isher-0.014
ocol-0.014
-0.013
ement-0.013
Token her
Token ID607
Feature activation-6.57e-02

Pos logit contributions
 course+0.004
ocol+0.004
isher+0.004
ement+0.004
+0.004
Neg logit contributions
 himself-0.004
 seek-0.003
lot-0.003
dy-0.003
die-0.003
Token t
Token ID256
Feature activation+7.72e-01

Pos logit contributions
 course+0.003
ocol+0.003
isher+0.003
ir+0.003
ement+0.003
Neg logit contributions
 himself-0.001
 seek-0.001
dy-0.001
 active-0.001
lot-0.001
Tokeneddy
Token ID21874
Feature activation+3.62e+00

Pos logit contributions
 himself+0.023
dy+0.021
lot+0.021
 seek+0.020
die+0.020
Neg logit contributions
 course-0.030
ocol-0.027
-0.025
ement-0.024
isher-0.023
Token bear
Token ID6842
Feature activation-9.61e-02

Pos logit contributions
 himself+0.094
 seek+0.088
lot+0.085
dy+0.085
die+0.081
Neg logit contributions
 course-0.142
ocol-0.137
isher-0.134
ement-0.127
-0.126
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
 course+0.003
isher+0.003
ocol+0.003
ement+0.002
+0.002
Neg logit contributions
 himself-0.003
 seek-0.003
lot-0.003
dy-0.003
die-0.003
Token<|endoftext|>
Token ID50256
Feature activation+9.31e-02

Pos logit contributions
 himself+0.007
 seek+0.007
lot+0.006
dy+0.006
 active+0.006
Neg logit contributions
 course-0.009
ocol-0.008
isher-0.008
-0.008
ement-0.008
TokenOnce
Token ID7454
Feature activation+4.78e-01

Pos logit contributions
 seek+0.002
 himself+0.002
dy+0.002
 active+0.002
lot+0.002
Neg logit contributions
 course-0.004
-0.004
ocol-0.004
ement-0.003
isher-0.003
Token upon
Token ID2402
Feature activation+2.09e-01

Pos logit contributions
 himself+0.013
 seek+0.013
lot+0.012
dy+0.012
 active+0.011
Neg logit contributions
 course-0.018
ocol-0.017
isher-0.017
ement-0.016
-0.016
Token and
Token ID290
Feature activation+9.36e-01

Pos logit contributions
 himself+0.024
 seek+0.024
lot+0.024
dy+0.023
die+0.023
Neg logit contributions
 course-0.032
ocol-0.030
isher-0.029
ement-0.028
-0.028
Token finally
Token ID3443
Feature activation+6.16e-01

Pos logit contributions
 himself+0.026
 seek+0.025
lot+0.024
dy+0.024
die+0.023
Neg logit contributions
 course-0.035
ocol-0.033
isher-0.032
ement-0.031
-0.030
Token Mark
Token ID2940
Feature activation-2.79e-01

Pos logit contributions
 active+0.019
die+0.019
lot+0.019
 himself+0.018
dy+0.018
Neg logit contributions
 course-0.018
isher-0.016
ocol-0.015
ir-0.015
 ribbon-0.015
Token felt
Token ID2936
Feature activation-3.35e-01

Pos logit contributions
 course+0.009
isher+0.008
ement+0.008
+0.008
ipped+0.007
Neg logit contributions
 seek-0.009
 active-0.008
 himself-0.008
die-0.008
 late-0.007
Token something
Token ID1223
Feature activation-9.45e-01

Pos logit contributions
 course+0.011
ocol+0.011
ement+0.010
isher+0.010
+0.010
Neg logit contributions
 seek-0.010
lot-0.010
dy-0.010
die-0.009
any-0.008
Token tug
Token ID27762
Feature activation-3.35e+00

Pos logit contributions
 course+0.038
ocol+0.035
isher+0.035
ement+0.033
+0.033
Neg logit contributions
 himself-0.025
 seek-0.023
dy-0.022
lot-0.022
 active-0.021
Tokenging
Token ID2667
Feature activation+3.87e-01

Pos logit contributions
 course+0.108
ocol+0.106
isher+0.104
ement+0.098
ir+0.096
Neg logit contributions
 seek-0.107
lot-0.105
 himself-0.102
die-0.101
dy-0.100
Token at
Token ID379
Feature activation+2.05e-01

Pos logit contributions
 seek+0.011
lot+0.011
dy+0.010
die+0.010
 himself+0.010
Neg logit contributions
 course-0.014
ocol-0.013
isher-0.013
ement-0.013
ir-0.012
Token the
Token ID262
Feature activation+1.55e-04

Pos logit contributions
 seek+0.005
 himself+0.005
lot+0.005
dy+0.004
 active+0.004
Neg logit contributions
 course-0.009
ocol-0.008
isher-0.008
ement-0.008
-0.007
Token line
Token ID1627
Feature activation-2.17e-01

Pos logit contributions
 seek+0.000
 himself+0.000
lot+0.000
dy+0.000
die+0.000
Neg logit contributions
 course-0.000
ocol-0.000
isher-0.000
ement-0.000
-0.000
Token.
Token ID13
Feature activation+7.74e-02

Pos logit contributions
 course+0.007
isher+0.006
ocol+0.006
ement+0.006
+0.005
Neg logit contributions
die-0.008
 seek-0.008
dy-0.008
lot-0.007
 himself-0.007
Tokenbled
Token ID9342
Feature activation+1.11e+00

Pos logit contributions
 course+0.019
ocol+0.018
isher+0.018
ement+0.017
+0.017
Neg logit contributions
 himself-0.015
 seek-0.014
lot-0.013
dy-0.013
die-0.013
Token up
Token ID510
Feature activation-7.71e-01

Pos logit contributions
 seek+0.035
lot+0.033
 active+0.031
 himself+0.031
dy+0.030
Neg logit contributions
 course-0.040
ocol-0.036
isher-0.036
ement-0.034
ir-0.032
Token all
Token ID477
Feature activation-1.18e-03

Pos logit contributions
 course+0.031
ocol+0.029
isher+0.028
ement+0.027
+0.027
Neg logit contributions
 seek-0.019
lot-0.019
 himself-0.018
dy-0.018
die-0.018
Token the
Token ID262
Feature activation-7.79e-01

Pos logit contributions
 course+0.000
ocol+0.000
isher+0.000
+0.000
ement+0.000
Neg logit contributions
 himself-0.000
 seek-0.000
lot-0.000
dy-0.000
die-0.000
Token ch
Token ID442
Feature activation-1.02e+00

Pos logit contributions
 course+0.026
ocol+0.025
isher+0.024
+0.023
ement+0.023
Neg logit contributions
 himself-0.025
 seek-0.024
lot-0.023
dy-0.023
die-0.022
Tokenocol
Token ID4668
Feature activation-3.34e+00

Pos logit contributions
 course+0.013
+0.008
 ribbon+0.006
ement+0.006
isher+0.005
Neg logit contributions
 himself-0.055
dy-0.050
lot-0.050
ages-0.049
 active-0.049
Tokenates
Token ID689
Feature activation-3.06e-01

Pos logit contributions
 course+0.134
ocol+0.123
isher+0.123
ement+0.119
+0.118
Neg logit contributions
 himself-0.084
lot-0.078
 seek-0.076
dy-0.074
die-0.071
Token,
Token ID11
Feature activation-5.08e-01

Pos logit contributions
 course+0.012
isher+0.011
ocol+0.011
ement+0.010
+0.010
Neg logit contributions
 seek-0.008
lot-0.008
 himself-0.008
dy-0.008
die-0.008
Token and
Token ID290
Feature activation-1.36e-01

Pos logit contributions
 course+0.020
ocol+0.018
isher+0.018
ement+0.017
+0.017
Neg logit contributions
 himself-0.013
lot-0.012
 seek-0.012
dy-0.012
die-0.011
Token she
Token ID673
Feature activation+5.53e-01

Pos logit contributions
 course+0.005
isher+0.004
 ribbon+0.004
ocol+0.004
 rib+0.004
Neg logit contributions
lot-0.004
dy-0.004
die-0.004
 himself-0.004
 active-0.004
Token was
Token ID373
Feature activation-5.82e-02

Pos logit contributions
lot+0.015
 himself+0.015
 seek+0.015
 active+0.014
dy+0.014
Neg logit contributions
 course-0.020
isher-0.018
-0.017
ocol-0.016
ement-0.016
Token save
Token ID3613
Feature activation-3.68e-01

Pos logit contributions
 himself+0.003
lot+0.003
dy+0.002
 seek+0.002
die+0.002
Neg logit contributions
 course-0.003
ocol-0.002
isher-0.002
-0.002
ir-0.002
Token his
Token ID465
Feature activation-3.45e-01

Pos logit contributions
 course+0.013
ocol+0.012
isher+0.011
ement+0.011
ir+0.010
Neg logit contributions
 seek-0.012
lot-0.010
dy-0.010
die-0.010
 active-0.010
Token pup
Token ID15552
Feature activation-2.53e-01

Pos logit contributions
 course+0.012
isher+0.010
ocol+0.010
 ribbon+0.010
+0.010
Neg logit contributions
 himself-0.011
lot-0.010
 seek-0.010
dy-0.009
die-0.009
Token from
Token ID422
Feature activation+1.30e+00

Pos logit contributions
 course+0.007
+0.005
isher+0.005
ement+0.005
ocol+0.005
Neg logit contributions
die-0.009
 seek-0.009
lot-0.009
dy-0.009
athan-0.008
Token the
Token ID262
Feature activation-4.76e-01

Pos logit contributions
 seek+0.035
lot+0.035
dy+0.035
die+0.033
 active+0.032
Neg logit contributions
 course-0.051
ocol-0.045
isher-0.043
ement-0.041
-0.040
Token injuries
Token ID6821
Feature activation-3.29e+00

Pos logit contributions
 course+0.017
ocol+0.016
isher+0.016
+0.015
ement+0.015
Neg logit contributions
 himself-0.014
lot-0.013
 seek-0.013
dy-0.013
die-0.012
Token and
Token ID290
Feature activation-1.65e-02

Pos logit contributions
 course+0.099
ocol+0.087
ement+0.085
isher+0.084
+0.084
Neg logit contributions
 seek-0.112
 himself-0.111
dy-0.111
lot-0.109
die-0.109
Token the
Token ID262
Feature activation-1.08e-01

Pos logit contributions
 course+0.001
isher+0.001
ocol+0.001
 ribbon+0.000
+0.000
Neg logit contributions
lot-0.000
dy-0.000
 seek-0.000
die-0.000
 himself-0.000
Token pup
Token ID15552
Feature activation-3.24e-01

Pos logit contributions
 course+0.004
ocol+0.003
isher+0.003
+0.003
 ribbon+0.003
Neg logit contributions
 himself-0.003
lot-0.003
dy-0.003
 seek-0.003
die-0.003
Token died
Token ID3724
Feature activation-2.47e-01

Pos logit contributions
 course+0.011
+0.009
isher+0.009
ement+0.009
ocol+0.009
Neg logit contributions
 seek-0.009
die-0.009
 himself-0.009
dy-0.009
lot-0.009
Token soon
Token ID2582
Feature activation+5.56e-01

Pos logit contributions
 course+0.007
isher+0.006
ocol+0.006
ement+0.006
+0.006
Neg logit contributions
 seek-0.009
lot-0.009
die-0.008
dy-0.008
 himself-0.008
Token with
Token ID351
Feature activation-3.21e-01

Pos logit contributions
lot+0.061
die+0.058
dy+0.058
 seek+0.055
 himself+0.055
Neg logit contributions
 course-0.043
ement-0.034
isher-0.033
ocol-0.032
-0.031
Token a
Token ID257
Feature activation-3.76e-01

Pos logit contributions
 course+0.011
ocol+0.011
isher+0.011
ement+0.010
+0.010
Neg logit contributions
 himself-0.009
 seek-0.009
lot-0.008
dy-0.008
 active-0.008
Token picnic
Token ID35715
Feature activation-3.58e-01

Pos logit contributions
 course+0.014
ocol+0.013
isher+0.013
ement+0.012
+0.012
Neg logit contributions
 himself-0.011
 seek-0.011
lot-0.010
dy-0.010
die-0.010
Token in
Token ID287
Feature activation-5.28e-01

Pos logit contributions
 course+0.012
isher+0.010
ocol+0.010
+0.010
ement+0.010
Neg logit contributions
 himself-0.012
lot-0.011
 seek-0.011
dy-0.011
die-0.011
Token the
Token ID262
Feature activation-6.32e-01

Pos logit contributions
 course+0.022
ocol+0.022
isher+0.022
ement+0.020
+0.020
Neg logit contributions
 himself-0.012
 seek-0.011
lot-0.010
dy-0.010
 active-0.010
Token shelter
Token ID11772
Feature activation-3.27e+00

Pos logit contributions
 course+0.025
ocol+0.024
isher+0.023
ement+0.022
+0.022
Neg logit contributions
 himself-0.016
 seek-0.016
lot-0.015
dy-0.015
 active-0.014
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
 course+0.101
isher+0.088
ocol+0.088
ement+0.085
+0.083
Neg logit contributions
 himself-0.112
lot-0.109
 seek-0.108
die-0.106
dy-0.106
Token<|endoftext|>
Token ID50256
Feature activation-6.81e-02

Pos logit contributions
 himself+0.000
 seek+0.000
lot+0.000
dy+0.000
die+0.000
Neg logit contributions
 course-0.001
ocol-0.001
isher-0.001
ement-0.001
-0.001
TokenOnce
Token ID7454
Feature activation+5.40e-01

Pos logit contributions
 course+0.003
+0.003
ement+0.002
ocol+0.002
ir+0.002
Neg logit contributions
 active-0.002
 seek-0.002
 himself-0.002
dy-0.002
lot-0.001
Token upon
Token ID2402
Feature activation-9.23e-03

Pos logit contributions
 himself+0.014
 seek+0.013
lot+0.013
dy+0.012
die+0.012
Neg logit contributions
 course-0.021
ocol-0.020
isher-0.020
ement-0.019
-0.019
Token a
Token ID257
Feature activation+3.49e-01

Pos logit contributions
 course+0.000
ocol+0.000
isher+0.000
ement+0.000
+0.000
Neg logit contributions
 himself-0.000
 seek-0.000
lot-0.000
dy-0.000
die-0.000
Token saw
Token ID2497
Feature activation-1.41e+00

Pos logit contributions
 himself+0.018
 seek+0.017
lot+0.016
dy+0.016
 active+0.015
Neg logit contributions
 course-0.029
ocol-0.028
isher-0.027
ement-0.026
-0.026
Token all
Token ID477
Feature activation+8.30e-01

Pos logit contributions
 course+0.059
ocol+0.057
isher+0.056
+0.054
ement+0.053
Neg logit contributions
 himself-0.034
 seek-0.030
lot-0.029
dy-0.028
die-0.026
Token the
Token ID262
Feature activation+4.70e-02

Pos logit contributions
 himself+0.021
 seek+0.020
lot+0.020
dy+0.019
die+0.019
Neg logit contributions
 course-0.033
ocol-0.031
isher-0.031
ement-0.029
-0.028
Token tasty
Token ID25103
Feature activation-5.09e-01

Pos logit contributions
 himself+0.001
 seek+0.001
lot+0.001
dy+0.001
 active+0.001
Neg logit contributions
 course-0.002
ocol-0.002
isher-0.002
-0.001
ement-0.001
Token ch
Token ID442
Feature activation-7.25e-01

Pos logit contributions
 course+0.018
isher+0.017
ocol+0.016
+0.016
ir+0.016
Neg logit contributions
 himself-0.016
 seek-0.014
dy-0.014
lot-0.014
 active-0.013
Tokenocol
Token ID4668
Feature activation-3.21e+00

Pos logit contributions
 course+0.011
+0.006
ement+0.005
isher+0.005
 ribbon+0.005
Neg logit contributions
 himself-0.038
lot-0.035
dy-0.034
 seek-0.034
 miners-0.034
Tokenates
Token ID689
Feature activation-2.59e-01

Pos logit contributions
 course+0.131
isher+0.120
ocol+0.119
ement+0.116
+0.114
Neg logit contributions
 himself-0.080
lot-0.074
 seek-0.072
dy-0.070
die-0.067
Token inside
Token ID2641
Feature activation-7.17e-01

Pos logit contributions
 course+0.009
ocol+0.008
isher+0.008
ement+0.008
+0.008
Neg logit contributions
 himself-0.008
 seek-0.008
lot-0.007
dy-0.007
die-0.007
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
 course+0.026
isher+0.026
ocol+0.026
+0.024
ement+0.023
Neg logit contributions
 himself-0.020
 seek-0.018
dy-0.017
lot-0.017
 having-0.016
Token She
Token ID1375
Feature activation+9.03e-02

Pos logit contributions
 himself+0.004
 seek+0.004
lot+0.004
dy+0.003
 active+0.003
Neg logit contributions
 course-0.005
ocol-0.004
isher-0.004
-0.004
ement-0.004
Token couldn
Token ID3521
Feature activation-4.67e-01

Pos logit contributions
lot+0.003
 himself+0.003
 seek+0.002
 afloat+0.002
 active+0.002
Neg logit contributions
 course-0.003
isher-0.003
 ribbon-0.003
ocol-0.003
ement-0.003
Token he
Token ID339
Feature activation+2.89e-01

Pos logit contributions
lot+0.001
 seek+0.001
die+0.001
 himself+0.001
dy+0.001
Neg logit contributions
 course-0.001
-0.001
ocol-0.001
ir-0.001
isher-0.001
Token was
Token ID373
Feature activation-6.96e-01

Pos logit contributions
 himself+0.009
 seek+0.008
lot+0.008
 afloat+0.008
dy+0.008
Neg logit contributions
 course-0.011
isher-0.009
-0.008
ir-0.008
ocol-0.008
Token known
Token ID1900
Feature activation-1.59e-01

Pos logit contributions
 course+0.023
+0.017
ocol+0.017
 yeah+0.017
isher+0.017
Neg logit contributions
dy-0.021
rated-0.021
die-0.021
lot-0.021
 afloat-0.020
Token as
Token ID355
Feature activation+2.03e-02

Pos logit contributions
 course+0.005
ocol+0.004
isher+0.004
ement+0.004
ir+0.004
Neg logit contributions
 seek-0.005
lot-0.005
 himself-0.005
dy-0.005
die-0.005
Token Prince
Token ID9005
Feature activation-6.74e-01

Pos logit contributions
 seek+0.000
dy+0.000
lot+0.000
die+0.000
 himself+0.000
Neg logit contributions
 course-0.001
ocol-0.001
isher-0.001
ement-0.001
-0.001
Token Mitt
Token ID16627
Feature activation-3.18e+00

Pos logit contributions
 course+0.023
ocol+0.021
+0.021
isher+0.020
ir+0.020
Neg logit contributions
 himself-0.020
 seek-0.019
 active-0.017
die-0.017
dy-0.017
Tokenens
Token ID641
Feature activation-2.51e+00

Pos logit contributions
 course+0.118
ocol+0.116
isher+0.115
ement+0.110
ir+0.104
Neg logit contributions
 himself-0.087
 seek-0.082
lot-0.081
dy-0.079
 active-0.075
Token and
Token ID290
Feature activation+1.17e-01

Pos logit contributions
 course+0.092
+0.075
ocol+0.071
ement+0.071
isher+0.070
Neg logit contributions
 himself-0.072
die-0.070
 seek-0.069
dy-0.069
lot-0.069
Token he
Token ID339
Feature activation+6.07e-01

Pos logit contributions
 afloat+0.003
lot+0.003
dy+0.003
 himself+0.003
athan+0.003
Neg logit contributions
 course-0.004
ocol-0.003
 rib-0.003
isher-0.003
 ribbon-0.003
Token lived
Token ID5615
Feature activation-7.41e-01

Pos logit contributions
 himself+0.019
 seek+0.018
 afloat+0.018
lot+0.017
 late+0.017
Neg logit contributions
 course-0.021
-0.017
isher-0.017
ir-0.016
ement-0.015
Token happily
Token ID18177
Feature activation-1.12e+00

Pos logit contributions
 course+0.023
ocol+0.020
isher+0.020
ir+0.019
+0.019
Neg logit contributions
dy-0.023
 himself-0.023
lot-0.023
die-0.022
 seek-0.022
Token the
Token ID262
Feature activation+4.05e-01

Pos logit contributions
 course+0.011
ocol+0.010
isher+0.010
ement+0.010
+0.010
Neg logit contributions
 himself-0.005
 seek-0.005
lot-0.005
dy-0.005
die-0.005
Token information
Token ID1321
Feature activation-7.94e-01

Pos logit contributions
 himself+0.013
lot+0.013
 seek+0.013
dy+0.013
die+0.012
Neg logit contributions
 course-0.013
ocol-0.012
isher-0.011
-0.011
ement-0.011
Token and
Token ID290
Feature activation-5.95e-02

Pos logit contributions
 course+0.031
ocol+0.029
isher+0.029
ement+0.028
+0.027
Neg logit contributions
 himself-0.020
 seek-0.020
lot-0.019
dy-0.019
die-0.018
Token continued
Token ID3767
Feature activation+1.11e+00

Pos logit contributions
 course+0.002
isher+0.002
ocol+0.002
 ribbon+0.002
ement+0.002
Neg logit contributions
lot-0.002
dy-0.002
 himself-0.002
 seek-0.002
die-0.002
Token his
Token ID465
Feature activation-1.02e+00

Pos logit contributions
 seek+0.030
lot+0.030
dy+0.029
die+0.029
 himself+0.029
Neg logit contributions
 course-0.040
ocol-0.038
ir-0.035
-0.034
ipped-0.033
Token stroll
Token ID41360
Feature activation-3.15e+00

Pos logit contributions
 course+0.036
ocol+0.031
ement+0.030
+0.030
isher+0.030
Neg logit contributions
 himself-0.031
lot-0.030
 seek-0.030
dy-0.029
die-0.029
Token.
Token ID13
Feature activation+8.17e-02

Pos logit contributions
 course+0.110
isher+0.100
ocol+0.099
ement+0.098
ir+0.093
Neg logit contributions
 seek-0.090
lot-0.089
die-0.089
dy-0.088
 himself-0.087
Token He
Token ID679
Feature activation+1.32e-01

Pos logit contributions
 seek+0.003
 active+0.002
 afloat+0.002
dy+0.002
lling+0.002
Neg logit contributions
 course-0.003
-0.003
ocol-0.002
ement-0.002
ay-0.002
Token wasn
Token ID2492
Feature activation-2.73e-01

Pos logit contributions
 himself+0.004
 seek+0.004
lot+0.004
 late+0.003
 afloat+0.003
Neg logit contributions
 course-0.005
isher-0.004
ement-0.004
-0.004
ir-0.004
Token't
Token ID470
Feature activation+5.08e-01

Pos logit contributions
 course+0.007
+0.006
ocol+0.006
ir+0.006
ement+0.006
Neg logit contributions
 seek-0.010
 himself-0.010
dy-0.010
lot-0.010
die-0.010
Token so
Token ID523
Feature activation+5.09e-01

Pos logit contributions
 seek+0.015
Back+0.014
Later+0.014
 rink+0.014
lot+0.014
Neg logit contributions
ocol-0.015
 course-0.014
ir-0.014
 emanating-0.013
isher-0.012
Token forth
Token ID6071
Feature activation+1.78e-01

Pos logit contributions
lot+0.003
 himself+0.003
dy+0.003
die+0.003
 seek+0.003
Neg logit contributions
 course-0.004
ocol-0.003
isher-0.003
 ribbon-0.003
ement-0.003
Token,
Token ID11
Feature activation-7.62e-02

Pos logit contributions
 himself+0.005
 seek+0.005
lot+0.005
dy+0.005
die+0.005
Neg logit contributions
 course-0.006
ocol-0.006
isher-0.006
ement-0.006
ir-0.005
Token while
Token ID981
Feature activation-1.59e+00

Pos logit contributions
 course+0.003
ocol+0.002
ement+0.002
isher+0.002
ir+0.002
Neg logit contributions
lot-0.002
 himself-0.002
 seek-0.002
dy-0.002
die-0.002
Token the
Token ID262
Feature activation+1.17e-01

Pos logit contributions
 course+0.058
ocol+0.054
+0.051
ir+0.050
ement+0.048
Neg logit contributions
lot-0.047
die-0.045
dy-0.045
 himself-0.044
 seek-0.043
Token small
Token ID1402
Feature activation+1.15e-02

Pos logit contributions
 himself+0.003
lot+0.003
dy+0.003
 seek+0.003
die+0.003
Neg logit contributions
 course-0.004
ocol-0.004
isher-0.004
-0.004
ement-0.004
Token bush
Token ID24484
Feature activation-3.14e+00

Pos logit contributions
 himself+0.000
lot+0.000
 seek+0.000
dy+0.000
die+0.000
Neg logit contributions
 course-0.000
ocol-0.000
isher-0.000
ement-0.000
-0.000
Token would
Token ID561
Feature activation-3.71e-01

Pos logit contributions
 course+0.119
ement+0.105
isher+0.102
+0.100
ocol+0.100
Neg logit contributions
 himself-0.089
lot-0.085
die-0.083
dy-0.083
 seek-0.082
Token w
Token ID266
Feature activation-1.20e+00

Pos logit contributions
 course+0.012
ocol+0.011
isher+0.011
ement+0.010
+0.010
Neg logit contributions
 himself-0.012
lot-0.011
 seek-0.011
dy-0.011
die-0.011
Tokeniggle
Token ID24082
Feature activation-5.93e-01

Pos logit contributions
 course+0.035
isher+0.034
ocol+0.032
ement+0.029
+0.028
Neg logit contributions
 himself-0.043
 seek-0.039
 active-0.038
dy-0.038
lot-0.038
Token and
Token ID290
Feature activation+1.99e-01

Pos logit contributions
 course+0.020
ocol+0.018
ir+0.017
isher+0.017
ement+0.016
Neg logit contributions
lot-0.018
 seek-0.018
 himself-0.017
die-0.017
dy-0.017
Token j
Token ID474
Feature activation-4.64e-01

Pos logit contributions
lot+0.007
dy+0.007
 himself+0.007
die+0.006
athan+0.006
Neg logit contributions
 course-0.006
ocol-0.005
 ribbon-0.005
 rib-0.004
isher-0.004
Token was
Token ID373
Feature activation+1.52e-01

Pos logit contributions
 course+0.015
ocol+0.015
isher+0.014
ement+0.014
+0.014
Neg logit contributions
 himself-0.008
 seek-0.008
lot-0.007
dy-0.007
 active-0.007
Token the
Token ID262
Feature activation+1.38e-01

Pos logit contributions
 himself+0.004
 seek+0.003
lot+0.003
dy+0.003
die+0.003
Neg logit contributions
 course-0.006
ocol-0.006
isher-0.006
ement-0.005
-0.005
Token same
Token ID976
Feature activation+1.20e-02

Pos logit contributions
 himself+0.003
 seek+0.003
 active+0.003
 having+0.003
lot+0.002
Neg logit contributions
ocol-0.006
 course-0.006
isher-0.006
ir-0.005
ement-0.005
Token color
Token ID3124
Feature activation+1.15e+00

Pos logit contributions
 himself+0.000
 seek+0.000
dy+0.000
lot+0.000
 active+0.000
Neg logit contributions
 course-0.000
ocol-0.000
isher-0.000
-0.000
ement-0.000
Token as
Token ID355
Feature activation-3.12e-01

Pos logit contributions
 himself+0.031
 seek+0.030
dy+0.028
lot+0.027
 active+0.026
Neg logit contributions
 course-0.044
ocol-0.043
isher-0.041
-0.038
ement-0.038
Token Mitt
Token ID16627
Feature activation-3.14e+00

Pos logit contributions
 course+0.012
ocol+0.011
isher+0.011
ement+0.011
+0.010
Neg logit contributions
 himself-0.009
 seek-0.008
lot-0.008
dy-0.008
 active-0.007
Tokenens
Token ID641
Feature activation-2.30e+00

Pos logit contributions
isher+0.131
ocol+0.130
 course+0.130
ement+0.125
ir+0.118
Neg logit contributions
 himself-0.069
 seek-0.064
dy-0.061
lot-0.059
 active-0.056
Token!
Token ID0
Feature activation-2.43e-01

Pos logit contributions
 course+0.089
ocol+0.085
isher+0.083
ement+0.079
+0.077
Neg logit contributions
 himself-0.060
 seek-0.058
lot-0.056
dy-0.054
die-0.052
Token Lily
Token ID20037
Feature activation-2.38e-01

Pos logit contributions
 course+0.009
ocol+0.008
isher+0.008
ement+0.008
+0.008
Neg logit contributions
 himself-0.007
 seek-0.007
lot-0.007
dy-0.006
die-0.006
Token was
Token ID373
Feature activation-3.40e-01

Pos logit contributions
 course+0.010
isher+0.009
ocol+0.009
ement+0.008
+0.008
Neg logit contributions
 himself-0.006
 seek-0.005
lot-0.005
dy-0.005
die-0.005
Token so
Token ID523
Feature activation+1.46e-01

Pos logit contributions
 course+0.012
ement+0.010
+0.010
isher+0.010
ocol+0.010
Neg logit contributions
lot-0.009
die-0.009
dy-0.009
 seek-0.009
 himself-0.009
Token dream
Token ID4320
Feature activation-4.74e-01

Pos logit contributions
 himself+0.025
 seek+0.024
lot+0.023
dy+0.022
die+0.022
Neg logit contributions
 course-0.034
ocol-0.032
isher-0.031
ement-0.030
-0.029
Token called
Token ID1444
Feature activation+2.94e-01

Pos logit contributions
 course+0.015
isher+0.013
ocol+0.013
ement+0.013
+0.013
Neg logit contributions
 himself-0.015
dy-0.015
die-0.015
 seek-0.015
lot-0.014
Token a
Token ID257
Feature activation+1.74e-01

Pos logit contributions
 himself+0.008
 seek+0.008
lot+0.008
dy+0.008
die+0.007
Neg logit contributions
 course-0.011
ocol-0.010
isher-0.010
ement-0.009
-0.009
Token "
Token ID366
Feature activation-2.43e-01

Pos logit contributions
 himself+0.004
 seek+0.004
lot+0.004
dy+0.004
 active+0.003
Neg logit contributions
 course-0.007
ocol-0.007
isher-0.007
ement-0.007
-0.006
Tokennight
Token ID3847
Feature activation-2.89e-01

Pos logit contributions
 course+0.008
ocol+0.007
isher+0.007
+0.007
ir+0.007
Neg logit contributions
dy-0.008
 himself-0.008
lot-0.007
die-0.007
 seek-0.007
Tokenmare
Token ID11449
Feature activation-3.11e+00

Pos logit contributions
 course+0.008
ocol+0.007
isher+0.007
+0.007
ir+0.007
Neg logit contributions
 himself-0.010
 seek-0.009
lot-0.009
dy-0.009
 active-0.009
Token".
Token ID1911
Feature activation-2.38e-01

Pos logit contributions
 course+0.102
ocol+0.093
isher+0.092
ir+0.087
+0.087
Neg logit contributions
 himself-0.099
 seek-0.093
lot-0.091
dy-0.090
die-0.088
Token She
Token ID1375
Feature activation+1.06e-01

Pos logit contributions
 course+0.008
ocol+0.008
isher+0.008
ement+0.007
+0.007
Neg logit contributions
 himself-0.008
 seek-0.007
lot-0.007
dy-0.007
die-0.007
Token woke
Token ID19092
Feature activation-5.21e-01

Pos logit contributions
 himself+0.003
 seek+0.003
lot+0.003
dy+0.002
 active+0.002
Neg logit contributions
 course-0.004
isher-0.004
ocol-0.003
ement-0.003
 ribbon-0.003
Token up
Token ID510
Feature activation-2.98e-01

Pos logit contributions
 course+0.023
ocol+0.022
isher+0.021
ement+0.021
+0.020
Neg logit contributions
 himself-0.011
 seek-0.011
lot-0.010
dy-0.010
die-0.009
Token crying
Token ID13774
Feature activation-1.06e-02

Pos logit contributions
 course+0.012
ocol+0.011
ement+0.010
isher+0.010
+0.010
Neg logit contributions
 seek-0.007
 himself-0.007
lot-0.007
dy-0.007
die-0.007
Token and
Token ID290
Feature activation+5.16e-02

Pos logit contributions
 course+0.055
ocol+0.052
isher+0.051
ement+0.049
+0.048
Neg logit contributions
 himself-0.032
 seek-0.031
lot-0.030
dy-0.029
die-0.028
Token put
Token ID1234
Feature activation+5.04e-01

Pos logit contributions
lot+0.002
 himself+0.001
dy+0.001
die+0.001
edient+0.001
Neg logit contributions
 course-0.002
isher-0.002
 ribbon-0.002
ocol-0.001
 rib-0.001
Token them
Token ID606
Feature activation-1.61e-01

Pos logit contributions
 himself+0.015
 seek+0.013
dy+0.013
lot+0.013
die+0.012
Neg logit contributions
 course-0.018
ocol-0.017
isher-0.016
ement-0.016
-0.016
Token under
Token ID739
Feature activation-8.17e-01

Pos logit contributions
 course+0.005
isher+0.005
ocol+0.005
+0.005
ipped+0.005
Neg logit contributions
lot-0.005
die-0.004
dy-0.004
 himself-0.004
 seek-0.004
Token the
Token ID262
Feature activation-3.54e-01

Pos logit contributions
 course+0.034
ocol+0.032
isher+0.032
ement+0.030
+0.030
Neg logit contributions
 himself-0.019
 seek-0.018
lot-0.018
dy-0.017
die-0.016
Token shelter
Token ID11772
Feature activation-3.08e+00

Pos logit contributions
 course+0.011
ocol+0.011
isher+0.010
ement+0.010
+0.009
Neg logit contributions
 himself-0.012
 seek-0.012
dy-0.011
lot-0.011
 active-0.011
Token.
Token ID13
Feature activation+5.75e-02

Pos logit contributions
 course+0.098
isher+0.090
ocol+0.089
ement+0.085
+0.084
Neg logit contributions
 himself-0.100
 seek-0.098
lot-0.097
dy-0.095
die-0.094
Token Then
Token ID3244
Feature activation+4.20e-01

Pos logit contributions
 seek+0.002
 himself+0.002
lot+0.002
 active+0.002
dy+0.002
Neg logit contributions
 course-0.002
-0.002
isher-0.002
ement-0.002
ocol-0.002
Token they
Token ID484
Feature activation+3.67e-01

Pos logit contributions
lot+0.015
 active+0.014
 himself+0.014
die+0.014
dy+0.013
Neg logit contributions
 course-0.013
isher-0.010
-0.010
 ribbon-0.010
ocol-0.010
Token sat
Token ID3332
Feature activation-3.86e-02

Pos logit contributions
 himself+0.010
lot+0.009
 seek+0.009
dy+0.009
 active+0.008
Neg logit contributions
 course-0.014
isher-0.013
ocol-0.013
ir-0.012
ement-0.012
Token down
Token ID866
Feature activation-6.34e-01

Pos logit contributions
 course+0.002
ocol+0.002
isher+0.002
ement+0.002
+0.001
Neg logit contributions
 himself-0.001
 seek-0.001
lot-0.001
dy-0.001
die-0.001
Token k
Token ID479
Feature activation-1.21e+00

Pos logit contributions
 himself+0.001
lot+0.001
 seek+0.001
dy+0.001
die+0.001
Neg logit contributions
 course-0.002
isher-0.001
ement-0.001
ocol-0.001
-0.001
Tokenang
Token ID648
Feature activation+3.95e-01

Pos logit contributions
 course+0.034
ocol+0.033
isher+0.031
ement+0.029
ir+0.028
Neg logit contributions
 himself-0.043
 seek-0.043
lot-0.041
 active-0.040
dy-0.039
Tokenaroo
Token ID38049
Feature activation-3.77e-01

Pos logit contributions
 himself+0.014
 seek+0.014
dy+0.014
lot+0.014
die+0.014
Neg logit contributions
 course-0.011
isher-0.010
ocol-0.010
ement-0.009
ir-0.009
Token in
Token ID287
Feature activation-1.62e+00

Pos logit contributions
 course+0.014
isher+0.012
ement+0.012
ocol+0.012
+0.011
Neg logit contributions
 himself-0.010
die-0.010
 seek-0.010
dy-0.010
lot-0.010
Token the
Token ID262
Feature activation-6.62e-01

Pos logit contributions
 course+0.064
ocol+0.056
ir+0.052
+0.052
isher+0.050
Neg logit contributions
lot-0.047
dy-0.046
ages-0.046
die-0.044
 seek-0.044
Token bush
Token ID24484
Feature activation-3.06e+00

Pos logit contributions
 course+0.026
ocol+0.021
+0.021
ement+0.021
isher+0.021
Neg logit contributions
 himself-0.020
lot-0.018
dy-0.018
 seek-0.018
die-0.017
Token!
Token ID0
Feature activation-5.12e-01

Pos logit contributions
 course+0.110
ement+0.092
+0.092
isher+0.092
ocol+0.092
Neg logit contributions
 himself-0.092
lot-0.090
dy-0.089
 seek-0.089
die-0.088
Token<|endoftext|>
Token ID50256
Feature activation-2.85e-02

Pos logit contributions
 course+0.019
+0.017
ement+0.016
ir+0.015
ay+0.015
Neg logit contributions
 seek-0.015
 active-0.014
lot-0.013
 himself-0.013
rated-0.012
TokenOnce
Token ID7454
Feature activation+6.13e-01

Pos logit contributions
 course+0.001
+0.001
ement+0.001
ocol+0.001
ir+0.001
Neg logit contributions
 seek-0.001
 active-0.001
lot-0.001
 himself-0.001
dy-0.001
Token upon
Token ID2402
Feature activation+1.37e-01

Pos logit contributions
 seek+0.018
 himself+0.018
lot+0.017
 active+0.016
dy+0.016
Neg logit contributions
 course-0.022
ocol-0.020
isher-0.020
ement-0.019
ir-0.019
Token a
Token ID257
Feature activation+3.61e-01

Pos logit contributions
 seek+0.004
lot+0.003
dy+0.003
 himself+0.003
die+0.003
Neg logit contributions
 course-0.005
ocol-0.005
isher-0.004
-0.004
ement-0.004
Token the
Token ID262
Feature activation-8.83e-02

Pos logit contributions
 course+0.041
ocol+0.036
isher+0.034
+0.034
ement+0.033
Neg logit contributions
lot-0.030
 himself-0.029
die-0.028
 seek-0.028
dy-0.028
Token house
Token ID2156
Feature activation-8.24e-01

Pos logit contributions
 course+0.003
isher+0.002
+0.002
ocol+0.002
ay+0.002
Neg logit contributions
lot-0.003
 himself-0.003
ages-0.003
athan-0.002
 seek-0.002
Token in
Token ID287
Feature activation-5.30e-01

Pos logit contributions
 course+0.028
ement+0.025
isher+0.024
+0.024
ocol+0.024
Neg logit contributions
die-0.025
 himself-0.025
lot-0.025
dy-0.024
 seek-0.024
Token their
Token ID511
Feature activation-1.01e+00

Pos logit contributions
 course+0.022
ocol+0.020
isher+0.020
ement+0.019
+0.019
Neg logit contributions
 himself-0.013
lot-0.013
 seek-0.012
dy-0.012
die-0.011
Token new
Token ID649
Feature activation-7.03e-01

Pos logit contributions
 course+0.039
ocol+0.035
isher+0.035
ement+0.033
+0.033
Neg logit contributions
 himself-0.028
lot-0.026
 seek-0.026
die-0.024
dy-0.024
Token boots
Token ID14412
Feature activation-3.06e+00

Pos logit contributions
 course+0.022
ocol+0.020
isher+0.020
ir+0.018
ement+0.018
Neg logit contributions
lot-0.024
 himself-0.023
 seek-0.021
die-0.021
athan-0.020
Token.
Token ID13
Feature activation+9.82e-03

Pos logit contributions
 course+0.110
isher+0.099
ocol+0.098
ement+0.095
+0.095
Neg logit contributions
 himself-0.089
 seek-0.087
lot-0.085
dy-0.084
die-0.083
Token They
Token ID1119
Feature activation-5.25e-01

Pos logit contributions
 seek+0.000
lot+0.000
 himself+0.000
dy+0.000
 active+0.000
Neg logit contributions
 course-0.000
-0.000
ocol-0.000
ement-0.000
isher-0.000
Token laughed
Token ID13818
Feature activation-4.01e-01

Pos logit contributions
 course+0.018
isher+0.015
 ribbon+0.015
ocol+0.014
ir+0.014
Neg logit contributions
 himself-0.016
lot-0.016
die-0.015
dy-0.015
 late-0.015
Token and
Token ID290
Feature activation-2.24e-01

Pos logit contributions
 course+0.014
ement+0.013
isher+0.012
ocol+0.012
+0.012
Neg logit contributions
 seek-0.012
lot-0.011
 himself-0.011
dy-0.011
die-0.011
Token played
Token ID2826
Feature activation+3.82e-01

Pos logit contributions
 course+0.008
+0.006
 ribbon+0.006
isher+0.006
ocol+0.006
Neg logit contributions
lot-0.007
 himself-0.007
dy-0.006
die-0.006
 seek-0.006
Token took
Token ID1718
Feature activation-4.40e-01

Pos logit contributions
 course+0.005
isher+0.005
ocol+0.005
ement+0.004
 ribbon+0.004
Neg logit contributions
 himself-0.005
lot-0.005
 seek-0.005
die-0.005
dy-0.005
Token out
Token ID503
Feature activation-9.79e-01

Pos logit contributions
 course+0.019
ocol+0.018
isher+0.018
ement+0.017
+0.017
Neg logit contributions
 himself-0.010
 seek-0.009
lot-0.009
dy-0.009
die-0.008
Token his
Token ID465
Feature activation-8.70e-01

Pos logit contributions
 course+0.043
ocol+0.040
isher+0.040
ement+0.038
+0.037
Neg logit contributions
 himself-0.021
 seek-0.020
lot-0.020
dy-0.019
die-0.018
Token hand
Token ID1021
Feature activation-3.73e-01

Pos logit contributions
 course+0.032
ocol+0.031
isher+0.030
ement+0.029
ir+0.028
Neg logit contributions
 himself-0.024
 seek-0.023
dy-0.023
lot-0.022
 active-0.022
Tokenker
Token ID6122
Feature activation+9.47e-01

Pos logit contributions
 course+0.016
ocol+0.016
isher+0.015
ement+0.015
+0.014
Neg logit contributions
 himself-0.008
 seek-0.008
lot-0.007
dy-0.007
 active-0.007
Tokenchief
Token ID17351
Feature activation-3.05e+00

Pos logit contributions
 himself+0.040
 seek+0.039
lot+0.039
dy+0.038
 active+0.037
Neg logit contributions
 course-0.021
ocol-0.019
isher-0.019
ement-0.017
-0.017
Token and
Token ID290
Feature activation+2.50e-01

Pos logit contributions
 course+0.109
isher+0.099
ement+0.096
+0.094
ir+0.093
Neg logit contributions
 seek-0.087
 himself-0.082
lot-0.079
dy-0.079
die-0.078
Token carefully
Token ID7773
Feature activation-1.89e-01

Pos logit contributions
 himself+0.007
lot+0.007
 seek+0.007
dy+0.007
die+0.007
Neg logit contributions
 course-0.009
isher-0.008
ocol-0.008
-0.007
 ribbon-0.007
Token picked
Token ID6497
Feature activation-3.58e-01

Pos logit contributions
 course+0.007
ocol+0.007
isher+0.007
ement+0.006
+0.006
Neg logit contributions
 himself-0.005
 seek-0.005
lot-0.005
dy-0.005
 active-0.004
Token up
Token ID510
Feature activation-3.59e-01

Pos logit contributions
 course+0.012
ocol+0.012
isher+0.012
ement+0.011
+0.011
Neg logit contributions
 himself-0.011
 seek-0.010
lot-0.010
dy-0.010
die-0.010
Token the
Token ID262
Feature activation-3.57e-01

Pos logit contributions
 course+0.015
ocol+0.013
isher+0.013
ement+0.012
+0.012
Neg logit contributions
 seek-0.009
lot-0.009
 himself-0.008
dy-0.008
die-0.008
Token lady
Token ID10846
Feature activation-9.12e-01

Pos logit contributions
 seek+0.007
 himself+0.007
lot+0.006
 active+0.006
dy+0.006
Neg logit contributions
 course-0.011
ocol-0.011
isher-0.010
ement-0.010
ir-0.010
Token was
Token ID373
Feature activation-8.33e-01

Pos logit contributions
 course+0.033
isher+0.026
ement+0.025
+0.025
ipped+0.024
Neg logit contributions
 himself-0.028
 seek-0.027
 afloat-0.027
lot-0.025
die-0.024
Token known
Token ID1900
Feature activation-2.15e-01

Pos logit contributions
 course+0.028
ocol+0.022
isher+0.021
ir+0.020
+0.020
Neg logit contributions
lot-0.026
dy-0.025
 himself-0.025
 afloat-0.024
die-0.024
Token as
Token ID355
Feature activation+9.42e-02

Pos logit contributions
 course+0.007
ocol+0.006
isher+0.006
ement+0.006
+0.006
Neg logit contributions
 himself-0.007
 seek-0.007
lot-0.007
dy-0.007
die-0.006
Token the
Token ID262
Feature activation+2.89e-01

Pos logit contributions
lot+0.003
dy+0.003
 himself+0.002
 seek+0.002
die+0.002
Neg logit contributions
 course-0.003
ocol-0.003
isher-0.003
ement-0.003
-0.003
Token blanket
Token ID18447
Feature activation-3.05e+00

Pos logit contributions
 himself+0.007
 seek+0.007
lot+0.006
dy+0.006
 active+0.006
Neg logit contributions
 course-0.012
ocol-0.012
isher-0.011
ement-0.011
-0.011
Token covering
Token ID9505
Feature activation-5.94e-01

Pos logit contributions
 course+0.112
ocol+0.102
isher+0.100
ement+0.097
+0.096
Neg logit contributions
 himself-0.087
lot-0.083
 seek-0.083
dy-0.081
die-0.080
Token silly
Token ID14397
Feature activation+8.84e-01

Pos logit contributions
 course+0.021
ocol+0.019
isher+0.018
ement+0.018
ir+0.017
Neg logit contributions
lot-0.018
 seek-0.018
 himself-0.017
dy-0.017
die-0.016
Token lady
Token ID10846
Feature activation-4.86e-02

Pos logit contributions
lot+0.028
 himself+0.027
 seek+0.026
die+0.025
dy+0.025
Neg logit contributions
 course-0.030
ocol-0.028
isher-0.026
ement-0.026
 ribbon-0.024
Token.
Token ID13
Feature activation+1.91e-02

Pos logit contributions
 course+0.002
ocol+0.002
ement+0.002
isher+0.002
ir+0.002
Neg logit contributions
 himself-0.001
lot-0.001
 seek-0.001
die-0.001
dy-0.001
Token<|endoftext|>
Token ID50256
Feature activation-9.99e-02

Pos logit contributions
 seek+0.000
 himself+0.000
lot+0.000
 active+0.000
dy+0.000
Neg logit contributions
 course-0.001
-0.001
ocol-0.001
ement-0.001
isher-0.001
Token,
Token ID11
Feature activation+3.78e-02

Pos logit contributions
 course+0.000
ocol+0.000
isher+0.000
ement+0.000
+0.000
Neg logit contributions
 himself-0.000
 seek-0.000
dy-0.000
lot-0.000
 active-0.000
Token Tim
Token ID5045
Feature activation-5.56e-02

Pos logit contributions
 himself+0.001
 seek+0.001
lot+0.001
dy+0.001
 active+0.001
Neg logit contributions
 course-0.001
ocol-0.001
isher-0.001
ement-0.001
ir-0.001
Tokenmy
Token ID1820
Feature activation-3.21e-03

Pos logit contributions
 course+0.001
isher+0.001
el+0.001
ipped+0.001
ir+0.001
Neg logit contributions
 seek-0.002
 himself-0.002
 afloat-0.002
 mesmer-0.002
 active-0.002
Token felt
Token ID2936
Feature activation-1.23e-01

Pos logit contributions
 course+0.000
ipped+0.000
our+0.000
aw+0.000
 can+0.000
Neg logit contributions
 himself-0.000
 seek-0.000
Back-0.000
 afloat-0.000
lot-0.000
Token something
Token ID1223
Feature activation-7.69e-01

Pos logit contributions
 course+0.005
ocol+0.004
isher+0.004
+0.004
ement+0.004
Neg logit contributions
 seek-0.003
lot-0.003
die-0.003
dy-0.003
 himself-0.003
Token tug
Token ID27762
Feature activation-3.04e+00

Pos logit contributions
 course+0.032
ocol+0.030
isher+0.030
ement+0.028
+0.028
Neg logit contributions
 himself-0.019
 seek-0.017
dy-0.017
lot-0.016
 active-0.016
Token on
Token ID319
Feature activation-9.88e-01

Pos logit contributions
 course+0.099
ocol+0.098
isher+0.095
ir+0.089
ement+0.086
Neg logit contributions
 seek-0.100
lot-0.098
die-0.093
dy-0.090
 himself-0.087
Token his
Token ID465
Feature activation-4.62e-01

Pos logit contributions
 course+0.043
ocol+0.041
isher+0.038
ement+0.037
ir+0.037
Neg logit contributions
 seek-0.024
lot-0.023
die-0.020
 active-0.020
dy-0.020
Token line
Token ID1627
Feature activation-4.04e-01

Pos logit contributions
 course+0.016
ocol+0.015
isher+0.015
ement+0.014
+0.014
Neg logit contributions
 himself-0.014
 seek-0.013
lot-0.013
dy-0.012
die-0.012
Token.
Token ID13
Feature activation-7.52e-02

Pos logit contributions
 course+0.012
isher+0.010
ement+0.010
ir+0.010
ocol+0.010
Neg logit contributions
die-0.014
dy-0.014
 seek-0.014
lot-0.014
 himself-0.013
Token He
Token ID679
Feature activation+1.96e-01

Pos logit contributions
+0.003
ocol+0.003
 course+0.003
isher+0.003
ement+0.003
Neg logit contributions
 seek-0.002
lot-0.002
die-0.002
dy-0.002
 afloat-0.002
Token other
Token ID584
Feature activation+1.36e+00

Pos logit contributions
 course+0.017
ocol+0.015
isher+0.014
ir+0.014
ement+0.014
Neg logit contributions
lot-0.010
 seek-0.010
dy-0.010
 himself-0.010
die-0.009
Token while
Token ID981
Feature activation-2.19e+00

Pos logit contributions
 seek+0.041
lot+0.041
die+0.039
 himself+0.038
dy+0.038
Neg logit contributions
 course-0.043
isher-0.037
ocol-0.035
ement-0.035
 ribbon-0.033
Token they
Token ID484
Feature activation-3.60e-01

Pos logit contributions
 course+0.073
ocol+0.072
isher+0.064
+0.063
ement+0.062
Neg logit contributions
lot-0.071
die-0.067
dy-0.066
 seek-0.065
 himself-0.065
Tokenâ
Token ID22940
Feature activation-4.11e-01

Pos logit contributions
 course+0.011
ocol+0.010
isher+0.010
ement+0.009
+0.009
Neg logit contributions
 himself-0.013
 seek-0.012
lot-0.012
dy-0.012
die-0.011
Token
Token ID26391
Feature activation+8.19e-01

Pos logit contributions
 course+0.016
ocol+0.016
isher+0.015
ement+0.015
+0.014
Neg logit contributions
 himself-0.010
 seek-0.010
lot-0.010
dy-0.009
die-0.009
Token
Token ID8151
Feature activation-3.00e+00

Pos logit contributions
 himself+0.023
 seek+0.021
die+0.020
dy+0.020
lot+0.019
Neg logit contributions
 course-0.031
-0.030
ocol-0.028
ement-0.027
isher-0.026
Tokenre
Token ID260
Feature activation-9.26e-01

Pos logit contributions
 course+0.085
+0.075
ocol+0.075
ement+0.072
ir+0.072
Neg logit contributions
 seek-0.112
 himself-0.110
lot-0.107
die-0.105
dy-0.105
Token facing
Token ID6476
Feature activation-2.67e-01

Pos logit contributions
 course+0.031
ocol+0.028
ir+0.026
isher+0.026
ement+0.026
Neg logit contributions
 seek-0.027
lot-0.027
 himself-0.026
die-0.025
dy-0.025
Token something
Token ID1223
Feature activation-1.04e+00

Pos logit contributions
 course+0.009
ocol+0.008
isher+0.008
ement+0.007
ir+0.007
Neg logit contributions
 seek-0.009
lot-0.008
 himself-0.008
dy-0.007
 active-0.007
Token that
Token ID326
Feature activation-1.24e-02

Pos logit contributions
 course+0.032
ocol+0.029
ement+0.028
isher+0.028
ir+0.025
Neg logit contributions
 seek-0.035
lot-0.035
 himself-0.034
die-0.033
dy-0.032
Tokenâ
Token ID22940
Feature activation-1.05e-01

Pos logit contributions
 course+0.000
isher+0.000
ocol+0.000
ement+0.000
+0.000
Neg logit contributions
 himself-0.000
 seek-0.000
lot-0.000
dy-0.000
die-0.000
Token know
Token ID760
Feature activation-5.59e-01

Pos logit contributions
 course+0.018
ocol+0.018
ement+0.018
isher+0.017
ir+0.016
Neg logit contributions
 himself-0.015
 seek-0.015
dy-0.014
lot-0.014
 active-0.013
Token you
Token ID345
Feature activation-1.57e-01

Pos logit contributions
 course+0.018
isher+0.018
ocol+0.018
+0.017
ement+0.017
Neg logit contributions
 himself-0.018
 seek-0.016
lot-0.016
dy-0.015
 having-0.015
Token value
Token ID1988
Feature activation-6.87e-01

Pos logit contributions
 course+0.005
ocol+0.005
isher+0.005
ement+0.004
+0.004
Neg logit contributions
 himself-0.005
 seek-0.005
lot-0.005
dy-0.005
 active-0.005
Token it
Token ID340
Feature activation-3.13e-01

Pos logit contributions
 course+0.023
ocol+0.021
isher+0.021
ement+0.020
+0.019
Neg logit contributions
 himself-0.022
 seek-0.021
lot-0.021
dy-0.020
die-0.020
Token a
Token ID257
Feature activation-1.18e+00

Pos logit contributions
 course+0.010
isher+0.009
ement+0.008
ocol+0.008
+0.008
Neg logit contributions
 seek-0.010
 himself-0.010
lot-0.010
dy-0.010
die-0.009
Token lot
Token ID1256
Feature activation-3.00e+00

Pos logit contributions
 course+0.044
ocol+0.043
isher+0.042
ement+0.040
+0.039
Neg logit contributions
 himself-0.032
 seek-0.032
lot-0.030
dy-0.030
 active-0.029
Token."
Token ID526
Feature activation-3.88e-02

Pos logit contributions
 course+0.116
ocol+0.110
isher+0.108
ement+0.102
+0.102
Neg logit contributions
 himself-0.080
 seek-0.076
lot-0.074
dy-0.072
 active-0.069
Token They
Token ID1119
Feature activation-3.50e-02

Pos logit contributions
 course+0.002
ocol+0.002
isher+0.002
ement+0.002
+0.002
Neg logit contributions
 himself-0.001
 seek-0.001
lot-0.001
dy-0.001
 active-0.001
Token looked
Token ID3114
Feature activation-3.54e-01

Pos logit contributions
ocol+0.002
 course+0.002
ement+0.002
isher+0.002
+0.002
Neg logit contributions
 himself-0.001
 seek-0.001
dy-0.001
 even-0.001
lot-0.001
Token everywhere
Token ID8347
Feature activation-9.11e-01

Pos logit contributions
 course+0.015
ocol+0.015
isher+0.014
+0.014
ement+0.013
Neg logit contributions
 himself-0.008
dy-0.008
 active-0.007
 seek-0.007
die-0.007
Token and
Token ID290
Feature activation-1.01e-01

Pos logit contributions
 course+0.033
ocol+0.031
isher+0.030
ement+0.029
+0.028
Neg logit contributions
 himself-0.026
 seek-0.025
lot-0.025
dy-0.024
die-0.023
Token they
Token ID484
Feature activation-1.06e-01

Pos logit contributions
 course+0.035
ocol+0.033
isher+0.032
ement+0.031
+0.030
Neg logit contributions
 himself-0.028
 seek-0.027
lot-0.026
dy-0.026
die-0.025
Token shared
Token ID4888
Feature activation+1.93e-01

Pos logit contributions
 course+0.005
ocol+0.005
isher+0.005
ement+0.004
+0.004
Neg logit contributions
 himself-0.002
 seek-0.002
lot-0.002
 active-0.002
dy-0.002
Token together
Token ID1978
Feature activation-4.41e-01

Pos logit contributions
 seek+0.006
lot+0.006
dy+0.006
 himself+0.005
die+0.005
Neg logit contributions
 course-0.007
ocol-0.006
isher-0.005
ement-0.005
 ribbon-0.005
Token on
Token ID319
Feature activation-1.05e+00

Pos logit contributions
 course+0.013
ement+0.010
ocol+0.010
isher+0.010
 ribbon+0.009
Neg logit contributions
lot-0.016
die-0.016
dy-0.016
 seek-0.015
 himself-0.015
Token those
Token ID883
Feature activation+3.71e-01

Pos logit contributions
 course+0.040
ocol+0.036
+0.033
ement+0.033
isher+0.033
Neg logit contributions
 seek-0.029
lot-0.028
dy-0.027
die-0.027
 himself-0.026
Token walks
Token ID11114
Feature activation-3.00e+00

Pos logit contributions
lot+0.010
 seek+0.010
die+0.010
 himself+0.010
dy+0.010
Neg logit contributions
 course-0.014
ement-0.012
ocol-0.012
isher-0.012
ir-0.011
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
 course+0.097
isher+0.082
ocol+0.080
ement+0.078
+0.077
Neg logit contributions
 himself-0.100
 seek-0.096
dy-0.095
die-0.095
lot-0.094
Token
Token ID198
Feature activation+5.39e-02

Pos logit contributions
 seek+0.001
 active+0.001
 himself+0.001
dy+0.001
lot+0.001
Neg logit contributions
 course-0.001
-0.001
ement-0.001
ocol-0.001
ay-0.001
Token
Token ID198
Feature activation-7.63e-02

Pos logit contributions
 seek+0.002
 miners+0.002
 active+0.002
dy+0.002
die+0.002
Neg logit contributions
 course-0.002
-0.002
ay-0.001
ir-0.001
uggle-0.001
TokenSpot
Token ID32565
Feature activation-7.49e-01

Pos logit contributions
 course+0.003
+0.003
 ribbon+0.002
ir+0.002
ocol+0.002
Neg logit contributions
 himself-0.002
dy-0.002
die-0.002
lot-0.002
 seek-0.002
Token always
Token ID1464
Feature activation+1.12e+00

Pos logit contributions
 course+0.024
ipped+0.017
+0.017
 ribbon+0.017
ement+0.017
Neg logit contributions
 afloat-0.024
 himself-0.023
 late-0.023
 seek-0.022
lot-0.022
Token to
Token ID284
Feature activation-4.51e-02

Pos logit contributions
 course+0.049
ocol+0.048
isher+0.047
ement+0.045
+0.044
Neg logit contributions
 himself-0.025
 seek-0.023
lot-0.021
dy-0.021
 active-0.020
Token pick
Token ID2298
Feature activation-1.01e+00

Pos logit contributions
 course+0.002
isher+0.002
ocol+0.002
ement+0.002
+0.001
Neg logit contributions
 himself-0.001
 seek-0.001
dy-0.001
lot-0.001
 active-0.001
Token them
Token ID606
Feature activation+2.73e-01

Pos logit contributions
 course+0.034
ocol+0.032
isher+0.031
ement+0.029
ir+0.028
Neg logit contributions
 seek-0.032
lot-0.031
 himself-0.031
dy-0.029
 active-0.029
Token off
Token ID572
Feature activation-4.45e-01

Pos logit contributions
lot+0.007
 seek+0.007
 himself+0.007
die+0.007
dy+0.006
Neg logit contributions
 course-0.010
ocol-0.010
isher-0.009
ement-0.009
-0.009
Token the
Token ID262
Feature activation-8.12e-01

Pos logit contributions
 course+0.017
ocol+0.015
isher+0.015
ement+0.014
+0.014
Neg logit contributions
 himself-0.012
lot-0.012
 seek-0.012
dy-0.012
die-0.011
Token bush
Token ID24484
Feature activation-2.99e+00

Pos logit contributions
 course+0.026
ocol+0.023
isher+0.023
+0.022
ement+0.022
Neg logit contributions
lot-0.027
 himself-0.027
 seek-0.025
dy-0.024
die-0.024
Token.
Token ID13
Feature activation+4.79e-02

Pos logit contributions
 course+0.112
ocol+0.106
isher+0.105
ement+0.099
+0.098
Neg logit contributions
 himself-0.082
 seek-0.079
lot-0.076
dy-0.075
die-0.072
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
 himself+0.001
 seek+0.001
lot+0.001
dy+0.001
die+0.001
Neg logit contributions
 course-0.002
ocol-0.002
isher-0.002
ement-0.002
-0.002
Token
Token ID198
Feature activation+4.84e-02

Pos logit contributions
 himself+0.004
 seek+0.004
dy+0.004
lot+0.004
 active+0.004
Neg logit contributions
 course-0.009
ement-0.007
ocol-0.007
-0.007
ir-0.007
TokenShe
Token ID3347
Feature activation-4.86e-01

Pos logit contributions
 himself+0.001
lot+0.001
dy+0.001
 seek+0.001
 afloat+0.001
Neg logit contributions
 course-0.002
ocol-0.002
isher-0.002
-0.002
ir-0.002
Token popped
Token ID22928
Feature activation+1.15e-01

Pos logit contributions
 course+0.017
isher+0.015
 ribbon+0.014
ocol+0.014
ir+0.013
Neg logit contributions
 himself-0.015
lot-0.014
 afloat-0.013
 seek-0.013
 late-0.013
Token know
Token ID760
Feature activation-5.44e-01

Pos logit contributions
lot+0.004
dy+0.004
die+0.004
 himself+0.004
 colony+0.004
Neg logit contributions
 course-0.003
ocol-0.003
ir-0.003
isher-0.003
 ribbon-0.003
Token what
Token ID644
Feature activation+8.26e-01

Pos logit contributions
 course+0.018
ocol+0.016
isher+0.016
ement+0.015
+0.014
Neg logit contributions
 himself-0.018
 seek-0.018
lot-0.017
dy-0.017
die-0.016
Token the
Token ID262
Feature activation+6.02e-03

Pos logit contributions
 himself+0.026
 seek+0.025
 afloat+0.025
dy+0.025
lot+0.025
Neg logit contributions
 course-0.028
ocol-0.023
 ribbon-0.021
isher-0.021
-0.021
Token dog
Token ID3290
Feature activation-1.09e+00

Pos logit contributions
 himself+0.000
lot+0.000
 seek+0.000
dy+0.000
die+0.000
Neg logit contributions
 course-0.000
ocol-0.000
isher-0.000
-0.000
ement-0.000
Token was
Token ID373
Feature activation-4.95e-01

Pos logit contributions
 course+0.044
ocol+0.039
isher+0.038
+0.038
ement+0.038
Neg logit contributions
 himself-0.028
 seek-0.027
lot-0.026
dy-0.025
die-0.025
Token doing
Token ID1804
Feature activation+2.89e+00

Pos logit contributions
 course+0.016
ement+0.013
ocol+0.013
our+0.013
ir+0.012
Neg logit contributions
die-0.016
lot-0.015
 seek-0.015
dy-0.015
 himself-0.015
Token there
Token ID612
Feature activation+1.21e+00

Pos logit contributions
lot+0.105
die+0.103
dy+0.101
 seek+0.094
 himself+0.092
Neg logit contributions
 course-0.084
ement-0.071
ocol-0.068
isher-0.067
-0.065
Token but
Token ID475
Feature activation-3.59e-02

Pos logit contributions
lot+0.045
die+0.043
dy+0.043
 seek+0.041
 active+0.041
Neg logit contributions
 course-0.037
ocol-0.028
ement-0.028
-0.027
ir-0.025
Token she
Token ID673
Feature activation+8.07e-01

Pos logit contributions
 course+0.001
ocol+0.001
isher+0.001
ement+0.001
+0.001
Neg logit contributions
lot-0.001
dy-0.001
 himself-0.001
 seek-0.001
die-0.001
Token was
Token ID373
Feature activation-4.53e-02

Pos logit contributions
 himself+0.023
lot+0.023
 seek+0.022
 afloat+0.021
 late+0.021
Neg logit contributions
 course-0.028
isher-0.025
ement-0.024
ocol-0.024
-0.024
Token happy
Token ID3772
Feature activation-5.12e-01

Pos logit contributions
 course+0.001
ocol+0.001
ement+0.001
ir+0.001
+0.001
Neg logit contributions
 seek-0.001
lot-0.001
dy-0.001
die-0.001
 himself-0.001
Token the
Token ID262
Feature activation+1.76e-01

Pos logit contributions
 course+0.014
ocol+0.014
isher+0.013
ement+0.013
 ribbon+0.012
Neg logit contributions
lot-0.013
 seek-0.013
die-0.012
dy-0.012
 active-0.012
Token rock
Token ID3881
Feature activation+3.76e-01

Pos logit contributions
 himself+0.005
lot+0.005
 seek+0.005
dy+0.005
die+0.005
Neg logit contributions
 course-0.006
isher-0.006
ocol-0.006
ement-0.005
-0.005
Token.
Token ID13
Feature activation-9.48e-02

Pos logit contributions
die+0.013
lot+0.012
 seek+0.012
dy+0.012
athan+0.011
Neg logit contributions
 course-0.012
isher-0.010
ement-0.010
ocol-0.009
ir-0.009
Token He
Token ID679
Feature activation+8.69e-02

Pos logit contributions
 course+0.003
+0.003
ement+0.003
 yeah+0.003
 goodness+0.002
Neg logit contributions
 seek-0.003
 afloat-0.003
 active-0.003
edient-0.003
lling-0.003
Token kept
Token ID4030
Feature activation+1.01e+00

Pos logit contributions
 himself+0.003
 seek+0.003
 afloat+0.002
lot+0.002
 late+0.002
Neg logit contributions
 course-0.003
isher-0.003
 ribbon-0.002
ement-0.002
 rib-0.002
Token doing
Token ID1804
Feature activation+3.25e+00

Pos logit contributions
 seek+0.029
dy+0.028
lot+0.027
die+0.027
 himself+0.026
Neg logit contributions
 course-0.037
ocol-0.034
isher-0.033
ement-0.032
-0.031
Token it
Token ID340
Feature activation-5.05e-02

Pos logit contributions
 seek+0.106
lot+0.101
dy+0.100
die+0.099
 himself+0.094
Neg logit contributions
 course-0.106
ocol-0.093
ement-0.091
isher-0.087
 ribbon-0.082
Token until
Token ID1566
Feature activation-1.36e-02

Pos logit contributions
 course+0.002
ement+0.001
isher+0.001
ocol+0.001
+0.001
Neg logit contributions
die-0.002
lot-0.002
dy-0.002
 seek-0.002
athan-0.002
Token the
Token ID262
Feature activation-6.51e-04

Pos logit contributions
 course+0.001
+0.000
ement+0.000
ocol+0.000
isher+0.000
Neg logit contributions
dy-0.000
die-0.000
lot-0.000
 active-0.000
ages-0.000
Token sun
Token ID4252
Feature activation-9.77e-02

Pos logit contributions
 course+0.000
isher+0.000
ocol+0.000
ement+0.000
+0.000
Neg logit contributions
lot-0.000
 himself-0.000
dy-0.000
 seek-0.000
die-0.000
Token went
Token ID1816
Feature activation+9.81e-02

Pos logit contributions
 course+0.003
isher+0.003
ocol+0.003
ement+0.003
+0.003
Neg logit contributions
 himself-0.003
 seek-0.003
lot-0.003
dy-0.003
die-0.003
Token she
Token ID673
Feature activation+3.03e-01

Pos logit contributions
 course+0.015
ocol+0.013
isher+0.013
 ribbon+0.012
ement+0.012
Neg logit contributions
 himself-0.012
lot-0.011
 seek-0.011
dy-0.011
 active-0.011
Token wondered
Token ID14028
Feature activation-2.32e-01

Pos logit contributions
 himself+0.009
lot+0.009
 seek+0.009
 afloat+0.008
 late+0.008
Neg logit contributions
 course-0.010
 ribbon-0.009
ement-0.009
isher-0.009
ir-0.009
Token what
Token ID644
Feature activation+1.24e+00

Pos logit contributions
 course+0.008
ocol+0.008
isher+0.008
+0.007
ement+0.007
Neg logit contributions
 himself-0.007
 seek-0.006
lot-0.006
dy-0.006
 active-0.006
Token it
Token ID340
Feature activation+8.58e-02

Pos logit contributions
 himself+0.042
 seek+0.040
lot+0.040
dy+0.039
die+0.038
Neg logit contributions
 course-0.040
ocol-0.035
isher-0.034
ement-0.033
-0.031
Token was
Token ID373
Feature activation-5.71e-01

Pos logit contributions
 himself+0.002
lot+0.002
 seek+0.002
dy+0.002
die+0.002
Neg logit contributions
 course-0.003
ocol-0.003
isher-0.003
ement-0.003
-0.003
Token doing
Token ID1804
Feature activation+2.96e+00

Pos logit contributions
 course+0.014
ocol+0.012
ement+0.011
ir+0.011
isher+0.011
Neg logit contributions
die-0.021
lot-0.021
 seek-0.021
 himself-0.021
dy-0.021
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
lot+0.107
dy+0.107
die+0.106
 himself+0.100
 seek+0.099
Neg logit contributions
 course-0.084
ement-0.069
ocol-0.069
isher-0.068
ir-0.063
Token She
Token ID1375
Feature activation+3.79e-02

Pos logit contributions
 seek+0.004
 himself+0.004
lot+0.004
 afloat+0.004
dy+0.004
Neg logit contributions
 course-0.004
-0.004
ement-0.004
ocol-0.004
isher-0.004
Token watched
Token ID7342
Feature activation+3.15e-01

Pos logit contributions
 himself+0.001
lot+0.001
 seek+0.001
 afloat+0.001
 active+0.001
Neg logit contributions
 course-0.001
isher-0.001
 ribbon-0.001
ocol-0.001
ement-0.001
Token it
Token ID340
Feature activation+4.92e-03

Pos logit contributions
 himself+0.009
 seek+0.008
lot+0.008
dy+0.007
 active+0.007
Neg logit contributions
 course-0.012
isher-0.011
ocol-0.011
ement-0.011
-0.011
Token fly
Token ID6129
Feature activation-1.72e-01

Pos logit contributions
lot+0.000
 himself+0.000
 seek+0.000
dy+0.000
die+0.000
Neg logit contributions
 course-0.000
ocol-0.000
isher-0.000
ement-0.000
ir-0.000
Token a
Token ID257
Feature activation-7.25e-01

Pos logit contributions
 course+0.003
ocol+0.003
isher+0.003
+0.003
ir+0.003
Neg logit contributions
dy-0.003
 seek-0.003
lot-0.002
die-0.002
 himself-0.002
Token big
Token ID1263
Feature activation+2.72e-01

Pos logit contributions
 course+0.030
ocol+0.027
isher+0.026
+0.025
ir+0.025
Neg logit contributions
 himself-0.019
lot-0.016
 seek-0.016
dy-0.015
athan-0.014
Token jungle
Token ID20712
Feature activation+1.40e-01

Pos logit contributions
 himself+0.006
lot+0.006
 seek+0.006
dy+0.005
die+0.005
Neg logit contributions
 course-0.012
ocol-0.011
isher-0.011
ement-0.010
-0.010
Token and
Token ID290
Feature activation+5.62e-02

Pos logit contributions
 himself+0.004
die+0.004
dy+0.004
lot+0.004
 seek+0.004
Neg logit contributions
 course-0.005
isher-0.004
ocol-0.004
ement-0.004
ir-0.004
Token was
Token ID373
Feature activation+8.38e-01

Pos logit contributions
dy+0.002
 afloat+0.002
lot+0.002
die+0.002
athan+0.002
Neg logit contributions
 course-0.002
 may-0.001
ocol-0.001
 grace-0.001
 rib-0.001
Token always
Token ID1464
Feature activation+3.08e+00

Pos logit contributions
rated+0.028
 seek+0.028
keeping+0.027
dy+0.027
athan+0.026
Neg logit contributions
 course-0.022
ocol-0.017
 grace-0.015
 please-0.015
ir-0.015
Token looking
Token ID2045
Feature activation+8.65e-01

Pos logit contributions
lot+0.084
dy+0.078
 himself+0.076
die+0.075
athan+0.074
Neg logit contributions
 course-0.101
ocol-0.088
isher-0.086
 emanating-0.083
ement-0.078
Token out
Token ID503
Feature activation-1.22e+00

Pos logit contributions
 seek+0.030
 himself+0.030
lot+0.029
dy+0.028
die+0.028
Neg logit contributions
 course-0.026
ocol-0.025
isher-0.023
ement-0.022
ir-0.021
Token for
Token ID329
Feature activation+2.40e-01

Pos logit contributions
 course+0.042
ocol+0.040
isher+0.038
ement+0.037
ir+0.035
Neg logit contributions
 seek-0.037
 himself-0.036
lot-0.036
dy-0.035
die-0.034
Token his
Token ID465
Feature activation-5.64e-02

Pos logit contributions
lot+0.006
dy+0.006
 seek+0.006
die+0.006
 himself+0.006
Neg logit contributions
 course-0.009
ocol-0.009
isher-0.008
ement-0.008
ir-0.007
Token animal
Token ID5044
Feature activation-4.39e-01

Pos logit contributions
 course+0.002
ocol+0.002
isher+0.002
 ribbon+0.002
+0.002
Neg logit contributions
 himself-0.002
lot-0.001
 seek-0.001
die-0.001
dy-0.001
TokenSam
Token ID16305
Feature activation-5.29e-01

Pos logit contributions
 course+0.022
ir+0.017
ocol+0.017
+0.017
isher+0.017
Neg logit contributions
 himself-0.016
lot-0.015
 having-0.015
dy-0.015
 afloat-0.014
Token got
Token ID1392
Feature activation+1.47e+00

Pos logit contributions
 course+0.017
ement+0.013
isher+0.013
ipped+0.012
 ribbon+0.012
Neg logit contributions
 seek-0.018
 himself-0.018
 afloat-0.018
lot-0.017
 active-0.017
Token angry
Token ID7954
Feature activation+1.22e+00

Pos logit contributions
 seek+0.042
lot+0.041
dy+0.037
die+0.036
 himself+0.035
Neg logit contributions
 course-0.053
ocol-0.051
isher-0.051
ir-0.046
ement-0.045
Token and
Token ID290
Feature activation+7.42e-01

Pos logit contributions
 seek+0.037
lot+0.036
die+0.035
dy+0.034
 himself+0.034
Neg logit contributions
 course-0.042
ocol-0.038
ement-0.038
isher-0.037
ir-0.035
Token said
Token ID531
Feature activation+2.02e-01

Pos logit contributions
 seek+0.020
lot+0.019
 himself+0.019
dy+0.018
die+0.018
Neg logit contributions
 course-0.028
isher-0.025
ocol-0.025
ement-0.023
-0.023
Token:
Token ID25
Feature activation+3.27e+00

Pos logit contributions
 himself+0.006
 seek+0.006
lot+0.006
dy+0.006
die+0.006
Neg logit contributions
 course-0.007
ocol-0.007
isher-0.006
ement-0.006
-0.006
Token "
Token ID366
Feature activation+3.50e-01

Pos logit contributions
 seek+0.089
 himself+0.087
 active+0.084
dy+0.083
lot+0.083
Neg logit contributions
 course-0.121
-0.114
ocol-0.112
ement-0.103
ir-0.103
TokenNo
Token ID2949
Feature activation-5.50e-01

Pos logit contributions
lot+0.012
 himself+0.012
 seek+0.011
dy+0.011
die+0.011
Neg logit contributions
 course-0.011
isher-0.010
ocol-0.009
ir-0.009
-0.009
Token,
Token ID11
Feature activation-7.77e-01

Pos logit contributions
 course+0.020
ocol+0.018
isher+0.018
ement+0.017
+0.017
Neg logit contributions
 himself-0.017
 seek-0.016
lot-0.016
dy-0.015
die-0.015
Token Lily
Token ID20037
Feature activation-2.59e-01

Pos logit contributions
 course+0.031
ocol+0.029
isher+0.028
ement+0.026
+0.026
Neg logit contributions
 himself-0.020
lot-0.019
 seek-0.019
dy-0.018
die-0.018
Token.
Token ID13
Feature activation-7.35e-01

Pos logit contributions
 course+0.009
ocol+0.008
isher+0.008
ement+0.007
+0.007
Neg logit contributions
 himself-0.008
 seek-0.008
lot-0.008
dy-0.008
die-0.007
Token has
Token ID468
Feature activation+3.83e-01

Pos logit contributions
 himself+0.018
 seek+0.018
lot+0.017
dy+0.017
die+0.016
Neg logit contributions
 course-0.016
ocol-0.015
isher-0.015
-0.014
ement-0.014
Token a
Token ID257
Feature activation+1.95e-02

Pos logit contributions
 seek+0.010
 himself+0.010
dy+0.010
lot+0.010
 active+0.010
Neg logit contributions
 course-0.014
ocol-0.013
isher-0.013
ement-0.012
 ribbon-0.012
Token name
Token ID1438
Feature activation+1.35e+00

Pos logit contributions
 seek+0.001
 himself+0.001
lot+0.001
dy+0.001
die+0.001
Neg logit contributions
 course-0.001
isher-0.001
ocol-0.001
ement-0.001
-0.001
Token.
Token ID13
Feature activation+4.37e-01

Pos logit contributions
 himself+0.035
 seek+0.032
lot+0.031
dy+0.029
die+0.027
Neg logit contributions
 course-0.053
isher-0.051
ocol-0.051
-0.048
ement-0.047
Token The
Token ID383
Feature activation-1.46e-01

Pos logit contributions
 himself+0.014
 seek+0.014
lot+0.013
dy+0.013
die+0.012
Neg logit contributions
 course-0.014
ocol-0.014
isher-0.013
ement-0.012
-0.012
Token name
Token ID1438
Feature activation+1.55e+00

Pos logit contributions
ocol+0.004
 course+0.004
isher+0.004
+0.004
ir+0.004
Neg logit contributions
 himself-0.005
 seek-0.005
lot-0.005
 active-0.004
dy-0.004
Token is
Token ID318
Feature activation-6.60e-01

Pos logit contributions
 himself+0.046
 seek+0.041
lot+0.040
dy+0.038
 having+0.037
Neg logit contributions
ocol-0.056
isher-0.054
-0.051
 course-0.051
ir-0.048
Token Hal
Token ID11023
Feature activation+9.08e-01

Pos logit contributions
 course+0.020
ocol+0.018
 rib+0.015
ement+0.015
 ribbon+0.015
Neg logit contributions
rated-0.021
 afloat-0.021
 seek-0.021
 active-0.021
 himself-0.020
Tokenley
Token ID1636
Feature activation-2.67e-01

Pos logit contributions
 seek+0.029
 himself+0.029
lot+0.027
dy+0.026
 active+0.026
Neg logit contributions
 course-0.030
ocol-0.028
isher-0.028
ement-0.026
-0.024
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
 course+0.008
ocol+0.007
isher+0.007
ement+0.007
ir+0.006
Neg logit contributions
 seek-0.009
 himself-0.009
lot-0.009
 active-0.009
die-0.009
Token Tom
Token ID4186
Feature activation-1.74e-01

Pos logit contributions
 himself+0.004
 seek+0.004
lot+0.004
dy+0.004
die+0.003
Neg logit contributions
 course-0.010
ocol-0.009
isher-0.009
ement-0.009
-0.009
Token<|endoftext|>
Token ID50256
Feature activation+3.11e-01

Pos logit contributions
 course+0.001
+0.001
ocol+0.001
ement+0.001
ir+0.001
Neg logit contributions
 seek-0.001
 active-0.000
 himself-0.000
lot-0.000
 afloat-0.000
TokenOne
Token ID3198
Feature activation-3.34e-01

Pos logit contributions
 active+0.008
 seek+0.008
dy+0.008
 himself+0.008
lot+0.008
Neg logit contributions
 course-0.012
-0.011
ement-0.010
ocol-0.010
ir-0.010
Token day
Token ID1110
Feature activation+2.07e-01

Pos logit contributions
 course+0.014
ement+0.011
ocol+0.011
ir+0.010
isher+0.010
Neg logit contributions
lot-0.009
 miners-0.008
 himself-0.008
die-0.008
ages-0.008
Token,
Token ID11
Feature activation-1.34e-01

Pos logit contributions
lot+0.008
 himself+0.008
die+0.008
 seek+0.008
 active+0.008
Neg logit contributions
 course-0.005
ocol-0.005
-0.004
ir-0.004
ipped-0.004
Token a
Token ID257
Feature activation-3.55e-01

Pos logit contributions
 course+0.005
ocol+0.004
isher+0.004
+0.004
ir+0.004
Neg logit contributions
 seek-0.004
lot-0.004
 himself-0.004
 active-0.003
dy-0.003
Token boy
Token ID2933
Feature activation+6.59e-01

Pos logit contributions
 course+0.015
ocol+0.015
isher+0.015
ement+0.014
+0.013
Neg logit contributions
 himself-0.008
 seek-0.007
dy-0.007
lot-0.007
 active-0.007
Token named
Token ID3706
Feature activation-4.57e-01

Pos logit contributions
 himself+0.019
 seek+0.019
dy+0.018
lot+0.018
 active+0.017
Neg logit contributions
 course-0.023
ocol-0.023
isher-0.022
ement-0.021
-0.021
Token Tim
Token ID5045
Feature activation+7.67e-01

Pos logit contributions
 course+0.013
ocol+0.012
isher+0.012
ir+0.010
+0.010
Neg logit contributions
 seek-0.016
lot-0.016
 active-0.015
die-0.015
 himself-0.015
Token went
Token ID1816
Feature activation+6.32e-01

Pos logit contributions
 seek+0.022
lot+0.021
 himself+0.020
 active+0.020
die+0.019
Neg logit contributions
 course-0.028
isher-0.024
ir-0.022
ocol-0.022
ipped-0.022
Token to
Token ID284
Feature activation-4.33e-02

Pos logit contributions
die+0.019
 himself+0.019
dy+0.019
 seek+0.019
lot+0.018
Neg logit contributions
 course-0.021
ocol-0.019
isher-0.019
-0.018
ipped-0.017
Token the
Token ID262
Feature activation+3.41e-01

Pos logit contributions
 course+0.002
ocol+0.001
isher+0.001
+0.001
ir+0.001
Neg logit contributions
lot-0.001
dy-0.001
 himself-0.001
 seek-0.001
die-0.001
Token looked
Token ID3114
Feature activation+1.53e-01

Pos logit contributions
 himself+0.010
 seek+0.009
lot+0.009
 active+0.009
 afloat+0.009
Neg logit contributions
 course-0.011
 ribbon-0.009
ir-0.009
ement-0.009
isher-0.009
Token down
Token ID866
Feature activation+1.51e-01

Pos logit contributions
 seek+0.004
 himself+0.004
lot+0.003
dy+0.003
die+0.003
Neg logit contributions
 course-0.006
ocol-0.006
isher-0.006
ement-0.006
ir-0.005
Token and
Token ID290
Feature activation+2.56e-01

Pos logit contributions
 himself+0.004
 seek+0.004
lot+0.003
dy+0.003
die+0.003
Neg logit contributions
 course-0.006
ocol-0.006
isher-0.006
ement-0.005
-0.005
Token saw
Token ID2497
Feature activation-4.13e-01

Pos logit contributions
 himself+0.007
 seek+0.007
lot+0.007
dy+0.006
 active+0.006
Neg logit contributions
 course-0.010
ocol-0.009
isher-0.009
ement-0.008
ir-0.008
Token a
Token ID257
Feature activation+4.02e-01

Pos logit contributions
 course+0.015
ocol+0.013
isher+0.013
ement+0.013
ir+0.012
Neg logit contributions
 seek-0.012
lot-0.012
 active-0.011
die-0.011
dy-0.011
Token brightly
Token ID35254
Feature activation+1.52e+00

Pos logit contributions
 himself+0.010
lot+0.009
 seek+0.009
dy+0.009
 active+0.009
Neg logit contributions
 course-0.016
ocol-0.015
isher-0.014
-0.014
ement-0.014
Token-
Token ID12
Feature activation+1.79e+00

Pos logit contributions
 himself+0.046
lot+0.044
 seek+0.043
dy+0.042
die+0.041
Neg logit contributions
 course-0.054
ocol-0.049
isher-0.047
ement-0.046
ir-0.045
Tokencolored
Token ID25717
Feature activation+2.20e+00

Pos logit contributions
 himself+0.048
 seek+0.045
lot+0.044
dy+0.042
 active+0.042
Neg logit contributions
ocol-0.068
 course-0.067
isher-0.066
-0.065
ement-0.063
Token bird
Token ID6512
Feature activation+2.84e-01

Pos logit contributions
 seek+0.060
 himself+0.059
dy+0.056
lot+0.053
die+0.052
Neg logit contributions
 course-0.080
isher-0.078
ocol-0.077
ement-0.073
-0.071
Token,
Token ID11
Feature activation-2.53e-01

Pos logit contributions
 himself+0.007
 seek+0.007
dy+0.007
lot+0.007
die+0.006
Neg logit contributions
 course-0.011
isher-0.011
ocol-0.011
-0.010
ement-0.010
Token who
Token ID508
Feature activation-1.73e-01

Pos logit contributions
 course+0.009
ocol+0.007
 ribbon+0.007
ement+0.007
+0.007
Neg logit contributions
 seek-0.007
lot-0.007
 active-0.007
 himself-0.007
dy-0.007
Token decided
Token ID3066
Feature activation+9.85e-01

Pos logit contributions
 himself+0.004
 seek+0.004
lot+0.004
dy+0.003
die+0.003
Neg logit contributions
 course-0.008
isher-0.008
ocol-0.008
ement-0.007
-0.007
Token to
Token ID284
Feature activation-8.48e-03

Pos logit contributions
die+0.034
 himself+0.034
dy+0.033
 seek+0.033
lot+0.033
Neg logit contributions
 course-0.027
isher-0.021
 ribbon-0.020
-0.020
 please-0.020
Token paint
Token ID7521
Feature activation+1.08e-01

Pos logit contributions
ocol+0.000
 course+0.000
isher+0.000
ement+0.000
+0.000
Neg logit contributions
 seek-0.000
 himself-0.000
 active-0.000
dy-0.000
 having-0.000
Token a
Token ID257
Feature activation-2.22e-01

Pos logit contributions
 seek+0.003
 himself+0.003
lot+0.003
dy+0.003
die+0.003
Neg logit contributions
 course-0.004
ocol-0.004
isher-0.004
ement-0.004
-0.004
Token big
Token ID1263
Feature activation+7.72e-01

Pos logit contributions
 course+0.008
ocol+0.008
isher+0.008
ement+0.007
+0.007
Neg logit contributions
 himself-0.006
 seek-0.006
lot-0.006
dy-0.006
die-0.006
Token picture
Token ID4286
Feature activation+1.27e+00

Pos logit contributions
 seek+0.021
dy+0.019
 himself+0.019
 active+0.018
 having+0.017
Neg logit contributions
ocol-0.027
isher-0.027
 course-0.026
ement-0.025
-0.025
Token together
Token ID1978
Feature activation-2.95e-01

Pos logit contributions
 himself+0.040
 seek+0.039
lot+0.038
dy+0.038
die+0.037
Neg logit contributions
 course-0.043
ocol-0.039
isher-0.039
ement-0.036
-0.036
Token,
Token ID11
Feature activation-3.63e-01

Pos logit contributions
 course+0.010
ocol+0.009
isher+0.009
ement+0.008
+0.008
Neg logit contributions
 himself-0.010
 seek-0.009
lot-0.009
dy-0.009
die-0.009
Token but
Token ID475
Feature activation-3.12e-01

Pos logit contributions
 course+0.014
ocol+0.013
isher+0.012
+0.012
ement+0.012
Neg logit contributions
 himself-0.009
lot-0.009
dy-0.009
 seek-0.009
die-0.009
Token it
Token ID340
Feature activation-1.23e-01

Pos logit contributions
 course+0.010
isher+0.009
ocol+0.008
+0.008
ement+0.008
Neg logit contributions
lot-0.010
 himself-0.010
dy-0.010
die-0.010
 seek-0.009
Token was
Token ID373
Feature activation+1.28e-01

Pos logit contributions
 course+0.005
isher+0.005
ocol+0.005
ement+0.004
+0.004
Neg logit contributions
 himself-0.003
 seek-0.003
lot-0.003
dy-0.003
die-0.003
Token happen
Token ID1645
Feature activation+9.38e-01

Pos logit contributions
 course+0.037
ocol+0.033
isher+0.032
+0.030
ement+0.030
Neg logit contributions
 himself-0.036
lot-0.036
dy-0.035
 seek-0.035
die-0.034
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
lot+0.030
dy+0.028
 seek+0.028
 himself+0.028
die+0.028
Neg logit contributions
 course-0.031
isher-0.026
ement-0.026
ocol-0.025
-0.024
Token
Token ID220
Feature activation+2.12e-01

Pos logit contributions
 seek+0.005
lot+0.005
 himself+0.005
 active+0.005
dy+0.005
Neg logit contributions
 course-0.007
-0.007
ement-0.007
ocol-0.006
isher-0.006
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
 seek+0.006
 active+0.005
 himself+0.005
dy+0.005
lot+0.005
Neg logit contributions
 course-0.008
-0.008
ocol-0.007
ement-0.007
ir-0.007
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
 course+0.004
+0.004
ir+0.003
ement+0.003
ocol+0.003
Neg logit contributions
 miners-0.004
 seek-0.003
rated-0.003
 active-0.003
 afloat-0.003
TokenAt
Token ID2953
Feature activation+1.04e+00

Pos logit contributions
 course+0.010
+0.008
ir+0.008
ocol+0.007
 ribbon+0.007
Neg logit contributions
lot-0.008
dy-0.008
 afloat-0.008
 miners-0.008
 himself-0.008
Token last
Token ID938
Feature activation+4.50e-01

Pos logit contributions
lot+0.031
dy+0.030
 active+0.030
 seek+0.029
die+0.029
Neg logit contributions
 course-0.039
isher-0.033
ocol-0.032
-0.032
ir-0.031
Token,
Token ID11
Feature activation+4.15e-01

Pos logit contributions
lot+0.017
dy+0.016
die+0.016
 himself+0.015
 seek+0.014
Neg logit contributions
 course-0.015
ocol-0.013
-0.012
ement-0.012
ir-0.012
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
lot+0.014
dy+0.013
die+0.013
 himself+0.013
 seek+0.012
Neg logit contributions
 course-0.014
-0.012
ocol-0.012
isher-0.011
ement-0.011
Token miracle
Token ID20820
Feature activation-4.42e-01

Pos logit contributions
 course+0.005
ocol+0.005
isher+0.005
+0.005
ement+0.004
Neg logit contributions
 himself-0.004
lot-0.004
dy-0.003
 seek-0.003
die-0.003
Token happened
Token ID3022
Feature activation-4.93e-01

Pos logit contributions
 course+0.014
ocol+0.013
isher+0.012
ement+0.012
+0.011
Neg logit contributions
 himself-0.015
 seek-0.015
lot-0.014
dy-0.014
die-0.014
Token They
Token ID1119
Feature activation-1.15e-01

Pos logit contributions
 course+0.005
+0.005
ement+0.004
ocol+0.004
ir+0.004
Neg logit contributions
 seek-0.004
lot-0.004
 active-0.003
die-0.003
dy-0.003
Token both
Token ID1111
Feature activation-1.28e-01

Pos logit contributions
 course+0.004
ocol+0.004
isher+0.004
ement+0.004
+0.004
Neg logit contributions
 himself-0.003
 seek-0.003
lot-0.003
dy-0.003
die-0.003
Token laughed
Token ID13818
Feature activation-4.50e-02

Pos logit contributions
 course+0.005
isher+0.004
ocol+0.004
ement+0.004
ir+0.004
Neg logit contributions
 himself-0.004
 seek-0.003
lot-0.003
dy-0.003
die-0.003
Token and
Token ID290
Feature activation-1.30e-01

Pos logit contributions
 course+0.002
ocol+0.002
isher+0.002
ement+0.002
+0.002
Neg logit contributions
 himself-0.001
 seek-0.001
lot-0.001
dy-0.001
die-0.001
Token enjoyed
Token ID8359
Feature activation+6.80e-01

Pos logit contributions
 course+0.005
isher+0.004
ocol+0.004
+0.004
ement+0.004
Neg logit contributions
 himself-0.004
lot-0.004
 seek-0.004
dy-0.003
die-0.003
Token their
Token ID511
Feature activation-3.88e-01

Pos logit contributions
 himself+0.016
 seek+0.016
lot+0.016
dy+0.015
die+0.015
Neg logit contributions
 course-0.028
ocol-0.026
isher-0.025
ement-0.024
-0.024
Token day
Token ID1110
Feature activation-2.85e-01

Pos logit contributions
 course+0.015
ocol+0.014
isher+0.014
ement+0.013
+0.013
Neg logit contributions
 himself-0.010
 seek-0.010
lot-0.010
dy-0.009
 active-0.009
Token at
Token ID379
Feature activation+2.30e-01

Pos logit contributions
 course+0.010
ocol+0.010
isher+0.010
ement+0.009
+0.009
Neg logit contributions
 himself-0.008
 seek-0.008
lot-0.008
dy-0.008
die-0.007
Token the
Token ID262
Feature activation-3.83e-02

Pos logit contributions
 himself+0.005
 seek+0.005
lot+0.005
dy+0.005
die+0.005
Neg logit contributions
 course-0.010
ocol-0.009
isher-0.009
ement-0.009
-0.009
Token park
Token ID3952
Feature activation-1.22e+00

Pos logit contributions
 course+0.001
ocol+0.001
isher+0.001
ement+0.001
+0.001
Neg logit contributions
 himself-0.001
 seek-0.001
lot-0.001
dy-0.001
die-0.001
Token.
Token ID13
Feature activation+9.85e-02

Pos logit contributions
 course+0.037
ement+0.031
isher+0.030
ocol+0.030
+0.029
Neg logit contributions
die-0.042
 himself-0.041
 seek-0.041
lot-0.041
dy-0.041
TokenFinally
Token ID11158
Feature activation+2.74e-01

Pos logit contributions
 course+0.001
+0.001
ir+0.001
ocol+0.001
isher+0.001
Neg logit contributions
dy-0.001
 seek-0.001
 afloat-0.001
die-0.001
 himself-0.001
Token,
Token ID11
Feature activation+8.85e-01

Pos logit contributions
lot+0.009
 seek+0.009
 himself+0.009
dy+0.009
die+0.009
Neg logit contributions
 course-0.009
ocol-0.008
isher-0.007
ement-0.007
-0.007
Token he
Token ID339
Feature activation+7.55e-01

Pos logit contributions
dy+0.026
die+0.024
 seek+0.024
lot+0.023
 active+0.023
Neg logit contributions
 course-0.033
-0.030
ocol-0.028
isher-0.027
ement-0.026
Token found
Token ID1043
Feature activation-1.21e+00

Pos logit contributions
 himself+0.024
 active+0.022
lot+0.021
 seek+0.021
dy+0.020
Neg logit contributions
 course-0.027
ir-0.021
-0.021
isher-0.021
 ribbon-0.021
Token the
Token ID262
Feature activation-8.62e-02

Pos logit contributions
 course+0.049
ocol+0.045
isher+0.045
ement+0.042
+0.041
Neg logit contributions
 seek-0.030
 himself-0.030
lot-0.029
dy-0.028
 active-0.027
Token squirrel
Token ID33039
Feature activation-8.08e-01

Pos logit contributions
 course+0.003
ocol+0.003
isher+0.003
ement+0.003
+0.003
Neg logit contributions
 himself-0.003
 seek-0.003
lot-0.002
dy-0.002
die-0.002
Token and
Token ID290
Feature activation-2.20e-01

Pos logit contributions
 course+0.033
isher+0.028
ocol+0.028
+0.027
ement+0.027
Neg logit contributions
 himself-0.020
 seek-0.020
lot-0.019
die-0.018
 active-0.018
Token told
Token ID1297
Feature activation+3.16e-01

Pos logit contributions
 course+0.008
isher+0.006
 ribbon+0.006
 rib+0.006
ocol+0.006
Neg logit contributions
lot-0.006
 active-0.006
dy-0.006
 himself-0.006
die-0.006
Token him
Token ID683
Feature activation+5.80e-01

Pos logit contributions
 seek+0.009
lot+0.008
dy+0.008
die+0.008
 himself+0.008
Neg logit contributions
 course-0.012
ocol-0.011
isher-0.011
ement-0.010
ir-0.010
Token about
Token ID546
Feature activation+1.89e+00

Pos logit contributions
 seek+0.021
die+0.019
lot+0.019
 afloat+0.019
 active+0.019
Neg logit contributions
 course-0.016
ocol-0.014
ement-0.013
isher-0.013
ir-0.012
Token the
Token ID262
Feature activation-2.93e-02

Pos logit contributions
 seek+0.046
lot+0.044
 himself+0.043
dy+0.043
 active+0.041
Neg logit contributions
 course-0.078
ocol-0.072
isher-0.069
ement-0.066
ir-0.064
Token is
Token ID318
Feature activation+3.75e-01

Pos logit contributions
 himself+0.020
 seek+0.019
lot+0.019
dy+0.019
 active+0.018
Neg logit contributions
 course-0.022
ocol-0.020
isher-0.020
ement-0.019
-0.018
Token blue
Token ID4171
Feature activation-1.01e-02

Pos logit contributions
lot+0.011
die+0.011
dy+0.011
 seek+0.010
rated+0.010
Neg logit contributions
 course-0.011
ocol-0.011
 icing-0.010
isher-0.010
ement-0.010
Token and
Token ID290
Feature activation+7.90e-01

Pos logit contributions
 course+0.000
ocol+0.000
isher+0.000
ement+0.000
+0.000
Neg logit contributions
 himself-0.000
 seek-0.000
lot-0.000
dy-0.000
die-0.000
Token has
Token ID468
Feature activation+9.59e-01

Pos logit contributions
 himself+0.022
lot+0.022
 seek+0.022
dy+0.020
die+0.020
Neg logit contributions
 course-0.028
ocol-0.026
isher-0.026
ement-0.024
-0.023
Token stars
Token ID5788
Feature activation+8.16e-01

Pos logit contributions
 seek+0.027
lot+0.026
 himself+0.025
dy+0.025
 active+0.025
Neg logit contributions
 course-0.035
ocol-0.032
isher-0.031
 ribbon-0.029
ement-0.029
Token on
Token ID319
Feature activation+1.03e-01

Pos logit contributions
 himself+0.020
 seek+0.020
lot+0.019
dy+0.019
die+0.018
Neg logit contributions
 course-0.033
ocol-0.031
isher-0.030
ement-0.029
-0.028
Token it
Token ID340
Feature activation-2.10e-01

Pos logit contributions
 himself+0.003
 seek+0.003
lot+0.003
dy+0.003
die+0.003
Neg logit contributions
 course-0.004
isher-0.004
ocol-0.004
ement-0.003
-0.003
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
 course+0.006
ocol+0.006
isher+0.005
ement+0.005
+0.005
Neg logit contributions
 seek-0.007
lot-0.007
dy-0.007
die-0.007
 himself-0.007
Token He
Token ID679
Feature activation-4.64e-02

Pos logit contributions
 seek+0.005
 afloat+0.004
lot+0.004
 active+0.004
rated+0.004
Neg logit contributions
 course-0.006
-0.006
ocol-0.006
ement-0.005
isher-0.005
Token likes
Token ID7832
Feature activation-6.10e-01

Pos logit contributions
 course+0.002
ocol+0.001
isher+0.001
ement+0.001
+0.001
Neg logit contributions
 himself-0.001
 seek-0.001
lot-0.001
dy-0.001
 active-0.001
Token it
Token ID340
Feature activation-4.75e-02

Pos logit contributions
 course+0.022
ocol+0.020
isher+0.019
ir+0.018
 ribbon+0.018
Neg logit contributions
 seek-0.018
lot-0.018
 himself-0.017
dy-0.017
die-0.016
Token.
Token ID13
Feature activation+3.01e-02

Pos logit contributions
lot+0.032
 seek+0.031
 himself+0.030
dy+0.029
die+0.029
Neg logit contributions
 course-0.032
ocol-0.029
isher-0.028
ir-0.026
ement-0.026
Token It
Token ID632
Feature activation-7.93e-01

Pos logit contributions
 himself+0.001
 seek+0.001
lot+0.001
dy+0.001
die+0.001
Neg logit contributions
 course-0.001
ocol-0.001
isher-0.001
-0.001
ement-0.001
Tokenâ
Token ID22940
Feature activation+5.40e-02

Pos logit contributions
 course+0.027
isher+0.022
ement+0.022
ocol+0.022
+0.021
Neg logit contributions
 himself-0.026
 seek-0.024
lot-0.024
dy-0.022
 active-0.022
Token
Token ID26391
Feature activation+1.51e+00

Pos logit contributions
 himself+0.001
 seek+0.001
lot+0.001
 active+0.001
dy+0.001
Neg logit contributions
ocol-0.002
 course-0.002
isher-0.002
ement-0.002
ir-0.002
Token
Token ID8151
Feature activation-1.90e+00

Pos logit contributions
 himself+0.044
 seek+0.041
lot+0.040
dy+0.040
die+0.039
Neg logit contributions
 course-0.055
ocol-0.051
isher-0.049
-0.048
ement-0.047
Tokens
Token ID82
Feature activation-1.19e+00

Pos logit contributions
 course+0.060
ocol+0.053
isher+0.052
ement+0.051
+0.051
Neg logit contributions
 himself-0.065
 seek-0.064
lot-0.062
 active-0.060
dy-0.060
Token so
Token ID523
Feature activation-3.49e-01

Pos logit contributions
 course+0.042
ocol+0.033
ement+0.033
ir+0.032
isher+0.031
Neg logit contributions
lot-0.036
 himself-0.035
 seek-0.035
dy-0.033
die-0.033
Token much
Token ID881
Feature activation-1.19e+00

Pos logit contributions
 course+0.013
ocol+0.012
isher+0.012
ement+0.011
+0.011
Neg logit contributions
 himself-0.010
lot-0.009
 seek-0.009
dy-0.009
die-0.009
Token fun
Token ID1257
Feature activation-6.92e-01

Pos logit contributions
 course+0.043
ocol+0.040
isher+0.039
ement+0.037
+0.036
Neg logit contributions
 himself-0.035
 seek-0.033
lot-0.033
dy-0.033
die-0.031
Token!
Token ID0
Feature activation-1.73e-01

Pos logit contributions
 course+0.021
isher+0.017
ocol+0.017
ement+0.016
+0.015
Neg logit contributions
 himself-0.025
lot-0.025
dy-0.023
die-0.023
 seek-0.023
Tokenâ
Token ID22940
Feature activation+1.24e-01

Pos logit contributions
 course+0.006
ocol+0.006
isher+0.006
+0.005
ement+0.005
Neg logit contributions
 himself-0.005
 seek-0.005
lot-0.005
dy-0.005
die-0.004
Token hit
Token ID2277
Feature activation+8.16e-02

Pos logit contributions
 course+0.017
ocol+0.014
isher+0.014
 ribbon+0.013
 rib+0.013
Neg logit contributions
dy-0.018
lot-0.017
die-0.017
 himself-0.017
 active-0.017
Token the
Token ID262
Feature activation+4.88e-01

Pos logit contributions
 seek+0.003
lot+0.003
dy+0.003
 himself+0.003
 active+0.003
Neg logit contributions
 course-0.002
ocol-0.002
isher-0.002
ement-0.002
ir-0.002
Token bad
Token ID2089
Feature activation+1.02e+00

Pos logit contributions
lot+0.015
 himself+0.015
dy+0.014
die+0.014
 seek+0.014
Neg logit contributions
 course-0.017
isher-0.014
ocol-0.014
 ribbon-0.013
ement-0.012
Token guys
Token ID3730
Feature activation+1.51e+00

Pos logit contributions
 himself+0.031
lot+0.030
 seek+0.030
die+0.029
 active+0.028
Neg logit contributions
 course-0.035
isher-0.032
ocol-0.031
ement-0.027
 ribbon-0.027
Token."
Token ID526
Feature activation-3.33e-01

Pos logit contributions
lot+0.047
die+0.047
 himself+0.047
dy+0.047
 seek+0.046
Neg logit contributions
 course-0.050
isher-0.044
ocol-0.043
ir-0.041
ement-0.040
Token She
Token ID1375
Feature activation+8.21e-02

Pos logit contributions
 course+0.012
+0.011
ement+0.010
ocol+0.010
ir+0.010
Neg logit contributions
 seek-0.010
 active-0.009
lot-0.009
 afloat-0.009
dy-0.009
Token showed
Token ID3751
Feature activation+6.41e-01

Pos logit contributions
 himself+0.002
 seek+0.002
lot+0.002
dy+0.002
die+0.002
Neg logit contributions
 course-0.003
ocol-0.003
isher-0.003
ement-0.003
-0.003
Token Max
Token ID5436
Feature activation+7.45e-01

Pos logit contributions
 himself+0.019
lot+0.015
dy+0.015
die+0.014
 his+0.014
Neg logit contributions
-0.024
ocol-0.024
 course-0.023
isher-0.023
ement-0.022
Token how
Token ID703
Feature activation+1.85e-02

Pos logit contributions
 seek+0.027
lot+0.026
 active+0.025
die+0.024
rated+0.024
Neg logit contributions
 course-0.022
isher-0.018
ocol-0.017
ement-0.016
ir-0.016
Token she
Token ID673
Feature activation+4.74e-01

Pos logit contributions
 himself+0.000
 seek+0.000
lot+0.000
dy+0.000
die+0.000
Neg logit contributions
 course-0.001
ocol-0.001
isher-0.001
ement-0.001
ir-0.001
Token could
Token ID714
Feature activation-1.52e-01

Pos logit contributions
 himself+0.011
 seek+0.011
lot+0.010
dy+0.010
 active+0.009
Neg logit contributions
 course-0.020
ocol-0.020
isher-0.019
ement-0.018
-0.018
Token whipped
Token ID30712
Feature activation+9.11e-01

Pos logit contributions
 course+0.005
ocol+0.005
isher+0.004
 ribbon+0.004
 icing+0.004
Neg logit contributions
lot-0.006
 seek-0.006
 himself-0.006
dy-0.006
die-0.006
Token cream
Token ID8566
Feature activation+2.01e-01

Pos logit contributions
 seek+0.031
 himself+0.031
lot+0.030
dy+0.029
die+0.029
Neg logit contributions
 course-0.028
ocol-0.026
isher-0.026
ement-0.024
-0.023
Token.
Token ID13
Feature activation+3.24e-01

Pos logit contributions
 himself+0.006
 seek+0.006
lot+0.006
dy+0.006
die+0.006
Neg logit contributions
 course-0.007
ocol-0.006
isher-0.006
ement-0.006
-0.006
Token
Token ID198
Feature activation+4.67e-02

Pos logit contributions
 seek+0.009
 himself+0.008
lot+0.008
dy+0.008
 active+0.008
Neg logit contributions
 course-0.012
-0.012
ocol-0.012
ement-0.011
isher-0.011
Token
Token ID198
Feature activation-6.14e-02

Pos logit contributions
 seek+0.001
 active+0.001
lot+0.001
rated+0.001
die+0.001
Neg logit contributions
 course-0.002
-0.001
ir-0.001
ocol-0.001
ement-0.001
TokenS
Token ID50
Feature activation-1.64e+00

Pos logit contributions
 course+0.002
ocol+0.002
isher+0.002
+0.002
ir+0.002
Neg logit contributions
lot-0.002
 himself-0.002
dy-0.002
die-0.002
 seek-0.002
Tokenara
Token ID3301
Feature activation-2.34e-01

Pos logit contributions
isher+0.046
 course+0.045
ocol+0.043
ternally+0.039
 emanating+0.038
Neg logit contributions
lot-0.055
 himself-0.055
dy-0.052
 seek-0.048
 early-0.048
Token and
Token ID290
Feature activation+2.21e-01

Pos logit contributions
 course+0.007
isher+0.006
ement+0.006
ocol+0.006
ipped+0.006
Neg logit contributions
 seek-0.008
 himself-0.008
 afloat-0.008
lot-0.008
 active-0.007
Token Tom
Token ID4186
Feature activation-4.08e-01

Pos logit contributions
 himself+0.006
 seek+0.006
lot+0.005
dy+0.005
die+0.005
Neg logit contributions
 course-0.009
ocol-0.008
isher-0.008
ement-0.008
-0.007
Token want
Token ID765
Feature activation-3.55e-01

Pos logit contributions
 course+0.011
ocol+0.010
isher+0.010
+0.009
ement+0.009
Neg logit contributions
 himself-0.015
 seek-0.015
lot-0.015
dy-0.015
die-0.014
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
 course+0.012
ocol+0.011
isher+0.011
+0.010
ement+0.010
Neg logit contributions
 seek-0.011
lot-0.010
 himself-0.010
dy-0.010
 active-0.010
Token'm
Token ID1101
Feature activation+2.41e-01

Pos logit contributions
 course+0.011
+0.010
isher+0.010
ocol+0.010
ement+0.009
Neg logit contributions
 himself-0.012
lot-0.012
 seek-0.011
dy-0.011
die-0.011
Token proud
Token ID6613
Feature activation-4.45e-01

Pos logit contributions
 himself+0.006
 seek+0.005
lot+0.005
dy+0.005
die+0.005
Neg logit contributions
 course-0.010
ocol-0.010
isher-0.010
ement-0.009
-0.009
Token of
Token ID286
Feature activation+3.23e-01

Pos logit contributions
 course+0.013
ocol+0.012
isher+0.012
ement+0.011
+0.011
Neg logit contributions
 himself-0.016
 seek-0.015
lot-0.015
dy-0.015
die-0.014
Token you
Token ID345
Feature activation-4.71e-01

Pos logit contributions
 himself+0.013
 seek+0.012
dy+0.011
lot+0.011
die+0.011
Neg logit contributions
isher-0.008
 course-0.008
ocol-0.008
ement-0.007
-0.007
Token and
Token ID290
Feature activation-2.27e-01

Pos logit contributions
 course+0.013
ocol+0.010
ement+0.010
isher+0.010
ir+0.010
Neg logit contributions
lot-0.017
 seek-0.017
dy-0.016
die-0.016
 himself-0.016
Token Spot
Token ID15899
Feature activation-1.76e+00

Pos logit contributions
 course+0.007
isher+0.006
ocol+0.006
+0.005
 yeah+0.005
Neg logit contributions
lot-0.007
 himself-0.007
dy-0.007
 afloat-0.007
 seek-0.007
Token.
Token ID13
Feature activation+2.74e-02

Pos logit contributions
 course+0.044
isher+0.034
ement+0.032
ocol+0.031
+0.029
Neg logit contributions
 seek-0.069
lot-0.067
 himself-0.065
die-0.064
Back-0.063
Token Now
Token ID2735
Feature activation-1.91e-01

Pos logit contributions
 seek+0.001
 himself+0.001
lot+0.001
dy+0.001
 active+0.001
Neg logit contributions
 course-0.001
ocol-0.001
isher-0.001
-0.001
ement-0.001
Token,
Token ID11
Feature activation+6.38e-01

Pos logit contributions
 course+0.006
ocol+0.005
isher+0.005
+0.004
ement+0.004
Neg logit contributions
lot-0.007
 himself-0.006
 seek-0.006
 active-0.006
dy-0.006
Token let
Token ID1309
Feature activation+3.84e-01

Pos logit contributions
lot+0.021
 himself+0.020
 seek+0.020
dy+0.020
die+0.020
Neg logit contributions
 course-0.020
ocol-0.018
isher-0.017
 ribbon-0.015
 please-0.015
Token's
Token ID338
Feature activation+4.11e-01

Pos logit contributions
 seek+0.010
lot+0.009
 himself+0.009
 active+0.008
dy+0.008
Neg logit contributions
 course-0.016
ocol-0.014
isher-0.014
ement-0.013
ir-0.013
Token.
Token ID13
Feature activation+2.48e-01

Pos logit contributions
 course+0.019
ement+0.016
+0.015
ocol+0.015
isher+0.014
Neg logit contributions
die-0.032
 seek-0.032
lot-0.032
dy-0.032
 active-0.030
Token She
Token ID1375
Feature activation-1.69e-01

Pos logit contributions
 seek+0.007
 himself+0.007
dy+0.006
lot+0.006
 active+0.006
Neg logit contributions
 course-0.009
-0.009
ocol-0.009
ement-0.008
isher-0.008
Token hugs
Token ID40657
Feature activation-8.09e-01

Pos logit contributions
 course+0.006
ocol+0.005
isher+0.005
+0.005
ement+0.005
Neg logit contributions
 himself-0.005
dy-0.005
 seek-0.005
lot-0.005
die-0.005
Token him
Token ID683
Feature activation-1.86e-01

Pos logit contributions
 course+0.029
ocol+0.028
isher+0.028
ement+0.026
+0.026
Neg logit contributions
 himself-0.025
 seek-0.022
lot-0.022
dy-0.021
die-0.020
Token and
Token ID290
Feature activation+4.54e-02

Pos logit contributions
 course+0.008
ocol+0.008
isher+0.008
ement+0.007
+0.007
Neg logit contributions
 himself-0.004
 seek-0.004
lot-0.004
dy-0.004
die-0.004
Token wipes
Token ID46142
Feature activation-1.67e+00

Pos logit contributions
 seek+0.002
 himself+0.002
lot+0.002
dy+0.002
 active+0.002
Neg logit contributions
 course-0.001
ocol-0.001
isher-0.001
ement-0.001
-0.001
Token his
Token ID465
Feature activation+2.11e-01

Pos logit contributions
 course+0.075
ocol+0.071
isher+0.070
ement+0.068
ir+0.065
Neg logit contributions
 seek-0.035
lot-0.032
 active-0.029
 himself-0.028
dy-0.028
Token face
Token ID1986
Feature activation+1.10e+00

Pos logit contributions
 himself+0.006
 seek+0.006
lot+0.006
dy+0.006
die+0.005
Neg logit contributions
 course-0.008
ocol-0.007
isher-0.007
ement-0.007
-0.007
Token with
Token ID351
Feature activation+2.37e-01

Pos logit contributions
 seek+0.034
 himself+0.034
lot+0.034
dy+0.033
die+0.032
Neg logit contributions
 course-0.036
ocol-0.033
isher-0.033
ement-0.031
-0.030
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
 himself+0.005
 his+0.003
 seek+0.003
 having+0.002
dy+0.002
Neg logit contributions
isher-0.011
 course-0.011
ement-0.011
ocol-0.011
-0.011
Token towel
Token ID24808
Feature activation-7.15e-01

Pos logit contributions
 course+0.005
isher+0.005
ocol+0.005
ement+0.004
+0.004
Neg logit contributions
 himself-0.004
 seek-0.004
dy-0.004
lot-0.004
die-0.004
Token They
Token ID1119
Feature activation-9.30e-01

Pos logit contributions
 seek+0.011
lot+0.011
dy+0.011
die+0.011
 himself+0.011
Neg logit contributions
 course-0.011
-0.011
ocol-0.011
isher-0.010
ement-0.010
Token take
Token ID1011
Feature activation+3.22e-01

Pos logit contributions
 course+0.032
ocol+0.031
isher+0.031
ement+0.029
+0.028
Neg logit contributions
 himself-0.028
 seek-0.027
lot-0.025
dy-0.025
 active-0.024
Token their
Token ID511
Feature activation-8.26e-01

Pos logit contributions
 himself+0.010
 seek+0.010
lot+0.010
dy+0.009
die+0.009
Neg logit contributions
 course-0.011
ocol-0.010
isher-0.010
ement-0.009
ir-0.009
Token toys
Token ID14958
Feature activation-4.25e-01

Pos logit contributions
 course+0.030
ocol+0.028
isher+0.028
ement+0.026
+0.026
Neg logit contributions
 himself-0.024
 seek-0.023
lot-0.023
dy-0.022
die-0.021
Token and
Token ID290
Feature activation+1.22e-01

Pos logit contributions
 course+0.019
ocol+0.018
isher+0.018
ement+0.017
+0.017
Neg logit contributions
 himself-0.009
 seek-0.008
lot-0.008
dy-0.007
 active-0.007
Token books
Token ID3835
Feature activation-1.64e+00

Pos logit contributions
lot+0.004
 himself+0.004
dy+0.004
die+0.004
 seek+0.004
Neg logit contributions
 course-0.004
ocol-0.003
isher-0.003
 ribbon-0.003
 rib-0.003
Token and
Token ID290
Feature activation-1.98e-01

Pos logit contributions
ocol+0.073
 course+0.071
isher+0.070
ement+0.065
+0.064
Neg logit contributions
 himself-0.037
 seek-0.031
 late-0.030
lot-0.028
 having-0.026
Token TV
Token ID3195
Feature activation+1.78e-01

Pos logit contributions
 course+0.006
ocol+0.005
isher+0.005
 ribbon+0.005
 rib+0.004
Neg logit contributions
lot-0.007
dy-0.007
 himself-0.007
die-0.007
 seek-0.006
Token and
Token ID290
Feature activation-1.32e-01

Pos logit contributions
 himself+0.005
 seek+0.005
lot+0.005
dy+0.005
 active+0.005
Neg logit contributions
 course-0.006
ocol-0.006
isher-0.006
ement-0.006
-0.005
Token fun
Token ID1257
Feature activation-7.06e-01

Pos logit contributions
 course+0.004
ocol+0.004
isher+0.003
 ribbon+0.003
 rib+0.003
Neg logit contributions
lot-0.005
dy-0.005
die-0.004
 himself-0.004
 seek-0.004
Token.
Token ID13
Feature activation+5.26e-01

Pos logit contributions
 course+0.019
ocol+0.018
isher+0.017
ement+0.015
+0.014
Neg logit contributions
lot-0.027
die-0.026
dy-0.025
 seek-0.025
 himself-0.025
Token,
Token ID11
Feature activation-7.28e-02

Pos logit contributions
 himself+0.003
lot+0.003
 seek+0.003
die+0.003
dy+0.003
Neg logit contributions
 course-0.004
isher-0.003
ocol-0.003
ement-0.003
-0.003
Token sometimes
Token ID3360
Feature activation+1.18e+00

Pos logit contributions
 course+0.003
ocol+0.002
isher+0.002
+0.002
ement+0.002
Neg logit contributions
lot-0.002
 seek-0.002
die-0.002
dy-0.002
 himself-0.002
Token they
Token ID484
Feature activation+3.33e-01

Pos logit contributions
 himself+0.042
lot+0.041
dy+0.039
die+0.039
 seek+0.038
Neg logit contributions
 course-0.036
 ribbon-0.027
isher-0.026
ocol-0.026
ement-0.025
Token kicked
Token ID12165
Feature activation-9.76e-01

Pos logit contributions
 himself+0.010
 seek+0.010
lot+0.010
dy+0.010
die+0.009
Neg logit contributions
 course-0.012
isher-0.010
ocol-0.010
-0.010
ir-0.010
Token too
Token ID1165
Feature activation-2.37e-01

Pos logit contributions
 course+0.037
ocol+0.032
isher+0.031
ement+0.030
ir+0.029
Neg logit contributions
 seek-0.028
lot-0.028
 himself-0.027
dy-0.025
die-0.025
Token hard
Token ID1327
Feature activation-1.67e+00

Pos logit contributions
 course+0.010
ocol+0.009
isher+0.009
ement+0.009
+0.009
Neg logit contributions
 himself-0.006
 seek-0.005
lot-0.005
dy-0.005
 active-0.005
Token or
Token ID393
Feature activation+9.00e-01

Pos logit contributions
 course+0.052
ocol+0.046
isher+0.045
ement+0.043
ir+0.042
Neg logit contributions
 seek-0.057
lot-0.055
 himself-0.054
die-0.054
dy-0.054
Token too
Token ID1165
Feature activation-1.39e-01

Pos logit contributions
lot+0.026
 himself+0.025
dy+0.025
 seek+0.025
die+0.025
Neg logit contributions
 course-0.031
ocol-0.027
isher-0.026
-0.025
 ribbon-0.025
Token soft
Token ID2705
Feature activation-1.81e+00

Pos logit contributions
 course+0.005
ocol+0.005
isher+0.005
+0.005
ement+0.005
Neg logit contributions
 himself-0.004
 seek-0.003
lot-0.003
dy-0.003
die-0.003
Token,
Token ID11
Feature activation-1.89e-01

Pos logit contributions
 course+0.058
ocol+0.053
isher+0.053
ement+0.050
+0.049
Neg logit contributions
 seek-0.059
 himself-0.059
lot-0.057
dy-0.056
die-0.055
Token sometimes
Token ID3360
Feature activation+1.28e+00

Pos logit contributions
 course+0.007
ocol+0.006
isher+0.006
+0.006
ement+0.006
Neg logit contributions
lot-0.005
 seek-0.005
 himself-0.005
die-0.005
dy-0.005