TOP ACTIVATIONS
MAX = 3.525

temper and often threw temper tant rums . One day ,
to cry and threw a tant rum . She didn 't
sad . She threw a tant rum and cried loudly .
and so she threw a tant rum when her parents said
kept crying and throwing a tant rum . Finally
his feet and threw a tant rum . "
and started to throw a tant rum . Suddenly , she
her feet and threw a tant rum . She grabbed the
them and never threw a tant rum again . <|endoftext|> Once
way forward . She stayed foc ussed . On the third
She cried and threw a tant rum . Her mom tried
. He kept throwing a tant rum . His mom was
stubborn . She throws a tant rum . She kicks and
started walking very slowly as tort o ises do , but
surprise . He threw a tant rum . He kicked and
found it !" Charl otte and her mom brought
the same urge to explore Ur ge Land everyday , and
listen . She threw a tant rum in the store .
the shed . Charl otte ran to the shed
and the harsh queen was overth rown . The

MIN ACTIVATIONS
MIN = -2.069

One , two , three , go !" They
. " One , two , three , go !"
. " One , two , three , four , five
. " One , two , three !" They
One , two , three , four , five ..." She
. " One , two , three , four ," she
One , two , three , four , five , six
. " One , two , three !" she said .
" One , two , three , four , five
. " One , two , three ..." L
One , two , three , four ," she counts .
to three . " One , two , three !"
One , two , three , four , five , six
her cone . " One , two , three !" she
One , two , three , four ," she said .
" One , two , three , four , five
to count . " One , two , three , four
. " One , two , three , four ," Ben
. " One , two , three , four ," she
One , two , three , four , five , six

INTERVAL 2.126 - 3.525
CONTAINS 0.011%

noticed something expensive shining and gl int ing in the sun
friend , a dog called Tri x . Mill ie wanted
in the fence . He wh ines and w ags his
The white cow che wed and che wed on
dad and felt a bit bra ver . She held his

INTERVAL 0.728 - 2.126
CONTAINS 2.461%

a T - rex and pret ends to chase his toys
looking . He sc ur ried to the back
the dump ster . She thinks maybe Tom is inside .
it was especially mild so O ll ie decided he wanted
to find , but the entrance was deep in the forest

INTERVAL -0.671 - 0.728
CONTAINS 94.061%

She lly took out some paints and started painting .
and climbed down the drain . He lifted Sara and her
ready to go ! One day , the hipp
each other . <|endoftext|> L ily and Ben love to play
. She sees Tom and Amy fighting . She sees the

INTERVAL -2.069 - -0.671
CONTAINS 3.466%

fast . So it ignored the sign and kept going fast
not like children . He is a guard dog . You
The cat laughed and said , " You 'll never get
put it down and said ,
One day , a gr umpy old man named Mr .
Token temper
Token ID4124
Feature activation+3.16e-01

Pos logit contributions
rily+0.004
ittered+0.004
immer+0.003
inations+0.003
leness+0.003
Neg logit contributions
 a-0.003
 two-0.003
-0.003
 the-0.003
 big-0.002
Token and
Token ID290
Feature activation-3.66e-02

Pos logit contributions
ittered+0.034
rily+0.034
immer+0.034
lights+0.033
inations+0.033
Neg logit contributions
 a-0.007
 two-0.006
 big-0.005
 the-0.005
-0.005
Token often
Token ID1690
Feature activation+6.55e-01

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
+0.001
 and+0.001
Neg logit contributions
ittered-0.003
inations-0.003
rily-0.003
lights-0.003
leness-0.003
Token threw
Token ID9617
Feature activation-7.59e-01

Pos logit contributions
ittered+0.055
rily+0.054
immer+0.054
inations+0.053
leness+0.052
Neg logit contributions
 a-0.030
 two-0.028
 big-0.028
-0.027
 the-0.027
Token temper
Token ID4124
Feature activation-1.89e-01

Pos logit contributions
 two+0.024
+0.023
 a+0.023
 big+0.022
 next+0.021
Neg logit contributions
rily-0.068
ittered-0.066
keeping-0.065
immer-0.065
lights-0.064
Token tant
Token ID24246
Feature activation+3.52e+00

Pos logit contributions
 a+0.010
 two+0.010
 big+0.010
+0.010
 the+0.010
Neg logit contributions
rily-0.014
ittered-0.014
immer-0.013
lights-0.013
leness-0.013
Tokenrums
Token ID45241
Feature activation+2.88e-01

Pos logit contributions
rily+0.109
immer+0.107
lights+0.102
acket+0.101
vation+0.099
Neg logit contributions
 a-0.343
 and-0.330
 two-0.329
 the-0.328
.-0.322
Token.
Token ID13
Feature activation+1.33e-01

Pos logit contributions
rily+0.030
ittered+0.030
immer+0.029
arians+0.029
lights+0.029
Neg logit contributions
 a-0.007
 big-0.007
 two-0.007
 the-0.006
-0.006
Token One
Token ID1881
Feature activation-2.82e-01

Pos logit contributions
rily+0.011
immer+0.011
ittered+0.011
lights+0.010
istent+0.010
Neg logit contributions
 a-0.006
 big-0.006
 two-0.006
 the-0.005
 and-0.005
Token day
Token ID1110
Feature activation+2.82e-01

Pos logit contributions
 a+0.011
 two+0.010
+0.010
 big+0.009
 the+0.009
Neg logit contributions
immer-0.023
rated-0.023
lights-0.023
inations-0.023
ittered-0.023
Token,
Token ID11
Feature activation+2.83e-01

Pos logit contributions
ittered+0.029
rily+0.028
immer+0.028
 avail+0.028
inations+0.028
Neg logit contributions
 big-0.006
 a-0.006
 two-0.006
-0.005
 and-0.005
Token to
Token ID284
Feature activation+7.09e-01

Pos logit contributions
ittered+0.010
epad+0.010
rily+0.010
ge+0.010
keeping+0.010
Neg logit contributions
 two-0.002
 a-0.002
 big-0.002
-0.002
 the-0.001
Token cry
Token ID3960
Feature activation-5.98e-01

Pos logit contributions
immer+0.047
ittered+0.047
rily+0.046
 avail+0.045
inations+0.045
Neg logit contributions
 a-0.043
 two-0.042
 big-0.041
-0.040
 the-0.040
Token and
Token ID290
Feature activation+4.10e-01

Pos logit contributions
 two+0.013
 big+0.012
 a+0.012
 the+0.010
+0.009
Neg logit contributions
rily-0.060
immer-0.060
istent-0.059
lights-0.058
ittered-0.058
Token threw
Token ID9617
Feature activation-5.03e-01

Pos logit contributions
ittered+0.035
 avail+0.035
rily+0.034
immer+0.034
inations+0.034
Neg logit contributions
 a-0.015
 two-0.015
 big-0.015
-0.014
 and-0.014
Token a
Token ID257
Feature activation+5.13e-01

Pos logit contributions
 two+0.012
+0.011
 big+0.011
 a+0.011
 little+0.010
Neg logit contributions
rily-0.047
 avail-0.046
ittered-0.045
immer-0.045
keeping-0.045
Token tant
Token ID24246
Feature activation+3.50e+00

Pos logit contributions
rily+0.035
immer+0.035
ittered+0.035
inations+0.034
lights+0.033
Neg logit contributions
 a-0.031
 two-0.029
 the-0.028
-0.028
 and-0.028
Tokenrum
Token ID6582
Feature activation+2.67e-01

Pos logit contributions
rily+0.100
acket+0.099
arians+0.098
inations+0.094
lights+0.094
Neg logit contributions
 a-0.350
 and-0.343
 the-0.334
 two-0.333
.-0.331
Token.
Token ID13
Feature activation+1.91e-02

Pos logit contributions
rily+0.030
ittered+0.030
lights+0.029
immer+0.029
ently+0.029
Neg logit contributions
 big-0.004
 a-0.004
 two-0.004
 the-0.003
-0.003
Token She
Token ID1375
Feature activation+1.95e-01

Pos logit contributions
rily+0.001
ittered+0.001
immer+0.001
istent+0.001
lights+0.001
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
 the-0.001
 and-0.001
Token didn
Token ID1422
Feature activation+2.59e-01

Pos logit contributions
ittered+0.015
rily+0.015
inations+0.015
aned+0.014
immer+0.014
Neg logit contributions
 a-0.009
 two-0.009
 big-0.009
-0.009
 the-0.008
Token't
Token ID470
Feature activation+6.19e-02

Pos logit contributions
rily+0.019
ittered+0.019
immer+0.018
inations+0.018
istent+0.018
Neg logit contributions
 a-0.014
 two-0.014
 big-0.013
 the-0.013
 and-0.013
Token sad
Token ID6507
Feature activation-5.24e-01

Pos logit contributions
 a+0.014
 two+0.014
 big+0.014
+0.013
 and+0.013
Neg logit contributions
rily-0.021
leness-0.021
ittered-0.021
aned-0.020
istent-0.020
Token.
Token ID13
Feature activation-3.92e-01

Pos logit contributions
 big+0.012
 two+0.012
 a+0.011
 the+0.009
 little+0.008
Neg logit contributions
rily-0.052
lights-0.051
ittered-0.051
ently-0.051
immer-0.050
Token She
Token ID1375
Feature activation-5.50e-01

Pos logit contributions
 a+0.014
 big+0.013
 two+0.013
 the+0.011
 and+0.011
Neg logit contributions
rily-0.035
ittered-0.034
ently-0.034
keeping-0.034
istent-0.034
Token threw
Token ID9617
Feature activation-5.41e-01

Pos logit contributions
 a+0.027
 two+0.026
 big+0.026
 the+0.025
+0.024
Neg logit contributions
aned-0.040
rily-0.040
ittered-0.040
rated-0.039
heit-0.039
Token a
Token ID257
Feature activation+3.48e-01

Pos logit contributions
+0.008
 two+0.007
 big+0.007
 a+0.007
 next+0.006
Neg logit contributions
rily-0.056
ittered-0.054
immer-0.054
 avail-0.054
lights-0.053
Token tant
Token ID24246
Feature activation+3.47e+00

Pos logit contributions
rily+0.020
immer+0.020
ittered+0.019
inations+0.018
lights+0.018
Neg logit contributions
 a-0.025
 two-0.024
 the-0.023
-0.023
 and-0.023
Tokenrum
Token ID6582
Feature activation+1.73e-01

Pos logit contributions
rily+0.098
acket+0.097
arians+0.095
lights+0.092
immer+0.091
Neg logit contributions
 a-0.348
 and-0.341
 two-0.332
 the-0.331
 big-0.326
Token and
Token ID290
Feature activation+3.29e-01

Pos logit contributions
rily+0.019
ittered+0.019
lights+0.019
ently+0.019
keeping+0.018
Neg logit contributions
 big-0.003
 a-0.003
 two-0.002
 the-0.002
-0.002
Token cried
Token ID16896
Feature activation-2.94e-01

Pos logit contributions
ittered+0.025
rily+0.025
 avail+0.024
keeping+0.024
leness+0.024
Neg logit contributions
 a-0.016
 big-0.016
 two-0.015
-0.015
 and-0.014
Token loudly
Token ID23112
Feature activation-4.41e-01

Pos logit contributions
 two+0.006
 big+0.006
 a+0.006
+0.005
 the+0.005
Neg logit contributions
ittered-0.029
rily-0.029
immer-0.029
istent-0.029
lights-0.028
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
 a+0.007
 big+0.007
 two+0.007
+0.006
 the+0.005
Neg logit contributions
rily-0.048
ittered-0.048
immer-0.047
epad-0.046
istent-0.046
Token and
Token ID290
Feature activation-2.71e-01

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
 the+0.001
+0.001
Neg logit contributions
rily-0.009
ittered-0.009
istent-0.009
lights-0.009
inations-0.009
Token so
Token ID523
Feature activation-2.30e-01

Pos logit contributions
 two+0.013
 a+0.012
+0.012
 big+0.012
 and+0.012
Neg logit contributions
ittered-0.020
rated-0.020
rily-0.020
leness-0.020
istent-0.020
Token she
Token ID673
Feature activation-2.21e-01

Pos logit contributions
 a+0.009
 two+0.009
 and+0.008
 big+0.008
 right+0.008
Neg logit contributions
ittered-0.019
lights-0.019
rily-0.018
leness-0.018
 avail-0.018
Token threw
Token ID9617
Feature activation-8.02e-01

Pos logit contributions
 a+0.010
 big+0.009
 two+0.009
+0.009
 the+0.009
Neg logit contributions
rily-0.018
ittered-0.018
inations-0.017
arians-0.017
aned-0.017
Token a
Token ID257
Feature activation+2.93e-01

Pos logit contributions
+0.019
 big+0.018
 little+0.017
 two+0.017
 be+0.015
Neg logit contributions
rily-0.072
 avail-0.070
immer-0.070
ittered-0.070
keeping-0.069
Token tant
Token ID24246
Feature activation+3.45e+00

Pos logit contributions
rily+0.024
immer+0.024
ittered+0.024
inations+0.023
lights+0.023
Neg logit contributions
 a-0.014
 two-0.013
 the-0.013
-0.012
 and-0.012
Tokenrum
Token ID6582
Feature activation+5.80e-01

Pos logit contributions
rily+0.100
acket+0.098
arians+0.097
immer+0.095
vation+0.094
Neg logit contributions
 a-0.344
 and-0.338
 the-0.328
.-0.328
 two-0.327
Token when
Token ID618
Feature activation+3.35e-01

Pos logit contributions
rily+0.066
ittered+0.065
immer+0.064
lights+0.063
keeping+0.063
Neg logit contributions
 a-0.009
 big-0.008
 two-0.008
 the-0.007
-0.006
Token her
Token ID607
Feature activation-2.04e-01

Pos logit contributions
ittered+0.030
 avail+0.030
rily+0.029
lights+0.029
immer+0.029
Neg logit contributions
 a-0.011
 and-0.011
 two-0.011
 big-0.011
-0.010
Token parents
Token ID3397
Feature activation+7.29e-02

Pos logit contributions
 a+0.007
+0.007
 the+0.006
 and+0.006
 two+0.006
Neg logit contributions
lights-0.018
rily-0.018
ittered-0.018
fer-0.018
 footing-0.018
Token said
Token ID531
Feature activation+3.20e-01

Pos logit contributions
ittered+0.006
immer+0.006
rily+0.006
inations+0.006
istent+0.006
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
 the-0.003
-0.003
Token kept
Token ID4030
Feature activation+5.41e-02

Pos logit contributions
 a+0.007
 big+0.007
 two+0.007
+0.007
 the+0.007
Neg logit contributions
ittered-0.012
rily-0.012
aned-0.012
inations-0.012
istent-0.012
Token crying
Token ID13774
Feature activation-3.70e-01

Pos logit contributions
rily+0.003
ittered+0.003
 avail+0.003
keeping+0.003
epad+0.003
Neg logit contributions
 a-0.003
 big-0.003
 two-0.003
-0.003
 little-0.003
Token and
Token ID290
Feature activation+3.68e-01

Pos logit contributions
 a+0.007
 two+0.006
 big+0.006
 the+0.005
+0.005
Neg logit contributions
rily-0.039
lights-0.039
immer-0.038
ently-0.038
ittered-0.038
Token throwing
Token ID9644
Feature activation-3.84e-01

Pos logit contributions
ittered+0.027
rily+0.026
immer+0.025
inations+0.025
 avail+0.025
Neg logit contributions
 a-0.020
 big-0.020
 two-0.019
-0.018
 and-0.018
Token a
Token ID257
Feature activation+3.36e-01

Pos logit contributions
 two+0.009
 a+0.009
 big+0.009
+0.008
 next+0.008
Neg logit contributions
rily-0.038
immer-0.037
ittered-0.037
lights-0.036
 avail-0.035
Token tant
Token ID24246
Feature activation+3.45e+00

Pos logit contributions
rily+0.019
immer+0.019
ittered+0.018
lights+0.018
inations+0.018
Neg logit contributions
 a-0.025
 two-0.023
 the-0.023
-0.022
 and-0.022
Tokenrum
Token ID6582
Feature activation+2.97e-01

Pos logit contributions
rily+0.099
arians+0.097
acket+0.095
lights+0.092
inations+0.090
Neg logit contributions
 a-0.348
 and-0.338
 two-0.331
 the-0.330
 big-0.327
Token.
Token ID13
Feature activation-3.96e-01

Pos logit contributions
rily+0.033
ittered+0.033
lights+0.032
immer+0.032
ently+0.032
Neg logit contributions
 a-0.005
 big-0.005
 two-0.004
 the-0.003
-0.003
Token
Token ID198
Feature activation-3.33e-01

Pos logit contributions
 a+0.013
 big+0.013
 two+0.013
 the+0.012
 and+0.012
Neg logit contributions
ittered-0.035
rily-0.035
immer-0.034
ently-0.034
lights-0.034
Token
Token ID198
Feature activation-3.39e-01

Pos logit contributions
 a+0.005
 two+0.004
 big+0.004
 the+0.003
 and+0.003
Neg logit contributions
immer-0.037
ittered-0.037
rily-0.036
inations-0.036
ently-0.036
TokenFinally
Token ID11158
Feature activation+5.98e-02

Pos logit contributions
 a+0.023
 the+0.022
 big+0.022
 two+0.022
 and+0.022
Neg logit contributions
immer-0.020
ittered-0.019
rily-0.019
inations-0.019
aned-0.019
Token his
Token ID465
Feature activation+5.66e-01

Pos logit contributions
 big+0.002
 a+0.002
 two+0.002
+0.002
 and+0.002
Neg logit contributions
rily-0.008
ittered-0.008
immer-0.007
lights-0.007
 avail-0.007
Token feet
Token ID3625
Feature activation-5.78e-01

Pos logit contributions
rily+0.040
ittered+0.039
immer+0.038
inations+0.037
lights+0.037
Neg logit contributions
 a-0.034
 two-0.032
 the-0.031
-0.031
 big-0.031
Token and
Token ID290
Feature activation+1.80e-01

Pos logit contributions
 a+0.007
 two+0.007
 big+0.006
 the+0.005
+0.005
Neg logit contributions
rily-0.066
immer-0.065
ge-0.064
ittered-0.064
istent-0.063
Token threw
Token ID9617
Feature activation-4.03e-01

Pos logit contributions
rily+0.013
ittered+0.013
immer+0.013
fer+0.013
leness+0.013
Neg logit contributions
 a-0.009
 two-0.009
 big-0.009
-0.008
 and-0.008
Token a
Token ID257
Feature activation+3.70e-01

Pos logit contributions
+0.010
 big+0.010
 two+0.010
 a+0.010
 and+0.009
Neg logit contributions
rily-0.040
ittered-0.037
immer-0.037
lights-0.036
pan-0.036
Token tant
Token ID24246
Feature activation+3.44e+00

Pos logit contributions
rily+0.027
immer+0.026
ittered+0.026
inations+0.025
leness+0.025
Neg logit contributions
 a-0.021
 two-0.020
 the-0.020
-0.019
 and-0.019
Tokenrum
Token ID6582
Feature activation+3.16e-01

Pos logit contributions
rily+0.098
lights+0.096
immer+0.095
acket+0.093
inations+0.089
Neg logit contributions
 a-0.339
 and-0.332
 two-0.328
 the-0.323
.-0.322
Token.
Token ID13
Feature activation-2.77e-01

Pos logit contributions
rily+0.037
ittered+0.036
immer+0.035
lights+0.035
ently+0.034
Neg logit contributions
 a-0.005
 big-0.004
 two-0.004
 the-0.003
-0.003
Token
Token ID198
Feature activation-4.00e-01

Pos logit contributions
 a+0.007
 big+0.006
 two+0.006
 and+0.006
 the+0.005
Neg logit contributions
rily-0.028
ittered-0.027
lights-0.026
immer-0.026
acket-0.025
Token
Token ID198
Feature activation-3.99e-01

Pos logit contributions
 a+0.006
 two+0.006
 big+0.005
 and+0.005
 the+0.004
Neg logit contributions
immer-0.042
ittered-0.042
rily-0.042
inations-0.042
lights-0.041
Token"
Token ID1
Feature activation+2.68e-01

Pos logit contributions
 a+0.018
 big+0.017
 and+0.016
 the+0.016
 two+0.016
Neg logit contributions
immer-0.032
fer-0.031
rily-0.031
leness-0.030
ittered-0.030
Token and
Token ID290
Feature activation-4.00e-02

Pos logit contributions
 big+0.010
 a+0.009
 two+0.009
+0.007
 the+0.007
Neg logit contributions
rily-0.048
istent-0.047
ently-0.046
lights-0.046
iously-0.046
Token started
Token ID2067
Feature activation-8.08e-02

Pos logit contributions
 a+0.002
 two+0.002
+0.002
 big+0.002
 and+0.002
Neg logit contributions
rily-0.003
ittered-0.003
aned-0.003
leness-0.003
istent-0.003
Token to
Token ID284
Feature activation+6.80e-01

Pos logit contributions
 a+0.002
 two+0.002
 big+0.001
+0.001
 the+0.001
Neg logit contributions
rily-0.008
ittered-0.008
ge-0.008
keeping-0.008
epad-0.008
Token throw
Token ID3714
Feature activation-4.13e-01

Pos logit contributions
rily+0.039
ittered+0.038
immer+0.038
 avail+0.037
inations+0.037
Neg logit contributions
 a-0.048
 two-0.047
 big-0.046
-0.045
 and-0.045
Token a
Token ID257
Feature activation+9.24e-01

Pos logit contributions
 two+0.009
 a+0.008
 big+0.008
+0.008
 next+0.008
Neg logit contributions
rily-0.043
ittered-0.039
immer-0.039
 avail-0.039
lights-0.038
Token tant
Token ID24246
Feature activation+3.42e+00

Pos logit contributions
rily+0.062
immer+0.061
ittered+0.061
inations+0.059
lights+0.057
Neg logit contributions
 a-0.060
 two-0.057
 the-0.055
 big-0.055
 and-0.055
Tokenrum
Token ID6582
Feature activation+2.61e-01

Pos logit contributions
rily+0.099
acket+0.096
arians+0.095
lights+0.093
ittered+0.092
Neg logit contributions
 a-0.342
 and-0.335
 the-0.327
 two-0.326
.-0.324
Token.
Token ID13
Feature activation-1.68e-01

Pos logit contributions
rily+0.030
ittered+0.029
lights+0.028
immer+0.028
ently+0.028
Neg logit contributions
 big-0.004
 a-0.004
 two-0.003
 the-0.003
-0.003
Token Suddenly
Token ID24975
Feature activation+1.88e-01

Pos logit contributions
 a+0.009
 two+0.009
 big+0.009
 the+0.008
 and+0.008
Neg logit contributions
rily-0.012
ittered-0.011
ently-0.011
aned-0.011
istent-0.011
Token,
Token ID11
Feature activation+7.83e-02

Pos logit contributions
ittered+0.020
immer+0.020
rily+0.019
 avail+0.019
lights+0.019
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.003
 and-0.003
Token she
Token ID673
Feature activation+2.43e-01

Pos logit contributions
 avail+0.007
rily+0.007
ittered+0.007
lights+0.007
comb+0.007
Neg logit contributions
 and-0.002
-0.002
 big-0.002
 a-0.002
 two-0.002
Token her
Token ID607
Feature activation+6.15e-01

Pos logit contributions
 two+0.008
 a+0.007
 big+0.007
+0.007
 next+0.006
Neg logit contributions
rily-0.030
ittered-0.029
keeping-0.028
lights-0.028
immer-0.027
Token feet
Token ID3625
Feature activation-5.54e-01

Pos logit contributions
rily+0.045
ittered+0.043
lights+0.042
arians+0.042
immer+0.041
Neg logit contributions
 a-0.036
 two-0.034
-0.033
 big-0.032
 the-0.032
Token and
Token ID290
Feature activation+3.85e-02

Pos logit contributions
 a+0.008
 two+0.007
 big+0.006
+0.006
 the+0.005
Neg logit contributions
rily-0.062
immer-0.062
ittered-0.061
ge-0.060
istent-0.060
Token threw
Token ID9617
Feature activation-6.63e-01

Pos logit contributions
rily+0.003
ittered+0.003
leness+0.003
fer+0.003
 avail+0.003
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
-0.002
 and-0.002
Token a
Token ID257
Feature activation+3.72e-01

Pos logit contributions
+0.016
 big+0.016
 a+0.016
 two+0.016
 and+0.014
Neg logit contributions
rily-0.065
ittered-0.062
immer-0.062
rol-0.060
lights-0.060
Token tant
Token ID24246
Feature activation+3.42e+00

Pos logit contributions
rily+0.015
immer+0.014
ittered+0.014
inations+0.013
leness+0.013
Neg logit contributions
 a-0.034
 two-0.033
 the-0.032
-0.032
 and-0.032
Tokenrum
Token ID6582
Feature activation+2.25e-01

Pos logit contributions
acket+0.095
rily+0.094
arians+0.092
inations+0.090
lights+0.089
Neg logit contributions
 a-0.342
 and-0.336
 the-0.326
 two-0.326
.-0.325
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
rily+0.026
ittered+0.025
lights+0.024
immer+0.024
ently+0.024
Neg logit contributions
 a-0.003
 big-0.003
 two-0.003
 the-0.002
-0.002
Token She
Token ID1375
Feature activation+4.71e-02

Pos logit contributions
 a+0.005
 big+0.005
 two+0.005
 the+0.004
 and+0.004
Neg logit contributions
rily-0.008
ittered-0.008
istent-0.007
ently-0.007
arians-0.007
Token grabbed
Token ID13176
Feature activation-4.46e-01

Pos logit contributions
ittered+0.003
rily+0.003
inations+0.003
arians+0.003
aned+0.003
Neg logit contributions
 a-0.002
 two-0.002
 big-0.002
-0.002
 the-0.002
Token the
Token ID262
Feature activation+4.42e-01

Pos logit contributions
+0.007
 big+0.006
 and+0.006
 a+0.005
 two+0.005
Neg logit contributions
rily-0.049
ittered-0.048
immer-0.046
 avail-0.045
lights-0.045
Token them
Token ID606
Feature activation+1.84e-01

Pos logit contributions
rily+0.009
ittered+0.009
inations+0.009
immer+0.009
leness+0.009
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.002
 and-0.002
Token and
Token ID290
Feature activation-4.60e-02

Pos logit contributions
ittered+0.019
rily+0.019
immer+0.019
inations+0.019
lights+0.019
Neg logit contributions
 a-0.004
 two-0.004
 big-0.004
-0.003
 the-0.003
Token never
Token ID1239
Feature activation+1.14e-01

Pos logit contributions
 a+0.002
 two+0.002
 big+0.002
+0.002
 and+0.001
Neg logit contributions
rily-0.004
ittered-0.004
keeping-0.004
inations-0.004
leness-0.004
Token threw
Token ID9617
Feature activation-5.10e-01

Pos logit contributions
ittered+0.009
immer+0.008
epad+0.008
rily+0.008
leness+0.008
Neg logit contributions
 a-0.006
 two-0.006
 big-0.005
-0.005
 and-0.005
Token a
Token ID257
Feature activation+1.01e+00

Pos logit contributions
 two+0.020
 a+0.020
 big+0.019
+0.019
 and+0.018
Neg logit contributions
rily-0.044
ittered-0.042
immer-0.041
keeping-0.041
lights-0.040
Token tant
Token ID24246
Feature activation+3.32e+00

Pos logit contributions
rily+0.069
immer+0.069
ittered+0.069
inations+0.067
lights+0.065
Neg logit contributions
 a-0.063
 two-0.060
 the-0.058
 big-0.058
 and-0.057
Tokenrum
Token ID6582
Feature activation+1.39e-01

Pos logit contributions
rily+0.097
immer+0.092
lights+0.092
acket+0.091
plets+0.086
Neg logit contributions
 a-0.326
 and-0.318
 two-0.315
.-0.311
 the-0.310
Token again
Token ID757
Feature activation-2.74e-01

Pos logit contributions
rily+0.016
ittered+0.016
immer+0.015
lights+0.015
keeping+0.015
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
 the-0.002
-0.002
Token.
Token ID13
Feature activation-1.77e-01

Pos logit contributions
 a+0.005
 two+0.005
 big+0.004
+0.003
 the+0.003
Neg logit contributions
rily-0.029
immer-0.029
ittered-0.029
ge-0.028
 avail-0.028
Token<|endoftext|>
Token ID50256
Feature activation-4.85e-01

Pos logit contributions
 a+0.006
 big+0.005
 two+0.005
 the+0.005
 and+0.005
Neg logit contributions
rily-0.017
ittered-0.016
immer-0.016
lor-0.016
lights-0.016
TokenOnce
Token ID7454
Feature activation-3.66e-01

Pos logit contributions
 a+0.006
 two+0.006
 big+0.006
 little+0.004
 the+0.004
Neg logit contributions
ittered-0.052
rily-0.052
immer-0.052
keeping-0.052
lor-0.051
Token way
Token ID835
Feature activation+8.53e-02

Pos logit contributions
rily+0.051
ittered+0.050
immer+0.049
 avail+0.049
inations+0.048
Neg logit contributions
 a-0.040
 two-0.038
 big-0.037
 and-0.037
-0.036
Token forward
Token ID2651
Feature activation-6.61e-02

Pos logit contributions
rily+0.008
immer+0.008
ittered+0.008
ge+0.007
 avail+0.007
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.003
 little-0.003
Token.
Token ID13
Feature activation+6.05e-01

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
 the+0.001
+0.001
Neg logit contributions
ittered-0.007
rily-0.007
immer-0.007
ge-0.007
keeping-0.007
Token She
Token ID1375
Feature activation+5.06e-02

Pos logit contributions
rily+0.049
ittered+0.049
immer+0.047
arians+0.046
lights+0.046
Neg logit contributions
 a-0.029
 big-0.028
 two-0.027
 and-0.026
 the-0.026
Token stayed
Token ID9658
Feature activation+2.97e-01

Pos logit contributions
ittered+0.004
rily+0.004
inations+0.004
fer+0.004
immer+0.004
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
-0.002
 the-0.002
Token foc
Token ID2133
Feature activation+3.26e+00

Pos logit contributions
rily+0.030
ittered+0.030
immer+0.029
 avail+0.028
inations+0.028
Neg logit contributions
 a-0.008
 two-0.008
-0.007
 big-0.007
 the-0.006
Tokenussed
Token ID29569
Feature activation+4.66e-02

Pos logit contributions
igi+0.102
 avail+0.098
inations+0.096
leness+0.095
rob+0.090
Neg logit contributions
 and-0.304
 a-0.300
 two-0.289
 at-0.285
.-0.281
Token.
Token ID13
Feature activation+5.13e-01

Pos logit contributions
rily+0.005
ittered+0.005
immer+0.005
lights+0.005
ently+0.005
Neg logit contributions
 a-0.001
 two-0.001
 big-0.000
-0.000
 the-0.000
Token On
Token ID1550
Feature activation+5.70e-01

Pos logit contributions
rily+0.043
ittered+0.043
immer+0.042
arians+0.041
istent+0.041
Neg logit contributions
 a-0.023
 big-0.022
 two-0.021
 the-0.020
 and-0.020
Token the
Token ID262
Feature activation+7.48e-01

Pos logit contributions
ittered+0.065
rily+0.064
immer+0.063
 avail+0.063
lights+0.063
Neg logit contributions
 a-0.008
 big-0.007
 two-0.006
 and-0.006
-0.006
Token third
Token ID2368
Feature activation+2.46e-01

Pos logit contributions
rily+0.058
immer+0.056
ittered+0.056
 avail+0.056
inations+0.056
Neg logit contributions
 a-0.039
 two-0.036
-0.036
 and-0.036
 big-0.036
Token She
Token ID1375
Feature activation-3.06e-01

Pos logit contributions
 a+0.013
 big+0.012
 two+0.012
 the+0.011
 and+0.011
Neg logit contributions
rily-0.021
ently-0.020
lights-0.020
ittered-0.020
istent-0.020
Token cried
Token ID16896
Feature activation-2.84e-01

Pos logit contributions
 a+0.013
 big+0.013
 two+0.012
 the+0.012
+0.012
Neg logit contributions
ittered-0.024
rily-0.024
inations-0.024
immer-0.024
arians-0.023
Token and
Token ID290
Feature activation-6.57e-02

Pos logit contributions
 two+0.004
 big+0.004
 a+0.003
+0.003
 the+0.003
Neg logit contributions
rily-0.031
ittered-0.030
immer-0.030
lights-0.030
istent-0.030
Token threw
Token ID9617
Feature activation-2.85e-01

Pos logit contributions
 big+0.003
 a+0.003
 two+0.003
+0.003
 and+0.003
Neg logit contributions
ittered-0.005
rily-0.005
 avail-0.005
leness-0.005
immer-0.005
Token a
Token ID257
Feature activation+3.54e-01

Pos logit contributions
 two+0.007
+0.007
 big+0.007
 a+0.007
 next+0.006
Neg logit contributions
rily-0.027
ittered-0.026
immer-0.026
keeping-0.026
lights-0.026
Token tant
Token ID24246
Feature activation+3.24e+00

Pos logit contributions
rily+0.020
immer+0.020
ittered+0.020
inations+0.019
leness+0.019
Neg logit contributions
 a-0.025
 two-0.024
 the-0.024
-0.023
 and-0.023
Tokenrum
Token ID6582
Feature activation+1.29e-01

Pos logit contributions
rily+0.092
acket+0.087
immer+0.086
arians+0.086
ittered+0.085
Neg logit contributions
 a-0.327
 and-0.321
 two-0.313
 the-0.313
 big-0.311
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
rily+0.014
ittered+0.014
lights+0.014
ently+0.014
keeping+0.014
Neg logit contributions
 big-0.002
 a-0.002
 two-0.002
 the-0.001
-0.001
Token Her
Token ID2332
Feature activation-7.39e-02

Pos logit contributions
 a+0.007
 big+0.006
 two+0.006
 the+0.006
 and+0.006
Neg logit contributions
rily-0.008
ittered-0.008
ently-0.008
lights-0.008
immer-0.008
Token mom
Token ID1995
Feature activation-8.67e-02

Pos logit contributions
 a+0.002
+0.002
 the+0.002
 two+0.002
 and+0.002
Neg logit contributions
lights-0.007
ittered-0.007
inations-0.007
rily-0.007
 footing-0.007
Token tried
Token ID3088
Feature activation+8.37e-02

Pos logit contributions
 a+0.004
 big+0.003
 two+0.003
 the+0.003
+0.003
Neg logit contributions
ittered-0.007
immer-0.007
inations-0.007
istent-0.007
rily-0.007
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
 two+0.010
 a+0.009
 big+0.009
 the+0.007
 little+0.007
Neg logit contributions
rily-0.050
ge-0.049
immer-0.049
lights-0.048
epad-0.048
Token He
Token ID679
Feature activation-2.34e-01

Pos logit contributions
 a+0.006
 two+0.005
 big+0.005
 the+0.005
 and+0.005
Neg logit contributions
rily-0.009
lights-0.009
keeping-0.009
epad-0.008
 avail-0.008
Token kept
Token ID4030
Feature activation-1.32e-02

Pos logit contributions
 a+0.009
 two+0.008
 big+0.008
 the+0.008
+0.008
Neg logit contributions
ittered-0.020
rily-0.020
immer-0.020
arians-0.019
fer-0.019
Token throwing
Token ID9644
Feature activation-5.36e-01

Pos logit contributions
 a+0.001
 big+0.001
 two+0.001
+0.001
 and+0.001
Neg logit contributions
rily-0.001
keeping-0.001
ittered-0.001
fer-0.001
 avail-0.001
Token a
Token ID257
Feature activation+1.45e-01

Pos logit contributions
+0.015
 a+0.015
 two+0.015
 big+0.015
 next+0.013
Neg logit contributions
rily-0.050
immer-0.048
ittered-0.047
lights-0.047
keeping-0.047
Token tant
Token ID24246
Feature activation+3.14e+00

Pos logit contributions
rily+0.011
immer+0.011
ittered+0.011
inations+0.011
leness+0.011
Neg logit contributions
 a-0.008
 two-0.007
 the-0.007
-0.007
 and-0.006
Tokenrum
Token ID6582
Feature activation+8.93e-02

Pos logit contributions
rily+0.084
ittered+0.079
arians+0.077
lights+0.077
immer+0.077
Neg logit contributions
 a-0.323
 and-0.315
 two-0.311
 big-0.308
 the-0.308
Token.
Token ID13
Feature activation-8.16e-02

Pos logit contributions
rily+0.010
ittered+0.010
lights+0.010
immer+0.010
ently+0.010
Neg logit contributions
 big-0.001
 a-0.001
 two-0.001
 the-0.001
 little-0.001
Token His
Token ID2399
Feature activation+2.41e-01

Pos logit contributions
 a+0.004
 big+0.003
 two+0.003
 and+0.003
 the+0.003
Neg logit contributions
rily-0.006
lights-0.006
ittered-0.006
immer-0.006
epad-0.006
Token mom
Token ID1995
Feature activation-2.03e-01

Pos logit contributions
rily+0.024
lights+0.024
ittered+0.023
keeping+0.023
 footing+0.023
Neg logit contributions
 a-0.007
 the-0.006
 two-0.006
-0.006
 and-0.005
Token was
Token ID373
Feature activation-1.75e-01

Pos logit contributions
 a+0.008
 big+0.008
 two+0.008
 the+0.007
+0.007
Neg logit contributions
ittered-0.017
immer-0.017
rily-0.017
inations-0.017
istent-0.016
Token stubborn
Token ID24444
Feature activation-4.70e-01

Pos logit contributions
+0.023
 two+0.021
 and+0.021
 a+0.021
 big+0.020
Neg logit contributions
ittered-0.038
rily-0.037
epad-0.036
immer-0.036
inations-0.035
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
 big+0.009
 a+0.009
 two+0.008
 the+0.007
+0.007
Neg logit contributions
rily-0.048
epad-0.047
lights-0.047
keeping-0.046
ittered-0.046
Token She
Token ID1375
Feature activation-4.59e-01

Pos logit contributions
rily+0.010
ittered+0.010
istent+0.010
lights+0.010
immer+0.010
Neg logit contributions
 a-0.005
 big-0.005
 two-0.005
 the-0.004
 and-0.004
Token throws
Token ID12542
Feature activation-6.42e-01

Pos logit contributions
 a+0.026
 two+0.025
+0.025
 the+0.025
 big+0.024
Neg logit contributions
ittered-0.030
leness-0.029
rily-0.029
inations-0.029
arians-0.029
Token a
Token ID257
Feature activation+2.41e-01

Pos logit contributions
+0.011
 big+0.011
 two+0.011
 a+0.010
 little+0.009
Neg logit contributions
rily-0.067
ittered-0.062
immer-0.062
pan-0.062
lights-0.061
Token tant
Token ID24246
Feature activation+3.12e+00

Pos logit contributions
rily+0.013
immer+0.012
ittered+0.012
inations+0.011
leness+0.011
Neg logit contributions
 a-0.019
 two-0.018
 the-0.018
-0.017
 and-0.017
Tokenrum
Token ID6582
Feature activation+1.67e-01

Pos logit contributions
rily+0.086
immer+0.081
acket+0.081
ittered+0.078
lights+0.078
Neg logit contributions
 a-0.318
 and-0.310
 two-0.306
 the-0.304
 big-0.300
Token.
Token ID13
Feature activation+1.58e-01

Pos logit contributions
rily+0.019
ittered+0.019
lights+0.018
ently+0.018
immer+0.018
Neg logit contributions
 big-0.003
 a-0.002
 two-0.002
 the-0.002
 little-0.002
Token She
Token ID1375
Feature activation-2.25e-02

Pos logit contributions
rily+0.013
ittered+0.013
immer+0.013
lights+0.013
istent+0.013
Neg logit contributions
 a-0.006
 big-0.006
 two-0.006
 the-0.005
 and-0.005
Token kicks
Token ID17364
Feature activation-5.31e-01

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 big+0.001
 the+0.001
Neg logit contributions
ittered-0.001
rily-0.001
inations-0.001
leness-0.001
arians-0.001
Token and
Token ID290
Feature activation-4.30e-01

Pos logit contributions
 big+0.007
 a+0.006
 two+0.006
+0.005
 little+0.004
Neg logit contributions
rily-0.056
ittered-0.053
lights-0.053
immer-0.053
ently-0.052
Token started
Token ID2067
Feature activation+1.95e-01

Pos logit contributions
ittered+0.015
inations+0.015
immer+0.014
rily+0.014
arians+0.014
Neg logit contributions
 a-0.008
 two-0.008
 big-0.008
-0.007
 the-0.007
Token walking
Token ID6155
Feature activation-1.26e-01

Pos logit contributions
ittered+0.019
rily+0.019
ge+0.018
inations+0.018
epad+0.018
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.005
 little-0.004
Token very
Token ID845
Feature activation+2.79e-01

Pos logit contributions
 two+0.004
 a+0.003
 big+0.003
+0.003
 little+0.003
Neg logit contributions
rily-0.012
ittered-0.012
lights-0.012
immer-0.011
eway-0.011
Token slowly
Token ID6364
Feature activation-7.95e-02

Pos logit contributions
rily+0.023
ittered+0.023
ge+0.023
immer+0.023
lights+0.022
Neg logit contributions
 a-0.012
 two-0.012
-0.010
 big-0.010
 the-0.010
Token as
Token ID355
Feature activation+8.43e-01

Pos logit contributions
 a+0.002
 two+0.002
 big+0.001
+0.001
 the+0.001
Neg logit contributions
immer-0.008
ittered-0.008
rily-0.008
ge-0.008
lights-0.008
Token tort
Token ID7619
Feature activation+3.08e+00

Pos logit contributions
rily+0.080
ittered+0.080
immer+0.078
lights+0.077
inations+0.077
Neg logit contributions
 a-0.028
 two-0.026
 big-0.025
 and-0.024
-0.024
Tokeno
Token ID78
Feature activation+2.05e-01

Pos logit contributions
rily+0.116
lights+0.115
immer+0.098
rier+0.097
 avail+0.096
Neg logit contributions
 a-0.287
 two-0.286
 and-0.274
 the-0.273
-0.268
Tokenises
Token ID2696
Feature activation-3.16e-01

Pos logit contributions
ittered+0.021
rily+0.021
immer+0.020
istent+0.020
fer+0.020
Neg logit contributions
 a-0.006
 big-0.005
 two-0.005
 the-0.005
-0.005
Token do
Token ID466
Feature activation-2.35e-01

Pos logit contributions
 a+0.006
 big+0.005
 two+0.005
+0.004
 little+0.004
Neg logit contributions
ittered-0.034
rily-0.034
immer-0.033
arians-0.033
istent-0.033
Token,
Token ID11
Feature activation+5.37e-02

Pos logit contributions
 a+0.006
 two+0.006
 big+0.005
+0.005
 and+0.005
Neg logit contributions
ittered-0.023
inations-0.023
rily-0.023
immer-0.023
leness-0.023
Token but
Token ID475
Feature activation-3.90e-02

Pos logit contributions
rily+0.005
ittered+0.005
inations+0.004
immer+0.004
istent+0.004
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
-0.002
 the-0.002
Token surprise
Token ID5975
Feature activation+2.63e-01

Pos logit contributions
rily+0.023
immer+0.023
inations+0.022
ittered+0.022
 avail+0.022
Neg logit contributions
 a-0.018
 the-0.017
-0.017
 two-0.016
 and-0.016
Token.
Token ID13
Feature activation-2.41e-01

Pos logit contributions
rily+0.029
immer+0.029
lights+0.028
ittered+0.028
inations+0.028
Neg logit contributions
 a-0.005
 two-0.004
 big-0.004
-0.003
 the-0.003
Token He
Token ID679
Feature activation-2.79e-01

Pos logit contributions
 a+0.010
 big+0.010
 two+0.010
 and+0.010
 the+0.009
Neg logit contributions
rily-0.018
lights-0.018
immer-0.018
keeping-0.018
ittered-0.017
Token threw
Token ID9617
Feature activation-7.26e-01

Pos logit contributions
 a+0.011
 big+0.010
 two+0.010
 the+0.010
+0.010
Neg logit contributions
ittered-0.023
rily-0.023
arians-0.022
immer-0.022
aned-0.022
Token a
Token ID257
Feature activation+6.31e-02

Pos logit contributions
+0.014
 two+0.013
 big+0.012
 a+0.012
 next+0.012
Neg logit contributions
rily-0.073
ittered-0.068
immer-0.068
keeping-0.068
lights-0.067
Token tant
Token ID24246
Feature activation+3.07e+00

Pos logit contributions
rily+0.004
immer+0.004
ittered+0.004
inations+0.004
leness+0.004
Neg logit contributions
 a-0.004
 two-0.004
 the-0.004
-0.003
 and-0.003
Tokenrum
Token ID6582
Feature activation+1.42e-01

Pos logit contributions
rily+0.083
immer+0.079
ittered+0.078
acket+0.077
arians+0.076
Neg logit contributions
 a-0.314
 and-0.307
 two-0.302
 the-0.299
 big-0.298
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
rily+0.016
ittered+0.016
lights+0.015
ently+0.015
immer+0.015
Neg logit contributions
 big-0.002
 a-0.002
 two-0.002
 the-0.002
-0.001
Token He
Token ID679
Feature activation+4.46e-03

Pos logit contributions
 a+0.005
 big+0.004
 two+0.004
 and+0.004
 the+0.004
Neg logit contributions
rily-0.009
lights-0.009
ittered-0.009
immer-0.009
keeping-0.009
Token kicked
Token ID12165
Feature activation-6.26e-01

Pos logit contributions
ittered+0.000
rily+0.000
inations+0.000
arians+0.000
aned+0.000
Neg logit contributions
 a-0.000
 two-0.000
 big-0.000
-0.000
 the-0.000
Token and
Token ID290
Feature activation-1.81e-01

Pos logit contributions
 big+0.007
 two+0.007
 a+0.006
+0.006
 next+0.005
Neg logit contributions
rily-0.066
immer-0.061
ge-0.061
ittered-0.061
lights-0.061
Token found
Token ID1043
Feature activation-7.57e-01

Pos logit contributions
 a+0.002
 two+0.002
 the+0.002
 big+0.001
+0.001
Neg logit contributions
ittered-0.003
immer-0.003
inations-0.003
istent-0.003
comb-0.003
Token it
Token ID340
Feature activation+3.70e-01

Pos logit contributions
 big+0.020
+0.019
 and+0.019
 two+0.017
 a+0.017
Neg logit contributions
rily-0.071
ittered-0.068
immer-0.067
lor-0.066
keeping-0.066
Token!"
Token ID2474
Feature activation+1.71e-01

Pos logit contributions
rily+0.031
ittered+0.031
immer+0.031
inations+0.030
comb+0.030
Neg logit contributions
 a-0.016
 two-0.015
 big-0.015
 the-0.015
-0.015
Token
Token ID198
Feature activation-4.43e-02

Pos logit contributions
rily+0.016
ittered+0.016
istent+0.015
immer+0.015
lights+0.015
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
 the-0.004
 and-0.004
Token
Token ID198
Feature activation+3.54e-01

Pos logit contributions
 a+0.001
 two+0.000
 big+0.000
 the+0.000
 and+0.000
Neg logit contributions
rily-0.005
ittered-0.005
immer-0.005
lor-0.005
inations-0.005
TokenCharl
Token ID24453
Feature activation+3.03e+00

Pos logit contributions
rily+0.022
immer+0.022
ittered+0.021
arians+0.021
lor+0.021
Neg logit contributions
 a-0.024
 big-0.022
 two-0.022
 the-0.022
 and-0.022
Tokenotte
Token ID11404
Feature activation-2.14e-01

Pos logit contributions
immer+0.121
lights+0.114
inations+0.112
ittered+0.111
rily+0.106
Neg logit contributions
 a-0.276
 two-0.270
 the-0.268
 and-0.268
-0.263
Token and
Token ID290
Feature activation+2.36e-01

Pos logit contributions
 a+0.009
 big+0.009
 two+0.009
 the+0.008
+0.008
Neg logit contributions
ittered-0.018
rily-0.018
immer-0.017
arians-0.017
istent-0.017
Token her
Token ID607
Feature activation-2.72e-02

Pos logit contributions
rily+0.023
 avail+0.022
ittered+0.022
immer+0.021
arians+0.021
Neg logit contributions
 a-0.007
 big-0.006
 two-0.006
 and-0.006
-0.005
Token mom
Token ID1995
Feature activation-1.83e-02

Pos logit contributions
 a+0.001
+0.001
 the+0.001
 and+0.001
 two+0.001
Neg logit contributions
lights-0.003
ittered-0.003
rily-0.003
arians-0.002
 footing-0.002
Token brought
Token ID3181
Feature activation-2.85e-01

Pos logit contributions
 a+0.001
 big+0.001
+0.001
 two+0.001
 the+0.001
Neg logit contributions
ittered-0.002
immer-0.002
rily-0.001
rol-0.001
inations-0.001
Token the
Token ID262
Feature activation+1.95e-01

Pos logit contributions
rily+0.008
immer+0.008
ittered+0.008
leness+0.008
inations+0.008
Neg logit contributions
 two-0.004
 a-0.003
 big-0.003
 and-0.003
-0.003
Token same
Token ID976
Feature activation+6.18e-02

Pos logit contributions
immer+0.016
rily+0.016
ittered+0.015
inations+0.015
vation+0.015
Neg logit contributions
 a-0.009
-0.008
 the-0.008
 and-0.008
 two-0.008
Token urge
Token ID14960
Feature activation-2.16e-01

Pos logit contributions
rily+0.005
immer+0.005
ittered+0.005
inations+0.005
lights+0.005
Neg logit contributions
 a-0.002
 two-0.002
-0.002
 big-0.002
 the-0.002
Token to
Token ID284
Feature activation-1.68e-01

Pos logit contributions
 a+0.004
 two+0.004
 big+0.004
+0.003
 the+0.003
Neg logit contributions
ittered-0.023
rily-0.022
immer-0.022
inations-0.022
lights-0.022
Token explore
Token ID7301
Feature activation+2.59e-01

Pos logit contributions
 a+0.007
 two+0.007
 big+0.007
 and+0.006
+0.006
Neg logit contributions
 avail-0.014
immer-0.014
ittered-0.014
inations-0.014
rily-0.013
Token Ur
Token ID8799
Feature activation+3.01e+00

Pos logit contributions
rily+0.028
ittered+0.028
immer+0.028
lights+0.028
inations+0.027
Neg logit contributions
 two-0.004
 a-0.004
 big-0.004
-0.003
 little-0.003
Tokenge
Token ID469
Feature activation+3.39e-01

Pos logit contributions
rily+0.058
lights+0.057
 avail+0.053
ittered+0.051
comb+0.050
Neg logit contributions
 a-0.316
 and-0.316
 the-0.313
.-0.310
 two-0.309
Token Land
Token ID6379
Feature activation+3.58e-01

Pos logit contributions
rily+0.038
ittered+0.038
immer+0.037
lights+0.037
inations+0.036
Neg logit contributions
 a-0.006
 two-0.005
 big-0.005
-0.004
 the-0.004
Token everyday
Token ID10908
Feature activation+1.03e-01

Pos logit contributions
rily+0.040
ittered+0.040
immer+0.040
lights+0.039
inations+0.039
Neg logit contributions
 a-0.006
 two-0.005
 big-0.005
 the-0.004
-0.004
Token,
Token ID11
Feature activation+2.14e-02

Pos logit contributions
rily+0.011
ittered+0.011
immer+0.011
inations+0.011
lights+0.011
Neg logit contributions
 a-0.002
 two-0.002
 big-0.001
-0.001
 the-0.001
Token and
Token ID290
Feature activation-3.20e-01

Pos logit contributions
inations+0.002
rily+0.002
ittered+0.002
immer+0.002
istent+0.002
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
-0.001
 the-0.001
Token listen
Token ID6004
Feature activation-6.68e-01

Pos logit contributions
 a+0.004
 big+0.004
 two+0.004
+0.004
 the+0.004
Neg logit contributions
immer-0.005
leness-0.005
epad-0.005
inations-0.005
ides-0.005
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
 two+0.015
 a+0.014
 big+0.013
 the+0.011
+0.010
Neg logit contributions
rily-0.065
ently-0.065
lights-0.065
ittered-0.064
epad-0.064
Token She
Token ID1375
Feature activation-2.38e-01

Pos logit contributions
rily+0.032
ittered+0.032
immer+0.031
lights+0.031
istent+0.031
Neg logit contributions
 a-0.015
 two-0.014
 big-0.014
 the-0.013
 and-0.012
Token threw
Token ID9617
Feature activation-4.76e-01

Pos logit contributions
 a+0.010
 big+0.010
 the+0.010
 two+0.009
+0.009
Neg logit contributions
ittered-0.020
rily-0.019
immer-0.019
inations-0.019
arians-0.019
Token a
Token ID257
Feature activation+2.35e-01

Pos logit contributions
+0.009
 two+0.008
 big+0.008
 a+0.008
 little+0.007
Neg logit contributions
rily-0.048
ittered-0.047
immer-0.046
lights-0.046
keeping-0.045
Token tant
Token ID24246
Feature activation+2.94e+00

Pos logit contributions
rily+0.012
immer+0.012
ittered+0.012
inations+0.012
leness+0.012
Neg logit contributions
 a-0.018
 two-0.017
 the-0.017
-0.016
 and-0.016
Tokenrum
Token ID6582
Feature activation+1.39e-02

Pos logit contributions
rily+0.082
immer+0.077
acket+0.077
arians+0.076
ittered+0.075
Neg logit contributions
 a-0.300
 and-0.292
 two-0.288
 the-0.286
 big-0.285
Token in
Token ID287
Feature activation-4.14e-01

Pos logit contributions
rily+0.002
ittered+0.002
lights+0.002
ently+0.001
keeping+0.001
Neg logit contributions
 big-0.000
 a-0.000
 two-0.000
 the-0.000
-0.000
Token the
Token ID262
Feature activation-2.71e-01

Pos logit contributions
 a+0.006
+0.006
 big+0.006
 two+0.006
 and+0.005
Neg logit contributions
rily-0.042
 avail-0.041
keeping-0.041
ittered-0.041
immer-0.041
Token store
Token ID3650
Feature activation+1.09e-01

Pos logit contributions
 a+0.013
 and+0.012
 the+0.012
+0.012
 two+0.011
Neg logit contributions
rily-0.020
immer-0.020
lights-0.020
leness-0.020
acket-0.020
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
rily+0.012
ittered+0.012
immer+0.012
arians+0.012
keeping+0.012
Neg logit contributions
 a-0.002
 two-0.001
 big-0.001
-0.001
 the-0.001
Token the
Token ID262
Feature activation-8.58e-02

Pos logit contributions
rily+0.103
ittered+0.103
immer+0.102
 avail+0.101
inations+0.099
Neg logit contributions
 a-0.012
 two-0.011
 big-0.010
 and-0.009
-0.008
Token shed
Token ID14999
Feature activation+9.17e-02

Pos logit contributions
 a+0.005
 two+0.005
 and+0.005
+0.005
 the+0.004
Neg logit contributions
rily-0.006
arians-0.006
comb-0.006
stack-0.006
keeping-0.006
Token.
Token ID13
Feature activation+4.63e-01

Pos logit contributions
rily+0.010
immer+0.010
ittered+0.010
lights+0.010
inations+0.010
Neg logit contributions
 a-0.001
 big-0.001
 two-0.001
 the-0.001
-0.001
Token
Token ID198
Feature activation-3.19e-02

Pos logit contributions
rily+0.032
ittered+0.031
immer+0.030
istent+0.030
arians+0.030
Neg logit contributions
 a-0.028
 two-0.027
 big-0.027
 the-0.025
 and-0.025
Token
Token ID198
Feature activation+2.50e-01

Pos logit contributions
 a+0.001
 two+0.000
 big+0.000
 the+0.000
 and+0.000
Neg logit contributions
rily-0.003
immer-0.003
ittered-0.003
lor-0.003
inations-0.003
TokenCharl
Token ID24453
Feature activation+2.92e+00

Pos logit contributions
immer+0.018
rily+0.018
ittered+0.018
arians+0.018
inations+0.018
Neg logit contributions
 a-0.014
 big-0.013
 two-0.013
 the-0.013
 and-0.012
Tokenotte
Token ID11404
Feature activation-6.95e-01

Pos logit contributions
immer+0.112
ittered+0.105
lights+0.104
inations+0.103
rily+0.098
Neg logit contributions
 a-0.271
 two-0.265
 and-0.264
 the-0.263
-0.259
Token ran
Token ID4966
Feature activation-4.45e-01

Pos logit contributions
 a+0.026
 big+0.025
 two+0.023
 the+0.022
+0.021
Neg logit contributions
ittered-0.061
rily-0.060
immer-0.060
arians-0.059
istent-0.059
Token to
Token ID284
Feature activation+2.47e-02

Pos logit contributions
 two+0.014
 big+0.013
 a+0.013
+0.012
 little+0.012
Neg logit contributions
rily-0.040
keeping-0.039
ittered-0.039
immer-0.039
ge-0.039
Token the
Token ID262
Feature activation-4.62e-01

Pos logit contributions
 avail+0.003
rily+0.003
ittered+0.003
immer+0.003
irs+0.003
Neg logit contributions
 two-0.000
 big-0.000
 a-0.000
 and-0.000
-0.000
Token shed
Token ID14999
Feature activation+2.55e-02

Pos logit contributions
 a+0.029
+0.027
 and+0.027
 the+0.027
 two+0.026
Neg logit contributions
rily-0.028
stack-0.027
comb-0.027
arians-0.027
keeping-0.026
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
ittered+0.067
rily+0.067
immer+0.066
inations+0.065
lights+0.065
Neg logit contributions
 a-0.012
 two-0.011
 big-0.010
-0.009
 the-0.008
Token the
Token ID262
Feature activation-3.25e-01

Pos logit contributions
rily+0.010
ittered+0.009
inations+0.009
lights+0.009
immer+0.009
Neg logit contributions
 a-0.004
 big-0.004
-0.004
 two-0.004
 and-0.003
Token harsh
Token ID11859
Feature activation-7.38e-02

Pos logit contributions
 a+0.019
+0.018
 and+0.018
 the+0.018
 big+0.017
Neg logit contributions
epad-0.021
inations-0.021
immer-0.020
rily-0.020
 avail-0.020
Token queen
Token ID16599
Feature activation+1.49e-01

Pos logit contributions
 a+0.005
 two+0.004
 big+0.004
 the+0.004
+0.004
Neg logit contributions
ittered-0.005
immer-0.005
rily-0.005
lights-0.004
epad-0.004
Token was
Token ID373
Feature activation+5.15e-01

Pos logit contributions
ittered+0.014
rily+0.014
immer+0.014
inations+0.013
fer+0.013
Neg logit contributions
 a-0.005
 big-0.005
 two-0.005
 the-0.005
-0.005
Token overth
Token ID34514
Feature activation+2.92e+00

Pos logit contributions
ittered+0.040
rily+0.040
immer+0.040
inations+0.038
lights+0.037
Neg logit contributions
 a-0.026
 two-0.025
 big-0.025
-0.024
 and-0.024
Tokenrown
Token ID2053
Feature activation-2.53e-01

Pos logit contributions
rily+0.114
 avail+0.111
lights+0.106
leness+0.106
inations+0.103
Neg logit contributions
 a-0.249
 two-0.249
 and-0.244
-0.244
.-0.238
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
 a+0.012
 big+0.011
 two+0.011
 the+0.011
+0.011
Neg logit contributions
ittered-0.020
rily-0.020
immer-0.020
ge-0.019
istent-0.019
Token
Token ID198
Feature activation+2.15e-01

Pos logit contributions
rily+0.019
ittered+0.018
immer+0.018
arians+0.018
rol+0.018
Neg logit contributions
 a-0.006
 big-0.006
 two-0.005
 and-0.005
 the-0.005
Token
Token ID198
Feature activation+3.33e-01

Pos logit contributions
rily+0.024
ittered+0.024
immer+0.024
inations+0.024
lor+0.024
Neg logit contributions
 a-0.003
 two-0.002
 big-0.002
 and-0.002
 the-0.002
TokenThe
Token ID464
Feature activation-3.00e-01

Pos logit contributions
immer+0.026
rily+0.026
ittered+0.026
inations+0.026
lor+0.025
Neg logit contributions
 a-0.017
 big-0.016
 and-0.015
 two-0.015
 the-0.015
TokenOne
Token ID3198
Feature activation-4.66e-01

Pos logit contributions
 a+0.014
 big+0.013
 the+0.013
 and+0.013
 two+0.013
Neg logit contributions
immer-0.013
ittered-0.013
rily-0.013
epad-0.012
inations-0.012
Token,
Token ID11
Feature activation-1.83e+00

Pos logit contributions
 a+0.010
 two+0.008
+0.007
 big+0.007
 and+0.007
Neg logit contributions
immer-0.047
lights-0.046
ittered-0.046
inations-0.045
adows-0.045
Token two
Token ID734
Feature activation-3.08e-01

Pos logit contributions
+0.029
.+0.023
 big+0.023
 and+0.023
 a+0.018
Neg logit contributions
 avail-0.167
icians-0.167
arians-0.166
plets-0.166
inations-0.165
Token,
Token ID11
Feature activation-2.06e+00

Pos logit contributions
 a+0.006
+0.005
 big+0.005
 two+0.005
 and+0.004
Neg logit contributions
ittered-0.032
immer-0.032
rily-0.032
inations-0.031
arians-0.031
Token three
Token ID1115
Feature activation-3.00e-01

Pos logit contributions
+0.059
 big+0.056
 and+0.053
.+0.052
 the+0.051
Neg logit contributions
icians-0.170
arians-0.170
inations-0.168
immer-0.168
comb-0.166
Token,
Token ID11
Feature activation-2.07e+00

Pos logit contributions
 a+0.007
 two+0.006
 big+0.006
+0.006
 the+0.005
Neg logit contributions
rily-0.030
ittered-0.030
inations-0.030
lights-0.030
immer-0.030
Token go
Token ID467
Feature activation-3.22e-01

Pos logit contributions
+0.063
 big+0.063
 the+0.056
 and+0.056
.+0.056
Neg logit contributions
immer-0.166
arians-0.166
icians-0.165
inations-0.163
comb-0.162
Token!"
Token ID2474
Feature activation-4.04e-01

Pos logit contributions
 a+0.014
 two+0.014
 big+0.013
+0.013
 little+0.012
Neg logit contributions
ittered-0.025
rily-0.025
inations-0.025
immer-0.025
ge-0.025
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
 big+0.009
 two+0.009
 a+0.009
 little+0.008
 next+0.008
Neg logit contributions
rily-0.039
lights-0.037
immer-0.036
plets-0.036
ittered-0.036
Token
Token ID198
Feature activation+1.96e-01

Pos logit contributions
 a+0.002
 two+0.002
 big+0.002
 the+0.002
 and+0.002
Neg logit contributions
immer-0.017
rily-0.017
ittered-0.017
igi-0.016
ently-0.016
TokenThey
Token ID2990
Feature activation-3.62e-01

Pos logit contributions
immer+0.015
rily+0.015
ittered+0.015
fer+0.015
vation+0.014
Neg logit contributions
 a-0.010
 big-0.009
 two-0.009
 the-0.009
 and-0.009
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
 a+0.006
 two+0.005
 big+0.004
+0.004
 the+0.003
Neg logit contributions
rily-0.030
lights-0.030
ittered-0.029
immer-0.029
inations-0.029
Token "
Token ID366
Feature activation-2.11e-01

Pos logit contributions
rily+0.020
ittered+0.020
immer+0.020
lights+0.019
epad+0.019
Neg logit contributions
 a-0.009
 big-0.008
 two-0.008
 the-0.008
 and-0.008
TokenOne
Token ID3198
Feature activation-4.66e-01

Pos logit contributions
 a+0.014
 big+0.013
 the+0.013
 and+0.013
 two+0.013
Neg logit contributions
immer-0.013
ittered-0.013
rily-0.013
epad-0.012
inations-0.012
Token,
Token ID11
Feature activation-1.83e+00

Pos logit contributions
 a+0.010
 two+0.008
+0.007
 big+0.007
 and+0.007
Neg logit contributions
immer-0.047
lights-0.046
ittered-0.046
inations-0.045
adows-0.045
Token two
Token ID734
Feature activation-3.08e-01

Pos logit contributions
+0.029
.+0.023
 big+0.023
 and+0.023
 a+0.018
Neg logit contributions
 avail-0.167
icians-0.167
arians-0.166
plets-0.166
inations-0.165
Token,
Token ID11
Feature activation-2.06e+00

Pos logit contributions
 a+0.006
+0.005
 big+0.005
 two+0.005
 and+0.004
Neg logit contributions
ittered-0.032
immer-0.032
rily-0.032
inations-0.031
arians-0.031
Token three
Token ID1115
Feature activation-3.00e-01

Pos logit contributions
+0.059
 big+0.056
 and+0.053
.+0.052
 the+0.051
Neg logit contributions
icians-0.170
arians-0.170
inations-0.168
immer-0.168
comb-0.166
Token,
Token ID11
Feature activation-2.07e+00

Pos logit contributions
 a+0.007
 two+0.006
 big+0.006
+0.006
 the+0.005
Neg logit contributions
rily-0.030
ittered-0.030
inations-0.030
lights-0.030
immer-0.030
Token go
Token ID467
Feature activation-3.22e-01

Pos logit contributions
+0.063
 big+0.063
 the+0.056
 and+0.056
.+0.056
Neg logit contributions
immer-0.166
arians-0.166
icians-0.165
inations-0.163
comb-0.162
Token!"
Token ID2474
Feature activation-4.04e-01

Pos logit contributions
 a+0.014
 two+0.014
 big+0.013
+0.013
 little+0.012
Neg logit contributions
ittered-0.025
rily-0.025
inations-0.025
immer-0.025
ge-0.025
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
 big+0.009
 two+0.009
 a+0.009
 little+0.008
 next+0.008
Neg logit contributions
rily-0.039
lights-0.037
immer-0.036
plets-0.036
ittered-0.036
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
 a+0.010
 two+0.010
 big+0.008
 little+0.008
+0.007
Neg logit contributions
rily-0.064
lights-0.063
ittered-0.062
immer-0.061
arians-0.060
Token "
Token ID366
Feature activation-3.04e-01

Pos logit contributions
rily+0.012
ittered+0.012
lights+0.012
immer+0.011
rol+0.011
Neg logit contributions
 a-0.006
 big-0.006
 two-0.006
 the-0.005
 and-0.005
TokenOne
Token ID3198
Feature activation-4.22e-01

Pos logit contributions
 a+0.018
 big+0.017
 the+0.017
 and+0.017
 two+0.016
Neg logit contributions
immer-0.020
ittered-0.020
rily-0.020
inations-0.020
epad-0.020
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 the+0.007
 and+0.007
Neg logit contributions
immer-0.042
inations-0.041
rated-0.041
ittered-0.040
adows-0.040
Token two
Token ID734
Feature activation-1.99e-01

Pos logit contributions
+0.043
.+0.037
 and+0.036
 big+0.036
 a+0.033
Neg logit contributions
arians-0.160
inations-0.159
 avail-0.159
comb-0.158
epad-0.157
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
 a+0.004
+0.004
 big+0.003
 two+0.003
 the+0.003
Neg logit contributions
ittered-0.021
immer-0.020
inations-0.020
rily-0.020
arians-0.020
Token three
Token ID1115
Feature activation-8.99e-02

Pos logit contributions
+0.064
 big+0.060
 and+0.056
.+0.056
 the+0.056
Neg logit contributions
arians-0.164
inations-0.163
icians-0.162
epad-0.161
immer-0.158
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
 a+0.003
 two+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
inations-0.009
ittered-0.009
rily-0.009
lights-0.009
immer-0.009
Token four
Token ID1440
Feature activation-1.19e-01

Pos logit contributions
+0.066
 big+0.064
 a+0.061
 the+0.060
.+0.058
Neg logit contributions
arians-0.147
icians-0.145
inations-0.145
immer-0.144
epad-0.142
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
 a+0.003
 big+0.003
+0.003
 two+0.002
 the+0.002
Neg logit contributions
ittered-0.012
inations-0.012
rily-0.012
immer-0.012
lights-0.012
Token five
Token ID1936
Feature activation-3.98e-02

Pos logit contributions
 big+0.073
+0.073
 a+0.071
 the+0.070
.+0.066
Neg logit contributions
arians-0.122
immer-0.119
icians-0.119
rily-0.117
ittered-0.117
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
 a+0.005
 two+0.004
 big+0.004
+0.004
 the+0.003
Neg logit contributions
rily-0.033
lights-0.032
ittered-0.032
inations-0.031
immer-0.031
Token "
Token ID366
Feature activation-7.13e-01

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
 the+0.003
 and+0.003
Neg logit contributions
ittered-0.011
rily-0.011
lights-0.011
immer-0.010
eway-0.010
TokenOne
Token ID3198
Feature activation-1.74e-01

Pos logit contributions
 a+0.044
 the+0.043
 big+0.042
 and+0.041
 little+0.041
Neg logit contributions
inations-0.043
rily-0.043
immer-0.043
ittered-0.042
icians-0.042
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
+0.002
 the+0.002
Neg logit contributions
immer-0.018
ittered-0.018
lights-0.017
inations-0.017
rily-0.017
Token two
Token ID734
Feature activation-5.14e-02

Pos logit contributions
+0.038
.+0.031
 and+0.031
 a+0.028
 big+0.027
Neg logit contributions
icians-0.176
inations-0.176
arians-0.173
plets-0.171
lights-0.171
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
 a+0.001
+0.001
 big+0.001
 two+0.001
 the+0.001
Neg logit contributions
ittered-0.005
immer-0.005
rily-0.005
inations-0.005
arians-0.005
Token three
Token ID1115
Feature activation-1.96e-01

Pos logit contributions
+0.062
 and+0.054
 big+0.054
.+0.053
 the+0.052
Neg logit contributions
icians-0.169
inations-0.168
arians-0.167
immer-0.163
rily-0.162
Token!"
Token ID2474
Feature activation-7.55e-01

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 the+0.004
Neg logit contributions
rily-0.019
ittered-0.019
inations-0.019
lights-0.019
immer-0.019
Token
Token ID198
Feature activation-4.89e-01

Pos logit contributions
 big+0.022
 two+0.021
 a+0.021
 next+0.021
 right+0.020
Neg logit contributions
lights-0.065
rily-0.063
eway-0.060
ittered-0.060
plets-0.059
Token
Token ID198
Feature activation-3.50e-01

Pos logit contributions
 a+0.008
 two+0.008
 big+0.006
 and+0.006
 right+0.005
Neg logit contributions
rily-0.051
ittered-0.051
immer-0.050
lights-0.050
igi-0.049
TokenThey
Token ID2990
Feature activation-2.16e-01

Pos logit contributions
 a+0.016
 big+0.015
 and+0.014
 the+0.014
 two+0.014
Neg logit contributions
immer-0.027
ittered-0.026
igi-0.026
rily-0.026
lights-0.026
TokenOne
Token ID3198
Feature activation-3.18e-01

Pos logit contributions
immer+0.043
rily+0.043
ittered+0.043
arians+0.042
leness+0.042
Neg logit contributions
 a-0.040
 big-0.038
 the-0.037
 two-0.037
 and-0.036
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
 a+0.007
 two+0.006
+0.005
 big+0.005
 the+0.005
Neg logit contributions
immer-0.032
ittered-0.031
lights-0.031
inations-0.030
 avail-0.030
Token two
Token ID734
Feature activation+6.80e-02

Pos logit contributions
+0.029
 big+0.022
.+0.022
 and+0.022
 a+0.021
Neg logit contributions
arians-0.145
icians-0.145
epad-0.143
inations-0.143
rily-0.142
Token,
Token ID11
Feature activation-1.86e+00

Pos logit contributions
ittered+0.007
immer+0.007
rily+0.007
inations+0.007
arians+0.007
Neg logit contributions
 a-0.001
-0.001
 two-0.001
 big-0.001
 the-0.001
Token three
Token ID1115
Feature activation-1.67e-01

Pos logit contributions
+0.058
 big+0.052
 and+0.050
 a+0.050
.+0.049
Neg logit contributions
arians-0.154
icians-0.152
inations-0.151
immer-0.151
rily-0.150
Token,
Token ID11
Feature activation-2.00e+00

Pos logit contributions
 a+0.004
 two+0.003
 big+0.003
+0.003
 the+0.003
Neg logit contributions
rily-0.017
ittered-0.017
lights-0.017
inations-0.017
immer-0.017
Token four
Token ID1440
Feature activation-1.49e-01

Pos logit contributions
+0.069
 big+0.065
 a+0.061
 the+0.060
 was+0.059
Neg logit contributions
arians-0.157
immer-0.157
icians-0.155
rily-0.154
inations-0.153
Token,
Token ID11
Feature activation-1.69e+00

Pos logit contributions
 a+0.004
 two+0.003
 big+0.003
+0.003
 the+0.003
Neg logit contributions
ittered-0.015
rily-0.015
immer-0.015
inations-0.015
lights-0.015
Token five
Token ID1936
Feature activation-1.13e-01

Pos logit contributions
+0.075
 big+0.072
 a+0.071
 the+0.069
.+0.066
Neg logit contributions
immer-0.119
arians-0.118
rily-0.118
ittered-0.117
icians-0.116
Token..."
Token ID9313
Feature activation+3.88e-02

Pos logit contributions
 a+0.003
 two+0.002
+0.002
 big+0.002
 the+0.002
Neg logit contributions
rily-0.012
ittered-0.012
immer-0.011
inations-0.011
lights-0.011
Token She
Token ID1375
Feature activation-6.73e-01

Pos logit contributions
rily+0.004
immer+0.004
ittered+0.004
inations+0.004
lights+0.004
Neg logit contributions
 big-0.001
 a-0.001
 two-0.001
 the-0.001
 little-0.001
Token.
Token ID13
Feature activation-3.93e-02

Pos logit contributions
rily+0.002
immer+0.002
ittered+0.002
lights+0.002
epad+0.002
Neg logit contributions
 a-0.000
 two-0.000
 big-0.000
-0.000
 little-0.000
Token "
Token ID366
Feature activation-2.23e-01

Pos logit contributions
 a+0.001
 big+0.001
 two+0.001
 the+0.001
 and+0.001
Neg logit contributions
rily-0.004
immer-0.004
ittered-0.004
lights-0.004
 avail-0.004
TokenOne
Token ID3198
Feature activation-1.86e-02

Pos logit contributions
 a+0.012
 the+0.011
 big+0.011
 two+0.011
 and+0.011
Neg logit contributions
rily-0.016
immer-0.016
ittered-0.016
epad-0.016
arians-0.016
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 the+0.001
 big+0.001
Neg logit contributions
immer-0.002
inations-0.002
lights-0.002
ittered-0.002
adows-0.002
Token two
Token ID734
Feature activation+3.14e-01

Pos logit contributions
+0.030
.+0.027
 big+0.024
 and+0.024
 a+0.023
Neg logit contributions
arians-0.144
 avail-0.142
rily-0.141
icians-0.141
epad-0.140
Token,
Token ID11
Feature activation-1.98e+00

Pos logit contributions
ittered+0.034
rily+0.034
immer+0.034
inations+0.033
arians+0.033
Neg logit contributions
 a-0.006
-0.005
 two-0.005
 big-0.005
 the-0.005
Token three
Token ID1115
Feature activation+3.41e-01

Pos logit contributions
+0.059
 big+0.055
.+0.053
 a+0.051
 and+0.051
Neg logit contributions
arians-0.169
icians-0.166
rily-0.164
immer-0.163
inations-0.163
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
rily+0.033
ittered+0.033
inations+0.032
immer+0.032
lights+0.032
Neg logit contributions
 a-0.011
 two-0.010
 big-0.010
-0.009
 the-0.009
Token four
Token ID1440
Feature activation+3.55e-01

Pos logit contributions
+0.057
 big+0.056
.+0.052
 a+0.051
 the+0.048
Neg logit contributions
arians-0.159
immer-0.157
icians-0.156
rily-0.156
inations-0.154
Token,"
Token ID553
Feature activation-7.91e-01

Pos logit contributions
ittered+0.037
rily+0.037
immer+0.037
inations+0.037
lights+0.036
Neg logit contributions
 a-0.009
 big-0.007
 two-0.007
-0.007
 the-0.006
Token she
Token ID673
Feature activation-1.74e-01

Pos logit contributions
 big+0.027
 a+0.027
 and+0.027
 two+0.026
+0.024
Neg logit contributions
 avail-0.066
rily-0.063
lights-0.063
ittered-0.062
immer-0.061
TokenOne
Token ID3198
Feature activation-6.75e-02

Pos logit contributions
ittered+0.005
immer+0.005
rily+0.005
arians+0.005
inations+0.005
Neg logit contributions
 a-0.006
 big-0.006
 and-0.006
 the-0.006
 two-0.006
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 big+0.001
 the+0.001
Neg logit contributions
immer-0.007
inations-0.007
ittered-0.007
lights-0.007
 avail-0.007
Token two
Token ID734
Feature activation+2.07e-01

Pos logit contributions
+0.029
 and+0.024
 big+0.024
.+0.023
 a+0.020
Neg logit contributions
inations-0.171
icians-0.169
 avail-0.167
epad-0.167
rily-0.167
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
ittered+0.022
immer+0.022
rily+0.022
inations+0.022
lights+0.022
Neg logit contributions
 a-0.004
-0.003
 big-0.003
 two-0.003
 the-0.003
Token three
Token ID1115
Feature activation+5.04e-02

Pos logit contributions
+0.055
 big+0.052
 and+0.050
 the+0.047
.+0.047
Neg logit contributions
icians-0.167
inations-0.166
arians-0.166
immer-0.163
rily-0.162
Token,
Token ID11
Feature activation-1.96e+00

Pos logit contributions
rily+0.005
inations+0.005
ittered+0.005
immer+0.005
lights+0.005
Neg logit contributions
 a-0.002
 big-0.001
 two-0.001
-0.001
 the-0.001
Token four
Token ID1440
Feature activation+1.28e-01

Pos logit contributions
+0.060
 big+0.059
 a+0.053
 the+0.053
 and+0.053
Neg logit contributions
icians-0.163
arians-0.162
immer-0.162
inations-0.160
rily-0.159
Token,
Token ID11
Feature activation-1.74e+00

Pos logit contributions
ittered+0.012
rily+0.012
inations+0.012
immer+0.012
lights+0.012
Neg logit contributions
 a-0.004
 big-0.003
-0.003
 two-0.003
 the-0.003
Token five
Token ID1936
Feature activation+1.70e-01

Pos logit contributions
+0.070
 big+0.070
 a+0.067
 the+0.065
 and+0.063
Neg logit contributions
immer-0.129
arians-0.129
icians-0.128
rily-0.127
ittered-0.126
Token,
Token ID11
Feature activation-1.63e+00

Pos logit contributions
rily+0.016
ittered+0.016
inations+0.016
immer+0.016
lights+0.015
Neg logit contributions
 a-0.006
 big-0.005
 two-0.005
-0.005
 the-0.005
Token six
Token ID2237
Feature activation+2.10e-01

Pos logit contributions
 big+0.070
+0.070
 a+0.067
 the+0.065
.+0.063
Neg logit contributions
icians-0.116
arians-0.116
immer-0.115
rily-0.115
ittered-0.113
Token.
Token ID13
Feature activation-2.28e-01

Pos logit contributions
rily+0.019
immer+0.019
ittered+0.018
comb+0.018
lights+0.018
Neg logit contributions
 a-0.003
 two-0.002
 big-0.002
 the-0.002
-0.002
Token "
Token ID366
Feature activation-1.71e-01

Pos logit contributions
 a+0.007
 big+0.006
 two+0.006
 the+0.006
 and+0.006
Neg logit contributions
ittered-0.022
rily-0.021
immer-0.021
rol-0.021
keeping-0.021
TokenOne
Token ID3198
Feature activation-3.62e-01

Pos logit contributions
 a+0.010
 big+0.010
 the+0.010
 and+0.010
 two+0.009
Neg logit contributions
rily-0.011
immer-0.011
ittered-0.011
rol-0.011
epad-0.011
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 and+0.007
 big+0.007
Neg logit contributions
immer-0.036
ittered-0.034
rily-0.034
lights-0.034
rated-0.034
Token two
Token ID734
Feature activation-3.51e-01

Pos logit contributions
+0.037
.+0.033
 and+0.031
 big+0.029
 a+0.026
Neg logit contributions
rily-0.168
comb-0.166
 avail-0.165
 pounds-0.164
heit-0.163
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
 a+0.008
+0.007
 big+0.007
 two+0.006
 and+0.006
Neg logit contributions
immer-0.036
rily-0.036
ittered-0.035
inations-0.035
arians-0.034
Token three
Token ID1115
Feature activation-1.50e-01

Pos logit contributions
+0.051
 big+0.047
 and+0.045
.+0.045
 a+0.043
Neg logit contributions
rily-0.168
immer-0.165
inations-0.164
 pounds-0.164
icians-0.162
Token!"
Token ID2474
Feature activation-7.64e-01

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 the+0.004
Neg logit contributions
rily-0.014
ittered-0.014
immer-0.014
inations-0.014
lights-0.013
Token she
Token ID673
Feature activation-7.84e-02

Pos logit contributions
 a+0.032
 two+0.032
 big+0.032
 little+0.030
 and+0.029
Neg logit contributions
rily-0.057
lights-0.055
ittered-0.055
 avail-0.055
immer-0.054
Token said
Token ID531
Feature activation+4.86e-01

Pos logit contributions
 a+0.003
 big+0.003
 two+0.003
 the+0.003
+0.003
Neg logit contributions
rily-0.007
ittered-0.007
inations-0.007
immer-0.006
fer-0.006
Token.
Token ID13
Feature activation-3.87e-01

Pos logit contributions
rily+0.055
ittered+0.055
immer+0.054
lights+0.052
comb+0.052
Neg logit contributions
 a-0.007
 big-0.006
 two-0.006
-0.005
 the-0.005
Token
Token ID198
Feature activation-3.71e-01

Pos logit contributions
 a+0.008
 two+0.007
 big+0.006
 the+0.005
 and+0.005
Neg logit contributions
immer-0.049
rily-0.048
ittered-0.048
inations-0.048
lights-0.047
Token"
Token ID1
Feature activation+9.34e-02

Pos logit contributions
 a+0.016
 big+0.015
 the+0.015
 two+0.014
 and+0.014
Neg logit contributions
immer-0.031
rily-0.030
inations-0.029
leness-0.029
ge-0.029
TokenOne
Token ID3198
Feature activation-6.75e-02

Pos logit contributions
ittered+0.005
immer+0.005
rily+0.005
arians+0.005
inations+0.005
Neg logit contributions
 a-0.006
 big-0.006
 and-0.006
 the-0.006
 two-0.006
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 big+0.001
 the+0.001
Neg logit contributions
immer-0.007
inations-0.007
ittered-0.007
lights-0.007
 avail-0.007
Token two
Token ID734
Feature activation+2.07e-01

Pos logit contributions
+0.029
 and+0.024
 big+0.024
.+0.023
 a+0.020
Neg logit contributions
inations-0.171
icians-0.169
 avail-0.167
epad-0.167
rily-0.167
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
ittered+0.022
immer+0.022
rily+0.022
inations+0.022
lights+0.022
Neg logit contributions
 a-0.004
-0.003
 big-0.003
 two-0.003
 the-0.003
Token three
Token ID1115
Feature activation+5.04e-02

Pos logit contributions
+0.055
 big+0.052
 and+0.050
 the+0.047
.+0.047
Neg logit contributions
icians-0.167
inations-0.166
arians-0.166
immer-0.163
rily-0.162
Token,
Token ID11
Feature activation-1.96e+00

Pos logit contributions
rily+0.005
inations+0.005
ittered+0.005
immer+0.005
lights+0.005
Neg logit contributions
 a-0.002
 big-0.001
 two-0.001
-0.001
 the-0.001
Token four
Token ID1440
Feature activation+1.28e-01

Pos logit contributions
+0.060
 big+0.059
 a+0.053
 the+0.053
 and+0.053
Neg logit contributions
icians-0.163
arians-0.162
immer-0.162
inations-0.160
rily-0.159
Token,
Token ID11
Feature activation-1.74e+00

Pos logit contributions
ittered+0.012
rily+0.012
inations+0.012
immer+0.012
lights+0.012
Neg logit contributions
 a-0.004
 big-0.003
-0.003
 two-0.003
 the-0.003
Token five
Token ID1936
Feature activation+1.70e-01

Pos logit contributions
+0.070
 big+0.070
 a+0.067
 the+0.065
 and+0.063
Neg logit contributions
immer-0.129
arians-0.129
icians-0.128
rily-0.127
ittered-0.126
Token.
Token ID13
Feature activation-8.18e-02

Pos logit contributions
rily+0.061
ittered+0.061
lights+0.060
immer+0.060
inations+0.058
Neg logit contributions
 a-0.008
 two-0.008
 big-0.007
-0.006
 little-0.005
Token "
Token ID366
Feature activation-3.17e-01

Pos logit contributions
 a+0.002
 two+0.002
 big+0.002
 the+0.002
 and+0.002
Neg logit contributions
rily-0.008
lights-0.008
ittered-0.008
immer-0.008
inations-0.008
TokenOne
Token ID3198
Feature activation+6.43e-02

Pos logit contributions
 a+0.018
 the+0.018
 big+0.017
 two+0.017
 and+0.017
Neg logit contributions
immer-0.022
ittered-0.022
rily-0.021
inations-0.021
arians-0.021
Token,
Token ID11
Feature activation-1.79e+00

Pos logit contributions
immer+0.007
ittered+0.007
inations+0.007
lights+0.007
adows+0.006
Neg logit contributions
 a-0.001
 two-0.001
 the-0.001
-0.001
 big-0.001
Token two
Token ID734
Feature activation+1.73e-01

Pos logit contributions
+0.029
 and+0.022
.+0.022
 big+0.018
 the+0.017
Neg logit contributions
inations-0.177
icians-0.174
lights-0.171
arians-0.171
comb-0.170
Token,
Token ID11
Feature activation-1.93e+00

Pos logit contributions
ittered+0.018
immer+0.018
rily+0.018
inations+0.018
arians+0.018
Neg logit contributions
 a-0.004
-0.003
 two-0.003
 big-0.003
 the-0.003
Token three
Token ID1115
Feature activation+5.87e-02

Pos logit contributions
+0.051
 and+0.043
 big+0.042
.+0.042
 the+0.042
Neg logit contributions
inations-0.171
icians-0.169
arians-0.168
comb-0.166
immer-0.166
Token..."
Token ID9313
Feature activation-5.70e-01

Pos logit contributions
ittered+0.006
rily+0.006
inations+0.006
lights+0.006
immer+0.006
Neg logit contributions
 a-0.002
 two-0.001
 big-0.001
-0.001
 the-0.001
Token
Token ID198
Feature activation-2.34e-01

Pos logit contributions
 big+0.009
 a+0.008
 two+0.007
 the+0.007
 little+0.005
Neg logit contributions
rily-0.062
immer-0.061
lights-0.060
inations-0.059
ittered-0.059
Token
Token ID198
Feature activation+1.24e-01

Pos logit contributions
 a+0.003
 two+0.003
 big+0.002
 the+0.002
 and+0.002
Neg logit contributions
lights-0.025
rily-0.025
immer-0.025
ittered-0.025
inations-0.025
TokenL
Token ID43
Feature activation+9.05e-01

Pos logit contributions
immer+0.009
rily+0.009
lights+0.009
inations+0.009
ittered+0.009
Neg logit contributions
 a-0.006
 big-0.006
 the-0.006
 two-0.006
 and-0.006
TokenOne
Token ID3198
Feature activation-1.35e-02

Pos logit contributions
immer+0.003
rily+0.003
ittered+0.003
inations+0.003
fer+0.003
Neg logit contributions
 a-0.003
 big-0.003
 the-0.003
 two-0.003
 and-0.003
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
 a+0.000
 two+0.000
+0.000
 the+0.000
 big+0.000
Neg logit contributions
immer-0.001
lights-0.001
inations-0.001
ittered-0.001
rily-0.001
Token two
Token ID734
Feature activation-4.44e-02

Pos logit contributions
+0.023
 and+0.019
 big+0.016
.+0.015
 next+0.010
Neg logit contributions
lights-0.150
rily-0.149
inations-0.148
 avail-0.147
ittered-0.143
Token,
Token ID11
Feature activation-1.82e+00

Pos logit contributions
 a+0.001
+0.001
 big+0.001
 two+0.001
 the+0.001
Neg logit contributions
inations-0.005
immer-0.005
ittered-0.005
lights-0.004
arians-0.004
Token three
Token ID1115
Feature activation+4.10e-02

Pos logit contributions
+0.041
 big+0.036
 and+0.036
.+0.032
 the+0.030
Neg logit contributions
inations-0.168
icians-0.165
immer-0.164
rily-0.164
arians-0.164
Token,
Token ID11
Feature activation-1.92e+00

Pos logit contributions
inations+0.004
ittered+0.004
rily+0.004
lights+0.004
immer+0.004
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
-0.001
 the-0.001
Token four
Token ID1440
Feature activation-9.40e-03

Pos logit contributions
+0.047
 big+0.043
 and+0.039
.+0.038
 the+0.036
Neg logit contributions
inations-0.171
immer-0.169
icians-0.168
arians-0.167
rily-0.167
Token,"
Token ID553
Feature activation-6.61e-01

Pos logit contributions
 a+0.000
 big+0.000
+0.000
 two+0.000
 the+0.000
Neg logit contributions
ittered-0.001
inations-0.001
immer-0.001
lights-0.001
rily-0.001
Token she
Token ID673
Feature activation-4.20e-01

Pos logit contributions
 big+0.020
 a+0.020
 and+0.019
 two+0.019
 right+0.018
Neg logit contributions
 avail-0.055
lights-0.054
rily-0.054
ittered-0.052
inations-0.051
Token counts
Token ID9853
Feature activation-5.27e-02

Pos logit contributions
 a+0.024
 the+0.022
 big+0.022
 two+0.022
+0.021
Neg logit contributions
rily-0.027
inations-0.027
immer-0.027
ittered-0.027
arians-0.027
Token.
Token ID13
Feature activation-5.67e-01

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
 little+0.001
 and+0.001
Neg logit contributions
rily-0.006
ittered-0.006
immer-0.005
lights-0.005
ge-0.005
Token to
Token ID284
Feature activation-4.34e-01

Pos logit contributions
 two+0.001
 a+0.001
 big+0.001
 little+0.001
+0.001
Neg logit contributions
rily-0.008
ittered-0.008
lights-0.008
immer-0.007
ge-0.007
Token three
Token ID1115
Feature activation-2.98e-01

Pos logit contributions
 a+0.016
+0.016
 two+0.016
 big+0.016
 and+0.016
Neg logit contributions
ittered-0.035
rily-0.034
 avail-0.034
epad-0.033
immer-0.033
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
 a+0.005
 two+0.004
 big+0.004
+0.004
 the+0.003
Neg logit contributions
rily-0.033
lights-0.032
ittered-0.032
inations-0.031
immer-0.031
Token "
Token ID366
Feature activation-7.13e-01

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
 the+0.003
 and+0.003
Neg logit contributions
ittered-0.011
rily-0.011
lights-0.011
immer-0.010
eway-0.010
TokenOne
Token ID3198
Feature activation-1.74e-01

Pos logit contributions
 a+0.044
 the+0.043
 big+0.042
 and+0.041
 little+0.041
Neg logit contributions
inations-0.043
rily-0.043
immer-0.043
ittered-0.042
icians-0.042
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
+0.002
 the+0.002
Neg logit contributions
immer-0.018
ittered-0.018
lights-0.017
inations-0.017
rily-0.017
Token two
Token ID734
Feature activation-5.14e-02

Pos logit contributions
+0.038
.+0.031
 and+0.031
 a+0.028
 big+0.027
Neg logit contributions
icians-0.176
inations-0.176
arians-0.173
plets-0.171
lights-0.171
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
 a+0.001
+0.001
 big+0.001
 two+0.001
 the+0.001
Neg logit contributions
ittered-0.005
immer-0.005
rily-0.005
inations-0.005
arians-0.005
Token three
Token ID1115
Feature activation-1.96e-01

Pos logit contributions
+0.062
 and+0.054
 big+0.054
.+0.053
 the+0.052
Neg logit contributions
icians-0.169
inations-0.168
arians-0.167
immer-0.163
rily-0.162
Token!"
Token ID2474
Feature activation-7.55e-01

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 the+0.004
Neg logit contributions
rily-0.019
ittered-0.019
inations-0.019
lights-0.019
immer-0.019
Token
Token ID198
Feature activation-4.89e-01

Pos logit contributions
 big+0.022
 two+0.021
 a+0.021
 next+0.021
 right+0.020
Neg logit contributions
lights-0.065
rily-0.063
eway-0.060
ittered-0.060
plets-0.059
TokenOne
Token ID3198
Feature activation-4.22e-01

Pos logit contributions
 a+0.018
 big+0.017
 the+0.017
 and+0.017
 two+0.016
Neg logit contributions
immer-0.020
ittered-0.020
rily-0.020
inations-0.020
epad-0.020
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 the+0.007
 and+0.007
Neg logit contributions
immer-0.042
inations-0.041
rated-0.041
ittered-0.040
adows-0.040
Token two
Token ID734
Feature activation-1.99e-01

Pos logit contributions
+0.043
.+0.037
 and+0.036
 big+0.036
 a+0.033
Neg logit contributions
arians-0.160
inations-0.159
 avail-0.159
comb-0.158
epad-0.157
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
 a+0.004
+0.004
 big+0.003
 two+0.003
 the+0.003
Neg logit contributions
ittered-0.021
immer-0.020
inations-0.020
rily-0.020
arians-0.020
Token three
Token ID1115
Feature activation-8.99e-02

Pos logit contributions
+0.064
 big+0.060
 and+0.056
.+0.056
 the+0.056
Neg logit contributions
arians-0.164
inations-0.163
icians-0.162
epad-0.161
immer-0.158
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
 a+0.003
 two+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
inations-0.009
ittered-0.009
rily-0.009
lights-0.009
immer-0.009
Token four
Token ID1440
Feature activation-1.19e-01

Pos logit contributions
+0.066
 big+0.064
 a+0.061
 the+0.060
.+0.058
Neg logit contributions
arians-0.147
icians-0.145
inations-0.145
immer-0.144
epad-0.142
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
 a+0.003
 big+0.003
+0.003
 two+0.002
 the+0.002
Neg logit contributions
ittered-0.012
inations-0.012
rily-0.012
immer-0.012
lights-0.012
Token five
Token ID1936
Feature activation-3.98e-02

Pos logit contributions
 big+0.073
+0.073
 a+0.071
 the+0.070
.+0.066
Neg logit contributions
arians-0.122
immer-0.119
icians-0.119
rily-0.117
ittered-0.117
Token,
Token ID11
Feature activation-1.60e+00

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 big+0.001
 the+0.001
Neg logit contributions
inations-0.004
ittered-0.004
rily-0.004
immer-0.004
lights-0.004
Token six
Token ID2237
Feature activation-2.93e-02

Pos logit contributions
 big+0.070
+0.069
 a+0.068
 the+0.067
.+0.063
Neg logit contributions
arians-0.112
icians-0.110
immer-0.109
rily-0.109
ittered-0.108
Token her
Token ID607
Feature activation+1.10e-01

Pos logit contributions
 two+0.006
 big+0.006
 a+0.006
+0.005
 little+0.005
Neg logit contributions
rily-0.042
immer-0.041
 avail-0.041
ittered-0.041
comb-0.040
Token cone
Token ID27763
Feature activation+1.66e-01

Pos logit contributions
rily+0.008
ittered+0.008
immer+0.008
 footing+0.007
lights+0.007
Neg logit contributions
 a-0.006
 two-0.006
 the-0.006
-0.005
 big-0.005
Token.
Token ID13
Feature activation-2.28e-01

Pos logit contributions
rily+0.019
immer+0.019
ittered+0.018
comb+0.018
lights+0.018
Neg logit contributions
 a-0.003
 two-0.002
 big-0.002
 the-0.002
-0.002
Token "
Token ID366
Feature activation-1.71e-01

Pos logit contributions
 a+0.007
 big+0.006
 two+0.006
 the+0.006
 and+0.006
Neg logit contributions
ittered-0.022
rily-0.021
immer-0.021
rol-0.021
keeping-0.021
TokenOne
Token ID3198
Feature activation-3.62e-01

Pos logit contributions
 a+0.010
 big+0.010
 the+0.010
 and+0.010
 two+0.009
Neg logit contributions
rily-0.011
immer-0.011
ittered-0.011
rol-0.011
epad-0.011
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 and+0.007
 big+0.007
Neg logit contributions
immer-0.036
ittered-0.034
rily-0.034
lights-0.034
rated-0.034
Token two
Token ID734
Feature activation-3.51e-01

Pos logit contributions
+0.037
.+0.033
 and+0.031
 big+0.029
 a+0.026
Neg logit contributions
rily-0.168
comb-0.166
 avail-0.165
 pounds-0.164
heit-0.163
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
 a+0.008
+0.007
 big+0.007
 two+0.006
 and+0.006
Neg logit contributions
immer-0.036
rily-0.036
ittered-0.035
inations-0.035
arians-0.034
Token three
Token ID1115
Feature activation-1.50e-01

Pos logit contributions
+0.051
 big+0.047
 and+0.045
.+0.045
 a+0.043
Neg logit contributions
rily-0.168
immer-0.165
inations-0.164
 pounds-0.164
icians-0.162
Token!"
Token ID2474
Feature activation-7.64e-01

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 the+0.004
Neg logit contributions
rily-0.014
ittered-0.014
immer-0.014
inations-0.014
lights-0.013
Token she
Token ID673
Feature activation-7.84e-02

Pos logit contributions
 a+0.032
 two+0.032
 big+0.032
 little+0.030
 and+0.029
Neg logit contributions
rily-0.057
lights-0.055
ittered-0.055
 avail-0.055
immer-0.054
TokenOne
Token ID3198
Feature activation-1.86e-02

Pos logit contributions
 a+0.012
 the+0.011
 big+0.011
 two+0.011
 and+0.011
Neg logit contributions
rily-0.016
immer-0.016
ittered-0.016
epad-0.016
arians-0.016
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 the+0.001
 big+0.001
Neg logit contributions
immer-0.002
inations-0.002
lights-0.002
ittered-0.002
adows-0.002
Token two
Token ID734
Feature activation+3.14e-01

Pos logit contributions
+0.030
.+0.027
 big+0.024
 and+0.024
 a+0.023
Neg logit contributions
arians-0.144
 avail-0.142
rily-0.141
icians-0.141
epad-0.140
Token,
Token ID11
Feature activation-1.98e+00

Pos logit contributions
ittered+0.034
rily+0.034
immer+0.034
inations+0.033
arians+0.033
Neg logit contributions
 a-0.006
-0.005
 two-0.005
 big-0.005
 the-0.005
Token three
Token ID1115
Feature activation+3.41e-01

Pos logit contributions
+0.059
 big+0.055
.+0.053
 a+0.051
 and+0.051
Neg logit contributions
arians-0.169
icians-0.166
rily-0.164
immer-0.163
inations-0.163
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
rily+0.033
ittered+0.033
inations+0.032
immer+0.032
lights+0.032
Neg logit contributions
 a-0.011
 two-0.010
 big-0.010
-0.009
 the-0.009
Token four
Token ID1440
Feature activation+3.55e-01

Pos logit contributions
+0.057
 big+0.056
.+0.052
 a+0.051
 the+0.048
Neg logit contributions
arians-0.159
immer-0.157
icians-0.156
rily-0.156
inations-0.154
Token,"
Token ID553
Feature activation-7.91e-01

Pos logit contributions
ittered+0.037
rily+0.037
immer+0.037
inations+0.037
lights+0.036
Neg logit contributions
 a-0.009
 big-0.007
 two-0.007
-0.007
 the-0.006
Token she
Token ID673
Feature activation-1.74e-01

Pos logit contributions
 big+0.027
 a+0.027
 and+0.027
 two+0.026
+0.024
Neg logit contributions
 avail-0.066
rily-0.063
lights-0.063
ittered-0.062
immer-0.061
Token said
Token ID531
Feature activation+8.28e-01

Pos logit contributions
 a+0.006
 big+0.006
 the+0.006
 two+0.006
 and+0.006
Neg logit contributions
ittered-0.015
inations-0.015
immer-0.015
rily-0.015
arians-0.015
Token.
Token ID13
Feature activation-2.66e-01

Pos logit contributions
rily+0.095
ittered+0.095
immer+0.094
lights+0.091
inations+0.091
Neg logit contributions
 a-0.012
 two-0.011
 big-0.011
 the-0.009
 and-0.008
Token
Token ID198
Feature activation+6.61e-02

Pos logit contributions
rily+0.007
immer+0.007
ittered+0.007
inations+0.007
lights+0.007
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
 the-0.000
 and-0.000
Token"
Token ID1
Feature activation+1.24e-01

Pos logit contributions
immer+0.006
rily+0.006
inations+0.006
ittered+0.006
leness+0.006
Neg logit contributions
 a-0.003
 big-0.002
 two-0.002
 the-0.002
 and-0.002
TokenOne
Token ID3198
Feature activation-2.03e-01

Pos logit contributions
rily+0.007
immer+0.007
ittered+0.007
arians+0.007
inations+0.007
Neg logit contributions
 a-0.008
 big-0.008
 the-0.008
 and-0.008
 two-0.008
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
 a+0.004
 two+0.003
+0.003
 big+0.003
 the+0.003
Neg logit contributions
immer-0.021
inations-0.021
ittered-0.021
lights-0.020
rily-0.020
Token two
Token ID734
Feature activation+1.66e-01

Pos logit contributions
+0.028
 big+0.023
.+0.021
 and+0.020
 a+0.020
Neg logit contributions
rily-0.179
inations-0.178
arians-0.177
immer-0.176
icians-0.176
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
ittered+0.017
immer+0.017
rily+0.017
inations+0.017
arians+0.017
Neg logit contributions
 a-0.004
 big-0.003
-0.003
 two-0.003
 the-0.003
Token three
Token ID1115
Feature activation+2.82e-02

Pos logit contributions
+0.052
 big+0.048
 and+0.045
 a+0.045
.+0.044
Neg logit contributions
arians-0.165
inations-0.164
rily-0.164
icians-0.163
immer-0.163
Token,
Token ID11
Feature activation-1.79e+00

Pos logit contributions
rily+0.003
ittered+0.003
immer+0.003
inations+0.003
lights+0.003
Neg logit contributions
 a-0.001
 big-0.001
 two-0.001
-0.001
 the-0.001
Token four
Token ID1440
Feature activation+1.14e-01

Pos logit contributions
+0.059
 big+0.057
 a+0.053
 the+0.051
 and+0.050
Neg logit contributions
arians-0.148
immer-0.148
rily-0.147
inations-0.147
icians-0.146
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
rily+0.011
ittered+0.011
immer+0.011
inations+0.011
arians+0.010
Neg logit contributions
 a-0.004
 big-0.003
 two-0.003
-0.003
 the-0.003
Token five
Token ID1936
Feature activation+3.24e-02

Pos logit contributions
+0.068
 big+0.067
 a+0.065
 the+0.063
 and+0.061
Neg logit contributions
rily-0.111
immer-0.111
arians-0.110
icians-0.108
ittered-0.108
Token to
Token ID284
Feature activation-5.90e-01

Pos logit contributions
 and+0.016
 two+0.014
 a+0.014
+0.013
 little+0.011
Neg logit contributions
epad-0.070
inations-0.070
ittered-0.069
lights-0.067
rily-0.067
Token count
Token ID954
Feature activation-5.97e-01

Pos logit contributions
 a+0.037
 two+0.036
 big+0.035
+0.035
 little+0.034
Neg logit contributions
ittered-0.036
immer-0.036
inations-0.035
comb-0.034
 avail-0.034
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
 a+0.010
 two+0.010
 big+0.008
 little+0.008
+0.007
Neg logit contributions
rily-0.064
lights-0.063
ittered-0.062
immer-0.061
arians-0.060
Token "
Token ID366
Feature activation-3.04e-01

Pos logit contributions
rily+0.012
ittered+0.012
lights+0.012
immer+0.011
rol+0.011
Neg logit contributions
 a-0.006
 big-0.006
 two-0.006
 the-0.005
 and-0.005
TokenOne
Token ID3198
Feature activation-4.22e-01

Pos logit contributions
 a+0.018
 big+0.017
 the+0.017
 and+0.017
 two+0.016
Neg logit contributions
immer-0.020
ittered-0.020
rily-0.020
inations-0.020
epad-0.020
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 the+0.007
 and+0.007
Neg logit contributions
immer-0.042
inations-0.041
rated-0.041
ittered-0.040
adows-0.040
Token two
Token ID734
Feature activation-1.99e-01

Pos logit contributions
+0.043
.+0.037
 and+0.036
 big+0.036
 a+0.033
Neg logit contributions
arians-0.160
inations-0.159
 avail-0.159
comb-0.158
epad-0.157
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
 a+0.004
+0.004
 big+0.003
 two+0.003
 the+0.003
Neg logit contributions
ittered-0.021
immer-0.020
inations-0.020
rily-0.020
arians-0.020
Token three
Token ID1115
Feature activation-8.99e-02

Pos logit contributions
+0.064
 big+0.060
 and+0.056
.+0.056
 the+0.056
Neg logit contributions
arians-0.164
inations-0.163
icians-0.162
epad-0.161
immer-0.158
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
 a+0.003
 two+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
inations-0.009
ittered-0.009
rily-0.009
lights-0.009
immer-0.009
Token four
Token ID1440
Feature activation-1.19e-01

Pos logit contributions
+0.066
 big+0.064
 a+0.061
 the+0.060
.+0.058
Neg logit contributions
arians-0.147
icians-0.145
inations-0.145
immer-0.144
epad-0.142
Token.
Token ID13
Feature activation+5.80e-01

Pos logit contributions
 a+0.010
 two+0.010
 big+0.010
+0.009
 next+0.008
Neg logit contributions
rily-0.040
ittered-0.038
lights-0.038
immer-0.037
ants-0.035
Token "
Token ID366
Feature activation+1.50e-01

Pos logit contributions
rily+0.053
ittered+0.052
immer+0.052
lights+0.051
inations+0.050
Neg logit contributions
 a-0.022
 big-0.020
 two-0.020
 the-0.019
 and-0.019
TokenOne
Token ID3198
Feature activation+5.88e-02

Pos logit contributions
immer+0.010
rily+0.010
ittered+0.010
inations+0.010
leness+0.010
Neg logit contributions
 a-0.009
 big-0.009
 the-0.008
 and-0.008
 two-0.008
Token,
Token ID11
Feature activation-1.76e+00

Pos logit contributions
immer+0.006
ittered+0.006
inations+0.006
rily+0.006
lights+0.006
Neg logit contributions
 a-0.001
 two-0.001
-0.001
 big-0.001
 the-0.001
Token two
Token ID734
Feature activation+1.58e-01

Pos logit contributions
+0.031
.+0.024
 and+0.024
 big+0.021
 a+0.020
Neg logit contributions
rily-0.168
inations-0.168
icians-0.167
arians-0.167
 avail-0.165
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
ittered+0.017
immer+0.017
rily+0.017
inations+0.016
arians+0.016
Neg logit contributions
 a-0.003
-0.003
 big-0.003
 two-0.003
 the-0.003
Token three
Token ID1115
Feature activation+1.55e-01

Pos logit contributions
+0.052
 big+0.044
 and+0.044
.+0.043
 the+0.042
Neg logit contributions
arians-0.163
icians-0.163
rily-0.163
inations-0.162
immer-0.161
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
rily+0.014
ittered+0.014
inations+0.014
immer+0.014
leness+0.014
Neg logit contributions
 a-0.006
 big-0.005
 two-0.005
-0.005
 the-0.005
Token four
Token ID1440
Feature activation+8.41e-02

Pos logit contributions
+0.057
 big+0.053
.+0.049
 the+0.049
 a+0.049
Neg logit contributions
arians-0.146
rily-0.145
icians-0.145
immer-0.145
inations-0.143
Token,"
Token ID553
Feature activation-4.97e-01

Pos logit contributions
ittered+0.008
rily+0.008
immer+0.008
inations+0.008
leness+0.008
Neg logit contributions
 a-0.003
 big-0.002
 two-0.002
-0.002
 the-0.002
Token Ben
Token ID3932
Feature activation-2.30e-01

Pos logit contributions
 big+0.017
 a+0.015
 and+0.014
 two+0.013
 little+0.013
Neg logit contributions
rily-0.043
lights-0.041
 avail-0.041
inations-0.040
leness-0.040
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
 a+0.006
 two+0.006
 big+0.005
+0.005
 little+0.004
Neg logit contributions
rily-0.030
lights-0.029
ittered-0.029
immer-0.029
ge-0.028
Token "
Token ID366
Feature activation+8.07e-02

Pos logit contributions
 a+0.006
 big+0.005
 two+0.005
 the+0.005
 and+0.005
Neg logit contributions
rily-0.011
ittered-0.011
lights-0.011
comb-0.010
arians-0.010
TokenOne
Token ID3198
Feature activation-2.96e-01

Pos logit contributions
immer+0.006
rily+0.006
ittered+0.006
inations+0.006
arians+0.006
Neg logit contributions
 a-0.005
 the-0.004
 big-0.004
 and-0.004
 two-0.004
Token,
Token ID11
Feature activation-1.62e+00

Pos logit contributions
 a+0.008
 two+0.007
 the+0.006
 big+0.006
+0.006
Neg logit contributions
immer-0.029
ittered-0.028
inations-0.028
adows-0.028
lights-0.028
Token two
Token ID734
Feature activation-1.35e-01

Pos logit contributions
+0.039
.+0.035
 a+0.035
 and+0.034
 big+0.034
Neg logit contributions
arians-0.145
icians-0.143
inations-0.142
plets-0.142
rily-0.142
Token,
Token ID11
Feature activation-1.87e+00

Pos logit contributions
 a+0.005
 two+0.004
 big+0.004
+0.004
 the+0.004
Neg logit contributions
immer-0.012
ittered-0.012
rily-0.012
inations-0.012
arians-0.012
Token three
Token ID1115
Feature activation+8.78e-02

Pos logit contributions
+0.057
 big+0.053
 a+0.051
 and+0.051
.+0.051
Neg logit contributions
arians-0.158
icians-0.156
immer-0.155
inations-0.154
rily-0.153
Token,
Token ID11
Feature activation-1.85e+00

Pos logit contributions
ittered+0.007
rily+0.007
immer+0.007
inations+0.007
lights+0.007
Neg logit contributions
 a-0.004
 two-0.004
 big-0.003
-0.003
 the-0.003
Token four
Token ID1440
Feature activation+8.08e-03

Pos logit contributions
+0.060
 big+0.058
 a+0.055
 the+0.053
.+0.053
Neg logit contributions
arians-0.151
immer-0.149
icians-0.149
rily-0.147
inations-0.147
Token,"
Token ID553
Feature activation-7.23e-01

Pos logit contributions
ittered+0.001
immer+0.001
rily+0.001
inations+0.001
lights+0.001
Neg logit contributions
 a-0.000
 two-0.000
 big-0.000
-0.000
 the-0.000
Token she
Token ID673
Feature activation-5.83e-01

Pos logit contributions
 big+0.026
 a+0.025
 and+0.025
 two+0.025
 right+0.023
Neg logit contributions
rily-0.054
comb-0.054
 avail-0.053
lights-0.053
ittered-0.051
TokenOne
Token ID3198
Feature activation-4.79e-01

Pos logit contributions
immer+0.025
ittered+0.025
rily+0.024
vation+0.024
inations+0.023
Neg logit contributions
 a-0.029
 big-0.028
 the-0.028
 and-0.028
 two-0.027
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
 a+0.010
 two+0.009
+0.008
 big+0.008
 and+0.007
Neg logit contributions
immer-0.048
lights-0.047
ently-0.047
rated-0.046
adows-0.046
Token two
Token ID734
Feature activation-2.92e-01

Pos logit contributions
+0.025
 big+0.020
.+0.019
 and+0.017
 to+0.015
Neg logit contributions
inations-0.153
rily-0.153
 avail-0.152
icians-0.152
lights-0.152
Token,
Token ID11
Feature activation-1.87e+00

Pos logit contributions
 a+0.006
+0.005
 big+0.005
 two+0.005
 the+0.005
Neg logit contributions
immer-0.030
ittered-0.030
rily-0.030
inations-0.030
lights-0.029
Token three
Token ID1115
Feature activation-3.47e-01

Pos logit contributions
+0.045
 big+0.042
.+0.038
 and+0.037
 the+0.034
Neg logit contributions
icians-0.166
inations-0.165
arians-0.165
immer-0.164
rily-0.163
Token,
Token ID11
Feature activation-1.87e+00

Pos logit contributions
 a+0.009
 two+0.007
 big+0.007
+0.007
 the+0.006
Neg logit contributions
rily-0.035
inations-0.035
ittered-0.035
lights-0.035
immer-0.034
Token four
Token ID1440
Feature activation-3.25e-01

Pos logit contributions
+0.051
 big+0.049
.+0.043
 and+0.041
 a+0.041
Neg logit contributions
immer-0.160
arians-0.159
icians-0.158
rily-0.158
inations-0.158
Token,
Token ID11
Feature activation-1.63e+00

Pos logit contributions
 a+0.009
 big+0.007
+0.007
 two+0.007
 the+0.006
Neg logit contributions
rily-0.032
ittered-0.032
immer-0.032
inations-0.032
lights-0.032
Token five
Token ID1936
Feature activation-2.54e-01

Pos logit contributions
+0.065
 big+0.064
 a+0.060
 the+0.059
.+0.059
Neg logit contributions
immer-0.123
rily-0.122
arians-0.119
ittered-0.119
comb-0.119
Token,
Token ID11
Feature activation-1.51e+00

Pos logit contributions
 a+0.007
+0.006
 two+0.006
 big+0.006
 the+0.006
Neg logit contributions
rily-0.024
inations-0.024
ittered-0.024
immer-0.024
lights-0.024
Token six
Token ID2237
Feature activation-3.04e-01

Pos logit contributions
 big+0.064
+0.064
 a+0.060
.+0.058
 the+0.058
Neg logit contributions
rily-0.109
immer-0.109
icians-0.107
ittered-0.106
arians-0.106
Token noticed
Token ID6810
Feature activation+3.45e-01

Pos logit contributions
ittered+0.046
rily+0.045
immer+0.045
inations+0.044
leness+0.044
Neg logit contributions
 a-0.029
 two-0.027
 big-0.027
-0.026
 the-0.026
Token something
Token ID1223
Feature activation+2.65e-01

Pos logit contributions
rily+0.036
ittered+0.036
immer+0.035
lights+0.035
inations+0.034
Neg logit contributions
 a-0.007
 big-0.007
 two-0.007
-0.006
 and-0.006
Token expensive
Token ID5789
Feature activation+7.39e-01

Pos logit contributions
ittered+0.024
rily+0.024
immer+0.024
inations+0.023
lights+0.023
Neg logit contributions
 a-0.009
 two-0.009
-0.008
 big-0.008
 the-0.008
Token shining
Token ID22751
Feature activation+8.37e-01

Pos logit contributions
rily+0.083
ittered+0.083
immer+0.082
lights+0.080
inations+0.079
Neg logit contributions
 a-0.013
 two-0.011
 big-0.011
-0.010
 the-0.009
Token and
Token ID290
Feature activation+3.95e-01

Pos logit contributions
rily+0.089
immer+0.088
ittered+0.087
lights+0.085
 avail+0.083
Neg logit contributions
 a-0.021
 two-0.019
 big-0.018
-0.018
 the-0.017
Token gl
Token ID1278
Feature activation+2.24e+00

Pos logit contributions
ittered+0.034
rily+0.034
immer+0.034
lights+0.033
 avail+0.033
Neg logit contributions
 a-0.016
 two-0.015
 and-0.015
-0.015
 big-0.015
Tokenint
Token ID600
Feature activation+4.60e-01

Pos logit contributions
 avail+0.036
leness+0.033
keeping+0.032
arians+0.030
inations+0.030
Neg logit contributions
 a-0.253
 and-0.245
 big-0.243
 two-0.242
.-0.239
Tokening
Token ID278
Feature activation+2.01e-01

Pos logit contributions
ittered+0.028
rily+0.028
immer+0.027
lights+0.026
rol+0.025
Neg logit contributions
 a-0.033
 big-0.031
 two-0.031
-0.031
 the-0.030
Token in
Token ID287
Feature activation-1.65e-01

Pos logit contributions
rily+0.021
ittered+0.021
immer+0.021
lights+0.020
ge+0.019
Neg logit contributions
 a-0.005
 two-0.005
-0.005
 big-0.004
 the-0.004
Token the
Token ID262
Feature activation+1.75e-01

Pos logit contributions
 a+0.002
+0.002
 two+0.002
 big+0.002
 and+0.002
Neg logit contributions
rily-0.018
 avail-0.018
immer-0.018
lights-0.018
ittered-0.017
Token sun
Token ID4252
Feature activation-2.74e-02

Pos logit contributions
rily+0.013
immer+0.013
lights+0.012
acket+0.012
ittered+0.012
Neg logit contributions
 a-0.010
 two-0.009
-0.009
 the-0.009
 and-0.009
Token friend
Token ID1545
Feature activation-6.78e-02

Pos logit contributions
 the+0.025
 a+0.025
 two+0.022
+0.022
 and+0.022
Neg logit contributions
ittered-0.056
immer-0.055
rily-0.055
inations-0.055
lights-0.054
Token,
Token ID11
Feature activation+4.00e-01

Pos logit contributions
 big+0.001
 a+0.001
 two+0.001
 next+0.001
 and+0.001
Neg logit contributions
ittered-0.007
immer-0.006
rily-0.006
 avail-0.006
inations-0.006
Token a
Token ID257
Feature activation-1.47e-01

Pos logit contributions
ittered+0.037
rily+0.037
inations+0.036
immer+0.036
lights+0.036
Neg logit contributions
 big-0.012
 a-0.011
 two-0.011
 and-0.010
 the-0.010
Token dog
Token ID3290
Feature activation-1.47e-01

Pos logit contributions
 a+0.003
 two+0.003
 the+0.003
 and+0.003
+0.002
Neg logit contributions
rily-0.015
ittered-0.015
fer-0.015
immer-0.014
inations-0.014
Token called
Token ID1444
Feature activation+6.90e-02

Pos logit contributions
 two+0.009
 a+0.009
 big+0.009
 the+0.008
 little+0.008
Neg logit contributions
ittered-0.009
immer-0.009
arians-0.009
rily-0.009
inations-0.009
Token Tri
Token ID7563
Feature activation+2.40e+00

Pos logit contributions
rily+0.005
ittered+0.005
immer+0.005
 avail+0.005
arians+0.005
Neg logit contributions
 big-0.003
 a-0.003
 two-0.003
 and-0.003
-0.003
Tokenx
Token ID87
Feature activation+6.38e-01

Pos logit contributions
rily+0.096
ittered+0.093
immer+0.092
lights+0.091
leness+0.090
Neg logit contributions
 a-0.215
 two-0.209
 and-0.207
 the-0.205
 big-0.204
Token.
Token ID13
Feature activation+2.12e-01

Pos logit contributions
ittered+0.031
rily+0.030
immer+0.030
inations+0.029
lights+0.028
Neg logit contributions
 a-0.053
 big-0.051
 two-0.050
-0.049
 the-0.049
Token Mill
Token ID9212
Feature activation+8.46e-01

Pos logit contributions
ittered+0.018
rily+0.018
immer+0.018
lights+0.017
 avail+0.017
Neg logit contributions
 a-0.009
 two-0.009
 big-0.009
 the-0.008
 and-0.008
Tokenie
Token ID494
Feature activation-6.88e-01

Pos logit contributions
ittered+0.038
rily+0.038
immer+0.037
inations+0.035
lights+0.034
Neg logit contributions
 a-0.074
 two-0.071
 big-0.070
 the-0.069
-0.068
Token wanted
Token ID2227
Feature activation-4.59e-01

Pos logit contributions
 a+0.023
 two+0.023
 big+0.022
 the+0.020
 next+0.018
Neg logit contributions
ittered-0.060
inations-0.058
arians-0.057
istent-0.057
leness-0.057
Token in
Token ID287
Feature activation-5.20e-01

Pos logit contributions
rily+0.045
ittered+0.044
immer+0.044
lights+0.042
ge+0.042
Neg logit contributions
 a-0.008
 two-0.008
 big-0.008
-0.007
 the-0.006
Token the
Token ID262
Feature activation-5.19e-01

Pos logit contributions
 big+0.003
 two+0.003
 a+0.003
+0.002
 and+0.002
Neg logit contributions
rily-0.056
ge-0.056
keeping-0.056
 avail-0.056
ittered-0.055
Token fence
Token ID13990
Feature activation-2.09e-01

Pos logit contributions
 a+0.037
 the+0.035
+0.035
 and+0.035
 two+0.035
Neg logit contributions
keeping-0.024
stack-0.024
comb-0.024
arians-0.024
rated-0.024
Token.
Token ID13
Feature activation-1.27e-01

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
+0.002
 the+0.002
Neg logit contributions
rily-0.023
ittered-0.023
immer-0.023
lights-0.022
ently-0.022
Token He
Token ID679
Feature activation+5.56e-02

Pos logit contributions
 a+0.004
 two+0.003
 big+0.003
 and+0.003
 the+0.003
Neg logit contributions
ittered-0.012
rily-0.012
lights-0.012
immer-0.012
istent-0.012
Token wh
Token ID348
Feature activation+2.14e+00

Pos logit contributions
ittered+0.004
rily+0.004
arians+0.003
inations+0.003
lights+0.003
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.003
 the-0.003
Tokenines
Token ID1127
Feature activation-3.48e-01

Pos logit contributions
rily+0.089
ittered+0.082
immer+0.082
 avail+0.080
arians+0.080
Neg logit contributions
 a-0.196
 two-0.191
 and-0.190
 big-0.189
 the-0.188
Token and
Token ID290
Feature activation-1.71e-01

Pos logit contributions
 two+0.004
 a+0.004
 big+0.004
+0.003
 the+0.002
Neg logit contributions
rily-0.039
ittered-0.038
lights-0.038
immer-0.038
ently-0.037
Token w
Token ID266
Feature activation+1.35e+00

Pos logit contributions
 a+0.011
 two+0.011
 big+0.011
+0.011
 and+0.011
Neg logit contributions
ittered-0.010
rily-0.010
leness-0.009
istent-0.009
heit-0.009
Tokenags
Token ID3775
Feature activation+3.71e-01

Pos logit contributions
rily+0.058
ittered+0.057
immer+0.057
inations+0.054
lights+0.052
Neg logit contributions
 a-0.121
 two-0.116
 big-0.116
 the-0.115
 and-0.114
Token his
Token ID465
Feature activation-9.94e-02

Pos logit contributions
ittered+0.037
rily+0.036
immer+0.035
lights+0.035
eway+0.035
Neg logit contributions
 two-0.011
 big-0.010
 a-0.010
 little-0.009
 and-0.009
Token
Token ID198
Feature activation+4.25e-01

Pos logit contributions
 a+0.002
 two+0.002
 big+0.002
 the+0.001
 and+0.001
Neg logit contributions
ittered-0.018
immer-0.018
rily-0.017
inations-0.017
ently-0.017
Token
Token ID198
Feature activation+6.53e-01

Pos logit contributions
ittered+0.050
rily+0.050
immer+0.049
inations+0.049
leness+0.048
Neg logit contributions
 a-0.005
 two-0.004
 big-0.003
 the-0.002
 and-0.002
TokenThe
Token ID464
Feature activation-3.84e-01

Pos logit contributions
ittered+0.048
immer+0.047
rily+0.047
inations+0.047
leness+0.045
Neg logit contributions
 a-0.037
 two-0.035
 big-0.035
 the-0.034
 and-0.034
Token white
Token ID2330
Feature activation-1.96e-01

Pos logit contributions
 a+0.014
+0.013
 the+0.012
 and+0.012
 two+0.012
Neg logit contributions
immer-0.032
rily-0.032
ittered-0.032
rated-0.031
comb-0.031
Token cow
Token ID9875
Feature activation-2.56e-01

Pos logit contributions
 a+0.012
 two+0.011
+0.011
 the+0.011
 big+0.011
Neg logit contributions
rily-0.013
ittered-0.013
immer-0.012
comb-0.012
lights-0.012
Token che
Token ID1125
Feature activation+2.39e+00

Pos logit contributions
 a+0.009
 big+0.008
 two+0.008
+0.007
 the+0.007
Neg logit contributions
ittered-0.024
rily-0.023
comb-0.023
immer-0.023
inations-0.022
Tokenwed
Token ID19103
Feature activation-2.77e-01

Pos logit contributions
rily+0.102
 avail+0.098
arians+0.097
lights+0.097
immer+0.096
Neg logit contributions
 a-0.211
 two-0.205
 and-0.201
 the-0.198
 big-0.195
Token and
Token ID290
Feature activation+5.09e-01

Pos logit contributions
 a+0.006
 two+0.006
 big+0.006
+0.005
 little+0.005
Neg logit contributions
rily-0.028
ittered-0.027
immer-0.027
ently-0.026
lights-0.026
Token che
Token ID1125
Feature activation+2.54e+00

Pos logit contributions
ittered+0.039
rily+0.037
immer+0.037
inations+0.037
leness+0.036
Neg logit contributions
 a-0.026
 big-0.025
 two-0.025
-0.024
 and-0.024
Tokenwed
Token ID19103
Feature activation-2.71e-01

Pos logit contributions
rily+0.108
arians+0.107
 avail+0.106
lights+0.105
vation+0.105
Neg logit contributions
 a-0.222
 two-0.212
 and-0.211
 the-0.209
 big-0.204
Token on
Token ID319
Feature activation-1.96e-01

Pos logit contributions
 a+0.010
 two+0.010
 big+0.010
 little+0.009
+0.009
Neg logit contributions
rily-0.024
ittered-0.023
immer-0.023
ently-0.022
ge-0.022
Token dad
Token ID9955
Feature activation-3.09e-01

Pos logit contributions
rily+0.040
ittered+0.039
 avail+0.039
immer+0.039
lights+0.039
Neg logit contributions
 a-0.023
 two-0.021
-0.021
 big-0.021
 the-0.021
Token and
Token ID290
Feature activation+6.41e-02

Pos logit contributions
 a+0.005
 big+0.005
 two+0.004
+0.004
 the+0.003
Neg logit contributions
rily-0.034
immer-0.034
ittered-0.034
ge-0.033
istent-0.033
Token felt
Token ID2936
Feature activation+3.60e-01

Pos logit contributions
ittered+0.005
rily+0.005
immer+0.005
 avail+0.005
fer+0.005
Neg logit contributions
 a-0.003
 big-0.003
 two-0.003
-0.003
 the-0.002
Token a
Token ID257
Feature activation+4.31e-01

Pos logit contributions
rily+0.033
ittered+0.032
immer+0.032
leness+0.031
inations+0.031
Neg logit contributions
 two-0.013
 a-0.012
 big-0.012
 and-0.012
-0.011
Token bit
Token ID1643
Feature activation+1.57e-01

Pos logit contributions
immer+0.040
rily+0.040
ittered+0.040
inations+0.039
istent+0.038
Neg logit contributions
 a-0.016
 two-0.014
 the-0.014
-0.014
 and-0.013
Token bra
Token ID8290
Feature activation+2.37e+00

Pos logit contributions
immer+0.012
rily+0.012
ittered+0.011
leness+0.011
istent+0.011
Neg logit contributions
 a-0.008
 two-0.008
 the-0.008
-0.008
 big-0.008
Tokenver
Token ID332
Feature activation+1.34e-01

Pos logit contributions
lights+0.065
rily+0.063
 avail+0.063
inations+0.062
ittered+0.062
Neg logit contributions
 a-0.245
 two-0.238
 and-0.235
 the-0.232
 big-0.231
Token.
Token ID13
Feature activation+6.19e-01

Pos logit contributions
rily+0.015
immer+0.014
ge+0.014
ittered+0.014
keeping+0.014
Neg logit contributions
 a-0.003
 big-0.003
 two-0.002
 the-0.002
-0.002
Token She
Token ID1375
Feature activation-3.55e-01

Pos logit contributions
rily+0.055
ittered+0.054
immer+0.053
lights+0.052
istent+0.052
Neg logit contributions
 a-0.026
 big-0.025
 two-0.024
 the-0.023
 and-0.022
Token held
Token ID2714
Feature activation+3.46e-01

Pos logit contributions
 a+0.015
 big+0.014
 two+0.014
 the+0.014
+0.013
Neg logit contributions
ittered-0.030
inations-0.029
immer-0.028
rily-0.028
comb-0.027
Token his
Token ID465
Feature activation+3.51e-01

Pos logit contributions
rily+0.032
ittered+0.032
immer+0.031
 avail+0.030
ently+0.030
Neg logit contributions
 a-0.012
 big-0.012
 two-0.011
-0.011
 and-0.011
Token a
Token ID257
Feature activation-1.92e-01

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
+0.001
 and+0.001
Neg logit contributions
rily-0.003
ittered-0.003
inations-0.003
lights-0.003
epad-0.003
Token T
Token ID309
Feature activation+7.35e-01

Pos logit contributions
 a+0.012
 the+0.011
 two+0.011
 and+0.010
+0.010
Neg logit contributions
rily-0.012
inations-0.012
ittered-0.012
immer-0.012
epad-0.012
Token-
Token ID12
Feature activation-1.44e-01

Pos logit contributions
ittered+0.038
rily+0.038
immer+0.038
inations+0.036
ides+0.035
Neg logit contributions
 a-0.059
 two-0.057
 big-0.056
 the-0.056
-0.054
Tokenrex
Token ID21510
Feature activation-4.46e-01

Pos logit contributions
 a+0.013
 two+0.013
 the+0.012
 big+0.012
 and+0.012
Neg logit contributions
ittered-0.006
immer-0.006
rily-0.006
inations-0.005
epad-0.005
Token and
Token ID290
Feature activation-2.64e-01

Pos logit contributions
 a+0.008
 two+0.007
 big+0.006
+0.006
 the+0.006
Neg logit contributions
ittered-0.047
rily-0.047
arians-0.046
immer-0.046
ides-0.046
Token pret
Token ID2181
Feature activation+9.34e-01

Pos logit contributions
 a+0.011
 two+0.010
 big+0.010
+0.010
 and+0.010
Neg logit contributions
rily-0.021
ittered-0.021
leness-0.020
icians-0.020
epad-0.020
Tokenends
Token ID2412
Feature activation+4.86e-01

Pos logit contributions
ittered+0.044
rily+0.044
immer+0.043
inations+0.041
arians+0.040
Neg logit contributions
 a-0.079
 two-0.076
 big-0.076
 the-0.075
 and-0.074
Token to
Token ID284
Feature activation+2.76e-01

Pos logit contributions
rily+0.053
ittered+0.053
inations+0.052
lights+0.051
immer+0.051
Neg logit contributions
 a-0.009
 two-0.009
 big-0.007
 the-0.006
 and-0.006
Token chase
Token ID15505
Feature activation-7.18e-01

Pos logit contributions
immer+0.022
ittered+0.022
rily+0.022
inations+0.022
 avail+0.021
Neg logit contributions
 a-0.013
 two-0.013
 big-0.012
 and-0.011
 the-0.011
Token his
Token ID465
Feature activation+8.61e-02

Pos logit contributions
 two+0.025
 a+0.024
 big+0.022
+0.021
 next+0.020
Neg logit contributions
rily-0.065
ittered-0.061
leness-0.060
keeping-0.059
ge-0.059
Token toys
Token ID14958
Feature activation-5.14e-01

Pos logit contributions
rily+0.005
ittered+0.005
arians+0.005
inations+0.005
immer+0.005
Neg logit contributions
 a-0.006
 two-0.005
-0.005
 the-0.005
 big-0.005
Token looking
Token ID2045
Feature activation+3.59e-01

Pos logit contributions
rily+0.071
ittered+0.070
immer+0.069
inations+0.068
 avail+0.066
Neg logit contributions
 a-0.042
 two-0.040
 big-0.039
-0.038
 and-0.038
Token.
Token ID13
Feature activation+8.49e-02

Pos logit contributions
rily+0.040
immer+0.039
ittered+0.039
inations+0.038
lights+0.037
Neg logit contributions
 a-0.007
 two-0.006
 big-0.005
-0.005
 the-0.005
Token
Token ID198
Feature activation+1.90e-01

Pos logit contributions
rily+0.007
immer+0.006
ittered+0.006
istent+0.006
acket+0.006
Neg logit contributions
 a-0.004
 big-0.004
 two-0.004
 the-0.004
 and-0.004
Token
Token ID198
Feature activation+3.72e-01

Pos logit contributions
immer+0.022
rily+0.021
ittered+0.021
inations+0.021
lor+0.021
Neg logit contributions
 a-0.003
 two-0.002
 big-0.002
 the-0.002
 and-0.001
TokenHe
Token ID1544
Feature activation-9.63e-02

Pos logit contributions
immer+0.033
rily+0.033
ittered+0.032
inations+0.032
lor+0.031
Neg logit contributions
 a-0.016
 two-0.014
 big-0.014
 the-0.014
 and-0.013
Token sc
Token ID629
Feature activation+1.38e+00

Pos logit contributions
 a+0.004
 big+0.004
 two+0.004
+0.004
 the+0.004
Neg logit contributions
rily-0.008
ittered-0.008
immer-0.007
inations-0.007
comb-0.007
Tokenur
Token ID333
Feature activation+2.09e+00

Pos logit contributions
rily+0.052
ittered+0.052
immer+0.051
inations+0.048
lights+0.047
Neg logit contributions
 a-0.131
 two-0.126
 big-0.125
 the-0.124
 and-0.123
Tokenried
Token ID2228
Feature activation-5.16e-01

Pos logit contributions
immer+0.067
rily+0.065
istent+0.064
inations+0.064
leness+0.064
Neg logit contributions
 a-0.208
 two-0.204
 the-0.200
 and-0.199
 big-0.197
Token to
Token ID284
Feature activation-8.36e-02

Pos logit contributions
 a+0.023
 big+0.022
 two+0.022
+0.022
 little+0.021
Neg logit contributions
rily-0.041
ittered-0.039
immer-0.038
keeping-0.037
rol-0.037
Token the
Token ID262
Feature activation-6.72e-02

Pos logit contributions
 two+0.001
 and+0.001
 big+0.001
 a+0.001
+0.001
Neg logit contributions
rily-0.009
 avail-0.009
immer-0.009
ittered-0.009
keeping-0.009
Token back
Token ID736
Feature activation-7.05e-02

Pos logit contributions
 a+0.004
+0.004
 the+0.004
 and+0.004
 two+0.003
Neg logit contributions
rily-0.005
keeping-0.005
comb-0.004
arians-0.004
stack-0.004
Token the
Token ID262
Feature activation-1.91e-01

Pos logit contributions
rily+0.035
ittered+0.035
lights+0.035
immer+0.034
inations+0.033
Neg logit contributions
 a-0.008
 two-0.007
 big-0.007
 and-0.007
-0.007
Token dump
Token ID10285
Feature activation+1.81e-01

Pos logit contributions
 a+0.010
+0.009
 the+0.009
 and+0.009
 two+0.009
Neg logit contributions
rily-0.013
keeping-0.013
arians-0.013
ittered-0.013
immer-0.013
Tokenster
Token ID1706
Feature activation+1.29e-01

Pos logit contributions
rily+0.015
ittered+0.015
immer+0.015
lights+0.015
fer+0.014
Neg logit contributions
 a-0.008
 two-0.008
 big-0.008
-0.008
 the-0.007
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
rily+0.014
immer+0.014
ittered+0.014
comb+0.013
lights+0.013
Neg logit contributions
 a-0.003
 two-0.002
 big-0.002
-0.002
 the-0.002
Token She
Token ID1375
Feature activation+3.30e-01

Pos logit contributions
rily+0.014
ittered+0.014
immer+0.014
keeping+0.014
 avail+0.014
Neg logit contributions
 a-0.006
 two-0.006
 big-0.006
 and-0.006
 little-0.006
Token thinks
Token ID6834
Feature activation+7.75e-01

Pos logit contributions
ittered+0.025
rily+0.025
immer+0.024
lights+0.024
inations+0.024
Neg logit contributions
 a-0.017
 two-0.016
 big-0.015
-0.015
 the-0.015
Token maybe
Token ID3863
Feature activation+6.94e-02

Pos logit contributions
rily+0.075
ittered+0.075
immer+0.074
inations+0.073
lights+0.072
Neg logit contributions
 a-0.024
 two-0.023
 big-0.022
-0.020
 and-0.020
Token Tom
Token ID4186
Feature activation+3.37e-01

Pos logit contributions
 avail+0.006
rily+0.006
ittered+0.006
immer+0.006
epad+0.006
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
 and-0.002
-0.002
Token is
Token ID318
Feature activation+1.44e-02

Pos logit contributions
ittered+0.031
rily+0.030
immer+0.030
inations+0.029
arians+0.029
Neg logit contributions
 a-0.012
 two-0.012
 big-0.012
 the-0.011
-0.010
Token inside
Token ID2641
Feature activation+1.32e-01

Pos logit contributions
ittered+0.001
rily+0.001
immer+0.001
epad+0.001
inations+0.001
Neg logit contributions
-0.001
 two-0.001
 a-0.001
 and-0.001
 big-0.001
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
rily+0.015
immer+0.015
ittered+0.014
lights+0.014
 avail+0.014
Neg logit contributions
 two-0.002
 a-0.002
 big-0.002
-0.001
 and-0.001
Token it
Token ID340
Feature activation-4.02e-01

Pos logit contributions
 big+0.005
 a+0.005
 two+0.005
 and+0.004
 next+0.004
Neg logit contributions
ittered-0.022
rily-0.022
immer-0.022
fer-0.022
ge-0.022
Token was
Token ID373
Feature activation+3.10e-01

Pos logit contributions
 a+0.013
 two+0.012
 big+0.011
+0.011
 the+0.011
Neg logit contributions
rated-0.036
ittered-0.036
immer-0.036
rily-0.036
lights-0.036
Token especially
Token ID2592
Feature activation+1.71e-01

Pos logit contributions
immer+0.030
ittered+0.030
rily+0.029
lights+0.029
inations+0.029
Neg logit contributions
 two-0.008
 a-0.008
-0.007
 big-0.007
 and-0.007
Token mild
Token ID11607
Feature activation+2.59e-01

Pos logit contributions
ittered+0.015
immer+0.015
rily+0.015
lights+0.015
rated+0.014
Neg logit contributions
 two-0.006
 a-0.006
-0.006
 big-0.006
 and-0.005
Token so
Token ID523
Feature activation-2.12e-01

Pos logit contributions
lights+0.028
rily+0.028
immer+0.028
ittered+0.028
istent+0.027
Neg logit contributions
 a-0.005
 two-0.005
 big-0.004
-0.004
 the-0.004
Token O
Token ID440
Feature activation+1.18e+00

Pos logit contributions
 two+0.009
 a+0.009
 and+0.008
+0.008
 big+0.008
Neg logit contributions
lights-0.017
ittered-0.017
rily-0.017
immer-0.017
inations-0.017
Tokenll
Token ID297
Feature activation+1.34e+00

Pos logit contributions
ittered+0.053
rily+0.052
immer+0.052
inations+0.049
lights+0.047
Neg logit contributions
 a-0.105
 two-0.100
 big-0.100
 the-0.099
 and-0.098
Tokenie
Token ID494
Feature activation+1.69e-01

Pos logit contributions
ittered+0.066
rily+0.066
immer+0.065
inations+0.062
lights+0.060
Neg logit contributions
 a-0.111
 two-0.107
 big-0.106
 the-0.105
 and-0.104
Token decided
Token ID3066
Feature activation-1.04e-01

Pos logit contributions
ittered+0.013
rily+0.013
inations+0.013
immer+0.013
fer+0.013
Neg logit contributions
 a-0.008
 two-0.008
 big-0.008
-0.008
 the-0.007
Token he
Token ID339
Feature activation+2.87e-01

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
+0.002
 and+0.002
Neg logit contributions
rily-0.010
fer-0.010
keeping-0.010
ittered-0.010
inations-0.010
Token wanted
Token ID2227
Feature activation-2.04e-01

Pos logit contributions
ittered+0.025
rily+0.025
inations+0.024
immer+0.024
istent+0.024
Neg logit contributions
 a-0.012
 two-0.011
 big-0.011
 the-0.010
-0.010
Token to
Token ID284
Feature activation+1.30e-01

Pos logit contributions
rily+0.024
inations+0.024
ittered+0.024
immer+0.024
keeping+0.024
Neg logit contributions
 a-0.004
 two-0.004
 big-0.003
-0.003
 the-0.003
Token find
Token ID1064
Feature activation+3.31e-02

Pos logit contributions
immer+0.010
ittered+0.010
 avail+0.010
rily+0.010
inations+0.010
Neg logit contributions
 a-0.006
 two-0.006
-0.006
 big-0.006
 and-0.006
Token,
Token ID11
Feature activation+1.69e-01

Pos logit contributions
rily+0.003
immer+0.003
ittered+0.003
 avail+0.003
inations+0.003
Neg logit contributions
 two-0.001
 a-0.001
-0.001
 big-0.001
 and-0.001
Token but
Token ID475
Feature activation-8.87e-02

Pos logit contributions
rily+0.015
inations+0.015
immer+0.015
ittered+0.015
istent+0.014
Neg logit contributions
 a-0.006
 two-0.006
 big-0.006
-0.005
 the-0.005
Token the
Token ID262
Feature activation-2.33e-01

Pos logit contributions
 big+0.004
 two+0.004
 a+0.004
+0.003
 and+0.003
Neg logit contributions
rily-0.007
comb-0.007
 avail-0.007
ittered-0.007
immer-0.007
Token entrance
Token ID10384
Feature activation+7.68e-01

Pos logit contributions
 a+0.012
 and+0.012
+0.012
 the+0.011
 two+0.011
Neg logit contributions
immer-0.016
rily-0.016
 avail-0.015
arians-0.015
rier-0.015
Token was
Token ID373
Feature activation+3.02e-01

Pos logit contributions
ittered+0.075
rily+0.074
immer+0.073
inations+0.072
lights+0.072
Neg logit contributions
 a-0.025
 two-0.023
 big-0.023
 the-0.021
-0.021
Token deep
Token ID2769
Feature activation+3.21e-01

Pos logit contributions
immer+0.027
ittered+0.027
rily+0.027
inations+0.026
epad+0.026
Neg logit contributions
 a-0.011
 two-0.010
-0.010
 and-0.010
 big-0.010
Token in
Token ID287
Feature activation+1.55e-01

Pos logit contributions
rily+0.036
ittered+0.034
immer+0.034
lights+0.034
 avail+0.033
Neg logit contributions
 a-0.006
 two-0.006
 big-0.005
-0.005
 the-0.004
Token the
Token ID262
Feature activation-1.90e-01

Pos logit contributions
 avail+0.018
rily+0.017
immer+0.017
lights+0.017
ittered+0.017
Neg logit contributions
 a-0.001
 and-0.001
 big-0.001
 two-0.001
-0.001
Token forest
Token ID8222
Feature activation+4.29e-01

Pos logit contributions
 a+0.011
 and+0.010
 two+0.010
+0.010
 the+0.010
Neg logit contributions
immer-0.012
lights-0.012
rily-0.012
rated-0.012
 avail-0.012
Token
Token ID198
Feature activation+3.35e-01

Pos logit contributions
rily+0.014
ittered+0.014
immer+0.014
inations+0.014
igi+0.014
Neg logit contributions
 a-0.002
 two-0.001
 big-0.001
 the-0.001
 and-0.001
TokenShe
Token ID3347
Feature activation-1.51e-01

Pos logit contributions
rily+0.023
immer+0.023
ittered+0.022
lor+0.021
rol+0.021
Neg logit contributions
 a-0.021
 two-0.019
 the-0.019
 big-0.019
 and-0.019
Tokenlly
Token ID12810
Feature activation-2.11e-01

Pos logit contributions
 a+0.006
 two+0.006
 big+0.006
+0.006
 the+0.006
Neg logit contributions
ittered-0.013
rily-0.012
inations-0.012
immer-0.012
lights-0.012
Token took
Token ID1718
Feature activation-3.20e-01

Pos logit contributions
 a+0.008
 two+0.007
 big+0.007
+0.007
 the+0.007
Neg logit contributions
ittered-0.019
rily-0.018
inations-0.018
immer-0.018
owed-0.017
Token out
Token ID503
Feature activation-4.90e-01

Pos logit contributions
 next+0.005
+0.005
 and+0.004
 two+0.004
 big+0.004
Neg logit contributions
ittered-0.033
rily-0.033
 avail-0.033
immer-0.032
lights-0.032
Token some
Token ID617
Feature activation-1.63e-01

Pos logit contributions
+0.007
 next+0.005
 be+0.005
 and+0.005
 big+0.005
Neg logit contributions
rily-0.051
lights-0.051
ittered-0.050
immer-0.050
inations-0.049
Token paints
Token ID32281
Feature activation-3.39e-01

Pos logit contributions
 a+0.008
+0.008
 two+0.008
 the+0.007
 big+0.007
Neg logit contributions
inations-0.012
immer-0.012
lights-0.012
rily-0.012
ittered-0.012
Token and
Token ID290
Feature activation-1.84e-01

Pos logit contributions
 a+0.004
 two+0.004
 big+0.004
+0.003
 little+0.003
Neg logit contributions
immer-0.038
lights-0.038
rily-0.037
ittered-0.037
pan-0.036
Token started
Token ID2067
Feature activation-1.03e-01

Pos logit contributions
 a+0.010
+0.010
 two+0.010
 big+0.010
 and+0.010
Neg logit contributions
ittered-0.012
lights-0.012
inations-0.012
rily-0.012
immer-0.011
Token painting
Token ID12036
Feature activation-1.55e-01

Pos logit contributions
 a+0.004
 two+0.004
+0.004
 big+0.004
 and+0.004
Neg logit contributions
ittered-0.009
rily-0.009
inations-0.009
immer-0.008
epad-0.008
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
+0.002
 little+0.002
Neg logit contributions
lights-0.017
rily-0.017
ittered-0.016
immer-0.016
 avail-0.016
Token and
Token ID290
Feature activation-1.48e-01

Pos logit contributions
 a+0.002
 two+0.002
 big+0.002
 the+0.001
 little+0.001
Neg logit contributions
immer-0.017
rily-0.017
lights-0.017
ittered-0.017
ge-0.017
Token climbed
Token ID19952
Feature activation-7.93e-02

Pos logit contributions
 big+0.006
 a+0.006
+0.006
 two+0.005
 and+0.005
Neg logit contributions
rily-0.012
ittered-0.012
lights-0.012
inations-0.012
immer-0.011
Token down
Token ID866
Feature activation-6.49e-01

Pos logit contributions
 big+0.003
 a+0.003
 two+0.003
 little+0.003
+0.003
Neg logit contributions
rily-0.007
ittered-0.007
immer-0.006
keeping-0.006
lights-0.006
Token the
Token ID262
Feature activation+3.14e-01

Pos logit contributions
 two+0.008
 big+0.008
 little+0.007
 a+0.006
 me+0.004
Neg logit contributions
rily-0.069
keeping-0.068
lights-0.067
ge-0.067
eway-0.066
Token drain
Token ID14782
Feature activation+1.86e-01

Pos logit contributions
rily+0.015
lights+0.014
immer+0.014
ittered+0.014
keeping+0.014
Neg logit contributions
 a-0.025
 two-0.024
 the-0.023
 big-0.023
-0.023
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
rily+0.021
ittered+0.021
immer+0.021
lights+0.021
keeping+0.020
Neg logit contributions
 a-0.003
 two-0.002
 big-0.002
-0.002
 the-0.002
Token He
Token ID679
Feature activation-5.37e-02

Pos logit contributions
rily+0.012
ittered+0.011
lights+0.011
immer+0.011
keeping+0.010
Neg logit contributions
 a-0.006
 two-0.005
 big-0.005
 the-0.005
 and-0.005
Token lifted
Token ID13663
Feature activation-6.54e-01

Pos logit contributions
 a+0.002
 two+0.002
 big+0.002
 the+0.002
+0.002
Neg logit contributions
ittered-0.004
inations-0.004
rily-0.004
immer-0.004
lights-0.004
Token Sara
Token ID24799
Feature activation-1.58e-01

Pos logit contributions
 big+0.017
 two+0.016
 a+0.015
 next+0.015
+0.015
Neg logit contributions
rily-0.065
ittered-0.059
immer-0.058
rol-0.058
keeping-0.057
Token and
Token ID290
Feature activation-4.46e-02

Pos logit contributions
 a+0.005
 big+0.005
 two+0.005
+0.004
 the+0.004
Neg logit contributions
rily-0.015
ittered-0.015
immer-0.015
ge-0.015
owed-0.014
Token her
Token ID607
Feature activation+2.52e-01

Pos logit contributions
 big+0.002
 a+0.002
+0.002
 two+0.002
 and+0.002
Neg logit contributions
rily-0.004
ittered-0.004
lights-0.004
immer-0.004
fer-0.004
Token ready
Token ID3492
Feature activation+6.64e-02

Pos logit contributions
ittered+0.042
immer+0.040
inations+0.040
rily+0.040
leness+0.040
Neg logit contributions
 a-0.019
 two-0.018
 big-0.018
 and-0.017
 the-0.017
Token to
Token ID284
Feature activation-2.94e-01

Pos logit contributions
ittered+0.007
rily+0.007
inations+0.007
epad+0.007
immer+0.007
Neg logit contributions
 two-0.002
 a-0.001
 big-0.001
-0.001
 little-0.001
Token go
Token ID467
Feature activation-2.39e-01

Pos logit contributions
 a+0.011
 two+0.011
 and+0.010
+0.010
 big+0.010
Neg logit contributions
 avail-0.025
inations-0.025
ittered-0.025
immer-0.025
rily-0.024
Token!
Token ID0
Feature activation+2.71e-02

Pos logit contributions
 two+0.007
 a+0.007
 little+0.006
 big+0.006
 the+0.006
Neg logit contributions
inations-0.022
immer-0.022
ittered-0.022
rily-0.021
lights-0.021
Token
Token ID198
Feature activation+2.32e-01

Pos logit contributions
immer+0.003
rily+0.003
ittered+0.003
acket+0.003
lights+0.003
Neg logit contributions
 a-0.001
 big-0.001
 two-0.001
 the-0.001
 and-0.001
Token
Token ID198
Feature activation+4.00e-01

Pos logit contributions
immer+0.027
ittered+0.027
rily+0.027
inations+0.027
lights+0.026
Neg logit contributions
 a-0.003
 two-0.002
 big-0.002
 the-0.001
 and-0.001
TokenOne
Token ID3198
Feature activation-8.38e-01

Pos logit contributions
immer+0.029
inations+0.029
ittered+0.029
rily+0.028
ge+0.028
Neg logit contributions
 a-0.021
 two-0.020
 the-0.020
 big-0.020
 and-0.019
Token day
Token ID1110
Feature activation+1.10e-01

Pos logit contributions
 two+0.037
 a+0.034
+0.031
 and+0.031
 the+0.031
Neg logit contributions
rated-0.063
immer-0.062
adows-0.061
inations-0.059
ittered-0.059
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
ittered+0.011
immer+0.011
rily+0.011
 avail+0.011
fer+0.011
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
 and-0.002
-0.002
Token the
Token ID262
Feature activation-9.47e-01

Pos logit contributions
rily+0.011
ittered+0.011
 avail+0.011
fer+0.011
lights+0.011
Neg logit contributions
 two-0.002
 big-0.002
 a-0.002
 and-0.002
-0.002
Token hipp
Token ID18568
Feature activation+3.79e-02

Pos logit contributions
 a+0.063
 and+0.059
 the+0.057
 to+0.056
 two+0.056
Neg logit contributions
rily-0.050
immer-0.048
lights-0.047
igi-0.047
acket-0.046
Token each
Token ID1123
Feature activation+2.43e-01

Pos logit contributions
rily+0.027
 avail+0.027
immer+0.026
ittered+0.026
lights+0.026
Neg logit contributions
 a-0.008
 two-0.008
 big-0.007
 and-0.007
-0.007
Token other
Token ID584
Feature activation-2.59e-02

Pos logit contributions
rily+0.025
 avail+0.024
inations+0.024
immer+0.024
leness+0.024
Neg logit contributions
 a-0.006
 two-0.005
-0.005
 and-0.005
 big-0.005
Token.
Token ID13
Feature activation+2.06e-01

Pos logit contributions
 a+0.000
 two+0.000
 big+0.000
+0.000
 the+0.000
Neg logit contributions
rily-0.003
ge-0.003
immer-0.003
keeping-0.003
inations-0.003
Token<|endoftext|>
Token ID50256
Feature activation-5.30e-01

Pos logit contributions
rily+0.020
ittered+0.020
immer+0.020
lights+0.020
inations+0.020
Neg logit contributions
 a-0.005
 big-0.005
 two-0.005
 the-0.004
 and-0.004
TokenL
Token ID43
Feature activation+8.03e-01

Pos logit contributions
 a+0.024
 big+0.023
 two+0.023
 little+0.022
 the+0.021
Neg logit contributions
ittered-0.040
rily-0.040
lor-0.039
immer-0.039
keeping-0.039
Tokenily
Token ID813
Feature activation-2.98e-01

Pos logit contributions
rily+0.060
ittered+0.060
immer+0.059
inations+0.058
lights+0.056
Neg logit contributions
 a-0.045
 two-0.042
 big-0.042
 the-0.041
 and-0.040
Token and
Token ID290
Feature activation-5.29e-02

Pos logit contributions
 a+0.007
 two+0.006
 big+0.006
 the+0.005
+0.005
Neg logit contributions
ittered-0.031
rily-0.030
immer-0.030
keeping-0.029
inations-0.029
Token Ben
Token ID3932
Feature activation-3.97e-01

Pos logit contributions
 big+0.002
 two+0.001
 a+0.001
 next+0.001
 and+0.001
Neg logit contributions
 avail-0.005
ittered-0.005
keeping-0.005
rily-0.005
fer-0.004
Token love
Token ID1842
Feature activation-8.73e-02

Pos logit contributions
 a+0.014
 big+0.014
 two+0.014
 the+0.012
+0.012
Neg logit contributions
ittered-0.035
immer-0.034
rily-0.033
leness-0.033
acket-0.033
Token to
Token ID284
Feature activation+1.04e-01

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
+0.002
 little+0.001
Neg logit contributions
rily-0.009
keeping-0.009
ittered-0.008
lights-0.008
inations-0.008
Token play
Token ID711
Feature activation-5.19e-01

Pos logit contributions
ittered+0.009
rily+0.009
immer+0.009
inations+0.009
 avail+0.009
Neg logit contributions
 two-0.003
 a-0.003
 big-0.003
 and-0.003
 next-0.003
Token.
Token ID13
Feature activation-6.66e-01

Pos logit contributions
 two+0.018
 a+0.016
 big+0.015
 little+0.011
+0.010
Neg logit contributions
rily-0.091
lights-0.089
ge-0.088
epad-0.086
immer-0.085
Token She
Token ID1375
Feature activation-1.40e-01

Pos logit contributions
 two+0.031
 a+0.031
 big+0.029
 little+0.028
 one+0.028
Neg logit contributions
keeping-0.044
lights-0.044
iders-0.044
 outage-0.044
plets-0.042
Token sees
Token ID7224
Feature activation-5.80e-02

Pos logit contributions
 a+0.008
 two+0.007
 big+0.007
+0.007
 the+0.007
Neg logit contributions
ittered-0.010
arians-0.009
leness-0.009
inations-0.009
immer-0.009
Token Tom
Token ID4186
Feature activation-2.83e-01

Pos logit contributions
 a+0.001
 big+0.001
 two+0.001
 and+0.001
+0.001
Neg logit contributions
lights-0.006
rily-0.006
irs-0.005
ittered-0.005
immer-0.005
Token and
Token ID290
Feature activation-2.14e-01

Pos logit contributions
 a+0.004
 big+0.003
 two+0.003
 the+0.002
+0.002
Neg logit contributions
rily-0.031
ittered-0.031
ge-0.031
immer-0.031
istent-0.030
Token Amy
Token ID14235
Feature activation-1.90e-01

Pos logit contributions
 a+0.008
 two+0.008
+0.008
 and+0.007
 big+0.007
Neg logit contributions
lights-0.017
rily-0.017
 avail-0.017
ittered-0.017
leness-0.016
Token fighting
Token ID4330
Feature activation-2.60e-01

Pos logit contributions
 a+0.007
 big+0.007
 two+0.007
+0.007
 the+0.006
Neg logit contributions
rily-0.016
ittered-0.016
immer-0.016
ge-0.016
keeping-0.016
Token.
Token ID13
Feature activation-5.75e-01

Pos logit contributions
 two+0.006
 a+0.005
 big+0.005
 little+0.005
+0.004
Neg logit contributions
ittered-0.026
lights-0.026
immer-0.026
rily-0.026
epad-0.026
Token She
Token ID1375
Feature activation-9.92e-02

Pos logit contributions
 a+0.025
 two+0.025
 big+0.023
 little+0.023
 the+0.022
Neg logit contributions
keeping-0.041
lights-0.041
iders-0.040
 outage-0.039
arians-0.039
Token sees
Token ID7224
Feature activation+3.05e-02

Pos logit contributions
 a+0.005
 two+0.004
 big+0.004
+0.004
 the+0.004
Neg logit contributions
ittered-0.008
leness-0.007
arians-0.007
heit-0.007
inations-0.007
Token the
Token ID262
Feature activation+6.42e-02

Pos logit contributions
lights+0.003
rily+0.003
ittered+0.003
irs+0.003
immer+0.003
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
 and-0.001
-0.001
Token fast
Token ID3049
Feature activation+2.50e-01

Pos logit contributions
+0.003
 two+0.002
 a+0.002
 big+0.002
 and+0.002
Neg logit contributions
ittered-0.008
immer-0.008
inations-0.008
rily-0.008
fer-0.008
Token.
Token ID13
Feature activation-2.30e-01

Pos logit contributions
rily+0.027
ittered+0.027
immer+0.027
inations+0.026
ge+0.026
Neg logit contributions
 a-0.005
 two-0.004
 big-0.004
-0.004
 the-0.004
Token So
Token ID1406
Feature activation-1.44e-01

Pos logit contributions
 big+0.010
 a+0.010
 two+0.010
 the+0.009
 and+0.009
Neg logit contributions
rily-0.018
istent-0.017
immer-0.017
arians-0.017
acket-0.017
Token it
Token ID340
Feature activation-7.99e-01

Pos logit contributions
 big+0.003
 a+0.003
 two+0.003
+0.002
 and+0.002
Neg logit contributions
 avail-0.015
rily-0.014
lights-0.014
ittered-0.013
immer-0.013
Token ignored
Token ID9514
Feature activation+2.59e-01

Pos logit contributions
 a+0.039
 two+0.037
+0.036
 big+0.036
 the+0.035
Neg logit contributions
ittered-0.060
rated-0.058
rily-0.058
immer-0.058
inations-0.057
Token the
Token ID262
Feature activation-7.93e-01

Pos logit contributions
rily+0.029
ittered+0.028
 avail+0.028
immer+0.028
ently+0.028
Neg logit contributions
 big-0.004
 two-0.003
 a-0.003
-0.002
 little-0.002
Token sign
Token ID1051
Feature activation-3.37e-01

Pos logit contributions
 a+0.040
 and+0.037
 the+0.036
 two+0.035
+0.035
Neg logit contributions
immer-0.056
leness-0.055
arians-0.055
epad-0.054
 avail-0.054
Token and
Token ID290
Feature activation-2.26e-04

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 the+0.003
Neg logit contributions
rily-0.036
ittered-0.036
immer-0.036
lights-0.035
inations-0.035
Token kept
Token ID4030
Feature activation-1.26e-01

Pos logit contributions
 big+0.000
 a+0.000
 two+0.000
+0.000
 and+0.000
Neg logit contributions
ittered-0.000
 avail-0.000
keeping-0.000
fer-0.000
inations-0.000
Token going
Token ID1016
Feature activation+1.37e-01

Pos logit contributions
 big+0.007
 a+0.007
 two+0.007
+0.007
 little+0.007
Neg logit contributions
ittered-0.008
 avail-0.008
rily-0.008
fer-0.008
ge-0.008
Token fast
Token ID3049
Feature activation+3.20e-01

Pos logit contributions
immer+0.014
rily+0.014
ge+0.014
ittered+0.014
inations+0.014
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.003
 the-0.003
Token not
Token ID407
Feature activation-4.08e-01

Pos logit contributions
 two+0.007
 and+0.007
+0.007
 a+0.007
 big+0.007
Neg logit contributions
immer-0.015
inations-0.015
ittered-0.015
rily-0.015
epad-0.015
Token like
Token ID588
Feature activation-1.69e-01

Pos logit contributions
 two+0.019
 a+0.018
+0.018
 the+0.017
 big+0.017
Neg logit contributions
immer-0.031
ittered-0.031
inations-0.031
leness-0.030
epad-0.030
Token children
Token ID1751
Feature activation+2.29e-01

Pos logit contributions
 two+0.006
 a+0.006
+0.006
 big+0.006
 and+0.005
Neg logit contributions
rily-0.014
 avail-0.014
ittered-0.014
keeping-0.014
leness-0.014
Token.
Token ID13
Feature activation+3.23e-02

Pos logit contributions
rily+0.025
ittered+0.024
immer+0.024
inations+0.024
fer+0.024
Neg logit contributions
 a-0.005
 two-0.004
 big-0.004
 the-0.003
-0.003
Token He
Token ID679
Feature activation-5.31e-01

Pos logit contributions
rily+0.003
ittered+0.003
keeping+0.003
immer+0.003
lights+0.003
Neg logit contributions
 a-0.001
 big-0.001
 two-0.001
 the-0.001
 and-0.001
Token is
Token ID318
Feature activation-1.05e+00

Pos logit contributions
 a+0.022
 two+0.021
 big+0.020
 the+0.020
+0.019
Neg logit contributions
ittered-0.045
inations-0.044
leness-0.043
arians-0.043
rily-0.043
Token a
Token ID257
Feature activation-5.82e-01

Pos logit contributions
 two+0.035
+0.033
 and+0.031
 a+0.030
 big+0.030
Neg logit contributions
rily-0.086
ittered-0.085
inations-0.084
epad-0.084
icians-0.084
Token guard
Token ID4860
Feature activation-2.32e-01

Pos logit contributions
 a+0.022
 two+0.021
 the+0.020
+0.019
 and+0.019
Neg logit contributions
fer-0.049
epad-0.047
rily-0.047
inations-0.047
 outage-0.047
Token dog
Token ID3290
Feature activation-2.18e-01

Pos logit contributions
 a+0.005
 two+0.005
 big+0.004
+0.004
 the+0.004
Neg logit contributions
rily-0.025
ittered-0.024
fer-0.024
immer-0.024
inations-0.024
Token.
Token ID13
Feature activation-5.44e-02

Pos logit contributions
 a+0.004
 two+0.004
 big+0.004
 the+0.003
+0.003
Neg logit contributions
rily-0.023
ittered-0.023
arians-0.022
immer-0.022
fer-0.022
Token You
Token ID921
Feature activation-3.21e-01

Pos logit contributions
 a+0.002
 big+0.002
 two+0.002
 and+0.002
 the+0.002
Neg logit contributions
rily-0.004
keeping-0.004
ittered-0.004
immer-0.004
lights-0.004
Token The
Token ID383
Feature activation-4.66e-02

Pos logit contributions
 two+0.014
 a+0.014
 big+0.013
 next+0.012
 the+0.012
Neg logit contributions
rily-0.026
istent-0.025
ittered-0.024
immer-0.024
leness-0.024
Token cat
Token ID3797
Feature activation-6.03e-01

Pos logit contributions
 a+0.002
 the+0.002
+0.002
 and+0.002
 two+0.002
Neg logit contributions
arians-0.004
comb-0.004
immer-0.004
 avail-0.004
iders-0.004
Token laughed
Token ID13818
Feature activation-7.04e-01

Pos logit contributions
 a+0.021
 two+0.019
 big+0.018
+0.017
 the+0.017
Neg logit contributions
ittered-0.053
rily-0.053
aned-0.052
immer-0.051
istent-0.051
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
 two+0.012
 big+0.010
 a+0.009
+0.008
 be+0.007
Neg logit contributions
rily-0.076
ittered-0.072
immer-0.071
istent-0.070
lights-0.069
Token said
Token ID531
Feature activation+1.73e-01

Pos logit contributions
ittered+0.001
rily+0.001
leness+0.001
istent+0.001
immer+0.001
Neg logit contributions
 a-0.001
 big-0.001
 two-0.001
-0.001
 and-0.001
Token,
Token ID11
Feature activation-8.14e-01

Pos logit contributions
ittered+0.019
rily+0.018
immer+0.018
leness+0.018
istent+0.018
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
 and-0.002
-0.002
Token "
Token ID366
Feature activation+4.04e-01

Pos logit contributions
 a+0.029
 big+0.029
 two+0.028
 the+0.024
 and+0.022
Neg logit contributions
rily-0.072
istent-0.070
ittered-0.069
immer-0.068
arians-0.067
TokenYou
Token ID1639
Feature activation-3.23e-01

Pos logit contributions
rily+0.026
ittered+0.025
immer+0.025
leness+0.025
istent+0.024
Neg logit contributions
 a-0.027
 big-0.026
 two-0.025
 and-0.025
 the-0.025
Token'll
Token ID1183
Feature activation-1.79e-01

Pos logit contributions
 a+0.015
+0.015
 and+0.015
 next+0.015
 big+0.015
Neg logit contributions
inations-0.023
ittered-0.023
istent-0.023
immer-0.023
rily-0.022
Token never
Token ID1239
Feature activation-3.28e-02

Pos logit contributions
 a+0.007
 two+0.007
 big+0.007
 the+0.007
 and+0.006
Neg logit contributions
comb-0.014
immer-0.014
ittered-0.014
istent-0.014
inations-0.014
Token get
Token ID651
Feature activation-1.78e-01

Pos logit contributions
 a+0.002
 two+0.002
 the+0.001
 big+0.001
+0.001
Neg logit contributions
immer-0.003
ittered-0.003
inations-0.002
istent-0.002
rated-0.002
Token put
Token ID1234
Feature activation-6.72e-01

Pos logit contributions
 a+0.007
 big+0.007
+0.007
 two+0.007
 and+0.006
Neg logit contributions
rily-0.016
ittered-0.016
immer-0.016
fer-0.015
 avail-0.015
Token it
Token ID340
Feature activation-2.35e-01

Pos logit contributions
 big+0.018
+0.018
 little+0.017
 two+0.017
 a+0.016
Neg logit contributions
rily-0.064
ittered-0.061
immer-0.060
 avail-0.058
rol-0.057
Token down
Token ID866
Feature activation-3.18e-01

Pos logit contributions
 a+0.006
+0.006
 two+0.006
 big+0.006
 the+0.005
Neg logit contributions
ittered-0.024
immer-0.023
rily-0.023
lights-0.022
comb-0.022
Token and
Token ID290
Feature activation+1.79e-01

Pos logit contributions
 two+0.005
 a+0.005
 big+0.005
+0.004
 little+0.004
Neg logit contributions
rily-0.034
ittered-0.034
immer-0.033
lights-0.033
ently-0.033
Token said
Token ID531
Feature activation+3.14e-01

Pos logit contributions
rily+0.014
ittered+0.014
immer+0.014
inations+0.013
lights+0.013
Neg logit contributions
 a-0.008
 big-0.008
 two-0.008
-0.008
 and-0.007
Token,
Token ID11
Feature activation-6.77e-01

Pos logit contributions
rily+0.034
ittered+0.033
immer+0.033
fer+0.032
leness+0.031
Neg logit contributions
 a-0.007
 two-0.006
 big-0.006
 and-0.005
-0.005
Token
Token ID6184
Feature activation+1.13e+00

Pos logit contributions
 a+0.024
 big+0.023
 two+0.023
 the+0.021
 little+0.019
Neg logit contributions
rily-0.062
ittered-0.059
immer-0.059
istent-0.059
epad-0.057
Token
Token ID95
Feature activation+4.99e-01

Pos logit contributions
ittered+0.047
rily+0.047
immer+0.046
inations+0.044
lights+0.042
Neg logit contributions
 a-0.103
 two-0.099
 big-0.098
 the-0.097
 and-0.097
Token
Token ID26391
Feature activation+6.13e-01

Pos logit contributions
ittered+0.026
immer+0.025
rily+0.025
istent+0.024
inations+0.024
Neg logit contributions
 a-0.040
 two-0.039
 big-0.038
 the-0.038
 and-0.037
Token
Token ID129
Feature activation+4.27e-01

Pos logit contributions
ittered+0.028
immer+0.027
rily+0.027
inations+0.027
istent+0.026
Neg logit contributions
 a-0.052
 two-0.050
 big-0.049
 the-0.049
 and-0.048
Token
Token ID241
Feature activation-2.92e-01

Pos logit contributions
ittered+0.020
rily+0.020
immer+0.020
istent+0.019
leness+0.019
Neg logit contributions
 a-0.036
 two-0.035
 big-0.034
 the-0.033
 and-0.033
TokenOne
Token ID3198
Feature activation-6.55e-01

Pos logit contributions
 two+0.011
 a+0.011
 big+0.010
 little+0.009
 the+0.007
Neg logit contributions
ittered-0.089
rily-0.088
keeping-0.088
immer-0.088
lor-0.086
Token day
Token ID1110
Feature activation+2.41e-02

Pos logit contributions
 two+0.025
 a+0.024
 and+0.023
+0.021
 the+0.021
Neg logit contributions
immer-0.051
inations-0.051
rated-0.051
adows-0.050
ittered-0.050
Token,
Token ID11
Feature activation-1.29e-01

Pos logit contributions
ittered+0.003
immer+0.003
rily+0.002
inations+0.002
 avail+0.002
Neg logit contributions
 a-0.000
 two-0.000
 big-0.000
 and-0.000
 the-0.000
Token a
Token ID257
Feature activation-8.83e-01

Pos logit contributions
 two+0.001
 big+0.000
 and+0.000
 a+0.000
-0.000
Neg logit contributions
ittered-0.015
rily-0.014
immer-0.014
 avail-0.014
comb-0.014
Token gr
Token ID1036
Feature activation+7.53e-01

Pos logit contributions
 two+0.019
 a+0.019
 the+0.016
 and+0.015
 all+0.014
Neg logit contributions
fer-0.085
immer-0.085
rily-0.084
ittered-0.084
inations-0.083
Tokenumpy
Token ID32152
Feature activation-7.94e-01

Pos logit contributions
ittered+0.036
rily+0.036
immer+0.036
inations+0.034
lights+0.033
Neg logit contributions
 a-0.063
 two-0.060
 the-0.059
 big-0.059
 and-0.058
Token old
Token ID1468
Feature activation-6.47e-02

Pos logit contributions
 a+0.033
 two+0.033
+0.030
 the+0.029
 big+0.028
Neg logit contributions
rily-0.063
immer-0.062
ittered-0.062
fer-0.062
acket-0.061
Token man
Token ID582
Feature activation+2.05e-01

Pos logit contributions
 a+0.003
 two+0.003
+0.003
 the+0.003
 big+0.003
Neg logit contributions
rily-0.005
fer-0.005
 avail-0.005
ittered-0.005
immer-0.005
Token named
Token ID3706
Feature activation+4.15e-01

Pos logit contributions
ittered+0.017
immer+0.017
rily+0.017
inations+0.017
fer+0.016
Neg logit contributions
 a-0.008
 two-0.008
 big-0.008
 the-0.007
-0.007
Token Mr
Token ID1770
Feature activation+6.09e-01

Pos logit contributions
rily+0.037
ittered+0.037
immer+0.036
 avail+0.036
inations+0.035
Neg logit contributions
 a-0.014
 and-0.014
 two-0.014
 big-0.013
-0.013
Token.
Token ID13
Feature activation+9.19e-01

Pos logit contributions
ittered+0.071
rily+0.071
immer+0.070
inations+0.068
keeping+0.068
Neg logit contributions
 a-0.008
 two-0.006
 big-0.006
 the-0.005
 and-0.004