TOP ACTIVATIONS
MAX = 0.087

temper and often threw temper tant rums . One day ,
to cry and threw a tant rum . She didn 't
sad . She threw a tant rum and cried loudly .
and so she threw a tant rum when her parents said
kept crying and throwing a tant rum . Finally
his feet and threw a tant rum . "
and started to throw a tant rum . Suddenly , she
her feet and threw a tant rum . She grabbed the
them and never threw a tant rum again . <|endoftext|> Once
way forward . She stayed foc ussed . On the third
She cried and threw a tant rum . Her mom tried
. He kept throwing a tant rum . His mom was
stubborn . She throws a tant rum . She kicks and
surprise . He threw a tant rum . He kicked and
started walking very slowly as tort o ises do , but
found it !" Charl otte and her mom brought
the same urge to explore Ur ge Land everyday , and
listen . She threw a tant rum in the store .
and the harsh queen was overth rown . The
the shed . Charl otte ran to the shed

MIN ACTIVATIONS
MIN = -0.052

One , two , three , go !" They
. " One , two , three , go !"
. " One , two , three !" They
. " One , two , three , four , five
One , two , three , four , five ..." She
. " One , two , three , four ," she
One , two , three , four , five , six
. " One , two , three !" she said .
" One , two , three , four , five
. " One , two , three ..." L
One , two , three , four ," she counts .
to three . " One , two , three !"
One , two , three , four , five , six
her cone . " One , two , three !" she
" One , two , three , four , five
One , two , three , four ," she said .
. " One , two , three , four ," Ben
. " One , two , three , four ," she
to count . " One , two , three , four
" One , two , three , four , five

INTERVAL 0.053 - 0.087
CONTAINS 0.011%

man looking for something . Ho ppy hopped up to him
delicious vegetables and bright - col oured flowers .
. He che wed and che wed until it was all
them to their owners . Ho ppy felt bad for taking
their favorite place . They gard ened together every day ,

INTERVAL 0.018 - 0.053
CONTAINS 2.745%

hard until the pond was even smoother than before .
to play with her toys and spend time with her family
," said the duck , " and look at this beautiful
went searching together . They searched and searched ,
if they didn â € ™ t start listening to her

INTERVAL -0.017 - 0.018
CONTAINS 94.195%

into the mailbox , she felt happy that she did the
friendly . They hugged it and talked to it and made
was important . Mary didn 't like the gym , it
and more and spoiled their sandbox . " M
Once there was a girl called L ola . L ola

INTERVAL -0.052 - -0.017
CONTAINS 3.048%

. One day , Lily 's mom brought home a newspaper
upon a time there was a judge . She was very
Fisher . J enny smiled and went home feeling
Tom thought for a moment . Then he said ,
. One day , Lily had to go to the toilet
Token temper
Token ID4124
Feature activation+8.06e-03

Pos logit contributions
rily+0.003
ittered+0.003
immer+0.003
inations+0.003
lights+0.003
Neg logit contributions
 a-0.002
 two-0.002
-0.002
 big-0.002
 the-0.002
Token and
Token ID290
Feature activation-5.67e-04

Pos logit contributions
rily+0.035
ittered+0.034
immer+0.034
inations+0.034
lights+0.034
Neg logit contributions
 a-0.006
 two-0.006
 big-0.005
-0.005
 the-0.005
Token often
Token ID1690
Feature activation+1.67e-02

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
+0.001
 and+0.001
Neg logit contributions
ittered-0.002
rily-0.002
inations-0.002
lights-0.002
immer-0.002
Token threw
Token ID9617
Feature activation-1.90e-02

Pos logit contributions
ittered+0.057
rily+0.056
immer+0.055
inations+0.055
lights+0.054
Neg logit contributions
 a-0.029
 two-0.028
 big-0.028
-0.026
 the-0.026
Token temper
Token ID4124
Feature activation-4.96e-03

Pos logit contributions
 two+0.024
+0.023
 a+0.023
 big+0.022
 next+0.021
Neg logit contributions
rily-0.068
ittered-0.066
keeping-0.065
immer-0.065
lights-0.065
Token tant
Token ID24246
Feature activation+8.75e-02

Pos logit contributions
 a+0.010
 two+0.010
 big+0.010
+0.010
 the+0.010
Neg logit contributions
rily-0.015
ittered-0.014
immer-0.014
lights-0.014
inations-0.014
Tokenrums
Token ID45241
Feature activation+7.13e-03

Pos logit contributions
rily+0.109
immer+0.105
lights+0.102
comb+0.099
acket+0.098
Neg logit contributions
 a-0.333
 two-0.325
 and-0.321
 the-0.317
.-0.312
Token.
Token ID13
Feature activation+3.39e-03

Pos logit contributions
rily+0.030
ittered+0.029
immer+0.029
lights+0.028
inations+0.028
Neg logit contributions
 a-0.007
 big-0.007
 two-0.007
-0.006
 the-0.006
Token One
Token ID1881
Feature activation-6.91e-03

Pos logit contributions
rily+0.011
immer+0.011
lights+0.011
ittered+0.011
inations+0.011
Neg logit contributions
 a-0.006
 big-0.006
 two-0.006
 the-0.005
 and-0.005
Token day
Token ID1110
Feature activation+7.19e-03

Pos logit contributions
 a+0.010
 two+0.009
+0.009
 big+0.009
 the+0.008
Neg logit contributions
immer-0.023
lights-0.023
inations-0.023
rated-0.023
rily-0.023
Token,
Token ID11
Feature activation+7.36e-03

Pos logit contributions
rily+0.029
ittered+0.029
immer+0.029
 avail+0.028
inations+0.028
Neg logit contributions
 big-0.006
 a-0.006
 two-0.006
-0.005
 and-0.005
Token to
Token ID284
Feature activation+1.81e-02

Pos logit contributions
ittered+0.010
epad+0.010
rily+0.010
ge+0.009
keeping+0.009
Neg logit contributions
 two-0.002
 a-0.002
 big-0.002
-0.001
 and-0.001
Token cry
Token ID3960
Feature activation-1.51e-02

Pos logit contributions
immer+0.048
ittered+0.048
rily+0.048
inations+0.047
 avail+0.047
Neg logit contributions
 a-0.043
 two-0.042
 big-0.041
-0.040
 and-0.039
Token and
Token ID290
Feature activation+1.05e-02

Pos logit contributions
 two+0.013
 big+0.012
 a+0.012
 the+0.010
+0.009
Neg logit contributions
rily-0.061
immer-0.060
lights-0.059
istent-0.058
ittered-0.058
Token threw
Token ID9617
Feature activation-1.26e-02

Pos logit contributions
ittered+0.036
rily+0.036
 avail+0.036
immer+0.035
inations+0.035
Neg logit contributions
 two-0.016
 a-0.015
 big-0.015
-0.014
 and-0.014
Token a
Token ID257
Feature activation+1.27e-02

Pos logit contributions
 two+0.012
 big+0.012
+0.011
 little+0.010
 a+0.010
Neg logit contributions
rily-0.046
 avail-0.045
ittered-0.045
keeping-0.045
immer-0.045
Token tant
Token ID24246
Feature activation+8.69e-02

Pos logit contributions
rily+0.035
immer+0.035
ittered+0.034
inations+0.033
lights+0.033
Neg logit contributions
 a-0.029
 two-0.029
 the-0.027
-0.027
 big-0.027
Tokenrum
Token ID6582
Feature activation+6.61e-03

Pos logit contributions
rily+0.098
arians+0.093
acket+0.093
lights+0.092
inations+0.092
Neg logit contributions
 a-0.343
 and-0.335
 two-0.331
 big-0.327
 the-0.326
Token.
Token ID13
Feature activation+5.54e-04

Pos logit contributions
rily+0.030
ittered+0.029
lights+0.029
immer+0.029
ently+0.028
Neg logit contributions
 big-0.004
 a-0.004
 two-0.004
-0.003
 the-0.003
Token She
Token ID1375
Feature activation+4.85e-03

Pos logit contributions
rily+0.002
ittered+0.002
lights+0.002
immer+0.002
istent+0.002
Neg logit contributions
 two-0.001
 a-0.001
 big-0.001
 the-0.001
 and-0.001
Token didn
Token ID1422
Feature activation+6.34e-03

Pos logit contributions
ittered+0.015
rily+0.015
inations+0.015
immer+0.014
lights+0.014
Neg logit contributions
 a-0.009
 two-0.009
 big-0.009
-0.008
 the-0.008
Token't
Token ID470
Feature activation+1.50e-03

Pos logit contributions
rily+0.018
ittered+0.018
immer+0.018
inations+0.017
istent+0.017
Neg logit contributions
 a-0.014
 two-0.013
 big-0.013
 the-0.012
 and-0.012
Token sad
Token ID6507
Feature activation-1.34e-02

Pos logit contributions
 two+0.014
 a+0.014
 big+0.013
+0.013
 and+0.012
Neg logit contributions
rily-0.021
ittered-0.020
leness-0.020
aned-0.020
istent-0.020
Token.
Token ID13
Feature activation-9.82e-03

Pos logit contributions
 two+0.012
 big+0.012
 a+0.011
 the+0.009
 little+0.008
Neg logit contributions
rily-0.053
lights-0.052
ittered-0.052
immer-0.051
ently-0.051
Token She
Token ID1375
Feature activation-1.40e-02

Pos logit contributions
 a+0.013
 two+0.013
 big+0.013
 the+0.011
 little+0.010
Neg logit contributions
rily-0.034
ittered-0.034
ently-0.034
keeping-0.034
immer-0.033
Token threw
Token ID9617
Feature activation-1.35e-02

Pos logit contributions
 a+0.026
 two+0.026
 big+0.026
+0.024
 the+0.024
Neg logit contributions
rily-0.040
aned-0.040
ittered-0.040
rated-0.039
arians-0.039
Token a
Token ID257
Feature activation+8.47e-03

Pos logit contributions
+0.008
 two+0.008
 big+0.007
 a+0.007
 next+0.006
Neg logit contributions
rily-0.056
ittered-0.054
immer-0.054
 avail-0.053
lights-0.053
Token tant
Token ID24246
Feature activation+8.62e-02

Pos logit contributions
rily+0.019
immer+0.019
ittered+0.019
inations+0.018
lights+0.018
Neg logit contributions
 a-0.024
 two-0.023
 the-0.022
-0.022
 big-0.022
Tokenrum
Token ID6582
Feature activation+4.36e-03

Pos logit contributions
rily+0.097
acket+0.091
lights+0.090
arians+0.090
immer+0.089
Neg logit contributions
 a-0.341
 and-0.333
 two-0.330
 big-0.323
 the-0.322
Token and
Token ID290
Feature activation+8.31e-03

Pos logit contributions
rily+0.019
ittered+0.019
lights+0.019
ently+0.019
immer+0.019
Neg logit contributions
 big-0.003
 a-0.003
 two-0.002
 the-0.002
-0.002
Token cried
Token ID16896
Feature activation-7.41e-03

Pos logit contributions
ittered+0.025
rily+0.025
 avail+0.025
keeping+0.025
immer+0.025
Neg logit contributions
 a-0.016
 big-0.016
 two-0.016
-0.015
 and-0.014
Token loudly
Token ID23112
Feature activation-1.13e-02

Pos logit contributions
 two+0.006
 big+0.006
 a+0.006
+0.005
 the+0.005
Neg logit contributions
rily-0.030
ittered-0.029
immer-0.029
lights-0.029
istent-0.028
Token.
Token ID13
Feature activation-5.00e-03

Pos logit contributions
 two+0.007
 big+0.007
 a+0.007
+0.006
 the+0.005
Neg logit contributions
rily-0.049
ittered-0.049
immer-0.048
epad-0.047
lights-0.046
Token and
Token ID290
Feature activation-6.76e-03

Pos logit contributions
 two+0.001
 a+0.001
 big+0.001
+0.001
 the+0.001
Neg logit contributions
rily-0.010
ittered-0.010
istent-0.009
lights-0.009
inations-0.009
Token so
Token ID523
Feature activation-5.84e-03

Pos logit contributions
 two+0.012
 a+0.012
+0.012
 big+0.012
 and+0.011
Neg logit contributions
ittered-0.020
rily-0.020
leness-0.020
rated-0.020
immer-0.020
Token she
Token ID673
Feature activation-5.69e-03

Pos logit contributions
 two+0.009
 a+0.009
 big+0.008
 and+0.008
 right+0.008
Neg logit contributions
ittered-0.019
lights-0.019
rily-0.019
 avail-0.018
leness-0.018
Token threw
Token ID9617
Feature activation-2.02e-02

Pos logit contributions
 a+0.010
 two+0.009
 big+0.009
+0.009
 the+0.009
Neg logit contributions
rily-0.018
ittered-0.018
inations-0.018
arians-0.017
immer-0.017
Token a
Token ID257
Feature activation+7.16e-03

Pos logit contributions
+0.019
 big+0.019
 two+0.018
 little+0.018
 be+0.016
Neg logit contributions
rily-0.072
 avail-0.070
immer-0.069
ittered-0.069
keeping-0.069
Token tant
Token ID24246
Feature activation+8.59e-02

Pos logit contributions
rily+0.023
immer+0.023
ittered+0.023
inations+0.023
lights+0.022
Neg logit contributions
 a-0.013
 two-0.012
-0.012
 the-0.012
 and-0.011
Tokenrum
Token ID6582
Feature activation+1.46e-02

Pos logit contributions
rily+0.098
acket+0.093
immer+0.093
arians+0.092
lights+0.090
Neg logit contributions
 a-0.337
 and-0.330
 two-0.325
 big-0.320
 the-0.320
Token when
Token ID618
Feature activation+8.99e-03

Pos logit contributions
rily+0.066
ittered+0.065
immer+0.064
lights+0.064
keeping+0.063
Neg logit contributions
 a-0.008
 big-0.008
 two-0.008
 the-0.006
-0.006
Token her
Token ID607
Feature activation-5.11e-03

Pos logit contributions
ittered+0.032
rily+0.032
 avail+0.032
immer+0.031
lights+0.031
Neg logit contributions
 a-0.012
 two-0.012
 big-0.012
 and-0.011
-0.011
Token parents
Token ID3397
Feature activation+1.94e-03

Pos logit contributions
 a+0.007
+0.006
 the+0.006
 two+0.006
 and+0.006
Neg logit contributions
lights-0.018
rily-0.018
ittered-0.018
inations-0.018
fer-0.018
Token said
Token ID531
Feature activation+8.01e-03

Pos logit contributions
ittered+0.007
immer+0.007
rily+0.007
inations+0.007
istent+0.006
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
 the-0.003
-0.003
Token kept
Token ID4030
Feature activation+1.23e-03

Pos logit contributions
 a+0.007
 two+0.007
 big+0.007
+0.007
 the+0.007
Neg logit contributions
rily-0.012
ittered-0.012
inations-0.012
aned-0.012
immer-0.012
Token crying
Token ID13774
Feature activation-9.44e-03

Pos logit contributions
rily+0.003
ittered+0.003
 avail+0.003
keeping+0.003
epad+0.003
Neg logit contributions
 a-0.003
 big-0.003
 two-0.003
-0.003
 little-0.003
Token and
Token ID290
Feature activation+9.28e-03

Pos logit contributions
 a+0.007
 two+0.007
 big+0.006
+0.005
 the+0.005
Neg logit contributions
rily-0.040
lights-0.039
immer-0.039
ittered-0.038
ently-0.038
Token throwing
Token ID9644
Feature activation-9.48e-03

Pos logit contributions
ittered+0.027
rily+0.026
immer+0.026
inations+0.025
 avail+0.025
Neg logit contributions
 big-0.020
 a-0.020
 two-0.019
-0.018
 and-0.018
Token a
Token ID257
Feature activation+8.26e-03

Pos logit contributions
 two+0.009
 big+0.009
 a+0.009
+0.008
 next+0.008
Neg logit contributions
rily-0.037
immer-0.036
ittered-0.036
lights-0.035
 avail-0.035
Token tant
Token ID24246
Feature activation+8.57e-02

Pos logit contributions
rily+0.019
immer+0.019
ittered+0.018
lights+0.018
inations+0.017
Neg logit contributions
 a-0.023
 two-0.023
 the-0.022
-0.021
 big-0.021
Tokenrum
Token ID6582
Feature activation+7.48e-03

Pos logit contributions
rily+0.097
arians+0.091
lights+0.090
acket+0.089
inations+0.088
Neg logit contributions
 a-0.341
 and-0.330
 two-0.329
 big-0.324
 the-0.322
Token.
Token ID13
Feature activation-9.95e-03

Pos logit contributions
rily+0.033
ittered+0.033
lights+0.032
immer+0.032
ently+0.032
Neg logit contributions
 big-0.004
 a-0.004
 two-0.004
-0.003
 the-0.003
Token
Token ID198
Feature activation-8.18e-03

Pos logit contributions
 a+0.013
 big+0.013
 two+0.013
 the+0.011
 and+0.011
Neg logit contributions
rily-0.035
ittered-0.035
immer-0.034
ently-0.034
lights-0.034
Token
Token ID198
Feature activation-8.56e-03

Pos logit contributions
 a+0.005
 two+0.004
 big+0.004
 the+0.003
 and+0.003
Neg logit contributions
immer-0.036
ittered-0.035
rily-0.035
inations-0.035
ently-0.035
TokenFinally
Token ID11158
Feature activation+1.57e-03

Pos logit contributions
 a+0.023
 two+0.022
 big+0.022
 the+0.022
 and+0.021
Neg logit contributions
immer-0.020
ittered-0.019
rily-0.019
inations-0.019
aned-0.018
Token his
Token ID465
Feature activation+1.40e-02

Pos logit contributions
 big+0.002
 two+0.002
 a+0.002
+0.002
 next+0.002
Neg logit contributions
rily-0.008
ittered-0.008
immer-0.008
lights-0.008
 avail-0.007
Token feet
Token ID3625
Feature activation-1.42e-02

Pos logit contributions
rily+0.040
ittered+0.039
immer+0.038
inations+0.037
lights+0.037
Neg logit contributions
 a-0.032
 two-0.031
 big-0.030
-0.030
 the-0.030
Token and
Token ID290
Feature activation+4.33e-03

Pos logit contributions
 a+0.007
 two+0.007
 big+0.006
+0.005
 the+0.004
Neg logit contributions
rily-0.064
immer-0.064
ittered-0.063
ge-0.063
istent-0.061
Token threw
Token ID9617
Feature activation-1.00e-02

Pos logit contributions
rily+0.013
ittered+0.013
immer+0.012
lights+0.012
fer+0.012
Neg logit contributions
 two-0.008
 a-0.008
 big-0.008
-0.008
 and-0.008
Token a
Token ID257
Feature activation+8.97e-03

Pos logit contributions
 two+0.010
 big+0.010
+0.010
 a+0.009
 little+0.009
Neg logit contributions
rily-0.039
ittered-0.037
immer-0.037
lights-0.036
keeping-0.036
Token tant
Token ID24246
Feature activation+8.52e-02

Pos logit contributions
rily+0.026
immer+0.025
ittered+0.025
inations+0.024
lights+0.024
Neg logit contributions
 a-0.020
 two-0.019
 the-0.018
-0.018
 and-0.018
Tokenrum
Token ID6582
Feature activation+7.92e-03

Pos logit contributions
rily+0.096
lights+0.094
immer+0.092
acket+0.088
inations+0.087
Neg logit contributions
 a-0.331
 two-0.324
 and-0.323
 the-0.314
.-0.313
Token.
Token ID13
Feature activation-6.75e-03

Pos logit contributions
rily+0.036
ittered+0.036
immer+0.035
lights+0.035
keeping+0.034
Neg logit contributions
 a-0.004
 big-0.004
 two-0.004
 the-0.003
-0.003
Token
Token ID198
Feature activation-9.68e-03

Pos logit contributions
 a+0.006
 big+0.006
 two+0.006
 and+0.005
 the+0.005
Neg logit contributions
rily-0.027
ittered-0.026
lights-0.026
immer-0.025
fer-0.025
Token
Token ID198
Feature activation-9.89e-03

Pos logit contributions
 a+0.006
 two+0.005
 big+0.005
 and+0.004
 the+0.004
Neg logit contributions
immer-0.041
rily-0.041
ittered-0.040
inations-0.040
lights-0.040
Token"
Token ID1
Feature activation+6.39e-03

Pos logit contributions
 a+0.017
 big+0.016
 two+0.016
 and+0.016
 all+0.015
Neg logit contributions
immer-0.031
fer-0.031
rily-0.030
leness-0.030
inations-0.030
Token and
Token ID290
Feature activation-9.10e-04

Pos logit contributions
 big+0.010
 two+0.009
 a+0.008
+0.007
 the+0.006
Neg logit contributions
rily-0.048
istent-0.047
lights-0.047
ently-0.046
iously-0.046
Token started
Token ID2067
Feature activation-2.32e-03

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
+0.002
 and+0.001
Neg logit contributions
rily-0.003
ittered-0.003
leness-0.003
aned-0.003
inations-0.003
Token to
Token ID284
Feature activation+1.72e-02

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
+0.001
 and+0.001
Neg logit contributions
rily-0.009
ittered-0.009
ge-0.009
keeping-0.009
epad-0.009
Token throw
Token ID3714
Feature activation-1.05e-02

Pos logit contributions
rily+0.040
ittered+0.039
immer+0.039
 avail+0.038
inations+0.037
Neg logit contributions
 a-0.047
 two-0.047
 big-0.046
-0.044
 and-0.044
Token a
Token ID257
Feature activation+2.28e-02

Pos logit contributions
 two+0.009
 big+0.009
 a+0.008
+0.008
 next+0.008
Neg logit contributions
rily-0.043
ittered-0.040
immer-0.040
 avail-0.039
lights-0.039
Token tant
Token ID24246
Feature activation+8.50e-02

Pos logit contributions
rily+0.061
immer+0.060
ittered+0.060
inations+0.058
lights+0.057
Neg logit contributions
 a-0.058
 two-0.056
 big-0.054
 the-0.053
-0.053
Tokenrum
Token ID6582
Feature activation+6.60e-03

Pos logit contributions
rily+0.097
lights+0.091
arians+0.089
ittered+0.089
acket+0.089
Neg logit contributions
 a-0.336
 and-0.328
 two-0.324
 big-0.320
 the-0.319
Token.
Token ID13
Feature activation-4.23e-03

Pos logit contributions
rily+0.030
ittered+0.029
lights+0.029
immer+0.029
ently+0.028
Neg logit contributions
 big-0.004
 a-0.004
 two-0.004
-0.003
 the-0.002
Token Suddenly
Token ID24975
Feature activation+4.89e-03

Pos logit contributions
 a+0.009
 two+0.009
 big+0.009
 the+0.008
 and+0.008
Neg logit contributions
rily-0.012
ittered-0.011
ently-0.011
lights-0.011
istent-0.011
Token,
Token ID11
Feature activation+2.16e-03

Pos logit contributions
ittered+0.020
immer+0.020
rily+0.020
 avail+0.020
lights+0.020
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.003
 and-0.003
Token she
Token ID673
Feature activation+5.87e-03

Pos logit contributions
 avail+0.008
rily+0.008
ittered+0.008
lights+0.008
comb+0.008
Neg logit contributions
 big-0.002
-0.002
 and-0.002
 two-0.002
 a-0.002
Token her
Token ID607
Feature activation+1.54e-02

Pos logit contributions
 two+0.008
 a+0.007
 big+0.007
+0.007
 next+0.006
Neg logit contributions
rily-0.030
ittered-0.029
keeping-0.028
lights-0.028
immer-0.028
Token feet
Token ID3625
Feature activation-1.37e-02

Pos logit contributions
rily+0.045
ittered+0.044
lights+0.042
immer+0.042
inations+0.042
Neg logit contributions
 a-0.034
 two-0.033
 big-0.032
-0.032
 the-0.031
Token and
Token ID290
Feature activation+8.79e-04

Pos logit contributions
 a+0.007
 two+0.007
 big+0.006
+0.005
 the+0.004
Neg logit contributions
rily-0.061
immer-0.061
ittered-0.060
ge-0.059
istent-0.058
Token threw
Token ID9617
Feature activation-1.66e-02

Pos logit contributions
rily+0.003
ittered+0.003
 avail+0.002
fer+0.002
leness+0.002
Neg logit contributions
 a-0.002
 two-0.002
 big-0.002
-0.002
 and-0.002
Token a
Token ID257
Feature activation+8.94e-03

Pos logit contributions
 two+0.016
 big+0.016
+0.016
 a+0.015
 next+0.014
Neg logit contributions
rily-0.065
ittered-0.062
immer-0.062
lights-0.060
keeping-0.060
Token tant
Token ID24246
Feature activation+8.48e-02

Pos logit contributions
rily+0.014
immer+0.014
ittered+0.014
inations+0.013
lights+0.012
Neg logit contributions
 a-0.032
 two-0.031
-0.030
 the-0.030
 and-0.029
Tokenrum
Token ID6582
Feature activation+5.60e-03

Pos logit contributions
rily+0.092
acket+0.089
lights+0.088
inations+0.088
arians+0.087
Neg logit contributions
 a-0.335
 and-0.329
 two-0.324
 big-0.320
 the-0.318
Token.
Token ID13
Feature activation-2.54e-03

Pos logit contributions
rily+0.025
ittered+0.025
lights+0.024
immer+0.024
keeping+0.024
Neg logit contributions
 a-0.003
 big-0.003
 two-0.003
-0.002
 the-0.002
Token She
Token ID1375
Feature activation+1.10e-03

Pos logit contributions
 a+0.005
 big+0.005
 two+0.004
 and+0.004
 the+0.004
Neg logit contributions
rily-0.008
ittered-0.008
lights-0.008
immer-0.008
ently-0.008
Token grabbed
Token ID13176
Feature activation-1.12e-02

Pos logit contributions
ittered+0.003
rily+0.003
inations+0.003
arians+0.003
lights+0.003
Neg logit contributions
 a-0.002
 two-0.002
 big-0.002
-0.002
 the-0.002
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
+0.007
 big+0.006
 and+0.006
 two+0.005
 a+0.005
Neg logit contributions
rily-0.048
ittered-0.047
immer-0.046
 avail-0.045
lights-0.045
Token them
Token ID606
Feature activation+4.86e-03

Pos logit contributions
rily+0.009
ittered+0.009
inations+0.009
immer+0.009
 avail+0.009
Neg logit contributions
 two-0.003
 big-0.003
 a-0.003
-0.002
 and-0.002
Token and
Token ID290
Feature activation-1.21e-03

Pos logit contributions
rily+0.021
ittered+0.021
immer+0.020
inations+0.020
lights+0.020
Neg logit contributions
 a-0.004
 two-0.004
 big-0.004
-0.003
 the-0.003
Token never
Token ID1239
Feature activation+2.56e-03

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
+0.002
 and+0.002
Neg logit contributions
rily-0.004
ittered-0.004
inations-0.004
keeping-0.004
lights-0.004
Token threw
Token ID9617
Feature activation-1.26e-02

Pos logit contributions
ittered+0.008
immer+0.008
rily+0.008
epad+0.007
inations+0.007
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.005
 and-0.005
Token a
Token ID257
Feature activation+2.54e-02

Pos logit contributions
 two+0.020
 a+0.019
 big+0.019
+0.018
 and+0.017
Neg logit contributions
rily-0.043
ittered-0.041
immer-0.041
keeping-0.041
lights-0.040
Token tant
Token ID24246
Feature activation+8.22e-02

Pos logit contributions
rily+0.070
ittered+0.070
immer+0.069
inations+0.068
lights+0.066
Neg logit contributions
 a-0.061
 two-0.059
 big-0.058
 the-0.056
 and-0.056
Tokenrum
Token ID6582
Feature activation+3.44e-03

Pos logit contributions
rily+0.096
lights+0.091
immer+0.090
acket+0.087
comb+0.084
Neg logit contributions
 a-0.317
 two-0.310
 and-0.308
.-0.302
 the-0.300
Token again
Token ID757
Feature activation-6.86e-03

Pos logit contributions
rily+0.015
ittered+0.015
immer+0.015
lights+0.015
keeping+0.015
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
 the-0.002
-0.001
Token.
Token ID13
Feature activation-4.51e-03

Pos logit contributions
 two+0.005
 a+0.004
 big+0.004
+0.003
 the+0.003
Neg logit contributions
rily-0.030
immer-0.029
ittered-0.029
 avail-0.029
ge-0.028
Token<|endoftext|>
Token ID50256
Feature activation-1.19e-02

Pos logit contributions
 big+0.005
 a+0.005
 two+0.005
 the+0.004
 and+0.004
Neg logit contributions
rily-0.017
ittered-0.017
immer-0.017
lights-0.016
keeping-0.016
TokenOnce
Token ID7454
Feature activation-9.51e-03

Pos logit contributions
 two+0.006
 a+0.006
 big+0.005
 little+0.004
 and+0.003
Neg logit contributions
ittered-0.051
rily-0.051
immer-0.050
keeping-0.050
lights-0.049
Token way
Token ID835
Feature activation+2.22e-03

Pos logit contributions
rily+0.051
ittered+0.050
immer+0.050
 avail+0.049
inations+0.049
Neg logit contributions
 a-0.038
 two-0.037
 big-0.036
-0.035
 and-0.035
Token forward
Token ID2651
Feature activation-1.47e-03

Pos logit contributions
rily+0.008
immer+0.008
ittered+0.008
ge+0.008
 avail+0.008
Neg logit contributions
 two-0.003
 a-0.003
 big-0.003
-0.003
 little-0.003
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
 two+0.001
 big+0.001
 a+0.001
+0.001
 little+0.001
Neg logit contributions
rily-0.006
ittered-0.006
immer-0.006
keeping-0.006
ge-0.006
Token She
Token ID1375
Feature activation+1.11e-03

Pos logit contributions
rily+0.049
ittered+0.048
immer+0.047
lights+0.046
arians+0.046
Neg logit contributions
 a-0.028
 two-0.027
 big-0.027
 and-0.025
 the-0.025
Token stayed
Token ID9658
Feature activation+7.38e-03

Pos logit contributions
ittered+0.004
rily+0.004
inations+0.004
immer+0.004
fer+0.004
Neg logit contributions
 a-0.002
 two-0.002
 big-0.002
-0.002
 the-0.002
Token foc
Token ID2133
Feature activation+8.09e-02

Pos logit contributions
rily+0.029
ittered+0.029
immer+0.029
inations+0.028
 avail+0.028
Neg logit contributions
 two-0.008
 a-0.008
-0.007
 big-0.007
 little-0.006
Tokenussed
Token ID29569
Feature activation+1.24e-03

Pos logit contributions
igi+0.100
 avail+0.098
inations+0.096
leness+0.094
lights+0.090
Neg logit contributions
 and-0.295
 a-0.291
 two-0.285
 at-0.280
-0.274
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
rily+0.006
ittered+0.006
immer+0.006
lights+0.005
inations+0.005
Neg logit contributions
 a-0.001
 two-0.001
 big-0.000
-0.000
 the-0.000
Token On
Token ID1550
Feature activation+1.43e-02

Pos logit contributions
rily+0.043
ittered+0.043
immer+0.042
lights+0.041
inations+0.041
Neg logit contributions
 a-0.022
 big-0.021
 two-0.021
 and-0.020
 the-0.020
Token the
Token ID262
Feature activation+1.84e-02

Pos logit contributions
rily+0.064
ittered+0.064
immer+0.063
lights+0.063
 avail+0.063
Neg logit contributions
 a-0.008
 big-0.007
 two-0.007
 and-0.006
-0.006
Token third
Token ID2368
Feature activation+6.01e-03

Pos logit contributions
rily+0.057
ittered+0.055
immer+0.055
 avail+0.055
inations+0.054
Neg logit contributions
 a-0.037
 two-0.036
-0.035
 big-0.035
 and-0.034
Token She
Token ID1375
Feature activation-7.72e-03

Pos logit contributions
 a+0.013
 two+0.012
 big+0.012
 the+0.011
 and+0.010
Neg logit contributions
rily-0.021
lights-0.021
ently-0.021
ittered-0.020
istent-0.020
Token cried
Token ID16896
Feature activation-7.19e-03

Pos logit contributions
 a+0.013
 big+0.013
 two+0.013
+0.012
 the+0.012
Neg logit contributions
ittered-0.025
rily-0.024
inations-0.024
immer-0.024
arians-0.023
Token and
Token ID290
Feature activation-1.84e-03

Pos logit contributions
 two+0.004
 big+0.004
 a+0.003
+0.003
 the+0.003
Neg logit contributions
rily-0.031
ittered-0.030
immer-0.030
lights-0.030
istent-0.030
Token threw
Token ID9617
Feature activation-7.10e-03

Pos logit contributions
 big+0.003
 two+0.003
 a+0.003
+0.003
 and+0.003
Neg logit contributions
ittered-0.006
rily-0.006
 avail-0.006
leness-0.006
immer-0.006
Token a
Token ID257
Feature activation+8.73e-03

Pos logit contributions
 two+0.007
 big+0.007
+0.007
 a+0.007
 next+0.006
Neg logit contributions
rily-0.027
ittered-0.026
immer-0.026
keeping-0.026
lights-0.026
Token tant
Token ID24246
Feature activation+8.05e-02

Pos logit contributions
rily+0.020
immer+0.020
ittered+0.019
inations+0.019
lights+0.019
Neg logit contributions
 a-0.024
 two-0.024
 the-0.022
-0.022
 and-0.022
Tokenrum
Token ID6582
Feature activation+3.28e-03

Pos logit contributions
rily+0.090
immer+0.083
ittered+0.082
inations+0.082
lights+0.081
Neg logit contributions
 a-0.320
 and-0.313
 two-0.311
 big-0.308
 the-0.305
Token.
Token ID13
Feature activation-3.18e-03

Pos logit contributions
rily+0.015
ittered+0.015
lights+0.014
immer+0.014
keeping+0.014
Neg logit contributions
 big-0.002
 a-0.002
 two-0.002
-0.001
 the-0.001
Token Her
Token ID2332
Feature activation-1.94e-03

Pos logit contributions
 a+0.007
 two+0.007
 big+0.007
 the+0.006
 and+0.006
Neg logit contributions
rily-0.009
ittered-0.009
lights-0.008
ently-0.008
immer-0.008
Token mom
Token ID1995
Feature activation-1.89e-03

Pos logit contributions
 a+0.002
+0.002
 two+0.002
 the+0.002
 and+0.002
Neg logit contributions
lights-0.007
ittered-0.007
inations-0.007
rily-0.007
 footing-0.007
Token tried
Token ID3088
Feature activation+2.30e-03

Pos logit contributions
 a+0.003
 big+0.003
 two+0.003
+0.003
 the+0.003
Neg logit contributions
ittered-0.006
inations-0.006
immer-0.006
rily-0.006
istent-0.006
Token.
Token ID13
Feature activation-3.01e-03

Pos logit contributions
 two+0.010
 a+0.009
 big+0.009
 little+0.006
 the+0.006
Neg logit contributions
rily-0.050
ge-0.049
immer-0.048
lights-0.048
epad-0.048
Token He
Token ID679
Feature activation-5.88e-03

Pos logit contributions
 two+0.005
 a+0.005
 big+0.005
 the+0.005
 and+0.005
Neg logit contributions
rily-0.009
lights-0.009
keeping-0.009
epad-0.009
 avail-0.008
Token kept
Token ID4030
Feature activation-3.32e-04

Pos logit contributions
 a+0.008
 two+0.008
 big+0.008
 the+0.008
+0.008
Neg logit contributions
ittered-0.020
rily-0.020
immer-0.020
arians-0.019
inations-0.019
Token throwing
Token ID9644
Feature activation-1.33e-02

Pos logit contributions
 two+0.001
 big+0.001
 a+0.001
+0.001
 little+0.001
Neg logit contributions
rily-0.001
ittered-0.001
keeping-0.001
fer-0.001
 avail-0.001
Token a
Token ID257
Feature activation+3.46e-03

Pos logit contributions
 two+0.015
+0.015
 a+0.015
 big+0.014
 next+0.013
Neg logit contributions
rily-0.049
immer-0.048
ittered-0.047
lights-0.047
keeping-0.046
Token tant
Token ID24246
Feature activation+7.78e-02

Pos logit contributions
rily+0.011
immer+0.011
ittered+0.010
inations+0.010
lights+0.010
Neg logit contributions
 a-0.007
 two-0.007
 the-0.006
-0.006
 and-0.006
Tokenrum
Token ID6582
Feature activation+2.33e-03

Pos logit contributions
rily+0.082
ittered+0.076
lights+0.075
immer+0.074
arians+0.072
Neg logit contributions
 a-0.316
 two-0.308
 and-0.308
 big-0.305
 the-0.301
Token.
Token ID13
Feature activation-1.99e-03

Pos logit contributions
rily+0.010
ittered+0.010
lights+0.010
immer+0.010
ently+0.010
Neg logit contributions
 big-0.001
 a-0.001
 two-0.001
 the-0.001
 little-0.001
Token His
Token ID2399
Feature activation+6.22e-03

Pos logit contributions
 two+0.003
 a+0.003
 big+0.003
 and+0.003
 the+0.003
Neg logit contributions
rily-0.006
lights-0.006
ittered-0.006
epad-0.006
immer-0.006
Token mom
Token ID1995
Feature activation-4.82e-03

Pos logit contributions
rily+0.024
lights+0.024
ittered+0.024
keeping+0.024
immer+0.024
Neg logit contributions
 a-0.007
 two-0.006
-0.006
 the-0.005
 and-0.005
Token was
Token ID373
Feature activation-4.41e-03

Pos logit contributions
 a+0.008
 big+0.007
 two+0.007
+0.007
 the+0.006
Neg logit contributions
ittered-0.016
immer-0.016
rily-0.016
inations-0.016
istent-0.015
Token stubborn
Token ID24444
Feature activation-1.18e-02

Pos logit contributions
+0.022
 two+0.022
 a+0.021
 and+0.021
 big+0.020
Neg logit contributions
ittered-0.038
rily-0.038
epad-0.037
immer-0.036
inations-0.036
Token.
Token ID13
Feature activation+3.14e-03

Pos logit contributions
 big+0.009
 a+0.008
 two+0.008
+0.006
 the+0.006
Neg logit contributions
rily-0.048
lights-0.047
epad-0.047
keeping-0.047
ittered-0.047
Token She
Token ID1375
Feature activation-1.17e-02

Pos logit contributions
rily+0.010
ittered+0.010
lights+0.010
immer+0.010
istent+0.010
Neg logit contributions
 two-0.005
 a-0.005
 big-0.005
 the-0.004
 and-0.004
Token throws
Token ID12542
Feature activation-1.61e-02

Pos logit contributions
 a+0.026
 two+0.025
+0.024
 big+0.024
 the+0.024
Neg logit contributions
ittered-0.030
rily-0.030
inations-0.030
leness-0.029
arians-0.029
Token a
Token ID257
Feature activation+5.82e-03

Pos logit contributions
 two+0.011
 big+0.011
+0.011
 a+0.010
 little+0.009
Neg logit contributions
rily-0.066
ittered-0.062
immer-0.062
pan-0.061
lights-0.061
Token tant
Token ID24246
Feature activation+7.73e-02

Pos logit contributions
rily+0.012
immer+0.012
ittered+0.012
inations+0.011
lights+0.011
Neg logit contributions
 a-0.018
 two-0.017
-0.016
 the-0.016
 and-0.016
Tokenrum
Token ID6582
Feature activation+4.30e-03

Pos logit contributions
rily+0.085
immer+0.079
lights+0.077
ittered+0.076
acket+0.076
Neg logit contributions
 a-0.311
 two-0.303
 and-0.302
 big-0.296
 the-0.296
Token.
Token ID13
Feature activation+4.13e-03

Pos logit contributions
rily+0.019
ittered+0.019
lights+0.019
immer+0.019
ently+0.018
Neg logit contributions
 big-0.003
 a-0.002
 two-0.002
 the-0.002
 little-0.002
Token She
Token ID1375
Feature activation-7.56e-04

Pos logit contributions
rily+0.014
ittered+0.014
lights+0.014
immer+0.014
istent+0.013
Neg logit contributions
 a-0.006
 big-0.006
 two-0.006
 the-0.005
 and-0.005
Token kicks
Token ID17364
Feature activation-1.32e-02

Pos logit contributions
 a+0.002
 two+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
rily-0.002
ittered-0.002
inations-0.002
leness-0.002
lights-0.002
Token and
Token ID290
Feature activation-1.10e-02

Pos logit contributions
 big+0.007
 two+0.007
 a+0.006
+0.005
 little+0.005
Neg logit contributions
rily-0.056
lights-0.053
ittered-0.053
immer-0.052
ently-0.052
Token surprise
Token ID5975
Feature activation+6.68e-03

Pos logit contributions
rily+0.023
immer+0.023
inations+0.022
ittered+0.022
 avail+0.022
Neg logit contributions
 a-0.017
 two-0.016
-0.016
 the-0.016
 big-0.016
Token.
Token ID13
Feature activation-6.03e-03

Pos logit contributions
rily+0.030
immer+0.029
ittered+0.029
lights+0.029
inations+0.028
Neg logit contributions
 a-0.005
 two-0.004
 big-0.004
-0.003
 the-0.003
Token He
Token ID679
Feature activation-7.14e-03

Pos logit contributions
 big+0.010
 two+0.010
 a+0.010
 and+0.009
 the+0.009
Neg logit contributions
rily-0.018
lights-0.018
keeping-0.018
immer-0.018
ittered-0.017
Token threw
Token ID9617
Feature activation-1.82e-02

Pos logit contributions
 a+0.011
 two+0.011
 big+0.011
+0.010
 the+0.010
Neg logit contributions
ittered-0.024
rily-0.024
immer-0.023
arians-0.023
inations-0.023
Token a
Token ID257
Feature activation+1.32e-03

Pos logit contributions
+0.014
 two+0.013
 big+0.013
 next+0.012
 a+0.012
Neg logit contributions
rily-0.073
ittered-0.068
immer-0.068
keeping-0.068
lights-0.067
Token tant
Token ID24246
Feature activation+7.62e-02

Pos logit contributions
rily+0.004
immer+0.004
ittered+0.003
inations+0.003
epad+0.003
Neg logit contributions
 a-0.003
 two-0.003
 the-0.003
-0.003
 and-0.003
Tokenrum
Token ID6582
Feature activation+3.63e-03

Pos logit contributions
rily+0.081
immer+0.076
ittered+0.075
lights+0.074
inations+0.072
Neg logit contributions
 a-0.307
 and-0.300
 two-0.299
 big-0.295
 the-0.292
Token.
Token ID13
Feature activation-2.80e-03

Pos logit contributions
rily+0.016
ittered+0.016
lights+0.016
immer+0.015
keeping+0.015
Neg logit contributions
 big-0.002
 a-0.002
 two-0.002
 the-0.001
-0.001
Token He
Token ID679
Feature activation+1.65e-05

Pos logit contributions
 big+0.004
 two+0.004
 a+0.004
 and+0.004
 the+0.004
Neg logit contributions
rily-0.009
lights-0.009
ittered-0.009
immer-0.009
keeping-0.009
Token kicked
Token ID12165
Feature activation-1.56e-02

Pos logit contributions
rily+0.000
ittered+0.000
inations+0.000
lights+0.000
immer+0.000
Neg logit contributions
 a-0.000
 two-0.000
 big-0.000
-0.000
 the-0.000
Token and
Token ID290
Feature activation-4.80e-03

Pos logit contributions
 two+0.008
 big+0.008
+0.006
 a+0.006
 next+0.006
Neg logit contributions
rily-0.065
immer-0.061
lights-0.060
ittered-0.060
ge-0.060
Token started
Token ID2067
Feature activation+4.79e-03

Pos logit contributions
ittered+0.015
rily+0.015
inations+0.015
immer+0.015
fer+0.014
Neg logit contributions
 a-0.008
 two-0.008
 big-0.007
-0.007
 the-0.007
Token walking
Token ID6155
Feature activation-3.11e-03

Pos logit contributions
ittered+0.018
rily+0.018
ge+0.018
inations+0.018
epad+0.018
Neg logit contributions
 two-0.005
 a-0.005
 big-0.005
-0.004
 little-0.004
Token very
Token ID845
Feature activation+7.36e-03

Pos logit contributions
 two+0.003
 a+0.003
 big+0.003
+0.003
 little+0.003
Neg logit contributions
rily-0.012
ittered-0.012
lights-0.012
immer-0.011
eway-0.011
Token slowly
Token ID6364
Feature activation-2.01e-03

Pos logit contributions
rily+0.024
ittered+0.024
immer+0.024
ge+0.024
lights+0.024
Neg logit contributions
 two-0.012
 a-0.012
 big-0.011
-0.011
 the-0.010
Token as
Token ID355
Feature activation+2.18e-02

Pos logit contributions
 a+0.002
 two+0.002
 big+0.001
+0.001
 the+0.001
Neg logit contributions
immer-0.008
ittered-0.008
rily-0.008
lights-0.008
ge-0.008
Token tort
Token ID7619
Feature activation+7.61e-02

Pos logit contributions
rily+0.083
ittered+0.082
immer+0.081
lights+0.080
inations+0.079
Neg logit contributions
 a-0.029
 two-0.028
 big-0.027
 and-0.025
-0.025
Tokeno
Token ID78
Feature activation+5.01e-03

Pos logit contributions
rily+0.115
lights+0.114
immer+0.096
 avail+0.095
inations+0.094
Neg logit contributions
 two-0.280
 a-0.277
 and-0.265
 the-0.263
-0.261
Tokenises
Token ID2696
Feature activation-7.78e-03

Pos logit contributions
rily+0.020
ittered+0.020
immer+0.020
inations+0.019
fer+0.019
Neg logit contributions
 a-0.005
 big-0.005
 two-0.005
-0.004
 the-0.004
Token do
Token ID466
Feature activation-5.81e-03

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 little+0.004
Neg logit contributions
ittered-0.033
rily-0.033
immer-0.033
inations-0.032
lights-0.032
Token,
Token ID11
Feature activation+1.57e-03

Pos logit contributions
 two+0.006
 a+0.006
 big+0.005
+0.005
 and+0.004
Neg logit contributions
ittered-0.023
rily-0.023
inations-0.023
immer-0.023
lights-0.023
Token but
Token ID475
Feature activation-1.17e-03

Pos logit contributions
rily+0.005
ittered+0.005
inations+0.005
immer+0.005
lights+0.005
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
-0.002
 the-0.002
Token found
Token ID1043
Feature activation-1.90e-02

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
 the+0.002
+0.002
Neg logit contributions
ittered-0.003
immer-0.003
inations-0.003
comb-0.003
rily-0.003
Token it
Token ID340
Feature activation+9.29e-03

Pos logit contributions
 big+0.020
+0.019
 and+0.018
 two+0.018
 a+0.017
Neg logit contributions
rily-0.071
ittered-0.068
immer-0.067
keeping-0.066
 avail-0.066
Token!"
Token ID2474
Feature activation+4.57e-03

Pos logit contributions
rily+0.031
ittered+0.031
immer+0.031
inations+0.030
comb+0.030
Neg logit contributions
 a-0.016
 two-0.015
 big-0.015
-0.014
 the-0.014
Token
Token ID198
Feature activation-1.06e-03

Pos logit contributions
rily+0.017
ittered+0.017
immer+0.016
lights+0.016
istent+0.016
Neg logit contributions
 two-0.005
 a-0.005
 big-0.005
 the-0.004
 next-0.004
Token
Token ID198
Feature activation+8.65e-03

Pos logit contributions
 a+0.001
 two+0.000
 big+0.000
 and+0.000
 the+0.000
Neg logit contributions
rily-0.005
ittered-0.005
immer-0.005
inations-0.005
lor-0.005
TokenCharl
Token ID24453
Feature activation+7.51e-02

Pos logit contributions
rily+0.021
immer+0.021
ittered+0.021
arians+0.020
inations+0.020
Neg logit contributions
 a-0.022
 two-0.022
 big-0.021
 the-0.021
 and-0.021
Tokenotte
Token ID11404
Feature activation-5.12e-03

Pos logit contributions
immer+0.116
lights+0.112
inations+0.109
ittered+0.107
rily+0.104
Neg logit contributions
 a-0.269
 two-0.268
 and-0.262
 the-0.261
-0.259
Token and
Token ID290
Feature activation+5.88e-03

Pos logit contributions
 a+0.008
 big+0.008
 two+0.008
+0.007
 the+0.007
Neg logit contributions
ittered-0.017
rily-0.017
immer-0.017
arians-0.016
comb-0.016
Token her
Token ID607
Feature activation-7.44e-04

Pos logit contributions
rily+0.022
 avail+0.022
ittered+0.022
immer+0.021
lights+0.021
Neg logit contributions
 a-0.007
 two-0.006
 big-0.006
 and-0.006
-0.005
Token mom
Token ID1995
Feature activation-7.86e-04

Pos logit contributions
 a+0.001
+0.001
 two+0.001
 and+0.001
 the+0.001
Neg logit contributions
lights-0.003
rily-0.003
ittered-0.003
arians-0.003
 footing-0.003
Token brought
Token ID3181
Feature activation-7.12e-03

Pos logit contributions
 a+0.001
 big+0.001
+0.001
 two+0.001
 the+0.001
Neg logit contributions
ittered-0.003
immer-0.003
rily-0.003
inations-0.002
rol-0.002
Token the
Token ID262
Feature activation+5.07e-03

Pos logit contributions
rily+0.009
immer+0.009
ittered+0.009
inations+0.009
leness+0.009
Neg logit contributions
 two-0.004
 a-0.004
 big-0.004
-0.003
 and-0.003
Token same
Token ID976
Feature activation+1.59e-03

Pos logit contributions
immer+0.017
rily+0.016
ittered+0.016
inations+0.016
 avail+0.016
Neg logit contributions
 a-0.009
-0.008
 the-0.008
 and-0.008
 two-0.008
Token urge
Token ID14960
Feature activation-5.23e-03

Pos logit contributions
rily+0.006
immer+0.006
ittered+0.006
inations+0.005
lights+0.005
Neg logit contributions
 a-0.002
 two-0.002
-0.002
 big-0.002
 the-0.002
Token to
Token ID284
Feature activation-4.08e-03

Pos logit contributions
 a+0.004
 two+0.004
 big+0.004
+0.003
 the+0.003
Neg logit contributions
rily-0.022
ittered-0.022
immer-0.022
inations-0.021
lights-0.021
Token explore
Token ID7301
Feature activation+6.48e-03

Pos logit contributions
 a+0.007
 two+0.006
 big+0.006
 and+0.006
+0.006
Neg logit contributions
 avail-0.013
immer-0.013
ittered-0.013
inations-0.013
rily-0.013
Token Ur
Token ID8799
Feature activation+7.47e-02

Pos logit contributions
rily+0.028
ittered+0.028
immer+0.028
lights+0.028
inations+0.027
Neg logit contributions
 two-0.004
 a-0.004
 big-0.004
-0.003
 little-0.003
Tokenge
Token ID469
Feature activation+8.48e-03

Pos logit contributions
lights+0.058
rily+0.058
 avail+0.053
comb+0.051
ittered+0.050
Neg logit contributions
 and-0.308
 a-0.308
 two-0.304
 the-0.304
.-0.302
Token Land
Token ID6379
Feature activation+8.99e-03

Pos logit contributions
rily+0.038
ittered+0.038
immer+0.037
lights+0.037
inations+0.036
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.004
 the-0.003
Token everyday
Token ID10908
Feature activation+2.75e-03

Pos logit contributions
rily+0.040
ittered+0.040
immer+0.040
lights+0.039
inations+0.039
Neg logit contributions
 a-0.006
 two-0.005
 big-0.005
-0.004
 the-0.004
Token,
Token ID11
Feature activation+6.56e-04

Pos logit contributions
rily+0.012
ittered+0.012
immer+0.012
inations+0.012
lights+0.012
Neg logit contributions
 a-0.002
 two-0.002
 big-0.001
-0.001
 the-0.001
Token and
Token ID290
Feature activation-8.35e-03

Pos logit contributions
rily+0.002
inations+0.002
ittered+0.002
immer+0.002
epad+0.002
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
-0.001
 the-0.001
Token listen
Token ID6004
Feature activation-1.66e-02

Pos logit contributions
 a+0.004
 two+0.004
 big+0.004
+0.004
 the+0.004
Neg logit contributions
immer-0.005
epad-0.005
leness-0.005
inations-0.005
ittered-0.005
Token.
Token ID13
Feature activation+9.38e-03

Pos logit contributions
 two+0.015
 a+0.013
 big+0.012
+0.010
 the+0.010
Neg logit contributions
rily-0.066
lights-0.065
ittered-0.064
ently-0.064
epad-0.064
Token She
Token ID1375
Feature activation-5.97e-03

Pos logit contributions
rily+0.033
ittered+0.032
immer+0.032
lights+0.032
istent+0.031
Neg logit contributions
 a-0.014
 two-0.014
 big-0.014
 the-0.012
 and-0.012
Token threw
Token ID9617
Feature activation-1.18e-02

Pos logit contributions
 a+0.010
 big+0.010
 two+0.009
 the+0.009
+0.009
Neg logit contributions
ittered-0.020
rily-0.019
inations-0.019
immer-0.019
arians-0.019
Token a
Token ID257
Feature activation+5.70e-03

Pos logit contributions
 two+0.009
+0.008
 big+0.008
 a+0.008
 little+0.007
Neg logit contributions
rily-0.048
ittered-0.047
immer-0.046
lights-0.046
keeping-0.045
Token tant
Token ID24246
Feature activation+7.29e-02

Pos logit contributions
rily+0.012
immer+0.012
ittered+0.012
inations+0.011
lights+0.011
Neg logit contributions
 a-0.017
 two-0.016
 the-0.016
-0.016
 and-0.015
Tokenrum
Token ID6582
Feature activation+4.93e-04

Pos logit contributions
rily+0.080
immer+0.075
ittered+0.073
inations+0.073
 avail+0.072
Neg logit contributions
 a-0.293
 two-0.285
 and-0.285
 big-0.281
 the-0.279
Token in
Token ID287
Feature activation-1.04e-02

Pos logit contributions
rily+0.002
ittered+0.002
lights+0.002
ently+0.002
keeping+0.002
Neg logit contributions
 big-0.000
 a-0.000
 two-0.000
-0.000
 the-0.000
Token the
Token ID262
Feature activation-7.01e-03

Pos logit contributions
 a+0.006
+0.006
 big+0.006
 two+0.006
 and+0.005
Neg logit contributions
rily-0.042
 avail-0.041
keeping-0.041
ittered-0.041
immer-0.041
Token store
Token ID3650
Feature activation+2.98e-03

Pos logit contributions
 a+0.013
 and+0.012
+0.012
 two+0.012
 the+0.012
Neg logit contributions
rily-0.021
lights-0.021
immer-0.020
leness-0.020
acket-0.020
Token.
Token ID13
Feature activation+6.63e-03

Pos logit contributions
rily+0.013
ittered+0.013
immer+0.013
lights+0.013
arians+0.013
Neg logit contributions
 a-0.002
 two-0.002
 big-0.001
-0.001
 the-0.001
Token and
Token ID290
Feature activation+2.52e-03

Pos logit contributions
rily+0.067
ittered+0.067
immer+0.066
inations+0.065
lights+0.064
Neg logit contributions
 a-0.011
 two-0.011
 big-0.010
-0.008
 the-0.008
Token the
Token ID262
Feature activation-8.19e-03

Pos logit contributions
rily+0.009
ittered+0.009
inations+0.009
lights+0.009
immer+0.009
Neg logit contributions
 a-0.004
 big-0.003
 two-0.003
-0.003
 and-0.003
Token harsh
Token ID11859
Feature activation-1.95e-03

Pos logit contributions
 a+0.019
+0.018
 and+0.018
 the+0.017
 two+0.017
Neg logit contributions
epad-0.021
inations-0.021
rily-0.021
immer-0.021
 avail-0.020
Token queen
Token ID16599
Feature activation+3.90e-03

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.005
 the+0.004
Neg logit contributions
rily-0.005
ittered-0.005
immer-0.005
lights-0.005
epad-0.005
Token was
Token ID373
Feature activation+1.30e-02

Pos logit contributions
ittered+0.014
rily+0.014
immer+0.014
inations+0.014
lights+0.014
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.005
 the-0.005
Token overth
Token ID34514
Feature activation+7.25e-02

Pos logit contributions
rily+0.040
ittered+0.040
immer+0.040
inations+0.038
lights+0.038
Neg logit contributions
 a-0.026
 two-0.026
 big-0.025
-0.024
 and-0.024
Tokenrown
Token ID2053
Feature activation-6.58e-03

Pos logit contributions
rily+0.114
 avail+0.111
lights+0.107
leness+0.104
inations+0.103
Neg logit contributions
 two-0.245
 a-0.242
-0.238
 and-0.237
.-0.230
Token.
Token ID13
Feature activation+4.73e-03

Pos logit contributions
 a+0.012
 two+0.012
 big+0.012
 the+0.011
+0.011
Neg logit contributions
rily-0.021
ittered-0.021
immer-0.020
ge-0.020
keeping-0.020
Token
Token ID198
Feature activation+5.53e-03

Pos logit contributions
rily+0.019
ittered+0.018
immer+0.018
inations+0.018
arians+0.018
Neg logit contributions
 a-0.006
 big-0.005
 two-0.005
 and-0.005
 the-0.005
Token
Token ID198
Feature activation+8.03e-03

Pos logit contributions
rily+0.025
ittered+0.025
immer+0.025
inations+0.025
lights+0.024
Neg logit contributions
 a-0.003
 two-0.002
 big-0.002
 and-0.002
 the-0.002
TokenThe
Token ID464
Feature activation-7.58e-03

Pos logit contributions
immer+0.025
rily+0.025
ittered+0.025
inations+0.025
fer+0.024
Neg logit contributions
 a-0.016
 big-0.015
 two-0.015
 and-0.014
 the-0.014
Token the
Token ID262
Feature activation-2.12e-03

Pos logit contributions
rily+0.105
ittered+0.104
immer+0.103
 avail+0.102
inations+0.101
Neg logit contributions
 a-0.012
 two-0.011
 big-0.011
 and-0.008
-0.008
Token shed
Token ID14999
Feature activation+2.27e-03

Pos logit contributions
 a+0.005
 two+0.005
+0.004
 and+0.004
 big+0.004
Neg logit contributions
rily-0.006
arians-0.006
comb-0.006
keeping-0.005
immer-0.005
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
rily+0.010
immer+0.010
ittered+0.010
lights+0.010
inations+0.010
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
-0.001
 the-0.001
Token
Token ID198
Feature activation-8.82e-04

Pos logit contributions
rily+0.032
ittered+0.030
immer+0.030
arians+0.029
istent+0.029
Neg logit contributions
 a-0.027
 two-0.027
 big-0.026
 the-0.025
 and-0.024
Token
Token ID198
Feature activation+5.90e-03

Pos logit contributions
 a+0.001
 two+0.000
 big+0.000
 the+0.000
 and+0.000
Neg logit contributions
rily-0.004
immer-0.004
ittered-0.004
inations-0.004
lor-0.004
TokenCharl
Token ID24453
Feature activation+7.24e-02

Pos logit contributions
rily+0.017
immer+0.017
ittered+0.017
inations+0.017
arians+0.017
Neg logit contributions
 a-0.013
 two-0.012
 big-0.012
 the-0.011
 and-0.011
Tokenotte
Token ID11404
Feature activation-1.72e-02

Pos logit contributions
immer+0.108
lights+0.102
ittered+0.101
inations+0.101
rily+0.097
Neg logit contributions
 a-0.264
 two-0.262
 and-0.258
 the-0.255
-0.254
Token ran
Token ID4966
Feature activation-1.09e-02

Pos logit contributions
 a+0.025
 big+0.024
 two+0.023
+0.021
 the+0.021
Neg logit contributions
ittered-0.061
rily-0.060
immer-0.059
arians-0.058
istent-0.058
Token to
Token ID284
Feature activation+9.30e-04

Pos logit contributions
 two+0.014
 big+0.013
 a+0.012
+0.011
 little+0.011
Neg logit contributions
rily-0.038
keeping-0.038
ittered-0.038
immer-0.037
ge-0.037
Token the
Token ID262
Feature activation-1.18e-02

Pos logit contributions
 avail+0.004
rily+0.004
ittered+0.004
immer+0.004
keeping+0.004
Neg logit contributions
 two-0.000
 big-0.000
 a-0.000
 and-0.000
-0.000
Token shed
Token ID14999
Feature activation+7.86e-04

Pos logit contributions
 a+0.029
+0.027
 two+0.027
 and+0.026
 the+0.026
Neg logit contributions
rily-0.029
stack-0.028
comb-0.028
arians-0.028
keeping-0.027
TokenOne
Token ID3198
Feature activation-1.14e-02

Pos logit contributions
 a+0.014
 big+0.014
 the+0.014
 and+0.014
 two+0.013
Neg logit contributions
immer-0.014
rily-0.013
ittered-0.013
epad-0.013
inations-0.013
Token,
Token ID11
Feature activation-4.59e-02

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 big+0.007
 and+0.007
Neg logit contributions
immer-0.046
lights-0.045
ittered-0.045
inations-0.044
adows-0.044
Token two
Token ID734
Feature activation-7.77e-03

Pos logit contributions
+0.027
 big+0.022
 and+0.020
.+0.020
 a+0.016
Neg logit contributions
 avail-0.168
icians-0.167
inations-0.167
arians-0.166
plets-0.165
Token,
Token ID11
Feature activation-5.20e-02

Pos logit contributions
 a+0.006
+0.005
 big+0.005
 two+0.005
 and+0.004
Neg logit contributions
ittered-0.032
immer-0.032
rily-0.032
inations-0.032
lights-0.031
Token three
Token ID1115
Feature activation-7.43e-03

Pos logit contributions
+0.058
 big+0.055
 and+0.050
.+0.049
 the+0.047
Neg logit contributions
icians-0.171
arians-0.170
inations-0.170
immer-0.169
comb-0.168
Token,
Token ID11
Feature activation-5.22e-02

Pos logit contributions
 a+0.007
 two+0.006
 big+0.006
+0.006
 the+0.005
Neg logit contributions
rily-0.030
ittered-0.030
inations-0.030
lights-0.029
immer-0.029
Token go
Token ID467
Feature activation-8.01e-03

Pos logit contributions
+0.061
 big+0.061
 and+0.053
.+0.053
 a+0.053
Neg logit contributions
immer-0.167
arians-0.166
icians-0.166
inations-0.165
comb-0.164
Token!"
Token ID2474
Feature activation-9.93e-03

Pos logit contributions
 a+0.014
 two+0.014
 big+0.013
+0.012
 little+0.012
Neg logit contributions
rily-0.025
ittered-0.025
inations-0.025
immer-0.024
ge-0.024
Token
Token ID198
Feature activation-3.74e-03

Pos logit contributions
 two+0.009
 big+0.009
 a+0.009
 next+0.007
 little+0.007
Neg logit contributions
rily-0.038
lights-0.036
immer-0.035
ittered-0.035
plets-0.035
Token
Token ID198
Feature activation+4.83e-03

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
 one+0.001
 the+0.001
Neg logit contributions
rily-0.016
immer-0.016
ittered-0.016
igi-0.015
lights-0.015
TokenThey
Token ID2990
Feature activation-8.96e-03

Pos logit contributions
immer+0.015
rily+0.015
fer+0.014
ittered+0.014
vation+0.014
Neg logit contributions
 a-0.009
 big-0.009
 two-0.009
 the-0.008
 and-0.008
Token.
Token ID13
Feature activation+5.96e-03

Pos logit contributions
 a+0.005
 two+0.005
 big+0.004
+0.004
 the+0.003
Neg logit contributions
rily-0.030
lights-0.029
ittered-0.029
immer-0.029
inations-0.029
Token "
Token ID366
Feature activation-5.68e-03

Pos logit contributions
rily+0.021
ittered+0.021
immer+0.021
lights+0.020
epad+0.020
Neg logit contributions
 a-0.009
 two-0.008
 big-0.008
 the-0.008
 and-0.008
TokenOne
Token ID3198
Feature activation-1.14e-02

Pos logit contributions
 a+0.014
 big+0.014
 the+0.014
 and+0.014
 two+0.013
Neg logit contributions
immer-0.014
rily-0.013
ittered-0.013
epad-0.013
inations-0.013
Token,
Token ID11
Feature activation-4.59e-02

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 big+0.007
 and+0.007
Neg logit contributions
immer-0.046
lights-0.045
ittered-0.045
inations-0.044
adows-0.044
Token two
Token ID734
Feature activation-7.77e-03

Pos logit contributions
+0.027
 big+0.022
 and+0.020
.+0.020
 a+0.016
Neg logit contributions
 avail-0.168
icians-0.167
inations-0.167
arians-0.166
plets-0.165
Token,
Token ID11
Feature activation-5.20e-02

Pos logit contributions
 a+0.006
+0.005
 big+0.005
 two+0.005
 and+0.004
Neg logit contributions
ittered-0.032
immer-0.032
rily-0.032
inations-0.032
lights-0.031
Token three
Token ID1115
Feature activation-7.43e-03

Pos logit contributions
+0.058
 big+0.055
 and+0.050
.+0.049
 the+0.047
Neg logit contributions
icians-0.171
arians-0.170
inations-0.170
immer-0.169
comb-0.168
Token,
Token ID11
Feature activation-5.22e-02

Pos logit contributions
 a+0.007
 two+0.006
 big+0.006
+0.006
 the+0.005
Neg logit contributions
rily-0.030
ittered-0.030
inations-0.030
lights-0.029
immer-0.029
Token go
Token ID467
Feature activation-8.01e-03

Pos logit contributions
+0.061
 big+0.061
 and+0.053
.+0.053
 a+0.053
Neg logit contributions
immer-0.167
arians-0.166
icians-0.166
inations-0.165
comb-0.164
Token!"
Token ID2474
Feature activation-9.93e-03

Pos logit contributions
 a+0.014
 two+0.014
 big+0.013
+0.012
 little+0.012
Neg logit contributions
rily-0.025
ittered-0.025
inations-0.025
immer-0.024
ge-0.024
Token
Token ID198
Feature activation-3.74e-03

Pos logit contributions
 two+0.009
 big+0.009
 a+0.009
 next+0.007
 little+0.007
Neg logit contributions
rily-0.038
lights-0.036
immer-0.035
ittered-0.035
plets-0.035
Token.
Token ID13
Feature activation-2.72e-03

Pos logit contributions
 a+0.005
 two+0.005
 big+0.004
+0.004
 right+0.003
Neg logit contributions
rily-0.032
lights-0.032
ittered-0.032
inations-0.031
immer-0.031
Token "
Token ID366
Feature activation-1.82e-02

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
 little+0.003
 the+0.003
Neg logit contributions
rily-0.010
ittered-0.010
lights-0.010
immer-0.010
eway-0.010
TokenOne
Token ID3198
Feature activation-4.16e-03

Pos logit contributions
 a+0.043
 big+0.042
 the+0.042
 two+0.041
 little+0.041
Neg logit contributions
inations-0.043
rily-0.043
immer-0.043
epad-0.043
icians-0.043
Token,
Token ID11
Feature activation-4.81e-02

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
+0.002
 the+0.002
Neg logit contributions
immer-0.017
ittered-0.017
lights-0.017
inations-0.017
rily-0.016
Token two
Token ID734
Feature activation-1.34e-03

Pos logit contributions
+0.036
.+0.029
 and+0.028
 big+0.026
 a+0.025
Neg logit contributions
icians-0.177
inations-0.177
arians-0.173
lights-0.173
plets-0.171
Token,
Token ID11
Feature activation-5.13e-02

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
+0.001
 the+0.001
Neg logit contributions
ittered-0.006
rily-0.006
immer-0.006
inations-0.005
lights-0.005
Token three
Token ID1115
Feature activation-4.85e-03

Pos logit contributions
+0.060
 big+0.054
 and+0.052
.+0.050
 a+0.049
Neg logit contributions
icians-0.170
inations-0.170
arians-0.168
immer-0.164
rily-0.164
Token!"
Token ID2474
Feature activation-1.86e-02

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 the+0.004
Neg logit contributions
rily-0.019
ittered-0.019
inations-0.019
lights-0.019
immer-0.019
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
 big+0.021
 two+0.021
 next+0.020
 a+0.020
 right+0.019
Neg logit contributions
lights-0.064
rily-0.062
eway-0.059
ittered-0.059
immer-0.058
Token
Token ID198
Feature activation-8.93e-03

Pos logit contributions
 two+0.008
 a+0.008
 big+0.006
 right+0.005
 and+0.005
Neg logit contributions
rily-0.049
ittered-0.049
lights-0.049
immer-0.049
inations-0.048
TokenThey
Token ID2990
Feature activation-5.39e-03

Pos logit contributions
 a+0.016
 big+0.015
 two+0.014
 and+0.014
 the+0.014
Neg logit contributions
immer-0.027
rily-0.027
ittered-0.026
igi-0.026
lights-0.026
Token.
Token ID13
Feature activation+3.53e-03

Pos logit contributions
 two+0.010
 a+0.009
 big+0.008
 little+0.007
+0.007
Neg logit contributions
rily-0.063
lights-0.063
ittered-0.061
immer-0.061
epad-0.059
Token "
Token ID366
Feature activation-8.05e-03

Pos logit contributions
rily+0.012
ittered+0.012
lights+0.011
immer+0.011
keeping+0.011
Neg logit contributions
 a-0.006
 two-0.006
 big-0.006
 the-0.005
 and-0.005
TokenOne
Token ID3198
Feature activation-1.05e-02

Pos logit contributions
 a+0.018
 big+0.018
 the+0.017
 and+0.017
 two+0.017
Neg logit contributions
rily-0.021
immer-0.021
ittered-0.021
epad-0.021
inations-0.021
Token,
Token ID11
Feature activation-4.73e-02

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 big+0.007
 the+0.007
Neg logit contributions
immer-0.041
inations-0.040
ittered-0.040
adows-0.039
rated-0.039
Token two
Token ID734
Feature activation-5.06e-03

Pos logit contributions
+0.041
 big+0.035
.+0.035
 and+0.034
 a+0.031
Neg logit contributions
arians-0.160
inations-0.160
 avail-0.160
epad-0.159
comb-0.159
Token,
Token ID11
Feature activation-5.13e-02

Pos logit contributions
 a+0.004
+0.004
 two+0.003
 big+0.003
 the+0.003
Neg logit contributions
ittered-0.021
inations-0.021
immer-0.021
rily-0.021
lights-0.020
Token three
Token ID1115
Feature activation-2.27e-03

Pos logit contributions
+0.063
 big+0.059
 and+0.053
.+0.053
 a+0.053
Neg logit contributions
inations-0.165
arians-0.165
icians-0.163
epad-0.163
immer-0.159
Token,
Token ID11
Feature activation-4.80e-02

Pos logit contributions
 a+0.003
 two+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
rily-0.009
inations-0.009
ittered-0.009
lights-0.009
immer-0.009
Token four
Token ID1440
Feature activation-3.06e-03

Pos logit contributions
+0.065
 big+0.064
 a+0.058
 the+0.057
.+0.055
Neg logit contributions
arians-0.148
inations-0.147
icians-0.146
immer-0.145
epad-0.144
Token,
Token ID11
Feature activation-4.36e-02

Pos logit contributions
 a+0.003
 big+0.003
 two+0.003
+0.003
 the+0.002
Neg logit contributions
ittered-0.012
rily-0.012
inations-0.012
immer-0.012
lights-0.012
Token five
Token ID1936
Feature activation-1.09e-03

Pos logit contributions
 big+0.072
+0.072
 a+0.069
 the+0.067
.+0.063
Neg logit contributions
arians-0.123
immer-0.121
icians-0.121
rily-0.120
inations-0.119
TokenOne
Token ID3198
Feature activation-7.83e-03

Pos logit contributions
rily+0.042
immer+0.042
ittered+0.042
arians+0.041
inations+0.041
Neg logit contributions
 a-0.037
 big-0.036
 two-0.036
 the-0.035
 and-0.034
Token,
Token ID11
Feature activation-3.89e-02

Pos logit contributions
 a+0.006
 two+0.006
+0.005
 big+0.005
 and+0.005
Neg logit contributions
immer-0.031
ittered-0.030
lights-0.030
inations-0.030
 avail-0.030
Token two
Token ID734
Feature activation+1.64e-03

Pos logit contributions
+0.028
 big+0.022
 and+0.020
.+0.020
 a+0.019
Neg logit contributions
icians-0.146
arians-0.146
epad-0.145
inations-0.145
rily-0.144
Token,
Token ID11
Feature activation-4.71e-02

Pos logit contributions
ittered+0.007
rily+0.007
immer+0.007
inations+0.007
lights+0.007
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
-0.001
 the-0.001
Token three
Token ID1115
Feature activation-4.16e-03

Pos logit contributions
+0.058
 big+0.051
 and+0.048
 a+0.048
.+0.047
Neg logit contributions
arians-0.155
icians-0.154
inations-0.153
rily-0.152
immer-0.152
Token,
Token ID11
Feature activation-5.06e-02

Pos logit contributions
 a+0.004
 two+0.003
 big+0.003
+0.003
 the+0.002
Neg logit contributions
rily-0.017
ittered-0.017
lights-0.017
inations-0.017
immer-0.017
Token four
Token ID1440
Feature activation-3.82e-03

Pos logit contributions
+0.067
 big+0.064
 a+0.058
 the+0.057
 was+0.057
Neg logit contributions
arians-0.158
immer-0.158
icians-0.156
rily-0.156
inations-0.155
Token,
Token ID11
Feature activation-4.30e-02

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
+0.003
 the+0.002
Neg logit contributions
ittered-0.016
rily-0.016
immer-0.015
inations-0.015
lights-0.015
Token five
Token ID1936
Feature activation-2.97e-03

Pos logit contributions
+0.074
 big+0.072
 a+0.069
 the+0.067
.+0.064
Neg logit contributions
immer-0.121
rily-0.120
arians-0.119
ittered-0.119
icians-0.117
Token..."
Token ID9313
Feature activation+1.43e-03

Pos logit contributions
 a+0.003
 two+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
rily-0.012
ittered-0.012
inations-0.012
immer-0.012
lights-0.012
Token She
Token ID1375
Feature activation-1.71e-02

Pos logit contributions
rily+0.006
immer+0.006
ittered+0.006
inations+0.006
lights+0.006
Neg logit contributions
 big-0.001
 a-0.001
 two-0.001
 the-0.001
 little-0.001
Token.
Token ID13
Feature activation-1.08e-03

Pos logit contributions
rily+0.004
immer+0.004
ittered+0.004
lights+0.004
inations+0.003
Neg logit contributions
 two-0.001
 a-0.001
 big-0.001
-0.000
 little-0.000
Token "
Token ID366
Feature activation-6.14e-03

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
 the+0.001
 and+0.001
Neg logit contributions
rily-0.004
immer-0.004
ittered-0.004
lights-0.004
 avail-0.004
TokenOne
Token ID3198
Feature activation-4.55e-04

Pos logit contributions
 a+0.013
 big+0.012
 the+0.012
 two+0.012
 and+0.011
Neg logit contributions
rily-0.018
immer-0.018
ittered-0.017
epad-0.017
 avail-0.017
Token,
Token ID11
Feature activation-3.91e-02

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 big+0.001
 the+0.001
Neg logit contributions
immer-0.002
inations-0.002
lights-0.002
ittered-0.002
adows-0.002
Token two
Token ID734
Feature activation+7.55e-03

Pos logit contributions
+0.030
.+0.025
 big+0.024
 and+0.022
 a+0.021
Neg logit contributions
arians-0.143
 avail-0.142
rily-0.142
icians-0.141
epad-0.141
Token,
Token ID11
Feature activation-5.02e-02

Pos logit contributions
rily+0.032
ittered+0.032
immer+0.032
inations+0.032
lights+0.031
Neg logit contributions
 a-0.006
 two-0.005
 big-0.005
-0.005
 the-0.004
Token three
Token ID1115
Feature activation+8.35e-03

Pos logit contributions
+0.059
 big+0.055
.+0.051
 and+0.049
 a+0.049
Neg logit contributions
arians-0.169
icians-0.167
rily-0.166
inations-0.164
immer-0.164
Token,
Token ID11
Feature activation-4.76e-02

Pos logit contributions
rily+0.032
ittered+0.032
inations+0.031
immer+0.031
lights+0.031
Neg logit contributions
 a-0.011
 two-0.010
 big-0.010
-0.009
 the-0.009
Token four
Token ID1440
Feature activation+8.66e-03

Pos logit contributions
+0.056
 big+0.055
.+0.049
 a+0.049
 the+0.045
Neg logit contributions
arians-0.159
rily-0.157
immer-0.157
icians-0.157
inations-0.155
Token,"
Token ID553
Feature activation-1.97e-02

Pos logit contributions
rily+0.036
ittered+0.036
inations+0.035
immer+0.035
lights+0.035
Neg logit contributions
 a-0.008
 two-0.007
 big-0.007
-0.007
 the-0.006
Token she
Token ID673
Feature activation-4.48e-03

Pos logit contributions
 big+0.027
 a+0.027
 two+0.026
 and+0.026
 right+0.024
Neg logit contributions
 avail-0.065
rily-0.063
lights-0.063
ittered-0.061
immer-0.060
TokenOne
Token ID3198
Feature activation-1.49e-03

Pos logit contributions
ittered+0.004
rily+0.004
immer+0.004
inations+0.004
arians+0.004
Neg logit contributions
 a-0.005
 big-0.005
 and-0.005
 the-0.005
 two-0.005
Token,
Token ID11
Feature activation-4.46e-02

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 big+0.001
 the+0.001
Neg logit contributions
immer-0.006
inations-0.006
lights-0.006
ittered-0.006
 avail-0.006
Token two
Token ID734
Feature activation+5.12e-03

Pos logit contributions
+0.027
 big+0.023
 and+0.022
.+0.020
 a+0.017
Neg logit contributions
inations-0.172
icians-0.170
 avail-0.169
epad-0.169
rily-0.169
Token,
Token ID11
Feature activation-4.93e-02

Pos logit contributions
ittered+0.022
rily+0.022
immer+0.022
inations+0.022
lights+0.021
Neg logit contributions
 a-0.004
 big-0.003
 two-0.003
-0.003
 the-0.003
Token three
Token ID1115
Feature activation+1.31e-03

Pos logit contributions
+0.054
 big+0.051
 and+0.047
 a+0.045
.+0.044
Neg logit contributions
icians-0.168
inations-0.168
arians-0.166
immer-0.165
rily-0.164
Token,
Token ID11
Feature activation-4.96e-02

Pos logit contributions
rily+0.005
ittered+0.005
inations+0.005
immer+0.005
lights+0.005
Neg logit contributions
 a-0.002
 two-0.001
 big-0.001
-0.001
 the-0.001
Token four
Token ID1440
Feature activation+3.18e-03

Pos logit contributions
+0.058
 big+0.058
 a+0.050
 and+0.050
 the+0.049
Neg logit contributions
icians-0.164
arians-0.163
immer-0.163
inations-0.162
rily-0.161
Token,
Token ID11
Feature activation-4.42e-02

Pos logit contributions
rily+0.012
ittered+0.012
inations+0.012
immer+0.012
lights+0.012
Neg logit contributions
 a-0.004
 big-0.003
 two-0.003
-0.003
 the-0.003
Token five
Token ID1936
Feature activation+4.18e-03

Pos logit contributions
 big+0.069
+0.069
 a+0.064
 the+0.062
 and+0.061
Neg logit contributions
immer-0.131
rily-0.130
icians-0.130
arians-0.130
inations-0.127
Token,
Token ID11
Feature activation-4.14e-02

Pos logit contributions
rily+0.015
ittered+0.015
inations+0.015
immer+0.015
lights+0.015
Neg logit contributions
 a-0.006
 two-0.005
 big-0.005
-0.005
 the-0.005
Token six
Token ID2237
Feature activation+5.17e-03

Pos logit contributions
 big+0.070
+0.069
 a+0.065
 the+0.063
 was+0.061
Neg logit contributions
icians-0.118
arians-0.118
rily-0.118
immer-0.117
ittered-0.115
Token.
Token ID13
Feature activation-5.60e-03

Pos logit contributions
rily+0.019
immer+0.019
ittered+0.018
comb+0.018
lights+0.018
Neg logit contributions
 a-0.003
 two-0.003
 big-0.002
 the-0.002
-0.002
Token "
Token ID366
Feature activation-4.90e-03

Pos logit contributions
 a+0.006
 big+0.006
 two+0.006
 little+0.005
 the+0.005
Neg logit contributions
ittered-0.021
rily-0.021
immer-0.020
keeping-0.020
eway-0.020
TokenOne
Token ID3198
Feature activation-8.98e-03

Pos logit contributions
 a+0.011
 big+0.011
 the+0.011
 and+0.011
 two+0.011
Neg logit contributions
rily-0.013
immer-0.013
ittered-0.013
epad-0.012
rol-0.012
Token,
Token ID11
Feature activation-4.78e-02

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 big+0.007
 and+0.007
Neg logit contributions
immer-0.035
rily-0.034
lights-0.034
ittered-0.034
inations-0.034
Token two
Token ID734
Feature activation-9.02e-03

Pos logit contributions
+0.036
.+0.031
 big+0.029
 and+0.029
 a+0.025
Neg logit contributions
rily-0.169
comb-0.167
 avail-0.166
 pounds-0.164
heit-0.162
Token,
Token ID11
Feature activation-4.94e-02

Pos logit contributions
 a+0.008
+0.007
 two+0.007
 big+0.007
 and+0.006
Neg logit contributions
rily-0.036
immer-0.036
ittered-0.036
inations-0.036
lights-0.035
Token three
Token ID1115
Feature activation-3.83e-03

Pos logit contributions
+0.050
 big+0.047
 and+0.043
.+0.043
 a+0.040
Neg logit contributions
rily-0.170
immer-0.166
inations-0.166
 pounds-0.164
icians-0.164
Token!"
Token ID2474
Feature activation-1.89e-02

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 the+0.004
Neg logit contributions
rily-0.014
ittered-0.014
immer-0.014
inations-0.014
lights-0.014
Token she
Token ID673
Feature activation-1.86e-03

Pos logit contributions
 two+0.032
 big+0.032
 a+0.032
 little+0.029
 next+0.029
Neg logit contributions
rily-0.056
lights-0.054
 avail-0.054
ittered-0.054
immer-0.053
Token said
Token ID531
Feature activation+1.23e-02

Pos logit contributions
 a+0.003
 big+0.003
 two+0.003
+0.002
 the+0.002
Neg logit contributions
rily-0.006
ittered-0.006
inations-0.006
immer-0.006
fer-0.006
Token.
Token ID13
Feature activation-9.59e-03

Pos logit contributions
rily+0.056
ittered+0.055
immer+0.055
lights+0.053
 avail+0.053
Neg logit contributions
 a-0.007
 two-0.006
 big-0.006
-0.005
 and-0.004
Token
Token ID198
Feature activation-9.47e-03

Pos logit contributions
 a+0.007
 two+0.007
 big+0.006
 and+0.004
 little+0.004
Neg logit contributions
immer-0.047
rily-0.047
ittered-0.047
inations-0.046
lights-0.046
Token"
Token ID1
Feature activation+1.89e-03

Pos logit contributions
 a+0.016
 big+0.015
 the+0.014
 two+0.014
 and+0.014
Neg logit contributions
immer-0.031
rily-0.030
inations-0.030
ge-0.029
leness-0.029
TokenOne
Token ID3198
Feature activation-1.49e-03

Pos logit contributions
ittered+0.004
rily+0.004
immer+0.004
inations+0.004
arians+0.004
Neg logit contributions
 a-0.005
 big-0.005
 and-0.005
 the-0.005
 two-0.005
Token,
Token ID11
Feature activation-4.46e-02

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 big+0.001
 the+0.001
Neg logit contributions
immer-0.006
inations-0.006
lights-0.006
ittered-0.006
 avail-0.006
Token two
Token ID734
Feature activation+5.12e-03

Pos logit contributions
+0.027
 big+0.023
 and+0.022
.+0.020
 a+0.017
Neg logit contributions
inations-0.172
icians-0.170
 avail-0.169
epad-0.169
rily-0.169
Token,
Token ID11
Feature activation-4.93e-02

Pos logit contributions
ittered+0.022
rily+0.022
immer+0.022
inations+0.022
lights+0.021
Neg logit contributions
 a-0.004
 big-0.003
 two-0.003
-0.003
 the-0.003
Token three
Token ID1115
Feature activation+1.31e-03

Pos logit contributions
+0.054
 big+0.051
 and+0.047
 a+0.045
.+0.044
Neg logit contributions
icians-0.168
inations-0.168
arians-0.166
immer-0.165
rily-0.164
Token,
Token ID11
Feature activation-4.96e-02

Pos logit contributions
rily+0.005
ittered+0.005
inations+0.005
immer+0.005
lights+0.005
Neg logit contributions
 a-0.002
 two-0.001
 big-0.001
-0.001
 the-0.001
Token four
Token ID1440
Feature activation+3.18e-03

Pos logit contributions
+0.058
 big+0.058
 a+0.050
 and+0.050
 the+0.049
Neg logit contributions
icians-0.164
arians-0.163
immer-0.163
inations-0.162
rily-0.161
Token,
Token ID11
Feature activation-4.42e-02

Pos logit contributions
rily+0.012
ittered+0.012
inations+0.012
immer+0.012
lights+0.012
Neg logit contributions
 a-0.004
 big-0.003
 two-0.003
-0.003
 the-0.003
Token five
Token ID1936
Feature activation+4.18e-03

Pos logit contributions
 big+0.069
+0.069
 a+0.064
 the+0.062
 and+0.061
Neg logit contributions
immer-0.131
rily-0.130
icians-0.130
arians-0.130
inations-0.127
Token.
Token ID13
Feature activation-1.96e-03

Pos logit contributions
rily+0.062
ittered+0.062
lights+0.060
immer+0.060
inations+0.059
Neg logit contributions
 a-0.008
 two-0.008
 big-0.007
-0.005
 little-0.005
Token "
Token ID366
Feature activation-8.35e-03

Pos logit contributions
 a+0.002
 two+0.002
 big+0.002
 the+0.001
 and+0.001
Neg logit contributions
rily-0.008
lights-0.008
ittered-0.008
immer-0.008
inations-0.007
TokenOne
Token ID3198
Feature activation+1.75e-03

Pos logit contributions
 a+0.018
 big+0.018
 the+0.018
 two+0.017
 and+0.017
Neg logit contributions
immer-0.023
ittered-0.023
rily-0.022
inations-0.022
epad-0.022
Token,
Token ID11
Feature activation-4.52e-02

Pos logit contributions
immer+0.007
ittered+0.007
inations+0.007
lights+0.007
adows+0.007
Neg logit contributions
 a-0.001
 two-0.001
-0.001
 big-0.001
 the-0.001
Token two
Token ID734
Feature activation+4.23e-03

Pos logit contributions
+0.028
 and+0.020
.+0.019
 big+0.018
 the+0.014
Neg logit contributions
inations-0.178
icians-0.174
lights-0.173
comb-0.172
 avail-0.171
Token,
Token ID11
Feature activation-4.89e-02

Pos logit contributions
ittered+0.018
rily+0.018
immer+0.018
inations+0.017
lights+0.017
Neg logit contributions
 a-0.003
 two-0.003
-0.003
 big-0.003
 the-0.003
Token three
Token ID1115
Feature activation+1.56e-03

Pos logit contributions
+0.050
 big+0.042
 and+0.041
.+0.039
 the+0.039
Neg logit contributions
inations-0.172
icians-0.170
arians-0.168
comb-0.168
immer-0.167
Token..."
Token ID9313
Feature activation-1.40e-02

Pos logit contributions
rily+0.006
ittered+0.006
inations+0.006
lights+0.006
immer+0.006
Neg logit contributions
 a-0.002
 two-0.001
 big-0.001
-0.001
 the-0.001
Token
Token ID198
Feature activation-5.69e-03

Pos logit contributions
 big+0.008
 a+0.007
 two+0.007
 the+0.006
 little+0.005
Neg logit contributions
rily-0.061
immer-0.060
lights-0.059
inations-0.058
ittered-0.058
Token
Token ID198
Feature activation+2.96e-03

Pos logit contributions
 two+0.003
 a+0.003
 big+0.002
 the+0.002
 and+0.002
Neg logit contributions
lights-0.025
rily-0.025
immer-0.024
ittered-0.024
inations-0.024
TokenL
Token ID43
Feature activation+2.22e-02

Pos logit contributions
immer+0.009
rily+0.009
lights+0.009
inations+0.009
ittered+0.008
Neg logit contributions
 a-0.006
 big-0.006
 the-0.005
 two-0.005
 and-0.005
TokenOne
Token ID3198
Feature activation-2.39e-04

Pos logit contributions
rily+0.003
immer+0.003
ittered+0.003
inations+0.003
epad+0.002
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
 the-0.002
 and-0.002
Token,
Token ID11
Feature activation-3.86e-02

Pos logit contributions
 a+0.000
 two+0.000
+0.000
 big+0.000
 the+0.000
Neg logit contributions
immer-0.001
lights-0.001
inations-0.001
ittered-0.001
rily-0.001
Token two
Token ID734
Feature activation-1.29e-03

Pos logit contributions
+0.023
 and+0.017
 big+0.016
.+0.014
 next+0.011
Neg logit contributions
lights-0.150
rily-0.149
inations-0.148
 avail-0.147
epad-0.143
Token,
Token ID11
Feature activation-4.61e-02

Pos logit contributions
 a+0.001
+0.001
 big+0.001
 two+0.001
 the+0.001
Neg logit contributions
inations-0.005
ittered-0.005
immer-0.005
lights-0.005
rily-0.005
Token three
Token ID1115
Feature activation+1.01e-03

Pos logit contributions
+0.040
 big+0.036
 and+0.034
.+0.031
 the+0.027
Neg logit contributions
inations-0.169
icians-0.165
rily-0.165
immer-0.165
 avail-0.164
Token,
Token ID11
Feature activation-4.85e-02

Pos logit contributions
inations+0.004
rily+0.004
ittered+0.004
lights+0.004
immer+0.004
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
-0.001
 the-0.001
Token four
Token ID1440
Feature activation-3.46e-04

Pos logit contributions
+0.046
 big+0.043
 and+0.037
.+0.036
 was+0.034
Neg logit contributions
inations-0.171
immer-0.169
icians-0.168
rily-0.168
arians-0.167
Token,"
Token ID553
Feature activation-1.62e-02

Pos logit contributions
 a+0.000
 two+0.000
 big+0.000
+0.000
 the+0.000
Neg logit contributions
inations-0.001
ittered-0.001
rily-0.001
lights-0.001
immer-0.001
Token she
Token ID673
Feature activation-1.07e-02

Pos logit contributions
 big+0.020
 a+0.019
 two+0.019
 and+0.019
 right+0.017
Neg logit contributions
 avail-0.054
lights-0.053
rily-0.053
ittered-0.051
inations-0.050
Token counts
Token ID9853
Feature activation-1.35e-03

Pos logit contributions
 a+0.023
 big+0.022
 two+0.022
 the+0.022
+0.021
Neg logit contributions
rily-0.028
inations-0.027
immer-0.027
ittered-0.027
arians-0.027
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
 two+0.001
 a+0.001
 big+0.001
 little+0.001
 and+0.001
Neg logit contributions
rily-0.006
ittered-0.006
lights-0.006
immer-0.006
ge-0.005
Token to
Token ID284
Feature activation-1.08e-02

Pos logit contributions
 two+0.001
 a+0.001
 big+0.001
 little+0.001
+0.001
Neg logit contributions
rily-0.007
ittered-0.007
lights-0.007
immer-0.007
ge-0.007
Token three
Token ID1115
Feature activation-7.49e-03

Pos logit contributions
 two+0.016
 a+0.016
+0.016
 big+0.016
 and+0.015
Neg logit contributions
rily-0.035
ittered-0.035
 avail-0.034
epad-0.034
immer-0.033
Token.
Token ID13
Feature activation-2.72e-03

Pos logit contributions
 a+0.005
 two+0.005
 big+0.004
+0.004
 right+0.003
Neg logit contributions
rily-0.032
lights-0.032
ittered-0.032
inations-0.031
immer-0.031
Token "
Token ID366
Feature activation-1.82e-02

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
 little+0.003
 the+0.003
Neg logit contributions
rily-0.010
ittered-0.010
lights-0.010
immer-0.010
eway-0.010
TokenOne
Token ID3198
Feature activation-4.16e-03

Pos logit contributions
 a+0.043
 big+0.042
 the+0.042
 two+0.041
 little+0.041
Neg logit contributions
inations-0.043
rily-0.043
immer-0.043
epad-0.043
icians-0.043
Token,
Token ID11
Feature activation-4.81e-02

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
+0.002
 the+0.002
Neg logit contributions
immer-0.017
ittered-0.017
lights-0.017
inations-0.017
rily-0.016
Token two
Token ID734
Feature activation-1.34e-03

Pos logit contributions
+0.036
.+0.029
 and+0.028
 big+0.026
 a+0.025
Neg logit contributions
icians-0.177
inations-0.177
arians-0.173
lights-0.173
plets-0.171
Token,
Token ID11
Feature activation-5.13e-02

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
+0.001
 the+0.001
Neg logit contributions
ittered-0.006
rily-0.006
immer-0.006
inations-0.005
lights-0.005
Token three
Token ID1115
Feature activation-4.85e-03

Pos logit contributions
+0.060
 big+0.054
 and+0.052
.+0.050
 a+0.049
Neg logit contributions
icians-0.170
inations-0.170
arians-0.168
immer-0.164
rily-0.164
Token!"
Token ID2474
Feature activation-1.86e-02

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 the+0.004
Neg logit contributions
rily-0.019
ittered-0.019
inations-0.019
lights-0.019
immer-0.019
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
 big+0.021
 two+0.021
 next+0.020
 a+0.020
 right+0.019
Neg logit contributions
lights-0.064
rily-0.062
eway-0.059
ittered-0.059
immer-0.058
TokenOne
Token ID3198
Feature activation-1.05e-02

Pos logit contributions
 a+0.018
 big+0.018
 the+0.017
 and+0.017
 two+0.017
Neg logit contributions
rily-0.021
immer-0.021
ittered-0.021
epad-0.021
inations-0.021
Token,
Token ID11
Feature activation-4.73e-02

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 big+0.007
 the+0.007
Neg logit contributions
immer-0.041
inations-0.040
ittered-0.040
adows-0.039
rated-0.039
Token two
Token ID734
Feature activation-5.06e-03

Pos logit contributions
+0.041
 big+0.035
.+0.035
 and+0.034
 a+0.031
Neg logit contributions
arians-0.160
inations-0.160
 avail-0.160
epad-0.159
comb-0.159
Token,
Token ID11
Feature activation-5.13e-02

Pos logit contributions
 a+0.004
+0.004
 two+0.003
 big+0.003
 the+0.003
Neg logit contributions
ittered-0.021
inations-0.021
immer-0.021
rily-0.021
lights-0.020
Token three
Token ID1115
Feature activation-2.27e-03

Pos logit contributions
+0.063
 big+0.059
 and+0.053
.+0.053
 a+0.053
Neg logit contributions
inations-0.165
arians-0.165
icians-0.163
epad-0.163
immer-0.159
Token,
Token ID11
Feature activation-4.80e-02

Pos logit contributions
 a+0.003
 two+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
rily-0.009
inations-0.009
ittered-0.009
lights-0.009
immer-0.009
Token four
Token ID1440
Feature activation-3.06e-03

Pos logit contributions
+0.065
 big+0.064
 a+0.058
 the+0.057
.+0.055
Neg logit contributions
arians-0.148
inations-0.147
icians-0.146
immer-0.145
epad-0.144
Token,
Token ID11
Feature activation-4.36e-02

Pos logit contributions
 a+0.003
 big+0.003
 two+0.003
+0.003
 the+0.002
Neg logit contributions
ittered-0.012
rily-0.012
inations-0.012
immer-0.012
lights-0.012
Token five
Token ID1936
Feature activation-1.09e-03

Pos logit contributions
 big+0.072
+0.072
 a+0.069
 the+0.067
.+0.063
Neg logit contributions
arians-0.123
immer-0.121
icians-0.121
rily-0.120
inations-0.119
Token,
Token ID11
Feature activation-4.06e-02

Pos logit contributions
 a+0.001
 two+0.001
 big+0.001
+0.001
 the+0.001
Neg logit contributions
rily-0.004
inations-0.004
ittered-0.004
immer-0.004
lights-0.004
Token six
Token ID2237
Feature activation-8.46e-04

Pos logit contributions
 big+0.070
+0.068
 a+0.066
 the+0.064
 two+0.061
Neg logit contributions
arians-0.113
icians-0.111
rily-0.111
immer-0.111
ittered-0.110
Token her
Token ID607
Feature activation+2.56e-03

Pos logit contributions
 two+0.006
 big+0.006
 a+0.005
+0.005
 little+0.005
Neg logit contributions
rily-0.040
immer-0.039
ittered-0.039
 avail-0.039
comb-0.038
Token cone
Token ID27763
Feature activation+4.19e-03

Pos logit contributions
rily+0.008
ittered+0.007
immer+0.007
lights+0.007
 footing+0.007
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.005
 the-0.005
Token.
Token ID13
Feature activation-5.60e-03

Pos logit contributions
rily+0.019
immer+0.019
ittered+0.018
comb+0.018
lights+0.018
Neg logit contributions
 a-0.003
 two-0.003
 big-0.002
 the-0.002
-0.002
Token "
Token ID366
Feature activation-4.90e-03

Pos logit contributions
 a+0.006
 big+0.006
 two+0.006
 little+0.005
 the+0.005
Neg logit contributions
ittered-0.021
rily-0.021
immer-0.020
keeping-0.020
eway-0.020
TokenOne
Token ID3198
Feature activation-8.98e-03

Pos logit contributions
 a+0.011
 big+0.011
 the+0.011
 and+0.011
 two+0.011
Neg logit contributions
rily-0.013
immer-0.013
ittered-0.013
epad-0.012
rol-0.012
Token,
Token ID11
Feature activation-4.78e-02

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 big+0.007
 and+0.007
Neg logit contributions
immer-0.035
rily-0.034
lights-0.034
ittered-0.034
inations-0.034
Token two
Token ID734
Feature activation-9.02e-03

Pos logit contributions
+0.036
.+0.031
 big+0.029
 and+0.029
 a+0.025
Neg logit contributions
rily-0.169
comb-0.167
 avail-0.166
 pounds-0.164
heit-0.162
Token,
Token ID11
Feature activation-4.94e-02

Pos logit contributions
 a+0.008
+0.007
 two+0.007
 big+0.007
 and+0.006
Neg logit contributions
rily-0.036
immer-0.036
ittered-0.036
inations-0.036
lights-0.035
Token three
Token ID1115
Feature activation-3.83e-03

Pos logit contributions
+0.050
 big+0.047
 and+0.043
.+0.043
 a+0.040
Neg logit contributions
rily-0.170
immer-0.166
inations-0.166
 pounds-0.164
icians-0.164
Token!"
Token ID2474
Feature activation-1.89e-02

Pos logit contributions
 a+0.005
 two+0.005
 big+0.005
+0.004
 the+0.004
Neg logit contributions
rily-0.014
ittered-0.014
immer-0.014
inations-0.014
lights-0.014
Token she
Token ID673
Feature activation-1.86e-03

Pos logit contributions
 two+0.032
 big+0.032
 a+0.032
 little+0.029
 next+0.029
Neg logit contributions
rily-0.056
lights-0.054
 avail-0.054
ittered-0.054
immer-0.053
Token
Token ID198
Feature activation+1.44e-03

Pos logit contributions
rily+0.009
immer+0.009
ittered+0.009
inations+0.008
lights+0.008
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
 the-0.000
 and-0.000
Token"
Token ID1
Feature activation+2.48e-03

Pos logit contributions
immer+0.005
rily+0.005
inations+0.005
ittered+0.005
leness+0.005
Neg logit contributions
 a-0.002
 big-0.002
 two-0.002
 the-0.002
 and-0.002
TokenOne
Token ID3198
Feature activation-5.00e-03

Pos logit contributions
rily+0.006
immer+0.006
inations+0.006
arians+0.006
ittered+0.006
Neg logit contributions
 a-0.007
 big-0.006
 and-0.006
 the-0.006
 two-0.006
Token,
Token ID11
Feature activation-4.54e-02

Pos logit contributions
 a+0.004
 two+0.004
 big+0.003
+0.003
 the+0.003
Neg logit contributions
immer-0.021
inations-0.020
ittered-0.020
lights-0.020
rily-0.020
Token two
Token ID734
Feature activation+3.91e-03

Pos logit contributions
+0.026
 big+0.022
.+0.018
 a+0.017
 and+0.017
Neg logit contributions
rily-0.182
inations-0.181
arians-0.178
immer-0.178
icians-0.177
Token,
Token ID11
Feature activation-4.77e-02

Pos logit contributions
rily+0.016
ittered+0.016
immer+0.016
inations+0.016
lights+0.016
Neg logit contributions
 a-0.003
 big-0.003
 two-0.003
-0.003
 the-0.003
Token three
Token ID1115
Feature activation+5.63e-04

Pos logit contributions
+0.051
 big+0.048
 and+0.043
 a+0.043
.+0.041
Neg logit contributions
inations-0.166
rily-0.166
arians-0.166
icians-0.165
immer-0.165
Token,
Token ID11
Feature activation-4.54e-02

Pos logit contributions
rily+0.002
ittered+0.002
immer+0.002
inations+0.002
lights+0.002
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
-0.001
 the-0.001
Token four
Token ID1440
Feature activation+2.64e-03

Pos logit contributions
+0.057
 big+0.056
 a+0.051
 the+0.048
 and+0.048
Neg logit contributions
rily-0.150
immer-0.149
arians-0.149
inations-0.149
icians-0.148
Token,
Token ID11
Feature activation-3.93e-02

Pos logit contributions
rily+0.010
ittered+0.010
immer+0.010
inations+0.010
lights+0.009
Neg logit contributions
 a-0.004
 big-0.003
 two-0.003
-0.003
 the-0.003
Token five
Token ID1936
Feature activation+5.85e-04

Pos logit contributions
+0.067
 big+0.066
 a+0.064
 the+0.061
 and+0.059
Neg logit contributions
rily-0.114
immer-0.113
arians-0.111
inations-0.110
icians-0.109
TokenOne
Token ID3198
Feature activation-4.55e-04

Pos logit contributions
 a+0.013
 big+0.012
 the+0.012
 two+0.012
 and+0.011
Neg logit contributions
rily-0.018
immer-0.018
ittered-0.017
epad-0.017
 avail-0.017
Token,
Token ID11
Feature activation-3.91e-02

Pos logit contributions
 a+0.001
 two+0.001
+0.001
 big+0.001
 the+0.001
Neg logit contributions
immer-0.002
inations-0.002
lights-0.002
ittered-0.002
adows-0.002
Token two
Token ID734
Feature activation+7.55e-03

Pos logit contributions
+0.030
.+0.025
 big+0.024
 and+0.022
 a+0.021
Neg logit contributions
arians-0.143
 avail-0.142
rily-0.142
icians-0.141
epad-0.141
Token,
Token ID11
Feature activation-5.02e-02

Pos logit contributions
rily+0.032
ittered+0.032
immer+0.032
inations+0.032
lights+0.031
Neg logit contributions
 a-0.006
 two-0.005
 big-0.005
-0.005
 the-0.004
Token three
Token ID1115
Feature activation+8.35e-03

Pos logit contributions
+0.059
 big+0.055
.+0.051
 and+0.049
 a+0.049
Neg logit contributions
arians-0.169
icians-0.167
rily-0.166
inations-0.164
immer-0.164
Token,
Token ID11
Feature activation-4.76e-02

Pos logit contributions
rily+0.032
ittered+0.032
inations+0.031
immer+0.031
lights+0.031
Neg logit contributions
 a-0.011
 two-0.010
 big-0.010
-0.009
 the-0.009
Token four
Token ID1440
Feature activation+8.66e-03

Pos logit contributions
+0.056
 big+0.055
.+0.049
 a+0.049
 the+0.045
Neg logit contributions
arians-0.159
rily-0.157
immer-0.157
icians-0.157
inations-0.155
Token,"
Token ID553
Feature activation-1.97e-02

Pos logit contributions
rily+0.036
ittered+0.036
inations+0.035
immer+0.035
lights+0.035
Neg logit contributions
 a-0.008
 two-0.007
 big-0.007
-0.007
 the-0.006
Token she
Token ID673
Feature activation-4.48e-03

Pos logit contributions
 big+0.027
 a+0.027
 two+0.026
 and+0.026
 right+0.024
Neg logit contributions
 avail-0.065
rily-0.063
lights-0.063
ittered-0.061
immer-0.060
Token said
Token ID531
Feature activation+2.09e-02

Pos logit contributions
 a+0.006
 big+0.006
 two+0.006
 the+0.006
 and+0.006
Neg logit contributions
ittered-0.015
inations-0.015
immer-0.015
rily-0.015
 avail-0.015
Token.
Token ID13
Feature activation-6.67e-03

Pos logit contributions
rily+0.095
ittered+0.095
immer+0.094
lights+0.092
inations+0.092
Neg logit contributions
 a-0.012
 two-0.011
 big-0.011
-0.008
 the-0.008
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
 two+0.010
 a+0.010
 big+0.010
+0.009
 next+0.008
Neg logit contributions
rily-0.040
ittered-0.038
lights-0.038
immer-0.037
 avail-0.035
Token "
Token ID366
Feature activation+3.30e-03

Pos logit contributions
rily+0.054
ittered+0.053
immer+0.052
lights+0.052
inations+0.051
Neg logit contributions
 a-0.021
 two-0.020
 big-0.020
 and-0.018
 the-0.018
TokenOne
Token ID3198
Feature activation+1.64e-03

Pos logit contributions
rily+0.009
immer+0.009
ittered+0.009
inations+0.009
leness+0.009
Neg logit contributions
 a-0.008
 big-0.007
 the-0.007
 two-0.007
 and-0.007
Token,
Token ID11
Feature activation-4.45e-02

Pos logit contributions
immer+0.007
ittered+0.007
rily+0.006
inations+0.006
lights+0.006
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
-0.001
 the-0.001
Token two
Token ID734
Feature activation+3.87e-03

Pos logit contributions
+0.029
 and+0.021
.+0.021
 big+0.020
 a+0.017
Neg logit contributions
rily-0.171
inations-0.170
icians-0.169
arians-0.168
 avail-0.167
Token,
Token ID11
Feature activation-4.76e-02

Pos logit contributions
ittered+0.016
rily+0.016
immer+0.016
inations+0.016
lights+0.016
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.003
 the-0.002
Token three
Token ID1115
Feature activation+3.87e-03

Pos logit contributions
+0.051
 big+0.044
 and+0.042
.+0.041
 the+0.039
Neg logit contributions
rily-0.166
icians-0.165
arians-0.164
inations-0.164
immer-0.163
Token,
Token ID11
Feature activation-4.49e-02

Pos logit contributions
rily+0.014
ittered+0.014
inations+0.014
immer+0.014
lights+0.014
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.005
 the-0.004
Token four
Token ID1440
Feature activation+2.01e-03

Pos logit contributions
+0.056
 big+0.052
.+0.047
 a+0.047
 the+0.046
Neg logit contributions
rily-0.148
icians-0.147
immer-0.147
arians-0.147
inations-0.145
Token,"
Token ID553
Feature activation-1.22e-02

Pos logit contributions
rily+0.008
ittered+0.008
immer+0.008
inations+0.007
lights+0.007
Neg logit contributions
 a-0.003
 big-0.002
 two-0.002
-0.002
 the-0.002
Token Ben
Token ID3932
Feature activation-5.78e-03

Pos logit contributions
 big+0.016
 a+0.014
 two+0.014
 and+0.013
 little+0.013
Neg logit contributions
rily-0.042
lights-0.040
 avail-0.040
inations-0.039
leness-0.039
Token.
Token ID13
Feature activation-3.42e-03

Pos logit contributions
 a+0.006
 two+0.006
 big+0.005
+0.005
 little+0.004
Neg logit contributions
rily-0.030
lights-0.029
ittered-0.029
immer-0.029
ge-0.028
Token "
Token ID366
Feature activation+1.61e-03

Pos logit contributions
 a+0.006
 two+0.005
 big+0.005
 and+0.005
 the+0.005
Neg logit contributions
rily-0.011
ittered-0.011
lights-0.011
comb-0.011
arians-0.011
TokenOne
Token ID3198
Feature activation-7.36e-03

Pos logit contributions
immer+0.005
rily+0.005
ittered+0.004
inations+0.004
arians+0.004
Neg logit contributions
 a-0.004
 big-0.003
 two-0.003
 the-0.003
 and-0.003
Token,
Token ID11
Feature activation-4.11e-02

Pos logit contributions
 a+0.008
 two+0.007
 big+0.006
+0.006
 the+0.006
Neg logit contributions
immer-0.029
inations-0.028
ittered-0.028
adows-0.028
lights-0.028
Token two
Token ID734
Feature activation-3.75e-03

Pos logit contributions
+0.038
 big+0.033
.+0.033
 a+0.033
 and+0.033
Neg logit contributions
arians-0.146
icians-0.144
inations-0.143
rily-0.143
comb-0.142
Token,
Token ID11
Feature activation-4.75e-02

Pos logit contributions
 a+0.005
 two+0.005
 big+0.004
+0.004
 the+0.004
Neg logit contributions
ittered-0.014
immer-0.014
rily-0.014
inations-0.014
arians-0.013
Token three
Token ID1115
Feature activation+2.08e-03

Pos logit contributions
+0.057
 big+0.053
 a+0.049
 and+0.049
.+0.048
Neg logit contributions
arians-0.158
icians-0.157
immer-0.156
inations-0.156
rily-0.155
Token,
Token ID11
Feature activation-4.67e-02

Pos logit contributions
rily+0.007
ittered+0.007
immer+0.007
inations+0.007
lights+0.007
Neg logit contributions
 a-0.004
 two-0.003
 big-0.003
-0.003
 the-0.003
Token four
Token ID1440
Feature activation-4.97e-05

Pos logit contributions
+0.058
 big+0.057
 a+0.053
.+0.050
 the+0.050
Neg logit contributions
arians-0.152
icians-0.150
immer-0.150
rily-0.149
inations-0.148
Token,"
Token ID553
Feature activation-1.79e-02

Pos logit contributions
 a+0.000
 two+0.000
 big+0.000
+0.000
 the+0.000
Neg logit contributions
ittered-0.000
rily-0.000
immer-0.000
inations-0.000
lights-0.000
Token she
Token ID673
Feature activation-1.47e-02

Pos logit contributions
 big+0.026
 two+0.025
 a+0.025
 and+0.025
 right+0.024
Neg logit contributions
rily-0.053
comb-0.053
 avail-0.052
lights-0.052
ittered-0.050
Token to
Token ID284
Feature activation-1.47e-02

Pos logit contributions
 and+0.015
 two+0.015
 a+0.013
+0.013
 big+0.011
Neg logit contributions
epad-0.070
inations-0.070
ittered-0.069
lights-0.067
rily-0.067
Token count
Token ID954
Feature activation-1.49e-02

Pos logit contributions
 a+0.036
 two+0.035
 big+0.035
+0.034
 little+0.033
Neg logit contributions
ittered-0.036
immer-0.036
inations-0.035
 avail-0.034
comb-0.034
Token.
Token ID13
Feature activation+3.53e-03

Pos logit contributions
 two+0.010
 a+0.009
 big+0.008
 little+0.007
+0.007
Neg logit contributions
rily-0.063
lights-0.063
ittered-0.061
immer-0.061
epad-0.059
Token "
Token ID366
Feature activation-8.05e-03

Pos logit contributions
rily+0.012
ittered+0.012
lights+0.011
immer+0.011
keeping+0.011
Neg logit contributions
 a-0.006
 two-0.006
 big-0.006
 the-0.005
 and-0.005
TokenOne
Token ID3198
Feature activation-1.05e-02

Pos logit contributions
 a+0.018
 big+0.018
 the+0.017
 and+0.017
 two+0.017
Neg logit contributions
rily-0.021
immer-0.021
ittered-0.021
epad-0.021
inations-0.021
Token,
Token ID11
Feature activation-4.73e-02

Pos logit contributions
 a+0.009
 two+0.008
+0.007
 big+0.007
 the+0.007
Neg logit contributions
immer-0.041
inations-0.040
ittered-0.040
adows-0.039
rated-0.039
Token two
Token ID734
Feature activation-5.06e-03

Pos logit contributions
+0.041
 big+0.035
.+0.035
 and+0.034
 a+0.031
Neg logit contributions
arians-0.160
inations-0.160
 avail-0.160
epad-0.159
comb-0.159
Token,
Token ID11
Feature activation-5.13e-02

Pos logit contributions
 a+0.004
+0.004
 two+0.003
 big+0.003
 the+0.003
Neg logit contributions
ittered-0.021
inations-0.021
immer-0.021
rily-0.021
lights-0.020
Token three
Token ID1115
Feature activation-2.27e-03

Pos logit contributions
+0.063
 big+0.059
 and+0.053
.+0.053
 a+0.053
Neg logit contributions
inations-0.165
arians-0.165
icians-0.163
epad-0.163
immer-0.159
Token,
Token ID11
Feature activation-4.80e-02

Pos logit contributions
 a+0.003
 two+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
rily-0.009
inations-0.009
ittered-0.009
lights-0.009
immer-0.009
Token four
Token ID1440
Feature activation-3.06e-03

Pos logit contributions
+0.065
 big+0.064
 a+0.058
 the+0.057
.+0.055
Neg logit contributions
arians-0.148
inations-0.147
icians-0.146
immer-0.145
epad-0.144
Token
Token ID198
Feature activation-5.38e-03

Pos logit contributions
 a+0.004
 two+0.003
 big+0.003
 the+0.002
 little+0.002
Neg logit contributions
rily-0.030
immer-0.030
ittered-0.030
inations-0.030
ently-0.029
Token"
Token ID1
Feature activation+1.01e-02

Pos logit contributions
 a+0.008
 big+0.008
 two+0.008
 little+0.007
 the+0.007
Neg logit contributions
immer-0.018
ge-0.018
ittered-0.018
inations-0.018
rily-0.018
TokenOne
Token ID3198
Feature activation-1.19e-02

Pos logit contributions
immer+0.024
ittered+0.023
rily+0.023
fer+0.022
lights+0.022
Neg logit contributions
 a-0.027
 big-0.027
 and-0.026
 the-0.026
 two-0.026
Token,
Token ID11
Feature activation-4.12e-02

Pos logit contributions
 a+0.010
 two+0.009
+0.008
 big+0.008
 and+0.007
Neg logit contributions
immer-0.047
lights-0.046
ently-0.046
adows-0.045
inations-0.045
Token two
Token ID734
Feature activation-7.48e-03

Pos logit contributions
+0.023
 big+0.019
.+0.016
 and+0.015
 to+0.013
Neg logit contributions
inations-0.155
rily-0.154
 avail-0.154
lights-0.153
icians-0.153
Token,
Token ID11
Feature activation-4.73e-02

Pos logit contributions
 a+0.006
+0.005
 big+0.005
 two+0.005
 the+0.004
Neg logit contributions
immer-0.030
ittered-0.030
rily-0.030
inations-0.030
lights-0.030
Token three
Token ID1115
Feature activation-8.67e-03

Pos logit contributions
+0.044
 big+0.041
.+0.035
 and+0.035
 was+0.032
Neg logit contributions
inations-0.167
icians-0.167
arians-0.165
rily-0.165
immer-0.165
Token,
Token ID11
Feature activation-4.72e-02

Pos logit contributions
 a+0.008
 two+0.007
 big+0.007
+0.007
 the+0.006
Neg logit contributions
rily-0.035
inations-0.034
ittered-0.034
lights-0.034
immer-0.034
Token four
Token ID1440
Feature activation-8.20e-03

Pos logit contributions
+0.049
 big+0.049
.+0.040
 and+0.038
 a+0.038
Neg logit contributions
immer-0.161
rily-0.160
inations-0.159
icians-0.159
arians-0.159
Token,
Token ID11
Feature activation-4.14e-02

Pos logit contributions
 a+0.008
 big+0.007
 two+0.007
+0.007
 the+0.006
Neg logit contributions
rily-0.032
ittered-0.032
inations-0.032
immer-0.032
lights-0.032
Token five
Token ID1936
Feature activation-6.48e-03

Pos logit contributions
+0.064
 big+0.064
 a+0.058
.+0.056
 the+0.056
Neg logit contributions
immer-0.124
rily-0.124
comb-0.121
ittered-0.121
arians-0.121
Token man
Token ID582
Feature activation+1.58e-02

Pos logit contributions
rily+0.021
ittered+0.021
immer+0.021
fer+0.020
comb+0.020
Neg logit contributions
 a-0.014
 two-0.014
 big-0.013
-0.013
 the-0.013
Token looking
Token ID2045
Feature activation+7.35e-03

Pos logit contributions
rily+0.050
immer+0.050
ittered+0.050
inations+0.048
fer+0.048
Neg logit contributions
 a-0.031
 two-0.030
 big-0.030
 the-0.028
-0.027
Token for
Token ID329
Feature activation+1.35e-03

Pos logit contributions
rily+0.028
ittered+0.027
immer+0.027
inations+0.026
epad+0.026
Neg logit contributions
 two-0.009
 a-0.009
-0.009
 big-0.009
 the-0.008
Token something
Token ID1223
Feature activation+6.50e-03

Pos logit contributions
rily+0.005
immer+0.005
ittered+0.005
 avail+0.005
inations+0.005
Neg logit contributions
 two-0.001
 a-0.001
 big-0.001
-0.001
 and-0.001
Token.
Token ID13
Feature activation+5.10e-03

Pos logit contributions
rily+0.025
immer+0.025
ittered+0.025
inations+0.025
ge+0.024
Neg logit contributions
 two-0.007
 a-0.007
 big-0.006
-0.006
 the-0.006
Token Ho
Token ID9544
Feature activation+6.02e-02

Pos logit contributions
rily+0.018
ittered+0.017
immer+0.017
istent+0.016
arians+0.016
Neg logit contributions
 a-0.008
 two-0.008
 big-0.008
 the-0.007
 and-0.007
Tokenppy
Token ID14097
Feature activation-1.93e-02

Pos logit contributions
lights+0.108
rily+0.104
inations+0.102
 avail+0.102
keeping+0.100
Neg logit contributions
 a-0.200
 two-0.197
 and-0.194
-0.189
 big-0.188
Token hopped
Token ID45230
Feature activation-3.63e-03

Pos logit contributions
 a+0.033
 big+0.031
 two+0.030
 the+0.029
+0.027
Neg logit contributions
rily-0.062
ittered-0.062
immer-0.061
fer-0.060
inations-0.060
Token up
Token ID510
Feature activation-3.87e-03

Pos logit contributions
 two+0.007
 a+0.007
 big+0.007
 little+0.006
+0.006
Neg logit contributions
rily-0.011
immer-0.011
ittered-0.011
rol-0.010
eway-0.010
Token to
Token ID284
Feature activation+3.84e-03

Pos logit contributions
 two+0.003
 a+0.002
 big+0.002
 little+0.002
+0.002
Neg logit contributions
rily-0.016
ittered-0.016
immer-0.016
keeping-0.015
ge-0.015
Token him
Token ID683
Feature activation+4.58e-04

Pos logit contributions
rily+0.016
 avail+0.016
ittered+0.016
immer+0.016
leness+0.015
Neg logit contributions
 two-0.003
 big-0.003
 a-0.003
 and-0.002
-0.002
Token delicious
Token ID12625
Feature activation-5.50e-03

Pos logit contributions
+0.011
 a+0.011
 two+0.011
 and+0.010
 big+0.010
Neg logit contributions
rily-0.026
lights-0.025
ittered-0.025
immer-0.025
keeping-0.025
Token vegetables
Token ID13701
Feature activation-5.95e-03

Pos logit contributions
 a+0.011
 two+0.011
+0.011
 big+0.010
 the+0.010
Neg logit contributions
rily-0.016
lights-0.015
ittered-0.015
immer-0.015
inations-0.015
Token and
Token ID290
Feature activation+5.90e-03

Pos logit contributions
 two+0.003
 a+0.003
 big+0.003
+0.002
 the+0.002
Neg logit contributions
rily-0.026
immer-0.026
comb-0.025
ittered-0.025
inations-0.025
Token bright
Token ID6016
Feature activation+5.43e-03

Pos logit contributions
rily+0.016
lights+0.016
ittered+0.016
inations+0.016
immer+0.015
Neg logit contributions
 a-0.013
 two-0.013
-0.012
 big-0.012
 and-0.012
Token-
Token ID12
Feature activation+1.64e-02

Pos logit contributions
rily+0.018
lights+0.017
ittered+0.017
immer+0.017
inations+0.017
Neg logit contributions
 a-0.010
 two-0.009
 big-0.009
-0.009
 the-0.009
Tokencol
Token ID4033
Feature activation+5.95e-02

Pos logit contributions
rily+0.035
ittered+0.034
immer+0.034
inations+0.033
lights+0.032
Neg logit contributions
 a-0.050
 two-0.049
 big-0.048
 the-0.047
 and-0.047
Tokenoured
Token ID8167
Feature activation-1.20e-03

Pos logit contributions
immer+0.056
inations+0.055
rily+0.055
lights+0.053
ittered+0.053
Neg logit contributions
 a-0.241
 two-0.236
 and-0.233
 big-0.232
 the-0.229
Token flowers
Token ID12734
Feature activation-2.15e-03

Pos logit contributions
 a+0.003
 two+0.003
+0.003
 big+0.003
 the+0.002
Neg logit contributions
rily-0.003
lights-0.003
ittered-0.003
keeping-0.003
immer-0.003
Token.
Token ID13
Feature activation+8.89e-03

Pos logit contributions
 two+0.001
 a+0.001
 big+0.001
 little+0.001
 the+0.001
Neg logit contributions
rily-0.010
ittered-0.009
immer-0.009
lights-0.009
inations-0.009
Token
Token ID220
Feature activation-9.32e-03

Pos logit contributions
rily+0.031
immer+0.030
ittered+0.030
lights+0.030
 avail+0.029
Neg logit contributions
 a-0.014
 two-0.014
 big-0.014
 and-0.012
 the-0.012
Token
Token ID198
Feature activation+3.57e-03

Pos logit contributions
 a+0.005
 two+0.004
 big+0.004
 the+0.003
 and+0.002
Neg logit contributions
rily-0.042
immer-0.042
ittered-0.042
lights-0.041
inations-0.041
Token.
Token ID13
Feature activation+4.85e-03

Pos logit contributions
rily+0.025
ittered+0.024
immer+0.024
comb+0.024
inations+0.024
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.004
 the-0.004
Token He
Token ID679
Feature activation+5.07e-03

Pos logit contributions
rily+0.016
ittered+0.016
immer+0.015
istent+0.015
lights+0.015
Neg logit contributions
 a-0.008
 two-0.008
 big-0.008
 the-0.007
 and-0.007
Token che
Token ID1125
Feature activation+5.97e-02

Pos logit contributions
rily+0.018
ittered+0.018
inations+0.018
immer+0.018
lights+0.017
Neg logit contributions
 a-0.007
 two-0.007
 big-0.007
-0.006
 the-0.006
Tokenwed
Token ID19103
Feature activation-9.27e-03

Pos logit contributions
rily+0.104
lights+0.100
 avail+0.100
arians+0.099
immer+0.097
Neg logit contributions
 a-0.203
 two-0.200
 and-0.193
 big-0.190
 the-0.189
Token and
Token ID290
Feature activation+1.05e-02

Pos logit contributions
 big+0.008
 a+0.008
 two+0.008
+0.007
 little+0.007
Neg logit contributions
rily-0.038
immer-0.036
ittered-0.036
lights-0.034
rol-0.034
Token che
Token ID1125
Feature activation+6.09e-02

Pos logit contributions
ittered+0.028
rily+0.028
immer+0.027
inations+0.027
leness+0.026
Neg logit contributions
 a-0.025
 two-0.024
 big-0.024
-0.023
 and-0.023
Tokenwed
Token ID19103
Feature activation-8.67e-03

Pos logit contributions
rily+0.104
arians+0.101
lights+0.101
 avail+0.101
immer+0.099
Neg logit contributions
 a-0.209
 two-0.203
 and-0.199
 the-0.195
 big-0.195
Token until
Token ID1566
Feature activation-4.44e-03

Pos logit contributions
 a+0.011
 two+0.011
 big+0.011
+0.009
 little+0.009
Neg logit contributions
rily-0.033
immer-0.032
ittered-0.031
lights-0.030
ently-0.030
Token it
Token ID340
Feature activation-7.16e-03

Pos logit contributions
 two+0.005
 a+0.005
 big+0.005
+0.005
 and+0.004
Neg logit contributions
rily-0.017
 avail-0.016
ittered-0.016
lights-0.016
immer-0.016
Token was
Token ID373
Feature activation+1.65e-02

Pos logit contributions
 two+0.009
 a+0.009
 big+0.008
+0.007
 the+0.007
Neg logit contributions
ittered-0.027
rily-0.027
immer-0.027
comb-0.026
lights-0.026
Token all
Token ID477
Feature activation+1.41e-02

Pos logit contributions
rily+0.063
immer+0.062
ittered+0.062
inations+0.060
lights+0.060
Neg logit contributions
 a-0.021
 two-0.021
 big-0.019
-0.019
 and-0.019
Token them
Token ID606
Feature activation-2.30e-03

Pos logit contributions
rily+0.009
keeping+0.009
immer+0.009
ittered+0.009
 avail+0.009
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.002
 little-0.002
Token to
Token ID284
Feature activation+3.76e-03

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
rily-0.009
immer-0.009
ittered-0.009
inations-0.009
keeping-0.009
Token their
Token ID511
Feature activation+7.34e-03

Pos logit contributions
rily+0.015
immer+0.015
ittered+0.014
inations+0.014
 avail+0.014
Neg logit contributions
 two-0.004
 a-0.004
 big-0.004
-0.003
 and-0.003
Token owners
Token ID4393
Feature activation+6.64e-03

Pos logit contributions
rily+0.023
ittered+0.022
immer+0.022
inations+0.022
lights+0.022
Neg logit contributions
 a-0.015
 two-0.014
 big-0.014
-0.014
 the-0.013
Token.
Token ID13
Feature activation+6.34e-03

Pos logit contributions
rily+0.029
immer+0.029
ittered+0.028
inations+0.028
keeping+0.028
Neg logit contributions
 a-0.005
 two-0.005
 big-0.004
-0.003
 the-0.003
Token Ho
Token ID9544
Feature activation+6.03e-02

Pos logit contributions
rily+0.025
ittered+0.024
immer+0.023
lor+0.023
arians+0.023
Neg logit contributions
 a-0.008
 big-0.008
 two-0.008
 the-0.007
 and-0.006
Tokenppy
Token ID14097
Feature activation-1.53e-02

Pos logit contributions
lights+0.107
rily+0.104
 avail+0.104
inations+0.101
keeping+0.099
Neg logit contributions
 a-0.199
 two-0.197
 and-0.195
-0.189
 big-0.188
Token felt
Token ID2936
Feature activation+1.12e-03

Pos logit contributions
 a+0.023
 big+0.022
 two+0.021
 the+0.020
+0.019
Neg logit contributions
rily-0.053
ittered-0.053
immer-0.051
inations-0.050
leness-0.050
Token bad
Token ID2089
Feature activation-7.11e-03

Pos logit contributions
rily+0.004
immer+0.004
ittered+0.004
leness+0.004
inations+0.004
Neg logit contributions
 two-0.002
 a-0.002
 big-0.002
 and-0.002
-0.002
Token for
Token ID329
Feature activation+7.69e-03

Pos logit contributions
 big+0.007
 two+0.007
 a+0.007
+0.006
 the+0.006
Neg logit contributions
rily-0.028
ittered-0.027
lights-0.027
ently-0.027
istent-0.026
Token taking
Token ID2263
Feature activation-1.43e-02

Pos logit contributions
rily+0.025
ittered+0.025
immer+0.024
 avail+0.024
inations+0.024
Neg logit contributions
 a-0.013
 two-0.013
 big-0.013
-0.012
 and-0.012
Token their
Token ID511
Feature activation+5.45e-04

Pos logit contributions
rily+0.071
ittered+0.071
immer+0.071
lights+0.069
inations+0.069
Neg logit contributions
 a-0.017
 two-0.017
 big-0.016
-0.015
 and-0.015
Token favorite
Token ID4004
Feature activation+6.09e-03

Pos logit contributions
ittered+0.002
rily+0.002
immer+0.002
inations+0.002
lights+0.002
Neg logit contributions
 a-0.001
 two-0.001
-0.001
 big-0.001
 the-0.001
Token place
Token ID1295
Feature activation+5.57e-03

Pos logit contributions
rily+0.019
ittered+0.019
immer+0.019
inations+0.018
lights+0.018
Neg logit contributions
 a-0.012
 two-0.011
 big-0.011
-0.011
 the-0.010
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
rily+0.024
ittered+0.024
immer+0.024
lights+0.023
inations+0.023
Neg logit contributions
 two-0.004
 a-0.004
 big-0.004
-0.003
 the-0.003
Token They
Token ID1119
Feature activation+7.17e-03

Pos logit contributions
rily+0.034
ittered+0.033
immer+0.033
lights+0.032
inations+0.032
Neg logit contributions
 a-0.024
 big-0.024
 two-0.024
 and-0.022
 the-0.022
Token gard
Token ID16985
Feature activation+5.61e-02

Pos logit contributions
ittered+0.020
rily+0.020
immer+0.019
inations+0.019
lights+0.018
Neg logit contributions
 a-0.017
 two-0.016
 big-0.016
-0.016
 the-0.015
Tokenened
Token ID2945
Feature activation+7.77e-03

Pos logit contributions
immer+0.086
inations+0.085
lights+0.083
rily+0.082
ittered+0.082
Neg logit contributions
 a-0.201
 two-0.199
 and-0.195
 the-0.191
 big-0.187
Token together
Token ID1978
Feature activation+2.50e-03

Pos logit contributions
rily+0.030
ittered+0.030
immer+0.029
lights+0.029
inations+0.029
Neg logit contributions
 a-0.009
 two-0.009
 big-0.009
-0.008
 and-0.007
Token every
Token ID790
Feature activation+8.54e-03

Pos logit contributions
ittered+0.011
rily+0.011
immer+0.011
inations+0.010
lights+0.010
Neg logit contributions
 a-0.002
 two-0.002
 big-0.002
-0.002
 the-0.001
Token day
Token ID1110
Feature activation-9.76e-03

Pos logit contributions
rily+0.030
immer+0.030
ittered+0.030
inations+0.029
lights+0.029
Neg logit contributions
 a-0.013
 two-0.013
 big-0.012
-0.012
 and-0.011
Token,
Token ID11
Feature activation+6.84e-03

Pos logit contributions
 two+0.008
 a+0.008
 big+0.007
+0.006
 the+0.005
Neg logit contributions
rily-0.041
ittered-0.040
immer-0.040
inations-0.040
lights-0.039
Token hard
Token ID1327
Feature activation+9.10e-03

Pos logit contributions
rily+0.056
ittered+0.055
immer+0.055
lights+0.054
inations+0.053
Neg logit contributions
 a-0.027
 two-0.027
 big-0.025
-0.024
 the-0.024
Token until
Token ID1566
Feature activation+1.68e-03

Pos logit contributions
rily+0.039
ittered+0.038
immer+0.038
inations+0.038
lights+0.038
Neg logit contributions
 a-0.008
 two-0.007
 big-0.007
-0.006
 the-0.006
Token the
Token ID262
Feature activation-1.15e-02

Pos logit contributions
ittered+0.006
 avail+0.006
rily+0.006
lights+0.006
immer+0.006
Neg logit contributions
 big-0.002
 two-0.002
 a-0.002
-0.002
 and-0.001
Token pond
Token ID16723
Feature activation+6.90e-03

Pos logit contributions
 a+0.018
+0.017
 and+0.017
 the+0.016
 two+0.015
Neg logit contributions
immer-0.037
rily-0.037
lights-0.037
 avail-0.037
 footing-0.036
Token was
Token ID373
Feature activation+1.23e-02

Pos logit contributions
rily+0.028
ittered+0.028
immer+0.027
lights+0.027
inations+0.026
Neg logit contributions
 a-0.008
 two-0.007
 big-0.007
-0.006
 the-0.006
Token even
Token ID772
Feature activation+2.45e-02

Pos logit contributions
immer+0.042
rily+0.042
ittered+0.042
inations+0.040
lights+0.040
Neg logit contributions
 a-0.021
 two-0.020
 big-0.019
-0.019
 and-0.019
Token smoother
Token ID32686
Feature activation+6.54e-03

Pos logit contributions
rily+0.076
ittered+0.075
immer+0.075
inations+0.073
lights+0.073
Neg logit contributions
 a-0.050
 two-0.048
 big-0.047
 and-0.045
-0.045
Token than
Token ID621
Feature activation+2.11e-02

Pos logit contributions
rily+0.027
ittered+0.027
immer+0.026
lights+0.026
inations+0.026
Neg logit contributions
 a-0.007
 two-0.006
 big-0.006
-0.006
 the-0.005
Token before
Token ID878
Feature activation+6.98e-03

Pos logit contributions
rily+0.070
ittered+0.070
immer+0.069
inations+0.068
lights+0.067
Neg logit contributions
 a-0.038
 two-0.037
 big-0.036
-0.034
 and-0.034
Token.
Token ID13
Feature activation+2.07e-03

Pos logit contributions
rily+0.030
ittered+0.029
immer+0.029
lights+0.029
inations+0.029
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.004
 the-0.004
Token
Token ID220
Feature activation+1.55e-03

Pos logit contributions
ittered+0.007
rily+0.007
lights+0.007
immer+0.007
inations+0.007
Neg logit contributions
 a-0.003
 big-0.003
 two-0.003
 and-0.003
 the-0.003
Token to
Token ID284
Feature activation+1.21e-02

Pos logit contributions
 two+0.003
 a+0.002
+0.002
 big+0.002
 and+0.002
Neg logit contributions
rily-0.012
keeping-0.012
inations-0.012
epad-0.012
lights-0.012
Token play
Token ID711
Feature activation-8.90e-03

Pos logit contributions
ittered+0.041
immer+0.041
inations+0.041
rily+0.041
 avail+0.040
Neg logit contributions
 two-0.019
 a-0.019
 big-0.018
-0.017
 and-0.017
Token with
Token ID351
Feature activation+2.57e-03

Pos logit contributions
 two+0.012
 a+0.011
 big+0.011
+0.010
 the+0.009
Neg logit contributions
rily-0.032
ittered-0.031
immer-0.031
inations-0.030
fer-0.030
Token her
Token ID607
Feature activation-5.64e-03

Pos logit contributions
rily+0.010
lights+0.009
ittered+0.009
 avail+0.009
inations+0.009
Neg logit contributions
 two-0.003
-0.003
 a-0.003
 and-0.002
 big-0.002
Token toys
Token ID14958
Feature activation-9.98e-03

Pos logit contributions
 a+0.009
 two+0.008
+0.008
 the+0.008
 and+0.008
Neg logit contributions
rily-0.018
lights-0.018
ittered-0.018
inations-0.017
arians-0.017
Token and
Token ID290
Feature activation+1.92e-02

Pos logit contributions
 two+0.007
 big+0.007
 a+0.006
+0.006
 little+0.005
Neg logit contributions
rily-0.040
immer-0.039
ittered-0.039
lights-0.039
arians-0.039
Token spend
Token ID4341
Feature activation-5.86e-03

Pos logit contributions
rily+0.065
ittered+0.064
inations+0.064
immer+0.064
 avail+0.063
Neg logit contributions
 a-0.032
 two-0.031
 big-0.029
-0.028
 and-0.028
Token time
Token ID640
Feature activation-1.81e-03

Pos logit contributions
 two+0.007
 a+0.007
 big+0.007
 next+0.006
+0.006
Neg logit contributions
ittered-0.021
inations-0.021
lights-0.021
rily-0.021
leness-0.020
Token with
Token ID351
Feature activation+7.64e-03

Pos logit contributions
 a+0.002
 two+0.002
 big+0.002
 the+0.001
+0.001
Neg logit contributions
ittered-0.007
rily-0.007
inations-0.007
immer-0.007
epad-0.007
Token her
Token ID607
Feature activation-1.66e-03

Pos logit contributions
rily+0.030
lights+0.030
ittered+0.029
 avail+0.029
inations+0.029
Neg logit contributions
 two-0.007
 a-0.007
 and-0.007
-0.007
 big-0.007
Token family
Token ID1641
Feature activation-9.17e-03

Pos logit contributions
 a+0.003
 two+0.002
+0.002
 and+0.002
 the+0.002
Neg logit contributions
lights-0.006
ittered-0.005
inations-0.005
rily-0.005
arians-0.005
Token,"
Token ID553
Feature activation-1.44e-03

Pos logit contributions
rily+0.042
ittered+0.042
immer+0.041
inations+0.041
lights+0.040
Neg logit contributions
 a-0.025
 two-0.025
 big-0.024
 the-0.023
-0.022
Token said
Token ID531
Feature activation+1.45e-02

Pos logit contributions
 big+0.001
 a+0.001
 two+0.001
 and+0.001
+0.001
Neg logit contributions
rily-0.006
 avail-0.006
ittered-0.006
leness-0.006
immer-0.006
Token the
Token ID262
Feature activation-4.59e-03

Pos logit contributions
rily+0.068
ittered+0.067
immer+0.066
 avail+0.066
fer+0.065
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
 and-0.004
-0.003
Token duck
Token ID22045
Feature activation+5.93e-03

Pos logit contributions
 a+0.010
 and+0.009
 the+0.009
+0.009
 two+0.009
Neg logit contributions
rily-0.013
immer-0.013
ittered-0.012
fer-0.012
lights-0.012
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
rily+0.026
immer+0.026
ittered+0.026
inations+0.025
lights+0.025
Neg logit contributions
 a-0.004
 two-0.004
 big-0.004
 the-0.003
-0.003
Token "
Token ID366
Feature activation+2.00e-02

Pos logit contributions
rily+0.042
ittered+0.041
immer+0.040
inations+0.039
fer+0.038
Neg logit contributions
 a-0.028
 big-0.028
 two-0.028
 the-0.025
-0.025
Tokenand
Token ID392
Feature activation+1.59e-02

Pos logit contributions
rily+0.059
ittered+0.058
immer+0.057
inations+0.056
lights+0.055
Neg logit contributions
 a-0.046
 big-0.044
 two-0.044
 the-0.042
 and-0.041
Token look
Token ID804
Feature activation-3.94e-03

Pos logit contributions
ittered+0.060
rily+0.060
immer+0.059
inations+0.057
leness+0.057
Neg logit contributions
 a-0.022
 two-0.021
 big-0.021
-0.019
 and-0.018
Token at
Token ID379
Feature activation-2.68e-04

Pos logit contributions
 two+0.004
 a+0.004
 big+0.003
+0.003
 the+0.003
Neg logit contributions
rily-0.016
immer-0.016
ittered-0.016
inations-0.015
lights-0.015
Token this
Token ID428
Feature activation+1.43e-02

Pos logit contributions
 two+0.000
 a+0.000
 big+0.000
+0.000
 and+0.000
Neg logit contributions
rily-0.001
ittered-0.001
immer-0.001
lights-0.001
 avail-0.001
Token beautiful
Token ID4950
Feature activation+1.11e-02

Pos logit contributions
rily+0.046
immer+0.045
ittered+0.045
inations+0.044
lights+0.044
Neg logit contributions
 a-0.028
 two-0.027
 big-0.026
-0.025
 the-0.025
Token went
Token ID1816
Feature activation-8.01e-03

Pos logit contributions
ittered+0.001
immer+0.001
rily+0.001
inations+0.001
lights+0.001
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
-0.001
 and-0.001
Token searching
Token ID10342
Feature activation+1.32e-02

Pos logit contributions
 a+0.010
 two+0.010
 big+0.009
 little+0.009
+0.008
Neg logit contributions
ittered-0.029
immer-0.029
rily-0.029
inations-0.028
keeping-0.028
Token together
Token ID1978
Feature activation+6.12e-03

Pos logit contributions
rily+0.054
ittered+0.053
immer+0.052
inations+0.051
lights+0.051
Neg logit contributions
 two-0.014
 a-0.014
 big-0.012
-0.011
 the-0.010
Token.
Token ID13
Feature activation+7.79e-03

Pos logit contributions
rily+0.027
ittered+0.027
immer+0.027
inations+0.026
lights+0.026
Neg logit contributions
 a-0.004
 two-0.004
 big-0.003
-0.003
 little-0.002
Token
Token ID198
Feature activation+2.02e-02

Pos logit contributions
rily+0.032
ittered+0.032
immer+0.031
lights+0.030
inations+0.030
Neg logit contributions
 a-0.008
 big-0.007
 two-0.007
 and-0.006
 the-0.006
Token
Token ID198
Feature activation+2.48e-02

Pos logit contributions
rily+0.096
ittered+0.096
immer+0.095
inations+0.095
lights+0.093
Neg logit contributions
 a-0.007
 two-0.006
 big-0.005
 the-0.003
 and-0.003
TokenThey
Token ID2990
Feature activation+5.05e-03

Pos logit contributions
rily+0.080
ittered+0.079
immer+0.079
inations+0.078
lights+0.076
Neg logit contributions
 a-0.049
 two-0.046
 big-0.046
 the-0.044
 and-0.043
Token searched
Token ID16499
Feature activation+2.01e-02

Pos logit contributions
ittered+0.017
immer+0.016
rily+0.016
inations+0.016
comb+0.015
Neg logit contributions
 a-0.009
 two-0.009
 big-0.009
-0.009
 and-0.008
Token and
Token ID290
Feature activation+1.46e-02

Pos logit contributions
rily+0.083
ittered+0.083
immer+0.082
inations+0.080
lights+0.080
Neg logit contributions
 a-0.020
 two-0.019
 big-0.017
-0.016
 the-0.015
Token searched
Token ID16499
Feature activation+1.71e-02

Pos logit contributions
ittered+0.034
rily+0.033
immer+0.033
inations+0.032
lights+0.031
Neg logit contributions
 a-0.041
 two-0.040
 big-0.040
 and-0.038
-0.038
Token,
Token ID11
Feature activation+2.13e-02

Pos logit contributions
rily+0.068
ittered+0.068
immer+0.067
inations+0.066
lights+0.066
Neg logit contributions
 a-0.019
 two-0.018
 big-0.017
-0.015
 the-0.015
Token if
Token ID611
Feature activation-2.02e-02

Pos logit contributions
rily+0.010
ittered+0.010
inations+0.010
lights+0.010
immer+0.010
Neg logit contributions
 a-0.005
 big-0.004
 two-0.004
-0.004
 and-0.004
Token they
Token ID484
Feature activation-6.16e-03

Pos logit contributions
 big+0.031
 a+0.031
 and+0.030
+0.029
 two+0.029
Neg logit contributions
 avail-0.065
inations-0.064
lights-0.063
rily-0.063
fer-0.062
Token didn
Token ID1422
Feature activation-1.10e-02

Pos logit contributions
 a+0.012
 big+0.012
 two+0.012
+0.011
 the+0.011
Neg logit contributions
ittered-0.018
inations-0.018
istent-0.018
rily-0.018
immer-0.018
Tokenâ
Token ID22940
Feature activation+2.00e-02

Pos logit contributions
 a+0.025
 two+0.024
 big+0.024
 little+0.023
 and+0.023
Neg logit contributions
rily-0.028
immer-0.028
istent-0.028
ittered-0.027
inations-0.027
Token€
Token ID26391
Feature activation+2.44e-02

Pos logit contributions
ittered+0.039
immer+0.039
rily+0.038
inations+0.036
leness+0.035
Neg logit contributions
 a-0.065
 big-0.064
 two-0.063
 the-0.061
 and-0.061
Tokenâ„¢
Token ID8151
Feature activation+1.77e-02

Pos logit contributions
ittered+0.030
rily+0.030
immer+0.029
inations+0.028
lights+0.027
Neg logit contributions
 a-0.095
 two-0.094
 big-0.093
 the-0.090
 and-0.090
Tokent
Token ID83
Feature activation-1.32e-02

Pos logit contributions
rily+0.044
ittered+0.044
immer+0.044
inations+0.042
lights+0.041
Neg logit contributions
 a-0.047
 two-0.046
 big-0.045
 the-0.043
 and-0.043
Token start
Token ID923
Feature activation+4.19e-03

Pos logit contributions
 a+0.024
 two+0.024
 big+0.023
+0.022
 the+0.021
Neg logit contributions
immer-0.041
inations-0.040
ge-0.040
epad-0.040
istent-0.039
Token listening
Token ID8680
Feature activation-3.81e-03

Pos logit contributions
ittered+0.016
rily+0.016
inations+0.015
immer+0.015
lights+0.015
Neg logit contributions
 a-0.005
 big-0.005
 two-0.005
-0.004
 the-0.004
Token to
Token ID284
Feature activation-1.08e-02

Pos logit contributions
 two+0.002
 a+0.002
 big+0.002
+0.002
 the+0.002
Neg logit contributions
rily-0.016
ittered-0.016
immer-0.016
lights-0.016
inations-0.016
Token her
Token ID607
Feature activation+1.11e-02

Pos logit contributions
 two+0.012
 a+0.012
 big+0.012
+0.011
 and+0.010
Neg logit contributions
rily-0.040
inations-0.039
ittered-0.039
 avail-0.039
immer-0.039
Token into
Token ID656
Feature activation-8.87e-03

Pos logit contributions
rily+0.041
immer+0.040
ittered+0.040
lights+0.039
comb+0.039
Neg logit contributions
 a-0.006
 two-0.006
 big-0.005
-0.005
 the-0.004
Token the
Token ID262
Feature activation+5.97e-03

Pos logit contributions
 big+0.005
 a+0.004
+0.004
 two+0.004
 and+0.004
Neg logit contributions
rily-0.038
 avail-0.037
ittered-0.037
keeping-0.037
immer-0.037
Token mailbox
Token ID37282
Feature activation+4.60e-03

Pos logit contributions
rily+0.017
immer+0.017
ittered+0.017
lights+0.017
 avail+0.017
Neg logit contributions
 a-0.013
 two-0.012
-0.012
 big-0.012
 and-0.012
Token,
Token ID11
Feature activation-3.97e-03

Pos logit contributions
rily+0.020
immer+0.020
ittered+0.020
lights+0.019
inations+0.019
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.002
 the-0.002
Token she
Token ID673
Feature activation+1.61e-02

Pos logit contributions
 big+0.004
 two+0.004
 a+0.004
+0.004
 and+0.004
Neg logit contributions
rily-0.015
ittered-0.015
 avail-0.014
immer-0.014
lights-0.014
Token felt
Token ID2936
Feature activation+1.09e-02

Pos logit contributions
rily+0.055
ittered+0.054
immer+0.054
inations+0.053
lights+0.052
Neg logit contributions
 a-0.028
 two-0.027
 big-0.026
-0.025
 the-0.025
Token happy
Token ID3772
Feature activation+3.59e-03

Pos logit contributions
rily+0.044
ittered+0.044
immer+0.044
inations+0.042
leness+0.042
Neg logit contributions
 two-0.011
 a-0.010
 big-0.010
-0.009
 and-0.009
Token that
Token ID326
Feature activation-3.07e-04

Pos logit contributions
rily+0.015
ittered+0.015
immer+0.015
lights+0.015
inations+0.014
Neg logit contributions
 two-0.003
 big-0.003
 a-0.003
-0.002
 the-0.002
Token she
Token ID673
Feature activation+1.34e-02

Pos logit contributions
 two+0.000
 big+0.000
 a+0.000
+0.000
 and+0.000
Neg logit contributions
ittered-0.001
rily-0.001
 avail-0.001
immer-0.001
epad-0.001
Token did
Token ID750
Feature activation+4.47e-03

Pos logit contributions
ittered+0.048
rily+0.047
inations+0.046
immer+0.046
fer+0.045
Neg logit contributions
 a-0.021
 two-0.021
 big-0.020
-0.019
 the-0.018
Token the
Token ID262
Feature activation+1.69e-03

Pos logit contributions
ittered+0.017
rily+0.017
immer+0.017
inations+0.017
epad+0.017
Neg logit contributions
 two-0.005
 a-0.005
 big-0.005
-0.004
 and-0.004
Token friendly
Token ID8030
Feature activation+6.79e-03

Pos logit contributions
ittered+0.015
rily+0.015
inations+0.014
lights+0.014
immer+0.014
Neg logit contributions
 two-0.007
 a-0.007
 big-0.007
-0.007
 and-0.007
Token.
Token ID13
Feature activation+3.73e-03

Pos logit contributions
ittered+0.030
rily+0.030
lights+0.029
immer+0.029
keeping+0.029
Neg logit contributions
 two-0.004
 a-0.004
 big-0.004
-0.003
 the-0.003
Token They
Token ID1119
Feature activation+2.55e-03

Pos logit contributions
rily+0.014
lights+0.014
ittered+0.014
immer+0.014
inations+0.014
Neg logit contributions
 a-0.004
 big-0.004
 two-0.004
 and-0.003
 the-0.003
Token hugged
Token ID41130
Feature activation+2.01e-03

Pos logit contributions
ittered+0.008
rily+0.008
immer+0.008
inations+0.008
lights+0.007
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.004
 the-0.004
Token it
Token ID340
Feature activation-1.22e-03

Pos logit contributions
rily+0.008
lights+0.008
ittered+0.007
arians+0.007
keeping+0.007
Neg logit contributions
 two-0.002
 big-0.002
 a-0.002
-0.002
 next-0.002
Token and
Token ID290
Feature activation+1.18e-02

Pos logit contributions
 two+0.001
 a+0.001
 big+0.000
+0.000
 the+0.000
Neg logit contributions
rily-0.005
ittered-0.005
lights-0.005
immer-0.005
inations-0.005
Token talked
Token ID6619
Feature activation-2.83e-04

Pos logit contributions
ittered+0.039
rily+0.039
lights+0.038
immer+0.038
inations+0.038
Neg logit contributions
 a-0.019
 two-0.019
 big-0.019
-0.018
 and-0.017
Token to
Token ID284
Feature activation+2.04e-03

Pos logit contributions
 two+0.000
 big+0.000
 a+0.000
+0.000
 little+0.000
Neg logit contributions
rily-0.001
ittered-0.001
immer-0.001
lights-0.001
epad-0.001
Token it
Token ID340
Feature activation-3.07e-03

Pos logit contributions
rily+0.007
 avail+0.007
ittered+0.007
lights+0.007
arians+0.007
Neg logit contributions
 two-0.003
 big-0.002
 a-0.002
 and-0.002
-0.002
Token and
Token ID290
Feature activation+9.30e-03

Pos logit contributions
 two+0.003
 a+0.003
 big+0.003
 the+0.002
+0.002
Neg logit contributions
rily-0.012
ittered-0.012
immer-0.012
lights-0.012
inations-0.012
Token made
Token ID925
Feature activation-2.00e-03

Pos logit contributions
ittered+0.030
rily+0.030
inations+0.029
keeping+0.029
immer+0.029
Neg logit contributions
 a-0.016
 two-0.016
 big-0.016
-0.015
 and-0.015
Token was
Token ID373
Feature activation+9.35e-04

Pos logit contributions
ittered+0.001
rily+0.001
immer+0.001
lights+0.001
inations+0.001
Neg logit contributions
 a-0.000
 two-0.000
 big-0.000
-0.000
 the-0.000
Token important
Token ID1593
Feature activation+5.52e-03

Pos logit contributions
immer+0.003
rily+0.003
ittered+0.003
 avail+0.003
inations+0.003
Neg logit contributions
 two-0.001
 a-0.001
 big-0.001
-0.001
 and-0.001
Token.
Token ID13
Feature activation+4.33e-03

Pos logit contributions
rily+0.023
ittered+0.023
lights+0.023
keeping+0.023
inations+0.023
Neg logit contributions
 two-0.004
 a-0.004
 big-0.004
-0.003
 the-0.003
Token Mary
Token ID5335
Feature activation-1.54e-02

Pos logit contributions
rily+0.016
immer+0.015
ittered+0.015
lights+0.015
inations+0.015
Neg logit contributions
 a-0.006
 big-0.006
 two-0.006
 the-0.005
 and-0.005
Token didn
Token ID1422
Feature activation-5.99e-03

Pos logit contributions
 a+0.023
 big+0.022
 two+0.022
+0.020
 the+0.019
Neg logit contributions
rily-0.054
ittered-0.054
immer-0.052
inations-0.051
istent-0.051
Token't
Token ID470
Feature activation-3.80e-03

Pos logit contributions
 a+0.013
 two+0.013
 big+0.013
 little+0.012
 the+0.012
Neg logit contributions
rily-0.016
immer-0.016
istent-0.016
ittered-0.016
inations-0.015
Token like
Token ID588
Feature activation+1.79e-03

Pos logit contributions
 a+0.008
 two+0.008
 big+0.008
+0.008
 the+0.008
Neg logit contributions
immer-0.011
istent-0.010
inations-0.010
leness-0.010
epad-0.010
Token the
Token ID262
Feature activation+9.96e-03

Pos logit contributions
rily+0.007
inations+0.007
ittered+0.007
keeping+0.007
lights+0.007
Neg logit contributions
 two-0.002
 a-0.002
 big-0.002
-0.002
 and-0.002
Token gym
Token ID11550
Feature activation-3.26e-03

Pos logit contributions
rily+0.030
immer+0.030
ittered+0.030
inations+0.029
 avail+0.029
Neg logit contributions
 a-0.020
 two-0.019
-0.019
 big-0.018
 the-0.018
Token,
Token ID11
Feature activation+5.32e-03

Pos logit contributions
 a+0.004
 two+0.004
 big+0.003
+0.003
 the+0.003
Neg logit contributions
ittered-0.013
rily-0.013
immer-0.013
lights-0.013
inations-0.012
Token it
Token ID340
Feature activation-1.19e-02

Pos logit contributions
rily+0.020
ittered+0.019
lights+0.019
immer+0.019
inations+0.019
Neg logit contributions
 a-0.007
 two-0.007
 big-0.007
-0.006
 the-0.006
Token and
Token ID290
Feature activation+1.91e-02

Pos logit contributions
rily+0.045
ittered+0.044
immer+0.044
lights+0.043
inations+0.043
Neg logit contributions
 a-0.024
 two-0.024
 big-0.022
-0.022
 the-0.021
Token more
Token ID517
Feature activation+1.22e-02

Pos logit contributions
ittered+0.072
rily+0.071
immer+0.071
inations+0.070
lights+0.070
Neg logit contributions
 a-0.025
 two-0.024
 big-0.024
-0.022
 and-0.022
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
rily+0.037
ittered+0.037
immer+0.036
inations+0.036
lights+0.036
Neg logit contributions
 a-0.025
 two-0.025
 big-0.023
-0.023
 the-0.023
Token spoiled
Token ID34781
Feature activation+1.11e-02

Pos logit contributions
rily+0.038
ittered+0.038
lights+0.038
immer+0.037
inations+0.037
Neg logit contributions
 a-0.013
 two-0.012
 big-0.012
-0.011
 and-0.011
Token their
Token ID511
Feature activation+5.18e-03

Pos logit contributions
rily+0.048
ittered+0.047
lights+0.046
keeping+0.046
immer+0.046
Neg logit contributions
 a-0.009
 big-0.009
 two-0.009
-0.008
 and-0.006
Token sandbox
Token ID35204
Feature activation+6.03e-03

Pos logit contributions
ittered+0.015
rily+0.015
inations+0.015
lights+0.015
immer+0.015
Neg logit contributions
 a-0.011
 two-0.011
-0.010
 big-0.010
 the-0.010
Token.
Token ID13
Feature activation+7.42e-03

Pos logit contributions
rily+0.027
ittered+0.027
immer+0.027
lights+0.027
keeping+0.026
Neg logit contributions
 a-0.003
 two-0.003
 big-0.003
-0.003
 the-0.002
Token
Token ID198
Feature activation+4.09e-03

Pos logit contributions
rily+0.032
ittered+0.031
immer+0.031
lights+0.031
inations+0.031
Neg logit contributions
 a-0.005
 big-0.005
 two-0.005
 and-0.004
 the-0.003
Token
Token ID198
Feature activation+3.27e-03

Pos logit contributions
rily+0.018
ittered+0.018
immer+0.018
inations+0.018
lights+0.018
Neg logit contributions
 a-0.002
 two-0.002
 big-0.002
 and-0.001
 the-0.001
Token"
Token ID1
Feature activation+1.96e-02

Pos logit contributions
immer+0.011
inations+0.011
ittered+0.010
rily+0.010
leness+0.010
Neg logit contributions
 a-0.006
 big-0.005
 two-0.005
 the-0.005
 and-0.005
TokenM
Token ID44
Feature activation+1.23e-02

Pos logit contributions
rily+0.053
ittered+0.052
immer+0.052
inations+0.050
lights+0.050
Neg logit contributions
 a-0.048
 big-0.046
 two-0.045
 and-0.045
 the-0.044
TokenOnce
Token ID7454
Feature activation-3.59e-03

Pos logit contributions
 two+0.009
 a+0.009
 big+0.009
 little+0.007
 and+0.005
Neg logit contributions
ittered-0.070
rily-0.070
immer-0.069
keeping-0.069
lor-0.068
Token there
Token ID612
Feature activation-6.41e-03

Pos logit contributions
 a+0.008
 two+0.008
 big+0.007
 and+0.007
 the+0.007
Neg logit contributions
ittered-0.010
rily-0.010
immer-0.010
lights-0.010
inations-0.009
Token was
Token ID373
Feature activation-6.37e-03

Pos logit contributions
 two+0.006
 a+0.005
 big+0.005
 and+0.004
 next+0.004
Neg logit contributions
ittered-0.026
rily-0.026
immer-0.025
 avail-0.025
lights-0.024
Token a
Token ID257
Feature activation-2.15e-02

Pos logit contributions
 two+0.001
 big+0.001
 a+0.000
 and+0.000
 the+0.000
Neg logit contributions
rily-0.029
ittered-0.029
immer-0.029
inations-0.029
 avail-0.028
Token girl
Token ID2576
Feature activation-7.23e-03

Pos logit contributions
 a+0.018
 two+0.017
 the+0.014
+0.012
 and+0.012
Neg logit contributions
rily-0.084
fer-0.083
immer-0.082
inations-0.081
comb-0.081
Token called
Token ID1444
Feature activation+1.09e-02

Pos logit contributions
 two+0.017
 a+0.017
 big+0.016
 the+0.015
+0.015
Neg logit contributions
immer-0.018
ittered-0.018
rily-0.018
inations-0.017
lights-0.017
Token L
Token ID406
Feature activation+2.87e-02

Pos logit contributions
ittered+0.033
rily+0.033
immer+0.033
keeping+0.032
inations+0.032
Neg logit contributions
 a-0.021
 two-0.021
 big-0.021
 and-0.020
-0.019
Tokenola
Token ID5708
Feature activation+4.85e-03

Pos logit contributions
rily+0.065
ittered+0.065
immer+0.064
inations+0.062
lights+0.060
Neg logit contributions
 a-0.084
 two-0.082
 big-0.081
 the-0.079
 and-0.078
Token.
Token ID13
Feature activation-8.06e-03

Pos logit contributions
rily+0.021
ittered+0.021
immer+0.021
inations+0.020
lights+0.020
Neg logit contributions
 two-0.003
 a-0.003
 big-0.003
 the-0.002
-0.002
Token L
Token ID406
Feature activation+2.44e-02

Pos logit contributions
 a+0.013
 two+0.013
 big+0.012
 the+0.011
 and+0.011
Neg logit contributions
ittered-0.027
lights-0.027
immer-0.026
rily-0.026
rol-0.026
Tokenola
Token ID5708
Feature activation-2.01e-02

Pos logit contributions
ittered+0.039
rily+0.039
immer+0.038
inations+0.037
lights+0.035
Neg logit contributions
 a-0.087
 two-0.086
 big-0.085
 the-0.082
 and-0.082
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
 two+0.005
 big+0.004
 a+0.004
 little+0.003
 the+0.003
Neg logit contributions
ittered-0.023
immer-0.023
rily-0.023
ge-0.023
keeping-0.023
Token One
Token ID1881
Feature activation-1.65e-02

Pos logit contributions
ittered+0.057
rily+0.057
immer+0.056
lights+0.055
inations+0.054
Neg logit contributions
 a-0.022
 two-0.021
 big-0.020
 the-0.018
 and-0.018
Token day
Token ID1110
Feature activation+1.82e-03

Pos logit contributions
 two+0.027
 a+0.024
 the+0.022
 big+0.022
+0.021
Neg logit contributions
rated-0.051
immer-0.051
comb-0.050
adows-0.050
ittered-0.050
Token,
Token ID11
Feature activation-1.04e-03

Pos logit contributions
ittered+0.008
immer+0.007
rily+0.007
inations+0.007
leness+0.007
Neg logit contributions
 two-0.001
 a-0.001
 big-0.001
 and-0.001
 the-0.001
Token Lily
Token ID20037
Feature activation-6.07e-03

Pos logit contributions
 two+0.002
 big+0.001
 a+0.001
 and+0.001
 right+0.001
Neg logit contributions
rily-0.003
ittered-0.003
immer-0.003
lights-0.003
 avail-0.003
Token's
Token ID338
Feature activation-1.76e-02

Pos logit contributions
 two+0.006
 a+0.006
 big+0.005
 the+0.005
+0.004
Neg logit contributions
rily-0.023
ittered-0.023
keeping-0.023
immer-0.023
rated-0.023
Token mom
Token ID1995
Feature activation-1.23e-02

Pos logit contributions
 the+0.021
 a+0.020
 two+0.019
+0.018
 and+0.018
Neg logit contributions
comb-0.060
rily-0.059
 footing-0.057
immer-0.057
ittered-0.057
Token brought
Token ID3181
Feature activation-2.18e-02

Pos logit contributions
 a+0.023
 big+0.022
 two+0.022
 the+0.020
+0.019
Neg logit contributions
immer-0.035
inations-0.034
pp-0.034
ittered-0.034
rily-0.034
Token home
Token ID1363
Feature activation-1.24e-02

Pos logit contributions
 two+0.021
 big+0.021
 be+0.019
+0.018
 little+0.017
Neg logit contributions
rily-0.081
keeping-0.078
uggets-0.076
immer-0.075
ittered-0.075
Token a
Token ID257
Feature activation+1.97e-02

Pos logit contributions
 two+0.004
 big+0.004
+0.003
 a+0.002
 little+0.002
Neg logit contributions
immer-0.054
ittered-0.054
rily-0.054
inations-0.053
lights-0.053
Token newspaper
Token ID7533
Feature activation+8.21e-03

Pos logit contributions
rily+0.069
immer+0.068
ittered+0.068
inations+0.067
 avail+0.065
Neg logit contributions
 a-0.032
 two-0.031
 big-0.028
 the-0.028
 and-0.028
Token upon
Token ID2402
Feature activation-2.43e-02

Pos logit contributions
 a+0.017
 two+0.017
 big+0.016
 and+0.016
 the+0.015
Neg logit contributions
ittered-0.019
rily-0.019
immer-0.019
lights-0.018
inations-0.018
Token a
Token ID257
Feature activation-2.71e-03

Pos logit contributions
 two+0.006
 big+0.005
 and+0.004
 next+0.003
 the+0.003
Neg logit contributions
rily-0.105
ittered-0.104
inations-0.104
immer-0.104
epad-0.103
Token time
Token ID640
Feature activation-7.00e-03

Pos logit contributions
 a+0.004
 two+0.003
 the+0.003
 big+0.003
 and+0.003
Neg logit contributions
immer-0.010
ittered-0.010
inations-0.010
rily-0.010
istent-0.010
Token there
Token ID612
Feature activation-9.76e-03

Pos logit contributions
 a+0.006
 two+0.006
 big+0.005
 the+0.004
 and+0.004
Neg logit contributions
ittered-0.029
immer-0.029
rily-0.028
inations-0.027
lights-0.027
Token was
Token ID373
Feature activation-4.69e-03

Pos logit contributions
 two+0.009
 a+0.009
 big+0.008
 and+0.006
 next+0.006
Neg logit contributions
ittered-0.039
rily-0.038
immer-0.038
lights-0.037
ge-0.036
Token a
Token ID257
Feature activation-1.82e-02

Pos logit contributions
 big+0.001
 two+0.001
 and+0.000
 the+0.000
 a+0.000
Neg logit contributions
rily-0.022
ittered-0.021
immer-0.021
inations-0.021
 avail-0.020
Token judge
Token ID5052
Feature activation+7.72e-04

Pos logit contributions
 a+0.015
 two+0.014
 the+0.011
+0.010
 and+0.010
Neg logit contributions
rily-0.073
immer-0.072
fer-0.071
inations-0.070
maid-0.070
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
rily+0.003
ittered+0.003
immer+0.003
fer+0.003
lights+0.003
Neg logit contributions
 two-0.001
 big-0.001
 a-0.001
 the-0.001
-0.001
Token She
Token ID1375
Feature activation+1.44e-03

Pos logit contributions
rily+0.056
ittered+0.055
immer+0.055
 avail+0.053
epad+0.053
Neg logit contributions
 a-0.022
 two-0.021
 big-0.021
 and-0.019
 the-0.018
Token was
Token ID373
Feature activation-9.95e-03

Pos logit contributions
ittered+0.005
rily+0.005
immer+0.005
inations+0.005
fer+0.004
Neg logit contributions
 two-0.002
 a-0.002
 big-0.002
 the-0.002
-0.002
Token very
Token ID845
Feature activation-1.52e-02

Pos logit contributions
+0.009
 two+0.009
 and+0.008
 a+0.008
.+0.007
Neg logit contributions
rily-0.038
ittered-0.038
immer-0.037
epad-0.037
chard-0.036
Token Fisher
Token ID14388
Feature activation-6.24e-04

Pos logit contributions
rily+0.019
immer+0.019
ittered+0.019
fer+0.018
 avail+0.018
Neg logit contributions
 a-0.024
 big-0.023
 two-0.022
 the-0.022
 and-0.022
Token.
Token ID13
Feature activation-1.11e-03

Pos logit contributions
 a+0.002
 big+0.002
 two+0.002
 the+0.002
+0.002
Neg logit contributions
rily-0.002
ittered-0.002
immer-0.002
comb-0.001
inations-0.001
Token
Token ID198
Feature activation-4.61e-03

Pos logit contributions
 a+0.001
 big+0.001
 two+0.001
 the+0.001
 and+0.001
Neg logit contributions
rily-0.004
immer-0.004
ittered-0.004
fer-0.004
istent-0.004
Token
Token ID198
Feature activation+8.59e-04

Pos logit contributions
 a+0.003
 two+0.003
 big+0.003
 the+0.002
 and+0.002
Neg logit contributions
rily-0.020
immer-0.019
inations-0.019
ittered-0.019
lights-0.019
TokenJ
Token ID41
Feature activation+1.63e-02

Pos logit contributions
rily+0.003
immer+0.003
inations+0.003
fer+0.003
leness+0.003
Neg logit contributions
 a-0.002
 big-0.001
 two-0.001
 the-0.001
 and-0.001
Tokenenny
Token ID11870
Feature activation-1.78e-02

Pos logit contributions
rily+0.021
ittered+0.020
immer+0.020
inations+0.019
lights+0.018
Neg logit contributions
 a-0.064
 two-0.062
 big-0.062
 the-0.061
-0.060
Token smiled
Token ID13541
Feature activation-2.48e-02

Pos logit contributions
 a+0.037
 big+0.036
 two+0.035
+0.034
 the+0.033
Neg logit contributions
ittered-0.051
rily-0.051
immer-0.049
fer-0.048
comb-0.048
Token and
Token ID290
Feature activation-6.03e-03

Pos logit contributions
 two+0.020
 big+0.018
 a+0.017
 the+0.015
+0.014
Neg logit contributions
rily-0.102
immer-0.097
ittered-0.095
lights-0.093
rier-0.092
Token went
Token ID1816
Feature activation-6.18e-03

Pos logit contributions
 two+0.012
 big+0.012
 a+0.012
+0.011
 and+0.011
Neg logit contributions
rily-0.017
ittered-0.017
fer-0.016
immer-0.016
comb-0.016
Token home
Token ID1363
Feature activation-8.03e-03

Pos logit contributions
 a+0.008
 two+0.008
 big+0.007
 little+0.007
+0.007
Neg logit contributions
rily-0.022
immer-0.022
ittered-0.022
keeping-0.022
epad-0.021
Token feeling
Token ID4203
Feature activation+1.17e-03

Pos logit contributions
 two+0.008
 a+0.007
 big+0.007
+0.005
 the+0.005
Neg logit contributions
rily-0.032
ge-0.032
ittered-0.032
immer-0.031
keeping-0.031
Token
Token ID198
Feature activation+3.57e-03

Pos logit contributions
rily+0.014
immer+0.014
ittered+0.014
inations+0.014
lights+0.013
Neg logit contributions
 a-0.002
 two-0.001
 big-0.001
 and-0.001
 the-0.001
TokenTom
Token ID13787
Feature activation-6.95e-03

Pos logit contributions
immer+0.011
rily+0.011
inations+0.011
ittered+0.011
fer+0.011
Neg logit contributions
 a-0.007
 big-0.006
 two-0.006
 the-0.006
 and-0.006
Token thought
Token ID1807
Feature activation+1.73e-02

Pos logit contributions
 a+0.012
 big+0.011
 two+0.011
 the+0.010
+0.010
Neg logit contributions
ittered-0.023
immer-0.023
rily-0.023
istent-0.022
inations-0.022
Token for
Token ID329
Feature activation+8.56e-03

Pos logit contributions
rily+0.067
ittered+0.067
immer+0.066
inations+0.065
lights+0.065
Neg logit contributions
 a-0.021
 two-0.021
 big-0.020
-0.018
 and-0.017
Token a
Token ID257
Feature activation+2.16e-02

Pos logit contributions
immer+0.041
ittered+0.041
rily+0.040
inations+0.040
lights+0.040
Neg logit contributions
 big-0.001
 two-0.001
-0.000
 a-0.000
 and-0.000
Token moment
Token ID2589
Feature activation-2.22e-02

Pos logit contributions
immer+0.056
ittered+0.056
rily+0.056
inations+0.055
lights+0.052
Neg logit contributions
 a-0.054
 two-0.053
 big-0.051
 the-0.050
 and-0.050
Token.
Token ID13
Feature activation-4.04e-03

Pos logit contributions
 big+0.017
 two+0.016
 a+0.013
 little+0.011
+0.010
Neg logit contributions
immer-0.093
rily-0.091
ittered-0.091
comb-0.090
ge-0.089
Token Then
Token ID3244
Feature activation-6.92e-03

Pos logit contributions
 a+0.006
 big+0.006
 two+0.006
 the+0.005
 and+0.005
Neg logit contributions
rily-0.014
lights-0.014
immer-0.013
ittered-0.013
arians-0.013
Token he
Token ID339
Feature activation+3.93e-04

Pos logit contributions
 big+0.011
 a+0.011
 two+0.011
+0.010
 and+0.010
Neg logit contributions
 avail-0.022
rily-0.022
fer-0.022
ittered-0.022
immer-0.021
Token said
Token ID531
Feature activation+7.20e-03

Pos logit contributions
ittered+0.001
immer+0.001
rily+0.001
inations+0.001
lights+0.001
Neg logit contributions
 a-0.001
 two-0.001
 big-0.001
 the-0.001
-0.001
Token,
Token ID11
Feature activation-7.99e-03

Pos logit contributions
rily+0.031
ittered+0.031
immer+0.031
fer+0.030
lights+0.029
Neg logit contributions
 a-0.005
 two-0.005
 big-0.005
-0.004
 and-0.004
Token.
Token ID13
Feature activation+5.61e-03

Pos logit contributions
 two+0.007
 big+0.006
 a+0.006
 little+0.004
 the+0.004
Neg logit contributions
ittered-0.032
immer-0.032
keeping-0.032
ge-0.032
rily-0.031
Token One
Token ID1881
Feature activation-1.95e-02

Pos logit contributions
ittered+0.020
rily+0.020
immer+0.020
lights+0.019
epad+0.019
Neg logit contributions
 a-0.008
 two-0.008
 big-0.008
 the-0.007
 and-0.007
Token day
Token ID1110
Feature activation+4.82e-04

Pos logit contributions
 two+0.032
 a+0.029
 the+0.027
+0.026
 big+0.026
Neg logit contributions
rated-0.060
immer-0.060
comb-0.058
ittered-0.058
adows-0.058
Token,
Token ID11
Feature activation-3.38e-03

Pos logit contributions
ittered+0.002
immer+0.002
rily+0.002
lights+0.002
inations+0.002
Neg logit contributions
 two-0.000
 a-0.000
 big-0.000
 and-0.000
 the-0.000
Token Lily
Token ID20037
Feature activation-9.83e-03

Pos logit contributions
 two+0.005
 big+0.005
 a+0.004
 and+0.004
+0.004
Neg logit contributions
ittered-0.011
rily-0.011
lights-0.011
immer-0.011
 avail-0.011
Token had
Token ID550
Feature activation-1.77e-02

Pos logit contributions
 two+0.011
 a+0.010
 big+0.010
 the+0.009
+0.008
Neg logit contributions
keeping-0.036
ittered-0.036
rily-0.036
rated-0.035
immer-0.035
Token to
Token ID284
Feature activation-3.39e-03

Pos logit contributions
 two+0.007
 and+0.006
 big+0.005
 next+0.005
+0.004
Neg logit contributions
keeping-0.072
epad-0.071
ittered-0.069
rily-0.068
 avail-0.068
Token go
Token ID467
Feature activation-5.79e-03

Pos logit contributions
 two+0.005
 a+0.005
+0.005
 big+0.005
 and+0.005
Neg logit contributions
immer-0.012
 avail-0.012
rily-0.011
ittered-0.011
inations-0.011
Token to
Token ID284
Feature activation-7.08e-03

Pos logit contributions
 two+0.006
 a+0.005
+0.005
 big+0.005
 little+0.005
Neg logit contributions
inations-0.022
immer-0.022
ittered-0.022
rily-0.021
ge-0.021
Token the
Token ID262
Feature activation-1.21e-02

Pos logit contributions
 two+0.004
 and+0.004
 big+0.003
 a+0.003
+0.003
Neg logit contributions
 avail-0.030
rily-0.028
epad-0.028
keeping-0.028
inations-0.028
Token toilet
Token ID16146
Feature activation+7.63e-03

Pos logit contributions
 and+0.023
 two+0.022
 a+0.022
+0.021
 the+0.021
Neg logit contributions
 avail-0.035
rily-0.034
lor-0.034
arians-0.034
immer-0.034